This paper presents a method of improving the quality of subcategorization frames (SCFs) acquired from corpora in order to augment a lexicon of a lexicalized grammar. We first estimate a confidence value that a word can have each SCF, and create an SCF con
References
HorstBischof,AlesLeonardis,andAlexanderSelb.1999.MDLprincipleforrobustvectorquantization.PatternAnalysisandApplications,2(1):59–rgelex-iconsfornaturallanguageprocessing:putationalLinguistics,13(4):203–218.
putationalLinguistics,19(2):243–262.TedBriscoeandJhonCarroll.1997.Automaticextrac-tionofsubcategorizationfromcorpora.InProc.ofthe fthANLP,pages356–363.
JhonCarrollandAlexC.Fang.2004.TheautomaticacquisitionofverbsubcategorizationsandtheirimpactontheperformanceofanHPSGparser.InProc.ofthe rstijc-NLP,pages107–114.
DanFlickinger.2000.Onbuildingamoreef cientgrammarbyexploitingtypes.NaturalLanguageEn-gineering,6(1):15–28.
EdwardW.Forgy.1965.Clusteranalysisofmultivariatedata:Ef ciencyvs.interpretabilityofclassi cations.Biometrics,21:768–780.
AndrewGelman,JohnB.Carlin,HalS.Stern,andDon-aldB.Rubin,editors.1995.BayesianDataAnalysis.ChapmanandHall.RalphGrishman,CatherineMacleod,lexsyntax:Buildingacomputationallexi-con.InProc.ofthe15thCOLING,pages268–272.GregHamerly.2003.Learningstructureandconceptsindatathroughdataclustering.Ph.D.thesis,UniversityofCalifornila,SanDiego.AnnaKorhonen,YuvalKrymolowski,andZvikaMarx.2003.Clusteringpolysemicsubcategorizationframedistributionssemantically.InProc.ofthe41stACL,pages64–71.AnnaKorhonen.2002.SubcategorizationAcquisition.Ph.D.thesis,UniversityofCambridge.ChristopherD.Manning.1993.Automaticacquisitionofalargesubcategorizationdictionaryfromcorpora.InProc.ofthe31stACL,pages235–242.AnoopSarkarandDanielZeman.2000.AutomaticextractionofsubcategorizationframesforCzech.InProc.of18thCOLING,pages691–697.
AnoopSarkar,FeiXia,andAravindK.Joshi.2000.Someexperimentsonindicatorsofparsingcomplexityforlexicalizedgrammars.InProc.ofthe18thCOL-INGworkshop,pages37–42.
YvesSchabes,AnneAbeill´e,andAravindK.Joshi.1988.Parsingstrategieswith‘lexicalized’grammars:applicationtoTreeAdjoiningGrammars.InProc.ofthe12thCOLING,pages578–583.
SabineSchulteimWaldeandChrisBrew.2002.Induc-ingGermansemanticverbclassesfrompurelysyntac-ticsubcategorisationinformation.InProc.ofthe41stACL,pages223–230.
NaftaliTishby,FernandC.Pereira,andWilliamBialek.1999.Theinformationbottleneckmethod.InProc.ofthe37thACL,pages368–377.YoshimasaTsuruokaandTakashiChikayama.2001.Es-timatingreliabilityofcontextualevidencesindecision-listclassi ersunderbayesianlearning.InProc.ofthesixthNLPRS,pages701–707.XTAGResearchGroup.2001.AlexicalizedTreeAd-joiningGrammarforEnglish.TechnicalReportIRCS-01-03,IRCS,UniversityofPennsylvania.