資源描述:
《foundations_of_statistical_natural_language_processing》由會(huì)員上傳分享,免費(fèi)在線閱讀,更多相關(guān)內(nèi)容在學(xué)術(shù)論文-天天文庫。
1、pPrefaceTheneedforathoroughtextbookforStatisticalNaturalLanguagePro-cessinghardlyneedstobearguedforintheageofon-lineinformation,electroniccommunicationandtheWorldWideWeb.Increasingly,busi-nesses,governmentagenciesandindividualsareconfrontedwithlargeamoun
2、tsoftextthatarecriticalforworkingandliving,butnotwellenoughunderstoodtogettheenormousvalueoutofthemthattheypo-tentiallyhide.Atthesametime,theavailabilityoflargetextcorporahaschangedthescienti?capproachtolanguageinlinguisticsandcognitivescience.Phenomenat
3、hatwerenotdetectableorseemeduninterestinginstudyingtoydomainsandindividualsentenceshavemovedintothecenter?eldofwhatisconsideredimportanttoexplain.Whereasasrecentlyastheearly1990squantitativemethodswereseenassoinadequateforlinguisticsthatanimportanttextbo
4、okformathematicallinguisticsdidnotcovertheminanyway,theyarenowincreasinglyseenascrucialforlinguistictheory.Inthisbookwehavetriedtoachieveabalancebetweentheoryandpractice,andbetweenintuitionandrigor.Weattempttogroundap-proachesintheoreticalideas,bothmathe
5、maticalandlinguistic,butsi-multaneouslywetrytonotletthematerialgettoodry,andtrytoshowhowtheoreticalideashavebeenusedtosolvepracticalproblems.Todothis,we?rstpresentkeyconceptsinprobabilitytheory,statistics,infor-mationtheory,andlinguisticsinordertogivestu
6、dentsthefoundationstounderstandthe?eldandcontributetoit.Thenwedescribetheprob-lemsthatareaddressedinStatisticalNaturalLanguageProcessing(NLP),liketagginganddisambiguation,andaselectionofimportantworksoiipxxxPrefacethatstudentsaregroundedintheadvancesthat
7、havebeenmadeand,havingunderstoodthespecialproblemsthatlanguageposes,canmovethe?eldforward.Whenwedesignedthebasicstructureofthebook,wehadtomakeanumberofdecisionsaboutwhattoincludeandhowtoorganizethematerial.Akeycriterionwastokeepthebooktoamanageablesize.(
8、Wedidn'tentirelysucceed!)Thusthebookisnotacompleteintroductiontoprobabilitytheory,informationtheory,statistics,andthemanyotherareasofmathematicsthatareusedinStatisticalNLP.Wehavetriedtocoverthosetopicsthatseemmostimportant