資源描述:
《Web訪問挖掘研究》由會員上傳分享,免費在線閱讀,更多相關(guān)內(nèi)容在學術(shù)論文-天天文庫。
1、東南大學博士學位論文Web訪問挖掘研究姓名:宋愛波申請學位級別:博士專業(yè):計算機應(yīng)用技術(shù)指導教師:董逸生20030801于Petri網(wǎng)方法。這兩種方法,都不需要用戶的參與協(xié)作,完全是自動的?;赑etri網(wǎng)方法,具有直觀性和不需要反復計算推薦集合的優(yōu)點,而且Petri網(wǎng)能很自然的描述web中的多內(nèi)容并發(fā)顯示和并發(fā)瀏覽。關(guān)鍵詞數(shù)據(jù)挖掘web訪問挖掘個性化站點管理Petri網(wǎng)IIResearchonWebAccessMiningAbs仃actToday,theWorldWideWebisrapidlyemergingasanimportantmedium
2、forthedissemination,exchange,andgettingofinformation.Accordingtomostpredictions,themajorityofhumaninformationwillbeavailableontheWebintenyears.Thesehugeamountsofdataraiseagrandchallenge,namely,howtoturntheWebintomoreusefulinformationutility.Atpresent,themaintoolsofgettinginform
3、ationarestillsearchengines.Today’Ssearchengines,however,areplaguedbythefollowingfourproblems:thelowprecisionproblem;thelowrecallproblem;Alimitedqueryinterfacethatisonlybasedonkeyword—orientedsearch,andhavenofunctionofcustomizationtoindividualusers.Theseproblems,inturn,Canbeattr
4、ibutedtothefollowingcharacteristicsoftheWeb.Firstandforemost,theWebisahuge,diverseanddynamiccollectionofinterlinkedhypertextdocuments.Furthermore,itiswidelybelievedthat99%oftheinformationontheWebisofnointerestto99%ofthepeople.Second,exceptforhyperlinks,theWebislargelyunstructur
5、ed.Finally,mostinformationontheWebisintheformofHTMLdocumentsforwhichanalysisandextractionofcontentisverydifficult.Therefore,itisnoteasytoovercomeallproblemscausedbysearchengines.Inthisthesis,weusewebaccessminingtodiscoverlaserbrowsingpatternssuchasaims,interests,andpreferences.
6、Thenthesepatternsareutilizedinimprovingthestructureofwebsitesandthemannerofwebservice.Thus,wecallhelpusersgettingwhattheyneedmoreeasilybypersonalizedinformationserviceandautomatedsiteadministration.ItisalsoimportantforE—commercetominethewebaccessdata.Thesignificanceliesinimprov
7、ingthecustomerrelationshipmanagement,assistinginmakingdecisionandsecuritymanagement,andhelpingmerchandisersimplementingaone-to-ODemarketingstrategy.Thedissertationiscomposedofthefollowingparts:(1)Wediscussvariousproblemsmetduringdatapreparinginwebaccessmining,thengiveasimplemet
8、hodtoidentifyuseraccesstransactionsaccordingtohostaddr