資源描述:
《基于個性Agent的協(xié)作強化學(xué)習(xí)模型研究》由會員上傳分享,免費在線閱讀,更多相關(guān)內(nèi)容在行業(yè)資料-天天文庫。
1、江蘇大學(xué)碩士學(xué)位論文基于個性Agent的協(xié)作強化學(xué)習(xí)模型研究姓名:嚴(yán)耀華申請學(xué)位級別:碩士專業(yè):計算機(jī)應(yīng)用技術(shù)指導(dǎo)教師:程顯毅20080606江蘇大學(xué)碩士研究生畢業(yè)論文ABSTRACTWiththedeVelopmentofComputernetworksandteclulologlesofartificialintelligence,theagentstIldyh弱becomeahotspotofdistribut|edanijciCialiIltelligencestudy.AgenttheOry,technolog
2、y,inpaniculaurmeMAStedmOlogynOtoIllytometheoryofdistmutedapplicationstosolVenewproblemsproVide鋤effectivewayforcomprehensiVeandaccIlratestudy0fmecharacteriStiCsofdistributedcomputingsystemproVidesareasonablemodel0ftheconCept0fbrir培tousthedesignalldoperationcanbere
3、al沱edi11medistribution卸diIlthe0penenViro衄emmanewsoftwaresystemmodel,adescriptionofmecomplexphenomenon,mestlldyofcomplexsystems,calclllationofcomplexad?。簦椋郑澹幔穑穑颍铮幔悖瑁停粒樱辏牛椋颍螅簦铮睿簦瑁澹猓幔螅椋螅铮妫椋螅螅酰澹?,defmitionofasiIl酉eAgent,Agentgiventocenainactsa11dpar鋤eters鋤dmen,thede觚
4、tionbet、)lreentheAgent柚dAgemaIldtheenViromentistIleiIlteractionbe似,eenthemles;FiIlaLlly,tlleAgentisⅡ1eiIlteractionbetweenmeactiVitiesoftllesolutiont0Ⅱleproblemcapaci哆Thus,stmcturaldesi?。粒纾澹睿簦幔睿洌粒纾澹睿簦茫埃欤欤幔猓铮颍幔欤椋铮睿猓澹?、)lreentheMAStechnologyistlleC0re.nispaperViewoft
5、heCu盯emIackofMulti—AgemcollaboratiVeresearchpersonali哆tendencies鋤dpersonalit)rtraitmodel,thispaperpresentsCRLBP((boperativeReinforCementLeamingb弱edPersonali哆)model,f而manotherperspectiVe,tosolVemeMulti—Agentcollaboration.Themainworkindudes:(1)themodelofAgentpe啪nal
6、咄personal時andbehaviorwillbebundled耐thInfomationAgent,aIldisdesc曲edmdetail鋤dpersonal也edinfonnationFonnalAgent.AgentmoreoftheVarioustasl【smatchiIlgtheroleofposition.’(2)themodelwiUbeiIltroducedt0thecharactcrAgentCollaborationagreement,basedonpersonalityAgentcollabo
7、rationrei響rcememleamingmodel(CRLBP),themodelwillbeasin甜eAgentf幻mthetraditionalperspectiVeofthereiIlfb-cementleam血g,?。穑欤椋澹洌簦铮簦瑁澹粒纾澹睿粢粒铮酰穑螅悖埃保欤幔猓铮颍幔簦椋铮睿颍澹椋睿剩妫铮颍茫澹恚澹睿簦欤澹幔睿欤椋睿?,theCRLBPModelcomp撕son麗ththetraditionalmodelexp舐men_ts,iIlthepassandinterceptedtheball,CRLBP
8、thanthetraditionalmodelhasadVantages.(3)theprobabili鑼distrib們onfunction鋤d鋤endedthrougllthe嬲sessment刪on,afornlalpersonalityAgent,鋤dsimulationeXperimentthatdonot