计科专业毕业设计外文文献翻译SQL.docx
- 文档编号:26411177
- 上传时间:2023-06-19
- 格式:DOCX
- 页数:14
- 大小:24.58KB
计科专业毕业设计外文文献翻译SQL.docx
《计科专业毕业设计外文文献翻译SQL.docx》由会员分享,可在线阅读,更多相关《计科专业毕业设计外文文献翻译SQL.docx(14页珍藏版)》请在冰豆网上搜索。
计科专业毕业设计外文文献翻译SQL
毕业设计(论文)
外文文献翻译
专业
理学院
学生姓名
班级
计科092
学号
200901051
指导教师
英文原文
IntroductiontoDataMining
Abstract:
Microsoft®SQLServer™2005providesanintegratedenvironmentforcreatingandworkingwithdataminingmodels.This tutorialusesfourscenarios,targeted mailing, forecasting, market basket,and sequence clustering,todemonstratehowtousetheminingmodelalgorithms,miningmodelviewers,anddataminingtools thatareincludedinthisreleaseofSQLServer.
Introduction
ThedataminingtutorialisdesignedtowalkyouthroughtheprocessofcreatingdataminingmodelsinMicrosoftSQLServer2005.ThedataminingalgorithmsandtoolsinSQLServer2005makeiteasytobuildacomprehensivesolutionforavarietyofprojects,includingmarketbasketanalysis,forecastinganalysis,andtargetedmailinganalysis.Thescenariosforthesesolutionsareexplainedingreaterdetaillaterinthetutorial.
ThemostvisiblecomponentsinSQLServer2005aretheworkspacesthatyouusetocreateandworkwithdataminingmodels.Theonlineanalyticalprocessing(OLAP)anddataminingtoolsareconsolidatedintotwoworkingenvironments:
BusinessIntelligenceDevelopmentStudioandSQLServerManagementStudio.UsingBusinessIntelligenceDevelopmentStudio,youcandevelopanAnalysisServicesprojectdisconnectedfromtheserver.Whentheprojectisready,youcandeployittotheserver.Youcanalsoworkdirectlyagainsttheserver.ThemainfunctionofSQLServerManagementStudioistomanagetheserver.Eachenvironmentisdescribedinmoredetaillaterinthisintroduction.Formoreinformationonchoosingbetweenthetwoenvironments,see"ChoosingBetweenSQLServerManagementStudioandBusinessIntelligenceDevelopmentStudio"inSQLServerBooksOnline.
Allofthedataminingtoolsexistinthedataminingeditor.Usingtheeditoryoucanmanageminingmodels,createnewmodels,viewmodels,comparemodels,andcreatepredictionsbasedonexistingmodels.
Afteryoubuildaminingmodel,youwillwanttoexploreit,lookingforinterestingpatternsandrules.Eachminingmodelviewerintheeditoriscustomizedtoexploremodelsbuiltwithaspecificalgorithm.Formoreinformationabouttheviewers,see"ViewingaDataMiningModel"inSQLServerBooksOnline.
Oftenyourprojectwillcontainseveralminingmodels,sobeforeyoucanuseamodeltocreatepredictions,youneedtobeabletodeterminewhichmodelisthemostaccurate.Forthisreason,theeditorcontainsamodelcomparisontoolcalledtheMiningAccuracyCharttab.Usingthistoolyoucancomparethepredictiveaccuracyofyourmodelsanddeterminethebestmodel.
Tocreatepredictions,youwillusetheDataMiningExtensions(DMX)language.DMXextendsSQL,containingcommandstocreate,modify,andpredictagainstminingmodels.FormoreinformationaboutDMX,see"DataMiningExtensions(DMX)Reference"inSQLServerBooksOnline.Becausecreatingapredictioncanbecomplicated,thedataminingeditorcontainsatoolcalledPredictionQueryBuilder,whichallowsyoutobuildqueriesusingagraphicalinterface.YoucanalsoviewtheDMXcodethatisgeneratedbythequerybuilder.
Justasimportantasthetoolsthatyouusetoworkwithandcreatedataminingmodelsarethemechanicsbywhichtheyarecreated.Thekeytocreatingaminingmodelisthedataminingalgorithm.Thealgorithmfindspatternsinthedatathatyoupassit,andittranslatesthemintoaminingmodel—itistheenginebehindtheprocess.
Someofthemostimportantstepsincreatingadataminingsolutionareconsolidating,cleaning,andpreparingthedatatobeusedtocreatetheminingmodels.SQLServer2005includestheDataTransformationServices(DTS)workingenvironment,whichcontainstoolsthatyoucanusetoclean,validate,andprepareyourdata.FormoreinformationonusingDTSinconjunctionwithadataminingsolution,see"DTSDataMiningTasksandTransformations"inSQLServerBooksOnline.
InordertodemonstratetheSQLServerdataminingfeatures,thistutorialusesanewsampledatabasecalledAdventureWorksDW.ThedatabaseisincludedwithSQLServer2005,anditsupportsOLAPanddataminingfunctionality.Inordertomakethesampledatabaseavailable,youneedtoselectthesampledatabaseattheinstallationtimeinthe“Advanced”dialogforcomponentselection.
AdventureWorks
AdventureWorksDWisbasedonafictionalbicyclemanufacturingcompanynamedAdventureWorksCycles.AdventureWorksproducesanddistributesmetalandcompositebicyclestoNorthAmerican,European,andAsiancommercialmarkets.ThebaseofoperationsislocatedinBothell,Washingtonwith500employees,andseveralregionalsalesteamsarelocatedthroughouttheirmarketbase.
AdventureWorkssellsproductswholesaletospecialtyshopsandtoindividualsthroughtheInternet.Forthedataminingexercises,youwillworkwiththeAdventureWorksDWInternetsalestables,whichcontainrealisticpatternsthatworkwellfordataminingexercises.
FormoreinformationonAdventureWorksCyclessee"SampleDatabasesandBusinessScenarios"inSQLServerBooksOnline.
DatabaseDetails
TheInternetsalesschemacontainsinformationabout9,242customers.Thesecustomersliveinsixcountries,whicharecombinedintothreeregions:
NorthAmerica(83%)
Europe(12%)
Australia(7%)
Thedatabasecontainsdataforthreefiscalyears:
2002,2003,and2004.
Theproductsinthedatabasearebrokendownbysubcategory,model,andproduct.
BusinessIntelligenceDevelopmentStudio
BusinessIntelligenceDevelopmentStudioisasetoftoolsdesignedforcreatingbusinessintelligenceprojects.BecauseBusinessIntelligenceDevelopmentStudiowascreatedasanIDEenvironmentinwhichyoucancreateacompletesolution,youworkdisconnectedfromtheserver.Youcanchangeyourdataminingobjectsasmuchasyouwant,butthechangesarenotreflectedontheserveruntilafteryoudeploytheproject.
WorkinginanIDEisbeneficialforthefollowingreasons:
TheAnalysisServicesprojectistheentrypointforabusinessintelligencesolution.AnAnalysisServicesprojectencapsulatesminingmodelsandOLAPcubes,alongwithsupplementalobjectsthatmakeuptheAnalysisServicesdatabase.FromBusinessIntelligenceDevelopmentStudio,youcancreateandeditAnalysisServicesobjectswithinaprojectanddeploytheprojecttotheappropriateAnalysisServicesserverorservers.
IfyouareworkingwithanexistingAnalysisServicesproject,youcanalsouseBusinessIntelligenceDevelopmentStudiotoworkconnectedtheserver.Inthisway,changesarereflecteddirectlyontheserverwithouthavingtodeploythesolution.
SQLServerManagementStudio
SQLServerManagementStudioisacollectionofadministrativeandscriptingtoolsforworkingwithMicrosoftSQLServercomponents.ThisworkspacediffersfromBusinessIntelligenceDevelopmentStudiointhatyouareworkinginaconnectedenvironmentwhereactionsarepropagatedtotheserverassoonasyousaveyourwork.
Afterthedatahasbeencleanedandpreparedfordatamining,mostofthetasksassociatedwithcreatingadataminingsolutionareperformedwithinBusinessIntelligenceDevelopmentStudio.UsingtheBusinessIntelligenceDevelopmentStudiotools,youdevelopandtestthedataminingsolution,usinganiterativeprocesstodeterminewhichmodelsworkbestforagivensituation.Whenthedeveloperissatisfiedwiththesolution,itisdeployedtoanAnalysisServicesserver.Fromthispoint,thefocusshiftsfromdevelopmenttomaintenanceanduse,andthusSQLServerManagementStudio.UsingSQLServerManagementStudio,youcanadministeryourdatabaseandperformsomeofthesamefunctionsasinBusinessIntelligenceDevelopmentStudio,suchasviewing,andcreatingpredictionsfromminingmodels.
DataTransformationServices
DataTransformationServices(DTS)comprisestheExtract,Transform,andLoad(ETL)toolsinSQLServer2005.Thesetoolscanbeusedtoperformsomeofthemostimportanttasksindatamining:
cleaningandpreparingthedataformodelcreation.Indatamining,youtypicallyperformrepetitivedatatransformationstocleanthedatabeforeusingthedatatotrainaminingmodel.UsingthetasksandtransformationsinDTS,youcancombinedatapreparationandmodelcreationintoasingleDTSpackage.
DTSalsoprovidesDTSDesignertohelpyoueasilybuildandrunpackagescontainingallofthetasksandtransformations.UsingDTSDesigner,youcandeploythepackagestoaserverandrunthemonaregularlyscheduledbasis.Thisisusefulif,forexample,youcollectdataweeklydataandwanttoperformthesamecleaningtransformationseachtimeinanautomatedfashion.
YoucanworkwithaDataTransformationprojectandanAnalysisServicesprojecttogetheraspartofabusinessintelligencesolution,byaddingeachprojecttoasolutioninBusinessIntelligenceDevelopmentStudio.
MiningModelAlgorithms
Dataminingalgorithmsarethefoundationfromwhichminingmodelsarecreated.ThevarietyofalgorithmsincludedinSQLServer2005allowsyoutoperformmanytypesofanalysis.Formorespecificinformationaboutthealgorithmsandhowtheycanbeadjustedusingparameters,see"DataMiningAlgorithms"inSQLServerBooksOnline.
MicrosoftDecisionTrees
TheMicrosoftDecisionTreesalgorithmsupportsbothclassificationandregressionanditworkswellforpredictivemodeling.Usingthealgorithm,youcanpredictbothdiscreteandcontinuousattributes.
Inbuildingamodel,thealgorithmexamineshoweachinputattributeinthedatasetaffectstheresultofthepredictedattribute,andthenitusestheinputattributeswiththestrongestrelationshiptocreateaseriesofsplits,callednodes
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 专业 毕业设计 外文 文献 翻译 SQL