学科分类
/ 3
41 个结果
  • 简介:Sequentialpatternminingisanimportantdataminingproblemwithbroadapplications.However,itisalsoachallengingproblemsincetheminingmayhavetogenerateorexamineacombinatoriallyexplosivenumberofintermediatesubsequences.Recentstudieshavedevelopedtwomajorclassesofsequentialpatternminingmethods:(1)acandidategeneration-and-testapproach,representedby(i)GSP,ahorizontalformat-basedsequentialpatternminingmethod,and(ii)SPADE,averticalformat-basedmethod;and(2)apattern-growthmethod,representedbyPrefixSpananditsfurtherextensions,suchasgSpanforminingstructuredpatterns.Inthisstudy,weperformasystematicintroductionandpresentationofthepattern-growthmethodologyandstudyitsprinciplesandextensions.Wefirstintroducetwointerestingpattern-growthalgorithms,FreeSpanandPrefixSpan,forefficientsequentialpatternmining.ThenweintroducegSpanforminingstructuredpatternsusingthesamemethodology.Theirrelativeperformanceinlargedatabasesispresentedandanalyzed.Severalextensionsofthesemethodsarealsodiscussedinthepaper,includingminingmulti-level,multi-dimensionalpatternsandminingconstraint-basedpatterns.

  • 标签: 数据挖掘 顺序方向挖掘 可量测性 性能分析
  • 简介:Geological Prospecting and Mining in TibetGeologicalProspectingandMininginTibet¥DONDUINAMGYISeptember1,1995markedthe30thanniv...

  • 标签:
  • 简介:HuainanCoalMiningBureau,aspeciallargecoalenterpriseandastatekeycoalproductionbase,issituatedincentral-northpartofAnhuiProvince.Thearea,well-knownas"thecoalcapitalofEastChina",aboundsincoalresources,andtheprovencoalreserveisestimatedtobeupto70billiontonswithcompletevarietiesandsuperiorquality.Bytheyearof2010,theannualproductioncapacitywillreach30milliontons.Thereareexcellentinvestmentenvironmentandconvenientcommunicationandtransportation

  • 标签:
  • 简介:Withmassiveamountsofdatastoredindatabases,mininginformationandknowledgeindatabaseshasbecomeanimportantissueinrecentresearch.Researchersinmanydifferentfieldshaveshowngreatinterestindateminingandknowledgediscoveryindatabases.Severalemergingapplicationsininformationprovidingservices,suchasdatawarehousingandon-lineservicesovertheInternet,alsocallforvariousdataminingandknowledgediscoverytchniquestounderstandusedbehaviorbetter,toimprovetheserviceprovided,andtoincreasethebusinessopportunities.Inresponsetosuchademand,thisarticleistoprovideacomprehensivesurveyonthedataminingandknowledgediscorverytechniquesdevelopedrecently,andintroducesomerealapplicationsystemsaswell.Inconclusion,thisarticlealsolistssomeproblemsandchallengesforfurtherresearch.

  • 标签: 数据库 知识发现 机器学习 数据开采
  • 简介:Landresourcesarefacingcrisesofbeingmisused,especiallyforanintersectionareabetweentownandcountry,andlandcontrolhastobeenforced.Thispaperpresentsadevelopmentofdataminingmethodforlandcontrol.Avector-matchmethodfortheprerequisiteofdataminingi.e.,datacleaningisproposed,whichdealswithbothcharacterandnumericdataviavectorizingcharacter-stringandmatchingnumber.Aminimaldecisionalgorithmofroughsetisusedtodiscovertheknowledgehiddeninthedatawarehouse.Inordertomonitorlandusedynamicallyandaccurately,itissuggestedtosetupareal-timelandcontrolsystembasedonGPS,digitalphotogrammetryandonlinedatamining.Finally,themeansisappliedintheintersectionareabetweentownandcountryofWuhancity,andasetofknowledgeaboutlandcontrolisdiscovered.

  • 标签: LAND CONTROL DATA MINING vector-match method
  • 简介:Thispaperpresentsafault-detectionmethodbasedonthephasespacereconstructionanddataminingapproachesforthecomplexelectronicsystem.TheapproachforthephasespacereconstructionofchaotictimeseriesisacombinationalgorithmofmultipleautocorrelationandΓ-test,bywhichthequasi-optimalembeddingdimensionandtimedelaycanbeobtained.Thedataminingalgorithm,whichcalculatestheradiusofgyrationofunit-masspointaroundthecentreofmassinthephasespace,candistinguishthefaultparameterfromthechaotictimeseriesoutputbythetestedsystem.Theexperimentalresultsdepictthatthisfaultdetectionmethodcancorrectlydetectthefaultphenomenaofelectronicsystem.

  • 标签: 数据采集 故障检测 混沌时间序列 相位空间重建 拓扑结构
  • 简介:OutlierminingisanimportantaspectindataminingandtheoutlierminingbasedonCookdistanceismostcommonlyused.Butweknowthatwhenthedatahavemulticollinearity,thetraditionalCookmethodisnolongereffective.Consideringtheexcellenceoftheprincipalcomponentestimation,weuseittosubstitutetheleastsquaresestimation,andthengivetheCookdistancemeasurementbasedonprincipalcomponentestimation,whichcanbeusedinoutliermining.Atthesametime,wehavedonesomeresearchonrelatedtheoriesandapplicationproblems.

  • 标签: 外露层采矿 基本成分估计 库克距离 数字化矿业 线性回归模型
  • 简介:Asemi-structureddocumenthasmorestructuredinformationcomparedtoanordinarydocument,andtherelationamongsemi-structureddocumentscanbefullyutilized.Inordertotakeadvantageofthestructureandlinkinformationinasemi-structureddocumentforbettermining,astructuredlinkvectormodel(SLVM)ispresentedinthispaper,whereavectorrepresentsadocument,andvectors'elementsaredeterminedbyterms,documentstructureandneighboringdocuments.TextminingbasedonSLVMisdescribedintheprocedureofK-meansforbriefnessandclarity:calculatingdocumentsimilarityandcalculatingclustercenter.TheclusteringbasedonSLVMperformssignificantlybetterthanthatbasedonaconventionalvectorspacemodelintheexperiments,anditsFvalueincreasesfrom0.65-0.73to0.82-0.86.

  • 标签: HTML语言 XML语言 半结构文件模型 版本开采 结构信息
  • 简介:Inthispaper,ARMiner,adataminingtoolbasedonassociationrules,isintroduced.Beginningwiththesystemarchitecture,thecharacteristicsandfunctionsaredis-cussedindetails,includingdatatransfer,concepthierarchygeneralization,miningruleswithnegativeitemsandthere-developmentofthesystem.Anexampleofthetool'sapplicationisalsoshown.Finally,someissuesforfutureresearcharepresented.

  • 标签: ARMiner 数据开采工具 机器学习
  • 简介:ThebackdoororinformationleakofWebserverscanbedetectedbyusingWebMiningtechniquesonsomeabnormalWeblogandWebapplicationlogdata.ThesecurityofWebserverscanbeenhancedandthedamageofillegalaccesscanbeavoided.Firstly,thesystemfordiscoveringthepatternsofinformationleakagesinCGIscriptsfromWeblogdatawasproposed.Secondly,thosepatternsforsystemadministratorstomodifytheircodesandenhancetheirWebsitesecuritywereprovided.Thefollowingaspectsweredescribed:oneistocombinewebapplicationlogwithweblogtoextractmoreinformation,sowebdataminingcouldbeusedtomineweblogfordiscoveringtheinformationthatfirewallandInformationDetectionSystemcannotfind.AnotherapproachistoproposeanoperationmoduleofwebsitetoenhanceWebsitesecurity.Inclusterserversession,Density-BasedClusteringtechniqueisusedtoreduceresourcecostandobtainbetterefficiency.

  • 标签: WEB 网络安全 数据挖掘 计算机网络 逻辑推理
  • 简介:Riversareoneofthemostessentialsourcesofsandandgravelsupplyforcivilworks.However,undesirableeffectsofirregularin-streammininghavebeenreportedonnaturalsources,environmentandinfrastructuresclosetorivers.Therefore,itisnecessarytofindtheeffectsofminingonriversinmoredetails.Thisresearchconcentratesonmining-pitmigrationphenomenonanditseffectsonthechannelbed.Thispaperreportsanexperimentalstudyonthemigrationofrectangularminingpitsandvariationoflongitudinalprofileinthechannelbedcomposedofratheruniformsediments.Differentvaluesofwidthsandlengthswereusedforpitwhilepitdepthsandflowvariableswerekeptconstant.Theresultsshowthatthemigrationspeedchangeswiththelength/widthratioofthepit.Themigrationspeedinconvectionperiodishigherthanthatindiffusionperiod.Inaddition,byincreasingthelengthorwidth,fillingrateofpitincreases,wheretheeffectofwidthismoreimportantthantheeffectofthelength.Alsoisreportedinthispaperafieldstudyonthechangesofthreepitsexcavatedatdifferentlocationsofariver.Somesimilaritiesbetweenthepitmigrationinthestraightreachoftheriverandthatoftheexperimentalworkisrealizedandpresented.

  • 标签: PIT migration SAND and GRAVEL mining
  • 简介:工作流执行的有时历史性的信息被需要分析企业过程。进程采矿为在执行捕获企业进程从事件日志瞄准析取信息。在这篇论文,一个过程采矿算法被建议基于是工作流逻辑和工作流语义的一个基于同步的模型的onSynchro网。Withthis采矿算法在容易基于模型,象不可见的任务那样的问题和短环的罐头被处理。一个过程采矿例子被举说明算法,并且评估也被给。

  • 标签: 工作流 过程采集 逻辑性 PETRI网
  • 简介:WiththedeliveryofagreatdealremotesensingdatatolandfromLandsatconstantly,RemoteSensingSatelliteGroundStationaccumulatesabundantsatelliteremotesensingdata.Forlackofeffectivedatamining(DM)andknowledgeDiscoveryfromDatabases(KDDtechnique)tothesedata,mostpartoftheinformationcannotbeusedefficiently.TechnicalinnovationandimprovementofthetraditionalDMandKDD,studyofthedataminingandKDDwillbothincreasetheinterpretationlevelandintelligentized,andmoreoverexploreandutilizetheremotesensinginformationatthemaximumdegree.BasedonthetraditionaldataminingandKDD,theauthorsprobedthetechnicalflowofDMandKDDoftheremotesensing,designedthesystematicalframeworkofmulti-sourcesremotesensingDM,putforwardaprototypeEstablishedabaseforfurtherexploringandsystem.ofmulti-sourcesremotesensingDMsystem.developingmulti-sourcesremotesensingDMsystem.

  • 标签: 数据提炼 遥感数据 知识发现 KDD
  • 简介:CitiesbasedonminingaredistinctivefromothercitiesinChina.Theirheavydependenceonminerals,arelativelyundiversifiedindustrialstructure,seriouslydamagedecologicalenvironmentandtheratherlowdegreeofopennesshaveallreducedtheircompetitiveness,andseverelyconstrainedandhinderedtheirsustainabledevelopment.Inthispapertheauthorswillstudymining-basedcitiesfromtheperspectiveofsustainabledevelopment,firstbyhavingacriticalreviewoftheirfeatures,andthenbyresearchingintostrategicoptionstosupporttheirsustainabledevelopment.

  • 标签:
  • 简介:Prosodiccontrolisanimportantpartofspeechsynthesissystem.Prosodicparameterschoicerightorwronginfluencesthequalityofsyntheticspeechdirectly.Atpresent,texttospeechsystemhaslesseffectivedescribetoreflectdatarelationshipsinthecorpus.Anewresearchapproach-dataminingtechnologytodiscoverthoserelationshipsbyassociationrulesmodelingispresented.Andanewalgorithmforgeneratingassociationrulesofprosodicparametersincludingpitchparametersanddurationparametersfromcorpusisdeveloped.Theoutputrulesimprovethecorrectnessofsyllablechoiceintexttospeechsystem.

  • 标签: 物质文明 坚持 优质 体育 建筑
  • 简介:Expressedsequencetags(ESTs)arewidelyusedingenesurveyresearchtheseyears.TheESTPipelineSystem,softwaredevelopedbyHangzhouGenomicsInstitute(HGI),canautomaticallyanalyzedifferentscalarESTsequencesbysuitablemethods.Alltheanalysisreports,includingthoseofvectormasking,sequenceassembly,geneannotation,GeneOntologyclassification,andsomeotheranalyses,canbebrowsedandsearchedaswellasdownloadedintheExcelformatfromthewebinterface,savingresearcheffortsfromroutinedataprocessingforbiologicalrulesembeddedinthedata.

  • 标签: 表达序列标签 管道系统 信号加工 EST
  • 简介:ThereisabundanceofMercurymineresurcesintheFanjinshanMountain,Miningmercuryhasalonghistorythere,TheconcentrationofgeseousHgproducedinsmeltingHereaches20-50mg/m^3inthetailgas.Becausemercuryelementisaneasilytransferringmicroelement,thepapertalksabouttheeffectofmercuryinHgmininginGuizhouProvinceonalpinesoil,analysesHgcontentinalpinesoilat2000mofrelativeelevationintheHgminingarea,andexploresforcausesoftheHgpollution.

  • 标签: 水银 汞矿区 高山土 湿性沉降 尾气 汞沉降
  • 简介:Somefactorssuchascontinuouspricefallingofrareearthconcentrates,increaseinproductioncostandincreasinginvestmentinsafetyandenvironmentalprotectionpreventedthedevelopmentofminingindustry.Tosafeguardtheinterestsofminingcompaniesandfacilitatethehealthydevelopmentofrareearthindustry,REMiningSocietyofMianningCountyheldageneralmeetingrecently.FollowingresolutionswerepassedonJuly13th:1.SinceJuly14th2005,thelowestprotectivemarketpricesofrareear...

  • 标签: