简介:Traditionally,SQLquerylanguageisusedtosearchthedataindatabases.However,itisinappropriateforend-users,sinceitiscomplexandhardtolearn.Itistheneedofend-user,searchingindatabaseswithkeywords,likeinwebsearchengines.Thispaperpresentsasurveyofworkonkeywordsearchindatabases.ItalsoincludesabriefintroductiontotheSEEKERsystemwhichhasbeendeveloped.
简介:Althoughtheproteinsequence-structuregapcontinuestoenlargeduetothedevelopmentofhigh-throughputsequencingtools,theproteinstructureuniversetendstobecompletewithoutproteinswithnovelstructuralfoldsdepositedintheproteindatabank(PDB)recently.Inthiswork,weidentifyaproteinstructuraldictionary(Frag-K)composedofasetofbackbonefragmentsrangingfrom4to20residuesasthestructural"keywords"thatcaneffectivelydistinguishbetweenmajorproteinfolds.Wefirstlyapplyrandomizedspectralclusteringandrandomforestalgorithmstoconstructrepresentativeandsensitiveproteinfragmentlibrariesfromalargescaleofhigh-quality,non-homologousproteinstructuresavailableinPDB.Weanalyzetheimpactsofclusteringcut-offsontheperformanceofthefragmenthbraries.Then,theFrag-KfragmentsareemployedasstructuralfeaturestoclassifyproteinstructuresinmajorproteinfoldsdefinedbySCOP(StructuralClassificationofProteins).OurresultsshowthatastructuraldictionarywithN4004-to20-residueFrag-KfragmentsiscapableofclassifyingmajorSCOPfoldswithhighaccuracy.
简介:Basedonthecovariantprolongationstructuretechnique,weconstructtheintegrablehigher-orderdeformationsofthe(2+1)-dimensionalHeisenbergferromagnetmodelandobtaintheirsu(2)×R(λ)prolongationstructures.ByassociatingthesedeformedmultidimensionalHeisenbergferromagnetmodelswiththemovingspacecurveinEuclideanspaceandusingtheHasimotofunction,wederivetheirgeometricalequivalentcounterparts,i.e.,higher-order(2+1)-dimensionalnonlinearSchrdingerequations.