BLASTX nr result
ID: Atractylodes21_contig00004772
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00004772 (2576 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI15120.3| unnamed protein product [Vitis vinifera] 278 4e-72 emb|CBI27291.3| unnamed protein product [Vitis vinifera] 266 3e-68 ref|XP_002514656.1| hypothetical protein RCOM_1469830 [Ricinus c... 262 4e-67 ref|XP_004138319.1| PREDICTED: uncharacterized protein LOC101218... 243 2e-61 ref|XP_003528223.1| PREDICTED: uncharacterized protein LOC100801... 238 5e-60 >emb|CBI15120.3| unnamed protein product [Vitis vinifera] Length = 676 Score = 278 bits (712), Expect = 4e-72 Identities = 227/702 (32%), Positives = 320/702 (45%), Gaps = 98/702 (13%) Frame = +3 Query: 663 EMDIWMVAVAAGAGYIARHFKKL----KQASGQIASESSLEGSFSVXXXXXXXXXXXXXX 830 +MDIW+V AA A Y+++ ++K+ + +S S E Sbjct: 2 KMDIWIVVAAASAVYLSKQWQKILRHRENSSESFCGTSDFEKPNPASTWWLPDKKSCP-- 59 Query: 831 FRRLAKRENLRQD-DFQFRN------------EGVSSTEEASTSEFDGEKGVISGNREGG 971 R LA R+ L +D + N +G + E AS DGE V GN + Sbjct: 60 LRSLAWRKKLDEDISIEKENISKGEVSEMTQLDGTPAAEVASACGADGENLVSLGNHKDC 119 Query: 972 SVFSASSLFPGFIGSDGVLGDVEGVRVS---EGMGDLLHKXXXXXXXXXXXXXXXXXXA- 1139 +V SSL GF +D + + +G+R++ + GD +H+ Sbjct: 120 NVLFISSLANGFSRNDNLQENEDGIRMTGDIDSSGDSVHEPSTVNVGPFLGLLRKGRALR 179 Query: 1140 --------IRPLSSLESCLMAQLYTEHVEMKEYMSSSFASLSMQTVRPFLVTDGSKVINR 1295 RP+SSLESCLMAQLY EH EM+EY+ SS S TVRP +DGS++IN+ Sbjct: 180 TKRLHARFARPISSLESCLMAQLYNEHAEMEEYVLSSLQSPPAPTVRPLFTSDGSRIINK 239 Query: 1296 ASGDA---FIAQNKKNL--QIYSEENSAFSGVPSPPNISSLEI---------RGKDIANS 1433 ASGD+ +I NL + Y E GVPS P + S+E RG+ Sbjct: 240 ASGDSPSMWIGTGDDNLHKEAYLEGKETAIGVPSLPKLRSMEQPRKMKLKTGRGQRQRFG 299 Query: 1434 RANTTVAAKHFNLRRGSSDAALLLCLGMSFGIISSFMENRREVEKLNRLLKQTQSLVQDL 1613 ++ + KHF+ + GS D +L CLG+S G+I S + N+REV+++ LL++T++LVQDL Sbjct: 300 SSSKMINRKHFHSQGGSPDGMVLFCLGISIGMIFSQISNKREVDEMKELLQRTENLVQDL 359 Query: 1614 EEELEMKDSLTMKEVTIEDHESQRIGGGSSNDGSPHSVSYEHNQEQSTNSNNKESVVQKG 1793 +EELEMKDSLT+KE+ ED+ESQ Y H+ NS Sbjct: 360 QEELEMKDSLTVKELANEDYESQ----------------YTHDHAFEENS---------- 393 Query: 1794 KDESFSKIXXXXXXXXXXXXXXXXXXXXXRRISNLVELDPDFEPNVAQGELRADMLDTQT 1973 E SKI R++SNLVELDPD +VAQGEL+ +M+ QT Sbjct: 394 --ELMSKIEAELEAELERLELSMNSSCLERKLSNLVELDPDIVADVAQGELKVEMVGRQT 451 Query: 1974 NTNEDGR---GSSTTTHSANYAVSPRELSLRLHEVLESQ----------XXXXXXXXXXX 2114 +D S++TTHSANYAVSPRELSLRLHEV++S+ Sbjct: 452 GGQDDSYQDVSSTSTTHSANYAVSPRELSLRLHEVIQSRLEERVKELETELQNSNRKLQF 511 Query: 2115 XXSKRTQNQKSWSSSDAG-------------DENPVDEPVVLNLSGEALDAYNEACNEFA 2255 S+R + ++ S G + NP+ + V+NL DAYNE + Sbjct: 512 MESERLTCYEEFADSILGSSTQRTPIALPVEESNPIAQLSVVNLPE---DAYNEVEDFMK 568 Query: 2256 KFXXXXXXXXXXXXGKGRLLQLLDVEGTPMYQKE-------------------------- 2357 + L +G+ ++ E Sbjct: 569 ETQSESEDFPSTAYRNNHQDGLHQFDGSVLWDHEDLLKRQRRTFIPQEVRTLEEHILMDR 628 Query: 2358 ---SGTSGDEEDEMEKLLIKHIVEKARQGSAVVLNAQRALFS 2474 + DE D+ KLLI+ IVEK+RQGS +VLN QRALFS Sbjct: 629 ELSGDENNDEADDEMKLLIQQIVEKSRQGSPIVLNVQRALFS 670 >emb|CBI27291.3| unnamed protein product [Vitis vinifera] Length = 599 Score = 266 bits (679), Expect = 3e-68 Identities = 223/668 (33%), Positives = 302/668 (45%), Gaps = 65/668 (9%) Frame = +3 Query: 666 MDIWMVAVAAGAGYIARHFKKLKQASGQIASESSLEGSFSVXXXXXXXXXXXXXXFRRLA 845 MD+W+VA AAGAGY+A++++ S + SL G S FRRL Sbjct: 1 MDLWVVAAAAGAGYLAKYWQNF---SNEKEGSLSLSGE-SKPRNLLQQIQERNGPFRRLR 56 Query: 846 KRENLRQDDFQFRNEGVS---STEEASTSEFDGE-KGVISGNREGGSVFSASSLFPGFIG 1013 +++ D N G +T AST++FDG+ K SGN + Sbjct: 57 QKQLGIDDILGSENLGEIGDLATGVASTNDFDGKGKAEFSGNSDN--------------- 101 Query: 1014 SDGVLGDVEGVRVSEGMGDLLHKXXXXXXXXXXXXXXXXXXAIRPLSSLESCLMAQLYTE 1193 L DV L + +I+PL SLESC+ +QLY E Sbjct: 102 ----LTDV-----------LTTQETRFKRHKAFRSRRPLEYSIKPLDSLESCVASQLYRE 146 Query: 1194 HVEMKEYMSSSFASLSMQTVRPFLVTDGSKVINRASGDAFIAQ-----NKKNLQ----IY 1346 H M EY SS S TVRP L TDGS+VI+R G +F A+ KK L + Sbjct: 147 HENMNEYEFSSLPSPYTPTVRPLLATDGSRVISRVGGGSFRAELQFESEKKKLHKGDGTH 206 Query: 1347 SEENSAFSGVPSPPNISSLEIRGKDIANSRANTTVAAKHFNLR--------RGSSDAALL 1502 EEN G+P P+I SL++ K N R + +R +GS + L Sbjct: 207 LEENETLLGLPPLPSIGSLQLPWKLKQNVRKGRVQRSSSPGIRASSEAFHSQGSPNGMFL 266 Query: 1503 LCLGMSFGIISSFMENRREVEKLNRLLKQTQSLVQDLEEELEMKDSLTMKEVTIEDHESQ 1682 LG++ G++++ + ++REV+KLN LKQT++LV+DL EELEMKD LT+K++T E+ ++Q Sbjct: 267 FFLGITIGVMATLVASKREVDKLNEQLKQTENLVEDLHEELEMKDFLTVKDLTNEEPDNQ 326 Query: 1683 RIGGGSSNDGSPHSVSYEHNQEQSTNSNNKESVVQKGKDESFSKIXXXXXXXXXXXXXXX 1862 N+ N E + SKI Sbjct: 327 -------------------------NTENSELI---------SKIEAELEAELERLEQNM 352 Query: 1863 XXXXXXRRISNLVELDPDFEPNVAQGELRADMLDTQTNTNEDGR---GSSTTTHSANYAV 2033 R+S+ V LDPDF +V QGELR D L+ Q D S+T HSANYAV Sbjct: 353 KASSSLERLSDFVGLDPDFIADVVQGELRVDTLNRQPGEQSDSDLDVSGSSTHHSANYAV 412 Query: 2034 SPRELSLRLHEVLESQXXXXXXXXXXXXXSKR-------------TQNQKSWSSSDAGDE 2174 SPRELSLRLHEV++S+ + + T+ S S + E Sbjct: 413 SPRELSLRLHEVIQSRLEARIIELEAALQNNQKRPHSMRQESIISTRELYSEVGSLSTQE 472 Query: 2175 NPV------DEPVVLNLSGEALDAYNEACNEFAKFXXXXXXXXXXXXGKGRLLQLLDVEG 2336 +P+ + P+V+NLSGE L+AYNEAC E +K G G + D+ G Sbjct: 473 SPIFMDEGTNHPMVINLSGETLEAYNEACEEISKM-----LRTGGNGGNGGSIMHQDITG 527 Query: 2337 TPMYQK----------ESGTS------------GDEEDEMEKLLIKHIVEKARQGSAVVL 2450 + E TS DEEDEM KLLIK IVEK RQGSA VL Sbjct: 528 ERWLSRNLVCDMIRSWEERTSRSWGSNEVGESEDDEEDEMGKLLIKQIVEKTRQGSAAVL 587 Query: 2451 NAQRALFS 2474 +AQR L+S Sbjct: 588 HAQRMLYS 595 >ref|XP_002514656.1| hypothetical protein RCOM_1469830 [Ricinus communis] gi|223546260|gb|EEF47762.1| hypothetical protein RCOM_1469830 [Ricinus communis] Length = 677 Score = 262 bits (669), Expect = 4e-67 Identities = 230/698 (32%), Positives = 326/698 (46%), Gaps = 95/698 (13%) Frame = +3 Query: 666 MDIWMVAVAAGAGYIARHFKKLKQASGQIASESSLEGSFSVXXXXXXXXXXXXXXFRRLA 845 MD+W+V AA AGYIA++++ + + + S E S ++L Sbjct: 1 MDLWVVGAAAAAGYIAKYWQNVSRDKDSL---SGSELSKCEKHENPSFPLCRSTRRKKLT 57 Query: 846 KRENLRQDDFQFRNEGVSSTEEASTSEFDGEKGVISGNREGGSVFSASSLFPGFI----- 1010 N + +S E S+S + ++G SG+ + ++ S SS G Sbjct: 58 GAANTGEKFSDMYRLDSASELEVSSSAYAEKRGS-SGDYDESNILSLSSPSSGVTMNEIL 116 Query: 1011 ---GSDGVLGDVEGVRVSE---GMGDLLHKXXXXXXXXXXXXXXXXXXAIRPLSSLESCL 1172 GSD L D G G D H+ I+PL+SLESCL Sbjct: 117 IGNGSDNGLRDDTGDNSGSPCTGEMDSFHESKMKTSSLRTKNIHGHF--IKPLNSLESCL 174 Query: 1173 MAQLYTEHVEMKEYMSSSFASLSMQTVRPFLVTDGSKVINRASGDAFIAQ-NKKNLQIYS 1349 MAQLY EH +M+EY+ S F S S + +RP LVTDG+++INR + D+F A + ++ Sbjct: 175 MAQLYKEHTKMEEYVLSVFPSPSKK-MRPLLVTDGNQIINRLNNDSFSASIGTDDNRLPK 233 Query: 1350 EENSAFSGVPSPPNISSLEI------RGKDIANSRANTTVAA---KHFNLRRGSSDAALL 1502 +EN+ GVP P IS + + +++ N R + + A +HF+ Sbjct: 234 DENAC--GVPPLPKISKFNVPKKIKFKAREVCNGRFSNSHKAGSGRHFH----------- 280 Query: 1503 LCLGMSFGIISSFMENRREVEKLNRLLKQTQSLVQDLEEELEMKDSLTMKEVTIEDHESQ 1682 + GI SS + +RREV+KL LLKQT++LVQDL+EELEMKDSLT+KE+ E ESQ Sbjct: 281 ---SQNVGIASSLLASRREVDKLQDLLKQTENLVQDLQEELEMKDSLTVKELADEKCESQ 337 Query: 1683 RIGGGSSNDGSPHSVSYEHNQEQSTNSNNKESVVQKGKD--ESFSKIXXXXXXXXXXXXX 1856 S + + + + N STN++ KES ++ +D E SKI Sbjct: 338 DTCENSLHYRALNPLFPLQNVNNSTNNDGKESQSERAEDNSEDMSKIEAELEAELERLGL 397 Query: 1857 XXXXXXXXRRISNLVELDPDFEPNVAQGELRADMLD----TQTNTNEDGRGSSTTTHSAN 2024 R +S+ VELDPD ++AQGELR DM + Q ++ D G + TTHS N Sbjct: 398 NMNKSSLERILSDGVELDPDVIADLAQGELRVDMFNGRAVCQPESDRDKSG-TPTTHSGN 456 Query: 2025 YAVSPRELSLRLHEVLESQXXXXXXXXXXXXXS-----------------KRTQNQKSWS 2153 YAVSP+ELSLRLHEV+ESQ + K + N+ ++S Sbjct: 457 YAVSPQELSLRLHEVIESQLEERVKQLEMALQNSQQKVLLMESEHKNVWRKFSNNELTYS 516 Query: 2154 SSDAGDENPVDE--------PVVLNLSGEALDAYNEACNEFAK----------------- 2258 S +E+P+ E P+V+NLSG+AL+AYNEA E K Sbjct: 517 SD---EESPITEENINSTAQPLVMNLSGDALEAYNEAYEELMKINESEEDDSPSTAYESM 573 Query: 2259 --FXXXXXXXXXXXXGKGRLLQLLDVEGTPMYQKESG--------------------TSG 2372 + G + +L + + P ++ G TS Sbjct: 574 HPYNQIMLQCCQDGATNGSMTRLRNNKEIPSSRESCGSQLKVAIKHSQGIQKLLYDDTSE 633 Query: 2373 DE----EDEMEKLLIKHIVEKARQGSAVVLNAQRALFS 2474 DE DEMEK LIK IVEK R+GS VVLNAQR LFS Sbjct: 634 DENSGSSDEMEKQLIKQIVEKTRKGSPVVLNAQRLLFS 671 >ref|XP_004138319.1| PREDICTED: uncharacterized protein LOC101218206 [Cucumis sativus] gi|449477525|ref|XP_004155048.1| PREDICTED: uncharacterized protein LOC101225278 [Cucumis sativus] Length = 683 Score = 243 bits (619), Expect = 2e-61 Identities = 220/698 (31%), Positives = 318/698 (45%), Gaps = 95/698 (13%) Frame = +3 Query: 666 MDIWMVAVAAGAGYIARHFKKLKQ---ASGQIASESSLEGSFSVXXXXXXXXXXXXXXFR 836 MD+W+VA AAGAG +A++++KL + S Q++S +S G F Sbjct: 1 MDLWVVATAAGAGCLAKYWQKLLKDGNTSSQMSSGNSSNGELG----------SLDHPFH 50 Query: 837 RLAKRENLRQDDFQFRNEGVSSTEE-------ASTSEFDGEKGVISGNREGGSVFSASSL 995 + +R D E ++ + AS S FD EK GN + + S S+L Sbjct: 51 QTEQRTKASGDIHAGEEEVLNGRDYVGSRFNVASISGFDCEKMDNLGNCQEYNGLSVSNL 110 Query: 996 ----------FPGFIGSDGVLGDVEGVRVSEGMGDLLHKXXXXXXXXXXXXXXXXXXA-- 1139 P G + V V++ M D L Sbjct: 111 PLELSTTTSNDPQTFGHRSSVN----VNVNDNMIDQLPCSSSRELNCFRPTMRKIGSLRH 166 Query: 1140 -------IRPLSSLESCLMAQLYTEHVEMKEYMSSSFASLSMQTVRPFLVTDGSKVINRA 1298 IRPLSSLESC+++ LY +HVEM+EY SF S S T+R F+V DG+++++R Sbjct: 167 KQSYGRFIRPLSSLESCVLSHLYKDHVEMEEYFLHSFQSPSKSTMRRFVVNDGTRIVSRR 226 Query: 1299 SGDAFIAQNKKNLQIYSEE-----NSAFSGVPSPPNISSLEIRGK-DIANSRAN---TTV 1451 D+F Q + + +E N G+P P I SL+ DI R ++ Sbjct: 227 VRDSFSVQVDMDASNFRKEPFIGKNRKAYGIPLLPKIQSLKTSEMIDINGGRRQSGASSA 286 Query: 1452 AAKHFNLRRGSSDAALLLCLGMSFGIISSFMENRREVEKLNRLLKQTQSLVQDLEEELEM 1631 + H + D +L CLG+S G+I SFM+N+RE++KL LL+ T++LVQDL+EELEM Sbjct: 287 SEMHNKKFLHAKDRMILFCLGISVGLI-SFMQNKREIDKLKELLRHTENLVQDLQEELEM 345 Query: 1632 KDSLTMKEVTIEDHESQRIGGGSSNDGSPHSVSYEHNQEQSTNSNNKESVVQKGKD--ES 1805 KDSLT+KE++ E+ ES I S G + N S S++KE ++ +S Sbjct: 346 KDSLTVKELSNENCESVGISENSFFGGK------DQNLNPSAKSDDKELFKPNPEEDSDS 399 Query: 1806 FSKIXXXXXXXXXXXXXXXXXXXXXRRISNLVELDPDFEPNVAQGELRADMLDTQTNTNE 1985 SKI +R S+L ELD +F + ++GELRADM+ + + Sbjct: 400 LSKIEAELEAELQRLGLNTETSSTDKRFSDLHELDQEFTVDFSEGELRADMISELSPKLQ 459 Query: 1986 DGRGSSTTTHSANYAVSPRELSLRLHEVLESQ----------XXXXXXXXXXXXXSKRTQ 2135 + +S T S NY VSP ELS+RLHEV++S+ +KRT Sbjct: 460 RNQDASEFTSSGNYTVSPWELSVRLHEVIQSRLEARVRELETALENSERRLHHIEAKRTD 519 Query: 2136 NQKSWSSSD----AGDENPVDEPVVLNLSGEALDAYNEACNEFAKF-------------- 2261 + K ++ ++ + +E+ +P+V+NLSGEALDAYN+A +E Sbjct: 520 SWKEFTHNEMLHSSSEESLTAQPLVMNLSGEALDAYNDAYSELMDMDDSEEETIDSPSTG 579 Query: 2262 -------------XXXXXXXXXXXXGKGRLLQLLDVEGTPMYQKESGT----------SG 2372 G L ++L E K GT S Sbjct: 580 DESKHSESQTTVNSHPFSVQNGKRNGSISLGRILVEEKMKNSYKMFGTMKGESNEIDGSE 639 Query: 2373 DE----EDEMEKLLIKHIVEKARQGSAVVLNAQRALFS 2474 DE +DE+EK LIK IVEK R GS VV NAQR LFS Sbjct: 640 DESSDYDDEIEKQLIKQIVEKTRMGSPVVRNAQRWLFS 677 >ref|XP_003528223.1| PREDICTED: uncharacterized protein LOC100801395 [Glycine max] Length = 598 Score = 238 bits (608), Expect = 5e-60 Identities = 206/607 (33%), Positives = 290/607 (47%), Gaps = 78/607 (12%) Frame = +3 Query: 888 EGVSSTEEASTSEFDGEKGVISGNREGGSVFSASSLF-------------------PGFI 1010 +G+ + E AST DGEK N V S S+L GF+ Sbjct: 2 DGLLTGEVASTRGIDGEKLRHFRNYNKHDVLSLSNLAMPLSPYDDVDDVDDGNEQSSGFV 61 Query: 1011 GSDGVL---GDVEGVRVSEGMGDLLHKXXXXXXXXXXXXXXXXXXAIRPLSSLESCLMAQ 1181 G G L E V ++ G HK A RPLSSLESC MAQ Sbjct: 62 GDHGFLFPDSSAEVVPINNSSG---HKTFLKMKHLSGR-------ANRPLSSLESCFMAQ 111 Query: 1182 LYTEHVEMKEYMSSSFASLSMQTVRPFLVTDGSKVINRASGDAF-IAQNKKNLQIYSE-- 1352 LY EH E +EY+ SS +S S T R FLV++GS++INRA+ + F + K ++ E Sbjct: 112 LYKEHAETEEYVFSSLSSPSTAT-RSFLVSNGSQIINRANNNLFSVPIGSKEYKLQKEAG 170 Query: 1353 --ENSAFSGVPSPPNISSLE--------IRGKDIANSRANTTVAAKHFNLRRGSSDAALL 1502 ++ GV S P I S+ + G+ S ++ + K +R D L Sbjct: 171 QVKDENVFGVSSLPKIISVNDTKETFNAVIGRSRRLSSSDNVSSGKI--IRTQQFDKTFL 228 Query: 1503 LCLGMSFGIISSFMENRREVEKLNRLLKQTQSLVQDLEEELEMKDSLTMKEVTIEDHESQ 1682 LG+SFG+I+S M N+RE++KL LLKQ ++LVQDL+EELEMKDS+T+KE+ E++ S Sbjct: 229 FSLGISFGMITSIMANKREIDKLRELLKQNENLVQDLQEELEMKDSMTVKELQNENYGSL 288 Query: 1683 RIGGGSSNDGSPHSVSYEHNQEQSTNSNNKESVVQK--GKDESFSKIXXXXXXXXXXXXX 1856 SS D + S E + + S ++KES QK ES SKI Sbjct: 289 DTLDHSSYDKELNEFSPEKHVDNSPRIDSKESYHQKVEQSSESMSKIEAELEAELERLGL 348 Query: 1857 XXXXXXXXRRISNLVELDPDFEPNVAQGELRADMLDTQTN----TNEDGRGSSTTTHSAN 2024 ++S LVELDP+F + +QGELRADM+ + + +NED T AN Sbjct: 349 DMNASSLEGKLSELVELDPEFVADFSQGELRADMVSGKDSLHPKSNED--AGDATPLPAN 406 Query: 2025 YAVSPRELSLRLHEVLESQXXXXXXXXXXXXXS-----------KRTQNQKSWSSSDAGD 2171 YAV P ELSLRLHEV++SQ + + + QK+ S S D Sbjct: 407 YAVLPHELSLRLHEVIQSQLEQRVKELEIALENSQRKVQLFESKQESYLQKASSFSKEND 466 Query: 2172 E-NPVDEPVVLNLSGEALDAYNEACNEFAKF----------------------XXXXXXX 2282 + + + +P++LNLSGEALDAYNEA E K Sbjct: 467 DCDLMSQPLILNLSGEALDAYNEAYEELIKINDSEENSPLGIHDSSDHQEDSHANDWHAL 526 Query: 2283 XXXXXGKGRLLQLLDVEGTPMYQKE---SGTSGDEEDEMEKLLIKHIVEKARQGSAVVLN 2453 G ++ +EG+ +Y+ + T G ++ E+E+ LI+ IVE+ ++GS V N Sbjct: 527 GVQHGGANGSSKVTMMEGS-IYELDGTADETCGFDDTEVEQQLIRQIVERTKKGSPVFKN 585 Query: 2454 AQRALFS 2474 AQR L+S Sbjct: 586 AQRILYS 592