BLASTX nr result
ID: Dioscorea21_contig00003377
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00003377 (1755 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|2... 154 5e-35 ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arab... 154 7e-35 ref|XP_003536443.1| PREDICTED: uncharacterized protein LOC100820... 149 3e-33 gb|AAD15341.1| hypothetical protein [Arabidopsis thaliana] gi|72... 149 3e-33 ref|XP_003520215.1| PREDICTED: uncharacterized protein LOC100789... 145 4e-32 >ref|XP_002321060.1| predicted protein [Populus trichocarpa] gi|222861833|gb|EEE99375.1| predicted protein [Populus trichocarpa] Length = 549 Score = 154 bits (390), Expect = 5e-35 Identities = 152/536 (28%), Positives = 230/536 (42%), Gaps = 36/536 (6%) Frame = -2 Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLPSFP----ASWTPCT 1473 MGF V L ++FPQVD +L+AVA +HS+D + A + +L +V+PS A PC Sbjct: 1 MGFSTVYKCLTDVFPQVDARILKAVAIEHSKDADIAAEVVLSEVIPSLSRHSAAPSPPCE 60 Query: 1472 SNSSDWGHQGVEEAHAQWPSESGLLSSGIACNSDFQVNDSMTCMDKPVGRDAEYETKTNG 1293 S G E E+GL ++ + ++ ++ G+ T+G Sbjct: 61 DTSPSLPLDGQTEQE----EETGLRHRQVSLVKSVRSSEPGLIAEEDDGKTE----LTSG 112 Query: 1292 MGYPTNSTEANSNMHPKERFVEFGVCLPSDHTNDFK-----VNSMNETLAGPDLQEVSAC 1128 + ++ + N P + +PS D + + E G ++VS Sbjct: 113 VNDGDSTHQENRQDQP--------IVVPSGANADTNQLQGHIETEQEEETGLRHRQVSLV 164 Query: 1127 ----SAGPTSIHENSAMQDKYT----DTATSHGAQDPFASILNKSIEHLKEFPSQREIEN 972 S+ P I E + + T D ++H ++ S + Q IE+ Sbjct: 165 KSVRSSEPGLIAEEDDGKTELTGGVNDGDSTHQEIRQDQPVVVPSGANADTNQLQGHIES 224 Query: 971 DDNL---SSQGDIGSFDVNGQSSSIASVTDLFEG-------------IKYLEDLTIDAKN 840 D+ + Q G + I DL G I+ LE++ AK+ Sbjct: 225 DELILLGKPQHQEGISQPGSSQTLILVSNDLLLGVNAENMNSKQYRQIELLEEIVEAAKD 284 Query: 839 SKVILTSAMESTINMEKEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDM 660 +K L SAMES +NM KEVEL E A+QAK EA+ G D LV V NDM Sbjct: 285 NKKTLFSAMESVMNMMKEVELQEISAEQAKEEAARGGLDILVEVEKLKQMLVHAKEANDM 344 Query: 659 HAEEVYGERSMLAAEAQGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXX 480 HA EVYGE+++LA E + LQ+ LLS+S+ERD +L I+DE+R TLE+R Sbjct: 345 HAGEVYGEKAILATEVRELQARLLSLSDERDNALAILDEMRQTLESRLAAAEELRKTAEL 404 Query: 479 XXXXXXXXXXXXXADGRLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIAS 300 A+ + + LM+ G ++D L+ EI+ Sbjct: 405 EKLEKEETARNALAEQEIIMEKVVQESKILQKEAEENAKLQEFLMDRGCVVDTLQGEISV 464 Query: 299 ILKDVMTLKERVDGHIPLSRSASHASVTSNLVSLSTELSD--SQIAPSEG-TSRNP 141 I +DV LKER D +PLS+S S + + L S + + S +A G TS P Sbjct: 465 ICQDVRLLKERFDERVPLSKSVSSSQTSCILASSGSSIKSMASNLAAETGETSELP 520 >ref|XP_002874890.1| hypothetical protein ARALYDRAFT_490272 [Arabidopsis lyrata subsp. lyrata] gi|297320727|gb|EFH51149.1| hypothetical protein ARALYDRAFT_490272 [Arabidopsis lyrata subsp. lyrata] Length = 559 Score = 154 bits (389), Expect = 7e-35 Identities = 135/516 (26%), Positives = 231/516 (44%), Gaps = 31/516 (6%) Frame = -2 Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLPSFPASWTPCTSNSS 1461 MG++ V +L E+FPQ+D +L+AVA +H +D N A ++ +++P F N + Sbjct: 1 MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDANEAAAVVVSEIVPFF-------YPNLA 53 Query: 1460 DWGHQGVEEAHAQWPSE-SGLLSSGIACNSDFQVNDSMTCMDKPVGRDAEYETKT----- 1299 D Q P++ + +G+ S+ S + P+ D ++E++ Sbjct: 54 DNSTQPENRTPGNVPNKVERAMQNGVLSGSE--TGSSSSSGSIPLAVDCDHESRAPITES 111 Query: 1298 ----NGMGY--PTNSTEANSNMHPKERFVEFGVCLPSDHTNDFKVNSMNET--------- 1164 N + + P + SN E + S++ F+ + + + Sbjct: 112 ISSRNQLTHVMPNVDLDIQSNAKIGLSGSEESGVVSSENPVSFQAGAKSTSHGCQGVGFH 171 Query: 1163 LAGPDLQEVSACSAGPTSIH------ENSAMQDKYTDTATSHGAQDPFASILNKSIEHLK 1002 + G + E S S ++H +NSAM K G+ D + S+ ++ Sbjct: 172 ITGSNQAEASTSSESEDAVHKLVYPADNSAMTQKSPPLQIRFGSIDIVNETSSGSLA-VE 230 Query: 1001 EFPSQREIENDDNLSSQGDI----GSFDVNGQSSSIASVTDLFEGIKYLEDLTIDAKNSK 834 ++ N +++S+G + G ++ G SS+ S + I +LE + DAK++K Sbjct: 231 NSDAELSGSNLVDVTSKGSLAVENGDPELVGAFSSVVSRSTQGCNIVHLEQIIEDAKSNK 290 Query: 833 VILTSAMESTINMEKEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHA 654 L + MES +N+ +EVEL E+ A++AK +AS G DTL V NDM A Sbjct: 291 KTLFTVMESIMNLMREVELQEKDAEKAKEDASRGGFDTLDKVEELKKMLEHAKEANDMDA 350 Query: 653 EEVYGERSMLAAEAQGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXX 474 EVYGERS+L E L++ LL++S ERDKSL ++DE+R LE R Sbjct: 351 GEVYGERSILTTEVNELENRLLNLSEERDKSLSVLDEMREVLEIRLAAALEIKNAAEQEK 410 Query: 473 XXXXXXXXXXXADGRLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASIL 294 A+ + LM+HG+I+D L+ EI+ I Sbjct: 411 QEKEGSARMAFAEQEAIMEKVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVIC 470 Query: 293 KDVMTLKERVDGHIPLSRSASHASVTSNLVSLSTEL 186 +D+ LKE+ D +PLS+S + + + L S ++ + Sbjct: 471 QDIRHLKEKFDNRVPLSQSITSSQTSCKLASSASSM 506 >ref|XP_003536443.1| PREDICTED: uncharacterized protein LOC100820331 [Glycine max] Length = 546 Score = 149 bits (375), Expect = 3e-33 Identities = 144/526 (27%), Positives = 231/526 (43%), Gaps = 17/526 (3%) Frame = -2 Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLP----SFPASWTPCT 1473 MGF V L EIFPQVD +L+AVA +H +D + A +L +V+P PA+ P Sbjct: 1 MGFNSVYRNLQEIFPQVDPRLLRAVAIEHPKDADLAAGIVLAEVIPFMSKKLPAAIPPQH 60 Query: 1472 SNSS-------DWGHQGVEEAHAQWPSE-----SGLLSSGIACNSDFQVNDSMTCMDKPV 1329 ++ + +G H Q + S LS+G CNS + MD Sbjct: 61 NDHGAPLDVEVESEEEGNRLRHCQRVDDVNVGPSSTLSNG--CNSKDDT-EKFLGMDDIK 117 Query: 1328 GRDAEYETKTNGMGYPTNS-TEANSNMHPKERFVEFGVCLPSDHTNDFKVNSMNETLAGP 1152 D + N +G N + SN +E E P D + ++S ++ P Sbjct: 118 ELDIFQNAEDNFIGETLNEIAQEMSNGFIQEEDNENFERQPVDFDCENLISSADDYDVTP 177 Query: 1151 DLQEVSACSAGPTSIHENSAMQDKYTDTATSHGAQDPFASILNKSIEHLKEFPSQREIEN 972 + C + + A + + T + ++D S L+ S ++EN Sbjct: 178 S-HRLEECETYLIELESSEAQEVCHVQGDTLN-SKDSLQSELDAGSSTAGGNTS--DVEN 233 Query: 971 DDNLSSQGDIGSFDVNGQSSSIASVTDLFEGIKYLEDLTIDAKNSKVILTSAMESTINME 792 D+ S G Q S ++ + DL LE++ +AK +K L S+MES IN+ Sbjct: 234 DNGAKSAGS--------QYSQVSRI-DL------LEEIIDEAKTNKKTLFSSMESLINLM 278 Query: 791 KEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHAEEVYGERSMLAAEA 612 +EVE+ E+ A+QA +EA+ G + L + NDMHA EVYGE+++LA E Sbjct: 279 REVEVQEKAAEQANMEAATGGSNILARIEEYKTMLVQAKEANDMHAGEVYGEKAILATEL 338 Query: 611 QGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXXXXXXXXXXXXXADG 432 + LQS LL +S+ERDKSL I+DE+R+ LE R + Sbjct: 339 KELQSRLLGLSDERDKSLAILDEMRHILEERLAAAEESRKAAEQQKLEKEESARKALVEQ 398 Query: 431 RLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASILKDVMTLKERVDGHI 252 + L++ G+++D+L+ EI+ I +D+ LKE+ D ++ Sbjct: 399 ERLVEMVVHESQRLQQEAEENSKLQEFLIDRGRVVDMLQGEISVICQDIKLLKEKFDANL 458 Query: 251 PLSRSASHASVTSNLVSLSTELSDSQIAPSEGTSRNPHSDGVSVSS 114 PLS+S + + + L S S + S+ S + S G+ +S Sbjct: 459 PLSKSFTSSQTSCKLASSG---SSHKTLASDAGSDHSESSGIRKTS 501 >gb|AAD15341.1| hypothetical protein [Arabidopsis thaliana] gi|7269773|emb|CAB77773.1| hypothetical protein [Arabidopsis thaliana] Length = 539 Score = 149 bits (375), Expect = 3e-33 Identities = 133/503 (26%), Positives = 223/503 (44%), Gaps = 18/503 (3%) Frame = -2 Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLPSFPASWTPCTSNSS 1461 MG++ V +L E+FPQ+D +L+AVA +H +DVN A ++ +++P F N + Sbjct: 1 MGYKAVYRSLTELFPQIDARLLKAVAIEHPKDVNEAAAVVVSEIVPFF-------YPNLA 53 Query: 1460 DWGHQGVEEAHAQWPSESGLLSSGIACNSDFQVNDSMTCMDKPVGR---------DAEYE 1308 D Q + P+E S + + F+ +++ + + V + + + Sbjct: 54 DSSTQPENKTPGNVPTEEMGGSYSGSASMAFEYHETRAPVTESVSKRNQLTHVMPNVVVD 113 Query: 1307 TKTNGMGYPTNSTEANSNMHPKERFVEFGVCLPSDHTNDFKVNSMNETLAGPDLQEVSAC 1128 + G + S E+ + G D + +S E S Sbjct: 114 IQRKGKIGLSGSDESGVVSSEPPVSCQAGAKSTGDDWQGVEFHSTGNQA------EASTS 167 Query: 1127 SAGPTSIHENSAMQDKYTDTATSHGAQDPFASI--LNK-SIEHLKEFPSQREIENDDNLS 957 + ++H+ D T SH Q F SI +N+ S L S E+ + + Sbjct: 168 ADSEDAVHKLVYPADNLAITQNSHPLQIRFGSIDVVNETSSGSLAVENSDAELSGSNLVD 227 Query: 956 --SQGDI----GSFDVNGQSSSIASVTDLFEGIKYLEDLTIDAKNSKVILTSAMESTINM 795 S+G + G +++G SS+ + + + +LE + DAK++K L + MES +N+ Sbjct: 228 EISKGSLADENGDPELDGAVSSVGNRSTQGCNMVHLEQIIEDAKSNKRTLFTVMESIMNL 287 Query: 794 EKEVELHEERAKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHAEEVYGERSMLAAE 615 +EVEL E+ A++AK +AS+ G DTL V NDM A EVYGERS+L E Sbjct: 288 MREVELQEKEAEKAKEDASIGGFDTLDKVEELKKMLEHAKEANDMAAGEVYGERSILTTE 347 Query: 614 AQGLQSWLLSVSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXXXXXXXXXXXXXAD 435 L++ L+S+S ERD SL ++DE+R LE R A+ Sbjct: 348 VNELENRLISLSEERDNSLSVLDEMRVDLEIRLATALGIKNAAEQEKQEKEGSARKAFAE 407 Query: 434 GRLFAASIFXXXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASILKDVMTLKERVDGH 255 + LM+HG+I+D L+ EI+ I +D+ LKE+ D Sbjct: 408 QEAIMERVVQESKLLQQEAEENSKLREFLMDHGRIVDSLQGEISVICQDIRHLKEKFDNR 467 Query: 254 IPLSRSASHASVTSNLVSLSTEL 186 +PLS+S S + + L S ++ + Sbjct: 468 VPLSQSISSSQTSCKLASSASSM 490 >ref|XP_003520215.1| PREDICTED: uncharacterized protein LOC100789476 [Glycine max] Length = 603 Score = 145 bits (365), Expect = 4e-32 Identities = 149/571 (26%), Positives = 240/571 (42%), Gaps = 68/571 (11%) Frame = -2 Query: 1640 MGFEDVLAALVEIFPQVDYCMLQAVASQHSEDVNAALDFILHDVLP----SFPASWTPCT 1473 MGF V +L EIFPQVD +L+AVA +H +D + A ++ +V+P PA+ P Sbjct: 1 MGFNSVYRSLQEIFPQVDPRLLRAVAIEHPKDADLAAGIVIAEVIPFMSKKLPAAIPPQH 60 Query: 1472 SN----------SSDWGH-----QGVEEAHAQWPSESGLLSSGIACNSDFQ-VNDSMTCM 1341 +N S + G+ Q V++ S +S + +D+ V D + Sbjct: 61 NNYVASLNVEVESEEEGNRLRHRQLVDDVTVGPSSAPHSISVEVIKTADYSFVPDLNEAL 120 Query: 1340 DKPV----GRDAEYE------------TKTNGMGYPTN----------STEANSNMHPK- 1242 DK G D E + N G N S E N N + Sbjct: 121 DKSTMSNDGTDKFLEMNDIKELDIYQNAEDNFSGETLNEIAQEMSNGFSQEDNENFERRF 180 Query: 1241 -----ERFVEFGVC--LPSDHTNDFKVNSMNETLAGPDLQEVSACSAGPTSIHENSAMQD 1083 E + G+C + H N K + N D + S + S++ D Sbjct: 181 VDVDCENLISSGICQEMEPKHNNLSKEAASNNG----DGNRIGNDSNEMGWLEVVSSLVD 236 Query: 1082 KYTDTATSHGAQDPFASILNKSIEHLKEFPSQREIEND-----DNLSSQGDIGSFDVNGQ 918 Y D TSH ++ ++ E P ++ D D+L S+ GS Sbjct: 237 DY-DATTSHRLEECETYLIELETS---EAPKVCHVQGDALNYKDSLQSELVAGSSSTGDN 292 Query: 917 SSSIAS-VTDLFEGIKY--------LEDLTIDAKNSKVILTSAMESTINMEKEVELHEER 765 +S + + G +Y LE++ +AK +K +L S+MES IN+ +EVEL E+ Sbjct: 293 TSDVEDDIGAKNAGSQYSHVCRIDLLEEIIDEAKTNKKMLFSSMESLINLMREVELQEKA 352 Query: 764 AKQAKIEASLAGEDTLVHVXXXXXXXXXXXXRNDMHAEEVYGERSMLAAEAQGLQSWLLS 585 A+QA +EA+ G + L + NDMH+ EVYGE+++L E + LQS LL Sbjct: 353 AEQANMEAATGGSNILARIEEYKTMVVQANEANDMHSGEVYGEKAILTTELKELQSRLLG 412 Query: 584 VSNERDKSLGIIDEIRNTLETRXXXXXXXXXXXXXXXXXXXXXXXXXXADGRLFAASIFX 405 +S+ERD+SL I+DEIR+ LE R + + Sbjct: 413 LSDERDRSLAILDEIRHILEVRLAAAEELRKAAEQLKLEKEESARKALVEQERLVEKVVH 472 Query: 404 XXXXXXXXXXXXXXXXXXLMNHGQIIDILRLEIASILKDVMTLKERVDGHIPLSRSASHA 225 L++ G+++D+L+ EI+ I +D+ LKE+ D ++PLS+S + + Sbjct: 473 ESQRLQQEAEENSKLQEFLIDRGRVVDMLQGEISVICQDIKLLKEKFDANLPLSKSFTSS 532 Query: 224 SVTSNLVSLSTELSDSQIAPSEGTSRNPHSD 132 + L S + S +A G+ + S+ Sbjct: 533 QTSCKLASSGS--SHKTLASDAGSEHSESSE 561