BLASTX nr result
ID: Dioscorea21_contig00011349
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00011349 (1562 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAO23078.1| polyprotein [Glycine max] 191 3e-46 ref|XP_003553022.1| PREDICTED: uncharacterized protein LOC100788... 136 1e-29 ref|XP_003525745.1| PREDICTED: uncharacterized protein LOC100809... 135 4e-29 emb|CAN68669.1| hypothetical protein VITISV_039388 [Vitis vinifera] 134 7e-29 ref|XP_003524238.1| PREDICTED: uncharacterized protein LOC100782... 132 2e-28 >gb|AAO23078.1| polyprotein [Glycine max] Length = 1552 Score = 191 bits (486), Expect = 3e-46 Identities = 101/247 (40%), Positives = 154/247 (62%), Gaps = 19/247 (7%) Frame = +3 Query: 315 TRFKDLAAKM----DTIVAVMDQHEERIKLLEQSFATIFKYIE----NHKAQSSNINAMQ 470 TR K++ A++ D I V D + I+ LE + + IE + +Q S +NA+ Sbjct: 5 TRMKEVYAELKKNADAITRVSDDLQNHIERLEATNHAQMEKIEVMQSTNDSQFSQLNAVM 64 Query: 471 ----------PLSQPVSDLASPTKPRNAIPL-PFQIDFPKFDGSDALHWIFRAEQFFDLY 617 P+S + + + R++ + ++DFP+FDG + + WIF+AEQFFD Y Sbjct: 65 SQVLQRLQNIPMSSHGASNSQKEQQRSSFQVRSVKLDFPRFDGKNVMDWIFKAEQFFDYY 124 Query: 618 DISDPDRVNIAAIHMEGEVIPWFQLLEKCEELVSWSSLAKAVKLAYGPSLFESPRYALFK 797 D DR+ IA++H++ +V+PW+Q+L+K E SW + +A++L +GPS ++ PR LFK Sbjct: 125 ATPDADRLIIASVHLDQDVVPWYQMLQKTEPFSSWQAFTRALELDFGPSAYDCPRATLFK 184 Query: 798 LQQHDTVSNYYLSFIELASQVEGLSSAALVDCFVSGLEREIQRDVILWQPESLTKAVALA 977 L Q TV+ YY+ F L ++V+GLS+ A++DCFVSGL+ EI RDV +P +LTKAVALA Sbjct: 185 LNQSATVNEYYMQFTALVNRVDGLSAEAILDCFVSGLQEEISRDVKAMEPRTLTKAVALA 244 Query: 978 KFFEEKY 998 K FEEKY Sbjct: 245 KLFEEKY 251 >ref|XP_003553022.1| PREDICTED: uncharacterized protein LOC100788433 [Glycine max] Length = 1433 Score = 136 bits (343), Expect = 1e-29 Identities = 74/224 (33%), Positives = 125/224 (55%), Gaps = 6/224 (2%) Frame = +3 Query: 363 MDQHEERIKLLEQSFATIFKYIENHKAQSSNINAM------QPLSQPVSDLASPTKPRNA 524 M H R ++ I + +H +S ++A+ Q + PVS P RN Sbjct: 1 MADHNTRKTTTDRLEEAISRLSLHHSELASKVDAILDRLDRQSSASPVSSPKPPPASRNH 60 Query: 525 IPLPFQIDFPKFDGSDALHWIFRAEQFFDLYDISDPDRVNIAAIHMEGEVIPWFQLLEKC 704 + +ID P+FDGSD + WIF+ Q F+ + +RV +A+ +++G + W+Q + + Sbjct: 61 V----KIDVPRFDGSDPVGWIFKITQLFEYQGTPEEERVTLASFYLDGPALSWYQWMFRN 116 Query: 705 EELVSWSSLAKAVKLAYGPSLFESPRYALFKLQQHDTVSNYYLSFIELASQVEGLSSAAL 884 + SWSS +A++ + P+ ++ P+ ALFKL Q +V++Y + F LA+++ GL L Sbjct: 117 GFITSWSSFLQALESRFAPTYYDDPKGALFKLTQTGSVNDYLIEFERLANRIFGLPPPFL 176 Query: 885 VDCFVSGLEREIQRDVILWQPESLTKAVALAKFFEEKYQPHTQP 1016 + CFVSGL +I+R+V+ QP S +A ALAK E+K + +P Sbjct: 177 LSCFVSGLAPDIRREVLALQPISFPQATALAKLQEDKLRDRPRP 220 >ref|XP_003525745.1| PREDICTED: uncharacterized protein LOC100809540 [Glycine max] Length = 1232 Score = 135 bits (339), Expect = 4e-29 Identities = 64/160 (40%), Positives = 99/160 (61%) Frame = +3 Query: 540 QIDFPKFDGSDALHWIFRAEQFFDLYDISDPDRVNIAAIHMEGEVIPWFQLLEKCEELVS 719 +++ P+FDG D L WIF+ QFFD +SD +R+ +A+ +MEG + WFQ + + L S Sbjct: 2 KLEVPRFDGKDPLGWIFKITQFFDYQGVSDAERLTVASFYMEGPALCWFQWMSRNGFLTS 61 Query: 720 WSSLAKAVKLAYGPSLFESPRYALFKLQQHDTVSNYYLSFIELASQVEGLSSAALVDCFV 899 W ++ +A++ + PS ++ P ALFK+QQ TV+ Y F LA++V GL+ + CF+ Sbjct: 62 WQAMLQALETRFAPSYYDDPYGALFKIQQRGTVNEYLSEFERLANRVVGLAPPLSLSCFI 121 Query: 900 SGLEREIQRDVILWQPESLTKAVALAKFFEEKYQPHTQPH 1019 SGL E+ R+V QP L +A+ALAK E+K + H Sbjct: 122 SGLNPELHREVQALQPMCLPQAMALAKLQEDKLLDRRRNH 161 >emb|CAN68669.1| hypothetical protein VITISV_039388 [Vitis vinifera] Length = 1360 Score = 134 bits (337), Expect = 7e-29 Identities = 73/240 (30%), Positives = 132/240 (55%), Gaps = 6/240 (2%) Frame = +3 Query: 315 TRFKDLAAKMDTIVAVMDQHE---ERIKLLEQSFATIFKYIENHKAQSSNINAMQPLSQP 485 TR K A + + ++ +HE +++ Q+ T + + ++Q+++ + P ++ Sbjct: 3 TRGKTNAEFRNDVNEILARHESSFDQVNAALQAVLTELQTLRASRSQNTSPSETNPFARD 62 Query: 486 VSDLASPTKPRNAIPLPFQ---IDFPKFDGSDALHWIFRAEQFFDLYDISDPDRVNIAAI 656 S + P Q + FPKF+G D WI++AEQ+FD +I+ +V +A+ Sbjct: 63 ESSHPHTARSNTINDHPHQHLKLSFPKFNGDDPTGWIYKAEQYFDFMNITPGQQVQLASF 122 Query: 657 HMEGEVIPWFQLLEKCEELVSWSSLAKAVKLAYGPSLFESPRYALFKLQQHDTVSNYYLS 836 H+EG + W + L K ++W KAV+L +GP+ +E P AL +L+Q TV+ Y + Sbjct: 123 HLEGIALQWHRWLTKFRGPLTWDEFTKAVQLRFGPTDYEDPSEALTRLKQTTTVAAYQEA 182 Query: 837 FIELASQVEGLSSAALVDCFVSGLEREIQRDVILWQPESLTKAVALAKFFEEKYQPHTQP 1016 F +L+ +V+GL L+ CF++GL+ EI+ DV + QP +L + +A+ EE+ Q +P Sbjct: 183 FEKLSHRVDGLPENFLIGCFIAGLQDEIRIDVKIKQPRTLADTIGVARLIEERNQLQRKP 242 >ref|XP_003524238.1| PREDICTED: uncharacterized protein LOC100782971 [Glycine max] Length = 1863 Score = 132 bits (333), Expect = 2e-28 Identities = 62/152 (40%), Positives = 97/152 (63%) Frame = +3 Query: 540 QIDFPKFDGSDALHWIFRAEQFFDLYDISDPDRVNIAAIHMEGEVIPWFQLLEKCEELVS 719 ++D P FDG D L WIF+ QFF+ + +R+ +A+ +++G + WFQ + + + S Sbjct: 31 KLDVPCFDGHDPLGWIFKISQFFEYQGTPEEERITVASFYLDGAALSWFQWMHQNGFITS 90 Query: 720 WSSLAKAVKLAYGPSLFESPRYALFKLQQHDTVSNYYLSFIELASQVEGLSSAALVDCFV 899 W +L +A++ + PS ++ PR ALFKL Q +V+ Y F LA+++ GLS L+ CFV Sbjct: 91 WPALLQAIETRFAPSFYDDPRGALFKLTQRASVTEYLTEFERLANRIVGLSPPMLLSCFV 150 Query: 900 SGLEREIQRDVILWQPESLTKAVALAKFFEEK 995 SGL EI+R+V +QP SL A+ALA+ E+K Sbjct: 151 SGLNTEIRREVQAFQPVSLPHAMALARLQEDK 182