BLASTX nr result
ID: Mentha25_contig00053956
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00053956 (1250 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera] 263 1e-67 emb|CAN64225.1| hypothetical protein VITISV_016222 [Vitis vinifera] 247 7e-63 emb|CAN65540.1| hypothetical protein VITISV_029946 [Vitis vinifera] 246 1e-62 ref|XP_004515089.1| PREDICTED: uncharacterized protein LOC101500... 234 6e-59 ref|XP_006596754.1| PREDICTED: uncharacterized protein LOC102663... 225 4e-56 ref|XP_006580020.1| PREDICTED: uncharacterized protein LOC100820... 224 6e-56 ref|XP_003524766.2| PREDICTED: uncharacterized protein LOC100820... 224 6e-56 ref|XP_006579313.1| PREDICTED: uncharacterized protein LOC102665... 223 2e-55 ref|XP_006575768.1| PREDICTED: uncharacterized protein LOC102663... 222 2e-55 ref|XP_004501782.1| PREDICTED: uncharacterized protein LOC101501... 222 2e-55 ref|XP_006586460.1| PREDICTED: uncharacterized protein LOC102664... 221 7e-55 ref|XP_006596695.1| PREDICTED: uncharacterized protein LOC102666... 219 2e-54 emb|CAN59936.1| hypothetical protein VITISV_001878 [Vitis vinifera] 219 2e-54 ref|XP_006575821.1| PREDICTED: uncharacterized protein LOC102670... 219 3e-54 gb|AAD15368.1| putative retroelement pol polyprotein [Arabidopsi... 219 3e-54 ref|XP_006606864.1| PREDICTED: uncharacterized protein LOC102669... 218 6e-54 emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis... 218 6e-54 ref|XP_006575837.1| PREDICTED: uncharacterized protein LOC102664... 216 2e-53 emb|CAN74230.1| hypothetical protein VITISV_000585 [Vitis vinifera] 216 2e-53 ref|XP_006603194.1| PREDICTED: uncharacterized protein LOC102665... 215 3e-53 >emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera] Length = 970 Score = 263 bits (672), Expect = 1e-67 Identities = 132/279 (47%), Positives = 181/279 (64%), Gaps = 5/279 (1%) Frame = +2 Query: 428 EDSSHPLYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEAND 607 EDS+ P +LH D+PG VLVS LTG+NY WSR+M AL AKNK+ +DGSI PE++D Sbjct: 22 EDSTSPYFLHNLDHPGIVLVSHHLTGANYNTWSRAMVMALTAKNKISFIDGSIPCPESDD 81 Query: 608 PMLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQK 787 + WIRCNSMV+SW+ NS I SLLY + AV IWNDLR RF Q + R++ +K+ Sbjct: 82 LLFGTWIRCNSMVISWILNSVHKDIADSLLYFDTAVGIWNDLRDRFCQSNGPRIFQIKKH 141 Query: 788 LFSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGGCRCRSAVKWRDYQQKDFDMHFL 967 L +L QGSLDV+TYYT L+I+WDE FQP C CG + W ++QQ+++ M FL Sbjct: 142 LIALSQGSLDVSTYYTRLKILWDELKGFQPLPECACGTMK-----TWMEFQQQEYVMQFL 196 Query: 968 MGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHG-----DSVFPSPTLSEPV 1132 MGLNES+ ++ ++ M+P P KVFSLV Q+ERQ +G DSV + + S Sbjct: 197 MGLNESFVQTRSQILMMEPLPPIAKVFSLVAQDERQCSINYGLYTPPDSVAANDSNST-- 254 Query: 1133 GMLTNASQGNYSRPRDKLYCTHCGKTNHTVDKCFQIRGF 1249 + +A++ N +D+ C+HCG HTVDKC+++ G+ Sbjct: 255 -VAISAARLNSKPKKDRPTCSHCGILGHTVDKCYKLYGY 292 >emb|CAN64225.1| hypothetical protein VITISV_016222 [Vitis vinifera] Length = 987 Score = 247 bits (631), Expect = 7e-63 Identities = 129/280 (46%), Positives = 170/280 (60%), Gaps = 4/280 (1%) Frame = +2 Query: 422 PPEDSSHPLYLHPSDNPGAVLVSELL--TGSNYLDWSRSMQTALLAKNKLGLVDGSIQRP 595 P ED S P +LH D+P LVS L +GSNY W RSM TAL AKNKLG +DG+I R Sbjct: 14 PMEDHSSPYFLHNGDHPSLSLVSLSLAGSGSNYHSWCRSMVTALNAKNKLGFIDGTISRL 73 Query: 596 EANDPMLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYH 775 A D + +W RCNSMV+SWL NS +I S+LY ++IWNDL RF QG R++ Sbjct: 74 AATDLLAGLWSRCNSMVISWLSNSVCKEIAESILYHETTIEIWNDLYERFHQGSGPRIFE 133 Query: 776 LKQKLFSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGGCRCRSAVKWRDYQQKDFD 955 +KQK+ + QG VNTYYT + +WDE +F+ C+CGG R + + QQ++ Sbjct: 134 IKQKILAHTQGLAYVNTYYTRQKSLWDELREFKAIPVCNCGGMRV-----YMEDQQRESV 188 Query: 956 MHFLMGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQR-FALHGDSVFPSPTLSEPV 1132 M FL+GLNES+AP++ ++ M P P KVFSLV+QEERQR + F +P S Sbjct: 189 MQFLLGLNESFAPIRAQILLMKPTPPLNKVFSLVVQEERQRSLTISNSPAFTAPVSSRFQ 248 Query: 1133 GMLTNASQGNYSRPR-DKLYCTHCGKTNHTVDKCFQIRGF 1249 +S N SR R D+ CTHC HTVD+C++I G+ Sbjct: 249 AASRASSPTNSSRSRKDRPLCTHCNILGHTVDRCYKIHGY 288 >emb|CAN65540.1| hypothetical protein VITISV_029946 [Vitis vinifera] Length = 1240 Score = 246 bits (628), Expect = 1e-62 Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 4/280 (1%) Frame = +2 Query: 422 PPEDSSHPLYLHPSDNPGAVLVSELL--TGSNYLDWSRSMQTALLAKNKLGLVDGSIQRP 595 P ED S P +LH D+P LVS L +GSNY W RSM TAL AKNKLG +DG+I R Sbjct: 14 PMEDHSSPYFLHNGDHPSLSLVSLSLAXSGSNYHSWXRSMVTALNAKNKLGFIDGTISRL 73 Query: 596 EANDPMLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYH 775 A D + W RCNSMV+SWL NS +I S+LY ++IWNDL RF QG R++ Sbjct: 74 AATDLLAGXWSRCNSMVISWLSNSVCKEIAESILYHETTIEIWNDLYERFHQGSGPRIFE 133 Query: 776 LKQKLFSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGGCRCRSAVKWRDYQQKDFD 955 +KQK+ + QG VNTYYT + +WDE +F+ C+CGG R + + QQ++ Sbjct: 134 IKQKILAHTQGLAYVNTYYTRQKSLWDELREFKAIPVCNCGGMRV-----YMEDQQRESV 188 Query: 956 MHFLMGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQR-FALHGDSVFPSPTLSEPV 1132 M FL+GLNES+AP++ ++ M P P KVFSLV+QEERQR + F +P S Sbjct: 189 MQFLLGLNESFAPIRAQILLMKPTPPLNKVFSLVVQEERQRSLTISNSPAFTAPVSSRFQ 248 Query: 1133 GMLTNASQGNYSRPR-DKLYCTHCGKTNHTVDKCFQIRGF 1249 +S N SR R D+ CTHC HTVD+C++I G+ Sbjct: 249 AASRASSPTNSSRSRKDRPLCTHCNILGHTVDRCYKIHGY 288 >ref|XP_004515089.1| PREDICTED: uncharacterized protein LOC101500638 [Cicer arietinum] Length = 379 Score = 234 bits (597), Expect = 6e-59 Identities = 112/278 (40%), Positives = 172/278 (61%), Gaps = 5/278 (1%) Frame = +2 Query: 431 DSSHPLYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDP 610 D P ++HPSDNPG LVS L +N+ WSR+M +L +KNK G V G+I RP+ D Sbjct: 37 DMMDPFFMHPSDNPGLALVSPPLNNTNFHSWSRAMLVSLRSKNKSGFVLGTISRPKDTDR 96 Query: 611 MLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKL 790 + W RCN+MV+SW+RNS I S++++++A +IW++L R+ QGD R+ L++++ Sbjct: 97 LSMAWDRCNTMVMSWIRNSLESDIAQSIMWMDSAAEIWHELNDRYHQGDIFRISDLQEEI 156 Query: 791 FSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDC-GGCRCRSAVKWRDYQQKDFDMHFL 967 + L+QG + Y+TNL+ +W E +F P C C C C K R+Y++ D+ +HFL Sbjct: 157 YGLRQGDSSITIYFTNLKKLWQELENFFPLPSCSCTPTCSCNLLPKIREYRENDYVIHFL 216 Query: 968 MGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFA----LHGDSVFPSPTLSEPVG 1135 GLNE Y+P+++ +M M+P P+ KVFS++LQ+ERQ F+ L +V + + G Sbjct: 217 KGLNEQYSPVRSQIMLMEPLPTISKVFSMLLQQERQFFSHTEELKTVAVVSNHSRGFGRG 276 Query: 1136 MLTNASQGNYSRPRDKLYCTHCGKTNHTVDKCFQIRGF 1249 + +G+ SR R CTHC K+ H VD CF+ G+ Sbjct: 277 SSLGSGRGSGSRGRGYKICTHCNKSGHMVDVCFKKHGY 314 >ref|XP_006596754.1| PREDICTED: uncharacterized protein LOC102663057 [Glycine max] Length = 456 Score = 225 bits (573), Expect = 4e-56 Identities = 113/270 (41%), Positives = 167/270 (61%), Gaps = 1/270 (0%) Frame = +2 Query: 443 PLYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNM 622 P YLHPS+NP LVS LL +NY WSRS+ TAL AKNK+ VDGS+ RP +N + Sbjct: 14 PYYLHPSENPAIALVSPLLDPTNYNSWSRSVLTALSAKNKVEFVDGSLPRPASNHRLYAA 73 Query: 623 WIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLK 802 W R N+MVVSWL +S IR S+L+++NAV IW DL+ R+SQGD + L+ KL S+K Sbjct: 74 WKRANNMVVSWLVHSVATSIRQSILWMDNAVDIWKDLKARYSQGDLLCISDLQHKLASIK 133 Query: 803 QGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGG-CRCRSAVKWRDYQQKDFDMHFLMGLN 979 QG++++ Y+T LR IWDE ++P C C C C + ++ + + +D M F+ GLN Sbjct: 134 QGNMNITDYFTKLRTIWDELESYRPDLVCTCASKCSCDALIEAKKRKDQDRIMEFMRGLN 193 Query: 980 ESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLSEPVGMLTNASQG 1159 + Y ++++++ MDP PS KVFS + Q+ERQ + S+ +G L+ + Sbjct: 194 DQYNHVRSNILMMDPLPSISKVFSYMAQQERQLAS------------SDALGNLSLVNVA 241 Query: 1160 NYSRPRDKLYCTHCGKTNHTVDKCFQIRGF 1249 +R + C++CG+ NHTV+ C++ GF Sbjct: 242 ASTRSSNS--CSYCGRDNHTVETCYKKNGF 269 >ref|XP_006580020.1| PREDICTED: uncharacterized protein LOC100820019 isoform X5 [Glycine max] Length = 395 Score = 224 bits (571), Expect = 6e-56 Identities = 121/297 (40%), Positives = 171/297 (57%), Gaps = 29/297 (9%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP L+S +L +NY WSRSM TAL AKNK+ VDGS P D W Sbjct: 13 LYLHPSENPSTALISPVLDSTNYHSWSRSMITALSAKNKIEFVDGSAPEPLKTDRTYGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MVVSW+ +S IR S+L+++ + IW DL++R+SQGD R++ L+Q+ +L+Q Sbjct: 73 RRCNNMVVSWIVHSVATSIRQSILWMDKSEDIWRDLKSRYSQGDLLRIFDLQQEASTLRQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 G+L V Y+T LR+IWDE +F+P C C C C + + +D M FL GLNE Sbjct: 133 GALSVTKYFTWLRVIWDEIENFRPNPVCTCNIRCSCSAFAIIAQRKLEDRAMQFLRGLNE 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLS-EPVGMLTNASQ- 1156 Y +++H++ MDP P+ K+FS V+Q+ERQ L G+S SP L+ EP + NA++ Sbjct: 193 QYINIRSHVLLMDPIPAISKIFSYVVQQERQ---LLGNS---SPNLNFEPKDVSINATKT 246 Query: 1157 -----GNYSRPRDKLY---------------------CTHCGKTNHTVDKCFQIRGF 1249 G ++ Y CTHCGK HT+D C++ G+ Sbjct: 247 ICDHCGRIGHTKNVCYKKHGMPLNHEARNKSMGGRKTCTHCGKIGHTIDVCYRKHGY 303 >ref|XP_003524766.2| PREDICTED: uncharacterized protein LOC100820019 isoform X1 [Glycine max] gi|571455200|ref|XP_006580017.1| PREDICTED: uncharacterized protein LOC100820019 isoform X2 [Glycine max] gi|571455202|ref|XP_006580018.1| PREDICTED: uncharacterized protein LOC100820019 isoform X3 [Glycine max] gi|571455204|ref|XP_006580019.1| PREDICTED: uncharacterized protein LOC100820019 isoform X4 [Glycine max] Length = 495 Score = 224 bits (571), Expect = 6e-56 Identities = 121/297 (40%), Positives = 171/297 (57%), Gaps = 29/297 (9%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP L+S +L +NY WSRSM TAL AKNK+ VDGS P D W Sbjct: 13 LYLHPSENPSTALISPVLDSTNYHSWSRSMITALSAKNKIEFVDGSAPEPLKTDRTYGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MVVSW+ +S IR S+L+++ + IW DL++R+SQGD R++ L+Q+ +L+Q Sbjct: 73 RRCNNMVVSWIVHSVATSIRQSILWMDKSEDIWRDLKSRYSQGDLLRIFDLQQEASTLRQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 G+L V Y+T LR+IWDE +F+P C C C C + + +D M FL GLNE Sbjct: 133 GALSVTKYFTWLRVIWDEIENFRPNPVCTCNIRCSCSAFAIIAQRKLEDRAMQFLRGLNE 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLS-EPVGMLTNASQ- 1156 Y +++H++ MDP P+ K+FS V+Q+ERQ L G+S SP L+ EP + NA++ Sbjct: 193 QYINIRSHVLLMDPIPAISKIFSYVVQQERQ---LLGNS---SPNLNFEPKDVSINATKT 246 Query: 1157 -----GNYSRPRDKLY---------------------CTHCGKTNHTVDKCFQIRGF 1249 G ++ Y CTHCGK HT+D C++ G+ Sbjct: 247 ICDHCGRIGHTKNVCYKKHGMPLNHEARNKSMGGRKTCTHCGKIGHTIDVCYRKHGY 303 >ref|XP_006579313.1| PREDICTED: uncharacterized protein LOC102665903 [Glycine max] Length = 395 Score = 223 bits (567), Expect = 2e-55 Identities = 118/291 (40%), Positives = 163/291 (56%), Gaps = 23/291 (7%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLH S+NP LVS +L +NY WSRSM AL AKNK+ +DGS P D M W Sbjct: 13 LYLHLSENPATALVSPVLDSTNYHSWSRSMVIALSAKNKVEFIDGSAPEPLKTDRMHGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MVVSW+ +S IR S+L+++ A +IW+DL++R+SQGD R+ L+Q+ ++KQ Sbjct: 73 RRCNNMVVSWIVHSVATSIRQSILWMDKAEEIWHDLKSRYSQGDLLRISDLQQEASTMKQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 GSL V Y+T LR+IWDE +F+P C C C C + + +D M FL GLNE Sbjct: 133 GSLTVTEYFTRLRVIWDEIENFRPDPICSCNIRCSCNAFTIIAQRKLEDRAMQFLRGLNE 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSP---------TLSEPVG 1135 YA +++H++ MDP PS K+ S V Q+ERQ G S+ P T + G Sbjct: 193 QYANIRSHVLLMDPIPSISKILSYVAQQERQLLGNTGPSINFEPKDISINAAKTTCDFCG 252 Query: 1136 MLTNASQGNYSRPR-------------DKLYCTHCGKTNHTVDKCFQIRGF 1249 + + Y + + CTHCGK HTVD C++ G+ Sbjct: 253 RIGHVESACYKKHEVPSNYDAKNKSNIGRKTCTHCGKIGHTVDFCYRKHGY 303 >ref|XP_006575768.1| PREDICTED: uncharacterized protein LOC102663845 [Glycine max] Length = 482 Score = 222 bits (566), Expect = 2e-55 Identities = 117/291 (40%), Positives = 162/291 (55%), Gaps = 23/291 (7%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP LVS +L +NY WSRSM TAL AKNKL VDG+ P D + W Sbjct: 13 LYLHPSENPATALVSPVLDSTNYHSWSRSMVTALSAKNKLEFVDGTAPEPLKTDRLYGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MVVSW+ +S IR S+L+++ A IW DL++R+SQGD R+ L+Q+ +LKQ Sbjct: 73 RRCNNMVVSWIVHSVATSIRQSVLWMDKAEDIWRDLKSRYSQGDLLRISDLQQEASTLKQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 G+L + Y+T LR+IWDE F+P C C C C + + +D M FL GLNE Sbjct: 133 GALSITEYFTRLRVIWDEIESFRPDPICTCNVRCSCSVSTIIGQRKLEDRAMQFLRGLNE 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQ---------RFALHGDSVFPSPTLSEPVG 1135 Y +++H++ MDP P K+FS V Q+ERQ F S+ + + E G Sbjct: 193 QYTNIRSHVLLMDPIPPISKIFSYVAQQERQLLGNCSPNLNFEAKEISINTARSACEYCG 252 Query: 1136 M-------------LTNASQGNYSRPRDKLYCTHCGKTNHTVDKCFQIRGF 1249 + ++ + Y + CTHCGK HTVD C++ G+ Sbjct: 253 RSGHTESVCYKKHGMPSSHETRYKSNGGRKTCTHCGKMGHTVDVCYRKHGY 303 >ref|XP_004501782.1| PREDICTED: uncharacterized protein LOC101501608 [Cicer arietinum] Length = 362 Score = 222 bits (566), Expect = 2e-55 Identities = 113/293 (38%), Positives = 173/293 (59%), Gaps = 20/293 (6%) Frame = +2 Query: 428 EDSSHPLYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEAND 607 +D++H Y+HP++NP VLVS +L G NY W+R+M +L KNK G VDGSI P+A + Sbjct: 12 QDTTHDYYIHPNENPSLVLVSPILEGPNYHGWARAMAMSLQMKNKFGFVDGSIPCPDAPN 71 Query: 608 PMLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQK 787 M+ W RCN++V+SW+ + +I +S+L+++ A W DL+ RFSQGD R+ L Q Sbjct: 72 QMIPAWKRCNNLVLSWINHFVSHEIATSILWIDTAAAAWKDLKDRFSQGDSVRISQLHQD 131 Query: 788 LFSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGG-CRCRSAVKWRDYQQKDFDMHF 964 L+S+ Q L V YYT ++I+WDE +++P C C C + + Y+ D + F Sbjct: 132 LYSMHQSDLTVTAYYTKMKILWDELCNYRPIPECQSVTLCCCDVSKTLKKYRDNDCVLCF 191 Query: 965 LMGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEER--QRFALHGDSVFPSPT------L 1120 L GLN++Y+ +++ ++ MDP PS K+FS+++Q+ER Q L SV + Sbjct: 192 LRGLNDNYSAVRSQILLMDPLPSLTKIFSMIIQQERQLQTSPLPESSVMAAQVPQQVSYQ 251 Query: 1121 SEPVGMLTNASQGNYS----RPRD-------KLYCTHCGKTNHTVDKCFQIRG 1246 ++P +N+ +G S +PR CTHCG+TNHT+D CF I G Sbjct: 252 NKPSYSSSNSGRGKASYQGNQPRHSGGKVGVNRQCTHCGRTNHTIDTCFLIHG 304 >ref|XP_006586460.1| PREDICTED: uncharacterized protein LOC102664915 [Glycine max] Length = 393 Score = 221 bits (562), Expect = 7e-55 Identities = 121/296 (40%), Positives = 166/296 (56%), Gaps = 28/296 (9%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP LVS +L SNY WSRSM TAL AKNK+ V+G P +D W Sbjct: 12 LYLHPSENPAVALVSPVLDSSNYHSWSRSMITALSAKNKVEFVNGKALEPLKSDRTYGAW 71 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN++VVSWL +S IR S+L+++ A +IWNDL++R++QGD RV L+Q+ S+KQ Sbjct: 72 SRCNNIVVSWLVHSVSISIRQSVLWMDRAEEIWNDLKSRYAQGDLLRVSELQQEASSIKQ 131 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 GSL V Y+T LR+IWDE +F+P C C C C +++D M FL GLNE Sbjct: 132 GSLSVTKYFTKLRVIWDEIENFRPDPICRCTVKCTCLVLTTMAQRKREDHAMQFLRGLNE 191 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLSEPVGMLTNASQG- 1159 Y+ +++H++ MDP P+ K+FS V Q+ERQ L G++ S L G NA + Sbjct: 192 QYSNIRSHVLLMDPIPTIPKIFSYVAQQERQ---LTGNNSISSFNLESKEGSSINAVKSV 248 Query: 1160 ----------------------NYSRP----RDKLYCTHCGKTNHTVDKCFQIRGF 1249 NY + CT+CGK HTV+ C++ G+ Sbjct: 249 CEFCGCIGHNESICYKKNGLPPNYDGKGKGYNTRKICTYCGKLGHTVEVCYKKHGY 304 >ref|XP_006596695.1| PREDICTED: uncharacterized protein LOC102666161 [Glycine max] Length = 368 Score = 219 bits (558), Expect = 2e-54 Identities = 114/274 (41%), Positives = 158/274 (57%), Gaps = 7/274 (2%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS++P LVS +L +NY WSRSM TAL AKNK+ VDGS P D M W Sbjct: 13 LYLHPSESPAIALVSPVLDATNYHSWSRSMITALSAKNKVEFVDGSAPEPLKTDRMYGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MVVSW+ +S IR S+L+++ A +IW DL++R+SQGD R+ L+Q+ ++KQ Sbjct: 73 RRCNNMVVSWIVHSVATSIRQSILWMDKAEEIWRDLKSRYSQGDLLRISDLQQEASTMKQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 G+L V Y+T LR+IWDE +F+P C C C C + + +D M FL GLNE Sbjct: 133 GALSVTEYFTRLRVIWDEIENFRPNPTCFCNIRCSCSALAIIAQRKLEDRAMQFLHGLNE 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLSEPVGMLTNASQGN 1162 Y +++H++ MDP P+ K+FS V+Q+ERQ +L N S Sbjct: 193 QYGNIRSHVLLMDPLPAISKIFSYVVQQERQ--------------------LLGNVSSNL 232 Query: 1163 YSRPRD------KLYCTHCGKTNHTVDKCFQIRG 1246 PRD K+ C CG+T H + C++ G Sbjct: 233 NLEPRDISINTAKVVCDFCGRTGHLENVCYKKHG 266 >emb|CAN59936.1| hypothetical protein VITISV_001878 [Vitis vinifera] Length = 1031 Score = 219 bits (558), Expect = 2e-54 Identities = 123/284 (43%), Positives = 162/284 (57%), Gaps = 4/284 (1%) Frame = +2 Query: 410 PGGFPPEDSSHPLYLHPSDNPGAVLVSELL--TGSNYLDWSRSMQTALLAKNKLGLVDGS 583 P FP ED S P +LH D+ LVS L +GSNY W RSM T L AKNKL +DG+ Sbjct: 10 PFQFPMEDHSSPYFLHNGDHTSLSLVSFSLVGSGSNYHSWCRSMVTTLNAKNKLRFIDGT 69 Query: 584 IQRPEANDPMLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDA 763 I RP A D + W RCNSMV+SWL NS +I S+LY ++IWNDL RF QG Sbjct: 70 ISRPVATDLLAGPWSRCNSMVISWLSNSVCKEISESILYHETTIEIWNDLYERFHQGSGP 129 Query: 764 RVYHLKQKLFSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGGCRCRSAVKWRDYQQ 943 R++ LKQK+ + QGS D+ +F+ C+CGG R + + QQ Sbjct: 130 RIFELKQKILAHTQGSADLQ--------------EFKAIPVCNCGGMRV-----YMEDQQ 170 Query: 944 KDFDMHFLMGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDS-VFPSPTL 1120 ++ M FL+GLNES+ P++ ++ M+P P KVFSLV+QEERQR +S F +P Sbjct: 171 RESVMQFLLGLNESFIPIRAQILLMEPTPLLNKVFSLVVQEERQRSLTTSNSPAFTAPVS 230 Query: 1121 SEPVGMLTNASQGNYSRPR-DKLYCTHCGKTNHTVDKCFQIRGF 1249 S +S N SR R D+ CTHC HTVD+C++I G+ Sbjct: 231 SRFQAASRASSPTNSSRSRKDRPLCTHCNILGHTVDQCYKIHGY 274 >ref|XP_006575821.1| PREDICTED: uncharacterized protein LOC102670485 [Glycine max] Length = 697 Score = 219 bits (557), Expect = 3e-54 Identities = 115/291 (39%), Positives = 165/291 (56%), Gaps = 23/291 (7%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP LVS +L +NY WSRSM TAL KNK+ +DGS +P D M W Sbjct: 13 LYLHPSENPTIALVSPVLDSTNYHSWSRSMVTALSTKNKVEFIDGSAPKPLKTDRMHGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MVVSW+ +S IR S+L+++ A +IW D+++R+SQGD R+ L+Q+ ++KQ Sbjct: 73 RRCNNMVVSWIVHSVATSIRQSILWMDKAEEIWCDMKSRYSQGDLLRISDLQQEASTMKQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 GSL + Y+T LR+IWDE +F+P C C C + + +D M FL GLNE Sbjct: 133 GSLTITEYFTRLRMIWDEIENFRPDPIYSCNIRCSCTAFTIIAQRKLEDRAMQFLRGLNE 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGD---------SVFPSPTLSEPVG 1135 YA +++H++ MDP P+ K+FS V Q+ERQ G S+ + T+ + G Sbjct: 193 QYANIRSHVLLMDPIPAISKIFSYVAQQERQLLGNTGPGINFEPKDISINAAKTICDFCG 252 Query: 1136 MLTNASQGNYSR-------------PRDKLYCTHCGKTNHTVDKCFQIRGF 1249 + + Y + + CTHCGK HTVD C++ G+ Sbjct: 253 RIGHVESVCYKKHGVPSNYDAKNKSNNGRKTCTHCGKLGHTVDVCYRKHGY 303 >gb|AAD15368.1| putative retroelement pol polyprotein [Arabidopsis thaliana] gi|17065314|gb|AAL32811.1| putative retroelement pol polyprotein [Arabidopsis thaliana] gi|21387147|gb|AAM47977.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 411 Score = 219 bits (557), Expect = 3e-54 Identities = 106/280 (37%), Positives = 169/280 (60%), Gaps = 5/280 (1%) Frame = +2 Query: 425 PEDSSHPLYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEAN 604 P++ +PL LH SD+PG +V+ +L GSNY WS +M+ +L AKNKLG VDGS+ RP + Sbjct: 60 PDNVDNPLILHSSDHPGLSIVAHVLDGSNYNSWSIAMRISLDAKNKLGFVDGSLLRPSVD 119 Query: 605 DPMLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQ 784 D +W RCNSMV SW+ N +I S+LY +AV++W DL TRF + R Y L+Q Sbjct: 120 DSTFRIWSRCNSMVKSWILNVVNKEIYDSILYYEDAVEMWTDLFTRFRVNNLPRKYQLEQ 179 Query: 785 KLFSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGGCRCRSAVKWRDYQQKDFDMHF 964 + +LKQGSL+++TY+T + +W++ + + +S C C + + + + F Sbjct: 180 AVMTLKQGSLNLSTYFTKKKTLWEQLLNTKTRS---VKKCDCDQVKELLEDAETSRVIQF 236 Query: 965 LMGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLSEPVGMLT 1144 LMGLN+ + + + +++M P P ++++++ Q+E QR H PSP + G+LT Sbjct: 237 LMGLNDDFNTIMSQILNMKPRPGLNEIYNMLDQDESQRLVGHASKPTPSPAAFQTQGLLT 296 Query: 1145 N-----ASQGNYSRPRDKLYCTHCGKTNHTVDKCFQIRGF 1249 +QGN+ +P+ CTHC + HTVDKC+++ G+ Sbjct: 297 EQNPILMAQGNFKKPK----CTHCNRIGHTVDKCYKVHGY 332 >ref|XP_006606864.1| PREDICTED: uncharacterized protein LOC102669025 [Glycine max] Length = 355 Score = 218 bits (554), Expect = 6e-54 Identities = 117/291 (40%), Positives = 165/291 (56%), Gaps = 23/291 (7%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP LVS +L +NY WSRSM TAL AKNKL VDGS P D W Sbjct: 13 LYLHPSENPSIALVSPVLDSTNYHSWSRSMITALSAKNKLEFVDGSAPEPLKTDRTYGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MV+SW+ +S IR S+L+++ A IW DL++R+SQGD R+ L+Q+ +L+Q Sbjct: 73 RRCNNMVLSWIVHSVATSIRQSILWMDKAEDIWRDLKSRYSQGDLLRISDLQQEASTLRQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 G+L V Y+T LR+IWDE +F+P C C C C + + +D M FL GLN+ Sbjct: 133 GTLSVTEYFTRLRVIWDEIENFRPDPACTCNIRCSCSAFAIIAQRKLEDRAMQFLRGLND 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQ---------RFALHGDSVFPSPTLSEPVG 1135 Y +++H++ MDP P+ K+FS V Q+ER+ F S+ + T+ + G Sbjct: 193 QYTNIRSHVLLMDPIPAITKIFSYVAQQERKLLGNSSPNLNFDPKDVSINVAKTICDYCG 252 Query: 1136 MLTNASQGNYSR--------PRDK-----LYCTHCGKTNHTVDKCFQIRGF 1249 + + Y + R+K CTHCGK HTVD C++ G+ Sbjct: 253 RIGHTENICYKKHGMPLNHETRNKSTGGRKSCTHCGKMGHTVDVCYRKHGY 303 >emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana] gi|7268152|emb|CAB78488.1| retrovirus-related like polyprotein [Arabidopsis thaliana] Length = 1489 Score = 218 bits (554), Expect = 6e-54 Identities = 107/281 (38%), Positives = 161/281 (57%), Gaps = 4/281 (1%) Frame = +2 Query: 419 FPPEDSSHPLYLHPSDNPGAVLVSE-LLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRP 595 +P + +P YLH +D+ G +LVS+ L T S++ W RS+ AL +NKLG ++G+I +P Sbjct: 25 YPVDQYENPYYLHSADHAGLILVSDRLTTASDFHSWRRSILMALNVRNKLGFINGTITKP 84 Query: 596 EANDPMLNMWIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYH 775 + W RCN +V +WL NS +I SLLY+ IWN+L +RF Q D R++ Sbjct: 85 PEDHRDFGAWSRCNDIVSTWLMNSVDKKIGQSLLYIATVQGIWNNLLSRFKQDDAPRIFD 144 Query: 776 LKQKLFSLKQGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGGCRCRSAVKWRDYQQKDFD 955 ++QKL ++QGS+D++TYYT L +W+E+ ++ C CG C C +AVKW QQ+ Sbjct: 145 IEQKLSKIEQGSMDISTYYTALLTLWEEHRNYVELPVCTCGRCECDAAVKWEHLQQRSRV 204 Query: 956 MHFLMGLNESYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFA---LHGDSVFPSPTLSE 1126 FL LNE + + H++ + P P+ + F++V Q+ERQR DSV T Sbjct: 205 TKFLKELNEGFDQTRRHILMLKPIPTIKEAFNMVTQDERQRNVKPLTRVDSVAFQNTSMI 264 Query: 1127 PVGMLTNASQGNYSRPRDKLYCTHCGKTNHTVDKCFQIRGF 1249 + N RP K CTHCGK HT+ KC+++ G+ Sbjct: 265 NEDENAYVAAYNTVRPNQKPICTHCGKVGHTIQKCYKVHGY 305 >ref|XP_006575837.1| PREDICTED: uncharacterized protein LOC102664271 [Glycine max] Length = 395 Score = 216 bits (550), Expect = 2e-53 Identities = 121/297 (40%), Positives = 166/297 (55%), Gaps = 29/297 (9%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP LVS +L +NY W RSM TAL AKNKL VDGS P D W Sbjct: 13 LYLHPSENPSIALVSPVLDSTNYHSWIRSMITALSAKNKLEFVDGSAPEPLKTDRTYGAW 72 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 CN+MV+SW+ +S IR S+L+++ A IW DL+TR+SQGD R+ L+Q+ +L+Q Sbjct: 73 RWCNNMVLSWIVHSVAASIRQSILWMDKAEDIWRDLKTRYSQGDLLRISDLQQEASTLRQ 132 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 G+L V Y+T LR+IWDE +F+P C C C C + + +D M FL GLN+ Sbjct: 133 GTLSVTEYFTRLRVIWDEIENFRPDPACTCNVRCSCSAFAIIAQRKLEDRAMQFLRGLND 192 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLS-EPVGMLTNASQ- 1156 Y +++H++ MDP P+ K+FS V Q+ERQ L G+S SP L+ EP + NA++ Sbjct: 193 QYTNIRSHVLLMDPIPAITKIFSYVAQQERQ---LLGNS---SPNLNFEPKDVSINAAKT 246 Query: 1157 -----GNYSRPRDKLY---------------------CTHCGKTNHTVDKCFQIRGF 1249 G + Y CTHCG+ HTVD C++ G+ Sbjct: 247 ICDYCGRIGHTENICYKKHGMPLNHETRNKGTSGRKSCTHCGRMGHTVDVCYRKHGY 303 >emb|CAN74230.1| hypothetical protein VITISV_000585 [Vitis vinifera] Length = 334 Score = 216 bits (550), Expect = 2e-53 Identities = 108/270 (40%), Positives = 163/270 (60%), Gaps = 1/270 (0%) Frame = +2 Query: 443 PLYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNM 622 P LH SD+PG VLVS++L G NY WSR+M+ +L AK+K+G V GSI+ P + D Sbjct: 20 PFSLHHSDHPGMVLVSKVLEGDNYSTWSRAMRISLSAKDKIGFVTGSIKPPSSTDDSFPS 79 Query: 623 WIRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLK 802 W RCN MV+SWL NS P I SS++Y A +IW DLR RFSQG+D+R+Y +K+ + + Sbjct: 80 WQRCNDMVISWLLNSIHPDIASSVIYAETASEIWADLRERFSQGNDSRIYQIKRDIVEHR 139 Query: 803 QGSLDVNTYYTNLRIIWDEYTDFQPKSWCDCGGCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 QG ++ YYT L+ DE + + C CGG K ++ +K+ M FLMGLN+ Sbjct: 140 QGQQSISVYYTKLKAFXDELSSYHEVLSCSCGGLE-----KLKERDEKERVMQFLMGLND 194 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQ-RFALHGDSVFPSPTLSEPVGMLTNASQG 1159 SYA ++ ++ M P P V+SLVLQ+E+Q +L+ + L++ T+A Sbjct: 195 SYAAIRGQILLMXPLPDTRXVYSLVLQQEKQVEVSLNNGNKNHYAMLADRDNKATSAH-- 252 Query: 1160 NYSRPRDKLYCTHCGKTNHTVDKCFQIRGF 1249 + + L+C++C + H+++KC+ + GF Sbjct: 253 XVQKQKTPLHCSYCDRDXHSIEKCYYLHGF 282 >ref|XP_006603194.1| PREDICTED: uncharacterized protein LOC102665260 [Glycine max] Length = 741 Score = 215 bits (548), Expect = 3e-53 Identities = 119/297 (40%), Positives = 168/297 (56%), Gaps = 29/297 (9%) Frame = +2 Query: 446 LYLHPSDNPGAVLVSELLTGSNYLDWSRSMQTALLAKNKLGLVDGSIQRPEANDPMLNMW 625 LYLHPS+NP LVS +L SNY WSRSM TAL AKNK+ ++G+ P D + W Sbjct: 12 LYLHPSENPVVALVSPVLDSSNYHSWSRSMVTALSAKNKVEFINGNAPEPLRTDRTYSAW 71 Query: 626 IRCNSMVVSWLRNSTIPQIRSSLLYLNNAVQIWNDLRTRFSQGDDARVYHLKQKLFSLKQ 805 RCN+MVVSW+ +S IR S+L++N A +IWNDL++R++QGD R+ L+Q+ S+KQ Sbjct: 72 SRCNNMVVSWIVHSVSVAIRQSILWMNRAEEIWNDLKSRYAQGDLLRISDLQQEASSMKQ 131 Query: 806 GSLDVNTYYTNLRIIWDEYTDFQPKSWCDCG-GCRCRSAVKWRDYQQKDFDMHFLMGLNE 982 G+L V Y+T LRIIWDE +F+P C C C C + +D M FL LNE Sbjct: 132 GTLSVTEYFTKLRIIWDEIENFRPDPTCSCTIKCTCSVLTIIAQQKLEDRAMQFLRRLNE 191 Query: 983 SYAPLKTHLMSMDPFPSFGKVFSLVLQEERQRFALHGDSVFPSPTLS------------- 1123 Y+ +++H++ M+P P+ K+FS V Q+ER+ L G + F + +L Sbjct: 192 QYSNVRSHVLLMEPMPTIPKIFSYVAQQERK---LSGINSFSNLSLESKENISINVVKVT 248 Query: 1124 -EPVGMLTNASQGNYSR--------PRDKLY------CTHCGKTNHTVDKCFQIRGF 1249 E G + + Y + R K Y CTHCGK HT+D C++ G+ Sbjct: 249 CEFCGRIGHTESVCYKKHGVPTSYEGRRKTYNRNGKMCTHCGKIGHTIDVCYKKHGY 305