BLASTX nr result
ID: Rehmannia29_contig00023129
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia29_contig00023129 (1078 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010274374.1| PREDICTED: uncharacterized protein LOC104609... 296 6e-89 ref|XP_021682122.1| uncharacterized protein LOC110666060 [Hevea ... 278 2e-87 ref|XP_021682117.1| uncharacterized protein LOC110666057 [Hevea ... 267 3e-83 gb|PNX79703.1| retrovirus-related Pol polyprotein from transposo... 271 3e-82 ref|XP_020535115.1| uncharacterized protein LOC110009491 [Jatrop... 261 6e-81 gb|PKI60126.1| hypothetical protein CRG98_019470, partial [Punic... 257 4e-80 gb|OMO88814.1| Reverse transcriptase, RNA-dependent DNA polymera... 266 2e-79 gb|KZV23217.1| Cysteine-rich RLK (receptor-like protein kinase) ... 275 2e-79 ref|XP_021602663.1| uncharacterized protein LOC110607810 [Maniho... 258 3e-79 ref|XP_021602662.1| uncharacterized protein LOC110607809 [Maniho... 258 3e-79 gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymera... 263 9e-79 gb|PNX84630.1| retrovirus-related Pol polyprotein from transposo... 259 2e-78 ref|XP_014628734.1| PREDICTED: uncharacterized protein LOC106794... 269 6e-78 gb|PNY03100.1| retrovirus-related Pol polyprotein from transposo... 260 9e-78 gb|OMO49485.1| Reverse transcriptase, RNA-dependent DNA polymera... 256 1e-77 gb|PNX77479.1| retrovirus-related Pol polyprotein from transposo... 259 6e-77 ref|XP_019107755.1| PREDICTED: uncharacterized protein LOC109136... 249 2e-76 gb|PNX84365.1| retrovirus-related Pol polyprotein from transposo... 253 9e-76 gb|PNX93131.1| retrovirus-related Pol polyprotein from transposo... 261 2e-75 dbj|GAU37804.1| hypothetical protein TSUD_276210, partial [Trifo... 254 2e-75 >ref|XP_010274374.1| PREDICTED: uncharacterized protein LOC104609701 [Nelumbo nucifera] Length = 946 Score = 296 bits (758), Expect = 6e-89 Identities = 150/281 (53%), Positives = 184/281 (65%), Gaps = 2/281 (0%) Frame = +1 Query: 55 IALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASS--VDSTPGTPYPLPTFPFVLSPSL 228 + LR R K P+W++D+ PT +S S+ + Y PTFP+ SP Sbjct: 494 LTLRHSTRQKFKPAWMNDFVSNVIVLTNLPTVITTSNTTTSSGSSAYTPPTFPYHKSPIF 553 Query: 229 CPTYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGC 408 TY L+N+SS+ EP SY QA + +W++A+N EL A E N TW L LP KKAIG Sbjct: 554 TNTYIYLLSNVSSVPEPSSYYQARKNEKWIEAINKELQAFESNNTWELVPLPPKKKAIGS 613 Query: 409 KWVYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWV 588 KWVYKVK DG++D KARLVAKGY+QI G+DY D FSPVAK+VTVR+ +A+A K W Sbjct: 614 KWVYKVKYLLDGTIDSYKARLVAKGYHQIEGVDYNDSFSPVAKVVTVRIFLAIAIAKNWA 673 Query: 589 LYQLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKV 768 L+QLDINN FLHGYLDE++++Q QGYTKA +V LK+SLYGLKQA RQWNV+FC K+ Sbjct: 674 LHQLDINNAFLHGYLDEEVFIQPPQGYTKAKPHEVSLLKRSLYGLKQASRQWNVKFCVKL 733 Query: 769 LAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPS 891 AYGFTQSAHDHCLFTK SF GTH S Sbjct: 734 QAYGFTQSAHDHCLFTKSTSSSFLALLLYIDDVLVTGTHES 774 >ref|XP_021682122.1| uncharacterized protein LOC110666060 [Hevea brasiliensis] Length = 396 Score = 278 bits (711), Expect = 2e-87 Identities = 141/219 (64%), Positives = 159/219 (72%) Frame = +1 Query: 325 MNAELSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQIFGI 504 M EL ALE+N TW LTSLP GKKAIGCKWVYKVK KP+G VDR KARLVAKGYNQ+ G+ Sbjct: 1 MQQELMALERNNTWELTSLPIGKKAIGCKWVYKVKCKPNGEVDRFKARLVAKGYNQMEGL 60 Query: 505 DYLDVFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYTKACA 684 DY D FSPVAKLVTVR+ IALAT KQW +YQLDINN FLHGYLDE+ YMQ QGY KA Sbjct: 61 DYKDRFSPVAKLVTVRLFIALATTKQWPIYQLDINNAFLHGYLDEEDYMQPPQGYDKAKP 120 Query: 685 GQVCELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXX 864 GQVC LK+SLYGLKQA RQWN+EF + + GFTQS HD+CLFT+ +F Sbjct: 121 GQVCLLKRSLYGLKQASRQWNLEFTSFLKSLGFTQSQHDYCLFTQDTAANFMALLIYVND 180 Query: 865 XXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEIA 981 GT + FK+ L F IKDLG KYFLG+E+A Sbjct: 181 VIITGTSSEAIAAFKAALHSKFTIKDLGFMKYFLGLEVA 219 >ref|XP_021682117.1| uncharacterized protein LOC110666057 [Hevea brasiliensis] Length = 396 Score = 267 bits (683), Expect = 3e-83 Identities = 137/219 (62%), Positives = 155/219 (70%) Frame = +1 Query: 325 MNAELSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQIFGI 504 M EL ALE+N TW LTSLP GKKAIGCKWVYKVK KP+G VDR KARL AKGYNQ+ G+ Sbjct: 1 MQQELMALERNNTWELTSLPIGKKAIGCKWVYKVKCKPNGEVDRFKARLFAKGYNQMEGL 60 Query: 505 DYLDVFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYTKACA 684 DY D FSPVAKLVTVR+ IALAT KQW +YQLDINN FLHGYLDE+ YMQ QGY KA Sbjct: 61 DYKDKFSPVAKLVTVRLFIALATTKQWPIYQLDINNAFLHGYLDEEDYMQPPQGYDKAKP 120 Query: 685 GQVCELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXX 864 G VC LK+SLYGLKQA RQWN+EF + + GFTQS HD+CLFT+ +F Sbjct: 121 GLVCLLKRSLYGLKQASRQWNLEFTSFLKSLGFTQSQHDYCLFTQDTAANFMALLIYVTD 180 Query: 865 XXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEIA 981 GT + K+ L F IK LG KYFLG+E+A Sbjct: 181 VIITGTSFEAIAAVKAALHSKFTIKGLGFMKYFLGLEVA 219 >gb|PNX79703.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 598 Score = 271 bits (692), Expect = 3e-82 Identities = 141/312 (45%), Positives = 188/312 (60%) Frame = +1 Query: 55 IALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCP 234 I+ RK +RH+ PS L DY TPYP+ F + L P Sbjct: 28 ISSRKSIRHRKLPSHLLDYHCNAVVHK---------------TPYPISNF--LSHDLLSP 70 Query: 235 TYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKW 414 +Y+ F ++ + EP SYA+A ++ W+ AMN EL+AL KN TWI+ LP G K IG KW Sbjct: 71 SYSKFCLSILADHEPNSYAEASKHECWIQAMNNELTALAKNNTWIIVDLPDGAKPIGRKW 130 Query: 415 VYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLY 594 VYK+KRK DGS+DR KARLVAKGYNQI G+D+ FSPVAK+ T+R V+A+A++K W ++ Sbjct: 131 VYKIKRKADGSIDRYKARLVAKGYNQIEGVDFFQTFSPVAKMSTIRTVLAVASIKNWHIH 190 Query: 595 QLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLA 774 QLD++N FLHG L+E +YM++ QG T A QVC+LKKSLYGL+QA R+W + ++ Sbjct: 191 QLDVDNAFLHGDLEEDVYMKAPQGLTGVSANQVCKLKKSLYGLRQASRKWYEKLSQFLIT 250 Query: 775 YGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLA 954 G+ Q D LFTK F G +D KS L + F IKDLG+ Sbjct: 251 IGYAQMTSDPTLFTKSTSSDFSVLLVYVDDIVLTGNCMDEIDATKSQLHKAFRIKDLGIL 310 Query: 955 KYFLGMEIAWGQ 990 K+FLG+E+A Q Sbjct: 311 KFFLGLEVAHSQ 322 >ref|XP_020535115.1| uncharacterized protein LOC110009491 [Jatropha curcas] Length = 385 Score = 261 bits (667), Expect = 6e-81 Identities = 126/223 (56%), Positives = 155/223 (69%) Frame = +1 Query: 313 WVDAMNAELSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQ 492 WV+AMN EL++LE+N TWILT LP K +GCKWVY++K DGS+DR KARLVAKGYNQ Sbjct: 3 WVNAMNNELASLEQNNTWILTDLPPNTKPVGCKWVYRIKYNADGSIDRYKARLVAKGYNQ 62 Query: 493 IFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYT 672 + G+DYL FSPVAKLVTVR+ + +A W + LDINN +LHG +DE IYMQ GY Sbjct: 63 LLGLDYLHTFSPVAKLVTVRVFLTIAVANSWSVQHLDINNAYLHGTIDEDIYMQVPPGYD 122 Query: 673 KACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXX 852 KA GQVC+L++SLYGLKQA RQWN E +L+ GFTQS+ DHCLFTK G SF Sbjct: 123 KAAEGQVCKLQRSLYGLKQAGRQWNKELTTSLLSQGFTQSSFDHCLFTKGCGASFFALLV 182 Query: 853 XXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEIA 981 +L+ K+ LD+ F IK+LG KYFLG+E+A Sbjct: 183 YVDGCLITSPSVTLISQLKTYLDQKFTIKNLGDVKYFLGIEVA 225 >gb|PKI60126.1| hypothetical protein CRG98_019470, partial [Punica granatum] Length = 336 Score = 257 bits (657), Expect = 4e-80 Identities = 134/308 (43%), Positives = 181/308 (58%) Frame = +1 Query: 55 IALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCP 234 + LR+ R P D+ P ++ S DS+ GT YP+ F + + Sbjct: 10 VNLRRSKRVSSLPKHFKDFIVHTARHKTPPPSSFISTDSS-GTSYPIEKF--IDYSGISD 66 Query: 235 TYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKW 414 + FLA + S EP SY +A + W AM E+ ALE N+TW + LP K+ IGCKW Sbjct: 67 HHRAFLAAIDSDSEPTSYREAVKDQRWRIAMAEEIRALELNKTWTIEQLPPSKRPIGCKW 126 Query: 415 VYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLY 594 VYKVKR+ DGS++R KARLV KG+ Q+ G+DY + F+PVAKLVTVR ++ +A K+W ++ Sbjct: 127 VYKVKRRADGSIERYKARLVVKGFTQVEGVDYCETFAPVAKLVTVRCLLTVAVAKEWQIH 186 Query: 595 QLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLA 774 Q+++NN FLH LDE++YM+ G++ + G VC L+KSLYGL+QA R W +F D + Sbjct: 187 QMNVNNAFLHRDLDEEVYMELPPGFSTSRNGNVCRLRKSLYGLRQASRNWFSKFADALRQ 246 Query: 775 YGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLA 954 YGF QS DH LFT G F G S D FK LD+ F IKDLG Sbjct: 247 YGFIQSGADHSLFTFTRGTIFLGVLVYVDDLIIVGNSRSHCDSFKGYLDKCFRIKDLGPL 306 Query: 955 KYFLGMEI 978 KYFLG+E+ Sbjct: 307 KYFLGIEV 314 >gb|OMO88814.1| Reverse transcriptase, RNA-dependent DNA polymerase [Corchorus capsularis] Length = 724 Score = 266 bits (681), Expect = 2e-79 Identities = 139/307 (45%), Positives = 187/307 (60%) Frame = +1 Query: 61 LRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCPTY 240 L + R++ PP +L Y + +SS S GT YP+ F + + L TY Sbjct: 210 LARSQRNRRPPPYLQYYECSKVRRQP---SQSSSTTSGSGTRYPISNF--LSTHRLSSTY 264 Query: 241 TCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKWVY 420 + F++N++S+ EP+SY++A + P W A++AEL ALE N+TW + LP K +GCKWV+ Sbjct: 265 STFVSNITSIAEPKSYSEAIKDPNWKAAIDAELHALEANKTWSIVDLPPHKSPVGCKWVF 324 Query: 421 KVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLYQL 600 KVK K DGS++R KARLVAKGY Q GID+ + F+PVAK+ TVR ++A+A+ K W LYQL Sbjct: 325 KVKYKSDGSIERYKARLVAKGYTQQEGIDFHETFAPVAKMTTVRCLLAIASTKNWPLYQL 384 Query: 601 DINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLAYG 780 D+ N LHG LDE++YM G T VC+L KSLYGLKQA QW +F +L YG Sbjct: 385 DVQNALLHGDLDEEVYMSLPPGVTSKGENSVCKLHKSLYGLKQASLQWFAKFSTALLTYG 444 Query: 781 FTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKY 960 F QS D+ LF K F G + L+D K+ L R F+IKDLG KY Sbjct: 445 FVQSRSDYSLFIKSSKTDFVAILVYVDDIVITGNNSKLIDSVKNALQRQFSIKDLGSLKY 504 Query: 961 FLGMEIA 981 FLG+E+A Sbjct: 505 FLGLEVA 511 >gb|KZV23217.1| Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum] Length = 1406 Score = 275 bits (702), Expect = 2e-79 Identities = 140/308 (45%), Positives = 180/308 (58%) Frame = +1 Query: 58 ALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCPT 237 A ++ RH+ PP W++DY V S TPY L + + + P Sbjct: 829 ATKQTSRHRTPPRWMNDY-----------------VCSHSSTPYGLEKY--LSHKYISPA 869 Query: 238 YTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKWV 417 Y+ FL +S EP+SY +A P W DAM AE++ALE N TW + +P GKKAIGC+WV Sbjct: 870 YSSFLTAISQSTEPKSYKEAATDPNWRDAMAAEIAALEANNTWTIVQIPPGKKAIGCRWV 929 Query: 418 YKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLYQ 597 YK+K + DG++DR KARLVAKGY Q +GIDY D FSPVAK+VTVR +I +A K W L+Q Sbjct: 930 YKIKYRSDGTIDRYKARLVAKGYTQQYGIDYQDTFSPVAKIVTVRCIITIAAAKAWPLHQ 989 Query: 598 LDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLAY 777 +D+ N FL G LDE IYM G+ K C+L KSLYGLKQA RQWN +FC + Sbjct: 990 MDVTNAFLQGDLDEDIYMTIPPGFGKQSPNLACKLLKSLYGLKQASRQWNTKFCQVLAQA 1049 Query: 778 GFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLAK 957 G+ QS HDH +F+K DG G + K +L + IKDLG K Sbjct: 1050 GYKQSQHDHSMFSKQDGPRITILIVYVDDIVITGNDNDSISQLKLHLHKHLHIKDLGPLK 1109 Query: 958 YFLGMEIA 981 YFLG+E+A Sbjct: 1110 YFLGIEVA 1117 >ref|XP_021602663.1| uncharacterized protein LOC110607810 [Manihot esculenta] Length = 421 Score = 258 bits (659), Expect = 3e-79 Identities = 130/219 (59%), Positives = 154/219 (70%) Frame = +1 Query: 325 MNAELSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQIFGI 504 M EL AL+ N TW LT+LPK KKAIGCKWVYKVK KP+G V+R KARLVAKGYNQI G+ Sbjct: 1 MRQELQALKTNNTWTLTTLPKNKKAIGCKWVYKVKFKPNGDVERYKARLVAKGYNQIEGL 60 Query: 505 DYLDVFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYTKACA 684 DY D FSPVAKL TVR++I +AT KQW ++QLDINN FLHGYL+E++ + QGY KA Sbjct: 61 DYKDRFSPVAKLTTVRILIVMATSKQWPIFQLDINNAFLHGYLNEEVCLTPPQGYHKAQP 120 Query: 685 GQVCELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXX 864 GQVC LK+SLYGLKQA RQWN+EF + + GF QS HD+CLFT+ G+ Sbjct: 121 GQVCLLKRSLYGLKQASRQWNIEFTNFLKKLGFVQSPHDYCLFTQHTGNLILILLVYVDD 180 Query: 865 XXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEIA 981 G ++ K LD F IKDLG KYFLG+EIA Sbjct: 181 VILTGNSIDAINKCKLALDSKFTIKDLGPMKYFLGLEIA 219 >ref|XP_021602662.1| uncharacterized protein LOC110607809 [Manihot esculenta] Length = 421 Score = 258 bits (659), Expect = 3e-79 Identities = 129/219 (58%), Positives = 154/219 (70%) Frame = +1 Query: 325 MNAELSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQIFGI 504 M EL AL+ N TW LT+LPK KKAIGCKWVYKVK KP+G ++R KARLVAKGYNQI G+ Sbjct: 1 MRQELQALKTNNTWTLTTLPKNKKAIGCKWVYKVKFKPNGDIERYKARLVAKGYNQIEGL 60 Query: 505 DYLDVFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYTKACA 684 DY D FSPVAKL TVR++I +AT KQW ++QLDINN FLHGYL+E++ + QGY KA Sbjct: 61 DYKDRFSPVAKLTTVRILIVMATSKQWPIFQLDINNAFLHGYLNEEVCLTPPQGYHKAQP 120 Query: 685 GQVCELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXX 864 GQVC LK+SLYGLKQA RQWN+EF + GF QS HD+CLFT+ G+ Sbjct: 121 GQVCLLKRSLYGLKQASRQWNIEFTNFFKKLGFVQSPHDYCLFTQHTGNLILILLVYVDD 180 Query: 865 XXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEIA 981 G +++ K LD F IKDLG KYFLG+EIA Sbjct: 181 VILTGNSIDIINKCKLALDSKFTIKDLGPMKYFLGLEIA 219 >gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymerase [Corchorus capsularis] Length = 666 Score = 263 bits (673), Expect = 9e-79 Identities = 138/307 (44%), Positives = 186/307 (60%) Frame = +1 Query: 61 LRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCPTY 240 L + R++ PP +L Y + +SS S GT YP+ F + + L TY Sbjct: 90 LARSQRNRRPPPYLQYYECSKVRRQP---SQSSSTTSGSGTRYPISNF--LSTHRLSSTY 144 Query: 241 TCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKWVY 420 + F++N++S+ EPQSY++A + P W A++AEL ALE N+TW + LP K +GCKWV+ Sbjct: 145 STFVSNITSIAEPQSYSEAIKDPNWKAAIDAELHALEANKTWSIVDLPPHKSPVGCKWVF 204 Query: 421 KVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLYQL 600 KVK K GS++R KARLVAKGY Q GID+ + F+PVAK+ TVR ++A+A+ K W LYQL Sbjct: 205 KVKYKSYGSIERYKARLVAKGYTQQEGIDFHETFAPVAKMTTVRCLLAIASTKNWPLYQL 264 Query: 601 DINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLAYG 780 D+ N LHG LDE++YM G T VC+L KSLYGL+QA QW +F +L YG Sbjct: 265 DVQNALLHGDLDEEVYMSLPPGVTSKGENSVCKLHKSLYGLRQASLQWFAKFSTALLTYG 324 Query: 781 FTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKY 960 F QS D+ LF K F G + L+D K+ L R F+IKDLG KY Sbjct: 325 FVQSRSDYSLFIKSSKTDFVAILVYVDDIVITGNNSKLIDSVKNALQRQFSIKDLGSLKY 384 Query: 961 FLGMEIA 981 FLG+E+A Sbjct: 385 FLGLEVA 391 >gb|PNX84630.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 543 Score = 259 bits (662), Expect = 2e-78 Identities = 144/309 (46%), Positives = 183/309 (59%) Frame = +1 Query: 55 IALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCP 234 I +RK R + P L+DY + A S S+ G YP+ F + L Sbjct: 175 IVIRKSTRMRSQPGHLNDYVCNL--------SDAYSKSSSQGMLYPISNFHSCAN--LST 224 Query: 235 TYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKW 414 ++T F+ ++++ EP +Y QA WV AMNAEL AL++N+TWIL P K IG KW Sbjct: 225 SHTKFVLSVNNDVEPNTYHQASLQDCWVQAMNAELHALQQNKTWILVDAPPNVKPIGSKW 284 Query: 415 VYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLY 594 VYKVK K DGS++R KARLVAKGY Q+ GID+ + FSPVAK+ TVR +IALA +K W L+ Sbjct: 285 VYKVKHKADGSIERYKARLVAKGYTQVEGIDFFETFSPVAKITTVRTLIALAAIKSWHLH 344 Query: 595 QLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLA 774 QLD+NN FLHG L E++YM QG T + QVC+L KSLYGLKQA R+W + +LA Sbjct: 345 QLDVNNAFLHGELQEEVYMSIPQGVTTSKPNQVCKLLKSLYGLKQASRKWYEKLTSVLLA 404 Query: 775 YGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLA 954 G+ QS+ DH LFT SF G K LD LF IKDLG Sbjct: 405 QGYMQSSSDHSLFTLHKDSSFTALLVYVDDIILAGDSHDEFLHIKKLLDDLFRIKDLGQL 464 Query: 955 KYFLGMEIA 981 KYFLG+E+A Sbjct: 465 KYFLGIEVA 473 >ref|XP_014628734.1| PREDICTED: uncharacterized protein LOC106794221 [Glycine max] Length = 1150 Score = 269 bits (688), Expect = 6e-78 Identities = 135/284 (47%), Positives = 180/284 (63%) Frame = +1 Query: 154 ASSVDSTPGTPYPLPTFPFVLSPSLCPTYTCFLANLSSMQEPQSYAQACQYPEWVDAMNA 333 ++++ + PGT +P+ + + L +Y F+ N+S+ EP+SY +AC++ WV AM+ Sbjct: 282 SATITNHPGTKHPISSV--ISYNKLSSSYHSFILNISANSEPKSYNEACKHDSWVQAMHD 339 Query: 334 ELSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYL 513 E+SALE+N TW+LT LP+ K IGCKWVYK+K DGS++R KARLVAKGY QI G+DYL Sbjct: 340 EISALERNNTWVLTDLPQHKNVIGCKWVYKIKHNSDGSIERYKARLVAKGYTQIEGLDYL 399 Query: 514 DVFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYTKACAGQV 693 D FSPVAK+ TVR+++ALA + W L QLD+NN FLHG L E++YM G A GQV Sbjct: 400 DTFSPVAKITTVRLLLALAALNNWYLRQLDVNNAFLHGGLHEEVYMVLPPGMKSAKPGQV 459 Query: 694 CELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXX 873 C+L++SLYGLKQA RQW + +GF QS D+ LFT +SF Sbjct: 460 CKLQRSLYGLKQASRQWYARLSTFLAHHGFKQSVADYSLFTLKKANSFTALLVYVDDIVL 519 Query: 874 XGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEIAWGQC*VGI 1005 G S++ LD F IKDLG KYFLG E+A G + I Sbjct: 520 SGNDLSVISSITKLLDDTFKIKDLGNLKYFLGFEVARGTSGINI 563 >gb|PNY03100.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 629 Score = 260 bits (664), Expect = 9e-78 Identities = 136/318 (42%), Positives = 192/318 (60%) Frame = +1 Query: 52 PIALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLC 231 P LR+ R+ PP +L++ P ++A+++ S+ YP+ ++ V ++ Sbjct: 37 PEPLRRSTRNSHPPPFLTENYYCNLTSATLPDSSAATLSSS-SCKYPISSY--VSYQNIS 93 Query: 232 PTYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCK 411 + FL NLS++ EP Y +A W A+NAELSALEKN TW L LP K AIGCK Sbjct: 94 SAHNHFLFNLSTIPEPTCYEKAVCDENWKTAINAELSALEKNNTWKLVPLPLHKHAIGCK 153 Query: 412 WVYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVL 591 WV+K+K DG+++R KARLVAKGY Q GIDY+D FSPV K+ T+R+++A+A + W L Sbjct: 154 WVFKLKLHADGTIERYKARLVAKGYTQTEGIDYMDTFSPVVKMTTIRVLLAVAAAQNWPL 213 Query: 592 YQLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVL 771 YQLD+N FLHG L+E++YMQ G + + VC+L++SLYGLKQA RQWN + + + Sbjct: 214 YQLDVNTAFLHGDLNEEVYMQPPPGLSLPHSNLVCKLQRSLYGLKQASRQWNTKLTETLT 273 Query: 772 AYGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGL 951 A G+ QS D+ LFTK GT + + + K+ LD F+IKDLG Sbjct: 274 ASGYVQSKSDYSLFTKQASSGLTIILVYVDDLVLGGTDSNEIQNIKALLDEKFSIKDLGY 333 Query: 952 AKYFLGMEIAWGQC*VGI 1005 KYFLG E+A Q + + Sbjct: 334 LKYFLGFEVARTQAGISL 351 >gb|OMO49485.1| Reverse transcriptase, RNA-dependent DNA polymerase [Corchorus capsularis] Length = 506 Score = 256 bits (655), Expect = 1e-77 Identities = 132/275 (48%), Positives = 175/275 (63%) Frame = +1 Query: 157 SSVDSTPGTPYPLPTFPFVLSPSLCPTYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAE 336 S S T YPL + T+ FLA +SS+ EP+S++QA ++ W DAM E Sbjct: 209 SPAPSANSTVYPLSHT--ISYSKFFNTHVAFLAAISSIDEPKSFSQAVKHVHWRDAMEKE 266 Query: 337 LSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLD 516 +SALE N TW LTSLP K+AI KW+YKVK PDG+V+R KARLVAKGY QI G+D+ + Sbjct: 267 ISALEANHTWTLTSLPPVKRAIDSKWIYKVKFNPDGTVERYKARLVAKGYTQIEGVDFHE 326 Query: 517 VFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVC 696 F+PVAKLVT++ ++A+A+V+ W L+QLD+NN FLHG L+E++YM+ Q + K +VC Sbjct: 327 TFAPVAKLVTIQCLLAVASVRNWELHQLDVNNAFLHGDLEEEVYMKIPQDFAKQGEHRVC 386 Query: 697 ELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXX 876 L+KSLYGLKQA R W +F +L GF QS DH LFT G+S+ Sbjct: 387 RLQKSLYGLKQASRNWYQKFTQALLTAGFIQSTSDHSLFTSTQGESYIAVLIYVDDVIVT 446 Query: 877 GTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEIA 981 G + + K L+ F IKDLG KYFLG+E+A Sbjct: 447 GNDSTKIAWLKEYLNTKFQIKDLGQLKYFLGLEVA 481 >gb|PNX77479.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 671 Score = 259 bits (661), Expect = 6e-77 Identities = 139/311 (44%), Positives = 183/311 (58%) Frame = +1 Query: 49 APIALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSL 228 AP LR+ R PPS D+ + S S T YPL + V L Sbjct: 276 APPDLRRSSRISRPPSHHRDFKTYHAAILGH----SDSSSSMSSTRYPLQRY--VSYSGL 329 Query: 229 CPTYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGC 408 TY F+ N+S + EP +Y QAC WV AMN+E+ ALE+N+TW + LP G++ IGC Sbjct: 330 SDTYRHFVNNISLLVEPTTYEQACHDSNWVAAMNSEIQALEENKTWSMVPLPLGQRPIGC 389 Query: 409 KWVYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWV 588 KWV+K+K DG+++R KARLVAKG+ Q GIDY D F+PVAKL+TVR ++A+A V+ W Sbjct: 390 KWVFKIKYNADGTIERHKARLVAKGFTQREGIDYKDTFAPVAKLITVRCLLAIAAVRHWP 449 Query: 589 LYQLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKV 768 L+Q+D+ N FLHG L E++YM GY + VC L KSLYGLKQA R W F + Sbjct: 450 LHQMDVQNAFLHGDLVEEVYMLPPPGYCRQGENVVCRLHKSLYGLKQASRSWFQRFSCAI 509 Query: 769 LAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLG 948 GF QS D+ LFT+V GDS G + + ++D K L+ F IKDLG Sbjct: 510 QEIGFQQSKADYSLFTQVRGDSITVVLLYVDDMVITGNNETTINDLKKFLNSCFKIKDLG 569 Query: 949 LAKYFLGMEIA 981 + KYFLG+E+A Sbjct: 570 VLKYFLGIEVA 580 >ref|XP_019107755.1| PREDICTED: uncharacterized protein LOC109136272 [Beta vulgaris subsp. vulgaris] Length = 381 Score = 249 bits (637), Expect = 2e-76 Identities = 128/218 (58%), Positives = 148/218 (67%) Frame = +1 Query: 325 MNAELSALEKNETWILTSLPKGKKAIGCKWVYKVKRKPDGSVDRCKARLVAKGYNQIFGI 504 MN EL ALE+NETW LT LP GKKAIG KWVYK K KPDG+++R KARLVA GY Q+ G Sbjct: 1 MNKELKALEENETWDLTVLPSGKKAIGSKWVYKTKLKPDGTIERHKARLVAIGYQQVEGQ 60 Query: 505 DYLDVFSPVAKLVTVRMVIALATVKQWVLYQLDINNVFLHGYLDEKIYMQSTQGYTKACA 684 D+ FSPVAKL TVR+VIALATV W L QLD+NN FLHG+LDE++YM + GYTKA Sbjct: 61 DFTQTFSPVAKLATVRIVIALATVHNWSLCQLDVNNAFLHGFLDEEVYMLPSAGYTKAKK 120 Query: 685 GQVCELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSAHDHCLFTKVDGDSFXXXXXXXXX 864 G+VC LKKSLYGLKQA RQWN E +L+ GF QS D+ LFT+ F Sbjct: 121 GEVCRLKKSLYGLKQASRQWNKELARFLLSLGFQQSKQDYSLFTRTKNGEFLVILVYVDD 180 Query: 865 XXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGMEI 978 GT +D+ K LD F IKDLG YFLG+EI Sbjct: 181 MMVTGTSLFQIDEVKQALDLAFTIKDLGDLNYFLGVEI 218 >gb|PNX84365.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 562 Score = 253 bits (646), Expect = 9e-76 Identities = 137/307 (44%), Positives = 179/307 (58%) Frame = +1 Query: 61 LRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCPTY 240 +RK R K PPS+L DY A+S S+ P F+ L + Sbjct: 16 IRKSDRVKHPPSYLQDYHTKILGNISHSAPDATSPSSSQ---CKFPISSFISYNHLSSAH 72 Query: 241 TCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKWVY 420 + NLS++ EP SY +A W A+N EL+AL KN+TW L LP KKAIGCKWV+ Sbjct: 73 KHYALNLSTLTEPSSYEEAMCDKNWESAVNVELAALLKNKTWDLVKLPPHKKAIGCKWVF 132 Query: 421 KVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLYQL 600 K+K DG+V+R KARLVAKG+ Q GIDY D FSPV K+ TVR +A+A + W L+QL Sbjct: 133 KLKLHADGTVERYKARLVAKGFTQTEGIDYTDTFSPVVKMTTVRTFLAIAASQNWPLFQL 192 Query: 601 DINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLAYG 780 D+N FLHG LDE++YM+ G + A VC+L++SLYGLKQA RQWN + + +L+ G Sbjct: 193 DVNTTFLHGDLDEEVYMKPPPGLSLAQPDLVCKLQRSLYGLKQASRQWNAKLTETLLSSG 252 Query: 781 FTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKY 960 + QS D+ LFTK F GT + + K+ LD F+IKDLG KY Sbjct: 253 YIQSKADYSLFTKNTSTGFTAILVYVDDLVLGGTDINEIHQLKALLDNKFSIKDLGSLKY 312 Query: 961 FLGMEIA 981 FLG E+A Sbjct: 313 FLGFEVA 319 >gb|PNX93131.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 982 Score = 261 bits (666), Expect = 2e-75 Identities = 135/302 (44%), Positives = 187/302 (61%) Frame = +1 Query: 76 RHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCPTYTCFLA 255 R K PS+LSD+ + S+ S+ GT YP+ +F + L P+++ F + Sbjct: 394 RIKHRPSYLSDFVCS--------ASDDSAKSSSTGTIYPISSFHSL--SQLSPSHSVFTS 443 Query: 256 NLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKWVYKVKRK 435 +L+ EP++Y +AC+ W+ AM +EL AL + TW + LP K IG KWVYK+K K Sbjct: 444 SLTQHTEPRTYTEACKSQHWIQAMTSELEALARTGTWKIVDLPPNVKPIGSKWVYKIKHK 503 Query: 436 PDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLYQLDINNV 615 DG+++R KARLVAKGYNQ+ G+D+ D FSPVAKL TVRM++A+A++K W L+QLD+NN Sbjct: 504 SDGTIERYKARLVAKGYNQVEGLDFFDTFSPVAKLTTVRMLLAIASIKGWFLHQLDVNNA 563 Query: 616 FLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLAYGFTQSA 795 FLHG L E +YM G + QVC+L KSLYGLKQA R+W + ++ G+TQS+ Sbjct: 564 FLHGDLQENVYMSIPDGVQCSKPNQVCKLLKSLYGLKQASRKWYEKLTSLLVKEGYTQSS 623 Query: 796 HDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLAKYFLGME 975 DH LFT D+F GT ++ K+ LD F IKDLG+ KYFLG+E Sbjct: 624 SDHSLFTISQQDNFTALLIYVDDIILAGTSLQEINRIKNILDTHFKIKDLGVVKYFLGLE 683 Query: 976 IA 981 +A Sbjct: 684 VA 685 >dbj|GAU37804.1| hypothetical protein TSUD_276210, partial [Trifolium subterraneum] Length = 633 Score = 254 bits (649), Expect = 2e-75 Identities = 133/317 (41%), Positives = 186/317 (58%) Frame = +1 Query: 55 IALRKGVRHKVPPSWLSDYXXXXXXXXXXPTAAASSVDSTPGTPYPLPTFPFVLSPSLCP 234 + LR+ R+ PP++L DY T SS + P + P F+ ++ Sbjct: 104 VPLRQSTRNCHPPTYLQDYYCNHLSN----TIHDSSGNMEPSSSCKYPISSFISYQNISS 159 Query: 235 TYTCFLANLSSMQEPQSYAQACQYPEWVDAMNAELSALEKNETWILTSLPKGKKAIGCKW 414 + +L N+S++ EP Y +A W A+ AEL+ALEKN TW L SLP K +IGCKW Sbjct: 160 AHKHYLLNISTISEPTCYEKAICDENWRTAIQAELTALEKNNTWKLVSLPPHKHSIGCKW 219 Query: 415 VYKVKRKPDGSVDRCKARLVAKGYNQIFGIDYLDVFSPVAKLVTVRMVIALATVKQWVLY 594 V+K+K G+++R KARLVAKGY Q GIDYLD FSPV K+ T+RM++A+A + W LY Sbjct: 220 VFKLKLHASGTIERYKARLVAKGYTQTEGIDYLDTFSPVVKMTTIRMLLAIAASENWPLY 279 Query: 595 QLDINNVFLHGYLDEKIYMQSTQGYTKACAGQVCELKKSLYGLKQAPRQWNVEFCDKVLA 774 QLD+N FLHG L+E++YMQ G + VC+L++SLYGLKQA +QWN + + + + Sbjct: 280 QLDVNTAFLHGDLNEEVYMQPPPGLALSNPNLVCKLQRSLYGLKQASKQWNTKLTETLTS 339 Query: 775 YGFTQSAHDHCLFTKVDGDSFXXXXXXXXXXXXXGTHPSLLDDFKSNLDRLFAIKDLGLA 954 G+ QS D+ LFTK F GT + + + K+ LD F+IKDLG Sbjct: 340 SGYVQSKSDYSLFTKQASTGFTVILVCVDDLVLGGTDSTEIQNIKALLDAKFSIKDLGSL 399 Query: 955 KYFLGMEIAWGQC*VGI 1005 KYFLG E+A Q + + Sbjct: 400 KYFLGFEVARTQAGISL 416