BLASTX nr result
ID: Rehmannia31_contig00025671
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00025671 (1166 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|OMO78632.1| Integrase, catalytic core [Corchorus capsularis] 343 e-104 gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus cap... 341 e-103 ref|XP_012486681.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-r... 324 e-101 gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifo... 323 3e-99 ref|XP_010274374.1| PREDICTED: uncharacterized protein LOC104609... 322 2e-98 gb|KYP37906.1| Retrovirus-related Pol polyprotein from transposo... 325 2e-98 gb|PNX84630.1| retrovirus-related Pol polyprotein from transposo... 311 3e-98 gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposo... 327 6e-98 gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) ... 327 1e-97 ref|XP_010526684.1| PREDICTED: uncharacterized protein LOC104804... 317 2e-97 dbj|GAU49938.1| hypothetical protein TSUD_290950 [Trifolium subt... 323 2e-97 gb|KYP34298.1| Retrovirus-related Pol polyprotein from transposo... 320 3e-97 emb|CAN74229.1| hypothetical protein VITISV_000584 [Vitis vinifera] 320 1e-96 ref|XP_010526683.1| PREDICTED: uncharacterized protein LOC104804... 317 1e-96 ref|XP_010526682.1| PREDICTED: uncharacterized protein LOC104804... 317 3e-96 gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymera... 309 6e-96 emb|CAN81016.1| hypothetical protein VITISV_025518 [Vitis vinifera] 321 2e-95 gb|KYP77128.1| Retrovirus-related Pol polyprotein from transposo... 307 2e-94 ref|XP_010526681.1| PREDICTED: uncharacterized protein LOC104804... 317 2e-94 ref|XP_010526680.1| PREDICTED: uncharacterized protein LOC104804... 317 2e-94 >gb|OMO78632.1| Integrase, catalytic core [Corchorus capsularis] Length = 1247 Score = 343 bits (879), Expect = e-104 Identities = 190/405 (46%), Positives = 257/405 (63%), Gaps = 22/405 (5%) Frame = +3 Query: 18 KGKFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPSPLKDV-- 191 + KF ARA+ C+FLG+ G KGY+++++ T +IF SRDVVF E FPFQ SP + Sbjct: 618 RDKFHARATTCLFLGYPHGQKGYKVYDLTTHKIFTSRDVVFCEHIFPFQDKNSPNHNTST 677 Query: 192 --PLPVPAHDIGEET----------------EGTDTQDVEPTPPPPA-QQLGRGHRIKHK 314 P+P+P D E E D+ D T PP Q R R++ Sbjct: 678 TTPIPLPIFDDTESMSLGSAPHTTTMMTHTPEILDSNDTTNTNTPPTIPQENRPVRVRKL 737 Query: 315 PAWLNDYVTNSIHTAHLIETPAESVKP-TPYRLDCFPYTYSSILSPAYSAFLTQITESQS 491 P+ +++ + T + + Y L F +YS SP +++FL IT+ + Sbjct: 738 PSRYHNFHVDLPGNNKSTPTSSNNASSGMSYPLVNF-LSYSKFNSP-HTSFLMAITQ-HN 794 Query: 492 EPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVD 671 EPT+Y +A++D W EAM +EL ALEQN TW L LP GKKAIGSKWVYKIK +SDG+++ Sbjct: 795 EPTSYKQAIKDTHWQEAMKKELEALEQNHTWTLEQLPTGKKAIGSKWVYKIKYHSDGTIE 854 Query: 672 RYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYL 851 RYKARLVAKGY QVEG+D+ E+F PVAKL +VR ++A+ K+WELH +D++NAFLHG L Sbjct: 855 RYKARLVAKGYTQVEGLDYTETFPPVAKLTTVRTLLAVVAAKSWELHQLDVHNAFLHGDL 914 Query: 852 EEEVYMIPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLF 1031 ++E+YM PP GY + VC L++SLYGLKQASRQW +FS +L+FGFIQS D+ LF Sbjct: 915 DKEIYMKPPPGYLSSNDNRVCRLRKSLYGLKQASRQWYAKFSTAILNFGFIQSKADSSLF 974 Query: 1032 VKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 + G F ALL+YVDDV+I N+ + ++LK+YL F IKDLG Sbjct: 975 LHHKGTSFTALLVYVDDVIIASNNNSHTKALKEYLDAWFHIKDLG 1019 Score = 173 bits (438), Expect = 1e-43 Identities = 83/184 (45%), Positives = 126/184 (68%), Gaps = 1/184 (0%) Frame = +3 Query: 618 IGSKWVYKIKRNSDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQK 797 +G KWV+K K +DGS++R KARLVAKG+ QV G+DF+E+FSPV K ++R+++ +A + Sbjct: 4 VGCKWVFKTKTKADGSLERLKARLVAKGFNQVPGVDFLETFSPVVKPATIRVVLTIALAR 63 Query: 798 NWELHHIDINNAFLHGYLEEEVYMIPPEGYFKAK-KGEVCLLKRSLYGLKQASRQWNLEF 974 +WE+ +D+ NAFLHG+L E V+M P G+ ++ VC L ++LYGL+QA R W F Sbjct: 64 DWEIRQLDVKNAFLHGFLNEPVFMTQPPGFQNSQHPNYVCKLNKALYGLRQAPRVWFDRF 123 Query: 975 SQKLLSFGFIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTI 1154 S LLSFGF S + LFV ++ + LL+YVDD+++TG++ ++ L + F++ Sbjct: 124 STFLLSFGFTCSVAGSSLFVLQSSRGTILLLLYVDDIILTGSNSHFLRDFIAALGREFSM 183 Query: 1155 KDLG 1166 KDLG Sbjct: 184 KDLG 187 >gb|OMO65653.1| hypothetical protein CCACVL1_21443 [Corchorus capsularis] Length = 1245 Score = 341 bits (875), Expect = e-103 Identities = 191/400 (47%), Positives = 255/400 (63%), Gaps = 19/400 (4%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPSPLKD---VP 194 KF R+S+C+F+G+ G KGYR++++ T +IF+SRDV FYE QFPF+ + D VP Sbjct: 571 KFSPRSSKCIFVGYPNGTKGYRVYDLTTKKIFVSRDVRFYENQFPFENTSTSTNDQTVVP 630 Query: 195 LPVPAHDIGEETEGTDTQD-VEPTPP-----------PPAQQLGRGHRIKHKPAWLNDYV 338 LP E+T+ + T D + P PP PP Q R R K +P L+D V Sbjct: 631 LPAL-----EDTDLSITHDSIPPNPPQEQPQPHPPTNPPNQPSTRPQRTKTRPKRLDDCV 685 Query: 339 TNSIHTAHLIETPA----ESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITESQSEPTTY 506 N+ + + +P+ E+ T Y L F +Y + S ++ AFL I+ + EP ++ Sbjct: 686 CNN---SKVDNSPSSLTHEASSGTLYSLSNF-ISYDNFHS-SHKAFLAAIS-LRDEPKSF 739 Query: 507 GEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKAR 686 +AV+ W EAM +EL ALE N TW L +LPP KK IG KW++KIK SDG+++RYKAR Sbjct: 740 SQAVKSPQWREAMQKELAALENNNTWTLETLPPRKKPIGCKWIFKIKYKSDGTIERYKAR 799 Query: 687 LVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEVY 866 VAKGY Q+EG+DF E+F+PVAKLV+VR ++A+A KNWELH +D+NNAFLHG L+EEVY Sbjct: 800 FVAKGYNQIEGMDFHETFAPVAKLVTVRCLLAIAAIKNWELHQLDVNNAFLHGDLDEEVY 859 Query: 867 MIPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRTG 1046 M P GY VC +++SLYGLKQASR W +F LL FGFIQST D LF TG Sbjct: 860 MSLPPGYGDKNDSRVCRVRKSLYGLKQASRNWFAKFFAALLEFGFIQSTVDYSLFTLTTG 919 Query: 1047 DDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 FL +L+YVDD++I G+ I+SLK +L F IKDLG Sbjct: 920 SSFLVVLVYVDDLIIAGDDSVRIRSLKQHLDSRFHIKDLG 959 >ref|XP_012486681.1| PREDICTED: LOW QUALITY PROTEIN: retrovirus-related Pol polyprotein from transposon TNT 1-94 [Gossypium raimondii] Length = 683 Score = 324 bits (830), Expect = e-101 Identities = 177/397 (44%), Positives = 241/397 (60%), Gaps = 9/397 (2%) Frame = +3 Query: 3 TTGPHKGKFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPSPL 182 T H+ KFD RA +CVFLG+ P VKGY L ++ T IF+SR+V F+ET FPF H Sbjct: 13 TLSAHRKKFDPRAKQCVFLGYKPHVKGYILLDIETRAIFVSRNVTFHETIFPFLQHSLNN 72 Query: 183 KDVPLPVPAHDI-------GEETEGTDTQDVEPTPPPPAQQLGRGHRIKHKPAWLNDYVT 341 P+ + A D + TD PP R R + P++L DY Sbjct: 73 PTTPVGLLASDTIYDSPISPPQPSSTDQSSSTSHPPTQPSTSSRPQRNRRPPSYLQDY-- 130 Query: 342 NSIHTAHLIETPAESVKP-TPYRL-DCFPYTYSSILSPAYSAFLTQITESQSEPTTYGEA 515 + PA + P TP+ + +C Y LSP + F I+ S EP TY +A Sbjct: 131 ------QHYQLPAATNHPGTPHSIFNCISYHN---LSPQHLHFTLAISAS-IEPKTYKQA 180 Query: 516 VQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKARLVA 695 + W EAM E+ ALEQN TW +T+LPPGK G KWV+++K +DGS +RYKARLVA Sbjct: 181 SKFTHWNEAMQAEINALEQNNTWTMTTLPPGKTPXGCKWVFRVKHRADGSTERYKARLVA 240 Query: 696 KGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEVYMIP 875 KGY Q+ G+D+ ++FSPVAK+ +VR+++ALAT ++W + +D+NNAFLHG L E+VYM+P Sbjct: 241 KGYTQI-GVDYFDTFSPVAKITTVRLLLALATSRHWHIQQLDVNNAFLHGDLNEDVYMLP 299 Query: 876 PEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRTGDDF 1055 P G F +VC L +S+YGLKQASRQW + + L+S G+IQST D+ +F K+ +DF Sbjct: 300 PPG-FSHDSTKVCKLHKSIYGLKQASRQWFSKLTTALISLGYIQSTADHSMFTKKHSEDF 358 Query: 1056 LALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 LLIYVDD+++TG S I +K +L F IKDLG Sbjct: 359 TVLLIYVDDIILTGTSSPEIMKVKQFLDTTFRIKDLG 395 >gb|PNX93928.1| hypothetical protein L195_g017092, partial [Trifolium pratense] Length = 865 Score = 323 bits (827), Expect = 3e-99 Identities = 183/420 (43%), Positives = 243/420 (57%), Gaps = 39/420 (9%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFP----------FQAHP 173 KFD RA +F+G+ G KGY++++ T F+SRDV F ET FP +HP Sbjct: 165 KFDPRAKRGIFVGYPTGQKGYKIYDPETKTFFVSRDVKFCETNFPSIPNTSEPNLISSHP 224 Query: 174 S--PLKDVPLPVPAHDIGEETEGTDTQD------------------VEPTP------PPP 275 S + D+P P +H ++T+ T + VEPTP PP Sbjct: 225 SYEAIDDLPSPTSSHHQSQQTDIPSTHEPNSPSHITTETSSAASPIVEPTPLTTHTTDPP 284 Query: 276 AQ---QLGRGHRIKHKPAWLNDYVTNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSILS 446 Q+ + R KH P W NDY ++ + +TP+E + R Y S +S Sbjct: 285 TPFIPQVRKSVRDKHPPIWHNDYHMST----QVNKTPSEPTSGSGTRYPLSHYLSYSRIS 340 Query: 447 PAYSAFLTQITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGS 626 + AFL IT + EP +Y +AV D W +AMN EL ALEQN TW L LP G K IG Sbjct: 341 SSNCAFLANIT-AHREPQSYDQAVHDPLWQDAMNAELEALEQNNTWSLVPLPSGHKPIGC 399 Query: 627 KWVYKIKRNSDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWE 806 KWVYKIK SDG+++RYKARLVAKGY QVEGID+ E+FSP AK+ ++R ++ +A +NW Sbjct: 400 KWVYKIKYKSDGTIERYKARLVAKGYTQVEGIDYQETFSPTAKVTTLRCLLTVAAARNWF 459 Query: 807 LHHIDINNAFLHGYLEEEVYMIPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKL 986 +H +D+ NAFLHG L E VYM PP G + + VC L +SLYGLKQASR W FS+ + Sbjct: 460 IHQLDVQNAFLHGDLHELVYMEPPPGLRRQGENVVCRLNKSLYGLKQASRNWFSTFSEVI 519 Query: 987 LSFGFIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 G+ QS D LF K G F A+LIYVDD+L+TGN + ++ LK++L + F IKDLG Sbjct: 520 QKAGYQQSKADYSLFTKSQGTSFTAVLIYVDDILLTGNDLQEMKRLKEFLLKRFRIKDLG 579 >ref|XP_010274374.1| PREDICTED: uncharacterized protein LOC104609701 [Nelumbo nucifera] Length = 946 Score = 322 bits (826), Expect = 2e-98 Identities = 180/391 (46%), Positives = 235/391 (60%), Gaps = 49/391 (12%) Frame = +3 Query: 111 EIFLSRDVVFYETQFPFQAHPSPLKDVPLPVPAHD--IGEETEGTDT---QDVE------ 257 + + SRDVVF+E FPF + LP+P HD + + + T Q+V+ Sbjct: 394 QFYTSRDVVFHENVFPFLNVNNDSSKASLPIPFHDPILFDSNHLSTTPLKQNVDSAIDHI 453 Query: 258 PTPPP---------------------------------PAQQLGRGHRIKHKPAWLNDYV 338 PT P P L R K KPAW+ND+V Sbjct: 454 PTISPSTTIESIHIEIISDPIQPLNPIQSSFDTHHSDIPILTLRHSTRQKFKPAWMNDFV 513 Query: 339 TNSIHTAHL-----IETPAESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITESQSEPTT 503 +N I +L S + Y FPY S I + Y L+ ++ S EP++ Sbjct: 514 SNVIVLTNLPTVITTSNTTTSSGSSAYTPPTFPYHKSPIFTNTYIYLLSNVS-SVPEPSS 572 Query: 504 YGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKA 683 Y +A +++ WIEA+N+EL A E N TW L LPP KKAIGSKWVYK+K DG++D YKA Sbjct: 573 YYQARKNEKWIEAINKELQAFESNNTWELVPLPPKKKAIGSKWVYKVKYLLDGTIDSYKA 632 Query: 684 RLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEV 863 RLVAKGY Q+EG+D+ +SFSPVAK+V+VR+ +A+A KNW LH +DINNAFLHGYL+EEV Sbjct: 633 RLVAKGYHQIEGVDYNDSFSPVAKVVTVRIFLAIAIAKNWALHQLDINNAFLHGYLDEEV 692 Query: 864 YMIPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRT 1043 ++ PP+GY KAK EV LLKRSLYGLKQASRQWN++F KL ++GF QS HD+CLF K T Sbjct: 693 FIQPPQGYTKAKPHEVSLLKRSLYGLKQASRQWNVKFCVKLQAYGFTQSAHDHCLFTKST 752 Query: 1044 GDDFLALLIYVDDVLITGNSVTLIQSLKDYL 1136 FLALL+Y+DDVL+TG + IQ L ++ Sbjct: 753 SSSFLALLLYIDDVLVTGTHESEIQKLSQFV 783 >gb|KYP37906.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1097 Score = 325 bits (833), Expect = 2e-98 Identities = 182/398 (45%), Positives = 243/398 (61%), Gaps = 14/398 (3%) Frame = +3 Query: 15 HKGKFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPF---------QA 167 ++ KFD R CVFLGF P VKG L+++N+ E FLSR V ++E FPF Q Sbjct: 482 NRKKFDPRGRRCVFLGFKPQVKGSILYDLNSRETFLSRHVEYFEHIFPFLPTSPLDLTQT 541 Query: 168 HPSPLKDVPLPVPAHDIGEETEGTDTQD-VEPTPPPPAQQLGRGHRIKHKPAWLNDY--V 338 P PLP+ T T T V PPPP Q + R + P++L+DY Sbjct: 542 ISLPRHQPPLPIDTDPTPLSTNTTPTSSPVSVVPPPPFVQ--KSTRPRKLPSYLHDYHHT 599 Query: 339 TNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITESQSEPTTYGEAV 518 + H + I P S+ +YS+ LSP+ AF I+ S EP +Y EA+ Sbjct: 600 LLTTHNSPTISQPLYSIHNH--------ISYSN-LSPSQKAFSLSIS-SIKEPNSYVEAI 649 Query: 519 QDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKARLVAK 698 QD++W A+ EL ALE+N TWILT LPP K+ +G KWV+K+K NSDG+++R+KARLVAK Sbjct: 650 QDESWKTAIQTELTALEKNNTWILTPLPPNKQVVGCKWVFKLKFNSDGTIERHKARLVAK 709 Query: 699 GYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEVYMIPP 878 GY Q EG+D++++FSPV K+ +VR ++A+AT KNW +H +D+N FLHG L EEVYM PP Sbjct: 710 GYTQTEGLDYLDTFSPVVKMTTVRTLLAVATAKNWHIHQLDVNTTFLHGDLHEEVYMTPP 769 Query: 879 EGYFKA--KKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRTGDD 1052 G + + VC L +SLYGLKQASRQWN + + L+ GF QS D LF K+ G Sbjct: 770 PGLTVSPHQSNCVCKLVKSLYGLKQASRQWNAKLTSVLIDSGFKQSMADYSLFTKQFGAK 829 Query: 1053 FLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 F A+L+YVDD+++ GN T I +K L Q FTIKDLG Sbjct: 830 FTAILVYVDDLVLAGNDPTEINYIKSLLDQKFTIKDLG 867 >gb|PNX84630.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Trifolium pratense] Length = 543 Score = 311 bits (798), Expect = 3e-98 Identities = 175/422 (41%), Positives = 246/422 (58%), Gaps = 38/422 (9%) Frame = +3 Query: 15 HKGKFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPSPLKDVP 194 H+ KFD+RA + V+LG GVKG L +++T IF+SR+V ++E P+Q H P + Sbjct: 51 HRTKFDSRARKAVYLGHQSGVKGAVLLDLHTKSIFISRNVTYHEHILPYQNHTPPFQWSY 110 Query: 195 LPVPAHDIGEETEGTDT----------------------QDVEPTP------------PP 272 + H ++ +DT D++ TP PP Sbjct: 111 HSI--HPTSDDISASDTITPSTPVFDDDFVVTTQSLAPNSDIQTTPHTSDSPPPSAPSPP 168 Query: 273 PAQQ----LGRGHRIKHKPAWLNDYVTNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSI 440 P+ + + R++ +P LNDYV N + A+ + + P C + Sbjct: 169 PSDNNDIVIRKSTRMRSQPGHLNDYVCN-LSDAYSKSSSQGMLYPISNFHSC------AN 221 Query: 441 LSPAYSAFLTQITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAI 620 LS +++ F+ + + EP TY +A W++AMN EL AL+QN+TWIL PP K I Sbjct: 222 LSTSHTKFVLSVN-NDVEPNTYHQASLQDCWVQAMNAELHALQQNKTWILVDAPPNVKPI 280 Query: 621 GSKWVYKIKRNSDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKN 800 GSKWVYK+K +DGS++RYKARLVAKGY QVEGIDF E+FSPVAK+ +VR +IALA K+ Sbjct: 281 GSKWVYKVKHKADGSIERYKARLVAKGYTQVEGIDFFETFSPVAKITTVRTLIALAAIKS 340 Query: 801 WELHHIDINNAFLHGYLEEEVYMIPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQ 980 W LH +D+NNAFLHG L+EEVYM P+G +K +VC L +SLYGLKQASR+W + + Sbjct: 341 WHLHQLDVNNAFLHGELQEEVYMSIPQGVTTSKPNQVCKLLKSLYGLKQASRKWYEKLTS 400 Query: 981 KLLSFGFIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKD 1160 LL+ G++QS+ D+ LF F ALL+YVDD+++ G+S +K L LF IKD Sbjct: 401 VLLAQGYMQSSSDHSLFTLHKDSSFTALLVYVDDIILAGDSHDEFLHIKKLLDDLFRIKD 460 Query: 1161 LG 1166 LG Sbjct: 461 LG 462 >gb|KYP61022.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1316 Score = 327 bits (838), Expect = 6e-98 Identities = 179/399 (44%), Positives = 244/399 (61%), Gaps = 11/399 (2%) Frame = +3 Query: 3 TTGPHKGKFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHP-SP 179 T H+ K D RA C+FLGF P KGY LFN++T + +SR+V+F+E FP P SP Sbjct: 641 TITSHRTKLDPRAHPCIFLGFKPHTKGYLLFNLHTHGLLVSRNVLFHEDHFPSFTKPHSP 700 Query: 180 LKDVPLPV----------PAHDIGEETEGTDTQDVEPTPPPPAQQLGRGHRIKHKPAWLN 329 P+P+ P+ I E ++ T D +PPP L R R + P +L Sbjct: 701 SFSSPVPIHYNYVDYPTFPSSSIVESSD-PPTSDQHSSPPP----LRRSTRPRRPPTYLQ 755 Query: 330 DYVTNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITESQSEPTTYG 509 D+ H A + A S T R + +LSP++ ++ I+ S +EP + Sbjct: 756 DF-----HGAFTSTSTAHS--STGIRHPLHSFLSYDLLSPSFHHYVFSIS-SVTEPKNFA 807 Query: 510 EAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKARL 689 EA + +W++AM+ E+ ALE N TW+LT+LPP K AIG +WVYK+K +DGS+DRYKARL Sbjct: 808 EASKSDSWLKAMHEEIFALEANNTWVLTTLPPHKTAIGCRWVYKVKHKADGSIDRYKARL 867 Query: 690 VAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEVYM 869 VAKGY Q+EG+DF ++FSPVAKL +VR++++LA NW L +D+NNAFLHG L EEVYM Sbjct: 868 VAKGYTQMEGLDFFDTFSPVAKLTTVRLLLSLAAINNWHLKQLDVNNAFLHGDLNEEVYM 927 Query: 870 IPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRTGD 1049 P G + G+VC L+RSLYGLKQASRQW S L+ G++ S D+ LF+K + Sbjct: 928 QLPPGLTPSFPGQVCRLQRSLYGLKQASRQWYARLSSFLIQHGYVPSPSDHSLFLKCSPA 987 Query: 1050 DFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 A+LIYVDD+++ GN +T I L LH F IKDLG Sbjct: 988 TTTAILIYVDDIVLAGNDLTEIHHLTSLLHTTFQIKDLG 1026 >gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum] Length = 1404 Score = 327 bits (838), Expect = 1e-97 Identities = 184/387 (47%), Positives = 238/387 (61%), Gaps = 6/387 (1%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQ-AHPSPLKDVPLP 200 KF RA CVF+G+ PG KGY+L N+ T EIF+SRDV+F+E FP+Q P L D+ Sbjct: 756 KFSPRAIRCVFIGYPPGYKGYKLLNLETNEIFISRDVIFHENTFPYQNTSPMSLSDMTFE 815 Query: 201 VPAHDIGEETEGTDTQDVEPTPPPPAQQLGRGHRIKHKPAWLNDYVTNSIHTAHLIETPA 380 V + + + P+ P AQQ R R + P+ L DY SI TP Sbjct: 816 V-----------SPSSQITPSIPADAQQHSRTSRPHNTPSHLRDYHCYSI------STPC 858 Query: 381 ESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITESQSEPTTYGEAVQDKAWIEAMNRELL 560 + P P S LS ++ AF+ I+ S EPTT+ +AV W +AM+ EL Sbjct: 859 STSTAHPIH----PLVNYSKLSSSHRAFVQNIS-SILEPTTFSQAVSLPEWRQAMDEELK 913 Query: 561 ALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKARLVAKGYLQVEGIDFMESF 740 ALE N TW + SLP GK A+G +WVYK K +DGS+ RYKARLVAKGY Q EG+D++E+F Sbjct: 914 ALELNHTWSIVSLPQGKSAVGCRWVYKAKFAADGSLQRYKARLVAKGYTQQEGLDYLETF 973 Query: 741 SPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEVYMIPPEGYFKAKKGE---- 908 SPVAKLV+VR ++ALA + W L +D+NNAFLHG L EEVYM P G+ +GE Sbjct: 974 SPVAKLVTVRTLLALAAVRGWFLIQLDVNNAFLHGDLTEEVYMTLPPGF--CSEGELPSR 1031 Query: 909 -VCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRTGDDFLALLIYVDDV 1085 VC L +S+YGLKQASRQW +FS LLS GFIQS DN LF++ + FLAL++YVDD+ Sbjct: 1032 AVCKLHKSIYGLKQASRQWFAKFSSTLLSIGFIQSHADNSLFIRSDKNIFLALVVYVDDI 1091 Query: 1086 LITGNSVTLIQSLKDYLHQLFTIKDLG 1166 +I N LKD+L+ F +KDLG Sbjct: 1092 VIATNDQNAASELKDFLNSKFKLKDLG 1118 >ref|XP_010526684.1| PREDICTED: uncharacterized protein LOC104804180 isoform X6 [Tarenaya hassleriana] Length = 789 Score = 317 bits (811), Expect = 2e-97 Identities = 182/416 (43%), Positives = 248/416 (59%), Gaps = 35/416 (8%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPS--PLKDVPL 197 KF+ RA VFLG+ GVKGY++ ++++ + +SR+VVF+ET FPF++ P P D P Sbjct: 330 KFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNVVFHETTFPFKSFPQSQPALD-PF 388 Query: 198 PVPAHDIGEETEG----TDTQDVEPT----PPPPAQQLG------------------RGH 299 P E+ + + + P P P LG R Sbjct: 389 PQSVSPFFYESISPQNLSSSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQ 448 Query: 300 RIKHKPAWLNDYVTNSIHTAHLIETPAESVKP-TPYRLD-CFPYTYSSILSPAYSAFLTQ 473 R PA+L+DY H + P TPY L C Y +LSP+Y F Sbjct: 449 RQSKTPAYLSDY-----HCYLISHNSTPHPNPVTPYPLSACLTY---DLLSPSYRTFALN 500 Query: 474 ITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRN 653 IT + EP +Y +A + ++W +AM EL AL + TW + +LP GK A+G KWV+K K N Sbjct: 501 ITTAP-EPQSYTQAAKFESWRQAMKLELEALIRTNTWSICTLPDGKNAVGCKWVFKTKYN 559 Query: 654 SDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNA 833 +DGS++R+KARLVAKGY Q+EG+DF E+FSPVAK+ +VR+++ALA + NW ++ +D++NA Sbjct: 560 ADGSIERHKARLVAKGYTQLEGVDFSETFSPVAKMTTVRVLLALAAKYNWLINQMDVSNA 619 Query: 834 FLHGYLEEEVYMIPPEGYF-----KAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFG 998 FL+G L+EE+YM P GY K VC L+RSLYGLKQASRQWN + SQ LL+ G Sbjct: 620 FLNGDLDEEIYMKLPPGYSDLQGEKVSSSSVCKLQRSLYGLKQASRQWNQKLSQVLLAAG 679 Query: 999 FIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 F+Q DN LF+KR G +A+L+YVDD+LITGN +I LKD LH F IKDLG Sbjct: 680 FLQVQSDNSLFLKRNGSQLVAVLVYVDDLLITGNDAAMISDLKDTLHSSFEIKDLG 735 >dbj|GAU49938.1| hypothetical protein TSUD_290950 [Trifolium subterraneum] Length = 1137 Score = 323 bits (828), Expect = 2e-97 Identities = 184/407 (45%), Positives = 243/407 (59%), Gaps = 23/407 (5%) Frame = +3 Query: 15 HKGKFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPSPL---- 182 H+ K R +CVFLG+ GVKG L ++NT EIF+SR+V Y+ FP+Q S + Sbjct: 531 HRTKLSPRGRKCVFLGYKQGVKGTVLLDLNTKEIFISRNVTHYDHFFPYQTTNSKIHWHY 590 Query: 183 -KDVPLPVPAHDIGEETEGTDT--QDVEPTP-----------PPPAQQLGRGHRIKHKPA 320 + P+ DI TDT D TP PP+Q R HR KHKP+ Sbjct: 591 HSNFECEHPS-DITSNPPSTDTFCDDTTKTPFIDVTNDIISNVPPSQPSDRPHREKHKPS 649 Query: 321 WLNDYVTNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSI-----LSPAYSAFLTQITES 485 +L D+V NS S P PY SS LSP+++A++ +T+ Sbjct: 650 YLEDFVCNS------------STSLEPSSSSGIPYPISSFHSLAHLSPSHTAYIVSLTQ- 696 Query: 486 QSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGS 665 +EP TY EA + WI+AMN +L AL + TW + LP + IG +WVYKIK NSDGS Sbjct: 697 HTEPKTYLEACKSDHWIQAMNTKLEALSRTGTWKIVDLPSNVRPIGCRWVYKIKHNSDGS 756 Query: 666 VDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHG 845 V+RYKARLVAKGY Q+EGIDF ++FSPVAKL +VR+++ALA+ K W LH +D+NNAFLHG Sbjct: 757 VERYKARLVAKGYTQIEGIDFFDTFSPVAKLTTVRLLLALASIKGWFLHQLDVNNAFLHG 816 Query: 846 YLEEEVYMIPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNC 1025 L+E+VYM+ PEG K + C L +SLYGLKQASR+W + + L+ G+ QST D Sbjct: 817 DLQEDVYMVVPEGVPCVKPNQACKLLKSLYGLKQASRKWYEKLTSLLVREGYTQSTADYS 876 Query: 1026 LFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 LF F ALLIYVDD++++G S+T I+ +K L IKDLG Sbjct: 877 LFTLNREGQFTALLIYVDDIILSGTSMTEIERIKTILDCNIKIKDLG 923 >gb|KYP34298.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1002 Score = 320 bits (821), Expect = 3e-97 Identities = 180/398 (45%), Positives = 242/398 (60%), Gaps = 14/398 (3%) Frame = +3 Query: 15 HKGKFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPF---------QA 167 ++ KFD R CVFLGF P VKG L+++N+ E FLSR V ++E FPF Q Sbjct: 326 NRKKFDPRGRRCVFLGFKPQVKGSILYDLNSRETFLSRHVEYFEHIFPFLPTSPLDLTQT 385 Query: 168 HPSPLKDVPLPVPAHDIGEETEGTDTQD-VEPTPPPPAQQLGRGHRIKHKPAWLNDY--V 338 P PLP+ T T T V PPPP + + R + P++L+DY Sbjct: 386 ISLPRHQPPLPIDTDPTPLSTNTTPTSSPVSVVPPPPFVR--KSTRPRKLPSYLHDYHHT 443 Query: 339 TNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITESQSEPTTYGEAV 518 + H + I P S+ +YS+ LSP+ AF I+ S EP +Y EA+ Sbjct: 444 LLTTHNSPTISQPLYSIHNH--------ISYSN-LSPSQKAFSLSIS-SIKEPNSYVEAI 493 Query: 519 QDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKARLVAK 698 QD++W A+ EL ALE+N TWILT LPP K+ +G KWV+K+K NSDG+++R+KARLVAK Sbjct: 494 QDESWKTAIQTELTALEKNNTWILTPLPPNKQVVGCKWVFKLKFNSDGTIERHKARLVAK 553 Query: 699 GYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEVYMIPP 878 GY Q E +D++++FSPV K+ +VR ++A+AT KNW +H +D+N FLHG L EEVYM PP Sbjct: 554 GYTQTETLDYLDTFSPVVKMTTVRTLLAVATAKNWHIHQLDVNTTFLHGDLHEEVYMTPP 613 Query: 879 EGYFKA--KKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRTGDD 1052 G + + VC L +SLYGLKQASRQWN + + L+ GF QS D LF K+ G Sbjct: 614 PGLTVSPHQSNCVCKLVKSLYGLKQASRQWNAKLTSVLIDSGFKQSMADYSLFTKQFGAK 673 Query: 1053 FLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 F A+L+YVDD+++ GN T I +K L Q FTIKDLG Sbjct: 674 FTAILVYVDDLVLAGNDPTEINYIKSLLDQKFTIKDLG 711 >emb|CAN74229.1| hypothetical protein VITISV_000584 [Vitis vinifera] Length = 1039 Score = 320 bits (819), Expect = 1e-96 Identities = 189/410 (46%), Positives = 245/410 (59%), Gaps = 29/410 (7%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPSPLKD----V 191 KFD RA C+F+G+ G KGYR++++ T + F S DVVF+E FPF +P + + Sbjct: 367 KFDQRARRCIFVGYPLGQKGYRVYDLETNKFFSSXDVVFHEHIFPFHTNPQEEQHDVVVL 426 Query: 192 PLPVPAHD-IGEETEGTDTQDVEP---------------------TPPPPAQQLGRGHRI 305 PLP +++ I ET D P +PPPPA + R RI Sbjct: 427 PLPQTSYEPITTETTKPQADDQPPPLLSSLESTSNERTLDLDTIVSPPPPATR--RSDRI 484 Query: 306 KHKPAWLNDYVTNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITES 485 K L ++ + HTA + + + S+ T + L Y + LSP Y F+ IT + Sbjct: 485 KQPNVXLRNF--HLYHTAKVXSSQSSSLSGTRHPLT--RYISYAQLSPKYRNFVCAIT-T 539 Query: 486 QSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGS 665 EPTTY +AV D W EAM EL ALEQN TW LT LP G + IG KWVYKIK NSDG+ Sbjct: 540 LVEPTTYEQAVLDPKWQEAMAAELHALEQNHTWTLTPLPSGHRPIGCKWVYKIKYNSDGT 599 Query: 666 VDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHG 845 V+RYKARLVAKG+ Q EGID+ E+FSPVAKL +VR ++A+A ++W LH +D+ NAFLHG Sbjct: 600 VERYKARLVAKGFTQREGIDYKETFSPVAKLTTVRCLLAIAAVRHWSLHQMDVQNAFLHG 659 Query: 846 YLEEEVYMIPPEGYFKAKKGE---VCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTH 1016 L EEVYM P G+ ++GE VC +SLYGLKQASR W +FS + GF QS Sbjct: 660 DLLEEVYMQLPPGF--XRQGETPMVCRXNKSLYGLKQASRSWFXKFSATIQQDGFXQSRA 717 Query: 1017 DNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 D LF K +G+ F +LIYVDD++I GN +I LK+ LH F IKDLG Sbjct: 718 DYSLFTKISGNSFTXVLIYVDDMIIXGNDENVIAXLKESLHTKFRIKDLG 767 >ref|XP_010526683.1| PREDICTED: uncharacterized protein LOC104804180 isoform X5 [Tarenaya hassleriana] Length = 886 Score = 317 bits (811), Expect = 1e-96 Identities = 182/416 (43%), Positives = 248/416 (59%), Gaps = 35/416 (8%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPS--PLKDVPL 197 KF+ RA VFLG+ GVKGY++ ++++ + +SR+VVF+ET FPF++ P P D P Sbjct: 427 KFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNVVFHETTFPFKSFPQSQPALD-PF 485 Query: 198 PVPAHDIGEETEG----TDTQDVEPT----PPPPAQQLG------------------RGH 299 P E+ + + + P P P LG R Sbjct: 486 PQSVSPFFYESISPQNLSSSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQ 545 Query: 300 RIKHKPAWLNDYVTNSIHTAHLIETPAESVKP-TPYRLD-CFPYTYSSILSPAYSAFLTQ 473 R PA+L+DY H + P TPY L C Y +LSP+Y F Sbjct: 546 RQSKTPAYLSDY-----HCYLISHNSTPHPNPVTPYPLSACLTY---DLLSPSYRTFALN 597 Query: 474 ITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRN 653 IT + EP +Y +A + ++W +AM EL AL + TW + +LP GK A+G KWV+K K N Sbjct: 598 ITTAP-EPQSYTQAAKFESWRQAMKLELEALIRTNTWSICTLPDGKNAVGCKWVFKTKYN 656 Query: 654 SDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNA 833 +DGS++R+KARLVAKGY Q+EG+DF E+FSPVAK+ +VR+++ALA + NW ++ +D++NA Sbjct: 657 ADGSIERHKARLVAKGYTQLEGVDFSETFSPVAKMTTVRVLLALAAKYNWLINQMDVSNA 716 Query: 834 FLHGYLEEEVYMIPPEGYF-----KAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFG 998 FL+G L+EE+YM P GY K VC L+RSLYGLKQASRQWN + SQ LL+ G Sbjct: 717 FLNGDLDEEIYMKLPPGYSDLQGEKVSSSSVCKLQRSLYGLKQASRQWNQKLSQVLLAAG 776 Query: 999 FIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 F+Q DN LF+KR G +A+L+YVDD+LITGN +I LKD LH F IKDLG Sbjct: 777 FLQVQSDNSLFLKRNGSQLVAVLVYVDDLLITGNDAAMISDLKDTLHSSFEIKDLG 832 >ref|XP_010526682.1| PREDICTED: uncharacterized protein LOC104804180 isoform X4 [Tarenaya hassleriana] Length = 940 Score = 317 bits (811), Expect = 3e-96 Identities = 182/416 (43%), Positives = 248/416 (59%), Gaps = 35/416 (8%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPS--PLKDVPL 197 KF+ RA VFLG+ GVKGY++ ++++ + +SR+VVF+ET FPF++ P P D P Sbjct: 481 KFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNVVFHETTFPFKSFPQSQPALD-PF 539 Query: 198 PVPAHDIGEETEG----TDTQDVEPT----PPPPAQQLG------------------RGH 299 P E+ + + + P P P LG R Sbjct: 540 PQSVSPFFYESISPQNLSSSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQ 599 Query: 300 RIKHKPAWLNDYVTNSIHTAHLIETPAESVKP-TPYRLD-CFPYTYSSILSPAYSAFLTQ 473 R PA+L+DY H + P TPY L C Y +LSP+Y F Sbjct: 600 RQSKTPAYLSDY-----HCYLISHNSTPHPNPVTPYPLSACLTY---DLLSPSYRTFALN 651 Query: 474 ITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRN 653 IT + EP +Y +A + ++W +AM EL AL + TW + +LP GK A+G KWV+K K N Sbjct: 652 ITTAP-EPQSYTQAAKFESWRQAMKLELEALIRTNTWSICTLPDGKNAVGCKWVFKTKYN 710 Query: 654 SDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNA 833 +DGS++R+KARLVAKGY Q+EG+DF E+FSPVAK+ +VR+++ALA + NW ++ +D++NA Sbjct: 711 ADGSIERHKARLVAKGYTQLEGVDFSETFSPVAKMTTVRVLLALAAKYNWLINQMDVSNA 770 Query: 834 FLHGYLEEEVYMIPPEGYF-----KAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFG 998 FL+G L+EE+YM P GY K VC L+RSLYGLKQASRQWN + SQ LL+ G Sbjct: 771 FLNGDLDEEIYMKLPPGYSDLQGEKVSSSSVCKLQRSLYGLKQASRQWNQKLSQVLLAAG 830 Query: 999 FIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 F+Q DN LF+KR G +A+L+YVDD+LITGN +I LKD LH F IKDLG Sbjct: 831 FLQVQSDNSLFLKRNGSQLVAVLVYVDDLLITGNDAAMISDLKDTLHSSFEIKDLG 886 >gb|OMP02866.1| Reverse transcriptase, RNA-dependent DNA polymerase [Corchorus capsularis] Length = 666 Score = 309 bits (792), Expect = 6e-96 Identities = 168/380 (44%), Positives = 232/380 (61%), Gaps = 17/380 (4%) Frame = +3 Query: 78 KGYRLFNMNTGEIFLSRDVVFYETQFPFQAH---PSPLKDVPLPVPAHDIGEETEGTDTQ 248 KGYRL++++ E +SRDVVF E FPFQ P+P + +PLP+P + T + Sbjct: 4 KGYRLYDLSNQEYLVSRDVVFQENIFPFQQSRTPPTPSQVLPLPIPDNHSFNSLPSTPIE 63 Query: 249 DVEPTP--------------PPPAQQLGRGHRIKHKPAWLNDYVTNSIHTAHLIETPAES 386 TP Q L R R + P +L Y + + + S Sbjct: 64 SPNETPIISNDSSLNEISLPSNEDQPLARSQRNRRPPPYLQYYECSKVRRQPSQSSSTTS 123 Query: 387 VKPTPYRLDCFPYTYSSILSPAYSAFLTQITESQSEPTTYGEAVQDKAWIEAMNRELLAL 566 T Y + F T+ LS YS F++ IT S +EP +Y EA++D W A++ EL AL Sbjct: 124 GSGTRYPISNFLSTHR--LSSTYSTFVSNIT-SIAEPQSYSEAIKDPNWKAAIDAELHAL 180 Query: 567 EQNETWILTSLPPGKKAIGSKWVYKIKRNSDGSVDRYKARLVAKGYLQVEGIDFMESFSP 746 E N+TW + LPP K +G KWV+K+K S GS++RYKARLVAKGY Q EGIDF E+F+P Sbjct: 181 EANKTWSIVDLPPHKSPVGCKWVFKVKYKSYGSIERYKARLVAKGYTQQEGIDFHETFAP 240 Query: 747 VAKLVSVRMIIALATQKNWELHHIDINNAFLHGYLEEEVYMIPPEGYFKAKKGEVCLLKR 926 VAK+ +VR ++A+A+ KNW L+ +D+ NA LHG L+EEVYM P G + VC L + Sbjct: 241 VAKMTTVRCLLAIASTKNWPLYQLDVQNALLHGDLDEEVYMSLPPGVTSKGENSVCKLHK 300 Query: 927 SLYGLKQASRQWNLEFSQKLLSFGFIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSV 1106 SLYGL+QAS QW +FS LL++GF+QS D LF+K + DF+A+L+YVDD++ITGN+ Sbjct: 301 SLYGLRQASLQWFAKFSTALLTYGFVQSRSDYSLFIKSSKTDFVAILVYVDDIVITGNNS 360 Query: 1107 TLIQSLKDYLHQLFTIKDLG 1166 LI S+K+ L + F+IKDLG Sbjct: 361 KLIDSVKNALQRQFSIKDLG 380 >emb|CAN81016.1| hypothetical protein VITISV_025518 [Vitis vinifera] Length = 1461 Score = 321 bits (823), Expect = 2e-95 Identities = 191/410 (46%), Positives = 248/410 (60%), Gaps = 29/410 (7%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPSPLKD----V 191 KFD RA C+F+G+ G KGYR++++ T + F S DVVF+E FPF +P + + Sbjct: 775 KFDQRARRCIFVGYPLGQKGYRVYDLXTNKFFSSXDVVFHEHIFPFHTNPQEEQHDVVVL 834 Query: 192 PLPVPAHD-IGEETEGTDTQDVEP---------------------TPPPPAQQLGRGHRI 305 PLP +++ I ET D P +PPPP + R RI Sbjct: 835 PLPQTSYEPITTETTKPQADDQPPPLLSSLESTSNERTLXLDTIVSPPPPTTR--RSDRI 892 Query: 306 KHKPAWLNDYVTNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSILSPAYSAFLTQITES 485 K L ++ + HTA + + + S+ T + L Y + LSP Y F+ IT + Sbjct: 893 KQPNVHLRNF--HLYHTAKVASSQSSSLSGTRHPLT--RYISYAQLSPKYRNFVCAIT-T 947 Query: 486 QSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRNSDGS 665 EPTTY +AV D W EAM EL ALEQN TW LT LP G + IG KWVYKIK NSDG+ Sbjct: 948 LVEPTTYEQAVLDPKWQEAMAAELHALEQNHTWTLTPLPYGHRPIGCKWVYKIKYNSDGT 1007 Query: 666 VDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNAFLHG 845 V+RYKARLVAKG+ Q EGID+ E+FSPVAKL +VR ++A+A ++W LH +D+ NAFLHG Sbjct: 1008 VERYKARLVAKGFTQREGIDYKETFSPVAKLTTVRCLLAIAAVRHWSLHQMDVQNAFLHG 1067 Query: 846 YLEEEVYMIPPEGYFKAKKGE---VCLLKRSLYGLKQASRQWNLEFSQKLLSFGFIQSTH 1016 L EEVYM P G+ ++GE VC L +SLYGLKQASR W +FS + GF QS Sbjct: 1068 DLLEEVYMQLPLGF--RQQGETPMVCRLNKSLYGLKQASRSWFRKFSATIQQDGFHQSRA 1125 Query: 1017 DNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 D LF K +G+ F A+LIYVDD++ITGN +I +LK+ LH F IKDLG Sbjct: 1126 DYSLFTKISGNSFTAVLIYVDDMIITGNDENVIAALKESLHTKFRIKDLG 1175 >gb|KYP77128.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 732 Score = 307 bits (787), Expect = 2e-94 Identities = 174/414 (42%), Positives = 244/414 (58%), Gaps = 33/414 (7%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPS-PLKDVPLP 200 KFD R C+F+G+ G KGY+++++ EI +SRDV+F E +FPF+A + LK V P Sbjct: 136 KFDERGRACIFMGYPRGQKGYKVYDIEKKEIQISRDVIFCEHEFPFKAEKTIMLKQVTPP 195 Query: 201 VPAHDIGEETEGTDT--------QDVEPTPPPPAQQLG---------------------- 290 V EETE T QDVE T A + Sbjct: 196 VL-----EETEDIATREAKCLVEQDVEETRGESAHEENHDVESLQERNTASGSVENEHNK 250 Query: 291 --RGHRIKHKPAWLNDYVTNSIHTAHLIETPAESVKPTPYRLDCFPYTYSSILSPAYSAF 464 R R++H P L +Y + + ++ S Y L + +Y + S +Y A Sbjct: 251 ESRSRRMRHPPRHLEEYEVDLPPSITRFQSDPPSGNSVVYPLSSY-LSYDNF-SHSYKAL 308 Query: 465 LTQITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKI 644 L I+ SEP + +AV+ + W EA +E+ ALE+NETW L +LPPGK A+ SKWV+KI Sbjct: 309 LATIS-LHSEPKNFSQAVKHECWREATKKEIEALEKNETWTLEALPPGKNAVDSKWVFKI 367 Query: 645 KRNSDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDI 824 K +G ++RYKARLVA+G+ QVEG+DF E+F+PVAKLV++R ++ +AT WE+H +D+ Sbjct: 368 KYKPNGEIERYKARLVARGFTQVEGVDFHETFAPVAKLVTLRCLLTIATTNGWEVHQLDV 427 Query: 825 NNAFLHGYLEEEVYMIPPEGYFKAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFGFI 1004 NNAFLHG LEEEVYM P+G+ K + VC L++SLYG +QASR W +F+ L GF Sbjct: 428 NNAFLHGELEEEVYMCIPQGFAKEGETRVCKLRKSLYGRRQASRNWYQKFTNALNKVGFR 487 Query: 1005 QSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 QS D+ LF+ + FL LIYVDDV++ GN +Q +K+YL + F+IKDLG Sbjct: 488 QSRADHSLFIYKWEGVFLVALIYVDDVILVGNEQETMQQMKNYLDREFSIKDLG 541 >ref|XP_010526681.1| PREDICTED: uncharacterized protein LOC104804180 isoform X3 [Tarenaya hassleriana] Length = 1238 Score = 317 bits (811), Expect = 2e-94 Identities = 182/416 (43%), Positives = 248/416 (59%), Gaps = 35/416 (8%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPS--PLKDVPL 197 KF+ RA VFLG+ GVKGY++ ++++ + +SR+VVF+ET FPF++ P P D P Sbjct: 816 KFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNVVFHETTFPFKSFPQSQPALD-PF 874 Query: 198 PVPAHDIGEETEG----TDTQDVEPT----PPPPAQQLG------------------RGH 299 P E+ + + + P P P LG R Sbjct: 875 PQSVSPFFYESISPQNLSSSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQ 934 Query: 300 RIKHKPAWLNDYVTNSIHTAHLIETPAESVKP-TPYRLD-CFPYTYSSILSPAYSAFLTQ 473 R PA+L+DY H + P TPY L C Y +LSP+Y F Sbjct: 935 RQSKTPAYLSDY-----HCYLISHNSTPHPNPVTPYPLSACLTY---DLLSPSYRTFALN 986 Query: 474 ITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRN 653 IT + EP +Y +A + ++W +AM EL AL + TW + +LP GK A+G KWV+K K N Sbjct: 987 ITTAP-EPQSYTQAAKFESWRQAMKLELEALIRTNTWSICTLPDGKNAVGCKWVFKTKYN 1045 Query: 654 SDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNA 833 +DGS++R+KARLVAKGY Q+EG+DF E+FSPVAK+ +VR+++ALA + NW ++ +D++NA Sbjct: 1046 ADGSIERHKARLVAKGYTQLEGVDFSETFSPVAKMTTVRVLLALAAKYNWLINQMDVSNA 1105 Query: 834 FLHGYLEEEVYMIPPEGYF-----KAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFG 998 FL+G L+EE+YM P GY K VC L+RSLYGLKQASRQWN + SQ LL+ G Sbjct: 1106 FLNGDLDEEIYMKLPPGYSDLQGEKVSSSSVCKLQRSLYGLKQASRQWNQKLSQVLLAAG 1165 Query: 999 FIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 F+Q DN LF+KR G +A+L+YVDD+LITGN +I LKD LH F IKDLG Sbjct: 1166 FLQVQSDNSLFLKRNGSQLVAVLVYVDDLLITGNDAAMISDLKDTLHSSFEIKDLG 1221 >ref|XP_010526680.1| PREDICTED: uncharacterized protein LOC104804180 isoform X2 [Tarenaya hassleriana] Length = 1244 Score = 317 bits (811), Expect = 2e-94 Identities = 182/416 (43%), Positives = 248/416 (59%), Gaps = 35/416 (8%) Frame = +3 Query: 24 KFDARASECVFLGFVPGVKGYRLFNMNTGEIFLSRDVVFYETQFPFQAHPS--PLKDVPL 197 KF+ RA VFLG+ GVKGY++ ++++ + +SR+VVF+ET FPF++ P P D P Sbjct: 785 KFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNVVFHETTFPFKSFPQSQPALD-PF 843 Query: 198 PVPAHDIGEETEG----TDTQDVEPT----PPPPAQQLG------------------RGH 299 P E+ + + + P P P LG R Sbjct: 844 PQSVSPFFYESISPQNLSSSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQ 903 Query: 300 RIKHKPAWLNDYVTNSIHTAHLIETPAESVKP-TPYRLD-CFPYTYSSILSPAYSAFLTQ 473 R PA+L+DY H + P TPY L C Y +LSP+Y F Sbjct: 904 RQSKTPAYLSDY-----HCYLISHNSTPHPNPVTPYPLSACLTY---DLLSPSYRTFALN 955 Query: 474 ITESQSEPTTYGEAVQDKAWIEAMNRELLALEQNETWILTSLPPGKKAIGSKWVYKIKRN 653 IT + EP +Y +A + ++W +AM EL AL + TW + +LP GK A+G KWV+K K N Sbjct: 956 ITTAP-EPQSYTQAAKFESWRQAMKLELEALIRTNTWSICTLPDGKNAVGCKWVFKTKYN 1014 Query: 654 SDGSVDRYKARLVAKGYLQVEGIDFMESFSPVAKLVSVRMIIALATQKNWELHHIDINNA 833 +DGS++R+KARLVAKGY Q+EG+DF E+FSPVAK+ +VR+++ALA + NW ++ +D++NA Sbjct: 1015 ADGSIERHKARLVAKGYTQLEGVDFSETFSPVAKMTTVRVLLALAAKYNWLINQMDVSNA 1074 Query: 834 FLHGYLEEEVYMIPPEGYF-----KAKKGEVCLLKRSLYGLKQASRQWNLEFSQKLLSFG 998 FL+G L+EE+YM P GY K VC L+RSLYGLKQASRQWN + SQ LL+ G Sbjct: 1075 FLNGDLDEEIYMKLPPGYSDLQGEKVSSSSVCKLQRSLYGLKQASRQWNQKLSQVLLAAG 1134 Query: 999 FIQSTHDNCLFVKRTGDDFLALLIYVDDVLITGNSVTLIQSLKDYLHQLFTIKDLG 1166 F+Q DN LF+KR G +A+L+YVDD+LITGN +I LKD LH F IKDLG Sbjct: 1135 FLQVQSDNSLFLKRNGSQLVAVLVYVDDLLITGNDAAMISDLKDTLHSSFEIKDLG 1190