BLASTX nr result
ID: Chrysanthemum21_contig00026258
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00026258 (2208 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_019183756.1| PREDICTED: uncharacterized protein LOC109178... 203 8e-52 ref|XP_023759396.1| uncharacterized protein LOC111907817 [Lactuc... 186 1e-45 ref|XP_017249749.1| PREDICTED: uncharacterized protein LOC108220... 171 6e-44 ref|XP_012572663.1| PREDICTED: uncharacterized protein LOC105852... 169 4e-40 ref|XP_012575677.1| PREDICTED: uncharacterized protein LOC105853... 169 5e-40 ref|XP_023757987.1| uncharacterized protein LOC111906461 [Lactuc... 165 2e-38 ref|XP_023767590.1| uncharacterized protein LOC111916195 [Lactuc... 164 4e-38 dbj|GBC54550.1| gag-pol fusion protein [Rhizophagus irregularis ... 157 7e-38 ref|XP_023731306.1| uncharacterized protein LOC111879058 [Lactuc... 147 2e-36 gb|KYP35691.1| Retrovirus-related Pol polyprotein from transposo... 159 2e-36 gb|KYP37030.1| Retrovirus-related Pol polyprotein from transposo... 159 3e-36 ref|XP_023745845.1| uncharacterized protein LOC111894012 [Lactuc... 154 3e-36 gb|KYP60954.1| Copia protein, partial [Cajanus cajan] 152 4e-36 gb|KYP40629.1| Copia protein, partial [Cajanus cajan] 148 1e-35 ref|XP_017228914.1| PREDICTED: uncharacterized protein LOC108204... 153 6e-35 gb|KZV36891.1| retrovirus-related Pol polyprotein from transposo... 153 7e-35 gb|KYP40635.1| Copia protein [Cajanus cajan] 149 2e-34 ref|XP_019194891.1| PREDICTED: uncharacterized protein LOC109188... 149 2e-34 gb|KYP64004.1| Retrovirus-related Pol polyprotein from transposo... 152 2e-34 gb|ABD32582.1| Integrase, catalytic region; Zinc finger, CCHC-ty... 152 3e-34 >ref|XP_019183756.1| PREDICTED: uncharacterized protein LOC109178675 [Ipomoea nil] Length = 760 Score = 203 bits (516), Expect = 8e-52 Identities = 187/689 (27%), Positives = 290/689 (42%), Gaps = 69/689 (10%) Frame = +1 Query: 271 SDYE*WKIRMKRFIRSKLHGKALWNSIMEGPTPHPMTTDPVGEGSTTVPTPRRKRDDEFT 450 S ++ WKIRM+ + S +H + +W+ I P M V K D++T Sbjct: 17 SKFDDWKIRMQALLCS-IHDE-MWDVITTRPIIVMMVNTQVAAQGAGAEQMIPKPKDQYT 74 Query: 451 DEENAKELIDMQAASILSQGLPRHVFNILNQTETGKEIWDNLELLMKGSGLTEQRKKEEL 630 EE + +D A +IL L +F + + + EIW L L KG + K Sbjct: 75 SEERTRANLDNVARNILYNSLDDSLFPRVRKCKNAMEIWKVLMELGKGDEQEKDNKLTVT 134 Query: 631 FDEYERFKAIGNETIHDYFVRFHKLANDLKITKIQIPTHQQNTKFLNNLPPYWAKYVTGV 810 ++E FK + NETI D +RF +L DL ++ ++N K L LP W V + Sbjct: 135 MKKFEDFKMLPNETIMDMELRFTRLMGDLTDLGKELSEKEKNLKILRGLPKSWEIKVIAM 194 Query: 811 KQNKDISTSTYVELYTYLKAYE-PHALKTLKKQEQTSSIMDPLAYLAHQNSTTVASPTSS 987 + N+D+ T++ ++++ LKAYE H K +++ E + +A +A Q +T S+ Sbjct: 195 RDNRDMKTTSTAKIFSDLKAYEFEHEPKDVEESETRN-----VALVASQQATNSNRSKSN 249 Query: 988 SFSTQVPQQQALTGTEAMLATMQQLVNLMSGFQKQFPPTNNQLRTSSNPRSHATVHEGQI 1167 S +Q AL V M F ++ N Q S+ H HE Sbjct: 250 SCEFLSDEQYAL------------FVRKMKRFMRK---DNFQ---DSHRSGHRKQHE--- 288 Query: 1168 ITETVQRKAPGNVSYAGTSGNKSYGQMTDRYGKKVICYNCRGEGHV-------------- 1305 +S S G+ D +++CYNCR GH Sbjct: 289 -----------------SSPQTSKGETPD---SQLLCYNCRKPGHFKANFPHPIVSKHQD 328 Query: 1306 ARQCKEPKRAKDTQYHKD--------------KLMLSEAKDRGVKLDAEAEAFLADVECT 1443 + K P + +Y ++ K M+ V+ ++ + + +D E T Sbjct: 329 SNATKSPSKETGERYKRNDKPESSSSRNERRKKAMVVNETSDNVEAESSSSSSSSDDEST 388 Query: 1444 EPLDESLALTTTTAFQVNHEDAYDSDVDE-GPHASAAFMANLSSTTDANGASS------- 1599 E L L + Q + + D DE H+S+ + A SS S Sbjct: 389 EEEKGLLCLFS----QESEDLCLMVDEDEVNSHSSSCYSAESSSVGSQTSNESVTEMMRR 444 Query: 1600 --------SKINE-----------------------VVQMVLWYLDSGCSRHMTGDRSKL 1686 SK+ E + + V+WY+DSGCSRHMTGD++ L Sbjct: 445 IKVIKNTYSKLKEENSRLMISYNALRQVRVENIKLAIDKQVIWYVDSGCSRHMTGDKTML 504 Query: 1687 INYVDNFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYVEGLKHNLFSVGQFCDAGLEV 1866 N+ + V FG + G GD I V YV+GLK NL S QFCD G +V Sbjct: 505 SNFKEVDGPKVIFGGENSGKTRGKGDIIKQGITIQDVSYVDGLKFNLLSTSQFCDKGYKV 564 Query: 1867 AFRRHTCHIRN-KDMIDLLQGSRSTNLYSISLNNLMAASPVCLLSKASSTKSWLWHRRLN 2043 F ++ C I N KD L G R +Y IS N+ + + VC ++K+++ S W+ +L+ Sbjct: 565 EFSKNECKIVNTKDGKIALTGQRKKKMYVISWNS--SNANVCFVAKSNADLSQEWNNKLS 622 Query: 2044 HLNFGTLNELARKDLVRGLPKLKYDKEHL 2130 HLN +N+LA+++LVRG K HL Sbjct: 623 HLNLKIINKLAKRNLVRGCFIHNNGKNHL 651 >ref|XP_023759396.1| uncharacterized protein LOC111907817 [Lactuca sativa] Length = 904 Score = 186 bits (472), Expect = 1e-45 Identities = 118/318 (37%), Positives = 168/318 (52%), Gaps = 8/318 (2%) Frame = +1 Query: 1243 QMTDRYGKKVICYNCRGEGHVARQCKEPKRAKDTQYHKDKLMLSEAKDRGVKLDAEAEAF 1422 Q D + +IC+NC+G H AR+C RAK+ KD ++ D KL+ + Sbjct: 422 QKDDSDEEVIICHNCKGTNHYAREC----RAKNKTKIKDSAYYAQRADELKKLENQ---- 473 Query: 1423 LADVECTEPLDESLALTTTTAFQVNHEDAYDSDVDEGP-HASAAFMANLSSTTDANGASS 1599 ++ AL V + D + + P ++ F+A + + A Sbjct: 474 ----------EKQRALMAIHEPSVEYWPTSDDEAEHEPTQSNFCFVAGVEIPSRAPNVIE 523 Query: 1600 SKINE-VVQMVLWYLDSGCSRHMTGDRSKLINYVDNF--IGT---VRFGNDEFATIDGYG 1761 +N WY+DSGCS+HMTG+R NY+ +F I T V FGN+ A I GYG Sbjct: 524 QVLNAGEPNDDTWYIDSGCSKHMTGNR----NYLRDFKPIQTNQDVTFGNNMKAKIKGYG 579 Query: 1762 DYKLGDTIITRVYYVEGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDLLQGS-RST 1938 + G+ I +V +V+ LKHNL SV Q CD LEV F + I + D++ S R+ Sbjct: 580 NITNGNFTIKKVAFVDDLKHNLISVSQLCDNNLEVLFTKQRSLIMDAKTKDVIVDSDRAG 639 Query: 1939 NLYSISLNNLMAASPVCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYD 2118 N+Y + ++ + +CLLSKA + SWLWHRRL+HLNFG +N+L DLVRGLP LK D Sbjct: 640 NMYPLDMDLIYGKPDICLLSKAPADISWLWHRRLSHLNFGYINKLIGDDLVRGLPLLKLD 699 Query: 2119 KEHLCPSCQLGKSKKSSH 2172 E LC +C+ GK KS+H Sbjct: 700 NETLCAACEKGKLSKSTH 717 Score = 79.7 bits (195), Expect = 2e-11 Identities = 86/315 (27%), Positives = 139/315 (44%), Gaps = 11/315 (3%) Frame = +1 Query: 226 SSAAGTDNRPPMLEESDYE*WKIRMKRFIRSKLHGKALWNSIMEGPTPHPMTTDPVGEGS 405 +++ G+ +R P+L +Y W RM + + + +W + EG P E Sbjct: 15 ANSLGSGSRAPILIPEEYNSWVGRMNLHLNAI--NEDVWKCV-EGTYVTP-------ENM 64 Query: 406 TTVPTPRRKRDDEFTDEENAKELIDMQAASILSQGLPRHVFNILNQTE--TGKEIWDNLE 579 T+ T ++ T E K+L ++QA L G+P + + ++ T +IW+NL+ Sbjct: 65 ATLAT------NQATQVEITKKL-ELQAKKELVSGIPHSILSQMDDIILLTANQIWENLK 117 Query: 580 LLMKGSGLTEQRKKEELFDEYERFKAIGNETIHDYFVRFHKLA---NDLKITKIQIPTHQ 750 G+ K+ + +E++ FK + +ETIHD RF+ + N+L I K Q H+ Sbjct: 118 NRFCGNKRIIGNKRTSVLNEFDNFKMLSSETIHDAHDRFNLIMVKMNNLGIKKTQ---HE 174 Query: 751 QNTKFLNNLPPYWAKYVTGVKQNKDISTSTYVELYTYLKAYEPHALKTLKKQEQTSSIMD 930 N KFLNNL W ++ N I T + LY L++YE ++ Sbjct: 175 INLKFLNNLFESWKMVKLIIQGNPAIHTESLYNLYGELQSYESSI-----DPPTIAAFGG 229 Query: 931 PLAYLA--HQNSTTVASPTSSSFS-TQVPQQQALTGTEAMLATMQQLVNLMSGFQKQ-FP 1098 PLA ++ QN T + F+ T Q QA A QQL L++ Q F Sbjct: 230 PLALVSTTSQNQTPFNDQNFNHFNQTTSFQNQAFQSDSNDEADYQQLCALVANTNLQRFI 289 Query: 1099 PTNNQ--LRTSSNPR 1137 P + Q R + PR Sbjct: 290 PNHGQSNFRPNFQPR 304 >ref|XP_017249749.1| PREDICTED: uncharacterized protein LOC108220476 [Daucus carota subsp. sativus] Length = 310 Score = 171 bits (432), Expect = 6e-44 Identities = 88/172 (51%), Positives = 112/172 (65%), Gaps = 1/172 (0%) Frame = +1 Query: 1630 LWYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYVE 1809 LWYLDS CSRHMTGD + L YV+ ++ FG+D GYG D II V V+ Sbjct: 139 LWYLDSECSRHMTGDPTLLTKYVEKAGPSITFGDDIKGYTLGYGLIPNKDVIIEDVSLVD 198 Query: 1810 GLKHNLFSVGQFCDAGLEVAFRRHTCHIRNK-DMIDLLQGSRSTNLYSISLNNLMAASPV 1986 GLKHNL SV Q CD GL+V F C + NK D +L G R N+Y N+ + S Sbjct: 199 GLKHNLLSVSQLCDKGLQVWFPNAACVVSNKKDNNVVLNGVRKGNVYIADFNSARSKSLT 258 Query: 1987 CLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSC 2142 CL SKASS +SWLW++RL+HLNF T+N+L RKDLVRG+PKL+++K+ LC +C Sbjct: 259 CLFSKASSDESWLWNKRLSHLNFKTMNDLIRKDLVRGIPKLEFNKDGLCGAC 310 >ref|XP_012572663.1| PREDICTED: uncharacterized protein LOC105852301 [Cicer arietinum] Length = 752 Score = 169 bits (427), Expect = 4e-40 Identities = 129/458 (28%), Positives = 207/458 (45%), Gaps = 2/458 (0%) Frame = +1 Query: 787 WAKYVTGVKQNKDISTSTYVELYTYLKAYEPHALKTLKKQEQTSSIMDPLAYLAHQNSTT 966 W +T + +++D++ T L+ L+ +E L+ L + E S L+ N + Sbjct: 8 WQPKITAIAESRDLAKMTTATLFGKLREHEME-LQRLDESEMESRKKKGLSLKVQANQSK 66 Query: 967 VASPTSSSFSTQVPQQQALTGTEAMLATMQQLVNLMSGFQKQFPPTNNQLRTSSNPRSHA 1146 + S + S+ S+ ++ ++ L+ F+K +N+ R S+ + Sbjct: 67 IESDSCSNESSSDNEEP-------------EIGLLVKKFKKFLKKKDNKFRKPSSSK--- 110 Query: 1147 TVHEGQIITETVQRKAPGNVSYAGTSGNKSYGQMTDRYGKKVICYNCRGEGHVARQCKEP 1326 TS NK ++ CY C GH+ +C + Sbjct: 111 ------------------------TSDNK-----------QITCYECGKTGHIKSECYKL 135 Query: 1327 KRAKDTQYHKDKLMLSEAKDRGVKLDAEAEAFLADVECTEPLDESLALTTTTAFQVNHED 1506 + +K+K S+ K+ K +A++A + N E Sbjct: 136 Q-------NKNKAAKSKGKEPVTKTK---KAYIA-------------------WNDNDES 166 Query: 1507 AYDSDVDEGPHASAAFMANLSSTTDANGASSSKINEVVQMVLWYLDSGCSRHMTGDRSKL 1686 + SD +E A+ MAN S ++ +S + WYLDSGCS+HMTGD+SK Sbjct: 167 SASSDEEE---ANMCLMANSDSESEKEVCLTSTKHS------WYLDSGCSKHMTGDKSKF 217 Query: 1687 INYVDNFIGTVRFGNDEFATIDGYGDY-KLGDTIITRVYYVEGLKHNLFSVGQFCDAGLE 1863 ++ G V++G++ I G GD T+I V YVEGLKHNL S+ Q CD G + Sbjct: 218 LSLTLKEGGFVKYGDNNRGKIIGIGDIGNESTTVIKNVLYVEGLKHNLLSISQLCDKGFQ 277 Query: 1864 VAFRRHTCHIRNKDMIDL-LQGSRSTNLYSISLNNLMAASPVCLLSKASSTKSWLWHRRL 2040 V+F +C I +KD ++ L G R N+Y + N++ +A CLLS T WLWH+R+ Sbjct: 278 VSFSSQSCIIEHKDDKNIKLIGDRINNIYMLDFNSVPSAV-CCLLSNQDET--WLWHKRI 334 Query: 2041 NHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGK 2154 H++ LN+L K LV GLP K+ K+ LC SC+ K Sbjct: 335 AHIHINHLNKLVSKQLVIGLPNRKFSKDRLCDSCEKSK 372 >ref|XP_012575677.1| PREDICTED: uncharacterized protein LOC105853126 [Cicer arietinum] Length = 796 Score = 169 bits (427), Expect = 5e-40 Identities = 129/458 (28%), Positives = 207/458 (45%), Gaps = 2/458 (0%) Frame = +1 Query: 787 WAKYVTGVKQNKDISTSTYVELYTYLKAYEPHALKTLKKQEQTSSIMDPLAYLAHQNSTT 966 W +T + +++D++ T L+ L+ +E L+ L + E S L+ N + Sbjct: 8 WQPKITAIAESRDLAKMTTATLFGKLREHEME-LQRLDESEMESRKKKGLSLKVQANQSK 66 Query: 967 VASPTSSSFSTQVPQQQALTGTEAMLATMQQLVNLMSGFQKQFPPTNNQLRTSSNPRSHA 1146 + S + S+ S+ ++ ++ L+ F+K +N+ R S+ + Sbjct: 67 IESDSCSNESSSDNEEP-------------EIGLLVKKFKKFLKKKDNKFRKPSSSK--- 110 Query: 1147 TVHEGQIITETVQRKAPGNVSYAGTSGNKSYGQMTDRYGKKVICYNCRGEGHVARQCKEP 1326 TS NK ++ CY C GH+ +C + Sbjct: 111 ------------------------TSDNK-----------QITCYECGKTGHIKSECYKL 135 Query: 1327 KRAKDTQYHKDKLMLSEAKDRGVKLDAEAEAFLADVECTEPLDESLALTTTTAFQVNHED 1506 + +K+K S+ K+ K +A++A + N E Sbjct: 136 Q-------NKNKAAKSKGKEPVTKTK---KAYIA-------------------WNDNDES 166 Query: 1507 AYDSDVDEGPHASAAFMANLSSTTDANGASSSKINEVVQMVLWYLDSGCSRHMTGDRSKL 1686 + SD +E A+ MAN S ++ +S + WYLDSGCS+HMTGD+SK Sbjct: 167 SASSDEEE---ANMCLMANSDSESEKEVCLTSTKHS------WYLDSGCSKHMTGDKSKF 217 Query: 1687 INYVDNFIGTVRFGNDEFATIDGYGDY-KLGDTIITRVYYVEGLKHNLFSVGQFCDAGLE 1863 ++ G V++G++ I G GD T+I V YVEGLKHNL S+ Q CD G + Sbjct: 218 LSLTLKEGGFVKYGDNNRGKIIGIGDIGNESTTVIKNVLYVEGLKHNLLSISQLCDKGFQ 277 Query: 1864 VAFRRHTCHIRNKDMIDL-LQGSRSTNLYSISLNNLMAASPVCLLSKASSTKSWLWHRRL 2040 V+F +C I +KD ++ L G R N+Y + N++ +A CLLS T WLWH+R+ Sbjct: 278 VSFSSQSCIIEHKDDKNIKLIGDRINNIYMLDFNSVPSAV-CCLLSNQDET--WLWHKRI 334 Query: 2041 NHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGK 2154 H++ LN+L K LV GLP K+ K+ LC SC+ K Sbjct: 335 AHIHINHLNKLVSKQLVIGLPNRKFSKDRLCDSCEKSK 372 >ref|XP_023757987.1| uncharacterized protein LOC111906461 [Lactuca sativa] Length = 950 Score = 165 bits (417), Expect = 2e-38 Identities = 90/186 (48%), Positives = 118/186 (63%), Gaps = 6/186 (3%) Frame = +1 Query: 1633 WYLDSGCSRHMTGDRSKLINYVDNF--IGT---VRFGNDEFATIDGYGDYKLGDTIITRV 1797 WY+DSGCS+HMTG+R NY+ +F I T V FGN+ A I GYG+ G+ I +V Sbjct: 697 WYIDSGCSKHMTGNR----NYLRDFKPIQTNQDVTFGNNMKAKIKGYGNITNGNFTIKKV 752 Query: 1798 YYVEGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDLLQGS-RSTNLYSISLNNLMA 1974 +V LKHNL SV Q CD LEV F + I + D++ S R+ N+Y + ++ + Sbjct: 753 AFVNDLKHNLISVSQQCDNNLEVLFTKQRSLIMDAKTKDVIVDSDRAGNMYPLDMDLIYG 812 Query: 1975 ASPVCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGK 2154 +CLLSKA + SWLWHRRL+HLNFG +N+L DLVRGLP LK D E LC +C+ GK Sbjct: 813 KPDICLLSKAPADISWLWHRRLSHLNFGYINKLIGDDLVRGLPLLKLDNETLCAACEKGK 872 Query: 2155 SKKSSH 2172 KS+H Sbjct: 873 LSKSTH 878 Score = 86.3 bits (212), Expect = 1e-13 Identities = 99/391 (25%), Positives = 154/391 (39%), Gaps = 8/391 (2%) Frame = +1 Query: 256 PMLEESDYE*WKIRMKRFIRSKLHGKALWNSIMEGPTPHPMTTDPVGEGSTTVPTPRRKR 435 P+L +Y W RM + + + +W + EG P E T+ T + + Sbjct: 17 PILIPEEYNLWVGRMNLHLNAI--NEDIWKCV-EGTYVTP-------ENMATLATNQATQ 66 Query: 436 DDEFTDEENAKELIDMQAASILSQGLPRHVFNILNQTE--TGKEIWDNLELLMKGSGLTE 609 D +++QA L GLP + + +N T +IW+NL+ G+ Sbjct: 67 TDIIRK-------LELQAKKELVSGLPHSILSQMNDIIMLTVNQIWENLKNRFCGNKRII 119 Query: 610 QRKKEELFDEYERFKAIGNETIHDYFVRFHKLA---NDLKITKIQIPTHQQNTKFLNNLP 780 K+ + +E++ FK + +ETIHD RF+ + N+L I K Q H+ N KFLNNL Sbjct: 120 GNKRTSVLNEFDNFKMLSSETIHDAHDRFNLIMVKMNNLGIKKTQ---HEINLKFLNNLF 176 Query: 781 PYWAKYVTGVKQNKDISTSTYVELYTYLKAYEPHALKTLKKQEQTSSIMDPLAYLA--HQ 954 W ++ N I T + LY L++YE ++ PLA ++ Q Sbjct: 177 ESWKMVKLIIQGNPAIHTESLYNLYGELQSYESSI-----DPPTIAAFGGPLALVSTTSQ 231 Query: 955 NSTTVASPTSSSFS-TQVPQQQALTGTEAMLATMQQLVNLMSGFQKQFPPTNNQLRTSSN 1131 N T + F+ T Q QA A QQL FQ+ + Q +T + Sbjct: 232 NQTPFNDQNFNHFNQTTSFQNQAFQSDSNDEADYQQL------FQQNHSQPSQQAQTQTP 285 Query: 1132 PRSHATVHEGQIITETVQRKAPGNVSYAGTSGNKSYGQMTDRYGKKVICYNCRGEGHVAR 1311 R Q D + +IC+NC+G H AR Sbjct: 286 ERLPIK------------------------------SQKDDSDEEVIICHNCKGTNHYAR 315 Query: 1312 QCKEPKRAKDTQYHKDKLMLSEAKDRGVKLD 1404 +C RAK+ KD ++ D KL+ Sbjct: 316 EC----RAKNKTKIKDSAYYAQRADELKKLE 342 >ref|XP_023767590.1| uncharacterized protein LOC111916195 [Lactuca sativa] Length = 934 Score = 164 bits (414), Expect = 4e-38 Identities = 88/186 (47%), Positives = 119/186 (63%), Gaps = 6/186 (3%) Frame = +1 Query: 1633 WYLDSGCSRHMTGDRSKLINYVDNF--IGT---VRFGNDEFATIDGYGDYKLGDTIITRV 1797 WY+DSGCS+HMTG+R NY+ +F I T V F N+ A I+GYG+ G+ I +V Sbjct: 744 WYIDSGCSKHMTGNR----NYLRDFKPIQTNQDVTFDNNMKAKINGYGNITNGNFTIKKV 799 Query: 1798 YYVEGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDLLQGS-RSTNLYSISLNNLMA 1974 +V+ LKHNL SV Q CD LEV F + I + D++ S R+ N+Y + ++ + Sbjct: 800 AFVDDLKHNLISVSQLCDNNLEVLFTKQRSLIMDAKTKDVIVDSDRAGNMYPLDMDLIYG 859 Query: 1975 ASPVCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGK 2154 +CLLSKA + SWLWHRRL+HLNFG +N+L DLVRGLP LK D E LC +C+ G+ Sbjct: 860 KPDICLLSKAPADISWLWHRRLSHLNFGYINKLIGDDLVRGLPLLKLDNETLCAACEKGR 919 Query: 2155 SKKSSH 2172 KS+H Sbjct: 920 LSKSTH 925 Score = 75.5 bits (184), Expect = 3e-10 Identities = 60/222 (27%), Positives = 103/222 (46%), Gaps = 5/222 (2%) Frame = +1 Query: 226 SSAAGTDNRPPMLEESDYE*WKIRMKRFIRSKLHGKALWNSIMEGPTPHPMTTDPVGEGS 405 +++ G+ +R P+L +Y W RM + + + +W + EG P E Sbjct: 15 ANSLGSGSRAPILIPEEYNSWVGRMNLHLNAI--NEDIWKCV-EGTYVTP-------ENM 64 Query: 406 TTVPTPRRKRDDEFTDEENAKELIDMQAASILSQGLPRHVFNILNQTE--TGKEIWDNLE 579 T+ T + + D +++QA L G+P + + ++ T +IW+NL+ Sbjct: 65 ATLATNQATQTDIIRK-------LELQAKKELVSGIPHSILSQMDDIMMLTANQIWENLK 117 Query: 580 LLMKGSGLTEQRKKEELFDEYERFKAIGNETIHDYFVRFHKLA---NDLKITKIQIPTHQ 750 G+ K+ + +E++ FK + +ETIHD RF+ + N+L I K Q H+ Sbjct: 118 NRFCGNKRIIGNKRTSVLNEFDNFKMLSSETIHDAHDRFNLIMVKMNNLGIKKTQ---HE 174 Query: 751 QNTKFLNNLPPYWAKYVTGVKQNKDISTSTYVELYTYLKAYE 876 N KFLNNL W ++ N I T + LY L++YE Sbjct: 175 INLKFLNNLFESWKMVKLIIQGNPAIHTESLYNLYGELQSYE 216 >dbj|GBC54550.1| gag-pol fusion protein [Rhizophagus irregularis DAOM 181602] Length = 457 Score = 157 bits (398), Expect = 7e-38 Identities = 82/178 (46%), Positives = 110/178 (61%), Gaps = 1/178 (0%) Frame = +1 Query: 1663 MTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYVEGLKHNLFSVGQ 1842 MTGD S L +V+ ++ FG+D GYG + II V V GLKHNL S+ Q Sbjct: 1 MTGDSSLLTKFVEKAGPSITFGDDSKGYTMGYGLIAKENVIIDEVALVSGLKHNLLSISQ 60 Query: 1843 FCDAGLEVAFRRHTCHI-RNKDMIDLLQGSRSTNLYSISLNNLMAASPVCLLSKASSTKS 2019 CD G +V F C + + D +L G R N+Y N++ + S C LSKASS S Sbjct: 61 LCDKGYKVNFTPAACVVTKGDDNNVVLIGQRKGNVYVADFNSVKSESITCFLSKASSDDS 120 Query: 2020 WLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKKSSHPLKTVNT 2193 WLWH+RL+HLNF TLNEL +KDLVRG+PKL++ K+ LC +CQ GK K+SS K++++ Sbjct: 121 WLWHKRLSHLNFKTLNELVKKDLVRGIPKLEFSKDGLCGACQQGKQKRSSFKSKSLSS 178 >ref|XP_023731306.1| uncharacterized protein LOC111879058 [Lactuca sativa] Length = 230 Score = 147 bits (370), Expect = 2e-36 Identities = 76/177 (42%), Positives = 109/177 (61%), Gaps = 3/177 (1%) Frame = +1 Query: 1633 WYLDSGCSRHMTGDRSKLINYVDNFIGT-VRFGNDEFATIDGYGDYKLGDTIITRVYYVE 1809 WY+DSGCSRHM G + +L + G+ +++ ND F I GY G+ I +V YVE Sbjct: 45 WYIDSGCSRHMIGRKEELREFRALKDGSNIKYRNDSFGIIKGYDMIMNGEFFIHKVAYVE 104 Query: 1810 GLKHNLFSVGQ-FCDAGLEVAFRRHTCHI-RNKDMIDLLQGSRSTNLYSISLNNLMAASP 1983 GL+HNL SV Q GL+V+F I K LL+ + +Y ++LN + Sbjct: 105 GLQHNLISVSQSVVGTGLKVSFDEEGSEIIEKKSTTVLLKSQQKGEMYPLNLNLIRGKPA 164 Query: 1984 VCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGK 2154 +CL++KA S +SWLWHR+L+HLNF +N+L D VRG P LK+DK++LC +C++GK Sbjct: 165 ICLITKAHSDESWLWHRQLSHLNFKDINKLVLGDHVRGRPLLKFDKDNLCAACEMGK 221 >gb|KYP35691.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] gb|KYP38474.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1482 Score = 159 bits (401), Expect = 2e-36 Identities = 85/195 (43%), Positives = 119/195 (61%), Gaps = 2/195 (1%) Frame = +1 Query: 1630 LWYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDY-KLGDTIITRVYYV 1806 +WYLDSGCSRHMTGD+SK I+ + G+V +G++ I G G +T+I V YV Sbjct: 473 MWYLDSGCSRHMTGDKSKFISLQEKEGGSVTYGDNNKGRILGSGSVGNNSNTLIENVLYV 532 Query: 1807 EGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMID-LLQGSRSTNLYSISLNNLMAASP 1983 EGLK+NL S+ Q CD ++F C + +K+ + LL G R N+Y + L + ++S Sbjct: 533 EGLKYNLLSISQLCDKNYNISFNNQCCMVCDKESNEVLLIGKRVNNIYILDLEH--SSSN 590 Query: 1984 VCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKK 2163 CL S + T WLWHRR+ H+N LN LA KDLV GLPK+K+ K+ LC +CQ GK + Sbjct: 591 SCLTSHDNVT--WLWHRRIAHINADQLNRLASKDLVSGLPKIKFSKQGLCDACQKGKQTR 648 Query: 2164 SSHPLKTVNTNTEIL 2208 +S K V + + L Sbjct: 649 ASFKSKKVMSTSRPL 663 Score = 87.4 bits (215), Expect = 8e-14 Identities = 60/210 (28%), Positives = 100/210 (47%) Frame = +1 Query: 247 NRPPMLEESDYE*WKIRMKRFIRSKLHGKALWNSIMEGPTPHPMTTDPVGEGSTTVPTPR 426 NRPP+ +Y WK+RMK F+ S +H + +W +++ T + + T EG + P Sbjct: 15 NRPPLFCGDNYPFWKVRMKIFMES-VH-RNIWQAVI---TDYKIPTKI--EGGKEIEKPY 67 Query: 427 RKRDDEFTDEENAKELIDMQAASILSQGLPRHVFNILNQTETGKEIWDNLELLMKGSGLT 606 D + E + D +A +I+ L F ++ T KE WD +++ +G+ Sbjct: 68 ----DSWDQSEIRRAENDAKALNIIHSALNSDEFFRISACSTAKEAWDLIQVTHEGTPEV 123 Query: 607 EQRKKEELFDEYERFKAIGNETIHDYFVRFHKLANDLKITKIQIPTHQQNTKFLNNLPPY 786 + +K L EYE F+ ET+ D RF + N LK + N K L +L Sbjct: 124 RRARKNTLIQEYETFRMNQGETVMDMQKRFIHIINHLKGLGKDFVEEEVNVKILKSLNRR 183 Query: 787 WAKYVTGVKQNKDISTSTYVELYTYLKAYE 876 W VT + ++K+++ T EL+ L+ YE Sbjct: 184 WQPTVTAITESKNLAQMTSAELFGKLREYE 213 >gb|KYP37030.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] gb|KYP55193.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 1530 Score = 159 bits (401), Expect = 3e-36 Identities = 85/195 (43%), Positives = 119/195 (61%), Gaps = 2/195 (1%) Frame = +1 Query: 1630 LWYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDY-KLGDTIITRVYYV 1806 +WYLDSGCSRHMTGD+SK I+ + G+V +G++ I G G +T+I V YV Sbjct: 521 MWYLDSGCSRHMTGDKSKFISLQEKEGGSVTYGDNNKGRILGSGSVGNNSNTLIENVLYV 580 Query: 1807 EGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMID-LLQGSRSTNLYSISLNNLMAASP 1983 EGLK+NL S+ Q CD ++F C + +K+ + LL G R N+Y + L + ++S Sbjct: 581 EGLKYNLLSISQLCDKNYNISFNNQCCMVCDKESNEVLLIGKRVNNIYILDLEH--SSSN 638 Query: 1984 VCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKK 2163 CL S + T WLWHRR+ H+N LN LA KDLV GLPK+K+ K+ LC +CQ GK + Sbjct: 639 SCLTSHDNVT--WLWHRRIAHINADQLNRLASKDLVSGLPKIKFSKQGLCDACQKGKQTR 696 Query: 2164 SSHPLKTVNTNTEIL 2208 +S K V + + L Sbjct: 697 ASFKSKKVMSTSRPL 711 Score = 99.8 bits (247), Expect = 1e-17 Identities = 111/489 (22%), Positives = 190/489 (38%), Gaps = 3/489 (0%) Frame = +1 Query: 247 NRPPMLEESDYE*WKIRMKRFIRSKLHGKALWNSIMEGPTPHPMTTDPVGEGSTTVPTPR 426 NRPP+ +Y WK+RMK F+ S +H + +W +++ T + + T EG + P Sbjct: 15 NRPPLFCGDNYPFWKVRMKIFMES-VH-RNIWQAVI---TDYKIPTKI--EGGKEIEKPY 67 Query: 427 RKRDDEFTDEENAKELIDMQAASILSQGLPRHVFNILNQTETGKEIWDNLELLMKGSGLT 606 D + E + D +A +I+ L F ++ T KE WD +++ +G+ Sbjct: 68 ----DSWDQSEIRRAENDAKALNIIHSALNSDEFFRISACSTAKEAWDLIQVTHEGTPEV 123 Query: 607 EQRKKEELFDEYERFKAIGNETIHDYFVRFHKLANDLKITKIQIPTHQQNTKFLNNLPPY 786 + +K L EYE F+ ET+ D RF + N LK + N K L +L Sbjct: 124 RRARKNTLIQEYETFRMNQGETVMDMQKRFIHIINHLKGLGKDFVEEEVNVKILKSLNRR 183 Query: 787 WAKYVTGVKQNKDISTSTYVELYTYLKAYEPHALKTLKKQEQTSSIMDPLAYLAHQNSTT 966 W VT + ++K+++ T EL+ L+ YE L + +EQ LA ++S+ Sbjct: 184 WQPTVTAITESKNLAQMTSAELFGKLREYEMD-LSRIADEEQKDKKAKGLALKVKESSSD 242 Query: 967 VASPTSSSFSTQVPQQQALTGTEAMLATMQQLVNLM-SGFQKQFPPTNNQLRTSSNPRSH 1143 ++ S + Q+ +NLM F+K NN R SS +S Sbjct: 243 EEDSSNES-------------------SEQEELNLMVKNFRKFMRRKNNNKRFSSQKKSF 283 Query: 1144 ATVHEGQIITETVQRKAPGNVSYAGTSGNKSYGQMTDRYGKKVICYNCRGEGHVARQCKE 1323 + + K C+ C GH+ C Sbjct: 284 ---------------------------------KKNENSSPKFKCFECGKAGHMRADCPS 310 Query: 1324 PKRAKDTQYHKDKLMLSEAKDRGVKLDAEAEAFLADVECTEPLDESLALTT--TTAFQVN 1497 K+ ++T K K +K + + E A + E + +L L T + ++V Sbjct: 311 LKKNEET---KQKF---RSKKKKAYIAWEENASTTSSDSDEDQEANLCLMTKHNSDYEVY 364 Query: 1498 HEDAYDSDVDEGPHASAAFMANLSSTTDANGASSSKINEVVQMVLWYLDSGCSRHMTGDR 1677 D+ DE +A A A +N K+ + + + +R + D Sbjct: 365 DSDSSIDSYDELQNAFAELYAEAKKLEKSNNVYKKKMTHMRDKISDLEND--NRKLLSDI 422 Query: 1678 SKLINYVDN 1704 SKL + +N Sbjct: 423 SKLKSPCEN 431 >ref|XP_023745845.1| uncharacterized protein LOC111894012 [Lactuca sativa] Length = 535 Score = 154 bits (390), Expect = 3e-36 Identities = 87/195 (44%), Positives = 117/195 (60%), Gaps = 3/195 (1%) Frame = +1 Query: 1633 WYLDSGCSRHMTGDRSKLINYVD-NFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYVE 1809 WY+DSGCSRHMTG R + + G V++GN+ + TI GYG D I +V YVE Sbjct: 341 WYIDSGCSRHMTGRREEPREFQALKDGGCVKYGNNLYRTIKGYGMITNEDFSIQKVAYVE 400 Query: 1810 GLKHNLFSVGQFC-DAGLEVAFRRHTCHI-RNKDMIDLLQGSRSTNLYSISLNNLMAASP 1983 GL+HN+ SV Q GL+V++ I K M LL+ +Y ++LN + Sbjct: 401 GLQHNIISVSQLVVGTGLKVSYDDEGSEIIEKKTMSVLLKSHHKDEMYPLNLNPIRGKPV 460 Query: 1984 VCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKK 2163 V LL KA S +SWL HRRL HLNF +N+L D V+GLP LK+DKEHLC +C++GK + Sbjct: 461 VSLLMKAHSDESWLGHRRLLHLNFKDINKLVLGDHVQGLPLLKFDKEHLCAACEMGKQSR 520 Query: 2164 SSHPLKTVNTNTEIL 2208 SHP + NT+I+ Sbjct: 521 KSHPTR---INTKII 532 >gb|KYP60954.1| Copia protein, partial [Cajanus cajan] Length = 438 Score = 152 bits (384), Expect = 4e-36 Identities = 83/218 (38%), Positives = 115/218 (52%), Gaps = 10/218 (4%) Frame = +1 Query: 1558 ANLSSTTDANGASSSKINEVVQMVL---------WYLDSGCSRHMTGDRSKLINYVDNFI 1710 AN+ D N +S S E + + WYLDSGCSRHMTG+RS ++ Sbjct: 182 ANICLMADTNSSSESDDEEYLFQICSTRKISVQSWYLDSGCSRHMTGERSMFLDLKSKKG 241 Query: 1711 GTVRFGNDEFATIDGYGDYKLGDTI-ITRVYYVEGLKHNLFSVGQFCDAGLEVAFRRHTC 1887 G + FG + I G + +I I V YV+GL HNL S+ Q CD+G EV+F ++ C Sbjct: 242 GQITFGGGQKGQIMGISKVGINSSISIDNVLYVKGLTHNLLSISQLCDSGYEVSFNKNKC 301 Query: 1888 HIRNKDMIDLLQGSRSTNLYSISLNNLMAASPVCLLSKASSTKSWLWHRRLNHLNFGTLN 2067 I D L +R NLY I LN L + CL+S + WLWH++ H + ++ Sbjct: 302 TISQNDSSILFTANRCNNLYKIMLNELENQNVDCLVSYEN---QWLWHKKFGHASLRLIS 358 Query: 2068 ELARKDLVRGLPKLKYDKEHLCPSCQLGKSKKSSHPLK 2181 +L + +L+RGLP L Y +LC +CQ GK KSS K Sbjct: 359 KLIKHNLIRGLPSLVYQTNNLCEACQKGKQVKSSFESK 396 >gb|KYP40629.1| Copia protein, partial [Cajanus cajan] Length = 334 Score = 148 bits (373), Expect = 1e-35 Identities = 84/194 (43%), Positives = 113/194 (58%), Gaps = 2/194 (1%) Frame = +1 Query: 1633 WYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLGDTI-ITRVYYVE 1809 WY+DSGCS+HMTGD SK I+ G V +G++ I G G +I I V V+ Sbjct: 2 WYIDSGCSKHMTGDASKFIDLTPKRSGHVTYGDNNRGKILGIGKIGTNFSISIENVLLVD 61 Query: 1810 GLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDL-LQGSRSTNLYSISLNNLMAASPV 1986 GLKH+L SV Q CD G +F C I++K+ ++ + G R N+Y I L N S Sbjct: 62 GLKHSLLSVSQLCDKGFSESFDSQKCLIKHKNDKNVKIIGFRVNNVYKIKLENNSNNSQ- 120 Query: 1987 CLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKKS 2166 CL+SK +SWLWH+R+ H+N LN+L KDLV GLPK+K++K LC +CQ GK K Sbjct: 121 CLMSKED--ESWLWHKRMAHINIEHLNKLISKDLVIGLPKIKFEKNKLCDACQKGKQVKV 178 Query: 2167 SHPLKTVNTNTEIL 2208 S K + T + L Sbjct: 179 SFKPKNIVTTSRPL 192 >ref|XP_017228914.1| PREDICTED: uncharacterized protein LOC108204125 [Daucus carota subsp. sativus] Length = 748 Score = 153 bits (386), Expect = 6e-35 Identities = 78/188 (41%), Positives = 112/188 (59%), Gaps = 1/188 (0%) Frame = +1 Query: 1627 VLWYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYV 1806 V+W L+SG SRHMT DR+ L V+ V FG++ GYG ++ + II + V Sbjct: 393 VMWILNSGSSRHMTRDRALLSKVVEKAGLVVTFGDESKGYTTGYGSLEIENVIIEDISLV 452 Query: 1807 EGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDL-LQGSRSTNLYSISLNNLMAASP 1983 +GL HNL S+ QFCD G +V F++ I N L L G R +L+ N+ Sbjct: 453 DGLMHNLLSISQFCDKGCDVFFKQDKFLITNHKNEKLALNGVRKGDLFVADWNSAEDGQV 512 Query: 1984 VCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKK 2163 +C KAS SWLWH++L+HLNF T+N +++LVR LPK+++ E LC +C+ GKS+K Sbjct: 513 MCFYGKASVNDSWLWHKKLSHLNFKTMNSPVKRELVRVLPKMEFSPEGLCEACEKGKSRK 572 Query: 2164 SSHPLKTV 2187 +SH KT+ Sbjct: 573 ASHKKKTI 580 >gb|KZV36891.1| retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Dorcoceras hygrometricum] Length = 774 Score = 153 bits (386), Expect = 7e-35 Identities = 73/180 (40%), Positives = 111/180 (61%) Frame = +1 Query: 1630 LWYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYVE 1809 +WY DSGC+RHMTG+ L + + + + FG++ + G G G+ I V + Sbjct: 1 VWYFDSGCARHMTGNPKLLTDVIPHKGAKIVFGDNAYGNTVGKGKLIHGNISIVDVLVFD 60 Query: 1810 GLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDLLQGSRSTNLYSISLNNLMAASPVC 1989 LK+NL S+ Q CD G V F++HTC +++ + +L G RS N Y I ++ +C Sbjct: 61 NLKYNLISISQLCDKGYIVKFQQHTCTVQSPSGLTVLVGKRSGNTYIIDRSDQQEPEQLC 120 Query: 1990 LLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKKSS 2169 L A S+KSWLWH+R+NHL+F T+ +L++ DLV GLPK+ + K+ +C +CQLGK KS+ Sbjct: 121 L--AAGSSKSWLWHQRMNHLHFKTIAKLSQHDLVSGLPKISFSKDKICAACQLGKQIKST 178 >gb|KYP40635.1| Copia protein [Cajanus cajan] Length = 520 Score = 149 bits (375), Expect = 2e-34 Identities = 87/207 (42%), Positives = 118/207 (57%), Gaps = 4/207 (1%) Frame = +1 Query: 1600 SKINEVVQMV--LWYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKL 1773 SK+N + WY+DSGCS+HMTGD SK I+ G V +G++ I G G Sbjct: 179 SKLNNLKDFTNQTWYIDSGCSKHMTGDASKFIDLTPKRSGHVTYGDNNRGKILGIGKIGT 238 Query: 1774 GDTI-ITRVYYVEGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDL-LQGSRSTNLY 1947 +I I V V+GLKH+L SV Q CD G +F C I++K+ ++ + G R N+Y Sbjct: 239 NFSISIENVLLVDGLKHSLLSVSQLCDKGFSESFDSQKCLIKHKNDKNVKIIGFRVNNVY 298 Query: 1948 SISLNNLMAASPVCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEH 2127 I L N S CL+SK +SWLWH+R+ H+N LN+L KDLV GLPK+K++K Sbjct: 299 KIKLENNSNNSQ-CLMSKED--ESWLWHKRMAHINIEHLNKLISKDLVIGLPKIKFEKNK 355 Query: 2128 LCPSCQLGKSKKSSHPLKTVNTNTEIL 2208 LC +CQ GK K S K + T + L Sbjct: 356 LCDACQKGKQVKVSFKPKNIVTTSRPL 382 Score = 63.5 bits (153), Expect = 1e-06 Identities = 47/184 (25%), Positives = 84/184 (45%), Gaps = 4/184 (2%) Frame = +1 Query: 247 NRPPMLEESDYE*WKIRMKRFIRSKLHGKALWNSIMEGPTPHPMTTDPVGEGSTTVPTPR 426 NRPP+ Y WK RMK FI + +W+ + G + T VG +P Sbjct: 15 NRPPIFNGEGYHYWKSRMKIFIEAI--DLNIWDVVENGSF---IPTIVVGHEIKDLPK-- 67 Query: 427 RKRDDEFTDEENAKELIDMQAASILSQGLPRHVFNILNQTETGKEIWDNLELLMKGSGLT 606 D+++D++ K +++A +I++ L + ++ + KE+WD L++ +G+ Sbjct: 68 ----DQWSDDDRRKFQYNLKAKNIITSALGIDEYFRISNCKNAKEMWDTLKITHEGTNDV 123 Query: 607 EQRKKEELFDEYERFKAIGNETIHDYFVR----FHKLANDLKITKIQIPTHQQNTKFLNN 774 ++ +K L EYE F+ NE+I D + + + TK I + LNN Sbjct: 124 KRSRKNTLIHEYELFRMNQNESIQDMQLSIDCVLYCVTRLTNCTKKSISNLENKISKLNN 183 Query: 775 LPPY 786 L + Sbjct: 184 LKDF 187 >ref|XP_019194891.1| PREDICTED: uncharacterized protein LOC109188716 [Ipomoea nil] Length = 550 Score = 149 bits (376), Expect = 2e-34 Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 1/173 (0%) Frame = +1 Query: 1627 VLWYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYV 1806 V+WY+DSGCSRHMTGD+S L N+ + V FG + G GD + II V YV Sbjct: 380 VIWYIDSGCSRHMTGDKSNLSNFKEVDGTKVIFGGESGGKTKGVGDIIKNEIIIREVSYV 439 Query: 1807 EGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMID-LLQGSRSTNLYSISLNNLMAASP 1983 EGLK N S QFCD G +V F + C++ + + D +L SR N+Y +S ++ + Sbjct: 440 EGLKFNFLSTSQFCDKGYKVEFSKDKCNVISTENGDIILSTSRKKNMYVVSWK--LSKAN 497 Query: 1984 VCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSC 2142 VCL++K+++ SW W+ +LNHLN T+ +LA+ +LV GLPK+ Y K+ +C +C Sbjct: 498 VCLMAKSNADLSWEWYSKLNHLNLKTIRKLAKGNLVEGLPKVSYIKDKICDAC 550 >gb|KYP64004.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Cajanus cajan] Length = 1021 Score = 152 bits (384), Expect = 2e-34 Identities = 87/196 (44%), Positives = 115/196 (58%), Gaps = 4/196 (2%) Frame = +1 Query: 1633 WYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLG---DTIITRVYY 1803 WY++SGCS+HMTGD SK I++ G V +G++ I G G K+G T I V Sbjct: 1 WYINSGCSKHMTGDASKFIDFTPKRSGHVTYGDNNRGKILGIG--KIGTNFSTSIENVLL 58 Query: 1804 VEGLKHNLFSVGQFCDAGLEVAFRRHTCHIRNK-DMIDLLQGSRSTNLYSISLNNLMAAS 1980 V+GLKH+L SV Q CD G V+F C I +K D + G R N+Y I + N S Sbjct: 59 VDGLKHSLLSVSQLCDKGFSVSFDSQKCLIEHKNDKQVKIVGFRINNVYKIKIENNPKHS 118 Query: 1981 PVCLLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSK 2160 CL+SK + +SWLWH+R+ H+N LN+L KDLV GLPK+K++K LC +CQ GK Sbjct: 119 Q-CLMSK--NDESWLWHKRIAHINMEHLNKLISKDLVIGLPKIKFEKNKLCDACQKGKQV 175 Query: 2161 KSSHPLKTVNTNTEIL 2208 K S K + T T L Sbjct: 176 KVSFKPKNIVTTTRPL 191 >gb|ABD32582.1| Integrase, catalytic region; Zinc finger, CCHC-type; Peptidase aspartic, catalytic [Medicago truncatula] Length = 1715 Score = 152 bits (385), Expect = 3e-34 Identities = 76/186 (40%), Positives = 110/186 (59%), Gaps = 1/186 (0%) Frame = +1 Query: 1633 WYLDSGCSRHMTGDRSKLINYVDNFIGTVRFGNDEFATIDGYGDYKLGDTIITRVYYVEG 1812 WYLDSGCSRHMTG+++ + G V+FG ++ I G G I V+ V+G Sbjct: 670 WYLDSGCSRHMTGEKALFLTLTMKDGGEVKFGGNQTGKIIGTGTIGNSSISINNVWLVDG 729 Query: 1813 LKHNLFSVGQFCDAGLEVAFRRHTCHIRNKDMIDL-LQGSRSTNLYSISLNNLMAASPVC 1989 LKHNL S+ QFCD G +V F + C + NKD + +G R N+Y I+ ++L VC Sbjct: 730 LKHNLLSISQFCDNGYDVTFSKTNCTLVNKDDKSITFKGKRVENVYKINFSDLADQKVVC 789 Query: 1990 LLSKASSTKSWLWHRRLNHLNFGTLNELARKDLVRGLPKLKYDKEHLCPSCQLGKSKKSS 2169 LLS + K W+WH+RL H N+ ++++++ LV+GLP + Y + LC +CQ GK KSS Sbjct: 790 LLS--MNDKKWVWHKRLGHANWRLISKISKLQLVKGLPNIDYHSDALCGACQKGKIVKSS 847 Query: 2170 HPLKTV 2187 K + Sbjct: 848 FKSKDI 853 Score = 64.3 bits (155), Expect = 1e-06 Identities = 78/392 (19%), Positives = 151/392 (38%), Gaps = 9/392 (2%) Frame = +1 Query: 496 ILSQGLPRHVFNILNQTETGKEIWDNLELLMKGSGLTEQRKKEELFDEYERFKAIGNETI 675 I+ +PR + ++ T K ++ +L +GS ++ K L +YE F+ +E+I Sbjct: 101 IIVASIPRTEYMKMSDKSTAKAMFASLCANFEGSKKVKEAKALMLVHQYELFRMKDDESI 160 Query: 676 HDYFVRFHKLANDLKITKIQIPTHQQNTKFLNNLPPYWAKYVTGVKQNKDISTSTYVELY 855 + + RF L + L+I K +K L +LP W VT +++ KD++T + +L Sbjct: 161 EEMYSRFQTLVSGLQILKKSYVASDHVSKILRSLPSRWRPKVTAIEEAKDLNTLSVEDLV 220 Query: 856 TYLKAYEPHALKTLKKQEQTSSIMDPLAYLAHQNSTTVASPTSSSFSTQVPQQQALTG-- 1029 + LK +E +L + +++ SI P S S +S ++ ++++ G Sbjct: 221 SSLKVHE-MSLNEHETSKKSKSIALP--------SKGKTSKSSKAYKASESEEESPDGDS 271 Query: 1030 TEAMLATMQQLVNLMSGFQKQFPPTNNQLRTSSNPRSHATVHEGQIITETVQRKAPGNVS 1209 E M L N + E + RK +S Sbjct: 272 DEDQSVKMAMLSNKL---------------------------------EYLARKQKKFLS 298 Query: 1210 YAGTSGNKSYGQMTDRYGKKVICYNCRGEGHVARQCKEPKRAKDTQYHKDKLMLSEAKDR 1389 G+ N + D+ G C+NC+ GH C + ++ K K S + Sbjct: 299 KRGSYKN---FKKEDQKG----CFNCKKPGHFIADCPDLQKEKFKGKSKKSSFNSSKFRK 351 Query: 1390 GVKL-------DAEAEAFLADVECTEPLDESLALTTTTAFQVNHEDAYDSDVDEGPHASA 1548 +K D ++E+ E + ++ L T + + E DS+ + ++ Sbjct: 352 QIKKSLMATWEDLDSESGSDKEEADDDAKAAVGLVATVSSEAVSEAESDSEDENEVYSKI 411 Query: 1549 AFMANLSSTTDANGASSSKINEVVQMVLWYLD 1644 + S + + NE+ + Y+D Sbjct: 412 PRQELVDSLKELLSLFEHRTNELTDLKEKYVD 443