BLASTX nr result
ID: Chrysanthemum21_contig00017632
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum21_contig00017632 (2388 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_022014041.1| uncharacterized protein LOC110913529 [Helian... 800 0.0 gb|OTG21088.1| putative ribonuclease H-like domain-containing pr... 784 0.0 gb|OTG33444.1| putative ribonuclease H-like domain-containing pr... 780 0.0 ref|XP_022042162.1| uncharacterized protein LOC110944826 [Helian... 760 0.0 gb|OTG37431.1| putative ribonuclease H-like domain-containing pr... 779 0.0 gb|OTG16942.1| putative ribonuclease H-like domain-containing pr... 775 0.0 ref|XP_021986042.1| uncharacterized protein LOC110882290 [Helian... 749 0.0 ref|XP_021980336.1| uncharacterized protein LOC110876473 [Helian... 694 0.0 ref|XP_021971692.1| uncharacterized protein LOC110866849 [Helian... 566 0.0 ref|XP_022014401.1| uncharacterized protein LOC110913892 [Helian... 546 0.0 ref|XP_021991826.1| uncharacterized protein LOC110888615 [Helian... 492 e-162 ref|XP_022023932.1| uncharacterized protein LOC110924205 [Helian... 471 e-154 emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] 478 e-147 gb|PNX93622.1| retrovirus-related Pol polyprotein from transposo... 474 e-146 gb|OMO88216.1| Integrase, catalytic core [Corchorus capsularis] 476 e-145 gb|PNX92904.1| retrovirus-related Pol polyprotein from transposo... 472 e-145 gb|KYP42518.1| Retrovirus-related Pol polyprotein from transposo... 451 e-144 ref|XP_017415202.1| PREDICTED: retrovirus-related Pol polyprotei... 469 e-144 gb|PNX93517.1| retrovirus-related Pol polyprotein from transposo... 468 e-143 ref|XP_017415203.1| PREDICTED: retrovirus-related Pol polyprotei... 459 e-140 >ref|XP_022014041.1| uncharacterized protein LOC110913529 [Helianthus annuus] Length = 784 Score = 800 bits (2067), Expect = 0.0 Identities = 402/795 (50%), Positives = 536/795 (67%), Gaps = 10/795 (1%) Frame = -2 Query: 2357 NNESVINSSEL---KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHN 2187 NN+S++++S K+ GD LYLHPSD++ I+++KL G++NY +W+ AM AL+ N Sbjct: 5 NNDSLVSTSGTLVSKIDAGDPLYLHPSDSANLTIVNIKLKGTDNYNVWANAMNLALQVKN 64 Query: 2186 KLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKE 2007 KLGF+DG+C + T + L QWD C+S+VLTWILNS+S EL+ G +YSK A +W +LKE Sbjct: 65 KLGFIDGSCARSTTDEVLGKQWDRCNSIVLTWILNSVSEELYLGHVYSKLASVVWKELKE 124 Query: 2006 TYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEK 1827 TYDKVDGS VFN+++ INS SQNG +++YY+ LN +WKQ D ++SLPTCTCDA F Sbjct: 125 TYDKVDGSVVFNLYQKINSFSQNGMPVSEYYHKLNCMWKQLDQILSLPTCTCDASKQFND 184 Query: 1826 HNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPN 1647 N LIKLMQFLMGLD Y +R+ +LT++ LPSVK AFS++S EESHRN ++ + I N Sbjct: 185 FNHLIKLMQFLMGLDSVYQSVRTTLLTREVLPSVKEAFSVVSREESHRNSNNFSEKISNN 244 Query: 1646 VSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRC 1467 FA NLKCS+CNK GHT+DRC Sbjct: 245 PVGFAVKTSQSFDSKKKNVRPPNP-------------------NLKCSHCNKTGHTVDRC 285 Query: 1466 YGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLND 1287 Y LVGYP+ + TS K E ++S++ L++EQ+ +L+SLLND Sbjct: 286 YELVGYPSWMKSK--TSGNKGGRASNNVVVDAS--ETTSSSTVNGLTNEQIAQLLSLLND 341 Query: 1286 SPAHSNMAGKCFSG----TFFNAS---VKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGAN 1128 + S G F+G FN+ +K F+ + + NF GWI+DSGAN Sbjct: 342 K-SRSETQGNNFAGRSNYVCFNSYADVLKPTCDFKPAYCFS-NFGNNGKKAGWIIDSGAN 399 Query: 1127 QHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSL 948 QHM LIN +DV++ + V HPNGT AL+T+IGD+K+++ +IL+DV V+P+Y V+L Sbjct: 400 QHMITDDTNLINQMDVTEYNIKVKHPNGTSALVTKIGDVKLSDKVILYDVFVIPDYCVNL 459 Query: 947 LSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVS 768 +SVHKLA+D L+VSFDE CYIQD TK+ + G+Q GLY +T + N ++ Sbjct: 460 VSVHKLAKDCNLTVSFDEHNCYIQDSQTKKVLVTGSQLDGLYFCGNSTMSDKVCN-ASLD 518 Query: 767 RNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQ 588 NLWH RLGHP++ VLHVLK L++ K C+TCH+AKQ REPFPLS+HK+ +G Sbjct: 519 VNLWHARLGHPAEPVLHVLKDKLDIKKNV-KLEPCETCHRAKQHREPFPLSDHKTKSLGD 577 Query: 587 LVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFES 408 LVHLDVWGPY++ S+DG++YFLT+VDDY+RAVWVY++KNK +V+ +I F ML QF Sbjct: 578 LVHLDVWGPYRVQSRDGFRYFLTVVDDYTRAVWVYLMKNKDEVFYNIKGFFNMLKTQFNK 637 Query: 407 NIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQ 228 +IK+FRSDNGTEF+N +M+ F + GI HQTSCVYTPQQNGI ERKHRHLLNVAR+L+FQ Sbjct: 638 HIKMFRSDNGTEFINKQMKDFCYNNGIIHQTSCVYTPQQNGIVERKHRHLLNVARALLFQ 697 Query: 227 GELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNN 48 PL W EC+LTA YLINR PSSVL+G+SP+ LV+G P LS +R+ GCLC++T+LNN Sbjct: 698 AGFPLKFWSECILTATYLINRTPSSVLNGRSPYELVFGFAPELSQLRIVGCLCFSTVLNN 757 Query: 47 SDKFYSRSEKSVLIG 3 DKF S +EK VL+G Sbjct: 758 FDKFNSHAEKCVLVG 772 >gb|OTG21088.1| putative ribonuclease H-like domain-containing protein [Helianthus annuus] Length = 1460 Score = 784 bits (2025), Expect = 0.0 Identities = 390/778 (50%), Positives = 524/778 (67%), Gaps = 4/778 (0%) Frame = -2 Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145 KL GD LYLHPSD+S I+SVKL G+ENY +WS AM AL NK GF+DG +K Sbjct: 16 KLDIGDPLYLHPSDSSSLTIVSVKLKGTENYAVWSSAMKLALEAKNKYGFIDGKVEKSKD 75 Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965 + LA QWD C+SVVLTW+LNS+S ELF G ++SK A E+W DLKE++DK+DGS V++++ Sbjct: 76 DEILAAQWDRCNSVVLTWLLNSVSEELFLGQVFSKLASEVWTDLKESFDKIDGSVVYDLY 135 Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785 K IN ++QNGS++A+YYN L ++WKQFD+M+ LP+C+C A + + LIKLMQFLMGL Sbjct: 136 KKINCIAQNGSTVAEYYNRLTTMWKQFDAMLQLPSCSCQAAKDYNDFSALIKLMQFLMGL 195 Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605 DD Y P+R+NILT++ PSVK AFSI+S EESHR S G+ S + A +S Sbjct: 196 DDVYQPVRTNILTRESFPSVKVAFSIVSREESHRLSSGGSKSQSVSYVARSSQPNQSSSR 255 Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425 LKC++CN GHT+DRC+ ++GYP G KR+ Sbjct: 256 RNFRGSNSV---------------------LKCTHCNMLGHTVDRCFEIIGYPPGMKKRS 294 Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSPA----HSNMAGK 1257 + + + S+ S + + EQ+ +LMSL+ + SNMAG+ Sbjct: 295 VGQSGR---NNVNSRSNQSAAPSSSVASALPFTSEQITKLMSLIGEKSEGEQQKSNMAGE 351 Query: 1256 CFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVS 1077 + +F ++F+ W+VDSGANQHM + K +IN VDVS Sbjct: 352 S--------------SYVNNFVSCSSFVNFEHGYRWVVDSGANQHMVNTDKDMINCVDVS 397 Query: 1076 KLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFD 897 + GL VGHPNGT + +IG++K+ NN++L DV VP Y+V+LLSVHKLA+D+ ++V F+ Sbjct: 398 ECGLKVGHPNGTSVNVIKIGELKLINNVVLKDVFFVPGYSVNLLSVHKLAKDNNIAVLFN 457 Query: 896 ESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLH 717 ES C +QD +K+ + IGNQ +GLY + N + N N+WH RLGHPSDQVL Sbjct: 458 ESNCMLQDLKSKKVLVIGNQENGLYYVGRHGNSVNLCYNSVDKSNVWHSRLGHPSDQVLA 517 Query: 716 VLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDG 537 VLK L + KT + C+ CHK+KQ R PFPLS+HKS IG LVHLD+WGPY++TS +G Sbjct: 518 VLKDKLEI-KTVEHDP-CEICHKSKQVRVPFPLSDHKSKGIGDLVHLDLWGPYRVTSYEG 575 Query: 536 YKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNK 357 YKYFLT+VDDY+R+VW Y+L+NK +V++++ +F +++ QF++ +K+FRSDNGTEFVNN+ Sbjct: 576 YKYFLTVVDDYTRSVWCYLLRNKMEVFENLKDFYELILTQFKTKVKVFRSDNGTEFVNNQ 635 Query: 356 MQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTAAY 177 M F +KGI HQTSC YTPQQNG+ ERKHRHLLN+AR+LMFQG LPL W +CVLTA Y Sbjct: 636 MNFFMKQKGILHQTSCSYTPQQNGVVERKHRHLLNIARALMFQGGLPLRFWSDCVLTAVY 695 Query: 176 LINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSEKSVLIG 3 LINRLPSSVL GKSP+ L++G +P+LSH+R FGCLC++T+LN SDKF ++K VLIG Sbjct: 696 LINRLPSSVLGGKSPYELMFGFEPSLSHLRSFGCLCFSTVLNESDKFAYHADKCVLIG 753 >gb|OTG33444.1| putative ribonuclease H-like domain-containing protein [Helianthus annuus] Length = 1427 Score = 780 bits (2014), Expect = 0.0 Identities = 394/784 (50%), Positives = 518/784 (66%) Frame = -2 Query: 2354 NESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGF 2175 NE++++ KL D LYLH SD+S I+++KL GSENY IWS AM AL+ NK+GF Sbjct: 10 NETLVS----KLDASDPLYLHASDSSNLTIVNIKLKGSENYTIWSSAMKLALQVKNKIGF 65 Query: 2174 VDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDK 1995 +DG+C K N LA QWD C+SVV+TWILNS+S EL+ G ++SK A E+W DLKETYDK Sbjct: 66 IDGSCTKSKDNDVLAKQWDRCNSVVITWILNSVSEELYMGQVFSKLASEVWADLKETYDK 125 Query: 1994 VDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQL 1815 +DGS +F +H+ INSLSQNG+S+++YY+ LN++WKQFD MI LP+CTC A F + + Sbjct: 126 IDGSVIFGLHQKINSLSQNGTSVSEYYHKLNTMWKQFDQMIQLPSCTCRASKEFNDFSHM 185 Query: 1814 IKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAF 1635 IKLMQFLMGLDD Y P+R+N+LT + LPSVK+AFSIIS EESHRN + NV Sbjct: 186 IKLMQFLMGLDDVYHPVRTNLLTSETLPSVKTAFSIISREESHRNSKNPLKDQTQNVGFV 245 Query: 1634 ASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLV 1455 + NLKC++CNK GHT+DRCY +V Sbjct: 246 SKTNQSFETKKKFNRGPNP--------------------NLKCTHCNKLGHTVDRCYEIV 285 Query: 1454 GYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSPAH 1275 GYP R+ S +E S++++ +L+ +Q+ RL+ LLN+ Sbjct: 286 GYPQNSKSRSNQSTKS----FASNNSVSNKVESSSASTIPALTPDQVSRLLGLLNERTGE 341 Query: 1274 SNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLI 1095 S+ +N ++G DSGANQHM + + + Sbjct: 342 SS---------------------------------QNANVG---DSGANQHMVRTEEGIF 365 Query: 1094 NTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSK 915 + +DVS+ + V HPNG+ A +T+IG K+N+ +IL DV VVPEY V+LLSV+KLA+D+K Sbjct: 366 DAIDVSEFNIKVKHPNGSDATVTKIGKYKLNDKVILTDVFVVPEYYVNLLSVYKLAKDNK 425 Query: 914 LSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHP 735 L V FDE+ CYIQD TK + GNQ GLY + + N + NLWH RLGHP Sbjct: 426 LRVLFDENNCYIQDSHTKNTLVTGNQVDGLYFCGDTSKTMKVCFNSHDTLNLWHSRLGHP 485 Query: 734 SDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYK 555 S+ VL VLK SLNL K T++ C+ CH+AKQ R PFPLS+HK++ +G+L+HLDVWGPY+ Sbjct: 486 SNPVLSVLKDSLNL-KFTNNDIPCEVCHRAKQHRVPFPLSDHKTSSLGELIHLDVWGPYR 544 Query: 554 ITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGT 375 I S++GYKYFL++VDDYSRAVWVY++++K +V+D+I +F M+ QF IK FRSDNGT Sbjct: 545 IQSREGYKYFLSVVDDYSRAVWVYLMEHKNEVFDNIKSFFNMIKTQFGKTIKTFRSDNGT 604 Query: 374 EFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPEC 195 EFVN++ + FF+ GI HQT+C YTPQQNGI ERKHRHLLNVARSL+FQG LPL W EC Sbjct: 605 EFVNHQTKNFFNTNGIIHQTTCPYTPQQNGIVERKHRHLLNVARSLLFQGGLPLRFWSEC 664 Query: 194 VLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSEKS 15 +LTA YLINR PSS+L+GKSP+ LVYG P L+H+R FGCLC++T+LNN DKF S +EK Sbjct: 665 ILTAVYLINRTPSSILNGKSPYDLVYGFKPFLNHLRNFGCLCFSTVLNNPDKFGSHAEKC 724 Query: 14 VLIG 3 V +G Sbjct: 725 VFLG 728 >ref|XP_022042162.1| uncharacterized protein LOC110944826 [Helianthus annuus] Length = 846 Score = 760 bits (1962), Expect = 0.0 Identities = 387/786 (49%), Positives = 523/786 (66%), Gaps = 12/786 (1%) Frame = -2 Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGF--VDGTCKKD 2151 KL D LYLH SD+SG ++++KL G ENY +WS AM AL NKLG +DG+CK+ Sbjct: 16 KLDASDPLYLHASDSSGLTVVNIKLKGIENYVVWSNAMHLALMTKNKLGQKKIDGSCKRS 75 Query: 2150 TANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFN 1971 T + LA+QWD C+S+VLTWILNS+S EL+ G +YSK A E+W+DLKETY+K+DGS VF Sbjct: 76 TTDDVLASQWDRCNSIVLTWILNSVSDELYVGQVYSKLASEVWDDLKETYNKIDGSVVFG 135 Query: 1970 IHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLM 1791 + + INS+SQNG+S++ YY+ +N++WKQFD+M+ LP+C+C A F + N LIKLMQFLM Sbjct: 136 LFQKINSVSQNGASVSKYYHKINTMWKQFDAMLQLPSCSCQASTKFNEFNHLIKLMQFLM 195 Query: 1790 GLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXX 1611 GLDD Y P+R+N+LT+ PLP+VK+AFSIIS EESHR+ S + S PNV A Sbjct: 196 GLDDVYQPVRTNLLTRYPLPTVKTAFSIISREESHRD--SNSSSKVPNVGFAAKTNQFNE 253 Query: 1610 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGK 1431 NLKC++CNK GH +D+C+ L GYP+ + Sbjct: 254 NKKRFTKVSNP--------------------NLKCTHCNKIGHVVDKCFELHGYPSNFRP 293 Query: 1430 RNFTSN---VKPXXXXXXXXXXXXSL---ECSTSNSPVSLSDEQMVRLMSLLN----DSP 1281 R +N KP + + S SNS SL+ +Q +L+ LLN D+ Sbjct: 294 RPNQNNNQWSKPNISANSSINSTVNHSFNDKSASNSLNSLTSDQFTKLLDLLNEKKTDNG 353 Query: 1280 AHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKP 1101 +N+ GK + N + +++ + N+ +N ++ WI+DS ANQHM +SS+ Sbjct: 354 PKTNVRGK-----YHNVISSLDC-YKRSYCFNSKSWSQN-NMSWIIDSSANQHMIMSSEN 406 Query: 1100 LINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARD 921 + N VDVS +TV HPNGT A +T IG K++N++IL DV VVP+Y V+L+ VHKLA+D Sbjct: 407 MFNKVDVSDYNITVKHPNGTDAKVTIIGCYKLSNSVILRDVFVVPKYCVNLIFVHKLAKD 466 Query: 920 SKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLG 741 ++L V FDE CYIQD K+ + Q+ GLY N N + LWH RLG Sbjct: 467 NQLRVVFDEDTCYIQDLYLKKTLVTSRQTDGLYFCGNYFNSVIACFNKAETIKLWHSRLG 526 Query: 740 HPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGP 561 HP DQ L+VL NL + C+ CHKAKQ R PFPLSEHK++K+G L+HLDVWGP Sbjct: 527 HPVDQALNVL----NLKTDKANIDPCEVCHKAKQHRVPFPLSEHKTSKVGDLIHLDVWGP 582 Query: 560 YKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDN 381 YK++S +G+KYFLT+VDDYSR+VWVY++K+K +V+++I +F ++ QFE NIK FRSDN Sbjct: 583 YKVSSIEGFKYFLTVVDDYSRSVWVYLMKSKVEVFENIQSFYNLVKTQFEVNIKAFRSDN 642 Query: 380 GTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWP 201 GTEFVN++M F + GI HQTSC YTPQQNG+ ERKH HLLNVAR+L+FQ +PL W Sbjct: 643 GTEFVNSQMSNFVNTHGIIHQTSCAYTPQQNGVVERKHGHLLNVARALLFQSGVPLKFWS 702 Query: 200 ECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSE 21 ECVLTA+YLINR PSSVL+GK+P+ L++G +P+LSH+++FGCLC+ T+LNN DK +E Sbjct: 703 ECVLTASYLINRTPSSVLNGKTPYELLFGFEPSLSHLKIFGCLCFFTVLNNPDKLDEEAE 762 Query: 20 KSVLIG 3 K + +G Sbjct: 763 KCIFMG 768 >gb|OTG37431.1| putative ribonuclease H-like domain-containing protein [Helianthus annuus] Length = 1459 Score = 779 bits (2012), Expect = 0.0 Identities = 393/786 (50%), Positives = 525/786 (66%), Gaps = 12/786 (1%) Frame = -2 Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145 KL GD LYLHPSD+S I+S+KL G+ENY +WS AM AL NK GF+DG +K Sbjct: 16 KLDIGDPLYLHPSDSSSLTIVSIKLKGTENYAVWSSAMKLALEAKNKYGFIDGKVEKSKD 75 Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965 + LA QWD C+SVVLTW+LNS+S ELF G ++SK A E+W DLKE++DK+DGS V++++ Sbjct: 76 DEILAAQWDRCNSVVLTWLLNSISEELFLGQVFSKLASEVWTDLKESFDKIDGSVVYDLY 135 Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785 K IN ++QNGS++A+YYN L ++WKQFD+M+ LP+C+C A + + LIKLMQFLMGL Sbjct: 136 KKINCIAQNGSTVAEYYNRLTTMWKQFDAMLQLPSCSCQAAKDYNDFSALIKLMQFLMGL 195 Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605 DD Y P+R+NILT++ PSVK AFSI+S EESHR SSG GS NVS + Sbjct: 196 DDVYQPVRTNILTRESFPSVKVAFSIVSREESHR-LSSG-GSKTQNVSFVSKPNQAFDPK 253 Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGY---G 1434 NLKC++CN GHT+DRC+ +VGYP G+ G Sbjct: 254 RRNNRGPNP--------------------NLKCTHCNMIGHTVDRCFEIVGYPPGFRRKG 293 Query: 1433 KRNFTS--NVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLND----SPAHS 1272 N T+ N S +S + + EQ+ +L+SL+ + S ++ Sbjct: 294 TNNQTNKTNSSVNNNNSNKSNNVGGSSVSAVSSGLPFTSEQISKLLSLVGEKSGSSAQNT 353 Query: 1271 NMAGKCFSGTFF---NASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKP 1101 ++ G+CF+ + F ++SV FN F WIVDSGA+QHM S K Sbjct: 354 SVGGECFNVSNFVSCSSSVSFNNSFV-----------------WIVDSGASQHMIKSDKY 396 Query: 1100 LINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARD 921 +IN VDVS+ +TVGHPNGT+ + +IGD+K+ +N++L DV VP+Y V+LLSV+KLA+D Sbjct: 397 MINVVDVSEFNITVGHPNGTKVKVLKIGDLKLTDNVVLRDVFYVPDYCVNLLSVYKLAKD 456 Query: 920 SKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLG 741 + +SV F E+ C +QD ++++ + G+Q SGLY + N + N +V WH RLG Sbjct: 457 NHISVIFKENSCVLQDSSSRKVLMNGSQDSGLYFVENYGNSVNVCLNSSVKSFTWHTRLG 516 Query: 740 HPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGP 561 HPSDQVL VLK SL ++ CD CH+AKQ R PFPLSEHKS +G L+HLD+WGP Sbjct: 517 HPSDQVLAVLKGSLKINSNEHGP--CDVCHRAKQVRVPFPLSEHKSKFVGDLIHLDLWGP 574 Query: 560 YKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDN 381 YK++S DG+KYFLT+VDDYSR+VW Y L NK +V++++ NF +++ QF+ IK+FRSDN Sbjct: 575 YKVSSYDGFKYFLTVVDDYSRSVWCYFLTNKTEVFENLKNFYELVVTQFKKRIKVFRSDN 634 Query: 380 GTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWP 201 GTEFVNN+M F KGI HQTSC YTPQQNG+ ERKHRHLLN AR+LMFQG LPL W Sbjct: 635 GTEFVNNQMSMFCKSKGILHQTSCSYTPQQNGVVERKHRHLLNTARALMFQGGLPLRYWS 694 Query: 200 ECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSE 21 +CVLTA YLINRLPSSVL+G+SPF +++G P+LSH+R FGCLC++T+L +SDKF ++ Sbjct: 695 DCVLTAVYLINRLPSSVLNGRSPFEMMFGFSPSLSHLRNFGCLCFSTVLTDSDKFAYHAD 754 Query: 20 KSVLIG 3 K V +G Sbjct: 755 KCVFLG 760 >gb|OTG16942.1| putative ribonuclease H-like domain-containing protein [Helianthus annuus] Length = 1458 Score = 775 bits (2002), Expect = 0.0 Identities = 390/780 (50%), Positives = 521/780 (66%), Gaps = 6/780 (0%) Frame = -2 Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145 KL GD LYLHPSD+S I+SVKL G+ENY +WS AM AL NK GF+DG +K Sbjct: 16 KLDIGDPLYLHPSDSSSLTIVSVKLKGTENYAVWSSAMKLALEAKNKYGFIDGKVEKSKD 75 Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965 + LA QWD C+SVVLTW+LNS+S ELF G ++SK A E+W DLKE++DK+DGS V++++ Sbjct: 76 DEILAAQWDRCNSVVLTWLLNSVSEELFLGQVFSKLASEVWTDLKESFDKIDGSVVYDLY 135 Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785 K IN ++QNGS++A+YYN L ++WKQFD+M+ LP+C+C A + + LIKLMQFLMGL Sbjct: 136 KKINCIAQNGSTVAEYYNKLTTMWKQFDAMLQLPSCSCQAAKDYNDFSALIKLMQFLMGL 195 Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605 DD Y P+R+NILT++ PSVK AFSI+S EESHR SSG+ S + A ++ Sbjct: 196 DDIYQPVRTNILTRETFPSVKVAFSIVSREESHRLSSSGSKSQSVSYVARSNQSNQNTSK 255 Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425 NLKC++CN GHT+DRC+ ++GYP G KR Sbjct: 256 RNFRGPNS---------------------NLKCTHCNMIGHTVDRCFEIIGYPPGMKKRG 294 Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSP----AHSNMAGK 1257 S K S++ S ++ + EQ+ +LMSL+ + P SNM G Sbjct: 295 NMSFGKNNGNNTSRSGMSSG-PSSSAVSALTFTPEQIAKLMSLVGEKPDGDQEKSNMGGM 353 Query: 1256 --CFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVD 1083 C SG S ++ F +N W+VDSGANQHM S K + N +D Sbjct: 354 SACMSGFL---SCSSSVCFSHEYN-------------WVVDSGANQHMIKSDKDMFNCID 397 Query: 1082 VSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVS 903 VS+ GL VGHPNGT + +IGD+K+ NN+I+ DV VP Y+V+LLSVHKLA+D+K++V Sbjct: 398 VSECGLKVGHPNGTSVSVLKIGDLKLINNVIIKDVFYVPGYSVNLLSVHKLAKDNKIAVL 457 Query: 902 FDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQV 723 F+E+ C +QD +K+ + IG Q +GLY N N + N +V +LWH RLGHPSDQV Sbjct: 458 FNENNCMLQDLRSKKILVIGRQENGLYFVGRNGNFANLCFNSSVKSDLWHSRLGHPSDQV 517 Query: 722 LHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSK 543 L VLK SL++ + C+ CH++KQ R PFPLS+HKS ++G L+HLD+WGPYK++S Sbjct: 518 LAVLKDSLDVKIVEHNP--CEVCHRSKQVRVPFPLSDHKSKELGDLIHLDLWGPYKVSSY 575 Query: 542 DGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVN 363 +GYKYFLT+VDD++R VW YMLK+K +V++++ F +++ QF+ +K+FRSDNGTEF+N Sbjct: 576 EGYKYFLTVVDDFTRTVWCYMLKSKVEVFENLKYFYELVLTQFKKKVKMFRSDNGTEFIN 635 Query: 362 NKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTA 183 N+M F +KGI HQTSC YTPQQNG+ ERKHRHLLN AR+LMFQ LPL W +CVLTA Sbjct: 636 NQMSTFCKQKGIVHQTSCSYTPQQNGVVERKHRHLLNTARTLMFQSGLPLRFWSDCVLTA 695 Query: 182 AYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNSDKFYSRSEKSVLIG 3 Y+INRLPSSVLSGKSP+ L++G P+LS+ R FGCLC++T LN DKF ++K VLIG Sbjct: 696 VYIINRLPSSVLSGKSPYELMFGFRPSLSYFRNFGCLCFSTNLNEPDKFAYHADKCVLIG 755 >ref|XP_021986042.1| uncharacterized protein LOC110882290 [Helianthus annuus] Length = 851 Score = 749 bits (1933), Expect = 0.0 Identities = 377/746 (50%), Positives = 506/746 (67%), Gaps = 9/746 (1%) Frame = -2 Query: 2213 MTFALRNHNKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTA 2034 M AL+ NK+GF+DGTC++ T + L QWD C+S+VLTWILNS+S +L+ G +YSK A Sbjct: 1 MNLALQVKNKIGFIDGTCRRSTTDEVLGRQWDRCNSIVLTWILNSVSEDLYLGHVYSKLA 60 Query: 2033 YEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCT 1854 ++W DLKETYDKVDGS VFN+++ INS +Q+G +++YY+ LN +WKQ D +++LP CT Sbjct: 61 SDVWKDLKETYDKVDGSVVFNLYQKINSFTQSGMPVSEYYHKLNCMWKQMDQLLALPACT 120 Query: 1853 CDAGIHFEKHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFS 1674 CDA F N LIKLMQFLMGLD TY +R+N+LT++ LPSVK AFSIIS EESH N Sbjct: 121 CDASKQFNDFNHLIKLMQFLMGLDSTYQSVRTNLLTREILPSVKDAFSIISREESHLNSK 180 Query: 1673 SGTGSIKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCN 1494 + + +V FA+ NLKCS+CN Sbjct: 181 NFSDKTHNSVVGFATKTNQLIDTKKKGIRTPNP-------------------NLKCSHCN 221 Query: 1493 KPGHTIDRCYGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECS-TSNSPVS-LSDE 1320 K GHTI++C+ LVGYP+ KP +E S T++S VS L+ + Sbjct: 222 KTGHTIEKCFELVGYPSWMKS-------KPGGNKGSRVSNNSVVENSDTTSSAVSYLTSD 274 Query: 1319 QMVRLMSLLNDSPAH----SNMAGKCFSGTFFNASVKFNLKFEKHFNGN---TNFLKKNT 1161 Q+ +L+SLL+D P + SN AG+C S +F+++V K F +NF+ Sbjct: 275 QIAQLLSLLHDKPKNDPSCSNFAGRCNS-VYFDSNVDVFSKPTSDFKPAYCFSNFINAGK 333 Query: 1160 SLGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHD 981 +GWI+DSGANQHM + LIN +DVS+ + V HPNGT AL+T+IGD+K++ +IL+D Sbjct: 334 KVGWIIDSGANQHMVKNDIGLINQMDVSEYNIKVKHPNGTSALVTKIGDIKLSEKVILYD 393 Query: 980 VLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTN 801 V VVP+Y V+L+SVHKLA+D L+VSFDE CYIQD TK+ G+Q GLY + Sbjct: 394 VFVVPDYCVNLVSVHKLAKDCNLTVSFDEHNCYIQDSRTKKVQVTGSQLDGLYFCGGSAL 453 Query: 800 CKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFP 621 + + ++ N WH RLGHP++ VLHVLK L++ K C+TCH+AKQ REPFP Sbjct: 454 SDKVCS-ASLDVNRWHARLGHPAEPVLHVLKNKLDI-KAGIKLEPCETCHRAKQHREPFP 511 Query: 620 LSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVN 441 LSEHK+ + +L+HLDVWGPY++ S++G+++FLT+VDDY+RAVWVY++K+K DV+ +I + Sbjct: 512 LSEHKTKNLSELIHLDVWGPYRVQSREGFRFFLTVVDDYTRAVWVYLMKSKEDVFYNIKD 571 Query: 440 FTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRH 261 F ML QF+ ++K+FRSDNGTEF+N +M++F H GI HQTS V+TPQQNGI ERKHRH Sbjct: 572 FFNMLKTQFDKHVKMFRSDNGTEFINKQMKEFCHNHGIIHQTSGVHTPQQNGIVERKHRH 631 Query: 260 LLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVF 81 LLNVAR+L+FQ PL W EC+LTA YLINR PSSVL+G+SP+ LVYG P LS +RV Sbjct: 632 LLNVARALLFQVGFPLKFWSECILTATYLINRTPSSVLNGRSPYKLVYGFAPVLSQLRVI 691 Query: 80 GCLCYATILNNSDKFYSRSEKSVLIG 3 GCLC++T+LNN+DKF S +EK VLIG Sbjct: 692 GCLCFSTVLNNTDKFNSHAEKCVLIG 717 >ref|XP_021980336.1| uncharacterized protein LOC110876473 [Helianthus annuus] Length = 801 Score = 694 bits (1791), Expect = 0.0 Identities = 371/805 (46%), Positives = 491/805 (60%), Gaps = 20/805 (2%) Frame = -2 Query: 2357 NNESVINSSELKLVF----GDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNH 2190 ++ +VINS LV GD L+LHPSD++ I+SVKL GSENY+IWS AM AL+ Sbjct: 4 DDNTVINSPGATLVSKIDAGDPLFLHPSDSANLSIVSVKLKGSENYRIWSNAMYLALQVK 63 Query: 2189 NKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLK 2010 NK+GFVDG+C + + L QWD C+S+VLTWILNS+S EL+ G +YSK A ++W DLK Sbjct: 64 NKIGFVDGSCLRSKTDEVLGRQWDRCNSIVLTWILNSVSEELYLGLVYSKIASDVWKDLK 123 Query: 2009 ETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFE 1830 +TYDK+DGS VFN+++ INS SQNG +++YY+ LN +WKQ D +++LP C+CDA F Sbjct: 124 DTYDKIDGSVVFNMYQKINSFSQNGMPISEYYHKLNCMWKQLDQLLALPACSCDASKQFN 183 Query: 1829 KHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKP 1650 N LIKLMQFLMGLD +Y +R+N+LT++ LPSVK AFS+IS EESH + + Sbjct: 184 DFNHLIKLMQFLMGLDSSYQSVRTNLLTRETLPSVKDAFSVISREESHLHSKNIFDKTPN 243 Query: 1649 NVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDR 1470 N F+ NLKCS+CNK GHTI++ Sbjct: 244 NPVGFSVKTGQTIDSRKRNNRTLNP-------------------NLKCSHCNKTGHTIEK 284 Query: 1469 CYGLVGYPA------GYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPV--SLSDEQM 1314 C+ LVGYP+ G K N SN +T +P SLS++Q+ Sbjct: 285 CFELVGYPSWIKTKPGGNKGNKVSN--------------NVTADTTDTTPAMSSLSNDQI 330 Query: 1313 VRLMSLLNDSPA----HSNMAGKCFS----GTFFNASVKFNLKFEKHFNGNTNFLKKNTS 1158 +L+SLLND P S AG C + +F N S K F+ F + NF+ Sbjct: 331 AQLLSLLNDKPKGDPQSSGFAGMCVNPVCLNSFVNLSAKPICDFKPVFCFS-NFINDGKK 389 Query: 1157 LGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDV 978 +GWIVDSGANQHM ++ + LIN DV++ + V HPNGT AL+T+IGD+K+++ Sbjct: 390 VGWIVDSGANQHMVMTDECLINQKDVTEFNIKVKHPNGTSALVTKIGDIKLSD------- 442 Query: 977 LVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNC 798 +D T++ G+Q GLY Sbjct: 443 ---------------------------------KDSQTQKVQVTGSQFDGLYF------- 462 Query: 797 KAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPL 618 GHP++ VLHVLK +LN+ KT + C+TCHKAKQ REPFPL Sbjct: 463 -----------------CGHPAEPVLHVLKNNLNI-KTGAKLNPCETCHKAKQHREPFPL 504 Query: 617 SEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNF 438 S+HKS +G L+HLDV GPY++ S++G++YFLT+VDDYSRAVWVY++K+K +V+ +I F Sbjct: 505 SDHKSEALGDLIHLDVRGPYRVQSREGFRYFLTMVDDYSRAVWVYLMKSKDEVFYNIKGF 564 Query: 437 TQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHL 258 L QF ++KIFRSDNGTEF N +M F ++ GI HQTSCV+TPQQNGI ERKHRHL Sbjct: 565 YNFLKTQFSKSVKIFRSDNGTEFTNKQMSNFCYENGILHQTSCVHTPQQNGIVERKHRHL 624 Query: 257 LNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFG 78 LNVAR+L+FQG P+ W EC+LTA+YLINR PSS+LSGKSP+ LV+G P L +RV G Sbjct: 625 LNVARTLLFQGGFPIKFWSECILTASYLINRTPSSILSGKSPYELVFGFSPVLGQLRVIG 684 Query: 77 CLCYATILNNSDKFYSRSEKSVLIG 3 CLC+ T+LNNSDKF + +EK VL+G Sbjct: 685 CLCFNTVLNNSDKFTTHAEKCVLVG 709 >ref|XP_021971692.1| uncharacterized protein LOC110866849 [Helianthus annuus] Length = 828 Score = 566 bits (1459), Expect = 0.0 Identities = 294/683 (43%), Positives = 415/683 (60%), Gaps = 3/683 (0%) Frame = -2 Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145 KL D LYLHPSD+S I+SVKL GSENY +WS AM Sbjct: 23 KLDASDPLYLHPSDSSNLTIVSVKLKGSENYTVWSNAM---------------------- 60 Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965 QWD C+S+VLTWILNS+S EL+ G ++SK A ++W+DLKETY+KV+GS VF ++ Sbjct: 61 -----QQWDRCNSIVLTWILNSISEELYMGQVFSKLACDVWSDLKETYNKVEGSVVFYLY 115 Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785 K IN +QNG+++++YY+ LN +W+Q D ++ LP+CTC+A F N +I+LMQFLMGL Sbjct: 116 KKINGFTQNGTNVSEYYHKLNVMWRQLDEILQLPSCTCEAAKEFNNFNHMIELMQFLMGL 175 Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605 DD Y +R+N+L K+ LP+VK AF+I+S EESHRN S S + AF S Sbjct: 176 DDVYQGVRTNLLMKETLPTVKEAFAIVSREESHRN--SSNSSKEGLTMAFVSKVSQPIEF 233 Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425 NLKCS+CNK GH++D+C+ ++GYP Sbjct: 234 KRGNKVANQ--------------------NLKCSHCNKVGHSVDKCFEIIGYP------- 266 Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVS---LSDEQMVRLMSLLNDSPAHSNMAGKC 1254 S +KP + S ++ VS L+ EQ+ RL+SL+ D P+ + + Sbjct: 267 --SWMKPPRGNQVKKAVASNSSTSVESANVSVNSLTSEQITRLLSLIGDKPSGAPQSCSV 324 Query: 1253 FSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVSK 1074 F ++V F K ++ K S+GW++DSGANQHM K L +++DVS+ Sbjct: 325 SGSNFLCSNVFF-----KPVICFSSESKDEQSVGWVIDSGANQHMIKDEKVLSHSIDVSE 379 Query: 1073 LGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFDE 894 +TV HPNGT AL+T+IG++K+ NN+IL DV +VPEY ++L+ VHKL +D+ L V FDE Sbjct: 380 FKITVKHPNGTNALVTKIGNVKLVNNVILKDVFLVPEYNINLIYVHKLVKDNGLYVGFDE 439 Query: 893 SKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLHV 714 +KCY+QD +TK+ + G+Q GLY + C + N + +LWH RLGHPSDQ + V Sbjct: 440 NKCYVQDISTKKVLVTGSQVDGLYFCGSSFMCNKVCFNSSSLNDLWHVRLGHPSDQAIRV 499 Query: 713 LKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGY 534 K L L +D++ C+ C +AKQ REPFPLS H+S+ +G LVHLDVWGPY++TS++G+ Sbjct: 500 FKYKLKLG-NSDTSLPCEVCQRAKQHREPFPLSSHRSSSLGDLVHLDVWGPYRVTSREGH 558 Query: 533 KYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKM 354 +FLTIVDDYSR VWV ++K K +V++++V+F ++ QF +K FRSDNGTEF+N + Sbjct: 559 WFFLTIVDDYSRVVWVCLMKTKQEVFENVVDFVNIIKTQFHKEVKCFRSDNGTEFINQQT 618 Query: 353 QQFFHKKGIFHQTSCVYTPQQNG 285 +F K + H TP G Sbjct: 619 NRFCKIKDVQHSK----TPDDEG 637 >ref|XP_022014401.1| uncharacterized protein LOC110913892 [Helianthus annuus] Length = 583 Score = 546 bits (1407), Expect = 0.0 Identities = 293/614 (47%), Positives = 395/614 (64%), Gaps = 7/614 (1%) Frame = -2 Query: 2369 TESMNNESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNH 2190 TE SV S+L + D LYLH SD+S I+++KL G+ENY +WS AM AL Sbjct: 4 TEKQGESSVTLVSKLDV--SDPLYLHASDSSSLFIVNIKLKGTENYVVWSNAMKLALTAK 61 Query: 2189 NKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLK 2010 NKLGF++GTC K T + LA+QWDMC+SV+LTWILNS+S EL+ G +YS A E+W+DLK Sbjct: 62 NKLGFINGTCTKSTKDDVLASQWDMCNSVILTWILNSVSKELYVGQVYSSLASEVWSDLK 121 Query: 2009 ETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFE 1830 +TYD+VDGS VF +++ INS+SQNG+S+++YY+ LN++WKQFD+++ LP+CTCDA + Sbjct: 122 DTYDRVDGSVVFGLYQKINSVSQNGTSVSEYYHRLNTMWKQFDAIVQLPSCTCDASSKYN 181 Query: 1829 KHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKP 1650 + +QLIKLMQFLMGLDD Y P+R+N+LT+DPLP+VK+AFS+IS EESHR+ S S P Sbjct: 182 EFSQLIKLMQFLMGLDDIYQPVRTNLLTRDPLPTVKTAFSVISREESHRD--SNKSSKTP 239 Query: 1649 NVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDR 1470 NV A NLKC++CNK GH I++ Sbjct: 240 NVGFVAKATQYNDNKKRFNKGPNP--------------------NLKCTHCNKVGHVIEK 279 Query: 1469 CYGLVGYPAGY-GKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPV-SLSDEQMVRLMSL 1296 C+ L GYP+ Y K N ++ ++ +NS + +L+ +Q +L+ L Sbjct: 280 CFKLHGYPSNYRNKSNQNNSQWSKTNLSANNSVANTMNDQPANSSLNALTVDQFSKLLGL 339 Query: 1295 LNDSPAH----SNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGAN 1128 LN++ SNM+GK F+NA ++ + N++F +N L WI+DSGAN Sbjct: 340 LNENKLEDSHKSNMSGK-----FYNAFTSLG-SYKNTYCFNSSFFHQN-KLKWIIDSGAN 392 Query: 1127 QHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSL 948 QHM ++ + N VDVS+ +T+ HPNGT A + EIG K++ ++IL DV VVPEY V+L Sbjct: 393 QHMVTNNDNMFNLVDVSEYDITIKHPNGTDAKVKEIGCFKLSEDVILKDVFVVPEYCVNL 452 Query: 947 LSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAIS-NNCTV 771 +SVHKLA+D+KL V FDE CYIQD + K+N+ IG Q+ GLY F N++ I+ N T Sbjct: 453 ISVHKLAKDNKLKVVFDEHNCYIQDVSLKKNLVIGRQTDGLY-FCGNSSISVIACFNKTE 511 Query: 770 SRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIG 591 + LWH RLGHPSDQVLHV NL + CDTCHKAKQ R PFPLS+HKS ++ Sbjct: 512 TIKLWHSRLGHPSDQVLHV----RNLKNESGQAEPCDTCHKAKQHRIPFPLSDHKSKRVD 567 Query: 590 QLVHLDVWGPYKIT 549 LVHLDVWGPYK T Sbjct: 568 DLVHLDVWGPYKTT 581 >ref|XP_021991826.1| uncharacterized protein LOC110888615 [Helianthus annuus] Length = 555 Score = 492 bits (1266), Expect = e-162 Identities = 264/572 (46%), Positives = 359/572 (62%), Gaps = 4/572 (0%) Frame = -2 Query: 2369 TESMNNESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNH 2190 TE+ SV S KL D LYLH SD+S I+++KL G+ENY +WS AM AL Sbjct: 4 TENQGESSVTLIS--KLDASDPLYLHASDSSSLTIVNIKLKGTENYVVWSNAMKLALTAK 61 Query: 2189 NKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLK 2010 NKLGF++GTC K T + LA+QWD C+SVVLTWILNS+S EL+ G +YS+ A E+W+DLK Sbjct: 62 NKLGFINGTCTKSTKDDVLASQWDRCNSVVLTWILNSVSEELYVGQVYSRLASEVWSDLK 121 Query: 2009 ETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFE 1830 +TYD VDGS VF +++ INS++QNG+S+++YY+ LN++WKQFD+M+ LP+CTCDA + Sbjct: 122 DTYDMVDGSVVFGLYQKINSVNQNGASVSEYYHKLNTMWKQFDAMVQLPSCTCDASTKYN 181 Query: 1829 KHNQLIKLMQFLMGLDDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKP 1650 + +QLIKL+ FLMGLDD Y P+R+N+LT+DPLP+VK+AFSIIS EESHR+ S S P Sbjct: 182 EFSQLIKLVHFLMGLDDIYQPVRTNLLTRDPLPTVKTAFSIISREESHRD--SNKSSKIP 239 Query: 1649 NVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDR 1470 NV A NLKC++CNK GH I++ Sbjct: 240 NVGFVAKETQFNENKKRFNKGPNP--------------------NLKCTHCNKVGHVIEK 279 Query: 1469 CYGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLN 1290 C+ + GYP Y + +N + + + S ++S +L+ +Q +L+ LLN Sbjct: 280 CFEIHGYPLNYRNKPNQNNSQWSKANVSANSSVANNDQSANSSLNALTADQFSKLLGLLN 339 Query: 1289 DS----PAHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQH 1122 ++ A SNM+G+CFS S K F N++F +N L WIVDSGANQH Sbjct: 340 ENKLEDSAKSNMSGECFSAFTPLGSYKNTYCF------NSSFFHQN-KLKWIVDSGANQH 392 Query: 1121 MTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLS 942 M +++ + N VDVS+ +T+ HPNGT A + +IG K++ ++IL DV VVPEY V+L+S Sbjct: 393 MVMNNDNMFNLVDVSEYDITIKHPNGTDAKVKQIGCFKLSEDVILKDVFVVPEYCVNLIS 452 Query: 941 VHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRN 762 VHKLA+D+KL V FDE CYIQD + KRN+ IG Q GLY ++ N T + Sbjct: 453 VHKLAKDNKLKVVFDEHNCYIQDVSLKRNLVIGRQMGGLYFCGNSSKPVIACFNKTETIK 512 Query: 761 LWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHI 666 LWH RLGHP DQVLHVLK + DK + T + Sbjct: 513 LWHSRLGHPEDQVLHVLKLKMKQDKLSRVTRV 544 >ref|XP_022023932.1| uncharacterized protein LOC110924205 [Helianthus annuus] Length = 541 Score = 471 bits (1211), Expect = e-154 Identities = 247/544 (45%), Positives = 340/544 (62%), Gaps = 7/544 (1%) Frame = -2 Query: 2324 KLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTA 2145 K+ GD L+LHPSD + I+++KL G+ENY +W+ +M AL+ NK+GF+DG+C++ T Sbjct: 23 KIDAGDPLFLHPSDCANLSIVTIKLKGTENYTVWANSMNLALQVKNKIGFIDGSCRRSTT 82 Query: 2144 NASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIH 1965 + L QWD C+S+VLTWILNS+S EL+ G +YSK A E+W DLKETYDKVDGS VFN++ Sbjct: 83 DEVLGRQWDRCNSIVLTWILNSVSDELYLGHVYSKLASEVWRDLKETYDKVDGSIVFNLY 142 Query: 1964 KNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGL 1785 + I+S +Q+G +++YY+ LN +WKQ D +++LP+CTCDA F N LIKLMQFLMGL Sbjct: 143 QKIDSFTQSGMPVSEYYHKLNCMWKQLDQLLALPSCTCDASKQFNDFNHLIKLMQFLMGL 202 Query: 1784 DDTYVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXX 1605 D TY +R+N+LT++ LP+VK AFSIIS EESH + + I N FA+ Sbjct: 203 DSTYQSVRTNLLTRETLPTVKDAFSIISREESHLHMKIFSERIPNNTVGFAAKTNQSFES 262 Query: 1604 XXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRN 1425 NLKCS+CNK GHTI++C+ LVG P +N Sbjct: 263 KKRGIRPPNP-------------------NLKCSHCNKTGHTIEKCFELVGNPTWMKSKN 303 Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLND----SPAHSNMAGK 1257 N S S S++ SL+ EQ+ +L+SLLND P S AG+ Sbjct: 304 ---NGNKGSRVSNNVITETSDTVSPSSAMSSLTSEQVAQLLSLLNDKSKNDPQSSGFAGR 360 Query: 1256 CFSGTFFNASVKFNLKFE---KHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTV 1086 FN+ V + K K +NF K +GWI+DSGANQHM ++ IN + Sbjct: 361 SDDSMCFNSFVDMSSKTSCDPKPVYCFSNFFKDGNRVGWIIDSGANQHMVMTDVGFINQI 420 Query: 1085 DVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSV 906 DV++ + V HPNGT AL+T+IGD+K+++ +IL+DV +VP+Y V+L+SVHKLA+D KL+V Sbjct: 421 DVTEYNIKVKHPNGTSALVTKIGDIKLSDKVILYDVFLVPDYCVNLVSVHKLAKDCKLTV 480 Query: 905 SFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQ 726 +FDE+ CYIQD TK+ NQ GLY F +++ K + C V NLWH RLGHP++ Sbjct: 481 TFDENNCYIQDSQTKKIQVTDNQLDGLY-FCGSSSVKVCNAKCDV--NLWHARLGHPAEP 537 Query: 725 VLHV 714 VLHV Sbjct: 538 VLHV 541 >emb|CAN71595.1| hypothetical protein VITISV_010143 [Vitis vinifera] Length = 1523 Score = 478 bits (1229), Expect = e-147 Identities = 285/783 (36%), Positives = 417/783 (53%), Gaps = 12/783 (1%) Frame = -2 Query: 2315 FGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANAS 2136 F L+LH SD G ++S L +NY W +M AL NK GFVDGT + T N + Sbjct: 28 FNHPLFLHHSDQPGAVLVSQPLM-EDNYTTWVQSMDMALTIKNKKGFVDGTLNRPTHNPN 86 Query: 2135 LANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNI 1956 QWD C+ +V TW+L ++S E+ I+ K A MW +L+E + + +FNI I Sbjct: 87 EQQQWDRCNILVKTWLLGAISKEISNSVIHCKDAKTMWLELQERFSHTNTVQLFNIENAI 146 Query: 1955 NSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGLDDT 1776 + +Q ++ ++ L LW + D++ P CTC + + + K M+FLMGL D Sbjct: 147 HECAQGTGTVTSFFTKLKGLWDEKDALCGFPPCTCATAAEVKTYMETQKTMKFLMGLGDN 206 Query: 1775 YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXX 1596 Y +RSNI+ DPLP+V A+++ E S+G ++ SAF+ Sbjct: 207 YATVRSNIIGMDPLPTVNKAYAMALRHEKQAEASNGKVAVPNEASAFS------VRKLDQ 260 Query: 1595 XXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCY---GLVGYPAGYGKRN 1425 +LKC+ C GHT D C +G G K N Sbjct: 261 DPNTTEREVKCEKCNMTNHSTKNCRAHLKCTYCGGKGHTYDYCRRRKNTMGGGQGRSKVN 320 Query: 1424 FTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSD-EQMVRLMSLLNDSP-AHSN---MAG 1260 + + +N P+S S+ +QM+ L+S + + +HS+ M Sbjct: 321 HAATLNEGKE-------------DVTNFPLSQSECQQMMGLLSKIKTAATSHSDGHQMLE 367 Query: 1259 KCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTS-LGWIVDSGANQHMTLSSKPLINTVD 1083 + +A++ N+ + +G L ++ WI+DSGA+ H+ S L + Sbjct: 368 MLHATKQASANLVGNVPNYEELSGRVFALSRDIKDTMWILDSGASDHIVCDSSFLTSFQP 427 Query: 1082 VSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVS 903 V V P+GT A ++ IG + + +LH+VL VP + ++L+S+ KLA DS Sbjct: 428 VHNR--IVKLPDGTSAHVSHIGTVSFSAQFVLHNVLCVPLFYLNLISISKLAFDSFYVTI 485 Query: 902 FDESKCYIQDCATKRNVGIGNQSSGLYLFDV--NTNCKAISNNCTVSRNLWHQRLGHPSD 729 F C+IQD + + +G+G +S GLY ++ C ++ T +++LWHQRLGHPS Sbjct: 486 FLRQVCFIQDLQSGKMIGMGTESEGLYCLNLPRKGTCNVVN---TKTQDLWHQRLGHPSS 542 Query: 728 QVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKIT 549 +V VL P L +KT D + C C AK TR PFPLS S L+H+D+WG Y + Sbjct: 543 KV-SVLFPFLQ-NKTLDVS-TCSICPLAKHTRTPFPLSVSSSDSCFDLIHVDIWGGYHVP 599 Query: 548 SKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEF 369 S G +YFLTIVDD+SR+ WVY++ +K + +V+F +++NQF S +KI RSDNG EF Sbjct: 600 SLSGAQYFLTIVDDHSRSTWVYLMHHKSEARSLLVHFVNLVANQFGSQVKIVRSDNGPEF 659 Query: 368 VNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVL 189 K QF+ +GI HQTSC+ TPQQNG+ ERKHRHLLNVAR+L+FQ LP W + +L Sbjct: 660 ---KHTQFYSSRGILHQTSCINTPQQNGVVERKHRHLLNVARALLFQSHLPKPFWGDAIL 716 Query: 188 TAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCY-ATILNNSDKFYSRSEKSV 12 TAAYLINR P+ +L GK+PF ++ P SH+RVFGC C+ +T KF RS +SV Sbjct: 717 TAAYLINRTPTPLLQGKTPFEKLFHKSPNYSHLRVFGCRCFVSTHPLRPSKFDPRSIESV 776 Query: 11 LIG 3 IG Sbjct: 777 FIG 779 >gb|PNX93622.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1454 Score = 474 bits (1219), Expect = e-146 Identities = 273/791 (34%), Positives = 406/791 (51%), Gaps = 26/791 (3%) Frame = -2 Query: 2297 LHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWD 2118 L+ +D GN I V+L G NY W+ AM +LR K GF++GT KK + W Sbjct: 32 LNSNDNPGNLITQVQLRGENNYDEWTRAMKTSLRARRKWGFIEGTVKKPDEGTAEIEDWW 91 Query: 2117 MCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQN 1938 S++++WILN++ P L + Y + A ++W D+KE + +G + + ++ + Q Sbjct: 92 TVQSMLVSWILNTVEPNLRSTMTYMENARDLWEDIKERFSVANGPKIHQLKADLVACKQA 151 Query: 1937 GSSLADYYNNLNSLWKQFDSMISLPTCTCDA-----GIHFEKHNQLIKLMQFLMGLDDT- 1776 G ++A YY L LW + + +P C+C+ EK + ++ QFLMGLDD Sbjct: 152 GMTIAAYYGKLKLLWDELANYEQVPVCSCEGCSCRITTKLEKRREEERVHQFLMGLDDVV 211 Query: 1775 YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXX 1596 Y RSN+L DPLP++ +S++ EE R + + +V A Sbjct: 212 YGTARSNLLASDPLPNLNRIYSVMIQEERVRTIARNKEE-RGDVMGLAVQIGGKNRGRDE 270 Query: 1595 XXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKR---- 1428 KC+NCN+ GH C+ L+GYP +G R Sbjct: 271 FKD-------------------------KCTNCNRDGHVAANCFQLIGYPDWWGDRPRGE 305 Query: 1427 ------NFTSNVKPXXXXXXXXXXXXSLECSTSNSP--------VSLSDEQMVRLMSLLN 1290 + N + + ++S ++ +Q +LM +LN Sbjct: 306 GKSGTRGRSQNRGAGRGKGAAIVRANAAQAGGNSSAREAESHGFPGITSDQWQKLMEILN 365 Query: 1289 DSP-AHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTL 1113 P M GK S WI+DSGA+ HMT Sbjct: 366 IQPDTAERMTGKSQSNE------------------------------WILDSGASNHMTG 395 Query: 1112 SSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHK 933 + + + D+ +G P+G A T+ G + ++ + L++VL VP +L+S+ + Sbjct: 396 TLEIMRELHDIQTC--PIGLPDGKNASATKEGVVLLDEGLKLYNVLYVPNLKCNLISLSQ 453 Query: 932 LARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWH 753 L D V F + C +QD ++ +G G + GLY F +A S LWH Sbjct: 454 LMDDLDCIVHFSDKLCVMQDRTSRMLIGAGKRRDGLYYFRTIQRVQACSVVGVNQLELWH 513 Query: 752 QRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLD 573 +RLGHPS +V ++ + + + CD C +AKQTRE F LSEH + +L+H D Sbjct: 514 RRLGHPSLKVTRLVSGTSKNNDHVELNKNCDVCLRAKQTREKFSLSEHVANDAFELIHCD 573 Query: 572 VWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIF 393 +WGPY+ S G YF+TIVDDYSRAVW+Y++ +K +V +++NF ++ QF+ +KIF Sbjct: 574 LWGPYRTASSCGAFYFVTIVDDYSRAVWIYLIGDKREVSQTLINFFTLIKRQFDKQVKIF 633 Query: 392 RSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPL 213 RSDNGTEFV M+++FH+ GI QTSCV TPQQNG ERKHRH+LNVAR+L FQ LP+ Sbjct: 634 RSDNGTEFV--CMKRYFHENGIIFQTSCVGTPQQNGRVERKHRHILNVARALRFQSNLPI 691 Query: 212 NLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILN-NSDKF 36 + W EC+L A YL+NR PS++L+GK+P+ +++G P+L HIRVFGCLCYA LN DKF Sbjct: 692 DFWGECILAAGYLLNRTPSAILNGKTPYEMLHGQAPSLEHIRVFGCLCYAHNLNRKGDKF 751 Query: 35 YSRSEKSVLIG 3 S+S K + IG Sbjct: 752 ASKSRKCIFIG 762 >gb|OMO88216.1| Integrase, catalytic core [Corchorus capsularis] Length = 1609 Score = 476 bits (1224), Expect = e-145 Identities = 272/771 (35%), Positives = 400/771 (51%), Gaps = 6/771 (0%) Frame = -2 Query: 2297 LHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWD 2118 LH SD G +++ L ENY W AMT AL+ +K GFVDG+ + + + + W Sbjct: 343 LHASDNPGTTLVTCLLK-EENYPTWRRAMTNALQAKSKFGFVDGSVPRPSLGSQEESSWV 401 Query: 2117 MCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQN 1938 C+S+V++WI N+L P L Y TA EMWNDL+E + + + + + + + + Q Sbjct: 402 KCNSMVISWIFNALHPTLHDSVAYCVTAQEMWNDLEERFSQGNAARIHQLKTEMVNTLQQ 461 Query: 1937 GSSLADYYNNLNSLWKQFDSMISLPTCTCDAGIHFEKHNQLIKLMQFLMGLDDTYVPIRS 1758 G S++ YY L +W + + +P CTC + + K+ QFLMGL++ Y + S Sbjct: 462 GMSVSAYYTKLKGIWDELGTYSHIPPCTCGSAKGLAAEREKEKVHQFLMGLNEKYNVVHS 521 Query: 1757 NILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXXXXX 1578 IL DPL S+ A+++++ EE + ++ S P+V A Sbjct: 522 QILNTDPLHSLSRAYALVAQEERQQLVAA---SRLPSVEGAAFMTNNANKSNFNRKPASN 578 Query: 1577 XXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFTSNVKPXX 1398 L C +C K HT D C+ L+GYP + K KP Sbjct: 579 RDLS----------------KLFCEHCKKTRHTKDSCFELLGYPEWWDK-----GKKPSK 617 Query: 1397 XXXXXXXXXXSLECSTSNSPVS-LSDEQMVRLMSLLNDSPAH---SNMAGKCFSGTFFNA 1230 +N P++ L+ EQ +L+S+LN +N AGK S Sbjct: 618 TKAANTAQHMETASGNNNVPINGLTSEQYAQLISMLNLDKIQIPTANFAGKATS------ 671 Query: 1229 SVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHP 1050 NT++ WI+DSGA+ HMT + + V + P Sbjct: 672 -------------------LSNTAIEWILDSGASDHMTCHKSAITSHKTVPHFS-PIKIP 711 Query: 1049 NGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDC 870 +G+ GD+ +N+ + L+DVL +P ++ +L+S+ KL + F + C +QD Sbjct: 712 DGSFVPAKSCGDVPLNSLVTLNDVLYIPSFSCNLISISKLTQALNCVAHFFPTFCTLQDL 771 Query: 869 ATKRNVGIGNQSSGLYLFD-VNTNCKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNL 693 AT++ +G+G GLY F + A S + LWH+RLGH S L +L L Sbjct: 772 ATRKLIGMGELRDGLYYFQAIKVPIAATSISRDSQLILWHRRLGHLSFDRLSLLN-DLGP 830 Query: 692 DKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIV 513 CD+CH+AKQTR PFP+S K+ + +L+H DVWGPY S YFL+IV Sbjct: 831 FPVKSFNKCCDSCHRAKQTRPPFPISSIKTHEAFELIHCDVWGPYHTPSLSNAHYFLSIV 890 Query: 512 DDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKK 333 DD+SR WVY+LK K +VY +++F M++ QF +K RSDNGTEF N Q F + Sbjct: 891 DDFSRTSWVYLLKTKTEVYTWLLSFIAMVAKQFGKAVKQIRSDNGTEFTNQNFQLFCQQN 950 Query: 332 GIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSS 153 GI Q SCV TPQQNG+ ERKHRH+L VAR+L FQ LP+ W ECVLTA YLIN +P+ Sbjct: 951 GILTQFSCVSTPQQNGVVERKHRHILEVARALRFQANLPIKFWGECVLTATYLINYVPTP 1010 Query: 152 VLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILNNS-DKFYSRSEKSVLIG 3 +LSGKSP+ +++ P+ SH+RVFGCLCY +++ S DKF++R+ + +G Sbjct: 1011 LLSGKSPYEVLFSRKPSYSHLRVFGCLCYTSVIPRSRDKFHARATACLFLG 1061 >gb|PNX92904.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1457 Score = 472 bits (1214), Expect = e-145 Identities = 281/795 (35%), Positives = 405/795 (50%), Gaps = 29/795 (3%) Frame = -2 Query: 2300 YLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQW 2121 YLHPSD G I ++L G +NY W+ A+ ALR KL F+DGT + +++ W Sbjct: 33 YLHPSDNPGMIITPIQLKG-DNYDEWAKAIRNALRAKKKLAFIDGTLTEPKEDSADLEDW 91 Query: 2120 DMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQ 1941 S+++ WILN++ P L + Y + ++W D+++ + +G ++ + ++ + Q Sbjct: 92 WAVSSMLVAWILNTIEPGLRSTITYMENVKDLWEDIRQRFSIGNGPRIYQLKADLAACKQ 151 Query: 1940 NGSSLADYYNNLNSLWKQFDSMISLPTCTC-----DAGIHFEKHNQLIKLMQFLMGLDDT 1776 G ++A+YY + +W + S PTC C + K + K+ QFLMGLDD Sbjct: 152 MGKTVAEYYGKIKVMWDELASYEPAPTCKCGGCKCNISKDLVKKREEEKVYQFLMGLDDV 211 Query: 1775 -YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXX 1599 Y +RSNIL+ DPLP++ ++I EE HR+ + G V Sbjct: 212 VYGTVRSNILSMDPLPNLSRVYAIAVQEERHRDIARGKEERSDAVGFTMQVGAGARAAVV 271 Query: 1598 XXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFT 1419 + C++C K GH + C+ + GYP +G R + Sbjct: 272 RTKEK----------------------GMNCNHCGKTGHDVKGCFEVNGYPEWWGDRPRS 309 Query: 1418 S--------------------NVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMS 1299 + +V+ + P LS EQ L++ Sbjct: 310 TGRHGNRGRGNAGSTGRGRGQSVRANATLVGGVEKQHGAGDEHAGIP-GLSGEQWTTLIN 368 Query: 1298 LLNDSPAHS--NMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQ 1125 LLN A S ++GK N N WI+DSGA+ Sbjct: 369 LLNTHKAGSVDRLSGK-----------------------NNN---------WIIDSGASH 396 Query: 1124 HMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLL 945 HMT + L +++ VG PNG Q + G + + NI L +VL VP +L+ Sbjct: 397 HMTGVIELLSEARNITPR--PVGLPNGKQTDAVKEGTLCLGENIYLQNVLYVPNMNCTLI 454 Query: 944 SVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSR 765 SV KL +D + V+F E+ C +QD ++ +G+G + G+++F A R Sbjct: 455 SVSKLVQDLRCIVTFTENLCVMQDRTSRTLIGVGEECDGIFIFRRAAPMHANKAKVMDVR 514 Query: 764 NLWHQRLGHPSDQVLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQL 585 LWHQRLGHPS QVL L +++ D +D C+TC+KAKQTR+ F S +K L Sbjct: 515 RLWHQRLGHPSKQVLSYLPETISSDLGSDLVDFCETCYKAKQTRDVFQESNNKVDDCFSL 574 Query: 584 VHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESN 405 +H D+WGPYK+ + G YFLTIVDD+SRA+WVY+L K +V +I NF M QFE Sbjct: 575 IHCDLWGPYKVPASCGAFYFLTIVDDFSRAIWVYLLLEKKEVSQTIKNFCAMTERQFEKP 634 Query: 404 IKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQG 225 +KI RSDNGTEF ++ +F KGI HQTSCV TPQQNG ERKHRH+LNVAR+L FQ Sbjct: 635 VKIVRSDNGTEF--TCLKSYFEVKGILHQTSCVGTPQQNGRVERKHRHILNVARALRFQA 692 Query: 224 ELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATIL-NN 48 LP+ W ECVLTA+YLINR PSS+L GKSP+ +++ P + ++VFGCLCY + Sbjct: 693 NLPIQFWGECVLTASYLINRTPSSLLRGKSPYEVLFKKKPIYNQLKVFGCLCYVHHRGRD 752 Query: 47 SDKFYSRSEKSVLIG 3 DKF RS+K V +G Sbjct: 753 KDKFSERSKKCVFLG 767 >gb|KYP42518.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 769 Score = 451 bits (1161), Expect = e-144 Identities = 260/751 (34%), Positives = 390/751 (51%), Gaps = 5/751 (0%) Frame = -2 Query: 2240 ENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELF 2061 +NY+ W+ +M ALR KLGF+D + KK T+ + W+ DS+V+ WI+NS P L Sbjct: 5 DNYRNWARSMRTALRAKTKLGFIDRSIKKPTSTSPDYQHWERADSMVVAWIINSTDPILH 64 Query: 2060 AGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFD 1881 ++ TA ++W DL++ + N+ ++ +YY S+ + Sbjct: 65 GSISHAMTAKDIWLDLRKL-------CLMQQETNV--------TVTEYYTKFKSIIDELR 109 Query: 1880 SMISLPTCTCDAGIHFEKHNQLIKLMQFLMGLD-DTYVPIRSNILTKDPLPSVKSAFSII 1704 + LP CTC A + + + ++ FL GLD D + + IL DPLPS+ F+ + Sbjct: 110 ELQPLPECTCGAAKNLAQREEEHRVHLFLGGLDSDRFAHAKGIILNTDPLPSLLRVFNHV 169 Query: 1703 SGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1524 EE+ G + +AF S Sbjct: 170 LREETRVLTEKGKDHKIESGTAFHSSTFNKKKNRDGPKP--------------------- 208 Query: 1523 NLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSN 1344 +C +C K GH +C+ +VGYPA + R T N ST Sbjct: 209 ----RCDHCGKIGHDKTKCFEIVGYPANWNPRRNTRN-------------------STKR 245 Query: 1343 SPVSLSDEQMVRLMSLLNDSPAHSNMAGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKN 1164 + S +N+A + GT +A H + N +++ N Sbjct: 246 TEHS-----------------GGANLAWENNQGTDGHALSGSQDSGGSHGSKN-DYMSGN 287 Query: 1163 TSLG--WIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINNNII 990 + W++DSGA+ HMT L D S + L + P G L+ + G MK+N NI Sbjct: 288 QMINDVWVLDSGASHHMTSLYSQLDEVQDFS-IPLRITVPIGDVVLVHKKGTMKLNENIK 346 Query: 989 LHDVLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYLFDV 810 L++VL +PE+ +L+S+HKL D V++ +C IQD KR +G G G+Y+F Sbjct: 347 LYNVLFIPEFRCNLISIHKLTHDLNCVVTYSVDECVIQDQTRKRMIGFGRLCDGIYIFTQ 406 Query: 809 NTNCKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLD-KTTDSTHICDTCHKAKQTR 633 ++ + LWH R+GHPSDQVL L ++ + CD CH++KQ R Sbjct: 407 QVGGYSLVASSGDITTLWHARMGHPSDQVLSKLSTIISFSFNANNKMECCDICHRSKQCR 466 Query: 632 EPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGDVYD 453 PF L+ +K +K+ L+H D+WG Y S +G YFLTIVDD++RAVW+Y+LK+K + + Sbjct: 467 LPFSLNYNKVSKVFDLIHCDLWGKYHTASHNGSHYFLTIVDDFTRAVWIYLLKDKTETTN 526 Query: 452 SIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGIAER 273 I+N+ +M+ QF++ +K+ RSDNGT+FVN+K+ FF + GI HQTSCV +PQQNG ER Sbjct: 527 VIINYYRMVQTQFDTKVKVVRSDNGTKFVNSKIHSFFQEVGILHQTSCVSSPQQNGRVER 586 Query: 272 KHRHLLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPTLSH 93 KHRH+LNVAR+L FQ LPL W ECVLTA +LINR P+ G +P+ ++YG P+ +H Sbjct: 587 KHRHILNVARALRFQANLPLTFWGECVLTAIHLINRTPTVANQGLTPYEMLYGKQPSYAH 646 Query: 92 IRVFGCLCYA-TILNNSDKFYSRSEKSVLIG 3 IRVFGCLCYA T+ +DKF +++++ + IG Sbjct: 647 IRVFGCLCYAKTLTKKTDKFEAQADRCIFIG 677 >ref|XP_017415202.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform X1 [Vigna angularis] Length = 1472 Score = 469 bits (1207), Expect = e-144 Identities = 278/814 (34%), Positives = 417/814 (51%), Gaps = 28/814 (3%) Frame = -2 Query: 2360 MNNESVINSSELKLVFGDTLYLHPSDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKL 2181 M E + SE+ L SD GN I V+L G ENY+ W+ A+ +LR K Sbjct: 21 MAKEGEKSESEVVKKMSSPYDLSASDNPGNVITQVQLKG-ENYEEWAKAVKISLRARRKW 79 Query: 2180 GFVDGTCKKDTANASLANQWDMCDSVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETY 2001 GF+DGT + + S W S++++WILN++ P L + Y + A ++W+D+KE + Sbjct: 80 GFIDGTHTEPETDTSKIEDWWTIQSMLVSWILNTIEPNLRSTIAYMENAKDLWDDIKERF 139 Query: 2000 DKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNSLWKQFDSMISLPTCTC-----DAGIH 1836 V+G + + + Q G ++ YY L LW + + +P C C + Sbjct: 140 SIVNGPRIQQLKSKLAECKQQGMTMVAYYGKLKILWDELANYEQIPQCKCGGCKCNIATK 199 Query: 1835 FEKHNQLIKLMQFLMGLDDT-YVPIRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGS 1659 EK + ++ QFLMGLDD Y RSN+L DPLPS+ ++ + EE R + Sbjct: 200 LEKRREEERVHQFLMGLDDEGYGTTRSNVLATDPLPSLNRVYATMVQEERVRMITRSKEE 259 Query: 1658 IKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHT 1479 V ++ C++C + GH Sbjct: 260 RGMIVGMVVQTETKGKLRNEVKEK-----------------------SIVCTHCGRTGHD 296 Query: 1478 IDRCYGLVGYPAGYGKRNFTSN-----------------VKPXXXXXXXXXXXXSLECST 1350 C+ ++GYP +G+R N V P + T Sbjct: 297 KRNCFEIIGYPDWWGERPRNENKSGGRHQQRTTFFRGKGVTPRVNIAHTSTSSSDSKSDT 356 Query: 1349 SNSPVS-LSDEQMVRLMSLLNDSPAHSN--MAGKCFSGTFFNASVKFNLKFEKHFNGNTN 1179 V+ LS+EQ L ++LN A++ M GK Sbjct: 357 KKPEVAGLSNEQWEILATMLNSHKANTTEKMTGK-------------------------- 390 Query: 1178 FLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVSKLGLTVGHPNGTQALITEIGDMKINN 999 K+ L WIVDSGA+ HMT + L + + G VG PNG L + G + ++ Sbjct: 391 ---KSRDL-WIVDSGASNHMTGTLDNLWESRTLE--GCPVGLPNGELVLADKEGSVFLDG 444 Query: 998 NIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFDESKCYIQDCATKRNVGIGNQSSGLYL 819 + L +VL VP+ +L+SV +L ++K +V F + C +QD + +G G + GLY Sbjct: 445 GLKLENVLYVPKLNCNLISVSQLIDEAKCTVHFTDKFCAMQDHTLRMLIGAGERKDGLYW 504 Query: 818 FDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLHVLKPSLNLDKTTDST-HICDTCHKAK 642 + ++ KA N +WH+R+GHP+ Q++ + P++ + + +T +C+ C K+K Sbjct: 505 YRGVSDVKAHHINTESQLEIWHKRMGHPAYQIVEKI-PNMTITRGDKNTSRVCEVCEKSK 563 Query: 641 QTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKDGYKYFLTIVDDYSRAVWVYMLKNKGD 462 Q+R FPLS+ +++ + L+H D+WGPY+ S G YFLT+VDD SRAVW+Y+L +K Sbjct: 564 QSRNKFPLSDSQASNVFDLIHCDLWGPYRTLSSCGASYFLTLVDDCSRAVWIYLLNSKKG 623 Query: 461 VYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNNKMQQFFHKKGIFHQTSCVYTPQQNGI 282 V +++NF ++ Q++ +K+ RSDNGTEF+ +QQ+F GI HQTSC TPQQNG Sbjct: 624 VSQTLMNFITLIERQYKKQVKMIRSDNGTEFM--CLQQYFQLHGILHQTSCTGTPQQNGR 681 Query: 281 AERKHRHLLNVARSLMFQGELPLNLWPECVLTAAYLINRLPSSVLSGKSPFSLVYGHDPT 102 ERKH+H+LNVAR+L FQG+LPL W EC+LTA YLINR PS++L GK+P+ +V G+ PT Sbjct: 682 VERKHQHILNVARALRFQGQLPLKFWGECILTAGYLINRTPSTILQGKTPYEIVNGNPPT 741 Query: 101 LSHIRVFGCLCYATILN-NSDKFYSRSEKSVLIG 3 H+RVFGCLCYA + N DKF SRS KSV +G Sbjct: 742 YDHLRVFGCLCYAHNQDRNGDKFASRSRKSVFVG 775 >gb|PNX93517.1| retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense] Length = 1465 Score = 468 bits (1204), Expect = e-143 Identities = 279/782 (35%), Positives = 407/782 (52%), Gaps = 20/782 (2%) Frame = -2 Query: 2288 SDTSGNPIISVKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWDMCD 2109 +D G+ I V+L G ENY W+ ++ ALR K GFVDGT + +S W + Sbjct: 38 NDNPGSLITHVQLKG-ENYDEWASSIRTALRARKKFGFVDGTIGRPGEESSDLEDWWTNN 96 Query: 2108 SVVLTWILNSLSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSS 1929 S++++WI+N++ P L + + + A ++WND+KE + +G + + + Q G + Sbjct: 97 SLLVSWIMNTIEPSLRSTMSHMEVAMDLWNDIKERFSIANGPRIQQLKAELVECKQKGLT 156 Query: 1928 LADYYNNLNSLWKQ---FDSMISLPT--CTCDAGIHFEKHNQLIKLMQFLMGLDDT-YVP 1767 + YY L LW++ +D +++ CTC+ G K + K+ QFLMGLDDT Y Sbjct: 157 IVTYYGKLKKLWEELVNYDQILTCKCGLCTCNLGNQITKKREEEKIHQFLMGLDDTLYGT 216 Query: 1766 IRSNILTKDPLPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXX 1587 +RSN+L +DPLP++ ++ + EE R + T + AFA Sbjct: 217 VRSNLLAQDPLPTLNKVYATLVQEERLRMVTRVTEE-RGEAMAFAVHSKFKNKEKEE--- 272 Query: 1586 XXXXXXXXXXXXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYP------------- 1446 CS+CN+ GH + C+ L+GYP Sbjct: 273 -------------------------SCSHCNQVGHNSEGCFQLIGYPEWWGDRRRRPMKG 307 Query: 1445 AGYGKRNFTSNVKPXXXXXXXXXXXXSLECSTSNSPVSLSDEQMVRLMSLLNDSPAHSNM 1266 +G GK ++N + S + L+ +Q+ L SLLN+ S Sbjct: 308 SGRGKPEQSNNRNRGGTAKAHVAQAKEITAEVSAADFGLTSDQLQTLSSLLNNVKLGS-- 365 Query: 1265 AGKCFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTV 1086 EK NG +FL WI+D+GA+ HMT + L N Sbjct: 366 -------------------IEK-LNGKCSFLP------WIIDTGASHHMTGQLECLTNIR 399 Query: 1085 DVSKLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSV 906 ++ + ++G PNG + + T+ G++ +N + L +VL VP +L+SV +L ++S V Sbjct: 400 NIFEC--SIGLPNGEETVATKEGNVVLNERLQLKNVLYVPSLQCNLISVSQLLKNSNYVV 457 Query: 905 SFDESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQ 726 F + C +QD + +G G Q GLY A+ N VS +L HQRLGH S + Sbjct: 458 QFTDKFCLVQDPTLRTPIGAGEQREGLYYLRGMVKAAAMKTNKEVSFDLLHQRLGHASLK 517 Query: 725 VLHVLKPSLNLDKTTDSTHICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITS 546 VL +L K T C+ C +AKQ+R+ FP+SE+K+ L+H D+WGPY+ + Sbjct: 518 VLQMLPNVRPSSKNNSCTQTCEICLRAKQSRDNFPVSENKAATPFHLIHCDLWGPYRNAT 577 Query: 545 KDGYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFV 366 G KYFLTIVDD+SRAVW+Y+L +K +V + F M+ QF + +KI RSDNGTEF Sbjct: 578 FCGAKYFLTIVDDFSRAVWIYLLIDKTEVSKHLYQFLAMVERQFSAQVKIIRSDNGTEF- 636 Query: 365 NNKMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLT 186 M+Q F GI H+TSCV TPQQNG ERKHRH+LNVAR+L FQ +LP+ W EC L Sbjct: 637 -TCMKQNFRDCGIIHETSCVGTPQQNGRVERKHRHILNVARALRFQAQLPIEFWGECALA 695 Query: 185 AAYLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYA-TILNNSDKFYSRSEKSVL 9 A YLINR P+ LSGK+P+ L+YG P+L H+RV GCL YA + DKF +RS K V Sbjct: 696 ACYLINRTPTKTLSGKTPYELLYGKAPSLEHLRVVGCLAYAHNQHHKGDKFATRSRKCVF 755 Query: 8 IG 3 +G Sbjct: 756 VG 757 >ref|XP_017415203.1| PREDICTED: retrovirus-related Pol polyprotein from transposon TNT 1-94 isoform X2 [Vigna angularis] Length = 1435 Score = 459 bits (1180), Expect = e-140 Identities = 268/780 (34%), Positives = 405/780 (51%), Gaps = 28/780 (3%) Frame = -2 Query: 2258 VKLTGSENYKIWSIAMTFALRNHNKLGFVDGTCKKDTANASLANQWDMCDSVVLTWILNS 2079 V+L G ENY+ W+ A+ +LR K GF+DGT + + S W S++++WILN+ Sbjct: 18 VQLKG-ENYEEWAKAVKISLRARRKWGFIDGTHTEPETDTSKIEDWWTIQSMLVSWILNT 76 Query: 2078 LSPELFAGAIYSKTAYEMWNDLKETYDKVDGSAVFNIHKNINSLSQNGSSLADYYNNLNS 1899 + P L + Y + A ++W+D+KE + V+G + + + Q G ++ YY L Sbjct: 77 IEPNLRSTIAYMENAKDLWDDIKERFSIVNGPRIQQLKSKLAECKQQGMTMVAYYGKLKI 136 Query: 1898 LWKQFDSMISLPTCTC-----DAGIHFEKHNQLIKLMQFLMGLDDT-YVPIRSNILTKDP 1737 LW + + +P C C + EK + ++ QFLMGLDD Y RSN+L DP Sbjct: 137 LWDELANYEQIPQCKCGGCKCNIATKLEKRREEERVHQFLMGLDDEGYGTTRSNVLATDP 196 Query: 1736 LPSVKSAFSIISGEESHRNFSSGTGSIKPNVSAFASXXXXXXXXXXXXXXXXXXXXXXXX 1557 LPS+ ++ + EE R + V Sbjct: 197 LPSLNRVYATMVQEERVRMITRSKEERGMIVGMVVQTETKGKLRNEVKEK---------- 246 Query: 1556 XXXXXXXXXXXNLNLKCSNCNKPGHTIDRCYGLVGYPAGYGKRNFTSN------------ 1413 ++ C++C + GH C+ ++GYP +G+R N Sbjct: 247 -------------SIVCTHCGRTGHDKRNCFEIIGYPDWWGERPRNENKSGGRHQQRTTF 293 Query: 1412 -----VKPXXXXXXXXXXXXSLECSTSNSPVS-LSDEQMVRLMSLLNDSPAHSN--MAGK 1257 V P + T V+ LS+EQ L ++LN A++ M GK Sbjct: 294 FRGKGVTPRVNIAHTSTSSSDSKSDTKKPEVAGLSNEQWEILATMLNSHKANTTEKMTGK 353 Query: 1256 CFSGTFFNASVKFNLKFEKHFNGNTNFLKKNTSLGWIVDSGANQHMTLSSKPLINTVDVS 1077 K+ L WIVDSGA+ HMT + L + + Sbjct: 354 -----------------------------KSRDL-WIVDSGASNHMTGTLDNLWESRTLE 383 Query: 1076 KLGLTVGHPNGTQALITEIGDMKINNNIILHDVLVVPEYTVSLLSVHKLARDSKLSVSFD 897 G VG PNG L + G + ++ + L +VL VP+ +L+SV +L ++K +V F Sbjct: 384 --GCPVGLPNGELVLADKEGSVFLDGGLKLENVLYVPKLNCNLISVSQLIDEAKCTVHFT 441 Query: 896 ESKCYIQDCATKRNVGIGNQSSGLYLFDVNTNCKAISNNCTVSRNLWHQRLGHPSDQVLH 717 + C +QD + +G G + GLY + ++ KA N +WH+R+GHP+ Q++ Sbjct: 442 DKFCAMQDHTLRMLIGAGERKDGLYWYRGVSDVKAHHINTESQLEIWHKRMGHPAYQIVE 501 Query: 716 VLKPSLNLDKTTDST-HICDTCHKAKQTREPFPLSEHKSTKIGQLVHLDVWGPYKITSKD 540 + P++ + + +T +C+ C K+KQ+R FPLS+ +++ + L+H D+WGPY+ S Sbjct: 502 KI-PNMTITRGDKNTSRVCEVCEKSKQSRNKFPLSDSQASNVFDLIHCDLWGPYRTLSSC 560 Query: 539 GYKYFLTIVDDYSRAVWVYMLKNKGDVYDSIVNFTQMLSNQFESNIKIFRSDNGTEFVNN 360 G YFLT+VDD SRAVW+Y+L +K V +++NF ++ Q++ +K+ RSDNGTEF+ Sbjct: 561 GASYFLTLVDDCSRAVWIYLLNSKKGVSQTLMNFITLIERQYKKQVKMIRSDNGTEFM-- 618 Query: 359 KMQQFFHKKGIFHQTSCVYTPQQNGIAERKHRHLLNVARSLMFQGELPLNLWPECVLTAA 180 +QQ+F GI HQTSC TPQQNG ERKH+H+LNVAR+L FQG+LPL W EC+LTA Sbjct: 619 CLQQYFQLHGILHQTSCTGTPQQNGRVERKHQHILNVARALRFQGQLPLKFWGECILTAG 678 Query: 179 YLINRLPSSVLSGKSPFSLVYGHDPTLSHIRVFGCLCYATILN-NSDKFYSRSEKSVLIG 3 YLINR PS++L GK+P+ +V G+ PT H+RVFGCLCYA + N DKF SRS KSV +G Sbjct: 679 YLINRTPSTILQGKTPYEIVNGNPPTYDHLRVFGCLCYAHNQDRNGDKFASRSRKSVFVG 738