BLASTX nr result

ID: Astragalus22_contig00038115 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00038115
         (843 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP76185.1| Putative ribonuclease H protein At1g65750 [Cajanu...   254   2e-73
dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subt...   252   1e-72
gb|KYP40438.1| Putative ribonuclease H protein At1g65750 family,...   248   4e-71
gb|KYP72147.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan]   239   7e-71
gb|KYP72596.1| Putative ribonuclease H protein At1g65750 family ...   234   6e-69
ref|XP_020210568.1| uncharacterized protein LOC109795461 [Cajanu...   239   4e-68
gb|KYP45885.1| Putative ribonuclease H protein At1g65750 family ...   237   2e-67
gb|KYP65965.1| Putative ribonuclease H protein At1g65750 family ...   233   3e-66
gb|KYP61054.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu...   227   3e-66
gb|KYP48048.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan]   219   7e-66
gb|KYP70239.1| Putative ribonuclease H protein At1g65750 family ...   231   1e-65
gb|KYP74374.1| Putative ribonuclease H protein At1g65750 family ...   226   2e-63
ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachi...   225   3e-63
gb|KYP53058.1| Putative ribonuclease H protein At1g65750 family,...   223   9e-63
gb|KYP33748.1| Putative ribonuclease H protein At1g65750 family ...   219   4e-61
gb|KYP57513.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu...   210   9e-61
gb|KYP69874.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu...   212   6e-59
gb|KYP49443.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanu...   199   2e-56
ref|XP_016168765.1| uncharacterized protein LOC107611342 [Arachi...   202   2e-55
ref|XP_016206284.1| uncharacterized protein LOC107646622 [Arachi...   200   1e-54

>gb|KYP76185.1| Putative ribonuclease H protein At1g65750 [Cajanus cajan]
          Length = 1354

 Score =  254 bits (649), Expect = 2e-73
 Identities = 127/279 (45%), Positives = 174/279 (62%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL+WNF+RDTL D+G P  +  L+WHCISSP +Q+LWNGEAL  F  
Sbjct: 579  WMAIKIDLEKAYDRLNWNFVRDTLVDIGLPQKLIELIWHCISSPSMQVLWNGEALEEFVP 638

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482
               + +  P          +  F  + +  +  L   +    K P             F 
Sbjct: 639  SRGIRQGDPISPYIFVLCMERLFHLIKIAEDHHLWKPIKLSKKGPPLSHLAFADDLILFS 698

Query: 481  RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302
                 QA  IK  L  FC S   KVS +KTRIF+S N+ + ++ +I   LGFQ T DLGK
Sbjct: 699  EASLDQAEIIKACLDNFCHSSGMKVSTEKTRIFFSKNIGWSVKNEISSSLGFQRTDDLGK 758

Query: 301  YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122
            Y+GI + H+ V+K S + ++D + +RLS+WK ++LS A RLTLTKSVL A+PSY +Q+ +
Sbjct: 759  YIGIKLHHERVSKRSLQSVMDHIKRRLSSWKTKTLSFAGRLTLTKSVLAAIPSYTMQTVL 818

Query: 121  IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +PK +C DIDK CRSFIWG+++G+R+ H ++W  +C PK
Sbjct: 819  LPKQLCYDIDKSCRSFIWGQDSGKRRVHALAWETLCKPK 857


>dbj|GAU26239.1| hypothetical protein TSUD_224300 [Trifolium subterraneum]
          Length = 1250

 Score =  252 bits (643), Expect = 1e-72
 Identities = 137/288 (47%), Positives = 183/288 (63%), Gaps = 8/288 (2%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL+W F+++TLED+G P  + NL+W CIS+  +++LWNGEAL  F  
Sbjct: 454  WMAIKIDLEKAYDRLNWEFVKETLEDIGVPRRMVNLIWSCISTSKMRVLWNGEALEEFSP 513

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*S---- 494
               + +  P L  YLF          I +  Q + L +  +   P  L+    + S    
Sbjct: 514  SRGIRQGDP-LSPYLFVL-------CIERLFQSINLAVDQNKLSPIKLSRGGPKISHLAY 565

Query: 493  ----FAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGF 326
                  F      QA+ IK +L TFC S  QKVS +KT+IF+S NV +H+R ++ E  GF
Sbjct: 566  ADDLLLFGEATVSQAQNIKVILDTFCISSGQKVSPEKTKIFFSKNVGWHVRQEVSERCGF 625

Query: 325  QSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALP 146
              T +LGKYLG+ I H   ++ +F+FI+DKV QRLS WKA++LS A R+TL KSV+QALP
Sbjct: 626  GWTDNLGKYLGVPILHNKASRATFQFIMDKVGQRLSNWKAKNLSFAGRVTLAKSVIQALP 685

Query: 145  SYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2
             Y +QS ++PK +CD+IDK CRSFIWG+    RK H ISW K+C PKK
Sbjct: 686  VYTMQSTLLPKSICDEIDKKCRSFIWGDTEESRKIHLISWDKICSPKK 733


>gb|KYP40438.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
            cajan]
          Length = 1356

 Score =  248 bits (632), Expect = 4e-71
 Identities = 134/280 (47%), Positives = 173/280 (61%), Gaps = 1/280 (0%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL+WNFIRDTL D+G P N   LVW CIS+P  ++LWNGEAL  F  
Sbjct: 565  WMAIKIDLEKAYDRLNWNFIRDTLTDIGLPQNFVELVWACISTPSSRVLWNGEALQEFHP 624

Query: 661  LEELGRVIPCLLIYLFY-AWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAF 485
               + +  P L  YLF    +  F  + +   Q+L   +    + P             F
Sbjct: 625  SRGIRQGDP-LSPYLFVLCMERLFHIIEVAVAQKLWKPICLSKQGPPLSHLAFADDLILF 683

Query: 484  CRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLG 305
                  Q   IK  L  FC S  QKVSL+KTRIF+S NV + +R +I   LGFQ T +LG
Sbjct: 684  SEASLDQVEVIKACLELFCKSSGQKVSLEKTRIFFSKNVGWSVREEISSALGFQRTDNLG 743

Query: 304  KYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSC 125
            KYLG+ I H  V +  +  II+KVNQRLS+WKA++LS A RLTLTK VL  LP Y +Q+ 
Sbjct: 744  KYLGVPIQHDRVNRRLYSSIINKVNQRLSSWKAKTLSFAGRLTLTKFVLVTLPMYTMQTA 803

Query: 124  IIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
             +P+ +CDDIDK CRSF+WG +  +++ H ++WS +C PK
Sbjct: 804  FLPRKICDDIDKECRSFLWGHKGEQQRIHAVAWSVICKPK 843


>gb|KYP72147.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 628

 Score =  239 bits (609), Expect = 7e-71
 Identities = 120/279 (43%), Positives = 170/279 (60%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL+W FI++TL  +G P N+  L+WHCISS  +Q+LWNGE L  F+ 
Sbjct: 181  WMAIKIDLEKAYDRLNWTFIKETLTMIGIPLNLVELIWHCISSSSMQVLWNGETLPEFKP 240

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482
               + +  P          +  F  + +   Q L   +    K P             F 
Sbjct: 241  TRGIRQGDPLSPYIFVLCMERLFHLIEVAVCQELWKPIKQSKKGPAISHLAFADNLILFA 300

Query: 481  RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302
                 QA  IK  L +FC S   KVS + TR+F+S NV ++++ +I   LGFQ T +LGK
Sbjct: 301  EASLDQAEIIKSCLDSFCLSSGMKVSEENTRVFFSKNVGWNVKSEISSSLGFQRTDNLGK 360

Query: 301  YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122
            YLG+ + H  V++ SF+ +++ +N+R+S+WKA++LS A RLTLTKSVL ALPSY +Q+  
Sbjct: 361  YLGVQLHHTRVSRNSFQSVMNSINRRISSWKAKTLSFAGRLTLTKSVLAALPSYTMQTVF 420

Query: 121  IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +P+ +CD+IDK  RSF+WG+    R+ H I+W  +C PK
Sbjct: 421  LPRQLCDEIDKASRSFLWGDSRAHRRVHAIAWETICKPK 459


>gb|KYP72596.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 646

 Score =  234 bits (597), Expect = 6e-69
 Identities = 127/291 (43%), Positives = 178/291 (61%), Gaps = 12/291 (4%)
 Frame = -3

Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668
           WMA+K+DLEKAYDRL W+FI+DTLED+GFPS   NLV  CI++P +++LWNGE L  F  
Sbjct: 10  WMALKVDLEKAYDRLEWSFIQDTLEDIGFPSTFINLVMACITTPKMRMLWNGEILDEFSP 69

Query: 667 -RFLEELGRVIPCLLIY----LFY-----AWKGFFT*LILKSEQRLGLLLSSH*KMP*DL 518
            R + +   + P + +     LF+       KGF++ + L      G  LS H     DL
Sbjct: 70  SRGIRQGDPISPYIFVLCIERLFHIIECAVEKGFWSPIQLSKR---GPKLS-HLGFADDL 125

Query: 517 TSCLCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRE 338
                     F   +  Q   I+  L  FC S  QKV+ +KT++F+S NV + +R ++  
Sbjct: 126 V--------LFAEANVEQVEVIQTCLDLFCKSSGQKVNKEKTKVFFSKNVSWTVRNQLSS 177

Query: 337 CLGFQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVL 158
            LG Q T DLGKYLG+ + HK VT  ++  I+DKV  R+S WK  SLS+A R+T  KSVL
Sbjct: 178 SLGVQRTEDLGKYLGVPLHHKRVTTNTYSNILDKVRNRMSCWKRNSLSMAGRVTFAKSVL 237

Query: 157 QALPSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            ALP+Y +Q+ ++PK +C+++DK+ R FIWGE +  RK H ISW+ +C PK
Sbjct: 238 NALPTYTMQTSLLPKTICEELDKLTRKFIWGENDHDRKIHTISWNTICQPK 288


>ref|XP_020210568.1| uncharacterized protein LOC109795461 [Cajanus cajan]
          Length = 1200

 Score =  239 bits (609), Expect = 4e-68
 Identities = 120/279 (43%), Positives = 170/279 (60%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL+W FI++TL  +G P N+  L+WHCISS  +Q+LWNGE L  F+ 
Sbjct: 434  WMAIKIDLEKAYDRLNWTFIKETLTMIGIPLNLVELIWHCISSSSMQVLWNGETLPEFKP 493

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482
               + +  P          +  F  + +   Q L   +    K P             F 
Sbjct: 494  TRGIRQGDPLSPYIFVLCMERLFHLIEVAVCQELWKPIKQSKKGPAISHLAFADNLILFA 553

Query: 481  RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302
                 QA  IK  L +FC S   KVS + TR+F+S NV ++++ +I   LGFQ T +LGK
Sbjct: 554  EASLDQAEIIKSCLDSFCLSSGMKVSEENTRVFFSKNVGWNVKSEISSSLGFQRTDNLGK 613

Query: 301  YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122
            YLG+ + H  V++ SF+ +++ +N+R+S+WKA++LS A RLTLTKSVL ALPSY +Q+  
Sbjct: 614  YLGVQLHHTRVSRNSFQSVMNSINRRISSWKAKTLSFAGRLTLTKSVLAALPSYTMQTVF 673

Query: 121  IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +P+ +CD+IDK  RSF+WG+    R+ H I+W  +C PK
Sbjct: 674  LPRQLCDEIDKASRSFLWGDSRAHRRVHAIAWETICKPK 712


>gb|KYP45885.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1192

 Score =  237 bits (604), Expect = 2e-67
 Identities = 124/279 (44%), Positives = 169/279 (60%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL+WNFI++TLED+GFP  I  L+W+CIS+   ++LWNGE L +F  
Sbjct: 659  WMAIKIDLEKAYDRLNWNFIKETLEDIGFPLKIIELIWNCISTAKFRMLWNGEMLESFSP 718

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482
               + +  P          +  F  + +   Q+L   +      P             F 
Sbjct: 719  SRGIRQGDPISPYLFVLCMERLFHLINISVTQKLWKPIRLSRSGPELSHLAFADDLILFA 778

Query: 481  RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302
                 Q   I+  L+ FC+S  QK+S +KTRIF+S NV +++R +I    GFQ   +LGK
Sbjct: 779  EARLDQVEIIQACLNLFCTSSGQKISQEKTRIFFSKNVNWNVRNEISSSFGFQRAENLGK 838

Query: 301  YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122
            YLGI + H  V + +   II+KV QRL+ WKA+SLS A RLTLTKSVL ALPSY +Q+  
Sbjct: 839  YLGIPLHHSRVNRATHSGIIEKVTQRLNNWKAKSLSFAGRLTLTKSVLTALPSYTMQTVW 898

Query: 121  IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +P+ +CDDIDK  R F+WG+ +  +K H +SWS +C PK
Sbjct: 899  LPRNICDDIDKKNRQFLWGDTSHNKKVHTVSWSVICQPK 937


>gb|KYP65965.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1043

 Score =  233 bits (593), Expect = 3e-66
 Identities = 123/278 (44%), Positives = 165/278 (59%)
 Frame = -3

Query: 838  MAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRFL 659
            +AIKIDLEKAYDRL+W FI+DTLED+G PS   +LVW CIS+  +Q+LWNGE L  F   
Sbjct: 244  LAIKIDLEKAYDRLNWLFIKDTLEDIGLPSKFIDLVWSCISTASLQVLWNGEVLEAFSPS 303

Query: 658  EELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFCR 479
              + +  P          +  F  + +   Q+L   +      P             F  
Sbjct: 304  RGIRQGDPISPYLFVLCMERLFHLIDITVTQQLWKPIRLSRGGPSLTHLAFADDLILFAE 363

Query: 478  CHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGKY 299
             +  Q   I+  L+ FCSS  QK+S +KTRIF+S NV   +R +I    GFQ   +LGKY
Sbjct: 364  ANMNQVEIIQSCLNHFCSSSGQKISQEKTRIFFSKNVARTVREEISSAFGFQRAENLGKY 423

Query: 298  LGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCII 119
            LGI + H  V ++++  I+DK+ QRLS WKA++LS A RLTLTKSVL ALPSY +Q   +
Sbjct: 424  LGIPLHHSRVNRDTYHGIMDKITQRLSNWKAKNLSFAGRLTLTKSVLAALPSYTMQMVRL 483

Query: 118  PKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            P+ +CD++DK CR F+WG+    RK H I WS +C PK
Sbjct: 484  PRSICDEVDKKCRQFLWGDSEDCRKIHTIGWSMLCLPK 521


>gb|KYP61054.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 636

 Score =  227 bits (578), Expect = 3e-66
 Identities = 122/282 (43%), Positives = 169/282 (59%), Gaps = 3/282 (1%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668
            WMAIKIDLEKAYDRL+W FI++TL D+G P+N   LVW CISS  ++++WNGEAL  F  
Sbjct: 218  WMAIKIDLEKAYDRLNWKFIKETLIDIGLPNNFVELVWACISSGKLRMMWNGEALEEFLP 277

Query: 667  -RFLEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SF 491
             R + +   + P L +      +  F  + +  + RL   +      P            
Sbjct: 278  SRGVRQGDPISPYLFVLCM---ERLFQLINMTIDHRLWKPIQLSRNGPMISHLAFADDIV 334

Query: 490  AFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSD 311
             F      Q   I+  L+ FC S  QKVS +KTRIF+S NV + +R +I    GFQ T +
Sbjct: 335  LFAEASLDQVEVIQGCLNVFCDSAGQKVSNEKTRIFFSKNVGHVVRSEISNAFGFQRTEN 394

Query: 310  LGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQ 131
            LG YLG+   H  V+  +++ IIDKVN RLS WKA++LS A R+TLTKSVL+ALPSY +Q
Sbjct: 395  LGNYLGVPTHHSRVSHATYQSIIDKVNNRLSGWKAKNLSFAGRITLTKSVLEALPSYIMQ 454

Query: 130  SCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +  +PK VCD ++K  R F+WG+ +   + H I+W+ +C PK
Sbjct: 455  TVSLPKTVCDALEKSSRGFLWGDNSEHHRPHAINWNTICLPK 496


>gb|KYP48048.1| LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 364

 Score =  219 bits (557), Expect = 7e-66
 Identities = 121/289 (41%), Positives = 164/289 (56%), Gaps = 9/289 (3%)
 Frame = -3

Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668
           WMAIKIDLEKAYDRL WNF++DTL+D+G P    NL+W  I SP ++++WNGEAL  F  
Sbjct: 66  WMAIKIDLEKAYDRLKWNFVKDTLQDIGLPQIFVNLIWASILSPRLRMVWNGEALEEFTP 125

Query: 667 -RFLEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLS------SH*KMP*DLTSC 509
            R + + G + P L +          +   + S+Q   + LS      SH     DL   
Sbjct: 126 SRGIRQGGPISPYLFVLCMERLFQLIS-AAVTSDQWKPIKLSRDGRPLSHLAFADDLV-- 182

Query: 508 LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329
                  F      Q   IK  L  FC+S  QKVSL+KTRI++S NV + +R +I    G
Sbjct: 183 ------LFAEASINQVEIIKTCLDLFCASSGQKVSLEKTRIYFSKNVNHSIREEISSTFG 236

Query: 328 FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149
           +Q   +LGKYLGI   H  V    ++ +I+ V++R   WK  +L    RLTL KSVL  +
Sbjct: 237 YQCIDNLGKYLGIPAHHSRVCHRDYQGLIEHVSRR--GWKTSALLFMGRLTLCKSVLSTI 294

Query: 148 PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2
           PSY +QS  +P+  CD+ID+ICR F+WG     R+FH I W+KVC  K+
Sbjct: 295 PSYTMQSVYLPRSTCDEIDRICRDFLWGGSRNNRRFHAIGWNKVCMAKE 343


>gb|KYP70239.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1157

 Score =  231 bits (590), Expect = 1e-65
 Identities = 121/279 (43%), Positives = 167/279 (59%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL+WNFI++TLED+GFP  I  L+W+CIS+   ++LWNGE L +F  
Sbjct: 674  WMAIKIDLEKAYDRLNWNFIKETLEDIGFPLKIIELIWNCISTAKFRMLWNGEMLESFSP 733

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482
               + +  P          +  F  + +   Q+L   +      P             F 
Sbjct: 734  SRGIRQGDPISPYLFVLCMERLFHLINISVTQKLWKPIRLSRSGPELSHLAFADDLILFA 793

Query: 481  RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302
                 Q   I+  L+ FC+S  QK+S +KTRIF+S NV +++  +I     FQ   +LGK
Sbjct: 794  EARLDQVEIIQACLNLFCTSSGQKISQEKTRIFFSKNVNWNVINEISSSFSFQQAENLGK 853

Query: 301  YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122
            YLGI + H  V + ++  II+KV QRL+ WKA+SLS A RLTLTKS L ALPSY +Q+  
Sbjct: 854  YLGIPLHHSRVNRATYSGIIEKVTQRLNNWKAKSLSFAGRLTLTKSFLTALPSYTMQTVW 913

Query: 121  IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +P+ +CDDIDK  R F+WG+ +  +K H +SWS +C PK
Sbjct: 914  LPRNICDDIDKKNRQFLWGDTSHNKKVHTVSWSVICQPK 952


>gb|KYP74374.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1421

 Score =  226 bits (575), Expect = 2e-63
 Identities = 118/279 (42%), Positives = 165/279 (59%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WM IKIDLEKAYDRL+WNF++DTL D+GFP N  +L+W CISS  +++LWNGEAL  F  
Sbjct: 772  WMIIKIDLEKAYDRLNWNFVKDTLLDIGFPENFISLIWSCISSSKMRVLWNGEALEEFLP 831

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482
               + +  P          +  F  + +    +L   +      P             F 
Sbjct: 832  SRGVRQGDPISPYIFVLCMERLFHLIEIAVNHQLWKPIRISRGGPKIAHLAFADDLLLFA 891

Query: 481  RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302
                 Q   I+  L  FCSS  QKVS DKTRI +S NV + +R +I    GF  T +LGK
Sbjct: 892  EASVDQVEIIQTCLDLFCSSSGQKVSQDKTRIHFSKNVSWRVREEISNKFGFLRTDNLGK 951

Query: 301  YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122
            YLG+ I H+ V +  F+ +++KVNQRLS+WKA++LS A R+TLT+SVL ALPSY +QS  
Sbjct: 952  YLGVPIHHRRVNRVLFKGVVEKVNQRLSSWKAKTLSFAGRVTLTQSVLSALPSYLMQSVY 1011

Query: 121  IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +P+ VCD++DK  R F+W ++  + + H +SW  +  P+
Sbjct: 1012 LPRQVCDELDKHYRRFLWDDKENKHRLHAVSWEVISKPR 1050


>ref|XP_016164673.1| uncharacterized protein LOC107607211 [Arachis ipaensis]
          Length = 1901

 Score =  225 bits (574), Expect = 3e-63
 Identities = 122/288 (42%), Positives = 167/288 (57%), Gaps = 8/288 (2%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WM IKIDLEKAYDRL+WNFI++TL D+GFP N  NL   CIS+  +++ WNGE L  F  
Sbjct: 1109 WMTIKIDLEKAYDRLNWNFIKETLMDIGFPQNFINLTLSCISTARMRVFWNGEELEEFSP 1168

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTS--------CL 506
               + +  P +  Y+F          I K  Q +   +      P  L          C 
Sbjct: 1169 TRGIRQGDP-ISPYIFVL-------CIEKLSQLISAAVEHDFWKPIRLKKDGPPISHLCF 1220

Query: 505  CR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGF 326
                  F   +  QA  I + L  FC S  QKVS DKTR+F+S NV +++R +I   + F
Sbjct: 1221 ADDIILFAEANVDQANIINKCLEAFCKSSGQKVSKDKTRVFFSRNVGHNVRTEISNVMQF 1280

Query: 325  QSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALP 146
              T DL KYLG+ I H  VTK +F  II+K++ RL++WKA SLSLA R TL KSVL ++P
Sbjct: 1281 TRTDDLRKYLGVPILHSKVTKHTFEGIINKLHVRLNSWKASSLSLAGRTTLVKSVLSSMP 1340

Query: 145  SYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2
             Y + S ++P   C+ ID+ICR+FIWG+ +  +K H ++W K+C PK+
Sbjct: 1341 IYNMHSALLPTATCNSIDRICRNFIWGDTDQNKKVHLLNWKKICEPKQ 1388


>gb|KYP53058.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
            cajan]
          Length = 1039

 Score =  223 bits (568), Expect = 9e-63
 Identities = 124/284 (43%), Positives = 165/284 (58%), Gaps = 9/284 (3%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL WNF++DTL+D+G P    NL+W  ISSP ++++WNGEAL  F  
Sbjct: 491  WMAIKIDLEKAYDRLKWNFVKDTLQDIGLPQTFVNLIWASISSPRLRMVWNGEALEEFTP 550

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LI---LKSEQRLGLLLS------SH*KMP*DLTSC 509
              E+ +  P +  YLF         LI     S Q   + LS      SH     DL   
Sbjct: 551  SREIRQGDP-ISPYLFVLCMERLFQLISAAANSNQWKPIKLSRDGPPLSHLAFADDLV-- 607

Query: 508  LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329
                   F      Q   IK  L  FC S  QK SL+KT+I++S NV + +R +I    G
Sbjct: 608  ------LFAEASINQVEIIKTCLDLFCVSSGQKASLEKTKIYFSKNVNHSIREEISSAFG 661

Query: 328  FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149
            +Q T +LGK+LGI   H  V    ++ +I++V++RLS WK  +LS A RLTL K+VL A+
Sbjct: 662  YQRTDNLGKFLGIPANHSRVCHRDYQGLIERVSRRLSGWKTSALSFAGRLTLCKTVLSAI 721

Query: 148  PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKV 17
            PSY +QS  +P+  CD+ID+I R F+WG     R+FH I W+KV
Sbjct: 722  PSYTMQSVYLPRRTCDEIDRISRDFLWGGSRNNRRFHAIGWNKV 765


>gb|KYP33748.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1133

 Score =  219 bits (557), Expect = 4e-61
 Identities = 130/289 (44%), Positives = 169/289 (58%), Gaps = 9/289 (3%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL W+F++DTL D+G PS   NLVW  I+SP  ++LWNGEAL  F  
Sbjct: 533  WMAIKIDLEKAYDRLKWSFVKDTLLDIGLPSQFVNLVWASITSPKFRMLWNGEALEEFSP 592

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LI---LKSEQRLGLLLS------SH*KMP*DLTSC 509
               + +  P +  YLF         LI   ++S+Q   + LS      SH     DL   
Sbjct: 593  SHGIRQGDP-ISPYLFVLCMERLFQLITSTVESQQWRPIKLSRDGPLLSHLAFADDL--- 648

Query: 508  LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329
                   F      Q   I+  L   C S  QKVS++KTRIF+S NV + +R +I    G
Sbjct: 649  -----ILFAEATSDQVEVIQSCLDQLCGSSGQKVSIEKTRIFFSKNVSHVIRNEISTTFG 703

Query: 328  FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149
            FQ TS+LGKYLGI   H  V +  ++ II++VN+RLS WK  +LS A RLTL KSVL A+
Sbjct: 704  FQCTSNLGKYLGIPAHHSRVCQRDYQEIIERVNKRLSGWKTSTLSFAGRLTLCKSVLSAI 763

Query: 148  PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PKK 2
            PSY +QS          +D++CRSF+ GE N +R++H I WS VC PK+
Sbjct: 764  PSYTMQS----------VDRLCRSFLSGESNNQRRYHAIGWSTVCQPKE 802


>gb|KYP57513.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 520

 Score =  210 bits (534), Expect = 9e-61
 Identities = 116/266 (43%), Positives = 161/266 (60%), Gaps = 9/266 (3%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668
            WMAIKIDLEKAYDRL W+F++DTL D+G P+   NLVW  ISSP +++LWNGEAL  F  
Sbjct: 264  WMAIKIDLEKAYDRLKWSFVKDTLLDIGLPNQFVNLVWVSISSPKLRMLWNGEALEEFVP 323

Query: 667  -RFLEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLS------SH*KMP*DLTSC 509
             R + +   + P L +          T   + S+Q   + LS      SH     DL   
Sbjct: 324  SRGIRQGDPISPYLFVLCMERLFHLIT-TTVDSQQWKPIRLSRDGPLLSHLAFADDL--- 379

Query: 508  LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329
                   F      Q   I+  L+ FC+S  QKVS++KTRI++S NV + +R ++    G
Sbjct: 380  -----ILFAEATLDQVEVIQSCLNHFCASSGQKVSIEKTRIYFSKNVSHIVRNEVSSAFG 434

Query: 328  FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149
            FQ T +LGKYLGI   H  V +  ++ II++VN+RLS WK+ +LS A RLTL KSVL A+
Sbjct: 435  FQRTDNLGKYLGIPAHHSRVCRRDYQGIIERVNKRLSGWKSSTLSFAGRLTLCKSVLSAI 494

Query: 148  PSYAVQSCIIPKGVCDDIDKICRSFI 71
            PSY +QS  +P+ VCD++D++C +F+
Sbjct: 495  PSYTMQSVFLPRSVCDEVDRLCSNFL 520


>gb|KYP69874.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 956

 Score =  212 bits (539), Expect = 6e-59
 Identities = 123/288 (42%), Positives = 167/288 (57%), Gaps = 9/288 (3%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMA KIDLEKAYDRL W+F++DTL D+G P+ + N++W CISSP +++LWNGE L  F  
Sbjct: 656  WMAFKIDLEKAYDRLKWDFVKDTLLDIGLPAQLVNIIWACISSPRMRMLWNGETLDEFLP 715

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILK---SEQRLGLLLS------SH*KMP*DLTSC 509
              ++ +  P +  YLF         LI K   +++   + L+      SH     DL   
Sbjct: 716  SRDVRQGDP-ISPYLFVLCIERLFQLITKEVEAKRWKPIRLAKDGPPLSHLAFADDL--- 771

Query: 508  LCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLG 329
                   F      QA  I+  L  FC+S  QKVSL+KT+IF+S NV + +R  I   LG
Sbjct: 772  -----ILFSEASMNQAEIIRDCLDRFCASSGQKVSLEKTKIFFSKNVAHTVRDDISSGLG 826

Query: 328  FQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQAL 149
            FQ T++LGKYLGI   H  V +  ++ +I+ VN+RLS WKA +LS A RLTL KSV++A+
Sbjct: 827  FQRTNNLGKYLGIPAHHSRVCRRDYQNVINCVNKRLSGWKASTLSFAGRLTLCKSVIEAI 886

Query: 148  PSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            PSY  +            DK+C SF+WG+    RK H ISW  +C PK
Sbjct: 887  PSYTSK----------QFDKLCMSFLWGDSPTSRKIHAISWKTICMPK 924


>gb|KYP49443.1| Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 548

 Score =  199 bits (506), Expect = 2e-56
 Identities = 124/285 (43%), Positives = 162/285 (56%), Gaps = 6/285 (2%)
 Frame = -3

Query: 841 WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
           WMAIKIDLEKAYDRL W FI+DTLED+G P     +VW CIS+P + +LWNGE L +F  
Sbjct: 105 WMAIKIDLEKAYDRLKWKFIKDTLEDIGLPQQFVEMVWACISTPSMSMLWNGEKLEDFTP 164

Query: 661 LEELGRVIPCLLIYLFY-AWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFA- 488
            + + +  P L  YLF    +  F  + +    +L   +      P    SCL   +FA 
Sbjct: 165 SKGIRQGDP-LSPYLFVLCMERVFHLIEIAVIHKLWKPIKLSKGGP--PLSCL---AFAD 218

Query: 487 ----FCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQS 320
               F      Q   I+Q L  FC S  QKVSL+KTRIF+S NV + ++ +I    GFQ 
Sbjct: 219 DLILFSEASMDQVEIIQQCLDIFCGSLGQKVSLEKTRIFFSKNVGWAVKNEISNAFGFQR 278

Query: 319 TSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSY 140
           T +LGKYLG+ I H  V +   R + DKVNQRL++WK R+LS+  RLTLTKSVL A+PSY
Sbjct: 279 TDNLGKYLGVSIHHDRVNRRLLRSVKDKVNQRLNSWKTRNLSVTGRLTLTKSVLAAIPSY 338

Query: 139 AVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +Q+   P            SF+  +  G RK H  +W K+  PK
Sbjct: 339 TMQTVFFPD-----------SFVM-KLTGLRKVHVKAWQKIYKPK 371


>ref|XP_016168765.1| uncharacterized protein LOC107611342 [Arachis ipaensis]
          Length = 917

 Score =  202 bits (513), Expect = 2e-55
 Identities = 113/279 (40%), Positives = 153/279 (54%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNFRF 662
            WMAIKIDLEKAYDRL   FI++TL D+G P N  NL+  CI +  +++LWNGE L  F  
Sbjct: 315  WMAIKIDLEKAYDRLKECFIKETLADIGLPQNFVNLILSCILTARMRVLWNGEELEEFTP 374

Query: 661  LEELGRVIPCLLIYLFYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DLTSCLCR*SFAFC 482
               +                GF+  + LK +             P     C       F 
Sbjct: 375  SRAVDH--------------GFWKPIRLKKDG------------PPISHLCFADDIILFA 408

Query: 481  RCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRECLGFQSTSDLGK 302
              +  QA  I + L  FC S  Q VS +KTR+ +S NV + +R ++   L F  T DLGK
Sbjct: 409  EANLEQANVINKCLEAFCDSSGQSVSKEKTRVIFSKNVGHTVRAELSNILQFSRTDDLGK 468

Query: 301  YLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVLQALPSYAVQSCI 122
            YLGI I H  V+K +F  II+K++ RL++WKA SLSLA R+TL K VL ++P Y +Q  +
Sbjct: 469  YLGIPILHSRVSKHAFEGIINKLHARLNSWKASSLSLAGRVTLVKYVLSSMPLYNMQYAV 528

Query: 121  IPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
            +    C+ ID ICR+F+WG     +K H +SW +VC PK
Sbjct: 529  LSSTTCNTIDCICRNFLWGNTEQTKKIHLLSWKRVCEPK 567


>ref|XP_016206284.1| uncharacterized protein LOC107646622 [Arachis ipaensis]
          Length = 1460

 Score =  200 bits (509), Expect = 1e-54
 Identities = 112/291 (38%), Positives = 168/291 (57%), Gaps = 12/291 (4%)
 Frame = -3

Query: 841  WMAIKIDLEKAYDRLSWNFIRDTLEDMGFPSNISNLVWHCISSPFIQLLWNGEALGNF-- 668
            +MAIKIDLEKAYD L+W FIRDTL +   P N+ +L+ HC SS  +++LWNG    +F  
Sbjct: 322  YMAIKIDLEKAYDLLNWKFIRDTLIEARLPENLVDLISHCYSSAEMKVLWNGIPSNSFTP 381

Query: 667  -RFLEELGRVIPCLLIYL---------FYAWKGFFT*LILKSEQRLGLLLSSH*KMP*DL 518
             R + +   + P L +           F   + F+  ++L    R G  LS         
Sbjct: 382  SRGIRQGDPMSPYLFVLCIERLSQIISFAVNQNFWEPMVLN---RGGPKLSH-------- 430

Query: 517  TSCLCR*SFAFCRCHC*QARFIKQVLHTFCSSYDQKVSLDKTRIFYSDNVPYHLRVKIRE 338
              C       F +    Q   ++ +L  FC    QKV+  K  +++SDN+ +  + ++ +
Sbjct: 431  -LCFADDIVLFGKASMEQVEVVRGILDLFCKCSGQKVNYFKFCVYFSDNMCFARKKELSD 489

Query: 337  CLGFQSTSDLGKYLGILIFHKNVTKESFRFIIDKVNQRLSAWKARSLSLADRLTLTKSVL 158
             LG + T+++GKYLG+ + H    KE F+FI+D++  RLS+WKA +LSLA R+TLT+S L
Sbjct: 490  ALGMRLTNNMGKYLGVPLLHGRSKKEDFQFILDRMANRLSSWKATNLSLAGRVTLTQSAL 549

Query: 157  QALPSYAVQSCIIPKGVCDDIDKICRSFIWGEENGRRKFHPISWSKVC*PK 5
             ++PSY +Q+  +P  +CD IDKICR+F+WG  +  RK H +SW KVC PK
Sbjct: 550  ASIPSYVMQTMKLPLSICDSIDKICRNFLWGSVSSGRKPHLMSWEKVCLPK 600


Top