BLASTX nr result

ID: Astragalus23_contig00027405 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00027405
         (1280 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU48433.1| hypothetical protein TSUD_135600 [Trifolium subt...   261   9e-81
gb|PNY00212.1| base excision DNA repair HhH-GPD family protein, ...   252   4e-78
ref|XP_004512816.1| PREDICTED: uncharacterized protein LOC101490...   263   4e-77
ref|XP_014522127.1| methyl-CpG-binding domain protein 4-like pro...   235   8e-69
ref|XP_013443136.1| base excision DNA repair protein, HhH-GPD fa...   224   6e-67
ref|XP_013443133.1| base excision DNA repair protein, HhH-GPD fa...   224   7e-67
ref|XP_020536364.1| methyl-CpG-binding domain protein 4-like pro...   230   4e-66
ref|XP_023898843.1| methyl-CpG-binding domain protein 4-like pro...   219   4e-66
gb|KYP57120.1| Methyl-CpG-binding domain protein 4 [Cajanus cajan]    224   5e-66
ref|XP_022847820.1| methyl-CpG-binding domain protein 4-like pro...   230   2e-65
gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidops...   224   3e-65
ref|XP_016198400.1| uncharacterized protein LOC107639420 isoform...   234   4e-65
ref|XP_020976903.1| uncharacterized protein LOC107639420 isoform...   234   4e-65
ref|XP_002882558.2| methyl-CpG-binding domain protein 4-like pro...   224   8e-65
ref|NP_001327391.1| DNA glycosylase superfamily protein [Arabido...   219   1e-64
gb|OVA19139.1| HhH-GPD domain [Macleaya cordata]                      221   1e-64
ref|XP_016668086.1| PREDICTED: methyl-CpG-binding domain protein...   219   2e-64
ref|XP_020976904.1| uncharacterized protein LOC107639420 isoform...   231   5e-64
ref|XP_020996176.1| uncharacterized protein LOC107484588 isoform...   231   5e-64
ref|XP_015960626.1| uncharacterized protein LOC107484588 isoform...   231   7e-64

>dbj|GAU48433.1| hypothetical protein TSUD_135600 [Trifolium subterraneum]
          Length = 317

 Score =  261 bits (666), Expect = 9e-81
 Identities = 152/313 (48%), Positives = 187/313 (59%)
 Frame = -3

Query: 1197 SREISNYFPKKPIDEEAIVEVNPSLGNVTKHFHKTAQNEDPNVRLSPYFQNNYANKTSHF 1018
            SR+ S YF    + E  + +V+P   NV        Q   PN + SPYF+N         
Sbjct: 69   SRKASPYFQN--VQESKLRKVSPYFRNV--------QESKPN-KASPYFRNV-------- 109

Query: 1017 SIGASQTHEESKNLKVGDSKYFLKNSIVEEAKVEIXXXXXXXXXXXXXXPKVGGPEINGS 838
                    +ESK  KV  S YF KNS V E+                             
Sbjct: 110  --------QESKPRKV--SPYFQKNSGVTESLK--------------------------- 132

Query: 837  LGPPNVSKYFKKRLKVKESKIKSNSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCS 658
                  + + ++R KV++ K K  + RK   + KP  K E   +AY+RKT DNNW PP S
Sbjct: 133  ------ADHSEERPKVEKPKRKFKNGRK---KTKPFPKAERRKEAYKRKTPDNNWLPPRS 183

Query: 657  PWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVI 478
             W+L QEDH HDPWRVLVIC+LLNRTTG QTKK+L +FF+LCPDAE+C QVP EEI+ +I
Sbjct: 184  YWNLIQEDHFHDPWRVLVICMLLNRTTGDQTKKILANFFELCPDAETCMQVPREEIQDLI 243

Query: 477  RSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHML 298
            RSLGL  KRS MLQR SREYLAE WT+ TEL  VG+YAADAYAIFCTGKWDEVIP+DHML
Sbjct: 244  RSLGLHAKRSKMLQRLSREYLAETWTYVTELHSVGRYAADAYAIFCTGKWDEVIPDDHML 303

Query: 297  LKYWEYIRIINNV 259
             KYW+++R I ++
Sbjct: 304  NKYWDFLRTIKHM 316


>gb|PNY00212.1| base excision DNA repair HhH-GPD family protein, partial [Trifolium
            pratense]
          Length = 270

 Score =  252 bits (644), Expect = 4e-78
 Identities = 145/279 (51%), Positives = 178/279 (63%), Gaps = 3/279 (1%)
 Frame = -3

Query: 1116 VTKHFHKTAQNEDPNVRLSPYFQNNYANKTSHFSIGASQTHEESKNLKVGDSKYFLKNSI 937
            V+ +FHK  +++    +LSPYFQ    +K    S    +  EESK  K+  S YF K   
Sbjct: 3    VSPYFHKVEESKPK--KLSPYFQKVEESKPKKLSPYFPKV-EESKPKKL--SPYFPK--- 54

Query: 936  VEEAKVEIXXXXXXXXXXXXXXPKVGGPEINGSLGPP---NVSKYFKKRLKVKESKIKSN 766
            VEE+K +                     + N S+      + SKY + RL V++ K KS 
Sbjct: 55   VEESKPKKLSPYFPKVEESKPKKLSPYFQKNSSITESLKADDSKYSQTRLIVEKRKRKSK 114

Query: 765  SRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLVICLLLN 586
            +  K   + KPL+K E + +AY+RKT DNNW PP S W+L QEDH HDPWRVLVIC+LLN
Sbjct: 115  NSGK---KTKPLTKAERFKEAYKRKTPDNNWLPPRSHWNLIQEDHFHDPWRVLVICMLLN 171

Query: 585  RTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSREYLAEN 406
             TTG+Q KK+L +FF+LCPDAE+C QVP EEI+ +IRSLGL   RS  LQR SREYLAE 
Sbjct: 172  VTTGNQAKKILANFFELCPDAETCIQVPREEIQEIIRSLGLHANRSKSLQRLSREYLAET 231

Query: 405  WTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKY 289
            WTH TEL GVG+YAADAYAIFCTGKWDEV P DHML  Y
Sbjct: 232  WTHVTELHGVGRYAADAYAIFCTGKWDEVRPHDHMLNNY 270


>ref|XP_004512816.1| PREDICTED: uncharacterized protein LOC101490359 isoform X1 [Cicer
            arietinum]
          Length = 741

 Score =  263 bits (673), Expect = 4e-77
 Identities = 164/355 (46%), Positives = 203/355 (57%), Gaps = 29/355 (8%)
 Frame = -3

Query: 1239 TKNLKVRDSCCRCCSREISNYFPKKPIDEEAIVEVNPSL----GNVTKHFHKTAQNEDPN 1072
            T+NLKV DS       +IS Y  K    +EA ++    L     N  +  HK    E+  
Sbjct: 414  TENLKVLDSSY---FGKISEYIAKV---QEAKIKSESLLQKLSNNSVREGHKV---EELK 464

Query: 1071 VRLSPYFQNN--------------YANKTSHFSIGASQTHEE----SKNLKVGDSKYFLK 946
            V    + Q+N              Y    S F    + +  E    SKNLKV DS  F+ 
Sbjct: 465  VESPSHVQDNCMKNFDDFVSQFQYYGGSVSIFKEYVTNSEYEGIGKSKNLKVEDSSCFVG 524

Query: 945  NSIVEEAKVEIXXXXXXXXXXXXXXPKVGGPEINGSLG---PPNVS----KYFKKRLKVK 787
                  AKVE                K+ G  +   L    P  ++       +KRLKV+
Sbjct: 525  MISEYRAKVE--------------EVKIKGESLLQKLSNLLPEGLNVEEDSSSQKRLKVE 570

Query: 786  ESKIKSNSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVL 607
            + K+KS        + KP  K E Y +AY+RKT +NNW PP S W+L QEDH  DPWRVL
Sbjct: 571  KPKMKSKR------KTKPFPKVERYKEAYKRKTPNNNWLPPRSHWNLIQEDHFQDPWRVL 624

Query: 606  VICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFS 427
            VIC+LLNRTTGSQ KK+L++FFKLCP+AESC QVP EEI+ VIR+LGL  KRS MLQR S
Sbjct: 625  VICMLLNRTTGSQAKKILIEFFKLCPNAESCMQVPREEIQEVIRTLGLQGKRSEMLQRLS 684

Query: 426  REYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKYWEYIRIINN 262
            REYL+  WT+ TELPGVGKYAADAYAIFCTGKWDEV+PEDHML KYW+++  I +
Sbjct: 685  REYLSAPWTYVTELPGVGKYAADAYAIFCTGKWDEVVPEDHMLNKYWDFLHTIKH 739


>ref|XP_014522127.1| methyl-CpG-binding domain protein 4-like protein isoform X1 [Vigna
            radiata var. radiata]
 ref|XP_022632001.1| methyl-CpG-binding domain protein 4-like protein isoform X1 [Vigna
            radiata var. radiata]
          Length = 492

 Score =  235 bits (600), Expect = 8e-69
 Identities = 140/332 (42%), Positives = 184/332 (55%), Gaps = 28/332 (8%)
 Frame = -3

Query: 1188 ISNYFPKKPIDEEAIVEVNPSLGN-------VTKHFHKTAQNEDPNVRLSPYFQNNYANK 1030
            +S YF K     ++  ++N +L N       V+ +FH    +    + +SPYFQN+   K
Sbjct: 171  VSPYFHK-----DSGKKINTNLDNKPLGSRYVSPYFH---DDSGKKIVVSPYFQNDSGKK 222

Query: 1029 T---------SHFSIGASQTHEESKNLKVGDSKYFLKNS---IV--------EEAKVEIX 910
            T         S   I  S   +     K+ +S YF  +S   IV         E K+++ 
Sbjct: 223  TVVSPYFQNDSRKKIVVSPYFQNDSGKKIVNSPYFQNDSGKKIVVSPYFHNDSEKKIDVK 282

Query: 909  XXXXXXXXXXXXXPKVGGPEINGSLGPPNVSKYFKKRLKVKESKIKSNSRRKNPVRVKP- 733
                           V   +  G        K     L   E+ ++ N   +  + +K  
Sbjct: 283  AEPLVQKNVTHAIRYVSPLDEGG--------KMESIALHAAENFVEENKSSEKSIEIKKN 334

Query: 732  LSKDEMYLDAYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVL 553
            LS  E + +AY RKT DN WKPP S   L QEDHAHDPWRVLVIC+LLNRT+G+QTKK++
Sbjct: 335  LSASEKWNEAYRRKTPDNTWKPPRSATVLIQEDHAHDPWRVLVICMLLNRTSGTQTKKIV 394

Query: 552  LDFFKLCPDAESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVG 373
            LDFFKLCPDA+SCT+VP +EI + IR+LG   KR+ M+QR S EYL E+WTH T+L GVG
Sbjct: 395  LDFFKLCPDAKSCTEVPRKEIEKTIRTLGFQHKRAKMVQRLSEEYLDESWTHVTQLHGVG 454

Query: 372  KYAADAYAIFCTGKWDEVIPEDHMLLKYWEYI 277
            KYAADAYAIF  G WD+V P DHML  YWEY+
Sbjct: 455  KYAADAYAIFVNGVWDKVRPADHMLNYYWEYL 486


>ref|XP_013443136.1| base excision DNA repair protein, HhH-GPD family protein [Medicago
           truncatula]
 gb|KEH17161.1| base excision DNA repair protein, HhH-GPD family protein [Medicago
           truncatula]
          Length = 280

 Score =  224 bits (570), Expect = 6e-67
 Identities = 110/187 (58%), Positives = 135/187 (72%), Gaps = 2/187 (1%)
 Frame = -3

Query: 831 PPNVSKYFKKRLKVKESKIKSNSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSPW 652
           PP + K  +KR KV++        RK+  + KP  K +   +AY+RKT DNNW PP S +
Sbjct: 96  PPKIPKDSRKRPKVEKP-------RKSKRKTKPFLKADRCREAYKRKTLDNNWVPPPSGF 148

Query: 651 H--LFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVI 478
              L QE H HDPWRV+VIC+LLNRT G QTK+VL +FF+LCPDAE+C QV  EEI  VI
Sbjct: 149 EFPLLQEHHFHDPWRVIVICMLLNRTLGKQTKQVLDNFFELCPDAETCMQVKREEIEEVI 208

Query: 477 RSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHML 298
           ++LG   KRS  LQRFSREYL E WT+ TEL GVGKYAADAYAIFCTGKWDEV+P+D+ L
Sbjct: 209 KTLGFQVKRSRSLQRFSREYLTETWTYVTELHGVGKYAADAYAIFCTGKWDEVVPDDYKL 268

Query: 297 LKYWEYI 277
            +YW ++
Sbjct: 269 NEYWNFL 275


>ref|XP_013443133.1| base excision DNA repair protein, HhH-GPD family protein [Medicago
           truncatula]
 gb|KEH17158.1| base excision DNA repair protein, HhH-GPD family protein [Medicago
           truncatula]
          Length = 282

 Score =  224 bits (570), Expect = 7e-67
 Identities = 112/195 (57%), Positives = 136/195 (69%), Gaps = 5/195 (2%)
 Frame = -3

Query: 831 PPNVSKYFKKRLKVKESKIKSNSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSP- 655
           PP + K  +KR KV++        RK+  + KP  K +   +AY+RKT DNNW PP S  
Sbjct: 93  PPKIPKDSRKRPKVEKP-------RKSKRKTKPFLKADRCREAYKRKTLDNNWVPPRSTP 145

Query: 654 ----WHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIR 487
                 L QEDH HDPWRV+VIC+LLNRT G Q +KVL +FFKLCP+AE+C QVP  EI+
Sbjct: 146 PLVEKPLLQEDHFHDPWRVIVICMLLNRTKGQQAEKVLANFFKLCPNAETCMQVPKVEIQ 205

Query: 486 RVIRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPED 307
            VI++LGL  KRS  LQR SREYLA  WT+ TEL  VGKYAADAYAIFCTGKWDEV+P+D
Sbjct: 206 EVIKTLGLQVKRSESLQRLSREYLAGTWTYVTELHSVGKYAADAYAIFCTGKWDEVVPDD 265

Query: 306 HMLLKYWEYIRIINN 262
           H L KYW ++  I +
Sbjct: 266 HKLNKYWNFLHSIKD 280


>ref|XP_020536364.1| methyl-CpG-binding domain protein 4-like protein isoform X2 [Jatropha
            curcas]
 gb|KDP47061.1| hypothetical protein JCGZ_10788 [Jatropha curcas]
          Length = 573

 Score =  230 bits (587), Expect = 4e-66
 Identities = 148/362 (40%), Positives = 191/362 (52%), Gaps = 27/362 (7%)
 Frame = -3

Query: 1266 IGANQTHEETKNLKVRDSCCRC--CSREISNYFPKKPIDEE------------------- 1150
            +G N+   E   +K    C R    ++ IS YF K P +EE                   
Sbjct: 227  VGINE--RELMKIKPLKPCGRAGRAAKNISPYFQKVPKEEEDVDNRTDNEYRPKKSSKKC 284

Query: 1149 --AIVEVNPSLGNVTKHFHKTAQNEDPNVRLSPYFQNNYAN-KTSHFSIGASQTHEESKN 979
              A V  +P++G V+ +FHK  + E+          NN+   KTS       +T    +N
Sbjct: 285  KNASVGADPTVGYVSPYFHKIPRKEEA-------IDNNHEQRKTSR----KRKTGATIQN 333

Query: 978  LKVGDSKYFLKNSIVEEAKVEIXXXXXXXXXXXXXXPKVGGPEINGSLGPPNVSKYFKKR 799
            +    S YF K S  +EA+                  K    EI G     NVS YF K 
Sbjct: 334  V----SPYFKKVSNEQEAEASSLIDGKRKRKKSSKKNKEEPCEIAGPT-VRNVSPYFHKE 388

Query: 798  LKVKES---KIKSNSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSPWHLFQEDHA 628
                 +   K  S  R+++      L+  E   +AY RKT DN WKPP S   L QE+HA
Sbjct: 389  EAADSNNGQKQSSKGRKRSARTSIVLTASEKRSEAYLRKTPDNTWKPPQSEHGLLQENHA 448

Query: 627  HDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSLGLWRKRS 448
            HDPWRVLVIC+LLN TTG+Q ++V+ D F LCP AE+   V  EEI R+I  LGL +KR+
Sbjct: 449  HDPWRVLVICMLLNCTTGTQVRRVIEDLFTLCPSAEAAINVMKEEIERIIEPLGLQKKRA 508

Query: 447  AMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKYWEYIRII 268
             M+QR S+EYL ++WTH T+L GVGKYAADAYAIFCTGKWD+V P DHML  YWE++  I
Sbjct: 509  VMIQRMSQEYLEDHWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPADHMLNYYWEFLGRI 568

Query: 267  NN 262
            NN
Sbjct: 569  NN 570


>ref|XP_023898843.1| methyl-CpG-binding domain protein 4-like protein [Quercus suber]
          Length = 210

 Score =  219 bits (558), Expect = 4e-66
 Identities = 105/188 (55%), Positives = 139/188 (73%), Gaps = 5/188 (2%)
 Frame = -3

Query: 822 VSKYFKKRLKVKESK----IKSNSRRKNPVRVKP-LSKDEMYLDAYERKTSDNNWKPPCS 658
           VS YF+K  K +E++    ++ ++  K PV +K  LS  E   DAY RK+ DN WKPP +
Sbjct: 17  VSPYFQKISKEEENEDGRLLEGSNGYKKPVAIKTVLSSSEKLDDAYRRKSPDNMWKPPRT 76

Query: 657 PWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVI 478
              L QE HAHDPWRVLVIC+LLNRTTG Q ++V+ + F LCPDA++ T+V  EEI ++I
Sbjct: 77  TPGLLQERHAHDPWRVLVICMLLNRTTGFQARRVISNLFTLCPDAKTATEVAKEEIEKII 136

Query: 477 RSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHML 298
           ++LGL +KR+ M+QR S+EY+ E+WTH T+L GVGKYAADAYAIFCTGKWD+V P DHML
Sbjct: 137 KTLGLQKKRALMIQRLSQEYMGESWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHML 196

Query: 297 LKYWEYIR 274
             YW+ ++
Sbjct: 197 NHYWKSLK 204


>gb|KYP57120.1| Methyl-CpG-binding domain protein 4 [Cajanus cajan]
          Length = 347

 Score =  224 bits (570), Expect = 5e-66
 Identities = 107/172 (62%), Positives = 130/172 (75%), Gaps = 1/172 (0%)
 Frame = -3

Query: 780 KIKSNSRRKNPVRVK-PLSKDEMYLDAYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLV 604
           ++ ++S  +  + +K  LS  E++ +AY+R+T DN WKPP S   L QEDH HDPWRVLV
Sbjct: 172 EVNTSSCSEESIEIKRKLSALEIWDEAYKRRTPDNTWKPPRSATGLIQEDHIHDPWRVLV 231

Query: 603 ICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSR 424
           IC+LLNRTTG Q KK++ D FKLCPDA+SCTQV  EEI + I+SLGL  KR+AMLQRFS 
Sbjct: 232 ICMLLNRTTGRQAKKIVSDLFKLCPDAKSCTQVAREEIEKTIQSLGLQHKRAAMLQRFSE 291

Query: 423 EYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKYWEYIRII 268
           EYL E+WTH T+L GVGKYAADAYAIF TG WD V P DHML  YWE++  I
Sbjct: 292 EYLDESWTHVTQLHGVGKYAADAYAIFITGMWDRVKPTDHMLNYYWEFLHRI 343


>ref|XP_022847820.1| methyl-CpG-binding domain protein 4-like protein [Olea europaea var.
            sylvestris]
          Length = 636

 Score =  230 bits (587), Expect = 2e-65
 Identities = 149/346 (43%), Positives = 191/346 (55%), Gaps = 40/346 (11%)
 Frame = -3

Query: 1194 REISNYFPKKPIDEEAI-----VEVNPSLGN-------VTKHFHKTAQNEDPNV------ 1069
            R +S YF  K   EEA      +E+  S          V+ +FH   Q ED  V      
Sbjct: 300  RVVSPYFRNKETGEEAETNDGKIELQKSQAKNILTARKVSPYFHHIKQEEDNAVTSLLDG 359

Query: 1068 ----RLSPYFQNNYANKTSHFSIGASQT----HEES-KNLKVGDSKYFLKNSIVEEAKVE 916
                ++ P    N   + +  SI    T    H+ S KN++V  S YF      EEA+  
Sbjct: 360  TTKSKVRPRKVKNTVTENATISIQTPSTGKRGHKGSYKNVRVV-SPYFRNKETGEEAETN 418

Query: 915  IXXXXXXXXXXXXXXPKVGGPEINGSLGPPNVSKYFKKRLKVKE-----------SKIKS 769
                            ++   +    L    VS YF   +K +E           +K K 
Sbjct: 419  ------------DVKIELLKSQAKNVLTARKVSPYFH-HVKQEEDNAVTSLLDGTTKSKV 465

Query: 768  NSRR-KNPVRVKP-LSKDEMYLDAYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLVICL 595
              R+ KN V  K  LS DE + +AY+R+T DN WKPP SP++L QEDH  DPWRVLVIC+
Sbjct: 466  RPRKVKNKVTAKSVLSADEKWDEAYKRRTPDNMWKPPRSPYNLLQEDHVFDPWRVLVICM 525

Query: 594  LLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSREYL 415
            LLN TTG QT KV+ +FF LCP+A+S T+V  E+I +VI+SLGL+RKR+  +Q FSR YL
Sbjct: 526  LLNVTTGRQTGKVISEFFTLCPNAKSATEVAKEDIEKVIQSLGLYRKRAEGIQHFSRMYL 585

Query: 414  AENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKYWEYI 277
             E+WTH TELPG+GKYAADAYAIFCTGKWD V P DHML KYWE++
Sbjct: 586  EESWTHVTELPGIGKYAADAYAIFCTGKWDRVRPLDHMLTKYWEFL 631


>gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 435

 Score =  224 bits (572), Expect = 3e-65
 Identities = 129/311 (41%), Positives = 177/311 (56%), Gaps = 3/311 (0%)
 Frame = -3

Query: 1194 REISNYFPKKPIDEEAIVEVNPSLGNVTKHFHKTAQNEDPNVRL-SPYFQNNYANKTSHF 1018
            R +S YF    + +++  E +    +V     +        V + SPYFQ++  ++    
Sbjct: 134  RRVSPYFQGSTVSQQSKEECDSD--SVCSQSGRNCSKVQAKVPIVSPYFQSSTISQCGSD 191

Query: 1017 SIGASQTHEESKNLKVGDSKYFLKNSIVEEAKVEIXXXXXXXXXXXXXXPKVGGPEINGS 838
             + +SQ+    KN + G SK         +AKV                 +    ++   
Sbjct: 192  IVSSSQS---GKNYRRGSSK--------RQAKVRRDSPYFQESTVSEQPSQAPPRDLRQY 240

Query: 837  LGPPNVSKYFKKR-LKVKES-KIKSNSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPP 664
                 VS+YF    ++V ES K KS   RK PV    LS  +   +AY+RKT D  W PP
Sbjct: 241  FKVVKVSRYFHADGIQVNESQKEKSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPP 300

Query: 663  CSPWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRR 484
             SP +L QE H HDPWRVLVIC+LLN+T+G+QT+ V+ D F LCPDA++ T+V   EI  
Sbjct: 301  RSPCNLLQEHHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIES 360

Query: 483  VIRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDH 304
            +I+ LGL +KR+ M+QRFS EYL E+WTH T+L G+GKYAADAYAIFC G WD V P+DH
Sbjct: 361  LIKPLGLQKKRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDH 420

Query: 303  MLLKYWEYIRI 271
            ML  YWE++RI
Sbjct: 421  MLNYYWEFLRI 431


>ref|XP_016198400.1| uncharacterized protein LOC107639420 isoform X2 [Arachis ipaensis]
          Length = 938

 Score =  234 bits (597), Expect = 4e-65
 Identities = 137/323 (42%), Positives = 183/323 (56%), Gaps = 18/323 (5%)
 Frame = -3

Query: 1191 EISNYFPKKPIDEEAIVEVNPSLGNVTKHFH---------KTAQNEDPNVRLSPYFQNNY 1039
            EISN   KK + +   + VN ++  V+ +FH         KT  +E  +  ++ +   N 
Sbjct: 597  EISNINYKKKVPKVRKILVNGAVRYVSPYFHNASGKKNNVKTLNDEGKSEVIALHTSQNI 656

Query: 1038 ANKTSHFSIGASQTHEESKNLKVGDSKYFLKNSIVEEAKVEIXXXXXXXXXXXXXXPKVG 859
             +  S       +TH   K+ + G  +  L +  +  A   +               K+ 
Sbjct: 657  MDDISQ------ETHNMRKHKQRGKHESHLGSVALTAASGNLFEAGLQESKNEVGTVKIL 710

Query: 858  GPEINGSLGPP-------NVSKYFKKRLKVKESKIKS--NSRRKNPVRVKPLSKDEMYLD 706
             P+   S  P         VS YF+ +  V   ++      R+K  +  K LS +E   +
Sbjct: 711  -PKKRKSKRPKALQDADIKVSPYFQNQWLVGCEQVDKAKKGRKKCNIINKQLSAEEKKDE 769

Query: 705  AYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPD 526
            AY+++T DN WKPP SP++L QE HA+DPWRVLVIC+LLNRTTG Q   V+LD F LCPD
Sbjct: 770  AYKKRTPDNTWKPPRSPFNLLQEPHAYDPWRVLVICMLLNRTTGGQAGPVILDLFNLCPD 829

Query: 525  AESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAI 346
            A+SCTQV  E+I  +I+SLGL +KRS MLQRFS EYL  NWTH T+L GVGKYAADAYAI
Sbjct: 830  AKSCTQVEQEKIEEIIKSLGLQKKRSRMLQRFSEEYLNGNWTHVTQLHGVGKYAADAYAI 889

Query: 345  FCTGKWDEVIPEDHMLLKYWEYI 277
            FCTGKWD V P DHML  YWE++
Sbjct: 890  FCTGKWDRVTPTDHMLNHYWEFL 912


>ref|XP_020976903.1| uncharacterized protein LOC107639420 isoform X1 [Arachis ipaensis]
          Length = 942

 Score =  234 bits (597), Expect = 4e-65
 Identities = 137/323 (42%), Positives = 183/323 (56%), Gaps = 18/323 (5%)
 Frame = -3

Query: 1191 EISNYFPKKPIDEEAIVEVNPSLGNVTKHFH---------KTAQNEDPNVRLSPYFQNNY 1039
            EISN   KK + +   + VN ++  V+ +FH         KT  +E  +  ++ +   N 
Sbjct: 601  EISNINYKKKVPKVRKILVNGAVRYVSPYFHNASGKKNNVKTLNDEGKSEVIALHTSQNI 660

Query: 1038 ANKTSHFSIGASQTHEESKNLKVGDSKYFLKNSIVEEAKVEIXXXXXXXXXXXXXXPKVG 859
             +  S       +TH   K+ + G  +  L +  +  A   +               K+ 
Sbjct: 661  MDDISQ------ETHNMRKHKQRGKHESHLGSVALTAASGNLFEAGLQESKNEVGTVKIL 714

Query: 858  GPEINGSLGPP-------NVSKYFKKRLKVKESKIKS--NSRRKNPVRVKPLSKDEMYLD 706
             P+   S  P         VS YF+ +  V   ++      R+K  +  K LS +E   +
Sbjct: 715  -PKKRKSKRPKALQDADIKVSPYFQNQWLVGCEQVDKAKKGRKKCNIINKQLSAEEKKDE 773

Query: 705  AYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPD 526
            AY+++T DN WKPP SP++L QE HA+DPWRVLVIC+LLNRTTG Q   V+LD F LCPD
Sbjct: 774  AYKKRTPDNTWKPPRSPFNLLQEPHAYDPWRVLVICMLLNRTTGGQAGPVILDLFNLCPD 833

Query: 525  AESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAI 346
            A+SCTQV  E+I  +I+SLGL +KRS MLQRFS EYL  NWTH T+L GVGKYAADAYAI
Sbjct: 834  AKSCTQVEQEKIEEIIKSLGLQKKRSRMLQRFSEEYLNGNWTHVTQLHGVGKYAADAYAI 893

Query: 345  FCTGKWDEVIPEDHMLLKYWEYI 277
            FCTGKWD V P DHML  YWE++
Sbjct: 894  FCTGKWDRVTPTDHMLNHYWEFL 916


>ref|XP_002882558.2| methyl-CpG-binding domain protein 4-like protein [Arabidopsis lyrata
            subsp. lyrata]
          Length = 479

 Score =  224 bits (572), Expect = 8e-65
 Identities = 129/311 (41%), Positives = 177/311 (56%), Gaps = 3/311 (0%)
 Frame = -3

Query: 1194 REISNYFPKKPIDEEAIVEVNPSLGNVTKHFHKTAQNEDPNVRL-SPYFQNNYANKTSHF 1018
            R +S YF    + +++  E +    +V     +        V + SPYFQ++  ++    
Sbjct: 178  RRVSPYFQGSTVSQQSKEECDSD--SVCSQSGRNCSKVQAKVPIVSPYFQSSTISQCGSD 235

Query: 1017 SIGASQTHEESKNLKVGDSKYFLKNSIVEEAKVEIXXXXXXXXXXXXXXPKVGGPEINGS 838
             + +SQ+    KN + G SK         +AKV                 +    ++   
Sbjct: 236  IVSSSQS---GKNYRRGSSK--------RQAKVRRDSPYFQESTVSEQPSQAPPRDLRQY 284

Query: 837  LGPPNVSKYFKKR-LKVKES-KIKSNSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPP 664
                 VS+YF    ++V ES K KS   RK PV    LS  +   +AY+RKT D  W PP
Sbjct: 285  FKVVKVSRYFHADGIQVNESQKEKSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPP 344

Query: 663  CSPWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRR 484
             SP +L QE H HDPWRVLVIC+LLN+T+G+QT+ V+ D F LCPDA++ T+V   EI  
Sbjct: 345  RSPCNLLQEHHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIES 404

Query: 483  VIRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDH 304
            +I+ LGL +KR+ M+QRFS EYL E+WTH T+L G+GKYAADAYAIFC G WD V P+DH
Sbjct: 405  LIKPLGLQKKRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDH 464

Query: 303  MLLKYWEYIRI 271
            ML  YWE++RI
Sbjct: 465  MLNYYWEFLRI 475


>ref|NP_001327391.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
 gb|ANM65422.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
          Length = 329

 Score =  219 bits (559), Expect = 1e-64
 Identities = 124/279 (44%), Positives = 165/279 (59%), Gaps = 13/279 (4%)
 Frame = -3

Query: 1068 RLSPYFQNNYANKTSHFSIGASQTHEESKNLKVGDSK----------YFLKNSIVEEAKV 919
            R+SPYFQ +  ++     + +SQ+    +N + G SK          YF ++++ E+   
Sbjct: 70   RVSPYFQASTISQCDSDIVSSSQS---GRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQ 126

Query: 918  EIXXXXXXXXXXXXXXPKVGGPE-INGSLGPPNVSKYFKKR-LKVKES-KIKSNSRRKNP 748
                                 P+ +        VS+YF    ++V ES K KS + RK P
Sbjct: 127  --------------------APKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTP 166

Query: 747  VRVKPLSKDEMYLDAYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLVICLLLNRTTGSQ 568
            +    LS  +   D Y RKT DN W PP SP +L QEDH HDPWRVLVIC+LLN+T+G+Q
Sbjct: 167  IVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQ 226

Query: 567  TKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSREYLAENWTHATE 388
            T+ V+ D F LC DA++ T+V  EEI  +I+ LGL +KR+ M+QR S EYL E+WTH T+
Sbjct: 227  TRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQ 286

Query: 387  LPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKYWEYIRI 271
            L GVGKYAADAYAIFC G WD V P DHML  YW+Y+RI
Sbjct: 287  LHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYLRI 325


>gb|OVA19139.1| HhH-GPD domain [Macleaya cordata]
          Length = 366

 Score =  221 bits (562), Expect = 1e-64
 Identities = 112/188 (59%), Positives = 137/188 (72%), Gaps = 6/188 (3%)
 Frame = -3

Query: 822 VSKYFKKR-LKVKESKIKSNSRRKNPVR----VKP-LSKDEMYLDAYERKTSDNNWKPPC 661
           VS+YF+K   K  E+K +S   +K  V     V P LS  E   +AY+RKT DN WKPP 
Sbjct: 173 VSRYFRKNESKDGENKQQSELLKKKKVGRRKVVSPTLSAAEKLDEAYKRKTLDNTWKPPP 232

Query: 660 SPWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRV 481
           S + L QE+H  DPWRV+VIC+LLNRTTG Q ++VL D FKLCPDA++ T+V  EEI +V
Sbjct: 233 SHFTLIQEEHFEDPWRVIVICMLLNRTTGRQARRVLSDLFKLCPDAKTTTEVAIEEIEKV 292

Query: 480 IRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHM 301
           I+ LGL +KR+ M+QR S EYL + WTH T+L GVGKYAADAYAIFCTGKWD V PEDHM
Sbjct: 293 IQVLGLHKKRAKMIQRMSSEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPEDHM 352

Query: 300 LLKYWEYI 277
           L KYWE++
Sbjct: 353 LNKYWEFL 360


>ref|XP_016668086.1| PREDICTED: methyl-CpG-binding domain protein 4-like protein
            [Gossypium hirsutum]
          Length = 339

 Score =  219 bits (559), Expect = 2e-64
 Identities = 123/267 (46%), Positives = 159/267 (59%), Gaps = 3/267 (1%)
 Frame = -3

Query: 1068 RLSPYFQNNYANKTSHFSIGASQTHEESKNLKVGDSKYFLKNSIVEEAKVEIXXXXXXXX 889
            ++SPYFQ N   +    +    +     K LK G++ +  +N        E+        
Sbjct: 76   KVSPYFQGNCERQLKSITQVVYKGCSNEKLLKEGEN-FSKQNRKQRRTDAEVVKVSPYFQ 134

Query: 888  XXXXXXPKVGGPEINGSLGPPNV--SKYFKKRLKVKESKIKSNSRRKNPVRVKPL-SKDE 718
                   K  G   N  + P  +  S YF+K  +      K++        VKPL S  +
Sbjct: 135  SCEEKQKKTSG---NRKIKPRVLKQSPYFQKNNESLRKPRKTDE-------VKPLLSASQ 184

Query: 717  MYLDAYERKTSDNNWKPPCSPWHLFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFK 538
               +AY+RKT DN W PP S   L QEDH HDPWRVLVIC+LLNRTTG+QT+KVL DFF 
Sbjct: 185  KRDEAYQRKTVDNTWIPPRSDAPLLQEDHTHDPWRVLVICMLLNRTTGNQTRKVLSDFFT 244

Query: 537  LCPDAESCTQVPTEEIRRVIRSLGLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAAD 358
            +CPDA++ T+V TEEI + I++LGL RKR+ M+QR S+EYL + WTH TEL GVGKYAAD
Sbjct: 245  VCPDAKTATEVATEEIEKAIKTLGLQRKRAEMIQRMSQEYLWKEWTHVTELHGVGKYAAD 304

Query: 357  AYAIFCTGKWDEVIPEDHMLLKYWEYI 277
            AYAIFCTGK D V+P DHML  YW ++
Sbjct: 305  AYAIFCTGKGDRVMPTDHMLNHYWNFL 331


>ref|XP_020976904.1| uncharacterized protein LOC107639420 isoform X3 [Arachis ipaensis]
          Length = 889

 Score =  231 bits (588), Expect = 5e-64
 Identities = 111/184 (60%), Positives = 135/184 (73%), Gaps = 2/184 (1%)
 Frame = -3

Query: 822  VSKYFKKRLKVKESKIKS--NSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSPWH 649
            VS YF+ +  V   ++      R+K  +  K LS +E   +AY+++T DN WKPP SP++
Sbjct: 680  VSPYFQNQWLVGCEQVDKAKKGRKKCNIINKQLSAEEKKDEAYKKRTPDNTWKPPRSPFN 739

Query: 648  LFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSL 469
            L QE HA+DPWRVLVIC+LLNRTTG Q   V+LD F LCPDA+SCTQV  E+I  +I+SL
Sbjct: 740  LLQEPHAYDPWRVLVICMLLNRTTGGQAGPVILDLFNLCPDAKSCTQVEQEKIEEIIKSL 799

Query: 468  GLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKY 289
            GL +KRS MLQRFS EYL  NWTH T+L GVGKYAADAYAIFCTGKWD V P DHML  Y
Sbjct: 800  GLQKKRSRMLQRFSEEYLNGNWTHVTQLHGVGKYAADAYAIFCTGKWDRVTPTDHMLNHY 859

Query: 288  WEYI 277
            WE++
Sbjct: 860  WEFL 863


>ref|XP_020996176.1| uncharacterized protein LOC107484588 isoform X3 [Arachis duranensis]
          Length = 890

 Score =  231 bits (588), Expect = 5e-64
 Identities = 111/184 (60%), Positives = 135/184 (73%), Gaps = 2/184 (1%)
 Frame = -3

Query: 822  VSKYFKKRLKVKESKIKS--NSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSPWH 649
            VS YF+ +  V   ++      R+K  +  K LS +E   +AY+++T DN WKPP SP++
Sbjct: 681  VSPYFQNQWLVGCEQVDKAKKGRKKCNIINKQLSAEEKKDEAYKKRTPDNTWKPPRSPFN 740

Query: 648  LFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSL 469
            L QE HA+DPWRVLVIC+LLNRTTG Q   V+LD F LCPDA+SCTQV  E+I  +I+SL
Sbjct: 741  LLQEPHAYDPWRVLVICMLLNRTTGGQAGPVILDLFNLCPDAKSCTQVEQEKIEEIIKSL 800

Query: 468  GLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKY 289
            GL +KRS MLQRFS EYL  NWTH T+L GVGKYAADAYAIFCTGKWD V P DHML  Y
Sbjct: 801  GLQKKRSRMLQRFSEEYLNGNWTHVTQLHGVGKYAADAYAIFCTGKWDRVTPTDHMLNHY 860

Query: 288  WEYI 277
            WE++
Sbjct: 861  WEFL 864


>ref|XP_015960626.1| uncharacterized protein LOC107484588 isoform X2 [Arachis duranensis]
          Length = 939

 Score =  231 bits (588), Expect = 7e-64
 Identities = 111/184 (60%), Positives = 135/184 (73%), Gaps = 2/184 (1%)
 Frame = -3

Query: 822  VSKYFKKRLKVKESKIKS--NSRRKNPVRVKPLSKDEMYLDAYERKTSDNNWKPPCSPWH 649
            VS YF+ +  V   ++      R+K  +  K LS +E   +AY+++T DN WKPP SP++
Sbjct: 730  VSPYFQNQWLVGCEQVDKAKKGRKKCNIINKQLSAEEKKDEAYKKRTPDNTWKPPRSPFN 789

Query: 648  LFQEDHAHDPWRVLVICLLLNRTTGSQTKKVLLDFFKLCPDAESCTQVPTEEIRRVIRSL 469
            L QE HA+DPWRVLVIC+LLNRTTG Q   V+LD F LCPDA+SCTQV  E+I  +I+SL
Sbjct: 790  LLQEPHAYDPWRVLVICMLLNRTTGGQAGPVILDLFNLCPDAKSCTQVEQEKIEEIIKSL 849

Query: 468  GLWRKRSAMLQRFSREYLAENWTHATELPGVGKYAADAYAIFCTGKWDEVIPEDHMLLKY 289
            GL +KRS MLQRFS EYL  NWTH T+L GVGKYAADAYAIFCTGKWD V P DHML  Y
Sbjct: 850  GLQKKRSRMLQRFSEEYLNGNWTHVTQLHGVGKYAADAYAIFCTGKWDRVTPTDHMLNHY 909

Query: 288  WEYI 277
            WE++
Sbjct: 910  WEFL 913


Top