BLASTX nr result

ID: Wisteria21_contig00002716 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Wisteria21_contig00002716
         (1760 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   613   e-172
ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glyc...   563   e-157
ref|XP_013462123.1| HhH-GPD base excision DNA repair family prot...   560   e-156
ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phas...   560   e-156
ref|XP_014501651.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   553   e-154
ref|XP_013462124.1| HhH-GPD base excision DNA repair family prot...   545   e-152
gb|KOM51587.1| hypothetical protein LR48_Vigan09g024600 [Vigna a...   543   e-151
ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   529   e-147
gb|ACU22727.1| unknown [Glycine max]                                  528   e-147
ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobro...   510   e-141
ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus not...   499   e-138
ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   499   e-138
ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citr...   490   e-135
ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glyc...   489   e-135
gb|KDO84582.1| hypothetical protein CISIN_1g039604mg [Citrus sin...   489   e-135
ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   489   e-135
ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633...   487   e-134
ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   484   e-133
ref|XP_004147864.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   483   e-133
ref|XP_012446557.1| PREDICTED: probable DNA-3-methyladenine glyc...   483   e-133

>ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cicer arietinum]
          Length = 384

 Score =  613 bits (1580), Expect = e-172
 Identities = 317/384 (82%), Positives = 338/384 (88%), Gaps = 9/384 (2%)
 Frame = -1

Query: 1439 MGEQTRTQAQ--SLTRTQIEPLPQ--PQGASSVAPTTTTLAASIVPVESELSNVPPQTNS 1272
            MGE+T+ Q Q  +L  T+IEP PQ  PQ ASS      T A +I+PVESELSNVPP  NS
Sbjct: 1    MGEETQIQPQPQTLIGTEIEPQPQSQPQEASSNNTVAATAAGAIIPVESELSNVPPHINS 60

Query: 1271 PASKIPLRPRKIRKVSPDPTT-SESQTETPKASTSTGGKTCGRN-NKTVQQQRALVVPRM 1098
            PA+KIPLRPRKIRKVSPDPTT SESQ+ETPK++TST GK+CGR+ NK+VQQQRAL+VPR+
Sbjct: 61   PATKIPLRPRKIRKVSPDPTTTSESQSETPKSATSTAGKSCGRHSNKSVQQQRALIVPRI 120

Query: 1097 VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKA 918
            VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLA+KA
Sbjct: 121  VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKA 180

Query: 917  GTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDST 738
            GTSIYTRFIALCGGEA VVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDS 
Sbjct: 181  GTSIYTRFIALCGGEAGVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSA 240

Query: 737  IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPR 558
            IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQ+LYNLEDLPR
Sbjct: 241  IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLEDLPR 300

Query: 557  PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL---XXXXXXXXXXXXX 387
            PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L                
Sbjct: 301  PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQHQLEQHQQQQQQQQ 360

Query: 386  XXXXQLLDPINSMFNLGAACAWGQ 315
                QL+DP+NSMFN+GAACAWGQ
Sbjct: 361  HSQQQLMDPMNSMFNIGAACAWGQ 384


>ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Glycine
            max] gi|947112855|gb|KRH61157.1| hypothetical protein
            GLYMA_04G031600 [Glycine max]
          Length = 374

 Score =  563 bits (1450), Expect = e-157
 Identities = 302/390 (77%), Positives = 314/390 (80%), Gaps = 15/390 (3%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGEQT  QAQSL    IEP P P  +S+  P   T       V+SEL+NVP  T SPA+K
Sbjct: 1    MGEQTLGQAQSL----IEPQPLPAPSSTAVPDGAT-------VDSELNNVPRPTTSPATK 49

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083
            IPLRPRKIRKVSPDP+TSESQTETPK +     KT GRN       RAL VVPR+VARSL
Sbjct: 50   IPLRPRKIRKVSPDPSTSESQTETPKPA-----KTGGRNTTKAAPPRALTVVPRIVARSL 104

Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903
            SC+GEVEIALRYLRNADP+LSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY
Sbjct: 105  SCDGEVEIALRYLRNADPVLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 164

Query: 902  TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723
            TRFIALCGGE  VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD
Sbjct: 165  TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 224

Query: 722  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543
            DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD
Sbjct: 225  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 284

Query: 542  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL--------------XXXXXXX 405
            QLC+KWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L                     
Sbjct: 285  QLCDKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQQHHQHHHQHQQQEQQQQQ 344

Query: 404  XXXXXXXXXXQLLDPINSMFNLGAACAWGQ 315
                      QLLDPINSMFNLGAACAWGQ
Sbjct: 345  QQQQQHPPQPQLLDPINSMFNLGAACAWGQ 374


>ref|XP_013462123.1| HhH-GPD base excision DNA repair family protein [Medicago truncatula]
            gi|657396011|gb|KEH36158.1| HhH-GPD base excision DNA
            repair family protein [Medicago truncatula]
          Length = 377

 Score =  560 bits (1444), Expect = e-156
 Identities = 287/373 (76%), Positives = 314/373 (84%), Gaps = 12/373 (3%)
 Frame = -1

Query: 1397 TQIEPLPQPQGASSVAPTTTTLAA----SIVPVESELSNVPPQTNSPASKIPLRPRKIRK 1230
            TQ +P PQPQG SS    + T A     +I+PVESELSNVPP T +PA+K+PLRPRKIRK
Sbjct: 5    TQQQPQPQPQGVSSDNTISATTAVDSVQTIIPVESELSNVPPHTKAPATKMPLRPRKIRK 64

Query: 1229 VSPDPTTSESQTETPKASTSTG-GKTCGRNNKTVQ--QQRALVVPRMVARSLSCEGEVEI 1059
            VSPDPTTSESQ+ET K   ST  GK+ GRNNKTVQ  QQR L VP++V RSLSCEGEVEI
Sbjct: 65   VSPDPTTSESQSETLKPPNSTAAGKSNGRNNKTVQPPQQRTLAVPKIVPRSLSCEGEVEI 124

Query: 1058 ALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIYTRFIALCG 879
            A+RYLR+ADPLLSPLIDIHQPP+FDNF TPFLALTRSILYQQLA+KAGTSIYTRFIALCG
Sbjct: 125  AIRYLRSADPLLSPLIDIHQPPTFDNFQTPFLALTRSILYQQLAFKAGTSIYTRFIALCG 184

Query: 878  GEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMDDKSLFTML 699
            GEA VVP+ VLAL  QQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMDDKSLFTML
Sbjct: 185  GEAGVVPDNVLALTAQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTML 244

Query: 698  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCEKWRP 519
            TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQ+LYNL+DLPRPSQMDQLCEKW+P
Sbjct: 245  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLDDLPRPSQMDQLCEKWKP 304

Query: 518  YRSVASWYMWRFVEAKGTPSSAVAVATG-----ASLXXXXXXXXXXXXXXXXXQLLDPIN 354
            YRSVASWY+WRFVEAKG+PS+AVAVATG       L                  ++DP+N
Sbjct: 305  YRSVASWYLWRFVEAKGSPSTAVAVATGNGLQQHELDHHQQQQQQQQQQHSQQPIMDPMN 364

Query: 353  SMFNLGAACAWGQ 315
            +MFN+GAACAWGQ
Sbjct: 365  NMFNMGAACAWGQ 377


>ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris]
            gi|561009684|gb|ESW08591.1| hypothetical protein
            PHAVU_009G058200g [Phaseolus vulgaris]
          Length = 366

 Score =  560 bits (1444), Expect = e-156
 Identities = 300/381 (78%), Positives = 311/381 (81%), Gaps = 6/381 (1%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGE T  QAQSL    IEP P P  +SS A      A      +SEL+NV P  NSPASK
Sbjct: 1    MGEHTLGQAQSL----IEPQPHPVPSSSAA------APDGAQADSELNNVLPHANSPASK 50

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083
            IPLRPRKIRKVSPDP+TSESQTE PK      GK+ GR+ K V   R + V+PR+VARSL
Sbjct: 51   IPLRPRKIRKVSPDPSTSESQTEPPKP-----GKSGGRSTKHVPPSRGMSVLPRLVARSL 105

Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903
            SCEGEVEIALR+LRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY
Sbjct: 106  SCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 165

Query: 902  TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723
            TRFIALCGGE  VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD
Sbjct: 166  TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 225

Query: 722  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543
            DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD
Sbjct: 226  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 285

Query: 542  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-----XXXXXXXXXXXXXXXX 378
             LCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L                     
Sbjct: 286  HLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQHHHQHQQHEQQQQQHPPQ 345

Query: 377  XQLLDPINSMFNLGAACAWGQ 315
             QLLDPINSMFNLGAACAWGQ
Sbjct: 346  PQLLDPINSMFNLGAACAWGQ 366


>ref|XP_014501651.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Vigna radiata var.
            radiata]
          Length = 367

 Score =  553 bits (1425), Expect = e-154
 Identities = 297/382 (77%), Positives = 309/382 (80%), Gaps = 7/382 (1%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGE T  QAQSL    IEP P P      AP+++         +SEL+NV P  NSPASK
Sbjct: 1    MGEHTLGQAQSL----IEPQPHP------APSSSAAGPDGAQADSELNNVLPHVNSPASK 50

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083
            IPLRPRKIRKVSPDP+TSES TE  K      GK  GR+ K V   RA+ VVPR+VARSL
Sbjct: 51   IPLRPRKIRKVSPDPSTSESLTEPSKP-----GKNGGRSTKHVPPSRAMTVVPRLVARSL 105

Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903
            S EGEVEIALR+LRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY
Sbjct: 106  SYEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 165

Query: 902  TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723
            TRFIALCGGE  VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSD+ IVNMD
Sbjct: 166  TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDAAIVNMD 225

Query: 722  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543
            DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD
Sbjct: 226  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 285

Query: 542  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL------XXXXXXXXXXXXXXX 381
            QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L                     
Sbjct: 286  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQHHHQHQQHEQQQQQQHPP 345

Query: 380  XXQLLDPINSMFNLGAACAWGQ 315
              QLLDPINSMFNLGAACAWGQ
Sbjct: 346  QPQLLDPINSMFNLGAACAWGQ 367


>ref|XP_013462124.1| HhH-GPD base excision DNA repair family protein [Medicago truncatula]
            gi|657396012|gb|KEH36159.1| HhH-GPD base excision DNA
            repair family protein [Medicago truncatula]
          Length = 375

 Score =  545 bits (1405), Expect = e-152
 Identities = 281/368 (76%), Positives = 308/368 (83%), Gaps = 12/368 (3%)
 Frame = -1

Query: 1397 TQIEPLPQPQGASSVAPTTTTLAA----SIVPVESELSNVPPQTNSPASKIPLRPRKIRK 1230
            TQ +P PQPQG SS    + T A     +I+PVESELSNVPP T +PA+K+PLRPRKIRK
Sbjct: 5    TQQQPQPQPQGVSSDNTISATTAVDSVQTIIPVESELSNVPPHTKAPATKMPLRPRKIRK 64

Query: 1229 VSPDPTTSESQTETPKASTSTG-GKTCGRNNKTVQ--QQRALVVPRMVARSLSCEGEVEI 1059
            VSPDPTTSESQ+ET K   ST  GK+ GRNNKTVQ  QQR L VP++V RSLSCEGEVEI
Sbjct: 65   VSPDPTTSESQSETLKPPNSTAAGKSNGRNNKTVQPPQQRTLAVPKIVPRSLSCEGEVEI 124

Query: 1058 ALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIYTRFIALCG 879
            A+RYLR+ADPLLSPLIDIHQPP+FDNF TPFLALTRSILYQQLA+KAGTSIYTRFIALCG
Sbjct: 125  AIRYLRSADPLLSPLIDIHQPPTFDNFQTPFLALTRSILYQQLAFKAGTSIYTRFIALCG 184

Query: 878  GEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMDDKSLFTML 699
            GEA VVP+ VLAL  QQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMDDKSLFTML
Sbjct: 185  GEAGVVPDNVLALTAQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTML 244

Query: 698  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCEKWRP 519
            TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQ+LYNL+DLPRPSQMDQLCEKW+P
Sbjct: 245  TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLDDLPRPSQMDQLCEKWKP 304

Query: 518  YRSVASWYMWRFVEAKGTPSSAVAVATG-----ASLXXXXXXXXXXXXXXXXXQLLDPIN 354
            YRSVASWY+WRFVEAKG+PS+AVAVATG       L                  ++DP+N
Sbjct: 305  YRSVASWYLWRFVEAKGSPSTAVAVATGNGLQQHELDHHQQQQQQQQQQHSQQPIMDPMN 364

Query: 353  SMFNLGAA 330
            +MFN+G A
Sbjct: 365  NMFNMGCA 372


>gb|KOM51587.1| hypothetical protein LR48_Vigan09g024600 [Vigna angularis]
          Length = 362

 Score =  543 bits (1400), Expect = e-151
 Identities = 293/377 (77%), Positives = 305/377 (80%), Gaps = 9/377 (2%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGE T  QAQSL    IEP P P      AP+++         +SEL+NV P  NSPASK
Sbjct: 1    MGEHTLGQAQSL----IEPQPHP------APSSSAAGPDGAQADSELNNVLPHVNSPASK 50

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083
            IPLRPRKIRKVSPDP+TSES TE  K      GK+ GR+ K V   RA+ VVPR+VARSL
Sbjct: 51   IPLRPRKIRKVSPDPSTSESLTEPSKP-----GKSGGRSTKHVPPSRAMAVVPRLVARSL 105

Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903
            SCEGEVEIALR+LRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY
Sbjct: 106  SCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 165

Query: 902  TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723
            TRFIALCGGE  VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD
Sbjct: 166  TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 225

Query: 722  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543
            DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD
Sbjct: 226  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 285

Query: 542  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL--------XXXXXXXXXXXXX 387
            QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L                     
Sbjct: 286  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQHQHHHQHQQHEQQQQQQQQH 345

Query: 386  XXXXQLLDPINSMFNLG 336
                QLLDPINSMFNLG
Sbjct: 346  PPQPQLLDPINSMFNLG 362


>ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Glycine max]
            gi|947103464|gb|KRH51847.1| hypothetical protein
            GLYMA_06G031700 [Glycine max]
          Length = 351

 Score =  529 bits (1363), Expect = e-147
 Identities = 283/382 (74%), Positives = 298/382 (78%), Gaps = 7/382 (1%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGEQT  QAQ        PLP P  A++                SEL+NVP  T SPA+K
Sbjct: 1    MGEQTLGQAQ--------PLPAPDAATA---------------HSELNNVPQPTTSPATK 37

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRN--NKTVQQQRALVVPRMVARS 1086
            IPLRPRKIRKVSPDP+TSE+  +          K  GRN  +K    +   VVPR+VARS
Sbjct: 38   IPLRPRKIRKVSPDPSTSEAPIKP--------AKPVGRNTTSKAAPPRALTVVPRIVARS 89

Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906
            LSC+GEVEI+LRYLRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLA+KAGTSI
Sbjct: 90   LSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGTSI 149

Query: 905  YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726
            YTRFI LCGGE  VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNM
Sbjct: 150  YTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNM 209

Query: 725  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546
            DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM
Sbjct: 210  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 269

Query: 545  DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-----XXXXXXXXXXXXXXX 381
            DQLC+KWRPYRSVASWYMWRFVEAKGTPSSAV VATGA L                    
Sbjct: 270  DQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVTVATGAGLQQQRHHQHQQQEQQQQQHAP 329

Query: 380  XXQLLDPINSMFNLGAACAWGQ 315
              QLLDPINSMFNLGAACAWGQ
Sbjct: 330  QPQLLDPINSMFNLGAACAWGQ 351


>gb|ACU22727.1| unknown [Glycine max]
          Length = 351

 Score =  528 bits (1359), Expect = e-147
 Identities = 282/382 (73%), Positives = 297/382 (77%), Gaps = 7/382 (1%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGEQT  QAQ        PLP P  A++                SEL+NVP  T SPA+K
Sbjct: 1    MGEQTLGQAQ--------PLPAPDAATA---------------HSELNNVPQPTTSPATK 37

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRN--NKTVQQQRALVVPRMVARS 1086
            IPLRPRKIRKVSPDP+TSE+  +          K  GRN  +K    +   VVPR+VARS
Sbjct: 38   IPLRPRKIRKVSPDPSTSEAPIKP--------AKPVGRNTTSKAAPPRALTVVPRIVARS 89

Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906
            LSC+GEVEI+LRYLRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLA+KAGTSI
Sbjct: 90   LSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGTSI 149

Query: 905  YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726
            YTRFI LCGGE  VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNM
Sbjct: 150  YTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNM 209

Query: 725  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546
            DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM
Sbjct: 210  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 269

Query: 545  DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-----XXXXXXXXXXXXXXX 381
            DQLC+KWRPYRSVASWYMWRFVEAKGTPSSAV VATGA L                    
Sbjct: 270  DQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVTVATGAGLQQQRHHQHQQQEQQQQQHAP 329

Query: 380  XXQLLDPINSMFNLGAACAWGQ 315
              QLLDPINSMFNLGA CAWGQ
Sbjct: 330  QPQLLDPINSMFNLGAVCAWGQ 351


>ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobroma cacao]
            gi|508722881|gb|EOY14778.1| DNA glycosylase superfamily
            protein [Theobroma cacao]
          Length = 397

 Score =  510 bits (1314), Expect = e-141
 Identities = 274/377 (72%), Positives = 301/377 (79%), Gaps = 6/377 (1%)
 Frame = -1

Query: 1427 TRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASKIPLR 1248
            T TQ QS  +TQ +    P   S+ A +TT  +A +    +EL+NVPPQT+SP SKIP R
Sbjct: 25   TPTQEQSQGQTQTQ---NPNNTSNAAVSTTVTSAVVTSAPTELTNVPPQTSSPPSKIPFR 81

Query: 1247 PRKIRKVSPDPT--TSESQTETPKASTSTGG-KTCGRNNKT-VQQQRAL-VVPRMVARSL 1083
            PRKIRK+SPDP   T+ SQ  T  A+++T   KT  +  KT + Q RAL VVPR++ARSL
Sbjct: 82   PRKIRKLSPDPNSDTNASQQATTSATSATEPPKTVAKTPKTKLTQHRALAVVPRIMARSL 141

Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903
            SCEGEVE A+R+LRNADPLL+ LIDIH PP+FD FHTPFLALTRSILYQQLA+KAGTSIY
Sbjct: 142  SCEGEVETAIRHLRNADPLLASLIDIHPPPTFDTFHTPFLALTRSILYQQLAFKAGTSIY 201

Query: 902  TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723
             RFIALCGGE  VVPETVL+L  QQLRQIGVSGRKASYLHDLARKYQ GILSDS IVNMD
Sbjct: 202  NRFIALCGGENGVVPETVLSLTAQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIVNMD 261

Query: 722  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543
            DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE+LPRPSQMD
Sbjct: 262  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMD 321

Query: 542  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-XXXXXXXXXXXXXXXXXQLL 366
            QLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GASL                  QLL
Sbjct: 322  QLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGASLPPPQQEEQQQHQQHQQQPQLL 381

Query: 365  DPINSMFNLGAACAWGQ 315
            DPINS+ NLG ACAWGQ
Sbjct: 382  DPINSILNLG-ACAWGQ 397


>ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]
            gi|587903719|gb|EXB91937.1| DNA-3-methyladenine
            glycosylase 1 [Morus notabilis]
          Length = 451

 Score =  499 bits (1285), Expect = e-138
 Identities = 257/344 (74%), Positives = 278/344 (80%), Gaps = 6/344 (1%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGEQT+TQ Q+    Q     Q   +S V   +TT  A      +ELSN P QT+SP SK
Sbjct: 1    MGEQTQTQTQTQQPQQHHGQTQESSSSMVTSISTTTIAPSSTAPTELSNAPSQTSSPPSK 60

Query: 1259 IPLRPRKIRKVSPDPTTSESQT-----ETPKASTSTGGKTCGRNNKTVQQQR-ALVVPRM 1098
            IPLRPRKIRK+SPD + S+S       E PK S +          K VQQ+  A+  PR+
Sbjct: 61   IPLRPRKIRKLSPDDSDSKSSQVVAVPENPKPSPTAAAAAKPAKAKIVQQRALAIAAPRI 120

Query: 1097 VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKA 918
            VARSLSCEGEVE+ALR+LR ADPLL+PLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKA
Sbjct: 121  VARSLSCEGEVEVALRHLRRADPLLAPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKA 180

Query: 917  GTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDST 738
            GTSIYTRFIALCGGE  VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS 
Sbjct: 181  GTSIYTRFIALCGGETGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSA 240

Query: 737  IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPR 558
            IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE+LPR
Sbjct: 241  IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPR 300

Query: 557  PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL 426
            PSQMDQLCEKWRPYRSVA+WYMWRFVE KG P +A  VA GA+L
Sbjct: 301  PSQMDQLCEKWRPYRSVAAWYMWRFVEQKGAPPNAATVAVGANL 344


>ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus
            sinensis]
          Length = 371

 Score =  499 bits (1285), Expect = e-138
 Identities = 265/383 (69%), Positives = 297/383 (77%), Gaps = 8/383 (2%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPP 1284
            M EQT++Q Q+    Q EP  QP        +TTTLA  ++PV++E +N        V P
Sbjct: 1    MVEQTQSQTQNQPEPQPEPETQPPPNQD---STTTLA--VIPVQTETANNATITHANVTP 55

Query: 1283 QTNSPASKIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVP 1104
            QT+SP SKIPLRPRKIRK+SPD    ++ +  P  S+        ++    QQQ+ L VP
Sbjct: 56   QTSSPPSKIPLRPRKIRKLSPDNGVDQASSSQPTESSKATSAKSTKSRAIQQQQQTLTVP 115

Query: 1103 RMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAY 924
            R++AR LS EGEVE A+R+LRNAD  L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+
Sbjct: 116  RIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAF 175

Query: 923  KAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSD 744
            KAGTSIYTRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSD
Sbjct: 176  KAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSD 235

Query: 743  STIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDL 564
            S IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+L
Sbjct: 236  SAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEEL 295

Query: 563  PRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXX 384
            PRPSQMDQLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L              
Sbjct: 296  PRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQ 349

Query: 383  XXXQLLDPINSMFNLGAACAWGQ 315
               QLLD INS+ N+G ACAWGQ
Sbjct: 350  QQPQLLDQINSLINIG-ACAWGQ 371


>ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citrus clementina]
            gi|557537126|gb|ESR48244.1| hypothetical protein
            CICLE_v10001539mg [Citrus clementina]
          Length = 373

 Score =  490 bits (1261), Expect = e-135
 Identities = 256/369 (69%), Positives = 288/369 (78%), Gaps = 8/369 (2%)
 Frame = -1

Query: 1418 QAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPPQTNSPAS 1263
            Q QS T+ Q EP P+P+        +TT A +++PV+SE +N        V PQT+SP S
Sbjct: 4    QTQSQTQNQPEPQPEPETQPPPNQDSTT-ALAVIPVQSETANNATITHANVTPQTSSPPS 62

Query: 1262 KIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMVARSL 1083
            KIPLRPRKIRK+SPD    ++ +  P  S+        ++    QQQ+ L VPR++AR L
Sbjct: 63   KIPLRPRKIRKLSPDNGVDQTSSSQPTESSKATSAKSTKSRAIQQQQQTLTVPRIIARPL 122

Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903
            S EGEVE A+R+LRNAD  L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+KAGTSIY
Sbjct: 123  SSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIY 182

Query: 902  TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723
            TRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD
Sbjct: 183  TRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 242

Query: 722  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543
            DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+LPRPSQMD
Sbjct: 243  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMD 302

Query: 542  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXXXXXQLLD 363
            QLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L                 QLLD
Sbjct: 303  QLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQQQPQLLD 356

Query: 362  PINSMFNLG 336
             INS+ N+G
Sbjct: 357  QINSLINIG 365


>ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Gossypium raimondii] gi|763791263|gb|KJB58259.1|
            hypothetical protein B456_009G201500 [Gossypium
            raimondii]
          Length = 395

 Score =  489 bits (1258), Expect = e-135
 Identities = 268/397 (67%), Positives = 298/397 (75%), Gaps = 22/397 (5%)
 Frame = -1

Query: 1439 MGEQTRTQAQ------------SLTRTQIEPLPQPQGASSVAP--TTTTLAASIVPV-ES 1305
            MGEQT +Q Q            + T+ Q++        SS AP  T TT   +IV    +
Sbjct: 1    MGEQTPSQPQPQVQSQPPNDSSTTTQAQVQTQSGDPNNSSTAPVSTVTTACTAIVACGPT 60

Query: 1304 ELSNVPPQTNSPASKIPLRPRKIRKVSPD----PTTSESQTETPKASTSTGGKTCGRNNK 1137
            EL NVP  T SP SKIP RPRKIRK+SPD    P  S+  T +   S +   KT GR +K
Sbjct: 61   ELVNVPLSTLSPPSKIPSRPRKIRKLSPDLSFDPNASQQATTSSSTSLTEQRKTVGRTSK 120

Query: 1136 T-VQQQRALVV--PRMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPF 966
            T + Q RAL V  PR+++RSLSCEGEVE A+ +LR+ADPLL+ LID+H PP+FD FH PF
Sbjct: 121  TKLSQHRALAVVAPRIISRSLSCEGEVENAIHHLRDADPLLASLIDLHPPPTFDTFHAPF 180

Query: 965  LALTRSILYQQLAYKAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYL 786
            LALTRSILYQQLA+KAGTSIYTRFI+LCGGE  VVPETVL+L  QQLRQIGVSGRKASYL
Sbjct: 181  LALTRSILYQQLAFKAGTSIYTRFISLCGGENGVVPETVLSLTSQQLRQIGVSGRKASYL 240

Query: 785  HDLARKYQNGILSDSTIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG 606
            HDLARKYQ GILSDS IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG
Sbjct: 241  HDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG 300

Query: 605  VRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL 426
            VRKGVQLLYNLE+LPRPSQMDQLCEKWRPYRSVASWY+WR+VEAKG PSSA AVA GASL
Sbjct: 301  VRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYLWRYVEAKGAPSSAAAVAAGASL 360

Query: 425  XXXXXXXXXXXXXXXXXQLLDPINSMFNLGAACAWGQ 315
                             QL+DPINS+ NLG ACAWGQ
Sbjct: 361  -PPLQQQEEPQQHQQQPQLMDPINSILNLG-ACAWGQ 395


>gb|KDO84582.1| hypothetical protein CISIN_1g039604mg [Citrus sinensis]
          Length = 373

 Score =  489 bits (1258), Expect = e-135
 Identities = 255/369 (69%), Positives = 288/369 (78%), Gaps = 8/369 (2%)
 Frame = -1

Query: 1418 QAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPPQTNSPAS 1263
            Q QS T+ Q EP P+P+        +TT A +++PV++E +N        V PQT+SP S
Sbjct: 4    QTQSQTQNQPEPQPEPETQPPPNQDSTT-ALAVIPVQTETANNATITHANVTPQTSSPPS 62

Query: 1262 KIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMVARSL 1083
            KIPLRPRKIRK+SPD    ++ +  P  S+        ++    QQQ+ L VPR++AR L
Sbjct: 63   KIPLRPRKIRKLSPDNGVDQASSSQPTESSKATSAKSTKSRAIQQQQQTLTVPRIIARPL 122

Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903
            S EGEVE A+R+LRNAD  L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+KAGTSIY
Sbjct: 123  SSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIY 182

Query: 902  TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723
            TRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD
Sbjct: 183  TRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 242

Query: 722  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543
            DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+LPRPSQMD
Sbjct: 243  DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMD 302

Query: 542  QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXXXXXQLLD 363
            QLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L                 QLLD
Sbjct: 303  QLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQQQPQLLD 356

Query: 362  PINSMFNLG 336
             INS+ N+G
Sbjct: 357  QINSLINIG 365


>ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Citrus
            sinensis]
          Length = 373

 Score =  489 bits (1258), Expect = e-135
 Identities = 259/376 (68%), Positives = 291/376 (77%), Gaps = 8/376 (2%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPP 1284
            M EQT++Q Q+    Q EP  QP        +TTTLA  ++PV++E +N        V P
Sbjct: 1    MVEQTQSQTQNQPEPQPEPETQPPPNQD---STTTLA--VIPVQTETANNATITHANVTP 55

Query: 1283 QTNSPASKIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVP 1104
            QT+SP SKIPLRPRKIRK+SPD    ++ +  P  S+        ++    QQQ+ L VP
Sbjct: 56   QTSSPPSKIPLRPRKIRKLSPDNGVDQASSSQPTESSKATSAKSTKSRAIQQQQQTLTVP 115

Query: 1103 RMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAY 924
            R++AR LS EGEVE A+R+LRNAD  L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+
Sbjct: 116  RIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAF 175

Query: 923  KAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSD 744
            KAGTSIYTRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSD
Sbjct: 176  KAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSD 235

Query: 743  STIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDL 564
            S IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+L
Sbjct: 236  SAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEEL 295

Query: 563  PRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXX 384
            PRPSQMDQLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L              
Sbjct: 296  PRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQ 349

Query: 383  XXXQLLDPINSMFNLG 336
               QLLD INS+ N+G
Sbjct: 350  QQPQLLDQINSLINIG 365


>ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633802 [Jatropha curcas]
            gi|643731174|gb|KDP38512.1| hypothetical protein
            JCGZ_04437 [Jatropha curcas]
          Length = 406

 Score =  487 bits (1253), Expect = e-134
 Identities = 260/385 (67%), Positives = 302/385 (78%), Gaps = 7/385 (1%)
 Frame = -1

Query: 1448 HTHMGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSP 1269
            H+H   Q+++  ++  +TQ +P P     ++   TTT          +EL+ +P QT SP
Sbjct: 40   HSHSQPQSQSPIKAQVQTQTQPQPLHDSTTTSTITTT----------NELTTIPQQTVSP 89

Query: 1268 ASKIP-LRPRKIRKVSPDPT-TSESQTETPKASTSTGG--KTCGRNNKT-VQQQRALVV- 1107
             +KIP  RPRKIRK+SPD T T+ +   + + +T+T    KT  ++ KT + Q +A+VV 
Sbjct: 90   PAKIPPSRPRKIRKLSPDDTATTATDPNSSQLTTTTNEPPKTTAKSAKTRIAQTKAIVVA 149

Query: 1106 -PRMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQL 930
             PR++ RSLSCEGEVE A+R+LR+ADPLL+ LID+H PP+FD FHTPFLALTRSILYQQL
Sbjct: 150  PPRIIPRSLSCEGEVENAIRHLRDADPLLASLIDLHPPPTFDTFHTPFLALTRSILYQQL 209

Query: 929  AYKAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGIL 750
            A+KAGTSIYTRFIALCGGEA V+P TVL+L PQQLRQIGVSGRKASYLHDLARKY NGIL
Sbjct: 210  AFKAGTSIYTRFIALCGGEAGVLPGTVLSLTPQQLRQIGVSGRKASYLHDLARKYHNGIL 269

Query: 749  SDSTIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE 570
            SD+ IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE
Sbjct: 270  SDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE 329

Query: 569  DLPRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXX 390
            DLPRPSQMDQLCEKWRPYRSVASWY+WRFVEAKG+PSSAVAVATGA +            
Sbjct: 330  DLPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGSPSSAVAVATGAGM-------TQQQQ 382

Query: 389  XXXXXQLLDPINSMFNLGAACAWGQ 315
                 QLLDPINS+ NLG ACAWGQ
Sbjct: 383  EEQQPQLLDPINSILNLG-ACAWGQ 406


>ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]
          Length = 379

 Score =  484 bits (1245), Expect = e-133
 Identities = 257/380 (67%), Positives = 283/380 (74%), Gaps = 5/380 (1%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGEQT+ Q Q+ T++Q +P  Q Q     +  +TT  A    + SE+ N P Q +SP SK
Sbjct: 1    MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMV--ARS 1086
            +PLRPRKIRK+SP+ +   S              T   N     QQRA      V  ARS
Sbjct: 61   MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQRAAFASATVPLARS 120

Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906
            LSCEGEVEIALR+LRNADPLL+ LID+HQ P+FD+F TPFLALTRSILYQQLAYKAGTSI
Sbjct: 121  LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 905  YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726
            YTRFIALCGGEA V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNM
Sbjct: 181  YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 725  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546
            DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL VRKGVQLLYNLE+LPRPSQM
Sbjct: 241  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 545  DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL---XXXXXXXXXXXXXXXXX 375
            DQLCEKWRPYRSV SWYMWR  EAKG  SSA AVA GASL                    
Sbjct: 301  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQDHHQEHQHPQHPQQP 360

Query: 374  QLLDPINSMFNLGAACAWGQ 315
            QLLDP+N + NLG ACAWGQ
Sbjct: 361  QLLDPLNGILNLG-ACAWGQ 379


>ref|XP_004147864.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis sativus]
            gi|700204833|gb|KGN59966.1| hypothetical protein
            Csa_3G857070 [Cucumis sativus]
          Length = 382

 Score =  483 bits (1244), Expect = e-133
 Identities = 258/383 (67%), Positives = 283/383 (73%), Gaps = 8/383 (2%)
 Frame = -1

Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260
            MGEQT+ Q Q+ T++Q +P  Q Q     +  +TT  A    + SE+ N P Q +SP SK
Sbjct: 1    MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60

Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMV--ARS 1086
            +PLRPRKIRK+SP+ +   S              T   N      QRA      V  ARS
Sbjct: 61   MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVPPARS 120

Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906
            LSCEGEVEIALR+LRNADPLL+ LID+HQ P+FD+F TPFLALTRSILYQQLAYKAGTSI
Sbjct: 121  LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 905  YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726
            YTRFIALCGGEA V+PETVLALNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD  IVNM
Sbjct: 181  YTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 725  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546
            DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL VRKGVQLLYNLE+LPRPSQM
Sbjct: 241  DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 545  DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL------XXXXXXXXXXXXXX 384
            DQLCEKWRPYRSV SWYMWR  EAKG  SSA AVA GASL                    
Sbjct: 301  DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHP 360

Query: 383  XXXQLLDPINSMFNLGAACAWGQ 315
               QLLDP+NS+ NLG ACAWGQ
Sbjct: 361  QQPQLLDPLNSILNLG-ACAWGQ 382


>ref|XP_012446557.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Gossypium
            raimondii] gi|763792804|gb|KJB59800.1| hypothetical
            protein B456_009G273100 [Gossypium raimondii]
          Length = 396

 Score =  483 bits (1243), Expect = e-133
 Identities = 262/379 (69%), Positives = 293/379 (77%), Gaps = 7/379 (1%)
 Frame = -1

Query: 1430 QTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASKIPL 1251
            QT+ QAQ+LT+T+          ++ APT TT  A +V   +EL++  P T+SP SKIP 
Sbjct: 28   QTQRQAQTLTKTE----NSNDAFAAAAPTVTT--ALVVSASTELTDGSPLTSSPPSKIPS 81

Query: 1250 RPRKIRKVSPDPTTS-----ESQTETPKASTSTGGKTCGRNNKT-VQQQRALVV-PRMVA 1092
            RPRKIRK+SPD  +      ++ T T   S +   KT  R  K  + Q RALVV P+  A
Sbjct: 82   RPRKIRKLSPDSNSEPNASQQATTSTTSTSVAVPLKTVPRAPKAKLSQHRALVVAPQFFA 141

Query: 1091 RSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGT 912
            RSLSCEGEVE A+R+LRNADPLL+ LID+H PP+FD F TPFLALTRSILYQQLA+KAGT
Sbjct: 142  RSLSCEGEVETAVRHLRNADPLLASLIDLHPPPTFDTFQTPFLALTRSILYQQLAFKAGT 201

Query: 911  SIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIV 732
            SIYTRFIALCGGE  VVPETVL+L PQQLRQIGVSGRKASYLHDLARKYQ GILSDS IV
Sbjct: 202  SIYTRFIALCGGENGVVPETVLSLTPQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIV 261

Query: 731  NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 552
            NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG+RKGVQLLY+LE+LPRPS
Sbjct: 262  NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGIRKGVQLLYSLEELPRPS 321

Query: 551  QMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXXXXXQ 372
            QMDQLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GASL                 Q
Sbjct: 322  QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGASL---QPLPQEEHQHQQQPQ 378

Query: 371  LLDPINSMFNLGAACAWGQ 315
            LLD INS+ +LG AC WGQ
Sbjct: 379  LLDSINSILDLG-ACTWGQ 396


Top