BLASTX nr result

ID: Cornus23_contig00006985 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00006985
         (1867 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282344.1| PREDICTED: probable DNA-3-methyladenine glyc...   459   e-126
ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobro...   442   e-121
ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   441   e-120
ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   437   e-119
ref|XP_010263642.1| PREDICTED: uncharacterized protein LOC104601...   437   e-119
ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus not...   435   e-119
ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glyc...   433   e-118
ref|XP_010060954.1| PREDICTED: probable DNA-3-methyladenine glyc...   432   e-118
ref|XP_004147864.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   432   e-118
ref|XP_008388056.1| PREDICTED: probable DNA-3-methyladenine glyc...   432   e-118
ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   431   e-117
gb|ACU22727.1| unknown [Glycine max]                                  431   e-117
ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633...   431   e-117
ref|XP_013462123.1| HhH-GPD base excision DNA repair family prot...   428   e-117
ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [R...   428   e-117
ref|XP_009364961.1| PREDICTED: probable DNA-3-methyladenine glyc...   428   e-117
ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   428   e-117
ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phas...   427   e-116
ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glyc...   427   e-116
ref|XP_008388055.1| PREDICTED: DNA-3-methyladenine glycosylase i...   427   e-116

>ref|XP_002282344.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Vitis
            vinifera]
          Length = 363

 Score =  459 bits (1181), Expect = e-126
 Identities = 240/328 (73%), Positives = 257/328 (78%), Gaps = 2/328 (0%)
 Frame = -3

Query: 1388 APPQKPPS-TKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQPK 1212
            APP+   S + IPFRPRKIRK+SPD ++   S     SKT       + +KNK+V  +  
Sbjct: 44   APPENQSSASNIPFRPRKIRKISPDNSE---SKPAGDSKT-----AGKGAKNKLVPQRVP 95

Query: 1211 RIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILYQQ 1032
             +  +VAR+LSCEGEIEIALRHLRNADP LAPLIDLH PPTFDSFH PFLALTKSILYQQ
Sbjct: 96   AVPNMVARALSCEGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQ 155

Query: 1031 LAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQNGI 852
            LAYKAGTSIYTRFV LCGGEAGV+P+TVLALTPHQLRQIGVSGRKASYLHDLARKYQNGI
Sbjct: 156  LAYKAGTSIYTRFVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGI 215

Query: 851  LSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGL 672
            LSD+ I+ MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGL
Sbjct: 216  LSDTGIITMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGL 275

Query: 671  EELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK-XXXXXXXXXXXXXXSLXXXXXXXXX 495
            EELPRPSQMEQLCEKWRPYRSVASWY+WRFVE K                          
Sbjct: 276  EELPRPSQMEQLCEKWRPYRSVASWYIWRFVEGKGAPSSAAAVAGGPSLQQQQQQQEQQQ 335

Query: 494  XXXXXXXXXQFLDPINGILNLGACAWGQ 411
                     QFLDPINGILNLGACAWGQ
Sbjct: 336  QHQQQQHQQQFLDPINGILNLGACAWGQ 363


>ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobroma cacao]
            gi|508722881|gb|EOY14778.1| DNA glycosylase superfamily
            protein [Theobroma cacao]
          Length = 397

 Score =  442 bits (1137), Expect = e-121
 Identities = 231/337 (68%), Positives = 259/337 (76%), Gaps = 10/337 (2%)
 Frame = -3

Query: 1391 NAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQK------DASKTPKT---TITTRAS 1242
            N PPQ   P +KIPFRPRKIRK+SPD   D  + Q+       A++ PKT   T  T+ +
Sbjct: 66   NVPPQTSSPPSKIPFRPRKIRKLSPDPNSDTNASQQATTSATSATEPPKTVAKTPKTKLT 125

Query: 1241 KNKIVASQPKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFL 1062
            +++ +A  P+    I+ARSLSCEGE+E A+RHLRNADPLLA LID+H PPTFD+FH PFL
Sbjct: 126  QHRALAVVPR----IMARSLSCEGEVETAIRHLRNADPLLASLIDIHPPPTFDTFHTPFL 181

Query: 1061 ALTKSILYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLH 882
            ALT+SILYQQLA+KAGTSIY RF++LCGGE GV+P+TVL+LT  QLRQIGVSGRKASYLH
Sbjct: 182  ALTRSILYQQLAFKAGTSIYNRFIALCGGENGVVPETVLSLTAQQLRQIGVSGRKASYLH 241

Query: 881  DLARKYQNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGV 702
            DLARKYQ GILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGV
Sbjct: 242  DLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGV 301

Query: 701  RKGVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSL 522
            RKGVQLLY LEELPRPSQM+QLCEKWRPYRSVASWY+WRFVE K              SL
Sbjct: 302  RKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYLWRFVEAK-GAPSSAAAVAAGASL 360

Query: 521  XXXXXXXXXXXXXXXXXXQFLDPINGILNLGACAWGQ 411
                              Q LDPIN ILNLGACAWGQ
Sbjct: 361  PPPQQEEQQQHQQHQQQPQLLDPINSILNLGACAWGQ 397


>ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cicer arietinum]
          Length = 384

 Score =  441 bits (1133), Expect = e-120
 Identities = 228/336 (67%), Positives = 255/336 (75%), Gaps = 9/336 (2%)
 Frame = -3

Query: 1391 NAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITT------RASKNK 1233
            N PP    P+TKIP RPRKIRKVSPD T   +S     S+TPK+  +T      R S   
Sbjct: 53   NVPPHINSPATKIPLRPRKIRKVSPDPTTTSESQ----SETPKSATSTAGKSCGRHSNKS 108

Query: 1232 IVASQPKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALT 1053
            +   +   +  IVARSLSCEGE+EIALR+LRNADPLL+PLID+H PPTFD+FH PFLALT
Sbjct: 109  VQQQRALIVPRIVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALT 168

Query: 1052 KSILYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLA 873
            +SILYQQLA+KAGTSIYTRF++LCGGEAGV+P+TVLAL P QLRQIGVSGRKASYLHDLA
Sbjct: 169  RSILYQQLAFKAGTSIYTRFIALCGGEAGVVPETVLALNPQQLRQIGVSGRKASYLHDLA 228

Query: 872  RKYQNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKG 693
            RKYQNGILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKG
Sbjct: 229  RKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKG 288

Query: 692  VQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK-XXXXXXXXXXXXXXSLXX 516
            VQ+LY LE+LPRPSQM+QLCEKWRPYRSVASWYMWRFVE K                   
Sbjct: 289  VQILYNLEDLPRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQHQ 348

Query: 515  XXXXXXXXXXXXXXXXQFLDPINGILNLG-ACAWGQ 411
                            Q +DP+N + N+G ACAWGQ
Sbjct: 349  LEQHQQQQQQQQHSQQQLMDPMNSMFNIGAACAWGQ 384


>ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo]
          Length = 379

 Score =  437 bits (1124), Expect = e-119
 Identities = 225/332 (67%), Positives = 248/332 (74%), Gaps = 4/332 (1%)
 Frame = -3

Query: 1394 VNAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQ 1218
            +NAP Q   P +K+P RPRKIRK+SP+ +D   SH       PK   T +++K+K    +
Sbjct: 48   MNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQR 107

Query: 1217 PKRIRTIV--ARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSI 1044
                   V  ARSLSCEGE+EIALRHLRNADPLLA LIDLH  PTFDSF  PFLALT+SI
Sbjct: 108  AAFASATVPLARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSI 167

Query: 1043 LYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKY 864
            LYQQLAYKAGTSIYTRF++LCGGEAGV+P+TVL+L P QLRQIG+SGRK+SYLHDLARKY
Sbjct: 168  LYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKY 227

Query: 863  QNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQL 684
            QNGILSD AIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDL VRKGVQL
Sbjct: 228  QNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQL 287

Query: 683  LYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK-XXXXXXXXXXXXXXSLXXXXX 507
            LY LEELPRPSQM+QLCEKWRPYRSV SWYMWR  E K                L     
Sbjct: 288  LYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQDH 347

Query: 506  XXXXXXXXXXXXXQFLDPINGILNLGACAWGQ 411
                         Q LDP+NGILNLGACAWGQ
Sbjct: 348  HQEHQHPQHPQQPQLLDPLNGILNLGACAWGQ 379


>ref|XP_010263642.1| PREDICTED: uncharacterized protein LOC104601852 [Nelumbo nucifera]
          Length = 425

 Score =  437 bits (1123), Expect = e-119
 Identities = 225/340 (66%), Positives = 252/340 (74%), Gaps = 13/340 (3%)
 Frame = -3

Query: 1391 NAPPQKPPSTKIPFRPRKIRKVSPDTTDDGKSHQ------------KDASKTPKTTITTR 1248
            +AP     STKIPFRPRKIRK S D + D   ++             D      T +TT 
Sbjct: 87   SAPQNSASSTKIPFRPRKIRKTSSDVSSDNSDNKIVDGECKTTATNGDHKTNNNTALTTT 146

Query: 1247 ASK-NKIVASQPKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHA 1071
            ++K ++IVA Q + +  +VAR+LSCEGE+ +AL+HLRN+DP LA LID+H PPTFDSFH 
Sbjct: 147  SNKKSRIVAKQVRVVPRVVARTLSCEGEVALALQHLRNSDPQLARLIDIHQPPTFDSFHP 206

Query: 1070 PFLALTKSILYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKAS 891
            PFLALTKSILYQQLAYKAGTSIYTRFVSLCGGEAGV+P+ VLAL+P QLRQIGVSGRKAS
Sbjct: 207  PFLALTKSILYQQLAYKAGTSIYTRFVSLCGGEAGVVPEAVLALSPQQLRQIGVSGRKAS 266

Query: 890  YLHDLARKYQNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVND 711
            YLHDLA KY+NGILSD++IV+MDDKSLFTMLTMV GIGSWSVHMFMIFSLHRPDVLPV D
Sbjct: 267  YLHDLANKYRNGILSDASIVDMDDKSLFTMLTMVKGIGSWSVHMFMIFSLHRPDVLPVGD 326

Query: 710  LGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXX 531
            LGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRF E K             
Sbjct: 327  LGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFAEAKGAPASAAAVAVGV 386

Query: 530  XSLXXXXXXXXXXXXXXXXXXQFLDPINGILNLGACAWGQ 411
                                 Q +DP+NGI NLGAC WGQ
Sbjct: 387  SQ-QQQLPPPPQQQQQPPPPPQLIDPMNGIANLGACTWGQ 425


>ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]
            gi|587903719|gb|EXB91937.1| DNA-3-methyladenine
            glycosylase 1 [Morus notabilis]
          Length = 451

 Score =  435 bits (1119), Expect = e-119
 Identities = 219/281 (77%), Positives = 238/281 (84%), Gaps = 7/281 (2%)
 Frame = -3

Query: 1391 NAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRAS----KNKIV 1227
            NAP Q   P +KIP RPRKIRK+SPD +D   S      + PK + T  A+    K KIV
Sbjct: 49   NAPSQTSSPPSKIPLRPRKIRKLSPDDSDSKSSQVVAVPENPKPSPTAAAAAKPAKAKIV 108

Query: 1226 ASQPKRIRT--IVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALT 1053
              +   I    IVARSLSCEGE+E+ALRHLR ADPLLAPLID+H PPTFD+FH PFLALT
Sbjct: 109  QQRALAIAAPRIVARSLSCEGEVEVALRHLRRADPLLAPLIDIHQPPTFDNFHTPFLALT 168

Query: 1052 KSILYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLA 873
            +SILYQQLAYKAGTSIYTRF++LCGGE GV+P+TVLALTP QLRQIGVSGRKASYLHDLA
Sbjct: 169  RSILYQQLAYKAGTSIYTRFIALCGGETGVVPETVLALTPQQLRQIGVSGRKASYLHDLA 228

Query: 872  RKYQNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKG 693
            RKYQNGILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKG
Sbjct: 229  RKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKG 288

Query: 692  VQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK 570
            VQLLY LEELPRPSQM+QLCEKWRPYRSVA+WYMWRFVE K
Sbjct: 289  VQLLYNLEELPRPSQMDQLCEKWRPYRSVAAWYMWRFVEQK 329


>ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Glycine
            max] gi|947112855|gb|KRH61157.1| hypothetical protein
            GLYMA_04G031600 [Glycine max]
          Length = 374

 Score =  433 bits (1113), Expect = e-118
 Identities = 229/344 (66%), Positives = 254/344 (73%), Gaps = 17/344 (4%)
 Frame = -3

Query: 1391 NAP-PQKPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQP 1215
            N P P   P+TKIP RPRKIRKVSPD      S  +  ++TPK   T    +N   A+ P
Sbjct: 38   NVPRPTTSPATKIPLRPRKIRKVSPDP-----STSESQTETPKPAKT--GGRNTTKAAPP 90

Query: 1214 KRIRT---IVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSI 1044
            + +     IVARSLSC+GE+EIALR+LRNADP+L+PLID+H PPTFD+FH PFLALT+SI
Sbjct: 91   RALTVVPRIVARSLSCDGEVEIALRYLRNADPVLSPLIDIHQPPTFDNFHTPFLALTRSI 150

Query: 1043 LYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKY 864
            LYQQLAYKAGTSIYTRF++LCGGE GV+P+TVLALTP QLRQIGVSGRKASYLHDLARKY
Sbjct: 151  LYQQLAYKAGTSIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKY 210

Query: 863  QNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQL 684
            QNGILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQL
Sbjct: 211  QNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQL 270

Query: 683  LYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK------------XXXXXXXXXX 540
            LY LE+LPRPSQM+QLC+KWRPYRSVASWYMWRFVE K                      
Sbjct: 271  LYNLEDLPRPSQMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQQHHQ 330

Query: 539  XXXXSLXXXXXXXXXXXXXXXXXXQFLDPINGILNLG-ACAWGQ 411
                                    Q LDPIN + NLG ACAWGQ
Sbjct: 331  HHHQHQQQEQQQQQQQQQQHPPQPQLLDPINSMFNLGAACAWGQ 374


>ref|XP_010060954.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Eucalyptus
            grandis] gi|629102382|gb|KCW67851.1| hypothetical protein
            EUGRSUZ_F01572 [Eucalyptus grandis]
          Length = 380

 Score =  432 bits (1111), Expect = e-118
 Identities = 229/335 (68%), Positives = 257/335 (76%), Gaps = 6/335 (1%)
 Frame = -3

Query: 1397 PVNAPPQKP---PSTKIPFRPRKIRKVSPDT-TDDGKSHQKDASKTPKTT--ITTRASKN 1236
            P   PPQ+    P +KIP RP+KIRK+SP++ T D K     A    KT    +++ASKN
Sbjct: 47   PPPPPPQQQSASPPSKIPVRPQKIRKLSPESSTPDPKPSAAGAGPKSKTANASSSKASKN 106

Query: 1235 KIVASQPKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLAL 1056
            +IVAS+   +  +VARSLSCEGE+E A+RHLR+ADPLL PLIDL+  PTFD F  PF AL
Sbjct: 107  RIVASRALAVPRVVARSLSCEGEVEAAVRHLRDADPLLGPLIDLYPLPTFDIFLTPFHAL 166

Query: 1055 TKSILYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDL 876
            TKSILYQQLA+KAGTSIYTRF++LCG +AGV+P+TVLAL PHQLRQIGVS RKASYLHDL
Sbjct: 167  TKSILYQQLAFKAGTSIYTRFLALCGSDAGVLPETVLALDPHQLRQIGVSARKASYLHDL 226

Query: 875  ARKYQNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRK 696
            ARKYQNGILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRK
Sbjct: 227  ARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRK 286

Query: 695  GVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSLXX 516
            GVQLLYGLEELPRPSQM+ +C+KWRPYRSVASWYMWRFVE+K              SL  
Sbjct: 287  GVQLLYGLEELPRPSQMDHMCDKWRPYRSVASWYMWRFVESK-GAPTSAAAVAVSASLQQ 345

Query: 515  XXXXXXXXXXXXXXXXQFLDPINGILNLGACAWGQ 411
                            Q LDPIN ILNLGA AWGQ
Sbjct: 346  QQQQVEEQQQHHPQQPQLLDPINSILNLGAYAWGQ 380


>ref|XP_004147864.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis sativus]
            gi|700204833|gb|KGN59966.1| hypothetical protein
            Csa_3G857070 [Cucumis sativus]
          Length = 382

 Score =  432 bits (1111), Expect = e-118
 Identities = 224/335 (66%), Positives = 246/335 (73%), Gaps = 7/335 (2%)
 Frame = -3

Query: 1394 VNAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQ 1218
            +NAP Q   P +K+P RPRKIRK+SP+ +D   SH       PK   T +++K+K    +
Sbjct: 48   MNAPSQISSPPSKMPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQR 107

Query: 1217 PKRIRTIV--ARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSI 1044
                   V  ARSLSCEGE+EIALRHLRNADPLLA LIDLH  PTFDSF  PFLALT+SI
Sbjct: 108  AAFASATVPPARSLSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSI 167

Query: 1043 LYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKY 864
            LYQQLAYKAGTSIYTRF++LCGGEAGV+P+TVLAL P QLRQIG+SGRK+SYLHDLARKY
Sbjct: 168  LYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKY 227

Query: 863  QNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQL 684
            QNGILSD AIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDL VRKGVQL
Sbjct: 228  QNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQL 287

Query: 683  LYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK----XXXXXXXXXXXXXXSLXX 516
            LY LEELPRPSQM+QLCEKWRPYRSV SWYMWR  E K                      
Sbjct: 288  LYNLEELPRPSQMDQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDH 347

Query: 515  XXXXXXXXXXXXXXXXQFLDPINGILNLGACAWGQ 411
                            Q LDP+N ILNLGACAWGQ
Sbjct: 348  HQEHQHPQHPQHPQQPQLLDPLNSILNLGACAWGQ 382


>ref|XP_008388056.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Malus domestica]
          Length = 378

 Score =  432 bits (1110), Expect = e-118
 Identities = 224/327 (68%), Positives = 249/327 (76%), Gaps = 2/327 (0%)
 Frame = -3

Query: 1391 NAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDA-SKTPKTTITTRASKNKIVASQ 1218
            NAP +   P +KIPFRPRKIRK+SPDT D   SHQ  A S+TPK    T+ASK K V  +
Sbjct: 51   NAPSKTSSPPSKIPFRPRKIRKLSPDTADPNSSHQIVAVSETPKPVAATKASKIKTVPQR 110

Query: 1217 PKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILY 1038
                  IVAR LSCEGEIE A+R+LRNADPLLAPLID H  PTFD+FH PFLALT+SILY
Sbjct: 111  AVAAPKIVARPLSCEGEIETAIRYLRNADPLLAPLIDRHPRPTFDNFHTPFLALTRSILY 170

Query: 1037 QQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQN 858
            QQLAYKAGTSIYTRF+ LCGGEA V+P+TVLA TP QLRQIG+SGRKASYLHDLARKYQN
Sbjct: 171  QQLAYKAGTSIYTRFIGLCGGEACVVPETVLAQTPQQLRQIGISGRKASYLHDLARKYQN 230

Query: 857  GILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLY 678
            GILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDLG+RKGVQLLY
Sbjct: 231  GILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPINDLGIRKGVQLLY 290

Query: 677  GLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSLXXXXXXXX 498
             LEELPRPSQMEQLCEKWRPYRSVA+ YMWRF E+                L        
Sbjct: 291  NLEELPRPSQMEQLCEKWRPYRSVATMYMWRFSESN-GAPSSAAAVAAGACLRPQQLQQQ 349

Query: 497  XXXXXXXXXXQFLDPINGILNLGACAW 417
                      Q +D ++ ++N+GAC+W
Sbjct: 350  QQHSQHPQQQQLMDSLSSLINIGACSW 376


>ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Glycine max]
            gi|947103464|gb|KRH51847.1| hypothetical protein
            GLYMA_06G031700 [Glycine max]
          Length = 351

 Score =  431 bits (1108), Expect = e-117
 Identities = 226/332 (68%), Positives = 254/332 (76%), Gaps = 5/332 (1%)
 Frame = -3

Query: 1391 NAP-PQKPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQP 1215
            N P P   P+TKIP RPRKIRKVSPD +   ++  K A    + T T++A+  + +   P
Sbjct: 26   NVPQPTTSPATKIPLRPRKIRKVSPDPSTS-EAPIKPAKPVGRNT-TSKAAPPRALTVVP 83

Query: 1214 KRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILYQ 1035
            +    IVARSLSC+GE+EI+LR+LRNADPLL+PLID+H PPTFD+FH PFLALT+SILYQ
Sbjct: 84   R----IVARSLSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQ 139

Query: 1034 QLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQNG 855
            QLA+KAGTSIYTRF+ LCGGE GV+P+TVLALTP QLRQIGVSGRKASYLHDLARKYQNG
Sbjct: 140  QLAFKAGTSIYTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNG 199

Query: 854  ILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYG 675
            ILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQLLY 
Sbjct: 200  ILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYN 259

Query: 674  LEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK---XXXXXXXXXXXXXXSLXXXXXX 504
            LE+LPRPSQM+QLC+KWRPYRSVASWYMWRFVE K                         
Sbjct: 260  LEDLPRPSQMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVTVATGAGLQQQRHHQHQQ 319

Query: 503  XXXXXXXXXXXXQFLDPINGILNLG-ACAWGQ 411
                        Q LDPIN + NLG ACAWGQ
Sbjct: 320  QEQQQQQHAPQPQLLDPINSMFNLGAACAWGQ 351


>gb|ACU22727.1| unknown [Glycine max]
          Length = 351

 Score =  431 bits (1108), Expect = e-117
 Identities = 226/332 (68%), Positives = 254/332 (76%), Gaps = 5/332 (1%)
 Frame = -3

Query: 1391 NAP-PQKPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQP 1215
            N P P   P+TKIP RPRKIRKVSPD +   ++  K A    + T T++A+  + +   P
Sbjct: 26   NVPQPTTSPATKIPLRPRKIRKVSPDPSTS-EAPIKPAKPVGRNT-TSKAAPPRALTVVP 83

Query: 1214 KRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILYQ 1035
            +    IVARSLSC+GE+EI+LR+LRNADPLL+PLID+H PPTFD+FH PFLALT+SILYQ
Sbjct: 84   R----IVARSLSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQ 139

Query: 1034 QLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQNG 855
            QLA+KAGTSIYTRF+ LCGGE GV+P+TVLALTP QLRQIGVSGRKASYLHDLARKYQNG
Sbjct: 140  QLAFKAGTSIYTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNG 199

Query: 854  ILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYG 675
            ILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQLLY 
Sbjct: 200  ILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYN 259

Query: 674  LEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETK---XXXXXXXXXXXXXXSLXXXXXX 504
            LE+LPRPSQM+QLC+KWRPYRSVASWYMWRFVE K                         
Sbjct: 260  LEDLPRPSQMDQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVTVATGAGLQQQRHHQHQQ 319

Query: 503  XXXXXXXXXXXXQFLDPINGILNLGA-CAWGQ 411
                        Q LDPIN + NLGA CAWGQ
Sbjct: 320  QEQQQQQHAPQPQLLDPINSMFNLGAVCAWGQ 351


>ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633802 [Jatropha curcas]
            gi|643731174|gb|KDP38512.1| hypothetical protein
            JCGZ_04437 [Jatropha curcas]
          Length = 406

 Score =  431 bits (1107), Expect = e-117
 Identities = 226/330 (68%), Positives = 251/330 (76%), Gaps = 9/330 (2%)
 Frame = -3

Query: 1373 PPSTKIPFRPRKIRKVSPD----TTDDGKSHQ--KDASKTPKTTIT---TRASKNKIVAS 1221
            PP+   P RPRKIRK+SPD    T  D  S Q     ++ PKTT     TR ++ K +  
Sbjct: 89   PPAKIPPSRPRKIRKLSPDDTATTATDPNSSQLTTTTNEPPKTTAKSAKTRIAQTKAIVV 148

Query: 1220 QPKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSIL 1041
             P RI   + RSLSCEGE+E A+RHLR+ADPLLA LIDLH PPTFD+FH PFLALT+SIL
Sbjct: 149  APPRI---IPRSLSCEGEVENAIRHLRDADPLLASLIDLHPPPTFDTFHTPFLALTRSIL 205

Query: 1040 YQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQ 861
            YQQLA+KAGTSIYTRF++LCGGEAGV+P TVL+LTP QLRQIGVSGRKASYLHDLARKY 
Sbjct: 206  YQQLAFKAGTSIYTRFIALCGGEAGVLPGTVLSLTPQQLRQIGVSGRKASYLHDLARKYH 265

Query: 860  NGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLL 681
            NGILSD+AIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQLL
Sbjct: 266  NGILSDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLL 325

Query: 680  YGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSLXXXXXXX 501
            Y LE+LPRPSQM+QLCEKWRPYRSVASWY+WRFVE K              ++       
Sbjct: 326  YNLEDLPRPSQMDQLCEKWRPYRSVASWYLWRFVEAK---------GSPSSAVAVATGAG 376

Query: 500  XXXXXXXXXXXQFLDPINGILNLGACAWGQ 411
                       Q LDPIN ILNLGACAWGQ
Sbjct: 377  MTQQQQEEQQPQLLDPINSILNLGACAWGQ 406


>ref|XP_013462123.1| HhH-GPD base excision DNA repair family protein [Medicago truncatula]
            gi|657396011|gb|KEH36158.1| HhH-GPD base excision DNA
            repair family protein [Medicago truncatula]
          Length = 377

 Score =  428 bits (1101), Expect = e-117
 Identities = 220/337 (65%), Positives = 251/337 (74%), Gaps = 10/337 (2%)
 Frame = -3

Query: 1391 NAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQP 1215
            N PP  K P+TK+P RPRKIRKVSPD T      Q +  K P +T   +++       QP
Sbjct: 43   NVPPHTKAPATKMPLRPRKIRKVSPDPTTS--ESQSETLKPPNSTAAGKSNGRNNKTVQP 100

Query: 1214 KRIRT-----IVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTK 1050
             + RT     IV RSLSCEGE+EIA+R+LR+ADPLL+PLID+H PPTFD+F  PFLALT+
Sbjct: 101  PQQRTLAVPKIVPRSLSCEGEVEIAIRYLRSADPLLSPLIDIHQPPTFDNFQTPFLALTR 160

Query: 1049 SILYQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLAR 870
            SILYQQLA+KAGTSIYTRF++LCGGEAGV+PD VLALT  QLRQIGVSGRKASYLHDLAR
Sbjct: 161  SILYQQLAFKAGTSIYTRFIALCGGEAGVVPDNVLALTAQQLRQIGVSGRKASYLHDLAR 220

Query: 869  KYQNGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGV 690
            KYQNGILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGV
Sbjct: 221  KYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGV 280

Query: 689  QLLYGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSL---X 519
            Q+LY L++LPRPSQM+QLCEKW+PYRSVASWY+WRFVE K                    
Sbjct: 281  QILYNLDDLPRPSQMDQLCEKWKPYRSVASWYLWRFVEAKGSPSTAVAVATGNGLQQHEL 340

Query: 518  XXXXXXXXXXXXXXXXXQFLDPINGILNLG-ACAWGQ 411
                               +DP+N + N+G ACAWGQ
Sbjct: 341  DHHQQQQQQQQQQHSQQPIMDPMNNMFNMGAACAWGQ 377


>ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
            gi|223551097|gb|EEF52583.1| DNA-3-methyladenine
            glycosylase, putative [Ricinus communis]
          Length = 369

 Score =  428 bits (1101), Expect = e-117
 Identities = 223/325 (68%), Positives = 247/325 (76%)
 Frame = -3

Query: 1385 PPQKPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQPKRI 1206
            P   PP+   P RPRK+RK+SP++          A+K+ KT    +  + + +A  P RI
Sbjct: 70   PTATPPAKIPPSRPRKLRKLSPES----------AAKSTKT----KTPQPRALAVAPPRI 115

Query: 1205 RTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILYQQLA 1026
               +ARSLSCEGE+E A+RHLR ADPLL+ LIDLH PPTFD+FH PFLALT+SILYQQLA
Sbjct: 116  ---IARSLSCEGEVENAIRHLREADPLLSSLIDLHPPPTFDTFHTPFLALTRSILYQQLA 172

Query: 1025 YKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILS 846
            +KAGTSIYTRF+SLCGGEAGV+PDTVLALTP QLRQIGVSGRKASYLHDLARKY NGILS
Sbjct: 173  FKAGTSIYTRFISLCGGEAGVVPDTVLALTPQQLRQIGVSGRKASYLHDLARKYHNGILS 232

Query: 845  DSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEE 666
            DSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQLLY LE+
Sbjct: 233  DSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLED 292

Query: 665  LPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSLXXXXXXXXXXXX 486
            LPRPSQM+QLCEKWRPYRSVASWY+WRFVE K                            
Sbjct: 293  LPRPSQMDQLCEKWRPYRSVASWYLWRFVEAK--------GSPSSAVAVATGAALTQQHQ 344

Query: 485  XXXXXXQFLDPINGILNLGACAWGQ 411
                  Q LDPIN ILNLGACAWGQ
Sbjct: 345  EDHQQPQLLDPINSILNLGACAWGQ 369


>ref|XP_009364961.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Pyrus x bretschneideri]
          Length = 374

 Score =  428 bits (1100), Expect = e-117
 Identities = 222/327 (67%), Positives = 249/327 (76%), Gaps = 2/327 (0%)
 Frame = -3

Query: 1391 NAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDA-SKTPKTTITTRASKNKIVASQ 1218
            NAP +   P +KIPFRPRKIRK+SPDT +   SHQ  A S+TPK    T+ASK K V  +
Sbjct: 47   NAPSKTSSPPSKIPFRPRKIRKLSPDTANPNSSHQIVAVSETPKPVAATKASKIKTVPQR 106

Query: 1217 PKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILY 1038
                  IVAR LSCEGEIE A+R+LRNADPLLAPLID H  PTFD+FH PFLALT+SILY
Sbjct: 107  AVAAPKIVARPLSCEGEIETAIRYLRNADPLLAPLIDRHPRPTFDNFHTPFLALTRSILY 166

Query: 1037 QQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQN 858
            QQLAYKAGTSIYTRF++LCGGEA V+P+ VLA TP QLRQIG+SGRKASYLHDLARKYQN
Sbjct: 167  QQLAYKAGTSIYTRFIALCGGEACVVPEIVLAQTPQQLRQIGISGRKASYLHDLARKYQN 226

Query: 857  GILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLY 678
            GILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDLG+RKGVQLLY
Sbjct: 227  GILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPINDLGIRKGVQLLY 286

Query: 677  GLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSLXXXXXXXX 498
             LEELPRPSQMEQLCEKWRPYRSVA+ YMWRF E+                L        
Sbjct: 287  NLEELPRPSQMEQLCEKWRPYRSVATMYMWRFSESN-GAPSSAAAVAAGVCLGSQQPQQQ 345

Query: 497  XXXXXXXXXXQFLDPINGILNLGACAW 417
                      Q +D ++ ++N+GAC+W
Sbjct: 346  QQHSQHPQQQQLMDSLSSLINIGACSW 372


>ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus
            sinensis]
          Length = 371

 Score =  428 bits (1100), Expect = e-117
 Identities = 218/325 (67%), Positives = 243/325 (74%)
 Frame = -3

Query: 1385 PPQKPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQPKRI 1206
            P    P +KIP RPRKIRK+SPD   D  S  +    +  T+  +  S+      Q   +
Sbjct: 55   PQTSSPPSKIPLRPRKIRKLSPDNGVDQASSSQPTESSKATSAKSTKSRAIQQQQQTLTV 114

Query: 1205 RTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILYQQLA 1026
              I+AR LS EGE+E A+RHLRNAD  LA LID+H PPTFDSFH PFLALT+SILYQQLA
Sbjct: 115  PRIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLA 174

Query: 1025 YKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILS 846
            +KAGTSIYTRF++LCGGEAGV+P+TVLALTP QLRQIGVSGRKASYLHDLARKYQNGILS
Sbjct: 175  FKAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILS 234

Query: 845  DSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEE 666
            DSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQLLY LEE
Sbjct: 235  DSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEE 294

Query: 665  LPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSLXXXXXXXXXXXX 486
            LPRPSQM+QLCEKWRPYRSVASWY+WRFVE K              +             
Sbjct: 295  LPRPSQMDQLCEKWRPYRSVASWYLWRFVEAK--------GAPSSAAAVAAGAALPQPQQ 346

Query: 485  XXXXXXQFLDPINGILNLGACAWGQ 411
                  Q LD IN ++N+GACAWGQ
Sbjct: 347  EEQQQPQLLDQINSLINIGACAWGQ 371


>ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris]
            gi|561009684|gb|ESW08591.1| hypothetical protein
            PHAVU_009G058200g [Phaseolus vulgaris]
          Length = 366

 Score =  427 bits (1099), Expect = e-116
 Identities = 224/330 (67%), Positives = 247/330 (74%), Gaps = 5/330 (1%)
 Frame = -3

Query: 1385 PPQKPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITTRASKNKIVASQPKRI 1206
            P    P++KIP RPRKIRKVSPD      S  +  ++ PK   +   S   +  S+   +
Sbjct: 42   PHANSPASKIPLRPRKIRKVSPDP-----STSESQTEPPKPGKSGGRSTKHVPPSRGMSV 96

Query: 1205 RT-IVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILYQQL 1029
               +VARSLSCEGE+EIALR LRNADPLL+PLID+H PPTFD+FH PFLALT+SILYQQL
Sbjct: 97   LPRLVARSLSCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQL 156

Query: 1028 AYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQNGIL 849
            AYKAGTSIYTRF++LCGGE GV+P+TVLALTP QLRQIGVSGRKASYLHDLARKYQNGIL
Sbjct: 157  AYKAGTSIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGIL 216

Query: 848  SDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLE 669
            SDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQLLY LE
Sbjct: 217  SDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE 276

Query: 668  ELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSL---XXXXXXXX 498
            +LPRPSQM+ LCEKWRPYRSVASWYMWRFVE K                           
Sbjct: 277  DLPRPSQMDHLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQHHHQHQQHE 336

Query: 497  XXXXXXXXXXQFLDPINGILNLG-ACAWGQ 411
                      Q LDPIN + NLG ACAWGQ
Sbjct: 337  QQQQQHPPQPQLLDPINSMFNLGAACAWGQ 366


>ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2
            [Gossypium raimondii] gi|763791263|gb|KJB58259.1|
            hypothetical protein B456_009G201500 [Gossypium
            raimondii]
          Length = 395

 Score =  427 bits (1098), Expect = e-116
 Identities = 224/330 (67%), Positives = 253/330 (76%), Gaps = 10/330 (3%)
 Frame = -3

Query: 1370 PSTKIPFRPRKIRKVSPDTTDDGKSHQKDASKTPKTTITT------RASKNKIVASQPKR 1209
            P +KIP RPRKIRK+SPD + D  + Q+ A+ +  T++T       R SK K+  SQ + 
Sbjct: 72   PPSKIPSRPRKIRKLSPDLSFDPNASQQ-ATTSSSTSLTEQRKTVGRTSKTKL--SQHRA 128

Query: 1208 IRT----IVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSIL 1041
            +      I++RSLSCEGE+E A+ HLR+ADPLLA LIDLH PPTFD+FHAPFLALT+SIL
Sbjct: 129  LAVVAPRIISRSLSCEGEVENAIHHLRDADPLLASLIDLHPPPTFDTFHAPFLALTRSIL 188

Query: 1040 YQQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQ 861
            YQQLA+KAGTSIYTRF+SLCGGE GV+P+TVL+LT  QLRQIGVSGRKASYLHDLARKYQ
Sbjct: 189  YQQLAFKAGTSIYTRFISLCGGENGVVPETVLSLTSQQLRQIGVSGRKASYLHDLARKYQ 248

Query: 860  NGILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLL 681
             GILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLGVRKGVQLL
Sbjct: 249  TGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLL 308

Query: 680  YGLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVETKXXXXXXXXXXXXXXSLXXXXXXX 501
            Y LEELPRPSQM+QLCEKWRPYRSVASWY+WR+VE K                       
Sbjct: 309  YNLEELPRPSQMDQLCEKWRPYRSVASWYLWRYVEAK---GAPSSAAAVAAGASLPPLQQ 365

Query: 500  XXXXXXXXXXXQFLDPINGILNLGACAWGQ 411
                       Q +DPIN ILNLGACAWGQ
Sbjct: 366  QEEPQQHQQQPQLMDPINSILNLGACAWGQ 395


>ref|XP_008388055.1| PREDICTED: DNA-3-methyladenine glycosylase isoform X1 [Malus
            domestica]
          Length = 401

 Score =  427 bits (1098), Expect = e-116
 Identities = 216/275 (78%), Positives = 234/275 (85%), Gaps = 2/275 (0%)
 Frame = -3

Query: 1391 NAPPQ-KPPSTKIPFRPRKIRKVSPDTTDDGKSHQKDA-SKTPKTTITTRASKNKIVASQ 1218
            NAP +   P +KIPFRPRKIRK+SPDT D   SHQ  A S+TPK    T+ASK K V  +
Sbjct: 51   NAPSKTSSPPSKIPFRPRKIRKLSPDTADPNSSHQIVAVSETPKPVAATKASKIKTVPQR 110

Query: 1217 PKRIRTIVARSLSCEGEIEIALRHLRNADPLLAPLIDLHHPPTFDSFHAPFLALTKSILY 1038
                  IVAR LSCEGEIE A+R+LRNADPLLAPLID H  PTFD+FH PFLALT+SILY
Sbjct: 111  AVAAPKIVARPLSCEGEIETAIRYLRNADPLLAPLIDRHPRPTFDNFHTPFLALTRSILY 170

Query: 1037 QQLAYKAGTSIYTRFVSLCGGEAGVIPDTVLALTPHQLRQIGVSGRKASYLHDLARKYQN 858
            QQLAYKAGTSIYTRF+ LCGGEA V+P+TVLA TP QLRQIG+SGRKASYLHDLARKYQN
Sbjct: 171  QQLAYKAGTSIYTRFIGLCGGEACVVPETVLAQTPQQLRQIGISGRKASYLHDLARKYQN 230

Query: 857  GILSDSAIVEMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLY 678
            GILSDSAIV MDDKSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDLG+RKGVQLLY
Sbjct: 231  GILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPINDLGIRKGVQLLY 290

Query: 677  GLEELPRPSQMEQLCEKWRPYRSVASWYMWRFVET 573
             LEELPRPSQMEQLCEKWRPYRSVA+ YMWRF E+
Sbjct: 291  NLEELPRPSQMEQLCEKWRPYRSVATMYMWRFSES 325


Top