BLASTX nr result

ID: Rheum21_contig00003862 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00003862
         (2179 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY14778.1| DNA glycosylase superfamily protein [Theobroma ca...   407   e-110
ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   406   e-110
ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   406   e-110
gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]    404   e-110
ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citr...   404   e-110
ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   398   e-108
ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [R...   398   e-108
ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glyc...   396   e-107
ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   395   e-107
ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1...   395   e-107
gb|ACU22727.1| unknown [Glycine max]                                  395   e-107
ref|XP_002302029.1| predicted protein [Populus trichocarpa]           393   e-106
gb|ESW08591.1| hypothetical protein PHAVU_009G058200g [Phaseolus...   392   e-106
ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202...   391   e-106
ref|XP_002887579.1| hypothetical protein ARALYDRAFT_476667 [Arab...   385   e-104
gb|EMJ24474.1| hypothetical protein PRUPE_ppa007252mg [Prunus pe...   384   e-104
ref|XP_002306870.2| hypothetical protein POPTR_0005s24930g [Popu...   384   e-104
ref|XP_006390331.1| hypothetical protein EUTSA_v10018654mg [Eutr...   382   e-103
gb|AAG12687.1|AC025814_11 3-methyladenine DNA glycosylase, putat...   381   e-103
ref|NP_974147.1| DNA glycosylase superfamily protein [Arabidopsi...   381   e-103

>gb|EOY14778.1| DNA glycosylase superfamily protein [Theobroma cacao]
          Length = 397

 Score =  407 bits (1045), Expect = e-110
 Identities = 219/370 (59%), Positives = 261/370 (70%), Gaps = 4/370 (1%)
 Frame = +1

Query: 409  QIQAQTPPQSQLLSHPDPLVQSQLHLKPE----IEDSVASITVSSVDITAAATTIELPNA 576
            Q QAQ+ PQ+   S      QSQ   + +      ++  S TV+S  +T+A T  EL N 
Sbjct: 10   QPQAQSQPQNDSSSSTPTQEQSQGQTQTQNPNNTSNAAVSTTVTSAVVTSAPT--ELTNV 67

Query: 577  LSQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVI 756
              Q  SP +K +P RPRKIRKLS D  S   +    A+  A S +E   + A TP + + 
Sbjct: 68   PPQTSSPPSK-IPFRPRKIRKLSPDPNSDT-NASQQATTSATSATEPPKTVAKTPKTKLT 125

Query: 757  VPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVD 936
                                       Q R +   PR++ARSLS  GE+E A++HL++ D
Sbjct: 126  ---------------------------QHRALAVVPRIMARSLSCEGEVETAIRHLRNAD 158

Query: 937  PHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDT 1116
            P LA LI++H PPTFD+F TPFLAL++SILYQQLA+KAGTSIY RF+ALCGGE GV+P+T
Sbjct: 159  PLLASLIDIHPPPTFDTFHTPFLALTRSILYQQLAFKAGTSIYNRFIALCGGENGVVPET 218

Query: 1117 VLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSW 1296
            VL+LT QQLRQIGVSGRKASYLHDLARKY  GILSD +I+NMDDKSLFTMLTMVNGIGSW
Sbjct: 219  VLSLTAQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSW 278

Query: 1297 SVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYL 1476
            SVHMFMIFSLHRPDVLP+NDLG+RKGVQLLY LEELPRPSQM+ +C+KWRPYRSVASWYL
Sbjct: 279  SVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYL 338

Query: 1477 WRFVEAKGTP 1506
            WRFVEAKG P
Sbjct: 339  WRFVEAKGAP 348


>ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus
            sinensis]
          Length = 371

 Score =  406 bits (1043), Expect = e-110
 Identities = 222/370 (60%), Positives = 258/370 (69%), Gaps = 1/370 (0%)
 Frame = +1

Query: 400  MSDQIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDI-TAAATTIELPNA 576
            M +Q Q+QT  Q+Q    P+P  Q      P  +DS  ++ V  V   TA   TI   N 
Sbjct: 1    MVEQTQSQT--QNQPEPQPEPETQP-----PPNQDSTTTLAVIPVQTETANNATITHANV 53

Query: 577  LSQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVI 756
              Q  SP +K +PLRPRKIRKLS D      +G   AS+   + S  ATS   T S  + 
Sbjct: 54   TPQTSSPPSK-IPLRPRKIRKLSPD------NGVDQASSSQPTESSKATSAKSTKSRAI- 105

Query: 757  VPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVD 936
                                       Q +     PR++AR LS  GE+E A++HL++ D
Sbjct: 106  --------------------------QQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNAD 139

Query: 937  PHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDT 1116
              LA LI++H PPTFDSF TPFLAL++SILYQQLA+KAGTSIY RF+ALCGGE GV+P+T
Sbjct: 140  RQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPET 199

Query: 1117 VLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSW 1296
            VLALTPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSW
Sbjct: 200  VLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSW 259

Query: 1297 SVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYL 1476
            SVHMFMIFSLHRPDVLP+NDLG+RKGVQLLY LEELPRPSQM+ +C+KWRPYRSVASWYL
Sbjct: 260  SVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYL 319

Query: 1477 WRFVEAKGTP 1506
            WRFVEAKG P
Sbjct: 320  WRFVEAKGAP 329


>ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Citrus
            sinensis]
          Length = 373

 Score =  406 bits (1043), Expect = e-110
 Identities = 222/370 (60%), Positives = 258/370 (69%), Gaps = 1/370 (0%)
 Frame = +1

Query: 400  MSDQIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDI-TAAATTIELPNA 576
            M +Q Q+QT  Q+Q    P+P  Q      P  +DS  ++ V  V   TA   TI   N 
Sbjct: 1    MVEQTQSQT--QNQPEPQPEPETQP-----PPNQDSTTTLAVIPVQTETANNATITHANV 53

Query: 577  LSQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVI 756
              Q  SP +K +PLRPRKIRKLS D      +G   AS+   + S  ATS   T S  + 
Sbjct: 54   TPQTSSPPSK-IPLRPRKIRKLSPD------NGVDQASSSQPTESSKATSAKSTKSRAI- 105

Query: 757  VPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVD 936
                                       Q +     PR++AR LS  GE+E A++HL++ D
Sbjct: 106  --------------------------QQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNAD 139

Query: 937  PHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDT 1116
              LA LI++H PPTFDSF TPFLAL++SILYQQLA+KAGTSIY RF+ALCGGE GV+P+T
Sbjct: 140  RQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPET 199

Query: 1117 VLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSW 1296
            VLALTPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSW
Sbjct: 200  VLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSW 259

Query: 1297 SVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYL 1476
            SVHMFMIFSLHRPDVLP+NDLG+RKGVQLLY LEELPRPSQM+ +C+KWRPYRSVASWYL
Sbjct: 260  SVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYL 319

Query: 1477 WRFVEAKGTP 1506
            WRFVEAKG P
Sbjct: 320  WRFVEAKGAP 329


>gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis]
          Length = 451

 Score =  404 bits (1039), Expect = e-110
 Identities = 224/375 (59%), Positives = 255/375 (68%), Gaps = 6/375 (1%)
 Frame = +1

Query: 400  MSDQIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTI--ELPN 573
            M +Q Q QT  Q           Q Q H     E S + +T  S    A ++T   EL N
Sbjct: 1    MGEQTQTQTQTQ-----------QPQQHHGQTQESSSSMVTSISTTTIAPSSTAPTELSN 49

Query: 574  ALSQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATS----TALTP 741
            A SQ  SP +K +PLRPRKIRKLS D      D D  +S   A       S     A  P
Sbjct: 50   APSQTSSPPSK-IPLRPRKIRKLSPD------DSDSKSSQVVAVPENPKPSPTAAAAAKP 102

Query: 742  SSMVIVPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQH 921
            +   IV                          Q  +    PR+VARSLS  GE+E AL+H
Sbjct: 103  AKAKIVQ-------------------------QRALAIAAPRIVARSLSCEGEVEVALRH 137

Query: 922  LQSVDPHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEG 1101
            L+  DP LA LI++HQPPTFD+F TPFLAL++SILYQQLAYKAGTSIY RF+ALCGGE G
Sbjct: 138  LRRADPLLAPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGETG 197

Query: 1102 VLPDTVLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVN 1281
            V+P+TVLALTPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVN
Sbjct: 198  VVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVN 257

Query: 1282 GIGSWSVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSV 1461
            GIGSWSVHMFMIFSLHRPDVLP+NDLG+RKGVQLLY LEELPRPSQM+ +C+KWRPYRSV
Sbjct: 258  GIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSV 317

Query: 1462 ASWYLWRFVEAKGTP 1506
            A+WY+WRFVE KG P
Sbjct: 318  AAWYMWRFVEQKGAP 332


>ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citrus clementina]
            gi|557537126|gb|ESR48244.1| hypothetical protein
            CICLE_v10001539mg [Citrus clementina]
          Length = 373

 Score =  404 bits (1038), Expect = e-110
 Identities = 221/370 (59%), Positives = 257/370 (69%), Gaps = 1/370 (0%)
 Frame = +1

Query: 400  MSDQIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDI-TAAATTIELPNA 576
            M +Q Q+QT  Q+Q    P+P  Q      P  +DS  ++ V  V   TA   TI   N 
Sbjct: 1    MVEQTQSQT--QNQPEPQPEPETQP-----PPNQDSTTALAVIPVQSETANNATITHANV 53

Query: 577  LSQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVI 756
              Q  SP +K +PLRPRKIRKLS D      +G    S+   + S  ATS   T S  + 
Sbjct: 54   TPQTSSPPSK-IPLRPRKIRKLSPD------NGVDQTSSSQPTESSKATSAKSTKSRAI- 105

Query: 757  VPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVD 936
                                       Q +     PR++AR LS  GE+E A++HL++ D
Sbjct: 106  --------------------------QQQQQTLTVPRIIARPLSSEGEVEAAIRHLRNAD 139

Query: 937  PHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDT 1116
              LA LI++H PPTFDSF TPFLAL++SILYQQLA+KAGTSIY RF+ALCGGE GV+P+T
Sbjct: 140  RQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPET 199

Query: 1117 VLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSW 1296
            VLALTPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSW
Sbjct: 200  VLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSW 259

Query: 1297 SVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYL 1476
            SVHMFMIFSLHRPDVLP+NDLG+RKGVQLLY LEELPRPSQM+ +C+KWRPYRSVASWYL
Sbjct: 260  SVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMDQLCEKWRPYRSVASWYL 319

Query: 1477 WRFVEAKGTP 1506
            WRFVEAKG P
Sbjct: 320  WRFVEAKGAP 329


>ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Cicer
            arietinum]
          Length = 384

 Score =  398 bits (1023), Expect = e-108
 Identities = 211/362 (58%), Positives = 258/362 (71%), Gaps = 4/362 (1%)
 Frame = +1

Query: 433  QSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNALSQNP----SPS 600
            ++Q+   P  L+ +++  +P+ +   AS   ++V  TAA   I + + LS  P    SP+
Sbjct: 4    ETQIQPQPQTLIGTEIEPQPQSQPQEASSN-NTVAATAAGAIIPVESELSNVPPHINSPA 62

Query: 601  TKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIFSDSK 780
            TK +PLRPRKIRK+S D  +         S   +   + ATSTA                
Sbjct: 63   TK-IPLRPRKIRKVSPDPTTT--------SESQSETPKSATSTA---------------- 97

Query: 781  PXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLAQLIN 960
                               Q +     PR+VARSLS  GE+E AL++L++ DP L+ LI+
Sbjct: 98   -------GKSCGRHSNKSVQQQRALIVPRIVARSLSCEGEVEIALRYLRNADPLLSPLID 150

Query: 961  LHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLALTPQQ 1140
            +HQPPTFD+F TPFLAL++SILYQQLA+KAGTSIY RF+ALCGGE GV+P+TVLAL PQQ
Sbjct: 151  IHQPPTFDNFHTPFLALTRSILYQQLAFKAGTSIYTRFIALCGGEAGVVPETVLALNPQQ 210

Query: 1141 LRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHMFMIF 1320
            LRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSWSVHMFMIF
Sbjct: 211  LRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIF 270

Query: 1321 SLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFVEAKG 1500
            SLHRPDVLP+NDLG+RKGVQ+LY LE+LPRPSQM+ +C+KWRPYRSVASWY+WRFVEAKG
Sbjct: 271  SLHRPDVLPINDLGVRKGVQILYNLEDLPRPSQMDQLCEKWRPYRSVASWYMWRFVEAKG 330

Query: 1501 TP 1506
            TP
Sbjct: 331  TP 332


>ref|XP_002510396.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
            gi|223551097|gb|EEF52583.1| DNA-3-methyladenine
            glycosylase, putative [Ricinus communis]
          Length = 369

 Score =  398 bits (1023), Expect = e-108
 Identities = 207/366 (56%), Positives = 248/366 (67%)
 Frame = +1

Query: 409  QIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNALSQN 588
            Q Q QT  Q+Q    P    +SQ H + +I   V +   S    T+   T EL       
Sbjct: 12   QPQPQTQTQTQSQPQPQSQSESQSHPQSQIRTQVLNHQPSHDSATSTIATSELICIPQPT 71

Query: 589  PSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIF 768
             +P  K+ P RPRK+RKLS +                 S ++   +    P ++ + P  
Sbjct: 72   ATPPAKIPPSRPRKLRKLSPE-----------------SAAKSTKTKTPQPRALAVAP-- 112

Query: 769  SDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLA 948
                                           PR++ARSLS  GE+E A++HL+  DP L+
Sbjct: 113  -------------------------------PRIIARSLSCEGEVENAIRHLREADPLLS 141

Query: 949  QLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLAL 1128
             LI+LH PPTFD+F TPFLAL++SILYQQLA+KAGTSIY RF++LCGGE GV+PDTVLAL
Sbjct: 142  SLIDLHPPPTFDTFHTPFLALTRSILYQQLAFKAGTSIYTRFISLCGGEAGVVPDTVLAL 201

Query: 1129 TPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHM 1308
            TPQQLRQIGVSGRKASYLHDLARKY+NGILSD +I+NMDDKSLFTMLTMVNGIGSWSVHM
Sbjct: 202  TPQQLRQIGVSGRKASYLHDLARKYHNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHM 261

Query: 1309 FMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFV 1488
            FMIFSLHRPDVLP+NDLG+RKGVQLLY LE+LPRPSQM+ +C+KWRPYRSVASWYLWRFV
Sbjct: 262  FMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVASWYLWRFV 321

Query: 1489 EAKGTP 1506
            EAKG+P
Sbjct: 322  EAKGSP 327


>ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Glycine
            max]
          Length = 374

 Score =  396 bits (1017), Expect = e-107
 Identities = 212/362 (58%), Positives = 248/362 (68%)
 Frame = +1

Query: 421  QTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNALSQNPSPS 600
            QT  Q+Q L  P PL        P+                 A    EL N      SP+
Sbjct: 4    QTLGQAQSLIEPQPLPAPSSTAVPD----------------GATVDSELNNVPRPTTSPA 47

Query: 601  TKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIFSDSK 780
            TK +PLRPRKIRK+S D  ++ +  +       A      T+ A  P ++ +VP      
Sbjct: 48   TK-IPLRPRKIRKVSPDPSTSESQTETPKP---AKTGGRNTTKAAPPRALTVVP------ 97

Query: 781  PXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLAQLIN 960
                                        R+VARSLS  GE+E AL++L++ DP L+ LI+
Sbjct: 98   ----------------------------RIVARSLSCDGEVEIALRYLRNADPVLSPLID 129

Query: 961  LHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLALTPQQ 1140
            +HQPPTFD+F TPFLAL++SILYQQLAYKAGTSIY RF+ALCGGE GV+P+TVLALTPQQ
Sbjct: 130  IHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGENGVVPETVLALTPQQ 189

Query: 1141 LRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHMFMIF 1320
            LRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSWSVHMFMIF
Sbjct: 190  LRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIF 249

Query: 1321 SLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFVEAKG 1500
            SLHRPDVLP+NDLG+RKGVQLLY LE+LPRPSQM+ +CDKWRPYRSVASWY+WRFVEAKG
Sbjct: 250  SLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCDKWRPYRSVASWYMWRFVEAKG 309

Query: 1501 TP 1506
            TP
Sbjct: 310  TP 311


>ref|XP_002282344.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Vitis vinifera]
          Length = 363

 Score =  395 bits (1016), Expect = e-107
 Identities = 213/344 (61%), Positives = 236/344 (68%), Gaps = 3/344 (0%)
 Frame = +1

Query: 484  LKPEIEDSVASITVSSVDITA---AATTIELPNALSQNPSPSTKLVPLRPRKIRKLSSDT 654
            L+P+ E + A  T ++ D TA    +T+ EL          S   +P RPRKIRK+S D 
Sbjct: 12   LQPDNESATA--TSNAADTTAIQIVSTSTELATIAPPENQSSASNIPFRPRKIRKISPDN 69

Query: 655  VSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIFSDSKPXXXXXXXXXXXXXXXXX 834
              +   GD   +   A          L P  +  VP                        
Sbjct: 70   SESKPAGDSKTAGKGAK-------NKLVPQRVPAVP------------------------ 98

Query: 835  XQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLAQLINLHQPPTFDSFQTPFLALS 1014
                       MVAR+LS  GEIE AL+HL++ DPHLA LI+LH PPTFDSF TPFLAL+
Sbjct: 99   ----------NMVARALSCEGEIEIALRHLRNADPHLAPLIDLHPPPTFDSFHTPFLALT 148

Query: 1015 KSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLALTPQQLRQIGVSGRKASYLHDLA 1194
            KSILYQQLAYKAGTSIY RFV LCGGE GVLP+TVLALTP QLRQIGVSGRKASYLHDLA
Sbjct: 149  KSILYQQLAYKAGTSIYTRFVGLCGGEAGVLPETVLALTPHQLRQIGVSGRKASYLHDLA 208

Query: 1195 RKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGIRKG 1374
            RKY NGILSD  II MDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLG+RKG
Sbjct: 209  RKYQNGILSDTGIITMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKG 268

Query: 1375 VQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFVEAKGTP 1506
            VQLLYGLEELPRPSQME +C+KWRPYRSVASWY+WRFVE KG P
Sbjct: 269  VQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYIWRFVEGKGAP 312


>ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Glycine max]
          Length = 351

 Score =  395 bits (1015), Expect = e-107
 Identities = 204/321 (63%), Positives = 238/321 (74%)
 Frame = +1

Query: 544  AAATTIELPNALSQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDAT 723
            AA    EL N      SP+TK +PLRPRKIRK+S D  ++ A         A  V  + T
Sbjct: 17   AATAHSELNNVPQPTTSPATK-IPLRPRKIRKVSPDPSTSEAP-----IKPAKPVGRNTT 70

Query: 724  STALTPSSMVIVPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEI 903
            S A  P ++ +VP                                  R+VARSLS  GE+
Sbjct: 71   SKAAPPRALTVVP----------------------------------RIVARSLSCDGEV 96

Query: 904  ERALQHLQSVDPHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVAL 1083
            E +L++L++ DP L+ LI++HQPPTFD+F TPFLAL++SILYQQLA+KAGTSIY RF+ L
Sbjct: 97   EISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGTSIYTRFIGL 156

Query: 1084 CGGEEGVLPDTVLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFT 1263
            CGGE GV+P+TVLALTPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFT
Sbjct: 157  CGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFT 216

Query: 1264 MLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKW 1443
            MLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLG+RKGVQLLY LE+LPRPSQM+ +CDKW
Sbjct: 217  MLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCDKW 276

Query: 1444 RPYRSVASWYLWRFVEAKGTP 1506
            RPYRSVASWY+WRFVEAKGTP
Sbjct: 277  RPYRSVASWYMWRFVEAKGTP 297


>gb|ACU22727.1| unknown [Glycine max]
          Length = 351

 Score =  395 bits (1015), Expect = e-107
 Identities = 204/321 (63%), Positives = 238/321 (74%)
 Frame = +1

Query: 544  AAATTIELPNALSQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDAT 723
            AA    EL N      SP+TK +PLRPRKIRK+S D  ++ A         A  V  + T
Sbjct: 17   AATAHSELNNVPQPTTSPATK-IPLRPRKIRKVSPDPSTSEAP-----IKPAKPVGRNTT 70

Query: 724  STALTPSSMVIVPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEI 903
            S A  P ++ +VP                                  R+VARSLS  GE+
Sbjct: 71   SKAAPPRALTVVP----------------------------------RIVARSLSCDGEV 96

Query: 904  ERALQHLQSVDPHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVAL 1083
            E +L++L++ DP L+ LI++HQPPTFD+F TPFLAL++SILYQQLA+KAGTSIY RF+ L
Sbjct: 97   EISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGTSIYTRFIGL 156

Query: 1084 CGGEEGVLPDTVLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFT 1263
            CGGE GV+P+TVLALTPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFT
Sbjct: 157  CGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFT 216

Query: 1264 MLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKW 1443
            MLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLG+RKGVQLLY LE+LPRPSQM+ +CDKW
Sbjct: 217  MLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCDKW 276

Query: 1444 RPYRSVASWYLWRFVEAKGTP 1506
            RPYRSVASWY+WRFVEAKGTP
Sbjct: 277  RPYRSVASWYMWRFVEAKGTP 297


>ref|XP_002302029.1| predicted protein [Populus trichocarpa]
          Length = 381

 Score =  393 bits (1009), Expect = e-106
 Identities = 207/372 (55%), Positives = 256/372 (68%), Gaps = 3/372 (0%)
 Frame = +1

Query: 400  MSDQIQAQTPPQ-SQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNA 576
            M +Q Q Q  PQ SQ  S P P +++Q + +P  + +   IT ++ + T    +I  P A
Sbjct: 1    MGEQTQIQPQPQPSQSESQPLPQIEAQAYTQPLNDPTTTVITTTTTESTTIPPSITSPPA 60

Query: 577  LSQNPSPSTKLVPLRPRKIRKLSSDT--VSAVADGDLDASNFAASVSEDATSTALTPSSM 750
                       +P RPRKIRKLS D   V+ V D +   ++   +     T+   TP + 
Sbjct: 61   K----------IPSRPRKIRKLSPDAAVVTTVNDPNSTQTSIKNTTEPPRTTATKTPRTK 110

Query: 751  VIVPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQS 930
                                            +V   PR++ARSL+  GE+E A++HL++
Sbjct: 111  TA--------------------------QHRAIVALAPRIMARSLTCEGELEIAIRHLRN 144

Query: 931  VDPHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLP 1110
             DP LA LI+++ PPTFD+F TPFLAL++SILYQQLA+KAGTSIY RF++LCGGE GVLP
Sbjct: 145  ADPLLASLIDIYPPPTFDTFPTPFLALARSILYQQLAFKAGTSIYTRFISLCGGEAGVLP 204

Query: 1111 DTVLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIG 1290
            +TVLALTPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIG
Sbjct: 205  ETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIG 264

Query: 1291 SWSVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASW 1470
            SWSVHMFMIFSLHRPDVLP+NDL +RKG+Q+LY L ELPRPSQM+H+C+KWRPYRSVASW
Sbjct: 265  SWSVHMFMIFSLHRPDVLPINDLQVRKGLQVLYNLPELPRPSQMDHLCEKWRPYRSVASW 324

Query: 1471 YLWRFVEAKGTP 1506
            YLWRF E KG+P
Sbjct: 325  YLWRFQEVKGSP 336


>gb|ESW08591.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris]
          Length = 366

 Score =  392 bits (1006), Expect = e-106
 Identities = 212/348 (60%), Positives = 247/348 (70%)
 Frame = +1

Query: 463  LVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNALSQNPSPSTKLVPLRPRKIRKL 642
            L Q+Q  ++P+    V S + ++ D   A    EL N L    SP++K +PLRPRKIRK+
Sbjct: 6    LGQAQSLIEPQ-PHPVPSSSAAAPD--GAQADSELNNVLPHANSPASK-IPLRPRKIRKV 61

Query: 643  SSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIFSDSKPXXXXXXXXXXXXX 822
            S D                 S SE  T       S                         
Sbjct: 62   SPDP----------------STSESQTEPPKPGKS---------------------GGRS 84

Query: 823  XXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLAQLINLHQPPTFDSFQTPF 1002
                  SR +   PR+VARSLS  GE+E AL+ L++ DP L+ LI++HQPPTFD+F TPF
Sbjct: 85   TKHVPPSRGMSVLPRLVARSLSCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPF 144

Query: 1003 LALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLALTPQQLRQIGVSGRKASYL 1182
            LAL++SILYQQLAYKAGTSIY RF+ALCGGE GV+P+TVLALTPQQLRQIGVSGRKASYL
Sbjct: 145  LALTRSILYQQLAYKAGTSIYTRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYL 204

Query: 1183 HDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLG 1362
            HDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLP+NDLG
Sbjct: 205  HDLARKYQNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG 264

Query: 1363 IRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFVEAKGTP 1506
            +RKGVQLLY LE+LPRPSQM+H+C+KWRPYRSVASWY+WRFVEAKGTP
Sbjct: 265  VRKGVQLLYNLEDLPRPSQMDHLCEKWRPYRSVASWYMWRFVEAKGTP 312


>ref|XP_004147864.1| PREDICTED: uncharacterized protein LOC101202943 [Cucumis sativus]
            gi|449476816|ref|XP_004154842.1| PREDICTED:
            uncharacterized LOC101202943 [Cucumis sativus]
          Length = 382

 Score =  391 bits (1005), Expect = e-106
 Identities = 217/374 (58%), Positives = 253/374 (67%), Gaps = 7/374 (1%)
 Frame = +1

Query: 400  MSDQIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNAL 579
            M +Q Q Q   Q+Q    P    Q+  H   E  +S   I  ++V ++      E+ NA 
Sbjct: 1    MGEQTQVQVQTQTQSQPQPQSQAQNTFH---ESSNSTTPIAQATVMLS------EVMNAP 51

Query: 580  SQNPSPSTKLVPLRPRKIRKLS-------SDTVSAVADGDLDASNFAASVSEDATSTALT 738
            SQ  SP +K+ PLRPRKIRKLS       S  V A+ DG    +   ++ S+ A   A  
Sbjct: 52   SQISSPPSKM-PLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQRAAF 110

Query: 739  PSSMVIVPIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQ 918
             S+ V                                   PP   ARSLS  GE+E AL+
Sbjct: 111  ASATV-----------------------------------PP---ARSLSCEGEVEIALR 132

Query: 919  HLQSVDPHLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEE 1098
            HL++ DP LAQLI+LHQ PTFDSFQTPFLAL++SILYQQLAYKAGTSIY RF+ALCGGE 
Sbjct: 133  HLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEA 192

Query: 1099 GVLPDTVLALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMV 1278
            GVLP+TVLAL PQQLRQIG+SGRK+SYLHDLARKY NGILSD +I+NMDDKSLFTMLTMV
Sbjct: 193  GVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMV 252

Query: 1279 NGIGSWSVHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRS 1458
            NGIGSWSVHMFMIFSLHRPDVLP+NDL +RKGVQLLY LEELPRPSQM+ +C+KWRPYRS
Sbjct: 253  NGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDQLCEKWRPYRS 312

Query: 1459 VASWYLWRFVEAKG 1500
            V SWY+WR  EAKG
Sbjct: 313  VGSWYMWRLAEAKG 326


>ref|XP_002887579.1| hypothetical protein ARALYDRAFT_476667 [Arabidopsis lyrata subsp.
            lyrata] gi|297333420|gb|EFH63838.1| hypothetical protein
            ARALYDRAFT_476667 [Arabidopsis lyrata subsp. lyrata]
          Length = 389

 Score =  385 bits (990), Expect = e-104
 Identities = 214/369 (57%), Positives = 252/369 (68%), Gaps = 3/369 (0%)
 Frame = +1

Query: 409  QIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNA--LS 582
            Q  +QT PQ+Q  S P+P   + +      +DS +S  VS   ++   TTIE P    L 
Sbjct: 8    QPSSQTHPQNQPES-PNPETPNPIPPGTNDDDSASSAGVSGSIVSL--TTIEAPRVTELG 64

Query: 583  QNPSPSTKLVPLRPRKIRKLS-SDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIV 759
               SP +K +PLRPRKIRKLS  D  S   DG     N  A+ S+ A  + L+ S  V V
Sbjct: 65   NVSSPPSK-IPLRPRKIRKLSPDDDASGNGDGFNPEHNLLATTSKPAVKSKLSQSRCVTV 123

Query: 760  PIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDP 939
            P                                  R+ ARSL+  GE+E AL HL+SVDP
Sbjct: 124  P----------------------------------RIHARSLTCEGELEAALHHLRSVDP 149

Query: 940  HLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTV 1119
             LA LI++H PPTF++F TPFLAL +SILYQQLA KAG SIY RFVALCGGE GV+P+ V
Sbjct: 150  LLASLIDIHPPPTFETFHTPFLALIRSILYQQLAAKAGNSIYTRFVALCGGENGVVPENV 209

Query: 1120 LALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWS 1299
            L LTPQQLRQIGVSGRKASYLHDLARKY NGILSD  I+NMD+KSLFTMLTMVNGIGSWS
Sbjct: 210  LPLTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWS 269

Query: 1300 VHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLW 1479
            VHMFMI SLHRPDVLPVNDLG+RKGVQ+L  +E+LPRPS+ME +C+KWRPYRSVASWY+W
Sbjct: 270  VHMFMINSLHRPDVLPVNDLGVRKGVQMLNAMEDLPRPSKMEQLCEKWRPYRSVASWYMW 329

Query: 1480 RFVEAKGTP 1506
            R +E+KGTP
Sbjct: 330  RLIESKGTP 338


>gb|EMJ24474.1| hypothetical protein PRUPE_ppa007252mg [Prunus persica]
          Length = 376

 Score =  384 bits (987), Expect = e-104
 Identities = 212/369 (57%), Positives = 250/369 (67%)
 Frame = +1

Query: 400  MSDQIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNAL 579
            M +Q Q QT  Q+Q         Q+    +P   DS+   T++ V    + T  +L NA 
Sbjct: 1    MGEQTQVQTQTQTQ--------PQTPTPTQPPHHDSIDDTTIAEV----SPTPTQLTNAP 48

Query: 580  SQNPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIV 759
            SQ  SP +K +P RPRKIRKLS DT                             SS  IV
Sbjct: 49   SQTSSPPSK-IPFRPRKIRKLSPDTSDP-------------------------NSSQQIV 82

Query: 760  PIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDP 939
             +  + KP                  Q R +   P++ AR LS  GE+E A++HL++ DP
Sbjct: 83   ALPDNPKPLPAAAKSAKSKAV-----QQRALS-APKIAARPLSCEGEVEAAIRHLRNADP 136

Query: 940  HLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTV 1119
             LA LI+LHQ PTFD+FQTPFLAL++SILYQQLAYKAG SIY RFV+LCGGE  V+P+TV
Sbjct: 137  LLAPLIDLHQRPTFDTFQTPFLALTRSILYQQLAYKAGNSIYTRFVSLCGGEACVVPETV 196

Query: 1120 LALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWS 1299
            LA TPQQLRQIGVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSWS
Sbjct: 197  LAQTPQQLRQIGVSGRKASYLHDLARKYQNGILSDAAIVNMDDKSLFTMLTMVNGIGSWS 256

Query: 1300 VHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLW 1479
            VHMFMIFSLHRPDVLP+NDL +RKGVQLLY L+ELPRPSQMEH+C+KWRPYRSVA+ Y+W
Sbjct: 257  VHMFMIFSLHRPDVLPINDLSMRKGVQLLYNLDELPRPSQMEHLCEKWRPYRSVAACYMW 316

Query: 1480 RFVEAKGTP 1506
            RF E+KG P
Sbjct: 317  RFSESKGAP 325


>ref|XP_002306870.2| hypothetical protein POPTR_0005s24930g [Populus trichocarpa]
            gi|550339688|gb|EEE93866.2| hypothetical protein
            POPTR_0005s24930g [Populus trichocarpa]
          Length = 375

 Score =  384 bits (986), Expect = e-104
 Identities = 207/364 (56%), Positives = 254/364 (69%)
 Frame = +1

Query: 415  QAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNALSQNPS 594
            Q +  PQSQ        VQS+    P+IE  V + ++S    T   TT EL        S
Sbjct: 4    QTKIQPQSQP-------VQSESQALPQIEAQVQTQSLSQPFNTTTTTTSELTTVPPPITS 56

Query: 595  PSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIFSD 774
            P  K +P RPRKIRK+S +  +  A+      N + + +   T T  TP+  +  P    
Sbjct: 57   PPAK-IPSRPRKIRKVSPNAAATTANDP----NSSPTSTTTTTETPKTPA--IKTPRTKT 109

Query: 775  SKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLAQL 954
            S+                     ++V   PR+VARSL+  GE+E A+ +L++ DP LA L
Sbjct: 110  SQ---------------------QLVIATPRIVARSLTCEGELEYAIHYLRNADPLLASL 148

Query: 955  INLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLALTP 1134
            I+++QPP+FD+F TPFLAL++SILYQQLA+KAG+SIY RF++LCGGE GVLP+TVLALTP
Sbjct: 149  IDIYQPPSFDTFPTPFLALARSILYQQLAFKAGSSIYTRFISLCGGEAGVLPETVLALTP 208

Query: 1135 QQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHMFM 1314
            QQLRQ GVSGRKASYLHDLARKY NGILSD +I+NMDDKSLFTMLTMVNGIGSWSVHMFM
Sbjct: 209  QQLRQFGVSGRKASYLHDLARKYRNGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFM 268

Query: 1315 IFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFVEA 1494
            IFSLHRPDVLP+NDL +RKGVQLLY L ELPRPSQM+ +C+KWRPYRSVASWYLWR  E+
Sbjct: 269  IFSLHRPDVLPINDLQVRKGVQLLYNLPELPRPSQMDQLCEKWRPYRSVASWYLWRLQES 328

Query: 1495 KGTP 1506
            KG+P
Sbjct: 329  KGSP 332


>ref|XP_006390331.1| hypothetical protein EUTSA_v10018654mg [Eutrema salsugineum]
            gi|557086765|gb|ESQ27617.1| hypothetical protein
            EUTSA_v10018654mg [Eutrema salsugineum]
          Length = 403

 Score =  382 bits (982), Expect = e-103
 Identities = 207/369 (56%), Positives = 252/369 (68%), Gaps = 1/369 (0%)
 Frame = +1

Query: 403  SDQIQAQTPPQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNALS 582
            S   Q Q  PQS    +P+P+        PE  D+ ++ +  +     ++TTIE P    
Sbjct: 10   SSHTQTQNQPQSPKPENPNPI-------PPETNDNDSASSAGAPGSIVSSTTIEAPRVTE 62

Query: 583  Q-NPSPSTKLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIV 759
              N S +   +PLRPRKIRKLS D          +  + A + + D  +   +PS M+  
Sbjct: 63   LGNVSSTPSKIPLRPRKIRKLSPD----------EDDSGAVNANSDGFNPDHSPSQMM-T 111

Query: 760  PIFSDSKPXXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDP 939
            P+ + +KP                  QSR +   PR+ ARSL+  GE+E A+ HL+SVDP
Sbjct: 112  PLATAAKPASKGKLT-----------QSRALT-VPRIHARSLTCEGELEAAICHLRSVDP 159

Query: 940  HLAQLINLHQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTV 1119
             L  LI++H PPT++SF +PFLAL +SILYQQLA KAG SIY RFVALCGGE  V+P+TV
Sbjct: 160  LLGSLIDIHPPPTYESFHSPFLALIRSILYQQLAAKAGNSIYTRFVALCGGENAVVPETV 219

Query: 1120 LALTPQQLRQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWS 1299
            L LTPQQLRQIGVSGRKASYL+DLARKY NGILSD  I+NMD+KSLFTMLTMVNGIGSWS
Sbjct: 220  LPLTPQQLRQIGVSGRKASYLNDLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWS 279

Query: 1300 VHMFMIFSLHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLW 1479
            VHMFMI SLHRPDVLPVNDLG+RKGVQ+LY L ELPRPSQME +C+KWRPYRSV SWY+W
Sbjct: 280  VHMFMINSLHRPDVLPVNDLGVRKGVQMLYNLPELPRPSQMEQLCEKWRPYRSVGSWYMW 339

Query: 1480 RFVEAKGTP 1506
            R +EAK TP
Sbjct: 340  RLIEAKSTP 348


>gb|AAG12687.1|AC025814_11 3-methyladenine DNA glycosylase, putative; 31680-30045 [Arabidopsis
            thaliana]
          Length = 428

 Score =  381 bits (978), Expect = e-103
 Identities = 210/361 (58%), Positives = 246/361 (68%), Gaps = 2/361 (0%)
 Frame = +1

Query: 430  PQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNA--LSQNPSPST 603
            P+S     P+P+        PE  D  ++ +        ++TTIE P    L    SP T
Sbjct: 19   PESPNHETPNPI-------PPETNDDDSASSAGVSGSIVSSTTIEAPQVTELGNVSSPPT 71

Query: 604  KLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIFSDSKP 783
            K +PLRPRKIRKLS D      D   D  N   ++S+  T+   T S +           
Sbjct: 72   K-IPLRPRKIRKLSPD------DDASDGFNPEHNLSQMTTTKPATKSKL----------- 113

Query: 784  XXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLAQLINL 963
                              QSR V   PR+ ARSL+  GE+E AL HL+SVDP LA LI++
Sbjct: 114  -----------------SQSRTVT-VPRIQARSLTCEGELEAALHHLRSVDPLLASLIDI 155

Query: 964  HQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLALTPQQL 1143
            H PPTF++FQTPFLAL +SILYQQLA KAG SIY RFVALCGGE GV+P+ VL LTPQQL
Sbjct: 156  HPPPTFETFQTPFLALIRSILYQQLAAKAGNSIYTRFVALCGGENGVVPENVLPLTPQQL 215

Query: 1144 RQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHMFMIFS 1323
            RQIGVSGRKASYLHDLARKY NGILSD  I+NMD+KSLFTMLTMVNGIGSWSVHMFMI S
Sbjct: 216  RQIGVSGRKASYLHDLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWSVHMFMINS 275

Query: 1324 LHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFVEAKGT 1503
            LHRPDVLPVNDLG+RKGVQ+L G+E+LPRPS+ME +C+KWRPYRSVASWYLWR +E+K T
Sbjct: 276  LHRPDVLPVNDLGVRKGVQMLNGMEDLPRPSKMEQLCEKWRPYRSVASWYLWRLIESKNT 335

Query: 1504 P 1506
            P
Sbjct: 336  P 336


>ref|NP_974147.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
            gi|332197569|gb|AEE35690.1| DNA glycosylase superfamily
            protein [Arabidopsis thaliana]
          Length = 394

 Score =  381 bits (978), Expect = e-103
 Identities = 210/361 (58%), Positives = 246/361 (68%), Gaps = 2/361 (0%)
 Frame = +1

Query: 430  PQSQLLSHPDPLVQSQLHLKPEIEDSVASITVSSVDITAAATTIELPNA--LSQNPSPST 603
            P+S     P+P+        PE  D  ++ +        ++TTIE P    L    SP T
Sbjct: 19   PESPNHETPNPI-------PPETNDDDSASSAGVSGSIVSSTTIEAPQVTELGNVSSPPT 71

Query: 604  KLVPLRPRKIRKLSSDTVSAVADGDLDASNFAASVSEDATSTALTPSSMVIVPIFSDSKP 783
            K +PLRPRKIRKLS D      D   D  N   ++S+  T+   T S +           
Sbjct: 72   K-IPLRPRKIRKLSPD------DDASDGFNPEHNLSQMTTTKPATKSKL----------- 113

Query: 784  XXXXXXXXXXXXXXXXXXQSRVVHHPPRMVARSLSYHGEIERALQHLQSVDPHLAQLINL 963
                              QSR V   PR+ ARSL+  GE+E AL HL+SVDP LA LI++
Sbjct: 114  -----------------SQSRTVT-VPRIQARSLTCEGELEAALHHLRSVDPLLASLIDI 155

Query: 964  HQPPTFDSFQTPFLALSKSILYQQLAYKAGTSIYARFVALCGGEEGVLPDTVLALTPQQL 1143
            H PPTF++FQTPFLAL +SILYQQLA KAG SIY RFVALCGGE GV+P+ VL LTPQQL
Sbjct: 156  HPPPTFETFQTPFLALIRSILYQQLAAKAGNSIYTRFVALCGGENGVVPENVLPLTPQQL 215

Query: 1144 RQIGVSGRKASYLHDLARKYNNGILSDESIINMDDKSLFTMLTMVNGIGSWSVHMFMIFS 1323
            RQIGVSGRKASYLHDLARKY NGILSD  I+NMD+KSLFTMLTMVNGIGSWSVHMFMI S
Sbjct: 216  RQIGVSGRKASYLHDLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWSVHMFMINS 275

Query: 1324 LHRPDVLPVNDLGIRKGVQLLYGLEELPRPSQMEHMCDKWRPYRSVASWYLWRFVEAKGT 1503
            LHRPDVLPVNDLG+RKGVQ+L G+E+LPRPS+ME +C+KWRPYRSVASWYLWR +E+K T
Sbjct: 276  LHRPDVLPVNDLGVRKGVQMLNGMEDLPRPSKMEQLCEKWRPYRSVASWYLWRLIESKNT 335

Query: 1504 P 1506
            P
Sbjct: 336  P 336


Top