BLASTX nr result

ID: Achyranthes22_contig00018903 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00018903
         (1156 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus pe...   396   e-107
ref|XP_002312220.1| methyladenine glycosylase family protein [Po...   393   e-107
ref|XP_002515807.1| DNA-3-methyladenine glycosylase, putative [R...   389   e-106
ref|XP_003531809.1| PREDICTED: uncharacterized protein LOC100793...   388   e-105
gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus...   388   e-105
ref|XP_004509996.1| PREDICTED: uncharacterized protein LOC101508...   387   e-105
gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Th...   385   e-104
ref|XP_003545728.1| PREDICTED: uncharacterized protein LOC100793...   385   e-104
ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791...   385   e-104
ref|XP_004139917.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   384   e-104
ref|XP_002315089.2| methyladenine glycosylase family protein [Po...   384   e-104
ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298...   383   e-104
ref|XP_004173920.1| PREDICTED: uncharacterized protein LOC101226...   383   e-104
ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811...   382   e-103
ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago tr...   380   e-103
gb|ESW05455.1| hypothetical protein PHAVU_011G180500g [Phaseolus...   379   e-102
gb|AFK37052.1| unknown [Medicago truncatula]                          379   e-102
gb|EOY14287.1| DNA glycosylase superfamily protein isoform 2 [Th...   377   e-102
gb|EXB51223.1| Putative Glutamine amidotransferase [Morus notabi...   375   e-101
ref|XP_006422312.1| hypothetical protein CICLE_v10005040mg [Citr...   374   e-101

>gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica]
          Length = 426

 Score =  396 bits (1017), Expect = e-107
 Identities = 197/252 (78%), Positives = 221/252 (87%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           TE+PGSIAAVRREQ+ALQHAQRKM+IAHYGRSKSA FE  V   V++ GNI+   A    
Sbjct: 172 TEAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSANFERVVP--VDASGNIEAKGAEE-- 227

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           E+RC+ IT NSDP++VAYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++RQDFR 
Sbjct: 228 EKRCSFITANSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRN 287

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS FDAEIVAN T+KQ+ +I ++Y IDISRVRGVVDN+ RI+EIKKEFGSF+KYIWGFV
Sbjct: 288 AFSDFDAEIVANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFV 347

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           N KPISPQYK   KIPVKTSKSESISKDMVRRGFR VGPT VHSFMQA+GLTNDHLITC+
Sbjct: 348 NQKPISPQYKLGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLITCH 407

Query: 721 RHLQCTALASVR 756
           RHLQCT LA+ R
Sbjct: 408 RHLQCTLLAARR 419


>ref|XP_002312220.1| methyladenine glycosylase family protein [Populus trichocarpa]
           gi|118486806|gb|ABK95238.1| unknown [Populus
           trichocarpa] gi|222852040|gb|EEE89587.1| methyladenine
           glycosylase family protein [Populus trichocarpa]
          Length = 403

 Score =  393 bits (1010), Expect = e-107
 Identities = 192/251 (76%), Positives = 220/251 (87%)
 Frame = +1

Query: 4   ESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPILE 183
           E+PGSIAAVRREQ+ALQHAQRKM+IAHYGRSKSA+FE +V P   S+     +D     E
Sbjct: 147 EAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSARFEDQVVPNDSSISMATKTDQEE--E 204

Query: 184 RRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFREA 363
           +RC+ IT NSDP++VAYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++RQDFR+A
Sbjct: 205 KRCSFITANSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDA 264

Query: 364 FSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFVN 543
           FSGFDAEIVAN++EKQI +I A+Y ID+SRVRGVVDN+ RI+EIKKEFGSF++YIW FVN
Sbjct: 265 FSGFDAEIVANISEKQIMSISAEYGIDMSRVRGVVDNSNRILEIKKEFGSFDRYIWTFVN 324

Query: 544 NKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCYR 723
           NKPIS  YKF  KIPVKTSKSE+ISKDMVRRGFR VGPT VHSFMQAAGLTNDHLITC+R
Sbjct: 325 NKPISTSYKFGHKIPVKTSKSETISKDMVRRGFRFVGPTMVHSFMQAAGLTNDHLITCHR 384

Query: 724 HLQCTALASVR 756
           HL CT +A+ R
Sbjct: 385 HLPCTLMAAAR 395


>ref|XP_002515807.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
           gi|223545076|gb|EEF46588.1| DNA-3-methyladenine
           glycosylase, putative [Ricinus communis]
          Length = 404

 Score =  389 bits (1000), Expect = e-106
 Identities = 191/249 (76%), Positives = 215/249 (86%)
 Frame = +1

Query: 4   ESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPILE 183
           ESPGSIAAVRREQ+A QHAQRKM+IAHYGRSKSAKFE+     ++SL NI T       E
Sbjct: 157 ESPGSIAAVRREQMAFQHAQRKMRIAHYGRSKSAKFEANNVFPIDSLTNISTKSDEE--E 214

Query: 184 RRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFREA 363
           +RC  ITPNSDP++VAYHDEEWGVPV DDK+LFELL+LSGAQVGSDWTSIL++RQDFR+A
Sbjct: 215 KRCNFITPNSDPIYVAYHDEEWGVPVRDDKLLFELLVLSGAQVGSDWTSILKKRQDFRDA 274

Query: 364 FSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFVN 543
           FSGFDAEIVA+ TEK + +I  +Y IDI+RVRGVVDN+ R++EIKKEFGSF+KYIW FVN
Sbjct: 275 FSGFDAEIVADFTEKHMISISTEYGIDINRVRGVVDNSNRVLEIKKEFGSFSKYIWAFVN 334

Query: 544 NKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCYR 723
           NKPIS QYKF  KIPVKTSKSESISKDMVRRGFR VGPT VHSFMQAAGLTNDHLITC+R
Sbjct: 335 NKPISTQYKFGHKIPVKTSKSESISKDMVRRGFRFVGPTMVHSFMQAAGLTNDHLITCHR 394

Query: 724 HLQCTALAS 750
           HL CT L +
Sbjct: 395 HLPCTLLTA 403


>ref|XP_003531809.1| PREDICTED: uncharacterized protein LOC100793991 [Glycine max]
          Length = 400

 Score =  388 bits (997), Expect = e-105
 Identities = 195/257 (75%), Positives = 221/257 (85%), Gaps = 1/257 (0%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLG-NIKTSDASPI 177
           TESPGSIAAVRREQ+ALQHAQRKMKIAHYGRSKSAKFE +V PL  S     KTS+    
Sbjct: 145 TESPGSIAAVRREQMALQHAQRKMKIAHYGRSKSAKFE-RVVPLDPSSNLTSKTSEE--- 200

Query: 178 LERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFR 357
            E+RC+ IT NSDP+++AYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++RQDFR
Sbjct: 201 -EKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFR 259

Query: 358 EAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGF 537
            AFS FD   +ANLT+KQ+ +I  +Y IDIS+VRGVVDNA RI+EI K+FGSF+KYIWGF
Sbjct: 260 AAFSEFDVATLANLTDKQMVSISLEYGIDISQVRGVVDNANRILEINKDFGSFDKYIWGF 319

Query: 538 VNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITC 717
           VN+KPIS QYKF  KIPVKTSKSESISKDM+RRGFR VGPT +HSFMQAAGLTNDHLITC
Sbjct: 320 VNHKPISTQYKFGHKIPVKTSKSESISKDMIRRGFRCVGPTVLHSFMQAAGLTNDHLITC 379

Query: 718 YRHLQCTALASVRHLSS 768
           +RHLQCT LAS  H ++
Sbjct: 380 HRHLQCTLLASSPHCTT 396


>gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris]
          Length = 405

 Score =  388 bits (996), Expect = e-105
 Identities = 193/253 (76%), Positives = 219/253 (86%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           T+SPGSIAAVRREQ+ALQ AQRKMKIAHYGRSKSAKFE +V PL  S   + +       
Sbjct: 150 TDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFE-RVVPLDPSTTTLTSKPTEE-- 206

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           E+RC+ IT NSDP+++AYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTS L++RQDFR 
Sbjct: 207 EKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRQDFRA 266

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS FDAE VANLT+KQ+ +I ++Y IDISRVRGVVDNA +I+EIKK+FGSF+KYIWGFV
Sbjct: 267 AFSDFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFV 326

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           N+KPIS QYKF  KIPVKTSKSESISKDMVRRG+R VGPT VHSFMQAAGLTNDHLITC+
Sbjct: 327 NHKPISTQYKFGHKIPVKTSKSESISKDMVRRGYRFVGPTVVHSFMQAAGLTNDHLITCH 386

Query: 721 RHLQCTALASVRH 759
           RHLQCT LA+  H
Sbjct: 387 RHLQCTLLAARPH 399


>ref|XP_004509996.1| PREDICTED: uncharacterized protein LOC101508282 isoform X1 [Cicer
           arietinum]
          Length = 388

 Score =  387 bits (995), Expect = e-105
 Identities = 189/249 (75%), Positives = 218/249 (87%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           TESPGSIAAVRREQ+ALQ+AQRKMKIAHYGRSKSAKFE+ V   +++  N+ +       
Sbjct: 143 TESPGSIAAVRREQMALQNAQRKMKIAHYGRSKSAKFETVVP--IDTSNNLSSKTTEE-- 198

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           E+RC+ ITPNSDP+++AYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++RQDFR 
Sbjct: 199 EKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRI 258

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS F+A  +ANLT+KQ+ +I  +Y IDISRVRGVVDNA RI+E+ K+FGSFNKY+WGFV
Sbjct: 259 AFSEFNASTLANLTDKQMMSISLEYGIDISRVRGVVDNANRILEVNKDFGSFNKYVWGFV 318

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           N+KPIS QYKF  KIPVKTSKSESISKDM+RRGFR VGPT VHSFMQAAGLTNDHLITC+
Sbjct: 319 NHKPISTQYKFGHKIPVKTSKSESISKDMIRRGFRFVGPTVVHSFMQAAGLTNDHLITCH 378

Query: 721 RHLQCTALA 747
           RHLQCT LA
Sbjct: 379 RHLQCTLLA 387


>gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
          Length = 409

 Score =  385 bits (990), Expect = e-104
 Identities = 190/249 (76%), Positives = 215/249 (86%)
 Frame = +1

Query: 4   ESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPILE 183
           E+PGSIAAVRREQ+ALQ AQRKMKIAHYGRSKSAKFESKV PL  S    K  +     E
Sbjct: 153 EAPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFESKVVPLNTSSAMTKPDEE----E 208

Query: 184 RRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFREA 363
           +RC+ ITPNSDP++VAYHDEEWGVPVHDD MLFELL+LSGAQVGSDW SIL++RQDFR+A
Sbjct: 209 KRCSFITPNSDPVYVAYHDEEWGVPVHDDSMLFELLVLSGAQVGSDWISILKKRQDFRDA 268

Query: 364 FSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFVN 543
           FSGFDAE VA  T+K++TTI ++Y IDISRV GVVDN+ RI+E+K +FGSF+KYIWGFVN
Sbjct: 269 FSGFDAETVAKFTDKEMTTISSEYGIDISRVLGVVDNSNRILEVKGQFGSFDKYIWGFVN 328

Query: 544 NKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCYR 723
           +K IS QYKF  KIPVKTSKSESISKDM+RRGFR VGPT VHSFMQAAGLTNDHLITC+R
Sbjct: 329 HKAISTQYKFGHKIPVKTSKSESISKDMLRRGFRCVGPTVVHSFMQAAGLTNDHLITCHR 388

Query: 724 HLQCTALAS 750
           HL CT LA+
Sbjct: 389 HLPCTLLAA 397


>ref|XP_003545728.1| PREDICTED: uncharacterized protein LOC100793449 [Glycine max]
          Length = 398

 Score =  385 bits (990), Expect = e-104
 Identities = 194/257 (75%), Positives = 220/257 (85%), Gaps = 1/257 (0%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLG-NIKTSDASPI 177
           TESPGSIAAVRREQ+ALQHAQRKMKIAHYGRSKSAKF ++V PL  S     KTS+    
Sbjct: 144 TESPGSIAAVRREQMALQHAQRKMKIAHYGRSKSAKF-ARVIPLEPSTNLTSKTSE---- 198

Query: 178 LERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFR 357
            E+RC+ IT NSDP+++AYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++RQDFR
Sbjct: 199 -EKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFR 257

Query: 358 EAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGF 537
            AFS FDA  +ANLT+KQ+ +I  +Y IDISRVRGVVDNA RI+ I K+FGSF+KYIW F
Sbjct: 258 TAFSEFDAATLANLTDKQMVSISMEYDIDISRVRGVVDNANRILAINKDFGSFDKYIWDF 317

Query: 538 VNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITC 717
           VN+KPIS QYKF  KIPVKTSKSESISKDM+RRGFR VGPT +HSFMQAAGLTNDHLITC
Sbjct: 318 VNHKPISTQYKFGHKIPVKTSKSESISKDMIRRGFRCVGPTVLHSFMQAAGLTNDHLITC 377

Query: 718 YRHLQCTALASVRHLSS 768
           +RHLQCT LAS  H ++
Sbjct: 378 HRHLQCTLLASTPHCTT 394


>ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max]
          Length = 400

 Score =  385 bits (988), Expect = e-104
 Identities = 191/250 (76%), Positives = 218/250 (87%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           T+SPGSIAAVRREQ+ALQ AQRKMKIAHYGRSKSAKFE +V PL  S  ++ +       
Sbjct: 144 TDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFE-RVVPLDPSNTSLASKPTEE-- 200

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           E+RC+ ITPNSDP+++AYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTS L++R DFR 
Sbjct: 201 EKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRA 260

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS FDAE VANLT+KQ+ +I ++Y IDISRVRGVVDNA +I+EIKK+FGSF+KYIWGFV
Sbjct: 261 AFSEFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFV 320

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           N+KPIS QYKF  KIPVKTSKSESISKDMVRRGFR VGPT VHSFMQ +GLTNDHLITC+
Sbjct: 321 NHKPISTQYKFGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQTSGLTNDHLITCH 380

Query: 721 RHLQCTALAS 750
           RHLQCT LA+
Sbjct: 381 RHLQCTLLAA 390


>ref|XP_004139917.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101218536 [Cucumis sativus]
          Length = 400

 Score =  384 bits (987), Expect = e-104
 Identities = 190/252 (75%), Positives = 218/252 (86%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           TESPGSIAAVRREQ+ALQ AQRKM+IAHYGRSKSA+FE K+ PL      IK +    + 
Sbjct: 132 TESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE-KIVPLDXLDSKIKPA----VE 186

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           +RRC+ ITPNSDP++VAYHDEEWGVPVHDDKMLFELL+LS AQVGSDWTSIL++RQDFR 
Sbjct: 187 DRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRN 246

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS FD+EIVAN ++KQ+ +I  +Y IDI+RVRGVVDNA RI++IKKEFGSF+KYIWGFV
Sbjct: 247 AFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFV 306

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           NNKP SPQYK   KIPVKTSKSE+ISKDMVRRGFR+VGPT VHSFMQAAGLTNDHL TC+
Sbjct: 307 NNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCH 366

Query: 721 RHLQCTALASVR 756
           RHL CT +A+ R
Sbjct: 367 RHLHCTLIAAGR 378


>ref|XP_002315089.2| methyladenine glycosylase family protein [Populus trichocarpa]
           gi|550330066|gb|EEF01260.2| methyladenine glycosylase
           family protein [Populus trichocarpa]
          Length = 411

 Score =  384 bits (986), Expect = e-104
 Identities = 190/258 (73%), Positives = 222/258 (86%), Gaps = 7/258 (2%)
 Frame = +1

Query: 4   ESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPILE 183
           E+PGSIAAVRREQ+ALQHAQRKM+IAHYGRSKS++FE+KV P+  S+     +D     E
Sbjct: 149 EAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSSRFEAKVVPVDSSINVTTKTDEE---E 205

Query: 184 RRCTSITPNS-------DPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRR 342
           +RC+ IT NS       +P++VAYHD+EWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++
Sbjct: 206 KRCSFITANSGKEKYEMNPIYVAYHDKEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKK 265

Query: 343 RQDFREAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNK 522
           RQDFR+AFSGFDAEIVAN+TEKQ+ +I A+Y I+ISRVRGVVDN+KRI+EIKKEFGSF++
Sbjct: 266 RQDFRDAFSGFDAEIVANITEKQMMSISAEYGIEISRVRGVVDNSKRILEIKKEFGSFDR 325

Query: 523 YIWGFVNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTND 702
           YIW FVNNKP S QYKF  KIPVKTSKSE+ISKDMVRRGFR VGPT VHSFMQA GLTND
Sbjct: 326 YIWTFVNNKPFSNQYKFGHKIPVKTSKSETISKDMVRRGFRFVGPTMVHSFMQAVGLTND 385

Query: 703 HLITCYRHLQCTALASVR 756
           HLITC+RHL CT +A+ R
Sbjct: 386 HLITCHRHLPCTLMAARR 403


>ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298985 [Fragaria vesca
           subsp. vesca]
          Length = 410

 Score =  383 bits (983), Expect = e-104
 Identities = 188/252 (74%), Positives = 221/252 (87%), Gaps = 2/252 (0%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPL--VESLGNIKTSDASP 174
           TE+PG+IAA RREQ+ALQHAQRKM+IAHYGRS SA FE +V+P+  +E+ G  +      
Sbjct: 160 TEAPGTIAAGRREQMALQHAQRKMRIAHYGRSNSANFE-RVAPIDTMEAKGGEED----- 213

Query: 175 ILERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDF 354
              +RC+ IT NSDP++VAYHD+EWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++RQDF
Sbjct: 214 --HKRCSFITANSDPIYVAYHDQEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDF 271

Query: 355 REAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWG 534
           R+AFSGFDAE VANLT+KQ+ +IC++Y IDISRVRGVVDN+ RI+E+K+EFGSF+KYIWG
Sbjct: 272 RDAFSGFDAEAVANLTDKQMISICSEYGIDISRVRGVVDNSNRILEVKREFGSFHKYIWG 331

Query: 535 FVNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLIT 714
           FVN+KPISPQYK   KIPVKTSKSESISKDMVRRGFR VGPT VHSFMQA+GLTNDHL T
Sbjct: 332 FVNHKPISPQYKQGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLTT 391

Query: 715 CYRHLQCTALAS 750
           C+RHLQCT LA+
Sbjct: 392 CHRHLQCTLLAA 403


>ref|XP_004173920.1| PREDICTED: uncharacterized protein LOC101226717 [Cucumis sativus]
          Length = 397

 Score =  383 bits (983), Expect = e-104
 Identities = 191/253 (75%), Positives = 217/253 (85%), Gaps = 1/253 (0%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           TESPGSIAAVRREQ+ALQ AQRKM+IAHYGRSKSA+FE K+ PL         S   P +
Sbjct: 132 TESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE-KIVPL--------DSKIKPAV 182

Query: 181 E-RRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFR 357
           E RRC+ ITPNSDP++VAYHDEEWGVPVHDDKMLFELL+LS AQVGSDWTSIL++RQDFR
Sbjct: 183 EDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFR 242

Query: 358 EAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGF 537
            AFS FD+EIVAN ++KQ+ +I  +Y IDI+RVRGVVDNA RI++IKKEFGSF+KYIWGF
Sbjct: 243 NAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGF 302

Query: 538 VNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITC 717
           VNNKP SPQYK   KIPVKTSKSE+ISKDMVRRGFR+VGPT VHSFMQAAGLTNDHL TC
Sbjct: 303 VNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 362

Query: 718 YRHLQCTALASVR 756
           +RHL CT +A+ R
Sbjct: 363 HRHLHCTLIAAGR 375


>ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811352 [Glycine max]
          Length = 400

 Score =  382 bits (982), Expect = e-103
 Identities = 190/250 (76%), Positives = 218/250 (87%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           T+SPGSIAAVRREQ+ALQ AQRKMKIAHYGRSKSAKFE +V PL  S  ++ +       
Sbjct: 149 TDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFE-RVVPLDPSNTSLASKPTEE-- 205

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           E+RC+ IT NSDP+++AYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTS L++R DFR 
Sbjct: 206 EKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRA 265

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS FDAE VANLT+KQ+ +I ++Y IDISRVRGVVDNA +I+EIKK+FGSF+KYIWGFV
Sbjct: 266 AFSEFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFV 325

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           N+KP+S QYKF  KIPVKTSKSESISKDMVRRGFR VGPT VHSFMQA+GLTNDHLITC+
Sbjct: 326 NHKPLSTQYKFGHKIPVKTSKSESISKDMVRRGFRYVGPTVVHSFMQASGLTNDHLITCH 385

Query: 721 RHLQCTALAS 750
           RHLQCT LA+
Sbjct: 386 RHLQCTLLAA 395


>ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago truncatula]
           gi|355484972|gb|AES66175.1| DNA-3-methyladenine
           glycosylase [Medicago truncatula]
          Length = 390

 Score =  380 bits (976), Expect = e-103
 Identities = 189/252 (75%), Positives = 221/252 (87%), Gaps = 1/252 (0%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLG-NIKTSDASPI 177
           T+SPGSIAAVRREQ+ALQ AQRKMKIAHYGRSKSAKFE +V P+  S   + KT++    
Sbjct: 143 TDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAKFE-RVFPIDPSSALDSKTTNQE-- 199

Query: 178 LERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFR 357
            E+RC+ IT NSDP+++AYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTS L++R DFR
Sbjct: 200 -EKRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSTLKKRLDFR 258

Query: 358 EAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGF 537
            AFS FDAEIVANLT+KQ+ +I ++Y IDIS+VRGVVDNA +I++++K FGSF+KYIWGF
Sbjct: 259 AAFSEFDAEIVANLTDKQMMSISSEYGIDISKVRGVVDNANQILQVRKGFGSFDKYIWGF 318

Query: 538 VNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITC 717
           VN+KPIS QYKF  KIPVKTSKSESISKDM++RGFR VGPT VHSFMQAAGLTNDHLITC
Sbjct: 319 VNHKPISNQYKFGHKIPVKTSKSESISKDMIKRGFRYVGPTVVHSFMQAAGLTNDHLITC 378

Query: 718 YRHLQCTALASV 753
           +RHLQCT LA++
Sbjct: 379 HRHLQCTLLAAI 390


>gb|ESW05455.1| hypothetical protein PHAVU_011G180500g [Phaseolus vulgaris]
          Length = 392

 Score =  379 bits (973), Expect = e-102
 Identities = 191/253 (75%), Positives = 218/253 (86%), Gaps = 2/253 (0%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           TESPGSIAAVRREQ+ALQHAQRKMKIAHYGRSKSAKFE KV PL     +I T+  S   
Sbjct: 146 TESPGSIAAVRREQMALQHAQRKMKIAHYGRSKSAKFE-KVVPL-----DISTNLTSKTC 199

Query: 181 E--RRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDF 354
           E  +RC+ IT NSDP+++AYHDEEWGVPVHDDKMLFELL++SGAQVGSDWTSIL++RQDF
Sbjct: 200 EEEKRCSFITANSDPVYIAYHDEEWGVPVHDDKMLFELLVVSGAQVGSDWTSILKKRQDF 259

Query: 355 REAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWG 534
           R AFS FDA  +AN+T+KQ+ +I  +Y IDISRVRGVVDNA RI EI K+FGSF+KYIWG
Sbjct: 260 RTAFSEFDAATLANMTDKQMVSISLEYGIDISRVRGVVDNANRISEINKDFGSFDKYIWG 319

Query: 535 FVNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLIT 714
           FVN+KPIS QYKF  KIPVKTSKS+SISKDM+RRGFR VGPT +HSFMQAAGLTNDHLIT
Sbjct: 320 FVNHKPISTQYKFGHKIPVKTSKSDSISKDMLRRGFRFVGPTVLHSFMQAAGLTNDHLIT 379

Query: 715 CYRHLQCTALASV 753
           C+RHLQCT  +S+
Sbjct: 380 CHRHLQCTRESSL 392


>gb|AFK37052.1| unknown [Medicago truncatula]
          Length = 390

 Score =  379 bits (973), Expect = e-102
 Identities = 188/251 (74%), Positives = 219/251 (87%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           T+SPGSIAAVRREQ+ALQ AQRKMKIAHYGRSKSAKFE +V P+  S  +   S  +   
Sbjct: 143 TDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAKFE-RVFPIDPS--SALDSKITNQE 199

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           E+RC+ IT NSDP+++AYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTS L++R DFR 
Sbjct: 200 EKRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSTLKKRLDFRA 259

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS FDAEIVANLT+KQ+ +I ++Y IDIS+VRGVVDNA +I++++K FGSF+KYIWGFV
Sbjct: 260 AFSEFDAEIVANLTDKQMMSISSEYGIDISKVRGVVDNANQILQVRKGFGSFDKYIWGFV 319

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           N+KPIS QYKF  KIPVKTSKSESISKDM++RGFR VGPT VHSFMQAAGLTNDHLITC+
Sbjct: 320 NHKPISNQYKFGHKIPVKTSKSESISKDMIKRGFRYVGPTVVHSFMQAAGLTNDHLITCH 379

Query: 721 RHLQCTALASV 753
           RHLQCT LA++
Sbjct: 380 RHLQCTLLAAI 390


>gb|EOY14287.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao]
          Length = 413

 Score =  377 bits (968), Expect = e-102
 Identities = 189/253 (74%), Positives = 214/253 (84%), Gaps = 4/253 (1%)
 Frame = +1

Query: 4   ESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPILE 183
           E+PGSIAAVRREQ+ALQ AQRKMKIAHYGRSKSAKFESKV PL  S    K  +     E
Sbjct: 153 EAPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFESKVVPLNTSSAMTKPDEE----E 208

Query: 184 RRCTSITPNSD----PLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQD 351
           +RC+ ITPNS     P++VAYHDEEWGVPVHDD MLFELL+LSGAQVGSDW SIL++RQD
Sbjct: 209 KRCSFITPNSGIAIYPVYVAYHDEEWGVPVHDDSMLFELLVLSGAQVGSDWISILKKRQD 268

Query: 352 FREAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIW 531
           FR+AFSGFDAE VA  T+K++TTI ++Y IDISRV GVVDN+ RI+E+K +FGSF+KYIW
Sbjct: 269 FRDAFSGFDAETVAKFTDKEMTTISSEYGIDISRVLGVVDNSNRILEVKGQFGSFDKYIW 328

Query: 532 GFVNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLI 711
           GFVN+K IS QYKF  KIPVKTSKSESISKDM+RRGFR VGPT VHSFMQAAGLTNDHLI
Sbjct: 329 GFVNHKAISTQYKFGHKIPVKTSKSESISKDMLRRGFRCVGPTVVHSFMQAAGLTNDHLI 388

Query: 712 TCYRHLQCTALAS 750
           TC+RHL CT LA+
Sbjct: 389 TCHRHLPCTLLAA 401


>gb|EXB51223.1| Putative Glutamine amidotransferase [Morus notabilis]
          Length = 458

 Score =  375 bits (962), Expect = e-101
 Identities = 185/252 (73%), Positives = 217/252 (86%)
 Frame = +1

Query: 1   TESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL 180
           TESPGSIAAVRREQ+ALQ AQRKM+IAHYGRSKSAKFE +V P+  +      ++ +   
Sbjct: 187 TESPGSIAAVRREQMALQQAQRKMRIAHYGRSKSAKFE-RVVPIDNNSSLDLMANKTAEE 245

Query: 181 ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQDFRE 360
           E+RC+ IT NSDP++VAYHDEEWGVPVHDDKMLFELL+LSGAQVGSDWTSIL++RQ+FR+
Sbjct: 246 EKRCSFITANSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQEFRK 305

Query: 361 AFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYIWGFV 540
           AFS FDA+IVA+ T+KQ+ +I +++  DISRVRGVVDN+ RI+EIKKE GS  KY+WGFV
Sbjct: 306 AFSEFDAQIVASFTDKQMISISSEFGFDISRVRGVVDNSNRILEIKKELGSLEKYVWGFV 365

Query: 541 NNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHLITCY 720
           N KPIS QYK  Q+IPVKTSKSE+ISKD+VRRGFR VGPT VHSFMQAAGLTNDHLITC+
Sbjct: 366 NQKPISTQYKSGQRIPVKTSKSETISKDLVRRGFRFVGPTVVHSFMQAAGLTNDHLITCH 425

Query: 721 RHLQCTALASVR 756
           RHLQCT LAS R
Sbjct: 426 RHLQCTLLASRR 437


>ref|XP_006422312.1| hypothetical protein CICLE_v10005040mg [Citrus clementina]
           gi|568881790|ref|XP_006493734.1| PREDICTED:
           uncharacterized protein LOC102621461 [Citrus sinensis]
           gi|557524185|gb|ESR35552.1| hypothetical protein
           CICLE_v10005040mg [Citrus clementina]
          Length = 420

 Score =  374 bits (961), Expect = e-101
 Identities = 186/254 (73%), Positives = 216/254 (85%), Gaps = 5/254 (1%)
 Frame = +1

Query: 4   ESPGSIAAVRREQIALQHAQRKMKIAHYGRSKSAKFESKVSPLVESLGNIKTSDASPIL- 180
           +SPGSIAAVRREQ+ALQHAQRKM+IAHYGRSKSAKFESKV PL +   N   + ++P   
Sbjct: 146 DSPGSIAAVRREQMALQHAQRKMRIAHYGRSKSAKFESKVVPLFDPNNN-NAARSTPTTG 204

Query: 181 ----ERRCTSITPNSDPLFVAYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSILRRRQ 348
               E+RC+ IT  SDP+FVAYHDEEWGVPV +D MLFELL+LSGAQVGSDWTSIL++RQ
Sbjct: 205 EQQEEKRCSFITAYSDPIFVAYHDEEWGVPVRNDNMLFELLVLSGAQVGSDWTSILKKRQ 264

Query: 349 DFREAFSGFDAEIVANLTEKQITTICAQYAIDISRVRGVVDNAKRIIEIKKEFGSFNKYI 528
            FR+AFSGF+AE VA L++KQ+ +I  +Y+ID+SRVRGVVDN+ RI+E+K+ FGSF KYI
Sbjct: 265 GFRDAFSGFEAETVAKLSDKQMMSISTEYSIDMSRVRGVVDNSNRILEVKRVFGSFEKYI 324

Query: 529 WGFVNNKPISPQYKFAQKIPVKTSKSESISKDMVRRGFRTVGPTAVHSFMQAAGLTNDHL 708
           WGFVN+KPIS QYKF  KIPVKTSKSESISKDMVRRGFR VGPT VHSFMQAAGLTNDHL
Sbjct: 325 WGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQAAGLTNDHL 384

Query: 709 ITCYRHLQCTALAS 750
           I C+RHL CT LA+
Sbjct: 385 IICHRHLPCTLLAA 398


Top