BLASTX nr result

ID: Akebia27_contig00013481 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00013481
         (1087 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007045734.1| DNA glycosylase superfamily protein isoform ...   368   3e-99
ref|XP_002267123.1| PREDICTED: probable GMP synthase [glutamine-...   364   4e-98
ref|XP_002314913.2| hypothetical protein POPTR_0010s14710g [Popu...   362   2e-97
ref|XP_006484265.1| PREDICTED: uncharacterized protein LOC102627...   357   4e-96
ref|XP_006437842.1| hypothetical protein CICLE_v10032151mg [Citr...   357   6e-96
ref|XP_007223145.1| hypothetical protein PRUPE_ppa009020mg [Prun...   350   6e-94
ref|XP_006379720.1| hypothetical protein POPTR_0008s11150g [Popu...   346   9e-93
ref|XP_004297192.1| PREDICTED: probable GMP synthase [glutamine-...   344   4e-92
ref|XP_002514580.1| DNA-3-methyladenine glycosylase, putative [R...   340   6e-91
ref|XP_004135425.1| PREDICTED: probable GMP synthase [glutamine-...   338   2e-90
ref|XP_003516830.1| PREDICTED: uncharacterized protein LOC100810...   335   2e-89
ref|NP_001240008.1| uncharacterized protein LOC100813637 [Glycin...   335   3e-89
ref|XP_002890014.1| methyladenine glycosylase family protein [Ar...   334   4e-89
ref|XP_006304034.1| hypothetical protein CARUB_v10009802mg [Caps...   333   7e-89
ref|XP_004509952.1| PREDICTED: probable GMP synthase [glutamine-...   333   1e-88
ref|NP_973818.1| putative 3-methyladenine glycosylase I [Arabido...   331   4e-88
gb|AFK34294.1| unknown [Lotus japonicus]                              330   8e-88
ref|XP_007153437.1| hypothetical protein PHAVU_003G035100g [Phas...   328   3e-87
ref|XP_006417105.1| hypothetical protein EUTSA_v10008250mg [Eutr...   325   2e-86
ref|XP_007045735.1| DNA glycosylase superfamily protein isoform ...   317   7e-84

>ref|XP_007045734.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
           gi|508709669|gb|EOY01566.1| DNA glycosylase superfamily
           protein isoform 1 [Theobroma cacao]
          Length = 323

 Score =  368 bits (944), Expect = 3e-99
 Identities = 187/326 (57%), Positives = 240/326 (73%), Gaps = 14/326 (4%)
 Frame = +3

Query: 9   LFSSMSKGNVRKQVLERSKTLREKEKPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXX 188
           +FSSMSK NVR+ +LE++++ +EKEKP+QS +SKHLKK+YP+G+Q+              
Sbjct: 2   IFSSMSKANVRRHILEKNRSPKEKEKPAQSVLSKHLKKIYPIGLQRSTSSLSLSSLSLSL 61

Query: 189 XXXXPNGPLSLVNRRSTGT----TLSLRQLGPP-ERKEKSVSVVNVVQD---------CL 326
                +   SL +  ST      +L+L  + P  ER+E  V VV  VQ            
Sbjct: 62  SQNSNDS--SLTDHSSTPLEQKISLALSLIAPHHERREFVVPVVKSVQHHHHQQQQQPSQ 119

Query: 327 DSDDGSFKRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKK 506
           D   G  +RC+W+TK SD+VYV FHDE WGVPVYDDNQLFELLA SGMLM + WTEILK+
Sbjct: 120 DPGSGELRRCNWVTKNSDKVYVSFHDEQWGVPVYDDNQLFELLALSGMLMDYNWTEILKR 179

Query: 507 KEQYREAFAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSF 686
           KE YREAF+ FDP +VAKM +KEI EI ++K + LAES+VRCIVDNAKCILKI++E+GSF
Sbjct: 180 KELYREAFSGFDPEIVAKMGDKEINEISSDKAIMLAESRVRCIVDNAKCILKIVREYGSF 239

Query: 687 SNYLWGYMNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMT 866
           S+++WGY+NYKP INRY+YP++VPLR+PKAEAIS+DL +RG RFVGPVIV +FMQAAG+T
Sbjct: 240 SSFMWGYVNYKPTINRYKYPRNVPLRTPKAEAISRDLLKRGFRFVGPVIVCSFMQAAGLT 299

Query: 867 MDHLVDCFRFSECVSMADKSMGAWFH 944
           +DHLVDCFR+SECV +A++    W H
Sbjct: 300 IDHLVDCFRYSECVGLAER---PWRH 322


>ref|XP_002267123.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like
           [Vitis vinifera]
          Length = 318

 Score =  364 bits (934), Expect = 4e-98
 Identities = 179/303 (59%), Positives = 229/303 (75%), Gaps = 2/303 (0%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKEKPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXXX 200
           MSKGNVR+  LE++++++E+EKP+Q F+S++L+K+YPL +QK                  
Sbjct: 1   MSKGNVRRLFLEKNRSIKEQEKPNQGFLSRNLRKIYPLSLQKSTSSLSLSSLSLSLSQNS 60

Query: 201 PNGPLS-LVNRRSTGTTLSLRQLGPPERKEKSVSVVNVVQD-CLDSDDGSFKRCHWITKK 374
            +  L   +        LSLR +GPPER+E  V++ NV Q    D  DG  KRC+WITK 
Sbjct: 61  NDSSLKDYITPLDRQIALSLRLIGPPERREVPVAITNVPQQPSPDVGDGELKRCNWITKN 120

Query: 375 SDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKFDPNVV 554
           SD+VYV FHDE WGVPVY+DNQLFELLA SGMLM + WTEILK+KE  R+AF+ FDPN V
Sbjct: 121 SDKVYVQFHDECWGVPVYEDNQLFELLAMSGMLMDYNWTEILKRKELLRDAFSGFDPNTV 180

Query: 555 AKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYKPMINR 734
           A+M EKEI E  +NK L LAES+VRCIVDNAKCI KI+++FGSFS+Y+WGY+N+KPMI R
Sbjct: 181 AQMGEKEITETASNKALMLAESRVRCIVDNAKCIQKIVRQFGSFSSYIWGYVNHKPMIIR 240

Query: 735 YRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFSECVSM 914
            RYP+SVPLR+PK+EAIS+DL +RG R VGPVIVY+FMQAAGMT DHL+DCFR+ EC+++
Sbjct: 241 CRYPRSVPLRTPKSEAISRDLIKRGFRLVGPVIVYSFMQAAGMTNDHLIDCFRYRECLNL 300

Query: 915 ADK 923
           A +
Sbjct: 301 AHR 303


>ref|XP_002314913.2| hypothetical protein POPTR_0010s14710g [Populus trichocarpa]
           gi|550329819|gb|EEF01084.2| hypothetical protein
           POPTR_0010s14710g [Populus trichocarpa]
          Length = 317

 Score =  362 bits (928), Expect = 2e-97
 Identities = 189/319 (59%), Positives = 230/319 (72%), Gaps = 11/319 (3%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTL-REKEKP---SQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXX 188
           M K NVRKQ+LE++  L +EKEKP   +Q   SKHLK+VYP+G+ +              
Sbjct: 1   MYKANVRKQILEKNNILIKEKEKPISNTQGLFSKHLKRVYPIGLHRSTSSLSLSSVSLSL 60

Query: 189 XXXXPNGPL--SLVNRRSTGTTLSLRQLGPPERKEKSVS-----VVNVVQDCLDSDDGSF 347
                +  L  S         +L+LR + P ER+E  V+          Q   DS+DG  
Sbjct: 61  SQNSNDSSLTDSSAVPLEQKISLALRLISPLERREVPVARNFQPQQQQQQQNQDSNDGEV 120

Query: 348 KRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREA 527
           KRC+WITK SD+VYV FHDE WGVPVYDDNQLFELLA SGMLM + WTEILK+KE +REA
Sbjct: 121 KRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREA 180

Query: 528 FAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGY 707
           F  FDPN+VAKM EKEI EI +NK + LAES+VRCIVDN+KCILKI +EFGSFSNY+WG 
Sbjct: 181 FEGFDPNIVAKMGEKEIMEIASNKAIMLAESRVRCIVDNSKCILKIAREFGSFSNYMWGN 240

Query: 708 MNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDC 887
           +N+KP INRY+YP++VPLRSPKAEAISKDL +RG RF GPVIVY+FMQAAG+T+DHLVDC
Sbjct: 241 VNFKPTINRYKYPRNVPLRSPKAEAISKDLLKRGFRFAGPVIVYSFMQAAGLTIDHLVDC 300

Query: 888 FRFSECVSMADKSMGAWFH 944
           FR+SECVS+A++    W H
Sbjct: 301 FRYSECVSLAER---PWRH 316


>ref|XP_006484265.1| PREDICTED: uncharacterized protein LOC102627575 isoform X1 [Citrus
           sinensis]
          Length = 317

 Score =  357 bits (917), Expect = 4e-96
 Identities = 188/323 (58%), Positives = 233/323 (72%), Gaps = 15/323 (4%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKE-KPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXX 197
           MSK NVR+ +LE++++ +EKE KP+QS +SKHLKKVYP+G+ +                 
Sbjct: 1   MSKANVRRHILEKNRSPKEKEPKPTQSLLSKHLKKVYPIGLHR---SSSSLSLSSLSLSL 57

Query: 198 XPNGPLSLVNRRSTG-----TTLSLRQLGPPERKEKSVSVVNV---------VQDCLDSD 335
             N   S V   S        +L+LR + PPER+E +V+  NV          Q   DS 
Sbjct: 58  SQNSNDSSVTDNSNSPLEQRISLALRLITPPERREVTVA-KNVQPQQQQQQQQQQSQDSC 116

Query: 336 DGSFKRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQ 515
            G  KRC+WITK SD VYV FHDE WGVPVYDDNQLFELLA SGMLM + WTEILK+KE 
Sbjct: 117 CGELKRCNWITKNSDRVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKEL 176

Query: 516 YREAFAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNY 695
           +REAF  FDP  VAKM EKEI EI +N  + LAE +VRCIVDNAKCI+KI+ EFGSFS++
Sbjct: 177 FREAFGGFDPKSVAKMGEKEILEISSNTAIMLAECRVRCIVDNAKCIVKILNEFGSFSSF 236

Query: 696 LWGYMNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDH 875
           +WGY+N+KPMIN++RYP++VPLRSPKAEAIS+DL +RG R VGPVIVY+FMQAAG+T+DH
Sbjct: 237 MWGYVNFKPMINKFRYPRNVPLRSPKAEAISRDLLKRGFRLVGPVIVYSFMQAAGLTIDH 296

Query: 876 LVDCFRFSECVSMADKSMGAWFH 944
           LVDCFR+SECVS+A++    W H
Sbjct: 297 LVDCFRYSECVSLAER---PWRH 316


>ref|XP_006437842.1| hypothetical protein CICLE_v10032151mg [Citrus clementina]
           gi|557540038|gb|ESR51082.1| hypothetical protein
           CICLE_v10032151mg [Citrus clementina]
          Length = 317

 Score =  357 bits (915), Expect = 6e-96
 Identities = 184/320 (57%), Positives = 233/320 (72%), Gaps = 12/320 (3%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKE-KPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXX 197
           MSK NVR+ +LE++++ +EKE KP+QS +SKHLKKVYP+G+ +                 
Sbjct: 1   MSKANVRRHILEKNRSPKEKEPKPTQSLLSKHLKKVYPIGLHRSSSSLSLSSLSLSLSQN 60

Query: 198 XPNGPL--SLVNRRSTGTTLSLRQLGPPERKEKSVSVVNV---------VQDCLDSDDGS 344
             +  +  +  +      +L+LR + PPER+E +V+  NV          Q   DS  G 
Sbjct: 61  SNDSSVTDNYNSPLEQRISLALRLITPPERREVTVA-KNVQPQQQQQQQQQQSQDSCCGE 119

Query: 345 FKRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYRE 524
            KRC+WITK SD VYV FHDE WGVPVYDDNQLFELLA SGMLM + WTEILK+KE +RE
Sbjct: 120 LKRCNWITKNSDRVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFRE 179

Query: 525 AFAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWG 704
           AF  FDP  VAKM EKEI EI +N  + LAE +VRCIVDNAKCI+KI+ EFGSFS+++WG
Sbjct: 180 AFGGFDPKSVAKMGEKEILEISSNTAIMLAECRVRCIVDNAKCIMKILNEFGSFSSFMWG 239

Query: 705 YMNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVD 884
           Y+N+KPMIN++RYP++VPLRSPKAEAIS+DL +RG R VGPVIVY+FMQAAG+T+DHLVD
Sbjct: 240 YVNFKPMINKFRYPRNVPLRSPKAEAISRDLLKRGFRLVGPVIVYSFMQAAGLTIDHLVD 299

Query: 885 CFRFSECVSMADKSMGAWFH 944
           CFR+SECVS+A++    W H
Sbjct: 300 CFRYSECVSLAER---PWRH 316


>ref|XP_007223145.1| hypothetical protein PRUPE_ppa009020mg [Prunus persica]
           gi|462420081|gb|EMJ24344.1| hypothetical protein
           PRUPE_ppa009020mg [Prunus persica]
          Length = 310

 Score =  350 bits (898), Expect = 6e-94
 Identities = 178/315 (56%), Positives = 228/315 (72%), Gaps = 7/315 (2%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKEKPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXXX 200
           MS+ NVR+ VL  +K L+E+EK S     KHLK++YP+G+ K                  
Sbjct: 1   MSRANVRRHVLLENKVLKEREKTSSP---KHLKRIYPIGLHKSTSSLSLSLSSSLSLSLS 57

Query: 201 PNGPLSLVNRRST---GTTLSLRQLGPPERKEKSVSVVNVVQDCL----DSDDGSFKRCH 359
            N   S +   ST     + +LR + P +R+E +  V  VVQ  +    D++D   KRC+
Sbjct: 58  ENSYDSSLTDSSTLDQKISAALRFIAPTQRREYNSPVAKVVQQQISQAQDTNDEELKRCN 117

Query: 360 WITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKF 539
           WITK SD+VYV FHDE WGVP YDDNQLFELLA SGMLM H WTEI+K++E +REAF  F
Sbjct: 118 WITKNSDKVYVAFHDECWGVPAYDDNQLFELLALSGMLMDHNWTEIVKRRELFREAFFGF 177

Query: 540 DPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYK 719
           DPN VAKM EKEI EI +NK + LAE +VRCI+DNAKCILKI++E GSFS+Y+WG +N+K
Sbjct: 178 DPNKVAKMGEKEIAEIASNKAIMLAECKVRCIIDNAKCILKIVRECGSFSSYMWGSVNHK 237

Query: 720 PMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFS 899
           P+INR+RYP++VPLRSPKAEA+SKDL +RG R+VGPVIVY+FMQAAG+T+DHLVDC+R+S
Sbjct: 238 PVINRFRYPRNVPLRSPKAEAMSKDLIKRGFRYVGPVIVYSFMQAAGLTIDHLVDCYRYS 297

Query: 900 ECVSMADKSMGAWFH 944
           ECVS+A++    W H
Sbjct: 298 ECVSLAER---PWRH 309


>ref|XP_006379720.1| hypothetical protein POPTR_0008s11150g [Populus trichocarpa]
           gi|550332834|gb|ERP57517.1| hypothetical protein
           POPTR_0008s11150g [Populus trichocarpa]
          Length = 320

 Score =  346 bits (888), Expect = 9e-93
 Identities = 180/316 (56%), Positives = 227/316 (71%), Gaps = 15/316 (4%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKT-LREKEKP--SQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXX 191
           MSK NVRKQ+LE++   ++EKEKP  SQ   +KHLK+VYP+G+ +               
Sbjct: 1   MSKANVRKQILEKNSIFIKEKEKPLSSQGLFTKHLKRVYPIGLHRSSSSLSLSSVSLSLS 60

Query: 192 XXXPNGPLSLVNRR--STGTTLSLRQLGPPERKEKSVS----------VVNVVQDCLDSD 335
               +  L+  +        +L+LR + P ER+E  V+               QD   S+
Sbjct: 61  QNSNDSSLTDCSATPLEQKISLALRLISPSERREVPVARNFQTRQQRQQQQQKQD-QGSN 119

Query: 336 DGSFKRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQ 515
           DG  KRC+WITK SD+VYV FHDE WGVPVYDD QLFELLA SGMLM + WTEILK+KE 
Sbjct: 120 DGELKRCNWITKNSDKVYVAFHDEFWGVPVYDDIQLFELLALSGMLMDYNWTEILKRKEL 179

Query: 516 YREAFAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNY 695
           +REAF  F+PN+VAK  EKEI EI +NK + LAES+VRCIVDNA+C+LKI +EFGSFSNY
Sbjct: 180 FREAFDGFNPNIVAKKGEKEIMEIASNKAIMLAESRVRCIVDNARCLLKIAREFGSFSNY 239

Query: 696 LWGYMNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDH 875
           +WG +N+KP INRY+YP++V LRSPKAEAISKDL +RG RFVGPVIVY+FMQAAG+T+DH
Sbjct: 240 MWGNVNFKPTINRYKYPRNVQLRSPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDH 299

Query: 876 LVDCFRFSECVSMADK 923
           LVDC+R+ ECVS+A++
Sbjct: 300 LVDCYRYGECVSLAER 315


>ref|XP_004297192.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like
           [Fragaria vesca subsp. vesca]
          Length = 319

 Score =  344 bits (882), Expect = 4e-92
 Identities = 175/321 (54%), Positives = 234/321 (72%), Gaps = 13/321 (4%)
 Frame = +3

Query: 21  MSKGNVRKQV--LERSKTLREKEKPSQSFIS----KHLKKVYPLGIQKXXXXXXXXXXXX 182
           MSK NVR+Q+  LE++K  +EK   + +F      KHLK++YP+G+ +            
Sbjct: 1   MSKANVRRQLVLLEKNKVPKEKSTTTTAFSPIFSYKHLKRIYPIGLHRSSSSSSSLSLSS 60

Query: 183 XXXXXXPNG--PLSLVNRRST---GTTLSLRQLGPPERKEKSVSVV--NVVQDCLDSDDG 341
                  N     S+++  S      +L+LR + PP+R+E  V  V     Q   D+D+G
Sbjct: 61  LSLSLSENSIDSSSIIDSASPLEQKISLALRLIAPPQRRESPVPKVVQQQSQTFQDTDNG 120

Query: 342 SFKRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYR 521
             +RC+WITK SD+VYV FHDE WGVPVYDDNQLFELLA SGMLM H WTEI+K++E +R
Sbjct: 121 ELRRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDHNWTEIVKRRELFR 180

Query: 522 EAFAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLW 701
           EAF+ FDPN+VAKM E+EI+EI +NK L L + +VRCIV+NAKCILKI++E GSFS+Y+W
Sbjct: 181 EAFSGFDPNIVAKMGEEEIEEIASNKALMLPDCKVRCIVENAKCILKIVRECGSFSSYMW 240

Query: 702 GYMNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLV 881
           G +N+KP+INR+RYP++VPLRSPKAEA+SKDL +RG R+VGPVIVY+FMQAAG+T+DHLV
Sbjct: 241 GSVNHKPVINRFRYPRNVPLRSPKAEAMSKDLIKRGFRYVGPVIVYSFMQAAGLTIDHLV 300

Query: 882 DCFRFSECVSMADKSMGAWFH 944
           DC+R++ECVS+A++    W H
Sbjct: 301 DCYRYNECVSLAER---PWRH 318


>ref|XP_002514580.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
           gi|223546184|gb|EEF47686.1| DNA-3-methyladenine
           glycosylase, putative [Ricinus communis]
          Length = 319

 Score =  340 bits (872), Expect = 6e-91
 Identities = 177/321 (55%), Positives = 225/321 (70%), Gaps = 13/321 (4%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTL-REKEKPSQS---FISKHLKKVYPLGIQKXXXXXXXXXXXXXX 188
           MSK  VRKQVLE+      EKE+ + +   F SK+LKKVYP+G+ +              
Sbjct: 1   MSKATVRKQVLEKKSIFTNEKERTTSNQLGFFSKNLKKVYPIGLHRSNSSLSLSSVSLSL 60

Query: 189 XXXXPNGPLSLVNRRSTGT--TLSLRQLGPPERKEKSVSVVNVVQD-------CLDSDDG 341
                +  L+  +        +L+LR + P ER+E      NV Q          +S+ G
Sbjct: 61  SENSNDSSLTDYSNTPLDQKISLALRLITPLERREVPALSRNVQQQQQQQQQQSQESNGG 120

Query: 342 SFKRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYR 521
             +RC+WITK SD+VYV FHDE WGVPVYDDNQLFELLA SGMLM + WTEILK+K+ +R
Sbjct: 121 EIRRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKQLFR 180

Query: 522 EAFAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLW 701
           EAFA FDPN+VA M EKEI +I +NK + LA+S+VRCIVDNAKCI KI +EFGSFS+++W
Sbjct: 181 EAFAGFDPNIVANMGEKEILDIASNKAIMLADSRVRCIVDNAKCIAKIAREFGSFSSFMW 240

Query: 702 GYMNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLV 881
           G++NYKP IN+Y+YP++VPLR+PKAEAISKDL +RG RFVGPVIVY+FMQAAG+T+DHLV
Sbjct: 241 GHVNYKPTINKYKYPRNVPLRTPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHLV 300

Query: 882 DCFRFSECVSMADKSMGAWFH 944
           DCFR  ECV +A++    W H
Sbjct: 301 DCFRHGECVGLAER---PWRH 318


>ref|XP_004135425.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like
           [Cucumis sativus] gi|449531521|ref|XP_004172734.1|
           PREDICTED: probable GMP synthase
           [glutamine-hydrolyzing]-like [Cucumis sativus]
          Length = 308

 Score =  338 bits (867), Expect = 2e-90
 Identities = 162/308 (52%), Positives = 222/308 (72%), Gaps = 1/308 (0%)
 Frame = +3

Query: 24  SKGNVRKQVLERSKTLREKEKPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXXXP 203
           SK  VR+ +LER    +EK++ SQ+ +SKHLKK+YP+G+Q+                   
Sbjct: 3   SKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQNSN 62

Query: 204 NGPLSLVN-RRSTGTTLSLRQLGPPERKEKSVSVVNVVQDCLDSDDGSFKRCHWITKKSD 380
           +  L+  + +     + ++R + PP  + +     ++ Q   +  DG  +RC+WIT  SD
Sbjct: 63  DSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHTSD 122

Query: 381 EVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKFDPNVVAK 560
           + YV FHDE WGVPVYDDN+LFELLA SGMLM + WTEI+K++E +REAFA F+P+VVA 
Sbjct: 123 KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVVAN 182

Query: 561 MSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYKPMINRYR 740
           M EKEI ++ ++K + L ES+VRCIVDNAKCILKI ++FGSFSNY+W Y+N+KP INR+R
Sbjct: 183 MGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRFR 242

Query: 741 YPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFSECVSMAD 920
           +P++VPLRSPKAEAISKD+ +RG RFVGPVIVY+FMQAAG+T+DHL+DCFR  ECV++A+
Sbjct: 243 HPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNLAE 302

Query: 921 KSMGAWFH 944
           +    W H
Sbjct: 303 R---PWRH 307


>ref|XP_003516830.1| PREDICTED: uncharacterized protein LOC100810677 [Glycine max]
          Length = 314

 Score =  335 bits (860), Expect = 2e-89
 Identities = 168/316 (53%), Positives = 224/316 (70%), Gaps = 8/316 (2%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKEKP-SQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXX 197
           MSK NVR+  LE+ ++++E +K  + +F +++LKKVYP+G+QK                 
Sbjct: 1   MSKTNVRRHALEKCRSVKETQKVLNHNFFTRNLKKVYPIGLQKSTSSLSLSSISLSLSQN 60

Query: 198 X-PNGPLSLVNRRSTGTTLSLRQLGPPERKEKSVSVVNVVQDCLD------SDDGSFKRC 356
              +     +       +L+LR + P ER+E +++    +Q          ++ G  KRC
Sbjct: 61  SNDSSQADSLTPLDEKISLALRLISPRERREPTIATSKPLQQQQPPSPPPTTEPGELKRC 120

Query: 357 HWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAK 536
           +WITK SD+ Y+ FHDE WGVP YDDN+LFELLA SG+LM + WTEILK+KE  RE FA 
Sbjct: 121 NWITKSSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILKRKETLREVFAG 180

Query: 537 FDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNY 716
           FD N VAKM EKEI E  +NK L LA+S+V C+VDNAKCI+KI+KE GSFS+Y+WGY+N+
Sbjct: 181 FDANTVAKMEEKEIMETASNKALSLADSRVMCVVDNAKCIMKIVKECGSFSSYIWGYVNH 240

Query: 717 KPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRF 896
           KP+INRYRYP++VPLRSPKAEA+SKDL +RG RFVGPVIV++FMQAAG+T+DHLVDC+R 
Sbjct: 241 KPIINRYRYPRNVPLRSPKAEALSKDLVKRGFRFVGPVIVHSFMQAAGLTIDHLVDCYRH 300

Query: 897 SECVSMADKSMGAWFH 944
           SECVS+A++    W H
Sbjct: 301 SECVSLAER---PWRH 313


>ref|NP_001240008.1| uncharacterized protein LOC100813637 [Glycine max]
           gi|255645793|gb|ACU23388.1| unknown [Glycine max]
          Length = 314

 Score =  335 bits (858), Expect = 3e-89
 Identities = 169/316 (53%), Positives = 224/316 (70%), Gaps = 8/316 (2%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKEKP-SQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXX 197
           MSK NVR+  LE+ ++++E +K  + SF +++LKKVYP+G+QK                 
Sbjct: 1   MSKTNVRRHALEKCRSVKETQKILNHSFFTRNLKKVYPIGLQKSTSSLSLSSISLSLSQN 60

Query: 198 X-PNGPLSLVNRRSTGTTLSLRQLGPPERKEKSVSVVNVV------QDCLDSDDGSFKRC 356
              +     +       +L+LR + P ER+E +++  N        Q    ++ G  KRC
Sbjct: 61  SNDSSQADSLTPLDEKISLALRLISPRERREPTIAASNKPLQQQHQQPPHTTEPGELKRC 120

Query: 357 HWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAK 536
           +WITK  D+ Y+ FHDE WGVP YDDN+LFELLA SG+LM + WTEILK+KE  RE FA 
Sbjct: 121 NWITKSCDKAYIEFHDECWGVPAYDDNKLFELLAMSGLLMDYNWTEILKRKETLREVFAG 180

Query: 537 FDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNY 716
           FD N VAKM EKEI EI +NK L LA+S+V CIVDNAKC++KI+KE GSFS+Y+WGY+N+
Sbjct: 181 FDANTVAKMKEKEIMEIASNKALSLADSRVMCIVDNAKCVMKIVKECGSFSSYIWGYVNH 240

Query: 717 KPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRF 896
           KP+I+RYRYP++VPLRSPKAEA+SKDL +RG RFVGPVIV++FMQAAG+T+DHLVDC+R 
Sbjct: 241 KPIISRYRYPRNVPLRSPKAEALSKDLVKRGFRFVGPVIVHSFMQAAGLTIDHLVDCYRH 300

Query: 897 SECVSMADKSMGAWFH 944
           SECVS+A++    W H
Sbjct: 301 SECVSLAER---PWRH 313


>ref|XP_002890014.1| methyladenine glycosylase family protein [Arabidopsis lyrata subsp.
           lyrata] gi|297335856|gb|EFH66273.1| methyladenine
           glycosylase family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 310

 Score =  334 bits (856), Expect = 4e-89
 Identities = 164/305 (53%), Positives = 220/305 (72%), Gaps = 3/305 (0%)
 Frame = +3

Query: 39  RKQVLERSKTLREKE-KPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXXXPNGPL 215
           R++++E+SK +REKE K + +F +KHLK++YP+ +Q+                      +
Sbjct: 8   RREIVEKSKNVREKETKQNSNFFAKHLKRIYPITLQRSTSSSFSISSISLSLSQNSTDSV 67

Query: 216 SLVNRRSTGTTLSLRQ--LGPPERKEKSVSVVNVVQDCLDSDDGSFKRCHWITKKSDEVY 389
           S  +  +    +SL    +  P R+E  V      Q C D +    KRC+WITKKSDEVY
Sbjct: 68  STDSNSTLEQKISLALGLISSPYRRETFVPKSIPQQLCQDFNSDEPKRCNWITKKSDEVY 127

Query: 390 VVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKFDPNVVAKMSE 569
           V FHD+ WGVP YDDN LFELLA SGMLM + WTEI+K+KE +REAF +FDPN+VAKM E
Sbjct: 128 VTFHDQQWGVPAYDDNLLFELLAMSGMLMDYNWTEIIKRKELFREAFCEFDPNLVAKMGE 187

Query: 570 KEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYKPMINRYRYPK 749
           K+I EI +NK + L ES+VRCIVDNAKCI K++KEFGSFS+++WG+M+YKP+IN+++Y +
Sbjct: 188 KDITEIASNKAIMLQESRVRCIVDNAKCITKVVKEFGSFSSFIWGFMDYKPIINKFKYSR 247

Query: 750 SVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFSECVSMADKSM 929
           +VPLRSPKAE ISKD+ +RG RFVGPVIV++FMQAAG+T+DHLVDCFR  +CVS+A++  
Sbjct: 248 NVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAER-- 305

Query: 930 GAWFH 944
             W H
Sbjct: 306 -PWRH 309


>ref|XP_006304034.1| hypothetical protein CARUB_v10009802mg [Capsella rubella]
           gi|482572745|gb|EOA36932.1| hypothetical protein
           CARUB_v10009802mg [Capsella rubella]
          Length = 314

 Score =  333 bits (854), Expect = 7e-89
 Identities = 164/309 (53%), Positives = 222/309 (71%), Gaps = 7/309 (2%)
 Frame = +3

Query: 39  RKQVLERSKTLREKE-KPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXXXPNGPL 215
           R++++E+SK++REKE K + +F +KHLK++YP+ +Q+                      +
Sbjct: 8   RREIVEKSKSVREKETKQNSNFFAKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSV 67

Query: 216 SLVNRRSTGTTLSLRQ--LGPPERKE----KSVSVVNVVQDCLDSDDGSFKRCHWITKKS 377
           S  +  +    +SL    +  P R+E    KS+      + C D +    KRC+WITKKS
Sbjct: 68  STDSNSTLEQKISLALGLISSPRRRETFVPKSIPQQLEQELCQDFNSDEPKRCNWITKKS 127

Query: 378 DEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKFDPNVVA 557
           DEVYV FHD+ WGVPVYDDN LFE LA SGMLM + WTEILK+KE +RE F +FDPNVVA
Sbjct: 128 DEVYVKFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKELFREVFCEFDPNVVA 187

Query: 558 KMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYKPMINRY 737
            M EKEI EI +NK + L ES+VRC+VDNAKCI+K++ EFGSFS+++WG+M+YKP+IN++
Sbjct: 188 NMGEKEITEIASNKAIMLQESRVRCVVDNAKCIIKVVNEFGSFSSFMWGFMDYKPIINKF 247

Query: 738 RYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFSECVSMA 917
           +YP++VPLRSPKAE ISKD+ +RG RFVGPVIV++FMQAAG+T+DHLVDCFR  +CVS+A
Sbjct: 248 KYPRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLA 307

Query: 918 DKSMGAWFH 944
           ++    W H
Sbjct: 308 ER---PWRH 313


>ref|XP_004509952.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like
           [Cicer arietinum]
          Length = 317

 Score =  333 bits (853), Expect = 1e-88
 Identities = 171/319 (53%), Positives = 225/319 (70%), Gaps = 11/319 (3%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKEKP-SQSFI-SKHLKKVYPLGIQKXXXXXXXXXXXXXXXX 194
           MSK NVR+Q LER+ + ++ +K  +QSF  +K LKKVYP+G+QK                
Sbjct: 1   MSKPNVRRQALERNTSFKDTQKILNQSFFQNKSLKKVYPIGLQKSTSSSSLSLSSISLSL 60

Query: 195 XXPNGPLSLVNR-----RSTGTTLSLRQLGPPERKEKSVSVVNVVQD----CLDSDDGSF 347
              +   S  +       +    L L    P ER+E  V+   + Q      ++++ G F
Sbjct: 61  SQNSNDSSQADSLTPLDENISLALRLISASPHERREHGVANKTIHQQQPSLLVNTEPGEF 120

Query: 348 KRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREA 527
           KRC+WITK SD+VY+ FHDE WGVP YDDN+LFELLA SG+LM + WTEI+K+KE  RE 
Sbjct: 121 KRCNWITKNSDKVYIEFHDECWGVPAYDDNKLFELLAMSGLLMDYNWTEIIKRKETLREV 180

Query: 528 FAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGY 707
           FA FDP  VAKM EKEI EI +NK L LA+S+V CIVDNAKCI+KI++E GSFS+Y+WG+
Sbjct: 181 FAGFDPYTVAKMEEKEIIEIASNKALSLADSRVMCIVDNAKCIMKIVRECGSFSSYIWGF 240

Query: 708 MNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDC 887
           +N+KP+IN+Y+YP+SVPLRSPKAEA+SKD+ +RG RFVGPVIV++FMQAAG+T+DHLVDC
Sbjct: 241 VNHKPIINKYKYPRSVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 300

Query: 888 FRFSECVSMADKSMGAWFH 944
           +R  ECVS+A++    W H
Sbjct: 301 YRHCECVSLAER---PWRH 316


>ref|NP_973818.1| putative 3-methyladenine glycosylase I [Arabidopsis thaliana]
           gi|334182561|ref|NP_001184988.1| putative
           3-methyladenine glycosylase I [Arabidopsis thaliana]
           gi|332190930|gb|AEE29051.1| putative 3-methyladenine
           glycosylase I [Arabidopsis thaliana]
           gi|332190931|gb|AEE29052.1| putative 3-methyladenine
           glycosylase I [Arabidopsis thaliana]
          Length = 311

 Score =  331 bits (848), Expect = 4e-88
 Identities = 166/306 (54%), Positives = 220/306 (71%), Gaps = 4/306 (1%)
 Frame = +3

Query: 39  RKQVLERSKTLREKE-KPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXXXPNGPL 215
           RK+++E+SK++REKE K + +F +KHLK++YP+ +Q+                      +
Sbjct: 8   RKEIVEKSKSVREKEIKQNSNFFAKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSV 67

Query: 216 SLVNRRSTGTTLSLRQ--LGPPERKEKSVSVVNVVQDCLDSDDGSF-KRCHWITKKSDEV 386
           S  +  +    +SL    +  P R+E  V      Q C D +     KRC+WITKKSDEV
Sbjct: 68  STDSNSTLEQKISLALGLISSPHRREIFVPKSIPQQLCQDFNSSDEPKRCNWITKKSDEV 127

Query: 387 YVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKFDPNVVAKMS 566
           YV+FHD+ WGVPVYDDN LFE LA SGMLM + WTEILK+KE +REAF +FDPN VAKM 
Sbjct: 128 YVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAKMG 187

Query: 567 EKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYKPMINRYRYP 746
           EKEI EI +NK + L ES+VRCIVDNAKCI K++ EFGSFS+++WG+M+YKP+IN+++Y 
Sbjct: 188 EKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFKYS 247

Query: 747 KSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFSECVSMADKS 926
           ++VPLRSPKAE ISKD+ +RG RFVGPVIV++FMQAAG+T+DHLVDCFR  +CVS+A++ 
Sbjct: 248 RNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAER- 306

Query: 927 MGAWFH 944
              W H
Sbjct: 307 --PWRH 310


>gb|AFK34294.1| unknown [Lotus japonicus]
          Length = 308

 Score =  330 bits (845), Expect = 8e-88
 Identities = 164/311 (52%), Positives = 218/311 (70%), Gaps = 3/311 (0%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKEKP-SQSFISKHLKKVYPLGIQK--XXXXXXXXXXXXXXX 191
           MSK NVR+  LE+  TL++ +K  +QSF  K LKKVYP+G+QK                 
Sbjct: 1   MSKSNVRRHALEKGMTLKDAQKILNQSFFPKSLKKVYPVGLQKSTSSLSLSSLSLSLSQN 60

Query: 192 XXXPNGPLSLVNRRSTGTTLSLRQLGPPERKEKSVSVVNVVQDCLDSDDGSFKRCHWITK 371
               +     +       +L+LR +    R+ +  +     Q  L+++ G  KRC+W TK
Sbjct: 61  SNDSSSQADSLTPLDEDISLALRLISVSPRQRREPTAAKTAQQ-LNTEPGELKRCNWATK 119

Query: 372 KSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKFDPNV 551
            SD+ Y+ FHDE WGVP YDDN+LFELLA SG+LM + WTEIL++KE  RE FA+FDP  
Sbjct: 120 NSDKAYIEFHDECWGVPAYDDNKLFELLALSGLLMDYNWTEILRRKETLREVFAEFDPYT 179

Query: 552 VAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYKPMIN 731
           VAKM EKEI EI +NK L LAES+V CI DNAKCI+KI++E GSFS+Y+WG++N+KP+IN
Sbjct: 180 VAKMEEKEIMEIASNKALSLAESRVMCIADNAKCIMKIIRECGSFSSYIWGFVNHKPIIN 239

Query: 732 RYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFSECVS 911
           RY+YP++VPLRSPKAEA+SKD+ +RG RFVGPVIV++F+QAAG+T+DHLVDC+R  ECVS
Sbjct: 240 RYKYPRNVPLRSPKAEALSKDMVKRGFRFVGPVIVHSFIQAAGLTIDHLVDCYRHDECVS 299

Query: 912 MADKSMGAWFH 944
           +A++    W H
Sbjct: 300 LAER---PWRH 307


>ref|XP_007153437.1| hypothetical protein PHAVU_003G035100g [Phaseolus vulgaris]
           gi|561026791|gb|ESW25431.1| hypothetical protein
           PHAVU_003G035100g [Phaseolus vulgaris]
          Length = 314

 Score =  328 bits (840), Expect = 3e-87
 Identities = 167/316 (52%), Positives = 220/316 (69%), Gaps = 8/316 (2%)
 Frame = +3

Query: 21  MSKGNVRKQVLERSKTLREKEKP-SQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXX 197
           MSK NVR+  LE+ +T++E +K  + +F +++LKKVYP+G+ K                 
Sbjct: 1   MSKTNVRRHALEKCRTVKETQKTVNHNFFTRNLKKVYPIGLHKSTSSSSLSLSSISLSLS 60

Query: 198 XPNGPLSLVNRRST---GTTLSLRQLGPPERKEKSVSVVNVVQD----CLDSDDGSFKRC 356
             +   S  +  +      +L+LR + P   +E S +   +          ++ G FKRC
Sbjct: 61  QNSNDSSQADSLTPLDDKISLALRFISPRHAREPSTASKPLHHHQPPTSPPTEPGEFKRC 120

Query: 357 HWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAK 536
           +WITK SD  Y+ FHDE WG+P YDDN+LFELLA SG+LM + WTEILK+KE  RE FA 
Sbjct: 121 NWITKNSDNAYIEFHDECWGIPAYDDNKLFELLAMSGLLMDYNWTEILKRKETLREVFAG 180

Query: 537 FDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNY 716
           FD N VAKM EKEI EI +NK L LA+S+V CIVDNAKCI KI+KE GSFS+Y+WGY+N+
Sbjct: 181 FDANTVAKMEEKEIVEIASNKALSLADSRVMCIVDNAKCITKIVKECGSFSSYIWGYVNH 240

Query: 717 KPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRF 896
           KP+INRY+YP++VPLRSPKAE +SKDL +RG RFVGPVIV++FMQAAG+T+DHLVDC+R 
Sbjct: 241 KPIINRYKYPRNVPLRSPKAEILSKDLVKRGFRFVGPVIVHSFMQAAGLTIDHLVDCYRH 300

Query: 897 SECVSMADKSMGAWFH 944
           SECVS+A++    W H
Sbjct: 301 SECVSLAER---PWRH 313


>ref|XP_006417105.1| hypothetical protein EUTSA_v10008250mg [Eutrema salsugineum]
           gi|557094876|gb|ESQ35458.1| hypothetical protein
           EUTSA_v10008250mg [Eutrema salsugineum]
          Length = 314

 Score =  325 bits (833), Expect = 2e-86
 Identities = 163/309 (52%), Positives = 218/309 (70%), Gaps = 7/309 (2%)
 Frame = +3

Query: 39  RKQVLERSKTLREKE-KPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXXXXXXPNGPL 215
           R+ +LE+S ++REKE K + +F +KHLK++YP+ +Q+                       
Sbjct: 8   RRVILEKSTSVREKETKQNSNFFAKHLKRIYPIALQRSNSSSFSLSSISLSLSQNSTDSF 67

Query: 216 SL--VNRRSTGTTLSLRQLGPPERKE----KSVSVVNVVQDCLDSDDGSFKRCHWITKKS 377
           +    +      +L+L  +  P R+E    KS+      Q   D +    KRC+WITKKS
Sbjct: 68  ATDSTSPLEQRISLALGLISSPRRRETFVPKSIPQQQEQQLYQDFNSDEPKRCNWITKKS 127

Query: 378 DEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKKKEQYREAFAKFDPNVVA 557
           DEVYV FHD+ WGVPVYDDN LFE LA SGMLM + WTEILK+KE +REAF +FDPN+VA
Sbjct: 128 DEVYVTFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKELFREAFCEFDPNLVA 187

Query: 558 KMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSFSNYLWGYMNYKPMINRY 737
           KM EKEI EI +NK + L ES+VRCIV+NAKC +K++KEFGSFS+++WG+M+YKP+IN++
Sbjct: 188 KMGEKEITEIASNKAIMLQESRVRCIVENAKCTMKVVKEFGSFSSFIWGFMDYKPIINKF 247

Query: 738 RYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMTMDHLVDCFRFSECVSMA 917
           +Y ++VPLRSPKAE ISKD+ +RG RFVGPVIV++FMQAAG+T DHLVDCFR  +CVS+A
Sbjct: 248 KYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTNDHLVDCFRHGDCVSLA 307

Query: 918 DKSMGAWFH 944
           ++    W H
Sbjct: 308 ER---PWRH 313


>ref|XP_007045735.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao]
           gi|508709670|gb|EOY01567.1| DNA glycosylase superfamily
           protein isoform 2 [Theobroma cacao]
          Length = 303

 Score =  317 bits (811), Expect = 7e-84
 Identities = 170/326 (52%), Positives = 222/326 (68%), Gaps = 14/326 (4%)
 Frame = +3

Query: 9   LFSSMSKGNVRKQVLERSKTLREKEKPSQSFISKHLKKVYPLGIQKXXXXXXXXXXXXXX 188
           +FSSMSK NVR+ +LE++++ +EKEKP+QS +SKHLKK+YP+G+Q+              
Sbjct: 2   IFSSMSKANVRRHILEKNRSPKEKEKPAQSVLSKHLKKIYPIGLQRSTSSLSLSSLSLSL 61

Query: 189 XXXXPNGPLSLVNRRSTGT----TLSLRQLGPP-ERKEKSVSVVNVVQD---------CL 326
                +   SL +  ST      +L+L  + P  ER+E  V VV  VQ            
Sbjct: 62  SQNSNDS--SLTDHSSTPLEQKISLALSLIAPHHERREFVVPVVKSVQHHHHQQQQQPSQ 119

Query: 327 DSDDGSFKRCHWITKKSDEVYVVFHDEHWGVPVYDDNQLFELLAFSGMLMYHLWTEILKK 506
           D   G  +RC+W+TK S                    QLFELLA SGMLM + WTEILK+
Sbjct: 120 DPGSGELRRCNWVTKNS--------------------QLFELLALSGMLMDYNWTEILKR 159

Query: 507 KEQYREAFAKFDPNVVAKMSEKEIKEIITNKTLGLAESQVRCIVDNAKCILKIMKEFGSF 686
           KE YREAF+ FDP +VAKM +KEI EI ++K + LAES+VRCIVDNAKCILKI++E+GSF
Sbjct: 160 KELYREAFSGFDPEIVAKMGDKEINEISSDKAIMLAESRVRCIVDNAKCILKIVREYGSF 219

Query: 687 SNYLWGYMNYKPMINRYRYPKSVPLRSPKAEAISKDLQRRGCRFVGPVIVYAFMQAAGMT 866
           S+++WGY+NYKP INRY+YP++VPLR+PKAEAIS+DL +RG RFVGPVIV +FMQAAG+T
Sbjct: 220 SSFMWGYVNYKPTINRYKYPRNVPLRTPKAEAISRDLLKRGFRFVGPVIVCSFMQAAGLT 279

Query: 867 MDHLVDCFRFSECVSMADKSMGAWFH 944
           +DHLVDCFR+SECV +A++    W H
Sbjct: 280 IDHLVDCFRYSECVGLAER---PWRH 302


Top