BLASTX nr result

ID: Lithospermum23_contig00047433 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00047433
         (1149 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KZM86605.1 hypothetical protein DCAR_023739 [Daucus carota subsp...   243   1e-74
XP_010091642.1 Putative Glutamine amidotransferase [Morus notabi...   236   5e-72
OMO72221.1 Methyladenine glycosylase [Corchorus olitorius]            232   3e-70
KDO70260.1 hypothetical protein CISIN_1g021082mg [Citrus sinensis]    230   1e-69
XP_006484265.1 PREDICTED: DNA-3-methyladenine glycosylase isofor...   229   5e-69
KDO70261.1 hypothetical protein CISIN_1g021082mg [Citrus sinensis]    222   2e-66
KZM85888.1 hypothetical protein DCAR_026690 [Daucus carota subsp...   219   1e-65
AAF81290.1 Contains similarity to a putative DNA-3-methyladenine...   206   1e-60
OAY47429.1 hypothetical protein MANES_06G078800 [Manihot esculenta]   200   5e-58
XP_018858991.1 PREDICTED: uncharacterized protein LOC109020914 [...   199   9e-58
XP_007045734.1 PREDICTED: DNA-3-methyladenine glycosylase [Theob...   199   2e-57
KHN19385.1 Putative GMP synthase [glutamine-hydrolyzing] [Glycin...   197   7e-57
XP_019149923.1 PREDICTED: uncharacterized protein LOC109146729 [...   194   2e-55
XP_012080476.1 PREDICTED: uncharacterized protein LOC105640691 [...   192   6e-55
OMO93631.1 Methyladenine glycosylase [Corchorus capsularis]           192   8e-55
XP_006379720.1 hypothetical protein POPTR_0008s11150g [Populus t...   186   1e-52
XP_006484266.1 PREDICTED: DNA-3-methyladenine glycosylase isofor...   184   6e-52
KVH97537.1 DNA glycosylase [Cynara cardunculus var. scolymus]         184   1e-51
XP_011021090.1 PREDICTED: uncharacterized protein LOC105123272 [...   183   2e-51
XP_012847546.1 PREDICTED: uncharacterized protein LOC105967493 [...   182   3e-51

>KZM86605.1 hypothetical protein DCAR_023739 [Daucus carota subsp. sativus]
          Length = 296

 Score =  243 bits (619), Expect = 1e-74
 Identities = 138/286 (48%), Positives = 164/286 (57%), Gaps = 45/286 (15%)
 Frame = -3

Query: 724 PKRNLSEKNIKSPSREKDN-KPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXX 548
           P+R + E   +SPS+EKD  KP  +LLS+HLKKVYPV + K                   
Sbjct: 6   PRRPVKEN--RSPSKEKDGQKPNYNLLSKHLKKVYPVGVYKASTSPLSLSSLSLSLSQNS 63

Query: 547 XXXXXXSPRTLEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFK 368
                 S  TL+QKI  A+RLI   E+++                   +    H+ +G  
Sbjct: 64  SDSLTDSSSTLDQKIAAAIRLIIPREKRDVSTVARY----------MQKLSPSHNGEGLN 113

Query: 367 RCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRELF 188
           RCNWIT NSDK+YVQFHDECWG+PVYDD QLFELLA+SGMLMDFNWTEILKRK+  RE F
Sbjct: 114 RCNWITANSDKVYVQFHDECWGIPVYDDNQLFELLAMSGMLMDFNWTEILKRKDLFREAF 173

Query: 187 AGFNPAT--------------------------------------------VINKFRHPR 140
            GF+P+T                                            VIN+FR+PR
Sbjct: 174 VGFDPSTVAKMGEKEIMEISSNKAIMLAESRIVKEYGSFSGYIWGYVDYKPVINRFRYPR 233

Query: 139 NVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           NVPLRS KAEA+SKD+ K GFRFVGPVIVYSFMQAAG+TIDHLVDC
Sbjct: 234 NVPLRSPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGMTIDHLVDC 279


>XP_010091642.1 Putative Glutamine amidotransferase [Morus notabilis] EXB44908.1
           Putative Glutamine amidotransferase [Morus notabilis]
          Length = 298

 Score =  236 bits (601), Expect = 5e-72
 Identities = 129/284 (45%), Positives = 165/284 (58%), Gaps = 44/284 (15%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R + E+N      EK +K    LLS+HLK++YP+ LQK+                    
Sbjct: 7   RRPVLERNGSLKENEKKDKTSPGLLSKHLKRIYPIGLQKSNSSPSLSSLSLSLSENSNDS 66

Query: 541 XXXXSPRTLEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKRC 362
                   L+ KI+LALRL+    RKE+P   +         +++ +S   ++ +  +RC
Sbjct: 67  SLADFGSPLDHKISLALRLVAPPRRKESPAPKN---------VQQQQSQDANNPEELRRC 117

Query: 361 NWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRELFAG 182
           NWITKNSDK+YV FHDECWGVPVYDD QLFELLA+SGMLMD+NWTEILKR+E  RE F+G
Sbjct: 118 NWITKNSDKVYVAFHDECWGVPVYDDNQLFELLAMSGMLMDYNWTEILKRRELFREAFSG 177

Query: 181 FNPA--------------------------------------------TVINKFRHPRNV 134
           F+P+                                             VIN++R+PRNV
Sbjct: 178 FDPSKVAKMGEKEITEISSNKAIMLAESRVVREFGSFSNYMWSYVDHKPVINRYRYPRNV 237

Query: 133 PLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           PLRS KAEA+SKD+ K GFRFVGPVIV+SFMQAAGLTIDHLV+C
Sbjct: 238 PLRSPKAEAISKDLLKRGFRFVGPVIVHSFMQAAGLTIDHLVNC 281


>OMO72221.1 Methyladenine glycosylase [Corchorus olitorius]
          Length = 314

 Score =  232 bits (591), Expect = 3e-70
 Identities = 131/287 (45%), Positives = 170/287 (59%), Gaps = 47/287 (16%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EK  +SP++EK+     ++LS+HLKK+YP+ LQ++                    
Sbjct: 12  RRHIMEKK-RSPTKEKEKPAAQNVLSKHLKKIYPIGLQRSSSSFSLSSLSLSLSQNSNDS 70

Query: 541 XXXXSPRT-LEQKITLALRLIKSNE-RKENPNSPS-KEDLGKNNWIKKNRSDIDHDEQGF 371
                  T LEQKI+LAL LI  +  R++   +P  K  +  ++  ++ +   D      
Sbjct: 71  SLTDHSTTPLEQKISLALSLISPHHVRRDFAAAPVVKSHVHHHHQQQQQQQSQDPGNGEV 130

Query: 370 KRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLREL 191
           +RCNW+TKNSDKLY+ FHDE WGVPVYDD QLFELLALSG+LMD+NWTEILKRKEQ RE 
Sbjct: 131 RRCNWVTKNSDKLYISFHDEQWGVPVYDDNQLFELLALSGLLMDYNWTEILKRKEQYREA 190

Query: 190 FAGFNPATV--------------------------------------------INKFRHP 143
           F+GF+P  V                                            INK+++P
Sbjct: 191 FSGFDPEIVAKMGDKEINEISSNKAIMLPESRIVREYGSFSSFMWGYVNYKPTINKYKYP 250

Query: 142 RNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           RNVPLR+ KAEA+S+D+ K GFRFVGPVIVYSFMQAAGLTIDHLVDC
Sbjct: 251 RNVPLRTPKAEAISRDLLKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 297


>KDO70260.1 hypothetical protein CISIN_1g021082mg [Citrus sinensis]
          Length = 317

 Score =  230 bits (587), Expect = 1e-69
 Identities = 139/298 (46%), Positives = 170/298 (57%), Gaps = 58/298 (19%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EKN +SP +EK+ KP  SLLS+HLKKVYP+ L ++                    
Sbjct: 7   RRHILEKN-RSP-KEKEPKPTQSLLSKHLKKVYPIGLHRSSSSLSLSSLSLSLSQNSNDS 64

Query: 541 XXXXSPRT-LEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKR 365
               +  + LEQ+I+LALRLI   ER+E   + + +   +    ++   D    E   KR
Sbjct: 65  SVTDNSNSPLEQRISLALRLITPPERREVTVAKNAQPQQQQQQQQQQSQDSCCGE--LKR 122

Query: 364 CNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRELFA 185
           CNWITKNSDK+YV FHDECWGVPVYDD QLFELLALSGMLMD+NWTEILKRKE  RE F 
Sbjct: 123 CNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREAFG 182

Query: 184 GFNPATV-------------------------------------INKF------------ 152
           GF+P +V                                     +N+F            
Sbjct: 183 GFDPKSVAKMGEKEILEISSNTAIMLAECRVRCIVDNAKCIVKILNEFGSFSSFMWGYVN 242

Query: 151 --------RHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
                   R+PRNVPLRS KAEA+S+D+ K GFR VGPVIVYSFMQAAGLTIDHLVDC
Sbjct: 243 FKPMINKFRYPRNVPLRSPKAEAISRDLLKRGFRLVGPVIVYSFMQAAGLTIDHLVDC 300


>XP_006484265.1 PREDICTED: DNA-3-methyladenine glycosylase isoform X1 [Citrus
           sinensis]
          Length = 317

 Score =  229 bits (583), Expect = 5e-69
 Identities = 138/298 (46%), Positives = 170/298 (57%), Gaps = 58/298 (19%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EKN +SP +EK+ KP  SLLS+HLKKVYP+ L ++                    
Sbjct: 7   RRHILEKN-RSP-KEKEPKPTQSLLSKHLKKVYPIGLHRSSSSLSLSSLSLSLSQNSNDS 64

Query: 541 XXXXSPRT-LEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKR 365
               +  + LEQ+I+LALRLI   ER+E   + + +   +    ++   D    E   KR
Sbjct: 65  SVTDNSNSPLEQRISLALRLITPPERREVTVAKNVQPQQQQQQQQQQSQDSCCGE--LKR 122

Query: 364 CNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRELFA 185
           CNWITKNSD++YV FHDECWGVPVYDD QLFELLALSGMLMD+NWTEILKRKE  RE F 
Sbjct: 123 CNWITKNSDRVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREAFG 182

Query: 184 GFNPATV-------------------------------------INKF------------ 152
           GF+P +V                                     +N+F            
Sbjct: 183 GFDPKSVAKMGEKEILEISSNTAIMLAECRVRCIVDNAKCIVKILNEFGSFSSFMWGYVN 242

Query: 151 --------RHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
                   R+PRNVPLRS KAEA+S+D+ K GFR VGPVIVYSFMQAAGLTIDHLVDC
Sbjct: 243 FKPMINKFRYPRNVPLRSPKAEAISRDLLKRGFRLVGPVIVYSFMQAAGLTIDHLVDC 300


>KDO70261.1 hypothetical protein CISIN_1g021082mg [Citrus sinensis]
          Length = 316

 Score =  222 bits (566), Expect = 2e-66
 Identities = 137/298 (45%), Positives = 169/298 (56%), Gaps = 58/298 (19%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EKN +SP +EK+ KP  SLLS+HLKKVYP+ L ++                    
Sbjct: 7   RRHILEKN-RSP-KEKEPKPTQSLLSKHLKKVYPIGLHRSSSSLSLSSLSLSLSQNSNDS 64

Query: 541 XXXXSPRT-LEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKR 365
               +  + LEQ+I+LALRLI   ER+E   + + +   +    ++   D    E   KR
Sbjct: 65  SVTDNSNSPLEQRISLALRLITPPERREVTVAKNAQPQQQQQQQQQQSQDSCCGE--LKR 122

Query: 364 CNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRELFA 185
           CNWITKN+ K+YV FHDECWGVPVYDD QLFELLALSGMLMD+NWTEILKRKE  RE F 
Sbjct: 123 CNWITKNN-KVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREAFG 181

Query: 184 GFNPATV-------------------------------------INKF------------ 152
           GF+P +V                                     +N+F            
Sbjct: 182 GFDPKSVAKMGEKEILEISSNTAIMLAECRVRCIVDNAKCIVKILNEFGSFSSFMWGYVN 241

Query: 151 --------RHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
                   R+PRNVPLRS KAEA+S+D+ K GFR VGPVIVYSFMQAAGLTIDHLVDC
Sbjct: 242 FKPMINKFRYPRNVPLRSPKAEAISRDLLKRGFRLVGPVIVYSFMQAAGLTIDHLVDC 299


>KZM85888.1 hypothetical protein DCAR_026690 [Daucus carota subsp. sativus]
          Length = 298

 Score =  219 bits (559), Expect = 1e-65
 Identities = 131/290 (45%), Positives = 164/290 (56%), Gaps = 53/290 (18%)
 Frame = -3

Query: 712 LSEKNIKSPSREK--------DNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXX 557
           +S+++++ P+R++        D KP  +LLS+HLKKVYP  +                  
Sbjct: 1   MSKEDVRRPARKESSCLKEKDDQKPNYNLLSKHLKKVYPDKVYNASTSPLSLSSLSLSLS 60

Query: 556 XXXXXXXXXSPRTLEQKITLALRLIKSNERKENPN-SPSKEDLGKNNWIKKNRSDIDHDE 380
                    SP TL QKI  A+RLI   E++E P  +   ++L   +    N  D +   
Sbjct: 61  QNSSDSTDSSP-TLNQKIAAAVRLIAPREKRELPTVARYMQNLSPGH----NTGDCE--- 112

Query: 379 QGFKRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQL 200
            G  RCNWIT NSD++YVQFHDE WG+P YDD +LFELLA+SGMLMDFNWTEIL+RKE  
Sbjct: 113 -GLNRCNWITANSDRVYVQFHDERWGLPEYDDNKLFELLAMSGMLMDFNWTEILRRKELF 171

Query: 199 RELFAGFNPAT--------------------------------------------VINKF 152
           RE FAGF+P T                                            VIN+F
Sbjct: 172 REAFAGFDPQTVARMGEKEIMEISTNKAIMLAESRIDTEYGSFGGYLWGYMDYKPVINRF 231

Query: 151 RHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           R+ RNVPLRS KAEA+SKD+ K GFRFVGPVIVYSFMQAAG++IDHLVDC
Sbjct: 232 RYARNVPLRSPKAEAISKDLVKRGFRFVGPVIVYSFMQAAGMSIDHLVDC 281


>AAF81290.1 Contains similarity to a putative DNA-3-methyladenine glycosylase I
           F9E10.6 gi|6646756 from Arabidopsis thaliana BAC F9E10
           gb|AC013258 [Arabidopsis thaliana]
          Length = 298

 Score =  206 bits (525), Expect = 1e-60
 Identities = 131/288 (45%), Positives = 157/288 (54%), Gaps = 47/288 (16%)
 Frame = -3

Query: 724 PKRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQK-TCXXXXXXXXXXXXXXXXX 548
           PKR    +  KS  REK+ K  ++  ++HLK++YP++LQ+ T                  
Sbjct: 6   PKRKEIVEKSKSV-REKEIKQNSNFFAKHLKRIYPITLQRSTSSSFSLSSISLSLSQNST 64

Query: 547 XXXXXXSPRTLEQKITLALRLIKSNERKEN--PNSPSKEDLGKNNWIKKNRSDIDHDEQG 374
                 S  TLEQKI+LAL LI S  R+E   P S  ++     N           DE  
Sbjct: 65  DSVSTDSNSTLEQKISLALGLISSPHRREIFVPKSIPQQLCQDFN---------SSDEP- 114

Query: 373 FKRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRE 194
            KRCNWITK SD++YV FHD+ WGVPVYDD  LFE LA+SGMLMD+NWTEILKRKE  RE
Sbjct: 115 -KRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFRE 173

Query: 193 LFAGFNP--------------------------------------------ATVINKFRH 146
            F  F+P                                              +INKF++
Sbjct: 174 AFCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVVNEFGSFSSFVWGFMDYKPIINKFKY 233

Query: 145 PRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
            RNVPLRS KAE +SKDM K GFRFVGPVIV+SFMQAAGLTIDHLVDC
Sbjct: 234 SRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 281


>OAY47429.1 hypothetical protein MANES_06G078800 [Manihot esculenta]
          Length = 315

 Score =  200 bits (509), Expect = 5e-58
 Identities = 121/299 (40%), Positives = 155/299 (51%), Gaps = 59/299 (19%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           ++ ++EKNI    +EK +       S+HLKK+YP+ L ++                    
Sbjct: 8   RKQVAEKNIFMNEKEKPSS--QGFFSKHLKKIYPIGLHRSNSSLSLSSVSLSLSQNSNDS 65

Query: 541 XXXXSPRTLEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQ--GFK 368
                   LE KI LALRLI   ER+ENP       + KN  I++ +   + +      K
Sbjct: 66  SLTDYSTPLEHKIALALRLITPLERRENPV------VSKNVQIQQQQQQSNQENTCGELK 119

Query: 367 RCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNW-------------- 230
           RCNWITKNSDK+YV+FHDECWGVPVYDD +LFELLA++GMLMD+NW              
Sbjct: 120 RCNWITKNSDKVYVEFHDECWGVPVYDDNKLFELLAMAGMLMDYNWTEIVKRKQVFREAF 179

Query: 229 ----------------TEILKRKE-------------------QLRELFAGF-------- 179
                           TEI   K                    ++   F  F        
Sbjct: 180 AGFDPNDVAKMGEKEITEIASNKAIMLAESRVRCIIDNAKCIGKIEREFGSFSSYMWGYV 239

Query: 178 NPATVINKFRHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           N    IN+++HPR VPLRS KAEA+S+D+ + GFRFVGPVIV+SFMQAAGLTIDHLVDC
Sbjct: 240 NYKPTINRYKHPRQVPLRSPKAEAISRDLVRRGFRFVGPVIVHSFMQAAGLTIDHLVDC 298


>XP_018858991.1 PREDICTED: uncharacterized protein LOC109020914 [Juglans regia]
          Length = 311

 Score =  199 bits (507), Expect = 9e-58
 Identities = 124/297 (41%), Positives = 156/297 (52%), Gaps = 57/297 (19%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EKN     +EK   P  S LS++LK++YP+ LQ++                    
Sbjct: 12  RRHVLEKNKVLKEKEK---PAQSSLSKNLKRIYPIGLQRSSSSLSLSSLSLSWSQNSNDS 68

Query: 541 XXXXSPRTLEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKRC 362
               S   L+QKI+LALRLI   ER+E P   + +   +           D      KRC
Sbjct: 69  SLTDSTTPLDQKISLALRLIAPPERREAPVDKNAQQQSQ-----------DTGTGELKRC 117

Query: 361 NWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNW---------------- 230
           NW+TKNSDK+YV FHDECWGVPVYDD QLFELLA+SGMLMD+NW                
Sbjct: 118 NWVTKNSDKVYVAFHDECWGVPVYDDSQLFELLAMSGMLMDYNWTEILKRRELFREAFSG 177

Query: 229 --------------TEILKRK--------------------EQLREL--FAGF-----NP 173
                         T+I   K                    + +RE   F+ F     N 
Sbjct: 178 FDPNVVAKMGEKEITDIASNKAIMLAEGRVRCIVDNSKCILKIVREFGSFSSFMWGYVNH 237

Query: 172 ATVINKFRHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
             VIN++R+PRN+PLR+ KAE +SKD+ KHGFR VGPVIV SFMQAAGLTIDHLVDC
Sbjct: 238 KPVINRYRYPRNIPLRTPKAETLSKDLIKHGFRLVGPVIVCSFMQAAGLTIDHLVDC 294


>XP_007045734.1 PREDICTED: DNA-3-methyladenine glycosylase [Theobroma cacao]
           EOY01566.1 DNA glycosylase superfamily protein isoform 1
           [Theobroma cacao]
          Length = 323

 Score =  199 bits (506), Expect = 2e-57
 Identities = 124/298 (41%), Positives = 160/298 (53%), Gaps = 58/298 (19%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EKN +SP +EK+ KP  S+LS+HLKK+YP+ LQ++                    
Sbjct: 12  RRHILEKN-RSP-KEKE-KPAQSVLSKHLKKIYPIGLQRSTSSLSLSSLSLSLSQNSNDS 68

Query: 541 XXXXSPRT-LEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKR 365
                  T LEQKI+LAL LI  +  +     P  + +  ++  ++ +   D      +R
Sbjct: 69  SLTDHSSTPLEQKISLALSLIAPHHERREFVVPVVKSVQHHHHQQQQQPSQDPGSGELRR 128

Query: 364 CNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRELFA 185
           CNW+TKNSDK+YV FHDE WGVPVYDD QLFELLALSGMLMD+NWTEILKRKE  RE F+
Sbjct: 129 CNWVTKNSDKVYVSFHDEQWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELYREAFS 188

Query: 184 GFNPATV-------INKFRHPRNVPLRSAK------------------------------ 116
           GF+P  V       IN+    + + L  ++                              
Sbjct: 189 GFDPEIVAKMGDKEINEISSDKAIMLAESRVRCIVDNAKCILKIVREYGSFSSFMWGYVN 248

Query: 115 --------------------AEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
                               AEA+S+D+ K GFRFVGPVIV SFMQAAGLTIDHLVDC
Sbjct: 249 YKPTINRYKYPRNVPLRTPKAEAISRDLLKRGFRFVGPVIVCSFMQAAGLTIDHLVDC 306


>KHN19385.1 Putative GMP synthase [glutamine-hydrolyzing] [Glycine soja]
          Length = 311

 Score =  197 bits (501), Expect = 7e-57
 Identities = 114/273 (41%), Positives = 153/273 (56%), Gaps = 54/273 (19%)
 Frame = -3

Query: 658 NSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXXXXXXSPRTLEQKITLALRLIK 479
           +S  +R+LKKVYP+ LQK+                        S   L++KI+LALRLI 
Sbjct: 26  HSFFTRNLKKVYPIGLQKSTSSLSLSSISLSLSQNSNDSSQADSLTPLDEKISLALRLIS 85

Query: 478 SNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKRCNWITKNSDKLYVQFHDECWGV 299
             ER+E   + S + L + +    + ++        KRCNWITK+ DK Y++FHDECWGV
Sbjct: 86  PRERREPTIAASNKPLQQQHQQPPHTTEPGE----LKRCNWITKSCDKAYIEFHDECWGV 141

Query: 298 PVYDDQQLFELLALSGMLMDFNWTEILK--------------------RKEQLREL---- 191
           P YDD +LFELLA+SG+LMD+NWTEILK                    +++++ E+    
Sbjct: 142 PAYDDNKLFELLAMSGLLMDYNWTEILKRKETLREVFAGFDANTVAKMKEKEIMEIASNK 201

Query: 190 -------------------------FAGF-----NPATVINKFRHPRNVPLRSAKAEAMS 101
                                    F+ +     N   +I+++R+PRNVPLRS KAEA+S
Sbjct: 202 ALSLADSRVMCIVDNAKCIVKECGSFSSYIWGYVNHKPIISRYRYPRNVPLRSPKAEALS 261

Query: 100 KDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           KD+ K GFRFVGPVIV+SFMQAAGLTIDHLVDC
Sbjct: 262 KDLVKRGFRFVGPVIVHSFMQAAGLTIDHLVDC 294


>XP_019149923.1 PREDICTED: uncharacterized protein LOC109146729 [Ipomoea nil]
          Length = 325

 Score =  194 bits (492), Expect = 2e-55
 Identities = 128/300 (42%), Positives = 152/300 (50%), Gaps = 63/300 (21%)
 Frame = -3

Query: 712 LSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXXXXX 533
           L ++   SPSR+++    NS     LKK+YP+ L KTC                      
Sbjct: 13  LEKRTNTSPSRDREKPNNNSC--NFLKKIYPIGLHKTCSPLSLSSLSLSLSQTSNDSSIT 70

Query: 532 XSPRT-LEQKITLALRLIKSNERKE---NPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKR 365
            S  T L+QKI  ALRLI   ER+E   N N+  +     +       S    D +   R
Sbjct: 71  DSSVTPLDQKIAFALRLIAPPERREALANRNAARQLQPSPSPTPSPASSVPSDDNEEVNR 130

Query: 364 CNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMD------------------ 239
           CNWITKNSDK+YVQFHDECWGVPVYDD QLFELLALSGMLMD                  
Sbjct: 131 CNWITKNSDKVYVQFHDECWGVPVYDDHQLFELLALSGMLMDFNWTEILKRRDLFREAFG 190

Query: 238 -FNWTEILKRKEQ-------------------------------LREL---------FAG 182
            FN   + K  E+                               +RE          +  
Sbjct: 191 GFNVNSVAKMGEKEIEEIASNESLMLAEGRVRHIVDNAKCTVKVVREFGSFSSYMWNYVS 250

Query: 181 FNPATVINKFRHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           + P  VIN+FR+PRNVPLRS KAE +SKD+ K GFRFVGPVIVYSFMQAAG+TIDHLVDC
Sbjct: 251 YKP--VINRFRYPRNVPLRSPKAEIISKDLVKRGFRFVGPVIVYSFMQAAGMTIDHLVDC 308


>XP_012080476.1 PREDICTED: uncharacterized protein LOC105640691 [Jatropha curcas]
          Length = 322

 Score =  192 bits (489), Expect = 6e-55
 Identities = 120/299 (40%), Positives = 156/299 (52%), Gaps = 59/299 (19%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNS-LLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXX 545
           K+ + +KNI    +EK+ KP +    S+HLKKVYP+ L ++                   
Sbjct: 9   KQVVEKKNIFM--KEKEIKPSSQGFFSKHLKKVYPIGLNRSNSSLSLSSLSLSLSQNSND 66

Query: 544 XXXXXSPRTLEQKITLALRLIKSNE-RKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFK 368
                    LEQKI+LALRLI     R+E P  P  +++ +    +++    + +     
Sbjct: 67  SSLTDYSTPLEQKISLALRLISPPPARREAPPPPVSKNVQQQQQQQQSMQSQESNGGELT 126

Query: 367 RCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNW-------------- 230
           RCNWITKNSD++YV FHDECWGVPVYDD +LFE+L LSGMLMD+NW              
Sbjct: 127 RCNWITKNSDEVYVAFHDECWGVPVYDDNKLFEVLTLSGMLMDYNWTEILKRRELFREAF 186

Query: 229 ----------------TEILKRKE-------------------QLRELFAGF-------- 179
                           TEI   K                    ++   F  F        
Sbjct: 187 AGFDPKIVAKMGEKEITEIASDKTIMLAETRVRCIADNAKCIVKIEREFGSFSSYMWGYV 246

Query: 178 NPATVINKFRHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
           N   +IN++++PRNVPLR+ KAE +SKD+ K GFRFVGPVIVYSFMQAAGLTIDHLVDC
Sbjct: 247 NYKPMINRYKYPRNVPLRTPKAEIISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 305


>OMO93631.1 Methyladenine glycosylase [Corchorus capsularis]
          Length = 323

 Score =  192 bits (488), Expect = 8e-55
 Identities = 120/301 (39%), Positives = 163/301 (54%), Gaps = 61/301 (20%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EKN +SP++EK+     ++LS+HLKK+YP+ LQ++                    
Sbjct: 7   RRHIMEKN-RSPTKEKEKPASQNVLSKHLKKIYPIGLQRSSSSFSLSSLSLSLSQNSNDS 65

Query: 541 XXXXSPRT-LEQKITLALRLIKSNE-RKENPNSPSKEDLGKNNWIKKNRSDIDHDEQG-- 374
                  T LEQKI+LAL LI  +  R++   +P  +    ++  ++ +     D     
Sbjct: 66  SLTDHSTTPLEQKISLALSLISPHHVRRDFAAAPVVKSHVHHHQQQQQQQQQSQDPGNGE 125

Query: 373 FKRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRE 194
            +RCNW+TKNSDKLY+ FHDE WGVPVYDD QLFELLALSG+LMD+NWTEILKRKEQ RE
Sbjct: 126 VRRCNWVTKNSDKLYISFHDEQWGVPVYDDNQLFELLALSGLLMDYNWTEILKRKEQYRE 185

Query: 193 LFAGFNPATV-------INKFRHPRNVPLRSAK-------AEAMSKDMQKHG-------- 80
            F+GF+P TV       IN+    + + L  ++       A+ + K ++++G        
Sbjct: 186 AFSGFDPETVAKMGDKEINEISSNKAIMLPESRVRCIVDNAKCILKIVREYGSFSSFMWG 245

Query: 79  -----------------------------------FRFVGPVIVYSFMQAAGLTIDHLVD 5
                                              FRFVGPVIVYSFMQAAGLTIDHLVD
Sbjct: 246 YVNYKPTINKYKYPRNVPLRTPKAEAISRDLLKRGFRFVGPVIVYSFMQAAGLTIDHLVD 305

Query: 4   C 2
           C
Sbjct: 306 C 306


>XP_006379720.1 hypothetical protein POPTR_0008s11150g [Populus trichocarpa]
           ERP57517.1 hypothetical protein POPTR_0008s11150g
           [Populus trichocarpa]
          Length = 320

 Score =  186 bits (473), Expect = 1e-52
 Identities = 123/304 (40%), Positives = 160/304 (52%), Gaps = 67/304 (22%)
 Frame = -3

Query: 712 LSEKNIKSPSREKDN-------KPCNS--LLSRHLKKVYPVSLQKTCXXXXXXXXXXXXX 560
           +S+ N++    EK++       KP +S  L ++HLK+VYP+ L ++              
Sbjct: 1   MSKANVRKQILEKNSIFIKEKEKPLSSQGLFTKHLKRVYPIGLHRSSSSLSLSSVSLSLS 60

Query: 559 XXXXXXXXXXSPRT-LEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHD 383
                        T LEQKI+LALRLI  +ER+E P + + +   +    ++ + D   +
Sbjct: 61  QNSNDSSLTDCSATPLEQKISLALRLISPSERREVPVARNFQTRQQRQQ-QQQKQDQGSN 119

Query: 382 EQGFKRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMD------------ 239
           +   KRCNWITKNSDK+YV FHDE WGVPVYDD QLFELLALSGMLMD            
Sbjct: 120 DGELKRCNWITKNSDKVYVAFHDEFWGVPVYDDIQLFELLALSGMLMDYNWTEILKRKEL 179

Query: 238 -------FNWTEILKRKEQ-------------------------------LRE------- 194
                  FN   + K+ E+                                RE       
Sbjct: 180 FREAFDGFNPNIVAKKGEKEIMEIASNKAIMLAESRVRCIVDNARCLLKIAREFGSFSNY 239

Query: 193 LFAGFNPATVINKFRHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDH 14
           ++   N    IN++++PRNV LRS KAEA+SKD+ K GFRFVGPVIVYSFMQAAGLTIDH
Sbjct: 240 MWGNVNFKPTINRYKYPRNVQLRSPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDH 299

Query: 13  LVDC 2
           LVDC
Sbjct: 300 LVDC 303


>XP_006484266.1 PREDICTED: DNA-3-methyladenine glycosylase isoform X2 [Citrus
           sinensis]
          Length = 291

 Score =  184 bits (466), Expect = 6e-52
 Identities = 115/275 (41%), Positives = 148/275 (53%), Gaps = 58/275 (21%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXXXX 542
           +R++ EKN +SP +EK+ KP  SLLS+HLKKVYP+ L ++                    
Sbjct: 7   RRHILEKN-RSP-KEKEPKPTQSLLSKHLKKVYPIGLHRSSSSLSLSSLSLSLSQNSNDS 64

Query: 541 XXXXSPRT-LEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGFKR 365
               +  + LEQ+I+LALRLI   ER+E   + + +   +    ++   D    E   KR
Sbjct: 65  SVTDNSNSPLEQRISLALRLITPPERREVTVAKNVQPQQQQQQQQQQSQDSCCGE--LKR 122

Query: 364 CNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLRELFA 185
           CNWITKNSD++YV FHDECWGVPVYDD QLFELLALSGMLMD+NWTEILKRKE  RE F 
Sbjct: 123 CNWITKNSDRVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREAFG 182

Query: 184 GFNPATV-------------------------------------INKF------------ 152
           GF+P +V                                     +N+F            
Sbjct: 183 GFDPKSVAKMGEKEILEISSNTAIMLAECRVRCIVDNAKCIVKILNEFGSFSSFMWGYVN 242

Query: 151 --------RHPRNVPLRSAKAEAMSKDMQKHGFRF 71
                   R+PRNVPLRS KAEA+S+D+ K GFR+
Sbjct: 243 FKPMINKFRYPRNVPLRSPKAEAISRDLLKRGFRY 277


>KVH97537.1 DNA glycosylase [Cynara cardunculus var. scolymus]
          Length = 324

 Score =  184 bits (467), Expect = 1e-51
 Identities = 122/303 (40%), Positives = 154/303 (50%), Gaps = 63/303 (20%)
 Frame = -3

Query: 721 KRNLSEK-----NIKSPSR-EKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXX 560
           +RN+ EK     +  SP+R  K+     SLLS+HLKKVYPV +QKT              
Sbjct: 8   RRNVIEKKNSSSSSSSPARGNKEKLSTQSLLSKHLKKVYPVGIQKTSSLLSLSSLSLTLS 67

Query: 559 XXXXXXXXXXSPRTLEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDE 380
                     S  TLEQ I+ AL LI     +  P  P  +    +  + +   D  + E
Sbjct: 68  HNSSGSFTDSSS-TLEQTISSALHLIAPTPARREP--PVAKTSAVHAPVPQPSLDPTNCE 124

Query: 379 QGFKRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQL 200
           +G +RCNWITK+SDK+YVQFHDECWGVPVYDD QLFELL+L GMLMD+NWTEILKRK+  
Sbjct: 125 EGLRRCNWITKSSDKVYVQFHDECWGVPVYDDNQLFELLSLCGMLMDYNWTEILKRKDLF 184

Query: 199 RELFAGFNPATV-------INKFRHPRNVPLRSAK-------AEAMSKDMQKHG------ 80
           RE FAGF P  V       I +    +++ L  ++       A+ + K  + HG      
Sbjct: 185 REAFAGFEPNIVAKMGENDIMEIASNKDIMLAESRVRSIVENAKCILKIAKAHGSFSGYM 244

Query: 79  -------------------------------------FRFVGPVIVYSFMQAAGLTIDHL 11
                                                FR VGPVIVYSFMQAAG++IDHL
Sbjct: 245 WGSVNYKPTINRCRHPRNVPLRTPKAEAISKDLLKHGFRLVGPVIVYSFMQAAGMSIDHL 304

Query: 10  VDC 2
           VDC
Sbjct: 305 VDC 307


>XP_011021090.1 PREDICTED: uncharacterized protein LOC105123272 [Populus
           euphratica]
          Length = 318

 Score =  183 bits (465), Expect = 2e-51
 Identities = 123/300 (41%), Positives = 158/300 (52%), Gaps = 60/300 (20%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSREKDNKPCNS--LLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXX 548
           +R + EKN  S   ++  KP +S  L ++HLK+VYP+ L ++                  
Sbjct: 7   RRQILEKN--SIFIKEKEKPLSSQGLFTKHLKRVYPIGLHRSSSSLSLSSVSLSLSQNSN 64

Query: 547 XXXXXXSPRT-LEQKITLALRLIKSNERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQGF 371
                    T LEQKI+LALRLI  +ER+  P + + +   +    K+++   D +    
Sbjct: 65  DSSLTDCSATPLEQKISLALRLISPSERRGVPVARNFQTRQQQQQQKQDQGSNDGE---L 121

Query: 370 KRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMD---------------- 239
           KRCNWIT+NSDK+YV FHDE WGVPVYDD QLFELLALSGMLMD                
Sbjct: 122 KRCNWITQNSDKVYVAFHDEFWGVPVYDDIQLFELLALSGMLMDYNWTEILKRKELFREA 181

Query: 238 ---FNWTEILKRKEQ-------------------------------LRE-------LFAG 182
              FN   + K+ E+                                RE       ++  
Sbjct: 182 FNGFNPNIVAKKGEKEIMEIASNKAIMLAESRVRCIVDNARCLLKIAREFGSFSNYMWGN 241

Query: 181 FNPATVINKFRHPRNVPLRSAKAEAMSKDMQKHGFRFVGPVIVYSFMQAAGLTIDHLVDC 2
            N    IN++++PRNV LRS KAEA+SKD+ K GFRFVGPVIVYSFMQAAGLTIDHLVDC
Sbjct: 242 VNFKPTINRYKYPRNVQLRSPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 301


>XP_012847546.1 PREDICTED: uncharacterized protein LOC105967493 [Erythranthe
           guttata] EYU28897.1 hypothetical protein
           MIMGU_mgv1a010390mg [Erythranthe guttata]
          Length = 313

 Score =  182 bits (463), Expect = 3e-51
 Identities = 119/302 (39%), Positives = 152/302 (50%), Gaps = 62/302 (20%)
 Frame = -3

Query: 721 KRNLSEKNIKSPSR--EKDNKPCNSLLSRHLKKVYPVSLQKTCXXXXXXXXXXXXXXXXX 548
           K+   EK  K+ S    +  K  +++ SRHLKK+YP+ L +TC                 
Sbjct: 7   KKQALEKTNKNSSIIISRPEKNSSNIFSRHLKKIYPIGLHRTCSPLSLSSLSLSLSQNST 66

Query: 547 XXXXXXSPRT-LEQKITLALRLIKSN--ERKENPNSPSKEDLGKNNWIKKNRSDIDHDEQ 377
                 S    L+QKI+LALRLI S   +R+ +P             + K    +  +E 
Sbjct: 67  DSSLTDSSSAPLDQKISLALRLITSPPIKRRVDPT------------VSKGLDVLGDEEV 114

Query: 376 GFKRCNWITKNSDKLYVQFHDECWGVPVYDDQQLFELLALSGMLMDFNWTEILKRKEQLR 197
             +RCNWITKNSDK+YVQFHDECWGVPVYDD QLFELLA+ GMLMDFNWTEILKR++ LR
Sbjct: 115 TTRRCNWITKNSDKVYVQFHDECWGVPVYDDNQLFELLAMCGMLMDFNWTEILKRRQLLR 174

Query: 196 ELFAGFNPATV-------INKFRHPRNVPLRSAK-------AEAMSKDMQKHG------- 80
           E F GF+P  V       IN     + + L   +       A+ ++K  +++G       
Sbjct: 175 EAFVGFDPNNVEKMGEKEINDIASNKELSLAENRVRCIVDNAKCITKVAEEYGSFSSYLW 234

Query: 79  ------------------------------------FRFVGPVIVYSFMQAAGLTIDHLV 8
                                               FR VGPVIVYSFMQA+GLTIDHLV
Sbjct: 235 DNMSYKPVINKFRHPRNVPLRSPKAEVMSKDLVRRGFRLVGPVIVYSFMQASGLTIDHLV 294

Query: 7   DC 2
           DC
Sbjct: 295 DC 296


Top