BLASTX nr result

ID: Zingiber25_contig00030538 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00030538
         (707 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004970256.1| PREDICTED: uncharacterized protein LOC101752...   162   1e-37
ref|XP_006644844.1| PREDICTED: uncharacterized protein LOC102709...   155   9e-36
ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614...   154   2e-35
gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Th...   154   3e-35
ref|NP_001150401.1| DNA-3-methyladenine glycosylase I [Zea mays]...   153   6e-35
ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256...   153   6e-35
gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabi...   152   1e-34
ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citr...   152   1e-34
tpg|DAA57252.1| TPA: hypothetical protein ZEAMMB73_557706 [Zea m...   151   2e-34
gb|ACF86573.1| unknown [Zea mays] gi|195657211|gb|ACG48073.1| DN...   151   2e-34
emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera]   149   7e-34
ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Popu...   148   1e-33
ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Popu...   148   1e-33
ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791...   147   3e-33
ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594...   147   4e-33
gb|EOY14287.1| DNA glycosylase superfamily protein isoform 2 [Th...   145   1e-32
ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246...   144   4e-32
gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Th...   143   5e-32
ref|XP_002299382.2| hypothetical protein POPTR_0001s12320g [Popu...   142   1e-31
ref|XP_003564403.1| PREDICTED: probable GMP synthase [glutamine-...   142   1e-31

>ref|XP_004970256.1| PREDICTED: uncharacterized protein LOC101752873 [Setaria italica]
          Length = 373

 Score =  162 bits (410), Expect = 1e-37
 Identities = 101/233 (43%), Positives = 131/233 (56%), Gaps = 3/233 (1%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAAGV 197
           MC+S ++S+  A   IDGR VLQP  NR++P E  RP+K +L KS S+P SF N AAA  
Sbjct: 1   MCNSNVKSAGVAQ--IDGRPVLQPAGNRVAPPEGARPLKKSLHKSLSMPASFDNNAAAAA 58

Query: 198 FSIDHPSPPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXXXXXXXXXXXXX 377
                P+P   +T+    + L     P  + +     R  KA+  +              
Sbjct: 59  A---RPAPE--NTRAAAAASLLPPATPASVTA-----RATKAAAVAAEKSRVKARKPGAV 108

Query: 378 XXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTPA--KIEELA 551
             V     L  F         AGS+AAAQREHAA  QAQRK+RIAHYGRT +  ++E   
Sbjct: 109 LPVVTFAALEAFEP-------AGSIAAAQREHAAQAQAQRKMRIAHYGRTASFSRVEGRV 161

Query: 552 GSIECPDIAMSAS-QEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
           G+     +  S +  +EK+CSFITP SDP+YVAYHDEEWGVPVHDD +LFE+L
Sbjct: 162 GATAAEPVPASPTGNDEKRCSFITPYSDPLYVAYHDEEWGVPVHDDELLFEML 214


>ref|XP_006644844.1| PREDICTED: uncharacterized protein LOC102709508 [Oryza brachyantha]
          Length = 387

 Score =  155 bits (393), Expect = 9e-36
 Identities = 92/236 (38%), Positives = 127/236 (53%), Gaps = 6/236 (2%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAAGV 197
           MC+S ++S+   +  IDGR VLQP  NR++  E  RP+K +LQKS S+P S  N AA   
Sbjct: 1   MCNSNVKSAGGVAQ-IDGRPVLQPAGNRVAAPEGARPLKKSLQKSLSMPASLDNAAAPPT 59

Query: 198 FSIDHPSPPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTW----SXXXXXXXXXX 365
            +    +           + L     P  +++   ++   K ++     +          
Sbjct: 60  CTATPENTRASDFARAAAAALLPPPTPASVNAKATRVAGAKVASARAAATAAAMGSLDRS 119

Query: 366 XXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTPA--KI 539
                   G +      +GL     AGS+AAAQREHAAL QAQRK+RIAHYGRT +  ++
Sbjct: 120 RKPAKKAGGAVLPVVAFAGLEAYEPAGSIAAAQREHAALAQAQRKMRIAHYGRTASFSRV 179

Query: 540 EELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
           E    +       +    +EK+CSFITP SDP+YVAYHDEEWGVPVHDD +LFE+L
Sbjct: 180 EGKVSATATGTAELVTGHDEKRCSFITPYSDPLYVAYHDEEWGVPVHDDGLLFEML 235


>ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614205 [Citrus sinensis]
          Length = 375

 Score =  154 bits (390), Expect = 2e-35
 Identities = 96/232 (41%), Positives = 128/232 (55%), Gaps = 2/232 (0%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAAGV 197
           MC SK +   A    I+GR VLQP SN++  LE    +K T    + + T   N+ +   
Sbjct: 1   MCSSKSKLHSATQ--INGRPVLQPTSNQVPSLEKRNSIKKTGSPKSPITTDNVNSKS--- 55

Query: 198 FSIDHPSPPI-PSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXXXXXXXXXXXX 374
           F+    SPP+ P  K    + +KR  +P  L++S EK+ TPK                  
Sbjct: 56  FTKSLLSPPVSPKLKSPRPAAVKRGNDPNVLNTSAEKIMTPKKLA--------------S 101

Query: 375 XXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTP-AKIEELA 551
                  + +A      L     GS+AAA+REH A++Q QRKLRIAHYGRT  AK E   
Sbjct: 102 LVKKPKNVGVAPCYDSSLIVEAPGSIAAARREHVAIMQEQRKLRIAHYGRTKSAKFEGKV 161

Query: 552 GSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
             ++      +  +EEK+CSFITP+SDP+YVAYHDEEWGVPVHDD++LFELL
Sbjct: 162 PGLDSFANGDNNDREEKRCSFITPNSDPIYVAYHDEEWGVPVHDDKLLFELL 213


>gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
          Length = 409

 Score =  154 bits (388), Expect = 3e-35
 Identities = 99/244 (40%), Positives = 132/244 (54%), Gaps = 14/244 (5%)
 Frame = +3

Query: 18  MCHS--KIRSSDAASPA---IDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANT 182
           MC S  K+ +    +PA   I+GR VLQP  NR+  L+    +K     S   P S A+T
Sbjct: 1   MCSSNAKVTAGVEITPAVARINGRPVLQPTCNRVPSLDRRNSLKKIPPLSPPTPPSLAST 60

Query: 183 AAAGVFSIDHP-------SPPI-PSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSX 338
             A   ++ +        +PPI P +K    + +KR  +P  L++S+EK+ TP+  T + 
Sbjct: 61  LPATSATVGNGGRAKASLTPPISPKSKSPRPAAIKRGSDPNALNTSSEKVMTPRNITKTL 120

Query: 339 XXXXXXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHY 518
                          +S  I  +   S  L     GS+AA +RE  AL QAQRK++IAHY
Sbjct: 121 ERKKSKSFKEGMGNGLSSWIEPSLSYSSSLIVEAPGSIAAVRREQMALQQAQRKMKIAHY 180

Query: 519 GRTP-AKIEELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRML 695
           GR+  AK E     +          +EEK+CSFITP+SDPVYVAYHDEEWGVPVHDD ML
Sbjct: 181 GRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSDPVYVAYHDEEWGVPVHDDSML 240

Query: 696 FELL 707
           FELL
Sbjct: 241 FELL 244


>ref|NP_001150401.1| DNA-3-methyladenine glycosylase I [Zea mays]
           gi|195638964|gb|ACG38950.1| DNA-3-methyladenine
           glycosylase I [Zea mays]
          Length = 373

 Score =  153 bits (386), Expect = 6e-35
 Identities = 101/241 (41%), Positives = 134/241 (55%), Gaps = 11/241 (4%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLE--SPRPVKHTLQKSTSLPTSFANTAAA 191
           MC+S ++S+  A   IDGR VLQP  NR++  E  + RP+K +LQKS S+P  + + A A
Sbjct: 1   MCNSNVKSAGVAQ--IDGRPVLQPAGNRVAAPEPDASRPLKKSLQKSLSMPAYYDSNATA 58

Query: 192 GVFSIDHPSPPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXXXXXXXXXXX 371
           G       + P P+   T  +            +++  L   KA+T +            
Sbjct: 59  G-------ARPAPAENTTRAA------------ANSSPLPPAKAATKAAGAFPAEKSGRS 99

Query: 372 XXXXVSGEIRLADFSSGLLRN-RVAGSVAAAQREHAALVQAQRKLRIAHYGRTPAKIEEL 548
                 G +     +   L     AGS+AAAQREHAA  QAQRKLRIAHYGRT A    +
Sbjct: 100 KAARRPGAVPPPVVAFAALDALEPAGSIAAAQREHAAQAQAQRKLRIAHYGRT-ASFSRV 158

Query: 549 AGSI-----ECPDIAMSAS---QEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFEL 704
            G +       P+ A++AS   Q+EK+CSFITP SDP+YVAYHDEEWGVPVHDD +LFE+
Sbjct: 159 EGRVVGAAAAAPERAVTASPAGQDEKRCSFITPYSDPLYVAYHDEEWGVPVHDDELLFEM 218

Query: 705 L 707
           L
Sbjct: 219 L 219


>ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256507 [Vitis vinifera]
           gi|297738175|emb|CBI27376.3| unnamed protein product
           [Vitis vinifera]
          Length = 398

 Score =  153 bits (386), Expect = 6e-35
 Identities = 105/247 (42%), Positives = 131/247 (53%), Gaps = 17/247 (6%)
 Frame = +3

Query: 18  MCHSKIRSSDA-----ASPAIDGRQVLQPPSNRISPLE--------SPRPVKHTLQKSTS 158
           MC SK +         +   I+GR  LQP  NRI  LE        SP+     L  S  
Sbjct: 1   MCSSKSKLHQGIDITPSKAQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSPLPASPP 60

Query: 159 LPTSFANTAAAGVFSIDHPS---PPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKAST 329
            PT+  NT          PS   P  P+ K      LKR  +P GL+SS EK+ TP+ +T
Sbjct: 61  PPTTIINTTKT------KPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKVLTPRGTT 114

Query: 330 WSXXXXXXXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRI 509
            S                 S    L ++SS L+     GS+AAA+RE  A++Q QRK+RI
Sbjct: 115 KSSSSPKKTKKCSAGLAPSSDTSSL-NYSSSLIVE-APGSIAAARREQMAIMQVQRKMRI 172

Query: 510 AHYGRTP-AKIEELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDD 686
           AHYGRT  AK EE  G +   D  +  ++EEK+CSFITP+SDP YV YHDEEWGVPVHDD
Sbjct: 173 AHYGRTKSAKYEEKIGPV---DPLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDD 229

Query: 687 RMLFELL 707
           + LFELL
Sbjct: 230 KRLFELL 236


>gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabilis]
          Length = 394

 Score =  152 bits (384), Expect = 1e-34
 Identities = 99/239 (41%), Positives = 131/239 (54%), Gaps = 9/239 (3%)
 Frame = +3

Query: 18  MCHSKIRSS------DAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKS-TSLPTSFA 176
           MC SK ++        +A P I+GR VLQP  NR+S LE    +K T  KS TS P +  
Sbjct: 1   MCSSKPKTLLGTNTITSAEPKINGRPVLQPTCNRVSSLERRMSLKKTTPKSPTSPPLALP 60

Query: 177 NTAAAGVFSIDHPSPPI-PSTKITETSLLKRRGEP-IGLDSSTEKLRTPKASTWSXXXXX 350
               A        SPP+ P         +KR  +P   L+SS EK+ TP+    S     
Sbjct: 61  IQNGACKTKPSTLSPPVSPKLPSPRPPAIKRGKDPNYELNSSAEKVLTPRCIIKSTSSIK 120

Query: 351 XXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTP 530
                       +G +     +S  L     GS+AAA+RE  A++Q QRK+RIAHYGRT 
Sbjct: 121 KSKKCGG-----AGVVAETLKNSSSLIVEAPGSIAAARREQVAIMQEQRKIRIAHYGRT- 174

Query: 531 AKIEELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
            K  +  G +  P +  S  +E+K+CS+ITP+SDP+YVAYHDEEWGVPVHDD++LFELL
Sbjct: 175 -KSAKFEGKVVAPMLDSSVGKEQKRCSYITPNSDPIYVAYHDEEWGVPVHDDKLLFELL 232


>ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citrus clementina]
           gi|557551187|gb|ESR61816.1| hypothetical protein
           CICLE_v10015639mg [Citrus clementina]
          Length = 375

 Score =  152 bits (384), Expect = 1e-34
 Identities = 97/232 (41%), Positives = 131/232 (56%), Gaps = 2/232 (0%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAAGV 197
           MC SK +   A    I+GR VLQP SN++  LE    +K T    + + T+  N+ +   
Sbjct: 1   MCSSKSKLHSATQ--INGRPVLQPTSNQVPSLEKRSSIKKTGSPKSPITTNNVNSKS--- 55

Query: 198 FSIDHPSPPI-PSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXXXXXXXXXXXX 374
           F+    SPP+ P  K    + +KR  +P  L++S EK+ TPK                  
Sbjct: 56  FTKSLLSPPVSPKLKSPRPAAVKRGNDPNVLNTSAEKIMTPKKLA------------SFV 103

Query: 375 XXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTP-AKIEELA 551
               + E+     SS ++     GS+AAA+REH A++Q QRKLRIAHYGRT  AK E   
Sbjct: 104 KKPKNAEVAPCYDSSLIVE--APGSIAAARREHVAIMQEQRKLRIAHYGRTKSAKFEGKV 161

Query: 552 GSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
             ++      +  +EEK+CSFITP+SDP YVAYHDEEWGVPVHDD++LFELL
Sbjct: 162 PGLDSFANGDNNDREEKRCSFITPNSDPKYVAYHDEEWGVPVHDDKLLFELL 213


>tpg|DAA57252.1| TPA: hypothetical protein ZEAMMB73_557706 [Zea mays]
          Length = 293

 Score =  151 bits (381), Expect = 2e-34
 Identities = 95/236 (40%), Positives = 127/236 (53%), Gaps = 6/236 (2%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAAGV 197
           MC+S ++S+  A   IDGR VLQP  NR++  ++ RP+K +L KS S+P S+ N A    
Sbjct: 1   MCNSNVKSAGVAQ--IDGRPVLQPAGNRVAAPDAARPLKKSLHKSFSMPPSYDNNATV-- 56

Query: 198 FSIDHPSPPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXXXXXXXXXXXXX 377
                P+ P P+         +    P  L   T     P A+  +              
Sbjct: 57  -----PARPAPAENT------RAAPAPPSLLPPTTPAPAPAAARATKAAGAVPAEKPRSK 105

Query: 378 XXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTPA--KIEELA 551
               G +      +       AGS+AAA+REHAA  QAQRK RIAHYGRT +  ++E   
Sbjct: 106 ARKPGAVLPVATFAAPEAFEPAGSIAAARREHAAQAQAQRKSRIAHYGRTASFSRVEGRV 165

Query: 552 GSIECPDIAMSASQ----EEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
           G+    + A+ AS     +EK+CSFITP SDP+YVAYHDEEWGVPVHDD +LFE+L
Sbjct: 166 GATATAEPAVPASPTTGLDEKRCSFITPYSDPLYVAYHDEEWGVPVHDDELLFEML 221


>gb|ACF86573.1| unknown [Zea mays] gi|195657211|gb|ACG48073.1| DNA-3-methyladenine
           glycosylase I [Zea mays] gi|414880122|tpg|DAA57253.1|
           TPA: DNA-3-methyladenine glycosylase I [Zea mays]
          Length = 377

 Score =  151 bits (381), Expect = 2e-34
 Identities = 95/236 (40%), Positives = 127/236 (53%), Gaps = 6/236 (2%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAAGV 197
           MC+S ++S+  A   IDGR VLQP  NR++  ++ RP+K +L KS S+P S+ N A    
Sbjct: 1   MCNSNVKSAGVAQ--IDGRPVLQPAGNRVAAPDAARPLKKSLHKSFSMPPSYDNNATV-- 56

Query: 198 FSIDHPSPPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXXXXXXXXXXXXX 377
                P+ P P+         +    P  L   T     P A+  +              
Sbjct: 57  -----PARPAPAENT------RAAPAPPSLLPPTTPAPAPAAARATKAAGAVPAEKPRSK 105

Query: 378 XXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTPA--KIEELA 551
               G +      +       AGS+AAA+REHAA  QAQRK RIAHYGRT +  ++E   
Sbjct: 106 ARKPGAVLPVATFAAPEAFEPAGSIAAARREHAAQAQAQRKSRIAHYGRTASFSRVEGRV 165

Query: 552 GSIECPDIAMSASQ----EEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
           G+    + A+ AS     +EK+CSFITP SDP+YVAYHDEEWGVPVHDD +LFE+L
Sbjct: 166 GATATAEPAVPASPTTGLDEKRCSFITPYSDPLYVAYHDEEWGVPVHDDELLFEML 221


>emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera]
          Length = 398

 Score =  149 bits (377), Expect = 7e-34
 Identities = 103/247 (41%), Positives = 128/247 (51%), Gaps = 17/247 (6%)
 Frame = +3

Query: 18  MCHSKIRSSDA-----ASPAIDGRQVLQPPSNRISPLE--------SPRPVKHTLQKSTS 158
           MC SK +         +   I+GR  LQP  NRI  LE        SP+     L  S  
Sbjct: 1   MCSSKSKLHQGIDITPSKAQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSPLPASLP 60

Query: 159 LPTSFANTAAAGVFSIDHPS---PPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKAST 329
            PT+  NT          PS   P  P+ K      LKR  +P GL+SS EK+ TP+ +T
Sbjct: 61  PPTTIINTTKT------KPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKVLTPRGTT 114

Query: 330 WSXXXXXXXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRI 509
            S                 S    L   SS ++     GS+AAA+RE  A++Q QRK+RI
Sbjct: 115 KSSSSPKKTKKCSAGLAPSSDTSSLNYSSSFIVE--APGSIAAARREQMAIMQVQRKMRI 172

Query: 510 AHYGRTP-AKIEELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDD 686
           AHYGRT  AK EE    +   D  +  ++EEK+CSFITP+SDP YV YHDEEWGVPVHDD
Sbjct: 173 AHYGRTKSAKYEEKISPV---DPLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDD 229

Query: 687 RMLFELL 707
           + LFELL
Sbjct: 230 KRLFELL 236


>ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa]
           gi|550343248|gb|EEE78698.2| hypothetical protein
           POPTR_0003s15520g [Populus trichocarpa]
          Length = 420

 Score =  148 bits (374), Expect = 1e-33
 Identities = 102/253 (40%), Positives = 130/253 (51%), Gaps = 23/253 (9%)
 Frame = +3

Query: 18  MCHSKIRSSDAAS------PAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFAN 179
           MC SK R + + S        I+GR VLQP SN++  LE     +H   K  S P S   
Sbjct: 1   MCSSKSRLNQSTSNIATTIAKINGRPVLQPKSNQVPSLE-----RHNSLKKNSPPKSPTR 55

Query: 180 TAAAGVFSIDHP---------------SPPI-PSTKITETSLLKRRGEPIGLDSSTEKLR 311
             A     +  P               SPPI P  K      +KR  EP GL++S EK+ 
Sbjct: 56  EPAGPPVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKVL 115

Query: 312 TPKASTWSXXXXXXXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQA 491
           TP+++T                         A   S  L     GS+AAA+RE  A++Q 
Sbjct: 116 TPRSTTKVTTSTVKKSKKSSTAGVPHSVDTFAMKYSSSLLVEAPGSIAAARREQVAVMQE 175

Query: 492 QRKLRIAHYGRTP-AKIEELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWG 668
           QRK+RIAHYGRT  AK +        P  + + ++EEK+CSFITP+SDPVYVAYHDEEWG
Sbjct: 176 QRKMRIAHYGRTKSAKYQGKIVPANSPATS-TITREEKRCSFITPNSDPVYVAYHDEEWG 234

Query: 669 VPVHDDRMLFELL 707
           VPVHDD++LFELL
Sbjct: 235 VPVHDDKLLFELL 247


>ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa]
           gi|550343247|gb|EEE78699.2| hypothetical protein
           POPTR_0003s15520g [Populus trichocarpa]
          Length = 417

 Score =  148 bits (374), Expect = 1e-33
 Identities = 102/253 (40%), Positives = 130/253 (51%), Gaps = 23/253 (9%)
 Frame = +3

Query: 18  MCHSKIRSSDAAS------PAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFAN 179
           MC SK R + + S        I+GR VLQP SN++  LE     +H   K  S P S   
Sbjct: 1   MCSSKSRLNQSTSNIATTIAKINGRPVLQPKSNQVPSLE-----RHNSLKKNSPPKSPTR 55

Query: 180 TAAAGVFSIDHP---------------SPPI-PSTKITETSLLKRRGEPIGLDSSTEKLR 311
             A     +  P               SPPI P  K      +KR  EP GL++S EK+ 
Sbjct: 56  EPAGPPVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKVL 115

Query: 312 TPKASTWSXXXXXXXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQA 491
           TP+++T                         A   S  L     GS+AAA+RE  A++Q 
Sbjct: 116 TPRSTTKVTTSTVKKSKKSSTAGVPHSVDTFAMKYSSSLLVEAPGSIAAARREQVAVMQE 175

Query: 492 QRKLRIAHYGRTP-AKIEELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWG 668
           QRK+RIAHYGRT  AK +        P  + + ++EEK+CSFITP+SDPVYVAYHDEEWG
Sbjct: 176 QRKMRIAHYGRTKSAKYQGKIVPANSPATS-TITREEKRCSFITPNSDPVYVAYHDEEWG 234

Query: 669 VPVHDDRMLFELL 707
           VPVHDD++LFELL
Sbjct: 235 VPVHDDKLLFELL 247


>ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max]
          Length = 400

 Score =  147 bits (371), Expect = 3e-33
 Identities = 98/244 (40%), Positives = 133/244 (54%), Gaps = 14/244 (5%)
 Frame = +3

Query: 18  MCHSKIRSS-------DAASPA---IDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPT 167
           MC SK + +        AA P+   I+GR VLQP  NR+  LE    +K      +  P 
Sbjct: 1   MCSSKTKVTVGLEAVVAAAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPPKSLSPP 60

Query: 168 SFANTAAAGVFSIDHPSPPI-PSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXX 344
           S    +   +      +PP+ P  K       KR  +  GL+SS EK+  P++ST +   
Sbjct: 61  SPPLPSKTSL------TPPVSPKLKSPRLPATKRGNDNNGLNSSYEKIVIPRSSTKTPTL 114

Query: 345 XXXXXXXXXXXXXVSGEIRLA-DFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYG 521
                        VS  I  +  +SS L+ +   GS+AA +RE  AL QAQRK++IAHYG
Sbjct: 115 ERKKSKSFKEGSCVSASIEASLSYSSSLITDS-PGSIAAVRREQMALQQAQRKMKIAHYG 173

Query: 522 RTP-AKIEELAG-SIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRML 695
           R+  AK E +         +A   ++EEK+CSFITP+SDP+Y+AYHDEEWGVPVHDD+ML
Sbjct: 174 RSKSAKFERVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKML 233

Query: 696 FELL 707
           FELL
Sbjct: 234 FELL 237


>ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594852 [Solanum tuberosum]
          Length = 395

 Score =  147 bits (370), Expect = 4e-33
 Identities = 100/241 (41%), Positives = 135/241 (56%), Gaps = 11/241 (4%)
 Frame = +3

Query: 18  MCHSK--IRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAA 191
           MC+SK  ++SS      I+GR VLQP SN I PL   R   ++L+K+T+   S     + 
Sbjct: 1   MCNSKTKLQSSPQTLSQINGRPVLQPHSN-IVPLYERR---NSLKKTTNTAASVTANGST 56

Query: 192 GVFSIDHPSPPI-PSTKITETSLLKRRG--EPIGLDSSTEKLRTPKASTWSXXXXXXXXX 362
            V +    +PP+ P  K      +KR    +P GL SS EK+ TPK +            
Sbjct: 57  KVKTSSSTTPPVSPKMKSPRLPAIKRGNNIDPNGLSSSAEKIVTPKGTANKAPILLKKPK 116

Query: 363 XXXXXXXVSGEIRLAD--FSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTP-A 533
                      +  +   +SS L+     GS+AAA+RE  A+ Q QRK++IAHYGRT  A
Sbjct: 117 KSSGGLASPPYVENSSLKYSSSLIVE-APGSIAAARREQVAIAQVQRKMKIAHYGRTKSA 175

Query: 534 KIEELAGSIECPDIAMSA---SQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFEL 704
           K E    S++ P  A +     +EEK+CSFITP+SDP+Y+AYHDEEWGVPVHDD +LFEL
Sbjct: 176 KYEGKVSSLD-PSFASAVIPNPREEKRCSFITPNSDPLYIAYHDEEWGVPVHDDNLLFEL 234

Query: 705 L 707
           L
Sbjct: 235 L 235


>gb|EOY14287.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao]
          Length = 413

 Score =  145 bits (366), Expect = 1e-32
 Identities = 98/248 (39%), Positives = 131/248 (52%), Gaps = 18/248 (7%)
 Frame = +3

Query: 18  MCHS--KIRSSDAASPA---IDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANT 182
           MC S  K+ +    +PA   I+GR VLQP  NR+  L+    +K     S   P S A+T
Sbjct: 1   MCSSNAKVTAGVEITPAVARINGRPVLQPTCNRVPSLDRRNSLKKIPPLSPPTPPSLAST 60

Query: 183 AAAGVFSIDHP-------SPPI-PSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSX 338
             A   ++ +        +PPI P +K    + +KR  +P  L++S+EK+ TP+  T + 
Sbjct: 61  LPATSATVGNGGRAKASLTPPISPKSKSPRPAAIKRGSDPNALNTSSEKVMTPRNITKTL 120

Query: 339 XXXXXXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHY 518
                          +S  I  +   S  L     GS+AA +RE  AL QAQRK++IAHY
Sbjct: 121 ERKKSKSFKEGMGNGLSSWIEPSLSYSSSLIVEAPGSIAAVRREQMALQQAQRKMKIAHY 180

Query: 519 GRTP-AKIEELAGSIECPDIAMSASQEEKKCSFITPSSD----PVYVAYHDEEWGVPVHD 683
           GR+  AK E     +          +EEK+CSFITP+S     PVYVAYHDEEWGVPVHD
Sbjct: 181 GRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSGIAIYPVYVAYHDEEWGVPVHD 240

Query: 684 DRMLFELL 707
           D MLFELL
Sbjct: 241 DSMLFELL 248


>ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246304 [Solanum
           lycopersicum]
          Length = 395

 Score =  144 bits (362), Expect = 4e-32
 Identities = 101/241 (41%), Positives = 133/241 (55%), Gaps = 11/241 (4%)
 Frame = +3

Query: 18  MCHSK--IRSSDAASPAIDGRQVLQPPSNRISPLESPRPVKHTLQKSTSLPTSFANTAAA 191
           MC+SK  ++SS      I+GR VLQP SN I PL   R   ++L+K+T          + 
Sbjct: 1   MCNSKTKLQSSAQTLSQINGRPVLQPHSN-IVPLYERR---NSLKKTTHTAAPVTANGST 56

Query: 192 GVFSIDHPSPPI-PSTKITETSLLKRRG--EPIGLDSSTEKLRTPK--ASTWSXXXXXXX 356
            V      +PP+ P  K      +KR    +P GL SS EK+ TPK  A+          
Sbjct: 57  KVKMSSSTTPPVSPKMKSPRLPAIKRGNNIDPNGLSSSAEKIVTPKGTANKAPILLKKPK 116

Query: 357 XXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTP-A 533
                     S E     +SS L+     GS+AAA+RE  A+ Q QRK++IAHYGRT  A
Sbjct: 117 KSSGGLASPSSVENSSLKYSSSLIVE-APGSIAAARREQVAIAQVQRKMKIAHYGRTKSA 175

Query: 534 KIEELAGSIECPDIAMSA---SQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFEL 704
           K E    S++ P  A +     +E+K+CSFITP+SDP+Y+AYHDEEWGVPVHDD +LFEL
Sbjct: 176 KYEGKVSSLD-PSFASAVIPNPREDKRCSFITPNSDPLYIAYHDEEWGVPVHDDNLLFEL 234

Query: 705 L 707
           L
Sbjct: 235 L 235


>gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Theobroma cacao]
          Length = 398

 Score =  143 bits (361), Expect = 5e-32
 Identities = 97/243 (39%), Positives = 135/243 (55%), Gaps = 13/243 (5%)
 Frame = +3

Query: 18  MCHSKIR---SSDAASPA--IDGRQVLQPPSNRISPLESPRPVKHTLQKSTSL--PTSFA 176
           MC SK +    S+ AS    I+GR VLQPPSN+I+  +    +K     S +L  P   +
Sbjct: 1   MCCSKFKLHKDSNIASTVAEINGRPVLQPPSNQITSSDKRNSLKKISSNSPALSAPLQLS 60

Query: 177 NTAAAGV-FSIDHPSPPIPSTKITETSLLKRRGEPIGLDSSTEKLRTPKASTWSXXXXXX 353
           N+ A  V  ++   SPPI S K    + LKR  +   L+SS+EK+  P+ +         
Sbjct: 61  NSRARAVKATMPSLSPPI-SPKSPRPTALKRGKDSNELNSSSEKVIAPRCNV------KL 113

Query: 354 XXXXXXXXXXVSGEIRL----ADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYG 521
                       G + L    A +SS  +     GS+AAA+RE  A++Q QRK+RIAHYG
Sbjct: 114 DSKVKKPKNASGGGVALTSVDAKYSSSFMVLEAPGSIAAARREQVAMIQEQRKMRIAHYG 173

Query: 522 RTP-AKIEELAGSIECPDIAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLF 698
           RT  AK E     ++      +A Q++++CSFIT +SDPVY AYHDEEWGV VHDD++LF
Sbjct: 174 RTKSAKYERKMVGLDSSAARTAARQDQRRCSFITVNSDPVYAAYHDEEWGVAVHDDKLLF 233

Query: 699 ELL 707
           EL+
Sbjct: 234 ELV 236


>ref|XP_002299382.2| hypothetical protein POPTR_0001s12320g [Populus trichocarpa]
           gi|550347083|gb|EEE84187.2| hypothetical protein
           POPTR_0001s12320g [Populus trichocarpa]
          Length = 373

 Score =  142 bits (357), Expect = 1e-31
 Identities = 99/250 (39%), Positives = 127/250 (50%), Gaps = 20/250 (8%)
 Frame = +3

Query: 18  MCHSKIR----SSDAASPA--IDGRQVLQPPSNRISPLESPR------PVKHTLQKSTSL 161
           MC  K R    +++ A+P   I+GR VLQP SN++  LE         P K   Q+  ++
Sbjct: 1   MCSFKFRLHRSANNIATPIAKINGRPVLQPKSNQVPSLERRNSLKKNSPAKSPTQEPAAV 60

Query: 162 PT-----SFANTAAAGVFSIDHPSPPI-PSTKITETSLLKRRGEPIGLDSSTEKLRTPKA 323
           P         N A          SPPI P  K      +KR  +P GL++S EK+ TP  
Sbjct: 61  PPIPLMQPAGNAAGTKTKQPSGLSPPISPKLKSPVLPAVKRGNDPDGLNTSAEKVWTPLE 120

Query: 324 STWSXXXXXXXXXXXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKL 503
           S                                       GS+AAA+REH A++Q QRK+
Sbjct: 121 SP--------------------------------------GSIAAARREHVAVMQEQRKM 142

Query: 504 RIAHYGRTPAKIEELAGSIECPD--IAMSASQEEKKCSFITPSSDPVYVAYHDEEWGVPV 677
           RIAHYGRT  K  +  G +   D     + S+EEK+CSFITP+SDP+YVAYHDEEWGVPV
Sbjct: 143 RIAHYGRT--KSAKYHGKVVPADSPATNTISREEKRCSFITPNSDPIYVAYHDEEWGVPV 200

Query: 678 HDDRMLFELL 707
           HDD+MLFELL
Sbjct: 201 HDDKMLFELL 210


>ref|XP_003564403.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like
           [Brachypodium distachyon]
          Length = 380

 Score =  142 bits (357), Expect = 1e-31
 Identities = 93/240 (38%), Positives = 126/240 (52%), Gaps = 10/240 (4%)
 Frame = +3

Query: 18  MCHSKIRSSDAASPAIDGRQVLQPPSNRISPLESPRP-VKHTLQKSTSLPTSFANTAAAG 194
           MC+S ++S+ A    IDGR VLQP  NR++  E+ RP +K +LQKS S+P S+ N  A  
Sbjct: 1   MCNSNVKSAVAQ---IDGRPVLQPAGNRVAAPEAARPPLKKSLQKSLSMPASYDNNNA-- 55

Query: 195 VFSIDHPSPPIPSTKITETSLLKRRG----EPIGLDSSTEKLRTPKASTWSXXXXXXXXX 362
                      P+T    +S L R       P     +    +   A+  +         
Sbjct: 56  -----------PTTATKNSSELARAALHLLPPTAPAKAAGVSKAGAAADKNRKGAKKSGA 104

Query: 363 XXXXXXXVSGEIRLADFSSGLLRNRVAGSVAAAQREHAALVQAQRKLRIAHYGRTPA--K 536
                      +   + +        AGS+AAAQREH    QAQRK+RIAHYGRT +  +
Sbjct: 105 AVLPPVVTFASLEAFELAGAGAGPGPAGSIAAAQREHVTQAQAQRKMRIAHYGRTASFSR 164

Query: 537 IEELAGSIECPDIA---MSASQEEKKCSFITPSSDPVYVAYHDEEWGVPVHDDRMLFELL 707
           +E   G+      A   + A+ +EK+CSFITP SDPVYVAYHDEEWG+PVHDD +LFE+L
Sbjct: 165 VEGRVGATATATPAGPAVVAAPDEKRCSFITPYSDPVYVAYHDEEWGMPVHDDELLFEML 224


Top