BLASTX nr result

ID: Rauwolfia21_contig00004988 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00004988
         (2214 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256...   482   e-133
emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera]   479   e-132
ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Popu...   477   e-132
ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Popu...   477   e-132
ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246...   467   e-129
gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus pe...   466   e-128
gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabi...   465   e-128
ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594...   461   e-127
ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614...   454   e-125
gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Th...   450   e-123
gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus pe...   442   e-121
ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791...   441   e-121
gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Th...   440   e-120
ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citr...   439   e-120
ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811...   438   e-120
gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus...   437   e-120
ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298...   435   e-119
ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [R...   435   e-119
ref|XP_002315089.2| methyladenine glycosylase family protein [Po...   434   e-119
ref|XP_002312220.1| methyladenine glycosylase family protein [Po...   432   e-118

>ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256507 [Vitis vinifera]
            gi|297738175|emb|CBI27376.3| unnamed protein product
            [Vitis vinifera]
          Length = 398

 Score =  482 bits (1240), Expect = e-133
 Identities = 263/407 (64%), Positives = 298/407 (73%), Gaps = 1/407 (0%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850
            MCSSKSK        D+  + + INGRP LQP CNR P LER +                
Sbjct: 1    MCSSKSKLH---QGIDITPSKAQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSPLPA 57

Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKT 1670
               P                      KSPR PA KRGNDPNGLNSS+EKV LTP+ + K+
Sbjct: 58   SPPPPTTIINTTKTKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKV-LTPRGTTKS 116

Query: 1669 VAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 1490
             +  KK+K  C  G     S D S L N SSSLIVEAPGSIAAARREQ+AIMQVQRKMRI
Sbjct: 117  SSSPKKTKK-CSAGL--APSSDTSSL-NYSSSLIVEAPGSIAAARREQMAIMQVQRKMRI 172

Query: 1489 AHYGRTKSAKYEGKIVPLDSSATAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHEDKM 1310
            AHYGRTKSAKYE KI P+D       +EE+RC FIT NSDP Y+ YHDEEWGVPVH+DK 
Sbjct: 173  AHYGRTKSAKYEEKIGPVDP-LVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKR 231

Query: 1309 LFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQV 1130
            LFELLV+TGAQVGSDW TVLKKRQ++RDA + +DAEIV K+SEKK+ +I   YGI+LSQV
Sbjct: 232  LFELLVMTGAQVGSDWTTVLKKRQEYRDALSGYDAEIVGKFSEKKITSISAYYGIDLSQV 291

Query: 1129 RGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVRR 950
            RGVVDN++RILEIKREFGSF KY+W FVN+KPI TQYKSC KIPVKTSKSE+ISK MVRR
Sbjct: 292  RGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQYKSCHKIPVKTSKSESISKDMVRR 351

Query: 949  GFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
            GFRLVGPTVI+SFM+AAGLTNDHLI+CPRHL+C AL+SH PAVAPAL
Sbjct: 352  GFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSSHQPAVAPAL 398


>emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera]
          Length = 398

 Score =  479 bits (1234), Expect = e-132
 Identities = 262/407 (64%), Positives = 297/407 (72%), Gaps = 1/407 (0%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850
            MCSSKSK        D+  + + INGRP LQP CNR P LER +                
Sbjct: 1    MCSSKSKLH---QGIDITPSKAQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSPLPA 57

Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKT 1670
               P                      KSPR PA KRGNDPNGLNSS+EKV LTP+ + K+
Sbjct: 58   SLPPPTTIINTTKTKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKV-LTPRGTTKS 116

Query: 1669 VAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 1490
             +  KK+K  C  G     S D S L N SSS IVEAPGSIAAARREQ+AIMQVQRKMRI
Sbjct: 117  SSSPKKTKK-CSAGL--APSSDTSSL-NYSSSFIVEAPGSIAAARREQMAIMQVQRKMRI 172

Query: 1489 AHYGRTKSAKYEGKIVPLDSSATAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHEDKM 1310
            AHYGRTKSAKYE KI P+D       +EE+RC FIT NSDP Y+ YHDEEWGVPVH+DK 
Sbjct: 173  AHYGRTKSAKYEEKISPVDP-LVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKR 231

Query: 1309 LFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQV 1130
            LFELLV+TGAQVGSDW TVLKKRQ++RDAF+ +DAEIV K+SEKK+ +I   YGI+LSQV
Sbjct: 232  LFELLVMTGAQVGSDWTTVLKKRQEYRDAFSGYDAEIVGKFSEKKITSISAYYGIDLSQV 291

Query: 1129 RGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVRR 950
            RGVVDN++RILEIKREFGSF KY+W FVN+KPI TQ KSC KIPVKTSKSE+ISK MVRR
Sbjct: 292  RGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQXKSCHKIPVKTSKSESISKDMVRR 351

Query: 949  GFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
            GFRLVGPTVI+SFM+AAGLTNDHLI+CPRHL+C AL+SH PAVAPAL
Sbjct: 352  GFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSSHQPAVAPAL 398


>ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa]
            gi|550343248|gb|EEE78698.2| hypothetical protein
            POPTR_0003s15520g [Populus trichocarpa]
          Length = 420

 Score =  477 bits (1228), Expect = e-132
 Identities = 256/413 (61%), Positives = 308/413 (74%), Gaps = 9/413 (2%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ--- 1856
            MCSSKS+   ST+      ++ INGRPVLQP  N+ P LER N             +   
Sbjct: 1    MCSSKSRLNQSTSNI-ATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAG 59

Query: 1855 --VPLSSPSXXXXXXXXXXXXXXXXXXXXXK-SPRPPATKRGNDPNGLNSSVEKVVLTPK 1685
              VPL  P+                       SPRPPA KRGN+P GLN+S EKV LTP+
Sbjct: 60   PPVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKV-LTPR 118

Query: 1684 CSNK-TVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQV 1508
             + K T + VKKSK S   G  +  S+D   +K  SSSL+VEAPGSIAAARREQVA+MQ 
Sbjct: 119  STTKVTTSTVKKSKKSSTAGVPH--SVDTFAMK-YSSSLLVEAPGSIAAARREQVAVMQE 175

Query: 1507 QRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV-KEERRCHFITLNSDPIYIAYHDEEWGV 1331
            QRKMRIAHYGRTKSAKY+GKIVP +S AT+ + +EE+RC FIT NSDP+Y+AYHDEEWGV
Sbjct: 176  QRKMRIAHYGRTKSAKYQGKIVPANSPATSTITREEKRCSFITPNSDPVYVAYHDEEWGV 235

Query: 1330 PVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDY 1151
            PVH+DK+LFELL LTGAQVGS+W +VLKKR+ FR+AF+ FDAEIV+K++EKK+ +I  +Y
Sbjct: 236  PVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEY 295

Query: 1150 GIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETI 971
            G+++SQVRGVVDN++RILE+KREFGSFD+YLW +VN+KPI+TQYKSC KIPVKTSKSETI
Sbjct: 296  GLDISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETI 355

Query: 970  SKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSP-AVAP 815
            SK MV+RGFR VGPTVIHSFM+A GL+NDHLI CPRHL+C ALAS  P  VAP
Sbjct: 356  SKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQLPRTVAP 408


>ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa]
            gi|550343247|gb|EEE78699.2| hypothetical protein
            POPTR_0003s15520g [Populus trichocarpa]
          Length = 417

 Score =  477 bits (1228), Expect = e-132
 Identities = 256/413 (61%), Positives = 308/413 (74%), Gaps = 9/413 (2%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ--- 1856
            MCSSKS+   ST+      ++ INGRPVLQP  N+ P LER N             +   
Sbjct: 1    MCSSKSRLNQSTSNI-ATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAG 59

Query: 1855 --VPLSSPSXXXXXXXXXXXXXXXXXXXXXK-SPRPPATKRGNDPNGLNSSVEKVVLTPK 1685
              VPL  P+                       SPRPPA KRGN+P GLN+S EKV LTP+
Sbjct: 60   PPVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKV-LTPR 118

Query: 1684 CSNK-TVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQV 1508
             + K T + VKKSK S   G  +  S+D   +K  SSSL+VEAPGSIAAARREQVA+MQ 
Sbjct: 119  STTKVTTSTVKKSKKSSTAGVPH--SVDTFAMK-YSSSLLVEAPGSIAAARREQVAVMQE 175

Query: 1507 QRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV-KEERRCHFITLNSDPIYIAYHDEEWGV 1331
            QRKMRIAHYGRTKSAKY+GKIVP +S AT+ + +EE+RC FIT NSDP+Y+AYHDEEWGV
Sbjct: 176  QRKMRIAHYGRTKSAKYQGKIVPANSPATSTITREEKRCSFITPNSDPVYVAYHDEEWGV 235

Query: 1330 PVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDY 1151
            PVH+DK+LFELL LTGAQVGS+W +VLKKR+ FR+AF+ FDAEIV+K++EKK+ +I  +Y
Sbjct: 236  PVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEY 295

Query: 1150 GIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETI 971
            G+++SQVRGVVDN++RILE+KREFGSFD+YLW +VN+KPI+TQYKSC KIPVKTSKSETI
Sbjct: 296  GLDISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETI 355

Query: 970  SKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSP-AVAP 815
            SK MV+RGFR VGPTVIHSFM+A GL+NDHLI CPRHL+C ALAS  P  VAP
Sbjct: 356  SKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQLPRTVAP 408


>ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246304 [Solanum
            lycopersicum]
          Length = 395

 Score =  467 bits (1202), Expect = e-129
 Identities = 263/417 (63%), Positives = 302/417 (72%), Gaps = 11/417 (2%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXL---- 1859
            MC+SK+K Q S        +S INGRPVLQP+ N  PL ERRN                 
Sbjct: 1    MCNSKTKLQSSAQT-----LSQINGRPVLQPHSNIVPLYERRNSLKKTTHTAAPVTANGS 55

Query: 1858 -QVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGN--DPNGLNSSVEKVVLTP 1688
             +V +SS +                      SPR PA KRGN  DPNGL+SS EK+V   
Sbjct: 56   TKVKMSSSTTPPVSPKMK-------------SPRLPAIKRGNNIDPNGLSSSAEKIVTPK 102

Query: 1687 KCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQV 1508
              +NK    +KK K S G G  + +S++NS LK  SSSLIVEAPGSIAAARREQVAI QV
Sbjct: 103  GTANKAPILLKKPKKSSG-GLASPSSVENSSLK-YSSSLIVEAPGSIAAARREQVAIAQV 160

Query: 1507 QRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDEE 1340
            QRKM+IAHYGRTKSAKYEGK+  LD S  +AV    +E++RC FIT NSDP+YIAYHDEE
Sbjct: 161  QRKMKIAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREDKRCSFITPNSDPLYIAYHDEE 220

Query: 1339 WGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTIC 1160
            WGVPVH+D +LFELLVLTGAQVGSDW +VLKKRQ+FRDAF+ FD EIVSKY+EKK+ +  
Sbjct: 221  WGVPVHDDNLLFELLVLTGAQVGSDWTSVLKKRQEFRDAFSGFDPEIVSKYNEKKITSTS 280

Query: 1159 NDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKS 980
             +YGIELSQ+RG VDN++RILEIK+ FGSFDKYLW FVN KPIATQYK+C KIPVKTSKS
Sbjct: 281  VEYGIELSQIRGAVDNSTRILEIKKTFGSFDKYLWGFVNNKPIATQYKACNKIPVKTSKS 340

Query: 979  ETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
            ETISK MV+RGFR VGPTVIHSFM+AAGLTNDHLI CPRHL C ALA+  PA  PAL
Sbjct: 341  ETISKDMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLPCVALAT-QPA-PPAL 395


>gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus persica]
          Length = 397

 Score =  466 bits (1199), Expect = e-128
 Identities = 255/407 (62%), Positives = 296/407 (72%), Gaps = 2/407 (0%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847
            MCSSK K Q +T+     +   +N RPVLQP  N+ P LE+R               +P 
Sbjct: 1    MCSSKPKLQRTTSVPP--STPKMNRRPVLQPTGNQFPSLEQRKSLKKSSQEPLAPTPLPS 58

Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667
              PS                      SPRPPA KRG DPN LNSS EKVV TP+C+ K  
Sbjct: 59   PLPSAKTKASLSPPISPKLP------SPRPPAFKRGKDPNELNSSAEKVV-TPRCTTKFT 111

Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487
            + VKKSK S G  ++  A    S LKNISS LIVEAPGSIAAARREQVA MQ QRKMRIA
Sbjct: 112  SSVKKSKKSSG--SVAAAPSAESILKNISS-LIVEAPGSIAAARREQVATMQEQRKMRIA 168

Query: 1486 HYGRTKSAKYEGKIVPLDSSATAAV-KEERRCHFITLNSDPIYIAYHDEEWGVPVHEDKM 1310
            HYGRTKSAK EGK+VPLD+S T    +++RRC FIT NSDPIY+AYHDEEWGVPVH+D +
Sbjct: 169  HYGRTKSAKNEGKVVPLDASPTTDFGRDQRRCTFITPNSDPIYVAYHDEEWGVPVHDDNL 228

Query: 1309 LFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQV 1130
            L ELLVLTGAQVGSDW +VL+KRQ  R++F+ FDA+ V+K+SE+K+ ++ +D GI++S V
Sbjct: 229  LLELLVLTGAQVGSDWTSVLRKRQALRESFSGFDADGVAKFSERKITSVSSDSGIDISLV 288

Query: 1129 RGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVRR 950
            RG VDNA RIL+IKRE GSFDKYLW FVN+KPI+TQYKSC KIPVK SKSE+ISK MVRR
Sbjct: 289  RGAVDNAKRILQIKREVGSFDKYLWGFVNHKPISTQYKSCHKIPVKNSKSESISKDMVRR 348

Query: 949  GFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAA-LASHSPAVAPA 812
            GFRLVGPTVIHSFM+AAGLTNDHLI CPRHL+CAA LAS  P  APA
Sbjct: 349  GFRLVGPTVIHSFMQAAGLTNDHLITCPRHLQCAASLASSPPVAAPA 395


>gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabilis]
          Length = 394

 Score =  465 bits (1196), Expect = e-128
 Identities = 262/409 (64%), Positives = 295/409 (72%), Gaps = 3/409 (0%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847
            MCSSK K    T      A   INGRPVLQP CNR   LERR                PL
Sbjct: 1    MCSSKPKTLLGTNTI-TSAEPKINGRPVLQPTCNRVSSLERR--MSLKKTTPKSPTSPPL 57

Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPN-GLNSSVEKVVLTPKCSNKT 1670
            + P                       SPRPPA KRG DPN  LNSS EKV LTP+C  K+
Sbjct: 58   ALP-IQNGACKTKPSTLSPPVSPKLPSPRPPAIKRGKDPNYELNSSAEKV-LTPRCIIKS 115

Query: 1669 VAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 1490
             + +KKSK  CG   +   +L NS      SSLIVEAPGSIAAARREQVAIMQ QRK+RI
Sbjct: 116  TSSIKKSKK-CGGAGVVAETLKNS------SSLIVEAPGSIAAARREQVAIMQEQRKIRI 168

Query: 1489 AHYGRTKSAKYEGKIVP--LDSSATAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHED 1316
            AHYGRTKSAK+EGK+V   LDSS     KE++RC +IT NSDPIY+AYHDEEWGVPVH+D
Sbjct: 169  AHYGRTKSAKFEGKVVAPMLDSSVG---KEQKRCSYITPNSDPIYVAYHDEEWGVPVHDD 225

Query: 1315 KMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELS 1136
            K+LFELLVLTGAQVGSDW +VLKKR+ FR+AF+ FDAE VSKY+EKK+ +I  DYGIELS
Sbjct: 226  KLLFELLVLTGAQVGSDWTSVLKKREIFRNAFSGFDAEAVSKYNEKKITSIGADYGIELS 285

Query: 1135 QVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMV 956
             +RG VDNA+RILEIK+EFGS +KYLW FVN K I+TQYKSC KIPVKTSKSE+ISK MV
Sbjct: 286  LIRGAVDNANRILEIKKEFGSLNKYLWGFVNNKLISTQYKSCQKIPVKTSKSESISKDMV 345

Query: 955  RRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
            RRGFR VGPTVI+SFM+AAGLTNDHLI CPRHL+C ALAS  P+VAPAL
Sbjct: 346  RRGFRFVGPTVIYSFMQAAGLTNDHLITCPRHLQCLALASQLPSVAPAL 394


>ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594852 [Solanum tuberosum]
          Length = 395

 Score =  461 bits (1185), Expect = e-127
 Identities = 259/412 (62%), Positives = 298/412 (72%), Gaps = 6/412 (1%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847
            MC+SK+K Q S        +S INGRPVLQP+ N  PL ERRN                 
Sbjct: 1    MCNSKTKLQSSPQT-----LSQINGRPVLQPHSNIVPLYERRNSLKKTTNTA-------- 47

Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGN--DPNGLNSSVEKVVLTPKCSNK 1673
            +S +                     KSPR PA KRGN  DPNGL+SS EK+V     +NK
Sbjct: 48   ASVTANGSTKVKTSSSTTPPVSPKMKSPRLPAIKRGNNIDPNGLSSSAEKIVTPKGTANK 107

Query: 1672 TVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMR 1493
                +KK K S G G  +   ++NS LK  SSSLIVEAPGSIAAARREQVAI QVQRKM+
Sbjct: 108  APILLKKPKKSSG-GLASPPYVENSSLK-YSSSLIVEAPGSIAAARREQVAIAQVQRKMK 165

Query: 1492 IAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDEEWGVPV 1325
            IAHYGRTKSAKYEGK+  LD S  +AV    +EE+RC FIT NSDP+YIAYHDEEWGVPV
Sbjct: 166  IAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREEKRCSFITPNSDPLYIAYHDEEWGVPV 225

Query: 1324 HEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGI 1145
            H+D +LFELLVLTGAQVGSDW +VL+KRQ+FRDAF+ FD EIVSKY+EKK+ +   +YGI
Sbjct: 226  HDDNLLFELLVLTGAQVGSDWTSVLRKRQEFRDAFSGFDPEIVSKYNEKKITSTSVEYGI 285

Query: 1144 ELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISK 965
            ELSQ+RG VDN++RILEIK+ F SF+KYLW FVN KPIATQYK+C KIPVKTSKSETISK
Sbjct: 286  ELSQIRGAVDNSTRILEIKKTFDSFNKYLWGFVNNKPIATQYKACNKIPVKTSKSETISK 345

Query: 964  GMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
             MV+RGFR VGPTVIHSFM+AAGLTNDHLI CPRHL+C ALA+  PA  PAL
Sbjct: 346  DMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLQCMALAT-QPA-PPAL 395


>ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614205 [Citrus sinensis]
          Length = 375

 Score =  454 bits (1167), Expect = e-125
 Identities = 249/408 (61%), Positives = 289/408 (70%), Gaps = 2/408 (0%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847
            MCSSKSK   +T          INGRPVLQP  N+ P LE+RN                +
Sbjct: 1    MCSSKSKLHSAT---------QINGRPVLQPTSNQVPSLEKRNSIKKTGSPKSPITTDNV 51

Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667
            +S S                      SPRP A KRGNDPN LN+S EK+ +TPK   K  
Sbjct: 52   NSKSFTKSLLSPPVSPKLK-------SPRPAAVKRGNDPNVLNTSAEKI-MTPK---KLA 100

Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487
            + VKK KN      + VA           SSLIVEAPGSIAAARRE VAIMQ QRK+RIA
Sbjct: 101  SLVKKPKN------VGVAPC-------YDSSLIVEAPGSIAAARREHVAIMQEQRKLRIA 147

Query: 1486 HYGRTKSAKYEGKIVPLDSSATAAV--KEERRCHFITLNSDPIYIAYHDEEWGVPVHEDK 1313
            HYGRTKSAK+EGK+  LDS A      +EE+RC FIT NSDPIY+AYHDEEWGVPVH+DK
Sbjct: 148  HYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPIYVAYHDEEWGVPVHDDK 207

Query: 1312 MLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQ 1133
            +LFELLVLT AQVGSDW +VLKKRQ FR+AF+ FDAE+V+K++EKKM ++  +Y I+LSQ
Sbjct: 208  LLFELLVLTAAQVGSDWTSVLKKRQAFREAFSGFDAEVVAKFTEKKMTSLSANYAIDLSQ 267

Query: 1132 VRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVR 953
            VRG+VDN+ RILE+K++FGSFDKYLW FVN+KPI TQY+S  KIPVKTSKSE ISK MV+
Sbjct: 268  VRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKPINTQYRSSQKIPVKTSKSEAISKDMVK 327

Query: 952  RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
            +GFR VGPTVIHSFM+AAGLTNDHLI C RHL+C ALASH PAVAPAL
Sbjct: 328  KGFRFVGPTVIHSFMQAAGLTNDHLITCTRHLQCTALASHQPAVAPAL 375


>gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Theobroma cacao]
          Length = 398

 Score =  450 bits (1158), Expect = e-123
 Identities = 245/408 (60%), Positives = 292/408 (71%), Gaps = 2/408 (0%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847
            MC SK K    +  A  VA   INGRPVLQP  N+    ++RN               PL
Sbjct: 1    MCCSKFKLHKDSNIASTVA--EINGRPVLQPPSNQITSSDKRNSLKKISSNSPAL-SAPL 57

Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667
               +                      SPRP A KRG D N LNSS EKV+  P+C+ K  
Sbjct: 58   QLSNSRARAVKATMPSLSPPISPK--SPRPTALKRGKDSNELNSSSEKVI-APRCNVKLD 114

Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487
            + VKK KN+ G G + + S+D    K  SS +++EAPGSIAAARREQVA++Q QRKMRIA
Sbjct: 115  SKVKKPKNASG-GGVALTSVD---AKYSSSFMVLEAPGSIAAARREQVAMIQEQRKMRIA 170

Query: 1486 HYGRTKSAKYEGKIVPLDSSA--TAAVKEERRCHFITLNSDPIYIAYHDEEWGVPVHEDK 1313
            HYGRTKSAKYE K+V LDSSA  TAA +++RRC FIT+NSDP+Y AYHDEEWGV VH+DK
Sbjct: 171  HYGRTKSAKYERKMVGLDSSAARTAARQDQRRCSFITVNSDPVYAAYHDEEWGVAVHDDK 230

Query: 1312 MLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQ 1133
            +LFEL+VL GAQVGSDW +VLKKRQDFR+AF+ FDAE+++ +SEK + +I +DYGI++SQ
Sbjct: 231  LLFELVVLIGAQVGSDWTSVLKKRQDFREAFSGFDAEVIAGFSEKNILSISSDYGIDVSQ 290

Query: 1132 VRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVR 953
            VR  VDNA+RILE+++EFGSF+ YLW FVN+KPI TQYKSC KIPVKTSKSE ISK MVR
Sbjct: 291  VRAAVDNANRILEVRKEFGSFNNYLWGFVNHKPIVTQYKSCHKIPVKTSKSEAISKDMVR 350

Query: 952  RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
            RGFR VGPTVIHS M+AAGLTNDHL  CPRHL+C ALAS  P VAPAL
Sbjct: 351  RGFRFVGPTVIHSLMQAAGLTNDHLSTCPRHLQCIALASQFPTVAPAL 398


>gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica]
          Length = 426

 Score =  442 bits (1138), Expect = e-121
 Identities = 242/429 (56%), Positives = 293/429 (68%), Gaps = 23/429 (5%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ-VP 1850
            MCSSK+K         +VA   INGRPVLQP CNR P L+RRN               +P
Sbjct: 1    MCSSKAKVTIGVEVTPMVA--RINGRPVLQPTCNRVPSLDRRNSIKKISTPRAPPPPPLP 58

Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXK-SPRPPATKRGNDPNGLNSSVEKVVLTPKCSNK 1673
             SS S                       SPRPPA KRGNDPNGLNSS EKVV     +  
Sbjct: 59   TSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNGLNSSSEKVVTPGGTTRA 118

Query: 1672 TVAPVKKSKN----SCGVGNINV--------------ASLDNSPLKNISSSLIVEAPGSI 1547
             +   KKSK+    S GV   +               +SL+     + SSSLI EAPGSI
Sbjct: 119  KILERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPGSI 178

Query: 1546 AAARREQVAIMQVQRKMRIAHYGRTKSAKYEGKIVPLDSSATAAVK---EERRCHFITLN 1376
            AA RREQ+A+   QRKMRIAHYGR+KSA +E ++VP+D+S     K   EE+RC FIT N
Sbjct: 179  AAVRREQMALQHAQRKMRIAHYGRSKSANFE-RVVPVDASGNIEAKGAEEEKRCSFITAN 237

Query: 1375 SDPIYIAYHDEEWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIV 1196
            SDPIY+AYHDEEWGVPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFR+AF++FDAEIV
Sbjct: 238  SDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSDFDAEIV 297

Query: 1195 SKYSEKKMNTICNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYK 1016
            + +++K+M +I ++YGI++S+VRGVVDN++RILEIK+EFGSFDKY+W FVN KPI+ QYK
Sbjct: 298  ANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQYK 357

Query: 1015 SCLKIPVKTSKSETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836
               KIPVKTSKSE+ISK MVRRGFR VGPTV+HSFM+A+GLTNDHLI C RHL+C  LA+
Sbjct: 358  LGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLAA 417

Query: 835  HSPAVAPAL 809
              P +   L
Sbjct: 418  RRPTLEEVL 426


>ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max]
          Length = 400

 Score =  441 bits (1135), Expect = e-121
 Identities = 241/406 (59%), Positives = 292/406 (71%), Gaps = 7/406 (1%)
 Frame = -3

Query: 2026 MCSSKSKPQ---GSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQ 1856
            MCSSK+K      +  AA   +V+ INGRPVLQP CNR P LERRN              
Sbjct: 1    MCSSKTKVTVGLEAVVAAAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPPKS---- 56

Query: 1855 VPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSN 1676
              LS PS                      SPR PATKRGND NGLNSS EK+V+ P+ S 
Sbjct: 57   --LSPPSPPLPSKTSLTPPVSPKLK----SPRLPATKRGNDNNGLNSSYEKIVI-PRSST 109

Query: 1675 KTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKM 1496
            KT    +K   S   G+   AS++ S   + SSSLI ++PGSIAA RREQ+A+ Q QRKM
Sbjct: 110  KTPTLERKKSKSFKEGSCVSASIEAS--LSYSSSLITDSPGSIAAVRREQMALQQAQRKM 167

Query: 1495 RIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDEEWGVP 1328
            +IAHYGR+KSAK+E ++VPLD S T+      +EE+RC FIT NSDPIYIAYHDEEWGVP
Sbjct: 168  KIAHYGRSKSAKFE-RVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHDEEWGVP 226

Query: 1327 VHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYG 1148
            VH+DKMLFELLVL+GAQVGSDW + LKKR DFR AF+EFDAE V+  ++K+M +I ++YG
Sbjct: 227  VHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSISSEYG 286

Query: 1147 IELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETIS 968
            I++S+VRGVVDNA++ILEIK++FGSFDKY+W FVN+KPI+TQYK   KIPVKTSKSE+IS
Sbjct: 287  IDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESIS 346

Query: 967  KGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHS 830
            K MVRRGFR VGPTV+HSFM+ +GLTNDHLI C RHL+C  LA+ S
Sbjct: 347  KDMVRRGFRFVGPTVVHSFMQTSGLTNDHLITCHRHLQCTLLAARS 392


>gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao]
          Length = 409

 Score =  440 bits (1131), Expect = e-120
 Identities = 237/405 (58%), Positives = 293/405 (72%), Gaps = 6/405 (1%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRN---XXXXXXXXXXXXL 1859
            MCSS +K    TA  ++  AV+ INGRPVLQP CNR P L+RRN               L
Sbjct: 1    MCSSNAKV---TAGVEITPAVARINGRPVLQPTCNRVPSLDRRNSLKKIPPLSPPTPPSL 57

Query: 1858 QVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCS 1679
               L + S                     KSPRP A KRG+DPN LN+S EK V+TP+  
Sbjct: 58   ASTLPATSATVGNGGRAKASLTPPISPKSKSPRPAAIKRGSDPNALNTSSEK-VMTPRNI 116

Query: 1678 NKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRK 1499
             KT+   K      G+GN   + ++  P  + SSSLIVEAPGSIAA RREQ+A+ Q QRK
Sbjct: 117  TKTLERKKSKSFKEGMGNGLSSWIE--PSLSYSSSLIVEAPGSIAAVRREQMALQQAQRK 174

Query: 1498 MRIAHYGRTKSAKYEGKIVPLDSSA--TAAVKEERRCHFITLNSDPIYIAYHDEEWGVPV 1325
            M+IAHYGR+KSAK+E K+VPL++S+  T   +EE+RC FIT NSDP+Y+AYHDEEWGVPV
Sbjct: 175  MKIAHYGRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSDPVYVAYHDEEWGVPV 234

Query: 1324 HEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGI 1145
            H+D MLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAE V+K+++K+M TI ++YGI
Sbjct: 235  HDDSMLFELLVLSGAQVGSDWISILKKRQDFRDAFSGFDAETVAKFTDKEMTTISSEYGI 294

Query: 1144 ELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISK 965
            ++S+V GVVDN++RILE+K +FGSFDKY+W FVN+K I+TQYK   KIPVKTSKSE+ISK
Sbjct: 295  DISRVLGVVDNSNRILEVKGQFGSFDKYIWGFVNHKAISTQYKFGHKIPVKTSKSESISK 354

Query: 964  GMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHS 830
             M+RRGFR VGPTV+HSFM+AAGLTNDHLI C RHL C  LA+ S
Sbjct: 355  DMLRRGFRCVGPTVVHSFMQAAGLTNDHLITCHRHLPCTLLAASS 399


>ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citrus clementina]
            gi|557551187|gb|ESR61816.1| hypothetical protein
            CICLE_v10015639mg [Citrus clementina]
          Length = 375

 Score =  439 bits (1128), Expect = e-120
 Identities = 241/408 (59%), Positives = 288/408 (70%), Gaps = 2/408 (0%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847
            MCSSKSK   +T          INGRPVLQP  N+ P LE+R+                +
Sbjct: 1    MCSSKSKLHSAT---------QINGRPVLQPTSNQVPSLEKRSSIKKTGSPKSPITTNNV 51

Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667
            +S S                      SPRP A KRGNDPN LN+S EK+ +TPK   K  
Sbjct: 52   NSKSFTKSLLSPPVSPKLK-------SPRPAAVKRGNDPNVLNTSAEKI-MTPK---KLA 100

Query: 1666 APVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 1487
            + VKK KN+           + +P  +  SSLIVEAPGSIAAARRE VAIMQ QRK+RIA
Sbjct: 101  SFVKKPKNA-----------EVAPCYD--SSLIVEAPGSIAAARREHVAIMQEQRKLRIA 147

Query: 1486 HYGRTKSAKYEGKIVPLDSSATAAV--KEERRCHFITLNSDPIYIAYHDEEWGVPVHEDK 1313
            HYGRTKSAK+EGK+  LDS A      +EE+RC FIT NSDP Y+AYHDEEWGVPVH+DK
Sbjct: 148  HYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPKYVAYHDEEWGVPVHDDK 207

Query: 1312 MLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICNDYGIELSQ 1133
            +LFELLVLT AQVGSDW +VLKKR+ FR+AF+ FDAE+V+K++EKK+ ++  +Y I+LSQ
Sbjct: 208  LLFELLVLTAAQVGSDWTSVLKKRRAFREAFSGFDAEVVAKFTEKKITSLSANYAIDLSQ 267

Query: 1132 VRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSETISKGMVR 953
            VRG+VDN+ RILE+K++FGSFDKYLW FVN+K I TQY+S  KIP KTSKSE ISK MV+
Sbjct: 268  VRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKTINTQYRSSQKIPAKTSKSEAISKDMVK 327

Query: 952  RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPAL 809
            +GFR VGPTVIHSFM+AAGL+NDHLI C RHL+C ALASH PAVAPAL
Sbjct: 328  KGFRFVGPTVIHSFMQAAGLSNDHLITCTRHLQCTALASHQPAVAPAL 375


>ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811352 [Glycine max]
          Length = 400

 Score =  438 bits (1126), Expect = e-120
 Identities = 239/411 (58%), Positives = 291/411 (70%), Gaps = 12/411 (2%)
 Frame = -3

Query: 2026 MCSSKSK--------PQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXX 1871
            MC SK+K           +T      +V+ INGRPVLQP CNR P LERRN         
Sbjct: 1    MCGSKTKVTIGLEVIAAAATTTTAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPAK 60

Query: 1870 XXXLQVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLT 1691
                   LS PS                      SPR PATKRGND NGLNSS EK+V+ 
Sbjct: 61   S------LSPPSPPLPSKTSLTPPVSPKSK----SPRLPATKRGNDNNGLNSSYEKIVI- 109

Query: 1690 PKCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQ 1511
            P+ S KT    +K   S   G+   AS++ S   + SSSLI ++PGSIAA RREQ+A+ Q
Sbjct: 110  PRSSIKTPTLERKKSKSFKEGSCVSASIEAS--LSYSSSLITDSPGSIAAVRREQMALQQ 167

Query: 1510 VQRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYHDE 1343
             QRKM+IAHYGR+KSAK+E ++VPLD S T+      +EE+RC FIT NSDPIYIAYHDE
Sbjct: 168  AQRKMKIAHYGRSKSAKFE-RVVPLDPSNTSLASKPTEEEKRCSFITANSDPIYIAYHDE 226

Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163
            EWGVPVH+DKMLFELLVL+GAQVGSDW + LKKR DFR AF+EFDAE V+  ++K+M +I
Sbjct: 227  EWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSI 286

Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983
             ++YGI++S+VRGVVDNA++ILEIK++FGSFDKY+W FVN+KP++TQYK   KIPVKTSK
Sbjct: 287  SSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPLSTQYKFGHKIPVKTSK 346

Query: 982  SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHS 830
            SE+ISK MVRRGFR VGPTV+HSFM+A+GLTNDHLI C RHL+C  LA+ S
Sbjct: 347  SESISKDMVRRGFRYVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLAARS 397


>gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris]
          Length = 405

 Score =  437 bits (1124), Expect = e-120
 Identities = 239/411 (58%), Positives = 292/411 (71%), Gaps = 14/411 (3%)
 Frame = -3

Query: 2026 MCSSKSKP----QGSTAAADVVA-----VSHINGRPVLQPNCNRSPLLERRNXXXXXXXX 1874
            MCSSK+K     +G  AAA   +     V+ INGRPVLQP CNR P LERRN        
Sbjct: 1    MCSSKAKVTVGIEGVVAAATTTSTVMPSVARINGRPVLQPTCNRVPNLERRNSIKKVQPP 60

Query: 1873 XXXXL-QVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVV 1697
                    PLSS +                      SPR PA KRGND NGLN+S EK+ 
Sbjct: 61   KSLSPPSPPLSSKTSLTPPVSPKSK-----------SPRLPAVKRGNDNNGLNTSYEKIA 109

Query: 1696 LTPKCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAI 1517
            + PK S+K     +K   S   G+   AS + S   + +SSLI ++PGSIAA RREQ+A+
Sbjct: 110  I-PKSSSKAPTLERKKSKSFKEGSCAPASTEAS--FSYASSLITDSPGSIAAVRREQMAL 166

Query: 1516 MQVQRKMRIAHYGRTKSAKYEGKIVPLDSSATAAV----KEERRCHFITLNSDPIYIAYH 1349
             Q QRKM+IAHYGR+KSAK+E ++VPLD S T       +EE+RC FIT NSDPIYIAYH
Sbjct: 167  QQAQRKMKIAHYGRSKSAKFE-RVVPLDPSTTTLTSKPTEEEKRCSFITANSDPIYIAYH 225

Query: 1348 DEEWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMN 1169
            DEEWGVPVH+DKMLFELLVL+GAQVGSDW + LKKRQDFR AF++FDAE V+  ++K+M 
Sbjct: 226  DEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRQDFRAAFSDFDAETVANLTDKQMM 285

Query: 1168 TICNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKT 989
            +I ++YGI++S+VRGVVDNA++ILEIK++FGSFDKY+W FVN+KPI+TQYK   KIPVKT
Sbjct: 286  SISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKT 345

Query: 988  SKSETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836
            SKSE+ISK MVRRG+R VGPTV+HSFM+AAGLTNDHLI C RHL+C  LA+
Sbjct: 346  SKSESISKDMVRRGYRFVGPTVVHSFMQAAGLTNDHLITCHRHLQCTLLAA 396


>ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298985 [Fragaria vesca
            subsp. vesca]
          Length = 410

 Score =  435 bits (1119), Expect = e-119
 Identities = 234/410 (57%), Positives = 289/410 (70%), Gaps = 12/410 (2%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVA-VSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850
            MCSSK+K    T   ++   VS INGRPVLQP CNR P L+RRN            L + 
Sbjct: 1    MCSSKAKV---TMGIEITPLVSRINGRPVLQPTCNRVPSLDRRNSLKKLSTPPPPPLPLS 57

Query: 1849 LSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKR-GNDPNGLNSSVEKVVLTPKCSNK 1673
             +S +                     KSPRPPA KR GNDPNGLNSS EKVV     +  
Sbjct: 58   NASSTSTSPRISTKASLTTPPVSPKSKSPRPPAIKRSGNDPNGLNSSSEKVVTPGGTTRA 117

Query: 1672 TVAPVKKSK---------NSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVA 1520
             V   KKSK         N+   G ++ AS++ S   + SSSLI EAPG+IAA RREQ+A
Sbjct: 118  KVLERKKSKSFKLGVGADNAHDHGRLSSASIEAS--LSYSSSLITEAPGTIAAGRREQMA 175

Query: 1519 IMQVQRKMRIAHYGRTKSAKYEGKIVPLDS-SATAAVKEERRCHFITLNSDPIYIAYHDE 1343
            +   QRKMRIAHYGR+ SA +E ++ P+D+  A    ++ +RC FIT NSDPIY+AYHD+
Sbjct: 176  LQHAQRKMRIAHYGRSNSANFE-RVAPIDTMEAKGGEEDHKRCSFITANSDPIYVAYHDQ 234

Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163
            EWGVPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAE V+  ++K+M +I
Sbjct: 235  EWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEAVANLTDKQMISI 294

Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983
            C++YGI++S+VRGVVDN++RILE+KREFGSF KY+W FVN+KPI+ QYK   KIPVKTSK
Sbjct: 295  CSEYGIDISRVRGVVDNSNRILEVKREFGSFHKYIWGFVNHKPISPQYKQGYKIPVKTSK 354

Query: 982  SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASH 833
            SE+ISK MVRRGFR VGPTV+HSFM+A+GLTNDHL  C RHL+C  LA+H
Sbjct: 355  SESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLTTCHRHLQCTLLAAH 404


>ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis]
            gi|223530365|gb|EEF32255.1| DNA-3-methyladenine
            glycosylase, putative [Ricinus communis]
          Length = 403

 Score =  435 bits (1119), Expect = e-119
 Identities = 241/409 (58%), Positives = 287/409 (70%), Gaps = 12/409 (2%)
 Frame = -3

Query: 2026 MCSSKSK--PQGSTAAAD----VVAVSHINGRPVLQPNCNRSPLLERRN--XXXXXXXXX 1871
            MCSSKSK    G+ AAA+       ++ INGRPVLQP  ++ P LERRN           
Sbjct: 1    MCSSKSKLHHHGAAAAANHHIPASTIAKINGRPVLQPKSDQVPTLERRNSLKKNSPKSPI 60

Query: 1870 XXXLQVPLSSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLT 1691
                  PL                         KSPRPPA KRGND N LNSS EK +  
Sbjct: 61   IQPPAAPLPLLPTTTTIKPKQPSSLSPPISPKLKSPRPPALKRGNDLNTLNSSAEKFLTP 120

Query: 1690 PKCSNKTVAPVKKSKNSCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQ 1511
             K  + T+   KKS  +  V        +   + N SSSLIVEAPGSIAAARRE VA MQ
Sbjct: 121  RKAVSTTLKKSKKSSPATPV------VAETCTVLNYSSSLIVEAPGSIAAARREHVATMQ 174

Query: 1510 VQRKMRIAHYGRTKS---AKYEGKIVPLDS-SATAAVKEERRCHFITLNSDPIYIAYHDE 1343
             QRK+R AHYGR  S   +K + KIVP+DS +ATA  +EERRC FIT +SDPIY+AYHD+
Sbjct: 175  EQRKLRTAHYGRVNSGSKSKRDAKIVPVDSPAATAVPQEERRCSFITPSSDPIYVAYHDQ 234

Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163
            EWGVPVH+DKMLFELLVLTGAQ+GSDW +VLKKR+ FR+AF+ FDAEIV+K+SEKK  +I
Sbjct: 235  EWGVPVHDDKMLFELLVLTGAQIGSDWTSVLKKREAFREAFSGFDAEIVAKFSEKKTTSI 294

Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983
              +YG+E+SQVRGVVDN++RIL++K+EFGSFDKYLW FVN+KPI TQY+S  KIPVKTSK
Sbjct: 295  SAEYGMEISQVRGVVDNSNRILQVKKEFGSFDKYLWGFVNHKPITTQYRSSNKIPVKTSK 354

Query: 982  SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836
            SETISK MV+RGFR VGPTV+HSFM+AAGL+NDHLI+C RH +C ALAS
Sbjct: 355  SETISKDMVKRGFRYVGPTVMHSFMQAAGLSNDHLISCSRHHQCLALAS 403


>ref|XP_002315089.2| methyladenine glycosylase family protein [Populus trichocarpa]
            gi|550330066|gb|EEF01260.2| methyladenine glycosylase
            family protein [Populus trichocarpa]
          Length = 411

 Score =  434 bits (1117), Expect = e-119
 Identities = 236/417 (56%), Positives = 285/417 (68%), Gaps = 12/417 (2%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVVAVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVPL 1847
            MCSS +K   +T      AV+ INGRPVLQP CNR P LER N               PL
Sbjct: 1    MCSSNAKV--TTGVEITPAVARINGRPVLQPTCNRVPTLERHNSLKKTAPKSPPPPPPPL 58

Query: 1846 SSPSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSNKTV 1667
              P+                      SPR PA KRG+D N LNSS +KVV+    +   +
Sbjct: 59   PPPTSANKTNKASPPLSPKSK-----SPRLPAIKRGSDANSLNSSSDKVVIPRSTAKTPI 113

Query: 1666 APVKKSKN--SCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMR 1493
               KKSK+     VG+  ++S   + L + SSSLIVEAPGSIAA RREQ+A+   QRKMR
Sbjct: 114  LERKKSKSFKETSVGSGALSSSIEASL-SYSSSLIVEAPGSIAAVRREQMALQHAQRKMR 172

Query: 1492 IAHYGRTKSAKYEGKIVPLDSSATAAVK---EERRCHFITLNS-------DPIYIAYHDE 1343
            IAHYGR+KS+++E K+VP+DSS     K   EE+RC FIT NS       +PIY+AYHD+
Sbjct: 173  IAHYGRSKSSRFEAKVVPVDSSINVTTKTDEEEKRCSFITANSGKEKYEMNPIYVAYHDK 232

Query: 1342 EWGVPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTI 1163
            EWGVPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAEIV+  +EK+M +I
Sbjct: 233  EWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEIVANITEKQMMSI 292

Query: 1162 CNDYGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSK 983
              +YGIE+S+VRGVVDN+ RILEIK+EFGSFD+Y+W FVN KP + QYK   KIPVKTSK
Sbjct: 293  SAEYGIEISRVRGVVDNSKRILEIKKEFGSFDRYIWTFVNNKPFSNQYKFGHKIPVKTSK 352

Query: 982  SETISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALASHSPAVAPA 812
            SETISK MVRRGFR VGPT++HSFM+A GLTNDHLI C RHL C  +A+  P  A A
Sbjct: 353  SETISKDMVRRGFRFVGPTMVHSFMQAVGLTNDHLITCHRHLPCTLMAARRPTEAQA 409


>ref|XP_002312220.1| methyladenine glycosylase family protein [Populus trichocarpa]
            gi|118486806|gb|ABK95238.1| unknown [Populus trichocarpa]
            gi|222852040|gb|EEE89587.1| methyladenine glycosylase
            family protein [Populus trichocarpa]
          Length = 403

 Score =  432 bits (1112), Expect = e-118
 Identities = 240/406 (59%), Positives = 284/406 (69%), Gaps = 9/406 (2%)
 Frame = -3

Query: 2026 MCSSKSKPQGSTAAADVV-AVSHINGRPVLQPNCNRSPLLERRNXXXXXXXXXXXXLQVP 1850
            MCS K+K    T   D+  AV+ INGRPVLQP CN    LERRN               P
Sbjct: 1    MCSFKAKV---TTGVDITPAVARINGRPVLQPTCNLVSTLERRN---------SLKKTAP 48

Query: 1849 LSS--PSXXXXXXXXXXXXXXXXXXXXXKSPRPPATKRGNDPNGLNSSVEKVVLTPKCSN 1676
             SS  P                      KSPR PA KRG+D N LNSS EKVV+    + 
Sbjct: 49   KSSPPPPPPPPTFSNKTNKASPPLSPMSKSPRLPAIKRGSDANSLNSSSEKVVIPRNTTK 108

Query: 1675 KTVAPVKKSKN--SCGVGNINVASLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQR 1502
                  KKSK+     VG    +S   + L + SSSLIVEAPGSIAA RREQ+A+   QR
Sbjct: 109  TPTLERKKSKSFKESSVGRGVHSSFIEASL-SYSSSLIVEAPGSIAAVRREQMALQHAQR 167

Query: 1501 KMRIAHYGRTKSAKYEGKIVPLDSSATAAVK----EERRCHFITLNSDPIYIAYHDEEWG 1334
            KMRIAHYGR+KSA++E ++VP DSS + A K    EE+RC FIT NSDPIY+AYHDEEWG
Sbjct: 168  KMRIAHYGRSKSARFEDQVVPNDSSISMATKTDQEEEKRCSFITANSDPIYVAYHDEEWG 227

Query: 1333 VPVHEDKMLFELLVLTGAQVGSDWATVLKKRQDFRDAFAEFDAEIVSKYSEKKMNTICND 1154
            VPVH+DKMLFELLVL+GAQVGSDW ++LKKRQDFRDAF+ FDAEIV+  SEK++ +I  +
Sbjct: 228  VPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEIVANISEKQIMSISAE 287

Query: 1153 YGIELSQVRGVVDNASRILEIKREFGSFDKYLWAFVNYKPIATQYKSCLKIPVKTSKSET 974
            YGI++S+VRGVVDN++RILEIK+EFGSFD+Y+W FVN KPI+T YK   KIPVKTSKSET
Sbjct: 288  YGIDMSRVRGVVDNSNRILEIKKEFGSFDRYIWTFVNNKPISTSYKFGHKIPVKTSKSET 347

Query: 973  ISKGMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLECAALAS 836
            ISK MVRRGFR VGPT++HSFM+AAGLTNDHLI C RHL C  +A+
Sbjct: 348  ISKDMVRRGFRFVGPTMVHSFMQAAGLTNDHLITCHRHLPCTLMAA 393


Top