BLASTX nr result

ID: Paeonia25_contig00004803 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00004803
         (1840 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citr...   258   5e-66
ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX ho...   251   6e-64
ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   242   5e-61
ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein...   241   6e-61
ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutr...   239   2e-60
gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabi...   239   4e-60
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 238   7e-60
ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Caps...   238   9e-60
ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255...   238   9e-60
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   237   1e-59
ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsi...   236   2e-59
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   236   2e-59
ref|XP_007032156.1| DNA glycosylase superfamily protein, putativ...   234   8e-59
ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Popu...   232   4e-58
ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101...   232   5e-58
ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partia...   230   2e-57
ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseo...   230   2e-57
emb|CBI29440.3| unnamed protein product [Vitis vinifera]              229   3e-57
ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244...   229   3e-57
gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]       222   5e-55

>ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citrus clementina]
            gi|568883956|ref|XP_006494704.1| PREDICTED:
            transcriptional regulator ATRX homolog isoform X2 [Citrus
            sinensis] gi|557525860|gb|ESR37166.1| hypothetical
            protein CICLE_v10028470mg [Citrus clementina]
          Length = 439

 Score =  258 bits (660), Expect = 5e-66
 Identities = 135/249 (54%), Positives = 159/249 (63%), Gaps = 4/249 (1%)
 Frame = +1

Query: 865  FSKRGRENNIQYKN----APKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXX 1032
            + +R +  N++ KN       + R VSPYF N   T                        
Sbjct: 213  YFQRQKAGNVERKNHDTSTMAQARKVSPYFQNQNSTTPAAATVQV--------------- 257

Query: 1033 XHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFY 1212
                HN  +   +  +      +K K+     LTA+QK  EAYERK  DNTW PP SP  
Sbjct: 258  ----HNQQQEEKEKDIA-----VKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIV 308

Query: 1213 LLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 1392
            LLQ +H HDPWRV+VIC+LLNRTTGLQA RVI DLFTLCP+AK ATEV  EEIE++I +L
Sbjct: 309  LLQHEHVHDPWRVIVICMLLNRTTGLQAGRVISDLFTLCPDAKTATEVDAEEIEKIISTL 368

Query: 1393 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 1572
            GLQKKRA  I+R S+EYL ESWTHVTQLHG+GKYAADAYAIFCTGKW+ V P DHMLNYY
Sbjct: 369  GLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYY 428

Query: 1573 WEFLISFYG 1599
            WEFL+S  G
Sbjct: 429  WEFLVSTKG 437


>ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Citrus
            sinensis]
          Length = 446

 Score =  251 bits (642), Expect = 6e-64
 Identities = 135/256 (52%), Positives = 159/256 (62%), Gaps = 11/256 (4%)
 Frame = +1

Query: 865  FSKRGRENNIQYKN----APKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXX 1032
            + +R +  N++ KN       + R VSPYF N   T                        
Sbjct: 213  YFQRQKAGNVERKNHDTSTMAQARKVSPYFQNQNSTTPAAATVQV--------------- 257

Query: 1033 XHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFY 1212
                HN  +   +  +      +K K+     LTA+QK  EAYERK  DNTW PP SP  
Sbjct: 258  ----HNQQQEEKEKDIA-----VKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIV 308

Query: 1213 LLQEKHAHDPWRVLVICLLLNRTTGLQ-------ARRVIWDLFTLCPNAKIATEVGTEEI 1371
            LLQ +H HDPWRV+VIC+LLNRTTGLQ       A RVI DLFTLCP+AK ATEV  EEI
Sbjct: 309  LLQHEHVHDPWRVIVICMLLNRTTGLQEIAILLKAGRVISDLFTLCPDAKTATEVDAEEI 368

Query: 1372 ERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPE 1551
            E++I +LGLQKKRA  I+R S+EYL ESWTHVTQLHG+GKYAADAYAIFCTGKW+ V P 
Sbjct: 369  EKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPT 428

Query: 1552 DHMLNYYWEFLISFYG 1599
            DHMLNYYWEFL+S  G
Sbjct: 429  DHMLNYYWEFLVSTKG 444


>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
            gi|223546492|gb|EEF47991.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 608

 Score =  242 bits (617), Expect = 5e-61
 Identities = 116/162 (71%), Positives = 131/162 (80%), Gaps = 1/162 (0%)
 Frame = +1

Query: 1102 KSKKVNKHV-LTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLVICLLLNR 1278
            K +   K + L+A++K  EAY RKT DNTWKPP S F LLQE HA DPWRVLVIC+LLN 
Sbjct: 440  KKRPARKSITLSAAEKRSEAYRRKTPDNTWKPPRSDFGLLQEDHASDPWRVLVICMLLNC 499

Query: 1279 TTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESW 1458
            TTG Q R VI D FTLCP+AK ATE  TEEIE++I  LGLQKKRAV IQRLS+EYL + W
Sbjct: 500  TTGKQVRGVISDFFTLCPDAKAATEAKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDW 559

Query: 1459 THVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 1584
            THVTQLHG+GKYAADAYAIFCTGKW+ V P+DHMLNYYW+FL
Sbjct: 560  THVTQLHGVGKYAADAYAIFCTGKWDQVRPKDHMLNYYWDFL 601


>ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein 4-like, partial [Solanum
            tuberosum]
          Length = 222

 Score =  241 bits (616), Expect = 6e-61
 Identities = 129/226 (57%), Positives = 148/226 (65%), Gaps = 3/226 (1%)
 Frame = +1

Query: 916  KVRVVSPYFLNSTVTNE---GKDMXXXXXXXXXXXXXXXXXXXHYFHNVPKVNGDDFLGG 1086
            KVRVVSPYF N TV  E   GKD                     YF N  + N       
Sbjct: 4    KVRVVSPYFANLTVGEEIKVGKDRSNPSKNCLNGRKVSP-----YFQNAYRENKKSR--- 55

Query: 1087 EIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLVICL 1266
                 K  K  K  L+A QK  EAY R++ DNTW PP S F LLQE HAHDPWRVLVIC+
Sbjct: 56   -----KGSKRQKPCLSAFQKRDEAYLRRSEDNTWVPPRSHFNLLQENHAHDPWRVLVICM 110

Query: 1267 LLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYL 1446
            LLN TTG+Q +RV+ + FTLCPNA  ATEV  E+IE++++ LGL  KR++AI RLS+EYL
Sbjct: 111  LLNCTTGVQVKRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLAIPRLSQEYL 170

Query: 1447 EESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 1584
             E+WTHVTQLHGIGKYAADAYAIFCTGKW+ V P DHML  YWEFL
Sbjct: 171  GETWTHVTQLHGIGKYAADAYAIFCTGKWDQVHPNDHMLTKYWEFL 216


>ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum]
            gi|557108926|gb|ESQ49233.1| hypothetical protein
            EUTSA_v10020704mg [Eutrema salsugineum]
          Length = 456

 Score =  239 bits (611), Expect = 2e-60
 Identities = 133/251 (52%), Positives = 158/251 (62%), Gaps = 8/251 (3%)
 Frame = +1

Query: 856  DQVFSKRGRENNIQYKNAPKKVRVVSPYFLNSTVT---NEGKDMXXXXXXXXXXXXXXXX 1026
            + V S+ GR+   +      KV  VSPYF  STV+   N  +D+                
Sbjct: 212  ESVASQSGRKYRKESSKLQAKVPRVSPYFQGSTVSEQPNPSRDLRQYFKVVKVS------ 265

Query: 1027 XXXHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWK 1191
                YFH++P    D     E    +S+++ K       L+  QK  EAY RK  DNTW 
Sbjct: 266  ---RYFHDMP---ADGTQVNEPQKERSRRMRKTPVVSPSLSQCQKTDEAYLRKMPDNTWV 319

Query: 1192 PPGSPFYLLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEI 1371
            PP SP  LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LCP+AK ATEV  +EI
Sbjct: 320  PPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFVLCPDAKSATEVEEKEI 379

Query: 1372 ERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPE 1551
            E +I+ LGLQKKRA  IQR S EYL+ESWTHVTQL+G+GKYAADAYAIFC GKW+ V P 
Sbjct: 380  ESLIKPLGLQKKRAKMIQRFSLEYLQESWTHVTQLYGVGKYAADAYAIFCNGKWDCVRPA 439

Query: 1552 DHMLNYYWEFL 1584
            DHMLNYYWEFL
Sbjct: 440  DHMLNYYWEFL 450


>gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabilis]
          Length = 418

 Score =  239 bits (609), Expect = 4e-60
 Identities = 115/165 (69%), Positives = 132/165 (80%), Gaps = 1/165 (0%)
 Frame = +1

Query: 1099 IKSKKVNKH-VLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLVICLLLN 1275
            ++ KK+ K  VL A++K  EAY+RKT DN W PP S   L+Q+ H HDPWRVLVIC+LLN
Sbjct: 250  VRRKKIEKSKVLNAAEKRDEAYKRKTDDNKWNPPPSEIRLIQQDHLHDPWRVLVICMLLN 309

Query: 1276 RTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEES 1455
            RTTG QA RVI D F+LCPNAK ATEV  EEI ++I +LGL  KRA  IQR SREYLEES
Sbjct: 310  RTTGAQATRVISDFFSLCPNAKAATEVSPEEIVKIIHTLGLH-KRAQMIQRFSREYLEES 368

Query: 1456 WTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 1590
            WTHVTQLHG+GKYAADAYAIFCTGKW+ V+P DHMLNYYW+FL S
Sbjct: 369  WTHVTQLHGVGKYAADAYAIFCTGKWDRVKPADHMLNYYWKFLHS 413


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  238 bits (607), Expect = 7e-60
 Identities = 167/418 (39%), Positives = 211/418 (50%), Gaps = 22/418 (5%)
 Frame = +1

Query: 397  KDEQNEVNEVVLLPDNFNDLDEKKSTNLSPYFNKVNEQVGEEVVLVESNTKNNLFLQFIY 576
            +D+ + V      PD+  D  E    N S    K +++   ++ LV+  + N      + 
Sbjct: 19   RDDDSSVMMTRRRPDS--DFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTN-----LVL 71

Query: 577  NASGDGNRSTSDSNYGKNNNELEG-FSHCFHKAARXXXXXXXXXNKKSLEFGYIN-KNII 750
                DG     D     N+N L+  FS   +K  R          +K  +FG I   N++
Sbjct: 72   QCHDDGCSLEKD-----NSNSLDDLFSGFVYKGVR---------RRKRDDFGSITTSNLV 117

Query: 751  SSVILENEGEENLVSSPSL----CGDLLDGKVPR-----NGVVVDQV------FSKRGRE 885
            S  I +++ +   VS   +    C  +   KVPR         + Q        S+ GR 
Sbjct: 118  SPQIADDDDDS--VSDSHIERQECSKV-QAKVPRVSPYFQASTISQCDSDIVSSSQSGRN 174

Query: 886  NNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXXHYFHNVPKVN 1065
                      K R VSPYF  STV+ +                        YFH      
Sbjct: 175  YRKGSSKRQVKARRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH------ 223

Query: 1066 GDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKH 1230
             D     E    KS+ V K      VL+ SQK  + Y RKT DNTW PP SP  LLQE H
Sbjct: 224  ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDH 283

Query: 1231 AHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKR 1410
             HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV  EEIE +I+ LGLQKKR
Sbjct: 284  WHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKR 343

Query: 1411 AVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 1584
               IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYYW++L
Sbjct: 344  TKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 401


>ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Capsella rubella]
            gi|482566361|gb|EOA30550.1| hypothetical protein
            CARUB_v10013672mg [Capsella rubella]
          Length = 456

 Score =  238 bits (606), Expect = 9e-60
 Identities = 135/248 (54%), Positives = 151/248 (60%), Gaps = 5/248 (2%)
 Frame = +1

Query: 856  DQVFSKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXX 1035
            D V S+ G            KVR VS YF  S       D                    
Sbjct: 212  DIVSSQSGGSYRRDSSKHQAKVRRVSRYFQASA------DSEQPNPPRDLRKYFKVVKVS 265

Query: 1036 HYFHNVPKVNGDDFLGGEIGCIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWKPPG 1200
             YFH+V   + D     +    KS++V K       L+ SQK  EAY RKT DNTW PP 
Sbjct: 266  RYFHDV---SADGIQVADSQKEKSRRVRKTPVVSPSLSPSQKTDEAYLRKTPDNTWVPPR 322

Query: 1201 SPFYLLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERV 1380
            SP  LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLFTLCP+AK ATEV  +EIE +
Sbjct: 323  SPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFTLCPDAKTATEVEEKEIESL 382

Query: 1381 IQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHM 1560
            I+ LGLQKKRA  IQR S EYL ESWTHVTQLHGIGKYAADAYAIFC G W+ V+P DHM
Sbjct: 383  IKPLGLQKKRAKMIQRFSLEYLNESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPSDHM 442

Query: 1561 LNYYWEFL 1584
            LNYYWEFL
Sbjct: 443  LNYYWEFL 450


>ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255935 [Solanum
            lycopersicum]
          Length = 544

 Score =  238 bits (606), Expect = 9e-60
 Identities = 138/291 (47%), Positives = 170/291 (58%), Gaps = 9/291 (3%)
 Frame = +1

Query: 739  KNIISSVILENEGEENLVSSPS--LCGDLLDGKVPRNGVVVDQVFSKRGRENNIQYKNAP 912
            K +    + +N+  E ++   +  +C   L+    RNG    +   K+GR      K   
Sbjct: 266  KTVFEPCLSQNQINEKMIEQKARAVCPYFLNS---RNG----ETEMKKGRSVECVKKRND 318

Query: 913  KK----VRVVSPYFLNSTVTNE---GKDMXXXXXXXXXXXXXXXXXXXHYFHNVPKVNGD 1071
            KK    VRVVSPYF N  V  E   GKD                     YF N  +    
Sbjct: 319  KKLRTKVRVVSPYFANLKVGEEIKVGKDSSNASKNCLNGRKVSP-----YFQNAYREKKK 373

Query: 1072 DFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRV 1251
              +G         K  K  L+ASQK  EAY R++ DN W PP S F LLQE HAHDPWRV
Sbjct: 374  STIGS--------KRQKPCLSASQKRDEAYLRRSEDNMWVPPRSHFNLLQENHAHDPWRV 425

Query: 1252 LVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRL 1431
            LVIC+LLN TTG+Q RRV+ + FTLCPNA  ATEV  E+IE++++ LGL  KR+++I RL
Sbjct: 426  LVICMLLNCTTGVQVRRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLSIPRL 485

Query: 1432 SREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 1584
            S+EYL ++WTHVTQLHGIGKYAADAYAIFCTG W+ V P DHML  YWEFL
Sbjct: 486  SQEYLGKNWTHVTQLHGIGKYAADAYAIFCTGNWDQVHPNDHMLTKYWEFL 536


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
            lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
            ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  237 bits (605), Expect = 1e-59
 Identities = 130/228 (57%), Positives = 145/228 (63%), Gaps = 5/228 (2%)
 Frame = +1

Query: 916  KVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXXHYFHNVPKVNGDDFLGGEIG 1095
            KVR  SPYF  STV+ +                        YFH       D     E  
Sbjct: 212  KVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVS----RYFH------ADGIQVNESQ 261

Query: 1096 CIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLVI 1260
              KS +V K       L+ SQK  EAY+RKT D TW PP SP  LLQE H HDPWRVLVI
Sbjct: 262  KEKSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQEHHWHDPWRVLVI 321

Query: 1261 CLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSRE 1440
            C+LLN+T+G Q R VI DLF LCP+AK ATEV   EIE +I+ LGLQKKRA  IQR S E
Sbjct: 322  CMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGLQKKRARMIQRFSLE 381

Query: 1441 YLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 1584
            YL+ESWTHVTQLHGIGKYAADAYAIFC G W+ V+P+DHMLNYYWEFL
Sbjct: 382  YLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429


>ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
            gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
            thaliana] gi|332641100|gb|AEE74621.1| DNA glycosylase
            superfamily protein [Arabidopsis thaliana]
          Length = 445

 Score =  236 bits (603), Expect = 2e-59
 Identities = 131/244 (53%), Positives = 149/244 (61%), Gaps = 5/244 (2%)
 Frame = +1

Query: 868  SKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXXHYFH 1047
            S+ GR           KVR VSPYF  STV+ +                        YFH
Sbjct: 207  SQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH 261

Query: 1048 NVPKVNGDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFY 1212
                   D     E    KS+ V K      VL+ SQK  + Y RKT DNTW PP SP  
Sbjct: 262  ------ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCN 315

Query: 1213 LLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 1392
            LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV  EEIE +I+ L
Sbjct: 316  LLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPL 375

Query: 1393 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 1572
            GLQKKR   IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYY
Sbjct: 376  GLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYY 435

Query: 1573 WEFL 1584
            W++L
Sbjct: 436  WDYL 439


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  236 bits (603), Expect = 2e-59
 Identities = 131/244 (53%), Positives = 149/244 (61%), Gaps = 5/244 (2%)
 Frame = +1

Query: 868  SKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXXHYFH 1047
            S+ GR           KVR VSPYF  STV+ +                        YFH
Sbjct: 181  SQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH 235

Query: 1048 NVPKVNGDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFY 1212
                   D     E    KS+ V K      VL+ SQK  + Y RKT DNTW PP SP  
Sbjct: 236  ------ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCN 289

Query: 1213 LLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 1392
            LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV  EEIE +I+ L
Sbjct: 290  LLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPL 349

Query: 1393 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 1572
            GLQKKR   IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYY
Sbjct: 350  GLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYY 409

Query: 1573 WEFL 1584
            W++L
Sbjct: 410  WDYL 413


>ref|XP_007032156.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|590648404|ref|XP_007032157.1| DNA glycosylase
            superfamily protein, putative isoform 1 [Theobroma cacao]
            gi|508711185|gb|EOY03082.1| DNA glycosylase superfamily
            protein, putative isoform 1 [Theobroma cacao]
            gi|508711186|gb|EOY03083.1| DNA glycosylase superfamily
            protein, putative isoform 1 [Theobroma cacao]
          Length = 382

 Score =  234 bits (598), Expect = 8e-59
 Identities = 116/174 (66%), Positives = 136/174 (78%)
 Frame = +1

Query: 1063 NGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDP 1242
            N D+ LGG    +K   V K VL+ASQK  EAY+RKT +NTW PP S   LLQE H HDP
Sbjct: 203  NKDNILGGMKKAMKPAGV-KPVLSASQKRDEAYQRKTPNNTWIPPRSNAPLLQEDHTHDP 261

Query: 1243 WRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAI 1422
            WRVL+IC+LLN+T+G QAR V+ DLFTLCP+AK ATEV T EIE+ I+ LGLQ+KRA  I
Sbjct: 262  WRVLLICMLLNKTSGNQARNVLSDLFTLCPDAKTATEVATGEIEKAIKPLGLQRKRAEMI 321

Query: 1423 QRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 1584
            QR+S+EYL + WTHVT+LHG+GKYAADAYAIFCTGK + V P DHMLNYYW FL
Sbjct: 322  QRMSQEYLWKEWTHVTELHGVGKYAADAYAIFCTGKGDRVTPSDHMLNYYWNFL 375


>ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa]
            gi|550326306|gb|EEE95947.2| hypothetical protein
            POPTR_0012s03470g [Populus trichocarpa]
          Length = 229

 Score =  232 bits (592), Expect = 4e-58
 Identities = 119/177 (67%), Positives = 131/177 (74%), Gaps = 10/177 (5%)
 Frame = +1

Query: 1090 IGCIKSKKVNK-------HVLTAS---QKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHD 1239
            IG  K KK  K       H  T S    K  EAYERKT++NTWKPP S F  L   HAHD
Sbjct: 48   IGRSKKKKKKKEGTKTSLHSDTTSPYYNKFDEAYERKTAENTWKPPQSEFGFLHN-HAHD 106

Query: 1240 PWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVA 1419
            PWRVLVIC+LLNRT G +A RV+ DLFTLCP+AK AT V TEEIER I+SLGLQK+RA  
Sbjct: 107  PWRVLVICMLLNRTAGTRAERVVADLFTLCPDAKAATGVATEEIERAIKSLGLQKRRAKM 166

Query: 1420 IQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 1590
            +QRLS +YLEE WTHVTQL G+GKYAADAYAIFCTGKWE V P DHMLN YWE+L S
Sbjct: 167  VQRLSEDYLEEDWTHVTQLPGVGKYAADAYAIFCTGKWEQVRPNDHMLNRYWEYLCS 223


>ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101(2)-like [Glycine max]
          Length = 1424

 Score =  232 bits (591), Expect = 5e-58
 Identities = 132/272 (48%), Positives = 156/272 (57%), Gaps = 16/272 (5%)
 Frame = +1

Query: 823  DGKVPRNGV--VVDQVFSKRGRENN---IQYKNAPKK-----------VRVVSPYFLNST 954
            DG V  NG+  V  +V S + +EN       K  PKK           +R VSPYF N  
Sbjct: 1158 DGNVTENGMINVKRKVISNKLQENGNNATTSKVKPKKKKPLVQKNGHGIRYVSPYFCN-- 1215

Query: 955  VTNEGKDMXXXXXXXXXXXXXXXXXXXHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLT 1134
              N GK +                      H       D     +  C       K    
Sbjct: 1216 --NSGKKVNVKPFDKGSTSESIA------LHTCKNFVEDKLEENKSNCSNKSIEIKRFPP 1267

Query: 1135 ASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWD 1314
            AS+K  EAY+RKT DNTWKPP S   L+QE H HDPWRVLVIC+LLNRT G Q ++V+ +
Sbjct: 1268 ASEKWDEAYKRKTPDNTWKPPRSEIVLIQEDHLHDPWRVLVICMLLNRTAGGQTKKVVSN 1327

Query: 1315 LFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKY 1494
             F LCP+AK  T+V  EEIE+ I++LG Q KRA  +QRLS EYL+ESWTHVTQLHG+GKY
Sbjct: 1328 FFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQRLSEEYLDESWTHVTQLHGVGKY 1387

Query: 1495 AADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 1590
            AADAYAIF TG W+ V P DHMLNYYWEFL S
Sbjct: 1388 AADAYAIFVTGMWDRVTPTDHMLNYYWEFLHS 1419


>ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris]
            gi|561039879|gb|ESW35973.1| hypothetical protein
            PHAVU_L0004001g, partial [Phaseolus vulgaris]
          Length = 715

 Score =  230 bits (586), Expect = 2e-57
 Identities = 122/233 (52%), Positives = 147/233 (63%), Gaps = 1/233 (0%)
 Frame = +1

Query: 901  KNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXX-HYFHNVPKVNGDDF 1077
            KN    +R VSPYF N +    GK++                    +Y  + P+ N    
Sbjct: 492  KNVAHAIRYVSPYFHNDS----GKNIDVKPLDEGSKFESIALHATENYVEDKPEEN---- 543

Query: 1078 LGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLV 1257
               +  C +     K  L+ASQK  EAY+RKT D TWKPP S   L+QE HAHDPWRVLV
Sbjct: 544  ---KSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLV 600

Query: 1258 ICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSR 1437
            IC+LLNRT+G Q + ++ D F LCP+AK  TEV  EEIE  I++LG Q KRA  ++RLS 
Sbjct: 601  ICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSE 660

Query: 1438 EYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLISFY 1596
            EYL+ESWTHVTQLHG+GKYAADAYAIF TGK + V P DHMLNYYWEFL   Y
Sbjct: 661  EYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFLRRIY 713


>ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris]
            gi|561039878|gb|ESW35972.1| hypothetical protein
            PHAVU_L0004001g [Phaseolus vulgaris]
          Length = 726

 Score =  230 bits (586), Expect = 2e-57
 Identities = 122/233 (52%), Positives = 147/233 (63%), Gaps = 1/233 (0%)
 Frame = +1

Query: 901  KNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXX-HYFHNVPKVNGDDF 1077
            KN    +R VSPYF N +    GK++                    +Y  + P+ N    
Sbjct: 503  KNVAHAIRYVSPYFHNDS----GKNIDVKPLDEGSKFESIALHATENYVEDKPEEN---- 554

Query: 1078 LGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLV 1257
               +  C +     K  L+ASQK  EAY+RKT D TWKPP S   L+QE HAHDPWRVLV
Sbjct: 555  ---KSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLV 611

Query: 1258 ICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSR 1437
            IC+LLNRT+G Q + ++ D F LCP+AK  TEV  EEIE  I++LG Q KRA  ++RLS 
Sbjct: 612  ICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSE 671

Query: 1438 EYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLISFY 1596
            EYL+ESWTHVTQLHG+GKYAADAYAIF TGK + V P DHMLNYYWEFL   Y
Sbjct: 672  EYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFLRRIY 724


>emb|CBI29440.3| unnamed protein product [Vitis vinifera]
          Length = 599

 Score =  229 bits (584), Expect = 3e-57
 Identities = 108/147 (73%), Positives = 124/147 (84%)
 Frame = +1

Query: 1144 KLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWDLFT 1323
            KL+ AY RK+ DN WKPP S F+LLQE H HDPWRV+VIC+LLN T+GLQA RVI DLFT
Sbjct: 441  KLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQASRVISDLFT 500

Query: 1324 LCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAAD 1503
            LCP+AK AT+V TE IE+VI++LGLQKKRA  IQR SREYL++SWTHVTQLHGIGKYAAD
Sbjct: 501  LCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAAD 560

Query: 1504 AYAIFCTGKWELVEPEDHMLNYYWEFL 1584
            AYAIFC+G W LV P DHML  YW++L
Sbjct: 561  AYAIFCSGDWGLVVPNDHMLVKYWKYL 587


>ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244192 [Vitis vinifera]
          Length = 536

 Score =  229 bits (584), Expect = 3e-57
 Identities = 108/147 (73%), Positives = 124/147 (84%)
 Frame = +1

Query: 1144 KLHEAYERKTSDNTWKPPGSPFYLLQEKHAHDPWRVLVICLLLNRTTGLQARRVIWDLFT 1323
            KL+ AY RK+ DN WKPP S F+LLQE H HDPWRV+VIC+LLN T+GLQA RVI DLFT
Sbjct: 378  KLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQASRVISDLFT 437

Query: 1324 LCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAAD 1503
            LCP+AK AT+V TE IE+VI++LGLQKKRA  IQR SREYL++SWTHVTQLHGIGKYAAD
Sbjct: 438  LCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAAD 497

Query: 1504 AYAIFCTGKWELVEPEDHMLNYYWEFL 1584
            AYAIFC+G W LV P DHML  YW++L
Sbjct: 498  AYAIFCSGDWGLVVPNDHMLVKYWKYL 524


>gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]
          Length = 369

 Score =  222 bits (565), Expect = 5e-55
 Identities = 123/243 (50%), Positives = 148/243 (60%), Gaps = 2/243 (0%)
 Frame = +1

Query: 874  RGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXXHYFHNV 1053
            R ++N +      KKV V+ PYF         +DM                    YF + 
Sbjct: 146  RKKKNIVTEDGCDKKVVVLDPYF--------AEDMSRKKVSP-------------YFQSP 184

Query: 1054 PKVNGDDFLGGEI--GCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEK 1227
             K +G D    E+     +  K  K VL++ QK  EAYER+T DN W PP SPF LLQE 
Sbjct: 185  RKTSGSDRGISEVVEESPERSKRWKPVLSSVQKRDEAYERRTPDNEWTPPRSPFNLLQED 244

Query: 1228 HAHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKK 1407
            H  DPWRVLVIC+LLN+TTG QA RV+  LF LCP AK ATEV  ++IE  I+ LGLQ+K
Sbjct: 245  HMFDPWRVLVICMLLNQTTGRQAFRVLSKLFELCPTAKAATEVARDDIEDAIRCLGLQRK 304

Query: 1408 RAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLI 1587
            RA  IQR S EY+ E WTHVT+L GIGKYAADAYAIFCTG+W+ V P DHML  YWE+L 
Sbjct: 305  RAEMIQRFSEEYMSEEWTHVTELPGIGKYAADAYAIFCTGRWQRVRPADHMLVKYWEWLN 364

Query: 1588 SFY 1596
             F+
Sbjct: 365  EFF 367


Top