BLASTX nr result

ID: Paeonia23_contig00001425 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00001425
         (1950 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citr...   258   9e-66
ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX ho...   251   1e-63
ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   248   9e-63
ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein...   240   1e-60
ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutr...   239   3e-60
gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabi...   238   6e-60
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 238   7e-60
ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Caps...   238   1e-59
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   237   1e-59
ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255...   236   2e-59
ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsi...   236   2e-59
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   236   2e-59
ref|XP_007032156.1| DNA glycosylase superfamily protein, putativ...   235   6e-59
ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101...   231   7e-58
ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Popu...   231   9e-58
emb|CBI29440.3| unnamed protein product [Vitis vinifera]              229   3e-57
ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244...   229   3e-57
ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partia...   229   4e-57
ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseo...   229   4e-57
gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]       222   5e-55

>ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citrus clementina]
           gi|568883956|ref|XP_006494704.1| PREDICTED:
           transcriptional regulator ATRX homolog isoform X2
           [Citrus sinensis] gi|557525860|gb|ESR37166.1|
           hypothetical protein CICLE_v10028470mg [Citrus
           clementina]
          Length = 439

 Score =  258 bits (658), Expect = 9e-66
 Identities = 135/249 (54%), Positives = 159/249 (63%), Gaps = 4/249 (1%)
 Frame = -1

Query: 978 FSKRGRENNIQYKN----APKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXX 811
           + +R +  N++ KN       + R VSPYF N   T                        
Sbjct: 213 YFQRQKAGNVERKNHDTSTMAQARKVSPYFQNQNSTTPAAATVQV--------------- 257

Query: 810 SHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFY 631
               HN  +   +  +      +K K+     LTA+QK  EAYERK  DNTW PP SP  
Sbjct: 258 ----HNQQQEEKEKDIA-----VKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIV 308

Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 451
           LLQ +H HDPWRV+VIC+LLNRTTGLQA RVI DLFTLCP+AK ATEV  EEIE++I +L
Sbjct: 309 LLQHEHVHDPWRVIVICMLLNRTTGLQAGRVISDLFTLCPDAKTATEVDAEEIEKIISTL 368

Query: 450 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 271
           GLQKKRA  I+R S+EYL ESWTHVTQLHG+GKYAADAYAIFCTGKW+ V P DHMLNYY
Sbjct: 369 GLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYY 428

Query: 270 WEFLISFYG 244
           WEFL+S  G
Sbjct: 429 WEFLVSTKG 437


>ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1
           [Citrus sinensis]
          Length = 446

 Score =  251 bits (640), Expect = 1e-63
 Identities = 135/256 (52%), Positives = 159/256 (62%), Gaps = 11/256 (4%)
 Frame = -1

Query: 978 FSKRGRENNIQYKN----APKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXX 811
           + +R +  N++ KN       + R VSPYF N   T                        
Sbjct: 213 YFQRQKAGNVERKNHDTSTMAQARKVSPYFQNQNSTTPAAATVQV--------------- 257

Query: 810 SHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFY 631
               HN  +   +  +      +K K+     LTA+QK  EAYERK  DNTW PP SP  
Sbjct: 258 ----HNQQQEEKEKDIA-----VKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIV 308

Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQ-------ARRVIWDLFTLCPNAKIATEVGTEEI 472
           LLQ +H HDPWRV+VIC+LLNRTTGLQ       A RVI DLFTLCP+AK ATEV  EEI
Sbjct: 309 LLQHEHVHDPWRVIVICMLLNRTTGLQEIAILLKAGRVISDLFTLCPDAKTATEVDAEEI 368

Query: 471 ERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPE 292
           E++I +LGLQKKRA  I+R S+EYL ESWTHVTQLHG+GKYAADAYAIFCTGKW+ V P 
Sbjct: 369 EKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPT 428

Query: 291 DHMLNYYWEFLISFYG 244
           DHMLNYYWEFL+S  G
Sbjct: 429 DHMLNYYWEFLVSTKG 444


>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
            gi|223546492|gb|EEF47991.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 608

 Score =  248 bits (632), Expect = 9e-63
 Identities = 200/597 (33%), Positives = 273/597 (45%), Gaps = 33/597 (5%)
 Frame = -1

Query: 1950 EASKREKRRVISPYFXXXXXXXXXXXXXXXRAFSVDGGYDGDGKMWKREGGTMQSRKKER 1771
            +  K++K+RV+SPYF                   VD     D    + +    + RKK++
Sbjct: 50   KTKKKKKKRVVSPYFERVESTIS----------KVDNNLSFDSHDHESKQKKKKKRKKKK 99

Query: 1770 NQSHMDMSLMVSQVIEDESNAMVFKKKETEPSSSINLQINSTIAVEDDDG-RLSKEVYQG 1594
                     +VS   E  +  M+ K +  +     NL  +S   VE+    R+S  + Q 
Sbjct: 100  G--------VVSPYFE-RAECMISKDEPVDN----NLTFDSYDPVEEKKNKRVSPFLAQA 146

Query: 1593 NGMEKRTWDDTYCCNKSIKMNSLHQKAANKAEEHSFSVQAEINSSVLVDKDEQNEVNEVV 1414
                 +  D+    N ++  ++  +K   K+   + +++ E   + +V + +  E     
Sbjct: 147  ESRISK--DENVDNNLTLHGHAREKKKKKKSGTFTLNLEEEQGGANVVSRGDGKE----- 199

Query: 1413 LLPDNFNDLDEKKSTNLSPYFNKVNEQVGEEVVLVESNTKNNLFLQFIYNASGDGNRSTS 1234
                  N    KK+   + Y NK  + V  +  + +      +      N + DGN +T 
Sbjct: 200  ----KANKRKRKKNDG-AIYPNKTRDTVSSDAQMRDIVKLTEI------NVASDGNMATD 248

Query: 1233 DSNYGKNN---------NELEGFSHCFHKAARXXXXXXXXENKKSL---EFGYINKNIIS 1090
            D      N         N    F     K A          +KK L    +  + KNI  
Sbjct: 249  DCKTSAKNLLNEQMVAPNAGMSFEDVLSKYAYKSDGRLNFRDKKILGAPHYPMVVKNIEK 308

Query: 1089 SVILEN----EGEENLVSSPSLCGDLLDGKVPRNGVVVDQVFS---KRGRENNIQYKNAP 931
                EN    E E  L  + +    L       +G  + +V +    R  EN        
Sbjct: 309  YEESENKISKEAEGTLKITENEAAPLPAIPYGNSGSQISEVGNVTPTRNIENEKPNSRVH 368

Query: 930  KKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGGEI 751
             +VR VSP F  S    E   M                  S YF  VPK   ++    + 
Sbjct: 369  IQVRKVSPNFNLSIGQQEC--MKIKPLKPCERVGLTVRNVSPYFQKVPKQEEEE--AADS 424

Query: 750  GCIKSKKVNKHV-------------LTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHS 610
              I +K   K +             L+A++K  EAY RKT DNTWKPP S F LLQE H+
Sbjct: 425  NMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRRKTPDNTWKPPRSDFGLLQEDHA 484

Query: 609  HDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRA 430
             DPWRVLVIC+LLN TTG Q R VI D FTLCP+AK ATE  TEEIE++I  LGLQKKRA
Sbjct: 485  SDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATEAKTEEIEKIIVPLGLQKKRA 544

Query: 429  VAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259
            V IQRLS+EYL + WTHVTQLHG+GKYAADAYAIFCTGKW+ V P+DHMLNYYW+FL
Sbjct: 545  VMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPKDHMLNYYWDFL 601


>ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein 4-like, partial
           [Solanum tuberosum]
          Length = 222

 Score =  240 bits (613), Expect = 1e-60
 Identities = 128/226 (56%), Positives = 148/226 (65%), Gaps = 3/226 (1%)
 Frame = -1

Query: 927 KVRVVSPYFLNSTVTNE---GKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGG 757
           KVRVVSPYF N TV  E   GKD                     YF N  + N       
Sbjct: 4   KVRVVSPYFANLTVGEEIKVGKDRSNPSKNCLNGRKVSP-----YFQNAYRENKKSR--- 55

Query: 756 EIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICL 577
                K  K  K  L+A QK  EAY R++ DNTW PP S F LLQE H+HDPWRVLVIC+
Sbjct: 56  -----KGSKRQKPCLSAFQKRDEAYLRRSEDNTWVPPRSHFNLLQENHAHDPWRVLVICM 110

Query: 576 LLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYL 397
           LLN TTG+Q +RV+ + FTLCPNA  ATEV  E+IE++++ LGL  KR++AI RLS+EYL
Sbjct: 111 LLNCTTGVQVKRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLAIPRLSQEYL 170

Query: 396 EESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259
            E+WTHVTQLHGIGKYAADAYAIFCTGKW+ V P DHML  YWEFL
Sbjct: 171 GETWTHVTQLHGIGKYAADAYAIFCTGKWDQVHPNDHMLTKYWEFL 216


>ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum]
           gi|557108926|gb|ESQ49233.1| hypothetical protein
           EUTSA_v10020704mg [Eutrema salsugineum]
          Length = 456

 Score =  239 bits (611), Expect = 3e-60
 Identities = 133/251 (52%), Positives = 158/251 (62%), Gaps = 8/251 (3%)
 Frame = -1

Query: 987 DQVFSKRGRENNIQYKNAPKKVRVVSPYFLNSTVT---NEGKDMXXXXXXXXXXXXXXXX 817
           + V S+ GR+   +      KV  VSPYF  STV+   N  +D+                
Sbjct: 212 ESVASQSGRKYRKESSKLQAKVPRVSPYFQGSTVSEQPNPSRDLRQYFKVVKVS------ 265

Query: 816 XXSHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWK 652
               YFH++P    D     E    +S+++ K       L+  QK  EAY RK  DNTW 
Sbjct: 266 ---RYFHDMP---ADGTQVNEPQKERSRRMRKTPVVSPSLSQCQKTDEAYLRKMPDNTWV 319

Query: 651 PPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEI 472
           PP SP  LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LCP+AK ATEV  +EI
Sbjct: 320 PPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFVLCPDAKSATEVEEKEI 379

Query: 471 ERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPE 292
           E +I+ LGLQKKRA  IQR S EYL+ESWTHVTQL+G+GKYAADAYAIFC GKW+ V P 
Sbjct: 380 ESLIKPLGLQKKRAKMIQRFSLEYLQESWTHVTQLYGVGKYAADAYAIFCNGKWDCVRPA 439

Query: 291 DHMLNYYWEFL 259
           DHMLNYYWEFL
Sbjct: 440 DHMLNYYWEFL 450


>gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabilis]
          Length = 418

 Score =  238 bits (608), Expect = 6e-60
 Identities = 115/165 (69%), Positives = 132/165 (80%), Gaps = 1/165 (0%)
 Frame = -1

Query: 744 IKSKKVNKH-VLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLN 568
           ++ KK+ K  VL A++K  EAY+RKT DN W PP S   L+Q+ H HDPWRVLVIC+LLN
Sbjct: 250 VRRKKIEKSKVLNAAEKRDEAYKRKTDDNKWNPPPSEIRLIQQDHLHDPWRVLVICMLLN 309

Query: 567 RTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEES 388
           RTTG QA RVI D F+LCPNAK ATEV  EEI ++I +LGL  KRA  IQR SREYLEES
Sbjct: 310 RTTGAQATRVISDFFSLCPNAKAATEVSPEEIVKIIHTLGLH-KRAQMIQRFSREYLEES 368

Query: 387 WTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 253
           WTHVTQLHG+GKYAADAYAIFCTGKW+ V+P DHMLNYYW+FL S
Sbjct: 369 WTHVTQLHGVGKYAADAYAIFCTGKWDRVKPADHMLNYYWKFLHS 413


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  238 bits (607), Expect = 7e-60
 Identities = 167/418 (39%), Positives = 211/418 (50%), Gaps = 22/418 (5%)
 Frame = -1

Query: 1446 KDEQNEVNEVVLLPDNFNDLDEKKSTNLSPYFNKVNEQVGEEVVLVESNTKNNLFLQFIY 1267
            +D+ + V      PD+  D  E    N S    K +++   ++ LV+  + N      + 
Sbjct: 19   RDDDSSVMMTRRRPDS--DFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTN-----LVL 71

Query: 1266 NASGDGNRSTSDSNYGKNNNELEG-FSHCFHKAARXXXXXXXXENKKSLEFGYIN-KNII 1093
                DG     D     N+N L+  FS   +K  R          +K  +FG I   N++
Sbjct: 72   QCHDDGCSLEKD-----NSNSLDDLFSGFVYKGVR---------RRKRDDFGSITTSNLV 117

Query: 1092 SSVILENEGEENLVSSPSL----CGDLLDGKVPR-----NGVVVDQV------FSKRGRE 958
            S  I +++ +   VS   +    C  +   KVPR         + Q        S+ GR 
Sbjct: 118  SPQIADDDDDS--VSDSHIERQECSKV-QAKVPRVSPYFQASTISQCDSDIVSSSQSGRN 174

Query: 957  NNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVN 778
                      K R VSPYF  STV+ +                        YFH      
Sbjct: 175  YRKGSSKRQVKARRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH------ 223

Query: 777  GDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKH 613
             D     E    KS+ V K      VL+ SQK  + Y RKT DNTW PP SP  LLQE H
Sbjct: 224  ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDH 283

Query: 612  SHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKR 433
             HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV  EEIE +I+ LGLQKKR
Sbjct: 284  WHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKR 343

Query: 432  AVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259
               IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYYW++L
Sbjct: 344  TKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 401


>ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Capsella rubella]
           gi|482566361|gb|EOA30550.1| hypothetical protein
           CARUB_v10013672mg [Capsella rubella]
          Length = 456

 Score =  238 bits (606), Expect = 1e-59
 Identities = 136/248 (54%), Positives = 152/248 (61%), Gaps = 5/248 (2%)
 Frame = -1

Query: 987 DQVFSKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXS 808
           D V S+ G            KVR VS YF  S       D                   S
Sbjct: 212 DIVSSQSGGSYRRDSSKHQAKVRRVSRYFQASA------DSEQPNPPRDLRKYFKVVKVS 265

Query: 807 HYFHNVPKVNGDDFLGGEIGCIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWKPPG 643
            YFH+V   + D     +    KS++V K       L+ SQK  EAY RKT DNTW PP 
Sbjct: 266 RYFHDV---SADGIQVADSQKEKSRRVRKTPVVSPSLSPSQKTDEAYLRKTPDNTWVPPR 322

Query: 642 SPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERV 463
           SP  LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLFTLCP+AK ATEV  +EIE +
Sbjct: 323 SPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFTLCPDAKTATEVEEKEIESL 382

Query: 462 IQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHM 283
           I+ LGLQKKRA  IQR S EYL ESWTHVTQLHGIGKYAADAYAIFC G W+ V+P DHM
Sbjct: 383 IKPLGLQKKRAKMIQRFSLEYLNESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPSDHM 442

Query: 282 LNYYWEFL 259
           LNYYWEFL
Sbjct: 443 LNYYWEFL 450


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
           lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
           ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  237 bits (605), Expect = 1e-59
 Identities = 130/228 (57%), Positives = 145/228 (63%), Gaps = 5/228 (2%)
 Frame = -1

Query: 927 KVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGGEIG 748
           KVR  SPYF  STV+ +                        YFH       D     E  
Sbjct: 212 KVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVS----RYFH------ADGIQVNESQ 261

Query: 747 CIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVI 583
             KS +V K       L+ SQK  EAY+RKT D TW PP SP  LLQE H HDPWRVLVI
Sbjct: 262 KEKSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQEHHWHDPWRVLVI 321

Query: 582 CLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSRE 403
           C+LLN+T+G Q R VI DLF LCP+AK ATEV   EIE +I+ LGLQKKRA  IQR S E
Sbjct: 322 CMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGLQKKRARMIQRFSLE 381

Query: 402 YLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259
           YL+ESWTHVTQLHGIGKYAADAYAIFC G W+ V+P+DHMLNYYWEFL
Sbjct: 382 YLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429


>ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255935 [Solanum
            lycopersicum]
          Length = 544

 Score =  236 bits (603), Expect = 2e-59
 Identities = 137/291 (47%), Positives = 170/291 (58%), Gaps = 9/291 (3%)
 Frame = -1

Query: 1104 KNIISSVILENEGEENLVSSPS--LCGDLLDGKVPRNGVVVDQVFSKRGRENNIQYKNAP 931
            K +    + +N+  E ++   +  +C   L+    RNG    +   K+GR      K   
Sbjct: 266  KTVFEPCLSQNQINEKMIEQKARAVCPYFLNS---RNG----ETEMKKGRSVECVKKRND 318

Query: 930  KK----VRVVSPYFLNSTVTNE---GKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGD 772
            KK    VRVVSPYF N  V  E   GKD                     YF N  +    
Sbjct: 319  KKLRTKVRVVSPYFANLKVGEEIKVGKDSSNASKNCLNGRKVSP-----YFQNAYREKKK 373

Query: 771  DFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRV 592
              +G         K  K  L+ASQK  EAY R++ DN W PP S F LLQE H+HDPWRV
Sbjct: 374  STIGS--------KRQKPCLSASQKRDEAYLRRSEDNMWVPPRSHFNLLQENHAHDPWRV 425

Query: 591  LVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRL 412
            LVIC+LLN TTG+Q RRV+ + FTLCPNA  ATEV  E+IE++++ LGL  KR+++I RL
Sbjct: 426  LVICMLLNCTTGVQVRRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLSIPRL 485

Query: 411  SREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259
            S+EYL ++WTHVTQLHGIGKYAADAYAIFCTG W+ V P DHML  YWEFL
Sbjct: 486  SQEYLGKNWTHVTQLHGIGKYAADAYAIFCTGNWDQVHPNDHMLTKYWEFL 536


>ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
           gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
           thaliana] gi|332641100|gb|AEE74621.1| DNA glycosylase
           superfamily protein [Arabidopsis thaliana]
          Length = 445

 Score =  236 bits (603), Expect = 2e-59
 Identities = 131/244 (53%), Positives = 149/244 (61%), Gaps = 5/244 (2%)
 Frame = -1

Query: 975 SKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFH 796
           S+ GR           KVR VSPYF  STV+ +                        YFH
Sbjct: 207 SQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH 261

Query: 795 NVPKVNGDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFY 631
                  D     E    KS+ V K      VL+ SQK  + Y RKT DNTW PP SP  
Sbjct: 262 ------ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCN 315

Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 451
           LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV  EEIE +I+ L
Sbjct: 316 LLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPL 375

Query: 450 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 271
           GLQKKR   IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYY
Sbjct: 376 GLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYY 435

Query: 270 WEFL 259
           W++L
Sbjct: 436 WDYL 439


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  236 bits (603), Expect = 2e-59
 Identities = 131/244 (53%), Positives = 149/244 (61%), Gaps = 5/244 (2%)
 Frame = -1

Query: 975 SKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFH 796
           S+ GR           KVR VSPYF  STV+ +                        YFH
Sbjct: 181 SQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH 235

Query: 795 NVPKVNGDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFY 631
                  D     E    KS+ V K      VL+ SQK  + Y RKT DNTW PP SP  
Sbjct: 236 ------ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCN 289

Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 451
           LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV  EEIE +I+ L
Sbjct: 290 LLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPL 349

Query: 450 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 271
           GLQKKR   IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYY
Sbjct: 350 GLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYY 409

Query: 270 WEFL 259
           W++L
Sbjct: 410 WDYL 413


>ref|XP_007032156.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|590648404|ref|XP_007032157.1| DNA glycosylase
           superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|508711185|gb|EOY03082.1| DNA glycosylase
           superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|508711186|gb|EOY03083.1| DNA glycosylase
           superfamily protein, putative isoform 1 [Theobroma
           cacao]
          Length = 382

 Score =  235 bits (599), Expect = 6e-59
 Identities = 116/174 (66%), Positives = 137/174 (78%)
 Frame = -1

Query: 780 NGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDP 601
           N D+ LGG    +K   V K VL+ASQK  EAY+RKT +NTW PP S   LLQE H+HDP
Sbjct: 203 NKDNILGGMKKAMKPAGV-KPVLSASQKRDEAYQRKTPNNTWIPPRSNAPLLQEDHTHDP 261

Query: 600 WRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAI 421
           WRVL+IC+LLN+T+G QAR V+ DLFTLCP+AK ATEV T EIE+ I+ LGLQ+KRA  I
Sbjct: 262 WRVLLICMLLNKTSGNQARNVLSDLFTLCPDAKTATEVATGEIEKAIKPLGLQRKRAEMI 321

Query: 420 QRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259
           QR+S+EYL + WTHVT+LHG+GKYAADAYAIFCTGK + V P DHMLNYYW FL
Sbjct: 322 QRMSQEYLWKEWTHVTELHGVGKYAADAYAIFCTGKGDRVTPSDHMLNYYWNFL 375


>ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101(2)-like [Glycine max]
          Length = 1424

 Score =  231 bits (590), Expect = 7e-58
 Identities = 132/272 (48%), Positives = 156/272 (57%), Gaps = 16/272 (5%)
 Frame = -1

Query: 1020 DGKVPRNGV--VVDQVFSKRGRENN---IQYKNAPKK-----------VRVVSPYFLNST 889
            DG V  NG+  V  +V S + +EN       K  PKK           +R VSPYF N  
Sbjct: 1158 DGNVTENGMINVKRKVISNKLQENGNNATTSKVKPKKKKPLVQKNGHGIRYVSPYFCN-- 1215

Query: 888  VTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLT 709
              N GK +                      H       D     +  C       K    
Sbjct: 1216 --NSGKKVNVKPFDKGSTSESIA------LHTCKNFVEDKLEENKSNCSNKSIEIKRFPP 1267

Query: 708  ASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWD 529
            AS+K  EAY+RKT DNTWKPP S   L+QE H HDPWRVLVIC+LLNRT G Q ++V+ +
Sbjct: 1268 ASEKWDEAYKRKTPDNTWKPPRSEIVLIQEDHLHDPWRVLVICMLLNRTAGGQTKKVVSN 1327

Query: 528  LFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKY 349
             F LCP+AK  T+V  EEIE+ I++LG Q KRA  +QRLS EYL+ESWTHVTQLHG+GKY
Sbjct: 1328 FFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQRLSEEYLDESWTHVTQLHGVGKY 1387

Query: 348  AADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 253
            AADAYAIF TG W+ V P DHMLNYYWEFL S
Sbjct: 1388 AADAYAIFVTGMWDRVTPTDHMLNYYWEFLHS 1419


>ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa]
           gi|550326306|gb|EEE95947.2| hypothetical protein
           POPTR_0012s03470g [Populus trichocarpa]
          Length = 229

 Score =  231 bits (589), Expect = 9e-58
 Identities = 118/177 (66%), Positives = 131/177 (74%), Gaps = 10/177 (5%)
 Frame = -1

Query: 753 IGCIKSKKVNK-------HVLTAS---QKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHD 604
           IG  K KK  K       H  T S    K  EAYERKT++NTWKPP S F  L   H+HD
Sbjct: 48  IGRSKKKKKKKEGTKTSLHSDTTSPYYNKFDEAYERKTAENTWKPPQSEFGFLHN-HAHD 106

Query: 603 PWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVA 424
           PWRVLVIC+LLNRT G +A RV+ DLFTLCP+AK AT V TEEIER I+SLGLQK+RA  
Sbjct: 107 PWRVLVICMLLNRTAGTRAERVVADLFTLCPDAKAATGVATEEIERAIKSLGLQKRRAKM 166

Query: 423 IQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 253
           +QRLS +YLEE WTHVTQL G+GKYAADAYAIFCTGKWE V P DHMLN YWE+L S
Sbjct: 167 VQRLSEDYLEEDWTHVTQLPGVGKYAADAYAIFCTGKWEQVRPNDHMLNRYWEYLCS 223


>emb|CBI29440.3| unnamed protein product [Vitis vinifera]
          Length = 599

 Score =  229 bits (584), Expect = 3e-57
 Identities = 108/147 (73%), Positives = 124/147 (84%)
 Frame = -1

Query: 699 KLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFT 520
           KL+ AY RK+ DN WKPP S F+LLQE H HDPWRV+VIC+LLN T+GLQA RVI DLFT
Sbjct: 441 KLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQASRVISDLFT 500

Query: 519 LCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAAD 340
           LCP+AK AT+V TE IE+VI++LGLQKKRA  IQR SREYL++SWTHVTQLHGIGKYAAD
Sbjct: 501 LCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAAD 560

Query: 339 AYAIFCTGKWELVEPEDHMLNYYWEFL 259
           AYAIFC+G W LV P DHML  YW++L
Sbjct: 561 AYAIFCSGDWGLVVPNDHMLVKYWKYL 587


>ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244192 [Vitis vinifera]
          Length = 536

 Score =  229 bits (584), Expect = 3e-57
 Identities = 108/147 (73%), Positives = 124/147 (84%)
 Frame = -1

Query: 699 KLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFT 520
           KL+ AY RK+ DN WKPP S F+LLQE H HDPWRV+VIC+LLN T+GLQA RVI DLFT
Sbjct: 378 KLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQASRVISDLFT 437

Query: 519 LCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAAD 340
           LCP+AK AT+V TE IE+VI++LGLQKKRA  IQR SREYL++SWTHVTQLHGIGKYAAD
Sbjct: 438 LCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAAD 497

Query: 339 AYAIFCTGKWELVEPEDHMLNYYWEFL 259
           AYAIFC+G W LV P DHML  YW++L
Sbjct: 498 AYAIFCSGDWGLVVPNDHMLVKYWKYL 524


>ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris]
            gi|561039879|gb|ESW35973.1| hypothetical protein
            PHAVU_L0004001g, partial [Phaseolus vulgaris]
          Length = 715

 Score =  229 bits (583), Expect = 4e-57
 Identities = 121/233 (51%), Positives = 148/233 (63%), Gaps = 1/233 (0%)
 Frame = -1

Query: 942  KNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXS-HYFHNVPKVNGDDF 766
            KN    +R VSPYF N +    GK++                  + +Y  + P+ N    
Sbjct: 492  KNVAHAIRYVSPYFHNDS----GKNIDVKPLDEGSKFESIALHATENYVEDKPEEN---- 543

Query: 765  LGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLV 586
               +  C +     K  L+ASQK  EAY+RKT D TWKPP S   L+QE H+HDPWRVLV
Sbjct: 544  ---KSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLV 600

Query: 585  ICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSR 406
            IC+LLNRT+G Q + ++ D F LCP+AK  TEV  EEIE  I++LG Q KRA  ++RLS 
Sbjct: 601  ICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSE 660

Query: 405  EYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLISFY 247
            EYL+ESWTHVTQLHG+GKYAADAYAIF TGK + V P DHMLNYYWEFL   Y
Sbjct: 661  EYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFLRRIY 713


>ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris]
            gi|561039878|gb|ESW35972.1| hypothetical protein
            PHAVU_L0004001g [Phaseolus vulgaris]
          Length = 726

 Score =  229 bits (583), Expect = 4e-57
 Identities = 121/233 (51%), Positives = 148/233 (63%), Gaps = 1/233 (0%)
 Frame = -1

Query: 942  KNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXS-HYFHNVPKVNGDDF 766
            KN    +R VSPYF N +    GK++                  + +Y  + P+ N    
Sbjct: 503  KNVAHAIRYVSPYFHNDS----GKNIDVKPLDEGSKFESIALHATENYVEDKPEEN---- 554

Query: 765  LGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLV 586
               +  C +     K  L+ASQK  EAY+RKT D TWKPP S   L+QE H+HDPWRVLV
Sbjct: 555  ---KSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLV 611

Query: 585  ICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSR 406
            IC+LLNRT+G Q + ++ D F LCP+AK  TEV  EEIE  I++LG Q KRA  ++RLS 
Sbjct: 612  ICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSE 671

Query: 405  EYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLISFY 247
            EYL+ESWTHVTQLHG+GKYAADAYAIF TGK + V P DHMLNYYWEFL   Y
Sbjct: 672  EYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFLRRIY 724


>gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]
          Length = 369

 Score =  222 bits (565), Expect = 5e-55
 Identities = 123/243 (50%), Positives = 148/243 (60%), Gaps = 2/243 (0%)
 Frame = -1

Query: 969 RGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNV 790
           R ++N +      KKV V+ PYF         +DM                    YF + 
Sbjct: 146 RKKKNIVTEDGCDKKVVVLDPYF--------AEDMSRKKVSP-------------YFQSP 184

Query: 789 PKVNGDDFLGGEI--GCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEK 616
            K +G D    E+     +  K  K VL++ QK  EAYER+T DN W PP SPF LLQE 
Sbjct: 185 RKTSGSDRGISEVVEESPERSKRWKPVLSSVQKRDEAYERRTPDNEWTPPRSPFNLLQED 244

Query: 615 HSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKK 436
           H  DPWRVLVIC+LLN+TTG QA RV+  LF LCP AK ATEV  ++IE  I+ LGLQ+K
Sbjct: 245 HMFDPWRVLVICMLLNQTTGRQAFRVLSKLFELCPTAKAATEVARDDIEDAIRCLGLQRK 304

Query: 435 RAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLI 256
           RA  IQR S EY+ E WTHVT+L GIGKYAADAYAIFCTG+W+ V P DHML  YWE+L 
Sbjct: 305 RAEMIQRFSEEYMSEEWTHVTELPGIGKYAADAYAIFCTGRWQRVRPADHMLVKYWEWLN 364

Query: 255 SFY 247
            F+
Sbjct: 365 EFF 367


Top