BLASTX nr result

ID: Rehmannia23_contig00021471 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00021471
         (866 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EMJ21917.1| hypothetical protein PRUPE_ppa022673mg [Prunus pe...   165   2e-38
gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo]    157   5e-36
ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203...   148   2e-33
gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [...   147   5e-33
gb|EPS67647.1| hypothetical protein M569_07127, partial [Genlise...   146   1e-32
gb|EPS67646.1| hypothetical protein M569_07128 [Genlisea aurea]       145   2e-32
ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208...   141   3e-31
ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [A...   141   4e-31
gb|EMJ11440.1| hypothetical protein PRUPE_ppa014973mg, partial [...   138   3e-30
emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera]   138   3e-30
ref|XP_004154145.1| PREDICTED: uncharacterized protein LOC101207...   137   4e-30
gb|EMJ02309.1| hypothetical protein PRUPE_ppa024392mg [Prunus pe...   135   1e-29
ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210...   132   2e-28
gb|EMJ28581.1| hypothetical protein PRUPE_ppb016096mg [Prunus pe...   132   2e-28
gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]         129   1e-27
gb|EOY26421.1| DNA/RNA polymerases superfamily protein [Theobrom...   129   1e-27
gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus pe...   129   2e-27
gb|EOY16854.1| DNA/RNA polymerases superfamily protein [Theobrom...   128   2e-27
gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobrom...   126   1e-26
emb|CAN83518.1| hypothetical protein VITISV_035077 [Vitis vinifera]   125   2e-26

>gb|EMJ21917.1| hypothetical protein PRUPE_ppa022673mg [Prunus persica]
          Length = 1506

 Score =  165 bits (417), Expect = 2e-38
 Identities = 90/263 (34%), Positives = 136/263 (51%), Gaps = 31/263 (11%)
 Frame = -3

Query: 702  SAPNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKDRRE------------ 559
            S   ++P C +C R H G C  G   C+ C +PGH   +CPL    RE            
Sbjct: 335  SGRRSRPQCARCGRYHSGPCQQGTTGCYYCGQPGHFQKDCPLFPQTRETTDAPTPGTASS 394

Query: 558  ---------------MRGQGG----TARMYAMTQEEADQNPGTMSGMLTISGIPALVLFD 436
                            RG+GG    T R+Y M+Q+EA  +P  ++G+L + GIPA VL D
Sbjct: 395  SGGAQTSVASHGSSQQRGRGGRSRATGRVYNMSQQEAHASPEVITGILPVFGIPARVLID 454

Query: 435  TGATHSFISAKFHDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVY 256
             GATHSF++  F      + + +   L IS+P+G+      +  +  V +G    EAD+ 
Sbjct: 455  PGATHSFVTPSFAHNANVRLSALQTELAISVPTGEIFRVGTVYRDSTVLVGNVFFEADLI 514

Query: 255  IIEMKDFDVILGMNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKAR 76
             + M D DVILGM+WL++++AS+ C  KE+  + PG  E+TFYG        +ISAM A+
Sbjct: 515  PLGMVDLDVILGMDWLARHRASVDCFRKEVVFRSPGRPEVTFYGKRRVLPSYLISAMTAK 574

Query: 75   KMLAKENCQGYLVSLVEAPTTDL 7
            ++L K  C GY+  +++    +L
Sbjct: 575  RLLRK-GCSGYIAHVIDTRDNEL 596


>gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo]
          Length = 871

 Score =  157 bits (397), Expect = 5e-36
 Identities = 92/241 (38%), Positives = 136/241 (56%), Gaps = 9/241 (3%)
 Frame = -3

Query: 699  APNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKDRREMRGQGGTA----R 532
            A   KP C  C ++H G CL G   CF C++ GH A  CPL +     + QG  A    R
Sbjct: 553  AARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPL-RPTGIAQNQGAGAPLQGR 611

Query: 531  MYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVP-- 358
            ++A  + EA++    ++G L + G  ALVLFD+G++HSFIS+ F     H  AR++V   
Sbjct: 612  VFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAF---VSH--ARLEVEPL 666

Query: 357  ---LEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASI 187
               L +S PSG+ +++        + I G  +E  + +++M DFDVILGM+WL+   ASI
Sbjct: 667  HHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASI 726

Query: 186  KCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYLVSLVEAPTTDL 7
             C  KE+T   P      F G  SKS+P+VISA++A K+L+ +   G L S+V+    D+
Sbjct: 727  DCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLS-QGTWGILASVVDTREADV 785

Query: 6    S 4
            S
Sbjct: 786  S 786


>ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203289 [Cucumis sativus]
          Length = 655

 Score =  148 bits (374), Expect = 2e-33
 Identities = 81/245 (33%), Positives = 126/245 (51%), Gaps = 4/245 (1%)
 Frame = -3

Query: 747  GNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKD 568
            G+  +  +            +P C  C + H G CL G   C+ CK+ GH A  CPL   
Sbjct: 325  GDSFRSFQQSSGGAGDTTQERPVCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRST 384

Query: 567  RREMRGQGGT----ARMYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKF 400
                  QG        ++A  + EA++    ++G L + G  AL LFD+G++HSFIS+ F
Sbjct: 385  GAGSSSQGERPPQRGTIFATNRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLF 444

Query: 399  HDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILG 220
                  +   +D  L +S PSG+ +++       ++ I G+ L+  + +++M+DFDVILG
Sbjct: 445  VTHACLEVEPLDYVLSVSTPSGEIMLSKEKIKACEIEIAGRVLDVTLLVLDMRDFDVILG 504

Query: 219  MNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYL 40
            M+WL+   ASI C  KE+    P      F GV +  +P+VISAMKA K+L  +     L
Sbjct: 505  MDWLATNHASIDCSRKEVVFSPPTASSFKFKGVGTVVLPKVISAMKASKLL-NQGTWSIL 563

Query: 39   VSLVE 25
             S+V+
Sbjct: 564  ASVVD 568


>gb|EMJ16022.1| hypothetical protein PRUPE_ppa023432mg, partial [Prunus persica]
          Length = 590

 Score =  147 bits (371), Expect = 5e-33
 Identities = 97/317 (30%), Positives = 147/317 (46%), Gaps = 56/317 (17%)
 Frame = -3

Query: 789  SEKRKFDGTKP--SKMGNQNKQ-----------IRIDGNSRSSAPNNK--PTCPKCNRNH 655
            S  R F G +P  S  G  N+            +R  G    SA   +  P C  C R H
Sbjct: 232  SRGRSFRGFRPGISSSGGSNRSGSFGSRLVGNAVRGSGRQSPSAVGGRRNPQCTVCGRYH 291

Query: 654  YGECLAGQGKCFMCKKPGHDASNCPLL--------------------------------- 574
             G C  G   CF C +PGH    CP+L                                 
Sbjct: 292  TGTCRQGTTGCFHCGQPGHFLRECPVLLQGGEATVTMPTEIGTEGKTQFRGASSSGGIQT 351

Query: 573  ----KDRREMRGQGG----TARMYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHS 418
                +   + +G+GG    T R+Y M+Q++A  +P  ++GML++ G PA VL D+GATHS
Sbjct: 352  SVASRGGSQQQGRGGRARATGRVYHMSQQQAQPSPDVVTGMLSVFGTPARVLIDSGATHS 411

Query: 417  FISAKFHDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKD 238
            F++         + + +   L IS+P+G+      + ++  + +    LEAD+  +EM  
Sbjct: 412  FVTPSVARNADVRQSALRDELAISVPTGEIFYVGTVYSDSAILVRDVCLEADLIPLEMVG 471

Query: 237  FDVILGMNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKE 58
             DVILGM+WL K+ A++ C  KE+ L+  G  E+TFYG        +IS M A ++L K 
Sbjct: 472  LDVILGMDWLVKHHAAVDCFRKEVILRSLGRPEVTFYGERRVLPSSLISVMMATRLLRK- 530

Query: 57   NCQGYLVSLVEAPTTDL 7
             C GY+  +V++ T +L
Sbjct: 531  GCSGYVAYIVDSRTQEL 547


>gb|EPS67647.1| hypothetical protein M569_07127, partial [Genlisea aurea]
          Length = 503

 Score =  146 bits (368), Expect = 1e-32
 Identities = 93/253 (36%), Positives = 136/253 (53%), Gaps = 9/253 (3%)
 Frame = -3

Query: 861 MVRRAHEVEAGLAGDDDDRKPAVVSEKRKFDGTKPSKMGNQNKQIRIDG---NSRSSAPN 691
           +V  A EVE  L     D++  VV++KR F+G +  +MG +    R +    N+  + P 
Sbjct: 138 VVNFAIEVEEEL-----DKRVDVVTKKRSFEGYRVEQMGKRQFSSRFEHLPKNATRAPPP 192

Query: 690 NKPTCPKCNRNHYGECLAGQGK-CFMCKKPGHDASNCPLLKDR-----REMRGQGGTARM 529
           NK  C KC++ H G C  GQ   CF C K GH A NC   K R     +E     G AR+
Sbjct: 193 NKTICVKCHKAHQGICRLGQPLICFYCGKEGHYARNCLAKKAREAEMSKEKDVPRGKARV 252

Query: 528 YAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEI 349
           + ++Q+EA  +   +SG + ++GIPA  LFDTGAT SF+S  F      +  RI     +
Sbjct: 253 FTISQDEATTSTDLISGTIDVNGIPAYTLFDTGATDSFVSKSFAKTLRIEPERIS--FTV 310

Query: 348 SLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKE 169
           +   G  + +  +  + +V +G   L AD+  + M DFDVILG+ WLSK+ ASIKC+EK+
Sbjct: 311 TTAGGMKLGSKFIYMDCEVMVGRHKLSADLKELPMNDFDVILGLEWLSKHGASIKCREKQ 370

Query: 168 ITLKLPGDEEITF 130
           I      ++ I F
Sbjct: 371 ICFNEFEEQPIDF 383


>gb|EPS67646.1| hypothetical protein M569_07128 [Genlisea aurea]
          Length = 503

 Score =  145 bits (365), Expect = 2e-32
 Identities = 93/255 (36%), Positives = 129/255 (50%), Gaps = 11/255 (4%)
 Frame = -3

Query: 861 MVRRAHEVEAGLAGDDDDRKPAVVSEKRKFDGTKPSKMGNQNKQIRIDGNSRSSA----- 697
           +V  A EVE  L     DR     + KR F+G +  ++  +    R +   R        
Sbjct: 149 VVNFALEVEEEL-----DRTARTENNKRPFEGYRVERVVKRQFPRRFEPIPRRDMRPMRP 203

Query: 696 PNNKPT-CPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKDR-----REMRGQGGTA 535
           P   PT C KC+R H GEC  G   CF C KPGH A +CP  + R     +E     G A
Sbjct: 204 PAPNPTVCEKCHRIHQGECRMGPLTCFYCGKPGHYARDCPTKRAREGEGAKEKNVPRGKA 263

Query: 534 RMYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPL 355
           R++ +TQ+EA  +   +SG + ++GIPA  LFDTGAT SF+S  F    G +  RI    
Sbjct: 264 RVFTITQDEATTSTDLISGTIDVNGIPAFTLFDTGATDSFVSKSFAKSLGVEPGRIS--F 321

Query: 354 EISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQE 175
            +    GK + +  +  N +V IG     AD+  + M DFDVILG  WL+K  ASI+C+E
Sbjct: 322 TVFTAGGKKLGSRFIYRNCEVVIGSHRFSADLKELYMIDFDVILGFEWLTKQGASIRCKE 381

Query: 174 KEITLKLPGDEEITF 130
           ++I     G E + F
Sbjct: 382 RQICFDETGKEPVDF 396


>ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208523, partial [Cucumis
            sativus]
          Length = 804

 Score =  141 bits (356), Expect = 3e-31
 Identities = 80/252 (31%), Positives = 128/252 (50%), Gaps = 4/252 (1%)
 Frame = -3

Query: 747  GNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKD 568
            G+  +  +            KP C  C + H G CL G   C+ CK+ GH A  C L   
Sbjct: 324  GDPFRNFQQSSGGAGDTTREKPLCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCRLRST 383

Query: 567  RREMRGQGG----TARMYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKF 400
                  QG        ++A ++ EA++    ++G L + G  AL LFD+G++HSFIS+ F
Sbjct: 384  GAGQSSQGAGPPQRGTIFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLF 443

Query: 399  HDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILG 220
                  +   +D  L +S PSG+ +++        + I G+ L+  + +++++DFDVILG
Sbjct: 444  VTHACLEVKPLDYVLSVSTPSGEIMLSKEKIKACKIEIAGRVLDVTLLVLDIRDFDVILG 503

Query: 219  MNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYL 40
            M+ L+   ASI C  KE+    P +    F GV +  +P+VISAMKA K+L+ +     L
Sbjct: 504  MDLLATNHASIDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLS-QGTWSIL 562

Query: 39   VSLVEAPTTDLS 4
             S+V+    + S
Sbjct: 563  ASVVDTREDETS 574


>ref|XP_006849815.1| hypothetical protein AMTR_s01849p00006620 [Amborella trichopoda]
           gi|548853397|gb|ERN11396.1| hypothetical protein
           AMTR_s01849p00006620 [Amborella trichopoda]
          Length = 383

 Score =  141 bits (355), Expect = 4e-31
 Identities = 84/237 (35%), Positives = 125/237 (52%), Gaps = 7/237 (2%)
 Frame = -3

Query: 813 DDRKPAVVSEKRKFDGTKPSKMGNQNKQIRIDGNSR--SSAPNNKPTCPKCNRNHYGECL 640
           + +K    S   K  G   S   +Q+K+ + D + R   S+  N P CPKC + H GEC 
Sbjct: 149 ESKKGGANSNDHKKRGQDQSGQPSQDKRYKSDNDQRFNGSSGRNIPECPKCTKRHLGECR 208

Query: 639 AGQGKCFMCKKPGHDASNCPL---LKDRREMRGQGG--TARMYAMTQEEADQNPGTMSGM 475
           A    C+ C K GH   NCPL     +R E +       AR++A+TQ EA+ +P  +SG 
Sbjct: 209 AKA--CYKCGKEGHIKRNCPLWGQTGNRAEPKKDDKYVPARVFAITQAEAEASPSVVSGQ 266

Query: 474 LTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEISLPSGKTIITDVMDNNID 295
           + ++     VLFD+GATHSFI++   +          V     LPSG+ +I+      I 
Sbjct: 267 IPMANTTCKVLFDSGATHSFIASTIVNHINAPSELFTVGFGTMLPSGEVVISRNWLRGIP 326

Query: 294 VNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKEITLKLPGDEEITFYG 124
           V I G+ L  D+ ++++ DFDVILGM++L+KY ASI C++K++       E   F G
Sbjct: 327 VRIDGRELFVDLIVLDLFDFDVILGMDFLTKYGASIDCKQKKVVFTPEDGETFEFRG 383


>gb|EMJ11440.1| hypothetical protein PRUPE_ppa014973mg, partial [Prunus persica]
          Length = 747

 Score =  138 bits (347), Expect = 3e-30
 Identities = 88/263 (33%), Positives = 136/263 (51%), Gaps = 8/263 (3%)
 Frame = -3

Query: 789 SEKRKFDGTKP--SKMGNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCF- 619
           S    + G +P  S  G  N+    +  S  SA           R    + L+G G+   
Sbjct: 77  SNGHSYGGYRPGFSSSGGSNQSSSFENRSGGSA-----------RGVGRQLLSGSGRRSR 125

Query: 618 -MCKKPGHDASNCPLLKDRREMRGQGG----TARMYAMTQEEADQNPGTMSGMLTISGIP 454
             C + G  AS         + RG+GG    T R+Y M+Q+EA  +P  ++G+L + GIP
Sbjct: 126 PQCARCGSVASG-----GSSQQRGRGGRSRATGRVYNMSQQEAHASPDVITGILPVFGIP 180

Query: 453 ALVLFDTGATHSFISAKFHDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKH 274
           A VL D GATHSF++  F      + + +   L IS+P+G+      +  +  V +G   
Sbjct: 181 ARVLIDPGATHSFVTPSFAHNANVRLSALQTELAISVPTGEIFRIGTVYRDSTVMVGNVF 240

Query: 273 LEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVI 94
           LEAD+  + M D DVILGM+WL++++AS+ C  KE+  + PG  E+TFYG        +I
Sbjct: 241 LEADLIPLGMVDLDVILGMDWLARHRASVDCFRKEVVFRSPGRHEVTFYGERRVLPSCLI 300

Query: 93  SAMKARKMLAKENCQGYLVSLVE 25
           SAM A+++L K  C GY+  +++
Sbjct: 301 SAMTAKRLLRK-GCSGYIAHVID 322


>emb|CAN62233.1| hypothetical protein VITISV_010121 [Vitis vinifera]
          Length = 1797

 Score =  138 bits (347), Expect = 3e-30
 Identities = 75/221 (33%), Positives = 118/221 (53%), Gaps = 14/221 (6%)
 Frame = -3

Query: 678  CPKCNRNHYGE-CLAGQGKCFMCKKPGHDASNCPL-------------LKDRREMRGQGG 541
            CP C + H G  C    G CF C + GH   +CP               KD+++ + QG 
Sbjct: 715  CPTCGKKHGGRPCYREIGACFCCGEQGHLIRDCPENRKFITGKPKEENKKDKQKPKAQGW 774

Query: 540  TARMYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDV 361
               ++AMT  +A      ++G L I  + A VL D G+THSF+S  F  + G   A +D 
Sbjct: 775  ---VFAMTHRDAQATSDVVTGTLRIHTLFARVLIDLGSTHSFVSVSFAGLLGLPVASMDF 831

Query: 360  PLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKC 181
             L ++ P G +++   M  N  V IG + +  D+ +++++DFDVILGM+WL+ Y AS+ C
Sbjct: 832  DLIVATPVGDSVVASRMLRNCIVMIGYREMPIDLVLLDLQDFDVILGMDWLASYHASVDC 891

Query: 180  QEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKE 58
             EK +T  +PG  + +F G +     R+IS ++A  +L K+
Sbjct: 892  FEKRVTFSIPGQPKFSFEGKHVDRPLRMISTLRASSLLKKD 932



 Score =  116 bits (290), Expect = 1e-23
 Identities = 63/167 (37%), Positives = 99/167 (59%)
 Frame = -3

Query: 528 YAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEI 349
           Y   Q + +++ G   G L I  + A VL D G+THSF+S  F  + G   A +D  L +
Sbjct: 71  YREQQRKRNRSDGA-HGTLRIHTLFARVLIDPGSTHSFVSVSFAGLLGLPVASMDFDLIV 129

Query: 348 SLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKE 169
           + P G  ++   M  N  V IG + +  D+ +++++DFDVILGM+WL+ Y ASI C EK 
Sbjct: 130 ATPVGDFVVASRMLRNCIVMIGYREMLVDLVLLDLQDFDVILGMDWLTSYHASIDCFEKR 189

Query: 168 ITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYLVSLV 28
           +T  +PG  + +F G +     R+ISA++A  +L K+ CQG+L S++
Sbjct: 190 VTFSIPGQPKFSFEGKHVDRPLRMISALRASSLL-KKGCQGFLASVM 235


>ref|XP_004154145.1| PREDICTED: uncharacterized protein LOC101207632 [Cucumis sativus]
          Length = 446

 Score =  137 bits (346), Expect = 4e-30
 Identities = 82/255 (32%), Positives = 128/255 (50%), Gaps = 7/255 (2%)
 Frame = -3

Query: 747 GNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKD 568
           G+  +  +   +        +P C  C + H G CL G   C+ CK+ GH A  CPL   
Sbjct: 100 GDSFRSFQQSSSGAGDTTQERPVCNTCGKRHLGRCLMGTRVCYKCKQEGHMADRCPLRST 159

Query: 567 RREMRGQGGT----ARMYAMTQEEADQNPGTMSGMLTIS-GIPALVLFDTGATHSFISAK 403
                 QG        ++A  + EA++    ++G L+ + G  AL LFD+G++HSFIS+ 
Sbjct: 160 GAGSSSQGERPPQRGTIFATNRSEAEKVGTIVTGTLSRARGHFALTLFDSGSSHSFISSL 219

Query: 402 FHDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVIL 223
           F      +   +D  L +S PSG+ ++        ++ I G+ L+  + +++M DFDVIL
Sbjct: 220 FVTHACLEVEPLDYVLSVSTPSGEIMLYKEKIKACEIEIAGRVLDVTLLVLDMCDFDVIL 279

Query: 222 GMNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGY 43
           GM+WL+   ASI C  KE+    P      + GV +  +P+VISAMK  K+L  +     
Sbjct: 280 GMDWLATNHASIDCSRKEVVFSPPTASSFKYKGVGTVVLPKVISAMKVSKLL-NQGTWSI 338

Query: 42  LVSLVE--APTTDLS 4
           L S+V+   P   LS
Sbjct: 339 LASVVDTRGPKVSLS 353


>gb|EMJ02309.1| hypothetical protein PRUPE_ppa024392mg [Prunus persica]
          Length = 1363

 Score =  135 bits (341), Expect = 1e-29
 Identities = 97/338 (28%), Positives = 145/338 (42%), Gaps = 63/338 (18%)
 Frame = -3

Query: 852  RAHEVEAGLAGDDDDRKPAVVSEK---RKFDGTKP--SKMGNQNKQIRIDGNSRSSAPN- 691
            R H+ E G       ++ +  S     R + G +P  S  G  N+       S  SA   
Sbjct: 145  RRHQFEIGDPSQGSSKRGSYSSGSSSGRSYGGYRPRFSSSGGSNQSGSSGNRSGGSAARG 204

Query: 690  ------------NKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKDRRE---- 559
                        ++P C K  R H G C  G   C+ C + GH   +CPLL   RE    
Sbjct: 205  IGRQLLSVSGMRSRPQCAKYGRYHSGTCQRGTTGCYYCGQSGHFRKDCPLLLQNRETTTA 264

Query: 558  -------------------------------------MRGQGG----TARMYAMTQEEAD 502
                                                  RG+GG    T R+Y M+Q+EA 
Sbjct: 265  STPGTGIQGRSQRTPQFVEASSSGGAQTSVANRGSSQQRGRGGRSRATGRVYNMSQQEAH 324

Query: 501  QNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEISLPSGKTII 322
             +P  ++GML + GIPA ++ D GATHSF++  F      + + +   L IS+P+G+   
Sbjct: 325  TSPHVITGMLPVFGIPARIMIDPGATHSFVTPSFAHNANVRLSTLQNELAISVPTGEIFK 384

Query: 321  TDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKEITLKLPGDE 142
               +  +  V +G   LEAD+  + M D DVIL M+WL+K+ AS+ C  KE+    P   
Sbjct: 385  VGTVYRDSIVMVGDVFLEADLIPLGMVDLDVILWMDWLAKHCASVNCFRKEVVFSSPRRP 444

Query: 141  EITFYGVNSKSVPRVISAMKARKMLAKENCQGYLVSLV 28
            E+TFY          +  M A+++L K  C GY+  ++
Sbjct: 445  EVTFY----------VEPMTAKQLLIKW-CSGYIAHVI 471


>ref|XP_004148918.1| PREDICTED: uncharacterized protein LOC101210300 [Cucumis sativus]
          Length = 623

 Score =  132 bits (332), Expect = 2e-28
 Identities = 75/228 (32%), Positives = 117/228 (51%)
 Frame = -3

Query: 687 KPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLLKDRREMRGQGGTARMYAMTQEE 508
           KP C  C ++H G CL G   C+ C++ GH A                         + E
Sbjct: 331 KPVCNTCGKHHLGRCLMGTRVCYKCRQEGHMAD------------------------RSE 366

Query: 507 ADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEISLPSGKT 328
           A++    ++G L + G  AL LFD+G++HSFIS+ F      +   +D  L +S PSG+ 
Sbjct: 367 AEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEPLDYVLSVSTPSGEI 426

Query: 327 IITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKEITLKLPG 148
           +++       ++ I G+ L+  + +++M+DFDVILGM+WL+   ASI C  KE+    P 
Sbjct: 427 MLSKEKIKACEIEIAGRVLDVTLLVLDMRDFDVILGMDWLATNHASIDCSRKEVVFSPPT 486

Query: 147 DEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYLVSLVEAPTTDLS 4
           +    F GV    +P+VISAMKA K+L  +     L S+V+    + S
Sbjct: 487 ESSFKFKGVGRVVLPKVISAMKASKLL-NQGTWSILASVVDTREDETS 533


>gb|EMJ28581.1| hypothetical protein PRUPE_ppb016096mg [Prunus persica]
          Length = 505

 Score =  132 bits (331), Expect = 2e-28
 Identities = 77/252 (30%), Positives = 128/252 (50%), Gaps = 34/252 (13%)
 Frame = -3

Query: 678 CPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLL------------------------- 574
           CP   +   G+  +    C+ C + GH   +CP++                         
Sbjct: 190 CPSYTQGG-GQSQSSSLTCYFCGQVGHTKRSCPIILQSDAAIQRTGAQQGQAGSSNSRAL 248

Query: 573 -----KDRREMRGQGGTA----RMYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATH 421
                +  R+ RGQ G +    R+++MTQ+EA   P  ++GM+ I G  A VL D GATH
Sbjct: 249 SSSRGRSGRQSRGQPGRSTTQGRVFSMTQQEAHATPDVITGMIPIFGYLARVLIDPGATH 308

Query: 420 SFISAKFHDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMK 241
           SF++  F      +   +     ISLP+G+ +  D +  N  V +    LEA++  +++ 
Sbjct: 309 SFVAHNFAPYINVRPTPMIGSFSISLPTGEVLYVDRVFRNCFVQVDDAWLEANLTPLDLV 368

Query: 240 DFDVILGMNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAK 61
           D D+ILGM+WL K+ AS+ C  K++TL+ PG  ++TF G        +ISA+ A+++L K
Sbjct: 369 DLDIILGMDWLEKHHASVDCFRKKVTLRSPGQPKVTFGGERRVLPTCLISAITAKRLL-K 427

Query: 60  ENCQGYLVSLVE 25
           + C+GYL  +++
Sbjct: 428 KGCEGYLAHIID 439


>gb|EOY08404.1| Retrotransposon-like protein [Theobroma cacao]
          Length = 654

 Score =  129 bits (325), Expect = 1e-27
 Identities = 84/283 (29%), Positives = 131/283 (46%), Gaps = 45/283 (15%)
 Frame = -3

Query: 753  KMGNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPL- 577
            ++G +    R   +SR S+   + +C  C R H G C      C+ C +PGH   +CP+ 
Sbjct: 196  RVGQRTFNSRRQQDSRQSSQVIR-SCDTCGRRHSGRCFLTTKTCYGCGQPGHIRRDCPMA 254

Query: 576  ---------------------LKDRREM---RGQG--------------------GTARM 529
                                 +   RE+   RG+G                    G AR+
Sbjct: 255  HQSPDSARGSTQPASSAPSVAVSSGREVSGSRGRGAGTSSQGKPSGSGHQSSIGRGQARV 314

Query: 528  YAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEI 349
            +A+TQ+EA  +   +SG+L++  + A VLFD GATHSFIS  F    G    R +  L +
Sbjct: 315  FALTQQEAQTSNAVVSGILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQLVV 374

Query: 348  SLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKE 169
            S P  +  + +    +  V +  K    ++ +++  DFDVILGMNWLS   AS+ C  K 
Sbjct: 375  STPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCYHKL 434

Query: 168  ITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYL 40
            +    PG+   +  G  S +   +IS + AR++L ++ C GYL
Sbjct: 435  VRFDFPGEPSFSIQGDRSNAPTNLISVISARRLL-RQGCIGYL 476


>gb|EOY26421.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 334

 Score =  129 bits (324), Expect = 1e-27
 Identities = 80/262 (30%), Positives = 121/262 (46%), Gaps = 45/262 (17%)
 Frame = -3

Query: 681 TCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPL----------------------LKD 568
           +C  C R H G C      C+ C +PGH   +CP+                      +  
Sbjct: 15  SCDTCGRRHSGRCFLTTKTCYGCGQPGHIRRDCPMAHQSPDFARGSTQPASSALSVVVSS 74

Query: 567 RREM---RGQG--------------------GTARMYAMTQEEADQNPGTMSGMLTISGI 457
            RE+   RG+G                    G AR++A+TQ+EA  +   +SG+L++  +
Sbjct: 75  GREVSGSRGKGAGTSSQGRPSGSGHQSSIGRGQARVFALTQQEAQTSNAVVSGILSVCNM 134

Query: 456 PALVLFDTGATHSFISAKFHDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGK 277
            A VLFD GATHSFIS  F    G    R +  L +S P  +  + +    +  V +  K
Sbjct: 135 NARVLFDPGATHSFISPCFASRLGRGRVRREEQLVVSTPLKEIFVAEWEYESCVVRVKDK 194

Query: 276 HLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRV 97
               ++ +++  DFDVILGMNWLS   AS+ C  K +    PG+   +  G  S +   +
Sbjct: 195 DTSVNLVVLDTLDFDVILGMNWLSPCHASVDCYHKLVRFDFPGEPSFSIQGDRSNAPTNL 254

Query: 96  ISAMKARKMLAKENCQGYLVSL 31
           IS + AR++L ++ C GYL  L
Sbjct: 255 ISVISARRLL-RQGCIGYLAVL 275


>gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica]
          Length = 1493

 Score =  129 bits (323), Expect = 2e-27
 Identities = 71/184 (38%), Positives = 109/184 (59%), Gaps = 4/184 (2%)
 Frame = -3

Query: 564 REMRGQGGT----ARMYAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFH 397
           R+ RGQ G     AR+++MTQ+EA   P  ++GM+ I G  A VL D GATHSF++  F 
Sbjct: 335 RQSRGQPGRSTTQARVFSMTQQEAYATPDVITGMIPIFGYLARVLIDPGATHSFVAHNFI 394

Query: 396 DITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGM 217
                +   I     ISLP+G+ +  D +  N  V +    LEA++  +++ D D+ILGM
Sbjct: 395 PYISIRPTPITGSFSISLPTGEVLYADRVFRNCFVQVDDAWLEANLIPLDLVDLDIILGM 454

Query: 216 NWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYLV 37
           +WL K+ AS+ C  KE+TL+ PG  ++TF G        +ISA+ A+K+L K+  +GYL 
Sbjct: 455 DWLEKHHASVDCFRKEVTLRSPGQPKVTFRGERRVLPTCLISAITAKKLL-KKGYEGYLA 513

Query: 36  SLVE 25
            +++
Sbjct: 514 HIID 517


>gb|EOY16854.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 737

 Score =  128 bits (322), Expect = 2e-27
 Identities = 85/284 (29%), Positives = 131/284 (46%), Gaps = 45/284 (15%)
 Frame = -3

Query: 753  KMGNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPL- 577
            ++G +    R   +SR S+   + +C    R H G C      C+ C +PGH   +CP+ 
Sbjct: 356  RVGQRTFSSRRQQDSRQSSQVIR-SCDTYGRRHSGRCFLTTKTCYRCGQPGHIRRDCPMA 414

Query: 576  ---------------------LKDRREM---RGQG--------------------GTARM 529
                                 +   RE+   RG+G                    G AR+
Sbjct: 415  HQSPDSARGSTQPASSAPSVTVSSGREVSGSRGRGAGTSSQGRPSGSGHQSSIGRGQARV 474

Query: 528  YAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEI 349
            +A+TQ+EA  +   +SG+L++  I A VLFD GATHSFIS  F    G    R +  L +
Sbjct: 475  FALTQQEAQTSNAVVSGILSVCNINARVLFDPGATHSFISPCFASRLGRGRVRREEQLVV 534

Query: 348  SLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKE 169
            S P  +  + +    +  V +  K    ++ +++  DFDVILGMNWLS   AS+ C  K 
Sbjct: 535  STPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTIDFDVILGMNWLSPCHASVDCYHKL 594

Query: 168  ITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYLV 37
            +    PG+   +  G  S +   +IS + AR++L ++ C GYLV
Sbjct: 595  VRFDFPGEPSFSIQGDRSNAPTNLISVISARRLL-RQGCIGYLV 637


>gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1400

 Score =  126 bits (316), Expect = 1e-26
 Identities = 81/283 (28%), Positives = 127/283 (44%), Gaps = 45/283 (15%)
 Frame = -3

Query: 753  KMGNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCFMCKKPGHDASNCPLL 574
            ++G +    R   +SR S+   + +C  C R H G C      C+ C +PGH   +CP+ 
Sbjct: 293  RVGQRTFSSRRQQDSRQSSQVIR-SCDTCGRRHSGRCFLTTKTCYGCGQPGHIRRDCPMA 351

Query: 573  KDRREM-------------------------RGQG--------------------GTARM 529
                +                          RG+G                    G AR+
Sbjct: 352  HQSPDSARGSTQPASSAPSVAVSSGQEVSGSRGRGAGTSSQGRPSGSGHQSSIGRGQARV 411

Query: 528  YAMTQEEADQNPGTMSGMLTISGIPALVLFDTGATHSFISAKFHDITGHKDARIDVPLEI 349
            +A+TQ+EA  +   +S +L++  + A VLFD GATHSFIS  F    G    R +  L +
Sbjct: 412  FALTQQEAQTSNAVVSSILSVCNMNARVLFDPGATHSFISPCFASRLGRGRVRREEQLVV 471

Query: 348  SLPSGKTIITDVMDNNIDVNIGGKHLEADVYIIEMKDFDVILGMNWLSKYQASIKCQEKE 169
            S P  +  + +    +  V +  K    ++ +++  DFDVILGMNWLS   AS+ C  K 
Sbjct: 472  STPLKEIFVAEWEYESCVVRVKDKDTSVNLVVLDTLDFDVILGMNWLSPCHASVDCYHKL 531

Query: 168  ITLKLPGDEEITFYGVNSKSVPRVISAMKARKMLAKENCQGYL 40
            +    PG+   +  G  S +   +IS + AR++L ++ C GYL
Sbjct: 532  VRFDFPGEPSFSIQGDRSNAPTNLISVISARRLL-RQGCIGYL 573


>emb|CAN83518.1| hypothetical protein VITISV_035077 [Vitis vinifera]
          Length = 1194

 Score =  125 bits (315), Expect = 2e-26
 Identities = 79/253 (31%), Positives = 134/253 (52%)
 Frame = -3

Query: 798 AVVSEKRKFDGTKPSKMGNQNKQIRIDGNSRSSAPNNKPTCPKCNRNHYGECLAGQGKCF 619
           A++ EK   D  +  +   Q ++  I+  +  +    KP  P+     YG  L+ + K F
Sbjct: 114 ALIVEK---DNEELHQYREQQRKRNINDGAHGNQAQKKPA-PRT----YGSGLSKENKKF 165

Query: 618 MCKKPGHDASNCPLLKDRREMRGQGGTARMYAMTQEEADQNPGTMSGMLTISGIPALVLF 439
           + +KP  +       +DR+++R QG   R++AMT  +A      + G L I  + A  L 
Sbjct: 166 VFRKPKEENK-----EDRQKLRAQG---RVFAMTHRDAQTTFDVVIGTLQIHTLFARALI 217

Query: 438 DTGATHSFISAKFHDITGHKDARIDVPLEISLPSGKTIITDVMDNNIDVNIGGKHLEADV 259
           D G+THSF+S  F  + G     +D  L +++P G +++ + +  +  V IG + +  D+
Sbjct: 218 DPGSTHSFVSVSFAGLLGMSIDNMDFDLFVAIPLGDSVVVNKILRDCIVMIGYREMTVDL 277

Query: 258 YIIEMKDFDVILGMNWLSKYQASIKCQEKEITLKLPGDEEITFYGVNSKSVPRVISAMKA 79
            +++++DFDVILGMNWL+ Y ASI C  K +T  +P   +  F G +      +ISA++A
Sbjct: 278 VLLDLQDFDVILGMNWLASYHASIDCFGKIVTFNIPSRPDFGFEGKHVDKPLHMISALQA 337

Query: 78  RKMLAKENCQGYL 40
             +L K  CQG+L
Sbjct: 338 SSLLRK-GCQGFL 349


Top