BLASTX nr result

ID: Catharanthus22_contig00021457 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00021457
         (978 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275316.2| PREDICTED: uncharacterized protein LOC100253...   243   8e-62
emb|CBI30031.3| unnamed protein product [Vitis vinifera]              243   8e-62
ref|XP_006344034.1| PREDICTED: uncharacterized protein LOC102591...   241   4e-61
ref|XP_006344033.1| PREDICTED: uncharacterized protein LOC102591...   241   4e-61
ref|XP_006344035.1| PREDICTED: uncharacterized protein LOC102591...   238   3e-60
ref|XP_002273408.2| PREDICTED: uncharacterized protein LOC100263...   232   2e-58
ref|XP_004234821.1| PREDICTED: uncharacterized protein LOC101267...   231   3e-58
emb|CBI32175.3| unnamed protein product [Vitis vinifera]              226   7e-57
emb|CAN75880.1| hypothetical protein VITISV_024453 [Vitis vinifera]   226   8e-57
ref|XP_002512158.1| conserved hypothetical protein [Ricinus comm...   220   6e-55
gb|EXC42163.1| hypothetical protein L484_002413 [Morus notabilis]     219   9e-55
gb|EOY32472.1| Emsy N Terminus/ plant Tudor-like domains-contain...   219   1e-54
gb|EOY32471.1| Emsy N Terminus/ plant Tudor-like domains-contain...   219   1e-54
gb|EOY32469.1| Emsy N Terminus/ plant Tudor-like domains-contain...   219   1e-54
gb|EOY32468.1| Emsy N Terminus/ plant Tudor-like domains-contain...   219   1e-54
gb|EOY32466.1| Emsy N Terminus/ plant Tudor-like domains-contain...   219   1e-54
ref|XP_002285615.1| PREDICTED: uncharacterized protein LOC100257...   217   6e-54
ref|XP_006850018.1| hypothetical protein AMTR_s00022p00184660 [A...   214   3e-53
ref|XP_002330852.1| predicted protein [Populus trichocarpa] gi|5...   212   2e-52
ref|XP_006439036.1| hypothetical protein CICLE_v10031573mg [Citr...   212   2e-52

>ref|XP_002275316.2| PREDICTED: uncharacterized protein LOC100253804 [Vitis vinifera]
          Length = 431

 Score =  243 bits (620), Expect = 8e-62
 Identities = 142/241 (58%), Positives = 172/241 (71%), Gaps = 19/241 (7%)
 Frame = -2

Query: 668 SAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHR 489
           SA    +HSDME QIH LEQEAY +VLRAFKAQSDA+TW+KEGLITELRKELRVSDDEHR
Sbjct: 39  SAPYPRLHSDMETQIHHLEQEAYSSVLRAFKAQSDAITWDKEGLITELRKELRVSDDEHR 98

Query: 488 ELLTKVNADGLIHRIREWRKAGGSLGTTI-MPQTGHDQLHSPTVSASRKKQKTTHSVP-- 318
           ELL +VNAD +I RIREWR+AGG     + M Q  HDQ+ SPTVSASRKKQK + SVP  
Sbjct: 99  ELLARVNADNIIQRIREWRQAGGHQTAMLSMSQHAHDQIPSPTVSASRKKQKLSQSVPSL 158

Query: 317 -FGTASQALHPQSI-PATTQ--PLSAKRAP-LGIGGRRFQPVQQV--LSSPPTMPYQ--- 168
            FG +SQALHPQS+ PA+ Q  P + KR P LG  G++F+P Q    LSS  +M Y    
Sbjct: 159 SFGVSSQALHPQSVAPASVQPSPSTMKRGPTLGGRGKKFKPGQPFPGLSSVKSMQYHSSI 218

Query: 167 -----QPNQGTGALVINELSQ-RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHAL 6
                Q    +G L  NE ++  T + L+GRKVMT + +D+NF EA+ITDY+P+EG HAL
Sbjct: 219 TAGRGQFRTSSGTLTTNEPAEPGTYDPLIGRKVMTLWPEDNNFYEAVITDYNPLEGLHAL 278

Query: 5   I 3
           +
Sbjct: 279 V 279


>emb|CBI30031.3| unnamed protein product [Vitis vinifera]
          Length = 483

 Score =  243 bits (620), Expect = 8e-62
 Identities = 142/241 (58%), Positives = 172/241 (71%), Gaps = 19/241 (7%)
 Frame = -2

Query: 668 SAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHR 489
           SA    +HSDME QIH LEQEAY +VLRAFKAQSDA+TW+KEGLITELRKELRVSDDEHR
Sbjct: 39  SAPYPRLHSDMETQIHHLEQEAYSSVLRAFKAQSDAITWDKEGLITELRKELRVSDDEHR 98

Query: 488 ELLTKVNADGLIHRIREWRKAGGSLGTTI-MPQTGHDQLHSPTVSASRKKQKTTHSVP-- 318
           ELL +VNAD +I RIREWR+AGG     + M Q  HDQ+ SPTVSASRKKQK + SVP  
Sbjct: 99  ELLARVNADNIIQRIREWRQAGGHQTAMLSMSQHAHDQIPSPTVSASRKKQKLSQSVPSL 158

Query: 317 -FGTASQALHPQSI-PATTQ--PLSAKRAP-LGIGGRRFQPVQQV--LSSPPTMPYQ--- 168
            FG +SQALHPQS+ PA+ Q  P + KR P LG  G++F+P Q    LSS  +M Y    
Sbjct: 159 SFGVSSQALHPQSVAPASVQPSPSTMKRGPTLGGRGKKFKPGQPFPGLSSVKSMQYHSSI 218

Query: 167 -----QPNQGTGALVINELSQ-RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHAL 6
                Q    +G L  NE ++  T + L+GRKVMT + +D+NF EA+ITDY+P+EG HAL
Sbjct: 219 TAGRGQFRTSSGTLTTNEPAEPGTYDPLIGRKVMTLWPEDNNFYEAVITDYNPLEGLHAL 278

Query: 5   I 3
           +
Sbjct: 279 V 279


>ref|XP_006344034.1| PREDICTED: uncharacterized protein LOC102591642 isoform X2 [Solanum
           tuberosum]
          Length = 398

 Score =  241 bits (614), Expect = 4e-61
 Identities = 138/230 (60%), Positives = 165/230 (71%), Gaps = 6/230 (2%)
 Frame = -2

Query: 674 LDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDE 495
           + SA  + M+ DME +IH LEQ+AYGA+LRAFKAQSDALTWEKE LITELRKELRVSDDE
Sbjct: 37  ISSAPYQRMYVDMEVEIHNLEQDAYGAILRAFKAQSDALTWEKESLITELRKELRVSDDE 96

Query: 494 HRELLTKVNADGLIHRIREWRKAGGSLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSV-- 321
           HR+LLTKVNAD  IH IREWRK  G+       Q  HDQL SPTVS SRK+ K + SV  
Sbjct: 97  HRDLLTKVNADNRIHSIREWRKTNGN-------QPVHDQLPSPTVSGSRKRPKMSQSVIM 149

Query: 320 PFGTASQALHPQSIPATTQPLS--AK-RAPLGIGGRRFQPVQQVLSSPPTMPYQQPNQG- 153
           P GT  ++LH Q+I A+TQP +  AK  A  G GG R +P QQV SS   + YQQ   G 
Sbjct: 150 PLGTPLESLHHQTIAASTQPTTPGAKWGAAPGNGGFRSRPGQQVFSSSRPVHYQQAAPGS 209

Query: 152 TGALVINELSQRTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           + AL   EL++R  + L+GR+VMTR+ DD+NF EAIITDY  V+GRHAL+
Sbjct: 210 SSALRSGELAERPRDPLIGRRVMTRWPDDNNFYEAIITDYSAVDGRHALV 259


>ref|XP_006344033.1| PREDICTED: uncharacterized protein LOC102591642 isoform X1 [Solanum
           tuberosum]
          Length = 440

 Score =  241 bits (614), Expect = 4e-61
 Identities = 138/230 (60%), Positives = 165/230 (71%), Gaps = 6/230 (2%)
 Frame = -2

Query: 674 LDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDE 495
           + SA  + M+ DME +IH LEQ+AYGA+LRAFKAQSDALTWEKE LITELRKELRVSDDE
Sbjct: 37  ISSAPYQRMYVDMEVEIHNLEQDAYGAILRAFKAQSDALTWEKESLITELRKELRVSDDE 96

Query: 494 HRELLTKVNADGLIHRIREWRKAGGSLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSV-- 321
           HR+LLTKVNAD  IH IREWRK  G+       Q  HDQL SPTVS SRK+ K + SV  
Sbjct: 97  HRDLLTKVNADNRIHSIREWRKTNGN-------QPVHDQLPSPTVSGSRKRPKMSQSVIM 149

Query: 320 PFGTASQALHPQSIPATTQPLS--AK-RAPLGIGGRRFQPVQQVLSSPPTMPYQQPNQG- 153
           P GT  ++LH Q+I A+TQP +  AK  A  G GG R +P QQV SS   + YQQ   G 
Sbjct: 150 PLGTPLESLHHQTIAASTQPTTPGAKWGAAPGNGGFRSRPGQQVFSSSRPVHYQQAAPGS 209

Query: 152 TGALVINELSQRTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           + AL   EL++R  + L+GR+VMTR+ DD+NF EAIITDY  V+GRHAL+
Sbjct: 210 SSALRSGELAERPRDPLIGRRVMTRWPDDNNFYEAIITDYSAVDGRHALV 259


>ref|XP_006344035.1| PREDICTED: uncharacterized protein LOC102591642 isoform X3 [Solanum
           tuberosum]
          Length = 396

 Score =  238 bits (606), Expect = 3e-60
 Identities = 136/222 (61%), Positives = 161/222 (72%), Gaps = 6/222 (2%)
 Frame = -2

Query: 650 MHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKV 471
           M+ DME +IH LEQ+AYGA+LRAFKAQSDALTWEKE LITELRKELRVSDDEHR+LLTKV
Sbjct: 1   MYVDMEVEIHNLEQDAYGAILRAFKAQSDALTWEKESLITELRKELRVSDDEHRDLLTKV 60

Query: 470 NADGLIHRIREWRKAGGSLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSV--PFGTASQA 297
           NAD  IH IREWRK  G+       Q  HDQL SPTVS SRK+ K + SV  P GT  ++
Sbjct: 61  NADNRIHSIREWRKTNGN-------QPVHDQLPSPTVSGSRKRPKMSQSVIMPLGTPLES 113

Query: 296 LHPQSIPATTQPLS--AK-RAPLGIGGRRFQPVQQVLSSPPTMPYQQPNQG-TGALVINE 129
           LH Q+I A+TQP +  AK  A  G GG R +P QQV SS   + YQQ   G + AL   E
Sbjct: 114 LHHQTIAASTQPTTPGAKWGAAPGNGGFRSRPGQQVFSSSRPVHYQQAAPGSSSALRSGE 173

Query: 128 LSQRTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           L++R  + L+GR+VMTR+ DD+NF EAIITDY  V+GRHAL+
Sbjct: 174 LAERPRDPLIGRRVMTRWPDDNNFYEAIITDYSAVDGRHALV 215


>ref|XP_002273408.2| PREDICTED: uncharacterized protein LOC100263217 [Vitis vinifera]
          Length = 424

 Score =  232 bits (591), Expect(2) = 2e-58
 Identities = 132/222 (59%), Positives = 161/222 (72%), Gaps = 8/222 (3%)
 Frame = -2

Query: 644 SDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKVNA 465
           +DME QIH LEQEAY +VLRAFKAQ+DA+TWEKE LITELRKELR+S++EHRELL +VNA
Sbjct: 50  TDMETQIHQLEQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNA 109

Query: 464 DGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSVP---FGTASQA 297
           D +I RIREWR+AGG   G     Q  HD + SPTVSASRKKQK T S+P   FG  SQ+
Sbjct: 110 DDVIRRIREWRQAGGLQPGMLTTGQAVHDPIPSPTVSASRKKQKITQSIPSQSFGGPSQS 169

Query: 296 LHPQSIPATTQPLS--AKRAP-LGIGGRRFQPVQQVLSSPPTMPYQQPNQGTGALVINEL 126
            HPQ+I A+ QP S  AKR P LG  G++ +   Q  SS PT   Q  N+G     +NE 
Sbjct: 170 FHPQAIAASNQPSSSAAKRGPILGPKGKKHKSSMQYASSGPTGRGQVANRG-----VNEP 224

Query: 125 SQ-RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           ++  T + L+GRKV TR+ DD+NF EA+ITDY+PVEGRHAL+
Sbjct: 225 AEAATFDPLIGRKVRTRWPDDNNFYEAVITDYNPVEGRHALV 266



 Score = 21.9 bits (45), Expect(2) = 2e-58
 Identities = 8/15 (53%), Positives = 10/15 (66%)
 Frame = -3

Query: 721 NSVPTGGLALGSGRS 677
           N +P GG   G+GRS
Sbjct: 21  NRIPRGGRVAGNGRS 35


>ref|XP_004234821.1| PREDICTED: uncharacterized protein LOC101267780 [Solanum
           lycopersicum]
          Length = 392

 Score =  231 bits (589), Expect = 3e-58
 Identities = 132/221 (59%), Positives = 160/221 (72%), Gaps = 5/221 (2%)
 Frame = -2

Query: 650 MHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKV 471
           M+ DME +IH LEQ+AYGA+LRAFKAQSDALTWEKE LITELRKELRVSDDEHR+LLTKV
Sbjct: 1   MYVDMEVEIHNLEQDAYGAILRAFKAQSDALTWEKESLITELRKELRVSDDEHRDLLTKV 60

Query: 470 NADGLIHRIREWRKAGGSLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSV-PFGTASQAL 294
           NAD  IH IREWRK  G+       Q  HDQL SPTVS SRK+ K+   + P G   ++L
Sbjct: 61  NADNRIHSIREWRKTNGN-------QPIHDQLPSPTVSGSRKRAKSQSVIMPLGAPLESL 113

Query: 293 HPQSIPATTQPLS--AK-RAPLGIGGRRFQPVQQVLSSPPTMPYQQPNQGTGALV-INEL 126
           H Q+I A TQP +  AK  A  G GG R +P QQV SS P + YQQ   G+ +++   EL
Sbjct: 114 HHQTIAANTQPTTPGAKWGAAPGNGGFRSRPGQQVFSSRP-VHYQQAGPGSSSVLRSGEL 172

Query: 125 SQRTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           ++R  + L+GR+VMTR+ DD+NF EAIITDY  V+GRHAL+
Sbjct: 173 AERPRDPLIGRRVMTRWPDDNNFYEAIITDYSAVDGRHALV 213


>emb|CBI32175.3| unnamed protein product [Vitis vinifera]
          Length = 433

 Score =  226 bits (577), Expect(2) = 7e-57
 Identities = 133/231 (57%), Positives = 162/231 (70%), Gaps = 17/231 (7%)
 Frame = -2

Query: 644 SDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKVNA 465
           +DME QIH LEQEAY +VLRAFKAQ+DA+TWEKE LITELRKELR+S++EHRELL +VNA
Sbjct: 50  TDMETQIHQLEQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNA 109

Query: 464 DGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSVP---FGTASQA 297
           D +I RIREWR+AGG   G     Q  HD + SPTVSASRKKQK T S+P   FG  SQ+
Sbjct: 110 DDVIRRIREWRQAGGLQPGMLTTGQAVHDPIPSPTVSASRKKQKITQSIPSQSFGGPSQS 169

Query: 296 LHPQSIPATTQPLS--AKRAP-LGIGGRRFQPV---------QQVLSSPPTMPYQQPNQG 153
            HPQ+I A+ QP S  AKR P LG  G++ + V          Q  SS PT   Q  N+G
Sbjct: 170 FHPQAIAASNQPSSSAAKRGPILGPKGKKHKSVLPGASSMKSMQYASSGPTGRGQVANRG 229

Query: 152 TGALVINELSQ-RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
                +NE ++  T + L+GRKV TR+ DD+NF EA+ITDY+PVEGRHAL+
Sbjct: 230 -----VNEPAEAATFDPLIGRKVRTRWPDDNNFYEAVITDYNPVEGRHALV 275



 Score = 21.9 bits (45), Expect(2) = 7e-57
 Identities = 8/15 (53%), Positives = 10/15 (66%)
 Frame = -3

Query: 721 NSVPTGGLALGSGRS 677
           N +P GG   G+GRS
Sbjct: 21  NRIPRGGRVAGNGRS 35


>emb|CAN75880.1| hypothetical protein VITISV_024453 [Vitis vinifera]
          Length = 1348

 Score =  226 bits (577), Expect = 8e-57
 Identities = 133/231 (57%), Positives = 162/231 (70%), Gaps = 17/231 (7%)
 Frame = -2

Query: 644  SDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKVNA 465
            +DME QIH LEQEAY +VLRAFKAQ+DA+TWEKE LITELRKELR+S++EHRELL +VNA
Sbjct: 441  TDMETQIHQLEQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNA 500

Query: 464  DGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSVP---FGTASQA 297
            D +I RIREWR+AGG   G     Q  HD + SPTVSASRKKQK T S+P   FG  SQ+
Sbjct: 501  DDVIRRIREWRQAGGLQPGMLTTGQAVHDPIPSPTVSASRKKQKITQSIPSQSFGGPSQS 560

Query: 296  LHPQSIPATTQPLS--AKRAP-LGIGGRRFQPV---------QQVLSSPPTMPYQQPNQG 153
             HPQ+I A+ QP S  AKR P LG  G++ + V          Q  SS PT   Q  N+G
Sbjct: 561  FHPQAIAASNQPSSSAAKRGPILGPKGKKHKSVLPGASSMKSMQYASSGPTGRGQVANRG 620

Query: 152  TGALVINELSQ-RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
                 +NE ++  T + L+GRKV TR+ DD+NF EA+ITDY+PVEGRHAL+
Sbjct: 621  -----VNEPAEAATFDPLIGRKVRTRWPDDNNFYEAVITDYNPVEGRHALV 666


>ref|XP_002512158.1| conserved hypothetical protein [Ricinus communis]
           gi|223548702|gb|EEF50192.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 430

 Score =  220 bits (561), Expect = 6e-55
 Identities = 126/226 (55%), Positives = 154/226 (68%), Gaps = 12/226 (5%)
 Frame = -2

Query: 644 SDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKVNA 465
           +DME QIH LEQEAY +VLRAFKAQ+DA+TWEKE LITELRKELR+S++EHRELL +VNA
Sbjct: 49  TDMETQIHQLEQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGRVNA 108

Query: 464 DGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSVP---FGTASQA 297
           D +I RIREWR+ GG   G     Q  HD + SPTVSASRKKQK T SVP   FG  S +
Sbjct: 109 DDVIRRIREWRQTGGLQSGMLGTGQAVHDPIPSPTVSASRKKQKITPSVPSQSFGGPSPS 168

Query: 296 LHPQSIPATTQP-LSAKRAPL-GIGGRRFQPVQQVLSSPPTMPYQQPNQGTGALVINELS 123
            HPQ++ A+ QP  SAKR P+ G   ++ +P     SS  ++PY          V N +S
Sbjct: 169 FHPQAVSASHQPSSSAKRGPVSGSKSKKQKPALPGASSMKSIPYPSSGPSGRGQVANRIS 228

Query: 122 QRTC------ESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
                     ESLVG+KV TR+ DD+NF EA+IT Y+PVEGRHAL+
Sbjct: 229 SGAAPEGAIPESLVGKKVKTRWPDDNNFYEAVITQYNPVEGRHALV 274


>gb|EXC42163.1| hypothetical protein L484_002413 [Morus notabilis]
          Length = 424

 Score =  219 bits (559), Expect(2) = 9e-55
 Identities = 130/232 (56%), Positives = 163/232 (70%), Gaps = 18/232 (7%)
 Frame = -2

Query: 644 SDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKVNA 465
           +DME QIH LEQEAY +VLRAFKAQ+D +TWEKE LITELRKELR+S++EHRELL +VNA
Sbjct: 49  TDMEAQIHQLEQEAYSSVLRAFKAQADLITWEKESLITELRKELRLSNEEHRELLGRVNA 108

Query: 464 DGLIHRIREWRKAGGS----LGTTIMPQTGHDQLHSPTVSASRKKQKTTHSV---PFGTA 306
           D +I RIREWR+AGG+    LGT+   Q  HD + SPTVSASRKKQK + SV    FG  
Sbjct: 109 DDVIRRIREWRQAGGAQPGMLGTS---QAVHDPIPSPTVSASRKKQKGSQSVASQSFGAP 165

Query: 305 SQALHPQSIPATTQPLS--AKRAPL-GIGGRRFQP-------VQQVLSSPPTMPYQQPNQ 156
           S   HPQ+I A+ QP S  AKR  + G  G++ +P       V+Q  SS PT   Q  N+
Sbjct: 166 SPPFHPQAIAASHQPPSSTAKRGYVPGAKGKKHKPVLPGGSSVKQYPSSGPTGRGQAVNR 225

Query: 155 GTGALVINELSQ-RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
            + + V NE ++    +SL+G+KV TR+ DD+NF EA+ITDYDP EGRHAL+
Sbjct: 226 VSSSAVANEPAEGAAIDSLIGKKVRTRWPDDNNFYEAVITDYDPAEGRHALV 277



 Score = 21.9 bits (45), Expect(2) = 9e-55
 Identities = 8/15 (53%), Positives = 10/15 (66%)
 Frame = -3

Query: 721 NSVPTGGLALGSGRS 677
           N +P GG   G+GRS
Sbjct: 21  NRIPRGGRVAGNGRS 35


>gb|EOY32472.1| Emsy N Terminus/ plant Tudor-like domains-containing protein
           isoform 7 [Theobroma cacao]
          Length = 366

 Score =  219 bits (558), Expect = 1e-54
 Identities = 127/235 (54%), Positives = 160/235 (68%), Gaps = 7/235 (2%)
 Frame = -2

Query: 686 RKVFLDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRV 507
           R   + SA L  MHSDME QIH++EQEAY +VLRAFKAQSDALTWEKE LITELRKELRV
Sbjct: 35  RSAVVGSAPLPRMHSDMETQIHLIEQEAYSSVLRAFKAQSDALTWEKESLITELRKELRV 94

Query: 506 SDDEHRELLTKVNADGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTT 330
           SD+EHRELL +VNAD ++ RIREWR A G   G     Q  HD + SPTVS SRKKQKT+
Sbjct: 95  SDEEHRELLLRVNADDILRRIREWRTASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTS 154

Query: 329 HSV---PFGTASQALHPQSIPATTQPLSAKRAPL-GIGGRRFQPVQQVLSSPPTMPYQQP 162
            SV     G  S ALHP   P+++   + +R PL G   ++ +   Q  S+   +  Q P
Sbjct: 155 QSVASLSMGAPSPALHPSMQPSSS---ALRRGPLPGAKSKKSKSSTQYPSTGLPVRPQAP 211

Query: 161 NQ-GTGALVINELSQRT-CESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           N+  +GA   NE ++    + L+GRKV TR+ +D++F EA+ITDY+PVEGRHAL+
Sbjct: 212 NRTSSGAFATNEPAEAAPYDPLIGRKVWTRWPEDNHFYEAVITDYNPVEGRHALV 266


>gb|EOY32471.1| Emsy N Terminus/ plant Tudor-like domains-containing protein
           isoform 6 [Theobroma cacao]
          Length = 393

 Score =  219 bits (558), Expect = 1e-54
 Identities = 127/235 (54%), Positives = 160/235 (68%), Gaps = 7/235 (2%)
 Frame = -2

Query: 686 RKVFLDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRV 507
           R   + SA L  MHSDME QIH++EQEAY +VLRAFKAQSDALTWEKE LITELRKELRV
Sbjct: 35  RSAVVGSAPLPRMHSDMETQIHLIEQEAYSSVLRAFKAQSDALTWEKESLITELRKELRV 94

Query: 506 SDDEHRELLTKVNADGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTT 330
           SD+EHRELL +VNAD ++ RIREWR A G   G     Q  HD + SPTVS SRKKQKT+
Sbjct: 95  SDEEHRELLLRVNADDILRRIREWRTASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTS 154

Query: 329 HSV---PFGTASQALHPQSIPATTQPLSAKRAPL-GIGGRRFQPVQQVLSSPPTMPYQQP 162
            SV     G  S ALHP   P+++   + +R PL G   ++ +   Q  S+   +  Q P
Sbjct: 155 QSVASLSMGAPSPALHPSMQPSSS---ALRRGPLPGAKSKKSKSSTQYPSTGLPVRPQAP 211

Query: 161 NQ-GTGALVINELSQRT-CESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           N+  +GA   NE ++    + L+GRKV TR+ +D++F EA+ITDY+PVEGRHAL+
Sbjct: 212 NRTSSGAFATNEPAEAAPYDPLIGRKVWTRWPEDNHFYEAVITDYNPVEGRHALV 266


>gb|EOY32469.1| Emsy N Terminus/ plant Tudor-like domains-containing protein
           isoform 4 [Theobroma cacao] gi|508785214|gb|EOY32470.1|
           Emsy N Terminus/ plant Tudor-like domains-containing
           protein isoform 4 [Theobroma cacao]
          Length = 412

 Score =  219 bits (558), Expect = 1e-54
 Identities = 127/235 (54%), Positives = 160/235 (68%), Gaps = 7/235 (2%)
 Frame = -2

Query: 686 RKVFLDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRV 507
           R   + SA L  MHSDME QIH++EQEAY +VLRAFKAQSDALTWEKE LITELRKELRV
Sbjct: 35  RSAVVGSAPLPRMHSDMETQIHLIEQEAYSSVLRAFKAQSDALTWEKESLITELRKELRV 94

Query: 506 SDDEHRELLTKVNADGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTT 330
           SD+EHRELL +VNAD ++ RIREWR A G   G     Q  HD + SPTVS SRKKQKT+
Sbjct: 95  SDEEHRELLLRVNADDILRRIREWRTASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTS 154

Query: 329 HSV---PFGTASQALHPQSIPATTQPLSAKRAPL-GIGGRRFQPVQQVLSSPPTMPYQQP 162
            SV     G  S ALHP   P+++   + +R PL G   ++ +   Q  S+   +  Q P
Sbjct: 155 QSVASLSMGAPSPALHPSMQPSSS---ALRRGPLPGAKSKKSKSSTQYPSTGLPVRPQAP 211

Query: 161 NQ-GTGALVINELSQRT-CESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           N+  +GA   NE ++    + L+GRKV TR+ +D++F EA+ITDY+PVEGRHAL+
Sbjct: 212 NRTSSGAFATNEPAEAAPYDPLIGRKVWTRWPEDNHFYEAVITDYNPVEGRHALV 266


>gb|EOY32468.1| Emsy N Terminus/ plant Tudor-like domains-containing protein
           isoform 3 [Theobroma cacao]
          Length = 452

 Score =  219 bits (558), Expect = 1e-54
 Identities = 127/235 (54%), Positives = 160/235 (68%), Gaps = 7/235 (2%)
 Frame = -2

Query: 686 RKVFLDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRV 507
           R   + SA L  MHSDME QIH++EQEAY +VLRAFKAQSDALTWEKE LITELRKELRV
Sbjct: 35  RSAVVGSAPLPRMHSDMETQIHLIEQEAYSSVLRAFKAQSDALTWEKESLITELRKELRV 94

Query: 506 SDDEHRELLTKVNADGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTT 330
           SD+EHRELL +VNAD ++ RIREWR A G   G     Q  HD + SPTVS SRKKQKT+
Sbjct: 95  SDEEHRELLLRVNADDILRRIREWRTASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTS 154

Query: 329 HSV---PFGTASQALHPQSIPATTQPLSAKRAPL-GIGGRRFQPVQQVLSSPPTMPYQQP 162
            SV     G  S ALHP   P+++   + +R PL G   ++ +   Q  S+   +  Q P
Sbjct: 155 QSVASLSMGAPSPALHPSMQPSSS---ALRRGPLPGAKSKKSKSSTQYPSTGLPVRPQAP 211

Query: 161 NQ-GTGALVINELSQRT-CESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           N+  +GA   NE ++    + L+GRKV TR+ +D++F EA+ITDY+PVEGRHAL+
Sbjct: 212 NRTSSGAFATNEPAEAAPYDPLIGRKVWTRWPEDNHFYEAVITDYNPVEGRHALV 266


>gb|EOY32466.1| Emsy N Terminus/ plant Tudor-like domains-containing protein
           isoform 1 [Theobroma cacao] gi|508785211|gb|EOY32467.1|
           Emsy N Terminus/ plant Tudor-like domains-containing
           protein isoform 1 [Theobroma cacao]
          Length = 453

 Score =  219 bits (558), Expect = 1e-54
 Identities = 127/235 (54%), Positives = 160/235 (68%), Gaps = 7/235 (2%)
 Frame = -2

Query: 686 RKVFLDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRV 507
           R   + SA L  MHSDME QIH++EQEAY +VLRAFKAQSDALTWEKE LITELRKELRV
Sbjct: 35  RSAVVGSAPLPRMHSDMETQIHLIEQEAYSSVLRAFKAQSDALTWEKESLITELRKELRV 94

Query: 506 SDDEHRELLTKVNADGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTT 330
           SD+EHRELL +VNAD ++ RIREWR A G   G     Q  HD + SPTVS SRKKQKT+
Sbjct: 95  SDEEHRELLLRVNADDILRRIREWRTASGLQPGMLSTSQPVHDTVPSPTVSGSRKKQKTS 154

Query: 329 HSV---PFGTASQALHPQSIPATTQPLSAKRAPL-GIGGRRFQPVQQVLSSPPTMPYQQP 162
            SV     G  S ALHP   P+++   + +R PL G   ++ +   Q  S+   +  Q P
Sbjct: 155 QSVASLSMGAPSPALHPSMQPSSS---ALRRGPLPGAKSKKSKSSTQYPSTGLPVRPQAP 211

Query: 161 NQ-GTGALVINELSQRT-CESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           N+  +GA   NE ++    + L+GRKV TR+ +D++F EA+ITDY+PVEGRHAL+
Sbjct: 212 NRTSSGAFATNEPAEAAPYDPLIGRKVWTRWPEDNHFYEAVITDYNPVEGRHALV 266


>ref|XP_002285615.1| PREDICTED: uncharacterized protein LOC100257061 [Vitis vinifera]
          Length = 449

 Score =  217 bits (552), Expect = 6e-54
 Identities = 127/234 (54%), Positives = 161/234 (68%), Gaps = 6/234 (2%)
 Frame = -2

Query: 686 RKVFLDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRV 507
           R   L SA    MH DME QIH +EQEAY +VLRAFKAQSDA+TWEKE LITELRKELRV
Sbjct: 34  RSGVLGSAPFPRMHGDMEAQIHHIEQEAYSSVLRAFKAQSDAITWEKESLITELRKELRV 93

Query: 506 SDDEHRELLTKVNADGLIHRIREWRKAGGSLGTTIMPQTGHDQLHSPTVSASRKKQKTTH 327
           SD+EHRELL++VNAD +I  IREWRK GG L   +  Q  H+ + SPTVSASRKKQKT+ 
Sbjct: 94  SDEEHRELLSRVNADDIIRSIREWRK-GGGLQHGMAAQPVHESIPSPTVSASRKKQKTSQ 152

Query: 326 SV---PFGTASQALHPQSIPATTQPLSAKRAPLGIG-GRRFQPVQQVLSSPPTMPYQQPN 159
           S+     G  S ALHP   P+++   + K AP      ++ +P  Q  S+  T   Q  N
Sbjct: 153 SIASLSLGAPSPALHPTMQPSSS---ALKHAPTPRSKSKKTKPSMQYPSAGLTGRPQLAN 209

Query: 158 QG-TGALVINELSQ-RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           +G +GA V NE ++  T + L+G+KV TR+ +D++F EA+ITDY+PVEGRHAL+
Sbjct: 210 RGSSGAFVGNEAAEGTTYDPLIGKKVWTRWPEDNHFYEAVITDYNPVEGRHALV 263


>ref|XP_006850018.1| hypothetical protein AMTR_s00022p00184660 [Amborella trichopoda]
           gi|548853616|gb|ERN11599.1| hypothetical protein
           AMTR_s00022p00184660 [Amborella trichopoda]
          Length = 421

 Score =  214 bits (546), Expect = 3e-53
 Identities = 130/248 (52%), Positives = 168/248 (67%), Gaps = 20/248 (8%)
 Frame = -2

Query: 686 RKVFLDSAALRMMHSDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRV 507
           R V + S     + +DME +IH +EQEAYG+VLRAFKAQ+DA+TWEKEGL++ELRKELRV
Sbjct: 29  RAVPVGSLPYSRVQNDMETEIHWIEQEAYGSVLRAFKAQADAITWEKEGLMSELRKELRV 88

Query: 506 SDDEHRELLTKVNADGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTT 330
           SD+EHRELL +VNAD  I RIREWRK+GG   G     Q GHD   SPTVSASRKKQKT+
Sbjct: 89  SDEEHRELLARVNADETIRRIREWRKSGGLQPGLLGAAQPGHDPSPSPTVSASRKKQKTS 148

Query: 329 HSVP---FGTASQALHPQSIPATTQPLS--AKR-APLGIGGRRFQPVQQV--LSSPPTMP 174
           H +P       S  + PQ++ A+  P S  AKR A +G  G++ +  Q V  +SS   + 
Sbjct: 149 HQMPSLSLNAPSPNIPPQTVAASMHPSSSAAKRGAVVGARGKKPKSGQTVPGVSSMKPLQ 208

Query: 173 Y---------QQPNQG-TGALVINELSQRTC-ESLVGRKVMTRFSDDSNFCEAIITDYDP 27
           Y         Q  N+G +GALV +E ++    + L+GRKVMTR+ +D+NF EA+ITDY+P
Sbjct: 209 YSSTPLAGRGQVANRGSSGALVASEPAEAAAFDPLIGRKVMTRWPEDNNFYEAVITDYNP 268

Query: 26  VEGRHALI 3
            EGRHAL+
Sbjct: 269 KEGRHALV 276


>ref|XP_002330852.1| predicted protein [Populus trichocarpa]
           gi|566150592|ref|XP_006369456.1| emsy N terminus
           domain-containing family protein [Populus trichocarpa]
           gi|550348005|gb|ERP66025.1| emsy N terminus
           domain-containing family protein [Populus trichocarpa]
          Length = 433

 Score =  212 bits (540), Expect = 2e-52
 Identities = 119/225 (52%), Positives = 152/225 (67%), Gaps = 11/225 (4%)
 Frame = -2

Query: 644 SDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKVNA 465
           +DME QIH LEQEAY +VLRAFKAQ+DA+TWEKE LITELRKELR+S++EHRELL +VNA
Sbjct: 49  TDMETQIHQLEQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLARVNA 108

Query: 464 DGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSV---PFGTASQA 297
           D +I RIREWR+AGG   G     Q  HD + SPTVSASRKKQK T S+    F   S +
Sbjct: 109 DDVIRRIREWRQAGGHQSGMLTTGQAVHDPIPSPTVSASRKKQKMTSSILSQSFAGPSPS 168

Query: 296 LHPQSIPATTQPLS--AKRAPL-GIGGRRFQPVQQVLSSPPTMPYQQPNQGTGALVINEL 126
            HPQ + A+ QP S  AKR P+ G  G++ +P     SS  ++PY          V N L
Sbjct: 169 FHPQPVSASQQPSSSAAKRGPVTGPKGKKQKPGLPGASSMKSIPYPSSGPSGRGQVANRL 228

Query: 125 SQ----RTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           S        +  +G++V TR+ DD++F EA+ITD++P+EGRHAL+
Sbjct: 229 SSGAVPEGADQYIGKRVKTRWPDDNHFYEAVITDFNPIEGRHALV 273


>ref|XP_006439036.1| hypothetical protein CICLE_v10031573mg [Citrus clementina]
           gi|557541232|gb|ESR52276.1| hypothetical protein
           CICLE_v10031573mg [Citrus clementina]
          Length = 432

 Score =  212 bits (539), Expect = 2e-52
 Identities = 126/232 (54%), Positives = 154/232 (66%), Gaps = 18/232 (7%)
 Frame = -2

Query: 644 SDMEYQIHVLEQEAYGAVLRAFKAQSDALTWEKEGLITELRKELRVSDDEHRELLTKVNA 465
           +DME QIH LEQEAY +VLRAFKAQ+DA+TWEKE LITELRKELR+S++EHRELL +VNA
Sbjct: 49  ADMETQIHQLEQEAYSSVLRAFKAQADAITWEKESLITELRKELRLSNEEHRELLGQVNA 108

Query: 464 DGLIHRIREWRKAGG-SLGTTIMPQTGHDQLHSPTVSASRKKQKTTHSVP---FGTASQA 297
           D  I RIREWR+AGG   G   + Q  HD + SPTVSAS K+QK T SVP   FG  S +
Sbjct: 109 DDTIRRIREWRQAGGLQPGMHSIGQGVHDPIPSPTVSASHKRQKITQSVPSQSFGGPSPS 168

Query: 296 LHPQSIPATTQPLS--AKRAP-LGIGGRRFQP---------VQQVLSSPPTMPYQQPNQG 153
            HPQ +  + QP S  AKR P  G  G++ +P           Q  SS P    Q  N+ 
Sbjct: 169 FHPQPVTTSHQPSSSAAKRGPATGSKGKKHKPGLPGVPSMKSMQYPSSGPAGRGQVANRA 228

Query: 152 T--GALVINELSQRTCESLVGRKVMTRFSDDSNFCEAIITDYDPVEGRHALI 3
           T   ALV       T + L+G++V TR+ DD+NF EA+ITDY+PVEGRHAL+
Sbjct: 229 TSGAALVSEPPDGATFDPLIGKRVRTRWPDDNNFYEAVITDYNPVEGRHALV 280


Top