BLASTX nr result

ID: Catharanthus22_contig00046577 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00046577
         (345 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]    84   3e-14
gb|EOX99837.1| Uncharacterized protein TCM_008802 [Theobroma cacao]    78   1e-12
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]    77   3e-12
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]    77   3e-12
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]    75   9e-12
gb|EOY25449.1| Uncharacterized protein TCM_016755 [Theobroma cacao]    74   2e-11
gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]    74   2e-11
gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]    74   2e-11
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]    73   3e-11
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]    73   4e-11
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]    72   8e-11
gb|EOY22164.1| Uncharacterized protein TCM_014381 [Theobroma cacao]    70   2e-10
gb|EOY20267.1| Uncharacterized protein TCM_045622 [Theobroma cacao]    70   2e-10
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...    70   2e-10
ref|XP_004305774.1| PREDICTED: uncharacterized protein LOC101293...    70   2e-10
gb|EOY31736.1| Uncharacterized protein TCM_038852 [Theobroma cacao]    69   8e-10
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]    66   1e-09
gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptas...    68   1e-09
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]    67   2e-09
ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250...    67   2e-09

>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 83.6 bits (205), Expect = 3e-14
 Identities = 35/77 (45%), Positives = 53/77 (68%)
 Frame = -1

Query: 231  TPCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTV 52
            TP + + ++N  L   PSL+EV+EAVF++ +DS  GPDGF   F+ HCWDII +D+ + V
Sbjct: 1262 TPRIISTTDNEFLCATPSLQEVKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKQDLFEAV 1321

Query: 51   CDFWVGLSIPKSFSSTS 1
             DF+ G  +P+  +ST+
Sbjct: 1322 LDFFKGSPLPRGITSTT 1338


>gb|EOX99837.1| Uncharacterized protein TCM_008802 [Theobroma cacao]
          Length = 372

 Score = 77.8 bits (190), Expect = 1e-12
 Identities = 39/107 (36%), Positives = 61/107 (57%)
 Frame = -1

Query: 321 IKSEPVWDFSMLFSDDTTSRPDSLSPLLDYTPCLFTPSENSMLDRLPSLEEVREAVFSLG 142
           +KS  V  FS L   +        + L+   P +   +EN  L  +PS+EE+ EAVF++ 
Sbjct: 197 VKSSVVDFFSSLMKKEPCDMSRFDTSLI---PAIIFENENLSLCAVPSMEELEEAVFNID 253

Query: 141 RDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPKSFSSTS 1
           +D+  GPDGFF  F+  CWDI++ D    V DF+ G+ +P+  +ST+
Sbjct: 254 KDNVVGPDGFFSYFYQQCWDIVANDFLDAVVDFFHGIDLPRGITSTT 300


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 76.6 bits (187), Expect = 3e-12
 Identities = 41/113 (36%), Positives = 65/113 (57%)
 Frame = -1

Query: 339  LDSQYHIKSEPVWDFSMLFSDDTTSRPDSLSPLLDYTPCLFTPSENSMLDRLPSLEEVRE 160
            ++ Q  +K   +  FS L   +        S L+   P + + SEN +L   PSL+EV++
Sbjct: 1263 IEDQEQLKHSAIEYFSSLLKVEPCYDSRFQSSLI---PSIISNSENELLCAEPSLQEVKD 1319

Query: 159  AVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPKSFSSTS 1
            AVF +  +SA GPDGF   F+  CW+II++D+   V DF+ G +IP+  +ST+
Sbjct: 1320 AVFGINSESAAGPDGFSSYFYQQCWNIIAQDLLDAVRDFFHGANIPRGVTSTT 1372


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 76.6 bits (187), Expect = 3e-12
 Identities = 32/76 (42%), Positives = 49/76 (64%)
 Frame = -1

Query: 228  PCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVC 49
            P + + ++N  L   P L+E++EAVF++ +DS  GPDGF   F+ HCWDII  D+   V 
Sbjct: 1176 PRIISSADNEFLCAAPPLQEIKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKNDLLDAVL 1235

Query: 48   DFWVGLSIPKSFSSTS 1
            DF+ G  +P+  +ST+
Sbjct: 1236 DFFRGSPLPRGVTSTT 1251


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 75.1 bits (183), Expect = 9e-12
 Identities = 32/76 (42%), Positives = 50/76 (65%)
 Frame = -1

Query: 228  PCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVC 49
            P   + ++N  L   PSL+E++E VF++ +DS  GPDGF   F+ HCWDII +D+ + V 
Sbjct: 1002 PRTISITDNEFLCAAPSLKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVL 1061

Query: 48   DFWVGLSIPKSFSSTS 1
            DF+ G  +P+  +ST+
Sbjct: 1062 DFFNGTPMPQGVTSTT 1077


>gb|EOY25449.1| Uncharacterized protein TCM_016755 [Theobroma cacao]
          Length = 1245

 Score = 74.3 bits (181), Expect = 2e-11
 Identities = 29/66 (43%), Positives = 47/66 (71%)
 Frame = -1

Query: 198 MLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPK 19
           ++D  PSL+E+++ VF++ +DS  GPDGF   F+ HCWDII +D+ + V DF+ G  +P+
Sbjct: 761 LIDAAPSLKEIKDVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFKGTPMPR 820

Query: 18  SFSSTS 1
             +ST+
Sbjct: 821 GVTSTT 826


>gb|EOY02242.1| Uncharacterized protein TCM_016767 [Theobroma cacao]
          Length = 1707

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 32/76 (42%), Positives = 48/76 (63%)
 Frame = -1

Query: 228  PCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVC 49
            P + + ++N  L   PSL+EV+E VF++ +DS  G DGF   F+ HCWDII  D+   V 
Sbjct: 1133 PRIISSADNEFLCAAPSLQEVKETVFNINKDSVAGSDGFSSLFYQHCWDIIKHDLLDAVL 1192

Query: 48   DFWVGLSIPKSFSSTS 1
            DF+ G  +P+  +ST+
Sbjct: 1193 DFFRGSPLPRGVTSTT 1208


>gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao]
          Length = 1659

 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 32/76 (42%), Positives = 49/76 (64%)
 Frame = -1

Query: 228  PCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVC 49
            P   + ++N  L   PSL+E+ E VF++ +DS  GPDGF   F+ HCWDII +D+ + V 
Sbjct: 899  PRTISITDNEFLCAAPSLKEINEVVFNIDKDSVVGPDGFSSLFYQHCWDIIKQDLLEAVL 958

Query: 48   DFWVGLSIPKSFSSTS 1
            DF+ G  +P+  +ST+
Sbjct: 959  DFFNGAPMPQGVTSTT 974


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 73.2 bits (178), Expect = 3e-11
 Identities = 39/113 (34%), Positives = 63/113 (55%)
 Frame = -1

Query: 339  LDSQYHIKSEPVWDFSMLFSDDTTSRPDSLSPLLDYTPCLFTPSENSMLDRLPSLEEVRE 160
            ++ Q  +K   +  FS L   +          L+   P + + SEN +L   P+L+EV++
Sbjct: 1435 IEDQEQLKQSAIKYFSSLLKFEPCDDSRFQRSLI---PSIISNSENELLCAEPNLQEVKD 1491

Query: 159  AVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPKSFSSTS 1
            AVF +  +SA GPDGF   F+  CW+II+ D+   V DF+ G +IP+  +ST+
Sbjct: 1492 AVFGIDPESAAGPDGFSSYFYQQCWNIIAHDLLDAVRDFFHGANIPRGVTSTT 1544


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 72.8 bits (177), Expect = 4e-11
 Identities = 32/76 (42%), Positives = 50/76 (65%)
 Frame = -1

Query: 228  PCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVC 49
            P + + ++N  L   P+L+EV+EAVF +  +SA GPDGF   F+  CWDII+ D+ + V 
Sbjct: 1262 PSIISDTDNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAHDLFEAVK 1321

Query: 48   DFWVGLSIPKSFSSTS 1
            +F+ G  IP+  +ST+
Sbjct: 1322 EFFHGADIPQGMTSTT 1337


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 38/113 (33%), Positives = 63/113 (55%)
 Frame = -1

Query: 339  LDSQYHIKSEPVWDFSMLFSDDTTSRPDSLSPLLDYTPCLFTPSENSMLDRLPSLEEVRE 160
            ++ Q  +K   +  FS L   +        + L+   P + + SEN +L   P+L+EV++
Sbjct: 1265 IEDQEQLKQSAIEYFSSLLKAEPCDISRFQNSLI---PSIISNSENELLCAEPNLQEVKD 1321

Query: 159  AVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPKSFSSTS 1
            AVF +  +SA GPDGF   F+  CW+ I+ D+   V DF+ G +IP+  +ST+
Sbjct: 1322 AVFDIDPESAAGPDGFSSYFYQQCWNTIAHDLLDAVRDFFHGANIPRGVTSTT 1374


>gb|EOY22164.1| Uncharacterized protein TCM_014381 [Theobroma cacao]
          Length = 250

 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 30/69 (43%), Positives = 46/69 (66%)
 Frame = -1

Query: 207 ENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLS 28
           +N +   +PSL EV+EA+F + ++S  GPDG    F+ HCW IISKD+ + V  F+ G +
Sbjct: 123 DNDVPCAIPSLHEVQEAIFDIAKNSVVGPDGLSSFFYKHCWSIISKDLLEVVTGFFQGAT 182

Query: 27  IPKSFSSTS 1
           +PK  +ST+
Sbjct: 183 LPKGMTSTT 191


>gb|EOY20267.1| Uncharacterized protein TCM_045622 [Theobroma cacao]
          Length = 1232

 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 28/61 (45%), Positives = 43/61 (70%)
 Frame = -1

Query: 183 PSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPKSFSST 4
           PSL +++E VF++ +DS   PDGF   F+ HCWDII +D+ + V DF+ G S+P+  +ST
Sbjct: 630 PSLSKIKEVVFNINKDSVASPDGFSSLFYQHCWDIIKQDLLEAVLDFFKGSSLPRGVTST 689

Query: 3   S 1
           +
Sbjct: 690 T 690


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 30/79 (37%), Positives = 52/79 (65%)
 Frame = -1

Query: 237 DYTPCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQ 58
           ++ P + + ++N++L   P L+EV++AVF++ +DS  GPDGF   F+  CW II++D+  
Sbjct: 380 EFIPQMLSDADNNLLCAEPQLQEVKDAVFAIDKDSVVGPDGFSSFFYQQCWPIIAEDLLA 439

Query: 57  TVCDFWVGLSIPKSFSSTS 1
            V DF+ G   P+  +ST+
Sbjct: 440 AVRDFFKGAVFPRGVTSTT 458


>ref|XP_004305774.1| PREDICTED: uncharacterized protein LOC101293221 [Fragaria vesca
           subsp. vesca]
          Length = 461

 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 39/102 (38%), Positives = 55/102 (53%)
 Frame = -1

Query: 342 ILDSQYHIKSEPVWDFSMLFSDDTTSRPDSLSPLLDYTPCLFTPSENSMLDRLPSLEEVR 163
           IL     I +  V  +  L+S  +T  P +L  +    P L T +EN  L  +PS EE++
Sbjct: 255 ILFEPSDIVAHVVGFYQNLYSSSST--PRNLDEVCSVIPSLVTNAENDWLTVIPSTEEIK 312

Query: 162 EAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWV 37
            AVF++   SAPGPDGF   F+  CWDI+  DV   V  F++
Sbjct: 313 NAVFAMDASSAPGPDGFPGCFYQSCWDIVGSDVVACVRQFFM 354


>gb|EOY31736.1| Uncharacterized protein TCM_038852 [Theobroma cacao]
          Length = 456

 Score = 68.6 bits (166), Expect = 8e-10
 Identities = 36/107 (33%), Positives = 60/107 (56%)
 Frame = -1

Query: 321 IKSEPVWDFSMLFSDDTTSRPDSLSPLLDYTPCLFTPSENSMLDRLPSLEEVREAVFSLG 142
           IK+  V  FS L   +        S ++     L + ++N+ L    +++EV+EAVF++ 
Sbjct: 41  IKASTVEFFSSLMKKEQCDLTRFNSSIIS---TLVSATDNNFLCAALTIQEVKEAVFAID 97

Query: 141 RDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPKSFSSTS 1
           +DS   PDGF   F+ HCWDI++ D+   V DF+ G  +P+  +ST+
Sbjct: 98  KDSIAEPDGFSSFFYQHCWDILANDLIAAVLDFFQGTYLPRGITSTT 144


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 66.2 bits (160), Expect(2) = 1e-09
 Identities = 29/61 (47%), Positives = 41/61 (67%)
 Frame = -1

Query: 183  PSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDFWVGLSIPKSFSST 4
            P+L+EV+E VF +  +SA GPDGF   F+  CWDII+ D+   V DF+ G  IP+  +ST
Sbjct: 2728 PNLQEVKEVVFGMDPESAAGPDGFSSHFYQQCWDIIAYDLFDAVKDFFQGADIPQGVTST 2787

Query: 3    S 1
            +
Sbjct: 2788 T 2788



 Score = 21.9 bits (45), Expect(2) = 1e-09
 Identities = 10/25 (40%), Positives = 13/25 (52%)
 Frame = -2

Query: 266  PGLIRFLLCWTTLHVFLLPLKIPCL 192
            P   RFL  WT  H F + ++ P L
Sbjct: 2706 PSSFRFLHAWTLHHNFNMSVEEPNL 2730


>gb|ABD28670.2| RNA-directed DNA polymerase (Reverse transcriptase) [Medicago
           truncatula]
          Length = 642

 Score = 67.8 bits (164), Expect = 1e-09
 Identities = 33/76 (43%), Positives = 46/76 (60%)
 Frame = -1

Query: 228 PCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVC 49
           P L   + N +L  LP+ EEV+ AVF L  D APGPD F   FF   W+I+ KDV++ V 
Sbjct: 162 PKLVDATTNRLLTMLPTKEEVKNAVFDLNSDDAPGPDVFGACFFQIYWNIVKKDVYEAVL 221

Query: 48  DFWVGLSIPKSFSSTS 1
           DF+    +P +F++ S
Sbjct: 222 DFFKNGWLPNNFNANS 237


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 29/70 (41%), Positives = 44/70 (62%)
 Frame = -1

Query: 228 PCLFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVC 49
           P   + ++N  L   PSL+E++E VF+  +DS   PDGF   F+ HCWDII +D+ + V 
Sbjct: 658 PRTISITDNDFLYAAPSLKEIKEVVFNNDKDSVASPDGFSSLFYQHCWDIIKQDLLEAVL 717

Query: 48  DFWVGLSIPK 19
           DF+ G  +P+
Sbjct: 718 DFFKGTPMPQ 727


>ref|XP_004253407.1| PREDICTED: uncharacterized protein LOC101250876, partial [Solanum
           lycopersicum]
          Length = 445

 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 30/68 (44%), Positives = 44/68 (64%)
 Frame = -1

Query: 222 LFTPSENSMLDRLPSLEEVREAVFSLGRDSAPGPDGFFRTFFTHCWDIISKDVHQTVCDF 43
           + T  +N  LDRLP ++E+R  + S+   SAPGPDGF   F+  C+DII KD+   V  F
Sbjct: 132 MITQEQNDGLDRLPDMDELRRIIMSMNPHSAPGPDGFGGKFYQVCFDIIKKDLLDAVNHF 191

Query: 42  WVGLSIPK 19
           ++G S+P+
Sbjct: 192 YIGNSMPR 199


Top