BLASTX nr result

ID: Catharanthus23_contig00007319 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00007319
         (763 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXC30167.1| hypothetical protein L484_008498 [Morus notabilis]      99   2e-18
ref|XP_002528390.1| conserved hypothetical protein [Ricinus comm...    97   5e-18
ref|XP_006284792.1| hypothetical protein CARUB_v10006060mg [Caps...    94   5e-17
ref|NP_001078473.1| uncharacterized protein [Arabidopsis thalian...    94   5e-17
ref|XP_004242630.1| PREDICTED: uncharacterized protein LOC101253...    93   1e-16
ref|XP_006381874.1| hypothetical protein POPTR_0006s19390g [Popu...    91   3e-16
ref|XP_002336580.1| predicted protein [Populus trichocarpa]            91   3e-16
ref|XP_002274551.1| PREDICTED: uncharacterized protein LOC100267...    91   6e-16
ref|XP_006412660.1| hypothetical protein EUTSA_v10026581mg [Eutr...    88   4e-15
gb|EOY31515.1| DNA-directed RNA polymerase subunit beta' [Theobr...    87   5e-15
ref|XP_006474056.1| PREDICTED: uncharacterized protein LOC102623...    86   1e-14
ref|XP_006453550.1| hypothetical protein CICLE_v10009916mg [Citr...    86   1e-14
ref|XP_004507768.1| PREDICTED: uncharacterized protein LOC101497...    86   1e-14
ref|XP_004309956.1| PREDICTED: uncharacterized protein LOC101291...    86   1e-14
gb|ESW26849.1| hypothetical protein PHAVU_003G153300g [Phaseolus...    86   2e-14
gb|EPS64943.1| hypothetical protein M569_09846, partial [Genlise...    84   5e-14
ref|XP_003610259.1| hypothetical protein MTR_4g130250 [Medicago ...    84   7e-14
ref|XP_006848730.1| hypothetical protein AMTR_s00177p00062320 [A...    81   4e-13
ref|XP_003549529.1| PREDICTED: uncharacterized protein LOC100778...    81   4e-13
ref|XP_004146667.1| PREDICTED: uncharacterized protein LOC101206...    79   2e-12

>gb|EXC30167.1| hypothetical protein L484_008498 [Morus notabilis]
          Length = 116

 Score = 98.6 bits (244), Expect = 2e-18
 Identities = 53/89 (59%), Positives = 61/89 (68%)
 Frame = -1

Query: 532 VTFAKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVIS 353
           +T AKK+D  E  TSK Q             L QS IAVF LGF DAGYSGDWSRIGVIS
Sbjct: 28  ITLAKKKDLPE--TSKTQQRSILPLRISSTFLGQSGIAVFGLGFIDAGYSGDWSRIGVIS 85

Query: 352 KENEDLLKSAAFLVVPLCFFLIVSFNYKK 266
           KE+EDL+K AAF+VVPLC FLIV  + ++
Sbjct: 86  KESEDLIKVAAFIVVPLCLFLIVKLSKER 114


>ref|XP_002528390.1| conserved hypothetical protein [Ricinus communis]
           gi|223532178|gb|EEF33983.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 127

 Score = 97.4 bits (241), Expect = 5e-18
 Identities = 52/86 (60%), Positives = 60/86 (69%)
 Frame = -1

Query: 532 VTFAKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVIS 353
           +T AKK+D  ENS ++ Q              A+SAIAVF LGF DAGYSGDWSRIGVIS
Sbjct: 38  ITVAKKKDSPENSRTQQQNSILPLKISYRAL-AKSAIAVFGLGFIDAGYSGDWSRIGVIS 96

Query: 352 KENEDLLKSAAFLVVPLCFFLIVSFN 275
           KE+EDLLK AAF V+PLC FLI S +
Sbjct: 97  KESEDLLKLAAFAVIPLCIFLIFSIS 122


>ref|XP_006284792.1| hypothetical protein CARUB_v10006060mg [Capsella rubella]
           gi|482553497|gb|EOA17690.1| hypothetical protein
           CARUB_v10006060mg [Capsella rubella]
          Length = 113

 Score = 94.0 bits (232), Expect = 5e-17
 Identities = 50/83 (60%), Positives = 57/83 (68%)
 Frame = -1

Query: 523 AKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKEN 344
           +KKR+FAE S  K                A+SAIAV  LGF DAGYSGDWSRIGV+SKE 
Sbjct: 29  SKKREFAEKSNEKRTVLRIKVPNTIL---ARSAIAVLGLGFIDAGYSGDWSRIGVVSKET 85

Query: 343 EDLLKSAAFLVVPLCFFLIVSFN 275
           E+LLK AAFLVVPLC FL +SF+
Sbjct: 86  EELLKIAAFLVVPLCIFLTLSFS 108


>ref|NP_001078473.1| uncharacterized protein [Arabidopsis thaliana]
           gi|332660418|gb|AEE85818.1| uncharacterized protein
           AT4G30845 [Arabidopsis thaliana]
          Length = 114

 Score = 94.0 bits (232), Expect = 5e-17
 Identities = 50/83 (60%), Positives = 60/83 (72%)
 Frame = -1

Query: 523 AKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKEN 344
           +K+RDF+E S   N+             +A+SAIAV +LGF DAGYSGDWSRIGVISKE 
Sbjct: 30  SKRRDFSEKS---NEERPILRIKVPNTIVARSAIAVLSLGFIDAGYSGDWSRIGVISKET 86

Query: 343 EDLLKSAAFLVVPLCFFLIVSFN 275
           E+LLK AAFLVVPLC FL +SF+
Sbjct: 87  EELLKIAAFLVVPLCIFLALSFS 109


>ref|XP_004242630.1| PREDICTED: uncharacterized protein LOC101253911 [Solanum
           lycopersicum]
          Length = 124

 Score = 92.8 bits (229), Expect = 1e-16
 Identities = 57/124 (45%), Positives = 71/124 (57%), Gaps = 2/124 (1%)
 Frame = -1

Query: 634 MLTTQKTI--VCISNTFQYDHVQITGXXXXXXXSPTVTFAKKRDFAENSTSKNQTXXXXX 461
           ML TQ  I     SN F Y  ++            T++ AK ++F+E S  K +      
Sbjct: 1   MLITQNFINYYSSSNNFSYIFLRYR-KKRLNTKHSTISLAKNKEFSEKS--KVEENSISS 57

Query: 460 XXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKENEDLLKSAAFLVVPLCFFLIVS 281
                  L Q+ + VFALGF DAGYSGDWSRIGVISK+NEDLLK  AF +VPLC F+I S
Sbjct: 58  LKIPRNFLIQALVGVFALGFIDAGYSGDWSRIGVISKDNEDLLKITAFFIVPLCLFVIFS 117

Query: 280 FNYK 269
           F+ K
Sbjct: 118 FSKK 121


>ref|XP_006381874.1| hypothetical protein POPTR_0006s19390g [Populus trichocarpa]
           gi|550336652|gb|ERP59671.1| hypothetical protein
           POPTR_0006s19390g [Populus trichocarpa]
          Length = 166

 Score = 91.3 bits (225), Expect = 3e-16
 Identities = 49/81 (60%), Positives = 58/81 (71%)
 Frame = -1

Query: 523 AKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKEN 344
           AK+++ +ENS S+ Q              +QSA AVF LGF DAGYSGDWSRIGVISKE+
Sbjct: 83  AKRKEPSENSRSQEQPVFPLRVPKNIL--SQSAAAVFGLGFIDAGYSGDWSRIGVISKES 140

Query: 343 EDLLKSAAFLVVPLCFFLIVS 281
           EDLLK AAF+V+PLC FLI S
Sbjct: 141 EDLLKFAAFVVIPLCVFLIFS 161


>ref|XP_002336580.1| predicted protein [Populus trichocarpa]
          Length = 166

 Score = 91.3 bits (225), Expect = 3e-16
 Identities = 49/81 (60%), Positives = 58/81 (71%)
 Frame = -1

Query: 523 AKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKEN 344
           AK+++ +ENS S+ Q              +QSA AVF LGF DAGYSGDWSRIGVISKE+
Sbjct: 83  AKRKEPSENSRSQEQPVFPLRVPKNIL--SQSAAAVFGLGFIDAGYSGDWSRIGVISKES 140

Query: 343 EDLLKSAAFLVVPLCFFLIVS 281
           EDLLK AAF+V+PLC FLI S
Sbjct: 141 EDLLKVAAFVVIPLCVFLIFS 161


>ref|XP_002274551.1| PREDICTED: uncharacterized protein LOC100267275 [Vitis vinifera]
           gi|297743698|emb|CBI36581.3| unnamed protein product
           [Vitis vinifera]
          Length = 117

 Score = 90.5 bits (223), Expect = 6e-16
 Identities = 49/85 (57%), Positives = 58/85 (68%)
 Frame = -1

Query: 529 TFAKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISK 350
           T A+KRD ++NS+++ Q              A+SAIAV  LGF DAGYSGDWSRIGVISK
Sbjct: 29  TLAQKRDASKNSSTQQQQSILPLRVSNTIL-ARSAIAVLGLGFIDAGYSGDWSRIGVISK 87

Query: 349 ENEDLLKSAAFLVVPLCFFLIVSFN 275
           E ED LK +A LVVPLC FLI S +
Sbjct: 88  ETEDFLKLSALLVVPLCLFLIFSIS 112


>ref|XP_006412660.1| hypothetical protein EUTSA_v10026581mg [Eutrema salsugineum]
           gi|312281503|dbj|BAJ33617.1| unnamed protein product
           [Thellungiella halophila] gi|557113830|gb|ESQ54113.1|
           hypothetical protein EUTSA_v10026581mg [Eutrema
           salsugineum]
          Length = 121

 Score = 87.8 bits (216), Expect = 4e-15
 Identities = 49/83 (59%), Positives = 54/83 (65%)
 Frame = -1

Query: 523 AKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKEN 344
           +KKRDF E S  K                A+SAIAV  LGF DAGYSGDWSRIG ISKE 
Sbjct: 37  SKKRDFPEKSNGKRPVLQIKVPNTIL---ARSAIAVLGLGFIDAGYSGDWSRIGSISKET 93

Query: 343 EDLLKSAAFLVVPLCFFLIVSFN 275
           E+LLK AAFLVVPL  FL +SF+
Sbjct: 94  EELLKIAAFLVVPLSIFLALSFS 116


>gb|EOY31515.1| DNA-directed RNA polymerase subunit beta' [Theobroma cacao]
          Length = 117

 Score = 87.4 bits (215), Expect = 5e-15
 Identities = 47/82 (57%), Positives = 57/82 (69%)
 Frame = -1

Query: 520 KKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKENE 341
           K++D  E+S ++ Q              A+S +AV  LGF DAGYSGDWSRIGVISKE E
Sbjct: 32  KRKDPPESSRTQQQPIFPQRVSNTIL--ARSVVAVVGLGFIDAGYSGDWSRIGVISKEVE 89

Query: 340 DLLKSAAFLVVPLCFFLIVSFN 275
           DLLK AAF+V+PLCFFLI SF+
Sbjct: 90  DLLKIAAFVVLPLCFFLIFSFS 111


>ref|XP_006474056.1| PREDICTED: uncharacterized protein LOC102623329 isoform X1 [Citrus
           sinensis]
          Length = 123

 Score = 86.3 bits (212), Expect = 1e-14
 Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
 Frame = -1

Query: 532 VTFAKKRDFAENSTSKNQTXXXXXXXXXXXXL-AQSAIAVFALGFTDAGYSGDWSRIGVI 356
           +T AKK+D  E S+++ Q             + +++A+AV  LGF DAGYSGDWSRIGVI
Sbjct: 32  ITLAKKKDLPETSSAQQQNEKPIFPIKVSNLILSRAAVAVLGLGFIDAGYSGDWSRIGVI 91

Query: 355 SKENEDLLKSAAFLVVPLCFFLIVSF 278
           S+E E LLK AAF VVPLC F I SF
Sbjct: 92  SEETEALLKVAAFGVVPLCIFFIFSF 117


>ref|XP_006453550.1| hypothetical protein CICLE_v10009916mg [Citrus clementina]
           gi|557556776|gb|ESR66790.1| hypothetical protein
           CICLE_v10009916mg [Citrus clementina]
          Length = 123

 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 46/86 (53%), Positives = 57/86 (66%), Gaps = 1/86 (1%)
 Frame = -1

Query: 532 VTFAKKRDFAENSTSKNQTXXXXXXXXXXXXL-AQSAIAVFALGFTDAGYSGDWSRIGVI 356
           +T AKK+D  E S+++ Q             + +++A+AV  LGF DAGYSGDWSRIGVI
Sbjct: 32  ITLAKKKDLPETSSAQQQNEKPIFPIKVSNLILSRAAVAVLGLGFFDAGYSGDWSRIGVI 91

Query: 355 SKENEDLLKSAAFLVVPLCFFLIVSF 278
           S+E E LLK AAF VVPLC F I SF
Sbjct: 92  SEETEALLKVAAFGVVPLCIFFIFSF 117


>ref|XP_004507768.1| PREDICTED: uncharacterized protein LOC101497228 [Cicer arietinum]
          Length = 115

 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 43/79 (54%), Positives = 53/79 (67%)
 Frame = -1

Query: 523 AKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKEN 344
           AKK+DF  N+    Q             LA++AI VFALGF DAGYSGDWSRIGVI+ ++
Sbjct: 28  AKKKDFQNNNNESQQKPFLLPLRVSNSNLARAAIGVFALGFIDAGYSGDWSRIGVITSQS 87

Query: 343 EDLLKSAAFLVVPLCFFLI 287
           E+LL+ AAFLVVP+C   I
Sbjct: 88  EELLRLAAFLVVPICVLFI 106


>ref|XP_004309956.1| PREDICTED: uncharacterized protein LOC101291784 [Fragaria vesca
           subsp. vesca]
          Length = 119

 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 47/84 (55%), Positives = 56/84 (66%), Gaps = 1/84 (1%)
 Frame = -1

Query: 523 AKKRDF-AENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISKE 347
           AK+RD   E S+++               +A+S +AV  LGF DAGYSGDWSR GVISKE
Sbjct: 31  AKRRDLPTEISSTQEGGFVFPRLRVSNTIVARSVVAVLGLGFIDAGYSGDWSRFGVISKE 90

Query: 346 NEDLLKSAAFLVVPLCFFLIVSFN 275
           +EDLLK AAFLVVPLC FLI S +
Sbjct: 91  SEDLLKVAAFLVVPLCLFLIFSIS 114


>gb|ESW26849.1| hypothetical protein PHAVU_003G153300g [Phaseolus vulgaris]
          Length = 112

 Score = 85.5 bits (210), Expect = 2e-14
 Identities = 44/81 (54%), Positives = 55/81 (67%)
 Frame = -1

Query: 529 TFAKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISK 350
           T AK++DF EN   K Q             +++ AI +F LGF DAGYSGDWSRIGVI+ 
Sbjct: 25  TLAKRKDFEENG--KPQQRPFFPLRISKSIISRGAIGMFGLGFIDAGYSGDWSRIGVITP 82

Query: 349 ENEDLLKSAAFLVVPLCFFLI 287
           ++E+LLK AAFLVVPLC FL+
Sbjct: 83  QSEELLKVAAFLVVPLCIFLV 103


>gb|EPS64943.1| hypothetical protein M569_09846, partial [Genlisea aurea]
          Length = 55

 Score = 84.0 bits (206), Expect = 5e-14
 Identities = 39/55 (70%), Positives = 45/55 (81%)
 Frame = -1

Query: 424 IAVFALGFTDAGYSGDWSRIGVISKENEDLLKSAAFLVVPLCFFLIVSFNYKKFE 260
           I +FALGF DAGYSGDWSRIGVIS E ED LK+AAF+VVPLC F I+S + K+ E
Sbjct: 1   IGIFALGFIDAGYSGDWSRIGVISTETEDFLKAAAFVVVPLCIFAIISLSLKRGE 55


>ref|XP_003610259.1| hypothetical protein MTR_4g130250 [Medicago truncatula]
           gi|355511314|gb|AES92456.1| hypothetical protein
           MTR_4g130250 [Medicago truncatula]
          Length = 113

 Score = 83.6 bits (205), Expect = 7e-14
 Identities = 44/83 (53%), Positives = 55/83 (66%)
 Frame = -1

Query: 535 TVTFAKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVI 356
           T    KK D  +N+ S+ Q+             A++AI VF LGF DAGYSGDWSRIGVI
Sbjct: 25  TALAKKKDDVQDNNESQQQSFLPLKVSKSNL--ARAAIGVFGLGFIDAGYSGDWSRIGVI 82

Query: 355 SKENEDLLKSAAFLVVPLCFFLI 287
           +++NE+LLK AAFLVVP+C F I
Sbjct: 83  TQQNEELLKLAAFLVVPICVFFI 105


>ref|XP_006848730.1| hypothetical protein AMTR_s00177p00062320 [Amborella trichopoda]
           gi|548852141|gb|ERN10311.1| hypothetical protein
           AMTR_s00177p00062320 [Amborella trichopoda]
          Length = 117

 Score = 80.9 bits (198), Expect = 4e-13
 Identities = 40/57 (70%), Positives = 45/57 (78%)
 Frame = -1

Query: 436 AQSAIAVFALGFTDAGYSGDWSRIGVISKENEDLLKSAAFLVVPLCFFLIVSFNYKK 266
           A++AIAVFALGF DAGYSGDWSRIG ISKE E+LLK AA+LV PLC  LI     +K
Sbjct: 60  ARTAIAVFALGFIDAGYSGDWSRIGAISKETEELLKVAAYLVTPLCLSLIFLIREEK 116


>ref|XP_003549529.1| PREDICTED: uncharacterized protein LOC100778234 [Glycine max]
          Length = 113

 Score = 80.9 bits (198), Expect = 4e-13
 Identities = 44/81 (54%), Positives = 51/81 (62%)
 Frame = -1

Query: 529 TFAKKRDFAENSTSKNQTXXXXXXXXXXXXLAQSAIAVFALGFTDAGYSGDWSRIGVISK 350
           T AK+ +  E S   + T             A+ AI +F LGF DAGYSGDWSRIGVI+ 
Sbjct: 29  TLAKRNESQEKSKGPSFTLRVSKSTI-----ARGAIGLFGLGFVDAGYSGDWSRIGVITP 83

Query: 349 ENEDLLKSAAFLVVPLCFFLI 287
           + EDLLK AAFLVVPLC FLI
Sbjct: 84  QTEDLLKLAAFLVVPLCIFLI 104


>ref|XP_004146667.1| PREDICTED: uncharacterized protein LOC101206545 [Cucumis sativus]
           gi|449503149|ref|XP_004161858.1| PREDICTED:
           uncharacterized LOC101206545 [Cucumis sativus]
          Length = 126

 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 36/54 (66%), Positives = 42/54 (77%)
 Frame = -1

Query: 436 AQSAIAVFALGFTDAGYSGDWSRIGVISKENEDLLKSAAFLVVPLCFFLIVSFN 275
           A+S ++V  LGF DAGYSGDWSRIG I+KE EDLLK  A LVVP C FL+ SF+
Sbjct: 66  ARSVVSVLGLGFVDAGYSGDWSRIGAITKETEDLLKIGALLVVPFCVFLVFSFS 119