BLASTX nr result

ID: Catharanthus23_contig00026531 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00026531
         (805 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280026.1| PREDICTED: uncharacterized protein LOC100244...   102   1e-19
gb|EOY03678.1| Hydroxyproline-rich glycoprotein family protein [...    89   1e-15
ref|XP_006300032.1| hypothetical protein CARUB_v10016256mg [Caps...    88   3e-15
ref|NP_566161.1| hydroxyproline-rich glycoprotein-like protein [...    87   7e-15
gb|ACZ74665.1| hydroxyproline-rich protein [Phaseolus vulgaris]        87   7e-15
gb|ESW23326.1| hypothetical protein PHAVU_004G037300g [Phaseolus...    86   2e-14
ref|XP_002323900.1| hypothetical protein POPTR_0017s12970g [Popu...    86   2e-14
gb|EXB38104.1| hypothetical protein L484_021026 [Morus notabilis]      84   6e-14
ref|XP_002527615.1| conserved hypothetical protein [Ricinus comm...    84   8e-14
ref|XP_002305356.1| hypothetical protein POPTR_0004s11940g [Popu...    84   8e-14
ref|XP_002882207.1| predicted protein [Arabidopsis lyrata subsp....    83   1e-13
ref|XP_006338366.1| PREDICTED: uncharacterized protein LOC102585...    82   3e-13
gb|EMJ17403.1| hypothetical protein PRUPE_ppa016098mg [Prunus pe...    82   3e-13
ref|XP_003554873.1| PREDICTED: uncharacterized protein LOC100815...    82   3e-13
ref|NP_001235757.1| uncharacterized protein LOC100305464 [Glycin...    82   3e-13
ref|XP_004232163.1| PREDICTED: uncharacterized protein LOC101260...    80   8e-13
gb|AAF14825.1|AC011664_7 hypothetical protein [Arabidopsis thali...    79   1e-12
ref|XP_006603909.1| PREDICTED: uncharacterized protein LOC100815...    78   3e-12
ref|XP_003516839.1| PREDICTED: uncharacterized protein LOC100816...    78   3e-12
gb|AFK40257.1| unknown [Lotus japonicus]                               78   4e-12

>ref|XP_002280026.1| PREDICTED: uncharacterized protein LOC100244709 isoform 1 [Vitis
           vinifera] gi|296082203|emb|CBI21208.3| unnamed protein
           product [Vitis vinifera]
          Length = 112

 Score =  102 bits (255), Expect = 1e-19
 Identities = 51/66 (77%), Positives = 57/66 (86%)
 Frame = +3

Query: 276 STPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQV 455
           +TPDRLKVPK FKYPERY SPTDLMISPVSKGLLAR+RK    +LLPP+KIQPK+Q  +V
Sbjct: 47  ATPDRLKVPKAFKYPERYRSPTDLMISPVSKGLLARSRKT--GSLLPPAKIQPKVQDLRV 104

Query: 456 QEAGLF 473
           QE GLF
Sbjct: 105 QEVGLF 110


>gb|EOY03678.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 108

 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 45/68 (66%), Positives = 52/68 (76%)
 Frame = +3

Query: 267 KPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQS 446
           K   TPDRLKVPK FKYPERY SPTD M+SPV+KGLLAR RK  G++LLPPS  Q K+  
Sbjct: 39  KKTCTPDRLKVPKAFKYPERYRSPTDSMMSPVTKGLLARNRK-GGASLLPPSINQTKIHE 97

Query: 447 FQVQEAGL 470
            +VQ+ GL
Sbjct: 98  LRVQDVGL 105


>ref|XP_006300032.1| hypothetical protein CARUB_v10016256mg [Capsella rubella]
           gi|482568741|gb|EOA32930.1| hypothetical protein
           CARUB_v10016256mg [Capsella rubella]
          Length = 124

 Score = 88.2 bits (217), Expect = 3e-15
 Identities = 47/93 (50%), Positives = 65/93 (69%)
 Frame = +3

Query: 192 EAKISEEEDQRPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKG 371
           E+ ++ E  +R ++S   P ++I      TPDRL+VP  FK+PERY SPTD M+SPV+KG
Sbjct: 33  ESCLNHESPRRRVSSTNEPMKKI-----GTPDRLRVPIAFKHPERYRSPTDAMMSPVTKG 87

Query: 372 LLARTRKPNGSNLLPPSKIQPKLQSFQVQEAGL 470
           LLAR+RK +GS L+PPS  Q K+Q  +  E+GL
Sbjct: 88  LLARSRKASGS-LIPPSFNQTKIQELRKPESGL 119


>ref|NP_566161.1| hydroxyproline-rich glycoprotein-like protein [Arabidopsis
           thaliana] gi|21593915|gb|AAM65880.1| unknown
           [Arabidopsis thaliana] gi|26452456|dbj|BAC43313.1|
           unknown protein [Arabidopsis thaliana]
           gi|28827236|gb|AAO50462.1| unknown protein [Arabidopsis
           thaliana] gi|332640244|gb|AEE73765.1|
           hydroxyproline-rich glycoprotein-like protein
           [Arabidopsis thaliana]
          Length = 126

 Score = 87.0 bits (214), Expect = 7e-15
 Identities = 47/81 (58%), Positives = 57/81 (70%), Gaps = 3/81 (3%)
 Frame = +3

Query: 237 DQRPPREICCKPAS---TPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSN 407
           D  PPR    +P     TP+RL+VP  FKYPERY SPTD M+SPV+KGLLARTRK +GS 
Sbjct: 42  DSPPPRASTNEPMKKIGTPERLRVPIAFKYPERYRSPTDAMMSPVTKGLLARTRKSSGS- 100

Query: 408 LLPPSKIQPKLQSFQVQEAGL 470
           L+PPS  Q K+Q  +  E+GL
Sbjct: 101 LIPPSFNQTKIQELRKPESGL 121


>gb|ACZ74665.1| hydroxyproline-rich protein [Phaseolus vulgaris]
          Length = 133

 Score = 87.0 bits (214), Expect = 7e-15
 Identities = 44/95 (46%), Positives = 58/95 (61%), Gaps = 1/95 (1%)
 Frame = +3

Query: 192 EAKISEEEDQRPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKG 371
           E K      Q+    D+    E+      TPD L+VPK FKYPERY SPTDLM+SP++KG
Sbjct: 23  ECKTPIPAQQQHQNKDRNSSNELRKPVTVTPDHLRVPKAFKYPERYTSPTDLMMSPITKG 82

Query: 372 LLARTRKPNGSN-LLPPSKIQPKLQSFQVQEAGLF 473
           LLART++  G   +LPP K QPK+    +++ G F
Sbjct: 83  LLARTKRGGGGGAMLPPGKNQPKILDMPLKDVGTF 117


>gb|ESW23326.1| hypothetical protein PHAVU_004G037300g [Phaseolus vulgaris]
          Length = 134

 Score = 85.5 bits (210), Expect = 2e-14
 Identities = 44/96 (45%), Positives = 58/96 (60%), Gaps = 2/96 (2%)
 Frame = +3

Query: 192 EAKISEEEDQRPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKG 371
           E K      Q+    D+    E+      TPD L+VPK FKYPERY SPTDLM+SP++KG
Sbjct: 23  ECKTPIPAQQQHQIKDRNSSNELRKPVTVTPDHLRVPKAFKYPERYTSPTDLMMSPITKG 82

Query: 372 LLARTRKPN--GSNLLPPSKIQPKLQSFQVQEAGLF 473
           LLART++    G  +LPP K QPK+    +++ G F
Sbjct: 83  LLARTKRGGGVGGAMLPPGKNQPKILDMPLKDVGTF 118


>ref|XP_002323900.1| hypothetical protein POPTR_0017s12970g [Populus trichocarpa]
           gi|118481606|gb|ABK92745.1| unknown [Populus
           trichocarpa] gi|222866902|gb|EEF04033.1| hypothetical
           protein POPTR_0017s12970g [Populus trichocarpa]
          Length = 121

 Score = 85.5 bits (210), Expect = 2e-14
 Identities = 44/83 (53%), Positives = 57/83 (68%)
 Frame = +3

Query: 213 EDQRPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRK 392
           EDQ      +   +++  + + TPD LKVPK FKYPERY SPTDLMISP++KG+LAR +K
Sbjct: 35  EDQEIDKKSENSSKDL--RKSGTPDPLKVPKAFKYPERYRSPTDLMISPITKGILARNKK 92

Query: 393 PNGSNLLPPSKIQPKLQSFQVQE 461
             G  LLPPS  QPK+Q  + Q+
Sbjct: 93  --GGALLPPSWNQPKVQDVETQD 113


>gb|EXB38104.1| hypothetical protein L484_021026 [Morus notabilis]
          Length = 118

 Score = 84.0 bits (206), Expect = 6e-14
 Identities = 45/97 (46%), Positives = 59/97 (60%), Gaps = 15/97 (15%)
 Frame = +3

Query: 222 RPITSDQRPPREIC---------------CKPASTPDRLKVPKPFKYPERYMSPTDLMIS 356
           +P++ D + P  I                 +  +TPD LKVPK FKYPERY SPTD ++S
Sbjct: 17  KPVSEDHKTPTPIAQTNKSLQNSPNSGTDLRKPTTPDLLKVPKAFKYPERYRSPTDSLMS 76

Query: 357 PVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAG 467
           PV+KGLLAR+RK  G  LLPPSK   K+Q  ++Q+ G
Sbjct: 77  PVTKGLLARSRK--GGALLPPSKNHHKIQDLRLQDVG 111


>ref|XP_002527615.1| conserved hypothetical protein [Ricinus communis]
           gi|223532989|gb|EEF34754.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 112

 Score = 83.6 bits (205), Expect = 8e-14
 Identities = 41/58 (70%), Positives = 48/58 (82%)
 Frame = +3

Query: 267 KPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKL 440
           + +STPDRLKVPK FKYPERY SPTDLM+SP++KGLLAR RK  G+ LLPPS  Q K+
Sbjct: 43  RKSSTPDRLKVPKAFKYPERYRSPTDLMVSPITKGLLARNRK--GAALLPPSMNQAKV 98


>ref|XP_002305356.1| hypothetical protein POPTR_0004s11940g [Populus trichocarpa]
           gi|222848320|gb|EEE85867.1| hypothetical protein
           POPTR_0004s11940g [Populus trichocarpa]
          Length = 153

 Score = 83.6 bits (205), Expect = 8e-14
 Identities = 45/86 (52%), Positives = 57/86 (66%)
 Frame = +3

Query: 210 EEDQRPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTR 389
           +EDQ      +    ++  + +S P  L+VPK FK+PERY SPTDLMISP++KGLLAR R
Sbjct: 69  KEDQEMDQESENSGNDL--RKSSAPYHLQVPKAFKFPERYRSPTDLMISPITKGLLARNR 126

Query: 390 KPNGSNLLPPSKIQPKLQSFQVQEAG 467
           K  G  LLPPS  QPK+Q  +VQ  G
Sbjct: 127 K--GGALLPPSLNQPKVQDVEVQGGG 150


>ref|XP_002882207.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297328047|gb|EFH58466.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 470

 Score = 82.8 bits (203), Expect = 1e-13
 Identities = 45/79 (56%), Positives = 55/79 (69%), Gaps = 3/79 (3%)
 Frame = +3

Query: 237 DQRPPREICCKPAS---TPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSN 407
           D  PPR    +P     TPDRL+VP  FK+PERY SPTD M+SPV+KGLLARTRK +GS 
Sbjct: 41  DSPPPRASTNEPMKKIGTPDRLRVPIAFKHPERYRSPTDAMMSPVTKGLLARTRKASGS- 99

Query: 408 LLPPSKIQPKLQSFQVQEA 464
           L+PPS  Q K+Q  +  E+
Sbjct: 100 LIPPSFNQTKIQELRKPES 118


>ref|XP_006338366.1| PREDICTED: uncharacterized protein LOC102585644 [Solanum tuberosum]
          Length = 129

 Score = 81.6 bits (200), Expect = 3e-13
 Identities = 46/66 (69%), Positives = 51/66 (77%), Gaps = 1/66 (1%)
 Frame = +3

Query: 276 STPDRLKVPKPFKYPERYMSPTDLMISPVSKGLL-ARTRKPNGSNLLPPSKIQPKLQSFQ 452
           +TPDRLKVPKPFKYPERY SPTD M+SPVSK LL  R+RK   S LLPPSK +P L    
Sbjct: 61  TTPDRLKVPKPFKYPERYTSPTDQMMSPVSKRLLIGRSRK--ASTLLPPSKNRP-LHQHM 117

Query: 453 VQEAGL 470
           VQE+GL
Sbjct: 118 VQESGL 123


>gb|EMJ17403.1| hypothetical protein PRUPE_ppa016098mg [Prunus persica]
          Length = 118

 Score = 81.6 bits (200), Expect = 3e-13
 Identities = 46/82 (56%), Positives = 55/82 (67%)
 Frame = +3

Query: 207 EEEDQRPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLART 386
           ++E+ +   +D R P        +TPDRLKVPK FKYPERY SPTDLM+SPV+KGLLAR 
Sbjct: 36  KDENSQNSGNDLRKP--------TTPDRLKVPKAFKYPERYTSPTDLMMSPVTKGLLARN 87

Query: 387 RKPNGSNLLPPSKIQPKLQSFQ 452
           RK  G  LLPPSK   K Q  +
Sbjct: 88  RK--GGALLPPSKNLHKPQGIE 107


>ref|XP_003554873.1| PREDICTED: uncharacterized protein LOC100815031 isoform X1 [Glycine
           max]
          Length = 129

 Score = 81.6 bits (200), Expect = 3e-13
 Identities = 38/63 (60%), Positives = 49/63 (77%)
 Frame = +3

Query: 279 TPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQ 458
           TP+RL+VPK FKYPERY SPTDL++SPV+KGLLARTR+  G+ L P  K QPK+    ++
Sbjct: 50  TPNRLRVPKAFKYPERYTSPTDLIMSPVTKGLLARTRRGGGAVLPPGGKNQPKILDMPLK 109

Query: 459 EAG 467
           + G
Sbjct: 110 DVG 112


>ref|NP_001235757.1| uncharacterized protein LOC100305464 [Glycine max]
           gi|255625585|gb|ACU13137.1| unknown [Glycine max]
          Length = 128

 Score = 81.6 bits (200), Expect = 3e-13
 Identities = 38/63 (60%), Positives = 48/63 (76%)
 Frame = +3

Query: 279 TPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQ 458
           TPDRL+VPK FKYPERY SPTDLM+ PV+KGLLARTR+  G+ L P  K +PK+    ++
Sbjct: 49  TPDRLRVPKAFKYPERYTSPTDLMMPPVTKGLLARTRRGGGAVLPPGGKNRPKILDMPLK 108

Query: 459 EAG 467
           + G
Sbjct: 109 DVG 111


>ref|XP_004232163.1| PREDICTED: uncharacterized protein LOC101260290 [Solanum
           lycopersicum]
          Length = 113

 Score = 80.1 bits (196), Expect = 8e-13
 Identities = 49/89 (55%), Positives = 58/89 (65%), Gaps = 1/89 (1%)
 Frame = +3

Query: 201 ISEEEDQRPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLL- 377
           ++EE    P+    RP   +   P +TPDRLKVPKPFKYPERY SPTD M+SPVSK LL 
Sbjct: 32  VAEEPKTPPLN---RPMIVLPNSPINTPDRLKVPKPFKYPERYTSPTDQMMSPVSKRLLI 88

Query: 378 ARTRKPNGSNLLPPSKIQPKLQSFQVQEA 464
            R+RK   S LLPPSK     Q  Q+QE+
Sbjct: 89  GRSRK--SSTLLPPSK---NRQGLQLQES 112


>gb|AAF14825.1|AC011664_7 hypothetical protein [Arabidopsis thaliana]
          Length = 480

 Score = 79.3 bits (194), Expect = 1e-12
 Identities = 43/73 (58%), Positives = 52/73 (71%), Gaps = 3/73 (4%)
 Frame = +3

Query: 237 DQRPPREICCKPAS---TPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSN 407
           D  PPR    +P     TP+RL+VP  FKYPERY SPTD M+SPV+KGLLARTRK +GS 
Sbjct: 42  DSPPPRASTNEPMKKIGTPERLRVPIAFKYPERYRSPTDAMMSPVTKGLLARTRKSSGS- 100

Query: 408 LLPPSKIQPKLQS 446
           L+PPS  Q K ++
Sbjct: 101 LIPPSFNQTKTKT 113


>ref|XP_006603909.1| PREDICTED: uncharacterized protein LOC100815031 isoform X2 [Glycine
           max]
          Length = 126

 Score = 78.2 bits (191), Expect = 3e-12
 Identities = 37/53 (69%), Positives = 44/53 (83%)
 Frame = +3

Query: 279 TPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPK 437
           TP+RL+VPK FKYPERY SPTDL++SPV+KGLLARTR+  G+ L P  K QPK
Sbjct: 50  TPNRLRVPKAFKYPERYTSPTDLIMSPVTKGLLARTRRGGGAVLPPGGKNQPK 102


>ref|XP_003516839.1| PREDICTED: uncharacterized protein LOC100816026 isoform X1 [Glycine
           max]
          Length = 127

 Score = 78.2 bits (191), Expect = 3e-12
 Identities = 36/63 (57%), Positives = 47/63 (74%)
 Frame = +3

Query: 279 TPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQ 458
           TPDRL+VPK FKY ERY SPTDLM+SPV+KGL A+TR+  G+ L P  K +PK+    ++
Sbjct: 48  TPDRLRVPKAFKYAERYTSPTDLMMSPVTKGLFAKTRRDGGAVLPPGGKNRPKILDLPLK 107

Query: 459 EAG 467
           + G
Sbjct: 108 DVG 110


>gb|AFK40257.1| unknown [Lotus japonicus]
          Length = 129

 Score = 77.8 bits (190), Expect = 4e-12
 Identities = 47/102 (46%), Positives = 59/102 (57%), Gaps = 8/102 (7%)
 Frame = +3

Query: 192 EAKISEEEDQRPITSDQRPPREICCKPAST--------PDRLKVPKPFKYPERYMSPTDL 347
           EA+ +E E + P    Q P  +    P ST        PD L+VPK FK+PERY SPTD 
Sbjct: 15  EAQSTEPECKTPTPIPQPPQND---DPNSTDELRKSLIPDPLRVPKAFKFPERYTSPTDS 71

Query: 348 MISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAGLF 473
           ++SPV+KGLLAR +K  G   LPP K  PK+    +QE G F
Sbjct: 72  IMSPVTKGLLARGKK--GVAKLPPGKYHPKIPDMSLQEVGPF 111


Top