BLASTX nr result

ID: Catharanthus22_contig00018157 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00018157
         (932 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280026.1| PREDICTED: uncharacterized protein LOC100244...   106   1e-20
gb|EOY03678.1| Hydroxyproline-rich glycoprotein family protein [...    93   2e-16
ref|NP_566161.1| hydroxyproline-rich glycoprotein-like protein [...    91   8e-16
ref|XP_006338366.1| PREDICTED: uncharacterized protein LOC102585...    88   4e-15
gb|ACZ74665.1| hydroxyproline-rich protein [Phaseolus vulgaris]        87   1e-14
gb|EMJ17403.1| hypothetical protein PRUPE_ppa016098mg [Prunus pe...    86   2e-14
gb|EXB38104.1| hypothetical protein L484_021026 [Morus notabilis]      86   2e-14
ref|NP_001235757.1| uncharacterized protein LOC100305464 [Glycin...    86   3e-14
ref|XP_004232163.1| PREDICTED: uncharacterized protein LOC101260...    85   3e-14
ref|XP_003554873.1| PREDICTED: uncharacterized protein LOC100815...    85   3e-14
gb|ESW23326.1| hypothetical protein PHAVU_004G037300g [Phaseolus...    85   4e-14
ref|XP_006300032.1| hypothetical protein CARUB_v10016256mg [Caps...    85   4e-14
ref|XP_002323900.1| hypothetical protein POPTR_0017s12970g [Popu...    85   4e-14
ref|XP_002527615.1| conserved hypothetical protein [Ricinus comm...    84   8e-14
ref|XP_002882207.1| predicted protein [Arabidopsis lyrata subsp....    84   1e-13
ref|XP_002305356.1| hypothetical protein POPTR_0004s11940g [Popu...    83   1e-13
gb|AAF14825.1|AC011664_7 hypothetical protein [Arabidopsis thali...    83   2e-13
ref|XP_006603909.1| PREDICTED: uncharacterized protein LOC100815...    82   4e-13
ref|XP_003516839.1| PREDICTED: uncharacterized protein LOC100816...    81   5e-13
gb|AFK40257.1| unknown [Lotus japonicus]                               79   2e-12

>ref|XP_002280026.1| PREDICTED: uncharacterized protein LOC100244709 isoform 1 [Vitis
           vinifera] gi|296082203|emb|CBI21208.3| unnamed protein
           product [Vitis vinifera]
          Length = 112

 Score =  106 bits (265), Expect = 1e-20
 Identities = 56/90 (62%), Positives = 63/90 (70%)
 Frame = -1

Query: 674 KTPPINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLAR 495
           KTPP N       +          +TPDRLKVPK FKYPERY SPTDLMISPVSKGLLAR
Sbjct: 23  KTPPPNQETAQKIQNSANDSGNKTATPDRLKVPKAFKYPERYRSPTDLMISPVSKGLLAR 82

Query: 494 TRKPNGSNLLPPSKIQPKLQSFQVQEAGLF 405
           +RK    +LLPP+KIQPK+Q  +VQE GLF
Sbjct: 83  SRKT--GSLLPPAKIQPKVQDLRVQEVGLF 110


>gb|EOY03678.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
          Length = 108

 Score = 92.8 bits (229), Expect = 2e-16
 Identities = 50/91 (54%), Positives = 60/91 (65%)
 Frame = -1

Query: 680 QQKTPPINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLL 501
           + K PP N  I  + +       K   TPDRLKVPK FKYPERY SPTD M+SPV+KGLL
Sbjct: 17  ENKAPPQNQQIDQNSQDSSNDL-KKTCTPDRLKVPKAFKYPERYRSPTDSMMSPVTKGLL 75

Query: 500 ARTRKPNGSNLLPPSKIQPKLQSFQVQEAGL 408
           AR RK  G++LLPPS  Q K+   +VQ+ GL
Sbjct: 76  ARNRK-GGASLLPPSINQTKIHELRVQDVGL 105


>ref|NP_566161.1| hydroxyproline-rich glycoprotein-like protein [Arabidopsis
           thaliana] gi|21593915|gb|AAM65880.1| unknown
           [Arabidopsis thaliana] gi|26452456|dbj|BAC43313.1|
           unknown protein [Arabidopsis thaliana]
           gi|28827236|gb|AAO50462.1| unknown protein [Arabidopsis
           thaliana] gi|332640244|gb|AEE73765.1|
           hydroxyproline-rich glycoprotein-like protein
           [Arabidopsis thaliana]
          Length = 126

 Score = 90.5 bits (223), Expect = 8e-16
 Identities = 55/114 (48%), Positives = 70/114 (61%), Gaps = 14/114 (12%)
 Frame = -1

Query: 707 ETKISEEEDQQK--------TPPINHPITS---DQRPPREICCKPAS---TPDRLKVPKP 570
           ET +  + D +K        +PP   P +S   D  PPR    +P     TP+RL+VP  
Sbjct: 9   ETPLKTQHDHRKITTSNPESSPPRPFPESSRKHDSPPPRASTNEPMKKIGTPERLRVPIA 68

Query: 569 FKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAGL 408
           FKYPERY SPTD M+SPV+KGLLARTRK +GS L+PPS  Q K+Q  +  E+GL
Sbjct: 69  FKYPERYRSPTDAMMSPVTKGLLARTRKSSGS-LIPPSFNQTKIQELRKPESGL 121


>ref|XP_006338366.1| PREDICTED: uncharacterized protein LOC102585644 [Solanum tuberosum]
          Length = 129

 Score = 88.2 bits (217), Expect = 4e-15
 Identities = 53/92 (57%), Positives = 60/92 (65%), Gaps = 1/92 (1%)
 Frame = -1

Query: 680 QQKTPPINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLL 501
           + KTPP+N P       P        +TPDRLKVPKPFKYPERY SPTD M+SPVSK LL
Sbjct: 41  EPKTPPLNRPTIVLPNSPIN------TTPDRLKVPKPFKYPERYTSPTDQMMSPVSKRLL 94

Query: 500 -ARTRKPNGSNLLPPSKIQPKLQSFQVQEAGL 408
             R+RK   S LLPPSK +P L    VQE+GL
Sbjct: 95  IGRSRK--ASTLLPPSKNRP-LHQHMVQESGL 123


>gb|ACZ74665.1| hydroxyproline-rich protein [Phaseolus vulgaris]
          Length = 133

 Score = 86.7 bits (213), Expect = 1e-14
 Identities = 47/97 (48%), Positives = 61/97 (62%), Gaps = 3/97 (3%)
 Frame = -1

Query: 686 EDQQKTP-PINHPITSDQRPPREICCKPAS-TPDRLKVPKPFKYPERYMSPTDLMISPVS 513
           E + KTP P      +  R       KP + TPD L+VPK FKYPERY SPTDLM+SP++
Sbjct: 21  EPECKTPIPAQQQHQNKDRNSSNELRKPVTVTPDHLRVPKAFKYPERYTSPTDLMMSPIT 80

Query: 512 KGLLARTRKPNGSN-LLPPSKIQPKLQSFQVQEAGLF 405
           KGLLART++  G   +LPP K QPK+    +++ G F
Sbjct: 81  KGLLARTKRGGGGGAMLPPGKNQPKILDMPLKDVGTF 117


>gb|EMJ17403.1| hypothetical protein PRUPE_ppa016098mg [Prunus persica]
          Length = 118

 Score = 86.3 bits (212), Expect = 2e-14
 Identities = 46/81 (56%), Positives = 55/81 (67%)
 Frame = -1

Query: 668 PPINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTR 489
           PP+ H   + Q    ++  +  +TPDRLKVPK FKYPERY SPTDLM+SPV+KGLLAR R
Sbjct: 31  PPLQHKDENSQNSGNDL--RKPTTPDRLKVPKAFKYPERYTSPTDLMMSPVTKGLLARNR 88

Query: 488 KPNGSNLLPPSKIQPKLQSFQ 426
           K  G  LLPPSK   K Q  +
Sbjct: 89  K--GGALLPPSKNLHKPQGIE 107


>gb|EXB38104.1| hypothetical protein L484_021026 [Morus notabilis]
          Length = 118

 Score = 85.9 bits (211), Expect = 2e-14
 Identities = 48/93 (51%), Positives = 59/93 (63%), Gaps = 1/93 (1%)
 Frame = -1

Query: 686 EDQQKTPPINHPITSDQRPPRE-ICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSK 510
           ED +   PI     S Q  P      +  +TPD LKVPK FKYPERY SPTD ++SPV+K
Sbjct: 21  EDHKTPTPIAQTNKSLQNSPNSGTDLRKPTTPDLLKVPKAFKYPERYRSPTDSLMSPVTK 80

Query: 509 GLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAG 411
           GLLAR+RK  G  LLPPSK   K+Q  ++Q+ G
Sbjct: 81  GLLARSRK--GGALLPPSKNHHKIQDLRLQDVG 111


>ref|NP_001235757.1| uncharacterized protein LOC100305464 [Glycine max]
           gi|255625585|gb|ACU13137.1| unknown [Glycine max]
          Length = 128

 Score = 85.5 bits (210), Expect = 3e-14
 Identities = 47/93 (50%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
 Frame = -1

Query: 686 EDQQKTP-PINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSK 510
           E + KTP P+     +D     E+  KP  TPDRL+VPK FKYPERY SPTDLM+ PV+K
Sbjct: 21  EPECKTPTPVQQQDPNDHNSSNELR-KPV-TPDRLRVPKAFKYPERYTSPTDLMMPPVTK 78

Query: 509 GLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAG 411
           GLLARTR+  G+ L P  K +PK+    +++ G
Sbjct: 79  GLLARTRRGGGAVLPPGGKNRPKILDMPLKDVG 111


>ref|XP_004232163.1| PREDICTED: uncharacterized protein LOC101260290 [Solanum
           lycopersicum]
          Length = 113

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 50/91 (54%), Positives = 60/91 (65%), Gaps = 1/91 (1%)
 Frame = -1

Query: 683 DQQKTPPINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGL 504
           ++ KTPP+N P+         +   P +TPDRLKVPKPFKYPERY SPTD M+SPVSK L
Sbjct: 34  EEPKTPPLNRPMIV-------LPNSPINTPDRLKVPKPFKYPERYTSPTDQMMSPVSKRL 86

Query: 503 L-ARTRKPNGSNLLPPSKIQPKLQSFQVQEA 414
           L  R+RK   S LLPPSK     Q  Q+QE+
Sbjct: 87  LIGRSRK--SSTLLPPSK---NRQGLQLQES 112


>ref|XP_003554873.1| PREDICTED: uncharacterized protein LOC100815031 isoform X1 [Glycine
           max]
          Length = 129

 Score = 85.1 bits (209), Expect = 3e-14
 Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 1/93 (1%)
 Frame = -1

Query: 686 EDQQKTP-PINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSK 510
           E + KTP P+     +D         KP  TP+RL+VPK FKYPERY SPTDL++SPV+K
Sbjct: 21  EPECKTPAPVQQQDPNDHNNSSNELHKPV-TPNRLRVPKAFKYPERYTSPTDLIMSPVTK 79

Query: 509 GLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAG 411
           GLLARTR+  G+ L P  K QPK+    +++ G
Sbjct: 80  GLLARTRRGGGAVLPPGGKNQPKILDMPLKDVG 112


>gb|ESW23326.1| hypothetical protein PHAVU_004G037300g [Phaseolus vulgaris]
          Length = 134

 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 47/99 (47%), Positives = 61/99 (61%), Gaps = 5/99 (5%)
 Frame = -1

Query: 686 EDQQKTP---PINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPV 516
           E + KTP      H I  D+    E+      TPD L+VPK FKYPERY SPTDLM+SP+
Sbjct: 21  EPECKTPIPAQQQHQI-KDRNSSNELRKPVTVTPDHLRVPKAFKYPERYTSPTDLMMSPI 79

Query: 515 SKGLLARTRKPN--GSNLLPPSKIQPKLQSFQVQEAGLF 405
           +KGLLART++    G  +LPP K QPK+    +++ G F
Sbjct: 80  TKGLLARTKRGGGVGGAMLPPGKNQPKILDMPLKDVGTF 118


>ref|XP_006300032.1| hypothetical protein CARUB_v10016256mg [Capsella rubella]
           gi|482568741|gb|EOA32930.1| hypothetical protein
           CARUB_v10016256mg [Capsella rubella]
          Length = 124

 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 50/106 (47%), Positives = 65/106 (61%), Gaps = 5/106 (4%)
 Frame = -1

Query: 710 QETKISEEEDQQKTPPINHP-----ITSDQRPPREICCKPASTPDRLKVPKPFKYPERYM 546
           QET     E       +NH      ++S   P ++I      TPDRL+VP  FK+PERY 
Sbjct: 20  QETTALSPESPPLESCLNHESPRRRVSSTNEPMKKI-----GTPDRLRVPIAFKHPERYR 74

Query: 545 SPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAGL 408
           SPTD M+SPV+KGLLAR+RK +GS L+PPS  Q K+Q  +  E+GL
Sbjct: 75  SPTDAMMSPVTKGLLARSRKASGS-LIPPSFNQTKIQELRKPESGL 119


>ref|XP_002323900.1| hypothetical protein POPTR_0017s12970g [Populus trichocarpa]
           gi|118481606|gb|ABK92745.1| unknown [Populus
           trichocarpa] gi|222866902|gb|EEF04033.1| hypothetical
           protein POPTR_0017s12970g [Populus trichocarpa]
          Length = 121

 Score = 84.7 bits (208), Expect = 4e-14
 Identities = 41/65 (63%), Positives = 50/65 (76%)
 Frame = -1

Query: 611 KPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQS 432
           + + TPD LKVPK FKYPERY SPTDLMISP++KG+LAR +K  G  LLPPS  QPK+Q 
Sbjct: 51  RKSGTPDPLKVPKAFKYPERYRSPTDLMISPITKGILARNKK--GGALLPPSWNQPKVQD 108

Query: 431 FQVQE 417
            + Q+
Sbjct: 109 VETQD 113


>ref|XP_002527615.1| conserved hypothetical protein [Ricinus communis]
           gi|223532989|gb|EEF34754.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 112

 Score = 84.0 bits (206), Expect = 8e-14
 Identities = 47/79 (59%), Positives = 55/79 (69%)
 Frame = -1

Query: 674 KTPPINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLAR 495
           KTPP +  + S          K +STPDRLKVPK FKYPERY SPTDLM+SP++KGLLAR
Sbjct: 23  KTPPQDQKMDSKSLNSSGDLRK-SSTPDRLKVPKAFKYPERYRSPTDLMVSPITKGLLAR 81

Query: 494 TRKPNGSNLLPPSKIQPKL 438
            RK  G+ LLPPS  Q K+
Sbjct: 82  NRK--GAALLPPSMNQAKV 98


>ref|XP_002882207.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297328047|gb|EFH58466.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 470

 Score = 83.6 bits (205), Expect = 1e-13
 Identities = 51/111 (45%), Positives = 66/111 (59%), Gaps = 13/111 (11%)
 Frame = -1

Query: 707 ETKISEEEDQQKTPPIN-----HPITS-----DQRPPREICCKPAS---TPDRLKVPKPF 567
           ET +  + D Q+   +N      P+       D  PPR    +P     TPDRL+VP  F
Sbjct: 9   ETPLKIQPDHQEITTLNPLSPPQPLPESCRNHDSPPPRASTNEPMKKIGTPDRLRVPIAF 68

Query: 566 KYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQEA 414
           K+PERY SPTD M+SPV+KGLLARTRK +GS L+PPS  Q K+Q  +  E+
Sbjct: 69  KHPERYRSPTDAMMSPVTKGLLARTRKASGS-LIPPSFNQTKIQELRKPES 118


>ref|XP_002305356.1| hypothetical protein POPTR_0004s11940g [Populus trichocarpa]
           gi|222848320|gb|EEE85867.1| hypothetical protein
           POPTR_0004s11940g [Populus trichocarpa]
          Length = 153

 Score = 83.2 bits (204), Expect = 1e-13
 Identities = 42/67 (62%), Positives = 50/67 (74%)
 Frame = -1

Query: 611 KPASTPDRLKVPKPFKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQS 432
           + +S P  L+VPK FK+PERY SPTDLMISP++KGLLAR RK  G  LLPPS  QPK+Q 
Sbjct: 86  RKSSAPYHLQVPKAFKFPERYRSPTDLMISPITKGLLARNRK--GGALLPPSLNQPKVQD 143

Query: 431 FQVQEAG 411
            +VQ  G
Sbjct: 144 VEVQGGG 150


>gb|AAF14825.1|AC011664_7 hypothetical protein [Arabidopsis thaliana]
          Length = 480

 Score = 82.8 bits (203), Expect = 2e-13
 Identities = 51/106 (48%), Positives = 65/106 (61%), Gaps = 14/106 (13%)
 Frame = -1

Query: 707 ETKISEEEDQQK--------TPPINHPITS---DQRPPREICCKPAS---TPDRLKVPKP 570
           ET +  + D +K        +PP   P +S   D  PPR    +P     TP+RL+VP  
Sbjct: 9   ETPLKTQHDHRKITTSNPESSPPRPFPESSRKHDSPPPRASTNEPMKKIGTPERLRVPIA 68

Query: 569 FKYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQS 432
           FKYPERY SPTD M+SPV+KGLLARTRK +GS L+PPS  Q K ++
Sbjct: 69  FKYPERYRSPTDAMMSPVTKGLLARTRKSSGS-LIPPSFNQTKTKT 113


>ref|XP_006603909.1| PREDICTED: uncharacterized protein LOC100815031 isoform X2 [Glycine
           max]
          Length = 126

 Score = 81.6 bits (200), Expect = 4e-13
 Identities = 45/83 (54%), Positives = 55/83 (66%), Gaps = 1/83 (1%)
 Frame = -1

Query: 686 EDQQKTP-PINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLMISPVSK 510
           E + KTP P+     +D         KP  TP+RL+VPK FKYPERY SPTDL++SPV+K
Sbjct: 21  EPECKTPAPVQQQDPNDHNNSSNELHKPV-TPNRLRVPKAFKYPERYTSPTDLIMSPVTK 79

Query: 509 GLLARTRKPNGSNLLPPSKIQPK 441
           GLLARTR+  G+ L P  K QPK
Sbjct: 80  GLLARTRRGGGAVLPPGGKNQPK 102


>ref|XP_003516839.1| PREDICTED: uncharacterized protein LOC100816026 isoform X1 [Glycine
           max]
          Length = 127

 Score = 81.3 bits (199), Expect = 5e-13
 Identities = 46/112 (41%), Positives = 60/112 (53%), Gaps = 18/112 (16%)
 Frame = -1

Query: 692 EEEDQQKTPPINHPITSDQRPPREICCKPAS------------------TPDRLKVPKPF 567
           E+E+  KTPP    +    R P   C  P                    TPDRL+VPK F
Sbjct: 2   EKENMLKTPP---KVPIQDRTPEPECKTPTPLQQDPNDHNSSNELRKPVTPDRLRVPKAF 58

Query: 566 KYPERYMSPTDLMISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAG 411
           KY ERY SPTDLM+SPV+KGL A+TR+  G+ L P  K +PK+    +++ G
Sbjct: 59  KYAERYTSPTDLMMSPVTKGLFAKTRRDGGAVLPPGGKNRPKILDLPLKDVG 110


>gb|AFK40257.1| unknown [Lotus japonicus]
          Length = 129

 Score = 79.3 bits (194), Expect = 2e-12
 Identities = 45/101 (44%), Positives = 59/101 (58%)
 Frame = -1

Query: 707 ETKISEEEDQQKTPPINHPITSDQRPPREICCKPASTPDRLKVPKPFKYPERYMSPTDLM 528
           E + +E E +  TP    P   D     E+  + +  PD L+VPK FK+PERY SPTD +
Sbjct: 15  EAQSTEPECKTPTPIPQPPQNDDPNSTDEL--RKSLIPDPLRVPKAFKFPERYTSPTDSI 72

Query: 527 ISPVSKGLLARTRKPNGSNLLPPSKIQPKLQSFQVQEAGLF 405
           +SPV+KGLLAR +K  G   LPP K  PK+    +QE G F
Sbjct: 73  MSPVTKGLLARGKK--GVAKLPPGKYHPKIPDMSLQEVGPF 111


Top