BLASTX nr result

ID: Rauwolfia21_contig00002398 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00002398
         (902 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241...   190   7e-46
gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus pe...   183   7e-44
ref|XP_002509872.1| conserved hypothetical protein [Ricinus comm...   182   1e-43
gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich g...   178   3e-42
ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578...   176   1e-41
gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]     174   5e-41
ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620...   169   1e-39
ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citr...   169   1e-39
gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich g...   165   2e-38
gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich g...   162   1e-37
gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich g...   155   2e-35
ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297...   155   2e-35
gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich g...   152   2e-34
gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich g...   147   5e-33
gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich g...   145   2e-32
ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306...   145   3e-32
gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich g...   144   6e-32
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   144   6e-32
ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, part...   143   8e-32
gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich g...   143   8e-32

>ref|XP_002272642.1| PREDICTED: uncharacterized protein LOC100241699 [Vitis vinifera]
          Length = 213

 Score =  190 bits (482), Expect = 7e-46
 Identities = 100/212 (47%), Positives = 141/212 (66%), Gaps = 1/212 (0%)
 Frame = +3

Query: 99  MAEDDHAAPLAPPRTYPRSDAESVITKPRNDRR-ERSSKCFVFILAFVVLHCIALLVFAL 275
           M ED+   PLAP R + +SD E  + KPR  +   RSSKC V++LA +V      LVFAL
Sbjct: 1   MPEDNQFQPLAPARLHGKSDEEFGVFKPRASKPPRRSSKCPVYVLAGLVTLAAIALVFAL 60

Query: 276 VVLRINSPRVKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSYG 455
            VLR+ +P V+++ V +KNL + T+PS S N+T+ AEV+++N NFG F F+N + TV Y 
Sbjct: 61  AVLRVEAPDVELKSVAVKNLTHGTSPSPSFNVTLTAEVSVQNKNFGAFNFENGTATVLYE 120

Query: 456 NTTVGSRSISGGRVKARETTRMNVTVQVRASDFEGNGDLSSEIGSGLVKLSSYAKLRGEI 635
              VG    S   V++R+T RMNVT+ VR+     + +LSS+I SG V L++YA++ G++
Sbjct: 121 GMVVGDEEFSKAHVESRKTKRMNVTLDVRSDRLWNDKNLSSDISSGSVNLTTYAQVTGKV 180

Query: 636 RVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
           RV+K + RR TA MNC+MTL L+S  IQDL C
Sbjct: 181 RVMKVVRRRTTARMNCSMTLNLTSSSIQDLVC 212


>gb|EMJ11860.1| hypothetical protein PRUPE_ppa022176mg [Prunus persica]
          Length = 213

 Score =  183 bits (465), Expect = 7e-44
 Identities = 98/214 (45%), Positives = 142/214 (66%), Gaps = 3/214 (1%)
 Frame = +3

Query: 99  MAEDDHAA-PLAPPRTYPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFAL 275
           MAE +    PLAP R + RSD E+     R  RRERS+KCFV++ A +VL  I +LVFAL
Sbjct: 1   MAEQESQVWPLAPSRLHRRSDEENPTF--RAIRRERSNKCFVYVFAAIVLQSIFILVFAL 58

Query: 276 VVLRINSPRVKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSYG 455
           VVLR+ SP   +  V +K+L++TT+P++SLN T+V E+ IKN NFG +KF+ SS ++ YG
Sbjct: 59  VVLRVKSPGFNLSSVSVKSLKHTTSPTSSLNATLVTELAIKNKNFGEYKFEGSSASLWYG 118

Query: 456 NTTVGSRSISGGRVKARETTRMNVTVQVRASDF--EGNGDLSSEIGSGLVKLSSYAKLRG 629
              VG   I  GRVKAR T R+++++ VR++    E       E+ SG +K+SSYAKL G
Sbjct: 119 GFKVGEAKIGKGRVKARGTRRVSLSIDVRSNRLPQEAKNGFEGEMNSGYLKISSYAKLTG 178

Query: 630 EIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
           ++ ++K + +R+T   NCTM + L S+ ++DL C
Sbjct: 179 KVNLMKIMKKRKTIDTNCTMVVVLKSRTVKDLFC 212


>ref|XP_002509872.1| conserved hypothetical protein [Ricinus communis]
           gi|223549771|gb|EEF51259.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 221

 Score =  182 bits (462), Expect = 1e-43
 Identities = 98/220 (44%), Positives = 138/220 (62%), Gaps = 9/220 (4%)
 Frame = +3

Query: 99  MAEDDHAAPLAPPRTYPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFALV 278
           M ED+   PLAP  T PRSD E    KP    +ERSSKC V++LA +V+    +LVFALV
Sbjct: 1   MVEDNQIVPLAPAETNPRSDEEFAAVKPNLRLQERSSKCLVYVLAGIVILSAVILVFALV 60

Query: 279 VLRINSPRVKVELVEIKNLRYTTAPSA--------SLNMTMVAEVTIKNANFGRFKFQNS 434
           VLR  +P  ++  V +K+L Y              + NMT+ +E+ I+N+NFG FK+ N+
Sbjct: 61  VLRPVNPNAELSFVRLKDLNYAAGSGGNGNNVSLPAFNMTLESELKIENSNFGEFKYDNT 120

Query: 435 STTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVRASDFEGNG-DLSSEIGSGLVKLSS 611
           S  V YG   VG   +  GRV AR+T RMNV V+VR+  +  NG DL+S+I SG++KL+S
Sbjct: 121 SARVFYGGMAVGEAILREGRVSARDTLRMNVKVEVRSHKYIYNGTDLTSDINSGILKLNS 180

Query: 612 YAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
           +AK  G + +L+   +RR+A M+C+ +L L S+ IQDL C
Sbjct: 181 HAKFSGRVNLLQIAKKRRSASMDCSFSLDLRSRSIQDLVC 220


>gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 213

 Score =  178 bits (451), Expect = 3e-42
 Identities = 93/212 (43%), Positives = 135/212 (63%), Gaps = 1/212 (0%)
 Frame = +3

Query: 99  MAEDDHAAPLAPPRTYPRSDAESVITKPR-NDRRERSSKCFVFILAFVVLHCIALLVFAL 275
           M ED  A PLAP   YPRSD E    KP  + R+E+SSKC V++L  +V+    LL+FA 
Sbjct: 1   MQEDPQAKPLAPVEYYPRSDMEFGGIKPTASQRKEKSSKCLVYVLVGMVIQGAVLLIFAS 60

Query: 276 VVLRINSPRVKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSYG 455
           +VLR  +P V++  V ++NL+Y  + + S N+T+V EVT++N+NFG FKF+N++ TV  G
Sbjct: 61  IVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVWCG 120

Query: 456 NTTVGSRSISGGRVKARETTRMNVTVQVRASDFEGNGDLSSEIGSGLVKLSSYAKLRGEI 635
           +  VG   I  GR +AR T R+NV+V V +       ++S  I SGL++L+S+ KL G++
Sbjct: 121 SVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSGKV 180

Query: 636 RVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
            ++  + RRR   MNC MTL L+ Q  QD  C
Sbjct: 181 SIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPC 212


>ref|XP_006343917.1| PREDICTED: uncharacterized protein LOC102578735 [Solanum tuberosum]
          Length = 223

 Score =  176 bits (445), Expect = 1e-41
 Identities = 89/223 (39%), Positives = 152/223 (68%), Gaps = 12/223 (5%)
 Frame = +3

Query: 99  MAEDDHAAPLAPPRTYPRSDAESVITKP-------RNDRRERSSKCFVFILAFVVLHCIA 257
           MA+D H  PLAPPR YP+SD    ++K         N++  +S KCFV+ L+ +V+  I 
Sbjct: 1   MAQDSHIIPLAPPRAYPKSDQGLNLSKSINYYNNHNNNKNRKSGKCFVYFLSTIVILSII 60

Query: 258 LLVFALVVLRINSPRVKVELVEIKNLRYTTAP-SASLNMTMVAEVTIKNANFGRFKFQNS 434
           +L+F++V  R  SP  +++ + ++NLR++ +  S+S NM M  E+ + N NFG+  +Q+S
Sbjct: 61  MLIFSMVFFRFKSPSFELDHINVQNLRFSNSTNSSSFNMNMGGEIIVDNDNFGQINYQDS 120

Query: 435 STTVS-YGNTTVGSRSISGGRVKARETTRMNVTVQVRASDFEGN---GDLSSEIGSGLVK 602
           S +V  Y N T+G  +++ GRV+AR++ R+ +++Q+R ++++ N   G+LSS+I S ++K
Sbjct: 121 SMSVFLYDNVTIGIANVNVGRVEARKSKRIGISLQLR-TNYQLNYSYGNLSSDINSRMLK 179

Query: 603 LSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
           L+S+ + RG+++ +K I++ +T+IMNCTM L L+SQ IQDL C
Sbjct: 180 LTSFGEFRGKVKAMKIISKHKTSIMNCTMNLNLTSQAIQDLLC 222


>gb|EXC34335.1| hypothetical protein L484_006690 [Morus notabilis]
          Length = 213

 Score =  174 bits (440), Expect = 5e-41
 Identities = 91/211 (43%), Positives = 138/211 (65%), Gaps = 2/211 (0%)
 Frame = +3

Query: 105 EDDHAAPLAPPRTYPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFALVVL 284
           ++  + PLAP R + RSD E+   K    R+ER++KCFV+I A +V+    LL+FAL+VL
Sbjct: 4   QESQSWPLAPMRVHQRSDEENPAFKAL--RKERTNKCFVYIFAGIVILGAILLIFALIVL 61

Query: 285 RINSPRVKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNANFGRFKF-QNSSTTVSYGNT 461
           R  SP +K++ V +K+L Y+T+P  SLN T++A V IKN NFG ++F  N+S    YG  
Sbjct: 62  RSKSPEIKLKSVTVKSLDYSTSPWPSLNATLIATVAIKNPNFGPYRFGSNNSAVFLYGGG 121

Query: 462 TVGSRSISGGRVKARETTRMNVTVQVRASDF-EGNGDLSSEIGSGLVKLSSYAKLRGEIR 638
            +G + I  G+  A+ T R+NVTV++R S   +G+ +L  ++ SG+V LSSY K  G + 
Sbjct: 122 KLGEQRIRQGKATAKATKRVNVTVEIRTSRLPQGSNNLGGDLSSGMVNLSSYCKFTGRVH 181

Query: 639 VLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
           ++K    R+TA MNC MTL L ++ I++L+C
Sbjct: 182 LIKIFENRKTAEMNCAMTLVLKTKMIKNLRC 212


>ref|XP_006477513.1| PREDICTED: uncharacterized protein LOC102620163 [Citrus sinensis]
          Length = 214

 Score =  169 bits (428), Expect = 1e-39
 Identities = 94/214 (43%), Positives = 137/214 (64%), Gaps = 3/214 (1%)
 Frame = +3

Query: 99  MAEDDHAAPLAPPRT-YPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFAL 275
           MAE++   PLAPPR  YPRSD E       + R+  SSKC V++L  +V    ALL+ A 
Sbjct: 1   MAEENPKFPLAPPRNEYPRSDQEYAPAVIESQRK--SSKCLVYVLVTIVTVSAALLISAS 58

Query: 276 VVLRINSPRVKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSYG 455
           + LR N+P V++E V +KNL +    S S N+T+V E+TI N N+G F+++N S +V YG
Sbjct: 59  IFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYG 118

Query: 456 NTTVGSRSISGGRVKARETTRMNVT--VQVRASDFEGNGDLSSEIGSGLVKLSSYAKLRG 629
           + TVG   I  GRV+ARE  R+NVT  V VR++    N +L S+I SG+VKL+SYAKL G
Sbjct: 119 SVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLRSDINSGIVKLNSYAKLHG 178

Query: 630 EIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
            + +   + + +T  ++C+M L L+ + ++DL C
Sbjct: 179 NVSLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212


>ref|XP_006439452.1| hypothetical protein CICLE_v10023929mg [Citrus clementina]
           gi|557541714|gb|ESR52692.1| hypothetical protein
           CICLE_v10023929mg [Citrus clementina]
          Length = 214

 Score =  169 bits (428), Expect = 1e-39
 Identities = 94/214 (43%), Positives = 137/214 (64%), Gaps = 3/214 (1%)
 Frame = +3

Query: 99  MAEDDHAAPLAPPRT-YPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFAL 275
           MAE++   PLAPPR  YPRSD E       + R+  SSKC V++L  +V    ALL+ A 
Sbjct: 1   MAEENPKIPLAPPRNEYPRSDQEYAPAVIESQRK--SSKCLVYVLVTIVTVSAALLISAS 58

Query: 276 VVLRINSPRVKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSYG 455
           + LR N+P V++E V +KNL +    S S N+T+V E+TI N N+G F+++N S +V YG
Sbjct: 59  IFLRPNTPEVQLESVTVKNLSHGNGTSPSFNVTLVTELTIDNENYGYFEYKNCSGSVFYG 118

Query: 456 NTTVGSRSISGGRVKARETTRMNVT--VQVRASDFEGNGDLSSEIGSGLVKLSSYAKLRG 629
           + TVG   I  GRV+ARE  R+NVT  V VR++    N +LSS+  SG+VKL+SYAKL G
Sbjct: 119 SVTVGDVKIRDGRVEAREVKRINVTVDVDVRSNGNLDNQNLSSDRNSGIVKLNSYAKLHG 178

Query: 630 EIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
            + +   + + +T  ++C+M L L+ + ++DL C
Sbjct: 179 NVNLFNVLKKTKTPELDCSMNLVLARRAVEDLVC 212


>gb|EOY23364.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 191

 Score =  165 bits (418), Expect = 2e-38
 Identities = 82/181 (45%), Positives = 125/181 (69%), Gaps = 1/181 (0%)
 Frame = +3

Query: 192 RRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSA-SLN 368
           RR+R+ KC  +I+A V+   I +L+F ++V+RI +P+V++  V ++NL   ++ S+ S +
Sbjct: 10  RRKRNIKCLAYIVAGVIAQTIIILLFVMLVMRIRNPKVRLGGVTVENLNLNSSSSSPSFS 69

Query: 369 MTMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVRAS 548
           M + A+VT+KN NFG FKFQNS+ T+SY  T VG  +I   R +AR TT++NVTV V + 
Sbjct: 70  MNLNAQVTVKNTNFGHFKFQNSTLTISYRGTPVGEATIVKARARARSTTKLNVTVSVSSD 129

Query: 549 DFEGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQ 728
               N  LSS++GSG + LSS+AKL G+I + K   ++++A MNCTM +  SS+QIQ+L 
Sbjct: 130 KMSRNSALSSDVGSGTINLSSHAKLDGKIHLFKVFKKKKSAEMNCTMEVTTSSKQIQNLM 189

Query: 729 C 731
           C
Sbjct: 190 C 190


>gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 201

 Score =  162 bits (411), Expect = 1e-37
 Identities = 75/181 (41%), Positives = 123/181 (67%), Gaps = 1/181 (0%)
 Frame = +3

Query: 192 RRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSA-SLN 368
           +R++  K F +  AFVV   I +LVF+L V+RI +P+ +V  + ++++ YT+ P+  S N
Sbjct: 20  KRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAYTSTPNPPSFN 79

Query: 369 MTMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVRAS 548
           M   AEV +KN NFG FKF N++ +  YG   VG   ++ GR KAR T +MNVTV + ++
Sbjct: 80  MKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKKMNVTVDLNSN 139

Query: 549 DFEGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQ 728
           +   N +L+S+I SG + L+++ KL G++ ++K I ++++A MNCTMT+ L+S+ IQD++
Sbjct: 140 NIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVNLASRAIQDIK 199

Query: 729 C 731
           C
Sbjct: 200 C 200


>gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 188

 Score =  155 bits (392), Expect = 2e-35
 Identities = 72/180 (40%), Positives = 117/180 (65%)
 Frame = +3

Query: 192 RRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSASLNM 371
           +R ++ KC+ +I+A VV   I +LVFAL V+RI +P  ++  V +++L Y  +     NM
Sbjct: 8   KRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPHFNM 67

Query: 372 TMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVRASD 551
            ++ E+ +KN NFG F+F N++  V++G+  VG   I   R +AR+T RMNVTV V +S 
Sbjct: 68  RLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVSSSA 127

Query: 552 FEGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
                +L +++ SG + L+  A+LRG++ ++K + +R+TA MNCTMT+ L+S  +QDL C
Sbjct: 128 VSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQDLDC 187


>ref|XP_004300835.1| PREDICTED: uncharacterized protein LOC101297644 [Fragaria vesca
           subsp. vesca]
          Length = 210

 Score =  155 bits (392), Expect = 2e-35
 Identities = 78/204 (38%), Positives = 133/204 (65%), Gaps = 1/204 (0%)
 Frame = +3

Query: 123 PLAPPRTYPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPR 302
           PLAP + + RS+        +  RRERS+KCFV++ + +V  C+ +LVFAL+VLR+ SP 
Sbjct: 10  PLAPGKLHQRSEENPTF---KAIRRERSNKCFVYVFSGIVFFCVTVLVFALLVLRVKSPE 66

Query: 303 VKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSI 482
           +++  V +K+L+YT++P  S N+++  ++++KN NFG ++F  ++ +  Y    VGS  +
Sbjct: 67  IRLRSVTVKSLKYTSSP-PSFNVSLSGQMSVKNPNFGDYEFVPTTVSFLYSRGAVGSTKV 125

Query: 483 SGGRVKARETTRMNVTVQVRASDF-EGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITR 659
           + G  K ++T R++  V +R++   EG   L S+I SG++KL+   K+ G++ + K I +
Sbjct: 126 AKGLAKVKKTERLSFGVDLRSNKLPEGANTLKSDINSGMLKLTGTGKVSGKVTLWKIINK 185

Query: 660 RRTAIMNCTMTLKLSSQQIQDLQC 731
           R+T  M+CTMTL L S+ I+DL C
Sbjct: 186 RKTGKMDCTMTLVLKSKTIKDLVC 209


>gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 215

 Score =  152 bits (384), Expect = 2e-34
 Identities = 77/214 (35%), Positives = 132/214 (61%), Gaps = 3/214 (1%)
 Frame = +3

Query: 99  MAEDDHAA-PLAPPRTYPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFAL 275
           MAE D    PLAP   +PRSD ES   + +  +R++  K  V+I AF V   + +L+FAL
Sbjct: 1   MAEKDQQVHPLAPANGHPRSDEESASLQSKELKRKKRIKYAVYIAAFAVFQTVVILIFAL 60

Query: 276 VVLRINSPRVKVELVEIKNLRYT-TAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSY 452
            V+R+ +P+V++  V ++ +  + T  +AS N+  + +VT+KN NFG +KF N++ +  Y
Sbjct: 61  TVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDNATMSFLY 120

Query: 453 GNTTVGSRSISGGRVKARETTRMNVTVQVRASDFEG-NGDLSSEIGSGLVKLSSYAKLRG 629
               VG   I   R +AR T +++VTV+V +S        L SE+ S ++ L+S AKL+G
Sbjct: 121 DGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLNSQAKLKG 180

Query: 630 EIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
           ++ ++K + ++++  MNCT+   +S++ +QDL+C
Sbjct: 181 KVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKC 214


>gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao]
          Length = 185

 Score =  147 bits (371), Expect = 5e-33
 Identities = 65/183 (35%), Positives = 118/183 (64%)
 Frame = +3

Query: 183 RNDRRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSAS 362
           + + +  ++KC  ++  FVV     +L+FAL V+RI +P+V+   V ++N     + S  
Sbjct: 2   KGEGKRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTVENFSTGNSSSPF 61

Query: 363 LNMTMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVR 542
            +M ++A+VT+KN NFG FK++NSS  + YG   VG  +I   R +AR+T + +VT+ + 
Sbjct: 62  FDMRLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDIS 121

Query: 543 ASDFEGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQD 722
           +S    N +L ++I SG++ LSS AKL G++ ++K I +++++ M+CTM + + ++ +QD
Sbjct: 122 SSKLSTNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQD 181

Query: 723 LQC 731
           L+C
Sbjct: 182 LKC 184


>gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 259

 Score =  145 bits (367), Expect = 2e-32
 Identities = 68/195 (34%), Positives = 124/195 (63%), Gaps = 2/195 (1%)
 Frame = +3

Query: 153 SDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKN 332
           SD    + + +  +R++  KC  ++ AFV+     +LVFAL V+RI +P+ ++  V + +
Sbjct: 4   SDVAFPMEQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDD 63

Query: 333 LRYTTAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKAR-- 506
           L +  + S S NM  +A+VT+KN NFG +KF+NS+ T +Y  + VG   ++ GR +AR  
Sbjct: 64  LTFNNS-SPSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARAR 122

Query: 507 ETTRMNVTVQVRASDFEGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCT 686
            T +MNVT+ + ++    + DL S++ SG + L+S + L G++ ++K I ++++  MNCT
Sbjct: 123 STKKMNVTMDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCT 182

Query: 687 MTLKLSSQQIQDLQC 731
           MT+ L+ + ++D++C
Sbjct: 183 MTVNLAQKLVRDIKC 197


>ref|XP_004306727.1| PREDICTED: uncharacterized protein LOC101306460 [Fragaria vesca
           subsp. vesca]
          Length = 219

 Score =  145 bits (365), Expect = 3e-32
 Identities = 73/213 (34%), Positives = 117/213 (54%), Gaps = 2/213 (0%)
 Frame = +3

Query: 99  MAEDDHAAPLAPPRTYPRSDAESVITKPRNDRRERSSKCFVFILAFVVLHCIALLVFALV 278
           M E + A PLAP    P SD        +  RR++   C   I A V++  + +++ A  
Sbjct: 1   MVEKEQARPLAPAGYRPSSDDNEAALHMKIARRKKFINCCGCITAIVLIQAVVIIILAFT 60

Query: 279 VLRINSPRVKVELVEIKNLRYT--TAPSASLNMTMVAEVTIKNANFGRFKFQNSSTTVSY 452
           V R+  P++ +  V +  L     T P    N+++ A+V++KN N   FK+ N++TT+ Y
Sbjct: 61  VFRVKEPKIMMNKVTVTKLELVNGTTPKPGTNISLTADVSVKNPNVASFKYSNTTTTLYY 120

Query: 453 GNTTVGSRSISGGRVKARETTRMNVTVQVRASDFEGNGDLSSEIGSGLVKLSSYAKLRGE 632
             T VG      GR KAR T RMN+TV +       N +L +++GSGL+ +SSY+++ G 
Sbjct: 121 HGTVVGEARGPPGRAKARRTMRMNITVDIITDILTTNPNLKTDVGSGLLTMSSYSRIPGR 180

Query: 633 IRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
           + +L  + +     MNCTMT+ +SSQ IQ+ +C
Sbjct: 181 VNMLNIVKKHVVVKMNCTMTVNISSQAIQEQKC 213


>gb|EOY24869.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao]
          Length = 226

 Score =  144 bits (362), Expect = 6e-32
 Identities = 67/174 (38%), Positives = 115/174 (66%), Gaps = 1/174 (0%)
 Frame = +3

Query: 192 RRERSS-KCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSASLN 368
           RRE S+ KC  ++ AFVV     +L+FAL V+RI SP+V+   V +++     + S S +
Sbjct: 4   RREGSNAKCLAYVAAFVVFQTAIILLFALTVMRIRSPKVRFGAVTVESFSTVNSSSPSFD 63

Query: 369 MTMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVRAS 548
           M ++A+V +KN NFG FK++NS+ T+ YG   VG  +I  GR +AR+T + N+ V + +S
Sbjct: 64  MKLMAQVAVKNTNFGHFKYENSTVTILYGGMPVGEAAIFKGRARARQTKKFNINVDISSS 123

Query: 549 DFEGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQ 710
               N +L ++I +G++ LSS AKL+G++ ++K I ++++  M+CTM + L+++
Sbjct: 124 RLSSNSNLGNDINAGVLPLSSQAKLKGKVHLMKVIKKKKSGEMSCTMGINLATR 177


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  144 bits (362), Expect = 6e-32
 Identities = 70/180 (38%), Positives = 113/180 (62%)
 Frame = +3

Query: 192 RRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSASLNM 371
           RR++S KC  ++ AFVV     +L+F L+VL+I  P+V++  + ++N  ++T    S +M
Sbjct: 9   RRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTN---SFSM 65

Query: 372 TMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVRASD 551
            + A VT+KN NFG FKF NS+ T+SY  T VG  +I   R ++R T R N+TV + +S 
Sbjct: 66  DLKARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISSSK 125

Query: 552 FEGNGDLSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
              +  L  ++ SG++ LSS AKL G+I + K   ++++A M+CTM L  ++  I++L C
Sbjct: 126 VNNHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSC 185


>ref|XP_006368771.1| hypothetical protein POPTR_0001s09980g, partial [Populus
           trichocarpa] gi|550346930|gb|ERP65340.1| hypothetical
           protein POPTR_0001s09980g, partial [Populus trichocarpa]
          Length = 173

 Score =  143 bits (361), Expect = 8e-32
 Identities = 76/169 (44%), Positives = 115/169 (68%), Gaps = 1/169 (0%)
 Frame = +3

Query: 228 LAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSASLNMTMVAEVTIKNAN 407
           LA +V+    +LVFA++V +  +PRVK+  V +++L Y   P  S NMT+ AEV++KN+N
Sbjct: 7   LALIVILSAIILVFAIIV-KPRTPRVKLSSVAVEHLSYGNNPIPSFNMTLAAEVSVKNSN 65

Query: 408 FGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQVRA-SDFEGNGDLSSEI 584
           F RFKF+N+S++  Y    VG   +  GRV AR+T RMN+ V++ +        +LSS+I
Sbjct: 66  FVRFKFENTSSSALYKGMVVGEAKLRSGRVGARKTRRMNIVVKIGSPGSLSEAKNLSSDI 125

Query: 585 GSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQIQDLQC 731
            SG++K++SYA L+G++R L  I + RTA+M+C M L LSS+ IQDL+C
Sbjct: 126 NSGMLKMNSYATLKGDVR-LFGIVKNRTAVMSCGMNLNLSSRSIQDLEC 173


>gb|EOY13742.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 192

 Score =  143 bits (361), Expect = 8e-32
 Identities = 76/186 (40%), Positives = 124/186 (66%), Gaps = 3/186 (1%)
 Frame = +3

Query: 183 RNDRRERSSKCFVFILAFVVLHCIALLVFALVVLRINSPRVKVELVEIKNLRYTTAPSA- 359
           +  R +R+ KC+  ++A V+   I +L+F L+V+RI +P+V++  V ++NLR +++ S+ 
Sbjct: 6   QTSRGKRNIKCWAIVVAGVIAKTIIILLFVLIVMRIRNPKVRLGGVTVENLRASSSSSSP 65

Query: 360 SLNMTMVAEVTIKNANFGRFKFQNSSTTVSYGNTTVGSRSISGGRVKARETTRMNVTVQV 539
           S +  + A+V++KN NFG FKF+NS+ T+SY  + VG  +I  G  +AR T + NVT+ V
Sbjct: 66  SFSTKLNAQVSVKNTNFGHFKFKNSTLTISYNGSPVGKATIVEGLARARSTKKFNVTILV 125

Query: 540 RASD-FEGNGD-LSSEIGSGLVKLSSYAKLRGEIRVLKKITRRRTAIMNCTMTLKLSSQQ 713
            +++    N D LSS+I SG + LSS+AKL G+I + K   ++++A MNCTM +  S +Q
Sbjct: 126 SSNNKISRNSDQLSSDIESGTINLSSHAKLEGKIHLFKIFKKKKSAEMNCTMDVNTSLKQ 185

Query: 714 IQDLQC 731
           IQ L C
Sbjct: 186 IQKLTC 191


Top