BLASTX nr result

ID: Catharanthus23_contig00010032 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00010032
         (1146 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006341680.1| PREDICTED: uncharacterized protein LOC102589...   221   3e-55
ref|XP_004235710.1| PREDICTED: uncharacterized protein LOC101250...   221   4e-55
gb|EOY26757.1| Late embryogenesis abundant hydroxyproline-rich g...   199   2e-48
gb|EOY26756.1| Late embryogenesis abundant hydroxyproline-rich g...   199   2e-48
gb|EOY26755.1| Late embryogenesis abundant hydroxyproline-rich g...   199   2e-48
ref|XP_002284574.1| PREDICTED: uncharacterized protein LOC100254...   196   2e-47
emb|CBI28084.3| unnamed protein product [Vitis vinifera]              196   2e-47
ref|XP_002299644.2| hypothetical protein POPTR_0001s18090g [Popu...   195   3e-47
gb|EPS63390.1| hypothetical protein M569_11397, partial [Genlise...   189   2e-45
ref|XP_002279706.1| PREDICTED: uncharacterized protein LOC100258...   185   3e-44
ref|XP_006427015.1| hypothetical protein CICLE_v10026444mg [Citr...   184   4e-44
ref|XP_002331158.1| predicted protein [Populus trichocarpa] gi|5...   184   4e-44
ref|XP_004148717.1| PREDICTED: uncharacterized protein LOC101219...   183   1e-43
gb|EOY30818.1| Late embryogenesis abundant (LEA) hydroxyproline-...   182   2e-43
gb|EXB37026.1| hypothetical protein L484_020812 [Morus notabilis]     182   2e-43
ref|XP_003534194.1| PREDICTED: uncharacterized protein LOC100793...   181   5e-43
ref|XP_006465533.1| PREDICTED: uncharacterized protein LOC102615...   180   8e-43
ref|XP_006451136.1| hypothetical protein CICLE_v10009328mg [Citr...   180   1e-42
ref|XP_006451135.1| hypothetical protein CICLE_v10009328mg [Citr...   180   1e-42
gb|EOY26762.1| Late embryogenesis abundant hydroxyproline-rich g...   180   1e-42

>ref|XP_006341680.1| PREDICTED: uncharacterized protein LOC102589613 [Solanum tuberosum]
          Length = 221

 Score =  221 bits (564), Expect = 3e-55
 Identities = 105/195 (53%), Positives = 141/195 (72%), Gaps = 1/195 (0%)
 Frame = -1

Query: 966 QPQYVILLPHYY-DPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRF 790
           QPQY+I+LP YY  P ++  R  RR + C             LWPSDP++S+ RL+L   
Sbjct: 27  QPQYIIVLPQYYRTPRQFLRRPTRRYVYCAAVFILLSAALFFLWPSDPELSIARLKLRHL 86

Query: 789 HIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSS 610
            + +FP I++D+ LD+T K+RNK+FYS+++  +VISIGYRGKQLG+V S+ G  KAR SS
Sbjct: 87  KVHSFPKIAIDVTLDVTAKIRNKDFYSVNFRYVVISIGYRGKQLGHVISDYGRIKARASS 146

Query: 609 YVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINV 430
           YVNATL L+ + I +D+IPL+ED+ARG I FDTVTQI G+LGL  F++P++ KV CEI V
Sbjct: 147 YVNATLELTDISIFSDLIPLIEDLARGSITFDTVTQIGGELGLVLFDIPIKGKVVCEIVV 206

Query: 429 DIHNQTIEHQNCYPQ 385
           D  N+TI HQNCYP+
Sbjct: 207 DTRNETISHQNCYPE 221


>ref|XP_004235710.1| PREDICTED: uncharacterized protein LOC101250488 [Solanum
           lycopersicum]
          Length = 221

 Score =  221 bits (563), Expect = 4e-55
 Identities = 105/195 (53%), Positives = 140/195 (71%), Gaps = 1/195 (0%)
 Frame = -1

Query: 966 QPQYVILLPHYY-DPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRF 790
           QPQY+I+LP YY  P ++  R  RR + C             +WPSDP++S+ RL+L   
Sbjct: 27  QPQYIIVLPQYYRTPRQFLRRPTRRYVCCAAVFILLSAALFLIWPSDPELSIARLKLRHL 86

Query: 789 HIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSS 610
            + +FP I++D+ LD+T K+RNK+FYS+ +  +VISIGYRGKQLG+V S+ G  KAR SS
Sbjct: 87  KVHSFPKIAIDVTLDVTAKIRNKDFYSVGFRYVVISIGYRGKQLGHVISDYGRIKARASS 146

Query: 609 YVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINV 430
           YVNATL L+ V I +D+IPL+ED+ARG I FDTVTQI G+LGL  F++P++ KV CEI V
Sbjct: 147 YVNATLELTDVSIFSDLIPLIEDLARGSITFDTVTQIGGELGLVLFDIPIKGKVVCEIVV 206

Query: 429 DIHNQTIEHQNCYPQ 385
           D  N+TI HQNCYP+
Sbjct: 207 DTRNETISHQNCYPE 221


>gb|EOY26757.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           isoform 3 [Theobroma cacao]
          Length = 222

 Score =  199 bits (506), Expect = 2e-48
 Identities = 100/195 (51%), Positives = 135/195 (69%)
 Frame = -1

Query: 972 EPQPQYVILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHR 793
           +P  Q  ++LP YY PT  +C     RI+C              WPSDP+V +VR+ + R
Sbjct: 28  QPPDQNYLVLP-YYRPTLRWCGC---RILCTASLVLLATSVYIFWPSDPEVKIVRMHVDR 83

Query: 792 FHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGS 613
             + T PII+LDI+L +T+KVRN + YS+D+ SL +++GYRGK LG+V S  GH +A GS
Sbjct: 84  MQLHTIPIIALDISLLVTLKVRNSDVYSVDFTSLDVAVGYRGKMLGHVTSEHGHVRAWGS 143

Query: 612 SYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEIN 433
           SYV A L L+GVE+L+DV+ +LED+ARG + FDTVT++ G LGLS F+ PL+A+VSCEI 
Sbjct: 144 SYVQAELELNGVEVLSDVVYMLEDLARGTVPFDTVTEVAGWLGLSLFKFPLKARVSCEIV 203

Query: 432 VDIHNQTIEHQNCYP 388
           V+  NQTI  QNCYP
Sbjct: 204 VNRTNQTIIRQNCYP 218


>gb|EOY26756.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           isoform 2 [Theobroma cacao]
          Length = 249

 Score =  199 bits (506), Expect = 2e-48
 Identities = 100/195 (51%), Positives = 135/195 (69%)
 Frame = -1

Query: 972 EPQPQYVILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHR 793
           +P  Q  ++LP YY PT  +C     RI+C              WPSDP+V +VR+ + R
Sbjct: 28  QPPDQNYLVLP-YYRPTLRWCGC---RILCTASLVLLATSVYIFWPSDPEVKIVRMHVDR 83

Query: 792 FHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGS 613
             + T PII+LDI+L +T+KVRN + YS+D+ SL +++GYRGK LG+V S  GH +A GS
Sbjct: 84  MQLHTIPIIALDISLLVTLKVRNSDVYSVDFTSLDVAVGYRGKMLGHVTSEHGHVRAWGS 143

Query: 612 SYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEIN 433
           SYV A L L+GVE+L+DV+ +LED+ARG + FDTVT++ G LGLS F+ PL+A+VSCEI 
Sbjct: 144 SYVQAELELNGVEVLSDVVYMLEDLARGTVPFDTVTEVAGWLGLSLFKFPLKARVSCEIV 203

Query: 432 VDIHNQTIEHQNCYP 388
           V+  NQTI  QNCYP
Sbjct: 204 VNRTNQTIIRQNCYP 218


>gb|EOY26755.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           isoform 1 [Theobroma cacao]
          Length = 220

 Score =  199 bits (506), Expect = 2e-48
 Identities = 100/195 (51%), Positives = 135/195 (69%)
 Frame = -1

Query: 972 EPQPQYVILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHR 793
           +P  Q  ++LP YY PT  +C     RI+C              WPSDP+V +VR+ + R
Sbjct: 28  QPPDQNYLVLP-YYRPTLRWCGC---RILCTASLVLLATSVYIFWPSDPEVKIVRMHVDR 83

Query: 792 FHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGS 613
             + T PII+LDI+L +T+KVRN + YS+D+ SL +++GYRGK LG+V S  GH +A GS
Sbjct: 84  MQLHTIPIIALDISLLVTLKVRNSDVYSVDFTSLDVAVGYRGKMLGHVTSEHGHVRAWGS 143

Query: 612 SYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEIN 433
           SYV A L L+GVE+L+DV+ +LED+ARG + FDTVT++ G LGLS F+ PL+A+VSCEI 
Sbjct: 144 SYVQAELELNGVEVLSDVVYMLEDLARGTVPFDTVTEVAGWLGLSLFKFPLKARVSCEIV 203

Query: 432 VDIHNQTIEHQNCYP 388
           V+  NQTI  QNCYP
Sbjct: 204 VNRTNQTIIRQNCYP 218


>ref|XP_002284574.1| PREDICTED: uncharacterized protein LOC100254347 [Vitis vinifera]
          Length = 212

 Score =  196 bits (497), Expect = 2e-47
 Identities = 97/189 (51%), Positives = 131/189 (69%)
 Frame = -1

Query: 954 VILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRFHIRTF 775
           V+LLP YY   R          +              L+PSDP V VV L L+   + T 
Sbjct: 23  VVLLPVYYPRRRRLLYRLCNAFLACAVFLSISAAVYLLYPSDPTVQVVGLHLNSVQVHTS 82

Query: 774 PIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSSYVNAT 595
           P+ISLD++LDLT++VRN++F+S  Y SL  S+GYRG++LG+V S+ G+ +ARGSSY+NAT
Sbjct: 83  PVISLDLSLDLTIRVRNRDFFSFSYTSLTASVGYRGRRLGFVNSSGGYLRARGSSYINAT 142

Query: 594 LALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINVDIHNQ 415
           L L G+E+L DV  LLED+ARG I FDTV+++ G+LGL FFE+PL+A+VSCE+ V+  NQ
Sbjct: 143 LDLDGIEVLHDVFYLLEDLARGSIPFDTVSEVRGKLGLFFFEIPLKARVSCEVYVNTSNQ 202

Query: 414 TIEHQNCYP 388
           TI HQ+CYP
Sbjct: 203 TIIHQDCYP 211


>emb|CBI28084.3| unnamed protein product [Vitis vinifera]
          Length = 218

 Score =  196 bits (497), Expect = 2e-47
 Identities = 97/189 (51%), Positives = 131/189 (69%)
 Frame = -1

Query: 954 VILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRFHIRTF 775
           V+LLP YY   R          +              L+PSDP V VV L L+   + T 
Sbjct: 23  VVLLPVYYPRRRRLLYRLCNAFLACAVFLSISAAVYLLYPSDPTVQVVGLHLNSVQVHTS 82

Query: 774 PIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSSYVNAT 595
           P+ISLD++LDLT++VRN++F+S  Y SL  S+GYRG++LG+V S+ G+ +ARGSSY+NAT
Sbjct: 83  PVISLDLSLDLTIRVRNRDFFSFSYTSLTASVGYRGRRLGFVNSSGGYLRARGSSYINAT 142

Query: 594 LALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINVDIHNQ 415
           L L G+E+L DV  LLED+ARG I FDTV+++ G+LGL FFE+PL+A+VSCE+ V+  NQ
Sbjct: 143 LDLDGIEVLHDVFYLLEDLARGSIPFDTVSEVRGKLGLFFFEIPLKARVSCEVYVNTSNQ 202

Query: 414 TIEHQNCYP 388
           TI HQ+CYP
Sbjct: 203 TIIHQDCYP 211


>ref|XP_002299644.2| hypothetical protein POPTR_0001s18090g [Populus trichocarpa]
           gi|550347583|gb|EEE84449.2| hypothetical protein
           POPTR_0001s18090g [Populus trichocarpa]
          Length = 220

 Score =  195 bits (495), Expect = 3e-47
 Identities = 94/189 (49%), Positives = 133/189 (70%)
 Frame = -1

Query: 951 ILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRFHIRTFP 772
           ++LP Y  PT   CR W   II               WPSDP V VVRLRL++ HI T P
Sbjct: 32  VVLPFYRHPTTQDCRRWPM-IIAIIFLLLLSTLVYVFWPSDPTVKVVRLRLNKIHIHTLP 90

Query: 771 IISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSSYVNATL 592
           II++DI+L +++KVRN + YS+D+ SL +++ YRGK+LG+VRS+ GH +A GSSYV+A +
Sbjct: 91  IINIDISLYVSLKVRNVDVYSMDFRSLDVAVKYRGKRLGHVRSDHGHVRALGSSYVHAGV 150

Query: 591 ALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINVDIHNQT 412
             SG+ +L+DV+ LL+D+ARG + FDTVT++ G+LGL FF  P++AK+ C + V+I+NQT
Sbjct: 151 DFSGISVLSDVVSLLDDLARGTVPFDTVTEVSGRLGLLFFGFPMKAKLFCAVLVNINNQT 210

Query: 411 IEHQNCYPQ 385
           I  Q CYP+
Sbjct: 211 IVRQTCYPE 219


>gb|EPS63390.1| hypothetical protein M569_11397, partial [Genlisea aurea]
          Length = 201

 Score =  189 bits (479), Expect = 2e-45
 Identities = 95/197 (48%), Positives = 130/197 (65%), Gaps = 4/197 (2%)
 Frame = -1

Query: 963 PQYVILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXL----WPSDPDVSVVRLRLH 796
           PQYV++LP Y    R + R   RR +C                  WPSDP++ +  L+L 
Sbjct: 4   PQYVVVLPPYRPAGRRFFRASTRRRLCWFACVVLFLAAASAAYVFWPSDPELLISDLKLD 63

Query: 795 RFHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARG 616
           R    T P+ISLD+ LDLT++VRN++FYSI+Y+SLV++I YRG++LG+  S  G  +ARG
Sbjct: 64  RLGFHTKPVISLDVTLDLTIQVRNRDFYSIEYDSLVVAIEYRGRRLGFATSGGGRIRARG 123

Query: 615 SSYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEI 436
           SSYVNATL L  VE+L D + LLED ARG I FDTV++I G++ L   +LP++AKVSCE 
Sbjct: 124 SSYVNATLDLDAVEMLTDAVSLLEDFARGAIEFDTVSEIHGKIALFALQLPVKAKVSCEA 183

Query: 435 NVDIHNQTIEHQNCYPQ 385
            V+  +Q +  QNCYP+
Sbjct: 184 TVNTKSQIVTSQNCYPE 200


>ref|XP_002279706.1| PREDICTED: uncharacterized protein LOC100258307 [Vitis vinifera]
          Length = 237

 Score =  185 bits (470), Expect = 3e-44
 Identities = 93/189 (49%), Positives = 129/189 (68%)
 Frame = -1

Query: 951 ILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRFHIRTFP 772
           ++LP Y       C   R R++              LWPSDPDVS+VRLRL R  + TFP
Sbjct: 51  VVLPLYIPLLHRRCN--RFRLLTAASVLLLVASTFVLWPSDPDVSIVRLRLRRIAVHTFP 108

Query: 771 IISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSSYVNATL 592
            +SLD+++ L VKVRN + YS++Y SL ++I YRGK+LG V S +GH +ARGSS V+A+L
Sbjct: 109 RLSLDVSMSLMVKVRNVDLYSMNYRSLHVAIEYRGKELGNVTSEEGHVRARGSSLVDASL 168

Query: 591 ALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINVDIHNQT 412
            L+GV +L+DVI +LED+A+G I  DTVT++ G +G  FF+LPL  KVSC++ V+ + Q 
Sbjct: 169 ELNGVAVLSDVIFVLEDLAKGTIPIDTVTEVRGSMGFLFFQLPLRTKVSCQVYVNTNTQK 228

Query: 411 IEHQNCYPQ 385
           + HQNCYP+
Sbjct: 229 VLHQNCYPE 237


>ref|XP_006427015.1| hypothetical protein CICLE_v10026444mg [Citrus clementina]
           gi|557529005|gb|ESR40255.1| hypothetical protein
           CICLE_v10026444mg [Citrus clementina]
          Length = 219

 Score =  184 bits (468), Expect = 4e-44
 Identities = 89/198 (44%), Positives = 134/198 (67%), Gaps = 2/198 (1%)
 Frame = -1

Query: 972 EPQPQYVILLPHYY--DPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRL 799
           +PQ +   +LP+YY  +P R +C      +I               WPS+P++ + +L L
Sbjct: 27  QPQDENYTILPYYYLANPRRNWCATIAISLILLAALLYVF------WPSEPELKIEKLHL 80

Query: 798 HRFHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKAR 619
             FH+R  P I +DI+L++T+KV N++ YS++Y SL +S+GYRG++LG+V+SN G  KA 
Sbjct: 81  AHFHVRMKPAICIDISLNVTLKVHNRDVYSVNYKSLDVSVGYRGRKLGHVKSNHGRVKAL 140

Query: 618 GSSYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCE 439
            SSY++A L L  V++L+DV+ LLED+ARG + FDT+T++ G LGL F E PLEA+VSCE
Sbjct: 141 ASSYIDAELQLKCVKVLSDVVYLLEDLARGTVPFDTITKVTGHLGLFFLEFPLEARVSCE 200

Query: 438 INVDIHNQTIEHQNCYPQ 385
           + ++  +QTI  QNCYP+
Sbjct: 201 VLINTTSQTIARQNCYPK 218


>ref|XP_002331158.1| predicted protein [Populus trichocarpa]
           gi|566176693|ref|XP_006381711.1| hypothetical protein
           POPTR_0006s16220g [Populus trichocarpa]
           gi|550336462|gb|ERP59508.1| hypothetical protein
           POPTR_0006s16220g [Populus trichocarpa]
          Length = 210

 Score =  184 bits (468), Expect = 4e-44
 Identities = 86/193 (44%), Positives = 136/193 (70%)
 Frame = -1

Query: 963 PQYVILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRFHI 784
           PQ VI+L +Y+ P  +   + RR ++              L+PSDP + + R++L+   +
Sbjct: 21  PQNVIVLSYYHRPPNH---ILRRCLLFTTAILLLSAAAYLLYPSDPAIQLSRIKLNHIRV 77

Query: 783 RTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSSYV 604
            + P ++LD++  LT+KV N++F+S+DY+SLV+S+GYRG++LG+V S  G  +AR SSYV
Sbjct: 78  NSSPELTLDVSFSLTIKVENRDFFSLDYDSLVVSVGYRGRELGFVNSKGGKIRARRSSYV 137

Query: 603 NATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINVDI 424
           +A L L+G+E++ DV  L++D+ARG IIFDT TQ+ G LGL  F++P+  +VSC++ V+ 
Sbjct: 138 DARLDLNGLEVIKDVFYLIQDLARGVIIFDTDTQVKGDLGLLLFKIPINGRVSCQVFVNT 197

Query: 423 HNQTIEHQNCYPQ 385
           +NQT+EHQ+CYPQ
Sbjct: 198 NNQTVEHQDCYPQ 210


>ref|XP_004148717.1| PREDICTED: uncharacterized protein LOC101219269 [Cucumis sativus]
           gi|449501291|ref|XP_004161330.1| PREDICTED:
           uncharacterized protein LOC101225993 [Cucumis sativus]
          Length = 215

 Score =  183 bits (465), Expect = 1e-43
 Identities = 89/192 (46%), Positives = 133/192 (69%)
 Frame = -1

Query: 960 QYVILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRFHIR 781
           Q V++L  Y  P   + RL R                  L+PSDP + +VRL+L+R  + 
Sbjct: 24  QNVVVLSLYRPPPCRHRRLLRLCAFYSAAFLLLFAVAFLLFPSDPSLQLVRLKLNRVKVH 83

Query: 780 TFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSSYVN 601
             P++SLD++  ++++VRNKNF+S++YN L +S+GYRG++LGYV S  G   ARGSSYVN
Sbjct: 84  LVPVVSLDLSFSVSLRVRNKNFFSLNYNFLGVSVGYRGRRLGYVSSEGGRVSARGSSYVN 143

Query: 600 ATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINVDIH 421
           ATL L+G+E++ DV+ LL D+ +G I FDT T ++G +GL F ++P++A+VSCE+ V+ +
Sbjct: 144 ATLDLNGLEVVHDVLYLLADLGKGIIPFDTETDVEGSMGLFFIKIPIKARVSCEVLVNTN 203

Query: 420 NQTIEHQNCYPQ 385
           NQTIEHQ+CYP+
Sbjct: 204 NQTIEHQDCYPE 215


>gb|EOY30818.1| Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein
           family isoform 1 [Theobroma cacao]
          Length = 214

 Score =  182 bits (463), Expect = 2e-43
 Identities = 91/195 (46%), Positives = 132/195 (67%)
 Frame = -1

Query: 969 PQPQYVILLPHYYDPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRF 790
           P  Q VI+LP YY       R  RR +I              L+PSDP + +VRL+L+  
Sbjct: 20  PNQQNVIVLPVYYSRPNQNYRCLRRCLIFTGIVVLLSAAVFFLYPSDPTLQLVRLQLNHV 79

Query: 789 HIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSS 610
            + + P ++LD++  LT++VRN++F+S+DY+ LV+S+GYRG++LG V S  G  +ARGSS
Sbjct: 80  RVNSSPALTLDLSFSLTIRVRNRDFFSLDYDKLVVSVGYRGRELGVVSSEGGRVRARGSS 139

Query: 609 YVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINV 430
           YVNATL L+G E++ DVI L+ D A+G I FDT T++DG LGL  F+ P++A+VSCE+ V
Sbjct: 140 YVNATLDLNGFEVVHDVIYLIADWAKGVIPFDTNTKVDGDLGLFLFKAPIKAEVSCEVYV 199

Query: 429 DIHNQTIEHQNCYPQ 385
           + +NQTI  Q+CY +
Sbjct: 200 NTNNQTIVRQDCYAE 214


>gb|EXB37026.1| hypothetical protein L484_020812 [Morus notabilis]
          Length = 250

 Score =  182 bits (462), Expect = 2e-43
 Identities = 91/193 (47%), Positives = 136/193 (70%), Gaps = 1/193 (0%)
 Frame = -1

Query: 960 QYVILLPHYY-DPTRYYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLRLHRFHI 784
           Q V++LP+Y   P++   R   R ++              L+PSDP + +VR+ L+R  +
Sbjct: 24  QNVVVLPYYRPSPSKRRSRRLCRCLLASAAVLLLIAAVFILYPSDPSLQLVRVHLNRVRV 83

Query: 783 RTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARGSSYV 604
            + P ++LD++  LTVKV N++F+S+DY+SL +S+GYRG++LG+V S+ G  +ARGSSYV
Sbjct: 84  NSSPDLTLDLSFFLTVKVFNRDFFSLDYDSLAVSVGYRGRELGFVNSDGGKIRARGSSYV 143

Query: 603 NATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEINVDI 424
           +ATL L+G  I+ DV  LLED+ARG I FDTVT+++G LGL  F++PL+A VSCE+ V+ 
Sbjct: 144 DATLDLNGFAIIQDVFYLLEDLARGVIPFDTVTKVEGNLGLFLFKIPLKASVSCEVYVNT 203

Query: 423 HNQTIEHQNCYPQ 385
           +NQTI  Q+CYP+
Sbjct: 204 NNQTIARQDCYPE 216


>ref|XP_003534194.1| PREDICTED: uncharacterized protein LOC100793858 [Glycine max]
          Length = 257

 Score =  181 bits (459), Expect = 5e-43
 Identities = 95/198 (47%), Positives = 135/198 (68%), Gaps = 3/198 (1%)
 Frame = -1

Query: 969 PQPQYVILLPHYYDPTRYYCRLWRRRII---CXXXXXXXXXXXXXLWPSDPDVSVVRLRL 799
           P PQ V++L   Y P  ++ R  RR II                 L+PSDP++ + R+RL
Sbjct: 45  PYPQNVVVLLPSYRP--HFQRRRRRCIIYSAALFLFLLVAGAAFLLYPSDPEIRLARIRL 102

Query: 798 HRFHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKAR 619
            R  IRT P   LD++  LTVKVRN++F+S+ Y+SL +S+GYRG+QLG+V +  G  +AR
Sbjct: 103 DRIGIRTNPRPILDLSFSLTVKVRNRDFFSLSYDSLTVSVGYRGRQLGFVTAGGGSIRAR 162

Query: 618 GSSYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCE 439
           GSSYV+ATL + G E++ D   LLED+A+G I FDT T+++G+LGL FF +PL+A VSCE
Sbjct: 163 GSSYVDATLTIDGFEVIYDAFYLLEDIAKGVIPFDTDTRVEGKLGLFFFTVPLKATVSCE 222

Query: 438 INVDIHNQTIEHQNCYPQ 385
           ++V+I+ QTI  Q+CYP+
Sbjct: 223 VDVNINQQTIVRQDCYPK 240


>ref|XP_006465533.1| PREDICTED: uncharacterized protein LOC102615257 [Citrus sinensis]
          Length = 219

 Score =  180 bits (457), Expect = 8e-43
 Identities = 89/199 (44%), Positives = 135/199 (67%), Gaps = 3/199 (1%)
 Frame = -1

Query: 972 EPQPQYVILLPHYY--DPTR-YYCRLWRRRIICXXXXXXXXXXXXXLWPSDPDVSVVRLR 802
           +PQ +   +LP+YY  +P R +Y  +    I+               WPS+P++ + RL 
Sbjct: 27  QPQDENYTILPYYYLENPRRNWYATIAISLILLAALLYVF-------WPSEPELKIERLH 79

Query: 801 LHRFHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKA 622
           L  FH+R  P I +DI+L++T+KV N++ YS++Y SL +S+GYRG++LG+V+SN G  KA
Sbjct: 80  LAHFHVRMKPAICIDISLNVTLKVHNRDVYSVNYKSLDVSVGYRGRKLGHVKSNHGRVKA 139

Query: 621 RGSSYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSC 442
             SS+++A L L  V++L+DV+ LLED+ARG + FDT+T++ G LGL F E PLEA+VSC
Sbjct: 140 LASSFIDAELQLKCVKVLSDVVYLLEDLARGTVPFDTITKVTGHLGLFFLEFPLEARVSC 199

Query: 441 EINVDIHNQTIEHQNCYPQ 385
           E+ ++  +QTI  QNCYP+
Sbjct: 200 EVLINTTSQTIARQNCYPK 218


>ref|XP_006451136.1| hypothetical protein CICLE_v10009328mg [Citrus clementina]
           gi|568843422|ref|XP_006475609.1| PREDICTED:
           uncharacterized protein LOC102611769 [Citrus sinensis]
           gi|557554362|gb|ESR64376.1| hypothetical protein
           CICLE_v10009328mg [Citrus clementina]
          Length = 213

 Score =  180 bits (456), Expect = 1e-42
 Identities = 93/197 (47%), Positives = 133/197 (67%), Gaps = 7/197 (3%)
 Frame = -1

Query: 954 VILLPHYYDPTRYYCRLWRRR-----IICXXXXXXXXXXXXXL--WPSDPDVSVVRLRLH 796
           VI+LP YY P     R WRRR      +C                +PSDP + + R+ L+
Sbjct: 20  VIVLPVYYQPD---LRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLN 76

Query: 795 RFHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARG 616
              + + P  +LD++  L VKV N++F+S++Y+SL +SIGYRG++LG VRS+ G  +ARG
Sbjct: 77  HIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARG 136

Query: 615 SSYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEI 436
           SSYVNA+L L+G+E++ DVI L+ED+ +G I FDTVT + G+LG+ FFE+PL+AKVSCE+
Sbjct: 137 SSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEV 196

Query: 435 NVDIHNQTIEHQNCYPQ 385
            V+  NQTI  Q+CYP+
Sbjct: 197 YVNTSNQTIVRQDCYPE 213


>ref|XP_006451135.1| hypothetical protein CICLE_v10009328mg [Citrus clementina]
           gi|557554361|gb|ESR64375.1| hypothetical protein
           CICLE_v10009328mg [Citrus clementina]
          Length = 240

 Score =  180 bits (456), Expect = 1e-42
 Identities = 93/197 (47%), Positives = 133/197 (67%), Gaps = 7/197 (3%)
 Frame = -1

Query: 954 VILLPHYYDPTRYYCRLWRRR-----IICXXXXXXXXXXXXXL--WPSDPDVSVVRLRLH 796
           VI+LP YY P     R WRRR      +C                +PSDP + + R+ L+
Sbjct: 20  VIVLPVYYQPD---LRRWRRRRNLSRCLCTAAAIASLLAVLVFIFYPSDPYLQLARIHLN 76

Query: 795 RFHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQLGYVRSNQGHFKARG 616
              + + P  +LD++  L VKV N++F+S++Y+SL +SIGYRG++LG VRS+ G  +ARG
Sbjct: 77  HIRVNSSPQPTLDLSFSLVVKVHNRDFFSLNYDSLDVSIGYRGRELGSVRSHGGRVRARG 136

Query: 615 SSYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGLSFFELPLEAKVSCEI 436
           SSYVNA+L L+G+E++ DVI L+ED+ +G I FDTVT + G+LG+ FFE+PL+AKVSCE+
Sbjct: 137 SSYVNASLKLNGLEVIHDVIYLIEDLIKGVIPFDTVTMVKGELGVLFFEIPLKAKVSCEV 196

Query: 435 NVDIHNQTIEHQNCYPQ 385
            V+  NQTI  Q+CYP+
Sbjct: 197 YVNTSNQTIVRQDCYPE 213


>gb|EOY26762.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           isoform 4 [Theobroma cacao]
          Length = 167

 Score =  180 bits (456), Expect = 1e-42
 Identities = 83/151 (54%), Positives = 116/151 (76%)
 Frame = -1

Query: 840 WPSDPDVSVVRLRLHRFHIRTFPIISLDINLDLTVKVRNKNFYSIDYNSLVISIGYRGKQ 661
           WPS P+V +VR+ + R  + T PII+LDI+L +T+KVRN + YS+D+ SL +++GYRGK 
Sbjct: 15  WPSQPEVKIVRMHVKRMQMHTVPIIALDISLLVTLKVRNSDVYSMDFTSLDMAVGYRGKM 74

Query: 660 LGYVRSNQGHFKARGSSYVNATLALSGVEILADVIPLLEDVARGEIIFDTVTQIDGQLGL 481
           LG+V+S   H +A GSSY+ A L L+GVE+L+DV+ +LED+ARG + FDT+T++ G LGL
Sbjct: 75  LGHVKSEHDHLRAWGSSYLQAELELNGVEVLSDVVYMLEDLARGTVPFDTITEVAGWLGL 134

Query: 480 SFFELPLEAKVSCEINVDIHNQTIEHQNCYP 388
           S F+ PL+ K+SCEI V+  NQ I HQNCYP
Sbjct: 135 SLFKFPLKVKISCEIVVNRTNQIIIHQNCYP 165


Top