BLASTX nr result

ID: Forsythia23_contig00023402 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00023402
         (1054 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012828647.1| PREDICTED: uncharacterized protein LOC105949...   180   1e-42
gb|EYU18126.1| hypothetical protein MIMGU_mgv1a026169mg, partial...   180   1e-42
ref|XP_009776837.1| PREDICTED: uncharacterized protein LOC104226...   138   6e-30
ref|XP_009788019.1| PREDICTED: uncharacterized protein LOC104235...   136   3e-29
ref|XP_012835824.1| PREDICTED: uncharacterized protein LOC105956...   135   4e-29
ref|XP_009627615.1| PREDICTED: uncharacterized protein LOC104118...   131   9e-28
ref|XP_012074425.1| PREDICTED: protein YLS9 [Jatropha curcas]         118   6e-24
gb|EYU38691.1| hypothetical protein MIMGU_mgv1a020389mg, partial...   118   6e-24
emb|CDP10609.1| unnamed protein product [Coffea canephora]            116   3e-23
ref|XP_006290371.1| hypothetical protein CARUB_v10017684mg [Caps...   111   8e-22
ref|XP_010426861.1| PREDICTED: uncharacterized protein LOC104711...   107   1e-20
ref|XP_006403802.1| hypothetical protein EUTSA_v10010982mg [Eutr...   107   2e-20
ref|XP_002309246.2| hydroxyproline-rich glycoprotein [Populus tr...   107   2e-20
ref|NP_190814.1| hydroxyproline-rich glycoprotein family protein...   107   2e-20
emb|CDX73634.1| BnaC08g23690D [Brassica napus]                        106   2e-20
ref|XP_010503988.1| PREDICTED: uncharacterized protein LOC104781...   106   3e-20
ref|XP_002532542.1| conserved hypothetical protein [Ricinus comm...   105   7e-20
gb|EYU19400.1| hypothetical protein MIMGU_mgv1a022723mg, partial...   104   9e-20
ref|XP_010109196.1| hypothetical protein L484_002234 [Morus nota...   103   2e-19
emb|CDX78099.1| BnaA09g32940D [Brassica napus]                        103   2e-19

>ref|XP_012828647.1| PREDICTED: uncharacterized protein LOC105949888 [Erythranthe
           guttatus]
          Length = 326

 Score =  180 bits (457), Expect = 1e-42
 Identities = 102/294 (34%), Positives = 160/294 (54%), Gaps = 33/294 (11%)
 Frame = -2

Query: 897 VASPSDDRSKIVMGYPPIN------XXXXXXXXXXXPDLGYHNHKGYVDSVSP------- 757
           V  P D + KIVMGYP ++                      H+H+ Y  + S        
Sbjct: 3   VPPPDDPKHKIVMGYPSMDRYHLQATPAPYPPPTVYGSSSIHHHQLYPSTSSQPPPPPFQ 62

Query: 756 --------------NPYSQSFDGYYYQQPYEPLVP----EPSRTTSVGRAMISLLIVLTI 631
                         N Y+Q ++ YYYQQ Y+PLVP      S ++S GR M+ L+IVL  
Sbjct: 63  HDGQYPNAAPPLGGNQYNQPYNEYYYQQHYKPLVPLNNNNDSDSSSFGRVMLILMIVLVA 122

Query: 630 GMCMLSIIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPH 451
            MCM+S++++ LF T +P+F + SL V NF AT+++L   W   ++VTN N+   + F  
Sbjct: 123 SMCMMSLVMWFLFGTYIPEFEVASLKVSNFTATDTTLTGTWEVELSVTNTNKELAVNFDR 182

Query: 450 VMSSVFYEEDMLAISPLQPFEIETKQNLDLSFRV-TTGRSNQEQLHAGVFPNIVQDRSTG 274
           VMSS+FY+E +L IS LQPF+++      ++F +    + N  +L + V P + QD+S G
Sbjct: 183 VMSSIFYKEALLGISTLQPFQVDQMNRSTVNFSMPAEQKPNDMKLQSWVLPTLSQDQSNG 242

Query: 273 FVIFSLRLSLKAQY-MSNSFSRKASIKVYCKNLQVSFSPTGEGKLTEDFPNECL 115
            V+FSLRL++K  +  +N   R+ +++V C+N+QV F+  G+G ++    N CL
Sbjct: 243 VVVFSLRLAMKTNFTTANLVYRQENLRVLCENVQVVFTVGGKGTMSPGLGNICL 296


>gb|EYU18126.1| hypothetical protein MIMGU_mgv1a026169mg, partial [Erythranthe
           guttata]
          Length = 304

 Score =  180 bits (457), Expect = 1e-42
 Identities = 102/294 (34%), Positives = 160/294 (54%), Gaps = 33/294 (11%)
 Frame = -2

Query: 897 VASPSDDRSKIVMGYPPIN------XXXXXXXXXXXPDLGYHNHKGYVDSVSP------- 757
           V  P D + KIVMGYP ++                      H+H+ Y  + S        
Sbjct: 3   VPPPDDPKHKIVMGYPSMDRYHLQATPAPYPPPTVYGSSSIHHHQLYPSTSSQPPPPPFQ 62

Query: 756 --------------NPYSQSFDGYYYQQPYEPLVP----EPSRTTSVGRAMISLLIVLTI 631
                         N Y+Q ++ YYYQQ Y+PLVP      S ++S GR M+ L+IVL  
Sbjct: 63  HDGQYPNAAPPLGGNQYNQPYNEYYYQQHYKPLVPLNNNNDSDSSSFGRVMLILMIVLVA 122

Query: 630 GMCMLSIIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPH 451
            MCM+S++++ LF T +P+F + SL V NF AT+++L   W   ++VTN N+   + F  
Sbjct: 123 SMCMMSLVMWFLFGTYIPEFEVASLKVSNFTATDTTLTGTWEVELSVTNTNKELAVNFDR 182

Query: 450 VMSSVFYEEDMLAISPLQPFEIETKQNLDLSFRV-TTGRSNQEQLHAGVFPNIVQDRSTG 274
           VMSS+FY+E +L IS LQPF+++      ++F +    + N  +L + V P + QD+S G
Sbjct: 183 VMSSIFYKEALLGISTLQPFQVDQMNRSTVNFSMPAEQKPNDMKLQSWVLPTLSQDQSNG 242

Query: 273 FVIFSLRLSLKAQY-MSNSFSRKASIKVYCKNLQVSFSPTGEGKLTEDFPNECL 115
            V+FSLRL++K  +  +N   R+ +++V C+N+QV F+  G+G ++    N CL
Sbjct: 243 VVVFSLRLAMKTNFTTANLVYRQENLRVLCENVQVVFTVGGKGTMSPGLGNICL 296


>ref|XP_009776837.1| PREDICTED: uncharacterized protein LOC104226526 [Nicotiana
           sylvestris]
          Length = 279

 Score =  138 bits (348), Expect = 6e-30
 Identities = 74/227 (32%), Positives = 133/227 (58%), Gaps = 4/227 (1%)
 Frame = -2

Query: 783 KGYVDSVSPNPYSQSFDGYY--YQQPYEPLVPEPSRTTSVGRAMISLLIVLTIGMCMLSI 610
           +GY  + +PN Y+     YY  Y Q Y PL  E +   S GR M +++++L IGM M S+
Sbjct: 56  QGYPSNYNPNSYAT----YYVNYSQKYTPLQEENNSRASFGRYMTTMMLILIIGMIMFSL 111

Query: 609 IIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFY 430
           +I+LLF T+ P+F +VS+ VP+   TN+S+   W  N+++ N N+   I+  +  +++ Y
Sbjct: 112 VIWLLFGTEKPEFHLVSMQVPSLMVTNTSILGNWQVNVSMKNSNKDLGIKLNNGNTAILY 171

Query: 429 EEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRL 250
           + ++LA +P+ PF++ ++ +  L   +TT  S    L  G+      +R+ G + F+L++
Sbjct: 172 KTNVLAETPVDPFKLASQSSAILFSNLTT--SAGTFLDKGILAEATGERNNGVLKFTLQI 229

Query: 249 SLKAQYMSNSFSRKASIKVYCKNLQVSF--SPTGEGKLTEDFPNECL 115
            L  QY S + S+   +++YC +++V F   P  +G+LT+  P +CL
Sbjct: 230 YLGIQYTSKTESKNERLRIYCNDVKVHFGHGPQDKGELTKADPIDCL 276


>ref|XP_009788019.1| PREDICTED: uncharacterized protein LOC104235889 [Nicotiana
           sylvestris]
          Length = 282

 Score =  136 bits (342), Expect = 3e-29
 Identities = 83/277 (29%), Positives = 139/277 (50%), Gaps = 18/277 (6%)
 Frame = -2

Query: 891 SPSDDRSKIVMGYPPINXXXXXXXXXXXPDLGYHN-----HKGYVDSV----SPNPYSQS 739
           S  +++ K VMGYP I+                +       +GY+  +    +PNP    
Sbjct: 4   SQDNEQQKHVMGYPSISKYNQSIKQGFPSQYDSNQLYNSFSQGYILPIQGYPNPNPNPNL 63

Query: 738 F---------DGYYYQQPYEPLVPEPSRTTSVGRAMISLLIVLTIGMCMLSIIIFLLFAT 586
           +         + Y     Y P+  + + ++S GR M+ L+++L +GM MLS++ +L F T
Sbjct: 64  YVSSSSNYPSNKYVTMAAYNPMEEQNNGSSSFGRLMVILMLILVVGMIMLSLVFWLFFGT 123

Query: 585 DLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFYEEDMLAIS 406
           + P F I++L+VP+F   NSS+   W  N+T+ N N+  KI+  H  +S+FY+ ++LA++
Sbjct: 124 EGPVFHILALSVPSFRIINSSIVGNWQVNLTMCNMNDHSKIKVFHGKTSIFYKTNLLAVT 183

Query: 405 PLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRLSLKAQYMS 226
           P  P  +E K   ++   +TT       L       I +D++ G + FSL +SLK    S
Sbjct: 184 PFDPVRLEVKGTKNMLSNLTT-LPEGNVLDQLAISEISKDKNDGDIEFSLEISLKVVLAS 242

Query: 225 NSFSRKASIKVYCKNLQVSFSPTGEGKLTEDFPNECL 115
           +S  +   + VYC NL+V F P   G+L      +CL
Sbjct: 243 DSEWQSHKMMVYCNNLKVKFGPEDRGELVAADKIKCL 279


>ref|XP_012835824.1| PREDICTED: uncharacterized protein LOC105956520 [Erythranthe
           guttatus]
          Length = 331

 Score =  135 bits (341), Expect = 4e-29
 Identities = 82/249 (32%), Positives = 132/249 (53%), Gaps = 36/249 (14%)
 Frame = -2

Query: 753 PYSQSFDGYYY------------QQPY--EPLVPEPSRTTSVGRAMISLLIVLTIGMCML 616
           PY+ S  GYYY            QQPY    ++   + ++S GR M+ L++VL   MCM+
Sbjct: 75  PYNNSNAGYYYNNNNNNSSNNYNQQPYIQGEIIRNETFSSSFGRMMLILMVVLVAAMCMM 134

Query: 615 SIIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSV 436
           S+  +LL+ T +P+F + SL V NF+ATN++L   W A++ V N NE   + F  V S V
Sbjct: 135 SLATWLLYGTYVPEFEVASLKVSNFSATNTTLRGTWIADVIVYNPNEELAVNFERVRSLV 194

Query: 435 FYEEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQ-------------------LHA 313
           FY+  ++  S L  F++E+    +++F V    +++E+                   LH 
Sbjct: 195 FYKGVIVGASTLDSFQVESNLRFNMNFSVAAAEADEERVVSVGPGSSSINNNNNNNNLHG 254

Query: 312 GVFPNIVQDRSTGFVIFSLRLSLKAQYMS-NSFSRKASIKVYCKNLQVSF--SPTGEGKL 142
            V   + QD S G V+FSLR++L A+  S N   R+ +++V C +L+V F  +   EG+L
Sbjct: 255 LVLSALAQDWSNGAVVFSLRIALDAKLASPNQVYRQDNLRVSCDDLEVKFMAADMDEGRL 314

Query: 141 TEDFPNECL 115
           +     +CL
Sbjct: 315 SRGLGAQCL 323


>ref|XP_009627615.1| PREDICTED: uncharacterized protein LOC104118138 [Nicotiana
           tomentosiformis] gi|697146927|ref|XP_009627616.1|
           PREDICTED: uncharacterized protein LOC104118138
           [Nicotiana tomentosiformis]
           gi|697146929|ref|XP_009627617.1| PREDICTED:
           uncharacterized protein LOC104118138 [Nicotiana
           tomentosiformis]
          Length = 286

 Score =  131 bits (329), Expect = 9e-28
 Identities = 71/228 (31%), Positives = 127/228 (55%), Gaps = 5/228 (2%)
 Frame = -2

Query: 783 KGYVDSVSPNPYSQSFDGYYY---QQPYEPLVPEPSRTTSVGRAMISLLIVLTIGMCMLS 613
           +GY  + +PN Y    D  +Y    Q Y PL  E +   S GR M +++++L IGM M S
Sbjct: 58  QGYPSNYNPNSYPLGADATFYVNYSQKYMPLQEENNSRASFGRYMTTMMLILIIGMIMFS 117

Query: 612 IIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVF 433
           ++I+LLF T+ P+F +VS+ V +   T++S+   W  N+++ N N    I+     +++ 
Sbjct: 118 LVIWLLFGTEKPEFHLVSIQVSSLMITSTSILGNWQVNVSMKNSNNDLDIKLNTGKTAIL 177

Query: 432 YEEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLR 253
           Y+ ++LA +P+ PF + ++ +  L   +TT  S    L  G+      +R  G + F+L+
Sbjct: 178 YKTNVLAETPVDPFNLASQSSTILFSNLTT--SPGTFLDKGILLEATGERDNGVLKFTLQ 235

Query: 252 LSLKAQYMSNSFSRKASIKVYCKNLQVSF--SPTGEGKLTEDFPNECL 115
           + L  QY S + S+   +++YC +++V F   P  +G+LT+  P +CL
Sbjct: 236 IYLGFQYTSKTESKNERLRIYCNDVKVQFGHGPQDKGELTKADPIDCL 283


>ref|XP_012074425.1| PREDICTED: protein YLS9 [Jatropha curcas]
          Length = 293

 Score =  118 bits (296), Expect = 6e-24
 Identities = 85/259 (32%), Positives = 137/259 (52%), Gaps = 7/259 (2%)
 Frame = -2

Query: 861 MGYPPINXXXXXXXXXXXPDLGYHNHK-GYVDSVSPNPYSQSFDGYYYQQPYEPLVPEPS 685
           MGYPP                GY N+  GY ++    PY+Q+    YY  P+   V + S
Sbjct: 40  MGYPP-GPAPPGYPSPSPGQQGYPNYSNGYNNNY---PYTQAPPASYYN-PHLYNVQQES 94

Query: 684 RTTSVGRAMISLLIVLTIGMCMLSIIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWY 505
             +S  R +I  L++L I MC+ SI+++L+    +P   + +L+V NFN ++S   A W 
Sbjct: 95  PVSSFIRGVIGGLVLLIIFMCIASILMWLILRPAIPVMHVDTLSVSNFNGSSSFFTADWD 154

Query: 504 ANITVTNQNEGFKIQFPHVMSSVFYEE-DMLAISPLQPFEIETKQNLDLSFRVTTGRSNQ 328
           A I V N N   KI F  +   ++++E D+LA S   PF +ET QN  +  ++    S+ 
Sbjct: 155 AKIAVENPNTKLKIYFDQIQVFLYFDESDLLASSFAHPFLLETHQNNVIKTKLAANNSDH 214

Query: 327 EQLHAG--VFPNIVQDRS-TGFVIFSLRLSLKAQYMSNS-FSRKASIKVYCKNLQVSF-S 163
            Q   G  V   +  D+S TG + F LR+++ + + S + +++ A+I+VYC++L+V F S
Sbjct: 215 TQAGVGSWVVEKMANDKSTTGKLHFGLRMAIWSTFKSGTWWAKHATIRVYCEDLEVVFGS 274

Query: 162 PTGEGKLTEDFPNECLTFS 106
               GKL     N+CL F+
Sbjct: 275 SKSNGKLNTSNANDCLIFA 293


>gb|EYU38691.1| hypothetical protein MIMGU_mgv1a020389mg, partial [Erythranthe
           guttata]
          Length = 287

 Score =  118 bits (296), Expect = 6e-24
 Identities = 75/216 (34%), Positives = 111/216 (51%), Gaps = 20/216 (9%)
 Frame = -2

Query: 753 PYSQSFDGYYY------------QQPY--EPLVPEPSRTTSVGRAMISLLIVLTIGMCML 616
           PY+ S  GYYY            QQPY    ++   + ++S GR M+ L++VL   MCM+
Sbjct: 75  PYNNSNAGYYYNNNNNNSSNNYNQQPYIQGEIIRNETFSSSFGRMMLILMVVLVAAMCMM 134

Query: 615 SIIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSV 436
           S+  +LL+ T +P+F + SL V NF+ATN++L   W A++ V N NE   + F  V S V
Sbjct: 135 SLATWLLYGTYVPEFEVASLKVSNFSATNTTLRGTWIADVIVYNPNEELAVNFERVRSLV 194

Query: 435 FY-----EEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGF 271
           FY     E D   +  + P       N           +N   LH  V   + QD S G 
Sbjct: 195 FYKGVIVEADEERVVSVGPGSSSINNN-----------NNNNNLHGLVLSALAQDWSNGA 243

Query: 270 VIFSLRLSLKAQYMS-NSFSRKASIKVYCKNLQVSF 166
           V+FSLR++L A+  S N   R+ +++V C +L+V F
Sbjct: 244 VVFSLRIALDAKLASPNQVYRQDNLRVSCDDLEVKF 279


>emb|CDP10609.1| unnamed protein product [Coffea canephora]
          Length = 260

 Score =  116 bits (290), Expect = 3e-23
 Identities = 76/267 (28%), Positives = 126/267 (47%), Gaps = 8/267 (2%)
 Frame = -2

Query: 885 SDDRSKIVMGYPPINXXXXXXXXXXXPDLGYHNHK-----GYVDSVSPNPYSQSFDGYYY 721
           +D+ ++ VMGY P+               GY+ H       Y +   P P +     YY 
Sbjct: 3   NDETTRPVMGYAPVGYPQPFPPPQQ----GYYGHPYTAYPNYYNGAGPPPGTV----YYS 54

Query: 720 QQPYEPLVPEPSRTTSVGRAMISLLIVLTIGMCMLSIIIFLLFATDLPDFRIVSLTVPNF 541
                 + P+PS+     R  + ++IVL +     S+  +L   + +PDF++ S  VP F
Sbjct: 55  SAQLPNISPQPSKGYEFARLALIIMIVLMVCTITFSLFTWLFLGSGVPDFKVESFNVPYF 114

Query: 540 NATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFYEEDMLAISPLQPFEIE--TKQNL 367
           +  NS+L A W  NITV N N+  +I FPH+   + Y   ++  + + P  +E  T+ +L
Sbjct: 115 DIANSTLKARWETNITVKNTNQKSRISFPHIQGYLVYRNRLVDAAMIDPLHLEGKTEASL 174

Query: 366 DLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRLSLKAQYMSNSF-SRKASIKVY 190
             +F +   R N  +  A V   + + R  G + F LRL ++A Y+S S+ SR+  ++V 
Sbjct: 175 RANFSLPDPRGNLPE--ASVVDAMGEGRKIGILDFYLRLDMRATYVSGSYWSRETMLRVI 232

Query: 189 CKNLQVSFSPTGEGKLTEDFPNECLTF 109
           C +L V+F     G        EC T+
Sbjct: 233 CGDLWVNFPAPIGGGTWNGTSGECSTY 259


>ref|XP_006290371.1| hypothetical protein CARUB_v10017684mg [Capsella rubella]
           gi|482559078|gb|EOA23269.1| hypothetical protein
           CARUB_v10017684mg [Capsella rubella]
          Length = 310

 Score =  111 bits (278), Expect = 8e-22
 Identities = 68/225 (30%), Positives = 121/225 (53%), Gaps = 10/225 (4%)
 Frame = -2

Query: 753 PYSQSFDGYYYQQPYEPLV-PEPSRTTSVG--RAMISLLIVLTIGMCMLSIIIFLLFATD 583
           PY+Q+    YY   Y P   P   R  S G  R +++ L+VL + +C+ + I +L+    
Sbjct: 84  PYAQAPPASYYGSSYPPQQNPVYQRPDSSGFFRGILTGLVVLVVLLCISTTITWLILRPQ 143

Query: 582 LPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFY-----EEDM 418
           +P F + + +V NFN T    +A W AN+T+ NQN   K  F  +   +++     E+D 
Sbjct: 144 IPGFSLNNFSVSNFNVTGPVFSAQWTANLTIENQNTKLKGYFDRIQGLLYHQNAIGEDDF 203

Query: 417 LAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRLSLKA 238
           LA S  QP  +ETK+++ +   +T G   Q ++ + V   + ++R TG + F+LR+++  
Sbjct: 204 LASSFFQPVYVETKKSVVIGETLTAGEKEQPKVPSWVVEEMKKERDTGTLTFNLRMAVWV 263

Query: 237 QYMSNSFS-RKASIKVYCKNLQVSF-SPTGEGKLTEDFPNECLTF 109
            + ++ +S R+  +KV+C  L+V+F   +G G +    P  CL +
Sbjct: 264 TFKTDGWSARERGLKVFCGKLKVAFEGVSGNGAVLLPKPLPCLVY 308


>ref|XP_010426861.1| PREDICTED: uncharacterized protein LOC104711796 [Camelina sativa]
          Length = 306

 Score =  107 bits (268), Expect = 1e-20
 Identities = 67/225 (29%), Positives = 116/225 (51%), Gaps = 10/225 (4%)
 Frame = -2

Query: 753 PYSQSFDGYYYQQPYEPLV-PEPSRTTSVG--RAMISLLIVLTIGMCMLSIIIFLLFATD 583
           PY+Q+    YY   Y P   P   R  S G  R +++ L+VL +  C+ + I +L+    
Sbjct: 80  PYAQAPPASYYGSSYPPQQNPVYQRPASSGFFRGIVTGLVVLVVLFCISTTITWLILRPQ 139

Query: 582 LPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFY-----EEDM 418
           +P F + + +V NFN T    +  W AN+T+ NQN   K  F  +   V++     E+D 
Sbjct: 140 IPVFSLNNFSVSNFNVTGPVFSPQWTANLTIENQNTKLKGYFDRIQGLVYHQNAIGEDDF 199

Query: 417 LAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRLSLKA 238
           LA +  QP  +ETK++  +   +T G   Q ++ + V   + ++R TG V F+LR+++  
Sbjct: 200 LATAFFQPVFVETKKSAAIGETLTAGEKEQPKIPSWVVDEMKKERETGTVTFNLRMAVWV 259

Query: 237 QYMSNSF-SRKASIKVYCKNLQVSFSPTGE-GKLTEDFPNECLTF 109
            + ++ + +R+  +KV+C  L+V F    E G +    P  CL +
Sbjct: 260 TFKTDGWVARERGLKVFCGKLKVGFEGASENGAVLLPKPLPCLVY 304


>ref|XP_006403802.1| hypothetical protein EUTSA_v10010982mg [Eutrema salsugineum]
           gi|557104921|gb|ESQ45255.1| hypothetical protein
           EUTSA_v10010982mg [Eutrema salsugineum]
          Length = 294

 Score =  107 bits (266), Expect = 2e-20
 Identities = 66/235 (28%), Positives = 121/235 (51%), Gaps = 7/235 (2%)
 Frame = -2

Query: 792 HNHKGYVDSVSPNPYSQSFDGYYYQQPYEPLVPEPSRTTSVGRAMISLLIVLTIGMCMLS 613
           +NH+ Y  + +P     S+ G  Y     P+   P+  +   R +++ L+VL + +C+ +
Sbjct: 63  YNHQQYAYAQAP---PASYYGSSYPAQQNPVYQRPA-PSGFFRGILAGLVVLVVLLCIST 118

Query: 612 IIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVF 433
            I +L+    +P F + + +V NFN T    +A W AN+TV N N   K  F  +   ++
Sbjct: 119 TITWLVLRPQIPVFSVTNFSVSNFNVTGPVFSAQWTANLTVENPNSKLKGYFDRIQGFIY 178

Query: 432 Y-----EEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFV 268
                 E+D LA +  QP  +ETK+++ +   +T G   Q ++ + V   + ++R TG V
Sbjct: 179 NQNAIGEDDFLAKAFFQPVYVETKKSVSIGETLTAGEKEQPKVPSWVGEEMKKERETGTV 238

Query: 267 IFSLRLSLKAQYMSNSFS-RKASIKVYCKNLQVSF-SPTGEGKLTEDFPNECLTF 109
            F LR+++   + +  +S R++ +KV+C  L+V+F   +G G      P  CL +
Sbjct: 239 SFDLRMAVWVTFKTEGWSARESGLKVFCGKLKVAFEGASGNGAALLPKPLPCLVY 293


>ref|XP_002309246.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|550336834|gb|EEE92769.2| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 287

 Score =  107 bits (266), Expect = 2e-20
 Identities = 79/253 (31%), Positives = 128/253 (50%), Gaps = 23/253 (9%)
 Frame = -2

Query: 798 GYHNHKGYVDSVS-PNPYSQSFDGY----YYQQPYEPLVPEPS-------------RTTS 673
           GY    GY  S+  P P    + GY    YY  PY    P  +             R++ 
Sbjct: 38  GYPPAMGYPPSMDYPPPPPGQYPGYPPPGYY--PYAQAPPAAAYYNATVHQQQGYERSSG 95

Query: 672 VGRAMISLLIVLTIGMCMLSIIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANIT 493
             R  ++ +I LT+ +   SII++L+    LP F + + +V N NAT  +  A W AN+T
Sbjct: 96  FSRCFLTTIIFLTLLIFTSSIIMWLVLRPQLPVFHVDNFSVSNLNATLPTFTANWEANLT 155

Query: 492 VTNQNEGFKIQFPHVMSSVFYEEDMLAISPL--QPFEIETKQNLDLSFRVTTGRSNQEQL 319
           V N N   KI+F  + + VFYEED L  S +  +PF +ETK +  ++ +++   +N++ L
Sbjct: 156 VRNPNTRLKIEFSELQNFVFYEEDYLLASAITSRPFSLETKTSGVINAKLS--ENNKDNL 213

Query: 318 -HAGVFPNIVQDRSTGFVIFSLRLSLKAQYMSNS-FSRKASIKVYCKNLQVSF-SPTGEG 148
               V   + ++RS G V F+ R+ +   + S   + R  SIKV C+++QV+F   +G G
Sbjct: 214 VENWVVDKLAKERSNGSVSFNFRMLVWTTFRSGLWWKRNLSIKVMCEDIQVTFVGASGNG 273

Query: 147 KLTEDFPNECLTF 109
            +  +   +CL F
Sbjct: 274 NIAANGLRDCLVF 286


>ref|NP_190814.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana] gi|4886281|emb|CAB43432.1| putative protein
           [Arabidopsis thaliana] gi|332645427|gb|AEE78948.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
          Length = 300

 Score =  107 bits (266), Expect = 2e-20
 Identities = 66/226 (29%), Positives = 121/226 (53%), Gaps = 11/226 (4%)
 Frame = -2

Query: 753 PYSQSFDGYYYQQPY----EPLVPEPSRTTSVGRAMISLLIVLTIGMCMLSIIIFLLFAT 586
           PY+Q+    YY   Y     P+   P+ +  V R + + LIVL + +C+ + I +L+   
Sbjct: 75  PYAQAPPASYYGSSYPAQQNPVYQRPASSGFV-RGIFTGLIVLVVLLCISTTITWLVLRP 133

Query: 585 DLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFY-----EED 421
            +P F + + +V NFN T    +A W AN+T+ NQN   K  F  +   V++     E++
Sbjct: 134 QIPLFSVNNFSVSNFNVTGPVFSAQWTANLTIENQNTKLKGYFDRIQGLVYHQNAVGEDE 193

Query: 420 MLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRLSLK 241
            LA +  QP  +ETK+++ +   +T G   Q ++ + V   + ++R TG V FSLR+++ 
Sbjct: 194 FLATAFFQPVFVETKKSVVIGETLTAGDKEQPKVPSWVVDEMKKERETGTVTFSLRMAVW 253

Query: 240 AQYMSNSF-SRKASIKVYCKNLQVSFSP-TGEGKLTEDFPNECLTF 109
             + ++ + +R++ +KV+C  L+V F   +G G +    P  C+ +
Sbjct: 254 VTFKTDGWAARESGLKVFCGKLKVGFEGISGNGAVLLPKPLPCVVY 299


>emb|CDX73634.1| BnaC08g23690D [Brassica napus]
          Length = 302

 Score =  106 bits (265), Expect = 2e-20
 Identities = 67/235 (28%), Positives = 121/235 (51%), Gaps = 7/235 (2%)
 Frame = -2

Query: 792 HNHKGYVDSVSPNPYSQSFDGYYYQQPYEPLVPEPSRTTSVGRAMISLLIVLTIGMCMLS 613
           +NH+ Y  + +P     S+ G  Y     P+   P+  +   R +++ LIVL + +C+ +
Sbjct: 71  YNHQQYAYAQAP---PASYYGSSYPAQQNPVYQRPA-PSGFFRGILTGLIVLVVLLCIST 126

Query: 612 IIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVF 433
            I +L+    +P F + S +V NFN T    +A W AN+TV N N      F  + + ++
Sbjct: 127 TITWLVLRPQIPVFSVTSFSVSNFNLTGPVFSAQWTANLTVENPNTKLNGYFDRIQAFIY 186

Query: 432 Y-----EEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFV 268
                 E+D LA++  QP  +ETK++  +   +T G   Q ++ + V   + ++R TG V
Sbjct: 187 NQNAIGEDDFLAMAFFQPVSVETKKSASIGETLTAGGKEQPKVPSWVGEEMKKERDTGMV 246

Query: 267 IFSLRLSLKAQYMSNSFS-RKASIKVYCKNLQVSF-SPTGEGKLTEDFPNECLTF 109
            F LR+ +   + ++ +S R+  +KV+C  L+V+F   +G G +    P  CL +
Sbjct: 247 SFDLRMLVWVTFKTDGWSARERGLKVFCGKLKVAFEGGSGNGAVLLPKPLPCLVY 301


>ref|XP_010503988.1| PREDICTED: uncharacterized protein LOC104781090 [Camelina sativa]
          Length = 304

 Score =  106 bits (264), Expect = 3e-20
 Identities = 66/225 (29%), Positives = 117/225 (52%), Gaps = 10/225 (4%)
 Frame = -2

Query: 753 PYSQSFDGYYYQQPYEPLV-PEPSRTTSVG--RAMISLLIVLTIGMCMLSIIIFLLFATD 583
           PY+++    YY   Y P   P   R  S G  R + + L+VL + +C+ + I +L+    
Sbjct: 78  PYARAPPASYYGSSYPPQQNPVYQRPASSGFFRGIFTGLVVLVLLLCISTTITWLILRPQ 137

Query: 582 LPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFY-----EEDM 418
           +P F + + +V NFN T    +  W AN+T+ NQN   K  F  +   V++     E+D 
Sbjct: 138 IPVFSLNNFSVSNFNVTGPVFSPQWTANLTIENQNTKLKGYFDRIQGLVYHQNAIGEDDF 197

Query: 417 LAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRLSLKA 238
           LA +  QP  +ETK+++ +   +T G   Q ++ + V   + ++R TG V F+LR+++  
Sbjct: 198 LATAFFQPVFVETKKSVAIGETLTAGEKEQPKIPSWVVDEMKKERETGTVTFNLRMAVWV 257

Query: 237 QYMSNSF-SRKASIKVYCKNLQVSFSPTGE-GKLTEDFPNECLTF 109
            + ++ + +R+  +KV+C  L+V F    E G +    P  CL +
Sbjct: 258 TFKTDGWVARERGLKVFCGKLKVGFEGASENGAVLLPKPLPCLVY 302


>ref|XP_002532542.1| conserved hypothetical protein [Ricinus communis]
           gi|223527731|gb|EEF29836.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 305

 Score =  105 bits (261), Expect = 7e-20
 Identities = 77/240 (32%), Positives = 128/240 (53%), Gaps = 8/240 (3%)
 Frame = -2

Query: 801 LGYHNHK-GYVDSVSPNPYSQSFDGYYYQQPYEPLVPEPSRTTSVGRAMISLLIVLTIGM 625
           +GY N+  GY ++    PY+Q+    YY        PE  R     R +I  L+ L I  
Sbjct: 70  VGYPNYNNGYNNNY---PYAQAPPTAYYNTQMYQTQPE-CRINGFLRGIIGGLVFLLILT 125

Query: 624 CMLSIIIFLLFATDLPDFRIVSLTVPNFNATNS-SLAAIWYANITVTNQNEGFKIQFPHV 448
           C +SI ++++    +P F + +L+V NFN ++S +  A W ANITV N N   K+ F  +
Sbjct: 126 CAISIFMWIILRPVIPVFHVNNLSVSNFNLSSSPTFHANWDANITVGNPNTKLKVYFDQI 185

Query: 447 MSSVFY-EEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAG--VFPNIVQDRS- 280
              ++Y E+D+LA S   PF +ET  N  +  ++    ++++Q   G  V   + +D+S 
Sbjct: 186 EVFIYYNEDDLLATSFSNPFFLETGGNSVVQAKLEANNADRKQAGVGSWVVDKMAKDKST 245

Query: 279 TGFVIFSLRLSLKAQYMSNS-FSRKASIKVYCKNLQVSF-SPTGEGKLTEDFPNECLTFS 106
           TG V F +R++L + + S S ++R  +I+VYC++L VSF   +G          +CL F+
Sbjct: 246 TGNVTFDIRMALWSTFKSGSWWARHVTIRVYCEDLVVSFMGNSGTANFANGKSKDCLVFA 305


>gb|EYU19400.1| hypothetical protein MIMGU_mgv1a022723mg, partial [Erythranthe
           guttata]
          Length = 203

 Score =  104 bits (260), Expect = 9e-20
 Identities = 64/187 (34%), Positives = 103/187 (55%), Gaps = 3/187 (1%)
 Frame = -2

Query: 666 RAMISLLIVLTIGMCMLSIIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVT 487
           R M+ L+IVL + MCM+S+  + L+ T +P+F + SL V NF ATN++L   W A++ V 
Sbjct: 1   RLMLILMIVLVVAMCMMSLATWFLYGTYVPEFEVTSLRVSNFTATNTTLRGTWNADVIVY 60

Query: 486 NQNEGFKIQFPHVMSSVFYEEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGV 307
           N NE   + F  + S VFY   ++ +S  Q     T++   +        +N + LH  V
Sbjct: 61  NPNEELAVDFQGIRSLVFY-RGVVVVSRRQR---RTRRGWSM---WGLANNNNDNLHGVV 113

Query: 306 FPNIVQDRSTGFVIFSLRLSLKAQYMS-NSFSRKASIKVYCKNLQVSF--SPTGEGKLTE 136
            P + +D S G V+FSL ++L A+  S +   R+ S++V C +L+V F  +   EG+L+ 
Sbjct: 114 LPALARDWSNGAVVFSLSIALDAKLESPHQVYRQDSLRVSCDDLEVRFMSADVDEGRLSR 173

Query: 135 DFPNECL 115
                CL
Sbjct: 174 GLGAPCL 180


>ref|XP_010109196.1| hypothetical protein L484_002234 [Morus notabilis]
           gi|587934295|gb|EXC21224.1| hypothetical protein
           L484_002234 [Morus notabilis]
          Length = 308

 Score =  103 bits (258), Expect = 2e-19
 Identities = 63/208 (30%), Positives = 110/208 (52%), Gaps = 11/208 (5%)
 Frame = -2

Query: 756 NPYSQSFDGYYYQQPYEPLVPEPSRTTSVG----------RAMISLLIVLTIGMCMLSII 607
           NPY+Q    +YY   Y       S  T+ G          R  ++++I L    C++SII
Sbjct: 80  NPYTQPPPPHYY---YANQNYTYSTATAGGGGAGSGSAFVRGFLAMIITLITITCVVSII 136

Query: 606 IFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVFYE 427
            +L+   ++P F +    V   N +    +A W A++TV N N   KI    V   V+Y+
Sbjct: 137 TWLVLRPEIPVFHVDKFAVSGLNISGPEFSAKWDASVTVENPNHKLKIYLEQVQGFVYYK 196

Query: 426 EDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFVIFSLRLS 247
           E+ L+ SP+ P  +ET+    ++ ++ T  + Q  +   V  +I ++RS G V F+LR+ 
Sbjct: 197 ENFLSSSPVDPMLLETRSRNSIAMKLATNNTEQHLVGRWVEEDIAKERSGGAVSFNLRML 256

Query: 246 LKAQYMSNS-FSRKASIKVYCKNLQVSF 166
           + A + S + ++R AS+KV+C++L+V+F
Sbjct: 257 VLATFKSGAWWTRHASMKVFCEDLKVNF 284


>emb|CDX78099.1| BnaA09g32940D [Brassica napus]
          Length = 302

 Score =  103 bits (258), Expect = 2e-19
 Identities = 65/235 (27%), Positives = 118/235 (50%), Gaps = 7/235 (2%)
 Frame = -2

Query: 792 HNHKGYVDSVSPNPYSQSFDGYYYQQPYEPLVPEPSRTTSVGRAMISLLIVLTIGMCMLS 613
           HN   Y  +   + Y  S     Y     P+   P+  +   R +++ LIVL + +C+ +
Sbjct: 73  HNQYAYAQAPPASYYGSS-----YPAQQNPVYQRPA-PSGFFRGILTGLIVLVVLLCIST 126

Query: 612 IIIFLLFATDLPDFRIVSLTVPNFNATNSSLAAIWYANITVTNQNEGFKIQFPHVMSSVF 433
            I +L+    +P F + S +V NFN T    +A W AN+TV N N      F  + + ++
Sbjct: 127 TITWLVLRPQIPVFSVTSFSVSNFNLTRPVFSAQWMANLTVENPNTKLNGYFDRIQAFIY 186

Query: 432 -----YEEDMLAISPLQPFEIETKQNLDLSFRVTTGRSNQEQLHAGVFPNIVQDRSTGFV 268
                 E+D LA++  QP  ++TK+++ +   +T G   Q ++ + V   + ++R TG V
Sbjct: 187 NQNAIEEDDFLAMAFFQPVSVQTKKSVSIGETLTAGGKEQPKVPSWVGEEMKKERDTGMV 246

Query: 267 IFSLRLSLKAQYMSNSFS-RKASIKVYCKNLQVSF-SPTGEGKLTEDFPNECLTF 109
            F LR+ +   + ++ +S R+  +KV+C  L+V+F   +G G +    P  CL +
Sbjct: 247 SFDLRMLVWVTFKTDGWSARERGLKVFCGKLKVAFEGGSGNGAVLLPKPLPCLVY 301


Top