BLASTX nr result

ID: Akebia23_contig00009814 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00009814
         (1101 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera]   220   7e-55
ref|XP_006421765.1| hypothetical protein CICLE_v10007148mg, part...   212   3e-52
gb|EXC20892.1| hypothetical protein L484_012968 [Morus notabilis]     202   2e-49
ref|XP_007219612.1| hypothetical protein PRUPE_ppa023224mg, part...   201   6e-49
ref|XP_006490259.1| PREDICTED: uncharacterized protein LOC102625...   199   1e-48
ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus c...   197   7e-48
ref|XP_004308157.1| PREDICTED: uncharacterized protein LOC101314...   180   1e-42
ref|XP_002321853.1| hydroxyproline-rich glycoprotein [Populus tr...   177   7e-42
ref|XP_007038486.1| Hydroxyproline-rich glycoprotein family prot...   177   9e-42
ref|XP_006348055.1| PREDICTED: uncharacterized protein LOC102604...   161   5e-37
ref|XP_004234156.1| PREDICTED: uncharacterized protein LOC101245...   160   7e-37
ref|XP_007143328.1| hypothetical protein PHAVU_007G063100g [Phas...   158   3e-36
ref|XP_004496654.1| PREDICTED: uncharacterized protein LOC101496...   157   6e-36
gb|ACU21406.1| unknown [Glycine max]                                  141   6e-31
ref|XP_006280760.1| hypothetical protein CARUB_v10026727mg [Caps...   124   9e-26
ref|XP_002864122.1| hydroxyproline-rich glycoprotein family prot...   118   5e-24
dbj|BAB11237.1| unnamed protein product [Arabidopsis thaliana]        117   9e-24
ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein...   117   9e-24
emb|CBI24501.3| unnamed protein product [Vitis vinifera]              109   2e-21
ref|XP_006401924.1| hypothetical protein EUTSA_v10014011mg [Eutr...    98   5e-18

>emb|CAN66820.1| hypothetical protein VITISV_003496 [Vitis vinifera]
          Length = 341

 Score =  220 bits (561), Expect = 7e-55
 Identities = 136/312 (43%), Positives = 177/312 (56%), Gaps = 33/312 (10%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNVPPV----------------------- 227
           K + Q  SVPFLWEEKPG PKKDWKPE   +N  PP                        
Sbjct: 11  KQIRQPPSVPFLWEEKPGIPKKDWKPEVTAVNPPPPPPPPPPPPPPPPPPPPPPPPPPPP 70

Query: 228 ----VKLVASIPFKWEEKPGKPLPSFLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXI 395
               +KL+ASIPF WEEKPGKPLP F     +  LL FPP KL+                
Sbjct: 71  PPPPIKLIASIPFTWEEKPGKPLPFFSGTPHDDSLLLFPPKKLV---CCSSLSDADSKDY 127

Query: 396 HEDNDDEEEEMLGP--ETNSFDIDEGHALAPSLLANRLMSAAAISTAIPVNQNVVNASSI 569
            +D DDE + +     E   F+ D+  + APSLLANRLMS  AISTA+PV +  +N    
Sbjct: 128 EDDGDDEHDGIFESDFEAFGFETDDSFSSAPSLLANRLMSTVAISTAVPVQKTSLN---- 183

Query: 570 VAENHYVWDENQASPVSETDSNPSSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHE 749
             E+     E+ +SP SET+S+ S YATG++S +G+SF +CLFPLF P++GF  +VG  E
Sbjct: 184 --EDSNDQPESPSSPASETNSSTSXYATGTTSLVGSSFLDCLFPLFPPNSGFLAKVGCPE 241

Query: 750 KSSSLTLPDPRIEELGC--QSNSGLVVKRTPTLGELILMSRRRNH-ANATHIRKRNLPLV 920
            S     P P ++  G   ++NS ++V+R PTLGELI+ SRRR++   A  +RK NL +V
Sbjct: 242 GSP----PPPELQNKGLDRETNSSVIVRRAPTLGELIMKSRRRSYRRKAVQMRKHNLSVV 297

Query: 921 ILQL-FKLLVFI 953
           I QL F L+ FI
Sbjct: 298 ICQLSFLLMTFI 309


>ref|XP_006421765.1| hypothetical protein CICLE_v10007148mg, partial [Citrus clementina]
           gi|557523638|gb|ESR35005.1| hypothetical protein
           CICLE_v10007148mg, partial [Citrus clementina]
          Length = 273

 Score =  212 bits (539), Expect = 3e-52
 Identities = 126/280 (45%), Positives = 171/280 (61%), Gaps = 5/280 (1%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWKPETIPINN--VPPVVKLVASIPFKWEEKPGKPLPS 290
           K + Q  +VPFLWE+KPG PKKDWKPE   ++   V P VKL+ASIPF WEEKPG PLPS
Sbjct: 1   KHVRQPPAVPFLWEQKPGIPKKDWKPEDSSVSPIVVTPPVKLIASIPFDWEEKPGTPLPS 60

Query: 291 FLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFDID--E 464
           F QP     +L  PP KL+               I  +ND+  ++      +SFD D  +
Sbjct: 61  FSQP----AVLPNPPEKLLASPPPPPMYSQGYYGIF-NNDEASDDDHDKRNDSFDFDTDD 115

Query: 465 GHALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSS 644
             + APSLLAN L+ + AIS+A+PV +++   SS    +     E  +SP SE +S+ SS
Sbjct: 116 SFSSAPSLLANCLVPSVAISSAVPVQRSL---SSDTTTDEL---EIPSSPASEAESSTSS 169

Query: 645 YATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSNSGLVV 824
           Y TG+SS +GASF ECLFPL  P T F  +  + E+ + +  P+ + ++  C+SNS +V+
Sbjct: 170 YETGTSSLVGASFLECLFPLLPPKTSFLEKARYTERDTVIDTPEVKSKDFDCESNSTVVI 229

Query: 825 KRTPTLGELILMSRRR-NHANATHIRKRNLPLVILQLFKL 941
           +R  TLGELI+MSRRR N  NA  +RK+NL +V  QL  L
Sbjct: 230 RRPTTLGELIMMSRRRSNQRNAVQMRKQNLSMVNPQLLPL 269


>gb|EXC20892.1| hypothetical protein L484_012968 [Morus notabilis]
          Length = 322

 Score =  202 bits (515), Expect = 2e-49
 Identities = 124/283 (43%), Positives = 168/283 (59%), Gaps = 11/283 (3%)
 Frame = +3

Query: 114 EK*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNVP----PVVKLVASIPFKWEEKPGKP 281
           +K + Q  SVPFLWE KPG  KKDWKPE   +++VP    P VKL+AS+PFKWEEKPG P
Sbjct: 14  KKHVRQPPSVPFLWEVKPGIAKKDWKPEFPSVSSVPIVPLPPVKLIASVPFKWEEKPGTP 73

Query: 282 LPSFLQPNTNS--PLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPE----T 443
           LPSF QP+  S  PLL  PP+    +                + D ++EE  G +    T
Sbjct: 74  LPSFSQPSQESASPLLPLPPIDNYPYEGVNVYQDSSEDSSSNEGDGQDEEQRGFKLDLGT 133

Query: 444 NSFDIDEGHALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSE 623
              + D+    APSLLAN L+S+ AISTA+P  QNV      + E+     E+ +SP SE
Sbjct: 134 FGSEADDSFCSAPSLLANCLVSSVAISTAVPA-QNVS-----LPEDKSGPLESPSSPASE 187

Query: 624 TDSNPSSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQ 803
           T+ + SSY TG+SS +G+S  ECLFPLF P +GF  +VG  ++      P    +    +
Sbjct: 188 TEISTSSYETGTSSLVGSSLLECLFPLFPPKSGFLEKVGNLDEPLK-PPPQQWNQNFNYE 246

Query: 804 SNSGLVVKRTPTLGELILMSRRRNH-ANATHIRKRNLPLVILQ 929
           S   + V+R PTLGELI+MSRRR++  NAT +RK+NL +  ++
Sbjct: 247 STGNITVRRPPTLGELIMMSRRRSYRRNATQMRKQNLSMEFMK 289


>ref|XP_007219612.1| hypothetical protein PRUPE_ppa023224mg, partial [Prunus persica]
           gi|462416074|gb|EMJ20811.1| hypothetical protein
           PRUPE_ppa023224mg, partial [Prunus persica]
          Length = 284

 Score =  201 bits (510), Expect = 6e-49
 Identities = 119/275 (43%), Positives = 155/275 (56%), Gaps = 17/275 (6%)
 Frame = +3

Query: 138 SVPFLWEEKPGTPKKDWKPETIPINN---VPPVVKLVASIPFKWEEKPGKPLPSFLQPNT 308
           +VPFLWEE+PG PKKDWKP  +  N+    P +VKLVAS+PFKWEEKPG PLPSF +P  
Sbjct: 15  AVPFLWEERPGIPKKDWKPPVVSSNSSFPAPHIVKLVASVPFKWEEKPGTPLPSFSEPTL 74

Query: 309 NSPLLTFPPLKLIGFXXXXXXXXXXXXXIHE-----------DNDDEEEEMLGPETNSFD 455
            S   +  PL+LI F                           D +D    M   E  +FD
Sbjct: 75  ESACPSSLPLQLITFPSPPISSHQYDYDGENEDYGDDISGNGDGEDGAPSMFNLELEAFD 134

Query: 456 I--DEGHALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETD 629
              D+    AP+LLAN L+ + AISTA+P ++      S   E+   W E  +SP SE  
Sbjct: 135 FETDDSFISAPALLANCLVPSIAISTAVPADK------STPTEDKSAWPETPSSPASEAG 188

Query: 630 SNPSSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSN 809
           S+ SSYATG SS +GASF ECLFPL   ++GF  ++G    +SSLT P+P+      +SN
Sbjct: 189 SSTSSYATGVSSLVGASFLECLFPLIPANSGFLEKIG-QSGNSSLTPPEPKSAHFDRESN 247

Query: 810 SGLVVKRTPTLGELILMSRRRNH-ANATHIRKRNL 911
              +V R  TLGELI+MSR+ ++   A  +RK NL
Sbjct: 248 GSAIVWRPKTLGELIMMSRKGSYRRKAVQMRKHNL 282


>ref|XP_006490259.1| PREDICTED: uncharacterized protein LOC102625222 [Citrus sinensis]
          Length = 296

 Score =  199 bits (507), Expect = 1e-48
 Identities = 115/258 (44%), Positives = 159/258 (61%), Gaps = 4/258 (1%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWKPETIPINN--VPPVVKLVASIPFKWEEKPGKPLPS 290
           K + Q  +VPFLWE+KPG PKKDWKP+   ++   V P VKL+ASIPF WEEKPG PLPS
Sbjct: 12  KNVRQPPAVPFLWEQKPGIPKKDWKPKDSSVSPIVVTPPVKLIASIPFDWEEKPGTPLPS 71

Query: 291 FLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFDID--E 464
           F QP     +L  PP KL+               I  +ND+  ++    + +SFD D  +
Sbjct: 72  FSQP----AVLPNPPEKLLALPPPPPMYSQGYYGIF-NNDEASDDDHDKQNDSFDFDTDD 126

Query: 465 GHALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSS 644
             + APSLLAN L+ + AIS+A+PV +++   SS    +     E  +SP SE +S+ SS
Sbjct: 127 SFSSAPSLLANCLVPSVAISSAVPVQRSL---SSDTTTDEL---EIPSSPASEAESSTSS 180

Query: 645 YATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSNSGLVV 824
           Y TG+SS +GASF ECLFPL  P T F  +  + E  S +  P+ + ++  C+SNS +V+
Sbjct: 181 YETGTSSLVGASFLECLFPLLPPKTSFLEKARYTESDSVIVTPEVKRKDFDCESNSTVVI 240

Query: 825 KRTPTLGELILMSRRRNH 878
           +R  TLGELI+MSRRR++
Sbjct: 241 RRPTTLGELIMMSRRRSY 258


>ref|XP_002510900.1| hypothetical protein RCOM_1498790 [Ricinus communis]
           gi|223550015|gb|EEF51502.1| hypothetical protein
           RCOM_1498790 [Ricinus communis]
          Length = 278

 Score =  197 bits (501), Expect = 7e-48
 Identities = 127/297 (42%), Positives = 158/297 (53%), Gaps = 12/297 (4%)
 Frame = +3

Query: 87  EKEFVVPCYEK*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNV--PPVVKLVASIPFKW 260
           E E +     K + Q   VPFLWEE+PG  KKDWKP    +  +  PP VKL+AS+PF W
Sbjct: 3   ENEIIEASKRKHIRQPPFVPFLWEERPGIAKKDWKPVVSSVTTLALPPPVKLIASVPFNW 62

Query: 261 EEKPGKPLPSFLQPNTNSPLLTFP--PLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEM-- 428
           EEKPGKPLP F QP   SP  T    P   + +                DN  E+EE   
Sbjct: 63  EEKPGKPLPCFSQPPMESPPATLNSLPSPPMYYQRCDDCEFNNENRAGHDNYGEKEEGIF 122

Query: 429 -LGPETNSFDIDEGHALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQ 605
            L  E+ SF+ D+  + APSLLAN L+S+ A+S A+PV+                  E  
Sbjct: 123 DLDIESFSFETDDSLSSAPSLLANCLVSSVAVSDAVPVDHL----------------ETP 166

Query: 606 ASPVSETDSNPSSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRI 785
           +SP S+TDS+ SSYATG SS  GAS  ECLFPL++PD+GF   V    K S +       
Sbjct: 167 SSPASDTDSSTSSYATGISSLTGASLLECLFPLYAPDSGFLETVAHSTKGSLIA-----T 221

Query: 786 EELGCQSNSG----LVVKRTPTLGELILMSRRRN-HANATHIRKRNLPLVILQLFKL 941
           E   C SN      +  KRTPTLGELI+MSRRR+    A  +  RNLP+V  Q   L
Sbjct: 222 EVQNCNSNRASDNIVTTKRTPTLGELIMMSRRRSCQRKAIQMGNRNLPMVNSQFMLL 278


>ref|XP_004308157.1| PREDICTED: uncharacterized protein LOC101314801 [Fragaria vesca
           subsp. vesca]
          Length = 308

 Score =  180 bits (456), Expect = 1e-42
 Identities = 115/272 (42%), Positives = 158/272 (58%), Gaps = 14/272 (5%)
 Frame = +3

Query: 138 SVPFLWEEKPGTPKKDWKPETIPINNVPPV--VKLVASIPFKWEEKPGKPLPSFLQPNTN 311
           SVPFLWEE+PG PKKDWKP T+  NNV P+  VKL+AS+PF WEEKPG PLP F++ ++ 
Sbjct: 14  SVPFLWEERPGIPKKDWKP-TVSSNNVAPIPPVKLIASVPFIWEEKPGTPLPYFMESSSE 72

Query: 312 SPLLTFPPLKLIGFXXXXXXXXXXXXXIHE------DNDDEEEEM-----LGPETNSFDI 458
           S   T  P+ LI +               E       NDD E+E+     L  +   F+ 
Sbjct: 73  SA--TTEPMMLITYPSPPICSQHNDHGGEEYSDASNGNDDGEDEIQSVFKLDMQAFDFET 130

Query: 459 DEGHALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNP 638
           D+  + APSLLAN L+S+ AISTA+P  ++         E+     +  +SP+SE  S+ 
Sbjct: 131 DDSFSSAPSLLANCLVSSLAISTAVPAPED---------ESDQTETDTPSSPLSEAGSST 181

Query: 639 SSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSNSGL 818
           SSYATG+SS +G +F ECLFPL     GF  +VG H  + +LT    + +    ++N G 
Sbjct: 182 SSYATGTSSLVGGAFLECLFPLLPAKAGFLEKVG-HSDNRTLTPQASKTKYFDRETN-GS 239

Query: 819 VVKRTPTLGELILMSRRRNH-ANATHIRKRNL 911
           V+ R  TLGELILMSR+ ++   A  + K+NL
Sbjct: 240 VILRPRTLGELILMSRKCSYRRKAVQMGKQNL 271


>ref|XP_002321853.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
           gi|222868849|gb|EEF05980.1| hydroxyproline-rich
           glycoprotein [Populus trichocarpa]
          Length = 333

 Score =  177 bits (449), Expect = 7e-42
 Identities = 113/275 (41%), Positives = 154/275 (56%), Gaps = 10/275 (3%)
 Frame = +3

Query: 114 EK*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNVP-PVVKLVASIPFKWEEKPGKPLPS 290
           +K + Q  SVPFLWE +PG  K+DWKPE   +  V  P VKL+AS+PF WEEKPGKPL  
Sbjct: 31  KKHIRQPPSVPFLWEVRPGVAKRDWKPEVSSVTPVQLPPVKLIASVPFNWEEKPGKPLSC 90

Query: 291 FLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGP--------ETN 446
           F Q +  S  +T P   L+                 ED D  EE             E+ 
Sbjct: 91  FSQ-SPESAFIT-PQANLLALPWHVTCSQGDDNHKQEDGDSGEENFGDEQVMFNSDLESF 148

Query: 447 SFDIDEGHALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSET 626
           SF+ DE  + A SLLAN ++S+ AISTA+PV       ++   ++     E  +SP SET
Sbjct: 149 SFETDESFSSAQSLLANCMVSSVAISTAVPVQ------TTSPTDDSNGQQETPSSPPSET 202

Query: 627 DSNPSSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQS 806
           DS+ SSYATG SS  GA+F E LFPL++P +GF  +   H +  S T P+    +   + 
Sbjct: 203 DSSTSSYATGVSSLEGAAFLEWLFPLYTPKSGFLGKAS-HPRKESFT-PELNSRDFDYER 260

Query: 807 NSGLVVKRTPTLGELILMSRRRN-HANATHIRKRN 908
           NS +++++  TLGELI+MSRRR+    A  +RK+N
Sbjct: 261 NSSVMIRKPLTLGELIMMSRRRSCQRKAVQMRKQN 295


>ref|XP_007038486.1| Hydroxyproline-rich glycoprotein family protein, putative
           [Theobroma cacao] gi|508775731|gb|EOY22987.1|
           Hydroxyproline-rich glycoprotein family protein,
           putative [Theobroma cacao]
          Length = 313

 Score =  177 bits (448), Expect = 9e-42
 Identities = 108/257 (42%), Positives = 145/257 (56%), Gaps = 10/257 (3%)
 Frame = +3

Query: 138 SVPFLWEEKPGTPKKDWKPETIPIN-NVPP--VVKLVASIPFKWEEKPGKPLPSFLQPNT 308
           SVPFLWE +PG  KKDWKP    +   +PP   +KL+AS+PF WEEKPG PLP F QP  
Sbjct: 20  SVPFLWEVRPGIAKKDWKPGVSSVTPTLPPRTPIKLIASVPFNWEEKPGTPLPRFSQPPV 79

Query: 309 -------NSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFDIDEG 467
                  ++ L+T PP  +                   D  D   EM   ET  F+ D+ 
Sbjct: 80  EPAAVPLSANLMTLPPRPVYTPAYFNGYDNNDDRGDGSDEQDVVPEM-DLETFGFETDDS 138

Query: 468 HALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSSY 647
            + APSLLAN L+++ AI TA+PV +      +  A+N     E  +SP SET+S+ SSY
Sbjct: 139 FSSAPSLLANCLVASTAICTAVPVQK------TYHADNSSDHPETPSSPASETESSTSSY 192

Query: 648 ATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSNSGLVVK 827
           ATG+SS +GASF ECLFPL  P++GF  +  +     S T  D        +SN+ +V++
Sbjct: 193 ATGTSSLVGASFLECLFPLLPPNSGFLEKARYPNHQGSQTQND-----FDRESNNTVVIR 247

Query: 828 RTPTLGELILMSRRRNH 878
           R  TLGELI+MSRR ++
Sbjct: 248 RPATLGELIMMSRRMSY 264


>ref|XP_006348055.1| PREDICTED: uncharacterized protein LOC102604397 [Solanum tuberosum]
          Length = 329

 Score =  161 bits (407), Expect = 5e-37
 Identities = 101/264 (38%), Positives = 138/264 (52%), Gaps = 10/264 (3%)
 Frame = +3

Query: 129 QRGSVPFLWEEKPGTPKKDWKPETIPINNVP------PVVKLVASIPFKWEEKPGKPLPS 290
           Q+ S+PF+WEE+PG P KDWKP+ + +          P VKL+AS+PF+WEEKPG PLP 
Sbjct: 26  QQISIPFIWEERPGIPIKDWKPKPVAMATTSGAFTFTPPVKLIASVPFEWEEKPGTPLPF 85

Query: 291 FLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFD---ID 461
           F Q + +  ++  P +                  I +    EE+EM   E  + D   I 
Sbjct: 86  FSQTSPHGNIVGLPSIVRDVHEGRDDFWAGIGEYIDQHGSHEEDEMSESEVEASDSESIY 145

Query: 462 EGHALAPS-LLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNP 638
           E  + APS LLAN  +    IS+A+PV Q     +S  A+ H+   +   SP SE  S+ 
Sbjct: 146 ESFSSAPSSLLANGFIPTVDISSAVPVEQ-----TSPTADIHHSQLQTPLSPTSEAGSSV 200

Query: 639 SSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSNSGL 818
            SYATG++S +G +F E LFPL SPDT F       EK  S   P         ++N   
Sbjct: 201 LSYATGTTSLVGTAFLEKLFPLLSPDTSFLQNCSNPEKGGSHVPPKALNNNQVRENNCST 260

Query: 819 VVKRTPTLGELILMSRRRNHANAT 890
            V+   TLGELI+MSRRR++   T
Sbjct: 261 KVRHPLTLGELIMMSRRRSYQRKT 284


>ref|XP_004234156.1| PREDICTED: uncharacterized protein LOC101245523 [Solanum
           lycopersicum]
          Length = 328

 Score =  160 bits (406), Expect = 7e-37
 Identities = 100/264 (37%), Positives = 139/264 (52%), Gaps = 10/264 (3%)
 Frame = +3

Query: 129 QRGSVPFLWEEKPGTPKKDWKPETIPINNVP------PVVKLVASIPFKWEEKPGKPLPS 290
           Q+ S+PF+WEE+PG P KDWKP+ +            P VKL+AS+PF+WEEKPG PLP 
Sbjct: 24  QQISIPFIWEERPGIPIKDWKPKPVATATTSGAFTFTPPVKLIASVPFEWEEKPGTPLPF 83

Query: 291 FLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFD---ID 461
           F Q + +  ++  P                    I +  + EE+EM   E  + D   I 
Sbjct: 84  FSQTSPHENIVGLPSTVRAVHEGGDDFWAGIGEYIDQRGNHEEDEMTESEVEASDSESIY 143

Query: 462 EGHALAPS-LLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNP 638
           E  + APS LLAN  +    IS+A+PV Q     +S  A+ H+   ++  SP SE  S+ 
Sbjct: 144 ESFSSAPSSLLANGFIPTVDISSAVPVEQ-----TSPTADIHHTQLQSPLSPTSEAGSSV 198

Query: 639 SSYATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSNSGL 818
            SYATG++S +G +F E LFPL SP+T F       EK  S   P         ++N  +
Sbjct: 199 LSYATGTTSLVGTAFLEKLFPLLSPNTSFLQNCSNPEKGGSHVPPKALNNNQVRENNCSI 258

Query: 819 VVKRTPTLGELILMSRRRNHANAT 890
            V+   TLGELI+MSRRR++   T
Sbjct: 259 KVRHPLTLGELIMMSRRRSYQRKT 282


>ref|XP_007143328.1| hypothetical protein PHAVU_007G063100g [Phaseolus vulgaris]
           gi|561016518|gb|ESW15322.1| hypothetical protein
           PHAVU_007G063100g [Phaseolus vulgaris]
          Length = 300

 Score =  158 bits (400), Expect = 3e-36
 Identities = 108/275 (39%), Positives = 147/275 (53%), Gaps = 16/275 (5%)
 Frame = +3

Query: 138 SVPFLWEEKPGTPKKDWKPET--IPINNVPPV-VKLVASIPFKWEEKPGKPLPSFLQPNT 308
           +VPF+WE KPG PKKDWK E     + + P   +KL+AS+PF WEEKPGKPLP+F   + 
Sbjct: 15  AVPFIWEVKPGIPKKDWKAEAEVSSLGHFPQTPLKLIASVPFVWEEKPGKPLPNFSDVSV 74

Query: 309 NSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDND-------DEEEEMLGPETNSFDIDEG 467
           + P+L  P   LI                H+D D        E    L  E  +FD DE 
Sbjct: 75  D-PVLPKPEKTLIHIASSSGFSVACNFG-HDDKDKGSCSYDSESITSLDLEAFTFDADES 132

Query: 468 HALAPSLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSSY 647
             L PSLLAN L+ +A +S+AIP+ +   + +S                 SETDS+ SSY
Sbjct: 133 FGLVPSLLANCLVPSAKVSSAIPLAETPSSPAS-----------------SETDSSISSY 175

Query: 648 ATGSSSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGCQSNSGL--- 818
           ATG SS +GA+F E LFPL++P      Q GF E+  +L        E+G +    +   
Sbjct: 176 ATGRSSPIGATFLESLFPLYAP------QSGFLERDENLQKETSSTHEVGAKDFDHVDIA 229

Query: 819 --VVKRTPTLGELILMSRRRN-HANATHIRKRNLP 914
             +++R PTLGELI+MSRRR+    A  ++K +LP
Sbjct: 230 SDMIRRPPTLGELIMMSRRRSCRRKAVQMKKWDLP 264


>ref|XP_004496654.1| PREDICTED: uncharacterized protein LOC101496421 [Cicer arietinum]
          Length = 262

 Score =  157 bits (398), Expect = 6e-36
 Identities = 100/260 (38%), Positives = 135/260 (51%), Gaps = 4/260 (1%)
 Frame = +3

Query: 138 SVPFLWEEKPGTPKKDWKPETIPINNVPP--VVKLVASIPFKWEEKPGKPLPSFLQPNTN 311
           S+PF+WE KPG PKKDWKP    ++   P   +K +AS+PF WEEKPGKPL +F   +  
Sbjct: 15  SIPFIWEAKPGIPKKDWKPVASSLSQSLPKTPLKQIASVPFVWEEKPGKPLHNFSHVSV- 73

Query: 312 SPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFDIDEGHALAPSLL 491
                                          N+ E    L  E+ SF+ DE  +L PSLL
Sbjct: 74  -------------------------------NESESITSLDLESFSFENDESVSLVPSLL 102

Query: 492 ANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSSYATGSSSFM 671
           AN L+S+  +S+AIP+ QN +  SS  A              SETD + SSY TG SS  
Sbjct: 103 ANCLVSSTKVSSAIPLQQNSLYVSSSPAS-------------SETDCSISSYETGMSSLT 149

Query: 672 GASFFECLFPLFSPDTGF--PNQVGFHEKSSSLTLPDPRIEELGCQSNSGLVVKRTPTLG 845
           G++F ECLFPLF P +GF   N  G  EK       D +IE+   +  + ++ ++ PTLG
Sbjct: 150 GSAFLECLFPLFPPKSGFLERNNTGHTEK-------DIKIEDFEHEDYTCVISRKPPTLG 202

Query: 846 ELILMSRRRNHANATHIRKR 905
           ELI+MSRRR+  N   +  +
Sbjct: 203 ELIMMSRRRSCRNKASLMNK 222


>gb|ACU21406.1| unknown [Glycine max]
          Length = 222

 Score =  141 bits (355), Expect = 6e-31
 Identities = 88/201 (43%), Positives = 109/201 (54%), Gaps = 5/201 (2%)
 Frame = +3

Query: 138 SVPFLWEEKPGTPKKDWKPETIPINNVPPV-VKLVASIPFKWEEKPGKPLPSFLQPNTNS 314
           SVPF+WE KPG PKKDWKPE  P   VP   +KL+AS+PF WEEKPGKPLP+F   +   
Sbjct: 15  SVPFIWEVKPGIPKKDWKPEPEP--EVPKTPLKLIASVPFVWEEKPGKPLPNFSVDHPVP 72

Query: 315 P---LLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFDIDEGHALA-P 482
           P   L+         F                 +D+E    L  E  SFD DE    + P
Sbjct: 73  PKPLLIHVASSSAFSFACNFGHDHDKDKGSLSSSDNESITTLDLEAFSFDEDESFVSSVP 132

Query: 483 SLLANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSSYATGSS 662
           SLLAN L+ +A +STAIP+ +   ++ +                 SETDS  SSYATG S
Sbjct: 133 SLLANCLVPSAKVSTAIPLRETTPSSPA---------------SSSETDSGTSSYATGMS 177

Query: 663 SFMGASFFECLFPLFSPDTGF 725
           S +GA+F ECLFPLF P +GF
Sbjct: 178 SPIGATFLECLFPLFPPKSGF 198


>ref|XP_006280760.1| hypothetical protein CARUB_v10026727mg [Capsella rubella]
           gi|482549464|gb|EOA13658.1| hypothetical protein
           CARUB_v10026727mg [Capsella rubella]
          Length = 341

 Score =  124 bits (310), Expect = 9e-26
 Identities = 97/305 (31%), Positives = 137/305 (44%), Gaps = 41/305 (13%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNVPPV--------VKLVASIPFKWEEKP 272
           K + Q  SVPF+WEE+PG PKKDW+P        PP         VKLV S+PF+WEE P
Sbjct: 11  KQLRQPPSVPFIWEERPGYPKKDWQPSLATFVPSPPPLPPPVPVPVKLVTSVPFRWEETP 70

Query: 273 GKPLPSFLQPNTNSPLLTFPPLKL-------------------IGFXXXXXXXXXXXXXI 395
           GKPLP+    + N P L  PPL+                    + F             +
Sbjct: 71  GKPLPA---SSNNQPQLPHPPLETATTTSLPPPVPVPVKLVTSVPFDWEETPGQPYPCFV 127

Query: 396 HEDNDDEEEEMLGP-------ETNSFDIDEGHALAPSLLANRLMSAAAISTAIPVNQNVV 554
             +  +  ++ L P       ETNS   D+    A S   + + S  A + ++ ++  VV
Sbjct: 128 DFNPREPLDQPLPPPPMYGEVETNSDIFDD----ASSDSFSSVPSLLATNRSVSISNTVV 183

Query: 555 NASSIVAENHYVWDENQASPVSETDSNPSSYATGSSSFMGASFFECLFPLFSPDTGFPNQ 734
                  + H       +SP  E+D + SSY TG+SS +GASF E LFP        P +
Sbjct: 184 AMDEFDDKQHRETSSTPSSPTYESDDSTSSYMTGASSLVGASFLEKLFPRL-----LPAE 238

Query: 735 VGFHEKSSSLTLPDPRIEELGCQSNS------GLVVKRTPTLGELILMSRRRNH-ANATH 893
                 S  + +P   + E G  +        G  V+   TLGELI+MSRRR++   A  
Sbjct: 239 KVKAADSEDVQVPTHPLNEEGKLTTESDNMSIGFPVRMPQTLGELIMMSRRRSYMRRAVE 298

Query: 894 IRKRN 908
           +RK+N
Sbjct: 299 MRKQN 303


>ref|XP_002864122.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
           subsp. lyrata] gi|297309957|gb|EFH40381.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 343

 Score =  118 bits (295), Expect = 5e-24
 Identities = 101/309 (32%), Positives = 136/309 (44%), Gaps = 45/309 (14%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNVPPV--------VKLVASIPFKWEEKP 272
           K + Q  SVPF+WEE+PG PKK+W+P        PP+        VKLV S+PF+WEE P
Sbjct: 14  KQLRQPPSVPFIWEERPGFPKKNWQPSLATFVPSPPLLPPPVPVPVKLVTSVPFRWEETP 73

Query: 273 GKPLPSFLQPNTNSP-LLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEE-------- 425
           GKPLP    P++N P  L  PPL+                 +     D EE         
Sbjct: 74  GKPLP----PSSNDPPQLPHPPLETATTTPLPPPVPVPVKQVTSVPFDWEETPGQPYPCF 129

Query: 426 -----------------MLGPETNSFDI-----DEGHALAPSLLANRLMSAAAISTAIPV 539
                            M G    S DI      +  +  PSLLA     + +IS A+ V
Sbjct: 130 VDTNPPELLDQPLPPPPMYGEVETSSDIFDDASSDSFSSVPSLLATN--RSVSISGAVAV 187

Query: 540 NQNVVNASSIVAENHYVWDENQASPVSETDSNPSSYATGSSSFMGASFFECLFPLFSPDT 719
           ++   N + +             SP  E+D + SSY TG+SS +GASF E LFP   P  
Sbjct: 188 DEFDDNLNRVTRSM-------PTSPAYESDDSTSSYMTGASSLVGASFLEKLFPRLLP-- 238

Query: 720 GFPNQVGFHEKSSSLTLPDPRIEELGCQSNS-----GLVVKRTPTLGELILMSRRRNH-A 881
               +V   +         P  EE+   + S     G  V+   TLGELI+MSRRR++  
Sbjct: 239 --LEKVKSADSEDVQVSTHPLHEEVKLTTESDNMSIGFPVRAPQTLGELIMMSRRRSYMR 296

Query: 882 NATHIRKRN 908
            A  +RK+N
Sbjct: 297 RAVEMRKQN 305


>dbj|BAB11237.1| unnamed protein product [Arabidopsis thaliana]
          Length = 325

 Score =  117 bits (293), Expect = 9e-24
 Identities = 97/304 (31%), Positives = 136/304 (44%), Gaps = 40/304 (13%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNVPPV--------VKLVASIPFKWEEKP 272
           K + Q  SVPF+WEE+PG PKK+W+P        PP         VKLV S+PF+WEE P
Sbjct: 14  KQLRQPPSVPFIWEERPGFPKKNWQPSLATFVPSPPPLPPPIPVPVKLVTSVPFRWEETP 73

Query: 273 GKPLPSFLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEE--------- 425
           GKPLP+    + + P L  PPL+                 +     D EE          
Sbjct: 74  GKPLPA---SSNDPPQLPHPPLETATPTPLPPPVPVPVKQVTSVPFDWEETPGQPYPCFV 130

Query: 426 ----------------MLGPETNSFDI-----DEGHALAPSLLANRLMSAAAISTAIPVN 542
                           M G    S DI      +  +  PSLLA     + +IS A+ V+
Sbjct: 131 DTSPPELLDQPLPPPPMYGDVETSSDIFDDASSDSFSSVPSLLATN--RSVSISGAVAVD 188

Query: 543 QNVVNASSIVAENHYVWDENQASPVSETDSNPSSYATGSSSFMGASFFECLFPLFSPDTG 722
           +   N +++ +           SP  E+D + SSY TG+SS +GASF E LFP   P   
Sbjct: 189 EFDDNLNTVTSSM-------PTSPAYESDDSTSSYMTGASSLVGASFLEKLFPRLLPSEK 241

Query: 723 FPNQVGFHEKSSSLTL-PDPRIEELGCQSNSGLVVKRTPTLGELILMSRRRNH-ANATHI 896
               V    + S+  L  + ++       + G  V+   TLGELI+MSRRR++   A  +
Sbjct: 242 VKAAVSEDVQVSTHPLHEEVKLTTETDNMSIGFPVRTPQTLGELIMMSRRRSYMRRAVEM 301

Query: 897 RKRN 908
           RK+N
Sbjct: 302 RKQN 305


>ref|NP_199981.2| hydroxyproline-rich glycoprotein family protein [Arabidopsis
           thaliana] gi|332008731|gb|AED96114.1|
           hydroxyproline-rich glycoprotein family protein
           [Arabidopsis thaliana]
          Length = 343

 Score =  117 bits (293), Expect = 9e-24
 Identities = 97/304 (31%), Positives = 136/304 (44%), Gaps = 40/304 (13%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWKPETIPINNVPPV--------VKLVASIPFKWEEKP 272
           K + Q  SVPF+WEE+PG PKK+W+P        PP         VKLV S+PF+WEE P
Sbjct: 14  KQLRQPPSVPFIWEERPGFPKKNWQPSLATFVPSPPPLPPPIPVPVKLVTSVPFRWEETP 73

Query: 273 GKPLPSFLQPNTNSPLLTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEE--------- 425
           GKPLP+    + + P L  PPL+                 +     D EE          
Sbjct: 74  GKPLPA---SSNDPPQLPHPPLETATPTPLPPPVPVPVKQVTSVPFDWEETPGQPYPCFV 130

Query: 426 ----------------MLGPETNSFDI-----DEGHALAPSLLANRLMSAAAISTAIPVN 542
                           M G    S DI      +  +  PSLLA     + +IS A+ V+
Sbjct: 131 DTSPPELLDQPLPPPPMYGDVETSSDIFDDASSDSFSSVPSLLATN--RSVSISGAVAVD 188

Query: 543 QNVVNASSIVAENHYVWDENQASPVSETDSNPSSYATGSSSFMGASFFECLFPLFSPDTG 722
           +   N +++ +           SP  E+D + SSY TG+SS +GASF E LFP   P   
Sbjct: 189 EFDDNLNTVTSSM-------PTSPAYESDDSTSSYMTGASSLVGASFLEKLFPRLLPSEK 241

Query: 723 FPNQVGFHEKSSSLTL-PDPRIEELGCQSNSGLVVKRTPTLGELILMSRRRNH-ANATHI 896
               V    + S+  L  + ++       + G  V+   TLGELI+MSRRR++   A  +
Sbjct: 242 VKAAVSEDVQVSTHPLHEEVKLTTETDNMSIGFPVRTPQTLGELIMMSRRRSYMRRAVEM 301

Query: 897 RKRN 908
           RK+N
Sbjct: 302 RKQN 305


>emb|CBI24501.3| unnamed protein product [Vitis vinifera]
          Length = 166

 Score =  109 bits (273), Expect = 2e-21
 Identities = 63/139 (45%), Positives = 91/139 (65%), Gaps = 3/139 (2%)
 Frame = +3

Query: 504 MSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSSYATGSSSFMGASF 683
           MS  AISTA+PV +  +N      E+     E+ +SP SET+S+ S+YATG++S +G+SF
Sbjct: 1   MSTVAISTAVPVQKTSLN------EDSNDQPESPSSPASETNSSTSTYATGTTSLVGSSF 54

Query: 684 FECLFPLFSPDTGFPNQVGFHEKSSSLTLPDPRIEELGC--QSNSGLVVKRTPTLGELIL 857
            +CLFPLF P++GF  +VG  E S     P P ++  G   ++NS ++V+R PTLGELI+
Sbjct: 55  LDCLFPLFPPNSGFLAKVGCPEGSP----PPPELQNKGLDRETNSSVIVRRAPTLGELIM 110

Query: 858 MSRRRNH-ANATHIRKRNL 911
            SRRR++   A  +RK NL
Sbjct: 111 KSRRRSYRRKAVQMRKHNL 129


>ref|XP_006401924.1| hypothetical protein EUTSA_v10014011mg [Eutrema salsugineum]
           gi|557103014|gb|ESQ43377.1| hypothetical protein
           EUTSA_v10014011mg [Eutrema salsugineum]
          Length = 343

 Score = 98.2 bits (243), Expect = 5e-18
 Identities = 96/328 (29%), Positives = 131/328 (39%), Gaps = 64/328 (19%)
 Frame = +3

Query: 117 K*MTQRGSVPFLWEEKPGTPKKDWK---------------PETIPINNV----------- 218
           K + Q  SVPF+WEE+PG PKK+W+               P  +P+  V           
Sbjct: 11  KQLRQPPSVPFIWEERPGLPKKNWQPSLATFVPSPPPLPPPIPVPVKLVTSVPFRWEQTP 70

Query: 219 ----------------------------PPV---VKLVASIPFKWEEKPGKPLPSFLQPN 305
                                       PPV   VKLV S+PF  EE PG+P P F+  N
Sbjct: 71  GKPLPSSSNDPPQLPHPPLETATAPPLPPPVPVPVKLVTSVPFVREETPGQPYPCFVDTN 130

Query: 306 TNSPL-LTFPPLKLIGFXXXXXXXXXXXXXIHEDNDDEEEEMLGPETNSFDIDEGHALAP 482
              PL    PP  + G                E N D  ++     ++SF      +  P
Sbjct: 131 QTEPLDQPLPPPPMYGEV--------------ETNSDIYDD---ASSDSF------SSVP 167

Query: 483 SLL-ANRLMSAAAISTAIPVNQNVVNASSIVAENHYVWDENQASPVSETDSNPSSYATGS 659
           SLL  NR +  +   T    ++N+   +S V            SP  E+D + SSY TG+
Sbjct: 168 SLLTGNRSVPVSGAVTVDEFDENLNRETSSV----------PTSPGYESDDSTSSYMTGA 217

Query: 660 SSFMGASFFECLFPLFSPDTGFPNQVGFHEKSSSLTL----PDPRIEELGCQSNSGLVVK 827
           SS +GASF E LFP   P           E    +T      + ++       N G  V+
Sbjct: 218 SSLVGASFLEKLFPRLLPHEKVEAAAASSEDHLQVTTRTLHEEVKLTTASDNMNIGFPVR 277

Query: 828 RTPTLGELILMSRRRNH-ANATHIRKRN 908
              TLGELI+MSRRR++   A  +RK N
Sbjct: 278 TPQTLGELIMMSRRRSYMRRAVEMRKHN 305