BLASTX nr result

ID: Sinomenium22_contig00016844 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00016844
         (1253 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI26785.3| unnamed protein product [Vitis vinifera]              433   e-119
ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252...   429   e-117
gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]     421   e-115
ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309...   416   e-113
ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809...   406   e-111
ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family prot...   402   e-109
ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794...   402   e-109
emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]   398   e-108
ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family prot...   397   e-108
ref|XP_002527549.1| conserved hypothetical protein [Ricinus comm...   394   e-107
ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family prot...   393   e-106
ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prun...   387   e-105
ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phas...   384   e-104
gb|ABK95394.1| unknown [Populus trichocarpa]                          382   e-103
ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Popu...   380   e-103
ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809...   376   e-101
ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210...   375   e-101
ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phas...   370   e-100
ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618...   367   8e-99
ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citr...   366   1e-98

>emb|CBI26785.3| unnamed protein product [Vitis vinifera]
          Length = 672

 Score =  433 bits (1113), Expect = e-119
 Identities = 239/430 (55%), Positives = 300/430 (69%), Gaps = 13/430 (3%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LHMQQYFSVAEV YALQQ  W +QQRH D +K + K+ ++    GV  R+  R ET K++
Sbjct: 94   LHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKR---YGVAYRQGQRGETAKDS 150

Query: 181  HSSD---SCAQSLSLGSEKGGEQT------IKGEEAKKKVEIERSDGKDSLPSEDKK-GV 330
            H+S+       + S G+ + GE+       +KG +  K   + + + KD   +E+KK G 
Sbjct: 151  HNSNFENHSHDANSSGTLEKGERVSEIYDDVKGGD--KGDVVGKLEDKDLAAAEEKKAGT 208

Query: 331  DATTNCHTDESLKSSENPGGTDTEKSIFEA--VHDEGTSNVNGTCNTLQKSGFNTTENHD 504
            DA    + +   KSSEN  G+    S  EA  + D GT N  G+CN + ++  +  +N +
Sbjct: 209  DAVAKPNANSCSKSSENSEGSRCGISETEANDMDDGGTLNPKGSCNMIMENNAHPVQNQN 268

Query: 505  EKQNLLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQL 684
            EK N   +PKTF+G E FDGKAVNVV+GL LYEEL D+ E+SK V L N+LR+AG+RGQL
Sbjct: 269  EKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQL 328

Query: 685  Q-GQTYVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERL 861
            Q GQT+VVS+RPMKG GRE+IQLG+PIADAP EDE++VG S+D + E+IP LL+D+I  L
Sbjct: 329  QAGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHL 388

Query: 862  VQSQVMTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGD 1041
            V SQV+TVKPD+CIIDF+NEGDHSQPH+ P WFGRPVCILFLTEC+MTFGRVIG DHPGD
Sbjct: 389  VGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGD 448

Query: 1042 YXXXXXXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLS 1221
            Y              VMQGKSADFAKHAI S+RKQRILVTFTKSQPKK+  SDGQRL L 
Sbjct: 449  YRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRL-LP 507

Query: 1222 VSASALPWGP 1251
             +A +  W P
Sbjct: 508  PAAQSSHWVP 517


>ref|XP_002281644.1| PREDICTED: uncharacterized protein LOC100252594 [Vitis vinifera]
          Length = 698

 Score =  429 bits (1103), Expect = e-117
 Identities = 237/428 (55%), Positives = 298/428 (69%), Gaps = 11/428 (2%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LHMQQYFSVAEV YALQQ  W +QQRH D +K + K+ ++    GV  R+  R ET K++
Sbjct: 94   LHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKR---YGVAYRQGQRGETAKDS 150

Query: 181  HSSD---SCAQSLSLGSEKGGEQT------IKGEEAKKKVEIERSDGKDSLPSEDKK-GV 330
            H+S+       + S G+ + GE+       +KG +  K   + + + KD   +E+KK G 
Sbjct: 151  HNSNFENHSHDANSSGTLEKGERVSEIYDDVKGGD--KGDVVGKLEDKDLAAAEEKKAGT 208

Query: 331  DATTNCHTDESLKSSENPGGTDTEKSIFEAVH-DEGTSNVNGTCNTLQKSGFNTTENHDE 507
            DA    + +   KSSEN  G+    S  EA   D+G     G+CN + ++  +  +N +E
Sbjct: 209  DAVAKPNANSCSKSSENSEGSRCGISETEANDMDDG-----GSCNMIMENNAHPVQNQNE 263

Query: 508  KQNLLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQ 687
            K N   +PKTF+G E FDGKAVNVV+GL LYEEL D+ E+SK V L N+LR+AG+RGQLQ
Sbjct: 264  KPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRAAGKRGQLQ 323

Query: 688  GQTYVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQ 867
            GQT+VVS+RPMKG GRE+IQLG+PIADAP EDE++VG S+D + E+IP LL+D+I  LV 
Sbjct: 324  GQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKDRRTESIPSLLQDVIGHLVG 383

Query: 868  SQVMTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYX 1047
            SQV+TVKPD+CIIDF+NEGDHSQPH+ P WFGRPVCILFLTEC+MTFGRVIG DHPGDY 
Sbjct: 384  SQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFGRVIGADHPGDYR 443

Query: 1048 XXXXXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVS 1227
                         VMQGKSADFAKHAI S+RKQRILVTFTKSQPKK+  SDGQRL L  +
Sbjct: 444  GSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTMASDGQRL-LPPA 502

Query: 1228 ASALPWGP 1251
            A +  W P
Sbjct: 503  AQSSHWVP 510


>gb|EXC05040.1| hypothetical protein L484_019288 [Morus notabilis]
          Length = 681

 Score =  421 bits (1082), Expect = e-115
 Identities = 225/418 (53%), Positives = 276/418 (66%), Gaps = 1/418 (0%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LHMQQYFSVAEV +ALQQ AW +QQR +D +K+  K+ ++S   GVG ++W R ++ K+ 
Sbjct: 91   LHMQQYFSVAEVMFALQQVAWRRQQRFYDPVKMGNKEFKRS---GVGFKQWQRNDSFKDG 147

Query: 181  HSSDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTNCHTDE 360
             +S + +  L   S  G   + KG   K   E+  SD + S+P+  +K  D+      D 
Sbjct: 148  RNSAAESHCLDGNSSFGNAASEKGGSDKSGDEVGNSDDRGSMPAAKEKN-DSAAKSQEDG 206

Query: 361  SLKSSEN-PGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNLLPTPKT 537
            ++KS  N  G     +    AV D       G  ++ +++  ++T   +E  NL   PKT
Sbjct: 207  NVKSLGNFEGVVSGSEPEVHAVDD-------GCTSSSKENDSHSTPKQNENSNLANVPKT 259

Query: 538  FMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQGQTYVVSRRP 717
            F GNE FDGK VNVVEGL LYEE   + E+SKLV L N+LRSAG+RG  Q QTYVVS+RP
Sbjct: 260  FSGNEMFDGKPVNVVEGLKLYEEFCADTEVSKLVALVNDLRSAGERGHFQSQTYVVSKRP 319

Query: 718  MKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTVKPDS 897
            MKG GRE IQLGLPIADAP EDE   G  +D + EAIP LL+D+ ERLV  QV TVKPDS
Sbjct: 320  MKGHGREKIQLGLPIADAPVEDEISAGTLKDRRTEAIPPLLQDVAERLVSMQVATVKPDS 379

Query: 898  CIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXXXXXXXXX 1077
            CIIDF+NEGDHSQPH+ P WFGRPVC+LFLTEC+MTFGRV  IDHPGDY           
Sbjct: 380  CIIDFYNEGDHSQPHLWPSWFGRPVCVLFLTECDMTFGRVFAIDHPGDYRGALKLSLKPG 439

Query: 1078 XXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASALPWGP 1251
                MQGKSADFAKHAI S+R+QRILVTFTKSQPKKS  SDGQR+P    A +  WGP
Sbjct: 440  SLLAMQGKSADFAKHAIPSLRRQRILVTFTKSQPKKSMPSDGQRMPSPGVAPSSHWGP 497


>ref|XP_004297399.1| PREDICTED: uncharacterized protein LOC101309147 [Fragaria vesca
            subsp. vesca]
          Length = 682

 Score =  416 bits (1069), Expect = e-113
 Identities = 230/419 (54%), Positives = 290/419 (69%), Gaps = 2/419 (0%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LHMQQYFSVAEV YALQQ AW +QQR+++ +K+  KD ++S   GVG +   R E +KE 
Sbjct: 93   LHMQQYFSVAEVIYALQQVAWRRQQRYYEPVKMGNKDYKRSN-SGVGFKP--RNEPVKEW 149

Query: 181  HSSDSCAQSLS-LGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTNCHTD 357
            H++    +S    G EK G +    EE K   E  + D K S      KGV   T  H  
Sbjct: 150  HTASVEYRSYDGSGLEKVGSEM--REEVKPGGEAGKVDDKGSAAGAVTKGV--LTKPHEY 205

Query: 358  ESLKSSENPGGTDTEKSIFE-AVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNLLPTPK 534
             S +SS N  GT +  S  E AV +EG ++      +++++  N+ +  +EKQNL   PK
Sbjct: 206  ISSRSSANSQGTISGNSESEDAVVNEGCTS------SIKENESNSIQIQNEKQNLSLIPK 259

Query: 535  TFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQGQTYVVSRR 714
            TF+GNETFDGK VNVV+GL LYEE L + E+SKL  L N+LR+ G+RGQLQGQTYV+S+R
Sbjct: 260  TFVGNETFDGKTVNVVDGLKLYEEFLGDTEVSKLFSLVNDLRTTGRRGQLQGQTYVLSKR 319

Query: 715  PMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTVKPD 894
            PMKG GRE+IQLG+PIAD P EDE   G S+D +MEAIP LL+D+I+RL+ +QV+T KPD
Sbjct: 320  PMKGHGREMIQLGIPIADGPQEDEISAGISKDRRMEAIPSLLQDVIDRLIGTQVLTDKPD 379

Query: 895  SCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXXXXXXXX 1074
            SCIIDFFNEGDHS PHM PPWFGRPV +LFLTEC++TFG+V+G+DHPGDY          
Sbjct: 380  SCIIDFFNEGDHSHPHMWPPWFGRPVSVLFLTECDLTFGKVLGMDHPGDYRGALRLSLTP 439

Query: 1075 XXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASALPWGP 1251
                ++QGKSAD+AKHAI SIRKQRILVTFTKSQP+KS  +DGQRLP    + +  W P
Sbjct: 440  GSLLLLQGKSADYAKHAIPSIRKQRILVTFTKSQPRKSFPTDGQRLPSPGPSQSPYWSP 498


>ref|XP_003524142.1| PREDICTED: uncharacterized protein LOC100809865 isoform X1 [Glycine
            max]
          Length = 681

 Score =  406 bits (1044), Expect = e-111
 Identities = 227/425 (53%), Positives = 283/425 (66%), Gaps = 8/425 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            L MQQYFSVA+V +ALQQ AW +QQR  D +KV  K+ RKS   G G R   R E +KE 
Sbjct: 96   LMMQQYFSVADVAHALQQVAWRRQQRPLDPVKVGAKEFRKS---GSGYRHGQRFEPVKEG 152

Query: 181  HSSD-------SCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDAT 339
            ++S            +++ G+EKG     K EE K   ++E+   K    +EDKK  DA 
Sbjct: 153  YNSSVESYNQYDANVTVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASAEDKK--DAI 210

Query: 340  TNCHTDESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNL 519
            T   TD SLKS+ +  G+ +       V+DE  SN  G  +       ++ +N  + Q+L
Sbjct: 211  TKHQTDGSLKSTRSTEGSLSNLESEAVVNDECISNSKGDDS-------HSVQNQHQSQSL 263

Query: 520  LPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQG-QT 696
                KTF+GNE FDGK VNVV+GL LYE+L D+ EI+ LV L N+LR +G++GQLQG Q 
Sbjct: 264  STKAKTFIGNEMFDGKMVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQA 323

Query: 697  YVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQV 876
            Y+VSRRPMKG GRE+IQLG+PIADAPAE ENM G S+D  +E IP L +DIIER+V SQV
Sbjct: 324  YIVSRRPMKGHGREMIQLGVPIADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQV 383

Query: 877  MTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXX 1056
            MTVKPD CI+DF+NEGDHSQPH  P W+GRPV ILFLTEC MTFGRVI  +HPGDY    
Sbjct: 384  MTVKPDCCIVDFYNEGDHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGI 443

Query: 1057 XXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASA 1236
                      VM+GKS+DFAKHA+ S+RKQRILVTFTKSQP+KS  SD QR  L+ +A++
Sbjct: 444  KLSLVPGSLLVMEGKSSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQR--LASTATS 501

Query: 1237 LPWGP 1251
              WGP
Sbjct: 502  SHWGP 506


>ref|XP_007045468.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
            [Theobroma cacao] gi|590697545|ref|XP_007045470.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709403|gb|EOY01300.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao] gi|508709405|gb|EOY01302.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 2 [Theobroma cacao]
          Length = 680

 Score =  402 bits (1033), Expect = e-109
 Identities = 224/422 (53%), Positives = 276/422 (65%), Gaps = 5/422 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LHMQQYFSVAEV+YALQQ AW ++QRH++  KV  K+ ++S     G R  +  E     
Sbjct: 108  LHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNSG 167

Query: 181  HSSD--SCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTNCHT 354
              SD  S   ++S  +E+G E   K EE K   E+ + + K S  +EDKK   +  +   
Sbjct: 168  VDSDGNSTVTAVSERNERGSE---KREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGD 224

Query: 355  DESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSG-FNTTENHDEKQNLLPTP 531
             ES+                       T +VNG C +  K     + +N +EKQNL   P
Sbjct: 225  AESV-----------------------TEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGP 261

Query: 532  KTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQGQTYVVSR 711
            KTF+GNE FDGK VNVV+GL LYEEL D+ E+  LV L N+LR+AG+RGQLQGQTYV ++
Sbjct: 262  KTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQGQTYVAAK 321

Query: 712  RPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTVKP 891
            RPMKG GRE+IQLGLPIADAP +DEN  G S+D ++E IP LL+D IERLV  QVMTVKP
Sbjct: 322  RPMKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKP 381

Query: 892  DSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGI-DHPGDYXXXXXXXX 1068
            DSCIID +NEGDHSQP M PPWFG+PVCI+FLTEC++TFGRV+ + DHPGDY        
Sbjct: 382  DSCIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSL 441

Query: 1069 XXXXXXVMQGKSADFAKHAISSIRKQRILVTFTK-SQPKKSTVSDGQRLPLSVSASALPW 1245
                  VMQGKSADFAKHA+ S+RKQRILVTFTK  QPKKST +D QRL     + +  W
Sbjct: 442  APGSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKST-TDNQRLSSPSVSQSSQW 500

Query: 1246 GP 1251
            GP
Sbjct: 501  GP 502


>ref|XP_006585073.1| PREDICTED: uncharacterized protein LOC100794176 [Glycine max]
          Length = 683

 Score =  402 bits (1032), Expect = e-109
 Identities = 224/424 (52%), Positives = 283/424 (66%), Gaps = 7/424 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            L MQQYFSVA+V YALQQ AW +QQR  D MKV  K+ RKS   G G R   R E++KE 
Sbjct: 100  LMMQQYFSVADVAYALQQVAWRRQQRPLDPMKVGAKEVRKS---GSGYRHGQRFESVKEG 156

Query: 181  HSSDSCAQS------LSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATT 342
            ++S   + S      ++ G+EKG     K EE K   ++E+   K     E+KK  DA T
Sbjct: 157  YNSSVESYSHDANVAVTGGTEKGTPVVEKSEEHKSGGKVEKVGDKGLASVEEKK--DAIT 214

Query: 343  NCHTDESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNLL 522
            N  ++ SLKS+ +  G+ +       V+D   SN  G       +  ++ +N  + Q+L 
Sbjct: 215  NHQSEGSLKSARSTEGSLSNLESEAVVNDGCISNSKG-------NDLHSVQNQSQSQSLS 267

Query: 523  PTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQG-QTY 699
               KTF+GNE FDGK VNVV+GL LY++L D+ E++ LV L N+LR +G++GQLQG Q Y
Sbjct: 268  NIAKTFIGNEMFDGKTVNVVDGLKLYDDLFDSTEVANLVSLVNDLRVSGKKGQLQGSQAY 327

Query: 700  VVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVM 879
            +VSRRPMKG GRE+IQLG+ IADAPAE ENM G S+D  +E+IP L +DIIER+V SQVM
Sbjct: 328  IVSRRPMKGHGREMIQLGVRIADAPAEGENMTGASKDMNVESIPSLFQDIIERMVSSQVM 387

Query: 880  TVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXXX 1059
            TVKPD CI+DF+NEGDHSQPH  P W+GRPV +LFLTEC MTFGRVI  +HPGDY     
Sbjct: 388  TVKPDCCIVDFYNEGDHSQPHSWPSWYGRPVYVLFLTECEMTFGRVIASEHPGDYRGSIK 447

Query: 1060 XXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASAL 1239
                     VMQGKS+DFAKHA+ S RKQRILVTFTKSQP+KS  SD Q+L  +V++S  
Sbjct: 448  LSLVPGSLLVMQGKSSDFAKHALPSTRKQRILVTFTKSQPRKSLSSDAQQLASAVASS-- 505

Query: 1240 PWGP 1251
             WGP
Sbjct: 506  HWGP 509


>emb|CAN65462.1| hypothetical protein VITISV_002198 [Vitis vinifera]
          Length = 1145

 Score =  398 bits (1022), Expect = e-108
 Identities = 228/440 (51%), Positives = 295/440 (67%), Gaps = 23/440 (5%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LHMQQYFSVAEV YALQQ  W +QQRH D +K + K+ ++    GV  R+  R ET K++
Sbjct: 92   LHMQQYFSVAEVIYALQQVGWRRQQRHLDPVKGAGKEYKR---YGVAYRQGQRGETAKDS 148

Query: 181  HSSD---SCAQSLSLGSEKGGEQT------IKGEEAKKKVEIERSDGKD-SLPSEDKKGV 330
            H+S+       + S G+ + GE+       +KG +  K   + + + KD S  +E K+ +
Sbjct: 149  HNSNFENHSHDANSSGTLEKGERVSEIYDDVKGGD--KGDVVGKLEDKDLSAAAEKKEVM 206

Query: 331  DATTNCHTDESLKSSENPGGT------DTEKS---IFEAVHDEGTSNVNGTCNTLQKSGF 483
            +       ++ L   +NP          T+K     F+ +          +CN + ++  
Sbjct: 207  NFVIFGQLEQMLL--QNPMQIAVRRVQKTQKDPDVAFQRLRPMTWMMEARSCNMIMENNA 264

Query: 484  NTTENHDEKQNLLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRS 663
            +  +N +EK N   +PKTF+G E FDGKAVNVV+GL LYEEL D+ E+SK V L N+LR+
Sbjct: 265  HPVQNQNEKPNPTTSPKTFVGTEIFDGKAVNVVDGLKLYEELFDDSEVSKFVSLVNDLRA 324

Query: 664  AGQRGQLQGQTYVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSE----DGKMEAIP 831
            AG+RGQLQGQT+VVS+RPMKG GRE+IQLG+PIADAP EDE++VG S+    + + E+IP
Sbjct: 325  AGKRGQLQGQTFVVSKRPMKGHGREMIQLGVPIADAPLEDESVVGTSKGMFHNRRTESIP 384

Query: 832  VLLKDIIERLVQSQVMTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFG 1011
             LL+D+I +LV SQV+TVKPD+CIIDF+NEGDHSQPH+ P WFGRPVCILFLTEC+MTFG
Sbjct: 385  SLLQDVIGQLVGSQVLTVKPDACIIDFYNEGDHSQPHIWPTWFGRPVCILFLTECDMTFG 444

Query: 1012 RVIGIDHPGDYXXXXXXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKST 1191
            RVIG DHPGDY              VMQGKSADFAKHAI S+RKQRILVTFTKSQPKK+T
Sbjct: 445  RVIGADHPGDYRGSLKLSLVPGSLLVMQGKSADFAKHAIPSLRKQRILVTFTKSQPKKTT 504

Query: 1192 VSDGQRLPLSVSASALPWGP 1251
             SDGQRL L  +A +  W P
Sbjct: 505  ASDGQRL-LPPAAQSSHWVP 523


>ref|XP_007045467.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|590697542|ref|XP_007045469.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709402|gb|EOY01299.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao] gi|508709404|gb|EOY01301.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 681

 Score =  397 bits (1021), Expect = e-108
 Identities = 224/423 (52%), Positives = 276/423 (65%), Gaps = 6/423 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LHMQQYFSVAEV+YALQQ AW ++QRH++  KV  K+ ++S     G R  +  E     
Sbjct: 108  LHMQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNSG 167

Query: 181  HSSD--SCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTNCHT 354
              SD  S   ++S  +E+G E   K EE K   E+ + + K S  +EDKK   +  +   
Sbjct: 168  VDSDGNSTVTAVSERNERGSE---KREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGD 224

Query: 355  DESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSG-FNTTENHDEKQNLLPTP 531
             ES+                       T +VNG C +  K     + +N +EKQNL   P
Sbjct: 225  AESV-----------------------TEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGP 261

Query: 532  KTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQ-GQTYVVS 708
            KTF+GNE FDGK VNVV+GL LYEEL D+ E+  LV L N+LR+AG+RGQLQ GQTYV +
Sbjct: 262  KTFVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAA 321

Query: 709  RRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTVK 888
            +RPMKG GRE+IQLGLPIADAP +DEN  G S+D ++E IP LL+D IERLV  QVMTVK
Sbjct: 322  KRPMKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVK 381

Query: 889  PDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGI-DHPGDYXXXXXXX 1065
            PDSCIID +NEGDHSQP M PPWFG+PVCI+FLTEC++TFGRV+ + DHPGDY       
Sbjct: 382  PDSCIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLS 441

Query: 1066 XXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTK-SQPKKSTVSDGQRLPLSVSASALP 1242
                   VMQGKSADFAKHA+ S+RKQRILVTFTK  QPKKST +D QRL     + +  
Sbjct: 442  LAPGSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKST-TDNQRLSSPSVSQSSQ 500

Query: 1243 WGP 1251
            WGP
Sbjct: 501  WGP 503


>ref|XP_002527549.1| conserved hypothetical protein [Ricinus communis]
            gi|223533099|gb|EEF34858.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 697

 Score =  394 bits (1013), Expect = e-107
 Identities = 222/435 (51%), Positives = 283/435 (65%), Gaps = 18/435 (4%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRH------------FDKMKVSEKDSRKSAFQGVGS 144
            LHMQQYFSV EV  ALQQ A  KQQ+H            +D+ KV  KD ++++  G   
Sbjct: 104  LHMQQYFSVGEVILALQQVALRKQQQHQHQHQHQQHRYYYDQPKVGGKDFKRNSSMGFNK 163

Query: 145  RKWIRTETIKE-NHSSDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDK 321
                  E +KE N+ ++S       G+  G E   K  E K   +  R + K    +EDK
Sbjct: 164  GHRGGGEVVKEVNYGAESHGLD---GNTSGNE---KFNEIKSGGDSGRLENKSLATAEDK 217

Query: 322  KGVDATTNCHTDESLKSSENPGGT-----DTEKSIFEAVHDEGTSNVNGTCNTLQKSGFN 486
            K  DA +  H D +LKSS N  G+     +TE    EAVH++ +   + +         +
Sbjct: 218  K--DAASKPHVD-NLKSSGNSEGSLSGNLETEA---EAVHEQSSPKEHDS---------H 262

Query: 487  TTENHDEKQNLLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSA 666
              +N   K NL  TPKTF+G E  DGK+VNVV+GL LYE+LLD++E+SKLV L N+LR+A
Sbjct: 263  FIQNQIVKLNLTTTPKTFVGAEMVDGKSVNVVDGLKLYEQLLDDVEVSKLVSLVNDLRAA 322

Query: 667  GQRGQLQGQTYVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKD 846
            G++GQ QGQ YVVS+RPMKG GRE+IQLGLPIADAPAE+EN  G S+D K+E+IP LL++
Sbjct: 323  GRKGQFQGQAYVVSKRPMKGHGREMIQLGLPIADAPAEEENAAGTSKDRKIESIPTLLQE 382

Query: 847  IIERLVQSQVMTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGI 1026
            +IER V  Q+MT+KPDSCIID +NEGDHSQPHM PPWFG+P+ +LFLTEC++TFGRVI  
Sbjct: 383  VIERFVSMQIMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPISVLFLTECDLTFGRVITA 442

Query: 1027 DHPGDYXXXXXXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQ 1206
            DHPGDY              VMQGK+ DFAKHAI +IRKQR+L+TFTKSQPKK   SDGQ
Sbjct: 443  DHPGDYRGSLKLPLAPGSLLVMQGKATDFAKHAIPAIRKQRVLLTFTKSQPKKFVQSDGQ 502

Query: 1207 RLPLSVSASALPWGP 1251
            RL    ++ +  WGP
Sbjct: 503  RLTSPAASPSSHWGP 517


>ref|XP_007045471.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 5
            [Theobroma cacao] gi|508709406|gb|EOY01303.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 5 [Theobroma cacao]
          Length = 572

 Score =  393 bits (1009), Expect = e-106
 Identities = 222/421 (52%), Positives = 274/421 (65%), Gaps = 6/421 (1%)
 Frame = +1

Query: 7    MQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKENHS 186
            MQQYFSVAEV+YALQQ AW ++QRH++  KV  K+ ++S     G R  +  E       
Sbjct: 1    MQQYFSVAEVSYALQQVAWRRRQRHYESGKVGGKEFKRSGMGFKGQRMEVAKEGQNSGVD 60

Query: 187  SD--SCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTNCHTDE 360
            SD  S   ++S  +E+G E   K EE K   E+ + + K S  +EDKK   +  +    E
Sbjct: 61   SDGNSTVTAVSERNERGSE---KREEVKSCGEVGKVEDKCSTFTEDKKDTGSKPHAGDAE 117

Query: 361  SLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSG-FNTTENHDEKQNLLPTPKT 537
            S+                       T +VNG C +  K     + +N +EKQNL   PKT
Sbjct: 118  SV-----------------------TEDVNGGCTSSYKENDLCSIQNQNEKQNLAAGPKT 154

Query: 538  FMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQ-GQTYVVSRR 714
            F+GNE FDGK VNVV+GL LYEEL D+ E+  LV L N+LR+AG+RGQLQ GQTYV ++R
Sbjct: 155  FVGNEMFDGKMVNVVDGLKLYEELFDDKEVLDLVSLVNDLRAAGKRGQLQAGQTYVAAKR 214

Query: 715  PMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTVKPD 894
            PMKG GRE+IQLGLPIADAP +DEN  G S+D ++E IP LL+D IERLV  QVMTVKPD
Sbjct: 215  PMKGHGREMIQLGLPIADAPLDDENAAGTSKDRRIEGIPPLLQDTIERLVNLQVMTVKPD 274

Query: 895  SCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGI-DHPGDYXXXXXXXXX 1071
            SCIID +NEGDHSQP M PPWFG+PVCI+FLTEC++TFGRV+ + DHPGDY         
Sbjct: 275  SCIIDVYNEGDHSQPRMWPPWFGKPVCIMFLTECDITFGRVVIVADHPGDYRGSLKLSLA 334

Query: 1072 XXXXXVMQGKSADFAKHAISSIRKQRILVTFTK-SQPKKSTVSDGQRLPLSVSASALPWG 1248
                 VMQGKSADFAKHA+ S+RKQRILVTFTK  QPKKST +D QRL     + +  WG
Sbjct: 335  PGSLLVMQGKSADFAKHALPSVRKQRILVTFTKYCQPKKST-TDNQRLSSPSVSQSSQWG 393

Query: 1249 P 1251
            P
Sbjct: 394  P 394


>ref|XP_007225122.1| hypothetical protein PRUPE_ppa002630mg [Prunus persica]
            gi|462422058|gb|EMJ26321.1| hypothetical protein
            PRUPE_ppa002630mg [Prunus persica]
          Length = 650

 Score =  387 bits (994), Expect = e-105
 Identities = 217/418 (51%), Positives = 261/418 (62%), Gaps = 1/418 (0%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWI-RTETIKE 177
            LHMQQYFSVAEV YALQ  AW +QQR++D +K   K+ ++S   GVG  K   R E  KE
Sbjct: 94   LHMQQYFSVAEVIYALQHVAWRRQQRYYDPVKAGAKEFKRS---GVGFNKGQQRAEAFKE 150

Query: 178  NHSSDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTNCHTD 357
             H+S                           +E   +DG  S       GV A       
Sbjct: 151  GHNST--------------------------LESHSNDGNSS-------GVVAPEKFERG 177

Query: 358  ESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNLLPTPKT 537
              +     PGG +  K   + +   G   VN +         ++ +  ++KQNL   PKT
Sbjct: 178  SEVGEEVEPGG-EVGKLNDKGLAPAGEKKVNES---------HSIQIQNQKQNLSIVPKT 227

Query: 538  FMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQGQTYVVSRRP 717
            F+GNE  DGK VNVV+GL LYE+ L + E+SKLV L N+LR+AG+R QLQGQTYVVS+RP
Sbjct: 228  FIGNEISDGKTVNVVDGLKLYEDFLGDTEVSKLVSLVNDLRAAGKRRQLQGQTYVVSKRP 287

Query: 718  MKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTVKPDS 897
            MKG GRE+IQLG+PIADAP EDE   G S+D K+E IP LL+D+I+RLV   VMTVKPDS
Sbjct: 288  MKGHGREMIQLGIPIADAPPEDEISAGTSKDRKIEPIPSLLQDVIDRLVGMHVMTVKPDS 347

Query: 898  CIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXXXXXXXXX 1077
            CIID +NEGDHSQPH  P WFGRPVC L+LTEC+MTFGR++ +DHPGDY           
Sbjct: 348  CIIDVYNEGDHSQPHTWPSWFGRPVCALYLTECDMTFGRLLLMDHPGDYRGSLRLSLTPG 407

Query: 1078 XXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASALPWGP 1251
               +MQGKSADFAKHAI SIRKQRILVT TKSQPKKST SDGQR P    A +  WGP
Sbjct: 408  SILLMQGKSADFAKHAIPSIRKQRILVTLTKSQPKKSTTSDGQRFPAPAPAQSSYWGP 465


>ref|XP_007158785.1| hypothetical protein PHAVU_002G181800g [Phaseolus vulgaris]
            gi|561032200|gb|ESW30779.1| hypothetical protein
            PHAVU_002G181800g [Phaseolus vulgaris]
          Length = 671

 Score =  384 bits (987), Expect = e-104
 Identities = 222/425 (52%), Positives = 268/425 (63%), Gaps = 8/425 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            L MQQYFSVA+VTY LQQ AW KQQR  D +KV  K+ RK    G G R   R E  KE 
Sbjct: 94   LLMQQYFSVADVTYTLQQVAWRKQQRPLDPVKVGAKEVRKP---GPGYRYGHRFEPSKEG 150

Query: 181  HSSDSCAQS------LSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATT 342
            ++S   + S       + G EKG     K EE K   ++E+   K     E+KK  DA  
Sbjct: 151  YNSSVESYSHDGNATFTRGMEKGTPTVDKSEEHKSGSKVEKVGDKGLASPEEKK--DAII 208

Query: 343  NCHTDESLKSS-ENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNL 519
               TD +LKS+  + G     +S    V+DE  SN  G  +       ++ E+  + Q+ 
Sbjct: 209  KHQTDGNLKSTGSSEGYLSNLESEAVVVNDEFISNSKGNDS-------DSVESQHQSQSF 261

Query: 520  LPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQG-QT 696
                KTF+GNE  DGK VN+ +GL LYE++ D+ E+S LV L N+LR +G++GQLQG Q 
Sbjct: 262  STIAKTFIGNEMIDGKMVNLADGLKLYEDIFDSTEVSNLVSLVNDLRISGKKGQLQGNQA 321

Query: 697  YVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQV 876
            YVVSRRPMKG GRE+IQLG+PIADAP E ENM G S+   +E IP L +DIIER+V SQV
Sbjct: 322  YVVSRRPMKGHGREMIQLGVPIADAPVEGENMTGASKVMNVEPIPSLFEDIIERMVSSQV 381

Query: 877  MTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXX 1056
            MT KPD CI+DF+NEGDHSQPH  P WFGRPV  LFLTEC MTFGR+I  +HPGDY    
Sbjct: 382  MTTKPDCCIVDFYNEGDHSQPHSWPSWFGRPVYTLFLTECEMTFGRLIASEHPGDYRGSL 441

Query: 1057 XXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASA 1236
                       MQGKS DFAKHA+ SIRKQRILVTFTKSQPKKS  SD QRL L  ++S 
Sbjct: 442  KLSLVPGSLLAMQGKSCDFAKHALPSIRKQRILVTFTKSQPKKSVPSDAQRLYLPAASS- 500

Query: 1237 LPWGP 1251
              WGP
Sbjct: 501  -QWGP 504


>gb|ABK95394.1| unknown [Populus trichocarpa]
          Length = 694

 Score =  382 bits (981), Expect = e-103
 Identities = 212/428 (49%), Positives = 263/428 (61%), Gaps = 11/428 (2%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQ----GVGSRKWIRTET 168
            LHMQQYFSV EV  ALQQ    +QQ+   + +  +    +  F      VG R + R+ +
Sbjct: 98   LHMQQYFSVGEVIVALQQVVLRRQQQQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSS 157

Query: 169  IKENHS-------SDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKG 327
               N          D+  + ++   E         E  + +   E   G D   S+DKK 
Sbjct: 158  AGFNRGHRGGGGGGDAVKEGVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKK- 216

Query: 328  VDATTNCHTDESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDE 507
             DAT   HTD    SS N  GT +  S   AV D  +          ++S  + + N +E
Sbjct: 217  -DATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSP---------EESDSHPSNNQNE 266

Query: 508  KQNLLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQ 687
            KQNL  TPKTF+  E  DG+ VNVV+GL LYE LLD LE+SKLV L NELR+ G+RGQ Q
Sbjct: 267  KQNLAITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQ 326

Query: 688  GQTYVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQ 867
            GQTY++S+RPMKG GRE+IQLGLPIADAPAEDEN  G S++ ++E+IP LL+D+IE  V 
Sbjct: 327  GQTYILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVA 386

Query: 868  SQVMTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYX 1047
             QVMT+KPDSCIID +NEGDHSQPHM PPWFG+PV +LFLTEC +TFG+VI   H GDY 
Sbjct: 387  MQVMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYK 446

Query: 1048 XXXXXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVS 1227
                         VMQGKS+D AKHAI  I+KQR+LVTFTKSQPKK T +DG RLP    
Sbjct: 447  GSLKLSVAPGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAV 506

Query: 1228 ASALPWGP 1251
            A +  WGP
Sbjct: 507  APSSHWGP 514


>ref|XP_006379789.1| hypothetical protein POPTR_0008s13830g [Populus trichocarpa]
            gi|550333016|gb|ERP57586.1| hypothetical protein
            POPTR_0008s13830g [Populus trichocarpa]
          Length = 693

 Score =  380 bits (977), Expect = e-103
 Identities = 213/427 (49%), Positives = 262/427 (61%), Gaps = 10/427 (2%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQG-VGSRKWIRTETIKE 177
            LHMQQYFSV EV  ALQQ    +QQ+   + +      R     G VG R + R+ +   
Sbjct: 98   LHMQQYFSVGEVIVALQQVVLRRQQQQQQQQQNHHHQQRFYYDHGKVGGRDFKRSSSAGF 157

Query: 178  NHS---------SDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGV 330
            N            D+  + ++   E         E  + +   E   G D   S+DKK  
Sbjct: 158  NRGHRGGGGGGGGDAVKEGVNSSVENHSFNGNSSENIRSEKFEEVKSGGDGGKSDDKK-- 215

Query: 331  DATTNCHTDESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEK 510
            DAT   HTD    SS N  GT +  S   AV D  +          ++S  + + N +EK
Sbjct: 216  DATAKSHTDNHKNSSGNAQGTFSGNSEAVAVDDRSSP---------EESDSHPSNNQNEK 266

Query: 511  QNLLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQG 690
            QNL  TPKTF+  E  DG+ VNVV+GL LYE LLD LE+SKLV L NELR+ G+RGQ QG
Sbjct: 267  QNLAITPKTFVAEEKIDGQMVNVVDGLKLYENLLDGLEVSKLVSLVNELRATGRRGQCQG 326

Query: 691  QTYVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQS 870
            QTY++S+RPMKG GRE+IQLGLPIADAPAEDEN  G S++ ++E+IP LL+D+IE  V  
Sbjct: 327  QTYILSKRPMKGHGREMIQLGLPIADAPAEDENATGTSKERRVESIPALLQDVIEHFVAM 386

Query: 871  QVMTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXX 1050
            QVMT+KPDSCIID +NEGDHSQPHM PPWFG+PV +LFLTEC +TFG+VI   H GDY  
Sbjct: 387  QVMTMKPDSCIIDIYNEGDHSQPHMWPPWFGKPVSVLFLTECELTFGKVIDTLHHGDYKG 446

Query: 1051 XXXXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSA 1230
                        VMQGKS+D AKHAI  I+KQR+LVTFTKSQPKK T +DG RLP    A
Sbjct: 447  SLKLSVAPGSLLVMQGKSSDLAKHAIPMIKKQRMLVTFTKSQPKKLTSNDGPRLPSHAVA 506

Query: 1231 SALPWGP 1251
             +  WGP
Sbjct: 507  PSSHWGP 513


>ref|XP_006580091.1| PREDICTED: uncharacterized protein LOC100809865 isoform X2 [Glycine
            max]
          Length = 641

 Score =  376 bits (965), Expect = e-101
 Identities = 212/418 (50%), Positives = 266/418 (63%), Gaps = 1/418 (0%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            L MQQYFSVA+V +ALQQ AW +QQR  D +KV  K+ RKS   G G R   R E +KE 
Sbjct: 96   LMMQQYFSVADVAHALQQVAWRRQQRPLDPVKVGAKEFRKS---GSGYRHGQRFEPVKEG 152

Query: 181  HSSDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTNCHTDE 360
            ++S     S+   ++     T+ G   K    +E+S+   S    +K G         D+
Sbjct: 153  YNS-----SVESYNQYDANVTVTGGTEKGTPVVEKSEEHKSGGKVEKVG---------DK 198

Query: 361  SLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNLLPTPKTF 540
             L S+E+  G D+                            ++ +N  + Q+L    KTF
Sbjct: 199  GLASAEDKKGDDS----------------------------HSVQNQHQSQSLSTKAKTF 230

Query: 541  MGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQG-QTYVVSRRP 717
            +GNE FDGK VNVV+GL LYE+L D+ EI+ LV L N+LR +G++GQLQG Q Y+VSRRP
Sbjct: 231  IGNEMFDGKMVNVVDGLKLYEDLFDSTEIANLVSLVNDLRVSGKKGQLQGSQAYIVSRRP 290

Query: 718  MKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTVKPDS 897
            MKG GRE+IQLG+PIADAPAE ENM G S+D  +E IP L +DIIER+V SQVMTVKPD 
Sbjct: 291  MKGHGREMIQLGVPIADAPAEGENMTGASKDMNVEPIPSLFQDIIERMVSSQVMTVKPDC 350

Query: 898  CIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXXXXXXXXX 1077
            CI+DF+NEGDHSQPH  P W+GRPV ILFLTEC MTFGRVI  +HPGDY           
Sbjct: 351  CIVDFYNEGDHSQPHSWPSWYGRPVYILFLTECEMTFGRVIASEHPGDYRGGIKLSLVPG 410

Query: 1078 XXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASALPWGP 1251
               VM+GKS+DFAKHA+ S+RKQRILVTFTKSQP+KS  SD QR  L+ +A++  WGP
Sbjct: 411  SLLVMEGKSSDFAKHALPSVRKQRILVTFTKSQPRKSLSSDAQR--LASTATSSHWGP 466


>ref|XP_004142291.1| PREDICTED: uncharacterized protein LOC101210274 [Cucumis sativus]
            gi|449481289|ref|XP_004156139.1| PREDICTED:
            uncharacterized LOC101210274 [Cucumis sativus]
          Length = 684

 Score =  375 bits (964), Expect = e-101
 Identities = 211/424 (49%), Positives = 272/424 (64%), Gaps = 7/424 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSA-----FQGVGSRKWIRTE 165
            LHMQQYFSVAEV YALQQ    +QQR+ D +KV  K  R+        QG  +   ++ E
Sbjct: 96   LHMQQYFSVAEVMYALQQVTSRRQQRYMDPVKVGPKLYRRPGPGFKQQQGHRAEATVKEE 155

Query: 166  TIKENHSSDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDAT-- 339
            TI    S +    S  + S K  + +   +E+K   E E+   KDS  + D K       
Sbjct: 156  TITCAESCNGGNSSTFVSSRKVEQVSNTCDESKASGEDEKLSEKDSGSAVDNKDTHGKDQ 215

Query: 340  TNCHTDESLKSSENPGGTDTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNL 519
            +NC T    KS+EN       K       D      +G  ++ +     + ++ + KQ  
Sbjct: 216  SNCKT----KSAENLEDNAINK-------DSQVEPDDGCSSSHRDKELQSVQSQNGKQYA 264

Query: 520  LPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQGQTY 699
              TP+TF+ +E FDGK VNV++GL L+EELLD+ E+SKL+ L N+LR++G+RGQ QGQTY
Sbjct: 265  ATTPRTFVASEMFDGKMVNVMDGLKLFEELLDDAEVSKLLSLVNDLRASGKRGQFQGQTY 324

Query: 700  VVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVM 879
            VVS+RPMKG GRE+IQLG PIADAP ED+N +G S+D ++E IP LL+D+I+RLV  QVM
Sbjct: 325  VVSKRPMKGHGREMIQLGFPIADAPHEDDNSLGLSKDRRIEPIPSLLQDLIDRLVGDQVM 384

Query: 880  TVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXXX 1059
            TVKPDSCIIDF+NEGDHSQPH+ P WFGRPV +L LTEC +TFGRVIG DH G+Y     
Sbjct: 385  TVKPDSCIIDFYNEGDHSQPHVWPSWFGRPVGVLLLTECEITFGRVIGTDHSGNYRGAMK 444

Query: 1060 XXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASAL 1239
                     V+QGKSADFAKHA+ +IRKQRILVT TKSQPK++  +DGQR  L+V   + 
Sbjct: 445  LSLTPGNLLVVQGKSADFAKHALPAIRKQRILVTLTKSQPKRAAPADGQRTSLNVGTFS- 503

Query: 1240 PWGP 1251
             WGP
Sbjct: 504  GWGP 507


>ref|XP_007153188.1| hypothetical protein PHAVU_003G014200g [Phaseolus vulgaris]
            gi|561026542|gb|ESW25182.1| hypothetical protein
            PHAVU_003G014200g [Phaseolus vulgaris]
          Length = 691

 Score =  370 bits (949), Expect = e-100
 Identities = 219/437 (50%), Positives = 272/437 (62%), Gaps = 20/437 (4%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRK--SAFQGVGSRKWI------ 156
            L MQQYFSV+EV YALQQ AW +QQR  D  K   K+ RK  S F+    R         
Sbjct: 93   LLMQQYFSVSEVVYALQQVAWRRQQRFVDPAKAGSKEFRKFGSGFRQGQHRNEASKEGYN 152

Query: 157  --RTETIKENHSS-------DSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLP 309
              R E  KE ++S       +  A  ++ G EKG     K  E     ++   D      
Sbjct: 153  NSRNEAAKEGYNSKVESFGREMNAVVVTGGVEKGTRVIDKNGELNSGGKVGTMDNNSIAS 212

Query: 310  SEDKKGVDATTNCHTDESLKSSENPGGTDTEKSIFEAV--HDEGTSNVNGTCNTLQKSGF 483
             E+ K  D  TN   D  L  S N  G+    S  EAV  ++E TSN  G  +       
Sbjct: 213  PEESK--DTITNDQLDGILNGSGNFQGS-LSSSECEAVGENEECTSNSKGNDS------- 262

Query: 484  NTTENHDEKQNLLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRS 663
            ++ +N  + QN     KTF+GNE F+GK VNVV+GL LYE+L+D+ E+SKLV L N++R 
Sbjct: 263  HSVQNQHQSQNASTIGKTFIGNEMFEGKMVNVVDGLKLYEDLIDSAEVSKLVSLVNDMRV 322

Query: 664  AGQRGQLQG-QTYVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLL 840
            AG+RGQ QG QT+VVS+RP+KGRGRE+IQLG+PIADAP + +N+ G S+D K+E+IP L 
Sbjct: 323  AGKRGQFQGSQTFVVSKRPIKGRGREMIQLGVPIADAPPDVDNVTGLSKDKKVESIPSLF 382

Query: 841  KDIIERLVQSQVMTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVI 1020
            +DIIERL  SQVMTVKPD+CI+DFFNEGDHSQP+ CPPWFGRPV +LFLTEC++TFGR I
Sbjct: 383  EDIIERLAASQVMTVKPDACIVDFFNEGDHSQPNSCPPWFGRPVYMLFLTECDITFGRTI 442

Query: 1021 GIDHPGDYXXXXXXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSD 1200
              DHPGDY              VMQGKS D AKHA+ SI KQRILVTFTKSQPK S  +D
Sbjct: 443  VSDHPGDYRGAVKLSLVPGSLLVMQGKSTDLAKHALPSIHKQRILVTFTKSQPKTSLPND 502

Query: 1201 GQRLPLSVSASALPWGP 1251
             QRL  +V++    W P
Sbjct: 503  SQRLSPAVTSH---WAP 516


>ref|XP_006469304.1| PREDICTED: uncharacterized protein LOC102618872 [Citrus sinensis]
          Length = 627

 Score =  367 bits (941), Expect = 8e-99
 Identities = 213/424 (50%), Positives = 257/424 (60%), Gaps = 8/424 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKD-----SRKSAFQGVGSRKWIRTE 165
            LH+QQYFSV+EV  ALQQ AW KQQR FD     ++      +++SAF     +K     
Sbjct: 65   LHLQQYFSVSEVMLALQQVAWRKQQRSFDHHHHHQQQHHLNRTKRSAFV----KKDFHNN 120

Query: 166  TIKENHSSDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPSEDKKGVDATTN 345
                NH+ DS                                  +S   +DKK  D    
Sbjct: 121  NNNNNHAFDS----------------------------------NSSAFDDKK--DVVMK 144

Query: 346  CHTDESLKSSENPGGT---DTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQN 516
             H D S KS  N   T   D E    EA+ D       G    L+++   + ++ +EKQN
Sbjct: 145  AHDDGSAKSLGNSEITQVGDAEPKA-EALDD-------GCTPGLKENDSQSVQSQNEKQN 196

Query: 517  LLPTPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQGQT 696
                 K+F+G E  DGK VNVV+GL LYEE+  N E+SKLV L N+LR+AG+RGQ+QG  
Sbjct: 197  QSMAAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPA 256

Query: 697  YVVSRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQV 876
            YVVS+RP++G GRE+IQLGLPI D P EDE   G S D ++E IP LL+D+I+RLV  Q+
Sbjct: 257  YVVSKRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIEPIPSLLQDVIDRLVGLQI 316

Query: 877  MTVKPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXX 1056
            MTVKPDSCI+D FNEGDHSQPH+ P WFGRPVCILFLTEC+MTFGR+IGIDHPGDY    
Sbjct: 317  MTVKPDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDMTFGRMIGIDHPGDYRGTL 376

Query: 1057 XXXXXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASA 1236
                      VMQGKSAD AKHAISSIRKQRILVTFTKSQPKK T +DGQRL     A +
Sbjct: 377  RLSVAPGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPKKLTPTDGQRLASPGIAPS 436

Query: 1237 LPWG 1248
              WG
Sbjct: 437  PHWG 440


>ref|XP_006448091.1| hypothetical protein CICLE_v10014588mg [Citrus clementina]
            gi|557550702|gb|ESR61331.1| hypothetical protein
            CICLE_v10014588mg [Citrus clementina]
          Length = 635

 Score =  366 bits (939), Expect = 1e-98
 Identities = 213/422 (50%), Positives = 252/422 (59%), Gaps = 5/422 (1%)
 Frame = +1

Query: 1    LHMQQYFSVAEVTYALQQAAWSKQQRHFDKMKVSEKDSRKSAFQGVGSRKWIRTETIKEN 180
            LH+QQYFSV+EV  ALQQ AW KQQR FD                              +
Sbjct: 65   LHLQQYFSVSEVMLALQQVAWRKQQRSFDH----------------------------HH 96

Query: 181  HSSDSCAQSLSLGSEKGGEQTIKGEEAKKKVEIERSDGKDSLPS--EDKKGVDATTNCHT 354
            H      Q   L   K      K            +   DS  S  +DKK  D     H 
Sbjct: 97   HHHHHHQQQHHLNRTKRSAFVKKDFHNNNNNNNNNNHAFDSNSSAFDDKK--DVVMKAHD 154

Query: 355  DESLKSSENPGGT---DTEKSIFEAVHDEGTSNVNGTCNTLQKSGFNTTENHDEKQNLLP 525
            D S KS  N   T   D E    EA+ D       G   +L+++   + ++ +EKQN   
Sbjct: 155  DGSAKSLGNSEITQVGDAEPKA-EALDD-------GCTPSLKENDSQSVQSQNEKQNQSM 206

Query: 526  TPKTFMGNETFDGKAVNVVEGLILYEELLDNLEISKLVQLSNELRSAGQRGQLQGQTYVV 705
              K+F+G E  DGK VNVV+GL LYEE+  N E+SKLV L N+LR+AG+RGQ+QG  YVV
Sbjct: 207  AAKSFVGTEMVDGKMVNVVDGLKLYEEVSGNSEVSKLVSLVNDLRTAGKRGQIQGPAYVV 266

Query: 706  SRRPMKGRGREIIQLGLPIADAPAEDENMVGNSEDGKMEAIPVLLKDIIERLVQSQVMTV 885
            S+RP++G GRE+IQLGLPI D P EDE   G S D ++E IP LL+D+I+RLV  Q+MTV
Sbjct: 267  SKRPIRGHGREVIQLGLPIVDGPPEDEIAAGTSRDRRIEPIPSLLQDVIDRLVGLQIMTV 326

Query: 886  KPDSCIIDFFNEGDHSQPHMCPPWFGRPVCILFLTECNMTFGRVIGIDHPGDYXXXXXXX 1065
            KPDSCI+D FNEGDHSQPH+ P WFGRPVCILFLTEC+MTFGR+IGIDHPGDY       
Sbjct: 327  KPDSCIVDVFNEGDHSQPHISPSWFGRPVCILFLTECDMTFGRMIGIDHPGDYRGTLRLS 386

Query: 1066 XXXXXXXVMQGKSADFAKHAISSIRKQRILVTFTKSQPKKSTVSDGQRLPLSVSASALPW 1245
                   VMQGKSAD AKHAISSIRKQRILVTFTKSQPKK T +DGQRL     A +  W
Sbjct: 387  VAPGSLLVMQGKSADIAKHAISSIRKQRILVTFTKSQPKKLTPTDGQRLASPGIAPSPHW 446

Query: 1246 GP 1251
            GP
Sbjct: 447  GP 448


Top