BLASTX nr result

ID: Rehmannia23_contig00000623 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00000623
         (1680 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002527961.1| hypothetical protein RCOM_0204720 [Ricinus c...   231   5e-58
ref|XP_002330809.1| predicted protein [Populus trichocarpa] gi|5...   226   2e-56
gb|EMJ24315.1| hypothetical protein PRUPE_ppa003768mg [Prunus pe...   214   9e-53
emb|CBI20768.3| unnamed protein product [Vitis vinifera]              212   4e-52
ref|XP_004291231.1| PREDICTED: uncharacterized protein LOC101293...   209   2e-51
ref|XP_006450162.1| hypothetical protein CICLE_v10007752mg [Citr...   202   5e-49
ref|XP_006450160.1| hypothetical protein CICLE_v10007752mg [Citr...   202   5e-49
ref|XP_006450159.1| hypothetical protein CICLE_v10007752mg [Citr...   202   5e-49
ref|XP_006450157.1| hypothetical protein CICLE_v10007752mg [Citr...   202   5e-49
ref|XP_006450156.1| hypothetical protein CICLE_v10007752mg [Citr...   202   5e-49
ref|XP_004136451.1| PREDICTED: uncharacterized protein LOC101212...   202   5e-49
ref|XP_006450161.1| hypothetical protein CICLE_v10007752mg [Citr...   201   6e-49
ref|XP_006483572.1| PREDICTED: flocculation protein FLO11-like i...   201   1e-48
ref|XP_006483571.1| PREDICTED: flocculation protein FLO11-like i...   200   1e-48
ref|XP_006382167.1| hypothetical protein POPTR_0006s29010g [Popu...   199   4e-48
ref|XP_006483573.1| PREDICTED: flocculation protein FLO11-like i...   196   3e-47
ref|XP_006578200.1| PREDICTED: dentin sialophosphoprotein-like i...   194   1e-46
ref|XP_003523717.1| PREDICTED: dentin sialophosphoprotein-like i...   194   1e-46
ref|XP_006578198.1| PREDICTED: dentin sialophosphoprotein-like i...   193   2e-46
gb|EOY29204.1| Uncharacterized protein isoform 1 [Theobroma cacao]    192   5e-46

>ref|XP_002527961.1| hypothetical protein RCOM_0204720 [Ricinus communis]
            gi|223532587|gb|EEF34373.1| hypothetical protein
            RCOM_0204720 [Ricinus communis]
          Length = 561

 Score =  231 bits (590), Expect = 5e-58
 Identities = 172/524 (32%), Positives = 257/524 (49%), Gaps = 21/524 (4%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + +++   R +A+GAGK+L S+F                     +  E     D++ 
Sbjct: 49   NWLSRLILSPTRMIATGAGKVL-SVFRNDSSSSSSSSSSGGDFSSESDTDEA--EDDDIS 105

Query: 270  NETNVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTVSRE 449
            ++      +   H    Q   W+SETKR IEQL+MQE+FSREECD+L  +L SRV  S  
Sbjct: 106  SQDANKLEKNSRHAIIPQAKEWKSETKRAIEQLLMQETFSREECDRLTYILKSRVVDSPV 165

Query: 450  KNMLAGS----PGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCNLNST 617
               + G     P   + +  +   + + A+ EAKKW ++KK+GS+S +EL +GTC LN++
Sbjct: 166  TRCIDGRLTEIPDTTIGSDPDLPALCSTAITEAKKWLEEKKLGSNSKSELEYGTCTLNTS 225

Query: 618  GLGHVETGG-SSPVDVARSYMKDRPPWASPT-RNVELRTPSTTTMKLFKEGTLYSVGQDS 791
             L HV  G   SPVD+A+SYM+ RPPWASP+ RN++  +PS   ++LFKE T YS G++S
Sbjct: 226  MLPHVTEGDVGSPVDLAKSYMRARPPWASPSMRNIQSLSPSPVGIQLFKEETPYSFGRNS 285

Query: 792  LSSSKK-RNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRVESIGAG 965
            L  SK  R+S  +GSWNIQEE+R+VRSKATEDMLR  PS+ ID S  A + I+       
Sbjct: 286  LPISKLIRDSSATGSWNIQEEIRKVRSKATEDMLRVRPSSVIDWSTLA-SDIKQSPRSLV 344

Query: 966  EEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTFFLKDPETVPADGECAASEFAHP 1145
                    ++ +     E+ P D        S        +DP  +         +    
Sbjct: 345  AYKAEFVSQVAQDGLQNESSPPDPATSVPEQS--------QDPRAIKITDSAKGLQDGSE 396

Query: 1146 SRSDHAEEHHTDPCL---SIINGVPATEVTENGGKQ-----IVNXXXXXXXXXXXXXXXX 1301
             R+ H ++      +   S      A  + +  G+Q     IV                 
Sbjct: 397  GRATHGQKRQPSEDVKAESQSGSAAADALKDADGEQQRLDSIVGIQGSQAIRLYGGQERE 456

Query: 1302 LNKEQHDEENQNADTVKEKKPANIINVEGNCELLSEAYMEVPVVTETDSIASGSQNSMGM 1481
               +  +E+  + D+  +K   N   V+  CELLSE+Y+EVP+V E+    +GSQNS  M
Sbjct: 457  QKSKASEEQQISVDSGHDKMTRN-TPVDETCELLSESYIEVPIVNESHIGGTGSQNSSSM 515

Query: 1482 QYDELSLDMAQPSPDEKVN-----IVAGKQQGRKTAKPKRQRKR 1598
             ++ LS  ++  S   K        V  KQQ  K    ++ + R
Sbjct: 516  HHEGLSQGVSPRSLKRKAGKSNDVTVTEKQQVGKQRYVRKGKGR 559


>ref|XP_002330809.1| predicted protein [Populus trichocarpa]
            gi|566178712|ref|XP_006382166.1| hypothetical protein
            POPTR_0006s29010g [Populus trichocarpa]
            gi|550337321|gb|ERP59963.1| hypothetical protein
            POPTR_0006s29010g [Populus trichocarpa]
          Length = 571

 Score =  226 bits (576), Expect = 2e-56
 Identities = 187/539 (34%), Positives = 269/539 (49%), Gaps = 36/539 (6%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPY------ 251
            NW +  ++  +R LA+GAGK+  ++F                     +            
Sbjct: 50   NWLSRFILSPSRILATGAGKVFSTVFGSESSASSSSSSDVDEEEEGDSGSTSEGEMEDVN 109

Query: 252  --NGDNMLNETNVTSSEIMHHGQKSQLSV-WRSETKRTIEQLIMQESFSREECDKLIKVL 422
              NG +  +E    ++EI+++ +K   +V W++ T R I QL+MQE+FSREECD+L  ++
Sbjct: 110  DGNGSSQSDEKENQTTEIVNYSKKDLPAVEWKTATLRVIAQLLMQETFSREECDRLTHII 169

Query: 423  NSRVTVSREKNMLA-GSPGKAVD----NGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTEL 587
             SRV  S        G P K +D    N  +  DI N AV EAKKWF+ KK+GS+S + +
Sbjct: 170  KSRVVDSPITGSTKDGRPSKTLDKTVGNDVDTPDICNTAVTEAKKWFEGKKLGSNSKS-V 228

Query: 588  AHGTCNLNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKE 761
             +GTC LN+    H   G   SPVD+A+SYM++RPPWASP+ N ++L++P +   +LF E
Sbjct: 229  EYGTCILNTAP--HATEGEMGSPVDLAKSYMRERPPWASPSTNHIQLQSPPSMGKELFVE 286

Query: 762  GTLYSVGQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVA 935
             T +SV   SLS SK  R+ LV+GSWNIQEELR+VRS+ATE+MLRT PS+K+D S  A A
Sbjct: 287  ATPFSVSGKSLSQSKLNRDFLVTGSWNIQEELRKVRSRATEEMLRTRPSSKMDWSALASA 346

Query: 936  PIRVESIGAGEEVLGMGERMTETVSLRE-TKPVDALVDAG-VSSDPALTFFLKDPETVPA 1109
                     G  VLG GE       L   T+ +D  +  G  +++  LT      +T  A
Sbjct: 347  ------YKGGPSVLGAGEFSGAKNKLSNFTQLIDVPLKWGSAANNSGLT------DTQMA 394

Query: 1110 DGECAASEFAHPSRSDHAEEHHTDPCLSIINGVPAT-----EVTENGGKQIVNXXXXXXX 1274
                   +F+  + +   E+           G+ A+     EV        VN       
Sbjct: 395  QVRLQKDDFSPNAATSVPEKSQGLGLTPTTEGMAASKEVAGEVAGRDDSVTVNGFPSSAS 454

Query: 1275 XXXXXXXXXLNK----EQHDEENQNADTVKEKKPANIINVEGNCELLSEAYMEVPVVTET 1442
                            E+H+    + D +    PA     E  C+LLSEA MEVP V E 
Sbjct: 455  SLPEAQEREQKSMPCGEEHNPVGPDHDKMTRTAPA-----EETCKLLSEASMEVPNVNEN 509

Query: 1443 DSIASGSQNSMGMQYDELSLD---MAQPSP----DEKVNIVAGKQQGRKTAKPKRQRKR 1598
            DS+A+ SQ+S  M + E SL    +AQP+P      +   V+ KQQGR  +    +R R
Sbjct: 510  DSVATDSQDSSSM-HQEGSLQAQALAQPNPKRGLGSRTTGVSEKQQGRIVSSRYNKRGR 567


>gb|EMJ24315.1| hypothetical protein PRUPE_ppa003768mg [Prunus persica]
          Length = 550

 Score =  214 bits (545), Expect = 9e-53
 Identities = 166/521 (31%), Positives = 262/521 (50%), Gaps = 21/521 (4%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NWF+ ++    R +ASGAGKI+ S+F                     +  +I    D+ L
Sbjct: 46   NWFSRLIYSPTRMIASGAGKIISSVFSPDSSSSSSSEDGTDDEDVDDD--DISTQEDDGL 103

Query: 270  NETNVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRV----- 434
            N+ N TS ++    ++   ++ +S+ K  IEQL+MQE+FSREECD+LIK++ SRV     
Sbjct: 104  NKRNGTSGKLSFFRKEPPATLGKSDNKHVIEQLLMQETFSREECDRLIKIIKSRVVGFTT 163

Query: 435  TVSREKNMLAGSPGKAV--DNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCNL 608
                E    +  P K V  ++  +  D    AV EAKKW +++++GSSS ++  HGTC L
Sbjct: 164  AEDAENTRPSEIPNKTVGSESDVDTPDFCGTAVTEAKKWLKERRLGSSSKSDSDHGTCTL 223

Query: 609  NSTGLGH-VETGGSSPVDVARSYMKDRPPWASPT-RNVELRTPSTTTMKLFKEGTLYSVG 782
            NS       E  G SPVDVA+ YM+ RPPWASP+ ++ ELR+PS+T M+LF E T YS+G
Sbjct: 224  NSLMFPQGAEDEGGSPVDVAKLYMRARPPWASPSIKHGELRSPSSTGMQLFNEETPYSIG 283

Query: 783  QDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRVES- 953
             +S+S+ K KR+S  +GSWNIQ+E+RRVRSKATE++LR+ PS +ID S   +        
Sbjct: 284  GNSVSTLKLKRDSRATGSWNIQDEIRRVRSKATEELLRSLPSTRIDWSASTLGNRSTSGY 343

Query: 954  IGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTFFLKDPETVPADGECAASE 1133
            +  G++ + MG+++  + +              + S+       K+   +PA      S 
Sbjct: 344  LVDGKQEVEMGDKIHNSKN-------------SIVSEKTQYELQKEALPLPA----IISS 386

Query: 1134 FAHPSRSDHAEEHHTD------PCLSIINGVPATEVTENGGKQIVNXXXXXXXXXXXXXX 1295
              +   S+  E+ +TD         S ++ +  +   E  G +                 
Sbjct: 387  EQNQKNSNWTEQRNTDIGGTSEVGDSKLHDITCSTTGEVTGSRSAYTTNGFPSSVASLSA 446

Query: 1296 XXLNKEQHDEEN--QNADTVKEKKPANIINVEGNC-ELLSEAYMEVPVVTETDSIASGSQ 1466
              L  E++   N   N  T   +K A  + VE    E  + A +EV    E D    G++
Sbjct: 447  PDLGIEENPILNGETNPVTSSHEKVAVDLTVEEEAHEFFNNATVEVANKNEND--VDGTK 504

Query: 1467 NSMGMQYDELSLDMAQPSPDEKVNIVAGKQQGRKTAKPKRQ 1589
             + G+   E S++     P+ K   V  KQ+G++ ++  R+
Sbjct: 505  ENDGVPLSEASIE-ELTQPNSKSTPVVEKQKGKRLSRYNRR 544


>emb|CBI20768.3| unnamed protein product [Vitis vinifera]
          Length = 546

 Score =  212 bits (539), Expect = 4e-52
 Identities = 142/348 (40%), Positives = 200/348 (57%), Gaps = 17/348 (4%)
 Frame = +3

Query: 108  VVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNM------- 266
            +V + R +ASGAGK++ S+F                         +  + ++M       
Sbjct: 48   LVSTTRMIASGAGKLISSVFGSDSSSSSSSSSSASSGGESSAEDNVDDDNNDMDTSSHRA 107

Query: 267  --LNETNVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVT- 437
              L +T   +  I    ++ Q S  +SETK  IEQL+MQE+FSREECD+LI+++ SR   
Sbjct: 108  DKLTKTEAATEIIKSFRKEPQPSTGKSETKCLIEQLLMQETFSREECDRLIEIIRSRAIG 167

Query: 438  -VSREKNM---LAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
              + E  +   L+  P + VD+     D+   AV EAKKW ++KK+ SS  + + H T  
Sbjct: 168  CPTAEDGLYGRLSEHPDRIVDSDAPMPDL-RTAVMEAKKWLEEKKLASSLKSGVHHETST 226

Query: 606  LNSTGLGHVETG-GSSPVDVARSYMKDRPPWASPTRNVELRTPSTTTMKLFKEGTLYSVG 782
            LNS  L HV  G   SPVD+A+SYM+ RPPWASP+ + EL+TPS T M LFKE T YS+G
Sbjct: 227  LNSVMLPHVNEGEAGSPVDMAKSYMRTRPPWASPSMSNELKTPSPTGMHLFKEETPYSLG 286

Query: 783  QDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDML-RTPSAKIDSSLFAVAPIRVESI 956
             +SLSSSK KR++  SGSWNIQEE+RRVR+KATEDML  +PS KID S F     +  S+
Sbjct: 287  HNSLSSSKLKRDAFASGSWNIQEEIRRVRAKATEDMLGSSPSMKIDLSEFGHKASQ-NSL 345

Query: 957  GAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTFFLKDPET 1100
             A    +G+ ++M  + SL   K ++A   + ++S PA    L   +T
Sbjct: 346  VADRTGVGLRDKMHYSNSLTALKSINA--SSNLASGPATCLGLAVSDT 391


>ref|XP_004291231.1| PREDICTED: uncharacterized protein LOC101293162 [Fragaria vesca
            subsp. vesca]
          Length = 552

 Score =  209 bits (533), Expect = 2e-51
 Identities = 174/529 (32%), Positives = 253/529 (47%), Gaps = 28/529 (5%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW    +    R + SGAGK+L S+F                     ++  +    D+ L
Sbjct: 50   NWLARFLYTPTRSIVSGAGKVLSSVFRSDSSSSSSSEIGSDDEEGDDDY--VSSQEDDGL 107

Query: 270  NETNVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTVSRE 449
            N+ N TS  +   G          E+K  IEQL+ QE+FSREECDKLIK++ SRV     
Sbjct: 108  NQRNGTSEPLFRQG----------ESKHAIEQLLRQETFSREECDKLIKIIKSRVV---- 153

Query: 450  KNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCNLNSTGLGH 629
                +    +      +  D+   A+ EAKKW  +K++GS+S +EL HGT  L   G   
Sbjct: 154  -GCTSTEDAQNTRLNMDATDLSATAISEAKKWVMEKRLGSASKSELGHGTI-LFPQG--- 208

Query: 630  VETGGSSPVDVARSYMKDRPPWASPT-RNVELRTPSTTTMKLFKEGTLYSVGQDSLSSSK 806
             E  G SPVDVA+SYM+  PPWASP+ ++ ELR+ S   ++LF E T YS+G  S++S  
Sbjct: 209  AEDDGGSPVDVAKSYMRALPPWASPSSQHGELRSTSPLGLQLFNEETPYSLGGTSVTSKL 268

Query: 807  KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFA-VAPIRVESIGAGEEVLG 980
            KR++  +GSWNIQEE+RRVR+KATE+MLR+ PS KID S F+      + S+  G++   
Sbjct: 269  KRDAPATGSWNIQEEIRRVRTKATEEMLRSLPSTKIDWSAFSRENRSTLNSLQDGKQEAD 328

Query: 981  MGERMTETVSLRETKPVDALVDAGVSSDPAL---TFFLKDPETVPADG---ECAASEFAH 1142
            +G+++   ++            AG+++DP L   T    D      DG   E   S    
Sbjct: 329  LGDKLKNPITA-----------AGLTTDPPLGISTTHSFDVTEKTQDGLQKEALTSGTEQ 377

Query: 1143 PSR-----------SDHAEEHHTDPCLSIINGVPATEVTE-----NGG-KQIVNXXXXXX 1271
            P              D       D      NG P +E  E     NG   Q+ +      
Sbjct: 378  PDAIIEGTIQYSKVHDQTCSTEKDATAHTTNGFPYSEPREETTVLNGEINQVGSSHGKMV 437

Query: 1272 XXXXXXXXXXLNKE--QHDEENQNADTVKEKKPANIINVEGNCELLSEAYMEVPVVTETD 1445
                      L+K   +++E + +  T  E  P N  +VE +   ++    ++      D
Sbjct: 438  TTLLVEETFELDKVILENNEMDIDGMTGPESMPVNAASVEASKVNVN----DIDASKGND 493

Query: 1446 SIASGSQNSMGMQYDELSLDMAQPSPDEKVNIVAGKQQGRKTAKPKRQR 1592
            S ASGSQNS  M  DELS ++ Q  P + +NI   K++G    KPK +R
Sbjct: 494  SAASGSQNSSSMP-DELSQELTQSQP-KSINIAVAKEEG-VVQKPKAKR 539


>ref|XP_006450162.1| hypothetical protein CICLE_v10007752mg [Citrus clementina]
            gi|557553388|gb|ESR63402.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
          Length = 612

 Score =  202 bits (513), Expect = 5e-49
 Identities = 138/343 (40%), Positives = 192/343 (55%), Gaps = 13/343 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 103  TDTMKKKGTLDIIEHVRSPHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 162

Query: 441  S-----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
            S      E   L+    + + +  +  D    A+ EAKKW ++KK GSS  +EL  GTC 
Sbjct: 163  SPVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAIMEAKKWLEEKKSGSSPNSELELGTCA 222

Query: 606  LNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSV 779
            LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS 
Sbjct: 223  LNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYST 282

Query: 780  GQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-E 950
            G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  
Sbjct: 283  GYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSN 342

Query: 951  SIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            S+ A E +  + +++  +     TKPV A V+       +  F
Sbjct: 343  SLVASEALTSLRDKVHSS-----TKPVAASVNVATGLSTSYGF 380


>ref|XP_006450160.1| hypothetical protein CICLE_v10007752mg [Citrus clementina]
            gi|557553386|gb|ESR63400.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
          Length = 624

 Score =  202 bits (513), Expect = 5e-49
 Identities = 138/343 (40%), Positives = 192/343 (55%), Gaps = 13/343 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 103  TDTMKKKGTLDIIEHVRSPHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 162

Query: 441  S-----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
            S      E   L+    + + +  +  D    A+ EAKKW ++KK GSS  +EL  GTC 
Sbjct: 163  SPVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAIMEAKKWLEEKKSGSSPNSELELGTCA 222

Query: 606  LNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSV 779
            LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS 
Sbjct: 223  LNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYST 282

Query: 780  GQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-E 950
            G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  
Sbjct: 283  GYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSN 342

Query: 951  SIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            S+ A E +  + +++  +     TKPV A V+       +  F
Sbjct: 343  SLVASEALTSLRDKVHSS-----TKPVAASVNVATGLSTSYGF 380


>ref|XP_006450159.1| hypothetical protein CICLE_v10007752mg [Citrus clementina]
            gi|557553385|gb|ESR63399.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
          Length = 450

 Score =  202 bits (513), Expect = 5e-49
 Identities = 138/343 (40%), Positives = 192/343 (55%), Gaps = 13/343 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 103  TDTMKKKGTLDIIEHVRSPHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 162

Query: 441  S-----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
            S      E   L+    + + +  +  D    A+ EAKKW ++KK GSS  +EL  GTC 
Sbjct: 163  SPVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAIMEAKKWLEEKKSGSSPNSELELGTCA 222

Query: 606  LNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSV 779
            LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS 
Sbjct: 223  LNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYST 282

Query: 780  GQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-E 950
            G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  
Sbjct: 283  GYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSN 342

Query: 951  SIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            S+ A E +  + +++  +     TKPV A V+       +  F
Sbjct: 343  SLVASEALTSLRDKVHSS-----TKPVAASVNVATGLSTSYGF 380


>ref|XP_006450157.1| hypothetical protein CICLE_v10007752mg [Citrus clementina]
            gi|557553383|gb|ESR63397.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
          Length = 467

 Score =  202 bits (513), Expect = 5e-49
 Identities = 138/343 (40%), Positives = 192/343 (55%), Gaps = 13/343 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 103  TDTMKKKGTLDIIEHVRSPHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 162

Query: 441  S-----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
            S      E   L+    + + +  +  D    A+ EAKKW ++KK GSS  +EL  GTC 
Sbjct: 163  SPVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAIMEAKKWLEEKKSGSSPNSELELGTCA 222

Query: 606  LNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSV 779
            LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS 
Sbjct: 223  LNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYST 282

Query: 780  GQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-E 950
            G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  
Sbjct: 283  GYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSN 342

Query: 951  SIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            S+ A E +  + +++  +     TKPV A V+       +  F
Sbjct: 343  SLVASEALTSLRDKVHSS-----TKPVAASVNVATGLSTSYGF 380


>ref|XP_006450156.1| hypothetical protein CICLE_v10007752mg [Citrus clementina]
            gi|567916304|ref|XP_006450158.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
            gi|557553382|gb|ESR63396.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
            gi|557553384|gb|ESR63398.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
          Length = 410

 Score =  202 bits (513), Expect = 5e-49
 Identities = 138/343 (40%), Positives = 192/343 (55%), Gaps = 13/343 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 103  TDTMKKKGTLDIIEHVRSPHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 162

Query: 441  S-----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
            S      E   L+    + + +  +  D    A+ EAKKW ++KK GSS  +EL  GTC 
Sbjct: 163  SPVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAIMEAKKWLEEKKSGSSPNSELELGTCA 222

Query: 606  LNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSV 779
            LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS 
Sbjct: 223  LNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYST 282

Query: 780  GQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-E 950
            G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  
Sbjct: 283  GYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSN 342

Query: 951  SIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            S+ A E +  + +++  +     TKPV A V+       +  F
Sbjct: 343  SLVASEALTSLRDKVHSS-----TKPVAASVNVATGLSTSYGF 380


>ref|XP_004136451.1| PREDICTED: uncharacterized protein LOC101212538 [Cucumis sativus]
            gi|449522948|ref|XP_004168487.1| PREDICTED:
            uncharacterized LOC101212538 [Cucumis sativus]
          Length = 581

 Score =  202 bits (513), Expect = 5e-49
 Identities = 175/554 (31%), Positives = 263/554 (47%), Gaps = 51/554 (9%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            +W +  +    R +ASGAGK+L S+F                            + D++ 
Sbjct: 47   SWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSESDSEDD------------DEDDVP 94

Query: 270  NETNVTSSEI--MHHGQKSQLSVWRSE-------TKRTIEQLIMQESFSREECDKLIKVL 422
            +E +V         +G    +S++R +       +K  IEQL+MQE+FSR ECDKL++++
Sbjct: 95   DERHVFQGAEGGKKNGTSEMVSLFRKDFPPEKKDSKHLIEQLLMQETFSRAECDKLVQII 154

Query: 423  NSRVTVSREKNMLAGSPGKAVDNGTEDVD-----IYNKAVQEAKKWFQQKKVG--SSSVT 581
             SRV   +     A      + N T D D     + + A+ EAKKW  +K++G  S+S  
Sbjct: 155  ESRVVECQTFEGQAAGRLTEISNRTVDSDDGRPAVCSSAILEAKKWLNEKRLGLVSTSTL 214

Query: 582  ELAHGTCNLNSTGLGHVETGG-SSPVDVARSYMKDRPPWASP-TRNVELRTPSTTTMKLF 755
            +L  G C LNST L  V      SPVDVA+SYM+ RPPWASP T N E ++PS   ++LF
Sbjct: 215  KLDDGPCTLNSTMLPMVNNEEMGSPVDVAKSYMQARPPWASPSTNNFEFKSPSPLGLQLF 274

Query: 756  KEGTLYSVGQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRTPSAKIDSSLFAV 932
            KE T YS+  + LSSS+ KR S  SGSWNIQEELRRVRSKATE+MLR+PS+K+D S  A 
Sbjct: 275  KEETSYSISGNPLSSSRIKRESPTSGSWNIQEELRRVRSKATEEMLRSPSSKLDWSSLAS 334

Query: 933  A---PIRVESIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTFFLKDPETV 1103
                   + S       +  G+++   V     KP+D  ++   S+   +T  L + +T 
Sbjct: 335  GSDYKTNLSSTHFNHLKIPSGDKIQHAV-----KPIDKSMN--WSAVNTVTHNLTESKTA 387

Query: 1104 PADGECAASEFAHPS-------RSDHAEEHHTDPCLSIINGVPATEVTENG----GKQIV 1250
                E  A +    S        +D  +     P ++ +   P T++  +      ++  
Sbjct: 388  EDVSENEACQLGTTSIVLQQDKVTDFQKGFAGPPAVNDLETNPTTQMKVSNSSLDARECS 447

Query: 1251 NXXXXXXXXXXXXXXXXLNKEQHDEENQNADTVKEKKPANIIN------VEGNCELLSEA 1412
                              ++E   E+N   + V+E   +   +      VE  CELLSE 
Sbjct: 448  TPHKDAGLANGFPPLPSSSRELGVEQNHFNNIVEESNSSGHDHKGKDPPVEERCELLSEV 507

Query: 1413 YMEVP-VVTETDSIAS-GSQNSMGMQYDELSLDMAQPS----------PDEKVNIVAGKQ 1556
             MEVP + T+TD + S G+  S  +  D  S  +++ +          P     + AGK 
Sbjct: 508  SMEVPDIETDTDKVVSDGNDASKVVSEDNSSCQISKENGGGNVKSVEKPSSASGVAAGK- 566

Query: 1557 QGRKTAKPKRQRKR 1598
             G  TA  +R R+R
Sbjct: 567  TGSGTAYLRRGRRR 580


>ref|XP_006450161.1| hypothetical protein CICLE_v10007752mg [Citrus clementina]
            gi|557553387|gb|ESR63401.1| hypothetical protein
            CICLE_v10007752mg [Citrus clementina]
          Length = 611

 Score =  201 bits (512), Expect = 6e-49
 Identities = 138/343 (40%), Positives = 191/343 (55%), Gaps = 13/343 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                        +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSDSE--------DIDDEDENDA 101

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 102  TDTMKKKGTLDIIEHVRSPHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 161

Query: 441  S-----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
            S      E   L+    + + +  +  D    A+ EAKKW ++KK GSS  +EL  GTC 
Sbjct: 162  SPVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAIMEAKKWLEEKKSGSSPNSELELGTCA 221

Query: 606  LNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSV 779
            LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS 
Sbjct: 222  LNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYST 281

Query: 780  GQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-E 950
            G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  
Sbjct: 282  GYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSN 341

Query: 951  SIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            S+ A E +  + +++  +     TKPV A V+       +  F
Sbjct: 342  SLVASEALTSLRDKVHSS-----TKPVAASVNVATGLSTSYGF 379


>ref|XP_006483572.1| PREDICTED: flocculation protein FLO11-like isoform X2 [Citrus
            sinensis]
          Length = 623

 Score =  201 bits (510), Expect = 1e-48
 Identities = 138/342 (40%), Positives = 191/342 (55%), Gaps = 12/342 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET--NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTVS 443
             +T    T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  S
Sbjct: 103  TDTMKKGTLDIIEHVRSAHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVDS 162

Query: 444  -----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCNL 608
                  E   L+    + + +  +  D    AV EAKKW ++KK GSS  +EL  GTC L
Sbjct: 163  PVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAVMEAKKWLEEKKSGSSPNSELELGTCAL 222

Query: 609  NSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSVG 782
            NS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS G
Sbjct: 223  NSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYSTG 282

Query: 783  QDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-ES 953
              S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  S
Sbjct: 283  YTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSNS 342

Query: 954  IGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            + A E +  + +++  +      KPV A V+       +  F
Sbjct: 343  LVASEALTSLRDKVHSS-----AKPVAASVNVATGLSTSYGF 379


>ref|XP_006483571.1| PREDICTED: flocculation protein FLO11-like isoform X1 [Citrus
            sinensis]
          Length = 624

 Score =  200 bits (509), Expect = 1e-48
 Identities = 138/343 (40%), Positives = 191/343 (55%), Gaps = 13/343 (3%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 103  TDTMKKKGTLDIIEHVRSAHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 162

Query: 441  S-----REKNMLAGSPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCN 605
            S      E   L+    + + +  +  D    AV EAKKW ++KK GSS  +EL  GTC 
Sbjct: 163  SPVIRDTEDWRLSEPRNRTIGSDVDIPDYRCTAVMEAKKWLEEKKSGSSPNSELELGTCA 222

Query: 606  LNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLYSV 779
            LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T YS 
Sbjct: 223  LNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPYST 282

Query: 780  GQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV-E 950
            G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +  
Sbjct: 283  GYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSMSN 342

Query: 951  SIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
            S+ A E +  + +++  +      KPV A V+       +  F
Sbjct: 343  SLVASEALTSLRDKVHSS-----AKPVAASVNVATGLSTSYGF 380


>ref|XP_006382167.1| hypothetical protein POPTR_0006s29010g [Populus trichocarpa]
           gi|550337322|gb|ERP59964.1| hypothetical protein
           POPTR_0006s29010g [Populus trichocarpa]
          Length = 504

 Score =  199 bits (505), Expect = 4e-48
 Identities = 131/318 (41%), Positives = 184/318 (57%), Gaps = 18/318 (5%)
 Frame = +3

Query: 90  NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPY------ 251
           NW +  ++  +R LA+GAGK+  ++F                     +            
Sbjct: 50  NWLSRFILSPSRILATGAGKVFSTVFGSESSASSSSSSDVDEEEEGDSGSTSEGEMEDVN 109

Query: 252 --NGDNMLNETNVTSSEIMHHGQKSQLSV-WRSETKRTIEQLIMQESFSREECDKLIKVL 422
             NG +  +E    ++EI+++ +K   +V W++ T R I QL+MQE+FSREECD+L  ++
Sbjct: 110 DGNGSSQSDEKENQTTEIVNYSKKDLPAVEWKTATLRVIAQLLMQETFSREECDRLTHII 169

Query: 423 NSRVTVSREKNMLA-GSPGKAVD----NGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTEL 587
            SRV  S        G P K +D    N  +  DI N AV EAKKWF+ KK+GS+S + +
Sbjct: 170 KSRVVDSPITGSTKDGRPSKTLDKTVGNDVDTPDICNTAVTEAKKWFEGKKLGSNSKS-V 228

Query: 588 AHGTCNLNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKE 761
            +GTC LN+    H   G   SPVD+A+SYM++RPPWASP+ N ++L++P +   +LF E
Sbjct: 229 EYGTCILNTAP--HATEGEMGSPVDLAKSYMRERPPWASPSTNHIQLQSPPSMGKELFVE 286

Query: 762 GTLYSVGQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVA 935
            T +SV   SLS SK  R+ LV+GSWNIQEELR+VRS+ATE+MLRT PS+K+D S  A A
Sbjct: 287 ATPFSVSGKSLSQSKLNRDFLVTGSWNIQEELRKVRSRATEEMLRTRPSSKMDWSALASA 346

Query: 936 PIRVESIGAGEEVLGMGE 989
                    G  VLG GE
Sbjct: 347 ------YKGGPSVLGAGE 358


>ref|XP_006483573.1| PREDICTED: flocculation protein FLO11-like isoform X3 [Citrus
            sinensis]
          Length = 614

 Score =  196 bits (497), Expect = 3e-47
 Identities = 139/345 (40%), Positives = 188/345 (54%), Gaps = 15/345 (4%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW + ++    R LA+GAGK+L S+F                     +  +I    +N  
Sbjct: 50   NWLSRLIYSPTRMLATGAGKLLSSVFTNDDSSSSSSSDSD-------SEEDIDDEDENDA 102

Query: 270  NET---NVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTV 440
             +T     T   I H     Q +V +SETKR IEQL++Q +FSREEC++L  ++ SRV  
Sbjct: 103  TDTMKKKGTLDIIEHVRSAHQPTVGKSETKRLIEQLLVQVTFSREECNRLTGIIKSRVVD 162

Query: 441  SREKNMLAGSPGKAVDNGTEDVDI-------YNKAVQEAKKWFQQKKVGSSSVTELAHGT 599
            S             V   TED  +          AV EAKKW ++KK GSS  +EL  GT
Sbjct: 163  S------------PVIRDTEDWRLSEPRNRTIGSAVMEAKKWLEEKKSGSSPNSELELGT 210

Query: 600  CNLNSTGLGHVETGG-SSPVDVARSYMKDRPPWASPTRN-VELRTPSTTTMKLFKEGTLY 773
            C LNS    HV  G   SPVD+A+SYM+ RPPWASP+ N +E  +PS T ++LFKE T Y
Sbjct: 211  CALNSAMSPHVNEGELGSPVDMAKSYMQTRPPWASPSANHIECGSPSPTGIQLFKEETPY 270

Query: 774  SVGQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFAVAPIRV 947
            S G  S +SSK K++S  SGSWNI EE+R+VRSKATE+MLRT PS+KID S FA+    +
Sbjct: 271  STGYTSFTSSKMKKDSPASGSWNILEEIRKVRSKATEEMLRTPPSSKIDWSSFALENKSM 330

Query: 948  -ESIGAGEEVLGMGERMTETVSLRETKPVDALVDAGVSSDPALTF 1079
              S+ A E +  + +++  +      KPV A V+       +  F
Sbjct: 331  SNSLVASEALTSLRDKVHSS-----AKPVAASVNVATGLSTSYGF 370


>ref|XP_006578200.1| PREDICTED: dentin sialophosphoprotein-like isoform X4 [Glycine max]
          Length = 604

 Score =  194 bits (493), Expect = 1e-46
 Identities = 176/569 (30%), Positives = 261/569 (45%), Gaps = 66/569 (11%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW +  V+  +RF+ASGAGKI  S+                      +         +  
Sbjct: 41   NWLSRFVISPSRFIASGAGKIFSSVLDLDNSPSDSSSATCSLSSSANDSDAEEVGTFDDE 100

Query: 270  NETNVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTVSRE 449
            N+        +  G   Q  V  S+ K  IEQL+M+ESFSREECD+LIK++ SRV     
Sbjct: 101  NDNPSEGDVALSKGL--QPFVRNSKNKHMIEQLLMKESFSREECDRLIKIIRSRVVDPAN 158

Query: 450  KNMLAGSP----GKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCNLNST 617
             +     P     K + + T+  ++++ A+ EAKKW Q+KK    + T++ +G+ +LN  
Sbjct: 159  DDDGDKRPTDMSNKILGSDTDSPELHDVAIMEAKKWLQEKKSALDTNTDIGYGSLSLNLV 218

Query: 618  GLGHVETGGSSPVDVARSYMKDRPPWASPT-RNVELRTPSTTTMKLFKEGTLYSVGQDSL 794
             L        SPVDVA+SYM  RPPWASP+  + + +TPS   ++LFKE T Y  G +S+
Sbjct: 219  ALPQDPKDEGSPVDVAKSYMCTRPPWASPSIDHTKPQTPS--GIQLFKEETPYLFGNNSM 276

Query: 795  SSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFA------VAPIRVE 950
             SSK KR+S  +GSW+IQ+E+RRVRS+ATE++LR+ PS+KID S FA      V    +E
Sbjct: 277  PSSKLKRDSAATGSWSIQDEIRRVRSRATEELLRSLPSSKIDWSAFAMENKNNVNSSAIE 336

Query: 951  SIGA--GEEV----------LGMGERMTETVSLRETKPVDALVDAGVSSDPALTFFLKDP 1094
            +IGA  GE V          + +   +   VS      +D      V S+P  T F ++ 
Sbjct: 337  NIGASLGERVHNSTNLVDASVNLARGLGSQVSPDLESKLDEFQPESVLSNPVNTNFEQNQ 396

Query: 1095 ETVPAD--GECAASEFAHPS-RSDHAEEHHTDPCLSIINGVPATE--------------- 1220
             +V      E  + E      R   +++ H D  L  +NG+  T                
Sbjct: 397  GSVAVQQTREDGSREITTSGLRDGSSDDMHRDGSLVKVNGISDTNGSGHQLDSVEETRDA 456

Query: 1221 ------------VTENGGKQ--IVNXXXXXXXXXXXXXXXXLNKEQHDEENQNADTVKEK 1358
                        + E  G +  + N                 N +  D +    D+ +E+
Sbjct: 457  INSRLQDSNHLVIKEKVGAEDALANGFPSSGPSFNAGQVIEQNTKTLDNKPNTTDSSQER 516

Query: 1359 KPANIINVEGNCELLSEAYMEVPVVTETDS----IASGSQNSMGMQYDELSLDMAQPS-- 1520
                ++  E  C+ L E+  EVP V   DS    +ASGSQNS  M   E+  D +QP   
Sbjct: 517  TAQGVLEQE-ECQTLRES-TEVPDVIGDDSVADRVASGSQNSSSMY--EVQHDTSQPGVE 572

Query: 1521 ---PDEKVNIVAGKQQGRKTAKPKRQRKR 1598
               P    +I   KQ+GR+      +R R
Sbjct: 573  LGLPATPTSI--AKQKGRRITTRYNRRGR 599


>ref|XP_003523717.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 603

 Score =  194 bits (492), Expect = 1e-46
 Identities = 175/571 (30%), Positives = 262/571 (45%), Gaps = 68/571 (11%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW +  V+  +RF+ASGAGKI  S+                      N  +    G    
Sbjct: 41   NWLSRFVISPSRFIASGAGKIFSSVLDLDNSPSDSSSATCSLSSSA-NDSDAEEVGTFDD 99

Query: 270  NETNVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTVSRE 449
               N +  ++      S+  V  S+ K  IEQL+M+ESFSREECD+LIK++ SRV     
Sbjct: 100  ENDNPSEGDVA----LSKPFVRNSKNKHMIEQLLMKESFSREECDRLIKIIRSRVVDPAN 155

Query: 450  KNMLAGSP----GKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCNLNST 617
             +     P     K + + T+  ++++ A+ EAKKW Q+KK    + T++ +G+ +LN  
Sbjct: 156  DDDGDKRPTDMSNKILGSDTDSPELHDVAIMEAKKWLQEKKSALDTNTDIGYGSLSLNLV 215

Query: 618  GLGHVETGGSSPVDVARSYMKDRPPWASPT-RNVELRTPSTTTMKLFKEGTLYSVGQDSL 794
             L        SPVDVA+SYM  RPPWASP+  + + +TPS   ++LFKE T Y  G +S+
Sbjct: 216  ALPQDPKDEGSPVDVAKSYMCTRPPWASPSIDHTKPQTPS--GIQLFKEETPYLFGNNSM 273

Query: 795  SSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFA------VAPIRVE 950
             SSK KR+S  +GSW+IQ+E+RRVRS+ATE++LR+ PS+KID S FA      V    +E
Sbjct: 274  PSSKLKRDSAATGSWSIQDEIRRVRSRATEELLRSLPSSKIDWSAFAMENKNNVNSSAIE 333

Query: 951  SIGA--GEEV----------LGMGERMTETVSLRETKPVDALVDAGVSSDPALTFFLKDP 1094
            +IGA  GE V          + +   +   VS      +D      V S+P  T F ++ 
Sbjct: 334  NIGASLGERVHNSTNLVDASVNLARGLGSQVSPDLESKLDEFQPESVLSNPVNTNFEQNQ 393

Query: 1095 ETVPADGECAASEFAHP-----SRSDHAEEHHTDPCLSIINGVPATE------------- 1220
             +V         + +        R   +++ H D  L  +NG+  T              
Sbjct: 394  GSVAVQQTRGTEDGSREITTSGLRDGSSDDMHRDGSLVKVNGISDTNGSGHQLDSVEETR 453

Query: 1221 --------------VTENGGKQ--IVNXXXXXXXXXXXXXXXXLNKEQHDEENQNADTVK 1352
                          + E  G +  + N                 N +  D +    D+ +
Sbjct: 454  DAINSRLQDSNHLVIKEKVGAEDALANGFPSSGPSFNAGQVIEQNTKTLDNKPNTTDSSQ 513

Query: 1353 EKKPANIINVEGNCELLSEAYMEVPVVTETDS----IASGSQNSMGMQYDELSLDMAQPS 1520
            E+    ++  E  C+ L E+  EVP V   DS    +ASGSQNS  M   E+  D +QP 
Sbjct: 514  ERTAQGVLEQE-ECQTLRES-TEVPDVIGDDSVADRVASGSQNSSSMY--EVQHDTSQPG 569

Query: 1521 -----PDEKVNIVAGKQQGRKTAKPKRQRKR 1598
                 P    +I   KQ+GR+      +R R
Sbjct: 570  VELGLPATPTSI--AKQKGRRITTRYNRRGR 598


>ref|XP_006578198.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 606

 Score =  193 bits (491), Expect = 2e-46
 Identities = 174/571 (30%), Positives = 260/571 (45%), Gaps = 68/571 (11%)
 Frame = +3

Query: 90   NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
            NW +  V+  +RF+ASGAGKI  S+                      +         +  
Sbjct: 41   NWLSRFVISPSRFIASGAGKIFSSVLDLDNSPSDSSSATCSLSSSANDSDAEEVGTFDDE 100

Query: 270  NETNVTSSEIMHHGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTVSRE 449
            N+        +  G   Q  V  S+ K  IEQL+M+ESFSREECD+LIK++ SRV     
Sbjct: 101  NDNPSEGDVALSKGL--QPFVRNSKNKHMIEQLLMKESFSREECDRLIKIIRSRVVDPAN 158

Query: 450  KNMLAGSP----GKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGTCNLNST 617
             +     P     K + + T+  ++++ A+ EAKKW Q+KK    + T++ +G+ +LN  
Sbjct: 159  DDDGDKRPTDMSNKILGSDTDSPELHDVAIMEAKKWLQEKKSALDTNTDIGYGSLSLNLV 218

Query: 618  GLGHVETGGSSPVDVARSYMKDRPPWASPT-RNVELRTPSTTTMKLFKEGTLYSVGQDSL 794
             L        SPVDVA+SYM  RPPWASP+  + + +TPS   ++LFKE T Y  G +S+
Sbjct: 219  ALPQDPKDEGSPVDVAKSYMCTRPPWASPSIDHTKPQTPS--GIQLFKEETPYLFGNNSM 276

Query: 795  SSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFA------VAPIRVE 950
             SSK KR+S  +GSW+IQ+E+RRVRS+ATE++LR+ PS+KID S FA      V    +E
Sbjct: 277  PSSKLKRDSAATGSWSIQDEIRRVRSRATEELLRSLPSSKIDWSAFAMENKNNVNSSAIE 336

Query: 951  SIGA--GEEV----------LGMGERMTETVSLRETKPVDALVDAGVSSDPALTFFLKDP 1094
            +IGA  GE V          + +   +   VS      +D      V S+P  T F ++ 
Sbjct: 337  NIGASLGERVHNSTNLVDASVNLARGLGSQVSPDLESKLDEFQPESVLSNPVNTNFEQNQ 396

Query: 1095 ETVPADGECAASEFAHP-----SRSDHAEEHHTDPCLSIINGVPATE------------- 1220
             +V         + +        R   +++ H D  L  +NG+  T              
Sbjct: 397  GSVAVQQTRGTEDGSREITTSGLRDGSSDDMHRDGSLVKVNGISDTNGSGHQLDSVEETR 456

Query: 1221 --------------VTENGGKQ--IVNXXXXXXXXXXXXXXXXLNKEQHDEENQNADTVK 1352
                          + E  G +  + N                 N +  D +    D+ +
Sbjct: 457  DAINSRLQDSNHLVIKEKVGAEDALANGFPSSGPSFNAGQVIEQNTKTLDNKPNTTDSSQ 516

Query: 1353 EKKPANIINVEGNCELLSEAYMEVPVVTETDS----IASGSQNSMGMQYDELSLDMAQPS 1520
            E+    ++  E  C+ L E+  EVP V   DS    +ASGSQNS  M   E+  D +QP 
Sbjct: 517  ERTAQGVLEQE-ECQTLRES-TEVPDVIGDDSVADRVASGSQNSSSMY--EVQHDTSQPG 572

Query: 1521 -----PDEKVNIVAGKQQGRKTAKPKRQRKR 1598
                 P    +I   KQ+GR+      +R R
Sbjct: 573  VELGLPATPTSI--AKQKGRRITTRYNRRGR 601


>gb|EOY29204.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 599

 Score =  192 bits (487), Expect = 5e-46
 Identities = 127/294 (43%), Positives = 169/294 (57%), Gaps = 14/294 (4%)
 Frame = +3

Query: 90  NWFTGIVVPSARFLASGAGKILYSIFXXXXXXXXXXXXXXXXXXXXXNHFEIPYNGDNML 269
           NW +  V    R + +GAG+IL S+F                       F      DN  
Sbjct: 46  NWISRHVFSPTRTIVTGAGRILSSVFGYESSSSSSSSSSSDCD------FSSDDTDDN-- 97

Query: 270 NETNVTSSEIMH--HGQKSQLSVWRSETKRTIEQLIMQESFSREECDKLIKVLNSRVTVS 443
           N+     S+ +H    ++ Q    ++ETKR IEQL++QE+FSREECDKL  ++ SRV   
Sbjct: 98  NDDKDVLSQGVHTIEHREPQSFAGKTETKRLIEQLLVQETFSREECDKLTNIIKSRVM-- 155

Query: 444 REKNMLAG--------SPGKAVDNGTEDVDIYNKAVQEAKKWFQQKKVGSSSVTELAHGT 599
            +  ML G        +P +   +  E  D+ + AV EA+KW ++KK+GSSS +EL + T
Sbjct: 156 -DSPMLTGMGDARLNETPNRTGGSDVEIHDLCSAAVMEARKWLEEKKLGSSSKSELDNET 214

Query: 600 CNLNSTGLGH-VETGGSSPVDVARSYMKDRPPWASP-TRNVELRTPSTTTMKLFKEGTLY 773
              N     H  E    SPVDVA+SYM+ RPPWASP T+N+  R+ S   M LFKE T Y
Sbjct: 215 SARNPVTFTHGAEEETGSPVDVAKSYMRTRPPWASPSTKNIGFRSSSPIGMPLFKEDTPY 274

Query: 774 SVGQDSLSSSK-KRNSLVSGSWNIQEELRRVRSKATEDMLRT-PSAKIDSSLFA 929
           S+G +S SSSK KR S  +GSWNIQEE+R+VRSKATE+MLRT  S+KID S F+
Sbjct: 275 SIGGNSFSSSKLKRGSPATGSWNIQEEIRKVRSKATEEMLRTRSSSKIDWSSFS 328


Top