BLASTX nr result

ID: Rehmannia28_contig00025141 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00025141
         (2107 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011083668.1| PREDICTED: flocculation protein FLO11 [Sesam...   396   e-124
ref|XP_006473518.1| PREDICTED: proteoglycan 4 isoform X1 [Citrus...   243   5e-67
ref|XP_006435013.1| hypothetical protein CICLE_v10000483mg [Citr...   242   9e-67
ref|XP_009785473.1| PREDICTED: flocculation protein FLO11 [Nicot...   236   5e-64
ref|XP_015384333.1| PREDICTED: neurofilament heavy polypeptide i...   231   3e-63
gb|KDO84591.1| hypothetical protein CISIN_1g005888mg [Citrus sin...   230   1e-62
ref|XP_009616925.1| PREDICTED: mucin-2-like [Nicotiana tomentosi...   228   4e-61
ref|XP_007017567.1| Uncharacterized protein TCM_034063 [Theobrom...   216   4e-58
ref|XP_002302027.2| isoflavone reductase family protein [Populus...   210   3e-56
ref|XP_011028856.1| PREDICTED: mucin-2 [Populus euphratica]           205   2e-54
ref|XP_015161887.1| PREDICTED: flocculation protein FLO11-like [...   206   3e-54
ref|XP_010320396.1| PREDICTED: cell wall protein DAN4 [Solanum l...   201   2e-52
ref|XP_015072007.1| PREDICTED: mucin-5AC [Solanum pennellii]          198   2e-51
ref|XP_012851385.1| PREDICTED: uncharacterized protein LOC105971...   186   9e-51
ref|XP_002306875.1| isoflavone reductase family protein [Populus...   194   1e-50
ref|XP_002510405.1| PREDICTED: serine/arginine repetitive matrix...   194   2e-50
ref|XP_009607887.1| PREDICTED: cell wall protein DAN4-like [Nico...   195   3e-50
ref|XP_011007468.1| PREDICTED: mucin-2-like [Populus euphratica]      188   2e-48
ref|XP_002281318.1| PREDICTED: proteoglycan 4 [Vitis vinifera]        177   9e-44
ref|XP_012071885.1| PREDICTED: proteoglycan 4 [Jatropha curcas] ...   169   3e-42

>ref|XP_011083668.1| PREDICTED: flocculation protein FLO11 [Sesamum indicum]
          Length = 746

 Score =  396 bits (1017), Expect = e-124
 Identities = 251/491 (51%), Positives = 299/491 (60%), Gaps = 53/491 (10%)
 Frame = +2

Query: 794  PQARSITSSSPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSPSK 973
            PQARS T SSPS+MGS   ++P+T QRA QPRSPS               K    +SPSK
Sbjct: 225  PQARS-TISSPSRMGS---QSPSTIQRASQPRSPSRLSSRSPKLTSSTSSKHMQPNSPSK 280

Query: 974  ESRETNQDISPSSEPQTLASGEKDS-------------------KPVASQQE-------- 1072
            E+R T Q      EPQ  A  + DS                   K V S+          
Sbjct: 281  EARRTTQQ-----EPQLKAEAKSDSAKNFNGETVDGTRSKKDEVKAVPSKSSEAGATTNP 335

Query: 1073 --------PQLK----------AEVKSDTVDDF-NGVDDGTNFKHNEVKVMPSKPSQLGT 1195
                    P  K          AEVKS++ +   N   D T+ K +EV V  SK S+LGT
Sbjct: 336  VQGSETTMPTPKSDSSNMNGSSAEVKSESSNHSKNQTVDETSSKRDEVNVTSSKSSELGT 395

Query: 1196 TTTISQSPDKPTQTEKLDSSFINGGNEP---LKAMPXXXXXXXXMVQARKTEANEVTKKE 1366
            +TT  Q  + PT T+K  SS +NG ++P   L A P        +VQ  K + NE TKKE
Sbjct: 396  STTPVQGLETPTPTQK--SSTVNGEHKPVMELTAKPEETQEVKEVVQETKVKINEETKKE 453

Query: 1367 VNDFLAPKSESEEPVIEKSEISRKPDEVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPK 1546
            VN+ L PKS S   +IEK+EI  K ++ + EKQ  LD  E + TS+PS++ TNTTTSQ K
Sbjct: 454  VNNILTPKSGSGVNIIEKAEIPSKSNQKNAEKQEALDTKEILPTSVPSLKQTNTTTSQLK 513

Query: 1547 KRSTILGTHSKSARTSGEHVSLNKEIMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGEN 1726
            KRS I  T   SA+ SGEHV+L+K+I DD+ST VN+   G  K AI D+ VS+ITLAGEN
Sbjct: 514  KRSMISDTQKISAQPSGEHVALHKDIRDDLSTFVNRVTAGDPKTAINDKPVSVITLAGEN 573

Query: 1727 RGASMQMGYNSSK---GVHIHRGYKIKPDENAEATTDGEGSF-KGKQSEDAKATEDQPTE 1894
            RGASMQMG NSS+    VHIHRGYKI PDE AEATTDG+GS  KGK+SEDAK TEDQP E
Sbjct: 574  RGASMQMGSNSSRTEGPVHIHRGYKINPDETAEATTDGDGSSNKGKKSEDAKTTEDQPKE 633

Query: 1895 AYVNNNAQGINNSIVFNASIAERNPGVHMVVTHVPKEPIQSSEKTSPLETRKAEFNMSRA 2074
             YVNNNAQGINNSIVFN SI ERNPGVHM V H PKE IQS++K  P ETR+AEFNM+R 
Sbjct: 634  VYVNNNAQGINNSIVFNVSITERNPGVHMSVAHAPKESIQSTDKRGPPETRRAEFNMNRM 693

Query: 2075 EKLTYEPTVRR 2107
            EKL YEP +RR
Sbjct: 694  EKLPYEPKIRR 704


>ref|XP_006473518.1| PREDICTED: proteoglycan 4 isoform X1 [Citrus sinensis]
          Length = 687

 Score =  243 bits (619), Expect = 5e-67
 Identities = 170/472 (36%), Positives = 239/472 (50%), Gaps = 32/472 (6%)
 Frame = +2

Query: 788  VAPQARSITSS-SPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSS 964
            + PQ+   TSS SPS MG      P T     Q  SP                  TH  S
Sbjct: 178  LGPQSTGQTSSRSPSPMGKATKVRP-TAGAVSQLPSPQKQSEPSTKETSQPPLNITHPLS 236

Query: 965  PSKESRETNQDISP---SSEPQTLASGEKDSKPVASQQ---EPQLKAEVKSDTVDDFNGV 1126
               E +ET +   P   S+ P +  S E++SKP AS+Q   EPQ K ++KS+   +    
Sbjct: 237  SYSEEKETKESSQPPLTSTHPLSSLSPERESKPAASKQLLQEPQPKTQIKSEIDSESQYR 296

Query: 1127 DDG-TNFKHNEVKVMPSKPSQLGTTT--TISQSPDKPTQTEKLDSSFINGGNEPLKAMPX 1297
              G T FK +      ++ S LGTT   T S +P  P   EKL    +    + ++ +  
Sbjct: 297  TGGETTFKPDTTAAQTTQASDLGTTIPPTSSVTPGAPVAREKLP---LTESEKKIEGVER 353

Query: 1298 XXXXXXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISR-------------- 1435
                   + +A+  +  EV ++ V D     S  E+     SE+                
Sbjct: 354  KKDMQQIVNEAKPQDGKEVARELVKDEKTNGSADEQMPRTISELLTAASGLETRSKELFG 413

Query: 1436 ---KPDEVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHV 1606
               K +E   EK  + +R +  +T     +   T +S   K   I+G H K   ++G+  
Sbjct: 414  AKIKTEERKQEKHEIFERKKPQSTPESEEKHIKTVSSTHAKDRNIIGNHQKPGVSNGDRT 473

Query: 1607 SLNKEIMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVHI 1777
             L+KEI +DIS  V++ AIG +K  + DR VS+IT+AGENRGASMQ+G  S+K    V I
Sbjct: 474  PLHKEIKEDISKFVHKLAIGHSKQPVDDRPVSIITVAGENRGASMQLGAESAKKGGSVPI 533

Query: 1778 HRGYKIKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIA 1957
            HRGYK+K DE AE TTDGE S KG+   D     +Q T AY+N+N Q INNSI+FN+S+ 
Sbjct: 534  HRGYKLKSDEIAETTTDGEESSKGRSPRDPTTKNNQATTAYINSNIQSINNSIMFNSSVN 593

Query: 1958 ERNPGVHMVVTHVPKEPIQSSEKTSPLET--RKAEFNMSRAEKLTYEPTVRR 2107
            ERNPGVH+V +H   EP + + K   LET   +A+  ++ +EKLTY+PTVRR
Sbjct: 594  ERNPGVHLVFSHNLAEPTKPATKPETLETHGHEAKVTITPSEKLTYQPTVRR 645


>ref|XP_006435013.1| hypothetical protein CICLE_v10000483mg [Citrus clementina]
            gi|557537135|gb|ESR48253.1| hypothetical protein
            CICLE_v10000483mg [Citrus clementina]
          Length = 687

 Score =  242 bits (617), Expect = 9e-67
 Identities = 174/472 (36%), Positives = 243/472 (51%), Gaps = 32/472 (6%)
 Frame = +2

Query: 788  VAPQARSITSS-SPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTH-LS 961
            + PQ+   TSS SPS MG      P T     Q  SP                  TH LS
Sbjct: 178  LGPQSTGQTSSRSPSPMGKATKVRP-TAGAVSQLPSPQKQSEPSTKETSQPPLNITHPLS 236

Query: 962  SPS--KESRETNQDISPSSEPQTLASGEKDSKPVASQQ---EPQLKAEVKSDTVDDFNGV 1126
            S S  KE++ET+Q    S+ P +  S EK+SKP AS+Q   EPQ K ++KS+   +    
Sbjct: 237  SYSEEKETKETSQPPLTSTHPLSSLSPEKESKPAASKQLLQEPQPKTQIKSEIDSESQYR 296

Query: 1127 DDG-TNFKHNEVKVMPSKPSQLGTTT--TISQSPDKPTQTEKLDSSFINGGNEPLKAMPX 1297
              G T FK +      ++ S LGTT   T S +P  P   EKL    +    + ++ +  
Sbjct: 297  TGGETTFKPDTTAAQTTQASDLGTTIPPTSSVTPGAPVAREKLP---LTESEKKIEGVER 353

Query: 1298 XXXXXXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISR-------------- 1435
                   + +A+  +  EV ++ V D     S  E+     SE+                
Sbjct: 354  KKDMQQIVNEAKPQDGKEVARELVKDEKTNGSADEQMPRTISELLTAASGLETRSKELFG 413

Query: 1436 ---KPDEVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHV 1606
               K +E   EK  + +R +  +T     +   T +S   K   I+G H K   ++G+  
Sbjct: 414  AKIKTEERKQEKHEIFERKKPQSTPESEEKHIKTVSSTHAKDRNIIGNHQKPGVSNGDRT 473

Query: 1607 SLNKEIMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVHI 1777
             L+KEI +DIS  V++ AIG +K  + DR VS+IT+AGENRGASMQ+G  S+K    V I
Sbjct: 474  PLHKEIKEDISKFVHKLAIGHSKQPVDDRPVSIITVAGENRGASMQLGAESAKKGGSVPI 533

Query: 1778 HRGYKIKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIA 1957
            HRGYK+K DE AE TTDGE S KG+   D     +Q T A +N+N Q INNSI+FN+S+ 
Sbjct: 534  HRGYKLKSDEIAETTTDGEESSKGRSPRDPTTKNNQATTACINSNIQSINNSIMFNSSVN 593

Query: 1958 ERNPGVHMVVTHVPKEPIQSSEKTSPLET--RKAEFNMSRAEKLTYEPTVRR 2107
            ERNPGVH+V +H   EP + + K   LET   +A+  ++ +EKLTY+PTVRR
Sbjct: 594  ERNPGVHLVFSHNLAEPTKPATKPETLETHGHEAKVTITPSEKLTYQPTVRR 645


>ref|XP_009785473.1| PREDICTED: flocculation protein FLO11 [Nicotiana sylvestris]
          Length = 758

 Score =  236 bits (601), Expect = 5e-64
 Identities = 158/459 (34%), Positives = 248/459 (54%), Gaps = 20/459 (4%)
 Frame = +2

Query: 791  APQARSITS-SSPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSP 967
            APQ+R +   SSP++       T     + P P S S +             K   ++ P
Sbjct: 287  APQSREMNPISSPTRKDPQVLSTDQLTSKVPPPASQSTSSDLPSGMTSQMQEKDKTIALP 346

Query: 968  S--KESRETNQDISPSSEPQTLASGEKDS----KPVASQQEPQLKAEVKSDTVDDFNGVD 1129
            +   +  ++++  S S+E  T  SG++      +P + Q E  LK E  SDTV +     
Sbjct: 347  TFPLKQLDSSEPSSKSTEALTSISGKEPKFTAIRPQSEQME-LLKKETISDTVAE----- 400

Query: 1130 DGTNFKHNEVKVMPSKPSQLGTTTTISQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXX 1309
                 K  E  +  S+ S++ +T  I+Q P + +     DSS I   +EP          
Sbjct: 401  --AKVKSPEKVMKSSEISEVKSTKGITQEPSQIS-----DSSGII--DEPK--------- 442

Query: 1310 XXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKS----EISRKPD------EVHTE 1459
                   R++E N    +EVN+ +           EK       +++P       + +TE
Sbjct: 443  -----MVRQSETNIQETREVNEVVQEMRNKNYGTGEKIGGLLTSTKQPGTAFQSKKAYTE 497

Query: 1460 KQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKEIMDDIS 1639
            KQ+  D ++     + +   T T +SQPK ++ +  +  ++A  + + + LNKE+ D+IS
Sbjct: 498  KQSNSDNDQIRVNLVSNGNHTKTVSSQPKNKTIVNSSSKETAGFAEQDIPLNKEVKDNIS 557

Query: 1640 TIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVHIHRGYKIKPDEN 1810
              +++ A+G  K  + +  VS+ITLAG+NRGASMQ+G +SS     +HIHRGYK+ PDE+
Sbjct: 558  KFIHRMAVGDGKQNLEEGPVSVITLAGDNRGASMQLGSDSSSKEGAIHIHRGYKLNPDES 617

Query: 1811 AEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMVVT 1990
            A+ATTD EG  +G++ +DA+  EDQ  EAY+N N QG+NNSI F+++IA +NPG+HM+  
Sbjct: 618  ADATTDAEGYSEGRRPKDARTMEDQEIEAYLNCNVQGLNNSITFDSAIAAKNPGIHMLFP 677

Query: 1991 HVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            H+P EPI+SS +T P E  KAEFN++ A+KLTYEP +RR
Sbjct: 678  HMPSEPIRSSGRTGPFEAHKAEFNVTPAQKLTYEPKIRR 716


>ref|XP_015384333.1| PREDICTED: neurofilament heavy polypeptide isoform X2 [Citrus
            sinensis]
          Length = 612

 Score =  231 bits (588), Expect = 3e-63
 Identities = 163/473 (34%), Positives = 238/473 (50%), Gaps = 35/473 (7%)
 Frame = +2

Query: 794  PQARSITSSSPSQMGSHNSRTP-----NTPQR-APQPRSPSLNXXXXXXXXXXXXXKGT- 952
            P   S  +S P+      ++TP      +P R APQ R+ S+              +   
Sbjct: 111  PPPESRVTSQPASPSRARTQTPVASQIRSPSRPAPQARAASVPPSPPRTAWTQPGIQAAA 170

Query: 953  HLSSPSKESRETNQDISPSSEPQTLASGEKDSKPVASQQ---EPQLKAEVKSDTVDDFNG 1123
               SPS+   ++   +SP          E++SKP AS+Q   EPQ K ++KS+   +   
Sbjct: 171  EPRSPSRLGPQSTSSLSP----------ERESKPAASKQLLQEPQPKTQIKSEIDSESQY 220

Query: 1124 VDDG-TNFKHNEVKVMPSKPSQLGTTT--TISQSPDKPTQTEKLDSSFINGGNEPLKAMP 1294
               G T FK +      ++ S LGTT   T S +P  P   EKL    +    + ++ + 
Sbjct: 221  RTGGETTFKPDTTAAQTTQASDLGTTIPPTSSVTPGAPVAREKLP---LTESEKKIEGVE 277

Query: 1295 XXXXXXXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISR------------- 1435
                    + +A+  +  EV ++ V D     S  E+     SE+               
Sbjct: 278  RKKDMQQIVNEAKPQDGKEVARELVKDEKTNGSADEQMPRTISELLTAASGLETRSKELF 337

Query: 1436 ----KPDEVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEH 1603
                K +E   EK  + +R +  +T     +   T +S   K   I+G H K   ++G+ 
Sbjct: 338  GAKIKTEERKQEKHEIFERKKPQSTPESEEKHIKTVSSTHAKDRNIIGNHQKPGVSNGDR 397

Query: 1604 VSLNKEIMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVH 1774
              L+KEI +DIS  V++ AIG +K  + DR VS+IT+AGENRGASMQ+G  S+K    V 
Sbjct: 398  TPLHKEIKEDISKFVHKLAIGHSKQPVDDRPVSIITVAGENRGASMQLGAESAKKGGSVP 457

Query: 1775 IHRGYKIKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASI 1954
            IHRGYK+K DE AE TTDGE S KG+   D     +Q T AY+N+N Q INNSI+FN+S+
Sbjct: 458  IHRGYKLKSDEIAETTTDGEESSKGRSPRDPTTKNNQATTAYINSNIQSINNSIMFNSSV 517

Query: 1955 AERNPGVHMVVTHVPKEPIQSSEKTSPLET--RKAEFNMSRAEKLTYEPTVRR 2107
             ERNPGVH+V +H   EP + + K   LET   +A+  ++ +EKLTY+PTVRR
Sbjct: 518  NERNPGVHLVFSHNLAEPTKPATKPETLETHGHEAKVTITPSEKLTYQPTVRR 570


>gb|KDO84591.1| hypothetical protein CISIN_1g005888mg [Citrus sinensis]
          Length = 671

 Score =  230 bits (587), Expect = 1e-62
 Identities = 170/462 (36%), Positives = 238/462 (51%), Gaps = 22/462 (4%)
 Frame = +2

Query: 788  VAPQARSITSS-SPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTH-LS 961
            + PQ+   TSS SPS MG      P T     Q  SP                  TH LS
Sbjct: 178  LGPQSTGQTSSRSPSPMGKATKVRP-TAGAVSQLPSPQKQSEPSTKETSQTPLNITHPLS 236

Query: 962  SPS--KESRETNQDISPSSEPQTLASGEKDSKPVASQQ---EPQLKAEVKSDTVDDFNGV 1126
            S S  KE++ET+Q    S+ P +  S EK+SKP AS+Q   EPQ K ++KS+   +    
Sbjct: 237  SYSEEKETKETSQPPLTSTHPLSSLSPEKESKPAASKQLLQEPQPKTQIKSEIDSESQYR 296

Query: 1127 DDG-TNFKHNEVKVMPSKPSQLGTTT--TISQSPDKPTQTEKLDSSFINGGNEPLKAMPX 1297
              G T FK +      ++ S LGTT   T S +P  P   EKL    +    + ++ +  
Sbjct: 297  TGGETTFKPDTTAAQTTQASDLGTTIPPTSSVTPGAPVAREKLP---LTESEKKIEGVER 353

Query: 1298 XXXXXXXMVQARKTEANEVTKKEVNDFLAPKSESE-------EPVIEKSEISRKPDEVHT 1456
                   + +A+  +  EV ++ V D     S  E       E +   S +  +  E+  
Sbjct: 354  KKDMQQIVNEAKPQDGKEVARELVKDEKTNGSADEQMPRTISELLTAASGLETRSKELFG 413

Query: 1457 EKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKEIMDDI 1636
             K    +R +         +P +T  S+ K   T      K   ++G+   L+KEI +DI
Sbjct: 414  AKIKTEERKQEKHEIFERKKPQSTPESEEKHIKT------KPGVSNGDRTPLHKEIKEDI 467

Query: 1637 STIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVHIHRGYKIKPDE 1807
            S  V++ AIG +K  + DR VS+IT+AGENRGASMQ+G  S+K    V IHRGYK+K DE
Sbjct: 468  SKFVHKLAIGHSKQPVDDRPVSIITVAGENRGASMQLGAESAKKGGSVPIHRGYKLKSDE 527

Query: 1808 NAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMVV 1987
             AE TTDGE S KG+   D     +Q T A +N+N Q INNSI+FN+S+ ERNPGVH+V 
Sbjct: 528  IAETTTDGEESSKGRSPRDPTTKNNQATTACINSNIQSINNSIMFNSSVNERNPGVHLVF 587

Query: 1988 THVPKEPIQSSEKTSPLET--RKAEFNMSRAEKLTYEPTVRR 2107
            +H   EP + + K   LET   +A+  ++ +EKLTY+PTVRR
Sbjct: 588  SHNLAEPTKPATKPETLETHGHEAKVTITPSEKLTYQPTVRR 629


>ref|XP_009616925.1| PREDICTED: mucin-2-like [Nicotiana tomentosiformis]
          Length = 806

 Score =  228 bits (582), Expect = 4e-61
 Identities = 151/455 (33%), Positives = 246/455 (54%), Gaps = 17/455 (3%)
 Frame = +2

Query: 794  PQARSITS-SSPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSPS 970
            PQ+R +   SSP++       T     + P P S S++             K   ++ P+
Sbjct: 341  PQSREMNPISSPTRKDPQVLSTDQLTSKVPPPASQSISSNLPSGITSQMQQKDQTIAQPT 400

Query: 971  KESRE--TNQDISPSSEPQTLASGEKDSKPVASQQEPQLKAEVKSDTVDDFNGVDDGTNF 1144
               ++  +++  S S+E  T   G++        Q  Q++   K +T+ D          
Sbjct: 401  SPLKQLDSSEPSSKSTEALTSIYGKEPKLTAIRPQSEQMELP-KKETISDTEA-----KV 454

Query: 1145 KHNEVKVMPSKPSQLGTTTTISQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXXXXXMV 1324
            +  E  + P K S++ +T  I++ P + +     DSS ING  +P+            MV
Sbjct: 455  RSPEKIMQPLKLSEVKSTRGITEEPSQIS-----DSSEING--KPM------------MV 495

Query: 1325 QARKTEANEVTK-KEVNDFLAPKSESEEPVIEKSEISRKPD-------EVHTEKQNVLDR 1480
               +T   E  + KEV   +  K+      I     S+K         + +TE+Q   + 
Sbjct: 496  TQSETNIQETREVKEVMQEMRDKNYGAGENIGGFLTSKKQPGIVFQSKQAYTEQQASSNN 555

Query: 1481 NE---NVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKEIMDDISTIVN 1651
            N+   N  +++ + + T T  SQPK ++    +  ++A ++ + + LNKE+ D+IS +++
Sbjct: 556  NQIRVNPVSNVSNRKHTRTVLSQPKNKTIASSSSKETAVSTEQDIPLNKEVKDNISKLIH 615

Query: 1652 QTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVHIHRGYKIKPDENAEAT 1822
            + A+G  K  + D  VS+ITLAG+NRGASMQ+G +SS+    +HI RGYK+ PDE+A+AT
Sbjct: 616  RMAVGEGKQNLEDGPVSVITLAGDNRGASMQLGSDSSRKEGAIHIRRGYKLNPDESADAT 675

Query: 1823 TDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMVVTHVPK 2002
            TD EG+  G++ +DA+  +DQ  EAY+N N QG+NNSI F+++I  +NPG+HM+   +P 
Sbjct: 676  TDAEGNSVGRRPKDARIVDDQEIEAYLNCNVQGMNNSITFDSAIEAKNPGIHMLFYCMPS 735

Query: 2003 EPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            EPI+SSE+T P E  KAEFN++ A+KLTYEP +RR
Sbjct: 736  EPIRSSERTGPFEAHKAEFNVTPAQKLTYEPKIRR 770


>ref|XP_007017567.1| Uncharacterized protein TCM_034063 [Theobroma cacao]
            gi|508722895|gb|EOY14792.1| Uncharacterized protein
            TCM_034063 [Theobroma cacao]
          Length = 569

 Score =  216 bits (549), Expect = 4e-58
 Identities = 147/401 (36%), Positives = 217/401 (54%), Gaps = 19/401 (4%)
 Frame = +2

Query: 962  SPSK---ESRETNQDISPSSEPQTLAS---GEKDSKPVA-SQQEPQLKAEVKSDTVDDFN 1120
            SPS+   + + T Q +S    P  LAS   G+  S+P + S++  Q +++  S T+   +
Sbjct: 144  SPSRIASQPQSTAQTVSEQQSPSRLASQPPGQTSSQPSSPSRRATQERSQPPSSTLPPLS 203

Query: 1121 GVDDGTNFKHNEVKVMPSKPSQLG-------TTTTISQSPDKPTQTEKLDSSFINGGNEP 1279
               + T F+   V V PS+ S            T    +P KP + E+   +      E 
Sbjct: 204  ASQE-TTFRPFGVAVEPSQASAQAKEVAPIIAATETPSAPLKPKEREERKKA----AEER 258

Query: 1280 LKAMPXXXXXXXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISRKPDEVHTE 1459
             KA            + + +   E  ++ +   LA  +++     E    + +    H E
Sbjct: 259  RKA------------KTKGSTHEEPEQRTITKLLAAAADAGTKTRELLGAAFETGIRHQE 306

Query: 1460 KQNVLDRNENVATSLPSVRPTNTTTSQ-PKKRSTILGTHSKSARTSGEHVSLNKEIMDDI 1636
            KQ  ++R +   TS    +   T +S  PK+ ST   +H K A ++ E V L+KEI +DI
Sbjct: 307  KQEDIERKKIWTTSSTDEKQIKTVSSTYPKEGSTPTNSHQKHATSTWEQVPLHKEIREDI 366

Query: 1637 STIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSKG---VHIHRGYKIKPDE 1807
            S  V++ A G  K    ++ ++++TLAGENRGAS  MG  S+K    VHIHRGYKI PD+
Sbjct: 367  SKFVHKMATGQPKLPTDEKSIAVLTLAGENRGASFYMGSESAKKDGLVHIHRGYKINPDD 426

Query: 1808 NAEATT-DGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMV 1984
            + +ATT DGEGS +G++ +D+   E+    AYVN+N Q INNS+VF +S+ ERNPGVH+ 
Sbjct: 427  SPDATTTDGEGSSRGRKPKDSMTRENPAPRAYVNSNTQSINNSVVFESSVNERNPGVHLE 486

Query: 1985 VTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
                  EP +S+ K  PLETRKAEFN++ AEKLTYEPTVRR
Sbjct: 487  FLQNSAEPTKSNAKAGPLETRKAEFNVTPAEKLTYEPTVRR 527


>ref|XP_002302027.2| isoflavone reductase family protein [Populus trichocarpa]
            gi|550344208|gb|EEE81300.2| isoflavone reductase family
            protein [Populus trichocarpa]
          Length = 552

 Score =  210 bits (534), Expect = 3e-56
 Identities = 153/456 (33%), Positives = 223/456 (48%), Gaps = 19/456 (4%)
 Frame = +2

Query: 797  QARSITSSSPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSPSKE 976
            Q+R    S  + +    SR     Q APQ +SPS               +    S     
Sbjct: 111  QSRETPQSRAASVPPSPSRAKTQSQAAPQTQSPS-----------RATPQSRAASVAPSP 159

Query: 977  SRETNQDISPSSE--PQTLASGEKDSKPVASQQEPQLKAEVKSDTVDDFNGVDDGTNFKH 1150
            SR T+Q  S ++   PQT  S  + +  V  ++ PQL +  K+ T               
Sbjct: 160  SRTTSQPQSAAAVTVPQT-QSPSRLATQVPGRKSPQLSSPSKTATQ------------VQ 206

Query: 1151 NEVKVMPSKPSQLGTTTTISQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXXXXXMVQA 1330
              V   PSK  QL T   ISQ P K TQ++                            + 
Sbjct: 207  PTVSQSPSKKLQLATQE-ISQPPPKSTQSDTQQE------------------------ET 241

Query: 1331 RKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISRKPDE--------------VHTEKQN 1468
            + T A E    + +D +   + + EP +E     +K D               V   K++
Sbjct: 242  KPTPAAEPV--QASDAVTAPTPTPEPALETPASLQKSDSRTIGADHPMPLSQPVKRVKED 299

Query: 1469 VLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKEIMDDISTIV 1648
            + +R +    S     P   T    + RS    +H K++ +SGE V L KEI +DIS  V
Sbjct: 300  IFERKKTTTIS-----PNGETIKTARTRSAFGESHQKTSMSSGEKVPLQKEIREDISKFV 354

Query: 1649 NQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVHIHRGYKIKPDENAEA 1819
            +   +   ++ I ++ VS++TLAGENRGA+M +G  SS+    VHIHRGYKI PDE++EA
Sbjct: 355  HNLGMEHMEHPIGEKPVSVVTLAGENRGATMYVGSESSRKDGSVHIHRGYKINPDESSEA 414

Query: 1820 TTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMVVTHVP 1999
            TTDGEGS +G+ S+D    ED   +AY+N+N Q +NNSI+F  S++ER+PGV + +++  
Sbjct: 415  TTDGEGSSRGRSSKDLLTKEDPARKAYINSNTQSVNNSILFETSVSERSPGVQLSLSYND 474

Query: 2000 KEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            +EP ++S K   LETRKAEF ++ AEKL+YEP VRR
Sbjct: 475  EEPSKNSSKPVRLETRKAEFQVTPAEKLSYEPKVRR 510


>ref|XP_011028856.1| PREDICTED: mucin-2 [Populus euphratica]
          Length = 553

 Score =  205 bits (522), Expect = 2e-54
 Identities = 154/464 (33%), Positives = 229/464 (49%), Gaps = 24/464 (5%)
 Frame = +2

Query: 788  VAPQARSITSSSPSQMGSHNSRT------PNTPQRAPQPRSPSLNXXXXXXXXXXXXXKG 949
            V P   +I S   SQ  S +  T      P+  +  PQ R+ S+                
Sbjct: 80   VPPHRAAIESQVNSQQVSQSPETRAASVPPSPSRETPQSRAASVPPSPSRAKTQSPAASQ 139

Query: 950  THLSSPSK---ESRETNQDISPS---SEPQTLA--------SGEKDSKPVASQQEPQLKA 1087
            T   SPS+   +SR  +   SPS   S+PQ+ A        S  + +  V  ++ PQL +
Sbjct: 140  TQ--SPSRATPQSRAASVAPSPSRTTSQPQSAAAVTVPQTQSPSRLATQVPGRKSPQLSS 197

Query: 1088 EVKSDTVDDFNGVDDGTNFKHNEVKVMPSKPSQLGTTTTISQSPDKPTQTE-KLDSSFIN 1264
              K+ T                 V   PSK  QL T   ISQ P K TQ++ + + +   
Sbjct: 198  PSKTATE------------VQPTVSQSPSKKLQLATQE-ISQPPPKSTQSDTQQEETKPT 244

Query: 1265 GGNEPLKAMPXXXXXXXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISRKPD 1444
               EP++A                T     T +   +  A   +S+   I          
Sbjct: 245  PAAEPVQASDAV------------TATPTPTPEPALETPASLQKSDSRTIGADHPMPISQ 292

Query: 1445 EVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKEI 1624
             V   K+++ +R +    S     P   T    + R     +H K++ ++GE V L KEI
Sbjct: 293  PVKRVKEDIFERKKTTTAS-----PNGETIKTARSRYAFRESHQKTSMSNGEKVLLQKEI 347

Query: 1625 MDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVHIHRGYKI 1795
             +DIS  V++  +   K+ + ++ VS++TLAGENRGA+M +G  SS+    VHIHRGYKI
Sbjct: 348  REDISKFVHKLGMEHIKHPMGEKPVSVVTLAGENRGATMYVGSESSRKDGSVHIHRGYKI 407

Query: 1796 KPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGV 1975
             PDE++EATTDGEGS +G+ S+     E    +AY+N+N Q +NNSI+F  S++ER+PGV
Sbjct: 408  NPDESSEATTDGEGSSRGRSSKGLLTKEGPARKAYINSNTQSVNNSILFETSVSERSPGV 467

Query: 1976 HMVVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
             + +++  +EP ++S K  PLETRKAEF ++ AEKL+YEP VRR
Sbjct: 468  QLSLSYKDEEPSKNSSKPVPLETRKAEFQVTPAEKLSYEPRVRR 511


>ref|XP_015161887.1| PREDICTED: flocculation protein FLO11-like [Solanum tuberosum]
          Length = 632

 Score =  206 bits (525), Expect = 3e-54
 Identities = 141/403 (34%), Positives = 207/403 (51%), Gaps = 20/403 (4%)
 Frame = +2

Query: 959  SSPSKESRETNQDISPSSE-PQTLASGEKDSKPV--ASQQEPQLKAEVKSDTVDDFNGVD 1129
            S P+ +S+E +Q  SP+S  PQ L++ +   K V  AS Q     +++ S    +   + 
Sbjct: 216  SRPAPQSQEMSQKSSPTSNGPQVLSTDQLTPKAVSTASGQTSSNPSDITSQMQPNNQTIS 275

Query: 1130 DGTNFKHNEVKVMPSKPSQLGTTTTISQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXX 1309
              T+         P K SQ         S   P  TE L S   N   EP   MP     
Sbjct: 276  QPTS---------PPKQSQ-------DSSEISPKSTETLPS---NSEKEP---MPATVKP 313

Query: 1310 XXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISRKPDEVHTEKQNVLDRNEN 1489
                ++  K EA   T  E  D    KS       + S I+ +P      + N+ +  E 
Sbjct: 314  QSEQMELPKKEAISDTLTEAKD----KSPENVKPSDSSRITSEPKTARPLETNIPETKEV 369

Query: 1490 VA----TSLPSV----------RPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKEIM 1627
                  T+ P +          +  N+   Q +      G  +++     + + LNKE+ 
Sbjct: 370  KEVVQQTTDPDIVFQSKQAYTEKQANSDNDQIRVNRVSNGKQTRTISAEPD-IPLNKEVK 428

Query: 1628 DDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSS---KGVHIHRGYKIK 1798
            D+IS  +N+  +G  K  + +  VS+ITLAG+NRGASMQ+  NSS   K VHIHRGYK+ 
Sbjct: 429  DNISKFINRMTVGDGKQKLEEGPVSVITLAGDNRGASMQLSSNSSRKGKAVHIHRGYKLN 488

Query: 1799 PDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVH 1978
             DE+A+ATTD EG+ KGKQ+ DA+  +DQ  EAY+N N QG+NNSI F+++I  +NPG+H
Sbjct: 489  ADESADATTDAEGTSKGKQTNDARTMKDQELEAYMNCNVQGLNNSITFDSTIEGKNPGIH 548

Query: 1979 MVVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            M    +P EP ++SEKT   E  K E+N++RA+K TY+PT++R
Sbjct: 549  MTFPRMPSEPPKTSEKTGLFEAHKTEYNVTRAQKHTYKPTIKR 591


>ref|XP_010320396.1| PREDICTED: cell wall protein DAN4 [Solanum lycopersicum]
          Length = 613

 Score =  201 bits (511), Expect = 2e-52
 Identities = 136/454 (29%), Positives = 219/454 (48%), Gaps = 16/454 (3%)
 Frame = +2

Query: 794  PQARSITSSSPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSPSK 973
            P      S S     S+ S+ P+     P P+S  ++                  SSP++
Sbjct: 166  PSQEQPASLSDVPTNSNVSQIPSPSPSRPAPQSQEMSPK----------------SSPTR 209

Query: 974  ESRETNQDISPSSEPQTLASGEKDSKP--VASQQEPQLKAEVKSDTVDDFNGVDDGTNFK 1147
               +       +S+  + AS +  + P  + SQ +P    ++ S          D +   
Sbjct: 210  NGPQVLSTDQLTSKAASTASDQTSANPSDITSQMQPN---DIISQPTSPPKQSHDSSEIS 266

Query: 1148 HNEVKVMPSKPSQLGTTTTISQSPDK----------PTQTEKLDSSFINGGNEPLKAMPX 1297
                + +PS   +    TT+    ++           T TE  D S       P K  P 
Sbjct: 267  PKSTETLPSNSEKEPMLTTVEPQSEQMELLKKEAISETSTEAKDKS-------PQKVKPS 319

Query: 1298 XXXXXXXMVQ-ARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISRKPDEVHTEKQNVL 1474
                     + AR  E N    KEV + +  +  +++   E+ +I  +  + +TEKQ   
Sbjct: 320  DSSRIITEPKTARPLEINIPETKEVKEVV--QETTDKNYREQPDIVFQSKQAYTEKQASS 377

Query: 1475 DRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKEIMDDISTIVNQ 1654
            D ++     + + + T T +++                   + V LNKE+ D+IS ++N+
Sbjct: 378  DNDQIRVNCVSNGKQTRTISAE-------------------QDVPLNKEVKDNISKLINR 418

Query: 1655 TAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSS---KGVHIHRGYKIKPDENAEATT 1825
              +G  K  + +  VS++TLAG+NRGASMQ+  NSS   K VHIHRGYK+  DE+ +ATT
Sbjct: 419  MTVGDGKQKLEEGPVSVVTLAGDNRGASMQLSSNSSRKGKAVHIHRGYKLNADESTDATT 478

Query: 1826 DGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMVVTHVPKE 2005
            D EG+ KGKQ+ DA+  +DQ  EAY+N N QG+NNSI F+++I  +NPG+HM+   +P E
Sbjct: 479  DAEGTSKGKQTNDARTMKDQGLEAYMNCNVQGLNNSITFDSTIQGKNPGIHMIFPRMPSE 538

Query: 2006 PIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            P +S E T  +E  K+E+N+ R +K TY+PT++R
Sbjct: 539  PTKSCETTGLIEAHKSEYNVIRPQKHTYKPTIKR 572


>ref|XP_015072007.1| PREDICTED: mucin-5AC [Solanum pennellii]
          Length = 613

 Score =  198 bits (503), Expect = 2e-51
 Identities = 147/462 (31%), Positives = 222/462 (48%), Gaps = 27/462 (5%)
 Frame = +2

Query: 803  RSITSSSPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSPSKESR 982
            +S T+++P+   +      +T  + PQ  +PS                 + + + S  S 
Sbjct: 126  QSSTTNAPASPKTETQTPTSTTSQRPQSSAPSGTESQPLIPSQEQPASLSDVPTNSNVS- 184

Query: 983  ETNQDISPSSE---PQTLASGEKDSKP-----VASQQEPQLKAEVKSDTVDDFNGVDDGT 1138
               Q +SPS     PQ+   G K S       V S  +   KA   +      N  D  +
Sbjct: 185  ---QILSPSPSRPAPQSQEMGPKSSPTRNGPQVLSTDQLTSKAASTASDQTSSNPSDITS 241

Query: 1139 NFKHNEVKVMP-SKPSQLGTTTTISQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXXXX 1315
              + N++   P S P Q   ++ IS     P  TE L S   N   EP   MP       
Sbjct: 242  QMQPNDIISQPTSPPKQSQDSSEIS-----PKSTETLPS---NSEKEP---MPATVEPQS 290

Query: 1316 XMVQARKTEANEVTKKEVNDFLAPKSE-SEEPVIEKSEISRKPDEVH------------- 1453
              ++  K EA   T  E  D    + + S+  +I     S +P E +             
Sbjct: 291  EQMEILKKEAISETSTEAKDKSPERVKPSDSSIIISEPKSARPLETNIPETKEVKEVVQE 350

Query: 1454 TEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEH-VSLNKEIMD 1630
            T  +N  ++ + V  S  +       +   + R   +    ++   S E  V LNKE+ D
Sbjct: 351  TTDKNYREQPDIVFQSKQAYTEKQANSDNDQIRVNRVSNGKQTRTISAEQDVPLNKEVKD 410

Query: 1631 DISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSS---KGVHIHRGYKIKP 1801
            +IS ++N+  +G  K  + +  VS+ITLAG+NRGASMQ+  NSS   K VHIHRGYK+  
Sbjct: 411  NISKLINRMTVGDGKQKLEEGPVSVITLAGDNRGASMQLSSNSSRKGKAVHIHRGYKLNA 470

Query: 1802 DENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHM 1981
            DE+ +A TD EG+ KGKQ+ D +  +DQ  EAY+N N QG+NNSI F+++I  +NPG+HM
Sbjct: 471  DESTDAMTDAEGTSKGKQTNDERTMKDQGLEAYMNCNVQGLNNSITFDSTIQGKNPGIHM 530

Query: 1982 VVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            +   +P EP +S E T  +E  KAE+N+ R +K TY+PT++R
Sbjct: 531  IFPRMPSEPTKSCETTGLIEAHKAEYNVIRPQKHTYKPTIKR 572


>ref|XP_012851385.1| PREDICTED: uncharacterized protein LOC105971083 [Erythranthe guttata]
          Length = 238

 Score =  186 bits (471), Expect = 9e-51
 Identities = 104/168 (61%), Positives = 126/168 (75%), Gaps = 6/168 (3%)
 Frame = +2

Query: 1622 IMDDISTIVNQTAIGG-TKNA-IYDRHVSLITLAGENRGASMQMGYNSSKG---VHIHRG 1786
            I DD+ST++N+ A    TK +  YDR VS+ITLAGENRGASM +G +SS     VHIHR 
Sbjct: 40   IKDDVSTLLNRVATSDDTKGSTFYDRAVSVITLAGENRGASMHVGPDSSNRDGPVHIHRS 99

Query: 1787 Y-KIKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAER 1963
            Y KI PDEN +A    E S + K+SEDAK  +D+P+EAY+NNNAQGINNSIVFN S+ ER
Sbjct: 100  YNKINPDENNDAE---ESSREKKKSEDAK--QDKPSEAYINNNAQGINNSIVFNGSVTER 154

Query: 1964 NPGVHMVVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            NPGVHM VTHV KEP++S+      E R+AEFN +RAEKLTYEPT+RR
Sbjct: 155  NPGVHMAVTHVAKEPVRSTNDGIAPEARRAEFNTTRAEKLTYEPTIRR 202


>ref|XP_002306875.1| isoflavone reductase family protein [Populus trichocarpa]
            gi|222856324|gb|EEE93871.1| isoflavone reductase family
            protein [Populus trichocarpa]
          Length = 545

 Score =  194 bits (494), Expect = 1e-50
 Identities = 139/440 (31%), Positives = 215/440 (48%), Gaps = 12/440 (2%)
 Frame = +2

Query: 821  SPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSPSKESRETNQDI 1000
            SPS+     S  P+  +  PQ R+ S+                T   SPS+ + ++    
Sbjct: 109  SPSRETRAASVPPSPSRATPQSRAASVPPSPSRATTQTQAASQTQ--SPSRATPQSRTAS 166

Query: 1001 SPSSEPQTLASGEKDSKPVASQQEP-QLKAEVKSDTVDDFNGVDDGTNFKHNEVKVMPSK 1177
             P S  +T +     +  V   + P +L + V   T    +            V   PSK
Sbjct: 167  VPPSPSRTTSQPRTAALAVQQPESPSRLASRVPGKTSSQPSSPSKIATQVQPTVSRSPSK 226

Query: 1178 PSQLGTTTTISQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXXXXXMVQARKTEANEVT 1357
              QL T  T SQ P   TQ+           + PL  +P           +R+ + +E  
Sbjct: 227  KLQLATQET-SQPPPTSTQSATQQQETNPALSFPLSQVPQEKTEIRAENVSRQQQQSEPV 285

Query: 1358 KKEVNDFLAPKSESEEPVIEKSEISRK-------PDEVHTEKQNVLDRNENVATSLPSVR 1516
            +       A  + +    +E    S+K       PD      + + +  E++     +  
Sbjct: 286  QASGVVRAATATPTSVAALEIPAASQKSDSYTIGPDHPMNLSEQLKNVKEDIFERKKTTV 345

Query: 1517 PTNTTTSQPKKRSTILG-THSKSARTSGEHVSLNKEIMDDISTIVNQTAIGGTKNAIYDR 1693
             +N  T++  +   +LG +H KS+ ++GE V L+KEI +DIS  V++  +   K+ I ++
Sbjct: 346  SSNGETAKSARARYVLGESHQKSSMSNGEKVPLHKEIREDISKFVHKLGMEHIKHPIGEK 405

Query: 1694 HVSLITLAGENRGASMQMGYNSSK---GVHIHRGYKIKPDENAEATTDGEGSFKGKQSED 1864
             VS++TLAGENRGASM  G   ++    VHIH GYKI PDE++E  TDGEGS KG + +D
Sbjct: 406  PVSIVTLAGENRGASMYEGSEPTRKDGSVHIHHGYKINPDESSENPTDGEGSSKGGKFKD 465

Query: 1865 AKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMVVTHVPKEPIQSSEKTSPLET 2044
                ED   +AY+N+N Q +NNSI+F +S+ ER+PGV + ++H  +EP + S K  PLET
Sbjct: 466  LLTKEDPAMKAYINSNTQSVNNSILFESSLNERSPGVQLHLSHNDEEPSEYSAKPGPLET 525

Query: 2045 RKAEFNMSRAEKLTYEPTVR 2104
             K EF ++ AE LT+EPTV+
Sbjct: 526  HKGEFKVTPAEMLTHEPTVK 545


>ref|XP_002510405.1| PREDICTED: serine/arginine repetitive matrix protein 1 [Ricinus
            communis] gi|223551106|gb|EEF52592.1| oxidoreductase,
            putative [Ricinus communis]
          Length = 551

 Score =  194 bits (492), Expect = 2e-50
 Identities = 147/473 (31%), Positives = 222/473 (46%), Gaps = 35/473 (7%)
 Frame = +2

Query: 794  PQARSITSSSPSQMGSHNSRTPNTPQRAPQPR----SPSLNXXXXXXXXXXXXXKGTHLS 961
            P   SI S  P Q  +  S  P TP   P+ R    SPS                 T L 
Sbjct: 51   PGLASIQSPPPPQARAPQSTEPQTPVAIPESRIIVQSPSSVKTQTRAASVPPSPSQTSLP 110

Query: 962  SPSK-ESRETNQDISPS---------SEPQTLASGEKDSKPVASQQEPQLKAEVKSDTVD 1111
            S +K E R  +Q  SPS         S P   ++ +  ++P AS   P+  +++   T  
Sbjct: 111  SRAKSEGRVVSQTRSPSRAASQPRAASVPPFRSTQQTVAQPQAS---PRSASQLAGRTSS 167

Query: 1112 DFNGVDDGTNFKHNEVKVMPSKPSQLGTTTTISQSPDKPTQTEKLD--SSFINGGNEPLK 1285
              +     T  +    +  P       T    SQ P    + E     SS  +   +P  
Sbjct: 168  QPSPPPRRTTQQPTGSQPPPPLRKLQSTAQETSQPPPSAYRQEPKPEGSSLFSQVAQPTA 227

Query: 1286 AM-PXXXXXXXXMVQARKTEANEVTKKE--------------VNDFLA-PKSESEEPVIE 1417
             M P            +  EA +V KKE              + + L  P + SE   + 
Sbjct: 228  GMKPLLEPTEREKENGQGKEAPDVLKKEGKTKVPAEEEPKKAITELLTGPMTGSETRELH 287

Query: 1418 KSEISRKPDEVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSG 1597
                S +  +   E +   +R E++ T          + + PK R+    +H K + ++G
Sbjct: 288  SVAFSSEQKQHEKEDKKTANRGEHIKT---------VSATHPKARNKSTESHQKPSMSNG 338

Query: 1598 EHVSLNKEIMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---G 1768
            EHV L+K I DDIS  +++  IG  K  + ++ V ++T+AGEN GASM +G   ++    
Sbjct: 339  EHVPLHKGIRDDISKFIHKLGIGQVKYPVDEKPVCVMTIAGENTGASMHVGAEPARKDGS 398

Query: 1769 VHIHRGYKIKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNA 1948
            +HIHRGYK+ PD++  ATTDGEGS  G+     K  E   T+AY+N+N Q +NNSI+ ++
Sbjct: 399  IHIHRGYKLNPDDSTGATTDGEGSRDGRSKNPTK--EVPVTKAYLNSNTQSVNNSIILDS 456

Query: 1949 SIAERNPGVHMVVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
            S+ E++PGVH+ ++H   E  + S K  PLET K EFN++R++KLTYEPT+RR
Sbjct: 457  SVNEQDPGVHLALSHNLAESSKPSAKPEPLETHKTEFNVTRSQKLTYEPTIRR 509


>ref|XP_009607887.1| PREDICTED: cell wall protein DAN4-like [Nicotiana tomentosiformis]
          Length = 647

 Score =  195 bits (496), Expect = 3e-50
 Identities = 160/457 (35%), Positives = 228/457 (49%), Gaps = 35/457 (7%)
 Frame = +2

Query: 818  SSPSQMG--SHNSRTPNTPQ------RAPQPRSPSLNXXXXXXXXXXXXX--KGTHL--- 958
            SSPSQ    S N+  P +P       RA Q +SPS +               K  H+   
Sbjct: 204  SSPSQATNKSRNTSQPASPSGAANRSRASQSQSPSRSAPQSPAMTPTSSPTRKAPHVLSS 263

Query: 959  -------SSPS---KESRETNQDIS-PSSEPQTLASGEKDS-------KPVASQQEPQLK 1084
                   +SPS    +  E NQ+ + P+S PQ+L    K S       K    Q E + K
Sbjct: 264  KAGQESSNSPSVVKSQMHEKNQETTLPTSPPQSLHGISKPSSKSIEADKEATEQMELRTK 323

Query: 1085 AEVKSDTVDDFNGVDDGTNFKHNEVKVMPSKPS-QLGTTTTISQSPDKPTQTEKLDSSFI 1261
             E KSD        D     +  +  + PS+ S    T T I++ P     +EK DSS I
Sbjct: 324  KETKSDA-------DSEAKIRFTDKAMQPSEESVPKSTITGITKGP-----SEKSDSSSI 371

Query: 1262 NGGNEPLKAMPXXXXXXXXMVQARKTEANEVTKKEVNDFLAPKSESEEPVIEKSEISRKP 1441
            +G  + LK                +++ N    KEV          +E   +      + 
Sbjct: 372  SGEPKMLK----------------ESDTNNQETKEV---------VQESRRKDYGAGERT 406

Query: 1442 DEVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILGTHSKSARTSGEHVSLNKE 1621
             EV + KQ    +++ +  S  + R T TTT+QP+ ++ + G+  K+  ++  H+ ++KE
Sbjct: 407  PEVQS-KQEGSSKDQVLKNSGSNGRQTRTTTTQPRNKTVVSGSSHKTVVSNEAHIPIHKE 465

Query: 1622 IMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSS-KG--VHIHRGYK 1792
            I D+IS ++N+ A+G          V++ITLAG+NRGASMQ+G +SS KG  +HIHRGYK
Sbjct: 466  IKDNISKVLNRVAVGDDTQKTGKTPVNVITLAGDNRGASMQLGSDSSTKGGRIHIHRGYK 525

Query: 1793 IKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPG 1972
            I PDE+A+ TTDGEGS K     D+K  EDQ   AY+N N QGINNSIVFN+SI ER+PG
Sbjct: 526  INPDESADTTTDGEGSPK-----DSKTKEDQTIAAYINCNVQGINNSIVFNSSITERDPG 580

Query: 1973 VHMVVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKL 2083
            VHM    +P EP+ S    S  +  KAE N++ A  +
Sbjct: 581  VHM---SIPSEPVHS----SLFDAHKAEVNVTPARTI 610


>ref|XP_011007468.1| PREDICTED: mucin-2-like [Populus euphratica]
          Length = 547

 Score =  188 bits (477), Expect = 2e-48
 Identities = 141/445 (31%), Positives = 216/445 (48%), Gaps = 16/445 (3%)
 Frame = +2

Query: 818  SSPSQMGSHNSRTPNTPQRAPQPRSPSLNXXXXXXXXXXXXXKGTHLSSPSKESRETNQD 997
            +SPSQ     S  P+  +  PQ R+ S+                T   SPS+ + ++   
Sbjct: 110  TSPSQETRAASVPPSPSRATPQSRAASVPPSASRATTQTQAASQTQ--SPSRATPQSRAA 167

Query: 998  ISPSSEPQTLASGEKDSKPVASQQEP-QLKAEVKSDTVDDFNGVDDGTNFKHNEVKVMPS 1174
              P S  +T +     +  V   + P +L + V   T    +                PS
Sbjct: 168  SVPPSPSRTTSQPRTAALAVQQPESPSRLASRVPGKTSSQPSSPSKIATQVQPTGSRSPS 227

Query: 1175 KPSQLGTTTTISQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXXXXXMVQARKTEANEV 1354
            K  QL T  T SQ P   TQ+           + PL  +P         ++A      + 
Sbjct: 228  KKLQLATQET-SQPPPTSTQSITQQQETKPAISFPLSQVPLEKTE----IRAENVSLRQQ 282

Query: 1355 TKKEVNDFLAPKSESEEPV----IEKSEISRKPDE-----VH--TEKQNVLDRNENVATS 1501
             ++ V    A ++ +  P     +E    S+K D      VH  T  + + +  E++   
Sbjct: 283  QEEPVQASGAVRAATATPASVAALEIPAASQKSDSNAIGPVHPMTLSEQLKNVKEDIFQR 342

Query: 1502 LPSVRPTNTTTSQPKKRSTILGT-HSKSARTSGEHVSLNKEIMDDISTIVNQTAIGGTKN 1678
              +   +N  T++  +    LG  H K + ++GE V L+KEI +DI   V++  +   K+
Sbjct: 343  KRTTVSSNGETAKSARARYALGEYHQKPSMSNGEKVPLHKEIREDIFKFVHKLGMEHIKH 402

Query: 1679 AIYDRHVSLITLAGENRGASMQMGYNSSK---GVHIHRGYKIKPDENAEATTDGEGSFKG 1849
               ++ VS++TLAGENRGASM  G   ++    +HIH GYKI PDE++E  TDGEGS KG
Sbjct: 403  PKGEKPVSIVTLAGENRGASMYEGSEPTRRDGSIHIHHGYKINPDESSENPTDGEGSSKG 462

Query: 1850 KQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASIAERNPGVHMVVTHVPKEPIQSSEKT 2029
             +S+D   TED   +AY+N+N Q +NNSI+F +S+ ER+PGV + ++H  +EP + S K 
Sbjct: 463  GKSKDLLTTEDPAMKAYINSNTQSVNNSILFESSLNERSPGVQLHLSHNDEEPSEYSAKP 522

Query: 2030 SPLETRKAEFNMSRAEKLTYEPTVR 2104
             PLET KAEF ++ AE L YEPTV+
Sbjct: 523  GPLETHKAEFKVTPAEMLAYEPTVK 547


>ref|XP_002281318.1| PREDICTED: proteoglycan 4 [Vitis vinifera]
          Length = 694

 Score =  177 bits (450), Expect = 9e-44
 Identities = 153/531 (28%), Positives = 231/531 (43%), Gaps = 92/531 (17%)
 Frame = +2

Query: 791  APQARSITSSSPSQMGSHNSRTPNTPQRAP---------QPRSPSLNXXXXXXXXXXXXX 943
            +P     TS + ++        P++P R P         QP SPS               
Sbjct: 138  SPVTSQPTSPTRAEYQFRTGSVPSSPSRGPSQYAGPTSSQPTSPSRTATRVPPTSQV--- 194

Query: 944  KGTHLSSPSKESRETNQDISPSSE-------PQTLASGEKDSKPVASQ---QEPQLKAEV 1093
                +S P+  SR     I  SS+       PQTLA+G ++ KP  SQ   QEPQ KA++
Sbjct: 195  ----VSQPASPSRRLQSSIQESSQAPPTYRRPQTLATGPEEMKPAVSQMWSQEPQTKAQI 250

Query: 1094 -----------------------------KSDT---------VDDFNGVDDGTNFKHNEV 1159
                                         KSD+         V +++ +   +     E 
Sbjct: 251  TTETPKSQDQNILGTTVVVAAPKAPASPQKSDSSTLIDDAKPVGEYDQISKASQVDTQEP 310

Query: 1160 KVMPSKPSQLGTTTTI----SQSP-------------DKPTQTEKLDSSFINGGNEPLKA 1288
            +     PS    T+ +    SQ+P             + P+  +K DS  +    +   +
Sbjct: 311  QSKAEIPSWTLPTSRVVAQPSQAPGPVAAALPTSAAIETPSGPQKPDSYAVTSVWKRFSS 370

Query: 1289 MPXXXXXXXXMVQARKTEANEVTKKEVND---FLAPKSESE---EPVIEKSEISRKPDEV 1450
             P          + RK    E+ K+E  +   +  PK  +E     +   S+I  K  + 
Sbjct: 371  EPKSEE-----TEERKKVEQELIKEEKTNGPAYETPKQMAEAEIHTITPHSKIQIKEQQA 425

Query: 1451 HT--------EKQNVLDRNENVATSLPSVRPTNTTTS-QPKKRSTILGTHSKSARTSGEH 1603
             T         K+ +LDR E + TS  S + + T  S  PK RS +     K A  +GE 
Sbjct: 426  VTFPGLQKQPGKREILDRKEILLTSGTSGKQSKTVISTHPKDRSPVSEPRPKPATFNGER 485

Query: 1604 VSLNKEIMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQMGYNSSK---GVH 1774
              L+KEI +DIS  V++   G  K  + ++ +S+  L GENRGASM     S+K    +H
Sbjct: 486  APLHKEIREDISKFVHKLTTGHPKQPMDEKPISITNLVGENRGASMHFVPESAKKDASLH 545

Query: 1775 IHRGYKIKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQGINNSIVFNASI 1954
            IHRGYK   DE      +GEGS +    ED+   E + T+AY+N+N Q INNSI  +A++
Sbjct: 546  IHRGYKANTDEGGVENIEGEGSSR----EDSMIKEVEATKAYINSNVQSINNSIFCDAAV 601

Query: 1955 AERNPGVHMVVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPTVRR 2107
             ERNPGV + ++  P    +   K  PLE  KAEF+++ ++KLTYEP ++R
Sbjct: 602  TERNPGVQLGISCTPTAQAKFGCKEEPLEAHKAEFSVTPSQKLTYEPVIKR 652


>ref|XP_012071885.1| PREDICTED: proteoglycan 4 [Jatropha curcas]
            gi|643731182|gb|KDP38520.1| hypothetical protein
            JCGZ_04445 [Jatropha curcas]
          Length = 450

 Score =  169 bits (427), Expect = 3e-42
 Identities = 125/423 (29%), Positives = 202/423 (47%), Gaps = 5/423 (1%)
 Frame = +2

Query: 854  TPNTPQRAP-QPRSPSLNXXXXXXXXXXXXXKGTHLSS-PSKESRETNQDISPSSEPQTL 1027
            TP  P  +P Q R+PS                 T  +S PS  SR  +Q +SP  + Q+ 
Sbjct: 41   TPRPPFLSPPQSRAPSPPRATTESQVVVQQTTQTGATSVPSSPSRVASQPVSPPKKLQSA 100

Query: 1028 ASGEKDSKPVASQQEPQLKAEVKSDTVDDFNGVDDGTNFKHNEVKVMPSKPSQLGTTTTI 1207
               ++ SKP     +P L A                        K  P     L  + T 
Sbjct: 101  T--QESSKPSIISTQPLLSAP-----------------------KQEPKPTVSLLMSQTS 135

Query: 1208 SQSPDKPTQTEKLDSSFINGGNEPLKAMPXXXXXXXXMVQARKTEANEVTKKEVNDFLAP 1387
            SQ     T+T K+ S+     ++P                A++T     T+ ++     P
Sbjct: 136  SQPSSPSTRTTKMQSTV----SQP---------------SAQETSKTAQTQTDLKPLSEP 176

Query: 1388 KSESEEPVIEKSEISRKPDEVHTEKQNVLDRNENVATSLPSVRPTNTTTSQPKKRSTILG 1567
              + +E    K    +    +  E++   D+   +++S   ++  ++T +  K + T   
Sbjct: 177  TEKMKETGKAKEVAEKLKTTIPYEQRQQEDKKTTMSSSGEHIKTVSSTHATTKNKLT--- 233

Query: 1568 THSKSARTSGEHVSLNKEIMDDISTIVNQTAIGGTKNAIYDRHVSLITLAGENRGASMQM 1747
                  +++GE VSL K+I DDI   V++  +   K  + ++ V+++T+AGENRG SM +
Sbjct: 234  --ESYQKSNGEQVSLQKQIRDDIFKFVHKLGVSQLKYPMEEKPVTIVTIAGENRGGSMHV 291

Query: 1748 G---YNSSKGVHIHRGYKIKPDENAEATTDGEGSFKGKQSEDAKATEDQPTEAYVNNNAQ 1918
                      +HIHRGYK  PDE    TTDGE S K ++S   K  ++   +AY+N+NAQ
Sbjct: 292  TAEPVTKDGSIHIHRGYKTNPDET---TTDGEVSTKKRKS---KTRQEPAKKAYLNSNAQ 345

Query: 1919 GINNSIVFNASIAERNPGVHMVVTHVPKEPIQSSEKTSPLETRKAEFNMSRAEKLTYEPT 2098
            GINNS++F+ S+ ER+PG+ + +++   EP + S K   +E+ KAEFN++ A+KLTYEPT
Sbjct: 346  GINNSMIFDTSVNERSPGIQLSLSNNVVEPTKPSAKPETIESHKAEFNVTPAQKLTYEPT 405

Query: 2099 VRR 2107
            +RR
Sbjct: 406  IRR 408


Top