BLASTX nr result

ID: Catharanthus22_contig00005092 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00005092
         (1863 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   515   e-143
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   511   e-142
gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus pe...   472   e-130
gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]     464   e-128
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...   463   e-127
ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr...   463   e-127
ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr...   458   e-126
ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-...   454   e-125
gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i...   453   e-124
gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i...   447   e-123
ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210...   442   e-121
ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225...   441   e-121
ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264...   402   e-109
ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutr...   393   e-106
ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494...   392   e-106
ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798...   390   e-106
gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus...   388   e-105
gb|AFK46430.1| unknown [Medicago truncatula]                          388   e-105
ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806...   382   e-103
ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818...   381   e-103

>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  515 bits (1326), Expect = e-143
 Identities = 283/455 (62%), Positives = 321/455 (70%), Gaps = 4/455 (0%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            MSSV N+              ESRVQPST QKRRW SCWSLYWCFGS+K+SKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE  APG   PV +N NH                  L SDPPSATQSPA GLLSL S S+
Sbjct: 61   PEPAAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPA-GLLSLKSLSI 119

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SPGGTASIFAIGPY HETQLV+PPVFS FTTEPSTA FTPPPE V +TTP SPEVPF
Sbjct: 120  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPF 179

Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQSPGSPGSHLISPASAISNSGTSSPFLDK 1225
            AQLLTSSLARNRR+SG N +FPLSQY+F PYQ PGSPGS+LISP S +SNSGTSSPF  K
Sbjct: 180  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1226 LPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 1402
             PI+EFR GE PKFLGYEHF T KWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1403 TQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVS 1579
            T              +TPNGGEP  ++SYLLE QISEVASLA+S+  SE  E ++D RVS
Sbjct: 300  T--------------VTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVS 345

Query: 1580 FELTGEHV--LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNE 1753
            FELTGE V   +  E V +H     P+   ++   N  +S S  +E    E   G   + 
Sbjct: 346  FELTGEDVPSCREKEPVMSHSQQTLPMDV-SNLLANEMKSGSSMAE----EKTYG---SP 397

Query: 1754 EKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
             KA E     C +KH +++ GSSKDF+FD++K E+
Sbjct: 398  RKASESGEDQCHRKHRNITFGSSKDFDFDNVKIEV 432


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum
            lycopersicum]
          Length = 470

 Score =  511 bits (1315), Expect = e-142
 Identities = 278/455 (61%), Positives = 318/455 (69%), Gaps = 4/455 (0%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            MSSV N+              ESRVQPST QKRRW SCWSLYWCFGS+K+SKRIGHAVLV
Sbjct: 1    MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE  APG   PV +N NH                  L SDPPSATQSPA GLLSL + S+
Sbjct: 61   PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPA-GLLSLKALSI 119

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SPGGTASIFAIGPY HETQLV+PPVFS FTTEPSTA FTPPPE V +TTP SPEVPF
Sbjct: 120  NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179

Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQSPGSPGSHLISPASAISNSGTSSPFLDK 1225
            AQLLTSSLARNRR+SG N +FPLSQY+F PYQ PGSPGS+LISP S +SNSGTSSPF  K
Sbjct: 180  AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239

Query: 1226 LPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 1402
             PI+EFR GE PKFLGYEHF T KWGSRVGSGS+TPSGWGSRLGSGTLTPNGGISRLGSG
Sbjct: 240  CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 299

Query: 1403 TQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVS 1579
            T              +TPNGGEP  ++SYLLE+QISEVASLA+S+  SE  E ++D RVS
Sbjct: 300  T--------------VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVS 345

Query: 1580 FELTGEHV--LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNE 1753
            FELT E V   +  E V +H     P+  +     N   S  +   +   E   G   + 
Sbjct: 346  FELTEEDVPSCREKEPVMSHSQPTLPMDVS-----NLLASEMRSGSSMAEEKTYG---SP 397

Query: 1754 EKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
             KA E     C +KH +++ GSSKDF+FD++K E+
Sbjct: 398  RKASESGEDECHRKHRNITFGSSKDFDFDNVKIEV 432


>gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica]
          Length = 499

 Score =  472 bits (1215), Expect = e-130
 Identities = 269/464 (57%), Positives = 317/464 (68%), Gaps = 12/464 (2%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            M SV++S              E+R QP+T  KRRW SCWSLYWCFG +KN KRIGHAVLV
Sbjct: 1    MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHKN-KRIGHAVLV 59

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE   PGA     DN                     L SDPPSATQSPA G LSL S S 
Sbjct: 60   PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPA-GFLSLKSLSA 118

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SPGG ASIF+IGPY +ETQLV+PPVFS F TEPSTA FTPPPESVQLTTPSSPEVPF
Sbjct: 119  NAYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPF 178

Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLD 1222
            AQLLTSSL RNRR+SG N +F LS Y+FQPYQ  PGSPG +LISP SA+SNSGTSSPF D
Sbjct: 179  AQLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPD 238

Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNGGI--S 1387
            + P++EFRMGEAPK  G++HF T KWGSR+GSGSLTP   G GSRLGSG+LTP+G    S
Sbjct: 239  RHPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGS 298

Query: 1388 RLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKS-ENEEE 1558
            RLGSG  TPNG GI SRLGSG LTP+G  P  ++S+LLE+QISEVASLA+SE   +  E 
Sbjct: 299  RLGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVET 358

Query: 1559 LLDPRVSFELTGEHV--LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYC 1732
            + D RVSFELTGE V     ++AV ++ T        A + P+   ++S  S N C    
Sbjct: 359  VFDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSV 418

Query: 1733 NGQIVNEEKALEGEGK-HCIKKHHSVSLGSSKDFNFDSMKQELP 1861
                    + + GEG+    +KH S++LGS+KDFNFD+ K E+P
Sbjct: 419  EESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVP 462


>gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis]
          Length = 521

 Score =  464 bits (1193), Expect = e-128
 Identities = 269/485 (55%), Positives = 324/485 (66%), Gaps = 34/485 (7%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            M +V+NS              E+R QP+   KRRW SCWSLYWCFGS+KNSKRIGHAVLV
Sbjct: 1    MRTVNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLV 60

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE   PGA AP  +N                     LQSDPPSATQSPA GLLSLTS S+
Sbjct: 61   PEPVLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPA-GLLSLTSLSI 119

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SPGG  SIFAIGPY +ETQLV+PPVFS FTTEPSTA FTPPPESVQLTTPSSPEVPF
Sbjct: 120  NAYSPGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPF 179

Query: 1046 AQLLTSSLARNRRH-SGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFL 1219
            AQLLTSSL R RR+ SG N +F LS  +FQPYQ  PGSPG +LISP S +SNSGTSSPF 
Sbjct: 180  AQLLTSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPFP 239

Query: 1220 DKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS------------------GWG 1342
            DK PI+ FRMGEAP+ LG+EHF T+KWGSR+GSGSLTP                   G G
Sbjct: 240  DKHPILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGLG 299

Query: 1343 SRLGSGTLTPNG-GI-SRLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQIS 1510
            SRLGSG+LTP+G G+ SRLGSG  TPNG G+ SRLGSG+LTP+G   V  +S+LLE+QIS
Sbjct: 300  SRLGSGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQIS 359

Query: 1511 EVASLAHSEKS-ENEEELLDPRVSFELTGEHV---LKFDEAVTAHDTVLEPVTTNADQAP 1678
            EVASLA+S+   +N+  ++D RVSFELTGE V   L    A +   T  E +  +  + P
Sbjct: 360  EVASLANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAECP 419

Query: 1679 N-----NCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDS 1843
                  +  ++   ++  C E  + +   +    EGE  H  +KH S++LGS K+FNFD+
Sbjct: 420  TKKDGISANNVDSPNDQSCVEETSNK-TPQSDCREGEDDHFYQKHRSITLGSIKEFNFDN 478

Query: 1844 MKQEL 1858
             K ++
Sbjct: 479  TKADV 483


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
            gi|223547583|gb|EEF49078.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 510

 Score =  463 bits (1192), Expect = e-127
 Identities = 264/441 (59%), Positives = 313/441 (70%), Gaps = 13/441 (2%)
 Frame = +2

Query: 569  ESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLVPESTAPGATAPVADNVNHXXX 748
            ESRVQP+T QKRRW  CWSLYWCFGS+K +KRIGHAVL PE    GA    A+N +    
Sbjct: 36   ESRVQPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTA 94

Query: 749  XXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPYDHET 928
                           LQSDPPSATQSPA GLLSLTS S+N++SPGG ASIFAIGPY HET
Sbjct: 95   ITVPFIAPPSSPASFLQSDPPSATQSPA-GLLSLTSLSVNAYSPGGPASIFAIGPYAHET 153

Query: 929  QLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPNLRF 1108
            QLVTPP FSAFTTEPSTA FTPPPESVQLTTPSSPEVPFAQLLTSSL R RR+SG N +F
Sbjct: 154  QLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKF 213

Query: 1109 PLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLGYEHF 1285
             LS Y+FQ Y   PGSPG  LISP S ISNSGTSSPF D+ PI+EFRMGEAPK LG+EHF
Sbjct: 214  ALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHF 273

Query: 1286 -TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPN--GGISRLGSGTQTPNG-GI-SRLGSG 1444
             T KWGSR+GSG++TP   G GSRLGSGT+TP+  G  SRLGSGT TP+G G+ S LGSG
Sbjct: 274  TTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSG 333

Query: 1445 SLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEHVLKFDEA 1621
            SLTP+   P  ++ + LE+QISEVASLA+SE  S+ +E ++D RVSFEL+GE V +  E+
Sbjct: 334  SLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLES 393

Query: 1622 VTAHD----TVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKHCI 1789
             +       +   P +   DQ  +    M    EN      +G+   E+ + E E +HC 
Sbjct: 394  KSLASCRAFSECPPDSMAEDQIKSG--KMLMTDENLPTGETSGE-TPEKPSGEMEEEHCY 450

Query: 1790 KKHHSVSLGSSKDFNFDSMKQ 1852
            +KH S++LGS K+FNFD+ K+
Sbjct: 451  RKHRSITLGSIKEFNFDNSKE 471


>ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|222858882|gb|EEE96429.1| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 507

 Score =  463 bits (1192), Expect = e-127
 Identities = 267/448 (59%), Positives = 312/448 (69%), Gaps = 18/448 (4%)
 Frame = +2

Query: 569  ESRVQPSTN--QKRRWASCWSLYWCFGSY---KNSKRIGHAVLVPESTAPGATAPVADNV 733
            ESRVQPS++  QKRRW  CWSLYWCFGS+   KNSKRIGHAVLVPE   PGA +   +N 
Sbjct: 23   ESRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVLVPEPEVPGAVSSSTENQ 82

Query: 734  NHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGP 913
                                LQSDPPS+TQSPA GLLSLTS S N++SP G ASIFAIGP
Sbjct: 83   TQSTPILLPFIAPPSSPASFLQSDPPSSTQSPA-GLLSLTSLSANAYSPRGPASIFAIGP 141

Query: 914  YDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSG 1093
            Y HETQLVTPPVFSAFTTEPSTA FTPPPESVQLTTPSSPEVPFAQLLTSSL R RR+SG
Sbjct: 142  YAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSG 201

Query: 1094 PNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFL 1270
            PN +F LS Y+FQ Y   PGSPG  +ISP SAISNSGTSSPF D+ P++EFRMGEAPK L
Sbjct: 202  PNQKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPDRHPMLEFRMGEAPKLL 261

Query: 1271 GYEHF-TYKWGSRVGSGSL----TPSGWG-SRLGSGTLTPNG-GISRLGSGTQTPNGG-- 1423
            G+EHF T KWGSR+GSGSL    TP G G SRLGSGT+TP+G G+SRL SGT TP+G   
Sbjct: 262  GFEHFSTRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMGLSRLCSGTATPDGAGL 321

Query: 1424 ISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEH 1600
             SRLGSG+LTP+   P  Q  +LLE+QISEVASL +SE  S+ EE ++  RVSFEL+GE 
Sbjct: 322  RSRLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTEENVVHHRVSFELSGEE 381

Query: 1601 VLKFDE--AVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGE 1774
            V +  E  +V +  T  E       + P     ++   E C         + E+ + E E
Sbjct: 382  VARCLEIKSVASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQNGEASSEMPEKNSEETE 441

Query: 1775 GKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
              H  +KH S++LGS K+FNFD+ K E+
Sbjct: 442  EDHVYRKHRSITLGSIKEFNFDNSKGEV 469


>ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina]
            gi|557523850|gb|ESR35217.1| hypothetical protein
            CICLE_v10004813mg [Citrus clementina]
          Length = 500

 Score =  458 bits (1178), Expect = e-126
 Identities = 258/465 (55%), Positives = 321/465 (69%), Gaps = 14/465 (3%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            MSSVH+S              ESR++P+  QKRRW SCWSLYWCFGS+K SKRI HAVLV
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE    GA AP A+   H                  LQSDPPSATQSPA GLLSL S S+
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPA-GLLSLNSLSV 119

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SPGG AS+FAIGPY HETQLVTPPVFSAFTTEPSTA  TPPPESVQLTTPSSPEVPF
Sbjct: 120  NAYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPF 179

Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222
            AQLLTSSL R RR+SG N +  LS Y +QPYQ  PGSPG  LISP S +S SGTSSPF D
Sbjct: 180  AQLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPD 239

Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNG-GI-S 1387
            + PI++F    APK LG+EHF T KWGSR+GSGS+TP   G GSR+GSG+LTP+G G+ S
Sbjct: 240  RHPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGS 299

Query: 1388 RLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEE 1558
            RLGSGT TP+G G+ SRLGSGSLTP+G  P  ++ ++ E+QISEVASLA+S+  ++++E 
Sbjct: 300  RLGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEH 359

Query: 1559 LLDPRVSFELTGEHVLKFDEAVTAHDTVLEP-----VTTNADQAPNNCQSMSKKSENCCC 1723
            ++D RVSFEL+GE V +     +A    + P     +    +   +   + S+     C 
Sbjct: 360  IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419

Query: 1724 EYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
            E  + + + E+   +GE ++C +KH S++LGS K+FNFD+ + E+
Sbjct: 420  EESSNR-MPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEV 463


>ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis]
          Length = 500

 Score =  454 bits (1167), Expect = e-125
 Identities = 256/465 (55%), Positives = 320/465 (68%), Gaps = 14/465 (3%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            MSSVH+S              ESR++P+  QKRRW SCWSLYWCFGS+K SKRI HAVL+
Sbjct: 1    MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE    GA AP A+   H                  LQSDP SATQSPA GLLSL S S+
Sbjct: 61   PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPA-GLLSLNSLSV 119

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SPGG AS+FAIGPY HETQLVTPPVFSAFTTEPSTA  TPPPESVQLTTPSSPEVPF
Sbjct: 120  NAYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPF 179

Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222
            AQLLTSSL R RR+SG N +  LS Y +QPYQ  PGSPG  LISP S +S SGTSSPF D
Sbjct: 180  AQLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPD 239

Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNG-GI-S 1387
            + PI++F    APK LG+EHF T KWGSR+GSGS+TP   G GSR+GSG+LTP+G G+ S
Sbjct: 240  RHPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGS 299

Query: 1388 RLGSGTQTPNG-GI-SRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEE 1558
            RLGSGT TP+G G+ SRLGSGSLTP+G  P  ++ ++ E+QISEVASLA+S+  ++++E 
Sbjct: 300  RLGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEH 359

Query: 1559 LLDPRVSFELTGEHVLKFDEAVTAHDTVLEP-----VTTNADQAPNNCQSMSKKSENCCC 1723
            ++D RVSFEL+GE V +     +A    + P     +    +   +   + S+     C 
Sbjct: 360  IIDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCP 419

Query: 1724 EYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
            E  + + + E+   +GE ++C +KH S++LGS K+FNFD+ + E+
Sbjct: 420  EESSNR-MPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEV 463


>gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma
            cacao]
          Length = 485

 Score =  453 bits (1166), Expect = e-124
 Identities = 266/461 (57%), Positives = 314/461 (68%), Gaps = 11/461 (2%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            M SV++S              +SRVQP+T QK+RW SCW LYWCFGS KNSKRIGHAVLV
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE   PGA+   A+NV++                  LQSDPPSATQSPA GLLSLTS S+
Sbjct: 61   PEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPA-GLLSLTSLSV 119

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SP G ASIFAIGPY HETQLVTPPVFSA TTEPSTA FTPPPESVQLTTPSSPEVPF
Sbjct: 120  NAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPF 179

Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222
            AQLLTSSL R RR+SG N +F LS Y+FQ YQ  PGSPG +LISP SAISNSGTSSPF D
Sbjct: 180  AQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPD 239

Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNGGISRL 1393
            + PI+EFRMGEAPK LG+E+F T KWGSR+GSGSLTP   G GSRLGSG++TP+G    +
Sbjct: 240  RRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG----M 295

Query: 1394 GSGTQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAH-SEKSENEEELLDP 1570
            G G        SRLGSGSLTP+G  P  ++ +L+ SQISEVA LA+ +   +N+E ++D 
Sbjct: 296  GLG--------SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDH 347

Query: 1571 RVSFELTGEHV---LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNGQ 1741
            RVSFEL+GE V   L+    + +      P    A+      +   KK     CE    +
Sbjct: 348  RVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKE--RDGIKKDLESSCELFIRE 405

Query: 1742 IVNE--EKAL-EGEGKHCIKKHHSVSLGSSKDFNFDSMKQE 1855
              NE  EKA  E E +H  +KH SV+LGS K+FNFD+ K E
Sbjct: 406  TSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGE 446


>gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma
            cacao]
          Length = 489

 Score =  447 bits (1151), Expect = e-123
 Identities = 266/465 (57%), Positives = 314/465 (67%), Gaps = 15/465 (3%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQ----KRRWASCWSLYWCFGSYKNSKRIGH 673
            M SV++S              +SRVQP+T Q    K+RW SCW LYWCFGS KNSKRIGH
Sbjct: 1    MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60

Query: 674  AVLVPESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLT 853
            AVLVPE   PGA+   A+NV++                  LQSDPPSATQSPA GLLSLT
Sbjct: 61   AVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPA-GLLSLT 119

Query: 854  SFSMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSP 1033
            S S+N++SP G ASIFAIGPY HETQLVTPPVFSA TTEPSTA FTPPPESVQLTTPSSP
Sbjct: 120  SLSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSP 179

Query: 1034 EVPFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSS 1210
            EVPFAQLLTSSL R RR+SG N +F LS Y+FQ YQ  PGSPG +LISP SAISNSGTSS
Sbjct: 180  EVPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSS 239

Query: 1211 PFLDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPS--GWGSRLGSGTLTPNGG 1381
            PF D+ PI+EFRMGEAPK LG+E+F T KWGSR+GSGSLTP   G GSRLGSG++TP+G 
Sbjct: 240  PFPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDG- 298

Query: 1382 ISRLGSGTQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAH-SEKSENEEE 1558
               +G G        SRLGSGSLTP+G  P  ++ +L+ SQISEVA LA+ +   +N+E 
Sbjct: 299  ---MGLG--------SRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDET 347

Query: 1559 LLDPRVSFELTGEHV---LKFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEY 1729
            ++D RVSFEL+GE V   L+    + +      P    A+      +   KK     CE 
Sbjct: 348  IVDHRVSFELSGEDVAPCLESKSLLPSRAVSEYPKDLVAEGRKE--RDGIKKDLESSCEL 405

Query: 1730 CNGQIVNE--EKAL-EGEGKHCIKKHHSVSLGSSKDFNFDSMKQE 1855
               +  NE  EKA  E E +H  +KH SV+LGS K+FNFD+ K E
Sbjct: 406  FIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGE 450


>ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus]
          Length = 497

 Score =  442 bits (1137), Expect = e-121
 Identities = 257/463 (55%), Positives = 316/463 (68%), Gaps = 12/463 (2%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFG--SYKNSKRIGHAV 679
            M+S++NS              E+RVQP+T  KRRW SCWSLYWCFG  S K++KRIGHAV
Sbjct: 1    MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60

Query: 680  LVPESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSF 859
            LVPE   PGA AP  ++                     LQS+P S TQSPA GLLSLT+ 
Sbjct: 61   LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPA-GLLSLTAL 119

Query: 860  SMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEV 1039
            S+N++SP G ASIFAIGPY ++TQLV+PPVFSAFTTEPSTA  TPPPESVQLTTPSSPEV
Sbjct: 120  SVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEV 179

Query: 1040 PFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPF 1216
            PFA+LLTSSL+   +  G N +F LS  DFQPYQ  PGSPG+HLISP S ISNSGTSSPF
Sbjct: 180  PFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 239

Query: 1217 LDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWG--SRLGSGTLTPNG-GI 1384
             DK PI+EFRM +APK LG EHF T KW SR+GSGSLTP G G  SRLGSGTLTP+G G+
Sbjct: 240  PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM 299

Query: 1385 -SRLGSGTQTPNG--GISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENEE 1555
             SRLGSG+ TPNG    SRLGSG+LTP+G     Q+S LL++QISEVASLA+SE +  + 
Sbjct: 300  GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSE-TGCQN 358

Query: 1556 ELLDPRVSFELTGEHVLK--FDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEY 1729
            ++ + RVSFELTGE V +   ++++T+  T  E     +    N  +  S+++E   CE+
Sbjct: 359  DVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAET--CEF 416

Query: 1730 CNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
             + +     +   GE   C +   +V+LGS K+FNFD  K E+
Sbjct: 417  FDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEI 459


>ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus]
          Length = 497

 Score =  441 bits (1133), Expect = e-121
 Identities = 256/463 (55%), Positives = 315/463 (68%), Gaps = 12/463 (2%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFG--SYKNSKRIGHAV 679
            M+S++NS              E+RVQP+T  KRRW SCWSLYWCFG  S K++KRIGHAV
Sbjct: 1    MASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAV 60

Query: 680  LVPESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSF 859
            LVPE   PGA AP  ++                     LQS+P S TQSPA GLLS T+ 
Sbjct: 61   LVPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPA-GLLSFTAL 119

Query: 860  SMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEV 1039
            S+N++SP G ASIFAIGPY ++TQLV+PPVFSAFTTEPSTA  TPPPESVQLTTPSSPEV
Sbjct: 120  SVNNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEV 179

Query: 1040 PFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPF 1216
            PFA+LLTSSL+   +  G N +F LS  DFQPYQ  PGSPG+HLISP S ISNSGTSSPF
Sbjct: 180  PFAKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPF 239

Query: 1217 LDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWG--SRLGSGTLTPNG-GI 1384
             DK PI+EFRM +APK LG EHF T KW SR+GSGSLTP G G  SRLGSGTLTP+G G+
Sbjct: 240  PDKHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGM 299

Query: 1385 -SRLGSGTQTPNG--GISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENEE 1555
             SRLGSG+ TPNG    SRLGSG+LTP+G     Q+S LL++QISEVASLA+SE +  + 
Sbjct: 300  GSRLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSE-TGCQN 358

Query: 1556 ELLDPRVSFELTGEHVLK--FDEAVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEY 1729
            ++ + RVSFELTGE V +   ++++T+  T  E     +    N  +  S+++E   CE+
Sbjct: 359  DVTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAET--CEF 416

Query: 1730 CNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
             + +     +   GE   C +   +V+LGS K+FNFD  K E+
Sbjct: 417  FDIKTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEI 459


>ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera]
          Length = 448

 Score =  402 bits (1034), Expect = e-109
 Identities = 240/463 (51%), Positives = 289/463 (62%), Gaps = 12/463 (2%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            M SV+NS              ESRVQP+T QKRRW SC SLYWCFGS+++SKRIGHAVLV
Sbjct: 1    MRSVNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLV 60

Query: 686  PESTAPGATAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSM 865
            PE   PGA AP ++N+N                   LQSDPPS+TQSPA G LSLT+ S+
Sbjct: 61   PEPMVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPA-GFLSLTALSV 119

Query: 866  NSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPF 1045
            N++SP G AS+FAIGPY HETQLV+PPVFS F TEPSTA FTPPPESVQLTTPSSPEVPF
Sbjct: 120  NAYSPSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPF 179

Query: 1046 AQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTSSPFLD 1222
            AQLLTSSL R+RR+SG N +  LS Y+FQPYQ  P SP  HLISP   ISNSGTSSPF D
Sbjct: 180  AQLLTSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPD 236

Query: 1223 KLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGS 1399
            + PIV     EAPK LG+EHF T +WGSR+GSGSLTP G G                   
Sbjct: 237  RRPIV-----EAPKLLGFEHFSTRRWGSRLGSGSLTPDGAG------------------- 272

Query: 1400 GTQTPNGGISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRV 1576
                                   P  ++S+LLE+QISEVASLA+SE  S+N E ++D RV
Sbjct: 273  -----------------------PASRDSFLLENQISEVASLANSESGSQNGETVIDHRV 309

Query: 1577 SFELTGEHVLKFDE------AVTAHDTVLEPVTTNADQAPNNCQSMSKKSENCCCEYCNG 1738
            SFEL GE V    E      A T  +T+ + V     +       +S+ +EN CCE+C G
Sbjct: 310  SFELAGEDVAVCVEKKPVASAETVQNTLQDIV--EEGEIERERDGISESTEN-CCEFCVG 366

Query: 1739 QIV---NEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
            + +   +E+ + EGE + C KKH  +  GS K+FNFD+ K E+
Sbjct: 367  EALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEV 409


>ref|XP_006413289.1| hypothetical protein EUTSA_v10025027mg [Eutrema salsugineum]
            gi|557114459|gb|ESQ54742.1| hypothetical protein
            EUTSA_v10025027mg [Eutrema salsugineum]
          Length = 489

 Score =  393 bits (1010), Expect = e-106
 Identities = 241/470 (51%), Positives = 299/470 (63%), Gaps = 19/470 (4%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLV 685
            M +V+NS              ESRVQPS+ QK+RW SCWSLYWCFGS KN+KRIGHAVLV
Sbjct: 1    MRNVNNSVDTVNAAASAIVSAESRVQPSSVQKKRWGSCWSLYWCFGSQKNNKRIGHAVLV 60

Query: 686  PESTAPGA--TAPVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSF 859
            PE  + G+   APV ++  +                  LQS PPS + +P AGLLSLT  
Sbjct: 61   PEPVSSGSVPVAPVQNSSTNSTSIFLPFIAPPSSPASFLQSGPPSVSHTPPAGLLSLT-- 118

Query: 860  SMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQL--TTPSSP 1033
             +N++S    AS FAIGPY HETQ VTPPV SAFTT PSTA FTPPPES Q+  TTPSSP
Sbjct: 119  -VNTYSRNEPASAFAIGPYAHETQPVTPPVDSAFTTRPSTAPFTPPPESAQMASTTPSSP 177

Query: 1034 EVPFAQLLTSSLARNRRHS-GPNLRFPLSQYDFQPYQ-SPGSPGSHLISPASAISNSGTS 1207
            EVPFAQLLTSSL R RR+S G N +F  + Y+F  +Q  PGSPG +LISP S ISNSGTS
Sbjct: 178  EVPFAQLLTSSLERARRNSGGMNQKFSAAHYEFHSHQVFPGSPGGNLISPGSVISNSGTS 237

Query: 1208 SPFLDKLPIVEFRMGEAPKFLGYEHFT-YKWGSRVGSGSLTPSGWGSRLGSGTLTPNGG- 1381
            SP+  K  I+EFR+GE PKFLG+EHFT  KWGSR GSGS+TP+G GSRLGSG LTP+GG 
Sbjct: 238  SPYPGKCSIIEFRIGEPPKFLGFEHFTARKWGSRFGSGSITPAGQGSRLGSGALTPDGGG 297

Query: 1382 -ISRLGSGTQTPNGG--ISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENE 1552
              S+L SG  TPNG   +SR GSG++TP        ES LL+ QISEVASLA+S+   + 
Sbjct: 298  LGSKLASGAVTPNGAEMVSRKGSGNVTP-------LESSLLDCQISEVASLANSDHGSSR 350

Query: 1553 EE----LLDPRVSFELTGEHVLKFDEA----VTAHDTVLEPVTTNADQAPNNCQSMSKKS 1708
             +    ++  RVSFELTGE V +   +        D + E    N D    N +++S  +
Sbjct: 351  HDEAVAVVSHRVSFELTGEDVARCFASKLNRAGLDDCLHE--KANGDHTDTN-EAVSPTN 407

Query: 1709 ENCCCEYCNGQIVNEEKALEGEGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
                    +G +   + + E E +  +K   S+SLGSSK+F FD+ K+E+
Sbjct: 408  R------WSGSVPGSKTSGETESEQSLKL-RSISLGSSKEFKFDNTKEEM 450


>ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum]
          Length = 492

 Score =  392 bits (1006), Expect = e-106
 Identities = 237/445 (53%), Positives = 304/445 (68%), Gaps = 15/445 (3%)
 Frame = +2

Query: 569  ESRVQPSTNQKRRWASCWSLYWCFGSYKNSKRIGHAVLVPESTAPGATAPVADNV-NHXX 745
            ESRVQPST+ K+RW SC+SL  CFGS+K+SKRIGHAVLVPE  AP    PVA +  N   
Sbjct: 23   ESRVQPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEPVAP--IVPVAHSAPNPST 80

Query: 746  XXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNS-FSPGGTASIFAIGPYDH 922
                            LQSDPPS+T SPAAGLLS    S+N+ +S  G+ASIF IGPY +
Sbjct: 81   VIVMPFIAPPSSPASFLQSDPPSSTHSPAAGLLSP---SVNAAYSSSGSASIFTIGPYAY 137

Query: 923  ETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPNL 1102
            ETQLV+PPVFS FTTEPSTA FTPPPESVQ+TTPSSPEVPFAQLL SSL R R+++G + 
Sbjct: 138  ETQLVSPPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQLLASSLDRARKNNGSH- 196

Query: 1103 RFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLGYE 1279
            +F L  Y+FQPYQ  PGSPG+ L+SP S IS SGTS+PF D+   +E   GE PK LG+E
Sbjct: 197  KFALYNYEFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELSRGETPKILGFE 256

Query: 1280 HF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNG--GISRLGSGTQTPN--GGISRLG 1438
            HF T +W SR+GSGSLTP  +G GSRLGSG+LTP+G    SRLGSG  TP+  G  SRLG
Sbjct: 257  HFSTRRWNSRIGSGSLTPDGAGQGSRLGSGSLTPDGFAHASRLGSGCTTPDGLGQDSRLG 316

Query: 1439 SGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEHVLKFD 1615
            SGSLTP+G  P  +ES  +++QISE  S+A+SE  S++   L+D RVSFELTGE V +  
Sbjct: 317  SGSLTPDGAGPTTRES--MQNQISEDVSVANSEHGSQSNATLVDHRVSFELTGEDVARC- 373

Query: 1616 EAVTAHDTVLEPVTTNAD----QAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKH 1783
                    +L  +++++     + P + + + K++ N CC+ C+ +  ++       G+ 
Sbjct: 374  -LANKAGALLRNMSSSSQGILAKDPIDRERILKET-NGCCDVCSRKTNDKSDNSCAGGEQ 431

Query: 1784 CIKKHHSVSLGSSKDFNFDSMKQEL 1858
            C +K +SVS  SSK+FNFD+ K ++
Sbjct: 432  CCQKRNSVS--SSKEFNFDNRKGDV 454


>ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798631 isoform X1 [Glycine
            max]
          Length = 504

 Score =  390 bits (1003), Expect = e-106
 Identities = 238/470 (50%), Positives = 302/470 (64%), Gaps = 22/470 (4%)
 Frame = +2

Query: 506  MSSVHNSXXXXXXXXXXXXXXESRVQPSTN-QKRRWASCWSLYWCFGSYKNSKRIGHAVL 682
            M +V+N+              ESR+QP+T   K+RW SCWSL WCFG +KNSKR+G+AVL
Sbjct: 1    MGTVNNTVDTVNAAASAIVYAESRIQPTTTVPKKRWGSCWSLCWCFGPHKNSKRVGNAVL 60

Query: 683  VPESTAPGATA---PVADNVNHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLT 853
            VPE   P       P     N                   LQSDPPSATQSP  GL SL+
Sbjct: 61   VPEPVEPIGPVGFHPATAAPNPSTAIVMPFIVPPSSPASFLQSDPPSATQSPV-GLFSLS 119

Query: 854  SFSMNSFSPGGTASIFAIGPYDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSP 1033
            S ++N+   GG ASIFAIGPY +ETQLV+PPVFS FTTEPSTA FTPPPESVQLTTPSSP
Sbjct: 120  SLTVNA--SGGPASIFAIGPYTYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSP 177

Query: 1034 EVPFAQLLTSSLARNRRHSGPNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSS 1210
            EVPFAQLL SSL RN + +G N RF LS Y+FQPYQ  PGSPG+ L+SP S IS SG+S+
Sbjct: 178  EVPFAQLLASSLDRNCKSNGTNQRFALSNYEFQPYQQYPGSPGTQLVSPRSIISTSGSST 237

Query: 1211 PFLDKLPIVEFRMGEAPKFLGYEHF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNGG 1381
            PF D+ P++EF  GEAPK LG+E+F T+KW SR+GSGSLTP  +G GSRLGSG+ TP+  
Sbjct: 238  PFPDRHPVLEFHKGEAPKLLGFENFLTHKWNSRLGSGSLTPDSAGQGSRLGSGSFTPDAV 297

Query: 1382 --ISRLGSGTQTPNG--GISRLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSE-KSE 1546
               S+LGSG  TP+G    SR GSGSLTP+   P  +    +  QISEV S+ +SE + +
Sbjct: 298  KLASQLGSGCLTPDGLCQDSRFGSGSLTPDAVAPTARNDIDIGKQISEVTSIVNSENECQ 357

Query: 1547 NEEELLDPRVSFELTGEHVLKFDEAVTAHDTVLEPVTTNAD----QAPNNCQSMSKKSEN 1714
             +  L+D RVSFELTG  V +   A  +  ++L  ++ ++     + P + + + K S N
Sbjct: 358  PKAALVDHRVSFELTGVDVPRC-LANKSGSSLLGNMSGSSQGTLVEDPVDIEKIQKNS-N 415

Query: 1715 CCCEYCNGQIVN--EEKALE--GEG-KHCIKKHHSVSLGSSKDFNFDSMK 1849
              C +C+ +  N   +K+    GEG + C +KHH  S  SSK+FNFD+ K
Sbjct: 416  SSCAFCSRKTSNASNDKSCNSPGEGAEQCCRKHH--SFNSSKEFNFDNRK 463


>gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus vulgaris]
          Length = 500

 Score =  388 bits (997), Expect = e-105
 Identities = 234/444 (52%), Positives = 289/444 (65%), Gaps = 17/444 (3%)
 Frame = +2

Query: 569  ESRVQPSTNQKRRWASCWSLYWCFGSYKN---SKRIGHAVLVPESTAP-GATAPVADNVN 736
            ESRVQP+T  K+RW  CWS YWCFGSYK+   SKRIGHAVLVPE  AP G  A  A   N
Sbjct: 23   ESRVQPTTVPKKRWGGCWSQYWCFGSYKSTKSSKRIGHAVLVPEPVAPTGPAAAAAAPPN 82

Query: 737  HXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPY 916
                               +QSDPPSA QSP  GLLSL+S + +++S GG AS+F IGPY
Sbjct: 83   PSTAIVMPFIAPPSSPASLIQSDPPSAIQSPP-GLLSLSSLAASAYSSGGPASMFTIGPY 141

Query: 917  DHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGP 1096
             +ETQLV+PPVFS FTTEPSTA FTPPPESV  TTPSSP+VPFAQLL SSL R R+ +G 
Sbjct: 142  AYETQLVSPPVFSNFTTEPSTAPFTPPPESVHQTTPSSPDVPFAQLLASSLDRARKSNG- 200

Query: 1097 NLRFPLSQYDFQPY-QSPGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLG 1273
            N +F L  YDFQPY Q PGSPG  LISP SA S SGTS+PF D+ P +EFR GE PK LG
Sbjct: 201  NQKFALYNYDFQPYHQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFRKGETPKILG 260

Query: 1274 YEHF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNG-GI-SRLGSGTQTPN--GGISR 1432
             EHF T +W SR+GSGSLTP  +G GSRLGSG++TP+G G+ SRLGSG  TP+  G  SR
Sbjct: 261  VEHFSTQRWSSRLGSGSLTPDGAGQGSRLGSGSVTPDGVGLASRLGSGCATPDGLGQESR 320

Query: 1433 LGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSE-NEEELLDPRVSFELTGEHVLK 1609
            LGSG LTP+G   + + +  +++QIS+ A+LA+S+    +   L+D RVSFELTGE V +
Sbjct: 321  LGSGCLTPDGVGQINENNLPVQNQISKEATLANSDNGHPSNATLIDHRVSFELTGEDVAR 380

Query: 1610 FDEAVTAHDTVLEPVTTNADQAPNNCQSMSK----KSENCCCEYCNGQIVNEEKALEGEG 1777
                +     VL    + + Q       + +    +  +  C  C  +  ++     GEG
Sbjct: 381  ---CLANKTGVLLRNMSGSSQGILAKDPVDRERVLRDTDASCNVCTEKTDDKPYNPIGEG 437

Query: 1778 KHCIKKHHSVSLGSSKDFNFDSMK 1849
            + C  K +SV+  SSK+FNFDS K
Sbjct: 438  EQCFHKQNSVN--SSKEFNFDSSK 459


>gb|AFK46430.1| unknown [Medicago truncatula]
          Length = 487

 Score =  388 bits (997), Expect = e-105
 Identities = 235/445 (52%), Positives = 299/445 (67%), Gaps = 15/445 (3%)
 Frame = +2

Query: 569  ESRVQPSTNQKRRWASCWSLYWCFGSY-KNSKRIGHAVLVPESTAPGATAPVADNV-NHX 742
            ESRVQP+++ K+RW SC+SL  CFGS+ K S+RIGHAVLVPE  AP  T PVA+   N  
Sbjct: 23   ESRVQPTSSPKKRWGSCFSLPSCFGSHNKTSERIGHAVLVPEPVAP--TVPVANAAPNPS 80

Query: 743  XXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPYDH 922
                             LQSDPPS+T SPAAGLLSL+S S N++S  G AS+F IGPY +
Sbjct: 81   TAIVIPFIAPPSSPASFLQSDPPSSTHSPAAGLLSLSSLSANAYSTSGPASMFTIGPYAY 140

Query: 923  ETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPNL 1102
            ETQLV+PPVFS FT EPSTA FTPPPESV +TTPSSPEVPFAQLL SSL R R+    N 
Sbjct: 141  ETQLVSPPVFSNFTAEPSTANFTPPPESVLMTTPSSPEVPFAQLLASSLDRARK---SNH 197

Query: 1103 RFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFLGYE 1279
            +F L  Y++QPYQ  PGSPG+ L+SP S IS SGTS+PF D+   +E R GEAPK LG+E
Sbjct: 198  KFALYNYEYQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSLELRKGEAPKILGFE 257

Query: 1280 HF-TYKWGSRVGSGSLTP--SGWGSRLGSGTLTPNG--GISRLGSGTQTPN--GGISRLG 1438
            HF T KW SR+GSGSLTP  +G GSRLGSG+LTP+G    SRLGSG  TP+  G  SRLG
Sbjct: 258  HFSTRKWMSRIGSGSLTPDGTGQGSRLGSGSLTPDGVSHTSRLGSGCATPDGLGQDSRLG 317

Query: 1439 SGSLTPNGGEPVCQESYLLESQISEVASLAHSEK-SENEEELLDPRVSFELTGEHVLKFD 1615
            SGSLTP+G  P  + S  +++QI    S+A+S+  S+    L+D RVSFELTGE V +  
Sbjct: 318  SGSLTPDGVGPTTRGSIDVQNQIPVGVSVANSDHGSQTNATLVDHRVSFELTGEDVARCL 377

Query: 1616 EAVTAHDTVLEPVTTNAD----QAPNNCQSMSKKSENCCCEYCNGQIVNEEKALEGEGKH 1783
               T    +L  +++++     + P + + + K++ N CC+ C+G+ +         G+H
Sbjct: 378  ANKTG--ALLRNMSSSSQGILAKDPIDREKILKET-NSCCDVCSGKAIG--------GEH 426

Query: 1784 CIKKHHSVSLGSSKDFNFDSMKQEL 1858
            C  K +SVS  SSK+FNFD+ K ++
Sbjct: 427  CCPKRNSVS--SSKEFNFDNRKGDV 449


>ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806399 [Glycine max]
          Length = 515

 Score =  382 bits (982), Expect = e-103
 Identities = 232/448 (51%), Positives = 289/448 (64%), Gaps = 18/448 (4%)
 Frame = +2

Query: 569  ESRVQPSTNQKRRWASCWSLYWCFGSYKNSK---RIGHAVLVPESTAPG--ATAPVADNV 733
            ESRVQP+   K+RW  CWS YWCFGS K+SK   RIGHAVLVPE  AP   A A  A   
Sbjct: 37   ESRVQPTDAPKKRWGGCWSQYWCFGSRKSSKSSKRIGHAVLVPEPAAPTGPAAAATAAAP 96

Query: 734  NHXXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGP 913
            N                   LQSDPPS  QSP  GLLSL++ + N++S GG A++F IGP
Sbjct: 97   NPSTAIVMPFIAPPSSPASFLQSDPPSGIQSPP-GLLSLSALAANAYSSGGPATMFTIGP 155

Query: 914  YDHETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSG 1093
            Y +ETQLV+PPVFSAFTTEPSTA +TPPPESVQ TTPSSP+VPFAQLL SSL R R+ +G
Sbjct: 156  YAYETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPFAQLLASSLDRARKCNG 215

Query: 1094 PNLRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRMGEAPKFL 1270
             + +FPL  Y+F PYQ  PGSPG  LISP SA S SGTS+PF D+ P +EF  GE PK L
Sbjct: 216  -HQKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFPKGETPKIL 274

Query: 1271 GYEHF-TYKWGSRVGSGSLTP-SGW-GSRLGSGTLTPNG-GI-SRLGSGTQTPN--GGIS 1429
            G EHF T +WGSR+GSGSLTP S W GSRLGSG+LTP+G G+ SRLGSG  TP+  G  S
Sbjct: 275  GVEHFSTRRWGSRLGSGSLTPDSAWQGSRLGSGSLTPDGVGLASRLGSGCVTPDGLGQES 334

Query: 1430 RLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSE-NEEELLDPRVSFELTGEHVL 1606
            RLGSG LTP+   P  Q +  +++QIS+ A+LA S+    +   L+D RVSFELTGE V 
Sbjct: 335  RLGSGCLTPDSAGPTNQNNISVQNQISKEATLADSDNGHPSNATLVDHRVSFELTGEDVA 394

Query: 1607 KFDEAVTAHDTVLEPVTTNADQAPNNCQSMSKK----SENCCCEYCNGQIVNEEKALEGE 1774
            +    +     VL    + + Q       + ++      N  C  C  +  ++     G+
Sbjct: 395  R---CLANKTGVLLRNMSGSSQGILTKDPVDRERVQIDTNSSCNACTEKTDDKPDNPVGK 451

Query: 1775 GKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
            G+ C+ K +SV+  SSK+FNFD+ K ++
Sbjct: 452  GEQCLHKQNSVN--SSKEFNFDNRKGDV 477


>ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818313 isoform X1 [Glycine
            max]
          Length = 509

 Score =  381 bits (979), Expect = e-103
 Identities = 231/449 (51%), Positives = 293/449 (65%), Gaps = 19/449 (4%)
 Frame = +2

Query: 569  ESRVQPSTNQKRRWASCWSLYWCFGSYKNSK---RIGHAVLVPESTAPGATAPVADNVNH 739
            ESRVQP+   K+RW  CWS YWCFGS K+SK   RIGHAVLVPE  AP   A  A   N 
Sbjct: 36   ESRVQPTDAPKKRWGGCWSQYWCFGSCKSSKSSKRIGHAVLVPEPAAPTGPAAAAAAPNP 95

Query: 740  XXXXXXXXXXXXXXXXXXLQSDPPSATQSPAAGLLSLTSFSMNSFSPGGTASIFAIGPYD 919
                              LQSDPPS  QSP  GLLSL++ + N++S GG AS+F IGPY 
Sbjct: 96   SAAIVMPFIAPPSSPASFLQSDPPSGIQSPP-GLLSLSALAANAYSSGGPASMFTIGPYA 154

Query: 920  HETQLVTPPVFSAFTTEPSTACFTPPPESVQLTTPSSPEVPFAQLLTSSLARNRRHSGPN 1099
            +ETQLV+PPVFSAFTTEPSTA +TPPPESVQ TTPSSP+VPFAQLL SSL R R+ +G N
Sbjct: 155  YETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPFAQLLASSLDRARKSNG-N 213

Query: 1100 LRFPLSQYDFQPYQS-PGSPGSHLISPASAISNSGTSSPFLDKLPIVEFRM--GEAPKFL 1270
             +FPL  Y+F PYQ  PGSPG  LISP SA S SGTS+PF D+ P +EF    GE P+ L
Sbjct: 214  HKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPTLEFPFPKGETPRIL 273

Query: 1271 GYEHF-TYKWGSRVGSGSLTPSG-W-GSRLGSGTLTPNG-GI-SRLGSGTQTPNG-GI-S 1429
            G+EHF T +WGSR+GSGSLTP G W GSRLGSG+LTP+G G+ SRLGSG  TP+G G+ S
Sbjct: 274  GFEHFSTRRWGSRLGSGSLTPDGAWQGSRLGSGSLTPDGIGLASRLGSGCVTPDGLGLES 333

Query: 1430 RLGSGSLTPNGGEPVCQESYLLESQISEVASLAHSEKSENEE-ELLDPRVSFELTGEHVL 1606
            RLGSG LTP+   P+ Q +  +++QIS+ A+LA ++   +    L+D RVSFELTGE V 
Sbjct: 334  RLGSGCLTPDSAGPINQNNISVQNQISKEATLADTDNGHSSNATLIDHRVSFELTGEDVA 393

Query: 1607 KFDEAVTAHDTVLEPVTTNADQA-----PNNCQSMSKKSENCCCEYCNGQIVNEEKALEG 1771
            +    +     VL    + + Q      P + + + K ++ C       +  +++     
Sbjct: 394  R---CLANKTGVLLRNMSGSSQGILSKDPVDRERVQKDTDTCT------EKTDDKPDNSV 444

Query: 1772 EGKHCIKKHHSVSLGSSKDFNFDSMKQEL 1858
             G+ C+ K +SV+  SSK+FNFD+ K ++
Sbjct: 445  GGEQCLHKQNSVN--SSKEFNFDNRKGDV 471


Top