BLASTX nr result

ID: Sinomenium21_contig00005678 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00005678
         (2541 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma...   662   0.0  
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              653   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   653   0.0  
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   649   0.0  
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   644   0.0  
ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   642   0.0  
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   637   e-180
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     634   e-179
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   634   e-179
ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma...   631   e-178
ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma...   622   e-175
gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus...   621   e-175
ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   620   e-174
ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutr...   619   e-174
emb|CAB82813.1| putative protein [Arabidopsis thaliana]               618   e-174
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   616   e-173
ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   615   e-173
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   615   e-173
ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778...   613   e-173
ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phas...   613   e-172

>ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590680339|ref|XP_007040835.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508778078|gb|EOY25334.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  662 bits (1707), Expect = 0.0
 Identities = 340/478 (71%), Positives = 385/478 (80%), Gaps = 9/478 (1%)
 Frame = -3

Query: 2101 LLFLLSLFGCVWHFQLASALARAHKQSTEE-AVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            LLFL S   C    QL+SALAR ++ S E+  VWEV+G K TKL+ D  +DAFV +   +
Sbjct: 104  LLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGIV 163

Query: 1924 FTWNSL--------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQS 1769
                SL        C D  ++L+LPEG+PDSVTSDYL+YSLWRGVQGVASQISGVLATQ+
Sbjct: 164  NLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQA 223

Query: 1768 LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 1589
            LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF
Sbjct: 224  LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 283

Query: 1588 GMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQGM 1409
            G+E+LTPAFPHLFV I                ATRSCFYAGFA+QRNFAEVIAKGEAQGM
Sbjct: 284  GLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGM 343

Query: 1408 VSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLVF 1229
            VSKSIGI+LGIALAN +GSST L+LASF VVT VHM+CNLKSYQS+ LRTLN YRASLVF
Sbjct: 344  VSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVF 403

Query: 1228 SEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLGS 1049
            SEYLLSGQ P +KEVNDEEPLFP +PF++     + +S  LS+EAK AA +IE+RL LGS
Sbjct: 404  SEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGS 463

Query: 1048 KLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWLE 869
            KLS+++NNKEDA+ALF LY  EGY+LTEH G+FC+VLKE S PQDMLKSLFQVNYLYWLE
Sbjct: 464  KLSDIVNNKEDALALFSLYKDEGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWLE 523

Query: 868  KNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIR 695
            +N GIE+   S DC+ GG+LQIS++Y QREFNHVK D E  GW  DGLIARPLPNRIR
Sbjct: 524  RNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIR 581


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  653 bits (1684), Expect = 0.0
 Identities = 333/481 (69%), Positives = 383/481 (79%), Gaps = 11/481 (2%)
 Frame = -3

Query: 2104 ILLFLLSLFGCVWHFQLASALARAHKQSTEEAVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            +LLF+ S+    +HFQL +AL+   K+  EE VWEVRGGK  K++ D  KD F++    I
Sbjct: 5    VLLFVFSVLYSFFHFQLDTALS---KEKEEEGVWEVRGGKWHKIIPDSSKDEFLVVTPGI 61

Query: 1924 FTWNS-----------LCSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLA 1778
                +            C + FL+LMLPEG+P SVTSDYL+Y+LWRGVQGVASQISGVLA
Sbjct: 62   GAVGAPKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLA 121

Query: 1777 TQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLEN 1598
            TQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLEN
Sbjct: 122  TQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLEN 181

Query: 1597 AAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEA 1418
            AA+G+EILTPAFPH F++I                +TRSCFYAGFA+QRNFAEVIAKGEA
Sbjct: 182  AAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEA 241

Query: 1417 QGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRAS 1238
            QGMVSKSIGIMLGIALAN IGSS PLS ASF VVT VHMFCNLKSYQS+ LRTLNPYRAS
Sbjct: 242  QGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRAS 301

Query: 1237 LVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLL 1058
            LVFSEYLLSGQVP +KEVN+EEPLFP +P ++A    K QS  LS EAK AA EIE+RL 
Sbjct: 302  LVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQ 361

Query: 1057 LGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLY 878
            LGSKLSE++++KED +ALFDLY +E Y+LTEH+GRF ++LKE  SPQDMLKS+F VNYLY
Sbjct: 362  LGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLY 421

Query: 877  WLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRI 698
            WLE+N GI S   SDDC+ GG+LQISL+Y QREFNH+K D E  GW  DGLIARPLPNRI
Sbjct: 422  WLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRI 481

Query: 697  R 695
            R
Sbjct: 482  R 482


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  653 bits (1684), Expect = 0.0
 Identities = 333/481 (69%), Positives = 383/481 (79%), Gaps = 11/481 (2%)
 Frame = -3

Query: 2104 ILLFLLSLFGCVWHFQLASALARAHKQSTEEAVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            +LLF+ S+    +HFQL +AL+   K+  EE VWEVRGGK  K++ D  KD F++    I
Sbjct: 207  VLLFVFSVLYSFFHFQLDTALS---KEKEEEGVWEVRGGKWHKIIPDSSKDEFLVVTPGI 263

Query: 1924 FTWNS-----------LCSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLA 1778
                +            C + FL+LMLPEG+P SVTSDYL+Y+LWRGVQGVASQISGVLA
Sbjct: 264  GAVGAPKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLA 323

Query: 1777 TQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLEN 1598
            TQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLEN
Sbjct: 324  TQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLEN 383

Query: 1597 AAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEA 1418
            AA+G+EILTPAFPH F++I                +TRSCFYAGFA+QRNFAEVIAKGEA
Sbjct: 384  AAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEA 443

Query: 1417 QGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRAS 1238
            QGMVSKSIGIMLGIALAN IGSS PLS ASF VVT VHMFCNLKSYQS+ LRTLNPYRAS
Sbjct: 444  QGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRAS 503

Query: 1237 LVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLL 1058
            LVFSEYLLSGQVP +KEVN+EEPLFP +P ++A    K QS  LS EAK AA EIE+RL 
Sbjct: 504  LVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQ 563

Query: 1057 LGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLY 878
            LGSKLSE++++KED +ALFDLY +E Y+LTEH+GRF ++LKE  SPQDMLKS+F VNYLY
Sbjct: 564  LGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLY 623

Query: 877  WLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRI 698
            WLE+N GI S   SDDC+ GG+LQISL+Y QREFNH+K D E  GW  DGLIARPLPNRI
Sbjct: 624  WLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRI 683

Query: 697  R 695
            R
Sbjct: 684  R 684


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  649 bits (1675), Expect = 0.0
 Identities = 336/488 (68%), Positives = 379/488 (77%), Gaps = 15/488 (3%)
 Frame = -3

Query: 2104 ILLFLLSLFGCVWHFQLASALARAHKQSTEE------AVWEVRGGKRTKLLSDPWKDAFV 1943
            +LLF+ SL  C  H Q+A+A+AR    S ++      AVWEV+G KRTKL+ D  KDAFV
Sbjct: 92   LLLFVPSLLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFV 151

Query: 1942 LAETSIFTWNSL---------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQIS 1790
            +A  S  + +SL         C + F+Q MLPEG+PDSVTSDYL YSLWR VQGVASQIS
Sbjct: 152  VASASNASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 211

Query: 1789 GVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFAD 1610
            GVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFAD
Sbjct: 212  GVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFAD 271

Query: 1609 LLENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIA 1430
            LLENAAFG+E+LTPAFPH FV I                +TRSCFYAGFA++RNFAEVIA
Sbjct: 272  LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 331

Query: 1429 KGEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNP 1250
            KGEAQGMVSK+IGIMLGIALAN+IGSS P +LASF+VVT +HM+CNLKSYQS+ LRTLNP
Sbjct: 332  KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 391

Query: 1249 YRASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIE 1070
            YRASLVFSEYLLSGQ P VKEVNDEEPLFP   F       K Q   LS+EAK AA EIE
Sbjct: 392  YRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 451

Query: 1069 QRLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQV 890
             RL LGSKLS+++NNKEDA ALF LY  EGY+LTEH G+FC+VLKE + PQDMLKSLFQ 
Sbjct: 452  HRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 511

Query: 889  NYLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPL 710
            +YLYWLE+N GI +   S DC  GG+L+ISLDY QREFNHVK D    GW  DGLIARPL
Sbjct: 512  SYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 571

Query: 709  PNRIRLCY 686
            PNRIR  Y
Sbjct: 572  PNRIRPGY 579


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  644 bits (1660), Expect = 0.0
 Identities = 332/493 (67%), Positives = 384/493 (77%), Gaps = 15/493 (3%)
 Frame = -3

Query: 2101 LLFLLSLFGCVWHFQL--ASALARAHKQSTE-----EAVWEVRGGKRTKLLSDPWKDAFV 1943
            L FL+ +  C +HF+L  ASA+A+A    ++     E VWEVRG KR +L+ D  KD FV
Sbjct: 160  LCFLVLVLSCFFHFRLSAASAVAKAENSDSDDSTEKETVWEVRGSKRKRLVPDFVKDEFV 219

Query: 1942 LAETSIFTWNSL--------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISG 1787
              E +    +SL        C     Q +LPEGYP+SVTSDYL+YSLWRGVQG+ASQISG
Sbjct: 220  SEEAAFELSSSLTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQISG 279

Query: 1786 VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADL 1607
            VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADL
Sbjct: 280  VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADL 339

Query: 1606 LENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAK 1427
            LENAAFGME+LTP FP  FV+I                ATRSCF AGFASQRNFAEVIAK
Sbjct: 340  LENAAFGMEMLTPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAK 399

Query: 1426 GEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPY 1247
            GEAQGMVSKS+GI+LGI +AN IG+ST L+LA+F VVT +HM+ NLKSYQ + LRTLNPY
Sbjct: 400  GEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTLNPY 459

Query: 1246 RASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQ 1067
            RASLVFSEYL+SGQ PL+KEVNDEEPLFP + F++     K+Q   LS+EAK+AA +IE+
Sbjct: 460  RASLVFSEYLISGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAADIEE 519

Query: 1066 RLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVN 887
            RL LGSKLS++I+NKE+AIALFDLY +EGY+LTEHRGRFC++LKE SSPQDML+SLFQVN
Sbjct: 520  RLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVN 579

Query: 886  YLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLP 707
            YLYWLEKN GIE      DCK GG+L ISLDY +REF H K D E  GW  +GLIARPLP
Sbjct: 580  YLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLP 639

Query: 706  NRIRLCYDASALT 668
             RIRL YD+  L+
Sbjct: 640  TRIRLGYDSEPLS 652


>ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog [Fragaria vesca subsp.
            vesca]
          Length = 593

 Score =  642 bits (1655), Expect = 0.0
 Identities = 333/482 (69%), Positives = 381/482 (79%), Gaps = 6/482 (1%)
 Frame = -3

Query: 2098 LFLLSLFGCVWHFQLASALARAHKQSTEEAVWEVRGGKRTKLLSDPWKDAFV------LA 1937
            +FL ++  C  H +LA ALA    +   E+VWEV+GGK TKL  D  +DAFV      L 
Sbjct: 113  IFLAAVACCFCHLRLAYALA---SEEDAESVWEVKGGKWTKLAPDFVRDAFVADGGGGLG 169

Query: 1936 ETSIFTWNSLCSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQSLLYA 1757
              S  +    C   F+QLMLPEG+PDSVTSDYL+YSLWR VQGVASQ+SGVLATQ+LLYA
Sbjct: 170  SISFESLGLQCKSLFVQLMLPEGFPDSVTSDYLDYSLWRAVQGVASQVSGVLATQALLYA 229

Query: 1756 VGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEI 1577
            VGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGME+
Sbjct: 230  VGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEM 289

Query: 1576 LTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQGMVSKS 1397
            LTP FP+ F++I                ATRSCFYAGFA+QRNFAEVIAKGEAQGMVSK 
Sbjct: 290  LTPVFPNHFLLIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKF 349

Query: 1396 IGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLVFSEYL 1217
            IGIMLGIALAN IGSST L LASF++VT +HMFCNLKSYQ++ LRTLNPYRASLVFSEYL
Sbjct: 350  IGIMLGIALANQIGSSTSLGLASFSLVTCIHMFCNLKSYQAIQLRTLNPYRASLVFSEYL 409

Query: 1216 LSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLGSKLSE 1037
            LSGQ P VK+VN+EEPLFP +PF++     K Q   LS+EAK AA EIEQRL LG KLS+
Sbjct: 410  LSGQAPPVKDVNEEEPLFPAVPFLNWKPANKGQPTVLSSEAKDAAAEIEQRLQLGCKLSD 469

Query: 1036 LINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWLEKNMG 857
            LINNKED  ALF+LY  EGY+LTEHRGR+C+VLKE SS QDMLK+LF VNYLYWLEKN G
Sbjct: 470  LINNKEDVHALFNLYKEEGYILTEHRGRYCVVLKETSSLQDMLKALFHVNYLYWLEKNAG 529

Query: 856  IESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIRLCYDAS 677
            IE+   S DC+ GG+L++SLDY +REF+ +K DGE  GW  DGLIARP PNRIR  Y+AS
Sbjct: 530  IEAKGTSIDCRPGGRLEMSLDYVRREFDIIKTDGESVGWVTDGLIARPAPNRIRPVYEAS 589

Query: 676  AL 671
            ++
Sbjct: 590  SV 591


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  637 bits (1643), Expect = e-180
 Identities = 329/492 (66%), Positives = 382/492 (77%), Gaps = 15/492 (3%)
 Frame = -3

Query: 2101 LLFLLSLFGCVWHFQLASALARAHKQSTE-------EAVWEVRGGKRTKLLSDPWKDAFV 1943
            L FLL    C +HF+L++A A A  Q+++       E VWEVRG KR +L+ D  KD FV
Sbjct: 110  LCFLLLGLSCFFHFRLSAASAIAKDQNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFV 169

Query: 1942 LAETSIFTWNSL--------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISG 1787
              E++    +SL        C +   Q +LPEG+P+SVTSDYL+YSLWRGVQG+ASQISG
Sbjct: 170  SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISG 229

Query: 1786 VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADL 1607
            VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADL
Sbjct: 230  VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADL 289

Query: 1606 LENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAK 1427
            LENAAFGME+LTP FP  FV+I                ATRSCF AGFASQRNFAEVIAK
Sbjct: 290  LENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAK 349

Query: 1426 GEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPY 1247
            GEAQGMVSKS+GI+LGI +AN IG+ST L+LA+F VVT +HM+ NLKSYQ + LRTLNPY
Sbjct: 350  GEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPY 409

Query: 1246 RASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQ 1067
            RASLVFSEYL+SGQ PL+KEVNDEEPLFP + F +     K+Q   LS+EAKAAA +IE+
Sbjct: 410  RASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEE 469

Query: 1066 RLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVN 887
            RL LGSKLS++I+NKE+AIALFDLY +EGY+LTEH+GRFC++LKE S+PQDML+SLFQVN
Sbjct: 470  RLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVN 529

Query: 886  YLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLP 707
            YLYWLEKN GIE      DCK GG+L ISLDY +REF H K D E  GW  +GLIARPLP
Sbjct: 530  YLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLP 589

Query: 706  NRIRLCYDASAL 671
             RIRL +D   L
Sbjct: 590  TRIRLGHDRELL 601


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  634 bits (1635), Expect = e-179
 Identities = 330/485 (68%), Positives = 374/485 (77%), Gaps = 11/485 (2%)
 Frame = -3

Query: 2098 LFLLSLFGCVWHFQLASALARAHKQSTEEAVWEVRGGKRTKLLSDPWKDAFVLAE----- 1934
            + LLSLF C       S LARA  QS   +VWEV+GGK   L+ +   D FV+       
Sbjct: 100  ILLLSLFFC-------SRLARA--QSLSSSVWEVKGGKWILLVPNDLDDTFVVDSLFPST 150

Query: 1933 ------TSIFTWNSLCSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQ 1772
                  + +  W   C    ++LMLPEGYP+SVTSDYL+YSLWR VQGVASQIS VLATQ
Sbjct: 151  SSTRPVSPLNLWLEKCRQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQ 210

Query: 1771 SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1592
            SLLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA
Sbjct: 211  SLLYAVGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 270

Query: 1591 FGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQG 1412
            FG E+LTPAFPHLFV I                ATRSCF+AGFA+QRNFAEVIAKGEAQG
Sbjct: 271  FGFEMLTPAFPHLFVPIGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQG 330

Query: 1411 MVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLV 1232
            MVSKSIGI +GI LAN IG+STPL+LASF+VVT +HM+CNLKSYQS+ LRTLNPYRASLV
Sbjct: 331  MVSKSIGIAMGIGLANCIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLV 390

Query: 1231 FSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLG 1052
            FSEYLLSGQ P +KEVNDE+PLFP +P ++   + K Q   LSAEAK AA EI+ RLLLG
Sbjct: 391  FSEYLLSGQAPPIKEVNDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLG 450

Query: 1051 SKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWL 872
            SKLS+++NN +D +ALFDLY +EGY+LTEH GRFC+VLKE  SP DMLK++F VNYLYWL
Sbjct: 451  SKLSDVVNNHKDVLALFDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWL 510

Query: 871  EKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIRL 692
            EKN GI+      D K GG+LQISLDY +REFNHVK DGE AGW  DGLIARPLPNRIR 
Sbjct: 511  EKNAGIDGASPYLDSKPGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRP 570

Query: 691  CYDAS 677
             + AS
Sbjct: 571  GFVAS 575


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  634 bits (1635), Expect = e-179
 Identities = 327/491 (66%), Positives = 383/491 (78%), Gaps = 15/491 (3%)
 Frame = -3

Query: 2095 FLLSLFGCVWHFQL--ASALARAHKQST-----EEAVWEVRGGKRTKLLSDPWKDAFVLA 1937
            FL+    C +HF+L  ASA+A+A    +     +E VWEVRG KR +L+ D  KD FV  
Sbjct: 118  FLVLGLSCFFHFRLSAASAIAKASDSDSSGDTDKETVWEVRGSKRKRLVPDFVKDEFVSE 177

Query: 1936 ETSIFTWNSL--------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVL 1781
            E++    +SL        C +   Q +LPEG+P+SVTSDYL+YSLWRGVQG+ASQ+SGVL
Sbjct: 178  ESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVL 237

Query: 1780 ATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLE 1601
            ATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLE
Sbjct: 238  ATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLE 297

Query: 1600 NAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGE 1421
            NAAFGME+LTP FP  FV+I                ATRSCF AGFASQRNFAEVIAKGE
Sbjct: 298  NAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGE 357

Query: 1420 AQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRA 1241
            AQGMVSKS+GI+LGI +AN IG+ST L+LA+F VVT +HM+ NLKSYQ + LRTLNPYRA
Sbjct: 358  AQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRA 417

Query: 1240 SLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRL 1061
            SLVFSEYL+SGQ PL+KEVNDEEPLFP + F++     K+Q   LS+EAKAAA +IE+RL
Sbjct: 418  SLVFSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERL 477

Query: 1060 LLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYL 881
             LGSKLS++I+NKE+AIALFDLY +EGY+LTEHRGRFC++LKE S+PQDML+SLFQVNYL
Sbjct: 478  QLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLFQVNYL 537

Query: 880  YWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNR 701
            YWLEKN GIE      DCK GG+L ISLDY +REF H K D +  GW  +GLIARPLP R
Sbjct: 538  YWLEKNAGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIARPLPTR 597

Query: 700  IRLCYDASALT 668
            IRL +D   L+
Sbjct: 598  IRLGHDREPLS 608


>ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508778081|gb|EOY25337.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 577

 Score =  631 bits (1627), Expect = e-178
 Identities = 329/478 (68%), Positives = 373/478 (78%), Gaps = 9/478 (1%)
 Frame = -3

Query: 2101 LLFLLSLFGCVWHFQLASALARAHKQSTEE-AVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            LLFL S   C    QL+SALAR ++ S E+  VWEV+G K TKL+ D  +DAFV +   +
Sbjct: 104  LLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGIV 163

Query: 1924 FTWNSL--------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQS 1769
                SL        C D  ++L+LPEG+PDSVTSDYL+YSLWRGVQGVASQISGVLATQ+
Sbjct: 164  NLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQA 223

Query: 1768 LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 1589
            LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF
Sbjct: 224  LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 283

Query: 1588 GMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQGM 1409
            G+E+LTPAFPHLFV I                ATRSCFYAGFA+QRNFAEVIAKGEAQGM
Sbjct: 284  GLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGM 343

Query: 1408 VSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLVF 1229
            VSKSIGI+LGIALAN +GSST L+LASF VVT VHM+CNLKSYQS+ LRTLN YRASLVF
Sbjct: 344  VSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVF 403

Query: 1228 SEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLGS 1049
            SEYLLSGQ P +KEVNDEEPLFP +PF++     + +S  LS+EAK AA +IE+RL LGS
Sbjct: 404  SEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGS 463

Query: 1048 KLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWLE 869
            KLS+++NNKEDA+ALF LY  EGY+LTEH G+FC              SLFQVNYLYWLE
Sbjct: 464  KLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC--------------SLFQVNYLYWLE 509

Query: 868  KNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIR 695
            +N GIE+   S DC+ GG+LQIS++Y QREFNHVK D E  GW  DGLIARPLPNRIR
Sbjct: 510  RNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIR 567


>ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508778082|gb|EOY25338.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 573

 Score =  622 bits (1604), Expect = e-175
 Identities = 325/478 (67%), Positives = 369/478 (77%), Gaps = 9/478 (1%)
 Frame = -3

Query: 2101 LLFLLSLFGCVWHFQLASALARAHKQSTEE-AVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            LLFL S   C    QL+SALAR ++ S E+  VWEV+G K TKL+ D  +DAFV +   +
Sbjct: 104  LLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGIV 163

Query: 1924 FTWNSL--------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQS 1769
                SL        C D  ++L+LPEG+PDSVTSDYL+YSLWRGVQGVASQISGVLATQ+
Sbjct: 164  NLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQA 223

Query: 1768 LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 1589
            LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF
Sbjct: 224  LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 283

Query: 1588 GMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQGM 1409
            G+E+LTPAFPHLFV I                ATRSCFYAGFA+QRNFAEVIAKGEAQGM
Sbjct: 284  GLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGM 343

Query: 1408 VSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLVF 1229
            VSKSIGI+LGIALAN +GSST L+LASF VVT VHM+CNLKSYQS+ LRTLN YRASLVF
Sbjct: 344  VSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVF 403

Query: 1228 SEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLGS 1049
            SEYLLSGQ P +KEVNDEEPLFP +PF++     + +S  LS+EAK AA +IE+RL LGS
Sbjct: 404  SEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGS 463

Query: 1048 KLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWLE 869
            KLS+++NNKEDA+ALF LY  EGY+LTEH G+FC                  VNYLYWLE
Sbjct: 464  KLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC------------------VNYLYWLE 505

Query: 868  KNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIR 695
            +N GIE+   S DC+ GG+LQIS++Y QREFNHVK D E  GW  DGLIARPLPNRIR
Sbjct: 506  RNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIR 563


>gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus guttatus]
          Length = 606

 Score =  621 bits (1601), Expect = e-175
 Identities = 313/456 (68%), Positives = 363/456 (79%), Gaps = 15/456 (3%)
 Frame = -3

Query: 2014 EAVWEVRGGKRTKLLSDPWKDAFVLAETSIFTWNSL---------------CSDFFLQLM 1880
            E V+E+R GKR +L+ D  KD FV+ E +   W                  C D  + LM
Sbjct: 139  EVVFEIRAGKRVELVPDYSKDEFVVPEKNWSWWLKAAKSNPSSNLADVWMKCRDVAMSLM 198

Query: 1879 LPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLK 1700
            LPEG+P+SVTSDYLEYSLWRGVQG+A+Q+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLK
Sbjct: 199  LPEGFPESVTSDYLEYSLWRGVQGIAAQVSGVLATQALLYAVGLGKGAIPTAAAVNWVLK 258

Query: 1699 DGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVVIXXXXXXX 1520
            DGIGYLSKIMLSKYGRHFDVNPKGWRL AD LENAAFG+EILTPAFPHLFV I       
Sbjct: 259  DGIGYLSKIMLSKYGRHFDVNPKGWRLCADFLENAAFGLEILTPAFPHLFVPIGAVAGAG 318

Query: 1519 XXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIGSSTPL 1340
                     ATRSCFYAGFA+QRNFAEVIAKGEAQGMVSKSIGIMLGIALAN + SS PL
Sbjct: 319  RSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQSSIPL 378

Query: 1339 SLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLVFSEYLLSGQVPLVKEVNDEEPLFP 1160
            +LASF+V+T +HMFCNLKSYQS+ LRTLNPYRASLVFS+YLLSG VP VKEVNDEEPLFP
Sbjct: 379  ALASFSVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEEPLFP 438

Query: 1159 GLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLGSKLSELINNKEDAIALFDLYWSEG 980
              P +      + Q + LS +AK AA  I++RL LGSKLS+++ ++E+AIALFDLY SEG
Sbjct: 439  AFPLLIVKPTSEEQVEVLSPDAKHAASNIDRRLKLGSKLSDVVKSREEAIALFDLYKSEG 498

Query: 979  YMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWLEKNMGIESGRVSDDCKQGGKLQIS 800
            Y+LTEH+GR+C+VLKE S PQDMLKSLFQV+YLYWLE+N GI+S    DDC+ GG+LQIS
Sbjct: 499  YILTEHQGRYCVVLKESSMPQDMLKSLFQVSYLYWLERNAGIKSTTTIDDCRPGGRLQIS 558

Query: 799  LDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIRL 692
            ++Y QREF H+K D + AGW VDGLIARPLP+RIR+
Sbjct: 559  MEYVQREFTHIKNDSQFAGWVVDGLIARPLPHRIRI 594


>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  620 bits (1598), Expect = e-174
 Identities = 326/493 (66%), Positives = 378/493 (76%), Gaps = 16/493 (3%)
 Frame = -3

Query: 2104 ILLFLLSLFGCVWHFQLASALARAHKQSTEEAVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            +LLFL+S    +    L ++  +A K +  E V+E+RGGKR +L+ D  KD FVL +T  
Sbjct: 114  LLLFLVSASSSITCCLLLASFVQA-KTNNGEIVYEIRGGKRFELVPDYSKDEFVLTKTM- 171

Query: 1924 FTWNSL----------------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQI 1793
              W+ L                C +    L LPEG+P+SVTSDYLEY+LWRGVQG+A+QI
Sbjct: 172  --WSQLWPDSTSGSFVSNLWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQI 229

Query: 1792 SGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFA 1613
            SGVLATQ+LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFA
Sbjct: 230  SGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFA 289

Query: 1612 DLLENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVI 1433
            DLLENAA+G+EILTPAFPHLFV I                ATRSCFYAGFA+QRNFAEVI
Sbjct: 290  DLLENAAYGLEILTPAFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVI 349

Query: 1432 AKGEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLN 1253
            AKGEAQGMVSK+IGIMLGIALANY  SST L+LASF VVT +HMFCNLKSYQS+ LRTLN
Sbjct: 350  AKGEAQGMVSKAIGIMLGIALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLN 409

Query: 1252 PYRASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEI 1073
            PYRASLVFSEYLLSG VP VKEVNDEEPLFP    ++     + Q++ LS  AK AA  I
Sbjct: 410  PYRASLVFSEYLLSGLVPSVKEVNDEEPLFPA-AILNLKAAYETQTEVLSVHAKQAAAGI 468

Query: 1072 EQRLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQ 893
             +RL LGSKLS++  ++ED +ALF+LY +EGY+LTEH GRFCIVLKE SSPQDMLKSLF 
Sbjct: 469  VRRLQLGSKLSDVATSQEDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFH 528

Query: 892  VNYLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARP 713
            VNYLYWLE N GI+S  V++DC+ GG+LQ+SL+Y +REFNHVK DGE+AGW  D LIARP
Sbjct: 529  VNYLYWLETNAGIKSSSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARP 588

Query: 712  LPNRIRLCYDASA 674
            LP RIRL Y A +
Sbjct: 589  LPVRIRLDYAAES 601


>ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutrema salsugineum]
            gi|557096914|gb|ESQ37422.1| hypothetical protein
            EUTSA_v10002446mg [Eutrema salsugineum]
          Length = 611

 Score =  619 bits (1595), Expect = e-174
 Identities = 321/487 (65%), Positives = 376/487 (77%), Gaps = 14/487 (2%)
 Frame = -3

Query: 2101 LLFLLSLFGCVWHFQLASALARAHKQSTE-------EAVWEVRGGKRTKLLSDPWKDAFV 1943
            L FL  ++ C +  +L++A+A A    ++       E VWEVRG KR +L+ D  +D F 
Sbjct: 115  LCFLFLVYSCFFQLRLSAAIAIAKAPESDSNGDTEKETVWEVRGSKRKRLVPDFVRDEFF 174

Query: 1942 LAE----TSIFTWNSL---CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGV 1784
            ++     +S  T  +L   C +   Q +LPEG+P+SVTSDYL+YSLWRGVQG+ASQISGV
Sbjct: 175  VSPEETTSSPLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGV 234

Query: 1783 LATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLL 1604
            LATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLL
Sbjct: 235  LATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLL 294

Query: 1603 ENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKG 1424
            EN+AFGME+LTP FP  FV+I                ATRSCF AGFASQRNFAEVIAKG
Sbjct: 295  ENSAFGMEMLTPLFPQFFVLIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKG 354

Query: 1423 EAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYR 1244
            EAQGMVSKSIGI+LGI +AN IG+ST L+LASF VVT +HM+ NLKSYQ + LRTLNPYR
Sbjct: 355  EAQGMVSKSIGILLGIVVANCIGTSTSLALASFGVVTSIHMYTNLKSYQCIQLRTLNPYR 414

Query: 1243 ASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQR 1064
            ASLVFSEYL+SGQ P +KEVNDEEPLFP +  ++     K Q   LS+EAKAAA +IE+R
Sbjct: 415  ASLVFSEYLISGQAPPIKEVNDEEPLFPTVRSLNIKSAEKRQDFVLSSEAKAAAADIEER 474

Query: 1063 LLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNY 884
            L LGSKLS++++NKE+A+ALFDLY  EGY+LTEHRGRFC++LKE SSPQDML+SLFQVNY
Sbjct: 475  LQLGSKLSDVVHNKEEAVALFDLYRDEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVNY 534

Query: 883  LYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPN 704
            LYWLEKN GIE+     DCK GG+L ISLDY +REF   K D EL GW  +GLIARPL  
Sbjct: 535  LYWLEKNAGIEASNTYLDCKPGGRLHISLDYVRREFELAKEDSELVGWVTEGLIARPLST 594

Query: 703  RIRLCYD 683
            RIRL YD
Sbjct: 595  RIRLDYD 601


>emb|CAB82813.1| putative protein [Arabidopsis thaliana]
          Length = 631

 Score =  618 bits (1594), Expect = e-174
 Identities = 326/515 (63%), Positives = 380/515 (73%), Gaps = 38/515 (7%)
 Frame = -3

Query: 2101 LLFLLSLFGCVWHFQLASALARAHKQSTE-------EAVWEVRGGKRTKLLSDPWKDAFV 1943
            L FLL    C +HF+L++A A A  Q+++       E VWEVRG KR +L+ D  KD FV
Sbjct: 110  LCFLLLGLSCFFHFRLSAASAIAKDQNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFV 169

Query: 1942 LAETSIFTWNSL--------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISG 1787
              E++    +SL        C +   Q +LPEG+P+SVTSDYL+YSLWRGVQG+ASQISG
Sbjct: 170  SEESAFELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISG 229

Query: 1786 VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADL 1607
            VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADL
Sbjct: 230  VLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADL 289

Query: 1606 LENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVI-- 1433
            LENAAFGME+LTP FP  FV+I                ATRSCF AGFASQRNFAEV   
Sbjct: 290  LENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVYNN 349

Query: 1432 ---------------------AKGEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVV 1316
                                 + GEAQGMVSKS+GI+LGI +AN IG+ST L+LA+F VV
Sbjct: 350  FYMALVLITYQQLFVFLNYSGSLGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVV 409

Query: 1315 TGVHMFCNLKSYQSVLLRTLNPYRASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISAN 1136
            T +HM+ NLKSYQ + LRTLNPYRASLVFSEYL+SGQ PL+KEVNDEEPLFP + F +  
Sbjct: 410  TTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSNMK 469

Query: 1135 LMRKVQSQQLSAEAKAAAYEIEQRLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRG 956
               K+Q   LS+EAKAAA +IE+RL LGSKLS++I+NKE+AIALFDLY +EGY+LTEH+G
Sbjct: 470  SPEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHKG 529

Query: 955  RFCIVLKEGSSPQDMLKSLFQVNYLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREF 776
            RFC++LKE S+PQDML+SLFQVNYLYWLEKN GIE      DCK GG+L ISLDY +REF
Sbjct: 530  RFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREF 589

Query: 775  NHVKYDGELAGWTVDGLIARPLPNRIRLCYDASAL 671
             H K D E  GW  +GLIARPLP RIRL +D   L
Sbjct: 590  EHAKEDSESVGWVTEGLIARPLPTRIRLGHDRELL 624


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  616 bits (1589), Expect = e-173
 Identities = 314/466 (67%), Positives = 360/466 (77%), Gaps = 13/466 (2%)
 Frame = -3

Query: 2053 ASALARAHKQSTEEAVWEVRGGKRTKLLSDPWKDAFVLAETSIFTWNSL----------- 1907
            +S  +    +  ++ +WEV+GG   KL  D  KD F+ +  + F+  S            
Sbjct: 120  SSCSSSIENEILKQPIWEVKGGNFIKLFPDHLKDIFIASNPTFFSELSSLNVSQVPSFLY 179

Query: 1906 --CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQSLLYAVGLGKGAI 1733
              C +F ++LMLPEG+P+SVTSDYLEYSLWRGVQGVA Q+SGVLATQ+LLYAVGLGKGAI
Sbjct: 180  TKCKEFTVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKGAI 239

Query: 1732 PTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHL 1553
            PTAAAINWVLKDGIGYLSKI+LS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFPHL
Sbjct: 240  PTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPHL 299

Query: 1552 FVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQGMVSKSIGIMLGIA 1373
            FV I                +TRSCF+AGFA+QRNFAEVIAKGE QGM S+ IGI LGI 
Sbjct: 300  FVPIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIALGIG 359

Query: 1372 LANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLVFSEYLLSGQVPLV 1193
            L N IGSSTPL LASF VVT VHM+CNLKSYQS+ LRTLNPYRASLVFSEYLLSGQ P V
Sbjct: 360  LGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPV 419

Query: 1192 KEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLGSKLSELINNKEDA 1013
            KEVNDEEPLFP LP ++A    K QS  LS+EAK AA EIE RL LGSKLSE+I+NKE+ 
Sbjct: 420  KEVNDEEPLFPALPILNACFANKAQSIVLSSEAKDAAVEIESRLQLGSKLSEIIHNKEEV 479

Query: 1012 IALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWLEKNMGIESGRVSD 833
            +ALF LY +EGY+L+EH G+FC+VLKE  S  DMLK+LFQVNYLYWLEKN GIE      
Sbjct: 480  LALFSLYKNEGYILSEHTGKFCVVLKENCSQLDMLKALFQVNYLYWLEKNAGIEGRGALY 539

Query: 832  DCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIR 695
            DCK GG+L+ISL+YA+REFNH + DGE AGW  DGLIARPLPNRIR
Sbjct: 540  DCKPGGRLRISLEYAEREFNHARNDGESAGWIADGLIARPLPNRIR 585


>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  615 bits (1587), Expect = e-173
 Identities = 325/491 (66%), Positives = 374/491 (76%), Gaps = 16/491 (3%)
 Frame = -3

Query: 2104 ILLFLLSLFGCVWHFQLASALARAHKQSTEEAVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            +LLFL+S    +    L ++  +A K +  E V E+RGGKR +L+ D  KD FVL +T  
Sbjct: 117  LLLFLVSASSSITCCLLLASFVQA-KTNNGEIVHEIRGGKRFELVPDYSKDEFVLTKTM- 174

Query: 1924 FTWNSL----------------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQI 1793
              W+ L                C +    L+LPEG+PDSVTSDYLEY+LWRGVQGVA+QI
Sbjct: 175  --WSRLLPDSKSGSFVSNLWMQCKELTTTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQI 232

Query: 1792 SGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFA 1613
            SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFA
Sbjct: 233  SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFA 292

Query: 1612 DLLENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVI 1433
            DLLENAA+G+EILTPAFPHLFV I                ATRSCFYAGFA+QRNFAEVI
Sbjct: 293  DLLENAAYGLEILTPAFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVI 352

Query: 1432 AKGEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLN 1253
            AKGEAQGMVSK+IGIMLGIALAN   SST L+LASF VVT +HMFCNLKSY S+ LRTLN
Sbjct: 353  AKGEAQGMVSKAIGIMLGIALANCTRSSTSLALASFGVVTWIHMFCNLKSYHSIQLRTLN 412

Query: 1252 PYRASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEI 1073
            PYRASLVFSEYLLSG VP VKEVNDEEPLFP    ++     + Q + LS  AK AA  I
Sbjct: 413  PYRASLVFSEYLLSGLVPSVKEVNDEEPLFPA-AILNLKAAYETQMEVLSVHAKQAAAGI 471

Query: 1072 EQRLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQ 893
             +RL LGSKLS++  ++ED +ALF+LY +EGY+LTEH GRFCIVLKE SSPQDMLKSLF 
Sbjct: 472  VRRLQLGSKLSDVATSREDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFH 531

Query: 892  VNYLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARP 713
            VNYLYWLE   GI+S  V++DC+ GG+LQ+SL+Y +REFNHVK DGE+AGW  D LIARP
Sbjct: 532  VNYLYWLETKAGIKSSSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARP 591

Query: 712  LPNRIRLCYDA 680
            LPNRIRL Y A
Sbjct: 592  LPNRIRLDYTA 602


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  615 bits (1587), Expect = e-173
 Identities = 319/490 (65%), Positives = 375/490 (76%), Gaps = 20/490 (4%)
 Frame = -3

Query: 2107 DILLFLLSLFGCVWHFQLASALARA-----HKQSTEEAVWEVRGGKRTKLLSDPWKDAFV 1943
            D  ++LL  F  +W    +SA AR       ++  E++VW V+G KR +L+ D  KD F+
Sbjct: 41   DYFVWLLCCFVALWLQSASSAFARTTLKEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFL 100

Query: 1942 LAETSIFTWNSL---------------CSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQG 1808
            +  +   +++ +               C   F++LMLPEGYP SVTSDYL+YSLWRGVQG
Sbjct: 101  VNPSLPSSYDDIISSSWLHFGRTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQG 160

Query: 1807 VASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKG 1628
            VASQISGVLATQ+LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDVNPKG
Sbjct: 161  VASQISGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKG 220

Query: 1627 WRLFADLLENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRN 1448
            WRLFADLLENAAFG+EILTPAFPHLFV I                ATRSCFYAGFA+QRN
Sbjct: 221  WRLFADLLENAAFGLEILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRN 280

Query: 1447 FAEVIAKGEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVL 1268
            FAEVIAKGEAQGMVSK IGIMLGI LAN IGSS PL+LASF+VVT +HMFCNLKSYQS+ 
Sbjct: 281  FAEVIAKGEAQGMVSKFIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQ 340

Query: 1267 LRTLNPYRASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKA 1088
            LRTLNPYRASLVFSEYLLSGQ P +K+VNDEEPLFP + F       K     LS EA+ 
Sbjct: 341  LRTLNPYRASLVFSEYLLSGQAPPIKDVNDEEPLFPAV-FPHFKSADKPSLVVLSLEARD 399

Query: 1087 AAYEIEQRLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDML 908
            AA EIE+RL LGSKLS+++N+KED +ALF+LY  EGY+LTE++GRFC+VLKE  S QDML
Sbjct: 400  AATEIERRLQLGSKLSDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDML 459

Query: 907  KSLFQVNYLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDG 728
            K+LFQVNYLYWLE+N G+++   S DC+ GG+LQ+SL+Y QREF+HV+ D    GW  DG
Sbjct: 460  KALFQVNYLYWLERNAGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADG 519

Query: 727  LIARPLPNRI 698
            LIARPLPNRI
Sbjct: 520  LIARPLPNRI 529


>ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 593

 Score =  613 bits (1582), Expect = e-173
 Identities = 320/486 (65%), Positives = 365/486 (75%), Gaps = 20/486 (4%)
 Frame = -3

Query: 2089 LSLFGCVWHFQLASALARAHKQSTE-----EAVWEVRGGKRTKLLSDPWKDAFVLAETSI 1925
            L  F  + H +LA A   +   + +     E V+EV+GGK TKL+ D   D FV A+   
Sbjct: 98   LCFFCHLLHAKLAKAKTLSPSTTADTSLFSEPVYEVKGGKWTKLVPDLTNDVFVSAQQGF 157

Query: 1924 ---------------FTWNSLCSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQIS 1790
                           F W   CSD F +LMLPEG+P+SVTSDYLEYSLWR VQGVA Q+S
Sbjct: 158  LSELSSLKVPSQLATFVWLK-CSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVS 216

Query: 1789 GVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFAD 1610
            GVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFAD
Sbjct: 217  GVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFAD 276

Query: 1609 LLENAAFGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIA 1430
            LLENAAFG+E+ TPAFP  FV+I                +TRSCF+AGFA+QRNFAEVIA
Sbjct: 277  LLENAAFGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIA 336

Query: 1429 KGEAQGMVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNP 1250
            KGE QGM S+ IGI LGI L N IGSSTPL LASF V+T +HM+CNLKSYQS+ LRTLNP
Sbjct: 337  KGEVQGMASRFIGIGLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNP 396

Query: 1249 YRASLVFSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIE 1070
            YRASLVFSEYLLSGQ P VKEVNDEEPLFP +P ++A    K QS  LS+EAK AA EIE
Sbjct: 397  YRASLVFSEYLLSGQAPPVKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAKDAAAEIE 456

Query: 1069 QRLLLGSKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQV 890
             RL LGSKLSE++N+KED +ALF LY +EGY+L+E+ G+FC+VLKE  S QDMLK+LFQV
Sbjct: 457  HRLQLGSKLSEIVNSKEDVLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDMLKALFQV 516

Query: 889  NYLYWLEKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPL 710
            NYLYWLEKN GI      +D K GG+L ISLDY +REFNHVK DGEL GW  DGLIARPL
Sbjct: 517  NYLYWLEKNAGIGGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPL 576

Query: 709  PNRIRL 692
            PNRIR+
Sbjct: 577  PNRIRI 582


>ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris]
            gi|561031470|gb|ESW30049.1| hypothetical protein
            PHAVU_002G120300g [Phaseolus vulgaris]
          Length = 592

 Score =  613 bits (1581), Expect = e-172
 Identities = 316/480 (65%), Positives = 361/480 (75%), Gaps = 17/480 (3%)
 Frame = -3

Query: 2080 FGCVWHFQLASALARAHKQSTE---EAVWEVRGGKRTKLLSDPWKDAFVLAETSI----- 1925
            FG +   +LA+A   +     E   E VWEV+GGK T+L+ DP  D FV A   +     
Sbjct: 103  FGHLLLVKLANAKTWSSSSDNELLSEPVWEVKGGKWTRLVPDPTNDVFVSAHPGLLAELQ 162

Query: 1924 ---------FTWNSLCSDFFLQLMLPEGYPDSVTSDYLEYSLWRGVQGVASQISGVLATQ 1772
                     F W   C D F +LMLPEG+P+SVTSDYLEYSLWR VQGVA Q+SGVLATQ
Sbjct: 163  SLKPSQFATFVWLK-CRDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQ 221

Query: 1771 SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1592
            SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAA
Sbjct: 222  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAA 281

Query: 1591 FGMEILTPAFPHLFVVIXXXXXXXXXXXXXXXXATRSCFYAGFASQRNFAEVIAKGEAQG 1412
            FG+E+ TPAFP  FV+I                +TRSCF+AGFA+QRNFAEVIAKGE QG
Sbjct: 282  FGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQG 341

Query: 1411 MVSKSIGIMLGIALANYIGSSTPLSLASFAVVTGVHMFCNLKSYQSVLLRTLNPYRASLV 1232
            M S+ IGI LGI L N IGSSTPL LASF V+T +HM+CNLKSYQS+ LRTLNPYRASLV
Sbjct: 342  MASRFIGIGLGIGLGNCIGSSTPLVLASFIVLTWIHMYCNLKSYQSIQLRTLNPYRASLV 401

Query: 1231 FSEYLLSGQVPLVKEVNDEEPLFPGLPFISANLMRKVQSQQLSAEAKAAAYEIEQRLLLG 1052
            FSEYLLSGQ P VK+VNDEEPLFP +P ++A    K +S  LS+EAK AA EIE+RL LG
Sbjct: 402  FSEYLLSGQAPPVKDVNDEEPLFPAVPILNATFANKARSIALSSEAKDAAAEIERRLQLG 461

Query: 1051 SKLSELINNKEDAIALFDLYWSEGYMLTEHRGRFCIVLKEGSSPQDMLKSLFQVNYLYWL 872
            SKLSE++N KED +ALF LY  EGY+L+EH G+FC+VLKE  S QDMLK+LFQVNYLYWL
Sbjct: 462  SKLSEIVNGKEDVLALFRLYKKEGYILSEHMGKFCVVLKENCSQQDMLKALFQVNYLYWL 521

Query: 871  EKNMGIESGRVSDDCKQGGKLQISLDYAQREFNHVKYDGELAGWTVDGLIARPLPNRIRL 692
            EKN GI      +D + GG+L  SLDY +REFNH+K DGE  GW  DGLIARPLPNRIR+
Sbjct: 522  EKNAGIGGRGTLNDSRPGGRLHTSLDYVEREFNHLKNDGESVGWVTDGLIARPLPNRIRI 581


Top