BLASTX nr result

ID: Rauwolfia21_contig00008034 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00008034
         (2717 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   728   0.0  
ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   721   0.0  
gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma caca...   701   0.0  
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   672   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   671   0.0  
gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]    670   0.0  
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              665   0.0  
gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]    661   0.0  
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   649   0.0  
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     645   0.0  
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   642   0.0  
ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   642   0.0  
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   639   e-180
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   632   e-178
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   632   e-178
ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago ...   630   e-178
gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus...   628   e-177
ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutr...   627   e-176
ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Popu...   626   e-176
ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786...   625   e-176

>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  728 bits (1879), Expect = 0.0
 Identities = 375/544 (68%), Positives = 428/544 (78%)
 Frame = +1

Query: 529  WNSFFNFDKNPLFLSVICHKLTNENDTFVIIRDTVRQLFIFFLSASTFLSFYSCVASVAE 708
            WN+FFNFDK      ++   +  + DTF+    + + L +F +SAS+ ++    +AS  +
Sbjct: 83   WNNFFNFDK------ILLLPIFRDEDTFIDSVLSCKPLLLFLVSASSSITCCLLLASFVQ 136

Query: 709  AKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRDEFIVPKTVLASWWVRGSEQKLK 888
            AKT              ++V EIRGGK+ EL+PDY +DEF++ KT+ +  W   +     
Sbjct: 137  AKT-----------NNGEIVYEIRGGKRFELVPDYSKDEFVLTKTMWSQLWPDSTSGSF- 184

Query: 889  LSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWRGVQGVAAQMSGVLATQALLYAV 1068
              V +LWMQC+EL   L LPEGFPESVT+DYLEY+LWRGVQG+AAQ+SGVLATQALLYAV
Sbjct: 185  --VSNLWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAV 242

Query: 1069 GLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEIL 1248
            GLGKGAIPTAAA+NWVLKDGIGYLSKILLS YGRHFDVNPK WRLFADLLENAAYG+EIL
Sbjct: 243  GLGKGAIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEIL 302

Query: 1249 TPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1428
            TPAFPHLFVPI                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+I
Sbjct: 303  TPAFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAI 362

Query: 1429 GIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLL 1608
            GIMLGIALANY +SST LAL+SFGVVTWIHM+CNLKSYQSIQLRTLNPYRA LVFSEYLL
Sbjct: 363  GIMLGIALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLL 422

Query: 1609 SGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSEDAKDAAANIERRLQLGSKLSDV 1788
            SGLVPSVKEVNDEEPLFPA   LN+K A     EVLS  AK AAA I RRLQLGSKLSDV
Sbjct: 423  SGLVPSVKEVNDEEPLFPAA-ILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDV 481

Query: 1789 VKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSPQDMLKSLFQVSYLYWLEKNAGI 1968
              ++ED LALF+L+++EGYILTEHEGRFC+ LKESSSPQDMLKSLF V+YLYWLE NAGI
Sbjct: 482  ATSQEDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGI 541

Query: 1969 KSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGWVTDGLIARPLPNRICPGYATVS 2148
            KSSS ++DC+PGGRL+MSLEYV+REFNHVK+DGE AGWVTD LIARPLP RI   YA  S
Sbjct: 542  KSSSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRLDYAAES 601

Query: 2149 LASE 2160
              +E
Sbjct: 602  SVAE 605


>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  721 bits (1861), Expect = 0.0
 Identities = 375/544 (68%), Positives = 427/544 (78%)
 Frame = +1

Query: 529  WNSFFNFDKNPLFLSVICHKLTNENDTFVIIRDTVRQLFIFFLSASTFLSFYSCVASVAE 708
            W++FFNFDK     S++   +    DTF+    + + L +F +SAS+ ++    +AS  +
Sbjct: 83   WSNFFNFDKRR---SLLLLPIFRNEDTFIDSVLSCKPLLLFLVSASSSITCCLLLASFVQ 139

Query: 709  AKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRDEFIVPKTVLASWWVRGSEQKLK 888
            AKT              ++V EIRGGK+ EL+PDY +DEF++ KT+   W     + K  
Sbjct: 140  AKT-----------NNGEIVHEIRGGKRFELVPDYSKDEFVLTKTM---WSRLLPDSKSG 185

Query: 889  LSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWRGVQGVAAQMSGVLATQALLYAV 1068
              V +LWMQC+EL   L+LPEGFP+SVT+DYLEY+LWRGVQGVAAQ+SGVLATQALLYAV
Sbjct: 186  SFVSNLWMQCKELTTTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAV 245

Query: 1069 GLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEIL 1248
            GLGKGAIPTAAA+NWVLKDGIGYLSKILLS YGRHFDVNPK WRLFADLLENAAYG+EIL
Sbjct: 246  GLGKGAIPTAAAVNWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEIL 305

Query: 1249 TPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1428
            TPAFPHLFVPI                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+I
Sbjct: 306  TPAFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAI 365

Query: 1429 GIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLL 1608
            GIMLGIALAN  +SST LAL+SFGVVTWIHM+CNLKSY SIQLRTLNPYRA LVFSEYLL
Sbjct: 366  GIMLGIALANCTRSSTSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLL 425

Query: 1609 SGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSEDAKDAAANIERRLQLGSKLSDV 1788
            SGLVPSVKEVNDEEPLFPA   LN+K A     EVLS  AK AAA I RRLQLGSKLSDV
Sbjct: 426  SGLVPSVKEVNDEEPLFPAA-ILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDV 484

Query: 1789 VKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSPQDMLKSLFQVSYLYWLEKNAGI 1968
              +RED LALF+L+++EGYILTEHEGRFC+ LKESSSPQDMLKSLF V+YLYWLE  AGI
Sbjct: 485  ATSREDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGI 544

Query: 1969 KSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGWVTDGLIARPLPNRICPGYATVS 2148
            KSSS ++DC+PGGRL+MSLEYV+REFNHVK+DGE AGWVTD LIARPLPNRI   Y  VS
Sbjct: 545  KSSSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRLDYTAVS 604

Query: 2149 LASE 2160
              +E
Sbjct: 605  SVAE 608


>gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  701 bits (1809), Expect = 0.0
 Identities = 353/505 (69%), Positives = 408/505 (80%)
 Frame = +1

Query: 643  FIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRD 822
            F+ FLS        S VA    ++ +S ++    +  +D VV E++G K  +LIPD+  D
Sbjct: 103  FLLFLS--------SFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSED 154

Query: 823  EFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWR 1002
             F+    ++             LS+ ++W QCR++ M L+LPEGFP+SVT+DYL+YSLWR
Sbjct: 155  AFVASNGIV--------NLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWR 206

Query: 1003 GVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDV 1182
            GVQGVA+Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDV
Sbjct: 207  GVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV 266

Query: 1183 NPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFA 1362
            NPKGWRLFADLLENAA+G+E+LTPAFPHLFVPI                 TRSCFYAGFA
Sbjct: 267  NPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFA 326

Query: 1363 AQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSY 1542
            AQRNFAEVIAKGEAQGMVSKSIGI+LGIALAN + SST LAL+SFGVVTW+HMYCNLKSY
Sbjct: 327  AQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSY 386

Query: 1543 QSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSE 1722
            QSIQLRTLN YRA LVFSEYLLSG  PS+KEVNDEEPLFPAVPFLN+  A RE + VLS 
Sbjct: 387  QSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSS 446

Query: 1723 DAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSP 1902
            +AK AAA+IERRLQLGSKLSD+V N+EDALALF L++DEGYILTEHEG+FCV LKESS P
Sbjct: 447  EAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFCVVLKESSLP 506

Query: 1903 QDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGW 2082
            QDMLKSLFQV+YLYWLE+NAGI++S  S DC+PGGRL++S+EYVQREFNHVK D E+ GW
Sbjct: 507  QDMLKSLFQVNYLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGW 566

Query: 2083 VTDGLIARPLPNRICPGYATVSLAS 2157
            VTDGLIARPLPNRI PG+   S AS
Sbjct: 567  VTDGLIARPLPNRIRPGHRDASTAS 591


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  672 bits (1734), Expect = 0.0
 Identities = 345/495 (69%), Positives = 386/495 (77%)
 Frame = +1

Query: 673  LSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRDEFIVPKTVLA 852
            L +  C   VA A   +  SSE     +   V E++G K+ +LIPD+ +D F+V     A
Sbjct: 99   LLYCFCHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASASNA 158

Query: 853  SWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWRGVQGVAAQMS 1032
            S           LSV  LW +CRELF+  MLPEGFP+SVT+DYL YSLWR VQGVA+Q+S
Sbjct: 159  SL-------SSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQIS 211

Query: 1033 GVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFAD 1212
            GVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHFDVNPKGWRLFAD
Sbjct: 212  GVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFAD 271

Query: 1213 LLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIA 1392
            LLENAA+G+E+LTPAFPH FV I                 TRSCFYAGFAA+RNFAEVIA
Sbjct: 272  LLENAAFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIA 331

Query: 1393 KGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSYQSIQLRTLNP 1572
            KGEAQGMVSK+IGIMLGIALAN+I SS P AL+SF VVTWIHMYCNLKSYQSI+LRTLNP
Sbjct: 332  KGEAQGMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNP 391

Query: 1573 YRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSEDAKDAAANIE 1752
            YRA LVFSEYLLSG  P VKEVNDEEPLFPA  F  IK A +    VLS +AKDAA  IE
Sbjct: 392  YRASLVFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIE 451

Query: 1753 RRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSPQDMLKSLFQV 1932
             RLQLGSKLSDVV N+EDA ALF L+ DEGYILTEH G+FCV LKES+ PQDMLKSLFQ 
Sbjct: 452  HRLQLGSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQA 511

Query: 1933 SYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGWVTDGLIARPL 2112
            SYLYWLE+NAGI ++STS DC PGGRL +SL+YVQREFNHVKSD  + GWVTDGLIARPL
Sbjct: 512  SYLYWLERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPL 571

Query: 2113 PNRICPGYATVSLAS 2157
            PNRI PGY   S+AS
Sbjct: 572  PNRIRPGYVEPSVAS 586


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  671 bits (1732), Expect = 0.0
 Identities = 353/541 (65%), Positives = 419/541 (77%), Gaps = 3/541 (0%)
 Frame = +1

Query: 523  TNWN-SFFNFDKNPLFLSVICHKLTNEN--DTFVIIRDTVRQLFIFFLSASTFLSFYSCV 693
            +NWN  ++  ++N LF+   C ++ +E+  +T  ++R  +  LF+F    S   SF+   
Sbjct: 170  SNWNWGWWGNEENALFI-FFCSRVLHEHGSETAHMLRAVL--LFVF----SVLYSFFHFQ 222

Query: 694  ASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRDEFIVPKTVLASWWVRGS 873
               A +K           E +++ V E+RGGK  ++IPD  +DEF+V    + +     S
Sbjct: 223  LDTALSK-----------EKEEEGVWEVRGGKWHKIIPDSSKDEFLVVTPGIGAVGAPKS 271

Query: 874  EQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWRGVQGVAAQMSGVLATQA 1053
                  ++ +LW+QC+ELF+ LMLPEGFP SVT+DYL+Y+LWRGVQGVA+Q+SGVLATQA
Sbjct: 272  S-----TLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQA 326

Query: 1054 LLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAY 1233
            LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKILLSKYGRHFDV+PKGWRLFADLLENAAY
Sbjct: 327  LLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAY 386

Query: 1234 GMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGM 1413
            G+EILTPAFPH F+ I                 TRSCFYAGFAAQRNFAEVIAKGEAQGM
Sbjct: 387  GLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGM 446

Query: 1414 VSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSYQSIQLRTLNPYRAGLVF 1593
            VSKSIGIMLGIALAN I SS PL+ +SF VVT +HM+CNLKSYQSIQLRTLNPYRA LVF
Sbjct: 447  VSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVF 506

Query: 1594 SEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSEDAKDAAANIERRLQLGS 1773
            SEYLLSG VPS+KEVN+EEPLFP VP LN KP  +  + VLS +AKDAAA IERRLQLGS
Sbjct: 507  SEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGS 566

Query: 1774 KLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSPQDMLKSLFQVSYLYWLE 1953
            KLS+VV ++ED LALFDL+R+E YILTEH+GRF V LKES SPQDMLKS+F V+YLYWLE
Sbjct: 567  KLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLE 626

Query: 1954 KNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGWVTDGLIARPLPNRICPG 2133
            +NAGI S   SDDC+PGGRL++SLEYVQREFNH+K+D E  GW TDGLIARPLPNRI PG
Sbjct: 627  RNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPG 686

Query: 2134 Y 2136
            +
Sbjct: 687  H 687


>gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 577

 Score =  670 bits (1728), Expect = 0.0
 Identities = 341/505 (67%), Positives = 396/505 (78%)
 Frame = +1

Query: 643  FIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRD 822
            F+ FLS        S VA    ++ +S ++    +  +D VV E++G K  +LIPD+  D
Sbjct: 103  FLLFLS--------SFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSED 154

Query: 823  EFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWR 1002
             F+    ++             LS+ ++W QCR++ M L+LPEGFP+SVT+DYL+YSLWR
Sbjct: 155  AFVASNGIV--------NLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWR 206

Query: 1003 GVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDV 1182
            GVQGVA+Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDV
Sbjct: 207  GVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV 266

Query: 1183 NPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFA 1362
            NPKGWRLFADLLENAA+G+E+LTPAFPHLFVPI                 TRSCFYAGFA
Sbjct: 267  NPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFA 326

Query: 1363 AQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSY 1542
            AQRNFAEVIAKGEAQGMVSKSIGI+LGIALAN + SST LAL+SFGVVTW+HMYCNLKSY
Sbjct: 327  AQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSY 386

Query: 1543 QSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSE 1722
            QSIQLRTLN YRA LVFSEYLLSG  PS+KEVNDEEPLFPAVPFLN+  A RE + VLS 
Sbjct: 387  QSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSS 446

Query: 1723 DAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSP 1902
            +AK AAA+IERRLQLGSKLSD+V N+EDALALF L++DEGYILTEHEG+FC         
Sbjct: 447  EAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC--------- 497

Query: 1903 QDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGW 2082
                 SLFQV+YLYWLE+NAGI++S  S DC+PGGRL++S+EYVQREFNHVK D E+ GW
Sbjct: 498  -----SLFQVNYLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGW 552

Query: 2083 VTDGLIARPLPNRICPGYATVSLAS 2157
            VTDGLIARPLPNRI PG+   S AS
Sbjct: 553  VTDGLIARPLPNRIRPGHRDASTAS 577


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  665 bits (1717), Expect = 0.0
 Identities = 335/466 (71%), Positives = 385/466 (82%)
 Frame = +1

Query: 748  ETKDKVVCEIRGGKKIELIPDYDRDEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCREL 927
            E +++ V E+RGGK  ++IPD  +DEF+V    + +     S      ++ +LW+QC+EL
Sbjct: 28   EKEEEGVWEVRGGKWHKIIPDSSKDEFLVVTPGIGAVGAPKSS-----TLPNLWLQCKEL 82

Query: 928  FMNLMLPEGFPESVTTDYLEYSLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAAL 1107
            F+ LMLPEGFP SVT+DYL+Y+LWRGVQGVA+Q+SGVLATQALLYAVGLGKGAIPTAAA+
Sbjct: 83   FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 142

Query: 1108 NWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXX 1287
            NWVLKDGIGYLSKILLSKYGRHFDV+PKGWRLFADLLENAAYG+EILTPAFPH F+ I  
Sbjct: 143  NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 202

Query: 1288 XXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQ 1467
                           TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN I 
Sbjct: 203  VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 262

Query: 1468 SSTPLALSSFGVVTWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDE 1647
            SS PL+ +SF VVT +HM+CNLKSYQSIQLRTLNPYRA LVFSEYLLSG VPS+KEVN+E
Sbjct: 263  SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVNEE 322

Query: 1648 EPLFPAVPFLNIKPACRELAEVLSEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDL 1827
            EPLFP VP LN KP  +  + VLS +AKDAAA IERRLQLGSKLS+VV ++ED LALFDL
Sbjct: 323  EPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDL 382

Query: 1828 FRDEGYILTEHEGRFCVALKESSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGG 2007
            +R+E YILTEH+GRF V LKES SPQDMLKS+F V+YLYWLE+NAGI S   SDDC+PGG
Sbjct: 383  YRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGIISMGASDDCRPGG 442

Query: 2008 RLRMSLEYVQREFNHVKSDGEAAGWVTDGLIARPLPNRICPGYATV 2145
            RL++SLEYVQREFNH+K+D E  GW TDGLIARPLPNRI PG+  +
Sbjct: 443  RLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHIVI 488


>gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 573

 Score =  661 bits (1705), Expect = 0.0
 Identities = 337/505 (66%), Positives = 392/505 (77%)
 Frame = +1

Query: 643  FIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRD 822
            F+ FLS        S VA    ++ +S ++    +  +D VV E++G K  +LIPD+  D
Sbjct: 103  FLLFLS--------SFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSED 154

Query: 823  EFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWR 1002
             F+    ++             LS+ ++W QCR++ M L+LPEGFP+SVT+DYL+YSLWR
Sbjct: 155  AFVASNGIV--------NLTKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWR 206

Query: 1003 GVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDV 1182
            GVQGVA+Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDV
Sbjct: 207  GVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDV 266

Query: 1183 NPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFA 1362
            NPKGWRLFADLLENAA+G+E+LTPAFPHLFVPI                 TRSCFYAGFA
Sbjct: 267  NPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFA 326

Query: 1363 AQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSY 1542
            AQRNFAEVIAKGEAQGMVSKSIGI+LGIALAN + SST LAL+SFGVVTW+HMYCNLKSY
Sbjct: 327  AQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSY 386

Query: 1543 QSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSE 1722
            QSIQLRTLN YRA LVFSEYLLSG  PS+KEVNDEEPLFPAVPFLN+  A RE + VLS 
Sbjct: 387  QSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSS 446

Query: 1723 DAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSP 1902
            +AK AAA+IERRLQLGSKLSD+V N+EDALALF L++DEGYILTEHEG+FC         
Sbjct: 447  EAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC--------- 497

Query: 1903 QDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGW 2082
                     V+YLYWLE+NAGI++S  S DC+PGGRL++S+EYVQREFNHVK D E+ GW
Sbjct: 498  ---------VNYLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGW 548

Query: 2083 VTDGLIARPLPNRICPGYATVSLAS 2157
            VTDGLIARPLPNRI PG+   S AS
Sbjct: 549  VTDGLIARPLPNRIRPGHRDASTAS 573


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  649 bits (1674), Expect = 0.0
 Identities = 335/488 (68%), Positives = 391/488 (80%)
 Frame = +1

Query: 670  FLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRDEFIVPKTVL 849
            F++ +   AS A A+T      E   E  +  V  ++G K+I LIPD+ +DEF+V  ++ 
Sbjct: 50   FVALWLQSASSAFARTTL---KEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFLVNPSLP 106

Query: 850  ASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWRGVQGVAAQM 1029
            +S+    S   L     +LW+QCR LF+ LMLPEG+P SVT+DYL+YSLWRGVQGVA+Q+
Sbjct: 107  SSYDDIISSSWLHFG-RTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQI 165

Query: 1030 SGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFA 1209
            SGVLATQALLYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFA
Sbjct: 166  SGVLATQALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFA 225

Query: 1210 DLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVI 1389
            DLLENAA+G+EILTPAFPHLFV I                 TRSCFYAGFAAQRNFAEVI
Sbjct: 226  DLLENAAFGLEILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVI 285

Query: 1390 AKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSYQSIQLRTLN 1569
            AKGEAQGMVSK IGIMLGI LAN I SS PLAL+SF VVTWIHM+CNLKSYQSIQLRTLN
Sbjct: 286  AKGEAQGMVSKFIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLN 345

Query: 1570 PYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSEDAKDAAANI 1749
            PYRA LVFSEYLLSG  P +K+VNDEEPLFPAV F + K A +    VLS +A+DAA  I
Sbjct: 346  PYRASLVFSEYLLSGQAPPIKDVNDEEPLFPAV-FPHFKSADKPSLVVLSLEARDAATEI 404

Query: 1750 ERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSPQDMLKSLFQ 1929
            ERRLQLGSKLSDVV ++ED LALF+L++DEGYILTE++GRFCV LKES S QDMLK+LFQ
Sbjct: 405  ERRLQLGSKLSDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQ 464

Query: 1930 VSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGWVTDGLIARP 2109
            V+YLYWLE+NAG+ +  TS DC+ GGRL++SLEY+QREF+HV++D  + GWV DGLIARP
Sbjct: 465  VNYLYWLERNAGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARP 524

Query: 2110 LPNRICPG 2133
            LPNRI PG
Sbjct: 525  LPNRIYPG 532


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  645 bits (1665), Expect = 0.0
 Identities = 329/466 (70%), Positives = 377/466 (80%), Gaps = 1/466 (0%)
 Frame = +1

Query: 742  RNETKDKVVCEIRGGKKIELIPDYDRDEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQ-C 918
            R ++    V E++GGK I L+P+   D F+V     ++   R       +S  +LW++ C
Sbjct: 113  RAQSLSSSVWEVKGGKWILLVPNDLDDTFVVDSLFPSTSSTR------PVSPLNLWLEKC 166

Query: 919  RELFMNLMLPEGFPESVTTDYLEYSLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTA 1098
            R+L M LMLPEG+PESVT+DYL+YSLWR VQGVA+Q+S VLATQ+LLYAVGLGKGAIPTA
Sbjct: 167  RQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQSLLYAVGLGKGAIPTA 226

Query: 1099 AALNWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVP 1278
            AALNWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAA+G E+LTPAFPHLFVP
Sbjct: 227  AALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGFEMLTPAFPHLFVP 286

Query: 1279 IXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN 1458
            I                 TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGI +GI LAN
Sbjct: 287  IGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIAMGIGLAN 346

Query: 1459 YIQSSTPLALSSFGVVTWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEV 1638
             I +STPLAL+SF VVT+IHMYCNLKSYQSIQLRTLNPYRA LVFSEYLLSG  P +KEV
Sbjct: 347  CIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIKEV 406

Query: 1639 NDEEPLFPAVPFLNIKPACRELAEVLSEDAKDAAANIERRLQLGSKLSDVVKNREDALAL 1818
            NDE+PLFPAVP LN+KP  +E   VLS +AK AAA I+ RL LGSKLSDVV N +D LAL
Sbjct: 407  NDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLGSKLSDVVNNHKDVLAL 466

Query: 1819 FDLFRDEGYILTEHEGRFCVALKESSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCK 1998
            FDL+R+EGYILTEH GRFCV LKE+ SP DMLK++F V+YLYWLEKNAGI  +S   D K
Sbjct: 467  FDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWLEKNAGIDGASPYLDSK 526

Query: 1999 PGGRLRMSLEYVQREFNHVKSDGEAAGWVTDGLIARPLPNRICPGY 2136
            PGGRL++SL+YV+REFNHVK DGE+AGW TDGLIARPLPNRI PG+
Sbjct: 527  PGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRPGF 572


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  642 bits (1657), Expect = 0.0
 Identities = 328/499 (65%), Positives = 380/499 (76%), Gaps = 1/499 (0%)
 Frame = +1

Query: 640  LFIFFLSASTFLSFYSCV-ASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYD 816
            LF+  L +S    F+  + A  A A+T S  SS   NE   + + E++GG  I+L PD+ 
Sbjct: 92   LFLSLLCSSVICYFFQLLLAKFAMARTPSSCSSSIENEILKQPIWEVKGGNFIKLFPDHL 151

Query: 817  RDEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSL 996
            +D FI       S     S   +      L+ +C+E  + LMLPEGFP SVT+DYLEYSL
Sbjct: 152  KDIFIASNPTFFS---ELSSLNVSQVPSFLYTKCKEFTVRLMLPEGFPNSVTSDYLEYSL 208

Query: 997  WRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHF 1176
            WRGVQGVA Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKILLS +GRHF
Sbjct: 209  WRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHF 268

Query: 1177 DVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAG 1356
            DVNPKGWRLFADLLENAA+G+E+ TPAFPHLFVPI                 TRSCF+AG
Sbjct: 269  DVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAVAGASRSAASLIQASTRSCFFAG 328

Query: 1357 FAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLK 1536
            FAAQRNFAEVIAKGE QGM S+ IGI LGI L N I SSTPL L+SF VVTW+HMYCNLK
Sbjct: 329  FAAQRNFAEVIAKGEVQGMASRFIGIALGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLK 388

Query: 1537 SYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVL 1716
            SYQSIQLRTLNPYRA LVFSEYLLSG  P VKEVNDEEPLFPA+P LN   A +  + VL
Sbjct: 389  SYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPALPILNACFANKAQSIVL 448

Query: 1717 SEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESS 1896
            S +AKDAA  IE RLQLGSKLS+++ N+E+ LALF L+++EGYIL+EH G+FCV LKE+ 
Sbjct: 449  SSEAKDAAVEIESRLQLGSKLSEIIHNKEEVLALFSLYKNEGYILSEHTGKFCVVLKENC 508

Query: 1897 SPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAA 2076
            S  DMLK+LFQV+YLYWLEKNAGI+      DCKPGGRLR+SLEY +REFNH ++DGE+A
Sbjct: 509  SQLDMLKALFQVNYLYWLEKNAGIEGRGALYDCKPGGRLRISLEYAEREFNHARNDGESA 568

Query: 2077 GWVTDGLIARPLPNRICPG 2133
            GW+ DGLIARPLPNRI PG
Sbjct: 569  GWIADGLIARPLPNRIRPG 587


>ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog [Fragaria vesca subsp.
            vesca]
          Length = 593

 Score =  642 bits (1656), Expect = 0.0
 Identities = 334/500 (66%), Positives = 390/500 (78%)
 Frame = +1

Query: 655  LSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRDEFIV 834
            L +S FL+  +C       + A  ++SE   E+    V E++GGK  +L PD+ RD F+ 
Sbjct: 109  LFSSIFLAAVACC--FCHLRLAYALASEEDAES----VWEVKGGKWTKLAPDFVRDAFVA 162

Query: 835  PKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLWRGVQG 1014
                       G      +S  SL +QC+ LF+ LMLPEGFP+SVT+DYL+YSLWR VQG
Sbjct: 163  D----------GGGGLGSISFESLGLQCKSLFVQLMLPEGFPDSVTSDYLDYSLWRAVQG 212

Query: 1015 VAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFDVNPKG 1194
            VA+Q+SGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKI+LSKYGRHFDVNPKG
Sbjct: 213  VASQVSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKG 272

Query: 1195 WRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRN 1374
            WRLFADLLENAA+GME+LTP FP+ F+ I                 TRSCFYAGFAAQRN
Sbjct: 273  WRLFADLLENAAFGMEMLTPVFPNHFLLIGAAAGAGRSAAALIQAATRSCFYAGFAAQRN 332

Query: 1375 FAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKSYQSIQ 1554
            FAEVIAKGEAQGMVSK IGIMLGIALAN I SST L L+SF +VT IHM+CNLKSYQ+IQ
Sbjct: 333  FAEVIAKGEAQGMVSKFIGIMLGIALANQIGSSTSLGLASFSLVTCIHMFCNLKSYQAIQ 392

Query: 1555 LRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLSEDAKD 1734
            LRTLNPYRA LVFSEYLLSG  P VK+VN+EEPLFPAVPFLN KPA +    VLS +AKD
Sbjct: 393  LRTLNPYRASLVFSEYLLSGQAPPVKDVNEEEPLFPAVPFLNWKPANKGQPTVLSSEAKD 452

Query: 1735 AAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSSPQDML 1914
            AAA IE+RLQLG KLSD++ N+ED  ALF+L+++EGYILTEH GR+CV LKE+SS QDML
Sbjct: 453  AAAEIEQRLQLGCKLSDLINNKEDVHALFNLYKEEGYILTEHRGRYCVVLKETSSLQDML 512

Query: 1915 KSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAGWVTDG 2094
            K+LF V+YLYWLEKNAGI++  TS DC+PGGRL MSL+YV+REF+ +K+DGE+ GWVTDG
Sbjct: 513  KALFHVNYLYWLEKNAGIEAKGTSIDCRPGGRLEMSLDYVRREFDIIKTDGESVGWVTDG 572

Query: 2095 LIARPLPNRICPGYATVSLA 2154
            LIARP PNRI P Y   S+A
Sbjct: 573  LIARPAPNRIRPVYEASSVA 592


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  639 bits (1647), Expect = e-180
 Identities = 332/509 (65%), Positives = 384/509 (75%)
 Frame = +1

Query: 631  VRQLFIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPD 810
            +R L    L  S F  F    AS       SD    T  ET    V E+RG K+  L+PD
Sbjct: 157  LRFLCFLVLVLSCFFHFRLSAASAVAKAENSDSDDSTEKET----VWEVRGSKRKRLVPD 212

Query: 811  YDRDEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEY 990
            + +DEF+  +           E    L+  +L  QCR L    +LPEG+P SVT+DYL+Y
Sbjct: 213  FVKDEFVSEEAAF--------ELSSSLTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDY 264

Query: 991  SLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGR 1170
            SLWRGVQG+A+Q+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGR
Sbjct: 265  SLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGR 324

Query: 1171 HFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFY 1350
            HFDV+PKGWRLFADLLENAA+GME+LTP FP  FV I                 TRSCF 
Sbjct: 325  HFDVHPKGWRLFADLLENAAFGMEMLTPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFN 384

Query: 1351 AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCN 1530
            AGFA+QRNFAEVIAKGEAQGMVSKS+GI+LGI +AN I +ST LAL++FGVVT IHMY N
Sbjct: 385  AGFASQRNFAEVIAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTAIHMYTN 444

Query: 1531 LKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAE 1710
            LKSYQ IQLRTLNPYRA LVFSEYL+SG  P +KEVNDEEPLFPAV FLNIK   +    
Sbjct: 445  LKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDF 504

Query: 1711 VLSEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKE 1890
            VLS +AK AAA+IE RLQLGSKLSDV+ N+E+A+ALFDL+R+EGYILTEH GRFCV LKE
Sbjct: 505  VLSSEAKSAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKE 564

Query: 1891 SSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGE 2070
            SSSPQDML+SLFQV+YLYWLEKNAGI+ +ST  DCKPGGRL +SL+YV+REF H K D E
Sbjct: 565  SSSPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSE 624

Query: 2071 AAGWVTDGLIARPLPNRICPGYATVSLAS 2157
            + GWVT+GLIARPLP RI  GY +  L+S
Sbjct: 625  SVGWVTEGLIARPLPTRIRLGYDSEPLSS 653


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  632 bits (1629), Expect = e-178
 Identities = 327/509 (64%), Positives = 384/509 (75%)
 Frame = +1

Query: 631  VRQLFIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPD 810
            +R      L  S F  F    AS     + SD S +T  ET    V E+RG K+  L+PD
Sbjct: 113  LRYFCFLVLGLSCFFHFRLSAASAIAKASDSDSSGDTDKET----VWEVRGSKRKRLVPD 168

Query: 811  YDRDEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEY 990
            + +DEF+  ++          E    L+  +L  QCR L    +LPEGFP SVT+DYL+Y
Sbjct: 169  FVKDEFVSEESAF--------ELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDY 220

Query: 991  SLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGR 1170
            SLWRGVQG+A+Q+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGR
Sbjct: 221  SLWRGVQGIASQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGR 280

Query: 1171 HFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFY 1350
            HFDV+PKGWRLFADLLENAA+GME+LTP FP  FV I                 TRSCF 
Sbjct: 281  HFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFN 340

Query: 1351 AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCN 1530
            AGFA+QRNFAEVIAKGEAQGMVSKS+GI+LGI +AN I +ST LAL++FGVVT IHMY N
Sbjct: 341  AGFASQRNFAEVIAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTN 400

Query: 1531 LKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAE 1710
            LKSYQ IQLRTLNPYRA LVFSEYL+SG  P +KEVNDEEPLFP V FLN+K   +    
Sbjct: 401  LKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDF 460

Query: 1711 VLSEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKE 1890
            VLS +AK AA +IE RLQLGSKLSDV+ N+E+A+ALFDL+R+EGYILTEH GRFCV LKE
Sbjct: 461  VLSSEAKAAAEDIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKE 520

Query: 1891 SSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGE 2070
            SS+PQDML+SLFQV+YLYWLEKNAGI+ +ST  DCKPGGRL +SL+YV+REF H K D +
Sbjct: 521  SSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQ 580

Query: 2071 AAGWVTDGLIARPLPNRICPGYATVSLAS 2157
            + GWVT+GLIARPLP RI  G+    L+S
Sbjct: 581  SVGWVTEGLIARPLPTRIRLGHDREPLSS 609


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  632 bits (1629), Expect = e-178
 Identities = 324/502 (64%), Positives = 383/502 (76%)
 Frame = +1

Query: 631  VRQLFIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPD 810
            +R L    L  S F  F    AS      A D +S++  +   + V E+RG K+  L+PD
Sbjct: 107  LRYLCFLLLGLSCFFHFRLSAASAI----AKDQNSDSNGDAVKETVWEVRGSKRKRLVPD 162

Query: 811  YDRDEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEY 990
            + +DEF+  ++          E    L+  +L  QCR L    +LPEGFP SVT+DYL+Y
Sbjct: 163  FVKDEFVSEESAF--------ELSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDY 214

Query: 991  SLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGR 1170
            SLWRGVQG+A+Q+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGR
Sbjct: 215  SLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGR 274

Query: 1171 HFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFY 1350
            HFDV+PKGWRLFADLLENAA+GME+LTP FP  FV I                 TRSCF 
Sbjct: 275  HFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFN 334

Query: 1351 AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCN 1530
            AGFA+QRNFAEVIAKGEAQGMVSKS+GI+LGI +AN I +ST LAL++FGVVT IHMY N
Sbjct: 335  AGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTN 394

Query: 1531 LKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAE 1710
            LKSYQ IQLRTLNPYRA LVFSEYL+SG  P +KEVNDEEPLFP V F N+K   +    
Sbjct: 395  LKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDF 454

Query: 1711 VLSEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKE 1890
            VLS +AK AAA+IE RLQLGSKLSDV+ N+E+A+ALFDL+R+EGYILTEH+GRFCV LKE
Sbjct: 455  VLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKE 514

Query: 1891 SSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGE 2070
            SS+PQDML+SLFQV+YLYWLEKNAGI+ +ST  DCKPGGRL +SL+YV+REF H K D E
Sbjct: 515  SSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSE 574

Query: 2071 AAGWVTDGLIARPLPNRICPGY 2136
            + GWVT+GLIARPLP RI  G+
Sbjct: 575  SVGWVTEGLIARPLPTRIRLGH 596


>ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
            gi|355513788|gb|AES95411.1| hypothetical protein
            MTR_5g025160 [Medicago truncatula]
          Length = 630

 Score =  630 bits (1626), Expect = e-178
 Identities = 329/499 (65%), Positives = 383/499 (76%), Gaps = 1/499 (0%)
 Frame = +1

Query: 640  LFIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDR 819
            LF    S+ TF     C+  +A AKT S +SSE  ++   + + E++GG  I+L PD  +
Sbjct: 90   LFTLLFSSVTF-----CLCQLAMAKTRS-LSSE--DDILTQPIYEVKGGNLIKLFPDNLK 141

Query: 820  DEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLW 999
            D FI     L S     +  ++      L+ +CRE  + LMLPEGFP SVT+DYLEYSLW
Sbjct: 142  DIFIASNPGLFSELSSLNSSQVPTF---LYNKCREFVVRLMLPEGFPNSVTSDYLEYSLW 198

Query: 1000 RGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFD 1179
            RGVQGVA Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKILLS +GRHFD
Sbjct: 199  RGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHFD 258

Query: 1180 VNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGF 1359
            VNPKGWRLFADLLENAA+G+E+ TPAFPHLFVPI                 TRSCF+AGF
Sbjct: 259  VNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAGF 318

Query: 1360 AAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKS 1539
            AAQRNFAEVIAKGE QGMVS+ IGI +GI L N I SSTPL L+SF VVTW+HMYCNLKS
Sbjct: 319  AAQRNFAEVIAKGEVQGMVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKS 378

Query: 1540 YQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAE-VL 1716
            YQSIQLRTLNP+RA LVFSEYLLSG  P VKEVN EEPLFPAVP LN   A +E    VL
Sbjct: 379  YQSIQLRTLNPHRASLVFSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIVL 438

Query: 1717 SEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESS 1896
            S +AKDAA  IE RLQLGSKLS+++ N+E+ LALF L+++EGYIL+EH G+FCV LKE+ 
Sbjct: 439  SSEAKDAAVEIESRLQLGSKLSEIINNKEEVLALFSLYKNEGYILSEHTGKFCVVLKETC 498

Query: 1897 SPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAA 2076
            S  DMLK+LFQV+YLYWLEKNAGI+   T  DCKPGGRL++SLEY +REFNHV++DGE+ 
Sbjct: 499  SQLDMLKALFQVNYLYWLEKNAGIEGRGTLYDCKPGGRLQISLEYAEREFNHVRNDGESV 558

Query: 2077 GWVTDGLIARPLPNRICPG 2133
            GW+TDGLIARPLPNR  PG
Sbjct: 559  GWITDGLIARPLPNRCRPG 577


>gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris]
          Length = 592

 Score =  628 bits (1619), Expect = e-177
 Identities = 331/508 (65%), Positives = 383/508 (75%), Gaps = 3/508 (0%)
 Frame = +1

Query: 634  RQLFIFFLSASTFLSF-YSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPD 810
            R LF+  L +S    F +  +  +A AKT S   S + NE   + V E++GGK   L+PD
Sbjct: 88   RILFLSLLCSSAVCFFGHLLLVKLANAKTWS---SSSDNELLSEPVWEVKGGKWTRLVPD 144

Query: 811  YDRDEFIVPKTVLASWWVRGSEQKLKLSVGS--LWMQCRELFMNLMLPEGFPESVTTDYL 984
               D F+     L +       Q LK S  +  +W++CR++F  LMLPEGFPESVT+DYL
Sbjct: 145  PTNDVFVSAHPGLLA-----ELQSLKPSQFATFVWLKCRDIFTRLMLPEGFPESVTSDYL 199

Query: 985  EYSLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKY 1164
            EYSLWR VQGVA Q+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +
Sbjct: 200  EYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNF 259

Query: 1165 GRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSC 1344
            GRHFDVNPKGWRLFADLLENAA+G+E+ TPAFP  FV I                 TRSC
Sbjct: 260  GRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSC 319

Query: 1345 FYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMY 1524
            F+AGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N I SSTPL L+SF V+TWIHMY
Sbjct: 320  FFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLGIGLGNCIGSSTPLVLASFIVLTWIHMY 379

Query: 1525 CNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACREL 1704
            CNLKSYQSIQLRTLNPYRA LVFSEYLLSG  P VK+VNDEEPLFPAVP LN   A +  
Sbjct: 380  CNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKDVNDEEPLFPAVPILNATFANKAR 439

Query: 1705 AEVLSEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVAL 1884
            +  LS +AKDAAA IERRLQLGSKLS++V  +ED LALF L++ EGYIL+EH G+FCV L
Sbjct: 440  SIALSSEAKDAAAEIERRLQLGSKLSEIVNGKEDVLALFRLYKKEGYILSEHMGKFCVVL 499

Query: 1885 KESSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSD 2064
            KE+ S QDMLK+LFQV+YLYWLEKNAGI    T +D +PGGRL  SL+YV+REFNH+K+D
Sbjct: 500  KENCSQQDMLKALFQVNYLYWLEKNAGIGGRGTLNDSRPGGRLHTSLDYVEREFNHLKND 559

Query: 2065 GEAAGWVTDGLIARPLPNRICPGYATVS 2148
            GE+ GWVTDGLIARPLPNRI  G  T S
Sbjct: 560  GESVGWVTDGLIARPLPNRIRIGDTTSS 587


>ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutrema salsugineum]
            gi|557096914|gb|ESQ37422.1| hypothetical protein
            EUTSA_v10002446mg [Eutrema salsugineum]
          Length = 611

 Score =  627 bits (1616), Expect = e-176
 Identities = 327/495 (66%), Positives = 382/495 (77%), Gaps = 1/495 (0%)
 Frame = +1

Query: 643  FIFFLSASTFLSFYSCVASVAEAKTASDISSETRNETKDKVVCEIRGGKKIELIPDYDRD 822
            F+F + +  F    S   ++A+A       S++  +T+ + V E+RG K+  L+PD+ RD
Sbjct: 117  FLFLVYSCFFQLRLSAAIAIAKAP-----ESDSNGDTEKETVWEVRGSKRKRLVPDFVRD 171

Query: 823  EFIV-PKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPEGFPESVTTDYLEYSLW 999
            EF V P+   +S           L+  +L  QCR L    +LPEGFP SVT+DYL+YSLW
Sbjct: 172  EFFVSPEETTSS----------PLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLW 221

Query: 1000 RGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGIGYLSKILLSKYGRHFD 1179
            RGVQG+A+Q+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFD
Sbjct: 222  RGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFD 281

Query: 1180 VNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGF 1359
            V+PKGWRLFADLLEN+A+GME+LTP FP  FV I                 TRSCF AGF
Sbjct: 282  VHPKGWRLFADLLENSAFGMEMLTPLFPQFFVLIGAAAGAGRSAAALIQAATRSCFNAGF 341

Query: 1360 AAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALSSFGVVTWIHMYCNLKS 1539
            A+QRNFAEVIAKGEAQGMVSKSIGI+LGI +AN I +ST LAL+SFGVVT IHMY NLKS
Sbjct: 342  ASQRNFAEVIAKGEAQGMVSKSIGILLGIVVANCIGTSTSLALASFGVVTSIHMYTNLKS 401

Query: 1540 YQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVPFLNIKPACRELAEVLS 1719
            YQ IQLRTLNPYRA LVFSEYL+SG  P +KEVNDEEPLFP V  LNIK A +    VLS
Sbjct: 402  YQCIQLRTLNPYRASLVFSEYLISGQAPPIKEVNDEEPLFPTVRSLNIKSAEKRQDFVLS 461

Query: 1720 EDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYILTEHEGRFCVALKESSS 1899
             +AK AAA+IE RLQLGSKLSDVV N+E+A+ALFDL+RDEGYILTEH GRFCV LKESSS
Sbjct: 462  SEAKAAAADIEERLQLGSKLSDVVHNKEEAVALFDLYRDEGYILTEHRGRFCVMLKESSS 521

Query: 1900 PQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEYVQREFNHVKSDGEAAG 2079
            PQDML+SLFQV+YLYWLEKNAGI++S+T  DCKPGGRL +SL+YV+REF   K D E  G
Sbjct: 522  PQDMLRSLFQVNYLYWLEKNAGIEASNTYLDCKPGGRLHISLDYVRREFELAKEDSELVG 581

Query: 2080 WVTDGLIARPLPNRI 2124
            WVT+GLIARPL  RI
Sbjct: 582  WVTEGLIARPLSTRI 596


>ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Populus trichocarpa]
            gi|550347673|gb|ERP65789.1| hypothetical protein
            POPTR_0001s19390g [Populus trichocarpa]
          Length = 406

 Score =  626 bits (1615), Expect = e-176
 Identities = 312/406 (76%), Positives = 349/406 (85%)
 Frame = +1

Query: 940  MLPEGFPESVTTDYLEYSLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVL 1119
            MLP+GFP SVT+DYL+YSLWR VQG+A+Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 1    MLPQGFPRSVTSDYLDYSLWRAVQGIASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 60

Query: 1120 KDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXX 1299
            KDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+E+LTPAFPHLFV I      
Sbjct: 61   KDGIGYLSKIVLSKYGRHFDVHPKGWRLFADLLENAAFGLEMLTPAFPHLFVFIGATAGA 120

Query: 1300 XXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTP 1479
                       TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN I SSTP
Sbjct: 121  GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANCIGSSTP 180

Query: 1480 LALSSFGVVTWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLF 1659
            LAL+SF VVTWIHM+CNLKSYQSIQLRTLNPYRA LVFSEYLLSG  P VKE+NDEEPLF
Sbjct: 181  LALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEINDEEPLF 240

Query: 1660 PAVPFLNIKPACRELAEVLSEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDE 1839
            PAVPFLNI       + VLS +A++AAA IE+RLQLGSKLSDVV N++D LALF+L+RDE
Sbjct: 241  PAVPFLNIYSKGNVQSIVLSSEARNAAAEIEQRLQLGSKLSDVVNNKDDVLALFNLYRDE 300

Query: 1840 GYILTEHEGRFCVALKESSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRM 2019
            GYILTEH+GRFCV LKESSSP DMLKSLFQV+YLYWLE+NAGI++ S S DC+P GRL++
Sbjct: 301  GYILTEHKGRFCVVLKESSSPHDMLKSLFQVNYLYWLERNAGIEARSISADCRPEGRLQI 360

Query: 2020 SLEYVQREFNHVKSDGEAAGWVTDGLIARPLPNRICPGYATVSLAS 2157
            SLEY +REFNHVK+D  + GWV DGLIARP P R+CPG    S+AS
Sbjct: 361  SLEYARREFNHVKNDSVSMGWVADGLIARPSPIRVCPGNIASSIAS 406


>ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
          Length = 592

 Score =  625 bits (1613), Expect = e-176
 Identities = 327/514 (63%), Positives = 384/514 (74%), Gaps = 1/514 (0%)
 Frame = +1

Query: 595  NENDTFVIIRDTVRQLFIFFLSASTFLSF-YSCVASVAEAKTASDISSETRNETKDKVVC 771
            N  D+F    ++   LF+  L +S    F +  +A +A+AKT S  SS   +   + V  
Sbjct: 73   NPFDSFDSNSNSHHTLFLSMLCSSALCFFGHLLLAKLAKAKTLSSSSSSDTSLFSEPVY- 131

Query: 772  EIRGGKKIELIPDYDRDEFIVPKTVLASWWVRGSEQKLKLSVGSLWMQCRELFMNLMLPE 951
            E++GGK  +L+PD   D F+  +    S     S  K       +W++C ++F  LMLPE
Sbjct: 132  EVKGGKWTKLVPDPTDDVFVSAQQGFLS---ELSSLKPSQLATFVWLKCSDIFTRLMLPE 188

Query: 952  GFPESVTTDYLEYSLWRGVQGVAAQMSGVLATQALLYAVGLGKGAIPTAAALNWVLKDGI 1131
            GFPESVT+DYLEYSLWR VQGVA Q+SGVLATQ+LLYAVGLGKGAIPTAAA+NWVLKDGI
Sbjct: 189  GFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGI 248

Query: 1132 GYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXXXXXX 1311
            GYLSKI+LS +GRHFDVNPKGWRLFADLLENAA+G+E+ TPA P  FV I          
Sbjct: 249  GYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMSTPACPQFFVLIGAVAGASRSA 308

Query: 1312 XXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANYIQSSTPLALS 1491
                   TRSCF+AGFAAQRNFAEVIAKGE QGM S+ IGI+LGI L N I SSTPL L+
Sbjct: 309  ASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIVLGIGLGNCIGSSTPLVLA 368

Query: 1492 SFGVVTWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLFPAVP 1671
            SF V+TWIHMYCNLKSYQSIQLRTLNPYRA LVFSEYLLSG  P VKEVNDEEPLFPAVP
Sbjct: 369  SFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPAVP 428

Query: 1672 FLNIKPACRELAEVLSEDAKDAAANIERRLQLGSKLSDVVKNREDALALFDLFRDEGYIL 1851
             LN   A +  +  LS +AKDAAA IE RLQLGSKLS++V ++ED LALF L+++EGYIL
Sbjct: 429  ILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEIVNSKEDVLALFGLYKNEGYIL 488

Query: 1852 TEHEGRFCVALKESSSPQDMLKSLFQVSYLYWLEKNAGIKSSSTSDDCKPGGRLRMSLEY 2031
            +EH G++ V LKE  S  DMLK+LFQV+YLYWLEKNAGI+   T +D KPGGRL +SL+Y
Sbjct: 489  SEHMGKYSVVLKEKCSQLDMLKALFQVNYLYWLEKNAGIEGRGTLNDSKPGGRLHISLDY 548

Query: 2032 VQREFNHVKSDGEAAGWVTDGLIARPLPNRICPG 2133
            V+REFNHVK+DGE  GWVTDGLIARPLPNRIC G
Sbjct: 549  VEREFNHVKNDGELVGWVTDGLIARPLPNRICIG 582


Top