BLASTX nr result

ID: Rehmannia23_contig00004494 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00004494
         (2416 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   670   0.0  
ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   669   0.0  
gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma caca...   653   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   640   e-180
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              639   e-180
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   633   e-178
gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]    624   e-176
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   619   e-174
gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]    615   e-173
ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778...   610   e-172
gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus...   608   e-171
ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   605   e-170
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     605   e-170
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   603   e-169
ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786...   602   e-169
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   602   e-169
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   601   e-169
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   598   e-168
ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Popu...   597   e-167
ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [A...   595   e-167

>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  670 bits (1728), Expect = 0.0
 Identities = 355/520 (68%), Positives = 404/520 (77%), Gaps = 3/520 (0%)
 Frame = -3

Query: 2066 RRPSFLLPFHFIFSQEE---DSYSICLPKHIYXXXXXXXXXLGYFIFSSSAARAKTDDLS 1896
            RR   LLP   IF  E+   DS   C P  ++             +  +S  +AKT    
Sbjct: 92   RRSLLLLP---IFRNEDTFIDSVLSCKPLLLFLVSASSSITCCLLL--ASFVQAKT---- 142

Query: 1895 RNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDLA 1716
             N+ E++ EIRGGKR ELVPDYSKDEF++ + M                 ++WM+C++L 
Sbjct: 143  -NNGEIVHEIRGGKRFELVPDYSKDEFVLTKTMWSRLLPDSKSGSFVS--NLWMQCKELT 199

Query: 1715 MSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVN 1536
             +++LPEGFP+SVTSDYLEY+LWRGVQG+AAQISGVLATQALLYAVGLGKGAIPTAAAVN
Sbjct: 200  TTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAVN 259

Query: 1535 WVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXX 1356
            WVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTPAFPHLFV I   
Sbjct: 260  WVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFPHLFVPIGAV 319

Query: 1355 XXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQS 1176
                         ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGIMLGIALAN  +S
Sbjct: 320  AGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLGIALANCTRS 379

Query: 1175 STALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEE 996
            ST+L+LASFGV+TWIHMFCNLKSY SIQLRTLNPYRASLVFS+YLLSGLVPSVKEVNDEE
Sbjct: 380  STSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGLVPSVKEVNDEE 439

Query: 995  PFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELY 816
            P FPA  +L LK   E Q+EVLS  AK AA+ I RRLQLGSKLSD+  SRE+ +AL ELY
Sbjct: 440  PLFPA-AILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVATSREDVLALFELY 498

Query: 815  KAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIFDDCRPGSR 636
            K EGYIL E +GR+C+VLKESSS QDMLKSLF V YLYWLE  AGIKS+++ +DCRPG R
Sbjct: 499  KNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKSSSVANDCRPGGR 558

Query: 635  LQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 516
            LQ+SLEYV+REFNHVK DGE+AGWV D LIARP PNRIR+
Sbjct: 559  LQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRL 598


>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  669 bits (1726), Expect = 0.0
 Identities = 344/472 (72%), Positives = 388/472 (82%)
 Frame = -3

Query: 1931 SSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXX 1752
            +S  +AKT     N+ E+++EIRGGKR ELVPDYSKDEF++ + M               
Sbjct: 132  ASFVQAKT-----NNGEIVYEIRGGKRFELVPDYSKDEFVLTKTMWSQLWPDSTSGSFVS 186

Query: 1751 XGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGL 1572
              ++WM+C++L  ++ LPEGFPESVTSDYLEY+LWRGVQGIAAQISGVLATQALLYAVGL
Sbjct: 187  --NLWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAVGL 244

Query: 1571 GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 1392
            GKGAIPTAAA+NWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTP
Sbjct: 245  GKGAIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTP 304

Query: 1391 AFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI 1212
            AFPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGI
Sbjct: 305  AFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGI 364

Query: 1211 MLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSG 1032
            MLGIALAN  +SST+L+LASFGV+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSG
Sbjct: 365  MLGIALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 424

Query: 1031 LVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVK 852
            LVPSVKEVNDEEP FPA  +L LK   E Q EVLS  AK AA+ I RRLQLGSKLSD+  
Sbjct: 425  LVPSVKEVNDEEPLFPA-AILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDVAT 483

Query: 851  SREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKS 672
            S+E+ +AL ELYK EGYIL E +GR+C+VLKESSS QDMLKSLF V YLYWLE NAGIKS
Sbjct: 484  SQEDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKS 543

Query: 671  TTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 516
            +++ +DCRPG RLQ+SLEYV+REFNHVK DGE+AGWV D LIARP P RIR+
Sbjct: 544  SSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRL 595


>gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  653 bits (1685), Expect = 0.0
 Identities = 334/479 (69%), Positives = 389/479 (81%)
 Frame = -3

Query: 1928 SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 1749
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 1748 GDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1569
              VW +CRD+ M ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1568 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1389
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1388 FPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1209
            FPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1208 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1029
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1028 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 849
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 848  REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 669
            +E+A+AL  LYK EGYIL E +G++CVVLKESS  QDMLKSLFQV YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWLERNAGIEAS 531

Query: 668  TIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 492
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 532  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 590


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  640 bits (1650), Expect = e-180
 Identities = 327/480 (68%), Positives = 379/480 (78%)
 Frame = -3

Query: 1946 YFIFSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXX 1767
            +F F    A +K  +     EE ++E+RGGK  +++PD SKDEF++              
Sbjct: 218  FFHFQLDTALSKEKE-----EEGVWEVRGGKWHKIIPDSSKDEFLV----VTPGIGAVGA 268

Query: 1766 XXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALL 1587
                   ++W++C++L + +MLPEGFP SVTSDYL+Y+LWRGVQG+A+QISGVLATQALL
Sbjct: 269  PKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALL 328

Query: 1586 YAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 1407
            YAVGLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+
Sbjct: 329  YAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGL 388

Query: 1406 EILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVS 1227
            EILTPAFPH F+LI                +TRSCFYAGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 389  EILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 448

Query: 1226 KSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQ 1047
            KSIGIMLGIALAN + SS  LS ASF V+T +HMFCNLKSYQSIQLRTLNPYRASLVFS+
Sbjct: 449  KSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSE 508

Query: 1046 YLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 867
            YLLSG VPS+KEVN+EEP FP  PLL  KPT + Q  VLST+AKDAA+ I+RRLQLGSKL
Sbjct: 509  YLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKL 568

Query: 866  SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 687
            S++V S+E+ +AL +LY+ E YIL E KGR+ V+LKES S QDMLKS+F V YLYWLERN
Sbjct: 569  SEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERN 628

Query: 686  AGIKSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQ 507
            AGI S    DDCRPG RLQISLEYVQREFNH+KND E  GW  DGLIARP PNRIR G++
Sbjct: 629  AGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHK 688


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  639 bits (1649), Expect = e-180
 Identities = 327/479 (68%), Positives = 378/479 (78%)
 Frame = -3

Query: 1946 YFIFSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXX 1767
            +F F    A +K  +     EE ++E+RGGK  +++PD SKDEF++              
Sbjct: 16   FFHFQLDTALSKEKE-----EEGVWEVRGGKWHKIIPDSSKDEFLV----VTPGIGAVGA 66

Query: 1766 XXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALL 1587
                   ++W++C++L + +MLPEGFP SVTSDYL+Y+LWRGVQG+A+QISGVLATQALL
Sbjct: 67   PKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALL 126

Query: 1586 YAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 1407
            YAVGLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+
Sbjct: 127  YAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGL 186

Query: 1406 EILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVS 1227
            EILTPAFPH F+LI                +TRSCFYAGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 187  EILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 246

Query: 1226 KSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQ 1047
            KSIGIMLGIALAN + SS  LS ASF V+T +HMFCNLKSYQSIQLRTLNPYRASLVFS+
Sbjct: 247  KSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSE 306

Query: 1046 YLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 867
            YLLSG VPS+KEVN+EEP FP  PLL  KPT + Q  VLST+AKDAA+ I+RRLQLGSKL
Sbjct: 307  YLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKL 366

Query: 866  SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 687
            S++V S+E+ +AL +LY+ E YIL E KGR+ V+LKES S QDMLKS+F V YLYWLERN
Sbjct: 367  SEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERN 426

Query: 686  AGIKSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGN 510
            AGI S    DDCRPG RLQISLEYVQREFNH+KND E  GW  DGLIARP PNRIR G+
Sbjct: 427  AGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGH 485


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  633 bits (1632), Expect = e-178
 Identities = 324/487 (66%), Positives = 379/487 (77%), Gaps = 4/487 (0%)
 Frame = -3

Query: 1934 SSSAARAKT---DDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXX 1764
            S+S+A A+T   +      E+ ++ ++G KR+ L+PD+ KDEF++  ++           
Sbjct: 57   SASSAFARTTLKEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFLVNPSLPSSYDDIISSS 116

Query: 1763 XXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLY 1584
                   +W++CR L + +MLPEG+P SVTSDYL+YSLWRGVQG+A+QISGVLATQALLY
Sbjct: 117  WLHFGRTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLY 176

Query: 1583 AVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGME 1404
            A+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAAFG+E
Sbjct: 177  AIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLE 236

Query: 1403 ILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 1224
            ILTPAFPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK
Sbjct: 237  ILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 296

Query: 1223 SIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQY 1044
             IGIMLGI LAN + SS  L+LASF V+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+Y
Sbjct: 297  FIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEY 356

Query: 1043 LLSGLVPSVKEVNDEEPFFPA-FPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 867
            LLSG  P +K+VNDEEP FPA FP    K   +  + VLS +A+DAA+ I+RRLQLGSKL
Sbjct: 357  LLSGQAPPIKDVNDEEPLFPAVFP--HFKSADKPSLVVLSLEARDAATEIERRLQLGSKL 414

Query: 866  SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 687
            SD+V S+E+ +AL  LYK EGYIL E KGR+CVVLKES SAQDMLK+LFQV YLYWLERN
Sbjct: 415  SDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQVNYLYWLERN 474

Query: 686  AGIKSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQ 507
            AG+ +     DCR G RLQ+SLEY+QREF+HV+ND    GWV DGLIARP PNRI  G+ 
Sbjct: 475  AGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARPLPNRIYPGDL 534

Query: 506  IASPAVS 486
            +AS  VS
Sbjct: 535  VASSIVS 541


>gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 577

 Score =  624 bits (1608), Expect = e-176
 Identities = 322/479 (67%), Positives = 377/479 (78%)
 Frame = -3

Query: 1928 SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 1749
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 1748 GDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1569
              VW +CRD+ M ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1568 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1389
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1388 FPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1209
            FPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1208 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1029
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1028 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 849
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 848  REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 669
            +E+A+AL  LYK EGYIL E +G++C              SLFQV YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFC--------------SLFQVNYLYWLERNAGIEAS 517

Query: 668  TIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 492
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 518  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 576


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  619 bits (1597), Expect = e-174
 Identities = 312/470 (66%), Positives = 367/470 (78%)
 Frame = -3

Query: 1922 ARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGD 1743
            A +  DD ++ ++ V +E++G KR +L+PD++KD F++                      
Sbjct: 116  ATSSEDDGNKEYDAV-WEVKGSKRTKLIPDFTKDAFVVAS------ASNASLSSLLSVNK 168

Query: 1742 VWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKG 1563
            +W +CR+L +  MLPEGFP+SVTSDYL YSLWR VQG+A+QISGVLATQALLYA+GLGKG
Sbjct: 169  LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 228

Query: 1562 AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 1383
            AIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+LTPAFP
Sbjct: 229  AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 288

Query: 1382 HLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 1203
            H FV I                +TRSCFYAGFAA+RNFAEVIAKGEAQGMVSK+IGIMLG
Sbjct: 289  HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 348

Query: 1202 IALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVP 1023
            IALAN + SS   +LASF V+TWIHM+CNLKSYQSI+LRTLNPYRASLVFS+YLLSG  P
Sbjct: 349  IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 408

Query: 1022 SVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSRE 843
             VKEVNDEEP FPAF    +K  ++ Q+ VLS++AKDAA  I+ RLQLGSKLSD+V ++E
Sbjct: 409  PVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 468

Query: 842  EAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTI 663
            +A AL  LY+ EGYIL E  G++CVVLKES+  QDMLKSLFQ  YLYWLERNAGI +T+ 
Sbjct: 469  DAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATST 528

Query: 662  FDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIG 513
              DC PG RL+ISL+YVQREFNHVK+D    GWV DGLIARP PNRIR G
Sbjct: 529  SADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 578


>gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 573

 Score =  615 bits (1585), Expect = e-173
 Identities = 318/479 (66%), Positives = 373/479 (77%)
 Frame = -3

Query: 1928 SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 1749
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 1748 GDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1569
              VW +CRD+ M ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1568 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1389
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1388 FPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1209
            FPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1208 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1029
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1028 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 849
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 848  REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 669
            +E+A+AL  LYK EGYIL E +G++C                  V YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFC------------------VNYLYWLERNAGIEAS 513

Query: 668  TIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 492
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 514  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 572


>ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 593

 Score =  610 bits (1573), Expect = e-172
 Identities = 316/490 (64%), Positives = 368/490 (75%), Gaps = 5/490 (1%)
 Frame = -3

Query: 1940 IFSSSAARAKTDDLSRNHEEVLF-----EIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 1776
            +  +  A+AKT   S   +  LF     E++GGK  +LVPD + D F+  +         
Sbjct: 104  LLHAKLAKAKTLSPSTTADTSLFSEPVYEVKGGKWTKLVPDLTNDVFVSAQQGFLSELSS 163

Query: 1775 XXXXXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1596
                       VW+KC D+   +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ
Sbjct: 164  LKVPSQLATF-VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQ 222

Query: 1595 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1416
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFADLLENAA
Sbjct: 223  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAA 282

Query: 1415 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQG 1236
            FG+E+ TPAFP  FVLI                +TRSCF+AGFAAQRNFAEVIAKGE QG
Sbjct: 283  FGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQG 342

Query: 1235 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1056
            M S+ IGI LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLV
Sbjct: 343  MASRFIGIGLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLV 402

Query: 1055 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 876
            FS+YLLSG  P VKEVNDEEP FPA P+L     ++ Q  VLS++AKDAA+ I+ RLQLG
Sbjct: 403  FSEYLLSGQAPPVKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLG 462

Query: 875  SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 696
            SKLS+IV S+E+ +AL  LYK EGYIL+E  G++CVVLKE+ S QDMLK+LFQV YLYWL
Sbjct: 463  SKLSEIVNSKEDVLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDMLKALFQVNYLYWL 522

Query: 695  ERNAGIKSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 516
            E+NAGI      +D +PG RL ISL+YV+REFNHVKNDGE+ GWV DGLIARP PNRIRI
Sbjct: 523  EKNAGIGGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRIRI 582

Query: 515  GNQIASPAVS 486
            G+   S +VS
Sbjct: 583  GDTPPSNSVS 592


>gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris]
          Length = 592

 Score =  608 bits (1567), Expect = e-171
 Identities = 314/482 (65%), Positives = 365/482 (75%), Gaps = 3/482 (0%)
 Frame = -3

Query: 1922 ARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXX 1752
            A AKT   S ++E   E ++E++GGK   LVPD + D F+                    
Sbjct: 112  ANAKTWSSSSDNELLSEPVWEVKGGKWTRLVPDPTNDVFVSAH--PGLLAELQSLKPSQF 169

Query: 1751 XGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGL 1572
               VW+KCRD+   +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ+LLYAVGL
Sbjct: 170  ATFVWLKCRDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGL 229

Query: 1571 GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 1392
            GKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ TP
Sbjct: 230  GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMCTP 289

Query: 1391 AFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI 1212
            AFP  FVLI                +TRSCF+AGFAAQRNFAEVIAKGE QGM S+ IGI
Sbjct: 290  AFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGI 349

Query: 1211 MLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSG 1032
             LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLVFS+YLLSG
Sbjct: 350  GLGIGLGNCIGSSTPLVLASFIVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 409

Query: 1031 LVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVK 852
              P VK+VNDEEP FPA P+L     ++ +   LS++AKDAA+ I+RRLQLGSKLS+IV 
Sbjct: 410  QAPPVKDVNDEEPLFPAVPILNATFANKARSIALSSEAKDAAAEIERRLQLGSKLSEIVN 469

Query: 851  SREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKS 672
             +E+ +AL  LYK EGYIL+E  G++CVVLKE+ S QDMLK+LFQV YLYWLE+NAGI  
Sbjct: 470  GKEDVLALFRLYKKEGYILSEHMGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGG 529

Query: 671  TTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 492
                +D RPG RL  SL+YV+REFNH+KNDGE  GWV DGLIARP PNRIRIG+  +S +
Sbjct: 530  RGTLNDSRPGGRLHTSLDYVEREFNHLKNDGESVGWVTDGLIARPLPNRIRIGDTTSSNS 589

Query: 491  VS 486
            VS
Sbjct: 590  VS 591


>ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog [Fragaria vesca subsp.
            vesca]
          Length = 593

 Score =  605 bits (1561), Expect = e-170
 Identities = 306/460 (66%), Positives = 362/460 (78%)
 Frame = -3

Query: 1898 SRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDL 1719
            S    E ++E++GGK  +L PD+ +D F+                       + ++C+ L
Sbjct: 133  SEEDAESVWEVKGGKWTKLAPDFVRDAFVADGG---------GGLGSISFESLGLQCKSL 183

Query: 1718 AMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAV 1539
             + +MLPEGFP+SVTSDYL+YSLWR VQG+A+Q+SGVLATQALLYAVGLGKGAIPTAAA+
Sbjct: 184  FVQLMLPEGFPDSVTSDYLDYSLWRAVQGVASQVSGVLATQALLYAVGLGKGAIPTAAAL 243

Query: 1538 NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXX 1359
            NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGME+LTP FP+ F+LI  
Sbjct: 244  NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEMLTPVFPNHFLLIGA 303

Query: 1358 XXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQ 1179
                          ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN + 
Sbjct: 304  AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANQIG 363

Query: 1178 SSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDE 999
            SST+L LASF ++T IHMFCNLKSYQ+IQLRTLNPYRASLVFS+YLLSG  P VK+VN+E
Sbjct: 364  SSTSLGLASFSLVTCIHMFCNLKSYQAIQLRTLNPYRASLVFSEYLLSGQAPPVKDVNEE 423

Query: 998  EPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLEL 819
            EP FPA P L  KP ++ Q  VLS++AKDAA+ I++RLQLG KLSD++ ++E+  AL  L
Sbjct: 424  EPLFPAVPFLNWKPANKGQPTVLSSEAKDAAAEIEQRLQLGCKLSDLINNKEDVHALFNL 483

Query: 818  YKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIFDDCRPGS 639
            YK EGYIL E +GRYCVVLKE+SS QDMLK+LF V YLYWLE+NAGI++     DCRPG 
Sbjct: 484  YKEEGYILTEHRGRYCVVLKETSSLQDMLKALFHVNYLYWLEKNAGIEAKGTSIDCRPGG 543

Query: 638  RLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIR 519
            RL++SL+YV+REF+ +K DGE  GWV DGLIARP+PNRIR
Sbjct: 544  RLEMSLDYVRREFDIIKTDGESVGWVTDGLIARPAPNRIR 583


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  605 bits (1559), Expect = e-170
 Identities = 315/482 (65%), Positives = 371/482 (76%), Gaps = 1/482 (0%)
 Frame = -3

Query: 1937 FSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 1758
            F S  ARA++   S      ++E++GGK + LVP+   D F++                 
Sbjct: 107  FCSRLARAQSLSSS------VWEVKGGKWILLVPNDLDDTFVVDS-----LFPSTSSTRP 155

Query: 1757 XXXGDVWM-KCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYA 1581
                ++W+ KCR L M +MLPEG+PESVTSDYL+YSLWR VQG+A+QIS VLATQ+LLYA
Sbjct: 156  VSPLNLWLEKCRQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQSLLYA 215

Query: 1580 VGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEI 1401
            VGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG E+
Sbjct: 216  VGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGFEM 275

Query: 1400 LTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKS 1221
            LTPAFPHLFV I                ATRSCF+AGFAAQRNFAEVIAKGEAQGMVSKS
Sbjct: 276  LTPAFPHLFVPIGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKS 335

Query: 1220 IGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYL 1041
            IGI +GI LAN + +ST L+LASF V+T+IHM+CNLKSYQSIQLRTLNPYRASLVFS+YL
Sbjct: 336  IGIAMGIGLANCIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLVFSEYL 395

Query: 1040 LSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSD 861
            LSG  P +KEVNDE+P FPA P+L +KP ++EQ  VLS +AK AA+ ID RL LGSKLSD
Sbjct: 396  LSGQAPPIKEVNDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLGSKLSD 455

Query: 860  IVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAG 681
            +V + ++ +AL +LY+ EGYIL E  GR+CVVLKE+ S  DMLK++F V YLYWLE+NAG
Sbjct: 456  VVNNHKDVLALFDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWLEKNAG 515

Query: 680  IKSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIA 501
            I   + + D +PG RLQISL+YV+REFNHVK DGE AGW  DGLIARP PNRIR G  +A
Sbjct: 516  IDGASPYLDSKPGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRPG-FVA 574

Query: 500  SP 495
            SP
Sbjct: 575  SP 576


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  603 bits (1554), Expect = e-169
 Identities = 310/487 (63%), Positives = 374/487 (76%), Gaps = 4/487 (0%)
 Frame = -3

Query: 1934 SSSAARAKTDDLSRNHE-EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 1758
            +S+ A+A+  D   + E E ++E+RG KR  LVPD+ KDEF+  E               
Sbjct: 178  ASAVAKAENSDSDDSTEKETVWEVRGSKRKRLVPDFVKDEFVSEE-------AAFELSSS 230

Query: 1757 XXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAV 1578
                ++  +CR L    +LPEG+P SVTSDYL+YSLWRGVQGIA+QISGVLATQ+LLYAV
Sbjct: 231  LTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAV 290

Query: 1577 GLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEIL 1398
            GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAAFGME+L
Sbjct: 291  GLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 350

Query: 1397 TPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1218
            TP FP  FV+I                ATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+
Sbjct: 351  TPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSM 410

Query: 1217 GIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLL 1038
            GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLVFS+YL+
Sbjct: 411  GILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLI 470

Query: 1037 SGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDI 858
            SG  P +KEVNDEEP FPA   L +K   + Q  VLS++AK AA+ I+ RLQLGSKLSD+
Sbjct: 471  SGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAADIEERLQLGSKLSDV 530

Query: 857  VKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGI 678
            + ++EEAIAL +LY+ EGYIL E +GR+CV+LKESSS QDML+SLFQV YLYWLE+NAGI
Sbjct: 531  IHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVNYLYWLEKNAGI 590

Query: 677  KSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIG---NQ 507
            +  + + DC+PG RL ISL+YV+REF H K D E  GWV +GLIARP P RIR+G     
Sbjct: 591  EPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRLGYDSEP 650

Query: 506  IASPAVS 486
            ++SP+ S
Sbjct: 651  LSSPSSS 657


>ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
          Length = 592

 Score =  602 bits (1552), Expect = e-169
 Identities = 314/484 (64%), Positives = 363/484 (75%), Gaps = 5/484 (1%)
 Frame = -3

Query: 1922 ARAKTDDLSRNHEEVLF-----EIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 1758
            A+AKT   S + +  LF     E++GGK  +LVPD + D F+  +               
Sbjct: 110  AKAKTLSSSSSSDTSLFSEPVYEVKGGKWTKLVPDPTDDVFVSAQQGFLSELSSLKPSQL 169

Query: 1757 XXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAV 1578
                 VW+KC D+   +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ+LLYAV
Sbjct: 170  ATF--VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAV 227

Query: 1577 GLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEIL 1398
            GLGKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ 
Sbjct: 228  GLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMS 287

Query: 1397 TPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1218
            TPA P  FVLI                +TRSCF+AGFAAQRNFAEVIAKGE QGM S+ I
Sbjct: 288  TPACPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFI 347

Query: 1217 GIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLL 1038
            GI+LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLVFS+YLL
Sbjct: 348  GIVLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLL 407

Query: 1037 SGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDI 858
            SG  P VKEVNDEEP FPA P+L     S+ Q   LS++AKDAA+ I+ RLQLGSKLS+I
Sbjct: 408  SGQAPPVKEVNDEEPLFPAVPILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEI 467

Query: 857  VKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGI 678
            V S+E+ +AL  LYK EGYIL+E  G+Y VVLKE  S  DMLK+LFQV YLYWLE+NAGI
Sbjct: 468  VNSKEDVLALFGLYKNEGYILSEHMGKYSVVLKEKCSQLDMLKALFQVNYLYWLEKNAGI 527

Query: 677  KSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIAS 498
            +     +D +PG RL ISL+YV+REFNHVKNDGE+ GWV DGLIARP PNRI IG+   S
Sbjct: 528  EGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRICIGDTAPS 587

Query: 497  PAVS 486
             +VS
Sbjct: 588  NSVS 591


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  602 bits (1551), Expect = e-169
 Identities = 307/482 (63%), Positives = 371/482 (76%), Gaps = 3/482 (0%)
 Frame = -3

Query: 1946 YFIFSSSAARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 1776
            +F  S+++A AK  +   N +   E ++E+RG KR  LVPD+ KDEF+  E+        
Sbjct: 122  HFRLSAASAIAKDQNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSL 181

Query: 1775 XXXXXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1596
                      ++  +CR+L    +LPEGFP SVTSDYL+YSLWRGVQGIA+QISGVLATQ
Sbjct: 182  TPE-------NLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQ 234

Query: 1595 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1416
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 235  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAA 294

Query: 1415 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQG 1236
            FGME+LTP FP  FV+I                ATRSCF AGFA+QRNFAEVIAKGEAQG
Sbjct: 295  FGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQG 354

Query: 1235 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1056
            MVSKS+GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLV
Sbjct: 355  MVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLV 414

Query: 1055 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 876
            FS+YL+SG  P +KEVNDEEP FP      +K   + Q  VLS++AK AA+ I+ RLQLG
Sbjct: 415  FSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEERLQLG 474

Query: 875  SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 696
            SKLSD++ ++EEAIAL +LY+ EGYIL E KGR+CV+LKESS+ QDML+SLFQV YLYWL
Sbjct: 475  SKLSDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWL 534

Query: 695  ERNAGIKSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 516
            E+NAGI+  + + DC+PG RL ISL+YV+REF H K D E  GWV +GLIARP P RIR+
Sbjct: 535  EKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRL 594

Query: 515  GN 510
            G+
Sbjct: 595  GH 596


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  601 bits (1550), Expect = e-169
 Identities = 308/493 (62%), Positives = 377/493 (76%), Gaps = 6/493 (1%)
 Frame = -3

Query: 1946 YFIFSSSAARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 1776
            +F  S+++A AK  D   + +   E ++E+RG KR  LVPD+ KDEF+  E+        
Sbjct: 128  HFRLSAASAIAKASDSDSSGDTDKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSL 187

Query: 1775 XXXXXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1596
                      ++  +CR+L    +LPEGFP SVTSDYL+YSLWRGVQGIA+Q+SGVLATQ
Sbjct: 188  TPE-------NLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQ 240

Query: 1595 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1416
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 241  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAA 300

Query: 1415 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQG 1236
            FGME+LTP FP  FV+I                ATRSCF AGFA+QRNFAEVIAKGEAQG
Sbjct: 301  FGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQG 360

Query: 1235 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1056
            MVSKS+GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLV
Sbjct: 361  MVSKSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLV 420

Query: 1055 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 876
            FS+YL+SG  P +KEVNDEEP FP    L +K   + Q  VLS++AK AA  I+ RLQLG
Sbjct: 421  FSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERLQLG 480

Query: 875  SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 696
            SKLSD++ ++EEAIAL +LY+ EGYIL E +GR+CV+LKESS+ QDML+SLFQV YLYWL
Sbjct: 481  SKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLFQVNYLYWL 540

Query: 695  ERNAGIKSTTIFDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 516
            E+NAGI+  + + DC+PG RL ISL+YV+REF H K D +  GWV +GLIARP P RIR+
Sbjct: 541  EKNAGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIARPLPTRIRL 600

Query: 515  GNQ---IASPAVS 486
            G+    ++SP+ S
Sbjct: 601  GHDREPLSSPSSS 613


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  598 bits (1542), Expect = e-168
 Identities = 301/459 (65%), Positives = 354/459 (77%)
 Frame = -3

Query: 1886 EEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDLAMSV 1707
            ++ ++E++GG  ++L PD+ KD FI                       ++ KC++  + +
Sbjct: 132  KQPIWEVKGGNFIKLFPDHLKDIFIASNPTFFSELSSLNVSQVPSF--LYTKCKEFTVRL 189

Query: 1706 MLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 1527
            MLPEGFP SVTSDYLEYSLWRGVQG+A Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 190  MLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVL 249

Query: 1526 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXXXXX 1347
            KDGIGYLSKI+LS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFPHLFV I      
Sbjct: 250  KDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAVAGA 309

Query: 1346 XXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQSSTA 1167
                      +TRSCF+AGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N + SST 
Sbjct: 310  SRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIALGIGLGNCIGSSTP 369

Query: 1166 LSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEEPFF 987
            L LASF V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFS+YLLSG  P VKEVNDEEP F
Sbjct: 370  LVLASFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 429

Query: 986  PAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELYKAE 807
            PA P+L     ++ Q  VLS++AKDAA  I+ RLQLGSKLS+I+ ++EE +AL  LYK E
Sbjct: 430  PALPILNACFANKAQSIVLSSEAKDAAVEIESRLQLGSKLSEIIHNKEEVLALFSLYKNE 489

Query: 806  GYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIFDDCRPGSRLQI 627
            GYIL+E  G++CVVLKE+ S  DMLK+LFQV YLYWLE+NAGI+      DC+PG RL+I
Sbjct: 490  GYILSEHTGKFCVVLKENCSQLDMLKALFQVNYLYWLEKNAGIEGRGALYDCKPGGRLRI 549

Query: 626  SLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGN 510
            SLEY +REFNH +NDGE AGW+ DGLIARP PNRIR GN
Sbjct: 550  SLEYAEREFNHARNDGESAGWIADGLIARPLPNRIRPGN 588


>ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Populus trichocarpa]
            gi|550347673|gb|ERP65789.1| hypothetical protein
            POPTR_0001s19390g [Populus trichocarpa]
          Length = 406

 Score =  597 bits (1538), Expect = e-167
 Identities = 303/405 (74%), Positives = 338/405 (83%)
 Frame = -3

Query: 1706 MLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 1527
            MLP+GFP SVTSDYL+YSLWR VQGIA+QISGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 1    MLPQGFPRSVTSDYLDYSLWRAVQGIASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 60

Query: 1526 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXXXXX 1347
            KDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAAFG+E+LTPAFPHLFV I      
Sbjct: 61   KDGIGYLSKIVLSKYGRHFDVHPKGWRLFADLLENAAFGLEMLTPAFPHLFVFIGATAGA 120

Query: 1346 XXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQSSTA 1167
                      ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN + SST 
Sbjct: 121  GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANCIGSSTP 180

Query: 1166 LSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEEPFF 987
            L+LASF V+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSG  P VKE+NDEEP F
Sbjct: 181  LALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEINDEEPLF 240

Query: 986  PAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELYKAE 807
            PA P L +      Q  VLS++A++AA+ I++RLQLGSKLSD+V ++++ +AL  LY+ E
Sbjct: 241  PAVPFLNIYSKGNVQSIVLSSEARNAAAEIEQRLQLGSKLSDVVNNKDDVLALFNLYRDE 300

Query: 806  GYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIFDDCRPGSRLQI 627
            GYIL E KGR+CVVLKESSS  DMLKSLFQV YLYWLERNAGI++ +I  DCRP  RLQI
Sbjct: 301  GYILTEHKGRFCVVLKESSSPHDMLKSLFQVNYLYWLERNAGIEARSISADCRPEGRLQI 360

Query: 626  SLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 492
            SLEY +REFNHVKND    GWV DGLIARPSP R+  GN  +S A
Sbjct: 361  SLEYARREFNHVKNDSVSMGWVADGLIARPSPIRVCPGNIASSIA 405


>ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [Amborella trichopoda]
            gi|548831916|gb|ERM94718.1| hypothetical protein
            AMTR_s00011p00244680 [Amborella trichopoda]
          Length = 565

 Score =  595 bits (1534), Expect = e-167
 Identities = 301/461 (65%), Positives = 353/461 (76%)
 Frame = -3

Query: 1898 SRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDL 1719
            S+  E V +E++GGK   +  D SKDE      +                   W+ CR+L
Sbjct: 101  SKPGEVVAWEVKGGKWSPVYADSSKDELFADNALRLLSSGVLDLGKILGSS--WLWCREL 158

Query: 1718 AMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAV 1539
            A+ +MLPEG+P SV+SDYLEYSLWR VQG+A+QI+GVL TQALLYAVGLGKGAIPTAAAV
Sbjct: 159  AVRLMLPEGYPASVSSDYLEYSLWRAVQGVASQINGVLTTQALLYAVGLGKGAIPTAAAV 218

Query: 1538 NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXX 1359
            NWVLKDG+GYLSKI LSKYGRHFDV+PKGWRLFADLLENAA+G+E+LTPA+P  FVLI  
Sbjct: 219  NWVLKDGLGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGLELLTPAYPQFFVLIGA 278

Query: 1358 XXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQ 1179
                          ATRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN + 
Sbjct: 279  AAGAGRSAAALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANHIG 338

Query: 1178 SSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDE 999
            +S  L+ ASFGV+T +HMFCNLKSYQSIQLRTLNPYR SLVFS+YLLSG VP VKEVNDE
Sbjct: 339  ASGPLAAASFGVVTAVHMFCNLKSYQSIQLRTLNPYRGSLVFSEYLLSGEVPPVKEVNDE 398

Query: 998  EPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLEL 819
            EP F     L + P    Q +VLS +AK+AA+ I+ RLQLG KLSD+V  +E+ +AL +L
Sbjct: 399  EPLFSGSSFLKVVPVQHAQSQVLSAEAKEAAAQIESRLQLGCKLSDVVSKKEDVLALFDL 458

Query: 818  YKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIFDDCRPGS 639
            ++ EGYIL E KG+YCVVLKE  S QDMLKSLFQV YLYWLERNAGI S +   DC+PG 
Sbjct: 459  FEKEGYILTEQKGKYCVVLKEDYSPQDMLKSLFQVSYLYWLERNAGIDSRSASTDCKPGG 518

Query: 638  RLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 516
            ++Q+S +YVQREFNHVKND + AGW+ DGLIARP P R+R+
Sbjct: 519  KMQLSYDYVQREFNHVKNDSQAAGWITDGLIARPLPCRVRV 559


Top