BLASTX nr result

ID: Rehmannia24_contig00012430 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00012430
         (2401 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   671   0.0  
ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   670   0.0  
gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma caca...   653   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   640   e-180
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              639   e-180
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   633   e-178
gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]    624   e-176
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   619   e-174
gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]    615   e-173
ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778...   610   e-172
gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus...   608   e-171
ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   605   e-170
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     603   e-169
ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786...   602   e-169
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   601   e-169
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   600   e-169
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   600   e-168
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   598   e-168
ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Popu...   597   e-167
ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [A...   595   e-167

>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  671 bits (1730), Expect = 0.0
 Identities = 355/520 (68%), Positives = 404/520 (77%), Gaps = 3/520 (0%)
 Frame = -2

Query: 2100 RRPSFLLPFHFIFSQEE---DSYSICLPKHIYXXXXXXXXXLGYFIFSSSAARAKTDDLS 1930
            RR   LLP   IF  E+   DS   C P  ++             +  +S  +AKT    
Sbjct: 92   RRSLLLLP---IFRNEDTFIDSVLSCKPLLLFLVSASSSITCCLLL--ASFVQAKT---- 142

Query: 1929 RNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDLA 1750
             N+ E++ EIRGGKR ELVPDYSKDEF++ + M                 ++WM+C++L 
Sbjct: 143  -NNGEIVHEIRGGKRFELVPDYSKDEFVLTKTMWSRLLPDSKSGSFVS--NLWMQCKELT 199

Query: 1749 MSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVN 1570
             +++LPEGFP+SVTSDYLEY+LWRGVQG+AAQISGVLATQALLYAVGLGKGAIPTAAAVN
Sbjct: 200  TTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAVN 259

Query: 1569 WVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXX 1390
            WVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTPAFPHLFV I   
Sbjct: 260  WVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFPHLFVPIGAV 319

Query: 1389 XXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQS 1210
                         ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGIMLGIALAN  +S
Sbjct: 320  AGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLGIALANCTRS 379

Query: 1209 STALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEE 1030
            ST+L+LASFGV+TWIHMFCNLKSY SIQLRTLNPYRASLVFS+YLLSGLVPSVKEVNDEE
Sbjct: 380  STSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGLVPSVKEVNDEE 439

Query: 1029 PFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELY 850
            P FPA  +L LK   E Q+EVLS  AK AA+ I RRLQLGSKLSD+  SRE+ +AL ELY
Sbjct: 440  PLFPA-AILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVATSREDVLALFELY 498

Query: 849  KAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGSR 670
            K EGYIL E +GR+C+VLKESSS QDMLKSLF V YLYWLE  AGIKS+++ +DCRPG R
Sbjct: 499  KNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKSSSVANDCRPGGR 558

Query: 669  LQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 550
            LQ+SLEYV+REFNHVK DGE+AGWV D LIARP PNRIR+
Sbjct: 559  LQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRL 598


>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  670 bits (1728), Expect = 0.0
 Identities = 344/472 (72%), Positives = 388/472 (82%)
 Frame = -2

Query: 1965 SSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXX 1786
            +S  +AKT     N+ E+++EIRGGKR ELVPDYSKDEF++ + M               
Sbjct: 132  ASFVQAKT-----NNGEIVYEIRGGKRFELVPDYSKDEFVLTKTMWSQLWPDSTSGSFVS 186

Query: 1785 XGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGL 1606
              ++WM+C++L  ++ LPEGFPESVTSDYLEY+LWRGVQGIAAQISGVLATQALLYAVGL
Sbjct: 187  --NLWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAVGL 244

Query: 1605 GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 1426
            GKGAIPTAAA+NWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTP
Sbjct: 245  GKGAIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTP 304

Query: 1425 AFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI 1246
            AFPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGI
Sbjct: 305  AFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGI 364

Query: 1245 MLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSG 1066
            MLGIALAN  +SST+L+LASFGV+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSG
Sbjct: 365  MLGIALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 424

Query: 1065 LVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVK 886
            LVPSVKEVNDEEP FPA  +L LK   E Q EVLS  AK AA+ I RRLQLGSKLSD+  
Sbjct: 425  LVPSVKEVNDEEPLFPA-AILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDVAT 483

Query: 885  SREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKS 706
            S+E+ +AL ELYK EGYIL E +GR+C+VLKESSS QDMLKSLF V YLYWLE NAGIKS
Sbjct: 484  SQEDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKS 543

Query: 705  TTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 550
            +++ +DCRPG RLQ+SLEYV+REFNHVK DGE+AGWV D LIARP P RIR+
Sbjct: 544  SSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRL 595


>gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  653 bits (1685), Expect = 0.0
 Identities = 334/479 (69%), Positives = 389/479 (81%)
 Frame = -2

Query: 1962 SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 1783
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 1782 GDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1603
              VW +CRD+ M ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1602 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1423
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1422 FPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1243
            FPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1242 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1063
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1062 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 883
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 882  REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 703
            +E+A+AL  LYK EGYIL E +G++CVVLKESS  QDMLKSLFQV YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWLERNAGIEAS 531

Query: 702  TIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 526
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 532  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 590


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  640 bits (1650), Expect = e-180
 Identities = 327/480 (68%), Positives = 379/480 (78%)
 Frame = -2

Query: 1980 YFIFSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXX 1801
            +F F    A +K  +     EE ++E+RGGK  +++PD SKDEF++              
Sbjct: 218  FFHFQLDTALSKEKE-----EEGVWEVRGGKWHKIIPDSSKDEFLV----VTPGIGAVGA 268

Query: 1800 XXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALL 1621
                   ++W++C++L + +MLPEGFP SVTSDYL+Y+LWRGVQG+A+QISGVLATQALL
Sbjct: 269  PKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALL 328

Query: 1620 YAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 1441
            YAVGLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+
Sbjct: 329  YAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGL 388

Query: 1440 EILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVS 1261
            EILTPAFPH F+LI                +TRSCFYAGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 389  EILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 448

Query: 1260 KSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQ 1081
            KSIGIMLGIALAN + SS  LS ASF V+T +HMFCNLKSYQSIQLRTLNPYRASLVFS+
Sbjct: 449  KSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSE 508

Query: 1080 YLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 901
            YLLSG VPS+KEVN+EEP FP  PLL  KPT + Q  VLST+AKDAA+ I+RRLQLGSKL
Sbjct: 509  YLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKL 568

Query: 900  SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 721
            S++V S+E+ +AL +LY+ E YIL E KGR+ V+LKES S QDMLKS+F V YLYWLERN
Sbjct: 569  SEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERN 628

Query: 720  AGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQ 541
            AGI S    DDCRPG RLQISLEYVQREFNH+KND E  GW  DGLIARP PNRIR G++
Sbjct: 629  AGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHK 688


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  639 bits (1649), Expect = e-180
 Identities = 327/479 (68%), Positives = 378/479 (78%)
 Frame = -2

Query: 1980 YFIFSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXX 1801
            +F F    A +K  +     EE ++E+RGGK  +++PD SKDEF++              
Sbjct: 16   FFHFQLDTALSKEKE-----EEGVWEVRGGKWHKIIPDSSKDEFLV----VTPGIGAVGA 66

Query: 1800 XXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALL 1621
                   ++W++C++L + +MLPEGFP SVTSDYL+Y+LWRGVQG+A+QISGVLATQALL
Sbjct: 67   PKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALL 126

Query: 1620 YAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 1441
            YAVGLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+
Sbjct: 127  YAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGL 186

Query: 1440 EILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVS 1261
            EILTPAFPH F+LI                +TRSCFYAGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 187  EILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 246

Query: 1260 KSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQ 1081
            KSIGIMLGIALAN + SS  LS ASF V+T +HMFCNLKSYQSIQLRTLNPYRASLVFS+
Sbjct: 247  KSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSE 306

Query: 1080 YLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 901
            YLLSG VPS+KEVN+EEP FP  PLL  KPT + Q  VLST+AKDAA+ I+RRLQLGSKL
Sbjct: 307  YLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKL 366

Query: 900  SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 721
            S++V S+E+ +AL +LY+ E YIL E KGR+ V+LKES S QDMLKS+F V YLYWLERN
Sbjct: 367  SEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERN 426

Query: 720  AGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGN 544
            AGI S    DDCRPG RLQISLEYVQREFNH+KND E  GW  DGLIARP PNRIR G+
Sbjct: 427  AGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGH 485


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  633 bits (1632), Expect = e-178
 Identities = 324/487 (66%), Positives = 379/487 (77%), Gaps = 4/487 (0%)
 Frame = -2

Query: 1968 SSSAARAKT---DDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXX 1798
            S+S+A A+T   +      E+ ++ ++G KR+ L+PD+ KDEF++  ++           
Sbjct: 57   SASSAFARTTLKEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFLVNPSLPSSYDDIISSS 116

Query: 1797 XXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLY 1618
                   +W++CR L + +MLPEG+P SVTSDYL+YSLWRGVQG+A+QISGVLATQALLY
Sbjct: 117  WLHFGRTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLY 176

Query: 1617 AVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGME 1438
            A+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAAFG+E
Sbjct: 177  AIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLE 236

Query: 1437 ILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 1258
            ILTPAFPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK
Sbjct: 237  ILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 296

Query: 1257 SIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQY 1078
             IGIMLGI LAN + SS  L+LASF V+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+Y
Sbjct: 297  FIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEY 356

Query: 1077 LLSGLVPSVKEVNDEEPFFPA-FPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 901
            LLSG  P +K+VNDEEP FPA FP    K   +  + VLS +A+DAA+ I+RRLQLGSKL
Sbjct: 357  LLSGQAPPIKDVNDEEPLFPAVFP--HFKSADKPSLVVLSLEARDAATEIERRLQLGSKL 414

Query: 900  SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 721
            SD+V S+E+ +AL  LYK EGYIL E KGR+CVVLKES SAQDMLK+LFQV YLYWLERN
Sbjct: 415  SDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQVNYLYWLERN 474

Query: 720  AGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQ 541
            AG+ +     DCR G RLQ+SLEY+QREF+HV+ND    GWV DGLIARP PNRI  G+ 
Sbjct: 475  AGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARPLPNRIYPGDL 534

Query: 540  IASPAVS 520
            +AS  VS
Sbjct: 535  VASSIVS 541


>gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 577

 Score =  624 bits (1608), Expect = e-176
 Identities = 322/479 (67%), Positives = 377/479 (78%)
 Frame = -2

Query: 1962 SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 1783
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 1782 GDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1603
              VW +CRD+ M ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1602 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1423
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1422 FPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1243
            FPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1242 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1063
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1062 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 883
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 882  REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 703
            +E+A+AL  LYK EGYIL E +G++C              SLFQV YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFC--------------SLFQVNYLYWLERNAGIEAS 517

Query: 702  TIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 526
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 518  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 576


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  619 bits (1597), Expect = e-174
 Identities = 312/470 (66%), Positives = 367/470 (78%)
 Frame = -2

Query: 1956 ARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGD 1777
            A +  DD ++ ++ V +E++G KR +L+PD++KD F++                      
Sbjct: 116  ATSSEDDGNKEYDAV-WEVKGSKRTKLIPDFTKDAFVVAS------ASNASLSSLLSVNK 168

Query: 1776 VWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKG 1597
            +W +CR+L +  MLPEGFP+SVTSDYL YSLWR VQG+A+QISGVLATQALLYA+GLGKG
Sbjct: 169  LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 228

Query: 1596 AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 1417
            AIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+LTPAFP
Sbjct: 229  AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 288

Query: 1416 HLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 1237
            H FV I                +TRSCFYAGFAA+RNFAEVIAKGEAQGMVSK+IGIMLG
Sbjct: 289  HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 348

Query: 1236 IALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVP 1057
            IALAN + SS   +LASF V+TWIHM+CNLKSYQSI+LRTLNPYRASLVFS+YLLSG  P
Sbjct: 349  IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 408

Query: 1056 SVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSRE 877
             VKEVNDEEP FPAF    +K  ++ Q+ VLS++AKDAA  I+ RLQLGSKLSD+V ++E
Sbjct: 409  PVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 468

Query: 876  EAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTI 697
            +A AL  LY+ EGYIL E  G++CVVLKES+  QDMLKSLFQ  YLYWLERNAGI +T+ 
Sbjct: 469  DAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATST 528

Query: 696  VDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIG 547
              DC PG RL+ISL+YVQREFNHVK+D    GWV DGLIARP PNRIR G
Sbjct: 529  SADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 578


>gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 573

 Score =  615 bits (1585), Expect = e-173
 Identities = 318/479 (66%), Positives = 373/479 (77%)
 Frame = -2

Query: 1962 SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 1783
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 1782 GDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1603
              VW +CRD+ M ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1602 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1423
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1422 FPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1243
            FPHLFV I                ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1242 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1063
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1062 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 883
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 882  REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 703
            +E+A+AL  LYK EGYIL E +G++C                  V YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFC------------------VNYLYWLERNAGIEAS 513

Query: 702  TIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 526
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 514  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 572


>ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 593

 Score =  610 bits (1574), Expect = e-172
 Identities = 316/490 (64%), Positives = 369/490 (75%), Gaps = 5/490 (1%)
 Frame = -2

Query: 1974 IFSSSAARAKTDDLSRNHEEVLF-----EIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 1810
            +  +  A+AKT   S   +  LF     E++GGK  +LVPD + D F+  +         
Sbjct: 104  LLHAKLAKAKTLSPSTTADTSLFSEPVYEVKGGKWTKLVPDLTNDVFVSAQQGFLSELSS 163

Query: 1809 XXXXXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1630
                       VW+KC D+   +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ
Sbjct: 164  LKVPSQLATF-VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQ 222

Query: 1629 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1450
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFADLLENAA
Sbjct: 223  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAA 282

Query: 1449 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQG 1270
            FG+E+ TPAFP  FVLI                +TRSCF+AGFAAQRNFAEVIAKGE QG
Sbjct: 283  FGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQG 342

Query: 1269 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1090
            M S+ IGI LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLV
Sbjct: 343  MASRFIGIGLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLV 402

Query: 1089 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 910
            FS+YLLSG  P VKEVNDEEP FPA P+L     ++ Q  VLS++AKDAA+ I+ RLQLG
Sbjct: 403  FSEYLLSGQAPPVKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLG 462

Query: 909  SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 730
            SKLS+IV S+E+ +AL  LYK EGYIL+E  G++CVVLKE+ S QDMLK+LFQV YLYWL
Sbjct: 463  SKLSEIVNSKEDVLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDMLKALFQVNYLYWL 522

Query: 729  ERNAGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 550
            E+NAGI     ++D +PG RL ISL+YV+REFNHVKNDGE+ GWV DGLIARP PNRIRI
Sbjct: 523  EKNAGIGGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRIRI 582

Query: 549  GNQIASPAVS 520
            G+   S +VS
Sbjct: 583  GDTPPSNSVS 592


>gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris]
          Length = 592

 Score =  608 bits (1568), Expect = e-171
 Identities = 314/482 (65%), Positives = 366/482 (75%), Gaps = 3/482 (0%)
 Frame = -2

Query: 1956 ARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXX 1786
            A AKT   S ++E   E ++E++GGK   LVPD + D F+                    
Sbjct: 112  ANAKTWSSSSDNELLSEPVWEVKGGKWTRLVPDPTNDVFVSAH--PGLLAELQSLKPSQF 169

Query: 1785 XGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGL 1606
               VW+KCRD+   +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ+LLYAVGL
Sbjct: 170  ATFVWLKCRDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGL 229

Query: 1605 GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 1426
            GKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ TP
Sbjct: 230  GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMCTP 289

Query: 1425 AFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI 1246
            AFP  FVLI                +TRSCF+AGFAAQRNFAEVIAKGE QGM S+ IGI
Sbjct: 290  AFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGI 349

Query: 1245 MLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSG 1066
             LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLVFS+YLLSG
Sbjct: 350  GLGIGLGNCIGSSTPLVLASFIVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 409

Query: 1065 LVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVK 886
              P VK+VNDEEP FPA P+L     ++ +   LS++AKDAA+ I+RRLQLGSKLS+IV 
Sbjct: 410  QAPPVKDVNDEEPLFPAVPILNATFANKARSIALSSEAKDAAAEIERRLQLGSKLSEIVN 469

Query: 885  SREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKS 706
             +E+ +AL  LYK EGYIL+E  G++CVVLKE+ S QDMLK+LFQV YLYWLE+NAGI  
Sbjct: 470  GKEDVLALFRLYKKEGYILSEHMGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGG 529

Query: 705  TTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 526
               ++D RPG RL  SL+YV+REFNH+KNDGE  GWV DGLIARP PNRIRIG+  +S +
Sbjct: 530  RGTLNDSRPGGRLHTSLDYVEREFNHLKNDGESVGWVTDGLIARPLPNRIRIGDTTSSNS 589

Query: 525  VS 520
            VS
Sbjct: 590  VS 591


>ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog [Fragaria vesca subsp.
            vesca]
          Length = 593

 Score =  605 bits (1561), Expect = e-170
 Identities = 306/460 (66%), Positives = 362/460 (78%)
 Frame = -2

Query: 1932 SRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDL 1753
            S    E ++E++GGK  +L PD+ +D F+                       + ++C+ L
Sbjct: 133  SEEDAESVWEVKGGKWTKLAPDFVRDAFVADGG---------GGLGSISFESLGLQCKSL 183

Query: 1752 AMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAV 1573
             + +MLPEGFP+SVTSDYL+YSLWR VQG+A+Q+SGVLATQALLYAVGLGKGAIPTAAA+
Sbjct: 184  FVQLMLPEGFPDSVTSDYLDYSLWRAVQGVASQVSGVLATQALLYAVGLGKGAIPTAAAL 243

Query: 1572 NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXX 1393
            NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGME+LTP FP+ F+LI  
Sbjct: 244  NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEMLTPVFPNHFLLIGA 303

Query: 1392 XXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQ 1213
                          ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN + 
Sbjct: 304  AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANQIG 363

Query: 1212 SSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDE 1033
            SST+L LASF ++T IHMFCNLKSYQ+IQLRTLNPYRASLVFS+YLLSG  P VK+VN+E
Sbjct: 364  SSTSLGLASFSLVTCIHMFCNLKSYQAIQLRTLNPYRASLVFSEYLLSGQAPPVKDVNEE 423

Query: 1032 EPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLEL 853
            EP FPA P L  KP ++ Q  VLS++AKDAA+ I++RLQLG KLSD++ ++E+  AL  L
Sbjct: 424  EPLFPAVPFLNWKPANKGQPTVLSSEAKDAAAEIEQRLQLGCKLSDLINNKEDVHALFNL 483

Query: 852  YKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGS 673
            YK EGYIL E +GRYCVVLKE+SS QDMLK+LF V YLYWLE+NAGI++     DCRPG 
Sbjct: 484  YKEEGYILTEHRGRYCVVLKETSSLQDMLKALFHVNYLYWLEKNAGIEAKGTSIDCRPGG 543

Query: 672  RLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIR 553
            RL++SL+YV+REF+ +K DGE  GWV DGLIARP+PNRIR
Sbjct: 544  RLEMSLDYVRREFDIIKTDGESVGWVTDGLIARPAPNRIR 583


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  603 bits (1555), Expect = e-169
 Identities = 315/482 (65%), Positives = 370/482 (76%), Gaps = 1/482 (0%)
 Frame = -2

Query: 1971 FSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 1792
            F S  ARA++   S      ++E++GGK + LVP+   D F++                 
Sbjct: 107  FCSRLARAQSLSSS------VWEVKGGKWILLVPNDLDDTFVVDS-----LFPSTSSTRP 155

Query: 1791 XXXGDVWM-KCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYA 1615
                ++W+ KCR L M +MLPEG+PESVTSDYL+YSLWR VQG+A+QIS VLATQ+LLYA
Sbjct: 156  VSPLNLWLEKCRQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQSLLYA 215

Query: 1614 VGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEI 1435
            VGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG E+
Sbjct: 216  VGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGFEM 275

Query: 1434 LTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKS 1255
            LTPAFPHLFV I                ATRSCF+AGFAAQRNFAEVIAKGEAQGMVSKS
Sbjct: 276  LTPAFPHLFVPIGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKS 335

Query: 1254 IGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYL 1075
            IGI +GI LAN + +ST L+LASF V+T+IHM+CNLKSYQSIQLRTLNPYRASLVFS+YL
Sbjct: 336  IGIAMGIGLANCIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLVFSEYL 395

Query: 1074 LSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSD 895
            LSG  P +KEVNDE+P FPA P+L +KP ++EQ  VLS +AK AA+ ID RL LGSKLSD
Sbjct: 396  LSGQAPPIKEVNDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLGSKLSD 455

Query: 894  IVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAG 715
            +V + ++ +AL +LY+ EGYIL E  GR+CVVLKE+ S  DMLK++F V YLYWLE+NAG
Sbjct: 456  VVNNHKDVLALFDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWLEKNAG 515

Query: 714  IKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIA 535
            I   +   D +PG RLQISL+YV+REFNHVK DGE AGW  DGLIARP PNRIR G  +A
Sbjct: 516  IDGASPYLDSKPGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRPG-FVA 574

Query: 534  SP 529
            SP
Sbjct: 575  SP 576


>ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
          Length = 592

 Score =  602 bits (1553), Expect = e-169
 Identities = 314/484 (64%), Positives = 364/484 (75%), Gaps = 5/484 (1%)
 Frame = -2

Query: 1956 ARAKTDDLSRNHEEVLF-----EIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 1792
            A+AKT   S + +  LF     E++GGK  +LVPD + D F+  +               
Sbjct: 110  AKAKTLSSSSSSDTSLFSEPVYEVKGGKWTKLVPDPTDDVFVSAQQGFLSELSSLKPSQL 169

Query: 1791 XXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAV 1612
                 VW+KC D+   +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ+LLYAV
Sbjct: 170  ATF--VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAV 227

Query: 1611 GLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEIL 1432
            GLGKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ 
Sbjct: 228  GLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMS 287

Query: 1431 TPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1252
            TPA P  FVLI                +TRSCF+AGFAAQRNFAEVIAKGE QGM S+ I
Sbjct: 288  TPACPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFI 347

Query: 1251 GIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLL 1072
            GI+LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLVFS+YLL
Sbjct: 348  GIVLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLL 407

Query: 1071 SGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDI 892
            SG  P VKEVNDEEP FPA P+L     S+ Q   LS++AKDAA+ I+ RLQLGSKLS+I
Sbjct: 408  SGQAPPVKEVNDEEPLFPAVPILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEI 467

Query: 891  VKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGI 712
            V S+E+ +AL  LYK EGYIL+E  G+Y VVLKE  S  DMLK+LFQV YLYWLE+NAGI
Sbjct: 468  VNSKEDVLALFGLYKNEGYILSEHMGKYSVVLKEKCSQLDMLKALFQVNYLYWLEKNAGI 527

Query: 711  KSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIAS 532
            +    ++D +PG RL ISL+YV+REFNHVKNDGE+ GWV DGLIARP PNRI IG+   S
Sbjct: 528  EGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRICIGDTAPS 587

Query: 531  PAVS 520
             +VS
Sbjct: 588  NSVS 591


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  601 bits (1550), Expect = e-169
 Identities = 310/487 (63%), Positives = 373/487 (76%), Gaps = 4/487 (0%)
 Frame = -2

Query: 1968 SSSAARAKTDDLSRNHE-EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 1792
            +S+ A+A+  D   + E E ++E+RG KR  LVPD+ KDEF+  E               
Sbjct: 178  ASAVAKAENSDSDDSTEKETVWEVRGSKRKRLVPDFVKDEFVSEE-------AAFELSSS 230

Query: 1791 XXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAV 1612
                ++  +CR L    +LPEG+P SVTSDYL+YSLWRGVQGIA+QISGVLATQ+LLYAV
Sbjct: 231  LTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAV 290

Query: 1611 GLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEIL 1432
            GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAAFGME+L
Sbjct: 291  GLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 350

Query: 1431 TPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1252
            TP FP  FV+I                ATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+
Sbjct: 351  TPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSM 410

Query: 1251 GIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLL 1072
            GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLVFS+YL+
Sbjct: 411  GILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLI 470

Query: 1071 SGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDI 892
            SG  P +KEVNDEEP FPA   L +K   + Q  VLS++AK AA+ I+ RLQLGSKLSD+
Sbjct: 471  SGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAADIEERLQLGSKLSDV 530

Query: 891  VKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGI 712
            + ++EEAIAL +LY+ EGYIL E +GR+CV+LKESSS QDML+SLFQV YLYWLE+NAGI
Sbjct: 531  IHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVNYLYWLEKNAGI 590

Query: 711  KSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIG---NQ 541
            +  +   DC+PG RL ISL+YV+REF H K D E  GWV +GLIARP P RIR+G     
Sbjct: 591  EPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRLGYDSEP 650

Query: 540  IASPAVS 520
            ++SP+ S
Sbjct: 651  LSSPSSS 657


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  600 bits (1547), Expect = e-169
 Identities = 307/482 (63%), Positives = 370/482 (76%), Gaps = 3/482 (0%)
 Frame = -2

Query: 1980 YFIFSSSAARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 1810
            +F  S+++A AK  +   N +   E ++E+RG KR  LVPD+ KDEF+  E+        
Sbjct: 122  HFRLSAASAIAKDQNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSL 181

Query: 1809 XXXXXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1630
                      ++  +CR+L    +LPEGFP SVTSDYL+YSLWRGVQGIA+QISGVLATQ
Sbjct: 182  TPE-------NLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQ 234

Query: 1629 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1450
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 235  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAA 294

Query: 1449 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQG 1270
            FGME+LTP FP  FV+I                ATRSCF AGFA+QRNFAEVIAKGEAQG
Sbjct: 295  FGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQG 354

Query: 1269 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1090
            MVSKS+GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLV
Sbjct: 355  MVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLV 414

Query: 1089 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 910
            FS+YL+SG  P +KEVNDEEP FP      +K   + Q  VLS++AK AA+ I+ RLQLG
Sbjct: 415  FSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEERLQLG 474

Query: 909  SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 730
            SKLSD++ ++EEAIAL +LY+ EGYIL E KGR+CV+LKESS+ QDML+SLFQV YLYWL
Sbjct: 475  SKLSDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWL 534

Query: 729  ERNAGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 550
            E+NAGI+  +   DC+PG RL ISL+YV+REF H K D E  GWV +GLIARP P RIR+
Sbjct: 535  EKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRL 594

Query: 549  GN 544
            G+
Sbjct: 595  GH 596


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  600 bits (1546), Expect = e-168
 Identities = 308/493 (62%), Positives = 376/493 (76%), Gaps = 6/493 (1%)
 Frame = -2

Query: 1980 YFIFSSSAARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 1810
            +F  S+++A AK  D   + +   E ++E+RG KR  LVPD+ KDEF+  E+        
Sbjct: 128  HFRLSAASAIAKASDSDSSGDTDKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSL 187

Query: 1809 XXXXXXXXXGDVWMKCRDLAMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1630
                      ++  +CR+L    +LPEGFP SVTSDYL+YSLWRGVQGIA+Q+SGVLATQ
Sbjct: 188  TPE-------NLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQ 240

Query: 1629 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1450
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 241  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAA 300

Query: 1449 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQG 1270
            FGME+LTP FP  FV+I                ATRSCF AGFA+QRNFAEVIAKGEAQG
Sbjct: 301  FGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQG 360

Query: 1269 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1090
            MVSKS+GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLV
Sbjct: 361  MVSKSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLV 420

Query: 1089 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 910
            FS+YL+SG  P +KEVNDEEP FP    L +K   + Q  VLS++AK AA  I+ RLQLG
Sbjct: 421  FSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERLQLG 480

Query: 909  SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 730
            SKLSD++ ++EEAIAL +LY+ EGYIL E +GR+CV+LKESS+ QDML+SLFQV YLYWL
Sbjct: 481  SKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLFQVNYLYWL 540

Query: 729  ERNAGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 550
            E+NAGI+  +   DC+PG RL ISL+YV+REF H K D +  GWV +GLIARP P RIR+
Sbjct: 541  EKNAGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIARPLPTRIRL 600

Query: 549  GNQ---IASPAVS 520
            G+    ++SP+ S
Sbjct: 601  GHDREPLSSPSSS 613


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  598 bits (1543), Expect = e-168
 Identities = 301/459 (65%), Positives = 355/459 (77%)
 Frame = -2

Query: 1920 EEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDLAMSV 1741
            ++ ++E++GG  ++L PD+ KD FI                       ++ KC++  + +
Sbjct: 132  KQPIWEVKGGNFIKLFPDHLKDIFIASNPTFFSELSSLNVSQVPSF--LYTKCKEFTVRL 189

Query: 1740 MLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 1561
            MLPEGFP SVTSDYLEYSLWRGVQG+A Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 190  MLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVL 249

Query: 1560 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXXXXX 1381
            KDGIGYLSKI+LS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFPHLFV I      
Sbjct: 250  KDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAVAGA 309

Query: 1380 XXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQSSTA 1201
                      +TRSCF+AGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N + SST 
Sbjct: 310  SRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIALGIGLGNCIGSSTP 369

Query: 1200 LSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEEPFF 1021
            L LASF V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFS+YLLSG  P VKEVNDEEP F
Sbjct: 370  LVLASFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 429

Query: 1020 PAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELYKAE 841
            PA P+L     ++ Q  VLS++AKDAA  I+ RLQLGSKLS+I+ ++EE +AL  LYK E
Sbjct: 430  PALPILNACFANKAQSIVLSSEAKDAAVEIESRLQLGSKLSEIIHNKEEVLALFSLYKNE 489

Query: 840  GYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGSRLQI 661
            GYIL+E  G++CVVLKE+ S  DMLK+LFQV YLYWLE+NAGI+    + DC+PG RL+I
Sbjct: 490  GYILSEHTGKFCVVLKENCSQLDMLKALFQVNYLYWLEKNAGIEGRGALYDCKPGGRLRI 549

Query: 660  SLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGN 544
            SLEY +REFNH +NDGE AGW+ DGLIARP PNRIR GN
Sbjct: 550  SLEYAEREFNHARNDGESAGWIADGLIARPLPNRIRPGN 588


>ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Populus trichocarpa]
            gi|550347673|gb|ERP65789.1| hypothetical protein
            POPTR_0001s19390g [Populus trichocarpa]
          Length = 406

 Score =  597 bits (1538), Expect = e-167
 Identities = 303/405 (74%), Positives = 338/405 (83%)
 Frame = -2

Query: 1740 MLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 1561
            MLP+GFP SVTSDYL+YSLWR VQGIA+QISGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 1    MLPQGFPRSVTSDYLDYSLWRAVQGIASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 60

Query: 1560 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXXXXX 1381
            KDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAAFG+E+LTPAFPHLFV I      
Sbjct: 61   KDGIGYLSKIVLSKYGRHFDVHPKGWRLFADLLENAAFGLEMLTPAFPHLFVFIGATAGA 120

Query: 1380 XXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQSSTA 1201
                      ATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN + SST 
Sbjct: 121  GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANCIGSSTP 180

Query: 1200 LSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEEPFF 1021
            L+LASF V+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSG  P VKE+NDEEP F
Sbjct: 181  LALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEINDEEPLF 240

Query: 1020 PAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELYKAE 841
            PA P L +      Q  VLS++A++AA+ I++RLQLGSKLSD+V ++++ +AL  LY+ E
Sbjct: 241  PAVPFLNIYSKGNVQSIVLSSEARNAAAEIEQRLQLGSKLSDVVNNKDDVLALFNLYRDE 300

Query: 840  GYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGSRLQI 661
            GYIL E KGR+CVVLKESSS  DMLKSLFQV YLYWLERNAGI++ +I  DCRP  RLQI
Sbjct: 301  GYILTEHKGRFCVVLKESSSPHDMLKSLFQVNYLYWLERNAGIEARSISADCRPEGRLQI 360

Query: 660  SLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 526
            SLEY +REFNHVKND    GWV DGLIARPSP R+  GN  +S A
Sbjct: 361  SLEYARREFNHVKNDSVSMGWVADGLIARPSPIRVCPGNIASSIA 405


>ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [Amborella trichopoda]
            gi|548831916|gb|ERM94718.1| hypothetical protein
            AMTR_s00011p00244680 [Amborella trichopoda]
          Length = 565

 Score =  595 bits (1534), Expect = e-167
 Identities = 301/461 (65%), Positives = 353/461 (76%)
 Frame = -2

Query: 1932 SRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXGDVWMKCRDL 1753
            S+  E V +E++GGK   +  D SKDE      +                   W+ CR+L
Sbjct: 101  SKPGEVVAWEVKGGKWSPVYADSSKDELFADNALRLLSSGVLDLGKILGSS--WLWCREL 158

Query: 1752 AMSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAV 1573
            A+ +MLPEG+P SV+SDYLEYSLWR VQG+A+QI+GVL TQALLYAVGLGKGAIPTAAAV
Sbjct: 159  AVRLMLPEGYPASVSSDYLEYSLWRAVQGVASQINGVLTTQALLYAVGLGKGAIPTAAAV 218

Query: 1572 NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXX 1393
            NWVLKDG+GYLSKI LSKYGRHFDV+PKGWRLFADLLENAA+G+E+LTPA+P  FVLI  
Sbjct: 219  NWVLKDGLGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGLELLTPAYPQFFVLIGA 278

Query: 1392 XXXXXXXXXXXXXXATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQ 1213
                          ATRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN + 
Sbjct: 279  AAGAGRSAAALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANHIG 338

Query: 1212 SSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDE 1033
            +S  L+ ASFGV+T +HMFCNLKSYQSIQLRTLNPYR SLVFS+YLLSG VP VKEVNDE
Sbjct: 339  ASGPLAAASFGVVTAVHMFCNLKSYQSIQLRTLNPYRGSLVFSEYLLSGEVPPVKEVNDE 398

Query: 1032 EPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLEL 853
            EP F     L + P    Q +VLS +AK+AA+ I+ RLQLG KLSD+V  +E+ +AL +L
Sbjct: 399  EPLFSGSSFLKVVPVQHAQSQVLSAEAKEAAAQIESRLQLGCKLSDVVSKKEDVLALFDL 458

Query: 852  YKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGS 673
            ++ EGYIL E KG+YCVVLKE  S QDMLKSLFQV YLYWLERNAGI S +   DC+PG 
Sbjct: 459  FEKEGYILTEQKGKYCVVLKEDYSPQDMLKSLFQVSYLYWLERNAGIDSRSASTDCKPGG 518

Query: 672  RLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 550
            ++Q+S +YVQREFNHVKND + AGW+ DGLIARP P R+R+
Sbjct: 519  KMQLSYDYVQREFNHVKNDSQAAGWITDGLIARPLPCRVRV 559


Top