BLASTX nr result

ID: Rehmannia25_contig00016655 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00016655
         (2676 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   673   0.0  
ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   672   0.0  
gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma caca...   651   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   639   e-180
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              638   e-180
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   632   e-178
gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]    621   e-175
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   619   e-174
ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778...   613   e-172
gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]    612   e-172
gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus...   610   e-172
ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   605   e-170
ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786...   605   e-170
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   603   e-170
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   602   e-169
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   602   e-169
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     601   e-169
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   598   e-168
ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Popu...   597   e-167
ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   597   e-167

>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  673 bits (1736), Expect = 0.0
 Identities = 355/520 (68%), Positives = 404/520 (77%), Gaps = 3/520 (0%)
 Frame = +1

Query: 544  RRPSFLLPFHFIFSQEE---DSYSICLPKHIYXXXXXXXXXXGYFIFSSSAARAKTDDLS 714
            RR   LLP   IF  E+   DS   C P  ++             +  +S  +AKT    
Sbjct: 92   RRSLLLLP---IFRNEDTFIDSVLSCKPLLLFLVSASSSITCCLLL--ASFVQAKT---- 142

Query: 715  RNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXXDVWMKCRDLA 894
             N+ E++ EIRGGKR ELVPDYSKDEF++ + M                 ++WM+C++L 
Sbjct: 143  -NNGEIVHEIRGGKRFELVPDYSKDEFVLTKTMWSRLLPDSKSGSFVS--NLWMQCKELT 199

Query: 895  TSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVN 1074
            T+++LPEGFP+SVTSDYLEY+LWRGVQG+AAQISGVLATQALLYAVGLGKGAIPTAAAVN
Sbjct: 200  TTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAVN 259

Query: 1075 WVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXX 1254
            WVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTPAFPHLFV I   
Sbjct: 260  WVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFPHLFVPIGAV 319

Query: 1255 XXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQS 1434
                          TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGIMLGIALAN  +S
Sbjct: 320  AGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLGIALANCTRS 379

Query: 1435 STALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEE 1614
            ST+L+LASFGV+TWIHMFCNLKSY SIQLRTLNPYRASLVFS+YLLSGLVPSVKEVNDEE
Sbjct: 380  STSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGLVPSVKEVNDEE 439

Query: 1615 PFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELY 1794
            P FPA  +L LK   E Q+EVLS  AK AA+ I RRLQLGSKLSD+  SRE+ +AL ELY
Sbjct: 440  PLFPA-AILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVATSREDVLALFELY 498

Query: 1795 KAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGSR 1974
            K EGYIL E +GR+C+VLKESSS QDMLKSLF V YLYWLE  AGIKS+++ +DCRPG R
Sbjct: 499  KNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKSSSVANDCRPGGR 558

Query: 1975 LQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 2094
            LQ+SLEYV+REFNHVK DGE+AGWV D LIARP PNRIR+
Sbjct: 559  LQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRL 598


>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  672 bits (1734), Expect = 0.0
 Identities = 344/472 (72%), Positives = 388/472 (82%)
 Frame = +1

Query: 679  SSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXX 858
            +S  +AKT     N+ E+++EIRGGKR ELVPDYSKDEF++ + M               
Sbjct: 132  ASFVQAKT-----NNGEIVYEIRGGKRFELVPDYSKDEFVLTKTMWSQLWPDSTSGSFVS 186

Query: 859  XXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGL 1038
              ++WM+C++L T++ LPEGFPESVTSDYLEY+LWRGVQGIAAQISGVLATQALLYAVGL
Sbjct: 187  --NLWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAVGL 244

Query: 1039 GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 1218
            GKGAIPTAAA+NWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTP
Sbjct: 245  GKGAIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTP 304

Query: 1219 AFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI 1398
            AFPHLFV I                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGI
Sbjct: 305  AFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGI 364

Query: 1399 MLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSG 1578
            MLGIALAN  +SST+L+LASFGV+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSG
Sbjct: 365  MLGIALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 424

Query: 1579 LVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVK 1758
            LVPSVKEVNDEEP FPA  +L LK   E Q EVLS  AK AA+ I RRLQLGSKLSD+  
Sbjct: 425  LVPSVKEVNDEEPLFPA-AILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDVAT 483

Query: 1759 SREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKS 1938
            S+E+ +AL ELYK EGYIL E +GR+C+VLKESSS QDMLKSLF V YLYWLE NAGIKS
Sbjct: 484  SQEDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKS 543

Query: 1939 TTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 2094
            +++ +DCRPG RLQ+SLEYV+REFNHVK DGE+AGWV D LIARP P RIR+
Sbjct: 544  SSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRL 595


>gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  651 bits (1679), Expect = 0.0
 Identities = 332/479 (69%), Positives = 387/479 (80%)
 Frame = +1

Query: 682  SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 861
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 862  XDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1041
              VW +CRD+   ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1042 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1221
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1222 FPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1401
            FPHLFV I                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1402 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1581
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1582 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 1761
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 1762 REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 1941
            +E+A+AL  LYK EGYIL E +G++CVVLKESS  QDMLKSLFQV YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWLERNAGIEAS 531

Query: 1942 TIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 2118
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 532  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 590


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  639 bits (1647), Expect = e-180
 Identities = 327/480 (68%), Positives = 377/480 (78%)
 Frame = +1

Query: 664  YFIFSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXX 843
            +F F    A +K  +     EE ++E+RGGK  +++PD SKDEF++              
Sbjct: 218  FFHFQLDTALSKEKE-----EEGVWEVRGGKWHKIIPDSSKDEFLV----VTPGIGAVGA 268

Query: 844  XXXXXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALL 1023
                   ++W++C++L   +MLPEGFP SVTSDYL+Y+LWRGVQG+A+QISGVLATQALL
Sbjct: 269  PKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALL 328

Query: 1024 YAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 1203
            YAVGLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+
Sbjct: 329  YAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGL 388

Query: 1204 EILTPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 1383
            EILTPAFPH F+LI                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 389  EILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 448

Query: 1384 KSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQ 1563
            KSIGIMLGIALAN + SS  LS ASF V+T +HMFCNLKSYQSIQLRTLNPYRASLVFS+
Sbjct: 449  KSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSE 508

Query: 1564 YLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 1743
            YLLSG VPS+KEVN+EEP FP  PLL  KPT + Q  VLST+AKDAA+ I+RRLQLGSKL
Sbjct: 509  YLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKL 568

Query: 1744 SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 1923
            S++V S+E+ +AL +LY+ E YIL E KGR+ V+LKES S QDMLKS+F V YLYWLERN
Sbjct: 569  SEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERN 628

Query: 1924 AGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQ 2103
            AGI S    DDCRPG RLQISLEYVQREFNH+KND E  GW  DGLIARP PNRIR G++
Sbjct: 629  AGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHK 688


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  638 bits (1646), Expect = e-180
 Identities = 327/479 (68%), Positives = 376/479 (78%)
 Frame = +1

Query: 664  YFIFSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXX 843
            +F F    A +K  +     EE ++E+RGGK  +++PD SKDEF++              
Sbjct: 16   FFHFQLDTALSKEKE-----EEGVWEVRGGKWHKIIPDSSKDEFLV----VTPGIGAVGA 66

Query: 844  XXXXXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALL 1023
                   ++W++C++L   +MLPEGFP SVTSDYL+Y+LWRGVQG+A+QISGVLATQALL
Sbjct: 67   PKSSTLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALL 126

Query: 1024 YAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 1203
            YAVGLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+
Sbjct: 127  YAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGL 186

Query: 1204 EILTPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 1383
            EILTPAFPH F+LI                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 187  EILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVS 246

Query: 1384 KSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQ 1563
            KSIGIMLGIALAN + SS  LS ASF V+T +HMFCNLKSYQSIQLRTLNPYRASLVFS+
Sbjct: 247  KSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSE 306

Query: 1564 YLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 1743
            YLLSG VPS+KEVN+EEP FP  PLL  KPT + Q  VLST+AKDAA+ I+RRLQLGSKL
Sbjct: 307  YLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKL 366

Query: 1744 SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 1923
            S++V S+E+ +AL +LY+ E YIL E KGR+ V+LKES S QDMLKS+F V YLYWLERN
Sbjct: 367  SEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERN 426

Query: 1924 AGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGN 2100
            AGI S    DDCRPG RLQISLEYVQREFNH+KND E  GW  DGLIARP PNRIR G+
Sbjct: 427  AGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGH 485


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  632 bits (1631), Expect = e-178
 Identities = 323/487 (66%), Positives = 377/487 (77%), Gaps = 4/487 (0%)
 Frame = +1

Query: 676  SSSAARAKT---DDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXX 846
            S+S+A A+T   +      E+ ++ ++G KR+ L+PD+ KDEF++  ++           
Sbjct: 57   SASSAFARTTLKEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFLVNPSLPSSYDDIISSS 116

Query: 847  XXXXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLY 1026
                   +W++CR L   +MLPEG+P SVTSDYL+YSLWRGVQG+A+QISGVLATQALLY
Sbjct: 117  WLHFGRTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLY 176

Query: 1027 AVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGME 1206
            A+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAAFG+E
Sbjct: 177  AIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLE 236

Query: 1207 ILTPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 1386
            ILTPAFPHLFV I                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK
Sbjct: 237  ILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 296

Query: 1387 SIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQY 1566
             IGIMLGI LAN + SS  L+LASF V+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+Y
Sbjct: 297  FIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEY 356

Query: 1567 LLSGLVPSVKEVNDEEPFFPA-FPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKL 1743
            LLSG  P +K+VNDEEP FPA FP    K   +  + VLS +A+DAA+ I+RRLQLGSKL
Sbjct: 357  LLSGQAPPIKDVNDEEPLFPAVFP--HFKSADKPSLVVLSLEARDAATEIERRLQLGSKL 414

Query: 1744 SDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERN 1923
            SD+V S+E+ +AL  LYK EGYIL E KGR+CVVLKES SAQDMLK+LFQV YLYWLERN
Sbjct: 415  SDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQVNYLYWLERN 474

Query: 1924 AGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQ 2103
            AG+ +     DCR G RLQ+SLEY+QREF+HV+ND    GWV DGLIARP PNRI  G+ 
Sbjct: 475  AGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARPLPNRIYPGDL 534

Query: 2104 IASPAVS 2124
            +AS  VS
Sbjct: 535  VASSIVS 541


>gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 577

 Score =  621 bits (1602), Expect = e-175
 Identities = 320/479 (66%), Positives = 375/479 (78%)
 Frame = +1

Query: 682  SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 861
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 862  XDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1041
              VW +CRD+   ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1042 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1221
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1222 FPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1401
            FPHLFV I                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1402 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1581
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1582 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 1761
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 1762 REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 1941
            +E+A+AL  LYK EGYIL E +G++C              SLFQV YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFC--------------SLFQVNYLYWLERNAGIEAS 517

Query: 1942 TIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 2118
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 518  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 576


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  619 bits (1596), Expect = e-174
 Identities = 312/470 (66%), Positives = 365/470 (77%)
 Frame = +1

Query: 688  ARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXXD 867
            A +  DD ++ ++ V +E++G KR +L+PD++KD F++                      
Sbjct: 116  ATSSEDDGNKEYDAV-WEVKGSKRTKLIPDFTKDAFVVAS------ASNASLSSLLSVNK 168

Query: 868  VWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKG 1047
            +W +CR+L    MLPEGFP+SVTSDYL YSLWR VQG+A+QISGVLATQALLYA+GLGKG
Sbjct: 169  LWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLGKG 228

Query: 1048 AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 1227
            AIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+LTPAFP
Sbjct: 229  AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPAFP 288

Query: 1228 HLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 1407
            H FV I                 TRSCFYAGFAA+RNFAEVIAKGEAQGMVSK+IGIMLG
Sbjct: 289  HHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIMLG 348

Query: 1408 IALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVP 1587
            IALAN + SS   +LASF V+TWIHM+CNLKSYQSI+LRTLNPYRASLVFS+YLLSG  P
Sbjct: 349  IALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQAP 408

Query: 1588 SVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSRE 1767
             VKEVNDEEP FPAF    +K  ++ Q+ VLS++AKDAA  I+ RLQLGSKLSD+V ++E
Sbjct: 409  PVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNNKE 468

Query: 1768 EAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTI 1947
            +A AL  LY+ EGYIL E  G++CVVLKES+  QDMLKSLFQ  YLYWLERNAGI +T+ 
Sbjct: 469  DAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVATST 528

Query: 1948 VDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIG 2097
              DC PG RL+ISL+YVQREFNHVK+D    GWV DGLIARP PNRIR G
Sbjct: 529  SADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 578


>ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 593

 Score =  613 bits (1580), Expect = e-172
 Identities = 317/490 (64%), Positives = 369/490 (75%), Gaps = 5/490 (1%)
 Frame = +1

Query: 670  IFSSSAARAKTDDLSRNHEEVLF-----EIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 834
            +  +  A+AKT   S   +  LF     E++GGK  +LVPD + D F+  +         
Sbjct: 104  LLHAKLAKAKTLSPSTTADTSLFSEPVYEVKGGKWTKLVPDLTNDVFVSAQQGFLSELSS 163

Query: 835  XXXXXXXXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1014
                       VW+KC D+ T +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ
Sbjct: 164  LKVPSQLATF-VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQ 222

Query: 1015 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1194
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFADLLENAA
Sbjct: 223  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAA 282

Query: 1195 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQG 1374
            FG+E+ TPAFP  FVLI                 TRSCF+AGFAAQRNFAEVIAKGE QG
Sbjct: 283  FGLEMCTPAFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQG 342

Query: 1375 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1554
            M S+ IGI LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLV
Sbjct: 343  MASRFIGIGLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLV 402

Query: 1555 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 1734
            FS+YLLSG  P VKEVNDEEP FPA P+L     ++ Q  VLS++AKDAA+ I+ RLQLG
Sbjct: 403  FSEYLLSGQAPPVKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLG 462

Query: 1735 SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 1914
            SKLS+IV S+E+ +AL  LYK EGYIL+E  G++CVVLKE+ S QDMLK+LFQV YLYWL
Sbjct: 463  SKLSEIVNSKEDVLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDMLKALFQVNYLYWL 522

Query: 1915 ERNAGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 2094
            E+NAGI     ++D +PG RL ISL+YV+REFNHVKNDGE+ GWV DGLIARP PNRIRI
Sbjct: 523  EKNAGIGGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRIRI 582

Query: 2095 GNQIASPAVS 2124
            G+   S +VS
Sbjct: 583  GDTPPSNSVS 592


>gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 573

 Score =  612 bits (1579), Expect = e-172
 Identities = 316/479 (65%), Positives = 371/479 (77%)
 Frame = +1

Query: 682  SAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXX 861
            S+A A+T++ S+  ++V++E++G K  +L+PD+S+D F+    +                
Sbjct: 120  SSALARTNEDSQE-DDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNLTKSLSLST----- 173

Query: 862  XDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLG 1041
              VW +CRD+   ++LPEGFP+SVTSDYL+YSLWRGVQG+A+QISGVLATQALLYAVGLG
Sbjct: 174  --VWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAVGLG 231

Query: 1042 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1221
            KGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 232  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 291

Query: 1222 FPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1401
            FPHLFV I                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI+
Sbjct: 292  FPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIV 351

Query: 1402 LGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGL 1581
            LGIALAN V SST+L+LASFGV+TW+HM+CNLKSYQSIQLRTLN YRASLVFS+YLLSG 
Sbjct: 352  LGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSEYLLSGQ 411

Query: 1582 VPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKS 1761
             PS+KEVNDEEP FPA P L L   + E+  VLS++AK AA+ I+RRLQLGSKLSDIV +
Sbjct: 412  APSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKLSDIVNN 471

Query: 1762 REEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKST 1941
            +E+A+AL  LYK EGYIL E +G++C                  V YLYWLERNAGI+++
Sbjct: 472  KEDALALFSLYKDEGYILTEHEGKFC------------------VNYLYWLERNAGIEAS 513

Query: 1942 TIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 2118
                DCRPG RLQIS+EYVQREFNHVK D E  GWV DGLIARP PNRIR G++ AS A
Sbjct: 514  GASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHRDASTA 572


>gb|ESW30049.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris]
          Length = 592

 Score =  610 bits (1574), Expect = e-172
 Identities = 315/482 (65%), Positives = 366/482 (75%), Gaps = 3/482 (0%)
 Frame = +1

Query: 688  ARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXX 858
            A AKT   S ++E   E ++E++GGK   LVPD + D F+                    
Sbjct: 112  ANAKTWSSSSDNELLSEPVWEVKGGKWTRLVPDPTNDVFVSAH--PGLLAELQSLKPSQF 169

Query: 859  XXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGL 1038
               VW+KCRD+ T +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ+LLYAVGL
Sbjct: 170  ATFVWLKCRDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGL 229

Query: 1039 GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 1218
            GKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ TP
Sbjct: 230  GKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMCTP 289

Query: 1219 AFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI 1398
            AFP  FVLI                 TRSCF+AGFAAQRNFAEVIAKGE QGM S+ IGI
Sbjct: 290  AFPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGI 349

Query: 1399 MLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSG 1578
             LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLVFS+YLLSG
Sbjct: 350  GLGIGLGNCIGSSTPLVLASFIVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 409

Query: 1579 LVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVK 1758
              P VK+VNDEEP FPA P+L     ++ +   LS++AKDAA+ I+RRLQLGSKLS+IV 
Sbjct: 410  QAPPVKDVNDEEPLFPAVPILNATFANKARSIALSSEAKDAAAEIERRLQLGSKLSEIVN 469

Query: 1759 SREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKS 1938
             +E+ +AL  LYK EGYIL+E  G++CVVLKE+ S QDMLK+LFQV YLYWLE+NAGI  
Sbjct: 470  GKEDVLALFRLYKKEGYILSEHMGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGG 529

Query: 1939 TTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 2118
               ++D RPG RL  SL+YV+REFNH+KNDGE  GWV DGLIARP PNRIRIG+  +S +
Sbjct: 530  RGTLNDSRPGGRLHTSLDYVEREFNHLKNDGESVGWVTDGLIARPLPNRIRIGDTTSSNS 589

Query: 2119 VS 2124
            VS
Sbjct: 590  VS 591


>ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog [Fragaria vesca subsp.
            vesca]
          Length = 593

 Score =  605 bits (1560), Expect = e-170
 Identities = 305/460 (66%), Positives = 360/460 (78%)
 Frame = +1

Query: 712  SRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXXDVWMKCRDL 891
            S    E ++E++GGK  +L PD+ +D F+                       + ++C+ L
Sbjct: 133  SEEDAESVWEVKGGKWTKLAPDFVRDAFVADGG---------GGLGSISFESLGLQCKSL 183

Query: 892  ATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAV 1071
               +MLPEGFP+SVTSDYL+YSLWR VQG+A+Q+SGVLATQALLYAVGLGKGAIPTAAA+
Sbjct: 184  FVQLMLPEGFPDSVTSDYLDYSLWRAVQGVASQVSGVLATQALLYAVGLGKGAIPTAAAL 243

Query: 1072 NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXX 1251
            NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGME+LTP FP+ F+LI  
Sbjct: 244  NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEMLTPVFPNHFLLIGA 303

Query: 1252 XXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQ 1431
                           TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN + 
Sbjct: 304  AAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANQIG 363

Query: 1432 SSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDE 1611
            SST+L LASF ++T IHMFCNLKSYQ+IQLRTLNPYRASLVFS+YLLSG  P VK+VN+E
Sbjct: 364  SSTSLGLASFSLVTCIHMFCNLKSYQAIQLRTLNPYRASLVFSEYLLSGQAPPVKDVNEE 423

Query: 1612 EPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLEL 1791
            EP FPA P L  KP ++ Q  VLS++AKDAA+ I++RLQLG KLSD++ ++E+  AL  L
Sbjct: 424  EPLFPAVPFLNWKPANKGQPTVLSSEAKDAAAEIEQRLQLGCKLSDLINNKEDVHALFNL 483

Query: 1792 YKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGS 1971
            YK EGYIL E +GRYCVVLKE+SS QDMLK+LF V YLYWLE+NAGI++     DCRPG 
Sbjct: 484  YKEEGYILTEHRGRYCVVLKETSSLQDMLKALFHVNYLYWLEKNAGIEAKGTSIDCRPGG 543

Query: 1972 RLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIR 2091
            RL++SL+YV+REF+ +K DGE  GWV DGLIARP+PNRIR
Sbjct: 544  RLEMSLDYVRREFDIIKTDGESVGWVTDGLIARPAPNRIR 583


>ref|XP_003538922.1| PREDICTED: uncharacterized protein LOC100786144 [Glycine max]
          Length = 592

 Score =  605 bits (1559), Expect = e-170
 Identities = 315/484 (65%), Positives = 364/484 (75%), Gaps = 5/484 (1%)
 Frame = +1

Query: 688  ARAKTDDLSRNHEEVLF-----EIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 852
            A+AKT   S + +  LF     E++GGK  +LVPD + D F+  +               
Sbjct: 110  AKAKTLSSSSSSDTSLFSEPVYEVKGGKWTKLVPDPTDDVFVSAQQGFLSELSSLKPSQL 169

Query: 853  XXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAV 1032
                 VW+KC D+ T +MLPEGFPESVTSDYLEYSLWR VQG+A Q+SGVLATQ+LLYAV
Sbjct: 170  ATF--VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAV 227

Query: 1033 GLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEIL 1212
            GLGKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ 
Sbjct: 228  GLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMS 287

Query: 1213 TPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1392
            TPA P  FVLI                 TRSCF+AGFAAQRNFAEVIAKGE QGM S+ I
Sbjct: 288  TPACPQFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFI 347

Query: 1393 GIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLL 1572
            GI+LGI L N + SST L LASF V+TWIHM+CNLKSYQSIQLRTLNPYRASLVFS+YLL
Sbjct: 348  GIVLGIGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLL 407

Query: 1573 SGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDI 1752
            SG  P VKEVNDEEP FPA P+L     S+ Q   LS++AKDAA+ I+ RLQLGSKLS+I
Sbjct: 408  SGQAPPVKEVNDEEPLFPAVPILNATFASKAQSFALSSEAKDAAAEIEHRLQLGSKLSEI 467

Query: 1753 VKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGI 1932
            V S+E+ +AL  LYK EGYIL+E  G+Y VVLKE  S  DMLK+LFQV YLYWLE+NAGI
Sbjct: 468  VNSKEDVLALFGLYKNEGYILSEHMGKYSVVLKEKCSQLDMLKALFQVNYLYWLEKNAGI 527

Query: 1933 KSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIAS 2112
            +    ++D +PG RL ISL+YV+REFNHVKNDGE+ GWV DGLIARP PNRI IG+   S
Sbjct: 528  EGRGTLNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRICIGDTAPS 587

Query: 2113 PAVS 2124
             +VS
Sbjct: 588  NSVS 591


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  603 bits (1556), Expect = e-170
 Identities = 310/487 (63%), Positives = 373/487 (76%), Gaps = 4/487 (0%)
 Frame = +1

Query: 676  SSSAARAKTDDLSRNHE-EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 852
            +S+ A+A+  D   + E E ++E+RG KR  LVPD+ KDEF+  E               
Sbjct: 178  ASAVAKAENSDSDDSTEKETVWEVRGSKRKRLVPDFVKDEFVSEE-------AAFELSSS 230

Query: 853  XXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAV 1032
                ++  +CR L T  +LPEG+P SVTSDYL+YSLWRGVQGIA+QISGVLATQ+LLYAV
Sbjct: 231  LTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAV 290

Query: 1033 GLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEIL 1212
            GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAAFGME+L
Sbjct: 291  GLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEML 350

Query: 1213 TPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSI 1392
            TP FP  FV+I                 TRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+
Sbjct: 351  TPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSM 410

Query: 1393 GIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLL 1572
            GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLVFS+YL+
Sbjct: 411  GILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLI 470

Query: 1573 SGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDI 1752
            SG  P +KEVNDEEP FPA   L +K   + Q  VLS++AK AA+ I+ RLQLGSKLSD+
Sbjct: 471  SGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAADIEERLQLGSKLSDV 530

Query: 1753 VKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGI 1932
            + ++EEAIAL +LY+ EGYIL E +GR+CV+LKESSS QDML+SLFQV YLYWLE+NAGI
Sbjct: 531  IHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVNYLYWLEKNAGI 590

Query: 1933 KSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIG---NQ 2103
            +  +   DC+PG RL ISL+YV+REF H K D E  GWV +GLIARP P RIR+G     
Sbjct: 591  EPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRLGYDSEP 650

Query: 2104 IASPAVS 2124
            ++SP+ S
Sbjct: 651  LSSPSSS 657


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  602 bits (1553), Expect = e-169
 Identities = 307/482 (63%), Positives = 370/482 (76%), Gaps = 3/482 (0%)
 Frame = +1

Query: 664  YFIFSSSAARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 834
            +F  S+++A AK  +   N +   E ++E+RG KR  LVPD+ KDEF+  E+        
Sbjct: 122  HFRLSAASAIAKDQNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSL 181

Query: 835  XXXXXXXXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1014
                      ++  +CR+L T  +LPEGFP SVTSDYL+YSLWRGVQGIA+QISGVLATQ
Sbjct: 182  TPE-------NLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQ 234

Query: 1015 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1194
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 235  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAA 294

Query: 1195 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQG 1374
            FGME+LTP FP  FV+I                 TRSCF AGFA+QRNFAEVIAKGEAQG
Sbjct: 295  FGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQG 354

Query: 1375 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1554
            MVSKS+GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLV
Sbjct: 355  MVSKSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLV 414

Query: 1555 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 1734
            FS+YL+SG  P +KEVNDEEP FP      +K   + Q  VLS++AK AA+ I+ RLQLG
Sbjct: 415  FSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEERLQLG 474

Query: 1735 SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 1914
            SKLSD++ ++EEAIAL +LY+ EGYIL E KGR+CV+LKESS+ QDML+SLFQV YLYWL
Sbjct: 475  SKLSDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWL 534

Query: 1915 ERNAGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 2094
            E+NAGI+  +   DC+PG RL ISL+YV+REF H K D E  GWV +GLIARP P RIR+
Sbjct: 535  EKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRL 594

Query: 2095 GN 2100
            G+
Sbjct: 595  GH 596


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  602 bits (1552), Expect = e-169
 Identities = 308/493 (62%), Positives = 376/493 (76%), Gaps = 6/493 (1%)
 Frame = +1

Query: 664  YFIFSSSAARAKTDDLSRNHE---EVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXX 834
            +F  S+++A AK  D   + +   E ++E+RG KR  LVPD+ KDEF+  E+        
Sbjct: 128  HFRLSAASAIAKASDSDSSGDTDKETVWEVRGSKRKRLVPDFVKDEFVSEESAFELSSSL 187

Query: 835  XXXXXXXXXXDVWMKCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQ 1014
                      ++  +CR+L T  +LPEGFP SVTSDYL+YSLWRGVQGIA+Q+SGVLATQ
Sbjct: 188  TPE-------NLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQ 240

Query: 1015 ALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 1194
            +LLYAVGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAA
Sbjct: 241  SLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAA 300

Query: 1195 FGMEILTPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQG 1374
            FGME+LTP FP  FV+I                 TRSCF AGFA+QRNFAEVIAKGEAQG
Sbjct: 301  FGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQG 360

Query: 1375 MVSKSIGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLV 1554
            MVSKS+GI+LGI +AN + +ST+L+LA+FGV+T IHM+ NLKSYQ IQLRTLNPYRASLV
Sbjct: 361  MVSKSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLV 420

Query: 1555 FSQYLLSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLG 1734
            FS+YL+SG  P +KEVNDEEP FP    L +K   + Q  VLS++AK AA  I+ RLQLG
Sbjct: 421  FSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERLQLG 480

Query: 1735 SKLSDIVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWL 1914
            SKLSD++ ++EEAIAL +LY+ EGYIL E +GR+CV+LKESS+ QDML+SLFQV YLYWL
Sbjct: 481  SKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLFQVNYLYWL 540

Query: 1915 ERNAGIKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRI 2094
            E+NAGI+  +   DC+PG RL ISL+YV+REF H K D +  GWV +GLIARP P RIR+
Sbjct: 541  EKNAGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIARPLPTRIRL 600

Query: 2095 GNQ---IASPAVS 2124
            G+    ++SP+ S
Sbjct: 601  GHDREPLSSPSSS 613


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  601 bits (1549), Expect = e-169
 Identities = 313/482 (64%), Positives = 368/482 (76%), Gaps = 1/482 (0%)
 Frame = +1

Query: 673  FSSSAARAKTDDLSRNHEEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXX 852
            F S  ARA++   S      ++E++GGK + LVP+   D F++                 
Sbjct: 107  FCSRLARAQSLSSS------VWEVKGGKWILLVPNDLDDTFVVDS-----LFPSTSSTRP 155

Query: 853  XXXXDVWM-KCRDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYA 1029
                ++W+ KCR L   +MLPEG+PESVTSDYL+YSLWR VQG+A+QIS VLATQ+LLYA
Sbjct: 156  VSPLNLWLEKCRQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQSLLYA 215

Query: 1030 VGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEI 1209
            VGLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG E+
Sbjct: 216  VGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGFEM 275

Query: 1210 LTPAFPHLFVLIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKS 1389
            LTPAFPHLFV I                 TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKS
Sbjct: 276  LTPAFPHLFVPIGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKS 335

Query: 1390 IGIMLGIALANGVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYL 1569
            IGI +GI LAN + +ST L+LASF V+T+IHM+CNLKSYQSIQLRTLNPYRASLVFS+YL
Sbjct: 336  IGIAMGIGLANCIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLVFSEYL 395

Query: 1570 LSGLVPSVKEVNDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSD 1749
            LSG  P +KEVNDE+P FPA P+L +KP ++EQ  VLS +AK AA+ ID RL LGSKLSD
Sbjct: 396  LSGQAPPIKEVNDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLGSKLSD 455

Query: 1750 IVKSREEAIALLELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAG 1929
            +V + ++ +AL +LY+ EGYIL E  GR+CVVLKE+ S  DMLK++F V YLYWLE+NAG
Sbjct: 456  VVNNHKDVLALFDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWLEKNAG 515

Query: 1930 IKSTTIVDDCRPGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIA 2109
            I   +   D +PG RLQISL+YV+REFNHVK DGE AGW  DGLIARP PNRIR G  +A
Sbjct: 516  IDGASPYLDSKPGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRPG-FVA 574

Query: 2110 SP 2115
            SP
Sbjct: 575  SP 576


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  598 bits (1542), Expect = e-168
 Identities = 301/459 (65%), Positives = 353/459 (76%)
 Frame = +1

Query: 724  EEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXXDVWMKCRDLATSV 903
            ++ ++E++GG  ++L PD+ KD FI                       ++ KC++    +
Sbjct: 132  KQPIWEVKGGNFIKLFPDHLKDIFIASNPTFFSELSSLNVSQVPSF--LYTKCKEFTVRL 189

Query: 904  MLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 1083
            MLPEGFP SVTSDYLEYSLWRGVQG+A Q+SGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 190  MLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVL 249

Query: 1084 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXXXXX 1263
            KDGIGYLSKI+LS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFPHLFV I      
Sbjct: 250  KDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAVAGA 309

Query: 1264 XXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQSSTA 1443
                       TRSCF+AGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N + SST 
Sbjct: 310  SRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIALGIGLGNCIGSSTP 369

Query: 1444 LSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEEPFF 1623
            L LASF V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFS+YLLSG  P VKEVNDEEP F
Sbjct: 370  LVLASFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLF 429

Query: 1624 PAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELYKAE 1803
            PA P+L     ++ Q  VLS++AKDAA  I+ RLQLGSKLS+I+ ++EE +AL  LYK E
Sbjct: 430  PALPILNACFANKAQSIVLSSEAKDAAVEIESRLQLGSKLSEIIHNKEEVLALFSLYKNE 489

Query: 1804 GYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGSRLQI 1983
            GYIL+E  G++CVVLKE+ S  DMLK+LFQV YLYWLE+NAGI+    + DC+PG RL+I
Sbjct: 490  GYILSEHTGKFCVVLKENCSQLDMLKALFQVNYLYWLEKNAGIEGRGALYDCKPGGRLRI 549

Query: 1984 SLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGN 2100
            SLEY +REFNH +NDGE AGW+ DGLIARP PNRIR GN
Sbjct: 550  SLEYAEREFNHARNDGESAGWIADGLIARPLPNRIRPGN 588


>ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Populus trichocarpa]
            gi|550347673|gb|ERP65789.1| hypothetical protein
            POPTR_0001s19390g [Populus trichocarpa]
          Length = 406

 Score =  597 bits (1538), Expect = e-167
 Identities = 302/405 (74%), Positives = 337/405 (83%)
 Frame = +1

Query: 904  MLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTAAAVNWVL 1083
            MLP+GFP SVTSDYL+YSLWR VQGIA+QISGVLATQALLYAVGLGKGAIPTAAA+NWVL
Sbjct: 1    MLPQGFPRSVTSDYLDYSLWRAVQGIASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 60

Query: 1084 KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVLIXXXXXX 1263
            KDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAAFG+E+LTPAFPHLFV I      
Sbjct: 61   KDGIGYLSKIVLSKYGRHFDVHPKGWRLFADLLENAAFGLEMLTPAFPHLFVFIGATAGA 120

Query: 1264 XXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGVQSSTA 1443
                       TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN + SST 
Sbjct: 121  GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANCIGSSTP 180

Query: 1444 LSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVNDEEPFF 1623
            L+LASF V+TWIHMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSG  P VKE+NDEEP F
Sbjct: 181  LALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEINDEEPLF 240

Query: 1624 PAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIALLELYKAE 1803
            PA P L +      Q  VLS++A++AA+ I++RLQLGSKLSD+V ++++ +AL  LY+ E
Sbjct: 241  PAVPFLNIYSKGNVQSIVLSSEARNAAAEIEQRLQLGSKLSDVVNNKDDVLALFNLYRDE 300

Query: 1804 GYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCRPGSRLQI 1983
            GYIL E KGR+CVVLKESSS  DMLKSLFQV YLYWLERNAGI++ +I  DCRP  RLQI
Sbjct: 301  GYILTEHKGRFCVVLKESSSPHDMLKSLFQVNYLYWLERNAGIEARSISADCRPEGRLQI 360

Query: 1984 SLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRIRIGNQIASPA 2118
            SLEY +REFNHVKND    GWV DGLIARPSP R+  GN  +S A
Sbjct: 361  SLEYARREFNHVKNDSVSMGWVADGLIARPSPIRVCPGNIASSIA 405


>ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 611

 Score =  597 bits (1538), Expect = e-167
 Identities = 302/462 (65%), Positives = 363/462 (78%), Gaps = 2/462 (0%)
 Frame = +1

Query: 709  LSRNH--EEVLFEIRGGKRVELVPDYSKDEFIIPENMXXXXXXXXXXXXXXXXXDVWMKC 882
            L+RN+   E ++E++GGKR+ L+ D  +DEF +   M                 +VW++C
Sbjct: 151  LARNNMNTESIWEVKGGKRIRLILDTYRDEFHVATGMPSSSLSFSFV-------NVWLRC 203

Query: 883  RDLATSVMLPEGFPESVTSDYLEYSLWRGVQGIAAQISGVLATQALLYAVGLGKGAIPTA 1062
             D+ T +MLPEGFP+SVTSDYLEYSLWRGVQGIA+Q+SGVLATQALLYAVGLGKGAIPTA
Sbjct: 204  SDIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTA 263

Query: 1063 AAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVL 1242
            AAVNWVLKDG GYLSKI LSKYGRHFDV+PKGWRLFADLLENAA+GME+LTPAFP  FV+
Sbjct: 264  AAVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVV 323

Query: 1243 IXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN 1422
            I                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG+MLGI LAN
Sbjct: 324  IGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLAN 383

Query: 1423 GVQSSTALSLASFGVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEV 1602
             ++SST+L+L  F ++T IHMFCNLKSY+SIQLRTLNPYRASLVFS+YLLSG VPS+K+V
Sbjct: 384  RIRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDV 443

Query: 1603 NDEEPFFPAFPLLILKPTSEEQVEVLSTDAKDAASHIDRRLQLGSKLSDIVKSREEAIAL 1782
            N+EEP FPA PLL  K     +  +LS +AK++A++I++RLQLGSKLSD+    E+ + L
Sbjct: 444  NNEEPLFPAVPLLNRKAPDWSRDFLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLEL 503

Query: 1783 LELYKAEGYILAELKGRYCVVLKESSSAQDMLKSLFQVCYLYWLERNAGIKSTTIVDDCR 1962
            L L+  E YIL+E +G+YCV+LKES+S  DMLK++F V YL+WLERNAGI + +  +DCR
Sbjct: 504  LSLFNKENYILSEHRGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCR 563

Query: 1963 PGSRLQISLEYVQREFNHVKNDGEIAGWVVDGLIARPSPNRI 2088
            PG RLQ+SLEYV+REF HVK DGE+AGW  DGLIARP   RI
Sbjct: 564  PGGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIARPLTTRI 605


Top