BLASTX nr result

ID: Catharanthus22_contig00010090 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00010090
         (2538 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma caca...   704   0.0  
ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   700   0.0  
ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   697   0.0  
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              679   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   679   0.0  
gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]    673   0.0  
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   670   0.0  
gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]    667   0.0  
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     654   0.0  
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   647   0.0  
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   641   0.0  
ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   639   e-180
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   635   e-179
ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago ...   631   e-178
ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutr...   630   e-177
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   629   e-177
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   626   e-176
ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Popu...   622   e-175
ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   619   e-174
ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [A...   618   e-174

>gb|EOY25334.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  704 bits (1817), Expect = 0.0
 Identities = 347/488 (71%), Positives = 403/488 (82%)
 Frame = +2

Query: 494  IASVADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPG 673
            +A    ++ +S ++  N +  ED++V E++G K  +LIPD+  D F+    I+       
Sbjct: 111  VACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNL----- 165

Query: 674  SEKNLKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQ 853
                  LS+ ++W QCR++ M L+LPEGFP SVTSDYL+YSLWRGVQGVA+QISGVLATQ
Sbjct: 166  ---TKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 222

Query: 854  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAA 1033
            ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAA
Sbjct: 223  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 282

Query: 1034 YGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQG 1213
            +G+E+LTPAFPHLFVPI                 TRSCFYAGFAAQRNFAEVIAKGEAQG
Sbjct: 283  FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQG 342

Query: 1214 MVSKSIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLV 1393
            MVSKSIGI+LGIALANC+ SST LAL+SFGV+TW+HMYCNLKSYQSIQLRTLN YRA LV
Sbjct: 343  MVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLV 402

Query: 1394 FSEYLLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLG 1573
            FSEYLLSG  PS+KEVNDEEPLFPAVP LN+  A RE++ VLS +AK AAA+IE RL LG
Sbjct: 403  FSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLG 462

Query: 1574 SKLSDVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWL 1753
            SKLSD+V N+EDALALF L++DEGY+LTEHEGKFCV LKESS PQDMLKS+F V+YLYWL
Sbjct: 463  SKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWL 522

Query: 1754 EKNAGIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISP 1933
            E+NAGI++S  S DC+PGGRLQ+S+EY++REFNHVK DSE+ GWVTDGLIARPLPNRI P
Sbjct: 523  ERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRP 582

Query: 1934 GYGNASVA 1957
            G+ +AS A
Sbjct: 583  GHRDASTA 590


>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  700 bits (1806), Expect = 0.0
 Identities = 354/473 (74%), Positives = 392/473 (82%)
 Frame = +2

Query: 521  ASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSI 700
            AS V ++  N     IV EIRGGK+ EL+PDY +DEF++ KT+   W R   +      +
Sbjct: 135  ASFVQAKTNN---GEIVHEIRGGKRFELVPDYSKDEFVLTKTM---WSRLLPDSKSGSFV 188

Query: 701  GSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLG 880
             +LWMQC+EL   L+LPEGFP SVTSDYLEY+LWRGVQGVAAQISGVLATQALLYAVGLG
Sbjct: 189  SNLWMQCKELTTTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLG 248

Query: 881  KGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPA 1060
            KGAIPTAAA+NWVLKDGIGYLSKILLS YGRHFDVNPK WRLFADLLENAAYG+EILTPA
Sbjct: 249  KGAIPTAAAVNWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPA 308

Query: 1061 FPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1240
            FPHLFVPI                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGIM
Sbjct: 309  FPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIM 368

Query: 1241 LGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGL 1420
            LGIALANC +SST LAL+SFGV+TWIHM+CNLKSY SIQLRTLNPYRA LVFSEYLLSGL
Sbjct: 369  LGIALANCTRSSTSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGL 428

Query: 1421 VPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKN 1600
            VPSVKEVNDEEPLFPA  +LN+K AY  Q EVLS  AK AAA I  RL LGSKLSDV  +
Sbjct: 429  VPSVKEVNDEEPLFPAA-ILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVATS 487

Query: 1601 REDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSS 1780
            RED LALF+L+++EGY+LTEHEG+FC+ LKESSSPQDMLKS+FHV+YLYWLE  AGIKSS
Sbjct: 488  REDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKSS 547

Query: 1781 STSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            S ++DC+PGGRLQMSLEY+ REFNHVK+D E AGWVTD LIARPLPNRI   Y
Sbjct: 548  SVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRLDY 600


>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  697 bits (1798), Expect = 0.0
 Identities = 353/477 (74%), Positives = 392/477 (82%)
 Frame = +2

Query: 521  ASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSI 700
            AS V ++  N     IV EIRGGK+ EL+PDY +DEF++ KT+    W   +  +    +
Sbjct: 132  ASFVQAKTNN---GEIVYEIRGGKRFELVPDYSKDEFVLTKTMWSQLWPDSTSGSF---V 185

Query: 701  GSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLG 880
             +LWMQC+EL   L LPEGFP SVTSDYLEY+LWRGVQG+AAQISGVLATQALLYAVGLG
Sbjct: 186  SNLWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAVGLG 245

Query: 881  KGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPA 1060
            KGAIPTAAAINWVLKDGIGYLSKILLS YGRHFDVNPK WRLFADLLENAAYG+EILTPA
Sbjct: 246  KGAIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPA 305

Query: 1061 FPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1240
            FPHLFVPI                 TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK+IGIM
Sbjct: 306  FPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIM 365

Query: 1241 LGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGL 1420
            LGIALAN  +SST LAL+SFGV+TWIHM+CNLKSYQSIQLRTLNPYRA LVFSEYLLSGL
Sbjct: 366  LGIALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGL 425

Query: 1421 VPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKN 1600
            VPSVKEVNDEEPLFPA  +LN+K AY  Q EVLS  AK AAA I  RL LGSKLSDV  +
Sbjct: 426  VPSVKEVNDEEPLFPAA-ILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDVATS 484

Query: 1601 REDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSS 1780
            +ED LALF+L+++EGY+LTEHEG+FC+ LKESSSPQDMLKS+FHV+YLYWLE NAGIKSS
Sbjct: 485  QEDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKSS 544

Query: 1781 STSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGYGNAS 1951
            S ++DC+PGGRLQMSLEY+ REFNHVK+D E AGWVTD LIARPLP RI   Y   S
Sbjct: 545  SVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRLDYAAES 601


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  679 bits (1751), Expect = 0.0
 Identities = 338/463 (73%), Positives = 388/463 (83%)
 Frame = +2

Query: 551  ETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSIGSLWMQCREL 730
            E E+  V E+RGGK  ++IPD  +DEF+V    + +   P S      ++ +LW+QC+EL
Sbjct: 28   EKEEEGVWEVRGGKWHKIIPDSSKDEFLVVTPGIGAVGAPKSS-----TLPNLWLQCKEL 82

Query: 731  FMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAI 910
            F+ LMLPEGFP SVTSDYL+Y+LWRGVQGVA+QISGVLATQALLYAVGLGKGAIPTAAA+
Sbjct: 83   FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 142

Query: 911  NWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXX 1090
            NWVLKDGIGYLSKILLSKYGRHFDV+PKGWRLFADLLENAAYG+EILTPAFPH F+ I  
Sbjct: 143  NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 202

Query: 1091 XXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIQ 1270
                           TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCI 
Sbjct: 203  VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 262

Query: 1271 SSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDE 1450
            SS PL+ +SF V+T +HM+CNLKSYQSIQLRTLNPYRA LVFSEYLLSG VPS+KEVN+E
Sbjct: 263  SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVNEE 322

Query: 1451 EPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDALALFDL 1630
            EPLFP VPLLN KP Y+ Q+ VLS +AKDAAA IE RL LGSKLS+VV ++ED LALFDL
Sbjct: 323  EPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDL 382

Query: 1631 FQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDDCKPGG 1810
            +++E Y+LTEH+G+F V LKES SPQDMLKS+FHV+YLYWLE+NAGI S   SDDC+PGG
Sbjct: 383  YRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGIISMGASDDCRPGG 442

Query: 1811 RLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            RLQ+SLEY++REFNH+K+DSE  GW TDGLIARPLPNRI PG+
Sbjct: 443  RLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGH 485


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  679 bits (1751), Expect = 0.0
 Identities = 338/463 (73%), Positives = 388/463 (83%)
 Frame = +2

Query: 551  ETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSIGSLWMQCREL 730
            E E+  V E+RGGK  ++IPD  +DEF+V    + +   P S      ++ +LW+QC+EL
Sbjct: 230  EKEEEGVWEVRGGKWHKIIPDSSKDEFLVVTPGIGAVGAPKSS-----TLPNLWLQCKEL 284

Query: 731  FMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAI 910
            F+ LMLPEGFP SVTSDYL+Y+LWRGVQGVA+QISGVLATQALLYAVGLGKGAIPTAAA+
Sbjct: 285  FLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAV 344

Query: 911  NWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXX 1090
            NWVLKDGIGYLSKILLSKYGRHFDV+PKGWRLFADLLENAAYG+EILTPAFPH F+ I  
Sbjct: 345  NWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLLIGA 404

Query: 1091 XXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIQ 1270
                           TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCI 
Sbjct: 405  VAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIG 464

Query: 1271 SSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDE 1450
            SS PL+ +SF V+T +HM+CNLKSYQSIQLRTLNPYRA LVFSEYLLSG VPS+KEVN+E
Sbjct: 465  SSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEVNEE 524

Query: 1451 EPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDALALFDL 1630
            EPLFP VPLLN KP Y+ Q+ VLS +AKDAAA IE RL LGSKLS+VV ++ED LALFDL
Sbjct: 525  EPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLALFDL 584

Query: 1631 FQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDDCKPGG 1810
            +++E Y+LTEH+G+F V LKES SPQDMLKS+FHV+YLYWLE+NAGI S   SDDC+PGG
Sbjct: 585  YRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGIISMGASDDCRPGG 644

Query: 1811 RLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            RLQ+SLEY++REFNH+K+DSE  GW TDGLIARPLPNRI PG+
Sbjct: 645  RLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGH 687


>gb|EOY25337.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 577

 Score =  673 bits (1736), Expect = 0.0
 Identities = 335/488 (68%), Positives = 391/488 (80%)
 Frame = +2

Query: 494  IASVADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPG 673
            +A    ++ +S ++  N +  ED++V E++G K  +LIPD+  D F+    I+       
Sbjct: 111  VACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNL----- 165

Query: 674  SEKNLKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQ 853
                  LS+ ++W QCR++ M L+LPEGFP SVTSDYL+YSLWRGVQGVA+QISGVLATQ
Sbjct: 166  ---TKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 222

Query: 854  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAA 1033
            ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAA
Sbjct: 223  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 282

Query: 1034 YGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQG 1213
            +G+E+LTPAFPHLFVPI                 TRSCFYAGFAAQRNFAEVIAKGEAQG
Sbjct: 283  FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQG 342

Query: 1214 MVSKSIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLV 1393
            MVSKSIGI+LGIALANC+ SST LAL+SFGV+TW+HMYCNLKSYQSIQLRTLN YRA LV
Sbjct: 343  MVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLV 402

Query: 1394 FSEYLLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLG 1573
            FSEYLLSG  PS+KEVNDEEPLFPAVP LN+  A RE++ VLS +AK AAA+IE RL LG
Sbjct: 403  FSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLG 462

Query: 1574 SKLSDVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWL 1753
            SKLSD+V N+EDALALF L++DEGY+LTEHEGKFC              S+F V+YLYWL
Sbjct: 463  SKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC--------------SLFQVNYLYWL 508

Query: 1754 EKNAGIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISP 1933
            E+NAGI++S  S DC+PGGRLQ+S+EY++REFNHVK DSE+ GWVTDGLIARPLPNRI P
Sbjct: 509  ERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRP 568

Query: 1934 GYGNASVA 1957
            G+ +AS A
Sbjct: 569  GHRDASTA 576


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  670 bits (1729), Expect = 0.0
 Identities = 340/489 (69%), Positives = 384/489 (78%)
 Frame = +2

Query: 491  CIASVADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRP 670
            C   VA A   +  SSE+    E + V E++G K+ +LIPD+ +D F+V           
Sbjct: 104  CHLQVATAIARTATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVASA-------S 156

Query: 671  GSEKNLKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLAT 850
             +  +  LS+  LW +CRELF+  MLPEGFP SVTSDYL YSLWR VQGVA+QISGVLAT
Sbjct: 157  NASLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLAT 216

Query: 851  QALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENA 1030
            QALLYA+GLGKGAIPTAAAINWVLKDGIGYLSKI+LS +GRHFDVNPKGWRLFADLLENA
Sbjct: 217  QALLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENA 276

Query: 1031 AYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQ 1210
            A+G+E+LTPAFPH FV I                 TRSCFYAGFAA+RNFAEVIAKGEAQ
Sbjct: 277  AFGLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQ 336

Query: 1211 GMVSKSIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGL 1390
            GMVSK+IGIMLGIALAN I SS P AL+SF V+TWIHMYCNLKSYQSI+LRTLNPYRA L
Sbjct: 337  GMVSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASL 396

Query: 1391 VFSEYLLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHL 1570
            VFSEYLLSG  P VKEVNDEEPLFPA     IK A + Q  VLS +AKDAA  IE RL L
Sbjct: 397  VFSEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQL 456

Query: 1571 GSKLSDVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYW 1750
            GSKLSDVV N+EDA ALF L++DEGY+LTEH GKFCV LKES+ PQDMLKS+F  SYLYW
Sbjct: 457  GSKLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYW 516

Query: 1751 LEKNAGIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRIS 1930
            LE+NAGI ++STS DC PGGRL++SL+Y++REFNHVKSDS + GWVTDGLIARPLPNRI 
Sbjct: 517  LERNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIR 576

Query: 1931 PGYGNASVA 1957
            PGY   SVA
Sbjct: 577  PGYVEPSVA 585


>gb|EOY25338.1| Uncharacterized protein isoform 5 [Theobroma cacao]
          Length = 573

 Score =  667 bits (1720), Expect = 0.0
 Identities = 333/488 (68%), Positives = 388/488 (79%)
 Frame = +2

Query: 494  IASVADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPG 673
            +A    ++ +S ++  N +  ED++V E++G K  +LIPD+  D F+    I+       
Sbjct: 111  VACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDFSEDAFVASNGIVNL----- 165

Query: 674  SEKNLKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQ 853
                  LS+ ++W QCR++ M L+LPEGFP SVTSDYL+YSLWRGVQGVA+QISGVLATQ
Sbjct: 166  ---TKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQ 222

Query: 854  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAA 1033
            ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAA
Sbjct: 223  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAA 282

Query: 1034 YGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQG 1213
            +G+E+LTPAFPHLFVPI                 TRSCFYAGFAAQRNFAEVIAKGEAQG
Sbjct: 283  FGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQG 342

Query: 1214 MVSKSIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLV 1393
            MVSKSIGI+LGIALANC+ SST LAL+SFGV+TW+HMYCNLKSYQSIQLRTLN YRA LV
Sbjct: 343  MVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLV 402

Query: 1394 FSEYLLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLG 1573
            FSEYLLSG  PS+KEVNDEEPLFPAVP LN+  A RE++ VLS +AK AAA+IE RL LG
Sbjct: 403  FSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLG 462

Query: 1574 SKLSDVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWL 1753
            SKLSD+V N+EDALALF L++DEGY+LTEHEGKFC                  V+YLYWL
Sbjct: 463  SKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC------------------VNYLYWL 504

Query: 1754 EKNAGIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISP 1933
            E+NAGI++S  S DC+PGGRLQ+S+EY++REFNHVK DSE+ GWVTDGLIARPLPNRI P
Sbjct: 505  ERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRP 564

Query: 1934 GYGNASVA 1957
            G+ +AS A
Sbjct: 565  GHRDASTA 572


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  654 bits (1686), Expect = 0.0
 Identities = 328/466 (70%), Positives = 378/466 (81%), Gaps = 1/466 (0%)
 Frame = +2

Query: 545  RNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSIGSLWMQ-C 721
            R ++  + V E++GGK I L+P+   D F+V          P +     +S  +LW++ C
Sbjct: 113  RAQSLSSSVWEVKGGKWILLVPNDLDDTFVVDSLF------PSTSSTRPVSPLNLWLEKC 166

Query: 722  RELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTA 901
            R+L M LMLPEG+P SVTSDYL+YSLWR VQGVA+QIS VLATQ+LLYAVGLGKGAIPTA
Sbjct: 167  RQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQSLLYAVGLGKGAIPTA 226

Query: 902  AAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVP 1081
            AA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAA+G E+LTPAFPHLFVP
Sbjct: 227  AALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGFEMLTPAFPHLFVP 286

Query: 1082 IXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN 1261
            I                 TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGI +GI LAN
Sbjct: 287  IGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIAMGIGLAN 346

Query: 1262 CIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEV 1441
            CI +STPLAL+SF V+T+IHMYCNLKSYQSIQLRTLNPYRA LVFSEYLLSG  P +KEV
Sbjct: 347  CIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIKEV 406

Query: 1442 NDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDALAL 1621
            NDE+PLFPAVP+LN+KP  +EQ  VLS +AK AAA I+ RL LGSKLSDVV N +D LAL
Sbjct: 407  NDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLGSKLSDVVNNHKDVLAL 466

Query: 1622 FDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDDCK 1801
            FDL+++EGY+LTEH G+FCV LKE+ SP DMLK+MFHV+YLYWLEKNAGI  +S   D K
Sbjct: 467  FDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWLEKNAGIDGASPYLDSK 526

Query: 1802 PGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            PGGRLQ+SL+Y+ REFNHVK D E+AGW TDGLIARPLPNRI PG+
Sbjct: 527  PGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRPGF 572


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  647 bits (1668), Expect = 0.0
 Identities = 332/480 (69%), Positives = 385/480 (80%)
 Frame = +2

Query: 497  ASVADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGS 676
            AS A ART      E   E  ++ V  ++G K+I LIPD+ +DEF+V  ++  S+    S
Sbjct: 58   ASSAFARTTL---KEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFLVNPSLPSSYDDIIS 114

Query: 677  EKNLKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQA 856
               L     +LW+QCR LF+ LMLPEG+P SVTSDYL+YSLWRGVQGVA+QISGVLATQA
Sbjct: 115  SSWLHFG-RTLWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQA 173

Query: 857  LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAY 1036
            LLYA+GLGKGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAA+
Sbjct: 174  LLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAF 233

Query: 1037 GMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGM 1216
            G+EILTPAFPHLFV I                 TRSCFYAGFAAQRNFAEVIAKGEAQGM
Sbjct: 234  GLEILTPAFPHLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGM 293

Query: 1217 VSKSIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVF 1396
            VSK IGIMLGI LANCI SS PLAL+SF V+TWIHM+CNLKSYQSIQLRTLNPYRA LVF
Sbjct: 294  VSKFIGIMLGIGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVF 353

Query: 1397 SEYLLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGS 1576
            SEYLLSG  P +K+VNDEEPLFPAV   + K A +    VLS +A+DAA  IE RL LGS
Sbjct: 354  SEYLLSGQAPPIKDVNDEEPLFPAV-FPHFKSADKPSLVVLSLEARDAATEIERRLQLGS 412

Query: 1577 KLSDVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLE 1756
            KLSDVV ++ED LALF+L++DEGY+LTE++G+FCV LKES S QDMLK++F V+YLYWLE
Sbjct: 413  KLSDVVNSKEDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQVNYLYWLE 472

Query: 1757 KNAGIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPG 1936
            +NAG+ +  TS DC+ GGRLQ+SLEY++REF+HV++DS + GWV DGLIARPLPNRI PG
Sbjct: 473  RNAGLDARGTSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARPLPNRIYPG 532


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  641 bits (1653), Expect = 0.0
 Identities = 322/481 (66%), Positives = 373/481 (77%)
 Frame = +2

Query: 494  IASVADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPG 673
            +A  A ART S  SS   NE     + E++GG  I+L PD+ +D FI       +++   
Sbjct: 110  LAKFAMARTPSSCSSSIENEILKQPIWEVKGGNFIKLFPDHLKDIFIASNP---TFFSEL 166

Query: 674  SEKNLKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQ 853
            S  N+      L+ +C+E  + LMLPEGFP SVTSDYLEYSLWRGVQGVA Q+SGVLATQ
Sbjct: 167  SSLNVSQVPSFLYTKCKEFTVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQ 226

Query: 854  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAA 1033
            ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLS +GRHFDVNPKGWRLFADLLENAA
Sbjct: 227  ALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAA 286

Query: 1034 YGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQG 1213
            +G+E+ TPAFPHLFVPI                 TRSCF+AGFAAQRNFAEVIAKGE QG
Sbjct: 287  FGLEMCTPAFPHLFVPIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQG 346

Query: 1214 MVSKSIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLV 1393
            M S+ IGI LGI L NCI SSTPL L+SF V+TW+HMYCNLKSYQSIQLRTLNPYRA LV
Sbjct: 347  MASRFIGIALGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLV 406

Query: 1394 FSEYLLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLG 1573
            FSEYLLSG  P VKEVNDEEPLFPA+P+LN   A + Q+ VLS +AKDAA  IE RL LG
Sbjct: 407  FSEYLLSGQAPPVKEVNDEEPLFPALPILNACFANKAQSIVLSSEAKDAAVEIESRLQLG 466

Query: 1574 SKLSDVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWL 1753
            SKLS+++ N+E+ LALF L+++EGY+L+EH GKFCV LKE+ S  DMLK++F V+YLYWL
Sbjct: 467  SKLSEIIHNKEEVLALFSLYKNEGYILSEHTGKFCVVLKENCSQLDMLKALFQVNYLYWL 526

Query: 1754 EKNAGIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISP 1933
            EKNAGI+      DCKPGGRL++SLEY  REFNH ++D E+AGW+ DGLIARPLPNRI P
Sbjct: 527  EKNAGIEGRGALYDCKPGGRLRISLEYAEREFNHARNDGESAGWIADGLIARPLPNRIRP 586

Query: 1934 G 1936
            G
Sbjct: 587  G 587


>ref|XP_004292905.1| PREDICTED: UPF0420 protein C16orf58 homolog [Fragaria vesca subsp.
            vesca]
          Length = 593

 Score =  639 bits (1647), Expect = e-180
 Identities = 322/470 (68%), Positives = 376/470 (80%)
 Frame = +2

Query: 548  NETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSIGSLWMQCRE 727
            +E +   V E++GGK  +L PD+ RD F+            G      +S  SL +QC+ 
Sbjct: 133  SEEDAESVWEVKGGKWTKLAPDFVRDAFVAD----------GGGGLGSISFESLGLQCKS 182

Query: 728  LFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAA 907
            LF+ LMLPEGFP SVTSDYL+YSLWR VQGVA+Q+SGVLATQALLYAVGLGKGAIPTAAA
Sbjct: 183  LFVQLMLPEGFPDSVTSDYLDYSLWRAVQGVASQVSGVLATQALLYAVGLGKGAIPTAAA 242

Query: 908  INWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIX 1087
            +NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAA+GME+LTP FP+ F+ I 
Sbjct: 243  LNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEMLTPVFPNHFLLIG 302

Query: 1088 XXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCI 1267
                            TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALAN I
Sbjct: 303  AAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANQI 362

Query: 1268 QSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVND 1447
             SST L L+SF ++T IHM+CNLKSYQ+IQLRTLNPYRA LVFSEYLLSG  P VK+VN+
Sbjct: 363  GSSTSLGLASFSLVTCIHMFCNLKSYQAIQLRTLNPYRASLVFSEYLLSGQAPPVKDVNE 422

Query: 1448 EEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDALALFD 1627
            EEPLFPAVP LN KPA + Q  VLS +AKDAAA IE RL LG KLSD++ N+ED  ALF+
Sbjct: 423  EEPLFPAVPFLNWKPANKGQPTVLSSEAKDAAAEIEQRLQLGCKLSDLINNKEDVHALFN 482

Query: 1628 LFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDDCKPG 1807
            L+++EGY+LTEH G++CV LKE+SS QDMLK++FHV+YLYWLEKNAGI++  TS DC+PG
Sbjct: 483  LYKEEGYILTEHRGRYCVVLKETSSLQDMLKALFHVNYLYWLEKNAGIEAKGTSIDCRPG 542

Query: 1808 GRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGYGNASVA 1957
            GRL+MSL+Y+RREF+ +K+D E+ GWVTDGLIARP PNRI P Y  +SVA
Sbjct: 543  GRLEMSLDYVRREFDIIKTDGESVGWVTDGLIARPAPNRIRPVYEASSVA 592


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  635 bits (1638), Expect = e-179
 Identities = 320/478 (66%), Positives = 377/478 (78%)
 Frame = +2

Query: 506  ADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKN 685
            A +  A   +S++ + TE   V E+RG K+  L+PD+ +DEF+  +   E         +
Sbjct: 177  AASAVAKAENSDSDDSTEKETVWEVRGSKRKRLVPDFVKDEFVSEEAAFEL--------S 228

Query: 686  LKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLY 865
              L+  +L  QCR L    +LPEG+P SVTSDYL+YSLWRGVQG+A+QISGVLATQ+LLY
Sbjct: 229  SSLTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLY 288

Query: 866  AVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGME 1045
            AVGLGKGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+GME
Sbjct: 289  AVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGME 348

Query: 1046 ILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 1225
            +LTP FP  FV I                 TRSCF AGFA+QRNFAEVIAKGEAQGMVSK
Sbjct: 349  MLTPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSK 408

Query: 1226 SIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEY 1405
            S+GI+LGI +ANCI +ST LAL++FGV+T IHMY NLKSYQ IQLRTLNPYRA LVFSEY
Sbjct: 409  SMGILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTLNPYRASLVFSEY 468

Query: 1406 LLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLS 1585
            L+SG  P +KEVNDEEPLFPAV  LNIK   + Q  VLS +AK AAA+IE RL LGSKLS
Sbjct: 469  LISGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAADIEERLQLGSKLS 528

Query: 1586 DVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNA 1765
            DV+ N+E+A+ALFDL+++EGY+LTEH G+FCV LKESSSPQDML+S+F V+YLYWLEKNA
Sbjct: 529  DVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVNYLYWLEKNA 588

Query: 1766 GIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            GI+ +ST  DCKPGGRL +SL+Y+RREF H K DSE+ GWVT+GLIARPLP RI  GY
Sbjct: 589  GIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRLGY 646


>ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
            gi|355513788|gb|AES95411.1| hypothetical protein
            MTR_5g025160 [Medicago truncatula]
          Length = 630

 Score =  631 bits (1628), Expect = e-178
 Identities = 326/483 (67%), Positives = 376/483 (77%), Gaps = 1/483 (0%)
 Frame = +2

Query: 491  CIASVADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRP 670
            C+  +A A+T S +SSE+   T+   + E++GG  I+L PD  +D FI     L S    
Sbjct: 101  CLCQLAMAKTRS-LSSEDDILTQP--IYEVKGGNLIKLFPDNLKDIFIASNPGLFSEL-- 155

Query: 671  GSEKNLKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLAT 850
             S  N       L+ +CRE  + LMLPEGFP SVTSDYLEYSLWRGVQGVA Q+SGVLAT
Sbjct: 156  -SSLNSSQVPTFLYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLAT 214

Query: 851  QALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENA 1030
            QALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLS +GRHFDVNPKGWRLFADLLENA
Sbjct: 215  QALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENA 274

Query: 1031 AYGMEILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQ 1210
            A+G+E+ TPAFPHLFVPI                 TRSCF+AGFAAQRNFAEVIAKGE Q
Sbjct: 275  AFGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQ 334

Query: 1211 GMVSKSIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGL 1390
            GMVS+ IGI +GI L NCI SSTPL L+SF V+TW+HMYCNLKSYQSIQLRTLNP+RA L
Sbjct: 335  GMVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPHRASL 394

Query: 1391 VFSEYLLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYRE-QAEVLSEDAKDAAANIECRLH 1567
            VFSEYLLSG  P VKEVN EEPLFPAVP+LN   A +E Q+ VLS +AKDAA  IE RL 
Sbjct: 395  VFSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIVLSSEAKDAAVEIESRLQ 454

Query: 1568 LGSKLSDVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLY 1747
            LGSKLS+++ N+E+ LALF L+++EGY+L+EH GKFCV LKE+ S  DMLK++F V+YLY
Sbjct: 455  LGSKLSEIINNKEEVLALFSLYKNEGYILSEHTGKFCVVLKETCSQLDMLKALFQVNYLY 514

Query: 1748 WLEKNAGIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRI 1927
            WLEKNAGI+   T  DCKPGGRLQ+SLEY  REFNHV++D E+ GW+TDGLIARPLPNR 
Sbjct: 515  WLEKNAGIEGRGTLYDCKPGGRLQISLEYAEREFNHVRNDGESVGWITDGLIARPLPNRC 574

Query: 1928 SPG 1936
             PG
Sbjct: 575  RPG 577


>ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutrema salsugineum]
            gi|557096914|gb|ESQ37422.1| hypothetical protein
            EUTSA_v10002446mg [Eutrema salsugineum]
          Length = 611

 Score =  630 bits (1624), Expect = e-177
 Identities = 321/468 (68%), Positives = 369/468 (78%)
 Frame = +2

Query: 536  SENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSIGSLWM 715
            S++  +TE   V E+RG K+  L+PD+ RDEF V          P    +  L+  +L  
Sbjct: 142  SDSNGDTEKETVWEVRGSKRKRLVPDFVRDEFFVS---------PEETTSSPLTPENLLA 192

Query: 716  QCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIP 895
            QCR L    +LPEGFP SVTSDYL+YSLWRGVQG+A+QISGVLATQ+LLYAVGLGKGAIP
Sbjct: 193  QCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIP 252

Query: 896  TAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLF 1075
            TAAAINWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLEN+A+GME+LTP FP  F
Sbjct: 253  TAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENSAFGMEMLTPLFPQFF 312

Query: 1076 VPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIAL 1255
            V I                 TRSCF AGFA+QRNFAEVIAKGEAQGMVSKSIGI+LGI +
Sbjct: 313  VLIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSIGILLGIVV 372

Query: 1256 ANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVK 1435
            ANCI +ST LAL+SFGV+T IHMY NLKSYQ IQLRTLNPYRA LVFSEYL+SG  P +K
Sbjct: 373  ANCIGTSTSLALASFGVVTSIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPPIK 432

Query: 1436 EVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDAL 1615
            EVNDEEPLFP V  LNIK A + Q  VLS +AK AAA+IE RL LGSKLSDVV N+E+A+
Sbjct: 433  EVNDEEPLFPTVRSLNIKSAEKRQDFVLSSEAKAAAADIEERLQLGSKLSDVVHNKEEAV 492

Query: 1616 ALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDD 1795
            ALFDL++DEGY+LTEH G+FCV LKESSSPQDML+S+F V+YLYWLEKNAGI++S+T  D
Sbjct: 493  ALFDLYRDEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVNYLYWLEKNAGIEASNTYLD 552

Query: 1796 CKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            CKPGGRL +SL+Y+RREF   K DSE  GWVT+GLIARPL  RI   Y
Sbjct: 553  CKPGGRLHISLDYVRREFELAKEDSELVGWVTEGLIARPLSTRIRLDY 600


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  629 bits (1623), Expect = e-177
 Identities = 315/478 (65%), Positives = 376/478 (78%)
 Frame = +2

Query: 506  ADARTASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKN 685
            A +  A D +S++  +     V E+RG K+  L+PD+ +DEF+  ++  E         +
Sbjct: 127  AASAIAKDQNSDSNGDAVKETVWEVRGSKRKRLVPDFVKDEFVSEESAFEL--------S 178

Query: 686  LKLSIGSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLY 865
              L+  +L  QCR L    +LPEGFP SVTSDYL+YSLWRGVQG+A+QISGVLATQ+LLY
Sbjct: 179  SSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLLY 238

Query: 866  AVGLGKGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGME 1045
            AVGLGKGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+GME
Sbjct: 239  AVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGME 298

Query: 1046 ILTPAFPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSK 1225
            +LTP FP  FV I                 TRSCF AGFA+QRNFAEVIAKGEAQGMVSK
Sbjct: 299  MLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSK 358

Query: 1226 SIGIMLGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEY 1405
            S+GI+LGI +ANCI +ST LAL++FGV+T IHMY NLKSYQ IQLRTLNPYRA LVFSEY
Sbjct: 359  SVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEY 418

Query: 1406 LLSGLVPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLS 1585
            L+SG  P +KEVNDEEPLFP V   N+K   + Q  VLS +AK AAA+IE RL LGSKLS
Sbjct: 419  LISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEERLQLGSKLS 478

Query: 1586 DVVKNREDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNA 1765
            DV+ N+E+A+ALFDL+++EGY+LTEH+G+FCV LKESS+PQDML+S+F V+YLYWLEKNA
Sbjct: 479  DVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNA 538

Query: 1766 GIKSSSTSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            GI+ +ST  DCKPGGRL +SL+Y+RREF H K DSE+ GWVT+GLIARPLP RI  G+
Sbjct: 539  GIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRLGH 596


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  626 bits (1615), Expect = e-176
 Identities = 314/473 (66%), Positives = 375/473 (79%)
 Frame = +2

Query: 521  ASDVSSENRNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSI 700
            ASD  S++  +T+   V E+RG K+  L+PD+ +DEF+  ++  E         +  L+ 
Sbjct: 140  ASD--SDSSGDTDKETVWEVRGSKRKRLVPDFVKDEFVSEESAFEL--------SSSLTP 189

Query: 701  GSLWMQCRELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLG 880
             +L  QCR L    +LPEGFP SVTSDYL+YSLWRGVQG+A+Q+SGVLATQ+LLYAVGLG
Sbjct: 190  ENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQSLLYAVGLG 249

Query: 881  KGAIPTAAAINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPA 1060
            KGAIPTAAAINWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+GME+LTP 
Sbjct: 250  KGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPV 309

Query: 1061 FPHLFVPIXXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1240
            FP  FV I                 TRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+GI+
Sbjct: 310  FPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSMGIL 369

Query: 1241 LGIALANCIQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGL 1420
            LGI +ANCI +ST LAL++FGV+T IHMY NLKSYQ IQLRTLNPYRA LVFSEYL+SG 
Sbjct: 370  LGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQ 429

Query: 1421 VPSVKEVNDEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKN 1600
             P +KEVNDEEPLFP V  LN+K   + Q  VLS +AK AA +IE RL LGSKLSDV+ N
Sbjct: 430  APLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERLQLGSKLSDVIHN 489

Query: 1601 REDALALFDLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSS 1780
            +E+A+ALFDL+++EGY+LTEH G+FCV LKESS+PQDML+S+F V+YLYWLEKNAGI+ +
Sbjct: 490  KEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPA 549

Query: 1781 STSDDCKPGGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGY 1939
            ST  DCKPGGRL +SL+Y+RREF H K DS++ GWVT+GLIARPLP RI  G+
Sbjct: 550  STYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIARPLPTRIRLGH 602


>ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Populus trichocarpa]
            gi|550347673|gb|ERP65789.1| hypothetical protein
            POPTR_0001s19390g [Populus trichocarpa]
          Length = 406

 Score =  622 bits (1604), Expect = e-175
 Identities = 310/405 (76%), Positives = 347/405 (85%)
 Frame = +2

Query: 743  MLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAINWVL 922
            MLP+GFP SVTSDYL+YSLWR VQG+A+QISGVLATQALLYAVGLGKGAIPTAAAINWVL
Sbjct: 1    MLPQGFPRSVTSDYLDYSLWRAVQGIASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 60

Query: 923  KDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXXX 1102
            KDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+E+LTPAFPHLFV I      
Sbjct: 61   KDGIGYLSKIVLSKYGRHFDVHPKGWRLFADLLENAAFGLEMLTPAFPHLFVFIGATAGA 120

Query: 1103 XXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIQSSTP 1282
                       TRSCFYAGFAAQRNFAEVIAKGEAQGMVSK IGIMLGIALANCI SSTP
Sbjct: 121  GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANCIGSSTP 180

Query: 1283 LALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPLF 1462
            LAL+SF V+TWIHM+CNLKSYQSIQLRTLNPYRA LVFSEYLLSG  P VKE+NDEEPLF
Sbjct: 181  LALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEINDEEPLF 240

Query: 1463 PAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDALALFDLFQDE 1642
            PAVP LNI      Q+ VLS +A++AAA IE RL LGSKLSDVV N++D LALF+L++DE
Sbjct: 241  PAVPFLNIYSKGNVQSIVLSSEARNAAAEIEQRLQLGSKLSDVVNNKDDVLALFNLYRDE 300

Query: 1643 GYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDDCKPGGRLQM 1822
            GY+LTEH+G+FCV LKESSSP DMLKS+F V+YLYWLE+NAGI++ S S DC+P GRLQ+
Sbjct: 301  GYILTEHKGRFCVVLKESSSPHDMLKSLFQVNYLYWLERNAGIEARSISADCRPEGRLQI 360

Query: 1823 SLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRISPGYGNASVA 1957
            SLEY RREFNHVK+DS + GWV DGLIARP P R+ PG   +S+A
Sbjct: 361  SLEYARREFNHVKNDSVSMGWVADGLIARPSPIRVCPGNIASSIA 405


>ref|XP_004145647.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus]
          Length = 611

 Score =  619 bits (1597), Expect = e-174
 Identities = 316/461 (68%), Positives = 361/461 (78%)
 Frame = +2

Query: 545  RNETEDNIVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLSIGSLWMQCR 724
            RN      + E++GGK+I LI D  RDEF V   +  S        +L  S  ++W++C 
Sbjct: 153  RNNMNTESIWEVKGGKRIRLILDTYRDEFHVATGMPSS--------SLSFSFVNVWLRCS 204

Query: 725  ELFMNLMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAA 904
            ++F  LMLPEGFP SVTSDYLEYSLWRGVQG+A+Q+SGVLATQALLYAVGLGKGAIPTAA
Sbjct: 205  DIFTRLMLPEGFPDSVTSDYLEYSLWRGVQGIASQVSGVLATQALLYAVGLGKGAIPTAA 264

Query: 905  AINWVLKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPI 1084
            A+NWVLKDG GYLSKI LSKYGRHFDV+PKGWRLFADLLENAAYGME+LTPAFP  FV I
Sbjct: 265  AVNWVLKDGFGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGMEMLTPAFPLHFVVI 324

Query: 1085 XXXXXXXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANC 1264
                             TRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG+MLGI LAN 
Sbjct: 325  GAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGMMLGITLANR 384

Query: 1265 IQSSTPLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVN 1444
            I+SST LAL  F ++T IHM+CNLKSY+SIQLRTLNPYRA LVFSEYLLSG VPS+K+VN
Sbjct: 385  IRSSTSLALGCFSIVTLIHMFCNLKSYKSIQLRTLNPYRASLVFSEYLLSGEVPSIKDVN 444

Query: 1445 DEEPLFPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDALALF 1624
            +EEPLFPAVPLLN K     +  +LS +AK++AANIE RL LGSKLSDV    ED L L 
Sbjct: 445  NEEPLFPAVPLLNRKAPDWSRDFLLSAEAKESAANIEKRLQLGSKLSDVATCEEDVLELL 504

Query: 1625 DLFQDEGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDDCKP 1804
             LF  E Y+L+EH GK+CV LKES+SP DMLK++FHV+YL+WLE+NAGI + S S+DC+P
Sbjct: 505  SLFNKENYILSEHRGKYCVMLKESASPVDMLKAVFHVNYLHWLERNAGITARSASNDCRP 564

Query: 1805 GGRLQMSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRI 1927
            GGRLQMSLEY+ REF HVK D E AGW TDGLIARPL  RI
Sbjct: 565  GGRLQMSLEYVEREFKHVKYDGELAGWSTDGLIARPLTTRI 605


>ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [Amborella trichopoda]
            gi|548831916|gb|ERM94718.1| hypothetical protein
            AMTR_s00011p00244680 [Amborella trichopoda]
          Length = 565

 Score =  618 bits (1594), Expect = e-174
 Identities = 312/456 (68%), Positives = 359/456 (78%), Gaps = 2/456 (0%)
 Frame = +2

Query: 566  IVCEIRGGKKIELIPDYDRDEFIVPKTILESWWRPGSEKNLKLS--IGSLWMQCRELFMN 739
            +  E++GGK   +  D  +DE      +     R  S   L L   +GS W+ CREL + 
Sbjct: 107  VAWEVKGGKWSPVYADSSKDELFADNAL-----RLLSSGVLDLGKILGSSWLWCRELAVR 161

Query: 740  LMLPEGFPGSVTSDYLEYSLWRGVQGVAAQISGVLATQALLYAVGLGKGAIPTAAAINWV 919
            LMLPEG+P SV+SDYLEYSLWR VQGVA+QI+GVL TQALLYAVGLGKGAIPTAAA+NWV
Sbjct: 162  LMLPEGYPASVSSDYLEYSLWRAVQGVASQINGVLTTQALLYAVGLGKGAIPTAAAVNWV 221

Query: 920  LKDGIGYLSKILLSKYGRHFDVNPKGWRLFADLLENAAYGMEILTPAFPHLFVPIXXXXX 1099
            LKDG+GYLSKI LSKYGRHFDV+PKGWRLFADLLENAAYG+E+LTPA+P  FV I     
Sbjct: 222  LKDGLGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGLELLTPAYPQFFVLIGAAAG 281

Query: 1100 XXXXXXXXXXXXTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANCIQSST 1279
                        TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN I +S 
Sbjct: 282  AGRSAAALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANHIGASG 341

Query: 1280 PLALSSFGVITWIHMYCNLKSYQSIQLRTLNPYRAGLVFSEYLLSGLVPSVKEVNDEEPL 1459
            PLA +SFGV+T +HM+CNLKSYQSIQLRTLNPYR  LVFSEYLLSG VP VKEVNDEEPL
Sbjct: 342  PLAAASFGVVTAVHMFCNLKSYQSIQLRTLNPYRGSLVFSEYLLSGEVPPVKEVNDEEPL 401

Query: 1460 FPAVPLLNIKPAYREQAEVLSEDAKDAAANIECRLHLGSKLSDVVKNREDALALFDLFQD 1639
            F     L + P    Q++VLS +AK+AAA IE RL LG KLSDVV  +ED LALFDLF+ 
Sbjct: 402  FSGSSFLKVVPVQHAQSQVLSAEAKEAAAQIESRLQLGCKLSDVVSKKEDVLALFDLFEK 461

Query: 1640 EGYVLTEHEGKFCVALKESSSPQDMLKSMFHVSYLYWLEKNAGIKSSSTSDDCKPGGRLQ 1819
            EGY+LTE +GK+CV LKE  SPQDMLKS+F VSYLYWLE+NAGI S S S DCKPGG++Q
Sbjct: 462  EGYILTEQKGKYCVVLKEDYSPQDMLKSLFQVSYLYWLERNAGIDSRSASTDCKPGGKMQ 521

Query: 1820 MSLEYIRREFNHVKSDSEAAGWVTDGLIARPLPNRI 1927
            +S +Y++REFNHVK+DS+AAGW+TDGLIARPLP R+
Sbjct: 522  LSYDYVQREFNHVKNDSQAAGWITDGLIARPLPCRV 557


Top