BLASTX nr result

ID: Mentha29_contig00000943 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00000943
         (2042 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus...   772   0.0  
ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   705   0.0  
ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   702   0.0  
ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma...   689   0.0  
ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma...   659   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   656   0.0  
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   654   0.0  
ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma...   652   0.0  
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              652   0.0  
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   640   0.0  
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   637   e-180
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   635   e-179
ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778...   632   e-178
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   632   e-178
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     631   e-178
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   630   e-178
ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago ...   627   e-177
ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutr...   625   e-176
ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phas...   624   e-176
ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [A...   622   e-175

>gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus guttatus]
          Length = 606

 Score =  772 bits (1993), Expect = 0.0
 Identities = 392/530 (73%), Positives = 440/530 (83%)
 Frame = -3

Query: 1962 NDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAKTG 1783
            +D SNNFFN  RN  FL PSH IFS  E L                  I +S P      
Sbjct: 101  DDWSNNFFNFSRNPFFLFPSHFIFSREENL------------------ISTSLPKH---- 138

Query: 1782 ETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRD 1603
               + V+EI+ GKR+ +VPDYSKDEFVVPEK W W   +   N +S+   + DVW KCRD
Sbjct: 139  ---EVVFEIRAGKRVELVPDYSKDEFVVPEKNWSWWLKAAKSNPSSN---LADVWMKCRD 192

Query: 1602 LTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAA 1423
            +  SL+LPEGFPESVTSDYLEYSLWRGVQG+AAQ+SGVLATQA+LYA+GLGKGAIPTAAA
Sbjct: 193  VAMSLMLPEGFPESVTSDYLEYSLWRGVQGIAAQVSGVLATQALLYAVGLGKGAIPTAAA 252

Query: 1422 VNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIX 1243
            VNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRL AD LENAAFG+EILTPAFPHLFVPI 
Sbjct: 253  VNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLCADFLENAAFGLEILTPAFPHLFVPIG 312

Query: 1242 XXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAV 1063
                      ALIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI LAN V
Sbjct: 313  AVAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANGV 372

Query: 1062 QSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVND 883
            QSS PLALASF VITW+HMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSGLVPSV+EVND
Sbjct: 373  QSSIPLALASFSVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSGLVPSVKEVND 432

Query: 882  EEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFD 703
            EEPLFPAFPLLIVK TSEEQ E+LS DAK AA+ IDRRL+LGSKLSDV+K+RE+A+ALFD
Sbjct: 433  EEPLFPAFPLLIVKPTSEEQVEVLSPDAKHAASNIDRRLKLGSKLSDVVKSREEAIALFD 492

Query: 702  LYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPG 523
            LY+SE YILTE +GRYCV LKESS PQDML+SL+QV YLYWLERNAGIKS++ +DDCRPG
Sbjct: 493  LYKSEGYILTEHQGRYCVVLKESSMPQDMLKSLFQVSYLYWLERNAGIKSTTTIDDCRPG 552

Query: 522  GKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSA 373
            G+LQIS+EYV+REF H+KNDS+ AGW++DGLIARPLP+RIR+G+++ S A
Sbjct: 553  GRLQISMEYVQREFTHIKNDSQFAGWVVDGLIARPLPHRIRIGDETASPA 602


>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  705 bits (1819), Expect = 0.0
 Identities = 372/536 (69%), Positives = 430/536 (80%), Gaps = 8/536 (1%)
 Frame = -3

Query: 1959 DGSNNFFNSDRNYLFLLPSHLIFSSNEE-----LRSVPYAL-LVSVAASLGCFILSSSPA 1798
            D  NNFFN D+  + LLP   IF   +      L   P  L LVS ++S+ C +L +S  
Sbjct: 81   DWWNNFFNFDK--ILLLP---IFRDEDTFIDSVLSCKPLLLFLVSASSSITCCLLLASFV 135

Query: 1797 RAKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVW--FWPWSSKDGNLTSSQMTMGD 1624
            +AKT    + VYEI+GGKR  +VPDYSKDEFV+ + +W   WP  S  G+  S+      
Sbjct: 136  QAKTNN-GEIVYEIRGGKRFELVPDYSKDEFVLTKTMWSQLWP-DSTSGSFVSN------ 187

Query: 1623 VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 1444
            +W +C++LT +L LPEGFPESVTSDYLEY+LWRGVQG+AAQISGVLATQA+LYA+GLGKG
Sbjct: 188  LWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQALLYAVGLGKG 247

Query: 1443 AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 1264
            AIPTAAA+NWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTPAFP
Sbjct: 248  AIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAFP 307

Query: 1263 HLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 1084
            HLFVPI           +LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK+IGIMLG
Sbjct: 308  HLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIMLG 367

Query: 1083 IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 904
            I LAN  +SST LALASFGV+TW+HMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP
Sbjct: 368  IALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 427

Query: 903  SVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNRE 724
            SV+EVNDEEPLFPA  +L +K   E Q+E+LS  AK AAA I RRLQLGSKLSDV  ++E
Sbjct: 428  SVKEVNDEEPLFPA-AILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSKLSDVATSQE 486

Query: 723  DAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSI 544
            D +ALF+LY++E YILTE EGR+C+ LKESSSPQDML+SL+ V YLYWLE NAGIKSSS+
Sbjct: 487  DVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETNAGIKSSSV 546

Query: 543  VDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSS 376
             +DCRPGG+LQ+SLEYV+REFNHVK D E AGW+ D LIARPLP RIRL   + SS
Sbjct: 547  ANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRLDYAAESS 602


>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  702 bits (1813), Expect = 0.0
 Identities = 374/537 (69%), Positives = 430/537 (80%), Gaps = 9/537 (1%)
 Frame = -3

Query: 1959 DGSNNFFNSD-RNYLFLLPSHLIFSSNEE-----LRSVPYAL-LVSVAASLGCFILSSSP 1801
            D  +NFFN D R  L LLP   IF + +      L   P  L LVS ++S+ C +L +S 
Sbjct: 81   DWWSNFFNFDKRRSLLLLP---IFRNEDTFIDSVLSCKPLLLFLVSASSSITCCLLLASF 137

Query: 1800 ARAKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVW--FWPWSSKDGNLTSSQMTMG 1627
             +AKT    + V+EI+GGKR  +VPDYSKDEFV+ + +W    P  SK G+  S+     
Sbjct: 138  VQAKTNN-GEIVHEIRGGKRFELVPDYSKDEFVLTKTMWSRLLP-DSKSGSFVSN----- 190

Query: 1626 DVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGK 1447
             +W +C++LT +LLLPEGFP+SVTSDYLEY+LWRGVQGVAAQISGVLATQA+LYA+GLGK
Sbjct: 191  -LWMQCKELTTTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGLGK 249

Query: 1446 GAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAF 1267
            GAIPTAAAVNWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTPAF
Sbjct: 250  GAIPTAAAVNWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTPAF 309

Query: 1266 PHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIML 1087
            PHLFVPI           +LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK+IGIML
Sbjct: 310  PHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGIML 369

Query: 1086 GIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLV 907
            GI LAN  +SST LALASFGV+TW+HMFCNLKSY SIQLRTLNPYRASLVFSEYLLSGLV
Sbjct: 370  GIALANCTRSSTSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSGLV 429

Query: 906  PSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNR 727
            PSV+EVNDEEPLFPA  +L +K   E Q E+LS  AK AAA I RRLQLGSKLSDV  +R
Sbjct: 430  PSVKEVNDEEPLFPA-AILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVATSR 488

Query: 726  EDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSS 547
            ED +ALF+LY++E YILTE EGR+C+ LKESSSPQDML+SL+ V YLYWLE  AGIKSSS
Sbjct: 489  EDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKSSS 548

Query: 546  IVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSS 376
            + +DCRPGG+LQ+SLEYV+REFNHVK D E AGW+ D LIARPLPNRIRL   + SS
Sbjct: 549  VANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRLDYTAVSS 605


>ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590680339|ref|XP_007040835.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508778078|gb|EOY25334.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  689 bits (1779), Expect = 0.0
 Identities = 354/509 (69%), Positives = 414/509 (81%), Gaps = 4/509 (0%)
 Frame = -3

Query: 1887 SNEELRSVPYALLVSVAASLGCFILSS-SPARAKTGET---DDPVYEIKGGKRIAVVPDY 1720
            +++   S  +  L+ +++ + CF  S  S A A+T E    DD V+E+KG K   ++PD+
Sbjct: 92   NDDSSSSHSHPFLLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDF 151

Query: 1719 SKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLE 1540
            S+D FV    +          NLT S +++  VW +CRD+   LLLPEGFP+SVTSDYL+
Sbjct: 152  SEDAFVASNGIV---------NLTKS-LSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLD 201

Query: 1539 YSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYG 1360
            YSLWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYG
Sbjct: 202  YSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYG 261

Query: 1359 RHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCF 1180
            RHFDVNPKGWRLFADLLENAAFG+E+LTPAFPHLFVPI           ALIQAATRSCF
Sbjct: 262  RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCF 321

Query: 1179 FAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFC 1000
            +AGFAAQRNFAEVIAKGEAQGMVSKSIGI+LGI LAN V SST LALASFGV+TWVHM+C
Sbjct: 322  YAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYC 381

Query: 999  NLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQS 820
            NLKSYQSIQLRTLN YRASLVFSEYLLSG  PS++EVNDEEPLFPA P L +   + E+S
Sbjct: 382  NLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERS 441

Query: 819  ELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALK 640
             +LSS+AK AAA I+RRLQLGSKLSD++ N+EDA+ALF LY+ E YILTE EG++CV LK
Sbjct: 442  VVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFCVVLK 501

Query: 639  ESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDS 460
            ESS PQDML+SL+QV YLYWLERNAGI++S    DCRPGG+LQIS+EYV+REFNHVK DS
Sbjct: 502  ESSLPQDMLKSLFQVNYLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDS 561

Query: 459  ESAGWILDGLIARPLPNRIRLGNQSTSSA 373
            ES GW+ DGLIARPLPNRIR G++  S+A
Sbjct: 562  ESVGWVTDGLIARPLPNRIRPGHRDASTA 590


>ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508778081|gb|EOY25337.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 577

 Score =  659 bits (1701), Expect = 0.0
 Identities = 343/509 (67%), Positives = 402/509 (78%), Gaps = 4/509 (0%)
 Frame = -3

Query: 1887 SNEELRSVPYALLVSVAASLGCFILSS-SPARAKTGET---DDPVYEIKGGKRIAVVPDY 1720
            +++   S  +  L+ +++ + CF  S  S A A+T E    DD V+E+KG K   ++PD+
Sbjct: 92   NDDSSSSHSHPFLLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDF 151

Query: 1719 SKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLE 1540
            S+D FV    +          NLT S +++  VW +CRD+   LLLPEGFP+SVTSDYL+
Sbjct: 152  SEDAFVASNGIV---------NLTKS-LSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLD 201

Query: 1539 YSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYG 1360
            YSLWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYG
Sbjct: 202  YSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYG 261

Query: 1359 RHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCF 1180
            RHFDVNPKGWRLFADLLENAAFG+E+LTPAFPHLFVPI           ALIQAATRSCF
Sbjct: 262  RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCF 321

Query: 1179 FAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFC 1000
            +AGFAAQRNFAEVIAKGEAQGMVSKSIGI+LGI LAN V SST LALASFGV+TWVHM+C
Sbjct: 322  YAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYC 381

Query: 999  NLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQS 820
            NLKSYQSIQLRTLN YRASLVFSEYLLSG  PS++EVNDEEPLFPA P L +   + E+S
Sbjct: 382  NLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERS 441

Query: 819  ELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALK 640
             +LSS+AK AAA I+RRLQLGSKLSD++ N+EDA+ALF LY+ E YILTE EG++C    
Sbjct: 442  VVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFC---- 497

Query: 639  ESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDS 460
                      SL+QV YLYWLERNAGI++S    DCRPGG+LQIS+EYV+REFNHVK DS
Sbjct: 498  ----------SLFQVNYLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDS 547

Query: 459  ESAGWILDGLIARPLPNRIRLGNQSTSSA 373
            ES GW+ DGLIARPLPNRIR G++  S+A
Sbjct: 548  ESVGWVTDGLIARPLPNRIRPGHRDASTA 576


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  656 bits (1693), Expect = 0.0
 Identities = 345/527 (65%), Positives = 412/527 (78%), Gaps = 4/527 (0%)
 Frame = -3

Query: 1956 GSN---NFFNSDRNYLFLL-PSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAK 1789
            GSN    ++ ++ N LF+   S ++     E   +  A+L+ V + L  F          
Sbjct: 169  GSNWNWGWWGNEENALFIFFCSRVLHEHGSETAHMLRAVLLFVFSVLYSFFHFQLDTALS 228

Query: 1788 TGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKC 1609
              + ++ V+E++GGK   ++PD SKDEF+V       P     G   SS  T+ ++W +C
Sbjct: 229  KEKEEEGVWEVRGGKWHKIIPDSSKDEFLVVT-----PGIGAVGAPKSS--TLPNLWLQC 281

Query: 1608 RDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTA 1429
            ++L   L+LPEGFP SVTSDYL+Y+LWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTA
Sbjct: 282  KELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTA 341

Query: 1428 AAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVP 1249
            AAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+EILTPAFPH F+ 
Sbjct: 342  AAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILTPAFPHQFLL 401

Query: 1248 IXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLAN 1069
            I           ALIQA+TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI LAN
Sbjct: 402  IGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALAN 461

Query: 1068 AVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREV 889
             + SS PL+ ASF V+T VHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG VPS++EV
Sbjct: 462  CIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQVPSIKEV 521

Query: 888  NDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVAL 709
            N+EEPLFP  PLL  K T + QS +LS++AKDAAA I+RRLQLGSKLS+V+ ++ED +AL
Sbjct: 522  NEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVVSSKEDVLAL 581

Query: 708  FDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCR 529
            FDLY++EAYILTE +GR+ V LKES SPQDML+S++ V YLYWLERNAGI S    DDCR
Sbjct: 582  FDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGIISMGASDDCR 641

Query: 528  PGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQ 388
            PGG+LQISLEYV+REFNH+KNDSE  GW  DGLIARPLPNRIR G++
Sbjct: 642  PGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHK 688


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  654 bits (1687), Expect = 0.0
 Identities = 342/539 (63%), Positives = 411/539 (76%), Gaps = 9/539 (1%)
 Frame = -3

Query: 1962 NDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFI---LSSSPARA 1792
            + G+NN  N++ N       H     +++     Y+LL+ V + L CF    ++++ AR 
Sbjct: 56   SSGNNNNNNNNNNPSGSWWWHGGNGGDDDSSGSFYSLLLFVPSLLYCFCHLQVATAIART 115

Query: 1791 KTGETDD------PVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTM 1630
             T   DD       V+E+KG KR  ++PD++KD FVV         S+ + +L SS +++
Sbjct: 116  ATSSEDDGNKEYDAVWEVKGSKRTKLIPDFTKDAFVVA--------SASNASL-SSLLSV 166

Query: 1629 GDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLG 1450
              +W +CR+L    +LPEGFP+SVTSDYL YSLWR VQGVA+QISGVLATQA+LYAIGLG
Sbjct: 167  NKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQALLYAIGLG 226

Query: 1449 KGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPA 1270
            KGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+LTPA
Sbjct: 227  KGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMLTPA 286

Query: 1269 FPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIM 1090
            FPH FV I           ALIQA+TRSCF+AGFAA+RNFAEVIAKGEAQGMVSK+IGIM
Sbjct: 287  FPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGMVSKAIGIM 346

Query: 1089 LGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGL 910
            LGI LAN + SS P ALASF V+TW+HM+CNLKSYQSI+LRTLNPYRASLVFSEYLLSG 
Sbjct: 347  LGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVFSEYLLSGQ 406

Query: 909  VPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKN 730
             P V+EVNDEEPLFPAF    +K  ++ Q  +LSS+AKDAA  I+ RLQLGSKLSDV+ N
Sbjct: 407  APPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGSKLSDVVNN 466

Query: 729  REDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSS 550
            +EDA ALF LY+ E YILTE  G++CV LKES+ PQDML+SL+Q  YLYWLERNAGI ++
Sbjct: 467  KEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLERNAGIVAT 526

Query: 549  SIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSA 373
            S   DC PGG+L+ISL+YV+REFNHVK+DS S GW+ DGLIARPLPNRIR G    S A
Sbjct: 527  STSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPGYVEPSVA 585


>ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508778082|gb|EOY25338.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 573

 Score =  652 bits (1682), Expect = 0.0
 Identities = 340/509 (66%), Positives = 398/509 (78%), Gaps = 4/509 (0%)
 Frame = -3

Query: 1887 SNEELRSVPYALLVSVAASLGCFILSS-SPARAKTGET---DDPVYEIKGGKRIAVVPDY 1720
            +++   S  +  L+ +++ + CF  S  S A A+T E    DD V+E+KG K   ++PD+
Sbjct: 92   NDDSSSSHSHPFLLFLSSFVACFCPSQLSSALARTNEDSQEDDVVWEVKGSKWTKLIPDF 151

Query: 1719 SKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLE 1540
            S+D FV    +          NLT S +++  VW +CRD+   LLLPEGFP+SVTSDYL+
Sbjct: 152  SEDAFVASNGIV---------NLTKS-LSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLD 201

Query: 1539 YSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYG 1360
            YSLWRGVQGVA+QISGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYG
Sbjct: 202  YSLWRGVQGVASQISGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYG 261

Query: 1359 RHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCF 1180
            RHFDVNPKGWRLFADLLENAAFG+E+LTPAFPHLFVPI           ALIQAATRSCF
Sbjct: 262  RHFDVNPKGWRLFADLLENAAFGLEMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCF 321

Query: 1179 FAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFC 1000
            +AGFAAQRNFAEVIAKGEAQGMVSKSIGI+LGI LAN V SST LALASFGV+TWVHM+C
Sbjct: 322  YAGFAAQRNFAEVIAKGEAQGMVSKSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYC 381

Query: 999  NLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQS 820
            NLKSYQSIQLRTLN YRASLVFSEYLLSG  PS++EVNDEEPLFPA P L +   + E+S
Sbjct: 382  NLKSYQSIQLRTLNSYRASLVFSEYLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERS 441

Query: 819  ELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALK 640
             +LSS+AK AAA I+RRLQLGSKLSD++ N+EDA+ALF LY+ E YILTE EG++CV   
Sbjct: 442  VVLSSEAKQAAADIERRLQLGSKLSDIVNNKEDALALFSLYKDEGYILTEHEGKFCVN-- 499

Query: 639  ESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDS 460
                            YLYWLERNAGI++S    DCRPGG+LQIS+EYV+REFNHVK DS
Sbjct: 500  ----------------YLYWLERNAGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDS 543

Query: 459  ESAGWILDGLIARPLPNRIRLGNQSTSSA 373
            ES GW+ DGLIARPLPNRIR G++  S+A
Sbjct: 544  ESVGWVTDGLIARPLPNRIRPGHRDASTA 572


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  652 bits (1681), Expect = 0.0
 Identities = 337/489 (68%), Positives = 395/489 (80%)
 Frame = -3

Query: 1857 ALLVSVAASLGCFILSSSPARAKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFW 1678
            A+L+ V + L  F            + ++ V+E++GGK   ++PD SKDEF+V       
Sbjct: 4    AVLLFVFSVLYSFFHFQLDTALSKEKEEEGVWEVRGGKWHKIIPDSSKDEFLVVT----- 58

Query: 1677 PWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQI 1498
            P     G   SS  T+ ++W +C++L   L+LPEGFP SVTSDYL+Y+LWRGVQGVA+QI
Sbjct: 59   PGIGAVGAPKSS--TLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQI 116

Query: 1497 SGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFA 1318
            SGVLATQA+LYA+GLGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFA
Sbjct: 117  SGVLATQALLYAVGLGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFA 176

Query: 1317 DLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVI 1138
            DLLENAA+G+EILTPAFPH F+ I           ALIQA+TRSCF+AGFAAQRNFAEVI
Sbjct: 177  DLLENAAYGLEILTPAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVI 236

Query: 1137 AKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLN 958
            AKGEAQGMVSKSIGIMLGI LAN + SS PL+ ASF V+T VHMFCNLKSYQSIQLRTLN
Sbjct: 237  AKGEAQGMVSKSIGIMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLN 296

Query: 957  PYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYI 778
            PYRASLVFSEYLLSG VPS++EVN+EEPLFP  PLL  K T + QS +LS++AKDAAA I
Sbjct: 297  PYRASLVFSEYLLSGQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEI 356

Query: 777  DRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQ 598
            +RRLQLGSKLS+V+ ++ED +ALFDLY++EAYILTE +GR+ V LKES SPQDML+S++ 
Sbjct: 357  ERRLQLGSKLSEVVSSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFH 416

Query: 597  VCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARP 418
            V YLYWLERNAGI S    DDCRPGG+LQISLEYV+REFNH+KNDSE  GW  DGLIARP
Sbjct: 417  VNYLYWLERNAGIISMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARP 476

Query: 417  LPNRIRLGN 391
            LPNRIR G+
Sbjct: 477  LPNRIRPGH 485


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  640 bits (1650), Expect = 0.0
 Identities = 344/540 (63%), Positives = 408/540 (75%), Gaps = 7/540 (1%)
 Frame = -3

Query: 1965 KNDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAKT 1786
            +  GSNN  N++ N     P    ++ N +     +  L+    +L     SS+ AR   
Sbjct: 9    RGSGSNNNNNNNNNNNPFDPWWW-WNENNKNNCDYFVWLLCCFVALWLQSASSAFARTTL 67

Query: 1785 GE-----TDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMG-D 1624
             E      +D V+ +KG KRI ++PD+ KDEF+V   +     SS D  ++SS +  G  
Sbjct: 68   KEKEEEGAEDSVWVVKGSKRIRLIPDFIKDEFLVNPSLP----SSYDDIISSSWLHFGRT 123

Query: 1623 VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 1444
            +W +CR L   L+LPEG+P SVTSDYL+YSLWRGVQGVA+QISGVLATQA+LYAIGLGKG
Sbjct: 124  LWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKG 183

Query: 1443 AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 1264
            AIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAAFG+EILTPAFP
Sbjct: 184  AIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFP 243

Query: 1263 HLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 1084
            HLFV I           ALIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK IGIMLG
Sbjct: 244  HLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLG 303

Query: 1083 IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 904
            I LAN + SS PLALASF V+TW+HMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P
Sbjct: 304  IGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAP 363

Query: 903  SVREVNDEEPLFPA-FPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNR 727
             +++VNDEEPLFPA FP    K   +    +LS +A+DAA  I+RRLQLGSKLSDV+ ++
Sbjct: 364  PIKDVNDEEPLFPAVFPHF--KSADKPSLVVLSLEARDAATEIERRLQLGSKLSDVVNSK 421

Query: 726  EDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSS 547
            ED +ALF+LY+ E YILTE +GR+CV LKES S QDML++L+QV YLYWLERNAG+ +  
Sbjct: 422  EDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQVNYLYWLERNAGLDARG 481

Query: 546  IVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSAES 367
               DCR GG+LQ+SLEY++REF+HV+NDS S GW+ DGLIARPLPNRI  G+   SS  S
Sbjct: 482  TSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARPLPNRIYPGDLVASSIVS 541


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  637 bits (1642), Expect = e-180
 Identities = 325/515 (63%), Positives = 394/515 (76%), Gaps = 14/515 (2%)
 Frame = -3

Query: 1893 FSSNEELRSVPYALLVSVAAS----------LGCFILSSSPARAKTGETDD----PVYEI 1756
            F S++   +  Y L +S+  S          L  F ++ +P+   +   ++    P++E+
Sbjct: 79   FDSDDSSSNSRYTLFLSLLCSSVICYFFQLLLAKFAMARTPSSCSSSIENEILKQPIWEV 138

Query: 1755 KGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPE 1576
            KGG  I + PD+ KD F+     +F   SS    L  SQ+    ++TKC++ T  L+LPE
Sbjct: 139  KGGNFIKLFPDHLKDIFIASNPTFFSELSS----LNVSQVP-SFLYTKCKEFTVRLMLPE 193

Query: 1575 GFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGI 1396
            GFP SVTSDYLEYSLWRGVQGVA Q+SGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGI
Sbjct: 194  GFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGI 253

Query: 1395 GYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXX 1216
            GYLSKI+LS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFPHLFVPI          
Sbjct: 254  GYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAVAGASRSA 313

Query: 1215 XALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALA 1036
             +LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N + SSTPL LA
Sbjct: 314  ASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIALGIGLGNCIGSSTPLVLA 373

Query: 1035 SFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFP 856
            SF V+TWVHM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P V+EVNDEEPLFPA P
Sbjct: 374  SFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEVNDEEPLFPALP 433

Query: 855  LLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYIL 676
            +L     ++ QS +LSS+AKDAA  I+ RLQLGSKLS+++ N+E+ +ALF LY++E YIL
Sbjct: 434  ILNACFANKAQSIVLSSEAKDAAVEIESRLQLGSKLSEIIHNKEEVLALFSLYKNEGYIL 493

Query: 675  TELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEY 496
            +E  G++CV LKE+ S  DML++L+QV YLYWLE+NAGI+    + DC+PGG+L+ISLEY
Sbjct: 494  SEHTGKFCVVLKENCSQLDMLKALFQVNYLYWLEKNAGIEGRGALYDCKPGGRLRISLEY 553

Query: 495  VKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 391
             +REFNH +ND ESAGWI DGLIARPLPNRIR GN
Sbjct: 554  AEREFNHARNDGESAGWIADGLIARPLPNRIRPGN 588


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  635 bits (1639), Expect = e-179
 Identities = 326/492 (66%), Positives = 387/492 (78%), Gaps = 10/492 (2%)
 Frame = -3

Query: 1830 LGCFI-----LSSSPARAKTGETDDP-----VYEIKGGKRIAVVPDYSKDEFVVPEKVWF 1681
            L CF       +S+ A+A+  ++DD      V+E++G KR  +VPD+ KDEFV  E  + 
Sbjct: 167  LSCFFHFRLSAASAVAKAENSDSDDSTEKETVWEVRGSKRKRLVPDFVKDEFVSEEAAFE 226

Query: 1680 WPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQ 1501
                       SS +T  ++  +CR L    LLPEG+P SVTSDYL+YSLWRGVQG+A+Q
Sbjct: 227  ----------LSSSLTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQ 276

Query: 1500 ISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLF 1321
            ISGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLF
Sbjct: 277  ISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLF 336

Query: 1320 ADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEV 1141
            ADLLENAAFGME+LTP FP  FV I           ALIQAATRSCF AGFA+QRNFAEV
Sbjct: 337  ADLLENAAFGMEMLTPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEV 396

Query: 1140 IAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTL 961
            IAKGEAQGMVSKS+GI+LGIV+AN + +ST LALA+FGV+T +HM+ NLKSYQ IQLRTL
Sbjct: 397  IAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTL 456

Query: 960  NPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAY 781
            NPYRASLVFSEYL+SG  P ++EVNDEEPLFPA   L +K   + Q  +LSS+AK AAA 
Sbjct: 457  NPYRASLVFSEYLISGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAAD 516

Query: 780  IDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLY 601
            I+ RLQLGSKLSDV+ N+E+A+ALFDLY++E YILTE  GR+CV LKESSSPQDMLRSL+
Sbjct: 517  IEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLF 576

Query: 600  QVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIAR 421
            QV YLYWLE+NAGI+ +S   DC+PGG+L ISL+YV+REF H K DSES GW+ +GLIAR
Sbjct: 577  QVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIAR 636

Query: 420  PLPNRIRLGNQS 385
            PLP RIRLG  S
Sbjct: 637  PLPTRIRLGYDS 648


>ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 593

 Score =  632 bits (1630), Expect = e-178
 Identities = 330/539 (61%), Positives = 404/539 (74%), Gaps = 9/539 (1%)
 Frame = -3

Query: 1953 SNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAASLGCFILSSSPARAKT---- 1786
            +NN  N++    +  P     S++    ++  +LL S A    C +L +  A+AKT    
Sbjct: 59   NNNNNNNNNGGSWGNPFDSSDSNSNSHHTLFLSLLCSSALCFFCHLLHAKLAKAKTLSPS 118

Query: 1785 --GETD---DPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDV 1621
               +T    +PVYE+KGGK   +VPD + D FV  ++ +    SS    + S   T   V
Sbjct: 119  TTADTSLFSEPVYEVKGGKWTKLVPDLTNDVFVSAQQGFLSELSSL--KVPSQLATF--V 174

Query: 1620 WTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGA 1441
            W KC D+   L+LPEGFPESVTSDYLEYSLWR VQGVA Q+SGVLATQ++LYA+GLGKGA
Sbjct: 175  WLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGA 234

Query: 1440 IPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPH 1261
            IPTAAA+NWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFADLLENAAFG+E+ TPAFP 
Sbjct: 235  IPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFPQ 294

Query: 1260 LFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI 1081
             FV I           +LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LGI
Sbjct: 295  FFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLGI 354

Query: 1080 VLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPS 901
             L N + SSTPL LASF V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P 
Sbjct: 355  GLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPP 414

Query: 900  VREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNRED 721
            V+EVNDEEPLFPA P+L     ++ QS +LSS+AKDAAA I+ RLQLGSKLS+++ ++ED
Sbjct: 415  VKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLGSKLSEIVNSKED 474

Query: 720  AVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIV 541
             +ALF LY++E YIL+E  G++CV LKE+ S QDML++L+QV YLYWLE+NAGI     +
Sbjct: 475  VLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGGRGTL 534

Query: 540  DDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSAEST 364
            +D +PGG+L ISL+YV+REFNHVKND E  GW+ DGLIARPLPNRIR+G+   S++ S+
Sbjct: 535  NDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRIRIGDTPPSNSVSS 593


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  632 bits (1630), Expect = e-178
 Identities = 329/509 (64%), Positives = 395/509 (77%), Gaps = 10/509 (1%)
 Frame = -3

Query: 1887 SNEELRSVPYALLVSVAASLGCFI---LSSSPARAKTGETD-------DPVYEIKGGKRI 1738
            S+ +LR + + LL      L CF    LS++ A AK   +D       + V+E++G KR 
Sbjct: 103  SSFDLRYLCFLLL-----GLSCFFHFRLSAASAIAKDQNSDSNGDAVKETVWEVRGSKRK 157

Query: 1737 AVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESV 1558
             +VPD+ KDEFV  E  +            SS +T  ++  +CR+L    LLPEGFP SV
Sbjct: 158  RLVPDFVKDEFVSEESAFE----------LSSSLTPENLLAQCRNLLTQFLLPEGFPNSV 207

Query: 1557 TSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKI 1378
            TSDYL+YSLWRGVQG+A+QISGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI
Sbjct: 208  TSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI 267

Query: 1377 MLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQA 1198
            MLSKYGRHFDV+PKGWRLFADLLENAAFGME+LTP FP  FV I           ALIQA
Sbjct: 268  MLSKYGRHFDVHPKGWRLFADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQA 327

Query: 1197 ATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVIT 1018
            ATRSCF AGFA+QRNFAEVIAKGEAQGMVSKS+GI+LGIV+AN + +ST LALA+FGV+T
Sbjct: 328  ATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSVGILLGIVVANCIGTSTSLALAAFGVVT 387

Query: 1017 WVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKR 838
             +HM+ NLKSYQ IQLRTLNPYRASLVFSEYL+SG  P ++EVNDEEPLFP      +K 
Sbjct: 388  TIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFSNMKS 447

Query: 837  TSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGR 658
              + Q  +LSS+AK AAA I+ RLQLGSKLSDV+ N+E+A+ALFDLY++E YILTE +GR
Sbjct: 448  PEKLQDFVLSSEAKAAAADIEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHKGR 507

Query: 657  YCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFN 478
            +CV LKESS+PQDMLRSL+QV YLYWLE+NAGI+ +S   DC+PGG+L ISL+YV+REF 
Sbjct: 508  FCVMLKESSTPQDMLRSLFQVNYLYWLEKNAGIEPASTYSDCKPGGRLHISLDYVRREFE 567

Query: 477  HVKNDSESAGWILDGLIARPLPNRIRLGN 391
            H K DSES GW+ +GLIARPLP RIRLG+
Sbjct: 568  HAKEDSESVGWVTEGLIARPLPTRIRLGH 596


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  631 bits (1627), Expect = e-178
 Identities = 322/467 (68%), Positives = 375/467 (80%)
 Frame = -3

Query: 1794 AKTGETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWT 1615
            A+       V+E+KGGK I +VP+   D FVV      +P +S    ++   + +     
Sbjct: 112  ARAQSLSSSVWEVKGGKWILLVPNDLDDTFVVDS---LFPSTSSTRPVSPLNLWL----E 164

Query: 1614 KCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIP 1435
            KCR L   L+LPEG+PESVTSDYL+YSLWR VQGVA+QIS VLATQ++LYA+GLGKGAIP
Sbjct: 165  KCRQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLATQSLLYAVGLGKGAIP 224

Query: 1434 TAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLF 1255
            TAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG E+LTPAFPHLF
Sbjct: 225  TAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGFEMLTPAFPHLF 284

Query: 1254 VPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVL 1075
            VPI            LIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGI +GI L
Sbjct: 285  VPIGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIAMGIGL 344

Query: 1074 ANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVR 895
            AN + +STPLALASF V+T++HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P ++
Sbjct: 345  ANCIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPIK 404

Query: 894  EVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAV 715
            EVNDE+PLFPA P+L VK  ++EQ  +LS++AK AAA ID RL LGSKLSDV+ N +D +
Sbjct: 405  EVNDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLLLGSKLSDVVNNHKDVL 464

Query: 714  ALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDD 535
            ALFDLY++E YILTE  GR+CV LKE+ SP DML++++ V YLYWLE+NAGI  +S   D
Sbjct: 465  ALFDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLYWLEKNAGIDGASPYLD 524

Query: 534  CRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLG 394
             +PGG+LQISL+YV+REFNHVK D ESAGW  DGLIARPLPNRIR G
Sbjct: 525  SKPGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRIRPG 571


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  630 bits (1625), Expect = e-178
 Identities = 322/490 (65%), Positives = 386/490 (78%), Gaps = 10/490 (2%)
 Frame = -3

Query: 1830 LGCFI---LSSSPARAKTGETD-------DPVYEIKGGKRIAVVPDYSKDEFVVPEKVWF 1681
            L CF    LS++ A AK  ++D       + V+E++G KR  +VPD+ KDEFV  E  + 
Sbjct: 123  LSCFFHFRLSAASAIAKASDSDSSGDTDKETVWEVRGSKRKRLVPDFVKDEFVSEESAFE 182

Query: 1680 WPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQ 1501
                       SS +T  ++  +CR+L    LLPEGFP SVTSDYL+YSLWRGVQG+A+Q
Sbjct: 183  ----------LSSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQ 232

Query: 1500 ISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLF 1321
            +SGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLF
Sbjct: 233  VSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLF 292

Query: 1320 ADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEV 1141
            ADLLENAAFGME+LTP FP  FV I           ALIQAATRSCF AGFA+QRNFAEV
Sbjct: 293  ADLLENAAFGMEMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEV 352

Query: 1140 IAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTL 961
            IAKGEAQGMVSKS+GI+LGIV+AN + +ST LALA+FGV+T +HM+ NLKSYQ IQLRTL
Sbjct: 353  IAKGEAQGMVSKSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTL 412

Query: 960  NPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAY 781
            NPYRASLVFSEYL+SG  P ++EVNDEEPLFP    L +K   + Q  +LSS+AK AA  
Sbjct: 413  NPYRASLVFSEYLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAED 472

Query: 780  IDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLY 601
            I+ RLQLGSKLSDV+ N+E+A+ALFDLY++E YILTE  GR+CV LKESS+PQDMLRSL+
Sbjct: 473  IEERLQLGSKLSDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLF 532

Query: 600  QVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIAR 421
            QV YLYWLE+NAGI+ +S   DC+PGG+L ISL+YV+REF H K DS+S GW+ +GLIAR
Sbjct: 533  QVNYLYWLEKNAGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIAR 592

Query: 420  PLPNRIRLGN 391
            PLP RIRLG+
Sbjct: 593  PLPTRIRLGH 602


>ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
            gi|355513788|gb|AES95411.1| hypothetical protein
            MTR_5g025160 [Medicago truncatula]
          Length = 630

 Score =  627 bits (1618), Expect = e-177
 Identities = 325/497 (65%), Positives = 385/497 (77%), Gaps = 7/497 (1%)
 Frame = -3

Query: 1860 YALLVSVAASLGCFIL---SSSPARAKTGETD---DPVYEIKGGKRIAVVPDYSKDEFVV 1699
            Y LL ++  S   F L   + +  R+ + E D    P+YE+KGG  I + PD  KD F+ 
Sbjct: 87   YTLLFTLLFSSVTFCLCQLAMAKTRSLSSEDDILTQPIYEVKGGNLIKLFPDNLKDIFIA 146

Query: 1698 PEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGV 1519
                 F   SS    L SSQ+    ++ KCR+    L+LPEGFP SVTSDYLEYSLWRGV
Sbjct: 147  SNPGLFSELSS----LNSSQVPTF-LYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGV 201

Query: 1518 QGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNP 1339
            QGVA Q+SGVLATQA+LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHFDVNP
Sbjct: 202  QGVACQVSGVLATQALLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNP 261

Query: 1338 KGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQAATRSCFFAGFAAQ 1159
            KGWRLFADLLENAAFG+E+ TPAFPHLFVPI           +LIQA+TRSCFFAGFAAQ
Sbjct: 262  KGWRLFADLLENAAFGLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQ 321

Query: 1158 RNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQS 979
            RNFAEVIAKGE QGMVS+ IGI +GI L N + SSTPL LASF V+TWVHM+CNLKSYQS
Sbjct: 322  RNFAEVIAKGEVQGMVSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQS 381

Query: 978  IQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEE-QSELLSSD 802
            IQLRTLNP+RASLVFSEYLLSG  P V+EVN EEPLFPA P+L     ++E QS +LSS+
Sbjct: 382  IQLRTLNPHRASLVFSEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIVLSSE 441

Query: 801  AKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGRYCVALKESSSPQ 622
            AKDAA  I+ RLQLGSKLS+++ N+E+ +ALF LY++E YIL+E  G++CV LKE+ S  
Sbjct: 442  AKDAAVEIESRLQLGSKLSEIINNKEEVLALFSLYKNEGYILSEHTGKFCVVLKETCSQL 501

Query: 621  DMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWI 442
            DML++L+QV YLYWLE+NAGI+    + DC+PGG+LQISLEY +REFNHV+ND ES GWI
Sbjct: 502  DMLKALFQVNYLYWLEKNAGIEGRGTLYDCKPGGRLQISLEYAEREFNHVRNDGESVGWI 561

Query: 441  LDGLIARPLPNRIRLGN 391
             DGLIARPLPNR R GN
Sbjct: 562  TDGLIARPLPNRCRPGN 578


>ref|XP_006418986.1| hypothetical protein EUTSA_v10002446mg [Eutrema salsugineum]
            gi|557096914|gb|ESQ37422.1| hypothetical protein
            EUTSA_v10002446mg [Eutrema salsugineum]
          Length = 611

 Score =  625 bits (1611), Expect = e-176
 Identities = 332/519 (63%), Positives = 396/519 (76%), Gaps = 12/519 (2%)
 Frame = -3

Query: 1884 NEELRSVPYALLVSVAASLGCFI---LSSSPARAKTGETD-------DPVYEIKGGKRIA 1735
            N +  S P   L  +     CF    LS++ A AK  E+D       + V+E++G KR  
Sbjct: 104  NSDGSSSPLRFLCFLFLVYSCFFQLRLSAAIAIAKAPESDSNGDTEKETVWEVRGSKRKR 163

Query: 1734 VVPDYSKDEFVV-PEKVWFWPWSSKDGNLTSSQMTMGDVWTKCRDLTASLLLPEGFPESV 1558
            +VPD+ +DEF V PE+             TSS +T  ++  +CR+L    LLPEGFP SV
Sbjct: 164  LVPDFVRDEFFVSPEET------------TSSPLTPENLLAQCRNLLTQFLLPEGFPNSV 211

Query: 1557 TSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKI 1378
            TSDYL+YSLWRGVQG+A+QISGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI
Sbjct: 212  TSDYLDYSLWRGVQGIASQISGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYLSKI 271

Query: 1377 MLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXALIQA 1198
            MLSKYGRHFDV+PKGWRLFADLLEN+AFGME+LTP FP  FV I           ALIQA
Sbjct: 272  MLSKYGRHFDVHPKGWRLFADLLENSAFGMEMLTPLFPQFFVLIGAAAGAGRSAAALIQA 331

Query: 1197 ATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFGVIT 1018
            ATRSCF AGFA+QRNFAEVIAKGEAQGMVSKSIGI+LGIV+AN + +ST LALASFGV+T
Sbjct: 332  ATRSCFNAGFASQRNFAEVIAKGEAQGMVSKSIGILLGIVVANCIGTSTSLALASFGVVT 391

Query: 1017 WVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKR 838
             +HM+ NLKSYQ IQLRTLNPYRASLVFSEYL+SG  P ++EVNDEEPLFP    L +K 
Sbjct: 392  SIHMYTNLKSYQCIQLRTLNPYRASLVFSEYLISGQAPPIKEVNDEEPLFPTVRSLNIKS 451

Query: 837  TSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTELEGR 658
              + Q  +LSS+AK AAA I+ RLQLGSKLSDV+ N+E+AVALFDLY+ E YILTE  GR
Sbjct: 452  AEKRQDFVLSSEAKAAAADIEERLQLGSKLSDVVHNKEEAVALFDLYRDEGYILTEHRGR 511

Query: 657  YCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFN 478
            +CV LKESSSPQDMLRSL+QV YLYWLE+NAGI++S+   DC+PGG+L ISL+YV+REF 
Sbjct: 512  FCVMLKESSSPQDMLRSLFQVNYLYWLEKNAGIEASNTYLDCKPGGRLHISLDYVRREFE 571

Query: 477  HVKNDSESAGWILDGLIARPLPNRIRLG-NQSTSSAEST 364
              K DSE  GW+ +GLIARPL  RIRL  ++  SS+ S+
Sbjct: 572  LAKEDSELVGWVTEGLIARPLSTRIRLDYDREPSSSPSS 610


>ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris]
            gi|561031470|gb|ESW30049.1| hypothetical protein
            PHAVU_002G120300g [Phaseolus vulgaris]
          Length = 592

 Score =  624 bits (1610), Expect = e-176
 Identities = 325/521 (62%), Positives = 390/521 (74%), Gaps = 12/521 (2%)
 Frame = -3

Query: 1890 SSNEELRSVPYALLVSVAASLGCFILSSSPARAKTGETD-------DPVYEIKGGKRIAV 1732
            S++   R +  +LL S A      +L    A AKT  +        +PV+E+KGGK   +
Sbjct: 82   SNSNSHRILFLSLLCSSAVCFFGHLLLVKLANAKTWSSSSDNELLSEPVWEVKGGKWTRL 141

Query: 1731 VPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGD-----VWTKCRDLTASLLLPEGFP 1567
            VPD + D FV          S+  G L   Q          VW KCRD+   L+LPEGFP
Sbjct: 142  VPDPTNDVFV----------SAHPGLLAELQSLKPSQFATFVWLKCRDIFTRLMLPEGFP 191

Query: 1566 ESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYL 1387
            ESVTSDYLEYSLWR VQGVA Q+SGVLATQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYL
Sbjct: 192  ESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKGAIPTAAAINWVLKDGIGYL 251

Query: 1386 SKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXXXXXXXAL 1207
            SKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFP  FV I           +L
Sbjct: 252  SKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFPQFFVLIGAVAGASRSAASL 311

Query: 1206 IQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTPLALASFG 1027
            IQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LGI L N + SSTPL LASF 
Sbjct: 312  IQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLGIGLGNCIGSSTPLVLASFI 371

Query: 1026 VITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLFPAFPLLI 847
            V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P V++VNDEEPLFPA P+L 
Sbjct: 372  VLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKDVNDEEPLFPAVPILN 431

Query: 846  VKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALFDLYQSEAYILTEL 667
                ++ +S  LSS+AKDAAA I+RRLQLGSKLS+++  +ED +ALF LY+ E YIL+E 
Sbjct: 432  ATFANKARSIALSSEAKDAAAEIERRLQLGSKLSEIVNGKEDVLALFRLYKKEGYILSEH 491

Query: 666  EGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQISLEYVKR 487
             G++CV LKE+ S QDML++L+QV YLYWLE+NAGI     ++D RPGG+L  SL+YV+R
Sbjct: 492  MGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGGRGTLNDSRPGGRLHTSLDYVER 551

Query: 486  EFNHVKNDSESAGWILDGLIARPLPNRIRLGNQSTSSAEST 364
            EFNH+KND ES GW+ DGLIARPLPNRIR+G+ ++S++ S+
Sbjct: 552  EFNHLKNDGESVGWVTDGLIARPLPNRIRIGDTTSSNSVSS 592


>ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [Amborella trichopoda]
            gi|548831916|gb|ERM94718.1| hypothetical protein
            AMTR_s00011p00244680 [Amborella trichopoda]
          Length = 565

 Score =  622 bits (1605), Expect = e-175
 Identities = 329/523 (62%), Positives = 387/523 (73%), Gaps = 1/523 (0%)
 Frame = -3

Query: 1962 NDGSNNFFNSDRNYLFLLPSHLIFSSNEELRSVPYALLVSVAA-SLGCFILSSSPARAKT 1786
            N+ +NN  N++ NY            N  + +  + LL+S +      F L+S P     
Sbjct: 54   NNNNNNGSNNNNNY-----GDSWSDDNNGIPNTSFCLLLSFSLFPNNLFSLASKPGEVVA 108

Query: 1785 GETDDPVYEIKGGKRIAVVPDYSKDEFVVPEKVWFWPWSSKDGNLTSSQMTMGDVWTKCR 1606
                   +E+KGGK   V  D SKDE      +         G L   ++ +G  W  CR
Sbjct: 109  -------WEVKGGKWSPVYADSSKDELFADNALRLL----SSGVLDLGKI-LGSSWLWCR 156

Query: 1605 DLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAA 1426
            +L   L+LPEG+P SV+SDYLEYSLWR VQGVA+QI+GVL TQA+LYA+GLGKGAIPTAA
Sbjct: 157  ELAVRLMLPEGYPASVSSDYLEYSLWRAVQGVASQINGVLTTQALLYAVGLGKGAIPTAA 216

Query: 1425 AVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPI 1246
            AVNWVLKDG+GYLSKI LSKYGRHFDV+PKGWRLFADLLENAA+G+E+LTPA+P  FV I
Sbjct: 217  AVNWVLKDGLGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGLELLTPAYPQFFVLI 276

Query: 1245 XXXXXXXXXXXALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANA 1066
                       ALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGI LAN 
Sbjct: 277  GAAAGAGRSAAALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIALANH 336

Query: 1065 VQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVN 886
            + +S PLA ASFGV+T VHMFCNLKSYQSIQLRTLNPYR SLVFSEYLLSG VP V+EVN
Sbjct: 337  IGASGPLAAASFGVVTAVHMFCNLKSYQSIQLRTLNPYRGSLVFSEYLLSGEVPPVKEVN 396

Query: 885  DEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDAVALF 706
            DEEPLF     L V      QS++LS++AK+AAA I+ RLQLG KLSDV+  +ED +ALF
Sbjct: 397  DEEPLFSGSSFLKVVPVQHAQSQVLSAEAKEAAAQIESRLQLGCKLSDVVSKKEDVLALF 456

Query: 705  DLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRP 526
            DL++ E YILTE +G+YCV LKE  SPQDML+SL+QV YLYWLERNAGI S S   DC+P
Sbjct: 457  DLFEKEGYILTEQKGKYCVVLKEDYSPQDMLKSLFQVSYLYWLERNAGIDSRSASTDCKP 516

Query: 525  GGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRL 397
            GGK+Q+S +YV+REFNHVKNDS++AGWI DGLIARPLP R+R+
Sbjct: 517  GGKMQLSYDYVQREFNHVKNDSQAAGWITDGLIARPLPCRVRV 559


Top