BLASTX nr result

ID: Mentha25_contig00004104 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00004104
         (1577 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus...   700   0.0  
ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma...   659   0.0  
ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   649   0.0  
ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog ...   648   0.0  
ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257...   633   e-179
emb|CBI21809.3| unnamed protein product [Vitis vinifera]              632   e-178
ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma...   629   e-177
ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Ci...   622   e-175
ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma...   621   e-175
gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]     618   e-174
ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Popu...   615   e-173
ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510...   614   e-173
ref|XP_002519954.1| conserved hypothetical protein [Ricinus comm...   610   e-172
ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Caps...   609   e-172
ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [A...   609   e-171
ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thalia...   607   e-171
ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago ...   605   e-170
ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arab...   604   e-170
ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phas...   603   e-170
ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778...   603   e-170

>gb|EYU37499.1| hypothetical protein MIMGU_mgv1a003124mg [Mimulus guttatus]
          Length = 606

 Score =  700 bits (1807), Expect = 0.0
 Identities = 343/415 (82%), Positives = 380/415 (91%)
 Frame = +3

Query: 24   MGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGL 203
            + DVW KCRD+  SL+LPEGFPESVTSDYLEYSLWRGVQG+AAQ+SGVLATQA+LYA+GL
Sbjct: 183  LADVWMKCRDVAMSLMLPEGFPESVTSDYLEYSLWRGVQGIAAQVSGVLATQALLYAVGL 242

Query: 204  GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 383
            GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRL AD LENAAFG+EILTP
Sbjct: 243  GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLCADFLENAAFGLEILTP 302

Query: 384  AFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGI 563
            AFPHLFVPI            LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIGI
Sbjct: 303  AFPHLFVPIGAVAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIGI 362

Query: 564  MLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 743
            MLGI LAN VQSS PLALASF VITW+HMFCNLKSYQSIQLRTLNPYRASLVFS+YLLSG
Sbjct: 363  MLGIALANGVQSSIPLALASFSVITWIHMFCNLKSYQSIQLRTLNPYRASLVFSQYLLSG 422

Query: 744  LVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMK 923
            LVPSV+EVNDEEPLFPAFPLLIVK TSEEQ E+LS DAK AA+ IDRRL+LGSKLSDV+K
Sbjct: 423  LVPSVKEVNDEEPLFPAFPLLIVKPTSEEQVEVLSPDAKHAASNIDRRLKLGSKLSDVVK 482

Query: 924  NREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKS 1103
            +RE+A+ALFDLY+SE YILTE +GRYCV LKESS PQDML+SL+QV YLYWLERNAGIKS
Sbjct: 483  SREEAIALFDLYKSEGYILTEHQGRYCVVLKESSMPQDMLKSLFQVSYLYWLERNAGIKS 542

Query: 1104 SSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQ 1268
            ++ +DDCRPGG+LQIS+EYV+REF H+KNDS+ AGW++DGLIARPLP+RIR+G++
Sbjct: 543  TTTIDDCRPGGRLQISMEYVQREFTHIKNDSQFAGWVVDGLIARPLPHRIRIGDE 597


>ref|XP_007040833.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590680339|ref|XP_007040835.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508778078|gb|EOY25334.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778080|gb|EOY25336.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 591

 Score =  659 bits (1699), Expect = 0.0
 Identities = 325/420 (77%), Positives = 367/420 (87%)
 Frame = +3

Query: 9    SSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAML 188
            +  +++  VW +CRD+   LLLPEGFP+SVTSDYL+YSLWRGVQGVA+QISGVLATQA+L
Sbjct: 166  TKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALL 225

Query: 189  YAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 368
            YA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+
Sbjct: 226  YAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGL 285

Query: 369  EILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVS 548
            E+LTPAFPHLFVPI            LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 286  EMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVS 345

Query: 549  KSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSE 728
            KSIGI+LGI LAN V SST LALASFGV+TWVHM+CNLKSYQSIQLRTLN YRASLVFSE
Sbjct: 346  KSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSE 405

Query: 729  YLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKL 908
            YLLSG  PS++EVNDEEPLFPA P L +   + E+S +LSS+AK AAA I+RRLQLGSKL
Sbjct: 406  YLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKL 465

Query: 909  SDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERN 1088
            SD++ N+EDALALF LY+ E YILTE EG++CV LKESS PQDML+SL+QV YLYWLERN
Sbjct: 466  SDIVNNKEDALALFSLYKDEGYILTEHEGKFCVVLKESSLPQDMLKSLFQVNYLYWLERN 525

Query: 1089 AGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQ 1268
            AGI++S    DCRPGG+LQIS+EYV+REFNHVK DSES GW+ DGLIARPLPNRIR G++
Sbjct: 526  AGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHR 585


>ref|XP_006361229.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum]
          Length = 609

 Score =  649 bits (1675), Expect = 0.0
 Identities = 324/412 (78%), Positives = 361/412 (87%)
 Frame = +3

Query: 24   MGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGL 203
            + ++W +C++LT +LLLPEGFP+SVTSDYLEY+LWRGVQGVAAQISGVLATQA+LYA+GL
Sbjct: 188  VSNLWMQCKELTTTLLLPEGFPDSVTSDYLEYALWRGVQGVAAQISGVLATQALLYAVGL 247

Query: 204  GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 383
            GKGAIPTAAAVNWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G+EILTP
Sbjct: 248  GKGAIPTAAAVNWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYGLEILTP 307

Query: 384  AFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGI 563
            AFPHLFVPI            LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK+IGI
Sbjct: 308  AFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKAIGI 367

Query: 564  MLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 743
            MLGI LAN  +SST LALASFGV+TW+HMFCNLKSY SIQLRTLNPYRASLVFSEYLLSG
Sbjct: 368  MLGIALANCTRSSTSLALASFGVVTWIHMFCNLKSYHSIQLRTLNPYRASLVFSEYLLSG 427

Query: 744  LVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMK 923
            LVPSV+EVNDEEPLFPA  +L +K   E Q E+LS  AK AAA I RRLQLGSKLSDV  
Sbjct: 428  LVPSVKEVNDEEPLFPA-AILNLKAAYETQMEVLSVHAKQAAAGIVRRLQLGSKLSDVAT 486

Query: 924  NREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKS 1103
            +RED LALF+LY++E YILTE EGR+C+ LKESSSPQDML+SL+ V YLYWLE  AGIKS
Sbjct: 487  SREDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLETKAGIKS 546

Query: 1104 SSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRL 1259
            SS+ +DCRPGG+LQ+SLEYV+REFNHVK D E AGW+ D LIARPLPNRIRL
Sbjct: 547  SSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPNRIRL 598


>ref|XP_004244433.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum lycopersicum]
          Length = 606

 Score =  648 bits (1671), Expect = 0.0
 Identities = 323/418 (77%), Positives = 365/418 (87%)
 Frame = +3

Query: 6    TSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAM 185
            ++S   + ++W +C++LT +L LPEGFPESVTSDYLEY+LWRGVQG+AAQISGVLATQA+
Sbjct: 179  STSGSFVSNLWMQCKELTTTLFLPEGFPESVTSDYLEYALWRGVQGIAAQISGVLATQAL 238

Query: 186  LYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG 365
            LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS YGRHFDVNPK WRLFADLLENAA+G
Sbjct: 239  LYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSNYGRHFDVNPKSWRLFADLLENAAYG 298

Query: 366  MEILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMV 545
            +EILTPAFPHLFVPI            LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMV
Sbjct: 299  LEILTPAFPHLFVPIGAVAGAGRSAASLIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMV 358

Query: 546  SKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFS 725
            SK+IGIMLGI LAN  +SST LALASFGV+TW+HMFCNLKSYQSIQLRTLNPYRASLVFS
Sbjct: 359  SKAIGIMLGIALANYTRSSTSLALASFGVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFS 418

Query: 726  EYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSK 905
            EYLLSGLVPSV+EVNDEEPLFPA  +L +K   E Q+E+LS  AK AAA I RRLQLGSK
Sbjct: 419  EYLLSGLVPSVKEVNDEEPLFPA-AILNLKAAYETQTEVLSVHAKQAAAGIVRRLQLGSK 477

Query: 906  LSDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLER 1085
            LSDV  ++ED LALF+LY++E YILTE EGR+C+ LKESSSPQDML+SL+ V YLYWLE 
Sbjct: 478  LSDVATSQEDVLALFELYKNEGYILTEHEGRFCIVLKESSSPQDMLKSLFHVNYLYWLET 537

Query: 1086 NAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRL 1259
            NAGIKSSS+ +DCRPGG+LQ+SLEYV+REFNHVK D E AGW+ D LIARPLP RIRL
Sbjct: 538  NAGIKSSSVANDCRPGGRLQMSLEYVEREFNHVKTDGEVAGWVTDSLIARPLPVRIRL 595


>ref|XP_002269838.1| PREDICTED: uncharacterized protein LOC100257731 [Vitis vinifera]
          Length = 713

 Score =  633 bits (1632), Expect = e-179
 Identities = 315/416 (75%), Positives = 361/416 (86%)
 Frame = +3

Query: 21   TMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIG 200
            T+ ++W +C++L   L+LPEGFP SVTSDYL+Y+LWRGVQGVA+QISGVLATQA+LYA+G
Sbjct: 273  TLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVG 332

Query: 201  LGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILT 380
            LGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+EILT
Sbjct: 333  LGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILT 392

Query: 381  PAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIG 560
            PAFPH F+ I            LIQA+TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIG
Sbjct: 393  PAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG 452

Query: 561  IMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLS 740
            IMLGI LAN + SS PL+ ASF V+T VHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLS
Sbjct: 453  IMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLS 512

Query: 741  GLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVM 920
            G VPS++EVN+EEPLFP  PLL  K T + QS +LS++AKDAAA I+RRLQLGSKLS+V+
Sbjct: 513  GQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVV 572

Query: 921  KNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIK 1100
             ++ED LALFDLY++EAYILTE +GR+ V LKES SPQDML+S++ V YLYWLERNAGI 
Sbjct: 573  SSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGII 632

Query: 1101 SSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQ 1268
            S    DDCRPGG+LQISLEYV+REFNH+KNDSE  GW  DGLIARPLPNRIR G++
Sbjct: 633  SMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGHK 688


>emb|CBI21809.3| unnamed protein product [Vitis vinifera]
          Length = 537

 Score =  632 bits (1631), Expect = e-178
 Identities = 315/415 (75%), Positives = 360/415 (86%)
 Frame = +3

Query: 21   TMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIG 200
            T+ ++W +C++L   L+LPEGFP SVTSDYL+Y+LWRGVQGVA+QISGVLATQA+LYA+G
Sbjct: 71   TLPNLWLQCKELFLRLMLPEGFPHSVTSDYLDYTLWRGVQGVASQISGVLATQALLYAVG 130

Query: 201  LGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILT 380
            LGKGAIPTAAAVNWVLKDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAA+G+EILT
Sbjct: 131  LGKGAIPTAAAVNWVLKDGIGYLSKILLSKYGRHFDVHPKGWRLFADLLENAAYGLEILT 190

Query: 381  PAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIG 560
            PAFPH F+ I            LIQA+TRSCF+AGFAAQRNFAEVIAKGEAQGMVSKSIG
Sbjct: 191  PAFPHQFLLIGAVAGAGRSAAALIQASTRSCFYAGFAAQRNFAEVIAKGEAQGMVSKSIG 250

Query: 561  IMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLS 740
            IMLGI LAN + SS PL+ ASF V+T VHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLS
Sbjct: 251  IMLGIALANCIGSSAPLSFASFTVVTAVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLS 310

Query: 741  GLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVM 920
            G VPS++EVN+EEPLFP  PLL  K T + QS +LS++AKDAAA I+RRLQLGSKLS+V+
Sbjct: 311  GQVPSIKEVNEEEPLFPVVPLLNAKPTYKAQSAVLSTEAKDAAAEIERRLQLGSKLSEVV 370

Query: 921  KNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIK 1100
             ++ED LALFDLY++EAYILTE +GR+ V LKES SPQDML+S++ V YLYWLERNAGI 
Sbjct: 371  SSKEDVLALFDLYRNEAYILTEHKGRFFVILKESCSPQDMLKSVFHVNYLYWLERNAGII 430

Query: 1101 SSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
            S    DDCRPGG+LQISLEYV+REFNH+KNDSE  GW  DGLIARPLPNRIR G+
Sbjct: 431  SMGASDDCRPGGRLQISLEYVQREFNHLKNDSEFVGWATDGLIARPLPNRIRPGH 485


>ref|XP_007040836.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508778081|gb|EOY25337.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 577

 Score =  629 bits (1621), Expect = e-177
 Identities = 314/420 (74%), Positives = 355/420 (84%)
 Frame = +3

Query: 9    SSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAML 188
            +  +++  VW +CRD+   LLLPEGFP+SVTSDYL+YSLWRGVQGVA+QISGVLATQA+L
Sbjct: 166  TKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALL 225

Query: 189  YAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 368
            YA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+
Sbjct: 226  YAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGL 285

Query: 369  EILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVS 548
            E+LTPAFPHLFVPI            LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 286  EMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVS 345

Query: 549  KSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSE 728
            KSIGI+LGI LAN V SST LALASFGV+TWVHM+CNLKSYQSIQLRTLN YRASLVFSE
Sbjct: 346  KSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSE 405

Query: 729  YLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKL 908
            YLLSG  PS++EVNDEEPLFPA P L +   + E+S +LSS+AK AAA I+RRLQLGSKL
Sbjct: 406  YLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKL 465

Query: 909  SDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERN 1088
            SD++ N+EDALALF LY+ E YILTE EG++C              SL+QV YLYWLERN
Sbjct: 466  SDIVNNKEDALALFSLYKDEGYILTEHEGKFC--------------SLFQVNYLYWLERN 511

Query: 1089 AGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQ 1268
            AGI++S    DCRPGG+LQIS+EYV+REFNHVK DSES GW+ DGLIARPLPNRIR G++
Sbjct: 512  AGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHR 571


>ref|XP_006482412.1| PREDICTED: UPF0420 protein C16orf58-like [Citrus sinensis]
          Length = 586

 Score =  622 bits (1605), Expect = e-175
 Identities = 307/420 (73%), Positives = 353/420 (84%)
 Frame = +3

Query: 3    STSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQA 182
            S SS +++  +W +CR+L    +LPEGFP+SVTSDYL YSLWR VQGVA+QISGVLATQA
Sbjct: 159  SLSSLLSVNKLWDECRELFVQFMLPEGFPDSVTSDYLNYSLWRSVQGVASQISGVLATQA 218

Query: 183  MLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 362
            +LYAIGLGKGAIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAF
Sbjct: 219  LLYAIGLGKGAIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAF 278

Query: 363  GMEILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGM 542
            G+E+LTPAFPH FV I            LIQA+TRSCF+AGFAA+RNFAEVIAKGEAQGM
Sbjct: 279  GLEMLTPAFPHHFVFIGAAAGAGRSAAALIQASTRSCFYAGFAARRNFAEVIAKGEAQGM 338

Query: 543  VSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVF 722
            VSK+IGIMLGI LAN + SS P ALASF V+TW+HM+CNLKSYQSI+LRTLNPYRASLVF
Sbjct: 339  VSKAIGIMLGIALANHIGSSMPFALASFSVVTWIHMYCNLKSYQSIELRTLNPYRASLVF 398

Query: 723  SEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGS 902
            SEYLLSG  P V+EVNDEEPLFPAF    +K  ++ Q  +LSS+AKDAA  I+ RLQLGS
Sbjct: 399  SEYLLSGQAPPVKEVNDEEPLFPAFHFFKIKSANKSQLLVLSSEAKDAAVEIEHRLQLGS 458

Query: 903  KLSDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLE 1082
            KLSDV+ N+EDA ALF LY+ E YILTE  G++CV LKES+ PQDML+SL+Q  YLYWLE
Sbjct: 459  KLSDVVNNKEDAHALFSLYEDEGYILTEHGGKFCVVLKESALPQDMLKSLFQASYLYWLE 518

Query: 1083 RNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLG 1262
            RNAGI ++S   DC PGG+L+ISL+YV+REFNHVK+DS S GW+ DGLIARPLPNRIR G
Sbjct: 519  RNAGIVATSTSADCAPGGRLEISLDYVQREFNHVKSDSASVGWVTDGLIARPLPNRIRPG 578


>ref|XP_007040837.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508778082|gb|EOY25338.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 573

 Score =  621 bits (1602), Expect = e-175
 Identities = 311/420 (74%), Positives = 351/420 (83%)
 Frame = +3

Query: 9    SSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAML 188
            +  +++  VW +CRD+   LLLPEGFP+SVTSDYL+YSLWRGVQGVA+QISGVLATQA+L
Sbjct: 166  TKSLSLSTVWRQCRDIVMRLLLPEGFPDSVTSDYLDYSLWRGVQGVASQISGVLATQALL 225

Query: 189  YAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 368
            YA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFG+
Sbjct: 226  YAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGL 285

Query: 369  EILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVS 548
            E+LTPAFPHLFVPI            LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVS
Sbjct: 286  EMLTPAFPHLFVPIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVS 345

Query: 549  KSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSE 728
            KSIGI+LGI LAN V SST LALASFGV+TWVHM+CNLKSYQSIQLRTLN YRASLVFSE
Sbjct: 346  KSIGIVLGIALANCVGSSTSLALASFGVVTWVHMYCNLKSYQSIQLRTLNSYRASLVFSE 405

Query: 729  YLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKL 908
            YLLSG  PS++EVNDEEPLFPA P L +   + E+S +LSS+AK AAA I+RRLQLGSKL
Sbjct: 406  YLLSGQAPSIKEVNDEEPLFPAVPFLNLLSANRERSVVLSSEAKQAAADIERRLQLGSKL 465

Query: 909  SDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERN 1088
            SD++ N+EDALALF LY+ E YILTE EG++CV                   YLYWLERN
Sbjct: 466  SDIVNNKEDALALFSLYKDEGYILTEHEGKFCVN------------------YLYWLERN 507

Query: 1089 AGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGNQ 1268
            AGI++S    DCRPGG+LQIS+EYV+REFNHVK DSES GW+ DGLIARPLPNRIR G++
Sbjct: 508  AGIEASGASTDCRPGGRLQISVEYVQREFNHVKIDSESVGWVTDGLIARPLPNRIRPGHR 567


>gb|EXB41003.1| hypothetical protein L484_020738 [Morus notabilis]
          Length = 579

 Score =  618 bits (1593), Expect = e-174
 Identities = 312/423 (73%), Positives = 356/423 (84%), Gaps = 3/423 (0%)
 Frame = +3

Query: 3    STSSQMTMG--DVWT-KCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLA 173
            STSS   +   ++W  KCR L   L+LPEG+PESVTSDYL+YSLWR VQGVA+QIS VLA
Sbjct: 149  STSSTRPVSPLNLWLEKCRQLVMRLMLPEGYPESVTSDYLDYSLWRAVQGVASQISAVLA 208

Query: 174  TQAMLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLEN 353
            TQ++LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLEN
Sbjct: 209  TQSLLYAVGLGKGAIPTAAALNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLEN 268

Query: 354  AAFGMEILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEA 533
            AAFG E+LTPAFPHLFVPI            LIQAATRSCFFAGFAAQRNFAEVIAKGEA
Sbjct: 269  AAFGFEMLTPAFPHLFVPIGAVAGAGRSAATLIQAATRSCFFAGFAAQRNFAEVIAKGEA 328

Query: 534  QGMVSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRAS 713
            QGMVSKSIGI +GI LAN + +STPLALASF V+T++HM+CNLKSYQSIQLRTLNPYRAS
Sbjct: 329  QGMVSKSIGIAMGIGLANCIGTSTPLALASFSVVTFIHMYCNLKSYQSIQLRTLNPYRAS 388

Query: 714  LVFSEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQ 893
            LVFSEYLLSG  P ++EVNDE+PLFPA P+L VK  ++EQ  +LS++AK AAA ID RL 
Sbjct: 389  LVFSEYLLSGQAPPIKEVNDEDPLFPAVPVLNVKPVNKEQPAVLSAEAKVAAAEIDNRLL 448

Query: 894  LGSKLSDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLY 1073
            LGSKLSDV+ N +D LALFDLY++E YILTE  GR+CV LKE+ SP DML++++ V YLY
Sbjct: 449  LGSKLSDVVNNHKDVLALFDLYRNEGYILTEHNGRFCVVLKETCSPHDMLKAMFHVNYLY 508

Query: 1074 WLERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRI 1253
            WLE+NAGI  +S   D +PGG+LQISL+YV+REFNHVK D ESAGW  DGLIARPLPNRI
Sbjct: 509  WLEKNAGIDGASPYLDSKPGGRLQISLDYVEREFNHVKIDGESAGWATDGLIARPLPNRI 568

Query: 1254 RLG 1262
            R G
Sbjct: 569  RPG 571


>ref|XP_006369220.1| hypothetical protein POPTR_0001s19390g [Populus trichocarpa]
            gi|550347673|gb|ERP65789.1| hypothetical protein
            POPTR_0001s19390g [Populus trichocarpa]
          Length = 406

 Score =  615 bits (1587), Expect = e-173
 Identities = 305/399 (76%), Positives = 345/399 (86%)
 Frame = +3

Query: 69   LLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKGAIPTAAAVNWVL 248
            +LP+GFP SVTSDYL+YSLWR VQG+A+QISGVLATQA+LYA+GLGKGAIPTAAA+NWVL
Sbjct: 1    MLPQGFPRSVTSDYLDYSLWRAVQGIASQISGVLATQALLYAVGLGKGAIPTAAAINWVL 60

Query: 249  KDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFPHLFVPIXXXXXX 428
            KDGIGYLSKI+LSKYGRHFDV+PKGWRLFADLLENAAFG+E+LTPAFPHLFV I      
Sbjct: 61   KDGIGYLSKIVLSKYGRHFDVHPKGWRLFADLLENAAFGLEMLTPAFPHLFVFIGATAGA 120

Query: 429  XXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLGIVLANAVQSSTP 608
                  LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK IGIMLGI LAN + SSTP
Sbjct: 121  GRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLGIALANCIGSSTP 180

Query: 609  LALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVPSVREVNDEEPLF 788
            LALASF V+TW+HMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P V+E+NDEEPLF
Sbjct: 181  LALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAPPVKEINDEEPLF 240

Query: 789  PAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNREDALALFDLYQSE 968
            PA P L +      QS +LSS+A++AAA I++RLQLGSKLSDV+ N++D LALF+LY+ E
Sbjct: 241  PAVPFLNIYSKGNVQSIVLSSEARNAAAEIEQRLQLGSKLSDVVNNKDDVLALFNLYRDE 300

Query: 969  AYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSIVDDCRPGGKLQI 1148
             YILTE +GR+CV LKESSSP DML+SL+QV YLYWLERNAGI++ SI  DCRP G+LQI
Sbjct: 301  GYILTEHKGRFCVVLKESSSPHDMLKSLFQVNYLYWLERNAGIEARSISADCRPEGRLQI 360

Query: 1149 SLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
            SLEY +REFNHVKNDS S GW+ DGLIARP P R+  GN
Sbjct: 361  SLEYARREFNHVKNDSVSMGWVADGLIARPSPIRVCPGN 399


>ref|XP_004512305.1| PREDICTED: uncharacterized protein LOC101510665 [Cicer arietinum]
          Length = 590

 Score =  614 bits (1583), Expect = e-173
 Identities = 300/411 (72%), Positives = 347/411 (84%)
 Frame = +3

Query: 33   VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 212
            ++TKC++ T  L+LPEGFP SVTSDYLEYSLWRGVQGVA Q+SGVLATQA+LYA+GLGKG
Sbjct: 178  LYTKCKEFTVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQALLYAVGLGKG 237

Query: 213  AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 392
            AIPTAAA+NWVLKDGIGYLSKI+LS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFP
Sbjct: 238  AIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFP 297

Query: 393  HLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 572
            HLFVPI            LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LG
Sbjct: 298  HLFVPIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIALG 357

Query: 573  IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 752
            I L N + SSTPL LASF V+TWVHM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P
Sbjct: 358  IGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAP 417

Query: 753  SVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNRE 932
             V+EVNDEEPLFPA P+L     ++ QS +LSS+AKDAA  I+ RLQLGSKLS+++ N+E
Sbjct: 418  PVKEVNDEEPLFPALPILNACFANKAQSIVLSSEAKDAAVEIESRLQLGSKLSEIIHNKE 477

Query: 933  DALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSI 1112
            + LALF LY++E YIL+E  G++CV LKE+ S  DML++L+QV YLYWLE+NAGI+    
Sbjct: 478  EVLALFSLYKNEGYILSEHTGKFCVVLKENCSQLDMLKALFQVNYLYWLEKNAGIEGRGA 537

Query: 1113 VDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
            + DC+PGG+L+ISLEY +REFNH +ND ESAGWI DGLIARPLPNRIR GN
Sbjct: 538  LYDCKPGGRLRISLEYAEREFNHARNDGESAGWIADGLIARPLPNRIRPGN 588


>ref|XP_002519954.1| conserved hypothetical protein [Ricinus communis]
            gi|223541000|gb|EEF42558.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 541

 Score =  610 bits (1573), Expect = e-172
 Identities = 306/412 (74%), Positives = 348/412 (84%), Gaps = 1/412 (0%)
 Frame = +3

Query: 33   VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 212
            +W +CR L   L+LPEG+P SVTSDYL+YSLWRGVQGVA+QISGVLATQA+LYAIGLGKG
Sbjct: 124  LWLQCRALFVRLMLPEGYPHSVTSDYLDYSLWRGVQGVASQISGVLATQALLYAIGLGKG 183

Query: 213  AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 392
            AIPTAAA+NWVLKDGIGYLSKI+LSKYGRHFDVNPKGWRLFADLLENAAFG+EILTPAFP
Sbjct: 184  AIPTAAAINWVLKDGIGYLSKIVLSKYGRHFDVNPKGWRLFADLLENAAFGLEILTPAFP 243

Query: 393  HLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 572
            HLFV I            LIQAATRSCF+AGFAAQRNFAEVIAKGEAQGMVSK IGIMLG
Sbjct: 244  HLFVFIGAAAGAGRSAAALIQAATRSCFYAGFAAQRNFAEVIAKGEAQGMVSKFIGIMLG 303

Query: 573  IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 752
            I LAN + SS PLALASF V+TW+HMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P
Sbjct: 304  IGLANCIGSSIPLALASFSVVTWIHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAP 363

Query: 753  SVREVNDEEPLFPA-FPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNR 929
             +++VNDEEPLFPA FP    K   +    +LS +A+DAA  I+RRLQLGSKLSDV+ ++
Sbjct: 364  PIKDVNDEEPLFPAVFPHF--KSADKPSLVVLSLEARDAATEIERRLQLGSKLSDVVNSK 421

Query: 930  EDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSS 1109
            ED LALF+LY+ E YILTE +GR+CV LKES S QDML++L+QV YLYWLERNAG+ +  
Sbjct: 422  EDVLALFNLYKDEGYILTEYKGRFCVVLKESCSAQDMLKALFQVNYLYWLERNAGLDARG 481

Query: 1110 IVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
               DCR GG+LQ+SLEY++REF+HV+NDS S GW+ DGLIARPLPNRI  G+
Sbjct: 482  TSADCRSGGRLQVSLEYMQREFSHVRNDSISVGWVADGLIARPLPNRIYPGD 533


>ref|XP_006290708.1| hypothetical protein CARUB_v10016806mg [Capsella rubella]
            gi|482559415|gb|EOA23606.1| hypothetical protein
            CARUB_v10016806mg [Capsella rubella]
          Length = 657

 Score =  609 bits (1571), Expect = e-172
 Identities = 302/418 (72%), Positives = 351/418 (83%)
 Frame = +3

Query: 9    SSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAML 188
            SS +T  ++  +CR L    LLPEG+P SVTSDYL+YSLWRGVQG+A+QISGVLATQ++L
Sbjct: 228  SSSLTPENLLAQCRSLLTQFLLPEGYPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLL 287

Query: 189  YAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 368
            YA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAAFGM
Sbjct: 288  YAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGM 347

Query: 369  EILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVS 548
            E+LTP FP  FV I            LIQAATRSCF AGFA+QRNFAEVIAKGEAQGMVS
Sbjct: 348  EMLTPLFPQFFVMIGAGAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVS 407

Query: 549  KSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSE 728
            KS+GI+LGIV+AN + +ST LALA+FGV+T +HM+ NLKSYQ IQLRTLNPYRASLVFSE
Sbjct: 408  KSMGILLGIVVANCIGTSTSLALAAFGVVTAIHMYTNLKSYQCIQLRTLNPYRASLVFSE 467

Query: 729  YLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKL 908
            YL+SG  P ++EVNDEEPLFPA   L +K   + Q  +LSS+AK AAA I+ RLQLGSKL
Sbjct: 468  YLISGQAPLIKEVNDEEPLFPAVRFLNIKSPGKLQDFVLSSEAKSAAADIEERLQLGSKL 527

Query: 909  SDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERN 1088
            SDV+ N+E+A+ALFDLY++E YILTE  GR+CV LKESSSPQDMLRSL+QV YLYWLE+N
Sbjct: 528  SDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSSPQDMLRSLFQVNYLYWLEKN 587

Query: 1089 AGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLG 1262
            AGI+ +S   DC+PGG+L ISL+YV+REF H K DSES GW+ +GLIARPLP RIRLG
Sbjct: 588  AGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRLG 645


>ref|XP_006878573.1| hypothetical protein AMTR_s00011p00244680 [Amborella trichopoda]
            gi|548831916|gb|ERM94718.1| hypothetical protein
            AMTR_s00011p00244680 [Amborella trichopoda]
          Length = 565

 Score =  609 bits (1570), Expect = e-171
 Identities = 302/412 (73%), Positives = 344/412 (83%)
 Frame = +3

Query: 24   MGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGL 203
            +G  W  CR+L   L+LPEG+P SV+SDYLEYSLWR VQGVA+QI+GVL TQA+LYA+GL
Sbjct: 148  LGSSWLWCRELAVRLMLPEGYPASVSSDYLEYSLWRAVQGVASQINGVLTTQALLYAVGL 207

Query: 204  GKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTP 383
            GKGAIPTAAAVNWVLKDG+GYLSKI LSKYGRHFDV+PKGWRLFADLLENAA+G+E+LTP
Sbjct: 208  GKGAIPTAAAVNWVLKDGLGYLSKIFLSKYGRHFDVHPKGWRLFADLLENAAYGLELLTP 267

Query: 384  AFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGI 563
            A+P  FV I            LIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGI
Sbjct: 268  AYPQFFVLIGAAAGAGRSAAALIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGI 327

Query: 564  MLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSG 743
            MLGI LAN + +S PLA ASFGV+T VHMFCNLKSYQSIQLRTLNPYR SLVFSEYLLSG
Sbjct: 328  MLGIALANHIGASGPLAAASFGVVTAVHMFCNLKSYQSIQLRTLNPYRGSLVFSEYLLSG 387

Query: 744  LVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMK 923
             VP V+EVNDEEPLF     L V      QS++LS++AK+AAA I+ RLQLG KLSDV+ 
Sbjct: 388  EVPPVKEVNDEEPLFSGSSFLKVVPVQHAQSQVLSAEAKEAAAQIESRLQLGCKLSDVVS 447

Query: 924  NREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKS 1103
             +ED LALFDL++ E YILTE +G+YCV LKE  SPQDML+SL+QV YLYWLERNAGI S
Sbjct: 448  KKEDVLALFDLFEKEGYILTEQKGKYCVVLKEDYSPQDMLKSLFQVSYLYWLERNAGIDS 507

Query: 1104 SSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRL 1259
             S   DC+PGGK+Q+S +YV+REFNHVKNDS++AGWI DGLIARPLP R+R+
Sbjct: 508  RSASTDCKPGGKMQLSYDYVQREFNHVKNDSQAAGWITDGLIARPLPCRVRV 559


>ref|NP_190175.2| proteinROOT UVB SENSITIVE 1 [Arabidopsis thaliana]
            gi|30793915|gb|AAP40410.1| unknown protein [Arabidopsis
            thaliana] gi|30794095|gb|AAP40490.1| unknown protein
            [Arabidopsis thaliana] gi|110739240|dbj|BAF01534.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332644566|gb|AEE78087.1| protein root UVB sensitive 1
            [Arabidopsis thaliana]
          Length = 608

 Score =  607 bits (1564), Expect = e-171
 Identities = 300/419 (71%), Positives = 352/419 (84%)
 Frame = +3

Query: 9    SSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAML 188
            SS +T  ++  +CR+L    LLPEGFP SVTSDYL+YSLWRGVQG+A+QISGVLATQ++L
Sbjct: 178  SSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQISGVLATQSLL 237

Query: 189  YAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 368
            YA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAAFGM
Sbjct: 238  YAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGM 297

Query: 369  EILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVS 548
            E+LTP FP  FV I            LIQAATRSCF AGFA+QRNFAEVIAKGEAQGMVS
Sbjct: 298  EMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVS 357

Query: 549  KSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSE 728
            KS+GI+LGIV+AN + +ST LALA+FGV+T +HM+ NLKSYQ IQLRTLNPYRASLVFSE
Sbjct: 358  KSVGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSE 417

Query: 729  YLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKL 908
            YL+SG  P ++EVNDEEPLFP      +K   + Q  +LSS+AK AAA I+ RLQLGSKL
Sbjct: 418  YLISGQAPLIKEVNDEEPLFPTVRFSNMKSPEKLQDFVLSSEAKAAAADIEERLQLGSKL 477

Query: 909  SDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERN 1088
            SDV+ N+E+A+ALFDLY++E YILTE +GR+CV LKESS+PQDMLRSL+QV YLYWLE+N
Sbjct: 478  SDVIHNKEEAIALFDLYRNEGYILTEHKGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKN 537

Query: 1089 AGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
            AGI+ +S   DC+PGG+L ISL+YV+REF H K DSES GW+ +GLIARPLP RIRLG+
Sbjct: 538  AGIEPASTYSDCKPGGRLHISLDYVRREFEHAKEDSESVGWVTEGLIARPLPTRIRLGH 596


>ref|XP_003612453.1| hypothetical protein MTR_5g025160 [Medicago truncatula]
            gi|355513788|gb|AES95411.1| hypothetical protein
            MTR_5g025160 [Medicago truncatula]
          Length = 630

 Score =  605 bits (1559), Expect = e-170
 Identities = 300/422 (71%), Positives = 348/422 (82%), Gaps = 1/422 (0%)
 Frame = +3

Query: 3    STSSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQA 182
            S +S      ++ KCR+    L+LPEGFP SVTSDYLEYSLWRGVQGVA Q+SGVLATQA
Sbjct: 157  SLNSSQVPTFLYNKCREFVVRLMLPEGFPNSVTSDYLEYSLWRGVQGVACQVSGVLATQA 216

Query: 183  MLYAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAF 362
            +LYA+GLGKGAIPTAAA+NWVLKDGIGYLSKI+LS +GRHFDVNPKGWRLFADLLENAAF
Sbjct: 217  LLYAVGLGKGAIPTAAAINWVLKDGIGYLSKILLSDFGRHFDVNPKGWRLFADLLENAAF 276

Query: 363  GMEILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGM 542
            G+E+ TPAFPHLFVPI            LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM
Sbjct: 277  GLEMCTPAFPHLFVPIGAFAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGM 336

Query: 543  VSKSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVF 722
            VS+ IGI +GI L N + SSTPL LASF V+TWVHM+CNLKSYQSIQLRTLNP+RASLVF
Sbjct: 337  VSRFIGIGIGIGLGNCIGSSTPLVLASFCVVTWVHMYCNLKSYQSIQLRTLNPHRASLVF 396

Query: 723  SEYLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEE-QSELLSSDAKDAAAYIDRRLQLG 899
            SEYLLSG  P V+EVN EEPLFPA P+L     ++E QS +LSS+AKDAA  I+ RLQLG
Sbjct: 397  SEYLLSGQAPPVKEVNAEEPLFPAVPILNAPFANKETQSIVLSSEAKDAAVEIESRLQLG 456

Query: 900  SKLSDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWL 1079
            SKLS+++ N+E+ LALF LY++E YIL+E  G++CV LKE+ S  DML++L+QV YLYWL
Sbjct: 457  SKLSEIINNKEEVLALFSLYKNEGYILSEHTGKFCVVLKETCSQLDMLKALFQVNYLYWL 516

Query: 1080 ERNAGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRL 1259
            E+NAGI+    + DC+PGG+LQISLEY +REFNHV+ND ES GWI DGLIARPLPNR R 
Sbjct: 517  EKNAGIEGRGTLYDCKPGGRLQISLEYAEREFNHVRNDGESVGWITDGLIARPLPNRCRP 576

Query: 1260 GN 1265
            GN
Sbjct: 577  GN 578


>ref|XP_002875756.1| hypothetical protein ARALYDRAFT_905765 [Arabidopsis lyrata subsp.
            lyrata] gi|297321594|gb|EFH52015.1| hypothetical protein
            ARALYDRAFT_905765 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  604 bits (1557), Expect = e-170
 Identities = 298/419 (71%), Positives = 351/419 (83%)
 Frame = +3

Query: 9    SSQMTMGDVWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAML 188
            SS +T  ++  +CR+L    LLPEGFP SVTSDYL+YSLWRGVQG+A+Q+SGVLATQ++L
Sbjct: 184  SSSLTPENLLAQCRNLLTQFLLPEGFPNSVTSDYLDYSLWRGVQGIASQVSGVLATQSLL 243

Query: 189  YAIGLGKGAIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGM 368
            YA+GLGKGAIPTAAA+NWVLKDGIGYLSKIMLSKYGRHFDV+PKGWRLFADLLENAAFGM
Sbjct: 244  YAVGLGKGAIPTAAAINWVLKDGIGYLSKIMLSKYGRHFDVHPKGWRLFADLLENAAFGM 303

Query: 369  EILTPAFPHLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVS 548
            E+LTP FP  FV I            LIQAATRSCF AGFA+QRNFAEVIAKGEAQGMVS
Sbjct: 304  EMLTPVFPQFFVMIGAAAGAGRSAAALIQAATRSCFNAGFASQRNFAEVIAKGEAQGMVS 363

Query: 549  KSIGIMLGIVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSE 728
            KS+GI+LGIV+AN + +ST LALA+FGV+T +HM+ NLKSYQ IQLRTLNPYRASLVFSE
Sbjct: 364  KSMGILLGIVVANCIGTSTSLALAAFGVVTTIHMYTNLKSYQCIQLRTLNPYRASLVFSE 423

Query: 729  YLLSGLVPSVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKL 908
            YL+SG  P ++EVNDEEPLFP    L +K   + Q  +LSS+AK AA  I+ RLQLGSKL
Sbjct: 424  YLISGQAPLIKEVNDEEPLFPTVRFLNMKSPEKLQDFVLSSEAKAAAEDIEERLQLGSKL 483

Query: 909  SDVMKNREDALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERN 1088
            SDV+ N+E+A+ALFDLY++E YILTE  GR+CV LKESS+PQDMLRSL+QV YLYWLE+N
Sbjct: 484  SDVIHNKEEAIALFDLYRNEGYILTEHRGRFCVMLKESSTPQDMLRSLFQVNYLYWLEKN 543

Query: 1089 AGIKSSSIVDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
            AGI+ +S   DC+PGG+L ISL+YV+REF H K DS+S GW+ +GLIARPLP RIRLG+
Sbjct: 544  AGIEPASTYTDCKPGGRLHISLDYVRREFEHAKEDSQSVGWVTEGLIARPLPTRIRLGH 602


>ref|XP_007158055.1| hypothetical protein PHAVU_002G120300g [Phaseolus vulgaris]
            gi|561031470|gb|ESW30049.1| hypothetical protein
            PHAVU_002G120300g [Phaseolus vulgaris]
          Length = 592

 Score =  603 bits (1556), Expect = e-170
 Identities = 296/411 (72%), Positives = 342/411 (83%)
 Frame = +3

Query: 33   VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 212
            VW KCRD+   L+LPEGFPESVTSDYLEYSLWR VQGVA Q+SGVLATQ++LYA+GLGKG
Sbjct: 173  VWLKCRDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 232

Query: 213  AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 392
            AIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDVNPKGWRLFADLLENAAFG+E+ TPAFP
Sbjct: 233  AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVNPKGWRLFADLLENAAFGLEMCTPAFP 292

Query: 393  HLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 572
              FV I            LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LG
Sbjct: 293  QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLG 352

Query: 573  IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 752
            I L N + SSTPL LASF V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P
Sbjct: 353  IGLGNCIGSSTPLVLASFIVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAP 412

Query: 753  SVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNRE 932
             V++VNDEEPLFPA P+L     ++ +S  LSS+AKDAAA I+RRLQLGSKLS+++  +E
Sbjct: 413  PVKDVNDEEPLFPAVPILNATFANKARSIALSSEAKDAAAEIERRLQLGSKLSEIVNGKE 472

Query: 933  DALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSI 1112
            D LALF LY+ E YIL+E  G++CV LKE+ S QDML++L+QV YLYWLE+NAGI     
Sbjct: 473  DVLALFRLYKKEGYILSEHMGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGGRGT 532

Query: 1113 VDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
            ++D RPGG+L  SL+YV+REFNH+KND ES GW+ DGLIARPLPNRIR+G+
Sbjct: 533  LNDSRPGGRLHTSLDYVEREFNHLKNDGESVGWVTDGLIARPLPNRIRIGD 583


>ref|XP_006573502.1| PREDICTED: uncharacterized protein LOC100778944 [Glycine max]
          Length = 593

 Score =  603 bits (1555), Expect = e-170
 Identities = 295/411 (71%), Positives = 343/411 (83%)
 Frame = +3

Query: 33   VWTKCRDLTASLLLPEGFPESVTSDYLEYSLWRGVQGVAAQISGVLATQAMLYAIGLGKG 212
            VW KC D+   L+LPEGFPESVTSDYLEYSLWR VQGVA Q+SGVLATQ++LYA+GLGKG
Sbjct: 174  VWLKCSDIFTRLMLPEGFPESVTSDYLEYSLWRAVQGVACQVSGVLATQSLLYAVGLGKG 233

Query: 213  AIPTAAAVNWVLKDGIGYLSKIMLSKYGRHFDVNPKGWRLFADLLENAAFGMEILTPAFP 392
            AIPTAAA+NWVLKDGIGYLSKIMLS +GRHFDV+PKGWRLFADLLENAAFG+E+ TPAFP
Sbjct: 234  AIPTAAAINWVLKDGIGYLSKIMLSNFGRHFDVDPKGWRLFADLLENAAFGLEMCTPAFP 293

Query: 393  HLFVPIXXXXXXXXXXXXLIQAATRSCFFAGFAAQRNFAEVIAKGEAQGMVSKSIGIMLG 572
              FV I            LIQA+TRSCFFAGFAAQRNFAEVIAKGE QGM S+ IGI LG
Sbjct: 294  QFFVLIGAVAGASRSAASLIQASTRSCFFAGFAAQRNFAEVIAKGEVQGMASRFIGIGLG 353

Query: 573  IVLANAVQSSTPLALASFGVITWVHMFCNLKSYQSIQLRTLNPYRASLVFSEYLLSGLVP 752
            I L N + SSTPL LASF V+TW+HM+CNLKSYQSIQLRTLNPYRASLVFSEYLLSG  P
Sbjct: 354  IGLGNCIGSSTPLVLASFTVLTWIHMYCNLKSYQSIQLRTLNPYRASLVFSEYLLSGQAP 413

Query: 753  SVREVNDEEPLFPAFPLLIVKRTSEEQSELLSSDAKDAAAYIDRRLQLGSKLSDVMKNRE 932
             V+EVNDEEPLFPA P+L     ++ QS +LSS+AKDAAA I+ RLQLGSKLS+++ ++E
Sbjct: 414  PVKEVNDEEPLFPAVPILNATFANKAQSIVLSSEAKDAAAEIEHRLQLGSKLSEIVNSKE 473

Query: 933  DALALFDLYQSEAYILTELEGRYCVALKESSSPQDMLRSLYQVCYLYWLERNAGIKSSSI 1112
            D LALF LY++E YIL+E  G++CV LKE+ S QDML++L+QV YLYWLE+NAGI     
Sbjct: 474  DVLALFGLYKNEGYILSEYMGKFCVVLKENCSQQDMLKALFQVNYLYWLEKNAGIGGRGT 533

Query: 1113 VDDCRPGGKLQISLEYVKREFNHVKNDSESAGWILDGLIARPLPNRIRLGN 1265
            ++D +PGG+L ISL+YV+REFNHVKND E  GW+ DGLIARPLPNRIR+G+
Sbjct: 534  LNDSKPGGRLHISLDYVEREFNHVKNDGELVGWVTDGLIARPLPNRIRIGD 584


Top