BLASTX nr result

ID: Atropa21_contig00006225 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00006225
         (1386 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006350310.1| PREDICTED: uncharacterized protein LOC102591...   589   e-166
ref|XP_004247100.1| PREDICTED: uncharacterized protein LOC101261...   582   e-163
ref|XP_006443639.1| hypothetical protein CICLE_v10020351mg [Citr...   382   e-103
ref|XP_006443638.1| hypothetical protein CICLE_v10020351mg [Citr...   382   e-103
ref|XP_006443643.1| hypothetical protein CICLE_v10020351mg [Citr...   381   e-103
gb|EOX93978.1| B3 domain-containing transcription factor VAL3, p...   379   e-102
gb|EXC24174.1| hypothetical protein L484_015193 [Morus notabilis]     369   1e-99
ref|XP_002521120.1| conserved hypothetical protein [Ricinus comm...   364   4e-98
ref|XP_004290229.1| PREDICTED: uncharacterized protein LOC101300...   361   5e-97
ref|XP_004290228.1| PREDICTED: uncharacterized protein LOC101300...   361   5e-97
ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263...   360   1e-96
ref|XP_006298026.1| hypothetical protein CARUB_v10014073mg [Caps...   350   1e-93
ref|XP_002301572.1| hypothetical protein POPTR_0002s22380g [Popu...   345   3e-92
gb|EMJ02549.1| hypothetical protein PRUPE_ppa007686mg [Prunus pe...   344   4e-92
ref|XP_006586264.1| PREDICTED: uncharacterized protein LOC100791...   343   1e-91
ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arab...   342   2e-91
gb|EPS72073.1| hypothetical protein M569_02687, partial [Genlise...   342   2e-91
ref|NP_189118.1| uncharacterized protein [Arabidopsis thaliana] ...   339   2e-90
ref|XP_004492331.1| PREDICTED: uncharacterized protein LOC101499...   338   2e-90
gb|ESW12557.1| hypothetical protein PHAVU_008G123200g [Phaseolus...   336   1e-89

>ref|XP_006350310.1| PREDICTED: uncharacterized protein LOC102591236 isoform X1 [Solanum
            tuberosum] gi|565367302|ref|XP_006350311.1| PREDICTED:
            uncharacterized protein LOC102591236 isoform X2 [Solanum
            tuberosum] gi|565367304|ref|XP_006350312.1| PREDICTED:
            uncharacterized protein LOC102591236 isoform X3 [Solanum
            tuberosum]
          Length = 385

 Score =  589 bits (1519), Expect = e-166
 Identities = 302/388 (77%), Positives = 315/388 (81%), Gaps = 24/388 (6%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MASRKRSMSN VDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSY+HS
Sbjct: 1    MASRKRSMSNDVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYKHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVV------------- 1017
            NCLDRFKKL+AEN D+PPIMTQGNLDIAVETP EHLEL+NLSD TVV             
Sbjct: 61   NCLDRFKKLKAENRDNPPIMTQGNLDIAVETPAEHLELKNLSDRTVVHGYHDIPANEVVA 120

Query: 1016 ----------DGNSNRDTHVEMQEDTLQTSGAVTLWGSSHETARGDNSSDSKLKLKCPMC 867
                      +GNSNRD  +EMQE  LQTS AVT+WGSSHETA  DNSSDS LKLKCPMC
Sbjct: 121  TGAFPGGSEENGNSNRDNRMEMQEGGLQTSDAVTVWGSSHETANADNSSDSILKLKCPMC 180

Query: 866  RGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQR 687
            RGDVLGWKVVEEARKYLNLK RSCSRESCSF+GNYRELRRHARRDHPTARPADIDPSRQR
Sbjct: 181  RGDVLGWKVVEEARKYLNLKHRSCSRESCSFLGNYRELRRHARRDHPTARPADIDPSRQR 240

Query: 686  AWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIENXXXXXXXXXXXXXXXXRWLSTFFLF 507
            AWRRLENQREYDDIVSAVRSAMPGAVV GDYVIEN                RWLSTFFLF
Sbjct: 241  AWRRLENQREYDDIVSAVRSAMPGAVVFGDYVIENGDRLSGERERGSGANGRWLSTFFLF 300

Query: 506  QMIGSMDPIPEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSE-EDQPNLNL 330
            QMIGSMDPI EARGGRSRALSRHRRS GPLSRRRYPWGENLLGLQD D++E E +P+LN+
Sbjct: 301  QMIGSMDPISEARGGRSRALSRHRRSTGPLSRRRYPWGENLLGLQDHDNNEDEGEPDLNI 360

Query: 329  LSDMSDDMSTNPXXXXXXXXXXSDEDQQ 246
            L   S DMS NP          SDEDQQ
Sbjct: 361  L---SGDMSNNPRRRRRLMRSRSDEDQQ 385


>ref|XP_004247100.1| PREDICTED: uncharacterized protein LOC101261359 isoform 1 [Solanum
            lycopersicum] gi|460403239|ref|XP_004247101.1| PREDICTED:
            uncharacterized protein LOC101261359 isoform 2 [Solanum
            lycopersicum] gi|460403241|ref|XP_004247102.1| PREDICTED:
            uncharacterized protein LOC101261359 isoform 3 [Solanum
            lycopersicum]
          Length = 385

 Score =  582 bits (1500), Expect = e-163
 Identities = 298/388 (76%), Positives = 311/388 (80%), Gaps = 24/388 (6%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MASRKRSMSN VDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS
Sbjct: 1    MASRKRSMSNDVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVV------------- 1017
            NCLDRFKKL+AEN D+PP MTQGNLDIAVE P EHLELRNLSD TVV             
Sbjct: 61   NCLDRFKKLKAENRDNPPTMTQGNLDIAVEIPAEHLELRNLSDRTVVHGYHDIPADEVVA 120

Query: 1016 ----------DGNSNRDTHVEMQEDTLQTSGAVTLWGSSHETARGDNSSDSKLKLKCPMC 867
                      +GNSNRD  +EMQE  LQTS AVT+WGSSHET   DNSSDS LKLKCPMC
Sbjct: 121  TGAFPGGSEENGNSNRDNRMEMQEGALQTSDAVTVWGSSHETVNADNSSDSILKLKCPMC 180

Query: 866  RGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQR 687
            RGDVLGWKVVEEARKYLNLK RSCSRESCSF+GNYRELRRHARRDHPTARPADIDPSRQR
Sbjct: 181  RGDVLGWKVVEEARKYLNLKHRSCSRESCSFLGNYRELRRHARRDHPTARPADIDPSRQR 240

Query: 686  AWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIENXXXXXXXXXXXXXXXXRWLSTFFLF 507
            AWRRLENQREYDDIVSAVRSAMPGAVV GDYVIEN                RWLSTFFLF
Sbjct: 241  AWRRLENQREYDDIVSAVRSAMPGAVVFGDYVIENGDRLSVERERGSGANGRWLSTFFLF 300

Query: 506  QMIGSMDPIPEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSE-EDQPNLNL 330
            QM GSMDPI EARGGRSRALSRHRRS GPLSRRRYPWGENLLGLQD +++E E +P++N+
Sbjct: 301  QMFGSMDPISEARGGRSRALSRHRRSTGPLSRRRYPWGENLLGLQDHNNNEDEGEPDVNI 360

Query: 329  LSDMSDDMSTNPXXXXXXXXXXSDEDQQ 246
            L   S DMS NP          SDEDQQ
Sbjct: 361  L---SGDMSNNPRRRRRLMRSRSDEDQQ 385


>ref|XP_006443639.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|567902306|ref|XP_006443641.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902312|ref|XP_006443644.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902314|ref|XP_006443645.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902316|ref|XP_006443646.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902318|ref|XP_006443647.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|568853098|ref|XP_006480204.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X2 [Citrus
            sinensis] gi|568853100|ref|XP_006480205.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X3 [Citrus
            sinensis] gi|557545901|gb|ESR56879.1| hypothetical
            protein CICLE_v10020351mg [Citrus clementina]
            gi|557545903|gb|ESR56881.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545906|gb|ESR56884.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545907|gb|ESR56885.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545908|gb|ESR56886.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545909|gb|ESR56887.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 389

 Score =  382 bits (981), Expect = e-103
 Identities = 215/391 (54%), Positives = 261/391 (66%), Gaps = 28/391 (7%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR M    D+H L+KELD  SCPICMDHPHNAVLL+CSSHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIM-----------TQGNLDIAVETPN-EHLELRNLSDPTVV- 1017
            NCLDR+KKLR  + ++  +               ++++A+ T   E  E  NL+    + 
Sbjct: 61   NCLDRYKKLRTSSRNNTTLSHSSPSHPQHNSNASDMNLALRTDFVESSENLNLNGSNALS 120

Query: 1016 DG--NSNRDTHVEMQEDTLQTSGAVTL---WGSS---HETAR-----GDNSSDSKLKLKC 876
            DG      + +++  +  L+  G   L    G+S   HE         DNSS+S L LKC
Sbjct: 121  DGLPEGPGENNIQQADRLLEREGEGNLNPEAGNSQTFHERTELEGLDVDNSSESILTLKC 180

Query: 875  PMCRGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPS 696
            PMCRG +LGW+VVEEARKYLNLK R+CSRESCSF+GNY+ELRRHARR HPT RP+DIDPS
Sbjct: 181  PMCRGAILGWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHARRAHPTTRPSDIDPS 240

Query: 695  RQRAWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRWLST 519
            R+RAWRRLE+QREY DIVSA+RS+MPGAVV+GDYVIEN                  W +T
Sbjct: 241  RERAWRRLEHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGRESGNGEVNAPWWTT 300

Query: 518  FFLFQMIGSMDPIPEARGGRSRALSRHRRSNGPLS-RRRYPWGENLLGLQDDDDSEEDQP 342
            FFLF MIGSMD   E+R  RSRA +RHRR+ G LS RRR+ WGENLLGLQD++D EED  
Sbjct: 301  FFLFHMIGSMDGTGESR-ARSRAWTRHRRTAGALSERRRFLWGENLLGLQDEEDDEED-- 357

Query: 341  NLNLLSDMSDDMSTNPXXXXXXXXXXSDEDQ 249
            +L++ SD+ +D S  P          SDEDQ
Sbjct: 358  DLHIFSDVGEDTSPIPRRRRRLTQSRSDEDQ 388


>ref|XP_006443638.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|567902304|ref|XP_006443640.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902308|ref|XP_006443642.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|568853096|ref|XP_006480203.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X1 [Citrus
            sinensis] gi|557545900|gb|ESR56878.1| hypothetical
            protein CICLE_v10020351mg [Citrus clementina]
            gi|557545902|gb|ESR56880.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545904|gb|ESR56882.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 415

 Score =  382 bits (981), Expect = e-103
 Identities = 215/391 (54%), Positives = 261/391 (66%), Gaps = 28/391 (7%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR M    D+H L+KELD  SCPICMDHPHNAVLL+CSSHDKGCRSYICDTSYRHS
Sbjct: 27   MAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSYRHS 86

Query: 1157 NCLDRFKKLRAENMDHPPIM-----------TQGNLDIAVETPN-EHLELRNLSDPTVV- 1017
            NCLDR+KKLR  + ++  +               ++++A+ T   E  E  NL+    + 
Sbjct: 87   NCLDRYKKLRTSSRNNTTLSHSSPSHPQHNSNASDMNLALRTDFVESSENLNLNGSNALS 146

Query: 1016 DG--NSNRDTHVEMQEDTLQTSGAVTL---WGSS---HETAR-----GDNSSDSKLKLKC 876
            DG      + +++  +  L+  G   L    G+S   HE         DNSS+S L LKC
Sbjct: 147  DGLPEGPGENNIQQADRLLEREGEGNLNPEAGNSQTFHERTELEGLDVDNSSESILTLKC 206

Query: 875  PMCRGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPS 696
            PMCRG +LGW+VVEEARKYLNLK R+CSRESCSF+GNY+ELRRHARR HPT RP+DIDPS
Sbjct: 207  PMCRGAILGWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHARRAHPTTRPSDIDPS 266

Query: 695  RQRAWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRWLST 519
            R+RAWRRLE+QREY DIVSA+RS+MPGAVV+GDYVIEN                  W +T
Sbjct: 267  RERAWRRLEHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGRESGNGEVNAPWWTT 326

Query: 518  FFLFQMIGSMDPIPEARGGRSRALSRHRRSNGPLS-RRRYPWGENLLGLQDDDDSEEDQP 342
            FFLF MIGSMD   E+R  RSRA +RHRR+ G LS RRR+ WGENLLGLQD++D EED  
Sbjct: 327  FFLFHMIGSMDGTGESR-ARSRAWTRHRRTAGALSERRRFLWGENLLGLQDEEDDEED-- 383

Query: 341  NLNLLSDMSDDMSTNPXXXXXXXXXXSDEDQ 249
            +L++ SD+ +D S  P          SDEDQ
Sbjct: 384  DLHIFSDVGEDTSPIPRRRRRLTQSRSDEDQ 414


>ref|XP_006443643.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|557545905|gb|ESR56883.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 381

 Score =  381 bits (979), Expect = e-103
 Identities = 209/374 (55%), Positives = 249/374 (66%), Gaps = 11/374 (2%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR M    D+H L+KELD  SCPICMDHPHNAVLL+CSSHDKGCRSYICDTSYRHS
Sbjct: 27   MAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSYRHS 86

Query: 1157 NCLDRFKKLRAENMDHP---------PIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNS 1005
            NCLDR+KKLR  + ++          P   +G  +  ++  +  LE     +     GNS
Sbjct: 87   NCLDRYKKLRTSSRNNTTLSHSSPSHPQHNKGPGENNIQQADRLLEREGEGNLNPEAGNS 146

Query: 1004 NRDTHVEMQEDTLQTSGAVTLWGSSHETARGDNSSDSKLKLKCPMCRGDVLGWKVVEEAR 825
             +  H   + + L                  DNSS+S L LKCPMCRG +LGW+VVEEAR
Sbjct: 147  -QTFHERTELEGLDV----------------DNSSESILTLKCPMCRGAILGWEVVEEAR 189

Query: 824  KYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDI 645
            KYLNLK R+CSRESCSF+GNY+ELRRHARR HPT RP+DIDPSR+RAWRRLE+QREY DI
Sbjct: 190  KYLNLKRRTCSRESCSFVGNYQELRRHARRAHPTTRPSDIDPSRERAWRRLEHQREYSDI 249

Query: 644  VSAVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEAR 468
            VSA+RS+MPGAVV+GDYVIEN                  W +TFFLF MIGSMD   E+R
Sbjct: 250  VSAIRSSMPGAVVVGDYVIENGDRFSAGRESGNGEVNAPWWTTFFLFHMIGSMDGTGESR 309

Query: 467  GGRSRALSRHRRSNGPLS-RRRYPWGENLLGLQDDDDSEEDQPNLNLLSDMSDDMSTNPX 291
              RSRA +RHRR+ G LS RRR+ WGENLLGLQD++D EED  +L++ SD+ +D S  P 
Sbjct: 310  -ARSRAWTRHRRTAGALSERRRFLWGENLLGLQDEEDDEED--DLHIFSDVGEDTSPIPR 366

Query: 290  XXXXXXXXXSDEDQ 249
                     SDEDQ
Sbjct: 367  RRRRLTQSRSDEDQ 380


>gb|EOX93978.1| B3 domain-containing transcription factor VAL3, putative isoform 1
            [Theobroma cacao] gi|508702083|gb|EOX93979.1| B3
            domain-containing transcription factor VAL3, putative
            isoform 1 [Theobroma cacao]
          Length = 377

 Score =  379 bits (974), Expect = e-102
 Identities = 205/382 (53%), Positives = 247/382 (64%), Gaps = 19/382 (4%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR +    D+  L+KELD  SCPICMDHPHNAVLLLCSSH+KGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRIITDSDIRALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPI----------MTQGNLDIAVETPN-EHLELRNLSDPTVVDG 1011
            NCLDR+KKLRA +   P +           +  ++++A+ T   E    RNL++     G
Sbjct: 61   NCLDRYKKLRAYSSKSPMLPHPIPQNRQNSSTSDMNLALRTDFIEGNGSRNLNETNSTPG 120

Query: 1010 NSNRDTHVEMQEDTLQTSGAVTLWGSSHETARGD-------NSSDSKLKLKCPMCRGDVL 852
             S  +     +    Q  G + +  S     R +       N+S+SK  LKCP+CRGD+ 
Sbjct: 121  RSEGNIQEPNRHLDSQGEGIIEIGDSDSSQGRAESEELDAENTSESKSSLKCPLCRGDIH 180

Query: 851  GWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRL 672
            GW+VVEEAR YLNLK RSCSRESC++ GNY+ELRRHARR HPT RP+DIDPSR+R WRRL
Sbjct: 181  GWEVVEEARMYLNLKKRSCSRESCAYNGNYQELRRHARRVHPTTRPSDIDPSRERDWRRL 240

Query: 671  ENQREYDDIVSAVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRWLSTFFLFQMIG 495
            E+QREY DIVSA+RSAMPGA+V+GDY IEN                  W +TFFLFQMIG
Sbjct: 241  EHQREYGDIVSAIRSAMPGAIVVGDYAIENGDRLAADRDSGTGEESAPWWTTFFLFQMIG 300

Query: 494  SMDPIPEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQPNLNLLSDMS 315
            S+D + E R  RSR  SRHRR  G LS RR+ WGENLLGLQDDDD +     L +LSD+ 
Sbjct: 301  SIDSVGEPR-ARSRVWSRHRRPAGALSERRFLWGENLLGLQDDDDDD-----LRILSDVG 354

Query: 314  DDMSTNPXXXXXXXXXXSDEDQ 249
            +D S NP          SDEDQ
Sbjct: 355  EDPSPNPRRRRRLTRSRSDEDQ 376


>gb|EXC24174.1| hypothetical protein L484_015193 [Morus notabilis]
          Length = 373

 Score =  369 bits (948), Expect = 1e-99
 Identities = 202/362 (55%), Positives = 241/362 (66%), Gaps = 14/362 (3%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA   R +    DM  L+KELD  SCPICMDHPHNAVLLLCSSHDKGCRSY+CDTSYRHS
Sbjct: 1    MAGVNRRICTDSDMRALHKELDEISCPICMDHPHNAVLLLCSSHDKGCRSYVCDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPP-----IMTQGNLDIAVETPNEHLELRNLSDPTVVD------G 1011
            NCLDRFKK+RA N ++P       +   NL   +   N++  L   +    VD       
Sbjct: 61   NCLDRFKKIRANNRNNPTPSSSLALNSNNLRPNLNEDNQNHNLNESNAVISVDLHGEPRE 120

Query: 1010 NSNRDTH--VEMQEDTLQTSGAVTLWGSSHETARG-DNSSDSKLKLKCPMCRGDVLGWKV 840
            N+ RD +  +E QE  ++   +  L         G +NSS+S L LKCP+CRG VLGW+V
Sbjct: 121  NNTRDLNRLLETQEGIVEAVDSEPLRERVEVDEFGVENSSESDLSLKCPLCRGTVLGWEV 180

Query: 839  VEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQR 660
            VEEARK+LNLK RSCSRESCSF GNY+ELRRHARR HPT RP+DIDPSR+RAW+RLE+QR
Sbjct: 181  VEEARKHLNLKRRSCSRESCSFSGNYQELRRHARRVHPTTRPSDIDPSRERAWQRLEHQR 240

Query: 659  EYDDIVSAVRSAMPGAVVLGDYVIENXXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPI 480
            E  D+VSA+RSA+PGAVV+GDYVIEN                 W +T FLFQMIG+MD  
Sbjct: 241  ELGDVVSAIRSAIPGAVVVGDYVIENGDRLGGERAGGDANGPWW-TTLFLFQMIGNMDNA 299

Query: 479  PEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQPNLNLLSDMSDDMST 300
             + R  R RA +RHRRS G  S RR  WGENLLGLQDDDD ++    L +LSD  +D S 
Sbjct: 300  GDHR-ARPRAWTRHRRSGGANSDRRLIWGENLLGLQDDDDEDD----LRILSDNGEDTSP 354

Query: 299  NP 294
             P
Sbjct: 355  AP 356


>ref|XP_002521120.1| conserved hypothetical protein [Ricinus communis]
            gi|223539689|gb|EEF41271.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 386

 Score =  364 bits (935), Expect = 4e-98
 Identities = 206/382 (53%), Positives = 239/382 (62%), Gaps = 34/382 (8%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            M   KRS     D+  L+ ELD  SCPICMDHPHNAVLLLCSSH+KGCRSYICDTS RHS
Sbjct: 1    MTGVKRSRYTDSDIRTLHNELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSSRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVDG----------- 1011
            NCLDR+KKLR  +          N  +    P       N+SD ++  G           
Sbjct: 61   NCLDRYKKLRDSS--------GSNTTLDSSLPINSFSSSNISDTSLTLGARVLDSYENHN 112

Query: 1010 --NSNRDTHVEMQEDTLQTS-------------GAVTLWGSSH-------ETARGDNSSD 897
              +S+  T V M E  L+ S             G +    S         E A   NSS+
Sbjct: 113  QSDSDNITSVRMPEQLLENSIQHPNRQVETRGEGVLEAGDSESFPDRIELEEADVVNSSE 172

Query: 896  SKLKLKCPMCRGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTAR 717
            + L LKCP+CRG VLGW+VVEEARKYLNLK RSCSRESCSF GNY+ELRRHARR HPT R
Sbjct: 173  AGLSLKCPLCRGAVLGWEVVEEARKYLNLKKRSCSRESCSFCGNYQELRRHARRVHPTTR 232

Query: 716  PADIDPSRQRAWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXX 540
            P+D+DPSR+RAWR LE QREY DIVSA+RSAMPGAVV+GDYVIEN               
Sbjct: 233  PSDVDPSRERAWRCLERQREYGDIVSALRSAMPGAVVVGDYVIENGDRFSVEREGGAGEV 292

Query: 539  XXRWLSTFFLFQMIGSMDPIPEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDD 360
               W +TFFLFQMIGS+D   E R  RSRA +RHRRS G L  RR+ WGENLLGLQDDD 
Sbjct: 293  NAPWWTTFFLFQMIGSIDGAAEPR-ARSRAWTRHRRSGGALPERRFLWGENLLGLQDDD- 350

Query: 359  SEEDQPNLNLLSDMSDDMSTNP 294
             E+D+ +L++LSD  +D S  P
Sbjct: 351  -EDDEGDLHILSDAGEDASPIP 371


>ref|XP_004290229.1| PREDICTED: uncharacterized protein LOC101300301 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 385

 Score =  361 bits (926), Expect = 5e-97
 Identities = 196/378 (51%), Positives = 234/378 (61%), Gaps = 30/378 (7%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR +  G ++  LYKELD  SCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRIDTGSEIRALYKELDAVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSD-------------PTVV 1017
            NCLDRFKKLR  N +   +++          P  H    N  D             P ++
Sbjct: 61   NCLDRFKKLRENNTNSQSLVSS--------LPTNHHGSHNTPDMAFGTDLNEANGSPNLI 112

Query: 1016 DGNSNRDTHV--EMQEDTLQTSGAVTLWGS--------------SHETARGDNSSDSKLK 885
            +GN+    ++  + QE  +Q      L                  H     +NSS+S L 
Sbjct: 113  EGNAVTSANIPGQPQERVIQDLNMPLLPEELMGVADSESFQERVEHGELDVENSSESNLS 172

Query: 884  LKCPMCRGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADI 705
            LKCP+CRG +LGW+VVE+ RKYLNLK RSCSRE+CSF GNY+ELRRHARR HP  RP+DI
Sbjct: 173  LKCPLCRGAILGWEVVEDCRKYLNLKKRSCSREACSFSGNYQELRRHARRVHPATRPSDI 232

Query: 704  DPSRQRAWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRW 528
            DPSR+RAWR LE+QRE+ D+VSA+ SA+PGAVV+GDYVIEN                  W
Sbjct: 233  DPSRERAWRHLEHQREFGDVVSAIHSAIPGAVVVGDYVIENGDRLGGGGESGTGEANGPW 292

Query: 527  LSTFFLFQMIGSMDPIPEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEED 348
             +T FLFQMIGS D   E R  R+RA  RHRRS G LS RR  WGENLLGLQDDD+ +ED
Sbjct: 293  WTTMFLFQMIGSADRGGEPR-ARARAWPRHRRSAGALSERRLLWGENLLGLQDDDEDDED 351

Query: 347  QPNLNLLSDMSDDMSTNP 294
                ++L     D+S  P
Sbjct: 352  DGEEDILMLNDRDVSPIP 369


>ref|XP_004290228.1| PREDICTED: uncharacterized protein LOC101300301 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 439

 Score =  361 bits (926), Expect = 5e-97
 Identities = 196/378 (51%), Positives = 234/378 (61%), Gaps = 30/378 (7%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR +  G ++  LYKELD  SCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS
Sbjct: 55   MAGVKRRIDTGSEIRALYKELDAVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 114

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSD-------------PTVV 1017
            NCLDRFKKLR  N +   +++          P  H    N  D             P ++
Sbjct: 115  NCLDRFKKLRENNTNSQSLVSS--------LPTNHHGSHNTPDMAFGTDLNEANGSPNLI 166

Query: 1016 DGNSNRDTHV--EMQEDTLQTSGAVTLWGS--------------SHETARGDNSSDSKLK 885
            +GN+    ++  + QE  +Q      L                  H     +NSS+S L 
Sbjct: 167  EGNAVTSANIPGQPQERVIQDLNMPLLPEELMGVADSESFQERVEHGELDVENSSESNLS 226

Query: 884  LKCPMCRGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADI 705
            LKCP+CRG +LGW+VVE+ RKYLNLK RSCSRE+CSF GNY+ELRRHARR HP  RP+DI
Sbjct: 227  LKCPLCRGAILGWEVVEDCRKYLNLKKRSCSREACSFSGNYQELRRHARRVHPATRPSDI 286

Query: 704  DPSRQRAWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRW 528
            DPSR+RAWR LE+QRE+ D+VSA+ SA+PGAVV+GDYVIEN                  W
Sbjct: 287  DPSRERAWRHLEHQREFGDVVSAIHSAIPGAVVVGDYVIENGDRLGGGGESGTGEANGPW 346

Query: 527  LSTFFLFQMIGSMDPIPEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEED 348
             +T FLFQMIGS D   E R  R+RA  RHRRS G LS RR  WGENLLGLQDDD+ +ED
Sbjct: 347  WTTMFLFQMIGSADRGGEPR-ARARAWPRHRRSAGALSERRLLWGENLLGLQDDDEDDED 405

Query: 347  QPNLNLLSDMSDDMSTNP 294
                ++L     D+S  P
Sbjct: 406  DGEEDILMLNDRDVSPIP 423


>ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263112 [Vitis vinifera]
          Length = 347

 Score =  360 bits (923), Expect = 1e-96
 Identities = 198/367 (53%), Positives = 236/367 (64%), Gaps = 4/367 (1%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA +K+SMS   D+H L KE D  SCPICMDHPHNAVLLLCSSH+ GCRSYICDTSYRH+
Sbjct: 1    MAGKKQSMSTDADIHALPKEWDDVSCPICMDHPHNAVLLLCSSHEMGCRSYICDTSYRHA 60

Query: 1157 NCLDRFKKLRAE--NMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNSNRDTHVE 984
            NCLDRFK+L A   N    P  +  N   +      +L LR   D T   GN N +    
Sbjct: 61   NCLDRFKRLGANLPNTSLQPSSSTTNQSYSSNASIVNLGLRLGIDSTEAHGNGNPN---- 116

Query: 983  MQEDTLQTSGAVTLWGSSHETARGDNSSDSKLKLKCPMCRGDVLGWKVVEEARKYLNLKP 804
                  + +G +++          +NSS+  L L CP+CRG VLGWKVVEEAR+ LNLKP
Sbjct: 117  ------EGNGLLSVRIPRRSELNAENSSELSLSLTCPLCRGAVLGWKVVEEARESLNLKP 170

Query: 803  RSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAVRSA 624
            RSCSRESCSF GNYRELRRHARR HPT RPADIDPSR+R+WRRLE+QRE+ DI+SA+RSA
Sbjct: 171  RSCSRESCSFSGNYRELRRHARRVHPTTRPADIDPSRERSWRRLEHQREHGDIISAIRSA 230

Query: 623  MPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARGGRSRAL 447
            MPGA+VLGDY IE+                  W +TFF FQMIGS++   E R  RSRAL
Sbjct: 231  MPGAIVLGDYAIESEDMLAGGRESGNEEGNGPWWTTFFWFQMIGSINSAAEPR-SRSRAL 289

Query: 446  SRHRRS-NGPLSRRRYPWGENLLGLQDDDDSEEDQPNLNLLSDMSDDMSTNPXXXXXXXX 270
            +R R+S    L+RRR+ WGENLLGLQDDDD          + D+ +D S  P        
Sbjct: 290  TRRRQSARAALTRRRFLWGENLLGLQDDDD----------VDDVGEDASPVPRRRRRLMR 339

Query: 269  XXSDEDQ 249
              S+EDQ
Sbjct: 340  SESNEDQ 346


>ref|XP_006298026.1| hypothetical protein CARUB_v10014073mg [Capsella rubella]
            gi|565480774|ref|XP_006298027.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
            gi|482566735|gb|EOA30924.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
            gi|482566736|gb|EOA30925.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
          Length = 353

 Score =  350 bits (897), Expect = 1e-93
 Identities = 195/359 (54%), Positives = 229/359 (63%), Gaps = 11/359 (3%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR +S   D+H L+KELD  SCP+CMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHP-PIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNSNRDTHVEM 981
            NCLDRFKKL +E+ + P P     + +   E+ NEH      S      G+ NR      
Sbjct: 61   NCLDRFKKLHSESPNDPTPEANLASRETNNESQNEH---GTTSRSNFHSGSGNR------ 111

Query: 980  QEDTLQTSGAVTLWGSSHETARGDNSSDSK--LKLKCPMCRGDVLGWKVVEEARKYLNLK 807
                    G+V  + S     R ++   S+    LKCP+CRG VLGWKVVEE R YL+LK
Sbjct: 112  --------GSVGDYESLRRRRRVEDEEQSEDFTNLKCPLCRGTVLGWKVVEEVRTYLDLK 163

Query: 806  PRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAVRS 627
             RSCSRESCSF GNY++LRRHARR HPT RP+D DPSR+RAWRRLENQREY DIVSA+RS
Sbjct: 164  NRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDPSRERAWRRLENQREYGDIVSAIRS 223

Query: 626  AMPGAVVLGDYVIENXXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARGG----- 462
            AMPGAVV+GDYVIEN                 W +T  LFQMIGS+D    +  G     
Sbjct: 224  AMPGAVVVGDYVIENGDRFPGEREAGNGGSDLW-TTLVLFQMIGSLDSGGPSGSGSGSGS 282

Query: 461  ---RSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQPNLNLLSDMSDDMSTNP 294
               RSRA   HRRS    S RRY WGENLLGLQD+ ++ +D+  L L +D  D  +  P
Sbjct: 283  RSHRSRAWRNHRRS----SDRRYLWGENLLGLQDEHNNNDDE-ELRLQNDAGDASTPVP 336


>ref|XP_002301572.1| hypothetical protein POPTR_0002s22380g [Populus trichocarpa]
            gi|566159410|ref|XP_006386811.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|566159412|ref|XP_006386812.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|566159414|ref|XP_006386813.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|222843298|gb|EEE80845.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345588|gb|ERP64608.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345589|gb|ERP64609.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345590|gb|ERP64610.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
          Length = 368

 Score =  345 bits (884), Expect = 3e-92
 Identities = 187/359 (52%), Positives = 232/359 (64%), Gaps = 11/359 (3%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA+ KR ++   D+H L+KELD  SCPIC+D PHNAVLLLCSS++KGC+SYICDTSYRHS
Sbjct: 1    MAALKRRLNTDSDIHALHKELDEVSCPICLDRPHNAVLLLCSSNEKGCKSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNSNR------- 999
            NCLD+FKK R  +  +  +  Q ++ I   + +   +          DGN N        
Sbjct: 61   NCLDQFKKSRGNSRSNATL--QSSMPINSVSSSTTTDASMTLRTHAFDGNENHNLNEISN 118

Query: 998  DTHVEMQEDTLQTSGAVTLWGSSHETARGDNSSDSKLKLKCPMCRGDVLGWKVVEEARKY 819
            DT V + E+ + +          HE     NS +  L   CP+CRG +LGW+VV+EARKY
Sbjct: 119  DTFVRLPEELVDSESVQER--IEHEGVNA-NSPELSLSPGCPLCRGTILGWEVVDEARKY 175

Query: 818  LNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVS 639
            LNLK RSCSRESCSF GNY+ELRRHARR HPT RP+DIDPSR+RAWR LE+QREY DIVS
Sbjct: 176  LNLKKRSCSRESCSFSGNYQELRRHARRVHPTIRPSDIDPSRERAWRCLEHQREYGDIVS 235

Query: 638  AVRSAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARGG 462
            AV SAMPGAVV+GDY+IEN                  W +TFF FQMIGS+D   E R  
Sbjct: 236  AVHSAMPGAVVVGDYIIENGDRLSVERESRTNEVNAPWWTTFFFFQMIGSIDGAAEPRTW 295

Query: 461  RSRALSRHRRSNGPLSRRRYPWGENLLGLQD---DDDSEEDQPNLNLLSDMSDDMSTNP 294
             SRA +RHR+S   L+ RR+ WGENLLGL D   DDD ++D   L++L +  +D S  P
Sbjct: 296  -SRAWTRHRQSAETLADRRFLWGENLLGLHDNDADDDDDDDNGYLHVLGNAGEDASPIP 353


>gb|EMJ02549.1| hypothetical protein PRUPE_ppa007686mg [Prunus persica]
          Length = 359

 Score =  344 bits (883), Expect = 4e-92
 Identities = 193/370 (52%), Positives = 232/370 (62%), Gaps = 36/370 (9%)
 Frame = -2

Query: 1250 MDHPHNAVLLLCSSHDKGCRSYICDTSYRHSNCLDRFKKLRAENMDHPPIMTQGNLDIAV 1071
            MDHPHNAVLLLCSSH+KGCRSYICDTSYRHSNCLDRFKKLR    + P + +        
Sbjct: 1    MDHPHNAVLLLCSSHEKGCRSYICDTSYRHSNCLDRFKKLRENTRNSPTLPSS------- 53

Query: 1070 ETPNEHLELRNLSD---PTVVDGNSNRDTHVEMQEDTL---------------------- 966
              P  H +  N+SD   P   + N   +TH  ++ +TL                      
Sbjct: 54   -LPVNHSDSHNISDLNIPLRTESNEANETHNLIESNTLISVNLSGPPQGHIIQDLNRPLE 112

Query: 965  -QTSGAVTLWGSS-------HETARGDNSSDSKLKLKCPMCRGDVLGWKVVEEARKYLNL 810
             QT G + +  S        H+   G+NS DSKL LKCP+CRG +LGW+VVE+ RKYLNL
Sbjct: 113  AQTEGVLEVADSESFQERVEHDELDGENSPDSKLSLKCPLCRGAILGWEVVEDVRKYLNL 172

Query: 809  KPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAVR 630
            K RSCSRESCSF GNY+ELRRHARR HPT RP+DIDPSR+RAW+ LE+QRE+ D+VSA+ 
Sbjct: 173  KKRSCSRESCSFSGNYQELRRHARRVHPTTRPSDIDPSRERAWQHLEHQREFGDVVSAIH 232

Query: 629  SAMPGAVVLGDYVIEN-XXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARGGRSR 453
            SA+PGAVV+GDYVIEN                  W +T FLFQMIGS+D   E R  RSR
Sbjct: 233  SAIPGAVVVGDYVIENGDRLAGGGESGAGEANGPWWTTLFLFQMIGSVDRAGEQR-ARSR 291

Query: 452  ALSRHRRSNGPLSRRRYPWGENLLGLQDD--DDSEEDQPNLNLLSDMSDDMSTNPXXXXX 279
            A +RHRRS G LS RR+ WGENLLGLQDD  DD ++D  NL +L+    D+S  P     
Sbjct: 292  AWTRHRRS-GALSERRFLWGENLLGLQDDEVDDEDDDDENLPILNHR--DLSPIPRRRRR 348

Query: 278  XXXXXSDEDQ 249
                 SDED+
Sbjct: 349  LTRSRSDEDR 358


>ref|XP_006586264.1| PREDICTED: uncharacterized protein LOC100791202 isoform X1 [Glycine
            max] gi|571474560|ref|XP_006586265.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X2 [Glycine
            max] gi|571474562|ref|XP_006586266.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X3 [Glycine
            max] gi|571474564|ref|XP_006586267.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X4 [Glycine
            max] gi|571474566|ref|XP_006586268.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X5 [Glycine
            max] gi|571474568|ref|XP_006586269.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X6 [Glycine
            max]
          Length = 350

 Score =  343 bits (879), Expect = 1e-91
 Identities = 192/372 (51%), Positives = 233/372 (62%), Gaps = 9/372 (2%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR + +  D+H L+KELD  SCPICMDHPHNAVLLLCSSH+KGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLR---AENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNSNRDTHV 987
            NCLDRFKK+R    EN + P  +        V T N      +  DP     N   D H 
Sbjct: 61   NCLDRFKKMRDNFKENQNLPSSL--------VNTNNSGSRQGDAQDP-----NRLLDQH- 106

Query: 986  EMQEDTLQTSGAVTLWGSSH-ETARGDNSSDSKLKLKCPMCRGDVLGWKVVEEARKYLNL 810
               E  L+T+ +  L   +  E    DNSS+SKL LKCP+CRG VL WKVVEEAR YLN+
Sbjct: 107  --DEGILETADSENLQDRAVIEDLNADNSSESKLNLKCPLCRGAVLNWKVVEEARNYLNM 164

Query: 809  KPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAVR 630
            K RSCSR+SCSF+G+Y ELRRHARR HPT+RP++IDP+R+RAWR  E+QREY DIVSA++
Sbjct: 165  KKRSCSRDSCSFVGDYLELRRHARRVHPTSRPSNIDPTRERAWRHFEDQREYGDIVSAIQ 224

Query: 629  SAMPGAVVLGDYVIEN-----XXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARG 465
            SA+PGAV++GDYV+EN                      WL+T  LFQM+ S   I     
Sbjct: 225  SAVPGAVLVGDYVLENGDGIGRLPDERAEGNIGNANGPWLTTTILFQMMDSTVEIVREPR 284

Query: 464  GRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQPNLNLLSDMSDDMSTNPXXX 285
              S A +RHRRS+    RRRY WGENLLGL D+D  ++    L +  D  +D S  P   
Sbjct: 285  AHSSAWTRHRRSD---ERRRYLWGENLLGLHDNDIEDD----LRIFRDAGEDASPVPRRR 337

Query: 284  XXXXXXXSDEDQ 249
                   S+EDQ
Sbjct: 338  RRLTRTRSNEDQ 349


>ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arabidopsis lyrata subsp.
            lyrata] gi|297329394|gb|EFH59813.1| hypothetical protein
            ARALYDRAFT_479993 [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score =  342 bits (878), Expect = 2e-91
 Identities = 190/343 (55%), Positives = 224/343 (65%), Gaps = 12/343 (3%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR +S   D+H L+KELD  SCP+CMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNSNRDTHVEMQ 978
            NCLDRFKKL +E+ + P    +GNL  + E  NE L            G ++R +    +
Sbjct: 61   NCLDRFKKLHSESPNDPT--PEGNL-ASRENNNESLNEH---------GTASRSSF--HR 106

Query: 977  EDTLQTSGAVTLWGSSHETARG----DNSSDSKLKLKCPMCRGDVLGWKVVEEARKYLNL 810
            E T + S     W S     R     +  S+    LKCP+CRG VLGWKVVEE R YL+L
Sbjct: 107  ESTNRGSA----WDSESLRRRRRVDEEEQSEDITNLKCPLCRGTVLGWKVVEEVRTYLDL 162

Query: 809  KPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAVR 630
            K RSCSRESCSF GNY++LRRHARR HPT RP+D DPSR+RAWR LENQREY DIVSA+R
Sbjct: 163  KNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDPSRERAWRHLENQREYGDIVSAIR 222

Query: 629  SAMPGAVVLGDYVIENXXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARGG---- 462
            SAMPGAVV+GDYVIEN                 W +T  LFQMIGS+D    +  G    
Sbjct: 223  SAMPGAVVVGDYVIENGDRFSGERETGNGGSDLW-TTLVLFQMIGSLDNGGSSASGSGGG 281

Query: 461  ----RSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQ 345
                RSRA   HRRS+   S RRY WGENLLGLQ++ ++ +D+
Sbjct: 282  SRSHRSRAWRNHRRSS---SDRRYLWGENLLGLQEEHNNNDDE 321


>gb|EPS72073.1| hypothetical protein M569_02687, partial [Genlisea aurea]
          Length = 344

 Score =  342 bits (877), Expect = 2e-91
 Identities = 189/366 (51%), Positives = 232/366 (63%), Gaps = 23/366 (6%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MASRKRS+SN  DM    KE D ASCPIC+DHPHNAVL++CSSHDKGCRS+ICDTSYRHS
Sbjct: 1    MASRKRSLSNDADMSAQQKEWDEASCPICLDHPHNAVLIICSSHDKGCRSFICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNSNRDTHVEMQ 978
            NCLDRFKKL+ +N++ P          A  + + H       D   V+ +S R T VE +
Sbjct: 61   NCLDRFKKLKQDNIELP----------ATSSISGH-------DHDSVNSSSRRRT-VEFE 102

Query: 977  EDTLQTSGAVTLWGSSHETARGDNSSDSKLKLKCPMCRGDVLGWKVVEEARKYLNLKPRS 798
            +      GA+  W            S  ++ L+CP+CRG+VLGWKVVE+ RKYLNLKPRS
Sbjct: 103  DQ----EGAL-FWERLGSGESNTEKSAEQVSLRCPLCRGNVLGWKVVEDVRKYLNLKPRS 157

Query: 797  CSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAVRSAMP 618
            CSRESCSF GNY ELRRHAR+DHPT  PAD+DPSRQR W+ LE+QRE +DIVSA+RSAMP
Sbjct: 158  CSRESCSFTGNYGELRRHARKDHPTVCPADVDPSRQREWQHLEDQRELNDIVSAIRSAMP 217

Query: 617  GAVVLGDYVIE---NXXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARGGRSRAL 447
            GA+++GDY IE   +                RWLST FLFQMIG+++     RGGRSR  
Sbjct: 218  GAILVGDYAIESSGDRPSRERIRSENAAERGRWLSTLFLFQMIGALEDGAPRRGGRSRG- 276

Query: 446  SRHRRSN-------GPLSRRRYPWGENLLGLQDDDDSEED-------------QPNLNLL 327
              HRR+          +    Y WGENLLGLQD+   EED             + +LN+ 
Sbjct: 277  --HRRAEQQQQQPVPAVRHHHYLWGENLLGLQDEAVDEEDEVADEEENGEESEEQDLNVP 334

Query: 326  SDMSDD 309
            SD+ D+
Sbjct: 335  SDLGDN 340


>ref|NP_189118.1| uncharacterized protein [Arabidopsis thaliana]
            gi|79313363|ref|NP_001030761.1| uncharacterized protein
            [Arabidopsis thaliana] gi|11994657|dbj|BAB02885.1|
            unnamed protein product [Arabidopsis thaliana]
            gi|19715579|gb|AAL91615.1| AT3g24740/K7P8_3 [Arabidopsis
            thaliana] gi|20334910|gb|AAM16211.1| AT3g24740/K7P8_3
            [Arabidopsis thaliana] gi|332643420|gb|AEE76941.1|
            uncharacterized protein AT3G24740 [Arabidopsis thaliana]
            gi|332643421|gb|AEE76942.1| uncharacterized protein
            AT3G24740 [Arabidopsis thaliana]
          Length = 354

 Score =  339 bits (869), Expect = 2e-90
 Identities = 184/344 (53%), Positives = 222/344 (64%), Gaps = 13/344 (3%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR +S   D+H L+KELD  SCP+CMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVD-GNSNRDTHVEM 981
            NCLDRFKKL +E+ + P             TP  +L  R  ++ ++ + G ++R +    
Sbjct: 61   NCLDRFKKLHSESANDP-------------TPEANLASREHNNESLYEHGTASRSSFHR- 106

Query: 980  QEDTLQTSGAVTLWGSSHETARG----DNSSDSKLKLKCPMCRGDVLGWKVVEEARKYLN 813
                 ++    + W S     R     +  S+    LKCP+CRG VLGWKVVEE R YL+
Sbjct: 107  -----ESGNRGSSWDSESLRRRRRVEEEVESEDITNLKCPLCRGTVLGWKVVEEVRTYLD 161

Query: 812  LKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAV 633
             K RSCSRESCSF GNY++LRRHARR HPT RP+D DPSR+RAWRRLENQREY DIVSA+
Sbjct: 162  HKNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDPSRERAWRRLENQREYGDIVSAI 221

Query: 632  RSAMPGAVVLGDYVIENXXXXXXXXXXXXXXXXRWLSTFFLFQMIGSMDPIPEARGG--- 462
            RSAMPGAVV+GDYVIEN                 W +T  LFQMIGS+D    +  G   
Sbjct: 222  RSAMPGAVVVGDYVIENGDRFAGERETGNGGSDLW-TTLLLFQMIGSLDNGGSSASGSGG 280

Query: 461  -----RSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQ 345
                 RSRA   HRRS+   S R Y WGENLLGLQD+ ++ +D+
Sbjct: 281  GSRSHRSRAWRNHRRSS---SDRPYLWGENLLGLQDERNNNDDE 321


>ref|XP_004492331.1| PREDICTED: uncharacterized protein LOC101499234 isoform X1 [Cicer
            arietinum] gi|502103643|ref|XP_004492332.1| PREDICTED:
            uncharacterized protein LOC101499234 isoform X2 [Cicer
            arietinum] gi|502103648|ref|XP_004492333.1| PREDICTED:
            uncharacterized protein LOC101499234 isoform X3 [Cicer
            arietinum] gi|502103652|ref|XP_004492334.1| PREDICTED:
            uncharacterized protein LOC101499234 isoform X4 [Cicer
            arietinum]
          Length = 354

 Score =  338 bits (868), Expect = 2e-90
 Identities = 189/375 (50%), Positives = 244/375 (65%), Gaps = 12/375 (3%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR + +  D+H L+KELD  SCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS
Sbjct: 1    MAGFKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLRAENMDHPPIMTQGNLDIAVETPNEHLELRNLSDPTVVDGNSNRDT--HVE 984
            NCLDRFKK+R               D + E PN    L N ++     G++ +D   H++
Sbjct: 61   NCLDRFKKMR---------------DNSKENPNLPSSLINTNNSGSRQGDAAQDPSRHLD 105

Query: 983  MQED-TLQTSGAVTLWGSS--HETARGDNSSDSKLKLKCPMCRGDVLGWKVVEEARKYLN 813
              ++  L+T+ + TL   +   +    +NSSDS L L+CP+CRG VLGW+V+EEAR YLN
Sbjct: 106  QHDEGILETAESETLQDRAVLEDLDVDNNSSDSILSLQCPLCRGTVLGWEVIEEARNYLN 165

Query: 812  LKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQRAWRRLENQREYDDIVSAV 633
             K RSCSR+SCSF G+Y ELRRHARR HPT+RP+D+DP+R++AW++ E QREY DIVSA+
Sbjct: 166  NKKRSCSRDSCSFAGDYLELRRHARRVHPTSRPSDVDPTREQAWQQFERQREYGDIVSAI 225

Query: 632  RSAMPGAVVLGDYVIEN----XXXXXXXXXXXXXXXXRWL--STFFLFQMI-GSMDPIPE 474
            +SA+PGAVV+GDYV+EN                     WL  +T  LFQM+  +++ + E
Sbjct: 226  QSAIPGAVVVGDYVLENGDGIGRLSGDRDGNNGNGNGPWLTTTTTILFQMMDNTIEIVRE 285

Query: 473  ARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQPNLNLLSDMSDDMSTNP 294
             R   S A SRHRRS+    RRRY WGENLLGLQD++  E+    L + +D+ +D ST P
Sbjct: 286  PRARSSSAWSRHRRSS---DRRRYLWGENLLGLQDNEVEED----LRIFNDLVEDASTVP 338

Query: 293  XXXXXXXXXXSDEDQ 249
                      S+EDQ
Sbjct: 339  RRRRRLNRTRSNEDQ 353


>gb|ESW12557.1| hypothetical protein PHAVU_008G123200g [Phaseolus vulgaris]
          Length = 385

 Score =  336 bits (862), Expect = 1e-89
 Identities = 189/390 (48%), Positives = 242/390 (62%), Gaps = 27/390 (6%)
 Frame = -2

Query: 1337 MASRKRSMSNGVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 1158
            MA  KR + +  D+H L+KELD  SCPICMDHPHNAVLLLCSSH+KGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1157 NCLDRFKKLR---AENMDHPPIMTQGN-------LDIAVETPNEHL------ELRNLSDP 1026
            NCLDRFKK+R    EN + P  +   N       ++I +++    +      E+  L   
Sbjct: 61   NCLDRFKKMRDNSKENENLPSSLVNTNNSGNSFDINITMQSDMHDVNELHENEINTLLSV 120

Query: 1025 TVVDGNSNRDT-----HVEMQED-TLQTSGAVTLWGSSH-ETARGDNSSDSKLKLKCPMC 867
             +  G+   D      H++  ++  L+T+ + TL   +  E    DNSS+SKLKLKCP+C
Sbjct: 121  GLAQGSRQGDAQDPSRHLDPHDEGILETADSETLQDRAVLEDLGADNSSESKLKLKCPLC 180

Query: 866  RGDVLGWKVVEEARKYLNLKPRSCSRESCSFIGNYRELRRHARRDHPTARPADIDPSRQR 687
            RG VL W+V EEAR YLN+K RSCSR+SCSF+G Y ELRRHARR HPT+RP+DIDP+R+R
Sbjct: 181  RGAVLSWEVDEEARNYLNVKKRSCSRDSCSFVGGYLELRRHARRVHPTSRPSDIDPTRER 240

Query: 686  AWRRLENQREYDDIVSAVRSAMPGAVVLGDYVIEN----XXXXXXXXXXXXXXXXRWLST 519
            AWR  E QREY DI+SA++SAMPGAV++GDYV+EN                     WL+T
Sbjct: 241  AWRHFERQREYGDIMSAIQSAMPGAVLVGDYVLENGDGIGRLSDEREGNISNANGPWLTT 300

Query: 518  FFLFQMIGSMDPIPEARGGRSRALSRHRRSNGPLSRRRYPWGENLLGLQDDDDSEEDQPN 339
              LFQ++ S   I       +   SRHRRS+    RRRY WGENLLGL ++D  ++    
Sbjct: 301  TILFQVMDSTIEIVREPRAHASTWSRHRRSS---ERRRYLWGENLLGLNENDIEDD---- 353

Query: 338  LNLLSDMSDDMSTNPXXXXXXXXXXSDEDQ 249
            L + SD  +D S  P          S+EDQ
Sbjct: 354  LRIFSDAGEDPSPVPRRRRRLTRTRSNEDQ 383


Top