BLASTX nr result

ID: Catharanthus23_contig00000465 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00000465
         (2400 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006350310.1| PREDICTED: uncharacterized protein LOC102591...   400   e-108
ref|XP_004247100.1| PREDICTED: uncharacterized protein LOC101261...   397   e-107
ref|XP_006443639.1| hypothetical protein CICLE_v10020351mg [Citr...   347   1e-92
ref|XP_006443638.1| hypothetical protein CICLE_v10020351mg [Citr...   347   1e-92
ref|XP_006443643.1| hypothetical protein CICLE_v10020351mg [Citr...   341   8e-91
ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263...   337   1e-89
gb|EOX93978.1| B3 domain-containing transcription factor VAL3, p...   333   3e-88
ref|XP_002521120.1| conserved hypothetical protein [Ricinus comm...   330   2e-87
gb|EXC24174.1| hypothetical protein L484_015193 [Morus notabilis]     328   5e-87
ref|XP_002301572.1| hypothetical protein POPTR_0002s22380g [Popu...   310   2e-81
ref|XP_006586264.1| PREDICTED: uncharacterized protein LOC100791...   306   4e-80
gb|EPS72073.1| hypothetical protein M569_02687, partial [Genlise...   305   6e-80
ref|XP_006602548.1| PREDICTED: uncharacterized protein LOC100807...   305   6e-80
ref|XP_004290229.1| PREDICTED: uncharacterized protein LOC101300...   305   8e-80
ref|XP_004290228.1| PREDICTED: uncharacterized protein LOC101300...   305   8e-80
ref|XP_006602541.1| PREDICTED: uncharacterized protein LOC100807...   303   2e-79
gb|ESW12557.1| hypothetical protein PHAVU_008G123200g [Phaseolus...   303   3e-79
ref|XP_006298026.1| hypothetical protein CARUB_v10014073mg [Caps...   303   3e-79
ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arab...   303   3e-79
ref|XP_004492331.1| PREDICTED: uncharacterized protein LOC101499...   302   5e-79

>ref|XP_006350310.1| PREDICTED: uncharacterized protein LOC102591236 isoform X1 [Solanum
            tuberosum] gi|565367302|ref|XP_006350311.1| PREDICTED:
            uncharacterized protein LOC102591236 isoform X2 [Solanum
            tuberosum] gi|565367304|ref|XP_006350312.1| PREDICTED:
            uncharacterized protein LOC102591236 isoform X3 [Solanum
            tuberosum]
          Length = 385

 Score =  400 bits (1028), Expect = e-108
 Identities = 224/396 (56%), Positives = 261/396 (65%), Gaps = 9/396 (2%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MASRKRSMS+D DMH LYKE D ASCPICMDHPHNAVLLLC+SHDKGCRSYICDTSY+HS
Sbjct: 1    MASRKRSMSNDVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYKHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMD-NTNSGNIGLGTSGNPSEVEGGNDVRIID 833
            NCLDRF+KLK EN DN   P + ++  L++   T + ++ L    + + V G +D+   +
Sbjct: 61   NCLDRFKKLKAENRDN---PPIMTQGNLDIAVETPAEHLELKNLSDRTVVHGYHDIPANE 117

Query: 832  LIATEDLSSGLEENSNHAAHNPLGVHDE--------TXXXXXXXXXXXXXXXXXXSKLRC 677
            ++AT     G EEN N    N + + +         T                   KL+C
Sbjct: 118  VVATGAFPGGSEENGNSNRDNRMEMQEGGLQTSDAVTVWGSSHETANADNSSDSILKLKC 177

Query: 676  PLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPT 497
            P+CRG +LGWKVVEEAR+YLN K RSCSRESCSF GNYREL             AD+DP+
Sbjct: 178  PMCRGDVLGWKVVEEARKYLNLKHRSCSRESCSFLGNYRELRRHARRDHPTARPADIDPS 237

Query: 496  RERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLSXXXXXXXXXXXRWLSTF 317
            R+RAWRRLENQREYDDIVSA+RSAMPGAVV GDYVIENG RLS           RWLSTF
Sbjct: 238  RQRAWRRLENQREYDDIVSAVRSAMPGAVVFGDYVIENGDRLSGERERGSGANGRWLSTF 297

Query: 316  FLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXXXX 137
            FLFQMIGSM+P++E RGGRSRALSRHRRST G   RRR+ WGENLLGL+           
Sbjct: 298  FLFQMIGSMDPISEARGGRSRALSRHRRST-GPLSRRRYPWGENLLGLQ--DHDNNEDEG 354

Query: 136  EPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQQ 29
            EP+ N LS    D S N       RLM SR+DEDQQ
Sbjct: 355  EPDLNILSG---DMSNN--PRRRRRLMRSRSDEDQQ 385


>ref|XP_004247100.1| PREDICTED: uncharacterized protein LOC101261359 isoform 1 [Solanum
            lycopersicum] gi|460403239|ref|XP_004247101.1| PREDICTED:
            uncharacterized protein LOC101261359 isoform 2 [Solanum
            lycopersicum] gi|460403241|ref|XP_004247102.1| PREDICTED:
            uncharacterized protein LOC101261359 isoform 3 [Solanum
            lycopersicum]
          Length = 385

 Score =  397 bits (1020), Expect = e-107
 Identities = 224/398 (56%), Positives = 258/398 (64%), Gaps = 11/398 (2%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MASRKRSMS+D DMH LYKE D ASCPICMDHPHNAVLLLC+SHDKGCRSYICDTSYRHS
Sbjct: 1    MASRKRSMSNDVDMHVLYKELDGASCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMD---NTNSGNIGLGTSGNPSEVEGGNDVRI 839
            NCLDRF+KLK EN DN  + +     Q N+D      + ++ L    + + V G +D+  
Sbjct: 61   NCLDRFKKLKAENRDNPPTMT-----QGNLDIAVEIPAEHLELRNLSDRTVVHGYHDIPA 115

Query: 838  IDLIATEDLSSGLEENSNHAAHNPLGVHDE--------TXXXXXXXXXXXXXXXXXXSKL 683
             +++AT     G EEN N    N + + +         T                   KL
Sbjct: 116  DEVVATGAFPGGSEENGNSNRDNRMEMQEGALQTSDAVTVWGSSHETVNADNSSDSILKL 175

Query: 682  RCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVD 503
            +CP+CRG +LGWKVVEEAR+YLN K RSCSRESCSF GNYREL             AD+D
Sbjct: 176  KCPMCRGDVLGWKVVEEARKYLNLKHRSCSRESCSFLGNYRELRRHARRDHPTARPADID 235

Query: 502  PTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLSXXXXXXXXXXXRWLS 323
            P+R+RAWRRLENQREYDDIVSA+RSAMPGAVV GDYVIENG RLS           RWLS
Sbjct: 236  PSRQRAWRRLENQREYDDIVSAVRSAMPGAVVFGDYVIENGDRLSVERERGSGANGRWLS 295

Query: 322  TFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXX 143
            TFFLFQM GSM+P++E RGGRSRALSRHRRST G   RRR+ WGENLLGL+         
Sbjct: 296  TFFLFQMFGSMDPISEARGGRSRALSRHRRST-GPLSRRRYPWGENLLGLQ--DHNNNED 352

Query: 142  XXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQQ 29
              EP+ N LS    D S N       RLM SR+DEDQQ
Sbjct: 353  EGEPDVNILSG---DMSNN--PRRRRRLMRSRSDEDQQ 385


>ref|XP_006443639.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|567902306|ref|XP_006443641.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902312|ref|XP_006443644.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902314|ref|XP_006443645.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902316|ref|XP_006443646.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902318|ref|XP_006443647.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|568853098|ref|XP_006480204.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X2 [Citrus
            sinensis] gi|568853100|ref|XP_006480205.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X3 [Citrus
            sinensis] gi|557545901|gb|ESR56879.1| hypothetical
            protein CICLE_v10020351mg [Citrus clementina]
            gi|557545903|gb|ESR56881.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545906|gb|ESR56884.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545907|gb|ESR56885.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545908|gb|ESR56886.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545909|gb|ESR56887.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 389

 Score =  347 bits (890), Expect = 1e-92
 Identities = 203/410 (49%), Positives = 249/410 (60%), Gaps = 24/410 (5%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR M  D+D+HAL+KE D+ SCPICMDHPHNAVLL+C+SHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNST----SPS-------------------LNSRDQLNMDNTNSGN 899
            NCLDR++KL+  + +N+T    SPS                   + S + LN++ +N+ +
Sbjct: 61   NCLDRYKKLRTSSRNNTTLSHSSPSHPQHNSNASDMNLALRTDFVESSENLNLNGSNALS 120

Query: 898  IGLGTSGNPSEVEGGNDVRIIDLIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXX 719
             GL       E  G N+++  D +    L    E N N  A N    H+ T         
Sbjct: 121  DGL------PEGPGENNIQQADRL----LEREGEGNLNPEAGNSQTFHERTELEGLDVDN 170

Query: 718  XXXXXXXXXSKLRCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXX 539
                       L+CP+CRG+ILGW+VVEEAR+YLN K R+CSRESCSF GNY+EL     
Sbjct: 171  SSESILT----LKCPMCRGAILGWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHAR 226

Query: 538  XXXXXXXXADVDPTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLS-XX 362
                    +D+DP+RERAWRRLE+QREY DIVSAIRS+MPGAVV+GDYVIENG R S   
Sbjct: 227  RAHPTTRPSDIDPSRERAWRRLEHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGR 286

Query: 361  XXXXXXXXXRWLSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENL 182
                      W +TFFLF MIGSM+   E R  RSRA +RHRR+ G    RRRFLWGENL
Sbjct: 287  ESGNGEVNAPWWTTFFLFHMIGSMDGTGESR-ARSRAWTRHRRTAGALSERRRFLWGENL 345

Query: 181  LGLRXXXXXXXXXXXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
            LGL+           E + +  SD+GEDTSP        RL  SR+DEDQ
Sbjct: 346  LGLQ-----DEEDDEEDDLHIFSDVGEDTSP--IPRRRRRLTQSRSDEDQ 388


>ref|XP_006443638.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|567902304|ref|XP_006443640.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902308|ref|XP_006443642.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|568853096|ref|XP_006480203.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X1 [Citrus
            sinensis] gi|557545900|gb|ESR56878.1| hypothetical
            protein CICLE_v10020351mg [Citrus clementina]
            gi|557545902|gb|ESR56880.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545904|gb|ESR56882.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 415

 Score =  347 bits (890), Expect = 1e-92
 Identities = 203/410 (49%), Positives = 249/410 (60%), Gaps = 24/410 (5%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR M  D+D+HAL+KE D+ SCPICMDHPHNAVLL+C+SHDKGCRSYICDTSYRHS
Sbjct: 27   MAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSYRHS 86

Query: 1009 NCLDRFRKLKDENGDNST----SPS-------------------LNSRDQLNMDNTNSGN 899
            NCLDR++KL+  + +N+T    SPS                   + S + LN++ +N+ +
Sbjct: 87   NCLDRYKKLRTSSRNNTTLSHSSPSHPQHNSNASDMNLALRTDFVESSENLNLNGSNALS 146

Query: 898  IGLGTSGNPSEVEGGNDVRIIDLIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXX 719
             GL       E  G N+++  D +    L    E N N  A N    H+ T         
Sbjct: 147  DGL------PEGPGENNIQQADRL----LEREGEGNLNPEAGNSQTFHERTELEGLDVDN 196

Query: 718  XXXXXXXXXSKLRCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXX 539
                       L+CP+CRG+ILGW+VVEEAR+YLN K R+CSRESCSF GNY+EL     
Sbjct: 197  SSESILT----LKCPMCRGAILGWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHAR 252

Query: 538  XXXXXXXXADVDPTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLS-XX 362
                    +D+DP+RERAWRRLE+QREY DIVSAIRS+MPGAVV+GDYVIENG R S   
Sbjct: 253  RAHPTTRPSDIDPSRERAWRRLEHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGR 312

Query: 361  XXXXXXXXXRWLSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENL 182
                      W +TFFLF MIGSM+   E R  RSRA +RHRR+ G    RRRFLWGENL
Sbjct: 313  ESGNGEVNAPWWTTFFLFHMIGSMDGTGESR-ARSRAWTRHRRTAGALSERRRFLWGENL 371

Query: 181  LGLRXXXXXXXXXXXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
            LGL+           E + +  SD+GEDTSP        RL  SR+DEDQ
Sbjct: 372  LGLQ-----DEEDDEEDDLHIFSDVGEDTSP--IPRRRRRLTQSRSDEDQ 414


>ref|XP_006443643.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|557545905|gb|ESR56883.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 381

 Score =  341 bits (875), Expect = 8e-91
 Identities = 198/388 (51%), Positives = 237/388 (61%), Gaps = 2/388 (0%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR M  D+D+HAL+KE D+ SCPICMDHPHNAVLL+C+SHDKGCRSYICDTSYRHS
Sbjct: 27   MAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSYRHS 86

Query: 1009 NCLDRFRKLKDENGDNST-SPSLNSRDQLNMDNTNSGNIGLGTSGNPSEVEGGNDVRIID 833
            NCLDR++KL+  + +N+T S S  S  Q N                     G N+++  D
Sbjct: 87   NCLDRYKKLRTSSRNNTTLSHSSPSHPQHNKG------------------PGENNIQQAD 128

Query: 832  LIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCPLCRGSIL 653
             +    L    E N N  A N    H+ T                    L+CP+CRG+IL
Sbjct: 129  RL----LEREGEGNLNPEAGNSQTFHERTELEGLDVDNSSESILT----LKCPMCRGAIL 180

Query: 652  GWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRERAWRRL 473
            GW+VVEEAR+YLN K R+CSRESCSF GNY+EL             +D+DP+RERAWRRL
Sbjct: 181  GWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHARRAHPTTRPSDIDPSRERAWRRL 240

Query: 472  ENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLS-XXXXXXXXXXXRWLSTFFLFQMIG 296
            E+QREY DIVSAIRS+MPGAVV+GDYVIENG R S             W +TFFLF MIG
Sbjct: 241  EHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGRESGNGEVNAPWWTTFFLFHMIG 300

Query: 295  SMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXXXXEPETNAL 116
            SM+   E R  RSRA +RHRR+ G    RRRFLWGENLLGL+           E + +  
Sbjct: 301  SMDGTGESR-ARSRAWTRHRRTAGALSERRRFLWGENLLGLQ-----DEEDDEEDDLHIF 354

Query: 115  SDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
            SD+GEDTSP        RL  SR+DEDQ
Sbjct: 355  SDVGEDTSP--IPRRRRRLTQSRSDEDQ 380


>ref|XP_002265815.1| PREDICTED: uncharacterized protein LOC100263112 [Vitis vinifera]
          Length = 347

 Score =  337 bits (865), Expect = 1e-89
 Identities = 197/400 (49%), Positives = 242/400 (60%), Gaps = 14/400 (3%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA +K+SMS DAD+HAL KEWDD SCPICMDHPHNAVLLLC+SH+ GCRSYICDTSYRH+
Sbjct: 1    MAGKKQSMSTDADIHALPKEWDDVSCPICMDHPHNAVLLLCSSHEMGCRSYICDTSYRHA 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGT---------SGNPSEVEG 857
            NCLDRF++L     + S  PS ++ +Q    N +  N+GL           +GNP+E   
Sbjct: 61   NCLDRFKRLGANLPNTSLQPSSSTTNQSYSSNASIVNLGLRLGIDSTEAHGNGNPNE--- 117

Query: 856  GNDVRIIDLIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRC 677
            GN +  + +    +L++   ENS+  +                              L C
Sbjct: 118  GNGLLSVRIPRRSELNA---ENSSELS----------------------------LSLTC 146

Query: 676  PLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPT 497
            PLCRG++LGWKVVEEAR  LN K RSCSRESCSFSGNYREL             AD+DP+
Sbjct: 147  PLCRGAVLGWKVVEEARESLNLKPRSCSRESCSFSGNYRELRRHARRVHPTTRPADIDPS 206

Query: 496  RERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIEN-----GGRLSXXXXXXXXXXXR 332
            RER+WRRLE+QRE+ DI+SAIRSAMPGA+VLGDY IE+     GGR S            
Sbjct: 207  RERSWRRLEHQREHGDIISAIRSAMPGAIVLGDYAIESEDMLAGGRES----GNEEGNGP 262

Query: 331  WLSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXX 152
            W +TFF FQMIGS+   AE R  RSRAL+R R+S   A  RRRFLWGENLLGL+      
Sbjct: 263  WWTTFFWFQMIGSINSAAEPR-SRSRALTRRRQSARAALTRRRFLWGENLLGLQ------ 315

Query: 151  XXXXXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
                   + + + D+GED SP        RLM S ++EDQ
Sbjct: 316  -------DDDDVDDVGEDASP--VPRRRRRLMRSESNEDQ 346


>gb|EOX93978.1| B3 domain-containing transcription factor VAL3, putative isoform 1
            [Theobroma cacao] gi|508702083|gb|EOX93979.1| B3
            domain-containing transcription factor VAL3, putative
            isoform 1 [Theobroma cacao]
          Length = 377

 Score =  333 bits (853), Expect = 3e-88
 Identities = 193/393 (49%), Positives = 239/393 (60%), Gaps = 7/393 (1%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +  D+D+ AL+KE D+ SCPICMDHPHNAVLLLC+SH+KGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRIITDSDIRALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGTSGNPSEVEGGNDVRIIDL 830
            NCLDR++KL+     +S SP L      N  N+++ ++ L        +EG     + + 
Sbjct: 61   NCLDRYKKLR---AYSSKSPMLPHPIPQNRQNSSTSDMNLAL--RTDFIEGNGSRNLNET 115

Query: 829  IATEDLSSG-LEENSNHAAHNPLGV-----HDETXXXXXXXXXXXXXXXXXXSKLRCPLC 668
             +T   S G ++E + H      G+      D +                  S L+CPLC
Sbjct: 116  NSTPGRSEGNIQEPNRHLDSQGEGIIEIGDSDSSQGRAESEELDAENTSESKSSLKCPLC 175

Query: 667  RGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRER 488
            RG I GW+VVEEAR YLN K RSCSRESC+++GNY+EL             +D+DP+RER
Sbjct: 176  RGDIHGWEVVEEARMYLNLKKRSCSRESCAYNGNYQELRRHARRVHPTTRPSDIDPSRER 235

Query: 487  AWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRL-SXXXXXXXXXXXRWLSTFFL 311
             WRRLE+QREY DIVSAIRSAMPGA+V+GDY IENG RL +            W +TFFL
Sbjct: 236  DWRRLEHQREYGDIVSAIRSAMPGAIVVGDYAIENGDRLAADRDSGTGEESAPWWTTFFL 295

Query: 310  FQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXXXXEP 131
            FQMIGS++ V E R  RSR  SRHRR   GA   RRFLWGENLLGL+           + 
Sbjct: 296  FQMIGSIDSVGEPR-ARSRVWSRHRR-PAGALSERRFLWGENLLGLQ--------DDDDD 345

Query: 130  ETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
            +   LSD+GED SPN       RL  SR+DEDQ
Sbjct: 346  DLRILSDVGEDPSPN--PRRRRRLTRSRSDEDQ 376


>ref|XP_002521120.1| conserved hypothetical protein [Ricinus communis]
            gi|223539689|gb|EEF41271.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 386

 Score =  330 bits (846), Expect = 2e-87
 Identities = 191/397 (48%), Positives = 237/397 (59%), Gaps = 13/397 (3%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            M   KRS   D+D+  L+ E D+ SCPICMDHPHNAVLLLC+SH+KGCRSYICDTS RHS
Sbjct: 1    MTGVKRSRYTDSDIRTLHNELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSSRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGT-------SGNPSEVEGGN 851
            NCLDR++KL+D +G N+T  S    +  +  N +  ++ LG        + N S+ +   
Sbjct: 61   NCLDRYKKLRDSSGSNTTLDSSLPINSFSSSNISDTSLTLGARVLDSYENHNQSDSDNIT 120

Query: 850  DVRIIDLIATEDLSSGLEENSNHAAHNPLGV-----HDETXXXXXXXXXXXXXXXXXXSK 686
             VR+ + +    L + ++  +        GV      +                      
Sbjct: 121  SVRMPEQL----LENSIQHPNRQVETRGEGVLEAGDSESFPDRIELEEADVVNSSEAGLS 176

Query: 685  LRCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADV 506
            L+CPLCRG++LGW+VVEEAR+YLN K RSCSRESCSF GNY+EL             +DV
Sbjct: 177  LKCPLCRGAVLGWEVVEEARKYLNLKKRSCSRESCSFCGNYQELRRHARRVHPTTRPSDV 236

Query: 505  DPTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLS-XXXXXXXXXXXRW 329
            DP+RERAWR LE QREY DIVSA+RSAMPGAVV+GDYVIENG R S             W
Sbjct: 237  DPSRERAWRCLERQREYGDIVSALRSAMPGAVVVGDYVIENGDRFSVEREGGAGEVNAPW 296

Query: 328  LSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXX 149
             +TFFLFQMIGS++  AE R  RSRA +RHRRS GGA P RRFLWGENLLGL+       
Sbjct: 297  WTTFFLFQMIGSIDGAAEPR-ARSRAWTRHRRS-GGALPERRFLWGENLLGLQDDDEDDE 354

Query: 148  XXXXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADE 38
                  + + LSD GED SP        RL  SR+D+
Sbjct: 355  G-----DLHILSDAGEDASP--IPRRRRRLTRSRSDD 384


>gb|EXC24174.1| hypothetical protein L484_015193 [Morus notabilis]
          Length = 373

 Score =  328 bits (842), Expect = 5e-87
 Identities = 200/398 (50%), Positives = 242/398 (60%), Gaps = 12/398 (3%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA   R +  D+DM AL+KE D+ SCPICMDHPHNAVLLLC+SHDKGCRSY+CDTSYRHS
Sbjct: 1    MAGVNRRICTDSDMRALHKELDEISCPICMDHPHNAVLLLCSSHDKGCRSYVCDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDN---STSPSLNS---RDQLNMDNTN------SGNIGLGTSGNPSE 866
            NCLDRF+K++  N +N   S+S +LNS   R  LN DN N      +  I +   G P E
Sbjct: 61   NCLDRFKKIRANNRNNPTPSSSLALNSNNLRPNLNEDNQNHNLNESNAVISVDLHGEPRE 120

Query: 865  VEGGNDVRIIDLIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSK 686
                N+ R  DL    +   G+ E  +     PL    E                     
Sbjct: 121  ----NNTR--DLNRLLETQEGIVEAVDS---EPLRERVEVDEFGVENSSESDL------S 165

Query: 685  LRCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADV 506
            L+CPLCRG++LGW+VVEEAR++LN K RSCSRESCSFSGNY+EL             +D+
Sbjct: 166  LKCPLCRGTVLGWEVVEEARKHLNLKRRSCSRESCSFSGNYQELRRHARRVHPTTRPSDI 225

Query: 505  DPTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLSXXXXXXXXXXXRWL 326
            DP+RERAW+RLE+QRE  D+VSAIRSA+PGAVV+GDYVIENG RL             W 
Sbjct: 226  DPSRERAWQRLEHQRELGDVVSAIRSAIPGAVVVGDYVIENGDRLGGERAGGDANGPWW- 284

Query: 325  STFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXX 146
            +T FLFQMIG+M+   + R  R RA +RHRRS GGA   RR +WGENLLGL+        
Sbjct: 285  TTLFLFQMIGNMDNAGDHR-ARPRAWTRHRRS-GGANSDRRLIWGENLLGLQ-------D 335

Query: 145  XXXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
               E +   LSD GEDTSP        RL  SR+DEDQ
Sbjct: 336  DDDEDDLRILSDNGEDTSP-APPRRRRRLTRSRSDEDQ 372


>ref|XP_002301572.1| hypothetical protein POPTR_0002s22380g [Populus trichocarpa]
            gi|566159410|ref|XP_006386811.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|566159412|ref|XP_006386812.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|566159414|ref|XP_006386813.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|222843298|gb|EEE80845.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345588|gb|ERP64608.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345589|gb|ERP64609.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345590|gb|ERP64610.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
          Length = 368

 Score =  310 bits (793), Expect = 2e-81
 Identities = 174/376 (46%), Positives = 225/376 (59%), Gaps = 9/376 (2%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA+ KR ++ D+D+HAL+KE D+ SCPIC+D PHNAVLLLC+S++KGC+SYICDTSYRHS
Sbjct: 1    MAALKRRLNTDSDIHALHKELDEVSCPICLDRPHNAVLLLCSSNEKGCKSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGTSG-------NPSEVEGGN 851
            NCLD+F+K +  +  N+T  S    + ++   T   ++ L T         N +E+    
Sbjct: 61   NCLDQFKKSRGNSRSNATLQSSMPINSVSSSTTTDASMTLRTHAFDGNENHNLNEISNDT 120

Query: 850  DVRIID-LIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCP 674
             VR+ + L+ +E +   +E    +A    L +                          CP
Sbjct: 121  FVRLPEELVDSESVQERIEHEGVNANSPELSLSPG-----------------------CP 157

Query: 673  LCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTR 494
            LCRG+ILGW+VV+EAR+YLN K RSCSRESCSFSGNY+EL             +D+DP+R
Sbjct: 158  LCRGTILGWEVVDEARKYLNLKKRSCSRESCSFSGNYQELRRHARRVHPTIRPSDIDPSR 217

Query: 493  ERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLS-XXXXXXXXXXXRWLSTF 317
            ERAWR LE+QREY DIVSA+ SAMPGAVV+GDY+IENG RLS             W +TF
Sbjct: 218  ERAWRCLEHQREYGDIVSAVHSAMPGAVVVGDYIIENGDRLSVERESRTNEVNAPWWTTF 277

Query: 316  FLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXXXX 137
            F FQMIGS++  AE R   SRA +RHR+S       RRFLWGENLLGL            
Sbjct: 278  FFFQMIGSIDGAAEPRTW-SRAWTRHRQS-AETLADRRFLWGENLLGLHDNDADDDDDDD 335

Query: 136  EPETNALSDIGEDTSP 89
                + L + GED SP
Sbjct: 336  NGYLHVLGNAGEDASP 351


>ref|XP_006586264.1| PREDICTED: uncharacterized protein LOC100791202 isoform X1 [Glycine
            max] gi|571474560|ref|XP_006586265.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X2 [Glycine
            max] gi|571474562|ref|XP_006586266.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X3 [Glycine
            max] gi|571474564|ref|XP_006586267.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X4 [Glycine
            max] gi|571474566|ref|XP_006586268.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X5 [Glycine
            max] gi|571474568|ref|XP_006586269.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X6 [Glycine
            max]
          Length = 350

 Score =  306 bits (783), Expect = 4e-80
 Identities = 177/394 (44%), Positives = 230/394 (58%), Gaps = 8/394 (2%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +  D+D+HAL+KE D+ SCPICMDHPHNAVLLLC+SH+KGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGTSGNPSEVEGGNDVRIIDL 830
            NCLDRF+K++D   +N   PS       ++ NTN+     G + +P+ +   +D  I++ 
Sbjct: 61   NCLDRFKKMRDNFKENQNLPS-------SLVNTNNSGSRQGDAQDPNRLLDQHDEGILET 113

Query: 829  IATEDLSSGL---EENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCPLCRGS 659
              +E+L       + N+++++ + L                          L+CPLCRG+
Sbjct: 114  ADSENLQDRAVIEDLNADNSSESKL-------------------------NLKCPLCRGA 148

Query: 658  ILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRERAWR 479
            +L WKVVEEAR YLN K RSCSR+SCSF G+Y EL             +++DPTRERAWR
Sbjct: 149  VLNWKVVEEARNYLNMKKRSCSRDSCSFVGDYLELRRHARRVHPTSRPSNIDPTRERAWR 208

Query: 478  RLENQREYDDIVSAIRSAMPGAVVLGDYVIENG---GRL--SXXXXXXXXXXXRWLSTFF 314
              E+QREY DIVSAI+SA+PGAV++GDYV+ENG   GRL               WL+T  
Sbjct: 209  HFEDQREYGDIVSAIQSAVPGAVLVGDYVLENGDGIGRLPDERAEGNIGNANGPWLTTTI 268

Query: 313  LFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXXXXE 134
            LFQM+ S   +       S A +RHRRS      RRR+LWGENLLGL            E
Sbjct: 269  LFQMMDSTVEIVREPRAHSSAWTRHRRSD----ERRRYLWGENLLGLH-------DNDIE 317

Query: 133  PETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
             +     D GED SP        RL  +R++EDQ
Sbjct: 318  DDLRIFRDAGEDASP--VPRRRRRLTRTRSNEDQ 349


>gb|EPS72073.1| hypothetical protein M569_02687, partial [Genlisea aurea]
          Length = 344

 Score =  305 bits (781), Expect = 6e-80
 Identities = 171/346 (49%), Positives = 209/346 (60%), Gaps = 6/346 (1%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MASRKRS+S+DADM A  KEWD+ASCPIC+DHPHNAVL++C+SHDKGCRS+ICDTSYRHS
Sbjct: 1    MASRKRSLSNDADMSAQQKEWDEASCPICLDHPHNAVLIICSSHDKGCRSFICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGTSGNPSEVEGGNDVRIIDL 830
            NCLDRF+KLK +N +   + S++  D    D+ NS +          E            
Sbjct: 61   NCLDRFKKLKQDNIELPATSSISGHDH---DSVNSSSRRRTVEFEDQE----------GA 107

Query: 829  IATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCPLCRGSILG 650
            +  E L SG E N+  +A                              LRCPLCRG++LG
Sbjct: 108  LFWERLGSG-ESNTEKSAEQ--------------------------VSLRCPLCRGNVLG 140

Query: 649  WKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRERAWRRLE 470
            WKVVE+ R+YLN K RSCSRESCSF+GNY EL             ADVDP+R+R W+ LE
Sbjct: 141  WKVVEDVRKYLNLKPRSCSRESCSFTGNYGELRRHARKDHPTVCPADVDPSRQREWQHLE 200

Query: 469  NQREYDDIVSAIRSAMPGAVVLGDYVIENGG---RLSXXXXXXXXXXXRWLSTFFLFQMI 299
            +QRE +DIVSAIRSAMPGA+++GDY IE+ G                 RWLST FLFQMI
Sbjct: 201  DQRELNDIVSAIRSAMPGAILVGDYAIESSGDRPSRERIRSENAAERGRWLSTLFLFQMI 260

Query: 298  GSMEPVAELRGGRSRALSRHRRSTGGAFPRRR---FLWGENLLGLR 170
            G++E  A  RGGRSR   R  +      P  R   +LWGENLLGL+
Sbjct: 261  GALEDGAPRRGGRSRGHRRAEQQQQQPVPAVRHHHYLWGENLLGLQ 306


>ref|XP_006602548.1| PREDICTED: uncharacterized protein LOC100807316 isoform X8 [Glycine
            max]
          Length = 349

 Score =  305 bits (781), Expect = 6e-80
 Identities = 180/393 (45%), Positives = 226/393 (57%), Gaps = 7/393 (1%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +  D+D+HAL+KE D+ SCPICMDHPHNAVLLLC+SH+KGCRSYICDTSYRHS
Sbjct: 1    MAGIKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGTSGNPSEVEGGNDVRIIDL 830
            NCLDRF+K++D + +N   PS       ++ NTN+     G + +PS     +D  I++ 
Sbjct: 61   NCLDRFKKMRDNSKENQNLPS-------SLVNTNNSGSRQGDAQDPSRHLDQHDEGILET 113

Query: 829  IATEDLSSG--LEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCPLCRGSI 656
              +E L     LE+    A+ + L                          L+CPLCRGS+
Sbjct: 114  ADSETLQDRAVLEDLDADASESKLN-------------------------LKCPLCRGSV 148

Query: 655  LGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRERAWRR 476
            L W+VVEEAR YLN K RSCSR+SCSF G+Y EL             ++VDPTRERAWR 
Sbjct: 149  LNWEVVEEARNYLNMKKRSCSRDSCSFVGDYLELRRHARRVHPTSRPSNVDPTRERAWRH 208

Query: 475  LENQREYDDIVSAIRSAMPGAVVLGDYVIENG---GRL--SXXXXXXXXXXXRWLSTFFL 311
             E QREY DIVSAI+SAMPGAV++GDY +ENG   GRL               WL+T  L
Sbjct: 209  FERQREYGDIVSAIQSAMPGAVLVGDYALENGDGIGRLQDERVEGNIDNANRPWLATTIL 268

Query: 310  FQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXXXXEP 131
            FQM+ S   +       S A +RHRRS+     RRR+LWGE+LLGL            E 
Sbjct: 269  FQMMDSTIEIVREPRAHSSAWTRHRRSS----ERRRYLWGESLLGLH-------DNDIED 317

Query: 130  ETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
            +     D GED SP        RL  +R++EDQ
Sbjct: 318  DLRIFRDAGEDASP--VPRRRRRLTRTRSNEDQ 348


>ref|XP_004290229.1| PREDICTED: uncharacterized protein LOC101300301 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 385

 Score =  305 bits (780), Expect = 8e-80
 Identities = 179/363 (49%), Positives = 222/363 (61%), Gaps = 23/363 (6%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +   +++ ALYKE D  SCPICMDHPHNAVLLLC+SHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRIDTGSEIRALYKELDAVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNT-NSGNIGLGT-----SGNPSEVEGGN- 851
            NCLDRF+KL++   +N+ S SL S    N   + N+ ++  GT     +G+P+ +EG   
Sbjct: 61   NCLDRFKKLRE---NNTNSQSLVSSLPTNHHGSHNTPDMAFGTDLNEANGSPNLIEGNAV 117

Query: 850  ---------DVRII-DL---IATEDLSSGLEENS--NHAAHNPLGVHDETXXXXXXXXXX 716
                       R+I DL   +  E+L    +  S      H  L V + +          
Sbjct: 118  TSANIPGQPQERVIQDLNMPLLPEELMGVADSESFQERVEHGELDVENSSESNL------ 171

Query: 715  XXXXXXXXSKLRCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXX 536
                      L+CPLCRG+ILGW+VVE+ R+YLN K RSCSRE+CSFSGNY+EL      
Sbjct: 172  ---------SLKCPLCRGAILGWEVVEDCRKYLNLKKRSCSREACSFSGNYQELRRHARR 222

Query: 535  XXXXXXXADVDPTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRL-SXXX 359
                   +D+DP+RERAWR LE+QRE+ D+VSAI SA+PGAVV+GDYVIENG RL     
Sbjct: 223  VHPATRPSDIDPSRERAWRHLEHQREFGDVVSAIHSAIPGAVVVGDYVIENGDRLGGGGE 282

Query: 358  XXXXXXXXRWLSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLL 179
                     W +T FLFQMIGS +   E R  R+RA  RHRRS  GA   RR LWGENLL
Sbjct: 283  SGTGEANGPWWTTMFLFQMIGSADRGGEPR-ARARAWPRHRRS-AGALSERRLLWGENLL 340

Query: 178  GLR 170
            GL+
Sbjct: 341  GLQ 343


>ref|XP_004290228.1| PREDICTED: uncharacterized protein LOC101300301 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 439

 Score =  305 bits (780), Expect = 8e-80
 Identities = 179/363 (49%), Positives = 222/363 (61%), Gaps = 23/363 (6%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +   +++ ALYKE D  SCPICMDHPHNAVLLLC+SHDKGCRSYICDTSYRHS
Sbjct: 55   MAGVKRRIDTGSEIRALYKELDAVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 114

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNT-NSGNIGLGT-----SGNPSEVEGGN- 851
            NCLDRF+KL++   +N+ S SL S    N   + N+ ++  GT     +G+P+ +EG   
Sbjct: 115  NCLDRFKKLRE---NNTNSQSLVSSLPTNHHGSHNTPDMAFGTDLNEANGSPNLIEGNAV 171

Query: 850  ---------DVRII-DL---IATEDLSSGLEENS--NHAAHNPLGVHDETXXXXXXXXXX 716
                       R+I DL   +  E+L    +  S      H  L V + +          
Sbjct: 172  TSANIPGQPQERVIQDLNMPLLPEELMGVADSESFQERVEHGELDVENSSESNL------ 225

Query: 715  XXXXXXXXSKLRCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXX 536
                      L+CPLCRG+ILGW+VVE+ R+YLN K RSCSRE+CSFSGNY+EL      
Sbjct: 226  ---------SLKCPLCRGAILGWEVVEDCRKYLNLKKRSCSREACSFSGNYQELRRHARR 276

Query: 535  XXXXXXXADVDPTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRL-SXXX 359
                   +D+DP+RERAWR LE+QRE+ D+VSAI SA+PGAVV+GDYVIENG RL     
Sbjct: 277  VHPATRPSDIDPSRERAWRHLEHQREFGDVVSAIHSAIPGAVVVGDYVIENGDRLGGGGE 336

Query: 358  XXXXXXXXRWLSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLL 179
                     W +T FLFQMIGS +   E R  R+RA  RHRRS  GA   RR LWGENLL
Sbjct: 337  SGTGEANGPWWTTMFLFQMIGSADRGGEPR-ARARAWPRHRRS-AGALSERRLLWGENLL 394

Query: 178  GLR 170
            GL+
Sbjct: 395  GLQ 397


>ref|XP_006602541.1| PREDICTED: uncharacterized protein LOC100807316 isoform X1 [Glycine
            max] gi|571546730|ref|XP_006602542.1| PREDICTED:
            uncharacterized protein LOC100807316 isoform X2 [Glycine
            max] gi|571546734|ref|XP_006602543.1| PREDICTED:
            uncharacterized protein LOC100807316 isoform X3 [Glycine
            max] gi|571546738|ref|XP_006602544.1| PREDICTED:
            uncharacterized protein LOC100807316 isoform X4 [Glycine
            max] gi|571546742|ref|XP_006602545.1| PREDICTED:
            uncharacterized protein LOC100807316 isoform X5 [Glycine
            max] gi|571546745|ref|XP_006602546.1| PREDICTED:
            uncharacterized protein LOC100807316 isoform X6 [Glycine
            max] gi|571546749|ref|XP_006602547.1| PREDICTED:
            uncharacterized protein LOC100807316 isoform X7 [Glycine
            max]
          Length = 384

 Score =  303 bits (777), Expect = 2e-79
 Identities = 180/401 (44%), Positives = 227/401 (56%), Gaps = 15/401 (3%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +  D+D+HAL+KE D+ SCPICMDHPHNAVLLLC+SH+KGCRSYICDTSYRHS
Sbjct: 1    MAGIKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNT-NSGNIGLGTSGNPSEVEGGNDVRIID 833
            NCLDRF+K++D + +N   PS      +N +N+ NS ++ +    +  +V   +   I  
Sbjct: 61   NCLDRFKKMRDNSKENQNLPS----SLVNTNNSGNSFDVNITVQSDMHDVNDLHQNEINT 116

Query: 832  LIATEDLSSGLEENSNHAAHNPLGVHDE---------TXXXXXXXXXXXXXXXXXXSKLR 680
            L++   L+ G  +         L  HDE         T                    L+
Sbjct: 117  LLSV-GLAQGSRQGDAQDPSRHLDQHDEGILETADSETLQDRAVLEDLDADASESKLNLK 175

Query: 679  CPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDP 500
            CPLCRGS+L W+VVEEAR YLN K RSCSR+SCSF G+Y EL             ++VDP
Sbjct: 176  CPLCRGSVLNWEVVEEARNYLNMKKRSCSRDSCSFVGDYLELRRHARRVHPTSRPSNVDP 235

Query: 499  TRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENG---GRL--SXXXXXXXXXXX 335
            TRERAWR  E QREY DIVSAI+SAMPGAV++GDY +ENG   GRL              
Sbjct: 236  TRERAWRHFERQREYGDIVSAIQSAMPGAVLVGDYALENGDGIGRLQDERVEGNIDNANR 295

Query: 334  RWLSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXX 155
             WL+T  LFQM+ S   +       S A +RHRRS+     RRR+LWGE+LLGL      
Sbjct: 296  PWLATTILFQMMDSTIEIVREPRAHSSAWTRHRRSS----ERRRYLWGESLLGLH----- 346

Query: 154  XXXXXXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
                  E +     D GED SP        RL  +R++EDQ
Sbjct: 347  --DNDIEDDLRIFRDAGEDASP--VPRRRRRLTRTRSNEDQ 383


>gb|ESW12557.1| hypothetical protein PHAVU_008G123200g [Phaseolus vulgaris]
          Length = 385

 Score =  303 bits (775), Expect = 3e-79
 Identities = 180/401 (44%), Positives = 228/401 (56%), Gaps = 15/401 (3%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +  D+D+HAL+KE D+ SCPICMDHPHNAVLLLC+SH+KGCRSYICDTSYRHS
Sbjct: 1    MAGVKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNT-NSGNIGLGTSGNPSEVEGGNDVRIID 833
            NCLDRF+K++D + +N   PS      +N +N+ NS +I +    +  +V   ++  I  
Sbjct: 61   NCLDRFKKMRDNSKENENLPS----SLVNTNNSGNSFDINITMQSDMHDVNELHENEINT 116

Query: 832  LIATEDLSSGLEENSNHAAHNPLGVHDE----------TXXXXXXXXXXXXXXXXXXSKL 683
            L++   L+ G  +         L  HDE                              KL
Sbjct: 117  LLSV-GLAQGSRQGDAQDPSRHLDPHDEGILETADSETLQDRAVLEDLGADNSSESKLKL 175

Query: 682  RCPLCRGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVD 503
            +CPLCRG++L W+V EEAR YLN K RSCSR+SCSF G Y EL             +D+D
Sbjct: 176  KCPLCRGAVLSWEVDEEARNYLNVKKRSCSRDSCSFVGGYLELRRHARRVHPTSRPSDID 235

Query: 502  PTRERAWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENG---GRLS-XXXXXXXXXXX 335
            PTRERAWR  E QREY DI+SAI+SAMPGAV++GDYV+ENG   GRLS            
Sbjct: 236  PTRERAWRHFERQREYGDIMSAIQSAMPGAVLVGDYVLENGDGIGRLSDEREGNISNANG 295

Query: 334  RWLSTFFLFQMIGSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXX 155
             WL+T  LFQ++ S   +       +   SRHRRS+     RRR+LWGENLLGL      
Sbjct: 296  PWLTTTILFQVMDSTIEIVREPRAHASTWSRHRRSS----ERRRYLWGENLLGLN----- 346

Query: 154  XXXXXXEPETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
                  E +    SD GED SP        RL  +R++EDQ
Sbjct: 347  --ENDIEDDLRIFSDAGEDPSP--VPRRRRRLTRTRSNEDQ 383


>ref|XP_006298026.1| hypothetical protein CARUB_v10014073mg [Capsella rubella]
            gi|565480774|ref|XP_006298027.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
            gi|482566735|gb|EOA30924.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
            gi|482566736|gb|EOA30925.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
          Length = 353

 Score =  303 bits (775), Expect = 3e-79
 Identities = 174/354 (49%), Positives = 209/354 (59%), Gaps = 14/354 (3%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +S ++D+HAL+KE D+ SCP+CMDHPHNAVLLLC+SHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNST------SPSLNSRDQLNMDNTNSGNIGLGTSGNPSEVEGGND 848
            NCLDRF+KL  E+ ++ T      S   N+  Q     T+  N   G SGN   V     
Sbjct: 61   NCLDRFKKLHSESPNDPTPEANLASRETNNESQNEHGTTSRSNFHSG-SGNRGSVGDYES 119

Query: 847  VRIIDLIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCPLC 668
            +R    +  E+ S   E+ +N                                 L+CPLC
Sbjct: 120  LRRRRRVEDEEQS---EDFTN---------------------------------LKCPLC 143

Query: 667  RGSILGWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRER 488
            RG++LGWKVVEE R YL+ K RSCSRESCSF+GNY++L             +D DP+RER
Sbjct: 144  RGTVLGWKVVEEVRTYLDLKNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDPSRER 203

Query: 487  AWRRLENQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLSXXXXXXXXXXXRWLSTFFLF 308
            AWRRLENQREY DIVSAIRSAMPGAVV+GDYVIENG R              W +T  LF
Sbjct: 204  AWRRLENQREYGDIVSAIRSAMPGAVVVGDYVIENGDRFPGEREAGNGGSDLW-TTLVLF 262

Query: 307  QMIGSMEPVAELRGG--------RSRALSRHRRSTGGAFPRRRFLWGENLLGLR 170
            QMIGS++       G        RSRA   HRRS+      RR+LWGENLLGL+
Sbjct: 263  QMIGSLDSGGPSGSGSGSGSRSHRSRAWRNHRRSSD-----RRYLWGENLLGLQ 311


>ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arabidopsis lyrata subsp.
            lyrata] gi|297329394|gb|EFH59813.1| hypothetical protein
            ARALYDRAFT_479993 [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score =  303 bits (775), Expect = 3e-79
 Identities = 174/348 (50%), Positives = 207/348 (59%), Gaps = 8/348 (2%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +S ++D+HAL+KE D+ SCP+CMDHPHNAVLLLC+SHDKGCRSYICDTSYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGTSGNPSEVEGGNDVRIIDL 830
            NCLDRF+KL  E      SP+         D T  GN+    + N S  E G        
Sbjct: 61   NCLDRFKKLHSE------SPN---------DPTPEGNLASRENNNESLNEHG-------- 97

Query: 829  IATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCPLCRGSILG 650
              T   SS   E++N       G   ++                  + L+CPLCRG++LG
Sbjct: 98   --TASRSSFHRESTNR------GSAWDSESLRRRRRVDEEEQSEDITNLKCPLCRGTVLG 149

Query: 649  WKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRERAWRRLE 470
            WKVVEE R YL+ K RSCSRESCSF+GNY++L             +D DP+RERAWR LE
Sbjct: 150  WKVVEEVRTYLDLKNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDPSRERAWRHLE 209

Query: 469  NQREYDDIVSAIRSAMPGAVVLGDYVIENGGRLSXXXXXXXXXXXRWLSTFFLFQMIGSM 290
            NQREY DIVSAIRSAMPGAVV+GDYVIENG R S            W +T  LFQMIGS+
Sbjct: 210  NQREYGDIVSAIRSAMPGAVVVGDYVIENGDRFSGERETGNGGSDLW-TTLVLFQMIGSL 268

Query: 289  EPVAELRGG--------RSRALSRHRRSTGGAFPRRRFLWGENLLGLR 170
            +       G        RSRA   HRRS+      RR+LWGENLLGL+
Sbjct: 269  DNGGSSASGSGGGSRSHRSRAWRNHRRSSSD----RRYLWGENLLGLQ 312


>ref|XP_004492331.1| PREDICTED: uncharacterized protein LOC101499234 isoform X1 [Cicer
            arietinum] gi|502103643|ref|XP_004492332.1| PREDICTED:
            uncharacterized protein LOC101499234 isoform X2 [Cicer
            arietinum] gi|502103648|ref|XP_004492333.1| PREDICTED:
            uncharacterized protein LOC101499234 isoform X3 [Cicer
            arietinum] gi|502103652|ref|XP_004492334.1| PREDICTED:
            uncharacterized protein LOC101499234 isoform X4 [Cicer
            arietinum]
          Length = 354

 Score =  302 bits (773), Expect = 5e-79
 Identities = 183/394 (46%), Positives = 236/394 (59%), Gaps = 8/394 (2%)
 Frame = -3

Query: 1189 MASRKRSMSHDADMHALYKEWDDASCPICMDHPHNAVLLLCTSHDKGCRSYICDTSYRHS 1010
            MA  KR +  D+D+HAL+KE D+ SCPICMDHPHNAVLLLC+SHDKGCRSYICDTSYRHS
Sbjct: 1    MAGFKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1009 NCLDRFRKLKDENGDNSTSPSLNSRDQLNMDNTNSGNIGLGTSG-NPSEVEGGNDVRIID 833
            NCLDRF+K++D + +N   PS       ++ NTN+     G +  +PS     +D  I++
Sbjct: 61   NCLDRFKKMRDNSKENPNLPS-------SLINTNNSGSRQGDAAQDPSRHLDQHDEGILE 113

Query: 832  LIATEDLSSGLEENSNHAAHNPLGVHDETXXXXXXXXXXXXXXXXXXSKLRCPLCRGSIL 653
               +E L        + A    L V + +                    L+CPLCRG++L
Sbjct: 114  TAESETLQ-------DRAVLEDLDVDNNSSDSIL--------------SLQCPLCRGTVL 152

Query: 652  GWKVVEEARRYLNSKARSCSRESCSFSGNYRELXXXXXXXXXXXXXADVDPTRERAWRRL 473
            GW+V+EEAR YLN+K RSCSR+SCSF+G+Y EL             +DVDPTRE+AW++ 
Sbjct: 153  GWEVIEEARNYLNNKKRSCSRDSCSFAGDYLELRRHARRVHPTSRPSDVDPTREQAWQQF 212

Query: 472  ENQREYDDIVSAIRSAMPGAVVLGDYVIENG---GRLS-XXXXXXXXXXXRWL--STFFL 311
            E QREY DIVSAI+SA+PGAVV+GDYV+ENG   GRLS             WL  +T  L
Sbjct: 213  ERQREYGDIVSAIQSAIPGAVVVGDYVLENGDGIGRLSGDRDGNNGNGNGPWLTTTTTIL 272

Query: 310  FQMI-GSMEPVAELRGGRSRALSRHRRSTGGAFPRRRFLWGENLLGLRXXXXXXXXXXXE 134
            FQM+  ++E V E R   S A SRHRRS+     RRR+LWGENLLGL+           E
Sbjct: 273  FQMMDNTIEIVREPRARSSSAWSRHRRSS----DRRRYLWGENLLGLQ-------DNEVE 321

Query: 133  PETNALSDIGEDTSPNXXXXXXXRLMHSRADEDQ 32
             +    +D+ ED S         RL  +R++EDQ
Sbjct: 322  EDLRIFNDLVEDAS--TVPRRRRRLNRTRSNEDQ 353


Top