BLASTX nr result

ID: Cocculus23_contig00022778 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00022778
         (1198 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007049821.1| B3 domain-containing transcription factor VA...   357   4e-96
ref|XP_002521120.1| conserved hypothetical protein [Ricinus comm...   354   4e-95
ref|XP_006443638.1| hypothetical protein CICLE_v10020351mg [Citr...   353   8e-95
ref|XP_006443639.1| hypothetical protein CICLE_v10020351mg [Citr...   351   4e-94
ref|XP_004139654.1| PREDICTED: uncharacterized protein LOC101208...   350   7e-94
ref|XP_004139655.1| PREDICTED: uncharacterized protein LOC101208...   348   3e-93
gb|EXC24174.1| hypothetical protein L484_015193 [Morus notabilis]     344   4e-92
ref|XP_006443643.1| hypothetical protein CICLE_v10020351mg [Citr...   336   1e-89
ref|XP_004975029.1| PREDICTED: uncharacterized protein LOC101762...   333   7e-89
ref|XP_002447452.1| hypothetical protein SORBIDRAFT_06g001230 [S...   330   1e-87
ref|XP_004290228.1| PREDICTED: uncharacterized protein LOC101300...   328   2e-87
gb|ACG33117.1| hypothetical protein [Zea mays]                        328   2e-87
ref|XP_006298026.1| hypothetical protein CARUB_v10014073mg [Caps...   328   3e-87
ref|NP_001143747.1| uncharacterized protein LOC100276502 [Zea ma...   328   3e-87
ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arab...   327   5e-87
ref|XP_004290229.1| PREDICTED: uncharacterized protein LOC101300...   327   8e-87
gb|AFW57825.1| hypothetical protein ZEAMMB73_396034 [Zea mays] g...   326   1e-86
ref|XP_007140563.1| hypothetical protein PHAVU_008G123200g [Phas...   325   2e-86
ref|XP_002301572.1| hypothetical protein POPTR_0002s22380g [Popu...   324   5e-86
ref|XP_006586264.1| PREDICTED: uncharacterized protein LOC100791...   323   7e-86

>ref|XP_007049821.1| B3 domain-containing transcription factor VAL3, putative isoform 1
            [Theobroma cacao] gi|590714105|ref|XP_007049822.1| B3
            domain-containing transcription factor VAL3, putative
            isoform 1 [Theobroma cacao] gi|508702082|gb|EOX93978.1|
            B3 domain-containing transcription factor VAL3, putative
            isoform 1 [Theobroma cacao] gi|508702083|gb|EOX93979.1|
            B3 domain-containing transcription factor VAL3, putative
            isoform 1 [Theobroma cacao]
          Length = 377

 Score =  357 bits (917), Expect = 4e-96
 Identities = 191/378 (50%), Positives = 248/378 (65%), Gaps = 11/378 (2%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K     ++++  LHKE DE  CPICMDHPHNAVLLLCSS++KGCR YICD+SYRHS
Sbjct: 1    MAGVKRRIITDSDIRALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1007 NCLDRFKK---YFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEAD 837
            NCLDR+KK   Y   SP+              I Q   +  T++      +LR +  E +
Sbjct: 61   NCLDRYKKLRAYSSKSPM----------LPHPIPQNRQNSSTSDMNL---ALRTDFIEGN 107

Query: 836  EHNSLNENSNGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPY-----SEMIGGG 672
               +LNE +  S+ GR E     N +EP R+L++QG  ++E G  +       SE +   
Sbjct: 108  GSRNLNETN--STPGRSE----GNIQEPNRHLDSQGEGIIEIGDSDSSQGRAESEELDAE 161

Query: 671  NLTDS--NLKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIH 498
            N ++S  +LKCPLCRG + GW++V+EAR YL+LK+RSC  +SC ++GNY ELRRHARR+H
Sbjct: 162  NTSESKSSLKCPLCRGDIHGWEVVEEARMYLNLKKRSCSRESCAYNGNYQELRRHARRVH 221

Query: 497  PTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENN 318
            PTTRP+ +DPSR+R WR LEHQ+EYGDIVSAIRSAMPGAIV+GDY IE+GD    DR++ 
Sbjct: 222  PTTRPSDIDPSRERDWRRLEHQREYGDIVSAIRSAMPGAIVVGDYAIENGDRLAADRDSG 281

Query: 317  SGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLG 138
            +GE + P  T+F L+QMI     V +    R  SR W R+ R +G  SERR LWGENLLG
Sbjct: 282  TGEESAPWWTTFFLFQMIGSIDSVGE---PRARSRVWSRHRRPAGALSERRFLWGENLLG 338

Query: 137  LH-EDDDDWNLASDMAED 87
            L  +DDDD  + SD+ ED
Sbjct: 339  LQDDDDDDLRILSDVGED 356


>ref|XP_002521120.1| conserved hypothetical protein [Ricinus communis]
            gi|223539689|gb|EEF41271.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 386

 Score =  354 bits (909), Expect = 4e-95
 Identities = 185/379 (48%), Positives = 248/379 (65%), Gaps = 12/379 (3%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            MT  K S   ++++  LH E DE  CPICMDHPHNAVLLLCSS++KGCR YICD+S RHS
Sbjct: 1    MTGVKRSRYTDSDIRTLHNELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSSRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVA-SLRNNLAEADEH 831
            NCLDR+KK   +S                    P++ +++ +    + +L   + ++ E+
Sbjct: 61   NCLDRYKKLRDSSGSNTTLDSSL----------PINSFSSSNISDTSLTLGARVLDSYEN 110

Query: 830  NSLNENSNGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIG-------GG 672
            ++ +++ N +S    E L +N+ + P R +ET+G  VLE G  E + + I          
Sbjct: 111  HNQSDSDNITSVRMPEQLLENSIQHPNRQVETRGEGVLEAGDSESFPDRIELEEADVVNS 170

Query: 671  NLTDSNLKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPT 492
            +    +LKCPLCRG V+GW++V+EAR+YL+LK+RSC  +SC F GNY ELRRHARR+HPT
Sbjct: 171  SEAGLSLKCPLCRGAVLGWEVVEEARKYLNLKKRSCSRESCSFCGNYQELRRHARRVHPT 230

Query: 491  TRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSG 312
            TRP+ VDPSR+RAWR LE Q+EYGDIVSA+RSAMPGA+V+GDYVIE+GD F  +RE  +G
Sbjct: 231  TRPSDVDPSRERAWRCLERQREYGDIVSALRSAMPGAVVVGDYVIENGDRFSVEREGGAG 290

Query: 311  EGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLH 132
            E N P  T+F L+QMI     +  +   R  SRAW R+ RS G   ERR LWGENLLGL 
Sbjct: 291  EVNAPWWTTFFLFQMI---GSIDGAAEPRARSRAWTRHRRSGGALPERRFLWGENLLGLQ 347

Query: 131  EDDD----DWNLASDMAED 87
            +DD+    D ++ SD  ED
Sbjct: 348  DDDEDDEGDLHILSDAGED 366


>ref|XP_006443638.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|567902304|ref|XP_006443640.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902308|ref|XP_006443642.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|568853096|ref|XP_006480203.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X1 [Citrus
            sinensis] gi|557545900|gb|ESR56878.1| hypothetical
            protein CICLE_v10020351mg [Citrus clementina]
            gi|557545902|gb|ESR56880.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545904|gb|ESR56882.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 415

 Score =  353 bits (906), Expect = 8e-95
 Identities = 194/406 (47%), Positives = 255/406 (62%), Gaps = 15/406 (3%)
 Frame = -3

Query: 1196 APKMTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSY 1017
            A KM   K     ++++H LHKE DE  CPICMDHPHNAVLL+CSS+DKGCR YICD+SY
Sbjct: 24   ACKMAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSY 83

Query: 1016 RHSNCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPV-ASLRNNLAEA 840
            RHSNCLDR+KK   +S                    P H   N +   +  +LR +  E+
Sbjct: 84   RHSNCLDRYKKLRTSSRNNTTLSH----------SSPSHPQHNSNASDMNLALRTDFVES 133

Query: 839  DEHNSLNENSNGSSTGRIEVLEDNNNEEPVRYLETQGNSVL--ETGGLEPYSEM--IGGG 672
             E+ +LN  SN  S G  E   +NN ++  R LE +G   L  E G  + + E   + G 
Sbjct: 134  SENLNLN-GSNALSDGLPEGPGENNIQQADRLLEREGEGNLNPEAGNSQTFHERTELEGL 192

Query: 671  NLTDSN-----LKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHAR 507
            ++ +S+     LKCP+CRG ++GW++V+EAR+YL+LK+R+C  +SC F GNY ELRRHAR
Sbjct: 193  DVDNSSESILTLKCPMCRGAILGWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHAR 252

Query: 506  RIHPTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDR 327
            R HPTTRP+ +DPSR+RAWR LEHQ+EY DIVSAIRS+MPGA+V+GDYVIE+GD F   R
Sbjct: 253  RAHPTTRPSDIDPSRERAWRRLEHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGR 312

Query: 326  ENNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRN-LWGE 150
            E+ +GE N P  T+F L+ MI     +  +  SR  SRAW R+ R++G  SERR  LWGE
Sbjct: 313  ESGNGEVNAPWWTTFFLFHMI---GSMDGTGESRARSRAWTRHRRTAGALSERRRFLWGE 369

Query: 149  NLLGLH----EDDDDWNLASDMAEDVXXXXXXXXXXXXXXPDEDLP 24
            NLLGL     +++DD ++ SD+ ED                DED P
Sbjct: 370  NLLGLQDEEDDEEDDLHIFSDVGEDTSPIPRRRRRLTQSRSDEDQP 415


>ref|XP_006443639.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|567902306|ref|XP_006443641.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902312|ref|XP_006443644.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902314|ref|XP_006443645.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902316|ref|XP_006443646.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|567902318|ref|XP_006443647.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|568853098|ref|XP_006480204.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X2 [Citrus
            sinensis] gi|568853100|ref|XP_006480205.1| PREDICTED:
            uncharacterized protein LOC102627851 isoform X3 [Citrus
            sinensis] gi|557545901|gb|ESR56879.1| hypothetical
            protein CICLE_v10020351mg [Citrus clementina]
            gi|557545903|gb|ESR56881.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545906|gb|ESR56884.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545907|gb|ESR56885.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545908|gb|ESR56886.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
            gi|557545909|gb|ESR56887.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 389

 Score =  351 bits (900), Expect = 4e-94
 Identities = 192/403 (47%), Positives = 253/403 (62%), Gaps = 15/403 (3%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K     ++++H LHKE DE  CPICMDHPHNAVLL+CSS+DKGCR YICD+SYRHS
Sbjct: 1    MAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSYRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPV-ASLRNNLAEADEH 831
            NCLDR+KK   +S                    P H   N +   +  +LR +  E+ E+
Sbjct: 61   NCLDRYKKLRTSSRNNTTLSH----------SSPSHPQHNSNASDMNLALRTDFVESSEN 110

Query: 830  NSLNENSNGSSTGRIEVLEDNNNEEPVRYLETQGNSVL--ETGGLEPYSEM--IGGGNLT 663
             +LN  SN  S G  E   +NN ++  R LE +G   L  E G  + + E   + G ++ 
Sbjct: 111  LNLN-GSNALSDGLPEGPGENNIQQADRLLEREGEGNLNPEAGNSQTFHERTELEGLDVD 169

Query: 662  DSN-----LKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIH 498
            +S+     LKCP+CRG ++GW++V+EAR+YL+LK+R+C  +SC F GNY ELRRHARR H
Sbjct: 170  NSSESILTLKCPMCRGAILGWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHARRAH 229

Query: 497  PTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENN 318
            PTTRP+ +DPSR+RAWR LEHQ+EY DIVSAIRS+MPGA+V+GDYVIE+GD F   RE+ 
Sbjct: 230  PTTRPSDIDPSRERAWRRLEHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGRESG 289

Query: 317  SGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRN-LWGENLL 141
            +GE N P  T+F L+ MI     +  +  SR  SRAW R+ R++G  SERR  LWGENLL
Sbjct: 290  NGEVNAPWWTTFFLFHMI---GSMDGTGESRARSRAWTRHRRTAGALSERRRFLWGENLL 346

Query: 140  GLH----EDDDDWNLASDMAEDVXXXXXXXXXXXXXXPDEDLP 24
            GL     +++DD ++ SD+ ED                DED P
Sbjct: 347  GLQDEEDDEEDDLHIFSDVGEDTSPIPRRRRRLTQSRSDEDQP 389


>ref|XP_004139654.1| PREDICTED: uncharacterized protein LOC101208460 isoform 1 [Cucumis
            sativus]
          Length = 389

 Score =  350 bits (898), Expect = 7e-94
 Identities = 193/386 (50%), Positives = 255/386 (66%), Gaps = 18/386 (4%)
 Frame = -3

Query: 1190 KMTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRH 1011
            KM   K     ++++  LHKE DE  CPICMDHPHNAVLLLCSS+ KGC+PYICD+S+RH
Sbjct: 3    KMAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRH 62

Query: 1010 SNCLDRFKKY---FRNSPLQXXXXXXXXXXXXSIVQRPLHV----YTNESTFPVASLRNN 852
            SNC D+FKK     R SP                +  PL +    ++N ST  +  L  +
Sbjct: 63   SNCFDQFKKLREETRKSPR---------------LSSPLPINPYSFSNPSTNNLG-LSID 106

Query: 851  LAEADEHNSLNENSNGSSTGRIEV-LEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIG- 678
            L E D++ ++NE +  +S G   + L DN  E   R ++T     ++T G    +E +  
Sbjct: 107  LNEVDDNQNINERNTVASAGLPGLALGDNGTENSNRTVDTNEAGDMDTAGSGSITERVDQ 166

Query: 677  ----GGNLTD-SNLKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRH 513
                 GN ++ SNLKCP+CRG V+G ++++EAR+YL+LK+RSC  ++C FSGNY ELRRH
Sbjct: 167  EGLDAGNSSEYSNLKCPMCRGAVLGLEVIEEAREYLNLKKRSCSRETCSFSGNYQELRRH 226

Query: 512  ARRIHPTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGF-P 336
            ARR+HPT+RPAV+DPSR+RAWR LE Q+E GD+VSAIRSAMPGA+V+GDYVIE+GDG   
Sbjct: 227  ARRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVA 286

Query: 335  GDRENNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSG--TFSERRN 162
            G+R+N +G+ NGPLLTSF L+ M   F  V  +R  R  SR+W R+ RS G    SERR 
Sbjct: 287  GERDNGTGDVNGPLLTSFFLFHM---FGSVEGAREPRPRSRSWVRHRRSGGGTPVSERRF 343

Query: 161  LWGENLLGLHED-DDDWNLASDMAED 87
            LWGENLLGL ED D+D+ +   M +D
Sbjct: 344  LWGENLLGLQEDTDEDFRIYIGMGDD 369


>ref|XP_004139655.1| PREDICTED: uncharacterized protein LOC101208460 isoform 2 [Cucumis
            sativus] gi|449443782|ref|XP_004139656.1| PREDICTED:
            uncharacterized protein LOC101208460 isoform 3 [Cucumis
            sativus] gi|449527327|ref|XP_004170663.1| PREDICTED:
            uncharacterized protein LOC101225264 isoform 1 [Cucumis
            sativus] gi|449527329|ref|XP_004170664.1| PREDICTED:
            uncharacterized protein LOC101225264 isoform 2 [Cucumis
            sativus]
          Length = 386

 Score =  348 bits (893), Expect = 3e-93
 Identities = 192/385 (49%), Positives = 254/385 (65%), Gaps = 18/385 (4%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K     ++++  LHKE DE  CPICMDHPHNAVLLLCSS+ KGC+PYICD+S+RHS
Sbjct: 1    MAGVKRRIHNDSDILALHKELDEVSCPICMDHPHNAVLLLCSSHHKGCKPYICDTSHRHS 60

Query: 1007 NCLDRFKKY---FRNSPLQXXXXXXXXXXXXSIVQRPLHV----YTNESTFPVASLRNNL 849
            NC D+FKK     R SP                +  PL +    ++N ST  +  L  +L
Sbjct: 61   NCFDQFKKLREETRKSPR---------------LSSPLPINPYSFSNPSTNNLG-LSIDL 104

Query: 848  AEADEHNSLNENSNGSSTGRIEV-LEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIG-- 678
             E D++ ++NE +  +S G   + L DN  E   R ++T     ++T G    +E +   
Sbjct: 105  NEVDDNQNINERNTVASAGLPGLALGDNGTENSNRTVDTNEAGDMDTAGSGSITERVDQE 164

Query: 677  ---GGNLTD-SNLKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHA 510
                GN ++ SNLKCP+CRG V+G ++++EAR+YL+LK+RSC  ++C FSGNY ELRRHA
Sbjct: 165  GLDAGNSSEYSNLKCPMCRGAVLGLEVIEEAREYLNLKKRSCSRETCSFSGNYQELRRHA 224

Query: 509  RRIHPTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGF-PG 333
            RR+HPT+RPAV+DPSR+RAWR LE Q+E GD+VSAIRSAMPGA+V+GDYVIE+GDG   G
Sbjct: 225  RRVHPTSRPAVIDPSRERAWRRLERQREVGDVVSAIRSAMPGALVVGDYVIENGDGMVAG 284

Query: 332  DRENNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSG--TFSERRNL 159
            +R+N +G+ NGPLLTSF L+ M   F  V  +R  R  SR+W R+ RS G    SERR L
Sbjct: 285  ERDNGTGDVNGPLLTSFFLFHM---FGSVEGAREPRPRSRSWVRHRRSGGGTPVSERRFL 341

Query: 158  WGENLLGLHED-DDDWNLASDMAED 87
            WGENLLGL ED D+D+ +   M +D
Sbjct: 342  WGENLLGLQEDTDEDFRIYIGMGDD 366


>gb|EXC24174.1| hypothetical protein L484_015193 [Morus notabilis]
          Length = 373

 Score =  344 bits (883), Expect = 4e-92
 Identities = 186/371 (50%), Positives = 242/371 (65%), Gaps = 14/371 (3%)
 Frame = -3

Query: 1157 NTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHSNCLDRFKKYF 978
            +++M  LHKE DE  CPICMDHPHNAVLLLCSS+DKGCR Y+CD+SYRHSNCLDRFKK  
Sbjct: 11   DSDMRALHKELDEISCPICMDHPHNAVLLLCSSHDKGCRSYVCDTSYRHSNCLDRFKKIR 70

Query: 977  ---RNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVAS--LRNNLAEADEHNSLNEN 813
               RN+P                        T  S+  + S  LR NL E +++++LNE+
Sbjct: 71   ANNRNNP------------------------TPSSSLALNSNNLRPNLNEDNQNHNLNES 106

Query: 812  SNGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIG-------GGNLTDSN 654
            +   S        +NN  +  R LETQ   ++E    EP  E +          + +D +
Sbjct: 107  NAVISVDLHGEPRENNTRDLNRLLETQ-EGIVEAVDSEPLRERVEVDEFGVENSSESDLS 165

Query: 653  LKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVV 474
            LKCPLCRG V+GW++V+EAR++L+LK+RSC  +SC FSGNY ELRRHARR+HPTTRP+ +
Sbjct: 166  LKCPLCRGTVLGWEVVEEARKHLNLKRRSCSRESCSFSGNYQELRRHARRVHPTTRPSDI 225

Query: 473  DPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSGEGNGPL 294
            DPSR+RAW+ LEHQ+E GD+VSAIRSA+PGA+V+GDYVIE+GD   G+R    G+ NGP 
Sbjct: 226  DPSRERAWQRLEHQRELGDVVSAIRSAIPGAVVVGDYVIENGDRLGGERA--GGDANGPW 283

Query: 293  LTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHEDD--D 120
             T+  L+QMI     + ++   R   RAW R+ RS G  S+RR +WGENLLGL +DD  D
Sbjct: 284  WTTLFLFQMI---GNMDNAGDHRARPRAWTRHRRSGGANSDRRLIWGENLLGLQDDDDED 340

Query: 119  DWNLASDMAED 87
            D  + SD  ED
Sbjct: 341  DLRILSDNGED 351


>ref|XP_006443643.1| hypothetical protein CICLE_v10020351mg [Citrus clementina]
            gi|557545905|gb|ESR56883.1| hypothetical protein
            CICLE_v10020351mg [Citrus clementina]
          Length = 381

 Score =  336 bits (862), Expect = 1e-89
 Identities = 185/405 (45%), Positives = 243/405 (60%), Gaps = 14/405 (3%)
 Frame = -3

Query: 1196 APKMTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSY 1017
            A KM   K     ++++H LHKE DE  CPICMDHPHNAVLL+CSS+DKGCR YICD+SY
Sbjct: 24   ACKMAGVKRRMYTDSDIHALHKELDEISCPICMDHPHNAVLLICSSHDKGCRSYICDTSY 83

Query: 1016 RHSNCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEAD 837
            RHSNCLDR+KK   +S                                    RNN   + 
Sbjct: 84   RHSNCLDRYKKLRTSS------------------------------------RNNTTLSH 107

Query: 836  EHNSLNENSNGSSTGRIEVLEDNNNEEPVRYLETQGNSVL--ETGGLEPYSEM--IGGGN 669
               S  +++ G          +NN ++  R LE +G   L  E G  + + E   + G +
Sbjct: 108  SSPSHPQHNKGPG--------ENNIQQADRLLEREGEGNLNPEAGNSQTFHERTELEGLD 159

Query: 668  LTDSN-----LKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARR 504
            + +S+     LKCP+CRG ++GW++V+EAR+YL+LK+R+C  +SC F GNY ELRRHARR
Sbjct: 160  VDNSSESILTLKCPMCRGAILGWEVVEEARKYLNLKRRTCSRESCSFVGNYQELRRHARR 219

Query: 503  IHPTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRE 324
             HPTTRP+ +DPSR+RAWR LEHQ+EY DIVSAIRS+MPGA+V+GDYVIE+GD F   RE
Sbjct: 220  AHPTTRPSDIDPSRERAWRRLEHQREYSDIVSAIRSSMPGAVVVGDYVIENGDRFSAGRE 279

Query: 323  NNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRN-LWGEN 147
            + +GE N P  T+F L+ MI     +  +  SR  SRAW R+ R++G  SERR  LWGEN
Sbjct: 280  SGNGEVNAPWWTTFFLFHMI---GSMDGTGESRARSRAWTRHRRTAGALSERRRFLWGEN 336

Query: 146  LLGLH----EDDDDWNLASDMAEDVXXXXXXXXXXXXXXPDEDLP 24
            LLGL     +++DD ++ SD+ ED                DED P
Sbjct: 337  LLGLQDEEDDEEDDLHIFSDVGEDTSPIPRRRRRLTQSRSDEDQP 381


>ref|XP_004975029.1| PREDICTED: uncharacterized protein LOC101762232 isoform X1 [Setaria
            italica] gi|514800198|ref|XP_004975030.1| PREDICTED:
            uncharacterized protein LOC101762232 isoform X2 [Setaria
            italica]
          Length = 378

 Score =  333 bits (855), Expect = 7e-89
 Identities = 185/351 (52%), Positives = 224/351 (63%), Gaps = 4/351 (1%)
 Frame = -3

Query: 1157 NTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHSNCLDRFKKYF 978
            +T+   LHKEWD+ALCPICMDHPHNAVLLLCSS+DKGCRPYICD+SYRHSNCLDRFKK  
Sbjct: 16   DTDTAALHKEWDDALCPICMDHPHNAVLLLCSSHDKGCRPYICDTSYRHSNCLDRFKKMK 75

Query: 977  RN---SPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHNSLNENSN 807
             N   SP Q            ++V+R     T ES   +  + +  AEA +H   +    
Sbjct: 76   VNDGDSPSQPSSSVPRGTRNQNVVRRSRFGVTRESPRLLIDI-SEPAEASDHQDASHRP- 133

Query: 806  GSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSNLKCPLCRGI 627
             +  GR E  E+N NE P   LE+Q         +E    ++     + + L CPLCRG 
Sbjct: 134  AAIAGRQE--ENNYNEGPDLTLESQE--------VEISGPLVSSDVSSSNQLLCPLCRGT 183

Query: 626  VIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVVDPSRQRAWR 447
            V GWKI++EARQYLD K R+C  ++C FSGNY E+RRHAR +HPTTRPA  DPSR+ AW 
Sbjct: 184  VSGWKIIKEARQYLDEKSRACSREACTFSGNYREIRRHARSVHPTTRPADEDPSRRCAWH 243

Query: 446  NLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNS-GEGNGPLLTSFILYQ 270
             LEHQ+EYGDIVSAIRSAMPGA+VLGDY IE G+ F  DRE +   E +G LLT+F L+ 
Sbjct: 244  RLEHQREYGDIVSAIRSAMPGAVVLGDYAIEGGEMFSHDRETSGPSEPSGSLLTTFFLFH 303

Query: 269  MIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHEDDDD 117
            M+   +P+  S   RG SR  RR          RR LWGENLLGL  DDDD
Sbjct: 304  MLSS-SPIRSSDDPRGASRGLRR--------RRRRYLWGENLLGLQYDDDD 345


>ref|XP_002447452.1| hypothetical protein SORBIDRAFT_06g001230 [Sorghum bicolor]
            gi|241938635|gb|EES11780.1| hypothetical protein
            SORBIDRAFT_06g001230 [Sorghum bicolor]
          Length = 382

 Score =  330 bits (845), Expect = 1e-87
 Identities = 183/353 (51%), Positives = 223/353 (63%), Gaps = 4/353 (1%)
 Frame = -3

Query: 1157 NTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHSNCLDRFKKYF 978
            +T+   LHKEWD+ LCPICMDHPHNAVLLLCSS+DKGCR YICD+SYRHSNCLDRFKK  
Sbjct: 14   DTDTAALHKEWDDVLCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHSNCLDRFKKMK 73

Query: 977  RN---SPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHNSLNENSN 807
             N   SP +            ++VQR     T ES      L  +++E DE ++  + S+
Sbjct: 74   VNDGDSPSESSSSMPRGTRNQNVVQRSRFGLTGESP----RLHIDISEPDEASNHQDASH 129

Query: 806  GSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSNLKCPLCRGI 627
              +    E  E+N NE P        N  LE   +E           + + L CPLCRG 
Sbjct: 130  RPAAIAGEQEENNYNEGP--------NLTLEAHEVEMNGPSESSDVSSLNQLLCPLCRGG 181

Query: 626  VIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVVDPSRQRAWR 447
            V GWKI++EARQYLD K R+C  ++C FSGNY E+RRHARR+HPTTRPA VDPSR+RAW 
Sbjct: 182  VSGWKIIKEARQYLDEKSRACSREACTFSGNYREIRRHARRVHPTTRPADVDPSRRRAWH 241

Query: 446  NLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNS-GEGNGPLLTSFILYQ 270
            +LEHQ+EY DIVSAIRSAMPGA+VLGDY IE G+ F  DRE +   E +G LLT+F L+ 
Sbjct: 242  HLEHQREYADIVSAIRSAMPGAVVLGDYAIEGGEMFSHDRETSGPSEPSGSLLTTFFLFH 301

Query: 269  MIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHEDDDDWN 111
            M+   +P+      RG SR  RR          RR LWGENLLGL  DDD+ N
Sbjct: 302  MLSS-SPIRSGDEPRGASRGLRR--------QRRRYLWGENLLGLQYDDDNDN 345


>ref|XP_004290228.1| PREDICTED: uncharacterized protein LOC101300301 isoform 1 [Fragaria
            vesca subsp. vesca]
          Length = 439

 Score =  328 bits (842), Expect = 2e-87
 Identities = 176/370 (47%), Positives = 231/370 (62%), Gaps = 12/370 (3%)
 Frame = -3

Query: 1190 KMTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRH 1011
            KM   K      +E+  L+KE D   CPICMDHPHNAVLLLCSS+DKGCR YICD+SYRH
Sbjct: 54   KMAGVKRRIDTGSEIRALYKELDAVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRH 113

Query: 1010 SNCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEH 831
            SNCLDRFKK   N+                +   P + + + +T P  +   +L EA+  
Sbjct: 114  SNCLDRFKKLRENNT----------NSQSLVSSLPTNHHGSHNT-PDMAFGTDLNEANGS 162

Query: 830  NSLNENSNGSSTG-----RIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNL 666
             +L E +  +S       +  V++D N   P+   E  G         E + E +  G L
Sbjct: 163  PNLIEGNAVTSANIPGQPQERVIQDLNM--PLLPEELMG-----VADSESFQERVEHGEL 215

Query: 665  TDSN-------LKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHAR 507
               N       LKCPLCRG ++GW++V++ R+YL+LK+RSC  ++C FSGNY ELRRHAR
Sbjct: 216  DVENSSESNLSLKCPLCRGAILGWEVVEDCRKYLNLKKRSCSREACSFSGNYQELRRHAR 275

Query: 506  RIHPTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDR 327
            R+HP TRP+ +DPSR+RAWR+LEHQ+E+GD+VSAI SA+PGA+V+GDYVIE+GD   G  
Sbjct: 276  RVHPATRPSDIDPSRERAWRHLEHQREFGDVVSAIHSAIPGAVVVGDYVIENGDRLGGGG 335

Query: 326  ENNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGEN 147
            E+ +GE NGP  T+  L+QMI            R  +RAW R+ RS+G  SERR LWGEN
Sbjct: 336  ESGTGEANGPWWTTMFLFQMI---GSADRGGEPRARARAWPRHRRSAGALSERRLLWGEN 392

Query: 146  LLGLHEDDDD 117
            LLGL +DD+D
Sbjct: 393  LLGLQDDDED 402


>gb|ACG33117.1| hypothetical protein [Zea mays]
          Length = 375

 Score =  328 bits (842), Expect = 2e-87
 Identities = 184/362 (50%), Positives = 228/362 (62%), Gaps = 3/362 (0%)
 Frame = -3

Query: 1163 SINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHSNCLDRFKK 984
            S +T+   LHKEWD+ LCPICMDHPHNAVLLLCSS+DKGCR YICD+SYRHSNCLDRFKK
Sbjct: 11   SADTDTAALHKEWDDVLCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHSNCLDRFKK 70

Query: 983  YFRN---SPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHNSLNEN 813
               N   SP Q            ++VQR     T ES      L  +++  DE +   + 
Sbjct: 71   MKVNDEDSPSQSSSSMPRGTGNQNVVQRSRFGPTRESP----RLHIDISVPDETSDHQDA 126

Query: 812  SNGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSNLKCPLCR 633
            S+  +    E  E+  NE P   LET    +  +      S +        + L CPLCR
Sbjct: 127  SHRPAAIVGEQEENIXNEGPDLTLETHEVGINGSSVSSDVSSL--------NQLLCPLCR 178

Query: 632  GIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVVDPSRQRA 453
            G+V GWKI++EARQYLD K R+C  ++C+FSGNY E+RRHARR+HPTTRPA VDPSR+RA
Sbjct: 179  GVVSGWKIIKEARQYLDGKSRACSREACMFSGNYREIRRHARRVHPTTRPADVDPSRRRA 238

Query: 452  WRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSGEGNGPLLTSFILY 273
            W +LEHQ++Y DIVSAIRSAMPGA+VLGDY IE G+ F  DRE  + E +G LLT+F L+
Sbjct: 239  WHHLEHQRDYADIVSAIRSAMPGAVVLGDYAIEGGEIFSHDRE--TSEPSGSLLTTFFLF 296

Query: 272  QMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHEDDDDWNLASDMA 93
             M+   +P+      RG SR  RR          RR LWGENLLGL  DDDD +  ++  
Sbjct: 297  HMLSS-SPIRSGDEPRGTSRGLRR--------QRRRYLWGENLLGLQYDDDDDDDDNEGE 347

Query: 92   ED 87
            ED
Sbjct: 348  ED 349


>ref|XP_006298026.1| hypothetical protein CARUB_v10014073mg [Capsella rubella]
            gi|565480774|ref|XP_006298027.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
            gi|482566735|gb|EOA30924.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
            gi|482566736|gb|EOA30925.1| hypothetical protein
            CARUB_v10014073mg [Capsella rubella]
          Length = 353

 Score =  328 bits (841), Expect = 3e-87
 Identities = 183/377 (48%), Positives = 228/377 (60%), Gaps = 11/377 (2%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K   S  +++H LHKE DE  CP+CMDHPHNAVLLLCSS+DKGCR YICD+SYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHN 828
            NCLDRFKK    SP                        T E+   +AS   N    +EH 
Sbjct: 61   NCLDRFKKLHSESPNDP---------------------TPEAN--LASRETNNESQNEHG 97

Query: 827  SLNENSNGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSNLK 648
            + + ++  S +G    + D  +    R +E +  S       E ++           NLK
Sbjct: 98   TTSRSNFHSGSGNRGSVGDYESLRRRRRVEDEEQS-------EDFT-----------NLK 139

Query: 647  CPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVVDP 468
            CPLCRG V+GWK+V+E R YLDLK RSC  +SC F+GNY +LRRHARR HPTTRP+  DP
Sbjct: 140  CPLCRGTVLGWKVVEEVRTYLDLKNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTDP 199

Query: 467  SRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSGEGNGPLLT 288
            SR+RAWR LE+Q+EYGDIVSAIRSAMPGA+V+GDYVIE+GD FPG+RE  +G G   L T
Sbjct: 200  SRERAWRRLENQREYGDIVSAIRSAMPGAVVVGDYVIENGDRFPGERE--AGNGGSDLWT 257

Query: 287  SFILYQMI------RPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHE- 129
            + +L+QMI       P    S S      SRAWR + RS    S+RR LWGENLLGL + 
Sbjct: 258  TLVLFQMIGSLDSGGPSGSGSGSGSRSHRSRAWRNHRRS----SDRRYLWGENLLGLQDE 313

Query: 128  ----DDDDWNLASDMAE 90
                DD++  L +D  +
Sbjct: 314  HNNNDDEELRLQNDAGD 330


>ref|NP_001143747.1| uncharacterized protein LOC100276502 [Zea mays]
            gi|195626164|gb|ACG34912.1| hypothetical protein [Zea
            mays]
          Length = 375

 Score =  328 bits (841), Expect = 3e-87
 Identities = 184/362 (50%), Positives = 228/362 (62%), Gaps = 3/362 (0%)
 Frame = -3

Query: 1163 SINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHSNCLDRFKK 984
            S +T+   LHKEWD+ LCPICMDHPHNAVLLLCSS+DKGCR YICD+SYRHSNCLDRFKK
Sbjct: 11   SADTDTAALHKEWDDVLCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHSNCLDRFKK 70

Query: 983  YFRN---SPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHNSLNEN 813
               N   SP Q            ++VQR     T ES      L  +++  DE +   + 
Sbjct: 71   MKVNDEDSPSQSSSSMPRGTGNQNVVQRSRFGPTRESP----RLHIDISVPDETSDHQDA 126

Query: 812  SNGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSNLKCPLCR 633
            S+  +    E  E+  NE P   LET    +  +      S +        + L CPLCR
Sbjct: 127  SHRPAAIVGEQEENIYNEGPDLTLETHEVGINGSSVSSDVSSL--------NQLLCPLCR 178

Query: 632  GIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVVDPSRQRA 453
            G+V GWKI++EARQYLD K R+C  ++C+FSGNY E+RRHARR+HPTTRPA VDPSR+RA
Sbjct: 179  GVVSGWKIIKEARQYLDGKSRACSREACMFSGNYREIRRHARRVHPTTRPADVDPSRRRA 238

Query: 452  WRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSGEGNGPLLTSFILY 273
            W +LEHQ++Y DIVSAIRSAMPGA+VLGDY IE G+ F  DRE  + E +G LLT+F L+
Sbjct: 239  WHHLEHQRDYADIVSAIRSAMPGAVVLGDYAIEGGEIFSHDRE--TSEPSGSLLTTFFLF 296

Query: 272  QMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHEDDDDWNLASDMA 93
             M+   +P+      RG SR  RR          RR LWGENLLGL  DDDD +  ++  
Sbjct: 297  HMLSS-SPIRSGDEPRGTSRGLRR--------QRRRYLWGENLLGLQYDDDDDDDDNEGE 347

Query: 92   ED 87
            ED
Sbjct: 348  ED 349


>ref|XP_002883554.1| hypothetical protein ARALYDRAFT_479993 [Arabidopsis lyrata subsp.
            lyrata] gi|297329394|gb|EFH59813.1| hypothetical protein
            ARALYDRAFT_479993 [Arabidopsis lyrata subsp. lyrata]
          Length = 354

 Score =  327 bits (839), Expect = 5e-87
 Identities = 186/375 (49%), Positives = 227/375 (60%), Gaps = 12/375 (3%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K   S  +++H LHKE DE  CP+CMDHPHNAVLLLCSS+DKGCR YICD+SYRHS
Sbjct: 1    MAGVKRKLSTESDVHALHKELDEVSCPVCMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHN 828
            NCLDRFKK    SP                        T E    +AS  NN    +EH 
Sbjct: 61   NCLDRFKKLHSESPNDP---------------------TPEGN--LASRENNNESLNEHG 97

Query: 827  SLNENS-NGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSNL 651
            + + +S +  ST R    +  +     R  E            E  SE I       +NL
Sbjct: 98   TASRSSFHRESTNRGSAWDSESLRRRRRVDE------------EEQSEDI-------TNL 138

Query: 650  KCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVVD 471
            KCPLCRG V+GWK+V+E R YLDLK RSC  +SC F+GNY +LRRHARR HPTTRP+  D
Sbjct: 139  KCPLCRGTVLGWKVVEEVRTYLDLKNRSCSRESCSFTGNYQDLRRHARRTHPTTRPSDTD 198

Query: 470  PSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSGEGNGPLL 291
            PSR+RAWR+LE+Q+EYGDIVSAIRSAMPGA+V+GDYVIE+GD F G+RE  +G G   L 
Sbjct: 199  PSRERAWRHLENQREYGDIVSAIRSAMPGAVVVGDYVIENGDRFSGERE--TGNGGSDLW 256

Query: 290  TSFILYQMIRPFAPVSDSRPSRG------LSRAWRRYHRSSGTFSERRNLWGENLLGLHE 129
            T+ +L+QMI        S    G       SRAWR + RSS   S+RR LWGENLLGL E
Sbjct: 257  TTLVLFQMIGSLDNGGSSASGSGGGSRSHRSRAWRNHRRSS---SDRRYLWGENLLGLQE 313

Query: 128  -----DDDDWNLASD 99
                 DD++ ++ +D
Sbjct: 314  EHNNNDDEELHMQND 328


>ref|XP_004290229.1| PREDICTED: uncharacterized protein LOC101300301 isoform 2 [Fragaria
            vesca subsp. vesca]
          Length = 385

 Score =  327 bits (837), Expect = 8e-87
 Identities = 175/369 (47%), Positives = 230/369 (62%), Gaps = 12/369 (3%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K      +E+  L+KE D   CPICMDHPHNAVLLLCSS+DKGCR YICD+SYRHS
Sbjct: 1    MAGVKRRIDTGSEIRALYKELDAVSCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHN 828
            NCLDRFKK   N+                +   P + + + +T P  +   +L EA+   
Sbjct: 61   NCLDRFKKLRENNT----------NSQSLVSSLPTNHHGSHNT-PDMAFGTDLNEANGSP 109

Query: 827  SLNENSNGSSTG-----RIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLT 663
            +L E +  +S       +  V++D N   P+   E  G         E + E +  G L 
Sbjct: 110  NLIEGNAVTSANIPGQPQERVIQDLNM--PLLPEELMG-----VADSESFQERVEHGELD 162

Query: 662  DSN-------LKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARR 504
              N       LKCPLCRG ++GW++V++ R+YL+LK+RSC  ++C FSGNY ELRRHARR
Sbjct: 163  VENSSESNLSLKCPLCRGAILGWEVVEDCRKYLNLKKRSCSREACSFSGNYQELRRHARR 222

Query: 503  IHPTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRE 324
            +HP TRP+ +DPSR+RAWR+LEHQ+E+GD+VSAI SA+PGA+V+GDYVIE+GD   G  E
Sbjct: 223  VHPATRPSDIDPSRERAWRHLEHQREFGDVVSAIHSAIPGAVVVGDYVIENGDRLGGGGE 282

Query: 323  NNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENL 144
            + +GE NGP  T+  L+QMI            R  +RAW R+ RS+G  SERR LWGENL
Sbjct: 283  SGTGEANGPWWTTMFLFQMI---GSADRGGEPRARARAWPRHRRSAGALSERRLLWGENL 339

Query: 143  LGLHEDDDD 117
            LGL +DD+D
Sbjct: 340  LGLQDDDED 348


>gb|AFW57825.1| hypothetical protein ZEAMMB73_396034 [Zea mays]
            gi|413917894|gb|AFW57826.1| hypothetical protein
            ZEAMMB73_396034 [Zea mays]
          Length = 375

 Score =  326 bits (836), Expect = 1e-86
 Identities = 183/360 (50%), Positives = 226/360 (62%), Gaps = 3/360 (0%)
 Frame = -3

Query: 1157 NTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHSNCLDRFKKYF 978
            +T+   LHKEWD+ LCPICMDHPHNAVLLLCSS+DKGCR YICD+SYRHSNCLDRFKK  
Sbjct: 13   HTDTAALHKEWDDVLCPICMDHPHNAVLLLCSSHDKGCRSYICDTSYRHSNCLDRFKKMK 72

Query: 977  RN---SPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHNSLNENSN 807
             N   SP Q            ++VQR     T ES      L  +++  DE +   + S+
Sbjct: 73   VNDEDSPSQPSSSMPRGTGNQNVVQRSRFGLTRESP----RLHIDISVPDETSDHQDASH 128

Query: 806  GSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSNLKCPLCRGI 627
              +    E  E+  NE P   LET    +  +      S +        + L CPLCRG 
Sbjct: 129  RPAAIVGEQEENIYNEGPDLTLETHEVGINGSSVSSDVSSL--------NQLLCPLCRGA 180

Query: 626  VIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVVDPSRQRAWR 447
            V GWKI++EARQYLD K R+C  ++C+FSGNY E+RRHARR+HPTTRPA VDPSR+RAW 
Sbjct: 181  VSGWKIIKEARQYLDGKSRACSREACMFSGNYREIRRHARRVHPTTRPADVDPSRRRAWH 240

Query: 446  NLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSGEGNGPLLTSFILYQM 267
            +LEHQ++Y DIVSAIRSAMPGA+VLGDY IE G+ F  DRE  + E +G LLT+F L+ M
Sbjct: 241  HLEHQRDYADIVSAIRSAMPGAVVLGDYAIEGGEIFSHDRE--TSEPSGSLLTTFFLFHM 298

Query: 266  IRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHEDDDDWNLASDMAED 87
            +   +P+      RG SR  RR          RR LWGENLLGL  DDDD +  ++  ED
Sbjct: 299  LSS-SPIRSGDEPRGTSRGLRR--------QRRRYLWGENLLGLQYDDDDDDDDNEGEED 349


>ref|XP_007140563.1| hypothetical protein PHAVU_008G123200g [Phaseolus vulgaris]
            gi|561013696|gb|ESW12557.1| hypothetical protein
            PHAVU_008G123200g [Phaseolus vulgaris]
          Length = 385

 Score =  325 bits (833), Expect = 2e-86
 Identities = 181/385 (47%), Positives = 239/385 (62%), Gaps = 18/385 (4%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K     ++++H LHKE DE  CPICMDHPHNAVLLLCSS++KGCR YICD+SYRHS
Sbjct: 1    MAGVKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNES--TFPVASLRNNLAEADE 834
            NCLDRFKK   NS                       V TN S  +F +    N   ++D 
Sbjct: 61   NCLDRFKKMRDNSKENENLPSSL-------------VNTNNSGNSFDI----NITMQSDM 103

Query: 833  H--NSLNENSNGS--STGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYS-----EMI 681
            H  N L+EN   +  S G  +     + ++P R+L+     +LET   E        E +
Sbjct: 104  HDVNELHENEINTLLSVGLAQGSRQGDAQDPSRHLDPHDEGILETADSETLQDRAVLEDL 163

Query: 680  GGGNLTDSNLK--CPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHAR 507
            G  N ++S LK  CPLCRG V+ W++ +EAR YL++K+RSC  DSC F G Y+ELRRHAR
Sbjct: 164  GADNSSESKLKLKCPLCRGAVLSWEVDEEARNYLNVKKRSCSRDSCSFVGGYLELRRHAR 223

Query: 506  RIHPTTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDG---FP 336
            R+HPT+RP+ +DP+R+RAWR+ E Q+EYGDI+SAI+SAMPGA+++GDYV+E+GDG     
Sbjct: 224  RVHPTSRPSDIDPTRERAWRHFERQREYGDIMSAIQSAMPGAVLVGDYVLENGDGIGRLS 283

Query: 335  GDRENNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLW 156
             +RE N    NGP LT+ IL+Q++   + +   R  R  +  W R+ RSS     RR LW
Sbjct: 284  DEREGNISNANGPWLTTTILFQVMD--STIEIVREPRAHASTWSRHRRSS---ERRRYLW 338

Query: 155  GENLLGLHEDD--DDWNLASDMAED 87
            GENLLGL+E+D  DD  + SD  ED
Sbjct: 339  GENLLGLNENDIEDDLRIFSDAGED 363


>ref|XP_002301572.1| hypothetical protein POPTR_0002s22380g [Populus trichocarpa]
            gi|566159410|ref|XP_006386811.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|566159412|ref|XP_006386812.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|566159414|ref|XP_006386813.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|222843298|gb|EEE80845.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345588|gb|ERP64608.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345589|gb|ERP64609.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
            gi|550345590|gb|ERP64610.1| hypothetical protein
            POPTR_0002s22380g [Populus trichocarpa]
          Length = 368

 Score =  324 bits (830), Expect = 5e-86
 Identities = 169/359 (47%), Positives = 231/359 (64%), Gaps = 2/359 (0%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K   + ++++H LHKE DE  CPIC+D PHNAVLLLCSS +KGC+ YICD+SYRHS
Sbjct: 1    MAALKRRLNTDSDIHALHKELDEVSCPICLDRPHNAVLLLCSSNEKGCKSYICDTSYRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVA-SLRNNLAEADEH 831
            NCLD+FKK   NS                    P++  ++ +T   + +LR +  + +E+
Sbjct: 61   NCLDQFKKSRGNSRSNATLQS----------SMPINSVSSSTTTDASMTLRTHAFDGNEN 110

Query: 830  NSLNENSNGSSTGRIEVLEDNNN-EEPVRYLETQGNSVLETGGLEPYSEMIGGGNLTDSN 654
            ++LNE SN +     E L D+ + +E + +     NS        P   +  G       
Sbjct: 111  HNLNEISNDTFVRLPEELVDSESVQERIEHEGVNANS--------PELSLSPG------- 155

Query: 653  LKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHPTTRPAVV 474
              CPLCRG ++GW++V EAR+YL+LK+RSC  +SC FSGNY ELRRHARR+HPT RP+ +
Sbjct: 156  --CPLCRGTILGWEVVDEARKYLNLKKRSCSRESCSFSGNYQELRRHARRVHPTIRPSDI 213

Query: 473  DPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDGFPGDRENNSGEGNGPL 294
            DPSR+RAWR LEHQ+EYGDIVSA+ SAMPGA+V+GDY+IE+GD    +RE+ + E N P 
Sbjct: 214  DPSRERAWRCLEHQREYGDIVSAVHSAMPGAVVVGDYIIENGDRLSVERESRTNEVNAPW 273

Query: 293  LTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGENLLGLHEDDDD 117
             T+F  +QMI     +  +   R  SRAW R+ +S+ T ++RR LWGENLLGLH++D D
Sbjct: 274  WTTFFFFQMI---GSIDGAAEPRTWSRAWTRHRQSAETLADRRFLWGENLLGLHDNDAD 329


>ref|XP_006586264.1| PREDICTED: uncharacterized protein LOC100791202 isoform X1 [Glycine
            max] gi|571474560|ref|XP_006586265.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X2 [Glycine
            max] gi|571474562|ref|XP_006586266.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X3 [Glycine
            max] gi|571474564|ref|XP_006586267.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X4 [Glycine
            max] gi|571474566|ref|XP_006586268.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X5 [Glycine
            max] gi|571474568|ref|XP_006586269.1| PREDICTED:
            uncharacterized protein LOC100791202 isoform X6 [Glycine
            max]
          Length = 350

 Score =  323 bits (829), Expect = 7e-86
 Identities = 183/403 (45%), Positives = 236/403 (58%), Gaps = 15/403 (3%)
 Frame = -3

Query: 1187 MTDTKGSTSINTEMHNLHKEWDEALCPICMDHPHNAVLLLCSSYDKGCRPYICDSSYRHS 1008
            M   K     ++++H LHKE DE  CPICMDHPHNAVLLLCSS++KGCR YICD+SYRHS
Sbjct: 1    MAGVKRRLCSDSDIHALHKELDEVSCPICMDHPHNAVLLLCSSHEKGCRSYICDTSYRHS 60

Query: 1007 NCLDRFKKYFRNSPLQXXXXXXXXXXXXSIVQRPLHVYTNESTFPVASLRNNLAEADEHN 828
            NCLDRFKK                                        +R+N  E     
Sbjct: 61   NCLDRFKK----------------------------------------MRDNFKENQNLP 80

Query: 827  S--LNENSNGSSTGRIEVLEDNNNEEPVRYLETQGNSVLETGGLEPYS-----EMIGGGN 669
            S  +N N++GS  G        + ++P R L+     +LET   E        E +   N
Sbjct: 81   SSLVNTNNSGSRQG--------DAQDPNRLLDQHDEGILETADSENLQDRAVIEDLNADN 132

Query: 668  LTDS--NLKCPLCRGIVIGWKIVQEARQYLDLKQRSCYSDSCLFSGNYVELRRHARRIHP 495
             ++S  NLKCPLCRG V+ WK+V+EAR YL++K+RSC  DSC F G+Y+ELRRHARR+HP
Sbjct: 133  SSESKLNLKCPLCRGAVLNWKVVEEARNYLNMKKRSCSRDSCSFVGDYLELRRHARRVHP 192

Query: 494  TTRPAVVDPSRQRAWRNLEHQQEYGDIVSAIRSAMPGAIVLGDYVIESGDG---FPGDR- 327
            T+RP+ +DP+R+RAWR+ E Q+EYGDIVSAI+SA+PGA+++GDYV+E+GDG    P +R 
Sbjct: 193  TSRPSNIDPTRERAWRHFEDQREYGDIVSAIQSAVPGAVLVGDYVLENGDGIGRLPDERA 252

Query: 326  ENNSGEGNGPLLTSFILYQMIRPFAPVSDSRPSRGLSRAWRRYHRSSGTFSERRNLWGEN 147
            E N G  NGP LT+ IL+QM+   + V   R  R  S AW R+ RS      RR LWGEN
Sbjct: 253  EGNIGNANGPWLTTTILFQMMD--STVEIVREPRAHSSAWTRHRRSD---ERRRYLWGEN 307

Query: 146  LLGLHEDD--DDWNLASDMAEDVXXXXXXXXXXXXXXPDEDLP 24
            LLGLH++D  DD  +  D  ED                +ED P
Sbjct: 308  LLGLHDNDIEDDLRIFRDAGEDASPVPRRRRRLTRTRSNEDQP 350


Top