BLASTX nr result

ID: Cinnamomum23_contig00001622 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00001622
         (3680 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010274478.1| PREDICTED: uncharacterized protein LOC104609...   458   e-125
ref|XP_010277001.1| PREDICTED: uncharacterized protein LOC104611...   410   e-111
ref|XP_010277003.1| PREDICTED: uncharacterized protein LOC104611...   407   e-110
ref|XP_010920169.1| PREDICTED: uncharacterized protein LOC105044...   313   5e-82
ref|XP_008789379.1| PREDICTED: uncharacterized protein LOC103706...   310   6e-81
emb|CAN81695.1| hypothetical protein VITISV_042576 [Vitis vinifera]   306   6e-80
ref|XP_002276750.3| PREDICTED: uncharacterized protein LOC100245...   305   2e-79
ref|XP_010104398.1| hypothetical protein L484_010350 [Morus nota...   294   4e-76
ref|XP_007048701.1| Uncharacterized protein isoform 1 [Theobroma...   280   5e-72
ref|XP_006852401.1| PREDICTED: uncharacterized protein LOC184421...   279   1e-71
ref|XP_007048702.1| Uncharacterized protein isoform 2 [Theobroma...   272   1e-69
ref|XP_007025362.1| Transcription initiation factor TFIID subuni...   272   2e-69
ref|XP_012091781.1| PREDICTED: uncharacterized protein LOC105649...   267   4e-68
gb|KDP21102.1| hypothetical protein JCGZ_21573 [Jatropha curcas]      266   7e-68
ref|XP_012437373.1| PREDICTED: uncharacterized protein LOC105763...   257   6e-65
gb|KJB49044.1| hypothetical protein B456_008G099200 [Gossypium r...   257   6e-65
gb|KHG24123.1| Protein arginine N-methyltransferase 7 [Gossypium...   254   5e-64
ref|XP_008229123.1| PREDICTED: uncharacterized protein LOC103328...   250   5e-63
ref|XP_008229122.1| PREDICTED: uncharacterized protein LOC103328...   250   5e-63
ref|XP_002533963.1| conserved hypothetical protein [Ricinus comm...   244   5e-61

>ref|XP_010274478.1| PREDICTED: uncharacterized protein LOC104609787 [Nelumbo nucifera]
            gi|720059112|ref|XP_010274479.1| PREDICTED:
            uncharacterized protein LOC104609787 [Nelumbo nucifera]
          Length = 684

 Score =  458 bits (1178), Expect = e-125
 Identities = 295/702 (42%), Positives = 379/702 (53%), Gaps = 10/702 (1%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRFASTAGSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPMRHPGAV 2508
            MEEKQL+FN PLLSVRRF+S   S+ E+++R+EK Q K PS   +K DLKSGP+R+PGAV
Sbjct: 1    MEEKQLDFNAPLLSVRRFSSITASSGEESKRIEKSQRKIPSFPYHKSDLKSGPVRNPGAV 60

Query: 2507 PFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSKALTVFK 2328
            PF WEQIPGRP+DG     R            PGR   VKQ SS K  ED N+      +
Sbjct: 61   PFLWEQIPGRPRDGDALQAR----PIEPPKLPPGRAFGVKQQSSNKEPEDPNA-----IR 111

Query: 2327 PPQNDNLTHHSLVVSSIG-NATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTLSRTES 2151
            P  ND   H S  +SS+  N T L+  K   +E+  A+              +TLSRTES
Sbjct: 112  PQAND--IHPSYKISSLDENVTALDNLKESLKEKRDADT-EEDVDEAFTDALETLSRTES 168

Query: 2150 FFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYASRRQLVA 1983
            F +NCSV+GLS  DG    PS + S D Q RDFM+ RFLPAA AMA E PQ+ SR+Q + 
Sbjct: 169  FLLNCSVTGLSALDGPNMRPSGTFSTDPQTRDFMLGRFLPAAKAMAEETPQHTSRKQPLP 228

Query: 1982 REPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHV---QXXXXXXXXXXXXXXDNTG 1812
            RE  R+V +V + RR         P +Y  +PY  P     Q              D+TG
Sbjct: 229  REQQRQVKVVSEDRR---------PPQYHYKPYMLPQFPMDQGEEESEDEDEEDGYDDTG 279

Query: 1811 NLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSEPDP 1632
            NL+AKACG+ PRF LKNSFCLLNP+PGMK+++ +P+SSV RKV T +KT  S    E + 
Sbjct: 280  NLSAKACGLFPRFGLKNSFCLLNPIPGMKVRNHVPISSV-RKVGTRVKTGHSRPHMEIND 338

Query: 1631 EQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGDGISP 1452
            E TW+AVYKHKL   L+P G  ++ +K TSESN LT  SDSQTP+GSSPYR      I P
Sbjct: 339  EHTWDAVYKHKLASRLKPSGVLEDETKLTSESNHLTYSSDSQTPDGSSPYR------ILP 392

Query: 1451 YRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPA 1272
            YRNEAP+S FHEG GFLG+P++  +R+ +GL +Y    N  R +L            SP 
Sbjct: 393  YRNEAPRSPFHEGSGFLGIPREVKDRKANGLDSYNKGGNCLRDILFHQNNKQELGSLSPM 452

Query: 1271 IEKTLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKRLTLEACLREDKQ 1092
            +EKTLYVDSV  +E                                + L   +C+   K 
Sbjct: 453  VEKTLYVDSVHTVETPNSKSSSANSRTLMDTKDKDSEVMGESMMEEENLATGSCIENIKN 512

Query: 1091 VKNLNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLECSKVL 912
            +K L ++ +L PK+     S+L   +E SI+G  +   EGS+       E++S  C +V 
Sbjct: 513  LKILEDKRILDPKIFGVVDSDLPCSAERSILGGQIDRTEGSRQDTFLDQESRSALCKEVP 572

Query: 911  INTSPGSD--MPERLDVDGGDSYAXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXXXXS 738
             +     D   P   D D G+S                     L R LP          S
Sbjct: 573  TDAKLDFDNSQPLSADNDDGNSCTSSLSALLPPPLPKSPSESWLLRALPSIPSRNPSLRS 632

Query: 737  YLGVQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEEL 612
            Y G +F  RKQL + SS DPKWETI K++N    +LR+SEEL
Sbjct: 633  YQGTRFNLRKQLSETSSNDPKWETIDKSSNTNADYLRYSEEL 674


>ref|XP_010277001.1| PREDICTED: uncharacterized protein LOC104611577 isoform X1 [Nelumbo
            nucifera] gi|720068103|ref|XP_010277002.1| PREDICTED:
            uncharacterized protein LOC104611577 isoform X1 [Nelumbo
            nucifera] gi|720068111|ref|XP_010277004.1| PREDICTED:
            uncharacterized protein LOC104611577 isoform X1 [Nelumbo
            nucifera]
          Length = 689

 Score =  410 bits (1053), Expect = e-111
 Identities = 276/705 (39%), Positives = 375/705 (53%), Gaps = 8/705 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFASTAGSNTEDNR-RVEKFQPKRPSLGSYKPDLKSGPM 2526
            MLKNLMEEKQL+FN PLLSVRRFAS + S+  D R R+ K QPK PSL  YK +LKSGP+
Sbjct: 2    MLKNLMEEKQLDFNAPLLSVRRFASASPSSEGDERKRIVKSQPKIPSLPYYKSELKSGPV 61

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
             +PGAVPF WEQIPGRPKDG G  PR            PGR++DVKQ SS K  ED ++ 
Sbjct: 62   SNPGAVPFLWEQIPGRPKDGGGAQPRATERPPVAPKLPPGRVLDVKQQSSNKEPEDQSAI 121

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTL 2166
               +     +  +++    + ++  +    + KGDA  E  A+              +TL
Sbjct: 122  KAQMDNDCPDHKISYLDNNLIALEKSKESLKKKGDADTEEDAD-------EAFTDALETL 174

Query: 2165 SRTESFFMNCSVSGLSGFDGPSRSTS----MDAQARDFMMDRFLPAATAMASEAPQYASR 1998
            SRTES F+NCSV+G+S +DGP+  +S     D Q RDFM+ RFLPAA A+A+E PQYASR
Sbjct: 175  SRTES-FLNCSVTGMSAWDGPNTRSSGTFLTDPQTRDFMLGRFLPAAKAVAAEMPQYASR 233

Query: 1997 RQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHV--QXXXXXXXXXXXXXX 1824
            +Q +  E  R+   V          G+ +P +Y+ RP                       
Sbjct: 234  KQPLPYEQPRETKKV--------VSGDTRPPQYKYRPNMIQQFPQDEGEEESEDEDEDDY 285

Query: 1823 DNTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLS 1644
             +TGNL+A ACG+ PRFCLK SFCLLNPVPGMK+++R+P+SSV RKV   +KTT +    
Sbjct: 286  GDTGNLSANACGLFPRFCLKGSFCLLNPVPGMKVRTRVPVSSV-RKVGKQVKTTYARSHK 344

Query: 1643 EPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGD 1464
            E   E +W+AVYKHKL   ++  G  ++ SK TS+SN+LT WSDSQTP+ S P R     
Sbjct: 345  ESKDEHSWDAVYKHKLASRIQRTGVLEDESKLTSQSNRLTYWSDSQTPDESPPNR----- 399

Query: 1463 GISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXX 1284
             ISP R    QS F EG GFLG+P++  N + +G+ ++       R +L           
Sbjct: 400  -ISPCRVGTRQSSFREGSGFLGIPEEVKNLKANGIDSHNKDHKSLREILFHQNSQIESGS 458

Query: 1283 XSPAIEKTLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKRLTLEACLR 1104
             SP +EKTLYVDSV ++E                                + L  E+  +
Sbjct: 459  VSPTVEKTLYVDSVHIVETSNSKSSSPDAKLLMNSSGKDFETLVEGLVVEENLATESYTK 518

Query: 1103 EDKQVKNLNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLEC 924
                +K  +++ +L P++   A S+L + S+ S +G ++   EG + +D  + + + + C
Sbjct: 519  NINHLKIPDDKGILEPQIFRAADSDLPT-SDRSNLGGNIDRIEGFR-QDSVLDQERFVLC 576

Query: 923  SKVLINTSPGSDMPERL-DVDGGDSYAXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXX 747
             K LI+     D P+ L   D G SY                    L RTLP        
Sbjct: 577  PKGLIDEKLDFDNPQPLKSEDKGISYTSSFRSPLAPPLPKSPSESWLSRTLP-SIPFRNP 635

Query: 746  XXSYLGVQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEEL 612
               Y   +F   KQ +  +SVDPKWETIVK++N    HL FSEEL
Sbjct: 636  SSRYQSTRFNVTKQ-IPETSVDPKWETIVKSSNVNTGHLWFSEEL 679


>ref|XP_010277003.1| PREDICTED: uncharacterized protein LOC104611577 isoform X2 [Nelumbo
            nucifera]
          Length = 681

 Score =  407 bits (1046), Expect = e-110
 Identities = 274/704 (38%), Positives = 374/704 (53%), Gaps = 8/704 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFASTAGSNTEDNR-RVEKFQPKRPSLGSYKPDLKSGPM 2526
            MLKNLMEEKQL+FN PLLSVRRFAS + S+  D R R+ K QPK PSL  YK +LKSGP+
Sbjct: 2    MLKNLMEEKQLDFNAPLLSVRRFASASPSSEGDERKRIVKSQPKIPSLPYYKSELKSGPV 61

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
             +PGAVPF WEQIPGRPKDG G  PR            PGR++DVKQ SS K  ED ++ 
Sbjct: 62   SNPGAVPFLWEQIPGRPKDGGGAQPRATERPPVAPKLPPGRVLDVKQQSSNKEPEDQSAI 121

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTL 2166
               +     +  +++    + ++  +    + KGDA  E  A+              +TL
Sbjct: 122  KAQMDNDCPDHKISYLDNNLIALEKSKESLKKKGDADTEEDAD-------EAFTDALETL 174

Query: 2165 SRTESFFMNCSVSGLSGFDGPSRSTS----MDAQARDFMMDRFLPAATAMASEAPQYASR 1998
            SRTES F+NCSV+G+S +DGP+  +S     D Q RDFM+ RFLPAA A+A+E PQYASR
Sbjct: 175  SRTES-FLNCSVTGMSAWDGPNTRSSGTFLTDPQTRDFMLGRFLPAAKAVAAEMPQYASR 233

Query: 1997 RQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHV--QXXXXXXXXXXXXXX 1824
            +Q +  E  R+   V          G+ +P +Y+ RP                       
Sbjct: 234  KQPLPYEQPRETKKV--------VSGDTRPPQYKYRPNMIQQFPQDEGEEESEDEDEDDY 285

Query: 1823 DNTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLS 1644
             +TGNL+A ACG+ PRFCLK SFCLLNPVPGMK+++R+P+SSV RKV   +KTT +    
Sbjct: 286  GDTGNLSANACGLFPRFCLKGSFCLLNPVPGMKVRTRVPVSSV-RKVGKQVKTTYARSHK 344

Query: 1643 EPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGD 1464
            E   E +W+AVYKHKL   ++  G  ++ SK TS+SN+LT WSDSQTP+ S P R     
Sbjct: 345  ESKDEHSWDAVYKHKLASRIQRTGVLEDESKLTSQSNRLTYWSDSQTPDESPPNR----- 399

Query: 1463 GISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXX 1284
             ISP R    QS F EG GFLG+P++  N + +G+ ++       R +L           
Sbjct: 400  -ISPCRVGTRQSSFREGSGFLGIPEEVKNLKANGIDSHNKDHKSLREILFHQNSQIESGS 458

Query: 1283 XSPAIEKTLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKRLTLEACLR 1104
             SP +EKTLYVDSV ++E                                + L  E+  +
Sbjct: 459  VSPTVEKTLYVDSVHIVETSNSKSSSPDAKLLMNSSGKDFETLVEGLVVEENLATESYTK 518

Query: 1103 EDKQVKNLNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLEC 924
                +K  +++ +L P++   A S+L + S+ S +G ++   EG + +D  + + + + C
Sbjct: 519  NINHLKIPDDKGILEPQIFRAADSDLPT-SDRSNLGGNIDRIEGFR-QDSVLDQERFVLC 576

Query: 923  SKVLINTSPGSDMPERL-DVDGGDSYAXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXX 747
             K LI+     D P+ L   D G SY                    L RTLP        
Sbjct: 577  PKGLIDEKLDFDNPQPLKSEDKGISYTSSFRSPLAPPLPKSPSESWLSRTLP-SIPFRNP 635

Query: 746  XXSYLGVQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEE 615
               Y   +F   KQ +  +SVDPKWETIVK++N    HL FSE+
Sbjct: 636  SSRYQSTRFNVTKQ-IPETSVDPKWETIVKSSNVNTGHLWFSEQ 678


>ref|XP_010920169.1| PREDICTED: uncharacterized protein LOC105044068 [Elaeis guineensis]
            gi|743757080|ref|XP_010920176.1| PREDICTED:
            uncharacterized protein LOC105044068 [Elaeis guineensis]
          Length = 720

 Score =  313 bits (803), Expect = 5e-82
 Identities = 265/739 (35%), Positives = 348/739 (47%), Gaps = 42/739 (5%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRF---ASTAGSNT--------EDNRRVEKFQP--KRPSL 2562
            ML+NLME+K+L+F+ PLLSVRR    A+ AG++T        + +R+    QP  +R SL
Sbjct: 1    MLRNLMEDKRLDFDAPLLSVRRLSAGAAAAGASTAPSTSKLEDGHRKAAAGQPPTRRSSL 60

Query: 2561 GSYKPDLKSGPMRHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQP 2382
              +K DLKSGP+ +PG +PF WEQ PG+PKD                     RI+  K+ 
Sbjct: 61   PFHKSDLKSGPVGNPGVIPFVWEQTPGQPKDEVSSSSVAVGRLPMALKLPADRILKEKEA 120

Query: 2381 SSAKSVEDGNSKALTVFKPPQNDNLTHHSLVVSSIGNATPLERPKGD---AREEHSA--- 2220
               ++    + +  T  +  +       + + S        E PKG     +EE      
Sbjct: 121  DGPRATAGASVRVGTSVRTQKAA-----TQIASDESTEKAPEIPKGGEEKVKEEEMKQKP 175

Query: 2219 ------NVXXXXXXXXXXXXXDTLSRTESFFMNCSVSGLSGFDG---PSRSTSMDAQARD 2067
                  N              DTLSRTESFFMNCSVSGLSG      PS S S D Q RD
Sbjct: 176  VPADRRNDEDDDEDEAFSDALDTLSRTESFFMNCSVSGLSGIPESAMPSGSFSTDPQVRD 235

Query: 2066 FMMDRFLPAATAMASEAPQYASRRQLV-AREPVRKVN---IVDQYRRPRPFGGNKQPMRY 1899
            FMM RFLPAA AMA+ +PQY  R+    AREP  +     +   +RRP P    K+P   
Sbjct: 236  FMMGRFLPAAQAMATGSPQYTFRKGTPPAREPPTRPAERVVSRDHRRPLPLPYQKRPNFV 295

Query: 1898 QSRPYKAPHVQXXXXXXXXXXXXXXDNTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQ 1719
            Q   Y   H +              D T  L +KACG+LPRFC+K+SFCLLNPVPGMK++
Sbjct: 296  QQ--YAQEH-EGGDSYDDEEEEEDCDETDRLPSKACGLLPRFCVKSSFCLLNPVPGMKVR 352

Query: 1718 SRMPLSSVRRKVPTHIKTTGSEHLSEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSE 1539
             R+P    RR     IKT     L E   E +WEAVYKHKL    +P    D  SK TSE
Sbjct: 353  PRLPAPLGRRIGNPRIKTFHHGSLGEAGDEDSWEAVYKHKLGQRYQPQ-VEDGRSKSTSE 411

Query: 1538 SNQLTLWSDSQTPEGSSPYRHSTGDGISPYRNEAPQSLFHEGIGFLGVPKQG-NNRRTDG 1362
            S QLT WSDS T +GSSP R STG GISP  NEAP   F EG GFLGVPK+G  + +TDG
Sbjct: 412  SKQLTYWSDSPTADGSSPCRRSTGGGISPNPNEAPPLPF-EGKGFLGVPKRGRKSSKTDG 470

Query: 1361 LVTYGNRCNDDRGMLXXXXXXXXXXXXSPAIEKTLYVDSVCMLEXXXXXXXXXXXXXXXX 1182
              +      +   M             SPA+EKTLYVDSV MLE                
Sbjct: 471  SDSCERDGENYWEMTPPQSSQQGSGSRSPALEKTLYVDSVNMLETSDSNSSSLYIATDTR 530

Query: 1181 XXXXXXXXXXXXXXXXKRLTLEACLREDKQVKNLNERDLLPPKMPDTAKSNLLSFSEDSI 1002
                            +R      ++E+  VK+ +E + L PK  +  +  L   SE S 
Sbjct: 531  VTLNSSEKDSEVGRDTQR------MQENSAVKS-HEENALQPKDSEVVELGLPFCSEKSD 583

Query: 1001 VGA--SLHSREGSKDKDGFVPEAKSLECSKVLIN-TSPGSDMP------ERLDVDGGDSY 849
             G     ++ + + D+DG +P  +      +L N  + G  +P       + DV    S 
Sbjct: 584  HGEMDGNNNIKHNADRDGPLPSGE----GDILKNDVNDGGPLPLEEGALHKTDVSSLQSL 639

Query: 848  AXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXXXXSYLGVQFRSRKQLLKASSVDPKWE 669
                                  RTLP          S+LG+QF+ RKQ  +ASS + K +
Sbjct: 640  LPPPLPKSPSESWLF-------RTLP-SVSSKNPPQSFLGLQFQPRKQAFQASSTNQKRD 691

Query: 668  TIVKTNNAQQRHLRFSEEL 612
            +  K + +  R  +F+E L
Sbjct: 692  SNAKPSVSHHRRRQFAEVL 710


>ref|XP_008789379.1| PREDICTED: uncharacterized protein LOC103706890 [Phoenix dactylifera]
            gi|672131636|ref|XP_008789380.1| PREDICTED:
            uncharacterized protein LOC103706890 [Phoenix
            dactylifera] gi|672131638|ref|XP_008789381.1| PREDICTED:
            uncharacterized protein LOC103706890 [Phoenix
            dactylifera]
          Length = 719

 Score =  310 bits (794), Expect = 6e-81
 Identities = 267/759 (35%), Positives = 343/759 (45%), Gaps = 62/759 (8%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRF---ASTAGSNT-------EDNRR---VEKFQPKRPSL 2562
            ML+NLME+K+L+F+ PLLSVRR    A+ AG++T       ED+ R     K  P+R SL
Sbjct: 1    MLRNLMEDKRLDFDAPLLSVRRLSAGAAAAGASTAPCTSKSEDSDRKAAAGKPPPRRSSL 60

Query: 2561 GSYKPDLKSGPMRHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQP 2382
              +K DLKSGP+ +PG +PF WEQ PG+PKDG                    RI++ ++ 
Sbjct: 61   PFHKSDLKSGPVGNPGVIPFVWEQTPGQPKDGVSSGSIAVGRPPMVSKLPSDRILNERES 120

Query: 2381 SSAKSVEDGNSKALTVFKPPQNDNLTHHSLVVSSIGNATPLERPKGD---AREEHSA--- 2220
               ++    + +  T  +  +           S  G     E PKG     +EE      
Sbjct: 121  YRPRTTAGASVRVGTSIRTQKAVTFA------SDEGTKKAPESPKGGEEKVKEEEEKQKP 174

Query: 2219 ------NVXXXXXXXXXXXXXDTLSRTESFFMNCSVSGLSGFDG---PSRSTSMDAQARD 2067
                  N              DTLSRTESFFMNCSVSGLSG      PS S S D Q RD
Sbjct: 175  VPADRHNDGDDDEDEAFSDALDTLSRTESFFMNCSVSGLSGIPESAMPSGSFSTDPQVRD 234

Query: 2066 FMMDRFLPAATAMASEAPQYASRRQL-VAREPVRKVN---IVDQYRRPRPFGGNKQPMRY 1899
            FMM RFLPAA AMA+ +PQY  R+   +AREP  +     +   +RR         P+ Y
Sbjct: 235  FMMGRFLPAAQAMATGSPQYTFRKAASLAREPPMRPAERFVSGDHRR-------LLPLPY 287

Query: 1898 QSRP-----YKAPHVQXXXXXXXXXXXXXXDNTGNLTAKACGMLPRFCLKNSFCLLNPVP 1734
            Q RP     Y   H +                T +L +KACG+LPR CLK+SFCLLNPVP
Sbjct: 288  QKRPNFGLQYAQKHGEGDSYDDEEEAEDCD-ETDHLPSKACGLLPRLCLKSSFCLLNPVP 346

Query: 1733 GMKIQSRMPLSSVRRKVPTHIKTTGSEHLSEPDPEQTWEAVYKHKLLCGLEPPGTYDNGS 1554
            GMK++ R+P    RR     IKT       +   E +WEAV+KHKL    +P    D  S
Sbjct: 347  GMKVRGRLPAPPGRRIGGPRIKTFHHGSFGQDGDEDSWEAVHKHKLGQRYQPQ-VEDGRS 405

Query: 1553 KPTSESNQLTLWSDSQTPEGSSPYRHSTGDGISPYRNEAPQSLFHEGIGFLGVPKQG-NN 1377
            + TSES QLT WSDS T +GSSP RHS G GISPYRNEAP   F E  GFLGVPK+G  +
Sbjct: 406  RSTSESKQLTYWSDSPTADGSSPCRHSAGGGISPYRNEAPPFPF-ERKGFLGVPKRGRKS 464

Query: 1376 RRTDGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPAIEKTLYVDSVCMLEXXXXXXXXXXX 1197
             +TDG         +   M             SPA+EKTLYVDSV M E           
Sbjct: 465  SKTDGSDLCERDGENYWEMTPSQSSQQGSGSRSPALEKTLYVDSVNMPETPDSNSSSLNI 524

Query: 1196 XXXXXXXXXXXXXXXXXXXXXKRLTLEACLREDKQVKNLNERDLLPPKMPDTAKSNLLSF 1017
                                 +R+       E+      +E + L PK+    +  L   
Sbjct: 525  ATGTRAMLNSTEKDYEVGRERQRM-------EENVAVKTHEENALQPKVSVVVEPGLPFC 577

Query: 1016 SEDSIVG---------------ASLHSREGSKDK-----DGFVP----EAKSLECSKVLI 909
            SE S  G                 LH+ EG   K     DG +P        ++ S +L 
Sbjct: 578  SERSDHGEMDGNNNIKHNADGDGPLHTEEGDIIKNDVNDDGPLPLEEGARHKIDVSSLL- 636

Query: 908  NTSPGSDMPERLDVDGGDSYAXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXXXXSYLG 729
                 S +P  L     +S+                      RTLP          S+LG
Sbjct: 637  -----SLLPPPLPKSPSESWLF--------------------RTLP-SVSSKNLPQSFLG 670

Query: 728  VQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEEL 612
            +QF+ RKQ  +ASS D K +T  K + +  R  RF+E L
Sbjct: 671  LQFQPRKQAFQASSTDQKQDTNAKPSVSHHRQRRFAEVL 709


>emb|CAN81695.1| hypothetical protein VITISV_042576 [Vitis vinifera]
          Length = 1185

 Score =  306 bits (785), Expect = 6e-80
 Identities = 250/706 (35%), Positives = 341/706 (48%), Gaps = 9/706 (1%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRFAST-AGSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPMRHPGA 2511
            ME+KQLNFN PLLSVRRF+ST A +  E  R+ +        L +YK +LKSGP+R+PGA
Sbjct: 1    MEDKQLNFNQPLLSVRRFSSTVASTEVESKRKNDSSLSNILPLPTYKSELKSGPVRNPGA 60

Query: 2510 VPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSKALTVF 2331
            VPF WEQ PGRPKD S                 PGRI++ KQ    K  +D       + 
Sbjct: 61   VPFIWEQTPGRPKDES-----KPQIPPTXPKLPPGRILNTKQRPPDKVSKD------PIV 109

Query: 2330 KPPQNDNLTHHSLVVSSIG-NATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTLSRTE 2154
               Q  N+  +S  VSS+  N T LE  K    ++ S+               DTLSR+E
Sbjct: 110  AGTQTANILSNSRNVSSLDENVTKLENFKEGVEDKGSSG--SEDGDVAYLDALDTLSRSE 167

Query: 2153 SFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYASRRQLV 1986
            SFF+NCSVSGLSG DG    PS + S D Q RDFMM RFLPAA AMASE P YASRRQ V
Sbjct: 168  SFFLNCSVSGLSGLDGPDVKPSGTFSTDPQTRDFMMGRFLPAAKAMASETPPYASRRQPV 227

Query: 1985 A-REPVRKVNIVDQYRRPR-PFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXDNTG 1812
            A R+PV +     Q R+ +    G+++P  YQ R   + H                  T 
Sbjct: 228  AQRQPVAQA----QPRQVKNVVSGDRRPPLYQYRLNVSSHYAQDKGREESEDEDNYVETE 283

Query: 1811 NLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSEPDP 1632
             L+AK CG+ PRF LKNSFCL+NPV  M +Q+R+P SS+R    T  + + S+  +  + 
Sbjct: 284  LLSAKVCGLFPRFGLKNSFCLMNPVLRMGVQARVPASSLR---ATRARFSYSDASTLTEN 340

Query: 1631 EQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGDGISP 1452
            + +   V   K   GL+     +   K  +ES++     DSQ P+GSS Y    G G+ P
Sbjct: 341  KHS-RNVVNEKKSGGLQRSKLQELKRKEENESSKTNYKXDSQKPDGSSLYMRLQGGGMLP 399

Query: 1451 YRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPA 1272
            YR+++  S F+E  GF G+ +   +   DG  ++       R +L            SP 
Sbjct: 400  YRSDSLLSHFNEEKGFHGIHEXPMSLGVDGFGSHQQGQKIFRELL-ASSPQRESGLESPT 458

Query: 1271 IEKTLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKRLTLEACLREDKQ 1092
            +EKTLY+DSV ++E                                   ++E+ L++ K 
Sbjct: 459  VEKTLYIDSVHIVEPRNSNSSRSDMKGLSDTRSDFEILGKSSTP-----SMESSLQDIKH 513

Query: 1091 VKNLNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLECSKVL 912
            +   +E     PK+ D+  SNLL     S     +  R+G    D  + ++ +L+  +VL
Sbjct: 514  LSIADEEGKSQPKILDSMGSNLLFSCVKSDQEVQMDQRKGFSSSDPIL-DSMTLDSPEVL 572

Query: 911  INTSPGSDMPERLDVDGGDSY-AXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXXXXSY 735
             N +   +     + D  + +                     L RTLP          S+
Sbjct: 573  DNRNLDDENHRPSEADSLEKFHDSHSELPLPPPLPKSPSESWLSRTLP--SASSRNSQSH 630

Query: 734  LGVQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEELKIHSH 597
                   R Q  K SS DPKWETIVKT+NA + HLRFSEE +IH H
Sbjct: 631  FATWTSPRNQASKTSSPDPKWETIVKTSNAHKGHLRFSEETEIHIH 676


>ref|XP_002276750.3| PREDICTED: uncharacterized protein LOC100245463 [Vitis vinifera]
            gi|731409014|ref|XP_010657043.1| PREDICTED:
            uncharacterized protein LOC100245463 [Vitis vinifera]
            gi|731409016|ref|XP_010657044.1| PREDICTED:
            uncharacterized protein LOC100245463 [Vitis vinifera]
            gi|731409018|ref|XP_010657045.1| PREDICTED:
            uncharacterized protein LOC100245463 [Vitis vinifera]
          Length = 684

 Score =  305 bits (781), Expect = 2e-79
 Identities = 249/706 (35%), Positives = 341/706 (48%), Gaps = 9/706 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFAST-AGSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPM 2526
            +L N ME+KQLNFN PLLSVRRF+ST A +  E  R+ +        L +YK +LKSGP+
Sbjct: 2    LLNNPMEDKQLNFNQPLLSVRRFSSTVASTEVESKRKNDSSLSNILPLPTYKSELKSGPV 61

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
            R+PGAVPF WEQ PGRPKD S                 PGRI++ KQ    K  +D    
Sbjct: 62   RNPGAVPFIWEQTPGRPKDES-----KPQIPPTTPKLPPGRILNTKQRPPDKVSKD---- 112

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIG-NATPLERPKGDAREEHSANVXXXXXXXXXXXXXDT 2169
               +    Q  N+  +S  VSS+  N T LE  K    ++ S+               DT
Sbjct: 113  --PIVAGTQTANILSNSRNVSSLDENVTKLENFKEGVEDKGSSG--SEDGDVAYLDALDT 168

Query: 2168 LSRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYAS 2001
            LSR+ESFF+NCSVSGLSG DG    PS + S D Q RDFMM RFLPAA AMASE P YAS
Sbjct: 169  LSRSESFFLNCSVSGLSGLDGPDVKPSGTFSTDPQTRDFMMGRFLPAAKAMASETPPYAS 228

Query: 2000 RRQLVA-REPVRKVNIVDQYRRPR-PFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXX 1827
            RRQ VA R+PV +     Q R+ +    G+++P  YQ R   + H               
Sbjct: 229  RRQPVAQRQPVAQA----QPRQVKNVVSGDRRPPLYQYRLNVSSHYAQDKGREESEDEDN 284

Query: 1826 XDNTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHL 1647
               T  L+AK CG+ PRF LKNSFCL+NPV  M +Q+R+P SS+R    T  + + S+  
Sbjct: 285  YVETELLSAKVCGLFPRFGLKNSFCLMNPVLRMGVQARVPASSLR---ATRARFSYSDAS 341

Query: 1646 SEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTG 1467
            +  + + +   V   K   GL+     +   K  +ES++    SDSQ P+GSS Y    G
Sbjct: 342  TLTENKHS-RNVVNEKKSGGLQRSKLQELKRKEENESSKTNYKSDSQKPDGSSLYMRLQG 400

Query: 1466 DGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXX 1287
             G+ PYR+++  S F+E  GF G+ +   +   DG  ++       R +L          
Sbjct: 401  GGMLPYRSDSLLSHFNEEKGFHGIHEAPMSLGVDGFGSHQQGQKIFRELL-ASSPQRESG 459

Query: 1286 XXSPAIEKTLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKRLTLEACL 1107
              SP +EKTLY+DSV ++E                                   ++E+ L
Sbjct: 460  LESPTVEKTLYIDSVHIVEPRNSNSSRSDMKGLSDTRSDFEILGKSSTP-----SMESSL 514

Query: 1106 REDKQVKNLNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLE 927
            ++ K +   +E     PK+ D+  SNLL     S     +  R+G    D  + ++ +L+
Sbjct: 515  QDIKHLSIADEEGKSQPKILDSMGSNLLFSCVKSDQEVQMDQRKGFSSSDPIL-DSMTLD 573

Query: 926  CSKVLINTSPGSDMPERLDVDGGDSY-AXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXX 750
              +VL N +   +     + D  + +                     L RTLP       
Sbjct: 574  SPEVLDNRNLDDENHRPSEADSLEKFHDSHSELPLPPPLPKSPSESWLSRTLP--SASSR 631

Query: 749  XXXSYLGVQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEEL 612
               S+       R Q  K SS DPKWETIVKT+NA + HLRFSE+L
Sbjct: 632  NSQSHFATWTSPRNQASKTSSPDPKWETIVKTSNAHKGHLRFSEKL 677


>ref|XP_010104398.1| hypothetical protein L484_010350 [Morus notabilis]
            gi|587912410|gb|EXC00243.1| hypothetical protein
            L484_010350 [Morus notabilis]
          Length = 775

 Score =  294 bits (752), Expect = 4e-76
 Identities = 210/601 (34%), Positives = 294/601 (48%), Gaps = 5/601 (0%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRFASTAGSNTEDNRR-VEKFQPKRPSLGSYKPDLKSGPMRHPGA 2511
            ME+KQL+FN PLLSVRRF+S A     DN+R  +K  PK P L  YK +LKSGP+R+PG 
Sbjct: 1    MEDKQLDFNQPLLSVRRFSSPAVPPEADNKRKTDKPLPKLPPLPVYKSELKSGPVRNPGT 60

Query: 2510 VPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSKALTVF 2331
            VPF WE+ PG+PKD     P+            PGR+++V+Q +S     D  SK  T+ 
Sbjct: 61   VPFVWERTPGKPKDEKTSRPQAPEQPPIAPKLPPGRVLNVRQEAS-----DKGSKG-TIA 114

Query: 2330 KPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTLSRTES 2151
               Q  ++   S  VS +   +  E        E  ++              DTLSR+ES
Sbjct: 115  TQSQTRSILSSSKDVSDLDKRSFTEDKISKLETEDKSSSGSGDGDETYLDALDTLSRSES 174

Query: 2150 FFMNCSVSGLSGFDGP----SRSTSMDAQARDFMMDRFLPAATAMASEAPQYASRRQLVA 1983
            FF+NCS+SG+SG D P    S + S D Q RDFMM RFLPAA  MAS+  QYA R+  V 
Sbjct: 175  FFLNCSISGVSGLDDPDVKPSGTFSTDQQTRDFMMGRFLPAAKVMASDTHQYALRKPQVV 234

Query: 1982 REPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXDNTGNLT 1803
            RE  R++N V    + RP   NK P R        P+ Q              + +  L+
Sbjct: 235  REQPRQINKVVSGDKRRPLNLNK-PNRLP------PYAQELGGEESEDESVTYEGSDILS 287

Query: 1802 AKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSEPDPEQT 1623
             K CG+ PRFCLKNSFCLLNPVPGMK+QS+ P+SSVRR VP +  ++ +    E   E  
Sbjct: 288  DKVCGLFPRFCLKNSFCLLNPVPGMKMQSQFPISSVRR-VPAN--SSSASTCRETKVEHA 344

Query: 1622 WEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGDGISPYRN 1443
               VY+ K +   +         K   +SN +   SDSQ  + SS YRH  G+G+S Y +
Sbjct: 345  EHLVYEQKSMVREQTAELNKGKIKLKYKSNGIEDKSDSQKVDQSSLYRHQQGNGLSLYHS 404

Query: 1442 EAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPAIEK 1263
               Q    E  GFLG+ ++  N R  G   + +R ++ R +L            SP +EK
Sbjct: 405  GHSQLKLPEQKGFLGIREKKRNSRERGFDIHKSRRSNFRELLNNENTKLEVGSGSPVVEK 464

Query: 1262 TLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKRLTLEACLREDKQVKN 1083
            TLY+DSV  ++                                   ++++ L++ K +  
Sbjct: 465  TLYIDSVHTVKPPSSNSSASDMKSFTDCRGNDVEIPEKSSDMEDTHSVDSSLQDIKCLSV 524

Query: 1082 LNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLECSKVLINT 903
            ++E+    PK   +  S   S S  S +   +H   GS   +  +P++ +L  SKV    
Sbjct: 525  VDEKATTTPKSLQSVDSCFQSCSNKSTLEKQMHMTNGSIQDEYLIPDSFTLMSSKVAAQE 584

Query: 902  S 900
            S
Sbjct: 585  S 585


>ref|XP_007048701.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700962|gb|EOX92858.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 759

 Score =  280 bits (717), Expect = 5e-72
 Identities = 201/502 (40%), Positives = 263/502 (52%), Gaps = 15/502 (2%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFAST-AGSNTEDNRRVEKFQPK--RPSLGSYKPDLKSG 2532
            +LKNLME+KQL+FN PLLSVRRF S  A S++E  ++ +   PK  RP +  YK +LKSG
Sbjct: 32   LLKNLMEDKQLDFNQPLLSVRRFTSPGAASDSECKKKTDTSLPKILRPPI--YKSELKSG 89

Query: 2531 PMRHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGN 2352
            P+R+PG VPF WE+ PGRPK+ S    +            PGRI++ KQ SS K     N
Sbjct: 90   PVRNPGTVPFVWEKTPGRPKEESNSQAQALEQPLLAPRLPPGRILNDKQHSSRKGF---N 146

Query: 2351 SKALTVFKPPQNDNLTHHSLVVSSI-GNATPLERPKGDAREEHSANVXXXXXXXXXXXXX 2175
             K    F P Q   +   S  VSS+  N T  E   GD  E  S+               
Sbjct: 147  GK---TFTPSQTGTVPSCSQKVSSLKRNETKYESSSGDMEETGSSG--SKDSDEAYVDAL 201

Query: 2174 DTLSRTESFFMNCSVSGLSGFDGPSRSTS----MDAQARDFMMDRFLPAATAMASEAPQY 2007
            DT SRTESFF+NCS+SG+SGFDGP    S     D Q RDFMM RFLPAA A+ASE P Y
Sbjct: 202  DTFSRTESFFLNCSISGVSGFDGPEIKPSGIFTTDPQTRDFMMGRFLPAAKAVASEIPPY 261

Query: 2006 ASRRQLVAREP---VRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXX 1836
            ASR+Q VAREP   V+KV IVD           KQ   Y S P K P+            
Sbjct: 262  ASRKQPVAREPQRQVKKVVIVD-----------KQQPLYVSSPNKFPNHAQDDWLEESEG 310

Query: 1835 XXXXDNTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPL----SSVRRKVPTHIK 1668
                  + N +AK CG+ P+F LK+SFCLLNPVPGMKIQ++ P     S  RR+  +   
Sbjct: 311  EDDYSGSQNSSAKVCGLFPQFLLKSSFCLLNPVPGMKIQAQKPAKPAHSVRRRQAKSSYL 370

Query: 1667 TTGSEHLSEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSS 1488
             +G+E  SE     T + + +             ++ +   S S+ ++  SD Q P+ +S
Sbjct: 371  RSGNETESEYAKAATEKGLTRIS-----RTEELIEDKNNLKSGSSHMSYRSDCQNPDAAS 425

Query: 1487 PYRHSTGDGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXX 1308
              RH  G+ +S Y ++  Q L H+  GFLG+P++  N     +       N+ + +L   
Sbjct: 426  LSRHLQGNVVSSYPSQISQ-LVHQEKGFLGIPEKAKNYGVSSIDPLKKGSNNFQELLALQ 484

Query: 1307 XXXXXXXXXSPAIEKTLYVDSV 1242
                     SP +EKTLYVDSV
Sbjct: 485  SKYQESGLDSPVVEKTLYVDSV 506


>ref|XP_006852401.1| PREDICTED: uncharacterized protein LOC18442121 [Amborella trichopoda]
            gi|548856012|gb|ERN13868.1| hypothetical protein
            AMTR_s00021p00026070 [Amborella trichopoda]
          Length = 758

 Score =  279 bits (714), Expect = 1e-71
 Identities = 200/528 (37%), Positives = 270/528 (51%), Gaps = 46/528 (8%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRFASTA-GSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPMRHPGA 2511
            MEEKQL+FN PLLSVRRF+ T+  S   DN+R EK   +  +  +YK DLKSGP+R+PG 
Sbjct: 1    MEEKQLDFNAPLLSVRRFSGTSVTSEVGDNKRSEKLAVQNLTPPTYKSDLKSGPVRNPGT 60

Query: 2510 VPFEWEQIPGRPKDGSGQ-HPRNXXXXXXXXXXXPGRIVDVKQP-SSAKSVEDGNSKALT 2337
            +PF WEQIPGRPKDG     P++           PGR  + K+P    +  E+ +    T
Sbjct: 61   IPFVWEQIPGRPKDGGNDGSPKSLERPPLAPKLPPGRKFNAKKPPKDDEKPENKDIMNAT 120

Query: 2336 VFKPPQNDNLTHHSLVVSSI----------GNATPLERPKGDAREEHSANVXXXXXXXXX 2187
              +P +    ++ S + ++I           ++    +  G++ +E + ++         
Sbjct: 121  RLQPIETSTGSYGSSLKTNIRSFSTSGYHGASSKTNMKSFGNSYKESTNSMALLERKFSN 180

Query: 2186 XXXXD-------------TLSRTESFFMNCSVSGLSGFDGPSRST----SMDAQARDFMM 2058
                              TLS+TES F+NCS+SG+S  DG    T     +D   R FM+
Sbjct: 181  EGGSSDIEDDDVFADALDTLSQTESCFLNCSISGVSALDGQDLKTLDNGGLDLSTRKFMI 240

Query: 2057 DRFLPAATAMASEAPQYA-SRRQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYK 1881
            DRFLPAA AMASE+PQYA SR+  V  EPVR+V  + +        G+    R  +    
Sbjct: 241  DRFLPAARAMASESPQYAPSRKPQVGNEPVRQVTNISR-------DGSPLVTRVPNHYLI 293

Query: 1880 APHVQXXXXXXXXXXXXXXDNTGNLTA----KACGMLPRFCLKNSFCLLNPV---PGMKI 1722
              H+Q              D+ G+ +     K CG+ P + LKNS CLLNPV   P  K 
Sbjct: 294  QKHIQEQQAGYEEEDDDDDDDDGDYSVDSSRKVCGLFP-WRLKNSICLLNPVIHAPRAKT 352

Query: 1721 QSRMPLSSVRRKVPTHIKTTGSEHLSEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTS 1542
              +MPL    R     IKT+    L++ + E TWEAVY+HKL+ G +     ++ SKPTS
Sbjct: 353  SKQMPLRDTSRPADYQIKTSSPVTLTQREQE-TWEAVYRHKLVNGSQTHEVVEDASKPTS 411

Query: 1541 ES--------NQLTLWSDSQTPEGSSPYRHSTGDGISPYRNEAPQSLFHEGIGFLGVPKQ 1386
            +S         Q    SDSQTP+  SPYRHS G GISPYRNEAP+S FHEG+GFLG PK 
Sbjct: 412  DSASTPSVYGKQPNYSSDSQTPDDMSPYRHSMG-GISPYRNEAPRSPFHEGMGFLGFPKT 470

Query: 1385 GNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPAIEKTLYVDSV 1242
                + D    Y +     RG              SPA EKT+Y+DSV
Sbjct: 471  EKTFKVD---KYSSSTTSHRG------SDRRSGSLSPAAEKTVYIDSV 509


>ref|XP_007048702.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508700963|gb|EOX92859.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 723

 Score =  272 bits (696), Expect = 1e-69
 Identities = 197/497 (39%), Positives = 258/497 (51%), Gaps = 15/497 (3%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRFAST-AGSNTEDNRRVEKFQPK--RPSLGSYKPDLKSGPMRHP 2517
            ME+KQL+FN PLLSVRRF S  A S++E  ++ +   PK  RP +  YK +LKSGP+R+P
Sbjct: 1    MEDKQLDFNQPLLSVRRFTSPGAASDSECKKKTDTSLPKILRPPI--YKSELKSGPVRNP 58

Query: 2516 GAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSKALT 2337
            G VPF WE+ PGRPK+ S    +            PGRI++ KQ SS K     N K   
Sbjct: 59   GTVPFVWEKTPGRPKEESNSQAQALEQPLLAPRLPPGRILNDKQHSSRKGF---NGK--- 112

Query: 2336 VFKPPQNDNLTHHSLVVSSI-GNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTLSR 2160
             F P Q   +   S  VSS+  N T  E   GD  E  S+               DT SR
Sbjct: 113  TFTPSQTGTVPSCSQKVSSLKRNETKYESSSGDMEETGSSG--SKDSDEAYVDALDTFSR 170

Query: 2159 TESFFMNCSVSGLSGFDGPSRSTS----MDAQARDFMMDRFLPAATAMASEAPQYASRRQ 1992
            TESFF+NCS+SG+SGFDGP    S     D Q RDFMM RFLPAA A+ASE P YASR+Q
Sbjct: 171  TESFFLNCSISGVSGFDGPEIKPSGIFTTDPQTRDFMMGRFLPAAKAVASEIPPYASRKQ 230

Query: 1991 LVAREP---VRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXD 1821
             VAREP   V+KV IVD           KQ   Y S P K P+                 
Sbjct: 231  PVAREPQRQVKKVVIVD-----------KQQPLYVSSPNKFPNHAQDDWLEESEGEDDYS 279

Query: 1820 NTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPL----SSVRRKVPTHIKTTGSE 1653
             + N +AK CG+ P+F LK+SFCLLNPVPGMKIQ++ P     S  RR+  +    +G+E
Sbjct: 280  GSQNSSAKVCGLFPQFLLKSSFCLLNPVPGMKIQAQKPAKPAHSVRRRQAKSSYLRSGNE 339

Query: 1652 HLSEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHS 1473
              SE     T + + +             ++ +   S S+ ++  SD Q P+ +S  RH 
Sbjct: 340  TESEYAKAATEKGLTRIS-----RTEELIEDKNNLKSGSSHMSYRSDCQNPDAASLSRHL 394

Query: 1472 TGDGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXX 1293
             G+ +S Y ++  Q L H+  GFLG+P++  N     +       N+ + +L        
Sbjct: 395  QGNVVSSYPSQISQ-LVHQEKGFLGIPEKAKNYGVSSIDPLKKGSNNFQELLALQSKYQE 453

Query: 1292 XXXXSPAIEKTLYVDSV 1242
                SP +EKTLYVDSV
Sbjct: 454  SGLDSPVVEKTLYVDSV 470


>ref|XP_007025362.1| Transcription initiation factor TFIID subunit 11, putative [Theobroma
            cacao] gi|508780728|gb|EOY27984.1| Transcription
            initiation factor TFIID subunit 11, putative [Theobroma
            cacao]
          Length = 710

 Score =  272 bits (695), Expect = 2e-69
 Identities = 233/746 (31%), Positives = 324/746 (43%), Gaps = 44/746 (5%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRFASTAGSNTEDNRR-VEKFQP-KRPSLGSYKPDLKSGPMRHPG 2514
            MEE++LNFN PLLSVRRF++T+  +  D ++ VE   P +R +L  Y  D+    +  P 
Sbjct: 1    MEERKLNFNAPLLSVRRFSATSAFSDRDKQKIVENPCPNRRHTLPFYNSDVSLDQVTEPV 60

Query: 2513 AVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSKALTV 2334
            AVPF WEQIPG+ K G     +            PGR++D+ + +  K  E+ N     V
Sbjct: 61   AVPFVWEQIPGKAKGGIEHESQPNKEASGTPRLPPGRVLDIMKYTVEKEFENQN-----V 115

Query: 2333 FKPPQ-----NDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDT 2169
             +P       NDN+T        I      E    D     + +               T
Sbjct: 116  VRPQSEIYSLNDNVTKLDSSNKGINEKCISESETDDDAYSDALD---------------T 160

Query: 2168 LSRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYAS 2001
            LS T+S  MNCS+SGLSG  G    PS + S D Q RDFMM RFLPAA AM  E PQYAS
Sbjct: 161  LSPTDSLSMNCSISGLSGSSGLVAKPSGTFSSDPQTRDFMMSRFLPAAKAMTLEMPQYAS 220

Query: 2000 RRQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHV-QXXXXXXXXXXXXXX 1824
            R+Q VA    R+   V          G+++P   Q      PH  Q              
Sbjct: 221  RKQSVAPALPREDKKV--------VVGDRKPPVNQYESVIIPHYNQDVDGEETEDEYDDY 272

Query: 1823 DNTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLS 1644
            +++GNL+ KACG+LPR   KNS CLLNPVPG+K+++   + S  R+V    K T  +  S
Sbjct: 273  EDSGNLSRKACGLLPRLSFKNSLCLLNPVPGLKVRTHSSMPST-REVAKPSKATYMKSHS 331

Query: 1643 EPDPEQTWEAVYKHKLLCGLEPPGTYDN----------------------------GSKP 1548
            +   +  W+AV+K+K   G++ P   +N                            G K 
Sbjct: 332  QIIEKHAWDAVHKNKSDSGVQSPQPQENKSDTGVQSPRLPENKLSGGVQSPRLPEIGKKM 391

Query: 1547 TSESNQLTLWSDSQTPEGSSPYRHSTGDGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRT 1368
            T  SNQ T   D Q    S P R      ISPYR E PQS F  G GFLG+PK+      
Sbjct: 392  TCGSNQFTNSGDQQIVNRSPPKRLPGSARISPYRRERPQSPFRGG-GFLGMPKEAEKFNA 450

Query: 1367 DGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPAIEKTLYVDSVCMLEXXXXXXXXXXXXXX 1188
            + L+ Y    N+ + ++            SPA+EKTLYVD+V   E              
Sbjct: 451  NMLIKYTKSNNNSQELVPYQSTRQGSGALSPAVEKTLYVDTVNFAEIASSNSDSSDTKAP 510

Query: 1187 XXXXXXXXXXXXXXXXXXKRLTLEACLREDKQVKNLNERDLLPPKMPDTAKSNLLSFSED 1008
                              +  T+E+ L++ K +  L+ +D+   ++  +  S+  SFS+ 
Sbjct: 511  MDSMGKHSDTLLVNRMLEESATVESSLQDIKCLNLLDGKDISKYEITGSVYSSRSSFSDK 570

Query: 1007 SIVGASLHSREGSKDKDGFVPEAKSLECSKV----LINTSPGSDMPERLDVDGGDSYAXX 840
              +       +  +   G     KSL   KV     +  S   D+ E    D  ++ A  
Sbjct: 571  PDLKGQAEMMDCFRQNGGL---NKSLGRIKVRADRSLTLSANGDVRE---ADQEENNAGS 624

Query: 839  XXXXXXXXXXXXXXXXXLCRTLPXXXXXXXXXXSYLGVQFRSRKQLLKASSVDPKWETIV 660
                             L   LP          SY G +F  +K+  K S+ D KWETIV
Sbjct: 625  DCSPLPPPLPKTPSESWLWCALPSVTSRNSFSQSYNGTRFYPKKEEPKVSATDTKWETIV 684

Query: 659  KTNNAQQRHLRFSEELKIHSHHGSET 582
            KT+     H+R+SEEL  H    S+T
Sbjct: 685  KTSYLHHDHVRYSEELVTHFSQQSKT 710


>ref|XP_012091781.1| PREDICTED: uncharacterized protein LOC105649674 [Jatropha curcas]
            gi|802786884|ref|XP_012091782.1| PREDICTED:
            uncharacterized protein LOC105649674 [Jatropha curcas]
          Length = 669

 Score =  267 bits (683), Expect = 4e-68
 Identities = 229/704 (32%), Positives = 313/704 (44%), Gaps = 7/704 (0%)
 Frame = -1

Query: 2690 LMEEKQLNFNVPLLSVRRFAS-TAGSNTEDNRRVEKFQ-PKRPSLGSYKPDLKSGPMRHP 2517
            +MEE++LNFN PL+SVRR ++ T  SN    ++ E  Q  KR +L SYK D     +  P
Sbjct: 1    MMEERKLNFNAPLMSVRRSSTATKPSNVTKGKKFENAQLVKRNTLPSYKSDFNLDQVTEP 60

Query: 2516 GAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSKALT 2337
             AVPF WEQIPGR KDGS   PR            P R +DV      K +ED   +   
Sbjct: 61   VAVPFHWEQIPGRRKDGSKPDPRGCEEASVTPRFTPRRALDV-----VKHIEDKKPEDQV 115

Query: 2336 VFKPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTLSRT 2157
             F+P    N        + I N   L+  K    E+   N              DTLS  
Sbjct: 116  AFRPQIQSNS------FNDIANG--LDCSKEGVNEKSDFNSENDDDDDLYSDARDTLSGM 167

Query: 2156 ESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYASRRQL 1989
            +SF ++CSVSG+SGFD     PS + + D Q RDFMM RFLPAA AM  EAPQYASR+Q 
Sbjct: 168  DSFSVDCSVSGVSGFDSLAVKPSGTFNADPQTRDFMMSRFLPAAKAMTLEAPQYASRKQP 227

Query: 1988 VAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXDNTGN 1809
            V+ E  R++  V Q  R  P        R +S    + H Q               N G 
Sbjct: 228  VSGEQPRQIVQVVQRDRTPPVN------RKESFNVPSYH-QDLVDEESEDECDQYVNYGK 280

Query: 1808 LTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSEPDPE 1629
            +  K CG+LP  C+KNS  L+NPVPGMK++++ P+S+  R +    K+  S   S    +
Sbjct: 281  IMTKGCGLLPLLCVKNSLRLVNPVPGMKVRNQSPMSAA-RDIKRMTKSVYSRSQSPTINK 339

Query: 1628 QTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGDGISPY 1449
               + V+K +    ++ P      +K T  SN+ T   D Q    +SP+R S    ISPY
Sbjct: 340  PAKDPVHKKEPDNEVQSPRLVGVDNKLTGGSNRFTYARDRQMISRTSPFRRS--GAISPY 397

Query: 1448 RNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPAI 1269
            RNEAPQS F  G GFLGVPK   N + + L  YG   +  + ++            SP  
Sbjct: 398  RNEAPQSPFPIG-GFLGVPKDLENFKANKLNLYGKCYSKSQELVPYHGLRHGSRPLSPTT 456

Query: 1268 EKTLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKR-LTLEACLREDKQ 1092
            EKTLYVD+V +                                  +   T+E+     K 
Sbjct: 457  EKTLYVDTVNVAGLLCSNAGSSDIKKGGMGPAEKDIKSLLSSREIQETYTIES---TSKD 513

Query: 1091 VKNLNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLECSKVL 912
            V +LN     P +    A  +LLS         S H  +    +D    E+ +L C    
Sbjct: 514  VTSLN----FPEQKSGDADLSLLS-------DMSTHRDQWDTGED-LSQESLALVCVSTT 561

Query: 911  INTSPGSDMPERLDVDGGDSYAXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXXXXSYL 732
               +   +  +  ++D G++                     L RTLP           Y 
Sbjct: 562  TEGNLNIENDQISNMDIGNAKTGFAQCSLPPSLPKTPSESWLSRTLPTVSSQNPSSHLYR 621

Query: 731  GVQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEELKIHS 600
            G  FRS++Q  K +S   KWE IVK++     H+R+SEEL  H+
Sbjct: 622  GTNFRSKRQDSKTTSTSTKWENIVKSSYLHNDHVRYSEELFPHA 665


>gb|KDP21102.1| hypothetical protein JCGZ_21573 [Jatropha curcas]
          Length = 668

 Score =  266 bits (681), Expect = 7e-68
 Identities = 229/703 (32%), Positives = 312/703 (44%), Gaps = 7/703 (0%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRFAS-TAGSNTEDNRRVEKFQ-PKRPSLGSYKPDLKSGPMRHPG 2514
            MEE++LNFN PL+SVRR ++ T  SN    ++ E  Q  KR +L SYK D     +  P 
Sbjct: 1    MEERKLNFNAPLMSVRRSSTATKPSNVTKGKKFENAQLVKRNTLPSYKSDFNLDQVTEPV 60

Query: 2513 AVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSKALTV 2334
            AVPF WEQIPGR KDGS   PR            P R +DV      K +ED   +    
Sbjct: 61   AVPFHWEQIPGRRKDGSKPDPRGCEEASVTPRFTPRRALDV-----VKHIEDKKPEDQVA 115

Query: 2333 FKPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTLSRTE 2154
            F+P    N        + I N   L+  K    E+   N              DTLS  +
Sbjct: 116  FRPQIQSNS------FNDIANG--LDCSKEGVNEKSDFNSENDDDDDLYSDARDTLSGMD 167

Query: 2153 SFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYASRRQLV 1986
            SF ++CSVSG+SGFD     PS + + D Q RDFMM RFLPAA AM  EAPQYASR+Q V
Sbjct: 168  SFSVDCSVSGVSGFDSLAVKPSGTFNADPQTRDFMMSRFLPAAKAMTLEAPQYASRKQPV 227

Query: 1985 AREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXDNTGNL 1806
            + E  R++  V Q  R  P        R +S    + H Q               N G +
Sbjct: 228  SGEQPRQIVQVVQRDRTPPVN------RKESFNVPSYH-QDLVDEESEDECDQYVNYGKI 280

Query: 1805 TAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSEPDPEQ 1626
              K CG+LP  C+KNS  L+NPVPGMK++++ P+S+  R +    K+  S   S    + 
Sbjct: 281  MTKGCGLLPLLCVKNSLRLVNPVPGMKVRNQSPMSAA-RDIKRMTKSVYSRSQSPTINKP 339

Query: 1625 TWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGDGISPYR 1446
              + V+K +    ++ P      +K T  SN+ T   D Q    +SP+R S    ISPYR
Sbjct: 340  AKDPVHKKEPDNEVQSPRLVGVDNKLTGGSNRFTYARDRQMISRTSPFRRS--GAISPYR 397

Query: 1445 NEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXXXXSPAIE 1266
            NEAPQS F  G GFLGVPK   N + + L  YG   +  + ++            SP  E
Sbjct: 398  NEAPQSPFPIG-GFLGVPKDLENFKANKLNLYGKCYSKSQELVPYHGLRHGSRPLSPTTE 456

Query: 1265 KTLYVDSVCMLEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKR-LTLEACLREDKQV 1089
            KTLYVD+V +                                  +   T+E+     K V
Sbjct: 457  KTLYVDTVNVAGLLCSNAGSSDIKKGGMGPAEKDIKSLLSSREIQETYTIES---TSKDV 513

Query: 1088 KNLNERDLLPPKMPDTAKSNLLSFSEDSIVGASLHSREGSKDKDGFVPEAKSLECSKVLI 909
             +LN     P +    A  +LLS         S H  +    +D    E+ +L C     
Sbjct: 514  TSLN----FPEQKSGDADLSLLS-------DMSTHRDQWDTGED-LSQESLALVCVSTTT 561

Query: 908  NTSPGSDMPERLDVDGGDSYAXXXXXXXXXXXXXXXXXXXLCRTLPXXXXXXXXXXSYLG 729
              +   +  +  ++D G++                     L RTLP           Y G
Sbjct: 562  EGNLNIENDQISNMDIGNAKTGFAQCSLPPSLPKTPSESWLSRTLPTVSSQNPSSHLYRG 621

Query: 728  VQFRSRKQLLKASSVDPKWETIVKTNNAQQRHLRFSEELKIHS 600
              FRS++Q  K +S   KWE IVK++     H+R+SEEL  H+
Sbjct: 622  TNFRSKRQDSKTTSTSTKWENIVKSSYLHNDHVRYSEELFPHA 664


>ref|XP_012437373.1| PREDICTED: uncharacterized protein LOC105763636 [Gossypium raimondii]
            gi|823207534|ref|XP_012437375.1| PREDICTED:
            uncharacterized protein LOC105763636 [Gossypium
            raimondii] gi|823207537|ref|XP_012437376.1| PREDICTED:
            uncharacterized protein LOC105763636 [Gossypium
            raimondii] gi|823207540|ref|XP_012437377.1| PREDICTED:
            uncharacterized protein LOC105763636 [Gossypium
            raimondii] gi|763781974|gb|KJB49045.1| hypothetical
            protein B456_008G099200 [Gossypium raimondii]
          Length = 708

 Score =  257 bits (656), Expect = 6e-65
 Identities = 184/496 (37%), Positives = 251/496 (50%), Gaps = 9/496 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFAS-TAGSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPM 2526
            +LKNLME+K+L+FN PLLSVRRF S  AGS +E N++ +    K P    YK +LKSGP+
Sbjct: 2    LLKNLMEDKKLDFNRPLLSVRRFTSQAAGSESEGNKKTDNSLKKVPHPPVYKSELKSGPL 61

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
            R+PG VPF WE+ PGRPK+ S                 PGR +  KQ S        N  
Sbjct: 62   RNPGTVPFVWEKTPGRPKEESNIQTDALDRPPIAPKLPPGRALRDKQQSPR------NGS 115

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIG-NATPLERPKGDAREEHSANVXXXXXXXXXXXXXDT 2169
                F P Q D     S  V S+  N T  E   G+  E  S+               DT
Sbjct: 116  DAKTFAPYQTDMAPSSSQNVPSLALNETTYECANGEMEETGSSG--SKDSGEAYVDALDT 173

Query: 2168 LSRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYAS 2001
            LSR+ESFF+NCS+SG+SG DG    PS + S D Q RDFMM RFLPAA A+ASE P YA+
Sbjct: 174  LSRSESFFLNCSISGVSGLDGSDIKPSGTFSSDPQTRDFMMGRFLPAAKAVASETPPYAT 233

Query: 2000 RRQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXD 1821
            ++Q +AREP R++         +    +KQ   Y S P K PH Q               
Sbjct: 234  KKQPIAREPPRQIK--------KLVIADKQQPLYASSPNKFPHAQ--DDWSEESEDDCYS 283

Query: 1820 NTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHL-- 1647
            ++ N +   CG+ P+F LKNS CLLNP+P +K Q      SV+     H +   S +L  
Sbjct: 284  DSQNYSVNVCGLFPQFLLKNSLCLLNPIPRVKAQ-----KSVKTAYSDHRREAKSSYLRS 338

Query: 1646 -SEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHST 1470
             +E + E T EA  K +L    +     ++ +   S S++ +  SD + P+G+S +RH  
Sbjct: 339  CNETETEHT-EAAGKKRLTGIAQTEEAIEDKNNLKSGSSKKSYRSDCRNPDGASLFRHFQ 397

Query: 1469 GDGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXX 1290
            G+ +S Y ++    L H+   FLG+P +  N R   +  +     + +  L         
Sbjct: 398  GNNVSSYPSQI-SWLGHQEKRFLGIPDKAKNYRVSSIDPHKQGSKNLQECLASESISQES 456

Query: 1289 XXXSPAIEKTLYVDSV 1242
               SP +EKTLYVDSV
Sbjct: 457  GSASP-VEKTLYVDSV 471


>gb|KJB49044.1| hypothetical protein B456_008G099200 [Gossypium raimondii]
          Length = 717

 Score =  257 bits (656), Expect = 6e-65
 Identities = 184/496 (37%), Positives = 251/496 (50%), Gaps = 9/496 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFAS-TAGSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPM 2526
            +LKNLME+K+L+FN PLLSVRRF S  AGS +E N++ +    K P    YK +LKSGP+
Sbjct: 11   LLKNLMEDKKLDFNRPLLSVRRFTSQAAGSESEGNKKTDNSLKKVPHPPVYKSELKSGPL 70

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
            R+PG VPF WE+ PGRPK+ S                 PGR +  KQ S        N  
Sbjct: 71   RNPGTVPFVWEKTPGRPKEESNIQTDALDRPPIAPKLPPGRALRDKQQSPR------NGS 124

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIG-NATPLERPKGDAREEHSANVXXXXXXXXXXXXXDT 2169
                F P Q D     S  V S+  N T  E   G+  E  S+               DT
Sbjct: 125  DAKTFAPYQTDMAPSSSQNVPSLALNETTYECANGEMEETGSSG--SKDSGEAYVDALDT 182

Query: 2168 LSRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYAS 2001
            LSR+ESFF+NCS+SG+SG DG    PS + S D Q RDFMM RFLPAA A+ASE P YA+
Sbjct: 183  LSRSESFFLNCSISGVSGLDGSDIKPSGTFSSDPQTRDFMMGRFLPAAKAVASETPPYAT 242

Query: 2000 RRQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXD 1821
            ++Q +AREP R++         +    +KQ   Y S P K PH Q               
Sbjct: 243  KKQPIAREPPRQIK--------KLVIADKQQPLYASSPNKFPHAQ--DDWSEESEDDCYS 292

Query: 1820 NTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHL-- 1647
            ++ N +   CG+ P+F LKNS CLLNP+P +K Q      SV+     H +   S +L  
Sbjct: 293  DSQNYSVNVCGLFPQFLLKNSLCLLNPIPRVKAQ-----KSVKTAYSDHRREAKSSYLRS 347

Query: 1646 -SEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHST 1470
             +E + E T EA  K +L    +     ++ +   S S++ +  SD + P+G+S +RH  
Sbjct: 348  CNETETEHT-EAAGKKRLTGIAQTEEAIEDKNNLKSGSSKKSYRSDCRNPDGASLFRHFQ 406

Query: 1469 GDGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXX 1290
            G+ +S Y ++    L H+   FLG+P +  N R   +  +     + +  L         
Sbjct: 407  GNNVSSYPSQI-SWLGHQEKRFLGIPDKAKNYRVSSIDPHKQGSKNLQECLASESISQES 465

Query: 1289 XXXSPAIEKTLYVDSV 1242
               SP +EKTLYVDSV
Sbjct: 466  GSASP-VEKTLYVDSV 480


>gb|KHG24123.1| Protein arginine N-methyltransferase 7 [Gossypium arboreum]
          Length = 708

 Score =  254 bits (648), Expect = 5e-64
 Identities = 184/495 (37%), Positives = 251/495 (50%), Gaps = 8/495 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFAS-TAGSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPM 2526
            +LKNLME+KQL+FN PLLSVRRF S  AGS +E N++ +    K P+   YK +LKSGP+
Sbjct: 2    LLKNLMEDKQLDFNRPLLSVRRFTSQVAGSESEGNKKTDNSLNKVPNPPVYKSELKSGPL 61

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
            R+PG VPF WE+ PGRPK+ S                 PGR +  KQ S  K      S 
Sbjct: 62   RNPGTVPFVWEKTPGRPKEESNIQTDALDRPPIAPKLPPGRALRDKQQSPRK-----GSD 116

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIG-NATPLERPKGDAREEHSANVXXXXXXXXXXXXXDT 2169
            A T F P Q + +   S  V S+  N T  E   G+  E  S+               DT
Sbjct: 117  AKT-FAPYQTEMVASSSQNVPSLALNETTYECANGEMEETGSSG--SKDSGEAYVDALDT 173

Query: 2168 LSRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYAS 2001
            LSR+ESFF+NCS+SG+SG DG    PS + S D Q RDFMM RFLPAA A+ASE P YA+
Sbjct: 174  LSRSESFFLNCSISGVSGLDGSDIKPSGTFSSDPQTRDFMMGRFLPAAKAVASETPPYAT 233

Query: 2000 RRQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXXXD 1821
            ++Q +AREP R++         +    +KQ   Y S P K  H Q               
Sbjct: 234  KKQPIAREPPRQIK--------KLVIADKQQPLYASSPNKFTHAQ--DDWSEESEDDCYS 283

Query: 1820 NTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSE 1641
            ++ N +   CG+ P+F LKNS CLLNP+PG+K Q      S +     H +   S +L  
Sbjct: 284  DSQNFSVNVCGLFPQFLLKNSLCLLNPIPGVKAQ-----KSAQTAYSDHRREAKSSYLRS 338

Query: 1640 PDPEQT--WEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTG 1467
             +  +T   EA  K +L    +     ++ +   S S++ +  SD + PEG+S +RH  G
Sbjct: 339  CNETETEHSEAAGKKRLTGIAQTEEAIEDKNNLKSGSSKKSYRSDCRNPEGASLFRHFQG 398

Query: 1466 DGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXX 1287
            + +S Y ++      H+   FLG+P +  N R      +     + +  L          
Sbjct: 399  NNVSSYPSQISWP-GHQEKRFLGIPDKAKNYRVSSFDPHKPGSKNLQECLASECISQESG 457

Query: 1286 XXSPAIEKTLYVDSV 1242
              SP +EKTLYVDSV
Sbjct: 458  SASP-VEKTLYVDSV 471


>ref|XP_008229123.1| PREDICTED: uncharacterized protein LOC103328503 isoform X2 [Prunus
            mume]
          Length = 739

 Score =  250 bits (639), Expect = 5e-63
 Identities = 191/495 (38%), Positives = 240/495 (48%), Gaps = 8/495 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFASTA-GSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPM 2526
            MLKNLMEEKQLNFN PLLSVRRF++T   S  ++ R+ EK  PK P L  YK +LKSGP+
Sbjct: 2    MLKNLMEEKQLNFNQPLLSVRRFSATVVSSEADEKRKTEKSLPKLPPLPVYKSELKSGPV 61

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
            R+PG VPF WEQIPGRPKD      R            PGR+  VK     K   D  SK
Sbjct: 62   RNPGTVPFVWEQIPGRPKDERKSPNRALEWLPTAPKLPPGRVSKVK-----KQATDKGSK 116

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTL 2166
              T  + P   N+  +S  VS++      E  K D+ +    +              D L
Sbjct: 117  CTTAAQSP-TGNVPSNSQNVSTLDTK---EATKYDSSKVEMEDKGIAGSDDGDETYLDAL 172

Query: 2165 SRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYASR 1998
            SR+ESFFMNCSVSGLSG DG    PS + S D Q RDFMM RFLPAA AMASE PQYASR
Sbjct: 173  SRSESFFMNCSVSGLSGLDGLDIKPSGTFSTDPQTRDFMMGRFLPAAKAMASETPQYASR 232

Query: 1997 RQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPH-VQXXXXXXXXXXXXXXD 1821
            +Q VARE  + +         +   G+KQ    Q RP   PH VQ               
Sbjct: 233  KQPVARE--QPLLQEQPSGMKKVVSGDKQHPLNQHRPKDLPHYVQDIAGDK--------- 281

Query: 1820 NTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSE 1641
                                     N   GM++Q+++P+SSVRR      K++ +    E
Sbjct: 282  -------------------------NEDEGMRVQAQLPISSVRR---VRAKSSYAISYRE 313

Query: 1640 PDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGDG 1461
               E +     + +L+ G       ++ +    ESNQ+T   D Q  +GS  YR   G G
Sbjct: 314  AKKEHSGGDSCEKRLMSGHPEARVPEDKNDLIHESNQITNRIDCQKLDGSPMYRRLQGSG 373

Query: 1460 ISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRC--NDDRGMLXXXXXXXXXX 1287
            ISPYRNE  Q   HE   FLG+P++  N R         +C  N    +           
Sbjct: 374  ISPYRNECAQ---HEQKCFLGIPEKAKNYREAISSGKYRKCHNNFQELLAAENVAELEMG 430

Query: 1286 XXSPAIEKTLYVDSV 1242
              SP +EKTLY+DSV
Sbjct: 431  PGSPVVEKTLYIDSV 445


>ref|XP_008229122.1| PREDICTED: uncharacterized protein LOC103328503 isoform X1 [Prunus
            mume]
          Length = 740

 Score =  250 bits (639), Expect = 5e-63
 Identities = 191/495 (38%), Positives = 240/495 (48%), Gaps = 8/495 (1%)
 Frame = -1

Query: 2702 MLKNLMEEKQLNFNVPLLSVRRFASTA-GSNTEDNRRVEKFQPKRPSLGSYKPDLKSGPM 2526
            MLKNLMEEKQLNFN PLLSVRRF++T   S  ++ R+ EK  PK P L  YK +LKSGP+
Sbjct: 2    MLKNLMEEKQLNFNQPLLSVRRFSATVVSSEADEKRKTEKSLPKLPPLPVYKSELKSGPV 61

Query: 2525 RHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGNSK 2346
            R+PG VPF WEQIPGRPKD      R            PGR+  VK     K   D  SK
Sbjct: 62   RNPGTVPFVWEQIPGRPKDERKSPNRALEWLPTAPKLPPGRVSKVK-----KQATDKGSK 116

Query: 2345 ALTVFKPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXDTL 2166
              T  + P   N+  +S  VS++      E  K D+ +    +              D L
Sbjct: 117  CTTAAQSP-TGNVPSNSQNVSTLDTK---EATKYDSSKVEMEDKGIAGSDDGDETYLDAL 172

Query: 2165 SRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYASR 1998
            SR+ESFFMNCSVSGLSG DG    PS + S D Q RDFMM RFLPAA AMASE PQYASR
Sbjct: 173  SRSESFFMNCSVSGLSGLDGLDIKPSGTFSTDPQTRDFMMGRFLPAAKAMASETPQYASR 232

Query: 1997 RQLVAREPVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPH-VQXXXXXXXXXXXXXXD 1821
            +Q VARE  + +         +   G+KQ    Q RP   PH VQ               
Sbjct: 233  KQPVARE--QPLLQEQPSGMKKVVSGDKQHPLNQHRPKDLPHYVQDIAGDK--------- 281

Query: 1820 NTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHLSE 1641
                                     N   GM++Q+++P+SSVRR      K++ +    E
Sbjct: 282  -------------------------NEDEGMRVQAQLPISSVRR---VRAKSSYAISYRE 313

Query: 1640 PDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTGDG 1461
               E +     + +L+ G       ++ +    ESNQ+T   D Q  +GS  YR   G G
Sbjct: 314  AKKEHSGGDSCEKRLMSGHPEARVPEDKNDLIHESNQITNRIDCQKLDGSPMYRRLQGSG 373

Query: 1460 ISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRC--NDDRGMLXXXXXXXXXX 1287
            ISPYRNE  Q   HE   FLG+P++  N R         +C  N    +           
Sbjct: 374  ISPYRNECAQ---HEQKCFLGIPEKAKNYREAISSGKYRKCHNNFQELLAAENVAELEMG 430

Query: 1286 XXSPAIEKTLYVDSV 1242
              SP +EKTLY+DSV
Sbjct: 431  PGSPVVEKTLYIDSV 445


>ref|XP_002533963.1| conserved hypothetical protein [Ricinus communis]
            gi|223526060|gb|EEF28419.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 612

 Score =  244 bits (622), Expect = 5e-61
 Identities = 181/495 (36%), Positives = 244/495 (49%), Gaps = 13/495 (2%)
 Frame = -1

Query: 2687 MEEKQLNFNVPLLSVRRF-------ASTAGSNTEDNRRVEKFQP-KRPSLGSYKPDLKSG 2532
            MEE++LNFN+PLLSVRR        A T  S+ E  ++ + F P +R +L S KP     
Sbjct: 1    MEERKLNFNIPLLSVRRSSTPTRSSAPTKSSSGEKGKKNDNFHPDRRRTLPSCKPAYILD 60

Query: 2531 PMRHPGAVPFEWEQIPGRPKDGSGQHPRNXXXXXXXXXXXPGRIVDVKQPSSAKSVEDGN 2352
             +  P AVPF+WEQIPGRPKDG+   P+            P R++DV +    K      
Sbjct: 61   QVTEPVAVPFQWEQIPGRPKDGAVPDPQGHEEVSVTPRIPPRRVLDVVKHIDNK------ 114

Query: 2351 SKALTVFKPPQNDNLTHHSLVVSSIGNATPLERPKGDAREEHSANVXXXXXXXXXXXXXD 2172
                   KP   D LT      S       L+  K    E+    +             D
Sbjct: 115  -------KPEDQDALTPQIEAKSFTNIVGRLDCSKEGVDEKAIIILENDDDEDVYSDALD 167

Query: 2171 TLSRTESFFMNCSVSGLSGFDG----PSRSTSMDAQARDFMMDRFLPAATAMASEAPQYA 2004
            TLS T+SF +NCS+SG+SGFD     PS + S+D QA+DFMM RFLPAA AM  E PQYA
Sbjct: 168  TLSPTDSFSVNCSLSGVSGFDNLAVKPSGTFSIDQQAQDFMMSRFLPAAKAMTLEPPQYA 227

Query: 2003 SRRQLVARE-PVRKVNIVDQYRRPRPFGGNKQPMRYQSRPYKAPHVQXXXXXXXXXXXXX 1827
            SR+Q V+ E P +    V++ R P        P+         P+ Q             
Sbjct: 228  SRKQPVSGEQPRQTTKAVNRDRTP--------PVIRNRSCNIPPYHQDKEDEESEDECDD 279

Query: 1826 XDNTGNLTAKACGMLPRFCLKNSFCLLNPVPGMKIQSRMPLSSVRRKVPTHIKTTGSEHL 1647
              ++GN+TAK CG LPR C+KNS CLLNPVPGMKI+++  +SS  + +    K   S   
Sbjct: 280  YSDSGNITAKGCGFLPRLCIKNSLCLLNPVPGMKIRTQTSMSST-KDIKKLTKAVFSRSQ 338

Query: 1646 SEPDPEQTWEAVYKHKLLCGLEPPGTYDNGSKPTSESNQLTLWSDSQTPEGSSPYRHSTG 1467
            S    +    AV K K    +  P      +K T  SN+ T  +D Q    +SP+R S  
Sbjct: 339  SPTVKKPARNAVSKQKQDSEVPSPRMVGVENKLTGGSNRFTYATDRQMISRTSPFRRS-- 396

Query: 1466 DGISPYRNEAPQSLFHEGIGFLGVPKQGNNRRTDGLVTYGNRCNDDRGMLXXXXXXXXXX 1287
              ISP+RNEAPQS F  G G  G+PKQ  N +++   ++    +  + ++          
Sbjct: 397  GCISPHRNEAPQSPF-RGRGSQGIPKQLENLKSNQFNSFNRGYSKSQELVSYNGIRRGSR 455

Query: 1286 XXSPAIEKTLYVDSV 1242
              SP +EKTLYVD+V
Sbjct: 456  PASPTVEKTLYVDTV 470


Top