BLASTX nr result

ID: Cinnamomum23_contig00018313 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00018313
         (1251 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010269520.1| PREDICTED: uncharacterized protein LOC104606...   108   8e-21
ref|XP_010649723.1| PREDICTED: uncharacterized protein LOC100244...    83   5e-13
ref|XP_010649722.1| PREDICTED: uncharacterized protein LOC100244...    83   5e-13
emb|CBI26057.3| unnamed protein product [Vitis vinifera]               83   5e-13
ref|XP_001770705.1| predicted protein [Physcomitrella patens] gi...    74   2e-10
ref|XP_009766253.1| PREDICTED: uncharacterized protein LOC104217...    73   4e-10
ref|XP_009766252.1| PREDICTED: uncharacterized protein LOC104217...    73   4e-10
ref|XP_010042399.1| PREDICTED: atherin-like, partial [Eucalyptus...    73   5e-10
ref|XP_010037544.1| PREDICTED: uncharacterized protein LOC104426...    71   1e-09
ref|XP_008458451.1| PREDICTED: uncharacterized protein LOC103497...    70   3e-09
ref|XP_010098273.1| hypothetical protein L484_023519 [Morus nota...    70   3e-09
gb|KHN03357.1| hypothetical protein glysoja_004383 [Glycine soja]      66   5e-08
ref|XP_003550288.2| PREDICTED: uncharacterized protein LOC100786...    66   5e-08
ref|XP_011658433.1| PREDICTED: uncharacterized protein LOC105436...    65   8e-08
gb|KGN47368.1| hypothetical protein Csa_6G306300 [Cucumis sativus]     65   8e-08
ref|XP_007035593.1| Uncharacterized protein isoform 5 [Theobroma...    65   1e-07
ref|XP_007035591.1| Uncharacterized protein isoform 3 [Theobroma...    65   1e-07
ref|XP_007035590.1| Uncharacterized protein isoform 2 [Theobroma...    65   1e-07
ref|XP_007035589.1| Uncharacterized protein isoform 1 [Theobroma...    65   1e-07
ref|XP_006840533.2| PREDICTED: uncharacterized protein LOC184303...    64   2e-07

>ref|XP_010269520.1| PREDICTED: uncharacterized protein LOC104606151 [Nelumbo nucifera]
            gi|720043315|ref|XP_010269521.1| PREDICTED:
            uncharacterized protein LOC104606151 [Nelumbo nucifera]
          Length = 694

 Score =  108 bits (270), Expect = 8e-21
 Identities = 88/328 (26%), Positives = 140/328 (42%), Gaps = 65/328 (19%)
 Frame = -1

Query: 1248 SPGSFINLVFCILGMFVFQTICTIWILGSADIDEISGDSVE------------------- 1126
            S  S + L F ++G+FV QTIC +W+LG       + +  E                   
Sbjct: 166  SSRSVLELAFYVVGLFVLQTICAVWVLGPTSFGRETTNGEEEAEAARLGVSESEVRNGKR 225

Query: 1125 MEGGEKNXXXXXXXXXXXXE-------------KRVSEIQILAREARASEQRKARDEASA 985
            MEG   N            +             +++ EI+ +A+EAR SE R+ R    A
Sbjct: 226  MEGYSLNRNNGVLENLFGSKPSSVIHIDKSQILEKIVEIRAMAKEARESEARELRASGLA 285

Query: 984  SSSTAVGNDGDDFVVETASSRHATDVRKEVDRRLLNLQKSIQTSRGDSP----------- 838
            SSS  +G+  D    ETASS   T+++KEVD R+  LQK + + R +SP           
Sbjct: 286  SSSDVIGDGADSVPAETASSTVKTNIQKEVDGRIFKLQKRLHSLRENSPLASFSYLTSSD 345

Query: 837  --------RRSNSGTKGANLALRKRQAVRVSLMKGRNDPKGFQSSKSNKNLPVG------ 700
                      SN+  K  N   +K++  R S     N+P+GFQ  K N ++P G      
Sbjct: 346  KVKDMTNSEASNANKKSGNFVFKKKRRFRSSPSSPGNNPQGFQGPKDN-SVPTGIGRKVA 404

Query: 699  -----GESGLLDNGEQRADLSDGDDMEEVDSHL--LNDEVLKRIMLKFQANEEGGREPLN 541
                   +  L  GEQ+ D+++G   E    ++  ++   L   + + + N+E  REP N
Sbjct: 405  TIDPLSNALDLSGGEQKIDITNGASQESTSMNIEKMHYNALGETIHEVKENKEARREPFN 464

Query: 540  GLGSE-EEMNFFLALQRKFEKEGMDNAK 460
               S  + +   L   R+    G++N+K
Sbjct: 465  NESSSMQSVRKKLENSRREMAMGIENSK 492


>ref|XP_010649723.1| PREDICTED: uncharacterized protein LOC100244229 isoform X2 [Vitis
            vinifera]
          Length = 605

 Score = 82.8 bits (203), Expect = 5e-13
 Identities = 66/218 (30%), Positives = 96/218 (44%), Gaps = 46/218 (21%)
 Frame = -1

Query: 1248 SPGSFINLVFCILGMFVFQTICTIWILGSADID---EISGDSV---EMEGGEKN------ 1105
            S GS + L  C++G+FVFQTIC +W+LGSAD D   EIS       ++   E+N      
Sbjct: 131  STGSLLKLGLCLVGIFVFQTICAVWVLGSADSDQEHEISDSEAKGSQLGANERNKGKFLL 190

Query: 1104 --------------XXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASAS-SSTA 970
                                      E+++ EI+ +A+EAR SE +K ++    S    A
Sbjct: 191  NFGGKFFGEKIGNKSSHAVYLNESELEEKIVEIRAMAKEARESEGKKLKNNGMNSYLEEA 250

Query: 969  VGNDGDDFVVETASSRHATDVRKEVDRRLLNLQKSIQTSRGDSP---------------- 838
             G D D+ V+ +  S     +++EVD RLL LQK +  +R  SP                
Sbjct: 251  GGGDADEDVISSIRS----GIQEEVDTRLLKLQKRLNATREKSPLPLVSHLNKFGKVENR 306

Query: 837  ---RRSNSGTKGANLALRKRQAVRVSLMKGRNDPKGFQ 733
                 S+       L  +K+   R +    RNDPKGFQ
Sbjct: 307  VNGDHSDVAELNRTLMFKKKMKFRNASSMPRNDPKGFQ 344


>ref|XP_010649722.1| PREDICTED: uncharacterized protein LOC100244229 isoform X1 [Vitis
            vinifera]
          Length = 620

 Score = 82.8 bits (203), Expect = 5e-13
 Identities = 66/218 (30%), Positives = 96/218 (44%), Gaps = 46/218 (21%)
 Frame = -1

Query: 1248 SPGSFINLVFCILGMFVFQTICTIWILGSADID---EISGDSV---EMEGGEKN------ 1105
            S GS + L  C++G+FVFQTIC +W+LGSAD D   EIS       ++   E+N      
Sbjct: 131  STGSLLKLGLCLVGIFVFQTICAVWVLGSADSDQEHEISDSEAKGSQLGANERNKGKFLL 190

Query: 1104 --------------XXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASAS-SSTA 970
                                      E+++ EI+ +A+EAR SE +K ++    S    A
Sbjct: 191  NFGGKFFGEKIGNKSSHAVYLNESELEEKIVEIRAMAKEARESEGKKLKNNGMNSYLEEA 250

Query: 969  VGNDGDDFVVETASSRHATDVRKEVDRRLLNLQKSIQTSRGDSP---------------- 838
             G D D+ V+ +  S     +++EVD RLL LQK +  +R  SP                
Sbjct: 251  GGGDADEDVISSIRS----GIQEEVDTRLLKLQKRLNATREKSPLPLVSHLNKFGKVENR 306

Query: 837  ---RRSNSGTKGANLALRKRQAVRVSLMKGRNDPKGFQ 733
                 S+       L  +K+   R +    RNDPKGFQ
Sbjct: 307  VNGDHSDVAELNRTLMFKKKMKFRNASSMPRNDPKGFQ 344


>emb|CBI26057.3| unnamed protein product [Vitis vinifera]
          Length = 637

 Score = 82.8 bits (203), Expect = 5e-13
 Identities = 66/218 (30%), Positives = 96/218 (44%), Gaps = 46/218 (21%)
 Frame = -1

Query: 1248 SPGSFINLVFCILGMFVFQTICTIWILGSADID---EISGDSV---EMEGGEKN------ 1105
            S GS + L  C++G+FVFQTIC +W+LGSAD D   EIS       ++   E+N      
Sbjct: 163  STGSLLKLGLCLVGIFVFQTICAVWVLGSADSDQEHEISDSEAKGSQLGANERNKGKFLL 222

Query: 1104 --------------XXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASAS-SSTA 970
                                      E+++ EI+ +A+EAR SE +K ++    S    A
Sbjct: 223  NFGGKFFGEKIGNKSSHAVYLNESELEEKIVEIRAMAKEARESEGKKLKNNGMNSYLEEA 282

Query: 969  VGNDGDDFVVETASSRHATDVRKEVDRRLLNLQKSIQTSRGDSP---------------- 838
             G D D+ V+ +  S     +++EVD RLL LQK +  +R  SP                
Sbjct: 283  GGGDADEDVISSIRS----GIQEEVDTRLLKLQKRLNATREKSPLPLVSHLNKFGKVENR 338

Query: 837  ---RRSNSGTKGANLALRKRQAVRVSLMKGRNDPKGFQ 733
                 S+       L  +K+   R +    RNDPKGFQ
Sbjct: 339  VNGDHSDVAELNRTLMFKKKMKFRNASSMPRNDPKGFQ 376


>ref|XP_001770705.1| predicted protein [Physcomitrella patens]
           gi|162678066|gb|EDQ64529.1| predicted protein
           [Physcomitrella patens]
          Length = 1011

 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 43/111 (38%), Positives = 64/111 (57%), Gaps = 1/111 (0%)
 Frame = -1

Query: 756 RNDPKGFQSSKSNKNLPVGGESGLLDNG-EQRADLSDGDDMEEVDSHLLNDEVLKRIMLK 580
           ++  KG QSS      P  G  G L +   Q    +   + ++ +   + DEVL+RI+LK
Sbjct: 521 KSKDKGSQSSG-----PTDGNRGSLKSSLSQEESPNPKSEEQQEEEGWMRDEVLRRIVLK 575

Query: 579 FQANEEGGREPLNGLGSEEEMNFFLALQRKFEKEGMDNAKQWMEKRMEGID 427
            + NEE GR+  +GL SEEE  FF  L+RKFE+EG +  K W++ R+E +D
Sbjct: 576 VRDNEEAGRDSFHGLNSEEEQLFFKGLERKFEREG-EAVKTWIQDRVENLD 625


>ref|XP_009766253.1| PREDICTED: uncharacterized protein LOC104217651 isoform X2 [Nicotiana
            sylvestris]
          Length = 586

 Score = 73.2 bits (178), Expect = 4e-10
 Identities = 71/259 (27%), Positives = 107/259 (41%), Gaps = 52/259 (20%)
 Frame = -1

Query: 1239 SFINLVFCILGMFVFQTICTIWILGSADIDEISGDS---------VEMEGGEKN------ 1105
            S +     ++G FVFQT+C +W+LGSAD    +G S         +++EG  K+      
Sbjct: 167  SLLKFGLWVVGAFVFQTVCAVWVLGSADYSGNNGTSDRNGYKNEVLDLEGKSKHKLRMFV 226

Query: 1104 ---------XXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGD 952
                                 EK++ EI+ +AREAR     K R E   S+      DG+
Sbjct: 227  NGDGKQNGENGGIVFVDETEMEKKIKEIRHMAREAR----EKERLETKGSNVDEESEDGE 282

Query: 951  DFVVETASSRHATDVRKEVDRRLLNLQKSIQTSRGDSPRRS----------NSGTKGAN- 805
            D  V+         ++KEVD RL+ ++K ++      P  S          N    G N 
Sbjct: 283  DSDVKIG-------IKKEVDERLIKMRKRLEKGSDKQPANSVNYPRVDVDKNGRKHGVNL 335

Query: 804  --------LALRKRQAVRVSLMKGRNDPKGF-----QSSKSNKNLPVGGESGLL--DNGE 670
                    L  +++  +R    K  N PKGF      S+ +N    V G + +L   NGE
Sbjct: 336  DEKELNAALIFKRKHKIRDFGSKPSNKPKGFVLPEHPSAGTNGEKTVEGNTEVLKNGNGE 395

Query: 669  QRADLSDGD--DMEEVDSH 619
               D+S  D  D+  +DSH
Sbjct: 396  GGVDVSGDDEVDLFTLDSH 414


>ref|XP_009766252.1| PREDICTED: uncharacterized protein LOC104217651 isoform X1 [Nicotiana
            sylvestris]
          Length = 708

 Score = 73.2 bits (178), Expect = 4e-10
 Identities = 71/259 (27%), Positives = 107/259 (41%), Gaps = 52/259 (20%)
 Frame = -1

Query: 1239 SFINLVFCILGMFVFQTICTIWILGSADIDEISGDS---------VEMEGGEKN------ 1105
            S +     ++G FVFQT+C +W+LGSAD    +G S         +++EG  K+      
Sbjct: 167  SLLKFGLWVVGAFVFQTVCAVWVLGSADYSGNNGTSDRNGYKNEVLDLEGKSKHKLRMFV 226

Query: 1104 ---------XXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGD 952
                                 EK++ EI+ +AREAR     K R E   S+      DG+
Sbjct: 227  NGDGKQNGENGGIVFVDETEMEKKIKEIRHMAREAR----EKERLETKGSNVDEESEDGE 282

Query: 951  DFVVETASSRHATDVRKEVDRRLLNLQKSIQTSRGDSPRRS----------NSGTKGAN- 805
            D  V+         ++KEVD RL+ ++K ++      P  S          N    G N 
Sbjct: 283  DSDVKIG-------IKKEVDERLIKMRKRLEKGSDKQPANSVNYPRVDVDKNGRKHGVNL 335

Query: 804  --------LALRKRQAVRVSLMKGRNDPKGF-----QSSKSNKNLPVGGESGLL--DNGE 670
                    L  +++  +R    K  N PKGF      S+ +N    V G + +L   NGE
Sbjct: 336  DEKELNAALIFKRKHKIRDFGSKPSNKPKGFVLPEHPSAGTNGEKTVEGNTEVLKNGNGE 395

Query: 669  QRADLSDGD--DMEEVDSH 619
               D+S  D  D+  +DSH
Sbjct: 396  GGVDVSGDDEVDLFTLDSH 414


>ref|XP_010042399.1| PREDICTED: atherin-like, partial [Eucalyptus grandis]
          Length = 402

 Score = 72.8 bits (177), Expect = 5e-10
 Identities = 74/230 (32%), Positives = 106/230 (46%), Gaps = 23/230 (10%)
 Frame = -1

Query: 1248 SPGSFINLVFCILGMFVFQTICTIWILGSADIDEISGDSVE--MEGGE----------KN 1105
            S GS I     +  +F+FQTIC +W+ GS   D  +G+S E  +  GE            
Sbjct: 123  SAGSVIKFGAYLAVVFLFQTICAVWVFGSNGSDS-NGESEERNLVNGEGRAVAGKKLLSR 181

Query: 1104 XXXXXXXXXXXXEKRVSEIQILAREARASEQR-KARDEASASSSTAVGNDGDDFVVETAS 928
                        E+R+ EI  +AREAR SE+R   RD+A        G+DGDD     A 
Sbjct: 182  TRNVGYLDESDLERRIEEISSMAREARRSEKRGLKRDDAGDGE----GDDGDDDDDGEAI 237

Query: 927  SRHATDVRKEVDRRLLNLQKSIQTSRGDSPRRSNSGTKGA-NLALRKRQAVRVSLM-KGR 754
                  + KE+  RL  LQK ++  R       N   +GA +L  +K+   R  L+ K  
Sbjct: 238  PDSRMAIEKEIGSRLDKLQKKLRPVRKSPGLSENVPLEGASSLMFKKKLKFRSPLVEKPS 297

Query: 753  NDPKGFQ-------SSKSNKNLPVG-GESGLLDNGEQRADLSDGDDMEEV 628
              PKGFQ       + K +KN   G G++G LD+G    + S+G+  EE+
Sbjct: 298  TAPKGFQRLQDNGKTKKKSKNEVNGEGKNGGLDHGV--TESSNGEKWEEL 345


>ref|XP_010037544.1| PREDICTED: uncharacterized protein LOC104426246 [Eucalyptus grandis]
          Length = 601

 Score = 71.2 bits (173), Expect = 1e-09
 Identities = 72/229 (31%), Positives = 106/229 (46%), Gaps = 22/229 (9%)
 Frame = -1

Query: 1248 SPGSFINLVFCILGMFVFQTICTIWILGSADIDEISGDSVEM-----EG----GEK---N 1105
            S  S I     ++ +F+FQTIC +W+ GS   D  +G+S E      EG    G+K    
Sbjct: 121  SAKSVIKFGAYLVAVFLFQTICAVWVFGSNGSDS-NGESEERISVNGEGRAVSGKKLLSR 179

Query: 1104 XXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASS 925
                        E+R+ EI  +AREAR SE+R  + + +       GND DD     A  
Sbjct: 180  TRNVVYLDESDLERRIEEISSMAREARRSEKRGLKRDDAGDGEGDDGNDDDD---GEAIP 236

Query: 924  RHATDVRKEVDRRLLNLQKSIQTSRGDSPRRSNSGTKGA-NLALRKRQAVRVSLM-KGRN 751
                 + KE+  RL  LQK ++  R       N   +GA +L  +K+   R  L+ K   
Sbjct: 237  DSRVAIEKEIGSRLDKLQKKLRPVRKSPGLSENVPLEGASSLMFKKKLKFRSPLVEKPST 296

Query: 750  DPKGFQ-------SSKSNKNLPVG-GESGLLDNGEQRADLSDGDDMEEV 628
             PKGFQ       + K +KN   G G++G LD+G    + S+G+  EE+
Sbjct: 297  VPKGFQRLQDNGKTKKKSKNEVNGEGKNGGLDHGV--TESSNGEKWEEL 343


>ref|XP_008458451.1| PREDICTED: uncharacterized protein LOC103497853 [Cucumis melo]
          Length = 599

 Score = 70.5 bits (171), Expect = 3e-09
 Identities = 68/262 (25%), Positives = 104/262 (39%), Gaps = 49/262 (18%)
 Frame = -1

Query: 1242 GSFINLVFCILGMFVFQTICTIWIL--GSADIDEI-----------SGDSVEMEGGEK-- 1108
            GSF+ L    L +F FQTICT+W+L  GS+  ++            SG  V + G E+  
Sbjct: 129  GSFVKLGVYFLAVFAFQTICTVWVLEYGSSSKEDTSSNEDLSVRRNSGREVLLNGNERIG 188

Query: 1107 ------NXXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDF 946
                                +++ EI+ +AR AR  E+ K  D+             DD 
Sbjct: 189  LGNVGSKRNKLVYLEETKMREKIEEIRSMARAARIEEKNKRSDDFGE----------DDM 238

Query: 945  VVETASSRHATDVRKEVDRRLLNLQKSIQTSRGDSPRRS-------------------NS 823
                A SR   D+ KEVD RL+ L+K + +S+   P  S                   N 
Sbjct: 239  EGGNAISRARIDIEKEVDARLVKLEKRLNSSKEKIPGSSMNYLLKSENVEDAVERNSFNG 298

Query: 822  GTKGANLALRKRQAVRVSLMKGRNDPKGFQSSKSN--------KNLPVGGESGLLDN-GE 670
              +  +L  +K+   R S       PKGFQ   SN        K   VGG + ++D  G 
Sbjct: 299  EERDKSLMFKKKMRYRNSSSHRIKKPKGFQGFVSNGKKSGSNGKGTTVGGANFVVDKMGV 358

Query: 669  QRADLSDGDDMEEVDSHLLNDE 604
            +  +   G+ + +  S +  D+
Sbjct: 359  KDTEKRVGNKIMDSVSEMFEDD 380


>ref|XP_010098273.1| hypothetical protein L484_023519 [Morus notabilis]
            gi|587885939|gb|EXB74777.1| hypothetical protein
            L484_023519 [Morus notabilis]
          Length = 559

 Score = 70.1 bits (170), Expect = 3e-09
 Identities = 52/198 (26%), Positives = 88/198 (44%), Gaps = 23/198 (11%)
 Frame = -1

Query: 1239 SFINLVFCILGMFVFQTICTIWILGSADIDEISGDSVEMEGGE--KNXXXXXXXXXXXXE 1066
            SF+     ++G+FVFQTI ++W+LG+A+ +E  GD   ++ G+   N            E
Sbjct: 109  SFVRYGVYLIGVFVFQTILSVWVLGTANSEEKDGDFDSLDNGKVLLNGNEKILRSNVELE 168

Query: 1065 KRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASSRHATDVRKEVDRR 886
            +++ +I+ +AR+AR  E+ K     S  S T +G                  + KE+++R
Sbjct: 169  EKIEKIRAMARKARKVEKNKGE---SLKSGTKIG------------------IEKEIEKR 207

Query: 885  LLNLQKSIQTSRGDSPRR---------------------SNSGTKGANLALRKRQAVRVS 769
            LL LQK + ++R   PR                       + G +   L  +K+   R  
Sbjct: 208  LLKLQKGLNSTREKLPRSYVNYLSKYGKVEDEVTKKKAGLDVGKENETLMFKKKLKFRSP 267

Query: 768  LMKGRNDPKGFQSSKSNK 715
            L +    PKGF  S+ +K
Sbjct: 268  LTEPSKGPKGFGDSEKHK 285


>gb|KHN03357.1| hypothetical protein glysoja_004383 [Glycine soja]
          Length = 536

 Score = 66.2 bits (160), Expect = 5e-08
 Identities = 68/250 (27%), Positives = 101/250 (40%), Gaps = 30/250 (12%)
 Frame = -1

Query: 1212 LGMFVFQTICTIWILGSADIDEISGDSVEMEGGEKN--------XXXXXXXXXXXXEKRV 1057
            LG FV QTI T+WI+G    ++   D +E++  EK                     +K++
Sbjct: 123  LGFFVLQTIYTVWIVGIFKFNQKDRD-LEIDRDEKTVSWPVHGASNVFLSEEQVLMDKKI 181

Query: 1056 SEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASSRHATDVRKEVDRRLLN 877
             EI+++AREAR  E  K + +        V +D DD   E A S +   + KE+  RLL 
Sbjct: 182  EEIKLMAREARRIESEK-KGKEDEDEDFEVDDDDDD---EGAVSSNRLGIEKEIGERLLK 237

Query: 876  LQKSIQTSRGDSP------RRSNSGT---KGAN----------LALRKRQAVRVSLMKGR 754
            +Q  I  S  D         R NS     +G N          L  +K+   +    K  
Sbjct: 238  VQNRINGSAKDISAALQINSRGNSAAGVDRGVNKNVKNKGNEALVFKKKFKFKSPSTKDT 297

Query: 753  NDPKGFQSS---KSNKNLPVGGESGLLDNGEQRADLSDGDDMEEVDSHLLNDEVLKRIML 583
              PKGF  +   K +K    G ES          D+SD   M   D      E++ +  +
Sbjct: 298  KTPKGFPGNRDWKESKATKRGSESKKAAQDHGSDDVSDHAQMLHEDKRANQPELVTQKSV 357

Query: 582  KFQANEEGGR 553
                +EEGG+
Sbjct: 358  SSVPSEEGGK 367


>ref|XP_003550288.2| PREDICTED: uncharacterized protein LOC100786970 [Glycine max]
          Length = 537

 Score = 66.2 bits (160), Expect = 5e-08
 Identities = 68/250 (27%), Positives = 101/250 (40%), Gaps = 30/250 (12%)
 Frame = -1

Query: 1212 LGMFVFQTICTIWILGSADIDEISGDSVEMEGGEKN--------XXXXXXXXXXXXEKRV 1057
            LG FV QTI T+WI+G    ++   D +E++  EK                     +K++
Sbjct: 124  LGFFVLQTIYTVWIVGIFKFNQKDRD-LEIDRDEKTVSWPVHGASNVFLSEEQVLMDKKI 182

Query: 1056 SEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASSRHATDVRKEVDRRLLN 877
             EI+++AREAR  E  K + +        V +D DD   E A S +   + KE+  RLL 
Sbjct: 183  EEIKLMAREARRIESEK-KGKEDEDEDFEVDDDDDD---EGAVSSNRLGIEKEIGERLLK 238

Query: 876  LQKSIQTSRGDSP------RRSNSGT---KGAN----------LALRKRQAVRVSLMKGR 754
            +Q  I  S  D         R NS     +G N          L  +K+   +    K  
Sbjct: 239  VQNRINGSAKDISAALQINSRGNSAAGVDRGVNKNVKNKGNEALVFKKKFKFKSPSTKDT 298

Query: 753  NDPKGFQSS---KSNKNLPVGGESGLLDNGEQRADLSDGDDMEEVDSHLLNDEVLKRIML 583
              PKGF  +   K +K    G ES          D+SD   M   D      E++ +  +
Sbjct: 299  KTPKGFPGNRDWKESKATKRGSESKKAAQDHGSDDVSDHAQMLHEDKRANQPELVTQKSV 358

Query: 582  KFQANEEGGR 553
                +EEGG+
Sbjct: 359  SSVPSEEGGK 368


>ref|XP_011658433.1| PREDICTED: uncharacterized protein LOC105436003 [Cucumis sativus]
          Length = 650

 Score = 65.5 bits (158), Expect = 8e-08
 Identities = 63/254 (24%), Positives = 101/254 (39%), Gaps = 40/254 (15%)
 Frame = -1

Query: 1242 GSFINLVFCILGMFVFQTICTIWIL--GSADIDEISGD---SVEMEGGEK---------- 1108
            GSF+ L   +L +F FQTICT+W+L  GS+  ++ S +   SV  +GG +          
Sbjct: 182  GSFVKLGVYLLAVFAFQTICTVWVLEYGSSIKEDKSSNEDLSVRRKGGREVLLNGNEGNV 241

Query: 1107 ------NXXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDF 946
                                +++ EI+++AR AR  E+ K  D+           + DD 
Sbjct: 242  LGNFGSKRNKSVYLEETKMREKIEEIRLMARAARIEEKNKMSDDF----------EDDDM 291

Query: 945  VVETASSRHATDVRKEVDRRLLNLQKSIQTSR-----------------GDSPRRS--NS 823
                A SR    + KEVD RL+ L+K + +++                  D+  R+  N 
Sbjct: 292  EGGNAISRARIGIEKEVDARLVKLEKRLNSAKEKISGSSMNYLLKSEHVEDAVERNSFNG 351

Query: 822  GTKGANLALRKRQAVRVSLMKGRNDPKGFQSSKSNKNLPVGGESGLLDNGEQRADLSDGD 643
              +  +L  +K+   R S       P+GFQ   SN       + G    G    D     
Sbjct: 352  EERNESLMYKKKMKYRDSSSHRIKKPEGFQGFVSNGRKSGSNDKGATVEGANIVDKMGVK 411

Query: 642  DMEEVDSHLLNDEV 601
            D E+   + + D V
Sbjct: 412  DTEKRVGNKIMDSV 425


>gb|KGN47368.1| hypothetical protein Csa_6G306300 [Cucumis sativus]
          Length = 597

 Score = 65.5 bits (158), Expect = 8e-08
 Identities = 63/254 (24%), Positives = 101/254 (39%), Gaps = 40/254 (15%)
 Frame = -1

Query: 1242 GSFINLVFCILGMFVFQTICTIWIL--GSADIDEISGD---SVEMEGGEK---------- 1108
            GSF+ L   +L +F FQTICT+W+L  GS+  ++ S +   SV  +GG +          
Sbjct: 129  GSFVKLGVYLLAVFAFQTICTVWVLEYGSSIKEDKSSNEDLSVRRKGGREVLLNGNEGNV 188

Query: 1107 ------NXXXXXXXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDF 946
                                +++ EI+++AR AR  E+ K  D+           + DD 
Sbjct: 189  LGNFGSKRNKSVYLEETKMREKIEEIRLMARAARIEEKNKMSDDF----------EDDDM 238

Query: 945  VVETASSRHATDVRKEVDRRLLNLQKSIQTSR-----------------GDSPRRS--NS 823
                A SR    + KEVD RL+ L+K + +++                  D+  R+  N 
Sbjct: 239  EGGNAISRARIGIEKEVDARLVKLEKRLNSAKEKISGSSMNYLLKSEHVEDAVERNSFNG 298

Query: 822  GTKGANLALRKRQAVRVSLMKGRNDPKGFQSSKSNKNLPVGGESGLLDNGEQRADLSDGD 643
              +  +L  +K+   R S       P+GFQ   SN       + G    G    D     
Sbjct: 299  EERNESLMYKKKMKYRDSSSHRIKKPEGFQGFVSNGRKSGSNDKGATVEGANIVDKMGVK 358

Query: 642  DMEEVDSHLLNDEV 601
            D E+   + + D V
Sbjct: 359  DTEKRVGNKIMDSV 372


>ref|XP_007035593.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508714622|gb|EOY06519.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 371

 Score = 65.1 bits (157), Expect = 1e-07
 Identities = 61/243 (25%), Positives = 105/243 (43%), Gaps = 34/243 (13%)
 Frame = -1

Query: 1239 SFINLVFCILGMFVFQTICTIWILGSADIDEIS----------GDSVEMEGGEKNXXXXX 1090
            S +   F  +G+FVFQT+  +W+ G+ D  +            G  +     E +     
Sbjct: 124  SVLKFGFYFVGIFVFQTLVAVWVTGNGDSQDKDRNFQRKKSWHGKFLNNGKVESSSRNVF 183

Query: 1089 XXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASSRHATD 910
                   E++V EI+ +AREAR  E+++ ++          G++  D + E+ +S+    
Sbjct: 184  SWDNSELEEKVKEIRAMAREARKIEEKETKN----------GDEEGDMIAESLNSKARIG 233

Query: 909  VRKEVDRRLLNLQKSIQTSRGDSP---------RRSNSGTK--GANLALRKRQAVRVSLM 763
              KE+  RL  L+K + + R + P          R     K     L ++K+   R S  
Sbjct: 234  FEKEIGARLNKLEKKLNSKRENIPGSYINFLDKLRDGEDAKEMDKKLFIKKKFKFRASEK 293

Query: 762  KGRNDPKGFQSSKS-----NKNLPVGGESGL--LDNGE----QRADL--SDGDDMEEVDS 622
              R+D KGF S K      N+N      SG   ++NG+    Q  D   SDG+++E+++ 
Sbjct: 294  NSRSDVKGFPSLKDCSATRNENGMATSGSGTKEVENGKRVVSQNLDFLPSDGEEIEKIEE 353

Query: 621  HLL 613
              L
Sbjct: 354  EEL 356


>ref|XP_007035591.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508714620|gb|EOY06517.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 447

 Score = 65.1 bits (157), Expect = 1e-07
 Identities = 61/243 (25%), Positives = 105/243 (43%), Gaps = 34/243 (13%)
 Frame = -1

Query: 1239 SFINLVFCILGMFVFQTICTIWILGSADIDEIS----------GDSVEMEGGEKNXXXXX 1090
            S +   F  +G+FVFQT+  +W+ G+ D  +            G  +     E +     
Sbjct: 124  SVLKFGFYFVGIFVFQTLVAVWVTGNGDSQDKDRNFQRKKSWHGKFLNNGKVESSSRNVF 183

Query: 1089 XXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASSRHATD 910
                   E++V EI+ +AREAR  E+++ ++          G++  D + E+ +S+    
Sbjct: 184  SWDNSELEEKVKEIRAMAREARKIEEKETKN----------GDEEGDMIAESLNSKARIG 233

Query: 909  VRKEVDRRLLNLQKSIQTSRGDSP---------RRSNSGTK--GANLALRKRQAVRVSLM 763
              KE+  RL  L+K + + R + P          R     K     L ++K+   R S  
Sbjct: 234  FEKEIGARLNKLEKKLNSKRENIPGSYINFLDKLRDGEDAKEMDKKLFIKKKFKFRASEK 293

Query: 762  KGRNDPKGFQSSKS-----NKNLPVGGESGL--LDNGE----QRADL--SDGDDMEEVDS 622
              R+D KGF S K      N+N      SG   ++NG+    Q  D   SDG+++E+++ 
Sbjct: 294  NSRSDVKGFPSLKDCSATRNENGMATSGSGTKEVENGKRVVSQNLDFLPSDGEEIEKIEE 353

Query: 621  HLL 613
              L
Sbjct: 354  EEL 356


>ref|XP_007035590.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590661142|ref|XP_007035592.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508714619|gb|EOY06516.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508714621|gb|EOY06518.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 390

 Score = 65.1 bits (157), Expect = 1e-07
 Identities = 61/243 (25%), Positives = 105/243 (43%), Gaps = 34/243 (13%)
 Frame = -1

Query: 1239 SFINLVFCILGMFVFQTICTIWILGSADIDEIS----------GDSVEMEGGEKNXXXXX 1090
            S +   F  +G+FVFQT+  +W+ G+ D  +            G  +     E +     
Sbjct: 124  SVLKFGFYFVGIFVFQTLVAVWVTGNGDSQDKDRNFQRKKSWHGKFLNNGKVESSSRNVF 183

Query: 1089 XXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASSRHATD 910
                   E++V EI+ +AREAR  E+++ ++          G++  D + E+ +S+    
Sbjct: 184  SWDNSELEEKVKEIRAMAREARKIEEKETKN----------GDEEGDMIAESLNSKARIG 233

Query: 909  VRKEVDRRLLNLQKSIQTSRGDSP---------RRSNSGTK--GANLALRKRQAVRVSLM 763
              KE+  RL  L+K + + R + P          R     K     L ++K+   R S  
Sbjct: 234  FEKEIGARLNKLEKKLNSKRENIPGSYINFLDKLRDGEDAKEMDKKLFIKKKFKFRASEK 293

Query: 762  KGRNDPKGFQSSKS-----NKNLPVGGESGL--LDNGE----QRADL--SDGDDMEEVDS 622
              R+D KGF S K      N+N      SG   ++NG+    Q  D   SDG+++E+++ 
Sbjct: 294  NSRSDVKGFPSLKDCSATRNENGMATSGSGTKEVENGKRVVSQNLDFLPSDGEEIEKIEE 353

Query: 621  HLL 613
              L
Sbjct: 354  EEL 356


>ref|XP_007035589.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508714618|gb|EOY06515.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score = 65.1 bits (157), Expect = 1e-07
 Identities = 61/243 (25%), Positives = 105/243 (43%), Gaps = 34/243 (13%)
 Frame = -1

Query: 1239 SFINLVFCILGMFVFQTICTIWILGSADIDEIS----------GDSVEMEGGEKNXXXXX 1090
            S +   F  +G+FVFQT+  +W+ G+ D  +            G  +     E +     
Sbjct: 124  SVLKFGFYFVGIFVFQTLVAVWVTGNGDSQDKDRNFQRKKSWHGKFLNNGKVESSSRNVF 183

Query: 1089 XXXXXXXEKRVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDFVVETASSRHATD 910
                   E++V EI+ +AREAR  E+++ ++          G++  D + E+ +S+    
Sbjct: 184  SWDNSELEEKVKEIRAMAREARKIEEKETKN----------GDEEGDMIAESLNSKARIG 233

Query: 909  VRKEVDRRLLNLQKSIQTSRGDSP---------RRSNSGTK--GANLALRKRQAVRVSLM 763
              KE+  RL  L+K + + R + P          R     K     L ++K+   R S  
Sbjct: 234  FEKEIGARLNKLEKKLNSKRENIPGSYINFLDKLRDGEDAKEMDKKLFIKKKFKFRASEK 293

Query: 762  KGRNDPKGFQSSKS-----NKNLPVGGESGL--LDNGE----QRADL--SDGDDMEEVDS 622
              R+D KGF S K      N+N      SG   ++NG+    Q  D   SDG+++E+++ 
Sbjct: 294  NSRSDVKGFPSLKDCSATRNENGMATSGSGTKEVENGKRVVSQNLDFLPSDGEEIEKIEE 353

Query: 621  HLL 613
              L
Sbjct: 354  EEL 356


>ref|XP_006840533.2| PREDICTED: uncharacterized protein LOC18430311 [Amborella trichopoda]
          Length = 695

 Score = 64.3 bits (155), Expect = 2e-07
 Identities = 90/333 (27%), Positives = 134/333 (40%), Gaps = 58/333 (17%)
 Frame = -1

Query: 1248 SPGSFINLVFCILGMFVFQTICTIWILGSADIDEISGDSVEMEGGE--------KNXXXX 1093
            S  S I+LV  ++G+FVFQT C +W+LGSA+ D+  G   E   G         KN    
Sbjct: 115  SSRSVIHLVLSLVGVFVFQTACAVWVLGSANFDDKLGKLEENSDGSSSSSSPNIKNGLFS 174

Query: 1092 XXXXXXXXEK----------RVSEIQILAREARASEQRKARDEASASSSTAVGNDGDDFV 943
                     K          R+S I+ +AREARA+E+++ +++        V  + +D  
Sbjct: 175  SGRKDGYFAKLSTGEAELGERISLIRSMAREARANERKRLKED-----DPFVSLEENDTF 229

Query: 942  VETASSRHA-----TDVRKEVDRRL-------------------------LNLQKSIQTS 853
            VET  +  A     T + KEVD+ L                         L++Q  I  S
Sbjct: 230  VETTKNLSAPVKFQTPIEKEVDKHLEILPRLVPKRLKDSTELPVKSLTKVLDVQNLIGKS 289

Query: 852  RGDSPRRSNSGTK---GANLALRKRQAVRVSLMKGRNDPKGFQSSKSNKNLPVGGESGLL 682
            R     R NS  K     +  +++    R   ++ R  P G + S S+KN    GE    
Sbjct: 290  RSRKASRKNSPYKYRQNLDGIVKEADKNRSKSVQAR-APWGSRGSSSDKN----GE---- 340

Query: 681  DNGEQRADLSDGDDMEEVDSHLLNDEVLKRIMLKFQANEEGGREPLNGLGSEEEMNFFLA 502
              G +R   S   D    +SH     VL             GRE  N    E+E+ F   
Sbjct: 341  --GSKRKATSADVDGRNGNSH----SVL-------------GREKHN---MEDEIGFSKV 378

Query: 501  LQRKFEK--EGM-----DNAKQWMEKRMEGIDL 424
            L  K E+  +G      +  K+ ++ R+E IDL
Sbjct: 379  LDGKLERMLDGKLERKDEGTKERIQPRIEAIDL 411


Top