BLASTX nr result

ID: Cornus23_contig00007163 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00007163
         (3842 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDP07372.1| unnamed protein product [Coffea canephora]            161   4e-36
ref|XP_010266830.1| PREDICTED: glutamic acid-rich protein-like [...   155   3e-34
ref|XP_010265994.1| PREDICTED: protein FAM133 isoform X2 [Nelumb...   150   9e-33
ref|XP_010265992.1| PREDICTED: splicing regulatory glutamine/lys...   150   9e-33
ref|XP_012073759.1| PREDICTED: uncharacterized protein LOC105635...   139   2e-29
ref|XP_010648991.1| PREDICTED: uncharacterized protein LOC100254...   134   5e-28
ref|XP_002262760.2| PREDICTED: uncharacterized protein LOC100254...   134   5e-28
gb|KHG14510.1| Heat shock factor 4 [Gossypium arboreum]               131   4e-27
ref|XP_012469303.1| PREDICTED: splicing regulatory glutamine/lys...   130   1e-26
gb|KJB17613.1| hypothetical protein B456_003G007900 [Gossypium r...   130   1e-26
ref|XP_006342067.1| PREDICTED: muscle M-line assembly protein un...   127   1e-25
ref|XP_010029657.1| PREDICTED: uncharacterized protein LOC104419...   126   2e-25
ref|XP_011083480.1| PREDICTED: uncharacterized protein LOC105166...   125   3e-25
ref|XP_011074998.1| PREDICTED: uncharacterized protein LOC105159...   124   7e-25
ref|XP_009589462.1| PREDICTED: uncharacterized protein LOC104086...   122   3e-24
ref|XP_002510569.1| conserved hypothetical protein [Ricinus comm...   122   3e-24
ref|XP_009772830.1| PREDICTED: glutamic acid-rich protein-like [...   120   1e-23
ref|XP_010113316.1| hypothetical protein L484_026647 [Morus nota...   115   4e-22
ref|XP_007017860.1| JHL20J20.12 protein, putative [Theobroma cac...   114   7e-22
ref|XP_004238373.1| PREDICTED: exocyst complex component 6 [Sola...   113   1e-21

>emb|CDP07372.1| unnamed protein product [Coffea canephora]
          Length = 319

 Score =  161 bits (408), Expect = 4e-36
 Identities = 103/253 (40%), Positives = 140/253 (55%), Gaps = 7/253 (2%)
 Frame = -2

Query: 1075 DLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHSPHSDGSH 896
            ++KG   +K+RK+++EQLERS ITEEH+QP  SQNP YS DSTQNSNKR+RH P  + + 
Sbjct: 78   NVKGGFLQKERKDDSEQLERSSITEEHEQPVCSQNPSYSSDSTQNSNKRKRHDPPLNATR 137

Query: 895  NHGSIIRIRLPLQKTKESVALASEERANSTSRRIVSPAQHESGVAHGPSGEEPCSTSTET 716
              G+I+RIRLP QK  +  +   +E   STS R   PA+H+   A     ++ CSTS  +
Sbjct: 138  VQGNILRIRLPSQKHIQHDSKDRDELLCSTSGRTDIPAEHKDARA---DPDKSCSTSLGS 194

Query: 715  GIINLRGVTHGPDKERSSASARTISFAQDSMQSTSVIKSFGNGMTRKESL------YKDL 554
             +I      HG         AR  S  Q  + S  ++ +  +G  R   L      Y DL
Sbjct: 195  DLI-----LHGLPLRSDQGLARGNSSQQPDVTSQEIVHT-DSGSKRHRKLKRAVKRYTDL 248

Query: 553  IETLLPPLQ-SHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXXXXXXXTLWWPRAQYLA 377
            IE   PP + S   E DD+ WLFG K  + Q +K+              +L WPRA +L 
Sbjct: 249  IENWTPPSRLSEHTEIDDEGWLFGSKHAEKQPEKK--VRCSSDISCSSSSLLWPRACHLH 306

Query: 376  EADIYALPFTVPY 338
            +ADIYALP+TVP+
Sbjct: 307  DADIYALPYTVPF 319


>ref|XP_010266830.1| PREDICTED: glutamic acid-rich protein-like [Nelumbo nucifera]
            gi|720034790|ref|XP_010266831.1| PREDICTED: glutamic
            acid-rich protein-like [Nelumbo nucifera]
          Length = 323

 Score =  155 bits (392), Expect = 3e-34
 Identities = 102/264 (38%), Positives = 138/264 (52%), Gaps = 5/264 (1%)
 Frame = -2

Query: 1114 KLNDKKIHKKHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSN 935
            K ND+K HKK   D KG    K  ++++EQLE+S +TE+H Q   SQN   S DST NS+
Sbjct: 62   KHNDEKKHKKSKVDQKGGEHPKNIQDDSEQLEKSVLTEDHGQAAVSQNVYDSSDSTGNSH 121

Query: 934  KRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRRIVSPAQHESGVAHG 755
            KR+R S  SDGS N+GSI+RIRLPL K KE  AL S+  ++  S      AQ    V H 
Sbjct: 122  KRKRLSSPSDGSQNNGSILRIRLPLMKHKEPEALPSKAVSSCNSIGSDVVAQGRCEVTHR 181

Query: 754  PSGEEPCSTS----TETGIINLRGVTHGPDKERSSASARTISFAQDSMQSTSVIKSFGNG 587
               E+ CS+S    T T + N          E   +++     AQD   S++      + 
Sbjct: 182  SGNEQACSSSCRIETATAVQNSHVKASSSSIELPCSTSSIGIVAQDKAPSSTCGVPKRDK 241

Query: 586  MTRKESLYKDLIETLL-PPLQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXXXXXXX 410
            + ++   Y+DL+E  + PP+Q   AEFDD +WLF  K       KR              
Sbjct: 242  IKKELQKYRDLVENWVPPPIQREYAEFDDQDWLFEVKPHGRHEAKRVKVDSDSLCRGSSD 301

Query: 409  TLWWPRAQYLAEADIYALPFTVPY 338
               WP+A YL E D+YALP+TVP+
Sbjct: 302  L--WPQACYLPEVDVYALPYTVPF 323


>ref|XP_010265994.1| PREDICTED: protein FAM133 isoform X2 [Nelumbo nucifera]
          Length = 325

 Score =  150 bits (379), Expect = 9e-33
 Identities = 104/263 (39%), Positives = 140/263 (53%), Gaps = 6/263 (2%)
 Frame = -2

Query: 1108 NDKKIHKKHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKR 929
            ++K+   K   D KG    +   +E+ QLE+S +TEEH Q   SQNP  S DSTQNS+KR
Sbjct: 68   HEKRHKDKSKVDNKGGEHPRNSHDESGQLEKSGLTEEHGQAAVSQNPDDSSDSTQNSHKR 127

Query: 928  RRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEE-RANSTSRRIVSPAQHESGVAHGP 752
            ++HS  SD SHNH SI+RIRLPL K K+   L S+E  A S+S RIV  +Q +      P
Sbjct: 128  KKHSTSSDVSHNHASILRIRLPLVKHKDPEMLPSKEVAACSSSGRIVISSQGKCEATPEP 187

Query: 751  SGEEPCSTSTETGIINLRGVTHGPDKE---RSSASARTISFAQDSMQS-TSVIKSFGNGM 584
              EE CSTS  + I       + P      R S S R    A+D   + TS   +  + M
Sbjct: 188  KREEVCSTSDRSEIAIQDKHANVPCSSIGVRCSTSRRNEVVAEDRTGTCTSSFPAESDEM 247

Query: 583  TRKESLYKDLIETLLPP-LQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXXXXXXXT 407
              +   Y+DLI+  +PP +QS   EFD+ +WLF     + + + +              +
Sbjct: 248  KIELRKYRDLIQNWVPPAIQSEYNEFDNQDWLF-----EVRHESKKVKVDGGSSSHGTSS 302

Query: 406  LWWPRAQYLAEADIYALPFTVPY 338
              WPR  YL E DIYALPFTVP+
Sbjct: 303  DPWPRCCYLREVDIYALPFTVPF 325


>ref|XP_010265992.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1
            isoform X1 [Nelumbo nucifera]
            gi|720032033|ref|XP_010265993.1| PREDICTED: splicing
            regulatory glutamine/lysine-rich protein 1 isoform X1
            [Nelumbo nucifera]
          Length = 327

 Score =  150 bits (379), Expect = 9e-33
 Identities = 104/263 (39%), Positives = 140/263 (53%), Gaps = 6/263 (2%)
 Frame = -2

Query: 1108 NDKKIHKKHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKR 929
            ++K+   K   D KG    +   +E+ QLE+S +TEEH Q   SQNP  S DSTQNS+KR
Sbjct: 70   HEKRHKDKSKVDNKGGEHPRNSHDESGQLEKSGLTEEHGQAAVSQNPDDSSDSTQNSHKR 129

Query: 928  RRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEE-RANSTSRRIVSPAQHESGVAHGP 752
            ++HS  SD SHNH SI+RIRLPL K K+   L S+E  A S+S RIV  +Q +      P
Sbjct: 130  KKHSTSSDVSHNHASILRIRLPLVKHKDPEMLPSKEVAACSSSGRIVISSQGKCEATPEP 189

Query: 751  SGEEPCSTSTETGIINLRGVTHGPDKE---RSSASARTISFAQDSMQS-TSVIKSFGNGM 584
              EE CSTS  + I       + P      R S S R    A+D   + TS   +  + M
Sbjct: 190  KREEVCSTSDRSEIAIQDKHANVPCSSIGVRCSTSRRNEVVAEDRTGTCTSSFPAESDEM 249

Query: 583  TRKESLYKDLIETLLPP-LQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXXXXXXXT 407
              +   Y+DLI+  +PP +QS   EFD+ +WLF     + + + +              +
Sbjct: 250  KIELRKYRDLIQNWVPPAIQSEYNEFDNQDWLF-----EVRHESKKVKVDGGSSSHGTSS 304

Query: 406  LWWPRAQYLAEADIYALPFTVPY 338
              WPR  YL E DIYALPFTVP+
Sbjct: 305  DPWPRCCYLREVDIYALPFTVPF 327


>ref|XP_012073759.1| PREDICTED: uncharacterized protein LOC105635312 [Jatropha curcas]
            gi|317106597|dbj|BAJ53105.1| JHL20J20.12 [Jatropha
            curcas] gi|643728958|gb|KDP36895.1| hypothetical protein
            JCGZ_08186 [Jatropha curcas]
          Length = 307

 Score =  139 bits (351), Expect = 2e-29
 Identities = 95/237 (40%), Positives = 131/237 (55%), Gaps = 2/237 (0%)
 Frame = -2

Query: 1042 KNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHSPHSDGSHNHGSIIRIRLP 863
            K + E+ ERS +TEEHDQP  SQ+ CYS DST++S+KR+R     + + + G+IIRIRLP
Sbjct: 83   KVQEEEAERSGLTEEHDQPVCSQSLCYSPDSTRSSDKRKRDDLSYNITKSSGNIIRIRLP 142

Query: 862  LQKTKESVALASEERANSTSRRIVSPAQHESGVAHGPSGEEPCSTSTETGI-INLRGVTH 686
            LQK +E  A  S E   S+SR+    AQ +  +   P  E+P S +++TGI I+   VT 
Sbjct: 143  LQKHREVDASTSGEHVRSSSRKSDFLAQKQ--IITVPDKEQPSSINSKTGINISDPIVTP 200

Query: 685  GPDKERSSASARTISFAQDSMQSTSVIKSFGNGMTRKESLYKDLIETLLP-PLQSHQAEF 509
              + E    S R        + + S + S   G+   ESLYKDL+E  +P PL   Q   
Sbjct: 201  CANLEADKDSVR------KRVITASGVSSRVRGVQNAESLYKDLLEDWVPLPLGCDQNNI 254

Query: 508  DDDEWLFGRKLQDNQRDKRXXXXXXXXXXXXXXTLWWPRAQYLAEADIYALPFTVPY 338
             D EWLFG K Q    +K               +  WP A+YL EA++YALP+TVP+
Sbjct: 255  GDQEWLFGTKKQ----EKHKRLKSQCDEPCHGSSTLWPCARYLPEAEVYALPYTVPF 307


>ref|XP_010648991.1| PREDICTED: uncharacterized protein LOC100254073 isoform X2 [Vitis
            vinifera]
          Length = 329

 Score =  134 bits (338), Expect = 5e-28
 Identities = 97/269 (36%), Positives = 131/269 (48%), Gaps = 12/269 (4%)
 Frame = -2

Query: 1108 NDKKIHKKHTGD------LKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDST 947
            + KK HKK   D       KG + +  RKNE +  E+S +TEEH  P GS+N CYS D +
Sbjct: 69   HQKKGHKKRLKDDMSQENQKGGNHQNSRKNETDHFEKSTLTEEHGHPIGSENICYSSDGS 128

Query: 946  QNSNKRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRRIVSPAQHESG 767
             NSNKR+++S   +G HN G+I RIRLPLQ+ K+   L S+ +  S   R     Q    
Sbjct: 129  LNSNKRQKYSSPPNGKHNSGNIFRIRLPLQRHKDLEVLPSKGQPCSALGRTDVFVQEMCD 188

Query: 766  VAHGP---SGEEPCSTSTETGIINLRGVTH--GPDKERSSASARTISFAQDSMQSTSVIK 602
            +A  P    GE  C  S  TG    +G+ H  G      S++A  I   +  +   S+  
Sbjct: 189  LAPKPGRREGEHLCFASWITG----QGLDHKLGRKNPCPSSAAHEIFGQKPEIAPASI-- 242

Query: 601  SFGNGMTRKESLYKDLIETLLPPL-QSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXX 425
            S G+  +  E   KDL++  +PPL QS     D+ +WL   K   N   +R         
Sbjct: 243  SSGSDSSLLELRIKDLLDYSVPPLMQSQFPASDNQDWLLETKQNHNLAPERCETNHDGGS 302

Query: 424  XXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                    W R  YL E DIYALPFTVP+
Sbjct: 303  YGNSAQ--WSRVCYLPEVDIYALPFTVPF 329


>ref|XP_002262760.2| PREDICTED: uncharacterized protein LOC100254073 isoform X1 [Vitis
            vinifera]
          Length = 331

 Score =  134 bits (338), Expect = 5e-28
 Identities = 97/269 (36%), Positives = 131/269 (48%), Gaps = 12/269 (4%)
 Frame = -2

Query: 1108 NDKKIHKKHTGD------LKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDST 947
            + KK HKK   D       KG + +  RKNE +  E+S +TEEH  P GS+N CYS D +
Sbjct: 71   HQKKGHKKRLKDDMSQENQKGGNHQNSRKNETDHFEKSTLTEEHGHPIGSENICYSSDGS 130

Query: 946  QNSNKRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRRIVSPAQHESG 767
             NSNKR+++S   +G HN G+I RIRLPLQ+ K+   L S+ +  S   R     Q    
Sbjct: 131  LNSNKRQKYSSPPNGKHNSGNIFRIRLPLQRHKDLEVLPSKGQPCSALGRTDVFVQEMCD 190

Query: 766  VAHGP---SGEEPCSTSTETGIINLRGVTH--GPDKERSSASARTISFAQDSMQSTSVIK 602
            +A  P    GE  C  S  TG    +G+ H  G      S++A  I   +  +   S+  
Sbjct: 191  LAPKPGRREGEHLCFASWITG----QGLDHKLGRKNPCPSSAAHEIFGQKPEIAPASI-- 244

Query: 601  SFGNGMTRKESLYKDLIETLLPPL-QSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXX 425
            S G+  +  E   KDL++  +PPL QS     D+ +WL   K   N   +R         
Sbjct: 245  SSGSDSSLLELRIKDLLDYSVPPLMQSQFPASDNQDWLLETKQNHNLAPERCETNHDGGS 304

Query: 424  XXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                    W R  YL E DIYALPFTVP+
Sbjct: 305  YGNSAQ--WSRVCYLPEVDIYALPFTVPF 331


>gb|KHG14510.1| Heat shock factor 4 [Gossypium arboreum]
          Length = 325

 Score =  131 bits (330), Expect = 4e-27
 Identities = 103/273 (37%), Positives = 134/273 (49%), Gaps = 11/273 (4%)
 Frame = -2

Query: 1123 EEVKLNDKKIHKKHTG--DLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDS 950
            E  K   KK HK      D KG   +KKR+NE E  E+S +TEEH Q  G QN   S DS
Sbjct: 66   ESKKRGHKKRHKDERSKEDQKGGDRQKKRENEVECFEKSTLTEEHGQAVGPQN---SSDS 122

Query: 949  TQNSNKRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTS---RRIVSPAQ 779
            T NS+KR++ S   D   N GSIIRIRLP Q+ K+   L S+E+  STS           
Sbjct: 123  TLNSSKRQKLSSPPDSGQNPGSIIRIRLPSQRHKDPEVLPSKEQPCSTSGNTDEAFVQRV 182

Query: 778  HESGVAHGPSGEE-PCSTSTETGIINLRGVTHGPDKERSSASARTISFAQDSMQSTSVIK 602
            HE     G   EE PCSTS     I    +T    KE++ +S    S   +++   + + 
Sbjct: 183  HEHAPRPGKELEEQPCSTSD----IKRPELTFKLGKEKACSS----SLTSETLAHNAKVP 234

Query: 601  SFGNGMT----RKESLYKDLIET-LLPPLQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXX 437
            +  N  T    +    +K+L+E  ++P LQS      DD+WL  +K   N   K      
Sbjct: 235  TLSNLCTTCPPKLALQFKNLVEDWVMPTLQSESTSSGDDDWLVQKKQNLNTEVKTHKDGN 294

Query: 436  XXXXXXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                     T  WPRA +L EADIYALPFTVP+
Sbjct: 295  LNSNQMSSAT--WPRACFLPEADIYALPFTVPF 325


>ref|XP_012469303.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1
            [Gossypium raimondii] gi|763750226|gb|KJB17614.1|
            hypothetical protein B456_003G007900 [Gossypium
            raimondii]
          Length = 325

 Score =  130 bits (326), Expect = 1e-26
 Identities = 101/269 (37%), Positives = 132/269 (49%), Gaps = 7/269 (2%)
 Frame = -2

Query: 1123 EEVKLNDKKIHKKHTG--DLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDS 950
            E  K   KK HK      D KG   +KKR+ E E  E+S +TEEH Q  G QN   S DS
Sbjct: 66   ESKKHGHKKRHKDEGSKEDQKGGDRQKKREYEVECFEKSTLTEEHGQAVGPQN---SSDS 122

Query: 949  TQNSNKRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTS---RRIVSPAQ 779
            T NS+KR++ S   D   N GSIIRIRLP Q+ K+   L S+E+  STS           
Sbjct: 123  TLNSSKRQKLSSPPDSGQNPGSIIRIRLPSQRHKDPEVLPSKEQPCSTSGNTDEAFVQRV 182

Query: 778  HESGVAHGPSGEE-PCSTSTETGIINLRGVTHGPDKERSSASARTISFAQDSMQSTSVIK 602
            HE     G   EE PCSTS     I    +T    KE++ +S+RT      + ++ ++  
Sbjct: 183  HEHAPRPGKELEEQPCSTSD----IKRPELTFKLGKEKACSSSRTSETLAHNTKAPTLSN 238

Query: 601  SFGNGMTRKESLYKDLIET-LLPPLQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXX 425
                   +    +K+L+E  ++P  QS      DD+WLF +K   N   K          
Sbjct: 239  LCTTCPPKLALQFKNLVEDWVMPTPQSELTSSGDDDWLFQKKQNLNTEVKTHKDGNLNSN 298

Query: 424  XXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                 T  WPRA +L EADIYALPFTVP+
Sbjct: 299  QMSSAT--WPRACFLPEADIYALPFTVPF 325


>gb|KJB17613.1| hypothetical protein B456_003G007900 [Gossypium raimondii]
          Length = 323

 Score =  130 bits (326), Expect = 1e-26
 Identities = 101/269 (37%), Positives = 132/269 (49%), Gaps = 7/269 (2%)
 Frame = -2

Query: 1123 EEVKLNDKKIHKKHTG--DLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDS 950
            E  K   KK HK      D KG   +KKR+ E E  E+S +TEEH Q  G QN   S DS
Sbjct: 64   ESKKHGHKKRHKDEGSKEDQKGGDRQKKREYEVECFEKSTLTEEHGQAVGPQN---SSDS 120

Query: 949  TQNSNKRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTS---RRIVSPAQ 779
            T NS+KR++ S   D   N GSIIRIRLP Q+ K+   L S+E+  STS           
Sbjct: 121  TLNSSKRQKLSSPPDSGQNPGSIIRIRLPSQRHKDPEVLPSKEQPCSTSGNTDEAFVQRV 180

Query: 778  HESGVAHGPSGEE-PCSTSTETGIINLRGVTHGPDKERSSASARTISFAQDSMQSTSVIK 602
            HE     G   EE PCSTS     I    +T    KE++ +S+RT      + ++ ++  
Sbjct: 181  HEHAPRPGKELEEQPCSTSD----IKRPELTFKLGKEKACSSSRTSETLAHNTKAPTLSN 236

Query: 601  SFGNGMTRKESLYKDLIET-LLPPLQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXX 425
                   +    +K+L+E  ++P  QS      DD+WLF +K   N   K          
Sbjct: 237  LCTTCPPKLALQFKNLVEDWVMPTPQSELTSSGDDDWLFQKKQNLNTEVKTHKDGNLNSN 296

Query: 424  XXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                 T  WPRA +L EADIYALPFTVP+
Sbjct: 297  QMSSAT--WPRACFLPEADIYALPFTVPF 323


>ref|XP_006342067.1| PREDICTED: muscle M-line assembly protein unc-89-like [Solanum
            tuberosum]
          Length = 308

 Score =  127 bits (318), Expect = 1e-25
 Identities = 100/275 (36%), Positives = 135/275 (49%), Gaps = 13/275 (4%)
 Frame = -2

Query: 1123 EEVKLNDKKIHK--------KHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNP 968
            +E K  +KK  K        K T + KG+   K  ++EAEQLERS++TEEH+    SQN 
Sbjct: 47   KEKKREEKKAKKEKSNLGFDKATHESKGKYLFKCLEDEAEQLERSNLTEEHEPAVCSQNS 106

Query: 967  CYSCDSTQNSNKRRR-HSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRRIV 791
              S DSTQNSNKR+R  SP   G   HGSIIRIRL          +  E  A+S  + + 
Sbjct: 107  SCSSDSTQNSNKRKRPASPSRGGIQAHGSIIRIRL------SKKGMQGEISASSKEKHLP 160

Query: 790  SPAQHESGVAHGPSGE--EPCSTSTETGIINLRGVTHGPDKERSSASARTISFAQDSMQS 617
             PAQ  + V    S E   P   +T         V   P    S+++   +        +
Sbjct: 161  KPAQQVAEVTVRASAERANPLLKTTNKRSCPPPVVVSEP----STSNCGWVDRVAVDNAT 216

Query: 616  TSVIKSFGNGMTRKESLYKDLIETLLPP-LQSHQAEFDDDE-WLFGRKLQDNQRDKRXXX 443
             S  K   N +   E  YK+LIE  LPP L S   + DDD+ WLF RK +  + +++   
Sbjct: 217  PSCSKVHENSI---EFQYKNLIENWLPPSLPSDNLDLDDDQSWLFQRKPKQARVEEKNVG 273

Query: 442  XXXXXXXXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                       +LW PRAQYL + D+YALP+TVP+
Sbjct: 274  SSNDKTCGSCSSLWQPRAQYLPDVDLYALPYTVPF 308


>ref|XP_010029657.1| PREDICTED: uncharacterized protein LOC104419639 [Eucalyptus grandis]
            gi|629090353|gb|KCW56606.1| hypothetical protein
            EUGRSUZ_I02328 [Eucalyptus grandis]
          Length = 315

 Score =  126 bits (316), Expect = 2e-25
 Identities = 91/259 (35%), Positives = 123/259 (47%), Gaps = 8/259 (3%)
 Frame = -2

Query: 1090 KKHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHSPH 911
            K+   D K     KKRK++ E LE+S++TEEH +P  S N   S DST NS+K+++    
Sbjct: 71   KRTQADQKAGDHRKKRKHDTEHLEKSNLTEEHGKPVNSLN---STDSTMNSSKKQKQILP 127

Query: 910  SDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRR--IVSPAQHESGVAHGPSG--- 746
             DG  N  SIIRIRLPLQ+ K+   L S E+  S   R  +V   +HE    H P     
Sbjct: 128  PDGGLNPASIIRIRLPLQRHKDLEMLPSGEQPCSAPVRTDVVDHEKHE----HAPRSSTD 183

Query: 745  --EEPCSTSTETGIINLRGVTHGPDKERSSASARTISFAQDSMQSTSVIKSFGNGMTRKE 572
              E  CSTS+  G     G            S+     +     ++S+      G++R E
Sbjct: 184  RREHLCSTSSSAG----EGTASKLGLMEQCPSSGVAEASSQKNGTSSLPSLDDRGLSRSE 239

Query: 571  SLYKDLIET-LLPPLQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXXXXXXXTLWWP 395
              Y++LIE  + P   S  A+ DD +WLFGRK  +                       WP
Sbjct: 240  IKYRNLIENWVAPSFHSGCADLDDQDWLFGRKQLNCDAGNCKADYDGSTYGSPSP---WP 296

Query: 394  RAQYLAEADIYALPFTVPY 338
            R  YL E D+YALP+TVPY
Sbjct: 297  RMHYLPEVDMYALPYTVPY 315


>ref|XP_011083480.1| PREDICTED: uncharacterized protein LOC105166005 [Sesamum indicum]
          Length = 750

 Score =  125 bits (314), Expect = 3e-25
 Identities = 92/267 (34%), Positives = 131/267 (49%), Gaps = 14/267 (5%)
 Frame = -2

Query: 1096 IHKKHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHS 917
            I KK   D K     +  + E EQLERS +TEEH QP     P  S DST+NSNKR+R S
Sbjct: 491  IGKKIWPDAKVELLYRGSQAEPEQLERSSLTEEHGQPVCLCAPSTSSDSTENSNKRKRQS 550

Query: 916  PHSDGSHNHGSII---------RIRLPLQKTKESVALASEERANSTSRRIVSPAQHESGV 764
              +D +  H  I+         RIRLP++K  ES      +R  STS     P+Q++  +
Sbjct: 551  SPADVARGHSKIVFPRLPLINYRIRLPVKKPNES---GDTDRICSTSGSTPFPSQNKDDI 607

Query: 763  AHGPSGEEPCSTSTETGIINLRGVTHGPDKER---SSASARTISFAQDSMQSTSVIKSFG 593
            +   +    C T  ET  I  +G++   D+E+   +S     ++  +  + S S   +  
Sbjct: 608  SLKYNRGNVCCTLQETSNI-AQGLSRRTDREQICSTSGQINPVTAVKTGIPSAS--NTVM 664

Query: 592  NGMTRKESLYKDLIETLLPP--LQSHQAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXXXX 419
              M R E  YK+LIE  +PP  L S     D+ +WLF  K +  + +KR           
Sbjct: 665  TPMQRMELQYKNLIENWIPPKWLDSSLNSDDEQDWLFLGKNEGQRAEKRQKAGNDSLPCS 724

Query: 418  XXXTLWWPRAQYLAEADIYALPFTVPY 338
                + WP AQYL + D+YALPFTVP+
Sbjct: 725  SSSAI-WPHAQYLQDVDVYALPFTVPF 750


>ref|XP_011074998.1| PREDICTED: uncharacterized protein LOC105159585, partial [Sesamum
            indicum]
          Length = 310

 Score =  124 bits (311), Expect = 7e-25
 Identities = 93/275 (33%), Positives = 129/275 (46%), Gaps = 29/275 (10%)
 Frame = -2

Query: 1075 DLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHSPHS--DG 902
            D KG+      + E EQLERS +TE+H QP     P  S DST+N+NKR+RHS  S  DG
Sbjct: 37   DGKGKVHHYSEEAETEQLERSSLTEDHGQPISFPVPSSSSDSTENTNKRKRHSSVSPMDG 96

Query: 901  SHNHGSIIRIRLPLQKTKESVAL-----ASEERANSTSRRIVS----------------- 788
            S +HG +I IRLP +K  E  AL     AS E+  S   +  +                 
Sbjct: 97   SRSHGKVILIRLPSKKQNEFDALKKVGSASLEKDLSVQSKDDAGLKDRSENSGCAFFQTD 156

Query: 787  -PAQHESGVAHGPSGEEPCSTSTETGIINLRGVTHGPDKER-SSASARTISFAQDSMQST 614
             PAQ +  +      E   ST   T  I  +G+T   + E+  S S +  + A       
Sbjct: 157  FPAQSKDDIGRRNRLENTHSTMKGTSNIK-QGITLRTNSEQVCSTSGQIEAVAPGKTGIK 215

Query: 613  SVIKSFGNGMTRKESLYKDLIETLLPPLQSHQAEFDDD---EWLFGRKLQDNQRDKRXXX 443
            SV K+    + ++E  YK+L+E  + P       + DD   +WLF  K ++N   K+   
Sbjct: 216  SVNKAVLKSVQKRELQYKNLLEKWVAPQPEDGCLYADDPDSDWLFDCKDKNNTHAKKRQR 275

Query: 442  XXXXXXXXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                       + WWP  +YL E D+YALPFTVPY
Sbjct: 276  RGSESISCSRSSTWWPHTEYLHEIDVYALPFTVPY 310


>ref|XP_009589462.1| PREDICTED: uncharacterized protein LOC104086823 [Nicotiana
            tomentosiformis]
          Length = 311

 Score =  122 bits (306), Expect = 3e-24
 Identities = 94/255 (36%), Positives = 127/255 (49%), Gaps = 5/255 (1%)
 Frame = -2

Query: 1087 KHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHSPHS 908
            K T + KG    K  ++EAEQLERS++TEEH Q   SQN   S DSTQNSNKR+R +  S
Sbjct: 71   KATHESKGMYLFKCLEDEAEQLERSNLTEEHGQAVCSQNSSCSSDSTQNSNKRKRPASPS 130

Query: 907  DGS-HNHGSIIRIRLPLQKTKESVALASEERANSTSRRIVSPAQHESGVAHGPSGE--EP 737
             G+   HGSIIRIRL  +  +  ++ A E       +++  PAQ ++ V      E   P
Sbjct: 131  HGNIQAHGSIIRIRLSKKGGQGEMSTAKE-------KQLRKPAQKDAEVTVRTIAERANP 183

Query: 736  CSTSTETGIINLRGVTHGPDKERSSASARTISFAQDSMQSTSVIKSFGNGMTRKESLYKD 557
               +T         V     +   S S  T   A D   +    K   N +   E  YK+
Sbjct: 184  LQKATNNQCCPSLSVL----EPSPSTSGWTDCVAVDKTATALCSKEHENSI---EFQYKN 236

Query: 556  LIETLLPP-LQSHQAEFDDDEWLFGRKLQDNQ-RDKRXXXXXXXXXXXXXXTLWWPRAQY 383
            LIE  LPP LQ+     DD+ WLF RK +  +  +K               +  WPRAQY
Sbjct: 237  LIENWLPPSLQTEHLGVDDESWLFQRKPKHTRVGEKSVVSKEVSNDSTCGSSALWPRAQY 296

Query: 382  LAEADIYALPFTVPY 338
            + +A++YALPFTVP+
Sbjct: 297  IHDAELYALPFTVPF 311


>ref|XP_002510569.1| conserved hypothetical protein [Ricinus communis]
            gi|223551270|gb|EEF52756.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 301

 Score =  122 bits (305), Expect = 3e-24
 Identities = 85/241 (35%), Positives = 121/241 (50%), Gaps = 6/241 (2%)
 Frame = -2

Query: 1042 KNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHSPHSDGSHNHGSIIRIRLP 863
            K + E+ ERS +TEEH+ P  SQ+ CYS DST++S KR+      + +  HG++IRIRLP
Sbjct: 77   KGKEEEAERSSLTEEHEPPVCSQSLCYSPDSTRSSKKRKGDDSVYNATKTHGNVIRIRLP 136

Query: 862  LQKTKESVALASEERANSTSRRIVSPAQHESGVAHGPSGEEPCSTSTETGIINLRGVTHG 683
            LQ+  E +A A+ E++ STS + +S  +    +    S  E CST       ++      
Sbjct: 137  LQRHIEPIASANGEQSCSTSGKNLSEQEQVITI----SRREHCSTINFKAAEDITSAPIK 192

Query: 682  P----DKERSSASARTISFAQDSMQSTSVIKSFGNGMTRKESLYKDLIE--TLLPPLQSH 521
            P    D ER   SAR  S  +   +           + + ES YK L E    LP   + 
Sbjct: 193  PILTADLERKEKSARLSSKTEKKEKK----------LYKAESRYKALFEDWAPLPVGFAQ 242

Query: 520  QAEFDDDEWLFGRKLQDNQRDKRXXXXXXXXXXXXXXTLWWPRAQYLAEADIYALPFTVP 341
            Q  FDD +WL   K Q+  RDKR                +WP A++L  ADIYALP+T+P
Sbjct: 243  QNNFDDCDWLCCSKRQERSRDKRLQISHDEPANEGLG--FWPCARFLPHADIYALPYTIP 300

Query: 340  Y 338
            +
Sbjct: 301  F 301


>ref|XP_009772830.1| PREDICTED: glutamic acid-rich protein-like [Nicotiana sylvestris]
          Length = 358

 Score =  120 bits (301), Expect = 1e-23
 Identities = 95/254 (37%), Positives = 128/254 (50%), Gaps = 4/254 (1%)
 Frame = -2

Query: 1087 KHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDSTQNSNKRRRHSPHS 908
            K T + KG    K  ++EAEQLERS++TEEH Q   SQN   S DSTQNSNKR+R +  S
Sbjct: 118  KATHESKGMYLFKCLEDEAEQLERSNLTEEHGQAVCSQNSSCSSDSTQNSNKRKRPASPS 177

Query: 907  DGS-HNHGSIIRIRLPLQKTKESVALASEERANSTSRRIVSPAQHESGVAHGPSGEEPCS 731
             G+   HGSIIRIRL  +  +   + A E       +++  PAQ +  V    S E    
Sbjct: 178  HGNIQAHGSIIRIRLSKKDGQGKTSTAKE-------KQLRKPAQKDVEVTVRTSVERANP 230

Query: 730  TSTETGIINLRGVTHGPDKERS-SASARTISFAQDSMQSTSVIKSFGNGMTRKESLYKDL 554
                T   N +G       E S S S      A D   + S  K+  N +   E  Y++L
Sbjct: 231  LLKAT---NNQGCPSPSVLEPSPSTSGWRDCVAVDKAATASCSKAHENSI---EFQYRNL 284

Query: 553  IETLLPP-LQSHQAEFDDDEWLFGRKLQDNQ-RDKRXXXXXXXXXXXXXXTLWWPRAQYL 380
            IE  LPP LQ+   + D + WLF RK +  +  +K               +  WPRAQY+
Sbjct: 285  IENWLPPSLQTEHLDVDGEAWLFQRKPKHTRVGEKSAVSKEVSNDSTCGSSALWPRAQYI 344

Query: 379  AEADIYALPFTVPY 338
             +A++YALPFTVP+
Sbjct: 345  HDAELYALPFTVPF 358


>ref|XP_010113316.1| hypothetical protein L484_026647 [Morus notabilis]
            gi|587949121|gb|EXC35323.1| hypothetical protein
            L484_026647 [Morus notabilis]
          Length = 394

 Score =  115 bits (287), Expect = 4e-22
 Identities = 104/331 (31%), Positives = 143/331 (43%), Gaps = 69/331 (20%)
 Frame = -2

Query: 1126 GEEVKLNDKKIHKKHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNPCYSCDST 947
            G+E + + K+  ++     KGR  +KKRK E E LERS++TEEH QP GSQN   S DST
Sbjct: 72   GKEKQSHKKRHREEDQESKKGRDHDKKRKLETENLERSNLTEEHGQPVGSQN---SSDST 128

Query: 946  QNSNKRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRRIVS------P 785
             NSNKRR     ++  HN GSIIRIRLPLQ+ K+   L S+E++ S S R  +      P
Sbjct: 129  VNSNKRRNPCSPAESCHNSGSIIRIRLPLQRHKDPEILPSKEQSCSASGRTHNAFVQGRP 188

Query: 784  AQHESGVAHGPSGEEPCSTSTETGIINL-------------RGVTHGPD--------KER 668
            ++  S          PCSTST     NL             R  T   D        KE 
Sbjct: 189  SEPASRQGKEQGEHHPCSTSTR----NLSQVAKNSRLSKEHRSTTKSVDLSQNSRLIKEN 244

Query: 667  SSASARTISFAQDS----------------MQSTSVIKSFGNGMTRK-------ESLYKD 557
              ++ +++  +Q+S                 Q++ +IK      T+        ES+   
Sbjct: 245  HCSTTKSVDLSQNSRLIKEKHCPTTKSVDLSQNSRLIKEKHCPTTKSVDISHKAESIPML 304

Query: 556  LIETLLPPLQSHQAEFDD-------------------DEWLFGRKLQDNQRDKRXXXXXX 434
                  PPL    +++ D                   + WLF  K QD++          
Sbjct: 305  STSPHFPPLPPMVSQYRDLFENWVPPPMQDDCMELGVETWLFKSK-QDHKNGVERCKDGG 363

Query: 433  XXXXXXXXTLWWPRAQYLAEADIYALPFTVP 341
                    TL WPRA YL   DI+ALP+ VP
Sbjct: 364  DILSHEPSTL-WPRAHYLPSVDIFALPYAVP 393


>ref|XP_007017860.1| JHL20J20.12 protein, putative [Theobroma cacao]
            gi|508723188|gb|EOY15085.1| JHL20J20.12 protein, putative
            [Theobroma cacao]
          Length = 289

 Score =  114 bits (285), Expect = 7e-22
 Identities = 90/274 (32%), Positives = 128/274 (46%), Gaps = 9/274 (3%)
 Frame = -2

Query: 1132 LHGEEVKLNDKKIHKKHTGDLKGRSTEKKRKNEA------EQLERSDITEEHDQPTGSQN 971
            +H    K   KK  +K      G S + K+ ++       EQL  SD+TEEH+ P     
Sbjct: 36   IHKRREKKEKKKKERKEKEKTHGISKKLKKLDDDLNGDKDEQLGNSDLTEEHEPPV---- 91

Query: 970  PCYSCDSTQNSNKRRRHSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRRIV 791
             CY  D TQNSNKR+R +P S     +GS I+IR   +K +ES A   EER  STS R  
Sbjct: 92   -CYLSDGTQNSNKRKRETPSSSECRVNGS-IKIRFSFKKPRESDASLCEERVCSTSGRAD 149

Query: 790  SPAQHESGVAHGPSGEEPCSTSTETGIINLRGVTHGPDKERSSASARTISFAQDSMQS-- 617
               Q        P  +E    S +   I    +TH P+++ ++   + +    +  Q   
Sbjct: 150  CSTQ--------PIAQEQPDPSNQKENI----ITHVPEQKITTVLEQKLWRDNERKQQIP 197

Query: 616  TSVIKSFGNGMTRKESLYKDLIETLLP-PLQSHQAEFDDDEWLFGRKLQDNQRDKRXXXX 440
            +S    FGN M +    YK L+E L+P PLQ    +  DD+WLF  K Q     +R    
Sbjct: 198  SSGTSVFGNKMKKAALQYKTLLEDLMPLPLQLQNHDDYDDDWLFKSKQQGKHAGERSKVD 257

Query: 439  XXXXXXXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                      +   PRA +L + +IYALP+TVP+
Sbjct: 258  DDVRCPTIATSC--PRAHFLPDVEIYALPYTVPF 289


>ref|XP_004238373.1| PREDICTED: exocyst complex component 6 [Solanum lycopersicum]
          Length = 309

 Score =  113 bits (283), Expect = 1e-21
 Identities = 93/276 (33%), Positives = 133/276 (48%), Gaps = 14/276 (5%)
 Frame = -2

Query: 1123 EEVKLNDKKIHK--------KHTGDLKGRSTEKKRKNEAEQLERSDITEEHDQPTGSQNP 968
            +E K  +KK  K        K T + KG+   K  ++E EQLERS++TEEH+    SQN 
Sbjct: 47   KEKKREEKKAKKEKSNLGFGKATHESKGKYLFKCFEDEPEQLERSNLTEEHEPAVCSQNS 106

Query: 967  CYSCDSTQNSNKRRR---HSPHSDGSHNHGSIIRIRLPLQKTKESVALASEERANSTSRR 797
              S DSTQNSNKR+R    SP   G   HGSIIRIRL  +  +  ++++ E       + 
Sbjct: 107  SCSSDSTQNSNKRKRPTSPSPSRGGIQAHGSIIRIRLSKKGVQGEISVSKE-------KH 159

Query: 796  IVSPAQHESGVAHGPSGEEPCSTSTETGIINLRGVTHG-PDKERSSASARTISFAQDSMQ 620
            +  PAQ  + V    S E        T   N R         E S+++   +    +   
Sbjct: 160  LPKPAQQVAEVTVRTSAERANPLLKTT---NKRSCPPPVAVSEPSTSNCGWVDRVAEDNA 216

Query: 619  STSVIKSFGNGMTRKESLYKDLIETLLPP-LQSHQAEFDDDE-WLFGRKLQDNQRDKRXX 446
            + S  K   N +   E  YK+LIE  LPP L S   + +DD+ WLF RK +  + +++  
Sbjct: 217  TPSCSKVHENSI---EFQYKNLIENWLPPSLPSDNLDLEDDQSWLFQRKPKQARVEEKNL 273

Query: 445  XXXXXXXXXXXXTLWWPRAQYLAEADIYALPFTVPY 338
                            PRAQYL + ++YALP+TVP+
Sbjct: 274  GGGDKTCGSCSSLWQQPRAQYLPDVELYALPYTVPF 309


Top