BLASTX nr result

ID: Mentha26_contig00004915 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00004915
         (555 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus...   147   2e-33
gb|EPS66078.1| hypothetical protein M569_08699 [Genlisea aurea]       103   3e-20
ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283...   100   4e-19
ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283...   100   4e-19
ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257...    99   7e-19
ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citr...    99   9e-19
ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citr...    99   9e-19
ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prun...    95   1e-17
ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218...    94   2e-17
ref|XP_006348038.1| PREDICTED: pre-mRNA-splicing factor 38B-like...    92   9e-17
ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma...    87   3e-15
ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma...    87   3e-15
ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [...    87   3e-15
ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma...    87   3e-15
ref|XP_002522170.1| conserved hypothetical protein [Ricinus comm...    86   8e-15
ref|XP_004252010.1| PREDICTED: uncharacterized protein LOC101247...    84   2e-14
ref|XP_004252009.1| PREDICTED: uncharacterized protein LOC101247...    84   2e-14
ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Popu...    80   4e-13
ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1...    80   4e-13
ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6...    77   3e-12

>gb|EYU18618.1| hypothetical protein MIMGU_mgv1a007462mg [Mimulus guttatus]
          Length = 406

 Score =  147 bits (370), Expect = 2e-33
 Identities = 88/190 (46%), Positives = 113/190 (59%), Gaps = 6/190 (3%)
 Frame = +3

Query: 3   RDKDSYDRAGSGRRHAAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEESKGI 182
           RDK+ Y+RAGSGR                  D++VKTD +K               S G 
Sbjct: 165 RDKEKYERAGSGRG-----------------DQYVKTDRRK---------------SLGD 192

Query: 183 RNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQRNRESKEHSDDK 362
           ++DS +R+D SG R+KE +WR+GKEL++E+  N+EKR+ D+R  YKE+ N E+KEHSDDK
Sbjct: 193 QSDSSSRKDTSGHRLKETSWREGKELNAEKYVNDEKRKFDNRSIYKEEGNGEAKEHSDDK 252

Query: 363 GQNY-----KKTKFVR-GSTSPGTDAGTSEPAVTDSDINXXXXXXXXXXELVNKNLVGTG 524
              +     KK KF    S +P TD  + +P VTDSDI+          ELVNKNLVGTG
Sbjct: 253 SIKFTETVTKKPKFSSLDSKAPVTDGTSEQPYVTDSDIDAAKIAAMKAAELVNKNLVGTG 312

Query: 525 YMSTDQKKKL 554
           YMSTDQKKKL
Sbjct: 313 YMSTDQKKKL 322


>gb|EPS66078.1| hypothetical protein M569_08699 [Genlisea aurea]
          Length = 420

 Score =  103 bits (257), Expect = 3e-20
 Identities = 75/192 (39%), Positives = 104/192 (54%), Gaps = 9/192 (4%)
 Frame = +3

Query: 6   DKDSYDRAGSGRRHAAFEER-----DRERNREGKVDRFVKTDIKKXXXXXXXXXXPAY-- 164
           D+DSYDR+ SGRRH+  EER     ++E  R+ + +++ ++D +K          P++  
Sbjct: 159 DRDSYDRSVSGRRHSNVEERGGRVREKEGYRDDRAEKYGRSDHRKASRDHRTDHSPSHIE 218

Query: 165 EESKGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQRNRESK 344
           EES+  + D  ++   SG  +KEA+ +DG E  + +  NE+ R D               
Sbjct: 219 EESRTQQKDH-SQIGVSGNNLKEASRKDGHEPGAGKCQNEKTRSD--------------- 262

Query: 345 EHSDDK-GQNYKKTKFVR-GSTSPGTDAGTSEPAVTDSDINXXXXXXXXXXELVNKNLVG 518
           EHSDD      KK+KF    ST P  D  + +  VTDSDI+          ELVNKNLVG
Sbjct: 263 EHSDDVISVRPKKSKFSPLESTGPVKDGTSEQLYVTDSDIDAAKIAAMKAAELVNKNLVG 322

Query: 519 TGYMSTDQKKKL 554
           TGYMSTDQKKKL
Sbjct: 323 TGYMSTDQKKKL 334


>ref|XP_006482076.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X2
           [Citrus sinensis]
          Length = 482

 Score = 99.8 bits (247), Expect = 4e-19
 Identities = 75/214 (35%), Positives = 107/214 (50%), Gaps = 30/214 (14%)
 Frame = +3

Query: 3   RDKDSY-DRAGSGRRH--AAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGR+H  A  EE DR+ ++  +  R  K D ++            Y+ES
Sbjct: 181 KDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRNDRTVTYDES 240

Query: 174 KGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQ---RNRESK 344
           +G RN S + RD    R+KEA+  D KELD ++ ANEEK++ +D   Y+++      +  
Sbjct: 241 RGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYRDRDRYHRADKP 300

Query: 345 EHSDDKGQN-YKKTKFV---RGSTSPGTDAGTSEPAVTDS-------------------- 452
           + +  K +N  KK +F    +G+ +    AGT   +   S                    
Sbjct: 301 DFASGKQENPTKKQRFSNWDKGADNVKDAAGTMSSSSMQSQDIGDTDALAQSHANDAVAN 360

Query: 453 DINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
           D++          ELVNKNLVG  YMSTDQKKKL
Sbjct: 361 DLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKL 394


>ref|XP_006482075.1| PREDICTED: uncharacterized protein DDB_G0283697-like isoform X1
           [Citrus sinensis]
          Length = 538

 Score = 99.8 bits (247), Expect = 4e-19
 Identities = 75/214 (35%), Positives = 107/214 (50%), Gaps = 30/214 (14%)
 Frame = +3

Query: 3   RDKDSY-DRAGSGRRH--AAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGR+H  A  EE DR+ ++  +  R  K D ++            Y+ES
Sbjct: 181 KDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRNDRTVTYDES 240

Query: 174 KGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQ---RNRESK 344
           +G RN S + RD    R+KEA+  D KELD ++ ANEEK++ +D   Y+++      +  
Sbjct: 241 RGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETYRDRDRYHRADKP 300

Query: 345 EHSDDKGQN-YKKTKFV---RGSTSPGTDAGTSEPAVTDS-------------------- 452
           + +  K +N  KK +F    +G+ +    AGT   +   S                    
Sbjct: 301 DFASGKQENPTKKQRFSNWDKGADNVKDAAGTMSSSSMQSQDIGDTDALAQSHANDAVAN 360

Query: 453 DINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
           D++          ELVNKNLVG  YMSTDQKKKL
Sbjct: 361 DLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKL 394


>ref|XP_002266542.1| PREDICTED: uncharacterized protein LOC100257160 [Vitis vinifera]
           gi|297739954|emb|CBI30136.3| unnamed protein product
           [Vitis vinifera]
          Length = 510

 Score = 99.0 bits (245), Expect = 7e-19
 Identities = 78/235 (33%), Positives = 111/235 (47%), Gaps = 51/235 (21%)
 Frame = +3

Query: 3   RDKD-SYDRAGSGRRHAAFEERDRERNREGKVDRFVKT--------DIKKXXXXXXXXXX 155
           +DKD S DRAGSGRRH      + E ++ G+ D+ ++         D ++          
Sbjct: 197 KDKDLSSDRAGSGRRHT---NSNFEDSKAGEQDKHLRDGDGPDERKDYRRGLGDYKSDRS 253

Query: 156 PAYEESKGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDD--RGRYKEQR 329
            ++EES+G RNDS + RD+ G R KE +  + KE+D ++   +EK++ D+    R+K++ 
Sbjct: 254 ISHEESRGHRNDSTSGRDSGGYRSKEVHKNEPKEVDGQKQPKDEKKKYDEWKTDRHKDRY 313

Query: 330 NRESKEHSDDK----GQNY----KKTKFV--RGSTSPGTDAGTSEPAVTD---------- 449
           NRES+E  +DK     +N     KK K V    ST  G D      AV D          
Sbjct: 314 NRESREQFEDKTVVASENQESAAKKPKLVSLEKSTDYGKDVSRFSTAVADMKQSSSSKLA 373

Query: 450 --------------------SDINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
                               +D+N          ELVN+NLVG GYMS DQKKKL
Sbjct: 374 QDIADKVTPEHAFLNNSEVANDLNAAKIAAMKAAELVNRNLVGVGYMSADQKKKL 428


>ref|XP_006430550.1| hypothetical protein CICLE_v10011438mg [Citrus clementina]
           gi|557532607|gb|ESR43790.1| hypothetical protein
           CICLE_v10011438mg [Citrus clementina]
          Length = 538

 Score = 98.6 bits (244), Expect = 9e-19
 Identities = 77/214 (35%), Positives = 109/214 (50%), Gaps = 30/214 (14%)
 Frame = +3

Query: 3   RDKDSY-DRAGSGRRH--AAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGR+H  A  EE DR+ ++  +  R  K D ++            Y+ES
Sbjct: 181 KDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRNDRTVTYDES 240

Query: 174 KGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRR--DDDRGRYKEQRNRESK- 344
           +G RN S + RD    R+KEA+  D KELD ++ ANEEK++  D +  R +++ +R  K 
Sbjct: 241 RGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNRDRDRYHRADKP 300

Query: 345 EHSDDKGQN-YKKTKFV---RGSTSPGTDAGTSEPAVTDS-------------------- 452
           + +  K +N  KK +F    +G+ +    AGT   +   S                    
Sbjct: 301 DFASGKQENPTKKQRFSNWDKGADNVKDAAGTMSSSSMQSQDIGDTDALAQSHANDAVAN 360

Query: 453 DINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
           D++          ELVNKNLVG  YMSTDQKKKL
Sbjct: 361 DLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKL 394


>ref|XP_006430548.1| hypothetical protein CICLE_v10011438mg [Citrus clementina]
           gi|567875919|ref|XP_006430549.1| hypothetical protein
           CICLE_v10011438mg [Citrus clementina]
           gi|557532605|gb|ESR43788.1| hypothetical protein
           CICLE_v10011438mg [Citrus clementina]
           gi|557532606|gb|ESR43789.1| hypothetical protein
           CICLE_v10011438mg [Citrus clementina]
          Length = 482

 Score = 98.6 bits (244), Expect = 9e-19
 Identities = 77/214 (35%), Positives = 109/214 (50%), Gaps = 30/214 (14%)
 Frame = +3

Query: 3   RDKDSY-DRAGSGRRH--AAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGR+H  A  EE DR+ ++  +  R  K D ++            Y+ES
Sbjct: 181 KDKDSSSDRAGSGRKHTVAYSEELDRDWHKRDRDGRDEKRDYRRSSGDHRNDRTVTYDES 240

Query: 174 KGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRR--DDDRGRYKEQRNRESK- 344
           +G RN S + RD    R+KEA+  D KELD ++ ANEEK++  D +  R +++ +R  K 
Sbjct: 241 RGHRNYSSSGRDYGSYRLKEAHRSDPKELDGQKLANEEKKKHNDSETNRDRDRYHRADKP 300

Query: 345 EHSDDKGQN-YKKTKFV---RGSTSPGTDAGTSEPAVTDS-------------------- 452
           + +  K +N  KK +F    +G+ +    AGT   +   S                    
Sbjct: 301 DFASGKQENPTKKQRFSNWDKGADNVKDAAGTMSSSSMQSQDIGDTDALAQSHANDAVAN 360

Query: 453 DINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
           D++          ELVNKNLVG  YMSTDQKKKL
Sbjct: 361 DLDAAKVAAMRAAELVNKNLVGGSYMSTDQKKKL 394


>ref|XP_007201961.1| hypothetical protein PRUPE_ppa004686mg [Prunus persica]
           gi|462397492|gb|EMJ03160.1| hypothetical protein
           PRUPE_ppa004686mg [Prunus persica]
          Length = 496

 Score = 94.7 bits (234), Expect = 1e-17
 Identities = 76/229 (33%), Positives = 110/229 (48%), Gaps = 45/229 (19%)
 Frame = +3

Query: 3   RDKDSYD-RAGSGRRHAAFEERDRERNREGKVDRFV---KTDIKKXXXXXXXXXXPAYEE 170
           +DKDS   R GSGRRH  FEE +RER+R   +DR V   K D ++           +YEE
Sbjct: 188 KDKDSSSQRVGSGRRHGHFEEMERERDRHA-LDRDVQDEKKDYRRNSGDYISERIFSYEE 246

Query: 171 SKGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDR-------------- 308
           SKG R+DS++RRD    R+KE    + KELD +  + E++++ DD+              
Sbjct: 247 SKGQRSDSISRRDEGKHRMKEGYKSELKELDDDNVSKEQRKKYDDKETSWGNRITRETSE 306

Query: 309 ----GRYKEQRNRES-----KEHSDDKGQNYKKTKFVRGSTSPGTDAGTSEPAVTD---- 449
                 Y +  N+ES     K  S +KG + +K      +T+ G ++ +S+    D    
Sbjct: 307 RSADKHYIKSENQESTAKRPKLFSSEKGIDGRKDVSKFTTTADGRESSSSKQVQEDEMTT 366

Query: 450 -----------SDINXXXXXXXXXXELVNKNLVG---TGYMSTDQKKKL 554
                      +DIN          ELVN+NL+G    G M+ DQKKKL
Sbjct: 367 EKTQANDAEAANDINAAKVAALKAAELVNRNLIGAGPVGCMTADQKKKL 415


>ref|XP_004144330.1| PREDICTED: uncharacterized protein LOC101218861 [Cucumis sativus]
          Length = 472

 Score = 94.4 bits (233), Expect = 2e-17
 Identities = 68/216 (31%), Positives = 96/216 (44%), Gaps = 32/216 (14%)
 Frame = +3

Query: 3   RDKDSY-DRAGSG-RRHAAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEESK 176
           RD DS  +R GSG RRHA+FEE ++ RN   +  +  K D  K           ++++ +
Sbjct: 169 RDGDSLSERHGSGSRRHASFEEMEKHRNARDRDGQDEKRDNIKHSGDYKNERVLSHDDGR 228

Query: 177 GIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQRNRESK---- 344
           G R DSL  RD S  R K+ N  D K+LD E+++ EE++ D     + + + +ESK    
Sbjct: 229 GNRYDSLLGRDESKHRTKDINKNDRKDLDDEKSSKEERKHDARETHWDKVQGKESKGKYD 288

Query: 345 --------------------------EHSDDKGQNYKKTKFVRGSTSPGTDAGTSEPAVT 446
                                      H +D  +N   T             G S  +  
Sbjct: 289 GKGVFVDENQGLPAKKPKLFSSGKEVNHEEDADENQSSTSKKEQDGKMSLGQGQSGDSDF 348

Query: 447 DSDINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
            +D +          ELVNKNLVG GYM+TDQKKKL
Sbjct: 349 AADFSAAKVAAMKAAELVNKNLVGGGYMTTDQKKKL 384


>ref|XP_006348038.1| PREDICTED: pre-mRNA-splicing factor 38B-like [Solanum tuberosum]
          Length = 457

 Score = 92.0 bits (227), Expect = 9e-17
 Identities = 74/208 (35%), Positives = 102/208 (49%), Gaps = 24/208 (11%)
 Frame = +3

Query: 3   RDKD-SYDRAGSGRRHAAFEERDRERNREGKVDRFVK-TDIKKXXXXXXXXXXPAYEESK 176
           +DKD S+DR GSGRR   +     + NR  + DR+ +  D +           PAYEES+
Sbjct: 182 KDKDLSFDRVGSGRR---YNNSSIDDNRSRESDRYKEYRDSRDEKGHRRSDRSPAYEESR 238

Query: 177 GIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQRNRESKEHSD 356
             RN+S +R+D   P+V      D  ELD E+   EE++  +DR +  E RN  S   S 
Sbjct: 239 SNRNESNSRKD---PQV------DAMELDGEKYTKEERKNYEDREKIFEDRNVAS---SK 286

Query: 357 DKGQNYKKTKF--VRGSTSPGTDAGTS--------------------EPAVTDSDINXXX 470
            +    KK+KF  +  S++ G DA  +                    E  V DSDI+   
Sbjct: 287 GRVSLSKKSKFSGMDESSAQGKDANAADGQLCSNSKQGQDPNNELSLEQGVKDSDIDAAK 346

Query: 471 XXXXXXXELVNKNLVGTGYMSTDQKKKL 554
                  ELVN+NL+GTG M+TDQKKKL
Sbjct: 347 IAAMKAAELVNRNLIGTGIMTTDQKKKL 374


>ref|XP_007028352.1| Uncharacterized protein isoform 4 [Theobroma cacao]
           gi|508716957|gb|EOY08854.1| Uncharacterized protein
           isoform 4 [Theobroma cacao]
          Length = 462

 Score = 87.0 bits (214), Expect = 3e-15
 Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 40/224 (17%)
 Frame = +3

Query: 3   RDKDS-YDRAGSGRRHAAF--EERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGRR  +   EE DR+R R G+  R  K D  +           +YEES
Sbjct: 194 KDKDSALDRAGSGRRQGSSFSEEMDRDRRRRGRDSRGEKGDYHRSSGDRKGDYTESYEES 253

Query: 174 KGIRNDSLA--RRDNSGPRVKEANWRDGKELDSERNANEEKRRDD-----DRGRY----K 320
           +G RNDS +   RDN   R KE      KE+D ++ A E  + D+     ++ RY    K
Sbjct: 254 RGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKPAKERMKHDEWETNMEKDRYGGVLK 313

Query: 321 EQ---------RNRES-----KEHSDDKGQNY------KKTKFVRGSTSPGT----DAGT 428
           EQ         +N+ES     K  S  KG  Y      K++   +   + G      A  
Sbjct: 314 EQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSLEQAEETDGRVTMGQAHG 373

Query: 429 SEPAVTDSDINXXXXXXXXXXELVNKNLVGTGY--MSTDQKKKL 554
           ++  +T +DIN          ELVN+NL+G G+  M+T+QKKKL
Sbjct: 374 NDVDIT-NDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKKKL 416


>ref|XP_007028351.1| Uncharacterized protein isoform 3 [Theobroma cacao]
           gi|508716956|gb|EOY08853.1| Uncharacterized protein
           isoform 3 [Theobroma cacao]
          Length = 464

 Score = 87.0 bits (214), Expect = 3e-15
 Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 40/224 (17%)
 Frame = +3

Query: 3   RDKDS-YDRAGSGRRHAAF--EERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGRR  +   EE DR+R R G+  R  K D  +           +YEES
Sbjct: 194 KDKDSALDRAGSGRRQGSSFSEEMDRDRRRRGRDSRGEKGDYHRSSGDRKGDYTESYEES 253

Query: 174 KGIRNDSLA--RRDNSGPRVKEANWRDGKELDSERNANEEKRRDD-----DRGRY----K 320
           +G RNDS +   RDN   R KE      KE+D ++ A E  + D+     ++ RY    K
Sbjct: 254 RGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKPAKERMKHDEWETNMEKDRYGGVLK 313

Query: 321 EQ---------RNRES-----KEHSDDKGQNY------KKTKFVRGSTSPGT----DAGT 428
           EQ         +N+ES     K  S  KG  Y      K++   +   + G      A  
Sbjct: 314 EQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSLEQAEETDGRVTMGQAHG 373

Query: 429 SEPAVTDSDINXXXXXXXXXXELVNKNLVGTGY--MSTDQKKKL 554
           ++  +T +DIN          ELVN+NL+G G+  M+T+QKKKL
Sbjct: 374 NDVDIT-NDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKKKL 416


>ref|XP_007028350.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
           gi|508716955|gb|EOY08852.1| Uncharacterized protein
           isoform 2, partial [Theobroma cacao]
          Length = 473

 Score = 87.0 bits (214), Expect = 3e-15
 Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 40/224 (17%)
 Frame = +3

Query: 3   RDKDS-YDRAGSGRRHAAF--EERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGRR  +   EE DR+R R G+  R  K D  +           +YEES
Sbjct: 194 KDKDSALDRAGSGRRQGSSFSEEMDRDRRRRGRDSRGEKGDYHRSSGDRKGDYTESYEES 253

Query: 174 KGIRNDSLA--RRDNSGPRVKEANWRDGKELDSERNANEEKRRDD-----DRGRY----K 320
           +G RNDS +   RDN   R KE      KE+D ++ A E  + D+     ++ RY    K
Sbjct: 254 RGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKPAKERMKHDEWETNMEKDRYGGVLK 313

Query: 321 EQ---------RNRES-----KEHSDDKGQNY------KKTKFVRGSTSPGT----DAGT 428
           EQ         +N+ES     K  S  KG  Y      K++   +   + G      A  
Sbjct: 314 EQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSLEQAEETDGRVTMGQAHG 373

Query: 429 SEPAVTDSDINXXXXXXXXXXELVNKNLVGTGY--MSTDQKKKL 554
           ++  +T +DIN          ELVN+NL+G G+  M+T+QKKKL
Sbjct: 374 NDVDIT-NDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKKKL 416


>ref|XP_007028349.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|590634353|ref|XP_007028353.1| Uncharacterized protein
           isoform 1 [Theobroma cacao] gi|508716954|gb|EOY08851.1|
           Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508716958|gb|EOY08855.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 504

 Score = 87.0 bits (214), Expect = 3e-15
 Identities = 78/224 (34%), Positives = 108/224 (48%), Gaps = 40/224 (17%)
 Frame = +3

Query: 3   RDKDS-YDRAGSGRRHAAF--EERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKDS  DRAGSGRR  +   EE DR+R R G+  R  K D  +           +YEES
Sbjct: 194 KDKDSALDRAGSGRRQGSSFSEEMDRDRRRRGRDSRGEKGDYHRSSGDRKGDYTESYEES 253

Query: 174 KGIRNDSLA--RRDNSGPRVKEANWRDGKELDSERNANEEKRRDD-----DRGRY----K 320
           +G RNDS +   RDN   R KE      KE+D ++ A E  + D+     ++ RY    K
Sbjct: 254 RGHRNDSSSGRERDNDKYRRKEGYKSGLKEIDGQKPAKERMKHDEWETNMEKDRYGGVLK 313

Query: 321 EQ---------RNRES-----KEHSDDKGQNY------KKTKFVRGSTSPGT----DAGT 428
           EQ         +N+ES     K  S  KG  Y      K++   +   + G      A  
Sbjct: 314 EQCEEKSIFVGKNQESPAKKLKLFSSSKGNEYDKDADEKRSSLEQAEETDGRVTMGQAHG 373

Query: 429 SEPAVTDSDINXXXXXXXXXXELVNKNLVGTGY--MSTDQKKKL 554
           ++  +T +DIN          ELVN+NL+G G+  M+T+QKKKL
Sbjct: 374 NDVDIT-NDINSAKVAAMKAAELVNRNLIGAGHSNMTTEQKKKL 416


>ref|XP_002522170.1| conserved hypothetical protein [Ricinus communis]
           gi|223538608|gb|EEF40211.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 425

 Score = 85.5 bits (210), Expect = 8e-15
 Identities = 68/214 (31%), Positives = 102/214 (47%), Gaps = 30/214 (14%)
 Frame = +3

Query: 3   RDKD-SYDRAGSGRRHA--AFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           RDKD S DRAGSG +H    +E++DR R+R  +  R  K +  +           +YE+S
Sbjct: 139 RDKDDSPDRAGSGTKHTYTTYEDKDRNRHRWDRDGRDEKRNYHR-----------SYEDS 187

Query: 174 KGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQ--------- 326
           KG RND  + +DN G   +++   D KEL+ ++   +    D D+ +Y  +         
Sbjct: 188 KGYRNDP-SGKDNDGYHHRDSYKNDQKELNGQKERKKHGDWDTDKDKYNREPQAQNGDKP 246

Query: 327 ----RNRESK-------------EHSDDKGQNYKKTKFVRG-STSPGTDAGTSEPAVTDS 452
                N+ES              +H+ D  +  K+ + V G +T     A  SE A   +
Sbjct: 247 VFGSENQESLAKKPKLFSSDLDVDHNKDANERQKQVQEVDGKATGEQVHASISEAA---N 303

Query: 453 DINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
           D+N          ELVN+NL G G+MST+QKKKL
Sbjct: 304 DLNAAKVAAIRAAELVNRNLAGVGFMSTEQKKKL 337


>ref|XP_004252010.1| PREDICTED: uncharacterized protein LOC101247793 isoform 2 [Solanum
           lycopersicum]
          Length = 461

 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 68/208 (32%), Positives = 97/208 (46%), Gaps = 24/208 (11%)
 Frame = +3

Query: 3   RDKD-SYDRAGSGRRHAAF---EERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEE 170
           +DKD S+DR GSGRR+ +    + R RE +R  K  R  + +             PAYEE
Sbjct: 181 KDKDLSFDRVGSGRRYNSSSIDDNRSRESDRY-KEYRDSRDEKGNRSSDHKSDRSPAYEE 239

Query: 171 SKGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQRNRESKEH 350
           S+  RN+S +R++   P+V      D  ELD ++   EE++  +DR +    RN  S + 
Sbjct: 240 SRSNRNESNSRKE---PQV------DAMELDGKKYTKEERKNYEDREKIFADRNVASSKG 290

Query: 351 SDDKGQNYKKTKFVRGSTSPGTDAGTS--------------------EPAVTDSDINXXX 470
                    K   +  S++ G DA  +                    E  V DSDI+   
Sbjct: 291 RVSSPSKKSKFSGMDESSAQGKDANAADGKFSSNSKQGQDLNGELSLEQGVKDSDIDAAK 350

Query: 471 XXXXXXXELVNKNLVGTGYMSTDQKKKL 554
                  ELVN+NL+GTG M+TDQKKKL
Sbjct: 351 IAAMKAAELVNRNLIGTGIMTTDQKKKL 378


>ref|XP_004252009.1| PREDICTED: uncharacterized protein LOC101247793 isoform 1 [Solanum
           lycopersicum]
          Length = 510

 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 68/208 (32%), Positives = 97/208 (46%), Gaps = 24/208 (11%)
 Frame = +3

Query: 3   RDKD-SYDRAGSGRRHAAF---EERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEE 170
           +DKD S+DR GSGRR+ +    + R RE +R  K  R  + +             PAYEE
Sbjct: 181 KDKDLSFDRVGSGRRYNSSSIDDNRSRESDRY-KEYRDSRDEKGNRSSDHKSDRSPAYEE 239

Query: 171 SKGIRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDDRGRYKEQRNRESKEH 350
           S+  RN+S +R++   P+V      D  ELD ++   EE++  +DR +    RN  S + 
Sbjct: 240 SRSNRNESNSRKE---PQV------DAMELDGKKYTKEERKNYEDREKIFADRNVASSKG 290

Query: 351 SDDKGQNYKKTKFVRGSTSPGTDAGTS--------------------EPAVTDSDINXXX 470
                    K   +  S++ G DA  +                    E  V DSDI+   
Sbjct: 291 RVSSPSKKSKFSGMDESSAQGKDANAADGKFSSNSKQGQDLNGELSLEQGVKDSDIDAAK 350

Query: 471 XXXXXXXELVNKNLVGTGYMSTDQKKKL 554
                  ELVN+NL+GTG M+TDQKKKL
Sbjct: 351 IAAMKAAELVNRNLIGTGIMTTDQKKKL 378


>ref|XP_002307979.2| hypothetical protein POPTR_0006s03830g [Populus trichocarpa]
           gi|550335404|gb|EEE91502.2| hypothetical protein
           POPTR_0006s03830g [Populus trichocarpa]
          Length = 473

 Score = 79.7 bits (195), Expect = 4e-13
 Identities = 69/220 (31%), Positives = 100/220 (45%), Gaps = 36/220 (16%)
 Frame = +3

Query: 3   RDKD-SYDRAGSGRRHAAF--EERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEES 173
           +DKD S DR GSGR++ +   EE+DR+ +R  +  R  K D  +            YE++
Sbjct: 180 KDKDFSPDRVGSGRKYTSIVSEEKDRDWHRRDRDGRDEKRDYHRSSGDHKSDRSSYYEDT 239

Query: 174 KGIRNDSLARRDNSGPRVKEANWRDGKEL----DSERNANEEKRRDDDR-GRYKEQRNRE 338
           +G RNDS  R      R++E+   D KEL    + +++ N E  RD DR  +   ++N +
Sbjct: 240 RGYRNDSSGR-----DRLRESYKNDPKELNGLKEKKKHDNWETSRDKDRYSKAPGEKNDD 294

Query: 339 SKEHSDDKGQN-YKKTKFVRGSTSPG---------------------------TDAGTSE 434
                 +K ++  KK K    S  P                              A TSE
Sbjct: 295 KSAFGSEKPESPAKKPKLFSSSKDPDYSGDVNQKQSSSSMLAQEVDNKVNVGQAHANTSE 354

Query: 435 PAVTDSDINXXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
            A   +D++          ELVNKNLVG G+MST+QKKKL
Sbjct: 355 AA---NDLDAAKVAAMKAAELVNKNLVGVGFMSTEQKKKL 391


>ref|XP_003519025.1| PREDICTED: protein starmaker-like isoform X1 [Glycine max]
          Length = 479

 Score = 79.7 bits (195), Expect = 4e-13
 Identities = 60/210 (28%), Positives = 97/210 (46%), Gaps = 26/210 (12%)
 Frame = +3

Query: 3   RDKDS-YDRAGSGRRHAAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEESKG 179
           +D DS YD++ S +RHA ++E +RE +      R  + D ++            Y ES+ 
Sbjct: 182 KDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSGDYRSDQAVCYSESRN 241

Query: 180 IRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDD--RGRYKEQRNRESKEHS 353
            R++S  +RD     +KE    + KE + +    EEKR+ DD   G+ K+ + R++ E  
Sbjct: 242 QRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTETGKGKDWKTRQASEQC 301

Query: 354 D----------------DKGQNYKKTKFVRGSTSPGTDAGTSEPAVT-------DSDINX 464
                            DK  NY+K    + S+S  +    ++           D+D++ 
Sbjct: 302 GIEDKESSGKKLKLFDLDKDDNYRKDDESKTSSSKLSHESKADVRAAKTSGFDGDNDLDA 361

Query: 465 XXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
                    ELVN+NLVG G ++TDQKKKL
Sbjct: 362 AKVAAMRAAELVNRNLVGAGCLTTDQKKKL 391


>ref|XP_006575187.1| PREDICTED: protein starmaker-like isoform X6 [Glycine max]
          Length = 438

 Score = 77.0 bits (188), Expect = 3e-12
 Identities = 61/211 (28%), Positives = 96/211 (45%), Gaps = 27/211 (12%)
 Frame = +3

Query: 3   RDKDS-YDRAGSGRRHAAFEERDRERNREGKVDRFVKTDIKKXXXXXXXXXXPAYEESKG 179
           +D DS YD++ S +RHA ++E +RE +      R  + D ++            Y ES+ 
Sbjct: 182 KDMDSSYDKSASSKRHALYDEVEREGHSRDWDGRNERRDSRRSSGDYRSDQAVCYSESRN 241

Query: 180 IRNDSLARRDNSGPRVKEANWRDGKELDSERNANEEKRRDDD--RGRYKEQRNRESKEHS 353
            R++S  +RD     +KE    + KE + +    EEKR+ DD   G+ K+ + R++ E  
Sbjct: 242 QRDESGPQRDCGKSSLKEGYKSEQKESNDQNLPWEEKRKHDDTETGKGKDWKTRQASEQC 301

Query: 354 D----------------DKGQNYKK----TKFVRGSTSPGTDAGTSEPAVT----DSDIN 461
                            DK  NY+K    +K      S  + A       +    D+D++
Sbjct: 302 GIEDKESSGKKLKLFDLDKDDNYRKDADESKTSSSKLSHESKADVRAAKTSGFDGDNDLD 361

Query: 462 XXXXXXXXXXELVNKNLVGTGYMSTDQKKKL 554
                     ELVN+NLVG G ++TDQKKKL
Sbjct: 362 AAKVAAMRAAELVNRNLVGAGCLTTDQKKKL 392


Top