BLASTX nr result

ID: Perilla23_contig00002239 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00002239
         (582 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011093021.1| PREDICTED: uncharacterized protein LOC105173...   216   7e-54
ref|XP_011093988.1| PREDICTED: uncharacterized protein LOC105173...   191   3e-46
emb|CDP08657.1| unnamed protein product [Coffea canephora]            176   6e-42
ref|XP_012835590.1| PREDICTED: uncharacterized protein LOC105956...   166   6e-39
ref|XP_009590542.1| PREDICTED: uncharacterized protein LOC104087...   150   4e-34
ref|XP_009775599.1| PREDICTED: uncharacterized protein LOC104225...   149   1e-33
ref|XP_011023158.1| PREDICTED: uncharacterized protein LOC105124...   141   3e-31
ref|XP_002302639.2| hypothetical protein POPTR_0002s17390g [Popu...   141   3e-31
ref|XP_012083184.1| PREDICTED: uncharacterized protein LOC105642...   139   1e-30
ref|XP_006339728.1| PREDICTED: uncharacterized protein LOC102587...   139   1e-30
ref|XP_002523322.1| conserved hypothetical protein [Ricinus comm...   138   2e-30
ref|XP_011038435.1| PREDICTED: uncharacterized protein LOC105135...   138   2e-30
ref|XP_007051518.1| Uncharacterized protein isoform 3, partial [...   137   4e-30
ref|XP_007051517.1| Uncharacterized protein isoform 2, partial [...   137   4e-30
ref|XP_007051516.1| Uncharacterized protein isoform 1 [Theobroma...   137   4e-30
ref|XP_008376151.1| PREDICTED: uncharacterized protein LOC103439...   137   5e-30
ref|XP_007217991.1| hypothetical protein PRUPE_ppa005611mg [Prun...   136   9e-30
gb|KHG18669.1| Alanine--tRNA ligase [Gossypium arboreum]              135   1e-29
ref|XP_008233124.1| PREDICTED: uncharacterized protein LOC103332...   135   1e-29
ref|XP_012083189.1| PREDICTED: uncharacterized protein LOC105642...   134   3e-29

>ref|XP_011093021.1| PREDICTED: uncharacterized protein LOC105173071 [Sesamum indicum]
           gi|747090650|ref|XP_011093022.1| PREDICTED:
           uncharacterized protein LOC105173071 [Sesamum indicum]
          Length = 478

 Score =  216 bits (550), Expect = 7e-54
 Identities = 119/208 (57%), Positives = 139/208 (66%), Gaps = 15/208 (7%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFG-----GTRDSSPAT-SA 420
           IENCDLP PQN H  K+L  + SC SH +ISTS P+ +  G G     G  DS+P + +A
Sbjct: 226 IENCDLPRPQNTHFTKDLRINASCYSHDQISTS-PLDRKPGIGSHNHCGAHDSTPVSLTA 284

Query: 419 GSSCRNLRMLTEEQLLSSPKYPL---------RDSPTQERNREIHMQKDDAGKAQLLEAL 267
           GSSCR LRM   +QL+S    PL         RDSPT ER  E+    DDA KA+LLEAL
Sbjct: 285 GSSCRQLRMSARDQLVSVVGKPLSVVTCERKIRDSPTNERMPELPGLDDDASKARLLEAL 344

Query: 266 RHSQTRAREAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLL 87
           RHSQTRAREAE VA+Q   EK+HVVKLVFRQA QLFAYKQWV LLQLENMYFQ KN K  
Sbjct: 345 RHSQTRAREAEHVAEQACVEKDHVVKLVFRQASQLFAYKQWVHLLQLENMYFQFKNAKAQ 404

Query: 86  SAATTSPAISPWTVPSNRKMQRGWMKSS 3
           +++   P + PWT    RKM++ W KSS
Sbjct: 405 TSSAVIPVVLPWTPLRTRKMRKSWQKSS 432


>ref|XP_011093988.1| PREDICTED: uncharacterized protein LOC105173804 [Sesamum indicum]
           gi|747045438|ref|XP_011093996.1| PREDICTED:
           uncharacterized protein LOC105173804 [Sesamum indicum]
           gi|747045440|ref|XP_011094003.1| PREDICTED:
           uncharacterized protein LOC105173804 [Sesamum indicum]
          Length = 454

 Score =  191 bits (484), Expect = 3e-46
 Identities = 105/197 (53%), Positives = 127/197 (64%), Gaps = 5/197 (2%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSI--PVRKSSG---FGGTRDSSPATSAG 417
           IENCDLPSPQN  VKK++  +     H  I +S+  P  KS     F        + +A 
Sbjct: 215 IENCDLPSPQNACVKKDMDVNTCSFGHDRIPSSLLDPNLKSGSPHHFSTHLHPPGSLTAE 274

Query: 416 SSCRNLRMLTEEQLLSSPKYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAREA 237
           S+C+NL ML + QL  S   PLRD PT E+   +HM ++DA KAQL+EAL HSQTRAREA
Sbjct: 275 SACKNLSMLDQAQLAYSTNKPLRDMPTHEK---MHMLENDATKAQLMEALCHSQTRAREA 331

Query: 236 ETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAIS 57
           E  A Q  AEKEHVVKLVF+QA QLFAY+QW+QLLQLENMY   KNNK  S +   P + 
Sbjct: 332 ERAATQACAEKEHVVKLVFKQAYQLFAYRQWLQLLQLENMYLHFKNNKTQSVSIVFPIMF 391

Query: 56  PWTVPSNRKMQRGWMKS 6
           PWT   +RKM + W KS
Sbjct: 392 PWTPRRSRKMLKNWQKS 408


>emb|CDP08657.1| unnamed protein product [Coffea canephora]
          Length = 476

 Score =  176 bits (447), Expect = 6e-42
 Identities = 95/195 (48%), Positives = 117/195 (60%), Gaps = 2/195 (1%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSE--ISTSIPVRKSSGFGGTRDSSPATSAGSSC 408
           IENCDLP PQN  V++    D+ C  H    IS + P               + ++ S C
Sbjct: 237 IENCDLPQPQNTCVEREAFVDLCCFDHDRACISPTDPKHPGCHHDLIVHKQSSIASQSDC 296

Query: 407 RNLRMLTEEQLLSSPKYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAREAETV 228
           +  R+  +EQL SS    LRDS  Q+          D+ KAQLLEALRHSQTRAREAE  
Sbjct: 297 QKQRLSVDEQLQSSTITSLRDSSNQDAILRSFASDSDSSKAQLLEALRHSQTRAREAEKA 356

Query: 227 AKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAISPWT 48
           AKQ  AEKEHVVKLVFRQA QLFAYKQW QLLQLEN+ +Q+KNNK    +T  P + PW 
Sbjct: 357 AKQAYAEKEHVVKLVFRQASQLFAYKQWFQLLQLENLCYQIKNNKGQPISTLFPVMLPWV 416

Query: 47  VPSNRKMQRGWMKSS 3
               RK+++ W K++
Sbjct: 417 PQKTRKLRKNWQKAA 431


>ref|XP_012835590.1| PREDICTED: uncharacterized protein LOC105956292 [Erythranthe
           guttatus] gi|604334813|gb|EYU38879.1| hypothetical
           protein MIMGU_mgv1a007152mg [Erythranthe guttata]
          Length = 417

 Score =  166 bits (421), Expect = 6e-39
 Identities = 103/199 (51%), Positives = 119/199 (59%), Gaps = 7/199 (3%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           IENCDLPSPQN H+KK  S +IS        TS+ VRK              ++ +SC  
Sbjct: 197 IENCDLPSPQNTHLKKETSMNIS--------TSLTVRKPG----------IIASRNSCHK 238

Query: 401 LRMLTEEQLLSSPKYPLRDSPTQERNREIHMQK-------DDAGKAQLLEALRHSQTRAR 243
           L M  EEQ +     PLRD    ER  E+H ++       DDA KAQLL+ALR+SQTRAR
Sbjct: 239 LSMSAEEQSIPGAGKPLRDRAALERMPEMHEKEEEEDDGVDDASKAQLLQALRYSQTRAR 298

Query: 242 EAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPA 63
           EAE VAKQ  AEKEHVVKLVFRQA QLFAYKQW+QLLQLENMYFQ  N          P 
Sbjct: 299 EAEEVAKQACAEKEHVVKLVFRQASQLFAYKQWLQLLQLENMYFQSNNKSHHETVVLLPG 358

Query: 62  ISPWTVPSNRKMQRGWMKS 6
            S  T    RKM++G  +S
Sbjct: 359 KSVRT----RKMRKGSNRS 373


>ref|XP_009590542.1| PREDICTED: uncharacterized protein LOC104087700 [Nicotiana
           tomentosiformis] gi|697163439|ref|XP_009590543.1|
           PREDICTED: uncharacterized protein LOC104087700
           [Nicotiana tomentosiformis]
          Length = 474

 Score =  150 bits (379), Expect = 4e-34
 Identities = 96/193 (49%), Positives = 114/193 (59%), Gaps = 6/193 (3%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGS-SCR 405
           IENCDLP PQN  VK++   D+    + + S S   R  S     RDS      G+ S +
Sbjct: 240 IENCDLPQPQNNFVKRDHDVDVDTKIY-DASVSPKSRDMS-----RDSENIHQRGNVSFK 293

Query: 404 NLRMLTEE---QLLSSPKYPLRDSPTQERNR--EIHMQKDDAGKAQLLEALRHSQTRARE 240
               L  E   QL +     LR+S T  +    E++   DD  KAQLLEALRHSQTRARE
Sbjct: 294 RPSQLEAEGHSQLHACKSTSLRNSDTSSQKLLPEMNTSSDDESKAQLLEALRHSQTRARE 353

Query: 239 AETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAI 60
           AE  AKQ  AEKEHVV+LVFRQA QLFAYKQW QLLQLEN+YFQ+KNN+    +   P +
Sbjct: 354 AENAAKQAFAEKEHVVQLVFRQASQLFAYKQWFQLLQLENLYFQIKNNRNQPISAILPVM 413

Query: 59  SPWTVPSNRKMQR 21
            PW VP   K  R
Sbjct: 414 LPW-VPQKTKRPR 425


>ref|XP_009775599.1| PREDICTED: uncharacterized protein LOC104225490 [Nicotiana
           sylvestris] gi|698574195|ref|XP_009775600.1| PREDICTED:
           uncharacterized protein LOC104225490 [Nicotiana
           sylvestris] gi|698574198|ref|XP_009775601.1| PREDICTED:
           uncharacterized protein LOC104225490 [Nicotiana
           sylvestris]
          Length = 474

 Score =  149 bits (375), Expect = 1e-33
 Identities = 91/192 (47%), Positives = 109/192 (56%), Gaps = 5/192 (2%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           IENCDLP PQN  VK++   D+    ++       V   SG       +       S + 
Sbjct: 240 IENCDLPQPQNNFVKRDHDVDVDTKIYAS-----SVSPKSGDTSRHSENIHQRGNVSFKR 294

Query: 401 LRMLTEE---QLLSSPKYPLRDSPTQERNR--EIHMQKDDAGKAQLLEALRHSQTRAREA 237
              L  E   QL +     LR+S T  +    E++   DD  KAQLLEALRHSQTRAREA
Sbjct: 295 PSQLEAEGNSQLHACKSSSLRNSDTSSQKPVPEMNTSSDDESKAQLLEALRHSQTRAREA 354

Query: 236 ETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAIS 57
           E  AKQ  AEKEHVV+LVFRQA QLFAYKQW QLLQLEN+YFQ+KNN+    +   P + 
Sbjct: 355 ENAAKQAFAEKEHVVQLVFRQASQLFAYKQWFQLLQLENLYFQIKNNRNQPISAILPVML 414

Query: 56  PWTVPSNRKMQR 21
           PW VP   K  R
Sbjct: 415 PW-VPQKTKRPR 425


>ref|XP_011023158.1| PREDICTED: uncharacterized protein LOC105124744 [Populus
           euphratica]
          Length = 429

 Score =  141 bits (355), Expect = 3e-31
 Identities = 89/202 (44%), Positives = 110/202 (54%), Gaps = 9/202 (4%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSE-ISTSIPVRKSSGFGGTRDSSPATSAGSSCR 405
           I NCDLP+PQ VH++K          H   +++S+      G       S AT     C 
Sbjct: 190 IGNCDLPAPQKVHIRKYPCAHSGSFQHDNTLASSLDWEAQIGC-----FSSATGHVQGCP 244

Query: 404 NL-------RMLTEEQLLSSPKYPLRDSPTQERNREI-HMQKDDAGKAQLLEALRHSQTR 249
                    R  TE Q LS        + T +   EI  + + D  KAQLLEALRHSQTR
Sbjct: 245 KSEGMPGKQRGSTEGQSLSGSDKACSYAATIKEAAEIGQISESDPCKAQLLEALRHSQTR 304

Query: 248 AREAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTS 69
           AREAE VAKQ  AEKEH+VKL F+QA QLFAYKQW QLLQLE +Y+Q+KN+     +   
Sbjct: 305 AREAEQVAKQACAEKEHIVKLFFKQASQLFAYKQWFQLLQLETLYYQMKNSD-QPISNLL 363

Query: 68  PAISPWTVPSNRKMQRGWMKSS 3
           P + PW     RK+ + W KSS
Sbjct: 364 PVVLPWIPQKGRKLCKSWQKSS 385


>ref|XP_002302639.2| hypothetical protein POPTR_0002s17390g [Populus trichocarpa]
           gi|550345217|gb|EEE81912.2| hypothetical protein
           POPTR_0002s17390g [Populus trichocarpa]
          Length = 429

 Score =  141 bits (355), Expect = 3e-31
 Identities = 89/202 (44%), Positives = 110/202 (54%), Gaps = 9/202 (4%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSE-ISTSIPVRKSSGFGGTRDSSPATSAGSSCR 405
           I NCDLP PQ VH++K          H   +++S+  +   G       S AT     C 
Sbjct: 190 IGNCDLPPPQKVHIRKYPCAHSGSFQHDNTLASSLDWKAQIGC-----ISSATGHVQGCP 244

Query: 404 NL-------RMLTEEQLLSSPKYPLRDSPTQERNREI-HMQKDDAGKAQLLEALRHSQTR 249
                    R  TE Q LS        + T +   EI  + + D  KAQLLEALRHSQTR
Sbjct: 245 KSEGMPGKQRGSTEGQSLSGSDKACSYAATIKEAAEIGQISESDPCKAQLLEALRHSQTR 304

Query: 248 AREAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTS 69
           AREAE VAKQ  AEKEH+VKL F+QA QLFAYKQW QLLQLE +Y+Q+KN+     +   
Sbjct: 305 AREAEQVAKQACAEKEHIVKLFFKQASQLFAYKQWFQLLQLETLYYQMKNSD-QPISNLF 363

Query: 68  PAISPWTVPSNRKMQRGWMKSS 3
           P + PW     RK+ + W KSS
Sbjct: 364 PVVLPWIPQKGRKLCKSWQKSS 385


>ref|XP_012083184.1| PREDICTED: uncharacterized protein LOC105642829 isoform X1
           [Jatropha curcas] gi|802694482|ref|XP_012083185.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|802694486|ref|XP_012083186.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|802694489|ref|XP_012083187.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|802694493|ref|XP_012083188.1|
           PREDICTED: uncharacterized protein LOC105642829 isoform
           X1 [Jatropha curcas] gi|643716844|gb|KDP28470.1|
           hypothetical protein JCGZ_14241 [Jatropha curcas]
          Length = 480

 Score =  139 bits (350), Expect = 1e-30
 Identities = 85/200 (42%), Positives = 115/200 (57%), Gaps = 8/200 (4%)
 Frame = -2

Query: 578 ENCDLPSPQNVHVKKNLSTDISCLSHSE-ISTSIPVRKSSGFGGTRDSSPAT------SA 420
           +NCDLP PQ +HV++  S  +    H + + + +  +  SG+     SSP        S+
Sbjct: 242 QNCDLPPPQKMHVRRYPSVRVGSSDHDDTVPSCLNWKPQSGY----ISSPIVKAHGCPSS 297

Query: 419 GSSCRNLRMLTEEQLLSSPKYPLRDSPTQERNREI-HMQKDDAGKAQLLEALRHSQTRAR 243
            S     R  TE  L S    P   + T +   E   + + D  KAQLLEALRHSQTRAR
Sbjct: 298 ESMHGRHRTSTEGHLQSGSNKPFSYTKTSKDTIEFGQVPECDPCKAQLLEALRHSQTRAR 357

Query: 242 EAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPA 63
           EAE VAKQ   EKEH++KL F+QA QLFAYKQW QLLQLE++Y+Q+KN+     +T  P 
Sbjct: 358 EAEKVAKQACEEKEHIIKLFFKQASQLFAYKQWFQLLQLESLYYQVKNSD-QPVSTLFPV 416

Query: 62  ISPWTVPSNRKMQRGWMKSS 3
             PW     RK+++G  K++
Sbjct: 417 ALPWMPRKGRKLRKGLQKTT 436


>ref|XP_006339728.1| PREDICTED: uncharacterized protein LOC102587530 isoform X1 [Solanum
           tuberosum] gi|565345288|ref|XP_006339729.1| PREDICTED:
           uncharacterized protein LOC102587530 isoform X2 [Solanum
           tuberosum]
          Length = 470

 Score =  139 bits (350), Expect = 1e-30
 Identities = 86/189 (45%), Positives = 109/189 (57%), Gaps = 2/189 (1%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           +ENCDLP PQN  VK++   D+    ++  S++ P   S     T        +      
Sbjct: 240 MENCDLPQPQNNFVKQDRDVDVDSKIYA--SSTGPKAGSMHQQNTNIYKRGNLSFERPSQ 297

Query: 401 LRMLTEEQLLSSPKYPLR--DSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAREAETV 228
           L    + QL +     L+  D+P+Q+   E++   DD  KAQLL+ALRHSQTRAREAE  
Sbjct: 298 LDAEGKLQLHTCKSSSLKNSDTPSQKVVPEMNTSGDDESKAQLLKALRHSQTRAREAENA 357

Query: 227 AKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAISPWT 48
           AKQ  AEKEHVV+LVFRQA QLFAYKQW QLLQLEN YFQ+KNNK    +   P +   T
Sbjct: 358 AKQAFAEKEHVVQLVFRQASQLFAYKQWFQLLQLENFYFQIKNNKKQPISAMLPRVPQKT 417

Query: 47  VPSNRKMQR 21
               +K  R
Sbjct: 418 KRPQKKSAR 426


>ref|XP_002523322.1| conserved hypothetical protein [Ricinus communis]
           gi|223537410|gb|EEF39038.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 481

 Score =  138 bits (348), Expect = 2e-30
 Identities = 88/201 (43%), Positives = 111/201 (55%), Gaps = 8/201 (3%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSE-ISTSIPVRKSSGFGGTRDSSPAT------S 423
           I NCDLP PQ +H+++          H + I+ S+  +  SG      SSP        S
Sbjct: 242 IANCDLPPPQKLHLRRYPHGRPGASDHDDSIALSLDGKAQSGC----ISSPLVHAHGCPS 297

Query: 422 AGSSCRNLRMLTEEQLLSSPKYPLRDSPTQERNREI-HMQKDDAGKAQLLEALRHSQTRA 246
           + S     R   E  L S    P     T +   EI  + + D  KAQLLEALRHSQTRA
Sbjct: 298 SESMHGRHRASVEGHLQSGLNKPFSSIATHKEMIEIGQVPEGDPCKAQLLEALRHSQTRA 357

Query: 245 REAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSP 66
           REAE VAKQ  AE+EH++KL FRQA QLFAYKQW  LLQLE++Y+Q+KN      +T  P
Sbjct: 358 REAEKVAKQACAEREHIIKLFFRQASQLFAYKQWFHLLQLESLYYQVKNGG-QPMSTLFP 416

Query: 65  AISPWTVPSNRKMQRGWMKSS 3
              PW     RKM++ W KS+
Sbjct: 417 VALPWMPQKGRKMRKSWQKST 437


>ref|XP_011038435.1| PREDICTED: uncharacterized protein LOC105135311 [Populus
           euphratica]
          Length = 480

 Score =  138 bits (347), Expect = 2e-30
 Identities = 85/202 (42%), Positives = 110/202 (54%), Gaps = 9/202 (4%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEI-STSIPVRKSSGFGGTRDSSPATSAGSSCR 405
           I NCDLP PQ +++ K          H    ++S+  ++ SG       S AT     C 
Sbjct: 241 ITNCDLPPPQKMNIGKYPCARPGSFQHDNTPASSLDWKEQSGC-----ISSATDPVQGCP 295

Query: 404 NLRMLTEEQLLSSPKYPLRDSP-------TQERNREIHM-QKDDAGKAQLLEALRHSQTR 249
               +  +Q  S+ +    DS        T     EI +  + D  KAQLLEALRHSQTR
Sbjct: 296 KFEGMPGKQSASTDRLSQSDSDKACSFTKTDMETAEIGLVSQGDPCKAQLLEALRHSQTR 355

Query: 248 AREAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTS 69
           AREAE VAKQ  AEKEH +KL F+QA QLFAYKQW QLLQLE +Y+Q+KN+     +   
Sbjct: 356 AREAEKVAKQACAEKEHTIKLFFKQASQLFAYKQWFQLLQLETLYYQMKNSD-QPMSNIF 414

Query: 68  PAISPWTVPSNRKMQRGWMKSS 3
           P + PW     RK+++ W KSS
Sbjct: 415 PVVLPWIPRKGRKLRKSWQKSS 436


>ref|XP_007051518.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
           gi|508703779|gb|EOX95675.1| Uncharacterized protein
           isoform 3, partial [Theobroma cacao]
          Length = 366

 Score =  137 bits (345), Expect = 4e-30
 Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 7/199 (3%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           IENCDLP PQ +HV+++           E+S S+  +  +G        P     S    
Sbjct: 132 IENCDLPPPQKMHVRRSSHACSGSSDGDEVS-SLAWKSQTG------PIPRPIVNSRAFT 184

Query: 401 LRMLTEEQLLSSP-------KYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAR 243
             + T  +L+SS              S T+E   E  + + D  KAQLLEAL HSQTRAR
Sbjct: 185 DSVRTHGRLMSSVGEGKVQCASDTSFSTTKEDTVE-QVTESDPTKAQLLEALCHSQTRAR 243

Query: 242 EAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPA 63
           EAE  AKQ  AEKEH++KL F+QA QLFAYKQW Q+LQLE +Y Q+KNN+    +T  PA
Sbjct: 244 EAERAAKQAYAEKEHIIKLFFKQASQLFAYKQWFQMLQLEALYVQIKNNE-QPVSTLFPA 302

Query: 62  ISPWTVPSNRKMQRGWMKS 6
           + PWT  ++RK+++ W K+
Sbjct: 303 VLPWTPYNSRKLRKSWQKT 321


>ref|XP_007051517.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
           gi|508703778|gb|EOX95674.1| Uncharacterized protein
           isoform 2, partial [Theobroma cacao]
          Length = 324

 Score =  137 bits (345), Expect = 4e-30
 Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 7/199 (3%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           IENCDLP PQ +HV+++           E+S S+  +  +G        P     S    
Sbjct: 90  IENCDLPPPQKMHVRRSSHACSGSSDGDEVS-SLAWKSQTG------PIPRPIVNSRAFT 142

Query: 401 LRMLTEEQLLSSP-------KYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAR 243
             + T  +L+SS              S T+E   E  + + D  KAQLLEAL HSQTRAR
Sbjct: 143 DSVRTHGRLMSSVGEGKVQCASDTSFSTTKEDTVE-QVTESDPTKAQLLEALCHSQTRAR 201

Query: 242 EAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPA 63
           EAE  AKQ  AEKEH++KL F+QA QLFAYKQW Q+LQLE +Y Q+KNN+    +T  PA
Sbjct: 202 EAERAAKQAYAEKEHIIKLFFKQASQLFAYKQWFQMLQLEALYVQIKNNE-QPVSTLFPA 260

Query: 62  ISPWTVPSNRKMQRGWMKS 6
           + PWT  ++RK+++ W K+
Sbjct: 261 VLPWTPYNSRKLRKSWQKT 279


>ref|XP_007051516.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508703777|gb|EOX95673.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 396

 Score =  137 bits (345), Expect = 4e-30
 Identities = 85/199 (42%), Positives = 114/199 (57%), Gaps = 7/199 (3%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           IENCDLP PQ +HV+++           E+S S+  +  +G        P     S    
Sbjct: 162 IENCDLPPPQKMHVRRSSHACSGSSDGDEVS-SLAWKSQTG------PIPRPIVNSRAFT 214

Query: 401 LRMLTEEQLLSSP-------KYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAR 243
             + T  +L+SS              S T+E   E  + + D  KAQLLEAL HSQTRAR
Sbjct: 215 DSVRTHGRLMSSVGEGKVQCASDTSFSTTKEDTVE-QVTESDPTKAQLLEALCHSQTRAR 273

Query: 242 EAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPA 63
           EAE  AKQ  AEKEH++KL F+QA QLFAYKQW Q+LQLE +Y Q+KNN+    +T  PA
Sbjct: 274 EAERAAKQAYAEKEHIIKLFFKQASQLFAYKQWFQMLQLEALYVQIKNNE-QPVSTLFPA 332

Query: 62  ISPWTVPSNRKMQRGWMKS 6
           + PWT  ++RK+++ W K+
Sbjct: 333 VLPWTPYNSRKLRKSWQKT 351


>ref|XP_008376151.1| PREDICTED: uncharacterized protein LOC103439371 [Malus domestica]
           gi|657968877|ref|XP_008376152.1| PREDICTED:
           uncharacterized protein LOC103439371 [Malus domestica]
          Length = 450

 Score =  137 bits (344), Expect = 5e-30
 Identities = 82/206 (39%), Positives = 104/206 (50%), Gaps = 13/206 (6%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           +ENCDLP PQ  + K++   DI C                       S P    G+S   
Sbjct: 223 MENCDLPPPQKTYHKRHPYADIGC-----------------------SDPNVILGTSLDA 259

Query: 401 LRMLTEEQLLSSPKYPLRDSPTQERNREIHMQKD-------------DAGKAQLLEALRH 261
               T    +++P +   DS   E + E H  K              +  KAQL+EAL H
Sbjct: 260 KAQATSLSSMTTPAHGYPDSGRGEMSGEGHSDKSFRDITEIQQLSEGEPTKAQLMEALCH 319

Query: 260 SQTRAREAETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSA 81
           SQTRAREAE  AKQ  AEKEH+ KL F QA QLFAYKQW QLLQLE++Y Q+KNN+   +
Sbjct: 320 SQTRAREAEKAAKQAYAEKEHIFKLFFTQASQLFAYKQWFQLLQLESLYLQIKNNEQPPS 379

Query: 80  ATTSPAISPWTVPSNRKMQRGWMKSS 3
           AT  P   PW     +K++R W K +
Sbjct: 380 ATVFPEGLPWMPAKGKKLRRNWRKGA 405


>ref|XP_007217991.1| hypothetical protein PRUPE_ppa005611mg [Prunus persica]
           gi|462414453|gb|EMJ19190.1| hypothetical protein
           PRUPE_ppa005611mg [Prunus persica]
          Length = 451

 Score =  136 bits (342), Expect = 9e-30
 Identities = 81/192 (42%), Positives = 110/192 (57%), Gaps = 1/192 (0%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEI-STSIPVRKSSGFGGTRDSSPATSAGSSCR 405
           +ENCDLP PQ ++ K++   DI C  H+ I  TS+  +  +G  G  D     ++ + C 
Sbjct: 220 VENCDLPPPQKMYHKRHPYADIGCSDHNVILGTSLDGKAQTG--GLSD----LTSHARCY 273

Query: 404 NLRMLTEEQLLSSPKYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAREAETVA 225
           +   +T E+  ++ +    D    +      + + +  KAQL+EAL HSQTRAREAE  A
Sbjct: 274 SDPGITHERKGNAAEEGHSDKSFWDVTETQQLSEGEPTKAQLMEALCHSQTRAREAEMAA 333

Query: 224 KQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAISPWTV 45
           KQ  AEKEH+ KL FRQA QLFAYKQW QLLQLE +  Q+KNN    +A   P + PW  
Sbjct: 334 KQAYAEKEHIFKLFFRQASQLFAYKQWFQLLQLETICIQIKNNDQPGSAVV-PVVLPWMP 392

Query: 44  PSNRKMQRGWMK 9
              RK +R W K
Sbjct: 393 FKGRKPRRNWRK 404


>gb|KHG18669.1| Alanine--tRNA ligase [Gossypium arboreum]
          Length = 436

 Score =  135 bits (341), Expect = 1e-29
 Identities = 78/194 (40%), Positives = 108/194 (55%), Gaps = 2/194 (1%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEISTSIPVRKSSGFGGTRDSSPATSAGSSCRN 402
           IENCDLP PQ VHV+             E+S+     ++        +    S GS  ++
Sbjct: 205 IENCDLPPPQKVHVRGYSHVCSGSFDGGEVSSLAWKSRTVAIRSPMVNHAQMSPGSVRKH 264

Query: 401 LRMLTE--EQLLSSPKYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAREAETV 228
            R ++   E  +      L  + T E++    + + D  KAQLLEAL HSQTRAREAE  
Sbjct: 265 GRQMSSVGEGKMQCASNSLSSTSTTEKDMLEQVTESDPTKAQLLEALCHSQTRAREAEKA 324

Query: 227 AKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAISPWT 48
           A++   EKEHV+KL+F+QA QLFAYKQW QLLQLE  Y Q+KNN+        P + PWT
Sbjct: 325 AQKAYEEKEHVIKLLFKQASQLFAYKQWFQLLQLEPFYHQIKNNE-------QPVVFPWT 377

Query: 47  VPSNRKMQRGWMKS 6
              ++K ++ W+K+
Sbjct: 378 PYKSQKFRKSWLKT 391


>ref|XP_008233124.1| PREDICTED: uncharacterized protein LOC103332188 [Prunus mume]
          Length = 451

 Score =  135 bits (341), Expect = 1e-29
 Identities = 81/192 (42%), Positives = 110/192 (57%), Gaps = 1/192 (0%)
 Frame = -2

Query: 581 IENCDLPSPQNVHVKKNLSTDISCLSHSEI-STSIPVRKSSGFGGTRDSSPATSAGSSCR 405
           IENCDLP PQ ++  ++   DI C  H+ I  TS+  +  +G  G  D     ++ + C 
Sbjct: 220 IENCDLPPPQKMYHNRHPYADIGCSDHNVILGTSLDGKAQTG--GLSD----LTSHARCY 273

Query: 404 NLRMLTEEQLLSSPKYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRAREAETVA 225
           +   +T E+  ++ +    D   ++      + + +  KAQL+EAL HSQTRAREAE  A
Sbjct: 274 SDPAITHERKGNAAEEGHSDKSFRDVTEIQQLSEGEPTKAQLMEALCHSQTRAREAEKAA 333

Query: 224 KQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAISPWTV 45
           KQ  AEKEH+ KL FRQA QLFAYKQW QLLQLE +  Q+KNN    +A   P + PW  
Sbjct: 334 KQAYAEKEHIFKLFFRQATQLFAYKQWFQLLQLETLCIQIKNNDQPGSAVV-PVVLPWMP 392

Query: 44  PSNRKMQRGWMK 9
              RK +R W K
Sbjct: 393 FKGRKPRRNWRK 404


>ref|XP_012083189.1| PREDICTED: uncharacterized protein LOC105642829 isoform X2
           [Jatropha curcas]
          Length = 475

 Score =  134 bits (338), Expect = 3e-29
 Identities = 83/199 (41%), Positives = 112/199 (56%), Gaps = 7/199 (3%)
 Frame = -2

Query: 578 ENCDLPSPQNVHVKKNLSTDISCLSHSE-ISTSIPVRKSSGFGGTRDSSPAT------SA 420
           +NCDLP PQ +HV++  S  +    H + + + +  +  SG+     SSP        S+
Sbjct: 242 QNCDLPPPQKMHVRRYPSVRVGSSDHDDTVPSCLNWKPQSGY----ISSPIVKAHGCPSS 297

Query: 419 GSSCRNLRMLTEEQLLSSPKYPLRDSPTQERNREIHMQKDDAGKAQLLEALRHSQTRARE 240
            S     R  TE  L S    P      +       + + D  KAQLLEALRHSQTRARE
Sbjct: 298 ESMHGRHRTSTEGHLQSGSNKPFSKDTIEFGQ----VPECDPCKAQLLEALRHSQTRARE 353

Query: 239 AETVAKQVSAEKEHVVKLVFRQAQQLFAYKQWVQLLQLENMYFQLKNNKLLSAATTSPAI 60
           AE VAKQ   EKEH++KL F+QA QLFAYKQW QLLQLE++Y+Q+KN+     +T  P  
Sbjct: 354 AEKVAKQACEEKEHIIKLFFKQASQLFAYKQWFQLLQLESLYYQVKNSD-QPVSTLFPVA 412

Query: 59  SPWTVPSNRKMQRGWMKSS 3
            PW     RK+++G  K++
Sbjct: 413 LPWMPRKGRKLRKGLQKTT 431


Top