BLASTX nr result
ID: Perilla23_contig00024459
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00024459 (817 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom... 283 1e-73 ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobrom... 283 1e-73 ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977... 282 2e-73 ref|XP_012857061.1| PREDICTED: uncharacterized protein LOC105976... 282 2e-73 ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949... 281 5e-73 ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969... 281 5e-73 ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964... 281 5e-73 ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974... 280 7e-73 ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969... 280 7e-73 ref|XP_012850129.1| PREDICTED: uncharacterized protein LOC105969... 278 3e-72 ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobrom... 278 3e-72 ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972... 278 3e-72 ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964... 278 3e-72 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 278 3e-72 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 276 2e-71 ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966... 275 3e-71 ref|XP_011085143.1| PREDICTED: uncharacterized protein LOC105167... 273 8e-71 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 273 1e-70 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 272 2e-70 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 265 2e-68 >ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao] gi|508710339|gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao] Length = 1954 Score = 283 bits (724), Expect = 1e-73 Identities = 137/237 (57%), Positives = 170/237 (71%) Frame = -3 Query: 713 LCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRS 534 LC APS E VF I KDSV+ PDGFSSLF+Q CWDI+ D++ AVLDFF+G+P+P+ Sbjct: 1013 LCAAPSLKEIKEVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFNGTPMPQG 1072 Query: 533 FTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQG 354 T+TT++LLPK+P+ W+DFRPISLC V NKI++K + +NQSGFV G Sbjct: 1073 VTSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKTLANRLSKILPSIISENQSGFVNG 1132 Query: 353 RLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGF 174 RLISDNILLAQEL+ L NV LKLDMAKAYDR+ W FL ++ + GF D+WI Sbjct: 1133 RLISDNILLAQELVGKLDAKARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFGFNDRWISM 1192 Query: 173 IQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKLI*SH 3 I+ C+ NCWFS+LINGS G+F S RGLRQGD +SP LFVLAAD LSRG+++L H Sbjct: 1193 IKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFVLAADYLSRGINQLFNRH 1249 >ref|XP_007052625.1| Uncharacterized protein TCM_005953 [Theobroma cacao] gi|508704886|gb|EOX96782.1| Uncharacterized protein TCM_005953 [Theobroma cacao] Length = 1659 Score = 283 bits (723), Expect = 1e-73 Identities = 137/237 (57%), Positives = 170/237 (71%) Frame = -3 Query: 713 LCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRS 534 LC APS E VF I KDSV PDGFSSLF+Q CWDI+ D++ AVLDFF+G+P+P+ Sbjct: 910 LCAAPSLKEINEVVFNIDKDSVVGPDGFSSLFYQHCWDIIKQDLLEAVLDFFNGAPMPQG 969 Query: 533 FTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQG 354 T+TT++LLPK+P+ W+DFRPISLC V NKI++K+ + +NQSGFV G Sbjct: 970 VTSTTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKMLANRLSKILPSIISENQSGFVNG 1029 Query: 353 RLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGF 174 RLISDNILLAQELI L NV LKLDMAKAYDR+ W FL ++ + GF D+WI Sbjct: 1030 RLISDNILLAQELIGKLDAKARGGNVVLKLDMAKAYDRLNWDFLYLMMKQFGFNDRWISM 1089 Query: 173 IQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKLI*SH 3 I+ C+ NCWFS+LINGS G+F S RGLRQGD +SP LF+LAAD LSRG+++L H Sbjct: 1090 IKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFILAADYLSRGINQLFSHH 1146 >ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977287 [Erythranthe guttatus] Length = 1237 Score = 282 bits (722), Expect = 2e-73 Identities = 139/231 (60%), Positives = 168/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 272 TRPSVEEIKDAVFGICRDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 331 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 332 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 391 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL MGF + I+ Sbjct: 392 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELMGFSANLVDIIR 451 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 452 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 502 >ref|XP_012857061.1| PREDICTED: uncharacterized protein LOC105976337 [Erythranthe guttatus] Length = 1169 Score = 282 bits (721), Expect = 2e-73 Identities = 139/231 (60%), Positives = 168/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 316 TRPSVEEIKDAVFGICQDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 375 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 376 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 435 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL MGF + I+ Sbjct: 436 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELMGFSANLVDIIR 495 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 496 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 546 >ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949732 [Erythranthe guttatus] Length = 1237 Score = 281 bits (718), Expect = 5e-73 Identities = 138/231 (59%), Positives = 167/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 272 TRPSVEEIKDAVFGICRDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 331 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 332 ATTLVLIPKGDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 391 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL MGF + I+ Sbjct: 392 ISDNILLAQEIVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELMGFSANLVDIIR 451 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLA + SRG+D L Sbjct: 452 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLATEYFSRGLDAL 502 >ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969825 [Erythranthe guttatus] Length = 1331 Score = 281 bits (718), Expect = 5e-73 Identities = 139/231 (60%), Positives = 168/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 366 TRPSVEEIKDAVFGICRDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 425 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 426 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 485 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL MGF + I+ Sbjct: 486 ISDNILLAQEMVHLISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELMGFSANLVDIIR 545 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 546 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 596 >ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964144 [Erythranthe guttatus] Length = 1237 Score = 281 bits (718), Expect = 5e-73 Identities = 138/231 (59%), Positives = 168/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 272 TRPSVEEIKDAVFGICRDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 331 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 332 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 391 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL +GF + I+ Sbjct: 392 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELIGFSANLVDIIR 451 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 452 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 502 >ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974867 [Erythranthe guttatus] Length = 1393 Score = 280 bits (717), Expect = 7e-73 Identities = 138/231 (59%), Positives = 168/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 428 TRPSVEEIKDAVFGICQDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 487 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 488 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 547 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL +GF + I+ Sbjct: 548 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELIGFSANLVDIIR 607 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 608 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 658 >ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969824 [Erythranthe guttatus] Length = 1805 Score = 280 bits (717), Expect = 7e-73 Identities = 138/231 (59%), Positives = 168/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 840 TRPSVEEIKDAVFGICQDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 899 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 900 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 959 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL +GF + I+ Sbjct: 960 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELIGFSANLVDIIR 1019 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 1020 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 1070 >ref|XP_012850129.1| PREDICTED: uncharacterized protein LOC105969901 [Erythranthe guttatus] Length = 1153 Score = 278 bits (712), Expect = 3e-72 Identities = 138/231 (59%), Positives = 167/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 343 TRPSVEEIKDAVFGICRDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 402 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISL NVTNKII+K+ + +QSGFVQGRL Sbjct: 403 ATTLVLIPKVDFPTAWTDFRPISLSNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 462 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL MGF + I+ Sbjct: 463 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELMGFSANLVDIIR 522 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 523 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 573 >ref|XP_007040948.1| Uncharacterized protein TCM_016755 [Theobroma cacao] gi|508778193|gb|EOY25449.1| Uncharacterized protein TCM_016755 [Theobroma cacao] Length = 1245 Score = 278 bits (712), Expect = 3e-72 Identities = 134/234 (57%), Positives = 168/234 (71%) Frame = -3 Query: 704 APSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFTA 525 APS E VF I KDSV+ PDGFSSLF+Q CWDI+ D++ AVLDFF G+P+PR T+ Sbjct: 765 APSLKEIKDVVFNIDKDSVAGPDGFSSLFYQHCWDIIKQDLLEAVLDFFKGTPMPRGVTS 824 Query: 524 TTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRLI 345 TT++LLPK+P+ W+DFRPISLC V NKI++K+ + +NQSGFV GRLI Sbjct: 825 TTLVLLPKKPNSCQWSDFRPISLCTVLNKIVTKLLANRLSKFLPSIISENQSGFVNGRLI 884 Query: 344 SDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQR 165 SDNILLAQEL+ L NV LKLDMAKAYDR+ W FL ++ + GF D+WI I+ Sbjct: 885 SDNILLAQELVGKLDAKARGGNVVLKLDMAKAYDRLSWDFLYLMMEQFGFNDRWISMIKA 944 Query: 164 CVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKLI*SH 3 C+ NCWFS+LINGS G+F S RGLRQGD +SP LF+LAA+ LSRG+++L H Sbjct: 945 CISNCWFSLLINGSLVGYFKSERGLRQGDSISPLLFILAAEYLSRGINQLFSDH 998 >ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972756 [Erythranthe guttatus] Length = 1285 Score = 278 bits (711), Expect = 3e-72 Identities = 137/231 (59%), Positives = 167/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 355 TRPSVEEIKDAVFGICRDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 414 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 415 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 474 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL +GF + I+ Sbjct: 475 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELIGFSANLVDIIR 534 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+ FVLAA+ SRG+D L Sbjct: 535 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTPFVLAAEYFSRGLDAL 585 >ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964855 [Erythranthe guttatus] Length = 1237 Score = 278 bits (711), Expect = 3e-72 Identities = 137/231 (59%), Positives = 167/231 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 272 TRPSVEEIKDAVFGICRDSALGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 331 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 332 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 391 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL +GF + I+ Sbjct: 392 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELIGFSANLVDIIR 451 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAA+ SRG+D L Sbjct: 452 RCVSSCQFSLLINGELMGYFTSSRGLRQGDPLSPTLFVLAAEYFSRGLDAL 502 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 278 bits (711), Expect = 3e-72 Identities = 138/239 (57%), Positives = 168/239 (70%) Frame = -3 Query: 731 SVDR*QLCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDG 552 S D LC AP E AVF I KDSV+ PDGFSSLF+Q CWDI+ D++ AVLDFF G Sbjct: 1181 SADNEFLCAAPPLQEIKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKNDLLDAVLDFFRG 1240 Query: 551 SPLPRSFTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQ 372 SPLPR T+TT++LLPK+P+ W+++RPISLC V NKI++K+ + +NQ Sbjct: 1241 SPLPRGVTSTTLVLLPKKPNACHWSEYRPISLCTVLNKIVTKLLANRLSKILPSIISENQ 1300 Query: 371 SGFVQGRLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFP 192 SGFV GRLISDNILLAQELI + NV LKLDMAKAYDR+ W FL ++ GF Sbjct: 1301 SGFVNGRLISDNILLAQELIGKIDAKSRGGNVVLKLDMAKAYDRLNWDFLYLMMEHFGFN 1360 Query: 191 DKWIGFIQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 WI I+ C+ NCWFS+LINGS AG+F S RGLRQGD +SP LF+LAAD LSRG++ L Sbjct: 1361 AHWINMIKSCISNCWFSLLINGSLAGYFKSERGLRQGDSISPMLFILAADYLSRGLNHL 1419 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 276 bits (705), Expect = 2e-71 Identities = 134/233 (57%), Positives = 165/233 (70%) Frame = -3 Query: 713 LCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRS 534 LC PS E AVF I KDSV+ PDGFSSLF+Q CWDI+ D+ AVLDFF GSPLPR Sbjct: 1274 LCATPSLQEVKEAVFNINKDSVAGPDGFSSLFYQHCWDIIKQDLFEAVLDFFKGSPLPRG 1333 Query: 533 FTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQG 354 T+TT++LLPK + W++FRPISLC V NKI++K+ + +NQSGFV G Sbjct: 1334 ITSTTLVLLPKTQNVSQWSEFRPISLCTVLNKIVTKLLANRLSKILPSIISENQSGFVNG 1393 Query: 353 RLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGF 174 RLISDNILLAQEL+ + NV LKLDMAKAYDR+ W FL ++ + GF WI Sbjct: 1394 RLISDNILLAQELVDKINARSRGGNVVLKLDMAKAYDRLNWEFLYLMMEQFGFNALWINM 1453 Query: 173 IQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 I+ C+ NCWFS+LINGS G+F S RGLRQGD +SPSLF+LAA+ LSRG+++L Sbjct: 1454 IKACISNCWFSLLINGSLVGYFKSERGLRQGDSISPSLFILAAEYLSRGLNQL 1506 >ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966658 [Erythranthe guttatus] Length = 1233 Score = 275 bits (703), Expect = 3e-71 Identities = 135/222 (60%), Positives = 162/222 (72%) Frame = -3 Query: 707 TAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRSFT 528 T PS +E AVFGIC+DS S PDG+SSLF+Q CWD++ DV AV DFF+G +P SFT Sbjct: 358 TRPSVEEIKDAVFGICRDSASGPDGYSSLFYQHCWDLIQCDVCEAVWDFFEGGSMPASFT 417 Query: 527 ATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQGRL 348 ATT++L+PK P +W DFRPISLCNVTNKII+K+ + +QSGFVQGRL Sbjct: 418 ATTLVLIPKVDFPTAWTDFRPISLCNVTNKIITKVLTNRLAPHLPHIISPSQSGFVQGRL 477 Query: 347 ISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGFIQ 168 ISDNILLAQE++H + PN+ LKLDMAKAYDRV+W FL VL MGF + I+ Sbjct: 478 ISDNILLAQEMVHSISVRCRNPNLILKLDMAKAYDRVQWRFLFRVLELMGFSANLVDIIR 537 Query: 167 RCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAAD 42 RCV +C FS+LING G+F SSRGLRQGDPLSP+LFVLAAD Sbjct: 538 RCVSSCQFSLLINGELTGYFTSSRGLRQGDPLSPTLFVLAAD 579 >ref|XP_011085143.1| PREDICTED: uncharacterized protein LOC105167219 [Sesamum indicum] Length = 1203 Score = 273 bits (699), Expect = 8e-71 Identities = 134/251 (53%), Positives = 172/251 (68%), Gaps = 3/251 (1%) Frame = -3 Query: 758 LDDMQNLPESV---DR*QLCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAP 588 +DD+ +P + DR QL P+ ++ + +F +C S + PDGFS+ FFQ CW+I+ Sbjct: 372 VDDLHWVPNILSEEDRHQLNATPTIEDVKTIIFDMCPHSTAGPDGFSAHFFQCCWEIIGQ 431 Query: 587 DVVAAVLDFFDGSPLPRSFTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXX 408 D+ AVLDF GS P++FT TTI+L+PK P +W DFRPISLCNVT KI+SK+ Sbjct: 432 DLYGAVLDFLSGSTPPKNFTTTTIVLIPKIEAPSTWKDFRPISLCNVTGKILSKVINNQM 491 Query: 407 XXXXXXXLMQNQSGFVQGRLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWP 228 + +QS FVQGR+ISDNILLAQEL H L K N K+DM KAYDRV W Sbjct: 492 AKLLPKIISPSQSSFVQGRMISDNILLAQELSHCLGKNGSLSNTIFKIDMEKAYDRVNWT 551 Query: 227 FLLAVLHRMGFPDKWIGFIQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLA 48 FL +L R+GFP WI I++ ++NCWFS+LING GFF S+RGLRQGDPLSP+LFV+A Sbjct: 552 FLYHMLMRVGFPTHWINMIKKLIENCWFSILINGEGVGFFKSTRGLRQGDPLSPTLFVIA 611 Query: 47 ADCLSRGVDKL 15 A+CLSRG+D L Sbjct: 612 AECLSRGLDWL 622 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 273 bits (698), Expect = 1e-70 Identities = 133/233 (57%), Positives = 163/233 (69%) Frame = -3 Query: 713 LCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAPDVVAAVLDFFDGSPLPRS 534 LC P E AVF I KDSV PDGFSS F+Q CW I+A D++AAV DFF G+ PR Sbjct: 394 LCAEPQLQEVKDAVFAIDKDSVVGPDGFSSFFYQQCWPIIAEDLLAAVRDFFKGAVFPRG 453 Query: 533 FTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXXXXXXXXXLMQNQSGFVQG 354 T+TT++LL K+P +W+DFRPISLC + NKI++K+ + +NQSGFV G Sbjct: 454 VTSTTLVLLAKKPDAATWSDFRPISLCTILNKIVTKLLANRLSKVLPSLISENQSGFVSG 513 Query: 353 RLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWPFLLAVLHRMGFPDKWIGF 174 RLI+DNILLAQELI + NV LKLDM KAYDR+ W FL+ VL R GF D WI Sbjct: 514 RLINDNILLAQELIGKIDYKARGGNVVLKLDMMKAYDRLNWDFLILVLERFGFNDMWIDM 573 Query: 173 IQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLAADCLSRGVDKL 15 I+RC+ NCWFSVLING AG+F S RGLRQGD +SP LF+LAA+ LSRG+++L Sbjct: 574 IRRCITNCWFSVLINGHSAGYFKSERGLRQGDSISPMLFILAAEYLSRGINEL 626 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 272 bits (696), Expect = 2e-70 Identities = 136/251 (54%), Positives = 172/251 (68%) Frame = -3 Query: 767 PSILDDMQNLPESVDR*QLCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAP 588 PSI+ + +N LC P+ E AVFGI +S + PDGFSS F+Q CW+I+A Sbjct: 1469 PSIISNSENE-------LLCAEPNLQEVKDAVFGIDPESAAGPDGFSSYFYQQCWNIIAH 1521 Query: 587 DVVAAVLDFFDGSPLPRSFTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXX 408 D++ AV DFF G+ +PR T+TT+ILLPK+P W+DFRPISLC V NKII+K+ Sbjct: 1522 DLLDAVRDFFHGANIPRGVTSTTLILLPKKPSASKWSDFRPISLCTVMNKIITKLLSNRL 1581 Query: 407 XXXXXXXLMQNQSGFVQGRLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWP 228 + +NQSGFV GRLISDNILLAQELI L N+ALKLDM KAYDR+ W Sbjct: 1582 AKILPSIITENQSGFVGGRLISDNILLAQELIGKLNTKSRGGNLALKLDMMKAYDRLDWS 1641 Query: 227 FLLAVLHRMGFPDKWIGFIQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLA 48 FL+ VL GF D+WIG IQ+C+ NCWFS+L+NG G+F RGLRQGDP+SP LF++A Sbjct: 1642 FLIKVLQHFGFNDQWIGMIQKCISNCWFSLLLNGRTEGYFKFERGLRQGDPISPQLFLIA 1701 Query: 47 ADCLSRGVDKL 15 A+ LSRG++ L Sbjct: 1702 AEYLSRGLNAL 1712 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 265 bits (678), Expect = 2e-68 Identities = 133/251 (52%), Positives = 168/251 (66%) Frame = -3 Query: 767 PSILDDMQNLPESVDR*QLCTAPSFDES*SAVFGICKDSVSEPDGFSSLFFQSCWDIVAP 588 PSI+ D D LC P+ E AVFGI +S + PDGFSS F+Q CWDI+A Sbjct: 1262 PSIISD-------TDNGFLCAEPTLQEVKEAVFGIDPESAAGPDGFSSHFYQQCWDIIAH 1314 Query: 587 DVVAAVLDFFDGSPLPRSFTATTIILLPKQPHPQSWADFRPISLCNVTNKIISKIXXXXX 408 D+ AV +FF G+ +P+ T+TT++L+PK W++FRPISLC V NKII+KI Sbjct: 1315 DLFEAVKEFFHGADIPQGMTSTTLVLIPKTTSASKWSEFRPISLCTVMNKIITKILANRL 1374 Query: 407 XXXXXXXLMQNQSGFVQGRLISDNILLAQELIHDLPKARPTPNVALKLDMAKAYDRVRWP 228 + +NQSGFV GRLISDNILLAQELI L + NVALKLDM KAYDR+ W Sbjct: 1375 AKILPSIITENQSGFVGGRLISDNILLAQELIGKLDQKNRGGNVALKLDMMKAYDRLDWS 1434 Query: 227 FLLAVLHRMGFPDKWIGFIQRCVQNCWFSVLINGSPAGFFHSSRGLRQGDPLSPSLFVLA 48 FL VL +GF +WIG IQ+C+ NCWFS+L+NG G+F S RGLRQGD +SP LF+LA Sbjct: 1435 FLFKVLQHLGFNAQWIGMIQKCISNCWFSLLLNGRTVGYFKSERGLRQGDSISPQLFILA 1494 Query: 47 ADCLSRGVDKL 15 A+ L+RG++ L Sbjct: 1495 AEYLARGLNAL 1505