BLASTX nr result
ID: Cinnamomum23_contig00034097
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum23_contig00034097 (492 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 220 2e-55 ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342... 215 1e-53 ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobrom... 213 5e-53 gb|AIG55302.1| gag-pol, partial [Camellia sinensis] 211 1e-52 ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800... 210 3e-52 ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [The... 209 4e-52 ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [The... 209 4e-52 ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass,... 209 4e-52 ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The... 209 7e-52 ref|XP_007027952.1| DNA/RNA polymerases superfamily protein [The... 208 1e-51 ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] g... 208 1e-51 ref|XP_012829796.1| PREDICTED: uncharacterized protein LOC105950... 208 1e-51 ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417... 208 1e-51 ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobrom... 208 1e-51 ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The... 207 2e-51 ref|XP_007099710.1| Retrotransposon protein, Ty3-gypsy subclass,... 207 2e-51 ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [The... 207 2e-51 ref|XP_007010873.1| Uncharacterized protein TCM_044868 [Theobrom... 207 3e-51 ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [The... 206 5e-51 ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prun... 206 5e-51 >ref|XP_008812481.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103723366 [Phoenix dactylifera] Length = 1246 Score = 220 bits (561), Expect = 2e-55 Identities = 103/164 (62%), Positives = 126/164 (76%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +L+ FWWN MKREI FV+RC+ CQ +KAEHQ+PAGLL+PLEI +WK EH+TMDF Sbjct: 889 YTDLREHFWWNGMKREIAGFVARCLVCQQVKAEHQRPAGLLEPLEIPEWKWEHITMDFVI 948 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP T + N+A+WVI RLTKSA F+P + G + LAQ +I++IV HG P+SI SD Sbjct: 949 GLPRTVRRNDAVWVIVDRLTKSAHFLPFRVGTSLD--KLAQRYIDDIVRLHGAPVSIVSD 1006 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RFVS FW++ +MGT L+ STAYHPQTDGQSERTIQTLED Sbjct: 1007 RDPRFVSGFWRSFQTAMGTDLRLSTAYHPQTDGQSERTIQTLED 1050 >ref|XP_008244885.1| PREDICTED: uncharacterized protein LOC103342989 [Prunus mume] Length = 1162 Score = 215 bits (547), Expect = 1e-53 Identities = 98/164 (59%), Positives = 130/164 (79%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK FWWN MKR+I FV++C+TCQ +KAEHQKP+G LQPL +++WK +H+TMDF + Sbjct: 593 YLDLKRNFWWNGMKRDIEKFVAKCLTCQQVKAEHQKPSGSLQPLPVAEWKWDHITMDFVT 652 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP + KG +AIWVI RLTKSA F+P+KT + + +L + ++ EIV HG P+SI SD Sbjct: 653 GLPRSPKGRDAIWVIVDRLTKSAHFLPVKTTESTE--NLGKLYVREIVRLHGIPVSIVSD 710 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RDS+F S+FW +L +++GT+L FSTA+HPQTDGQSERTIQ LED Sbjct: 711 RDSKFTSKFWGSLQKALGTQLNFSTAFHPQTDGQSERTIQILED 754 >ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobroma cacao] gi|508722241|gb|EOY14138.1| Uncharacterized protein TCM_033423 [Theobroma cacao] Length = 809 Score = 213 bits (541), Expect = 5e-53 Identities = 101/164 (61%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +K +WW MKR+I FV++C+TCQ IKAEHQKP+G LQPL I +WK EH+TMDF Sbjct: 624 YRTIKESYWWPGMKRDIAEFVAKCLTCQQIKAEHQKPSGTLQPLLIPEWKWEHVTMDFVL 683 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP T G +AIWVI RLTKSA F+ + + ++ LA+ +I+EIV HG P+SI SD Sbjct: 684 GLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIE--RLARLYIDEIVRLHGVPVSIVSD 741 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RF SRFW HE++GTKL+FSTA+HPQTDGQSERTIQTLED Sbjct: 742 RDPRFTSRFWPKFHEALGTKLRFSTAFHPQTDGQSERTIQTLED 785 >gb|AIG55302.1| gag-pol, partial [Camellia sinensis] Length = 923 Score = 211 bits (538), Expect = 1e-52 Identities = 97/164 (59%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +L +FWW MKR++ FVS+C+TCQ +KAEHQ+PAGLLQPL I++WK EH+TMDF Sbjct: 521 YQDLGRQFWWRGMKRDVAVFVSKCLTCQQVKAEHQRPAGLLQPLPIAEWKWEHITMDFVV 580 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP T +G++AIWV+ RLTKSA FIP++ M LA +I ++V HG P++I SD Sbjct: 581 GLPRTQRGSDAIWVVVDRLTKSAHFIPMRVRDSMD--HLADLYIRDVVRLHGVPVTIVSD 638 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD F +R W++L ++GTKL FSTAYHPQTDGQSERTIQ LED Sbjct: 639 RDPCFTARLWQSLQSALGTKLTFSTAYHPQTDGQSERTIQILED 682 >ref|XP_012487705.1| PREDICTED: uncharacterized protein LOC105800880, partial [Gossypium raimondii] Length = 1085 Score = 210 bits (535), Expect = 3e-52 Identities = 102/164 (62%), Positives = 123/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW MKREI +V+RC+ CQ +KAEHQ P GLLQP+ I +WK EH+TMDF S Sbjct: 671 YCDLKKMYWWPGMKREICEYVARCLICQQVKAEHQVPTGLLQPIMIPEWKWEHVTMDFVS 730 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP T K ++IWVI RLTKSA FIP++T Q+ LA+ +++EIV HG PISI SD Sbjct: 731 GLPVTPKKKDSIWVIVDRLTKSAHFIPVRT--DYQLEKLAELYVSEIVRLHGVPISIISD 788 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RF SRFW L E++GTKL FSTA+HPQTDGQSER IQ LED Sbjct: 789 RDPRFTSRFWSKLQEALGTKLNFSTAFHPQTDGQSERVIQILED 832 >ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508774422|gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 209 bits (533), Expect = 4e-52 Identities = 97/164 (59%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 47 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 106 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 107 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRLHGIPISIVSD 164 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F SRFW L E++GTKL FSTA+HPQTDGQSERTIQTLED Sbjct: 165 RGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLED 208 >ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508774222|gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 209 bits (533), Expect = 4e-52 Identities = 97/164 (59%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 655 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 714 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 715 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRLHGIPISIVSD 772 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F SRFW L E++GTKL FSTA+HPQTDGQSERTIQTLED Sbjct: 773 RGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLED 816 >ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] gi|508716770|gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 209 bits (533), Expect = 4e-52 Identities = 97/164 (59%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 120 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 179 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 180 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRLHGIPISIVSD 237 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F SRFW L E++GTKL FSTA+HPQTDGQSERTIQTLED Sbjct: 238 RGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLED 281 >ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708318|gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 209 bits (531), Expect = 7e-52 Identities = 99/164 (60%), Positives = 123/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +K +WW M+R+I FV++C+TCQ IKAEHQKP+G LQPL I +WK EH+TMDF Sbjct: 1119 YRTIKESYWWPGMERDIAEFVAKCLTCQQIKAEHQKPSGTLQPLSIPEWKWEHVTMDFVL 1178 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP T G +AIWVI RLTKSA F+ + + ++ LA+ +I+EIV HG P+SI SD Sbjct: 1179 GLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIE--RLARLYIDEIVRLHGVPVSIVSD 1236 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RF SRFW E++GTKL+FSTA+HPQTDGQSERTIQTLED Sbjct: 1237 RDLRFTSRFWPKFQEALGTKLRFSTAFHPQTDGQSERTIQTLED 1280 >ref|XP_007027952.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716557|gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1400 Score = 208 bits (530), Expect = 1e-51 Identities = 96/164 (58%), Positives = 123/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 1049 YQDLKEVYWWEELKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 1108 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV QHG PISI D Sbjct: 1109 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRQHGIPISIVFD 1166 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F RFW L E++GTKL FSTA+HPQTDGQSERTIQTLED Sbjct: 1167 RGAQFTGRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLED 1210 >ref|XP_007049935.1| Gag protease polyprotein [Theobroma cacao] gi|508702196|gb|EOX94092.1| Gag protease polyprotein [Theobroma cacao] Length = 269 Score = 208 bits (530), Expect = 1e-51 Identities = 97/164 (59%), Positives = 123/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +K +WW MKR++ FV++C+ CQ +KAEHQ+PAG LQ L + +WK EH+TMDF Sbjct: 19 YRTIKENYWWPGMKRDVAEFVAKCLVCQQVKAEHQRPAGTLQSLPVPEWKWEHVTMDFVL 78 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP T +GN+AIWVI RLTKSA F+ + + ++ LAQ +I+EIV HG P+SI SD Sbjct: 79 GLPRTQRGNDAIWVIVDRLTKSAHFLAVHSTYSIE--KLAQLYIDEIVRLHGVPVSIVSD 136 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RF SRFW E++GTKL+FSTA+HPQTDGQSERTIQTLED Sbjct: 137 RDPRFTSRFWLKFQEALGTKLKFSTAFHPQTDGQSERTIQTLED 180 >ref|XP_012829796.1| PREDICTED: uncharacterized protein LOC105950954 [Erythranthe guttatus] Length = 1316 Score = 208 bits (529), Expect = 1e-51 Identities = 100/164 (60%), Positives = 122/164 (74%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW MK++I +VS C+ CQ IK EHQ+P GLLQ I +WK E +TMDF Sbjct: 843 YQDLKKLYWWPGMKKDIAKYVSECLICQQIKTEHQRPGGLLQSNHIPEWKWESVTMDFVQ 902 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 G P T KG+++IWVI RLTKSA F+P+KT ++ LA+ +I EIV HG PISI SD Sbjct: 903 GFPKTLKGSDSIWVIVDRLTKSAHFLPVKTTFSLE--KLAELYIGEIVRLHGVPISIISD 960 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RF S+FWK LHE+MGT+L FSTAYHPQTDGQSERTI+TLED Sbjct: 961 RDPRFTSKFWKRLHEAMGTRLSFSTAYHPQTDGQSERTIKTLED 1004 >ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417177 [Eucalyptus grandis] Length = 1753 Score = 208 bits (529), Expect = 1e-51 Identities = 93/164 (56%), Positives = 126/164 (76%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y NL+ +WW MK +I V++C+TCQ +KA+H KP GLL+PLEI +WK EH+TMDF + Sbjct: 1094 YQNLRQHYWWCGMKADIAKHVAKCLTCQQVKAQHCKPGGLLRPLEIPEWKWEHITMDFVT 1153 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP + +GN++IWV+ RLTKSA FI ++ + + + LA ++ ++V HG P++ITSD Sbjct: 1154 GLPRSQRGNDSIWVVVDRLTKSAHFIAVR--RDLSLDRLADLYVRQVVRMHGVPVTITSD 1211 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RF + FWK+L ++GTKLQ+STAYHPQTDGQSERTIQTLED Sbjct: 1212 RDPRFTAAFWKSLQSALGTKLQYSTAYHPQTDGQSERTIQTLED 1255 >ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobroma cacao] gi|508727367|gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 208 bits (529), Expect = 1e-51 Identities = 96/164 (58%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 489 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 548 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 549 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRLHGIPISIVSD 606 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F SRFW L E++GTKL FSTA+HPQTDGQSERTI+TLED Sbjct: 607 RGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIKTLED 650 >ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508779195|gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 207 bits (528), Expect = 2e-51 Identities = 96/164 (58%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 278 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 337 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ +LTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 338 GLPRTSGGYDSIWIVVDQLTKSAHFLPVKT--TYGAAHYARVYVDEIVRLHGIPISIVSD 395 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F SRFW L E++GTKL FSTA+HPQTDGQSERTIQTLED Sbjct: 396 RGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLED 439 >ref|XP_007099710.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] gi|508728428|gb|EOY20325.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 460 Score = 207 bits (527), Expect = 2e-51 Identities = 96/164 (58%), Positives = 123/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 241 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 300 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 301 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRLHGIPISIVSD 358 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F SRFW L E++GTKL F TA+HPQTDGQSERTIQTLED Sbjct: 359 RGAQFTSRFWGKLQEALGTKLDFITAFHPQTDGQSERTIQTLED 402 >ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508711429|gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 207 bits (527), Expect = 2e-51 Identities = 96/163 (58%), Positives = 123/163 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 1046 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 1105 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 1106 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRLHGIPISIVSD 1163 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLE 4 R ++F SRFW L E++GTKL FSTA+HPQTDGQSERTIQTLE Sbjct: 1164 RGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLE 1206 >ref|XP_007010873.1| Uncharacterized protein TCM_044868 [Theobroma cacao] gi|508727786|gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao] Length = 403 Score = 207 bits (526), Expect = 3e-51 Identities = 96/164 (58%), Positives = 123/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +LK +WW +KR++ FVS+C+ CQ +KAEHQKPAGLLQPL + +WK EH+ MDF + Sbjct: 2 YQDLKEVYWWEGLKRDVAEFVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVT 61 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP TS G ++IW++ RLTKSA F+P+KT A+ +++EIV HG PISI SD Sbjct: 62 GLPRTSGGYDSIWIVVDRLTKSAHFLPVKT--TYGAAQYARVYVDEIVRLHGIPISIVSD 119 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 R ++F SRFW L E++GTKL FSTA+HPQT GQSERTIQTLED Sbjct: 120 RGAQFTSRFWGKLQEALGTKLDFSTAFHPQTGGQSERTIQTLED 163 >ref|XP_007049837.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702098|gb|EOX93994.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 811 Score = 206 bits (524), Expect = 5e-51 Identities = 98/164 (59%), Positives = 122/164 (74%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y +K +WW MKR+I FV++C+TCQ IKAEHQK +G LQPL I +WK EH+TMDF Sbjct: 520 YRTIKESYWWPGMKRDIAKFVAKCLTCQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFVL 579 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 GLP T G +AIWVI RLTKSA F+ + + ++ LA+ +I+E+V HG PISI SD Sbjct: 580 GLPRTQSGKDAIWVIVDRLTKSAHFLAIHSTYSIE--RLARLYIDEVVRLHGVPISIVSD 637 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD RF SRFW E++GTKL+FST++HPQTDGQSERTIQTLED Sbjct: 638 RDPRFTSRFWPKFQEALGTKLRFSTSFHPQTDGQSERTIQTLED 681 >ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica] gi|462395665|gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica] Length = 1493 Score = 206 bits (524), Expect = 5e-51 Identities = 99/164 (60%), Positives = 124/164 (75%) Frame = -1 Query: 492 YHNLKGEFWWNNMKREITAFVSRCMTCQLIKAEHQKPAGLLQPLEISQWK*EHLTMDFSS 313 Y L+ + W +MK +I +VSRC+ CQ +KAE QKP+GL+QPL I +WK E +TMDF Sbjct: 1092 YRTLREYYSWPHMKGDIAKYVSRCLICQQVKAERQKPSGLMQPLPIPEWKWERITMDFVF 1151 Query: 312 GLPTTSKGNNAIWVIGYRLTKSARFIPLKTGKKMQMLSLAQTFINEIVSQHGQPISITSD 133 LP TSKG++ IWVI RLTKS F+P+K + + LA+ F++EIV HG P+SI SD Sbjct: 1152 KLPRTSKGHDGIWVIVDRLTKSTHFLPIK--ETYSLTKLAKLFVDEIVRLHGAPVSIVSD 1209 Query: 132 RDSRFVSRFWKTLHESMGTKLQFSTAYHPQTDGQSERTIQTLED 1 RD+RF SRFWK L E+MGT+LQFSTA+HPQTDGQSERTIQTLED Sbjct: 1210 RDARFTSRFWKCLQEAMGTRLQFSTAFHPQTDGQSERTIQTLED 1253