BLASTX nr result
ID: Mentha23_contig00040235
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00040235 (544 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAO45752.1| pol protein [Cucumis melo subsp. melo] 211 1e-52 ref|XP_004154269.1| PREDICTED: uncharacterized protein LOC101204... 206 3e-51 ref|XP_004154025.1| PREDICTED: uncharacterized protein LOC101211... 204 1e-50 ref|XP_007028311.1| Uncharacterized protein TCM_024023 [Theobrom... 203 2e-50 ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [The... 201 1e-49 ref|XP_007022406.1| Uncharacterized protein TCM_032736 [Theobrom... 201 1e-49 ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The... 200 2e-49 emb|CAN69016.1| hypothetical protein VITISV_016361 [Vitis vinifera] 199 3e-49 ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [The... 199 4e-49 ref|XP_007020066.1| Uncharacterized protein TCM_036439 [Theobrom... 198 8e-49 ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass,... 198 8e-49 ref|XP_007049973.1| DNA/RNA polymerases superfamily protein [The... 198 8e-49 ref|XP_007023768.1| Uncharacterized protein TCM_027940 [Theobrom... 197 1e-48 ref|XP_007010873.1| Uncharacterized protein TCM_044868 [Theobrom... 197 1e-48 ref|XP_007014875.1| Uncharacterized protein TCM_040446 [Theobrom... 196 3e-48 ref|XP_007028011.1| Uncharacterized protein TCM_023017 [Theobrom... 196 3e-48 ref|XP_007036990.1| Retrotransposon protein, putative [Theobroma... 195 5e-48 ref|XP_007022529.1| Uncharacterized protein TCM_033221 [Theobrom... 195 5e-48 ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobrom... 194 9e-48 ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The... 194 1e-47 >gb|AAO45752.1| pol protein [Cucumis melo subsp. melo] Length = 923 Score = 211 bits (536), Expect = 1e-52 Identities = 102/168 (60%), Positives = 128/168 (76%) Frame = +3 Query: 12 DIIDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCPRFI 191 + I KI R+ AQ RQKSYAD R +L F+VGDKVFLKV+P KGV+RF ++GKL PRF+ Sbjct: 753 EAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMKGVLRFERRGKLSPRFV 812 Query: 192 GPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLSYEE 371 GPFEILER GPVAY+LALPP+L+ VH+VFHVS LRK + DP H++ Y+ L +D++LSY E Sbjct: 813 GPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVE 872 Query: 372 GPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 P +L R ++ LR+KQIPLVKV W H EEATWE E M+ YP+L Sbjct: 873 QPVEVLARGVKTLRNKQIPLVKVLWRNHRVEEATWEREDDMRSRYPEL 920 >ref|XP_004154269.1| PREDICTED: uncharacterized protein LOC101204584 [Cucumis sativus] Length = 207 Score = 206 bits (524), Expect = 3e-51 Identities = 101/168 (60%), Positives = 127/168 (75%) Frame = +3 Query: 12 DIIDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCPRFI 191 + I KI R++ AQ RQKSYAD R +L F VG+KVFLKV+P KGV+RF KKGKL PRF+ Sbjct: 37 EAIQKIRARMQTAQSRQKSYADVRRKDLKFDVGEKVFLKVAPMKGVMRFEKKGKLSPRFV 96 Query: 192 GPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLSYEE 371 GPFEILER G VAY+LALPP+L+ VHNVFHVS LRK + D H++ Y+ L +D+ LSY E Sbjct: 97 GPFEILERVGVVAYRLALPPSLSTVHNVFHVSMLRKYVADTSHVVDYEPLAIDEHLSYVE 156 Query: 372 GPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 P IL R++++LR++ IPLVKV W H EEATWE E +M+ YP+L Sbjct: 157 QPVEILAREVKMLRNRSIPLVKVLWRNHRIEEATWEREEEMRTRYPEL 204 >ref|XP_004154025.1| PREDICTED: uncharacterized protein LOC101211634 [Cucumis sativus] Length = 207 Score = 204 bits (519), Expect = 1e-50 Identities = 100/168 (59%), Positives = 126/168 (75%) Frame = +3 Query: 12 DIIDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCPRFI 191 + I KI ++ AQ RQKSYADE R +L F VGD +FLKV+P KGV+RF KKGKL PRF+ Sbjct: 37 EAIQKIRALMQTAQSRQKSYADERRKDLKFDVGDMIFLKVAPMKGVMRFEKKGKLSPRFV 96 Query: 192 GPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLSYEE 371 GPFEILER G VAY+LALPP+L+ VHNVFHVS LRK + D H++ Y+ L +D+ LSY E Sbjct: 97 GPFEILERVGVVAYRLALPPSLSAVHNVFHVSMLRKYVADTSHVVDYEPLEIDEHLSYVE 156 Query: 372 GPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 P IL R++++LR++ IPLVKV W H EEATWE E +M+ YP+L Sbjct: 157 QPAEILAREVKMLRNRSIPLVKVLWRNHRIEEATWEREEEMRTRYPEL 204 >ref|XP_007028311.1| Uncharacterized protein TCM_024023 [Theobroma cacao] gi|508716916|gb|EOY08813.1| Uncharacterized protein TCM_024023 [Theobroma cacao] Length = 207 Score = 203 bits (517), Expect = 2e-50 Identities = 102/171 (59%), Positives = 125/171 (73%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ RQKSYAD R +L FQVGD VFLKVSP+KGV+RF KKGKL P Sbjct: 34 DATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSP 93 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPFEILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 94 RYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQDDLT 153 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + LVKV W H EE TWE E +M+ +P L Sbjct: 154 YEEQPVAILDRQVKKLRSKDVALVKVLWRNHTSEEVTWEAEDEMRTKHPHL 204 >ref|XP_007032400.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508711429|gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 201 bits (511), Expect = 1e-49 Identities = 101/171 (59%), Positives = 124/171 (72%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ RQKSYAD R +L FQVGD VFLKVSP+KGV+RF KKGKL P Sbjct: 1274 DATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSP 1333 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPFEILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 1334 RYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQDDLT 1393 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 1394 YEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAEDEMRTKHPHL 1444 >ref|XP_007022406.1| Uncharacterized protein TCM_032736 [Theobroma cacao] gi|508722034|gb|EOY13931.1| Uncharacterized protein TCM_032736 [Theobroma cacao] Length = 293 Score = 201 bits (510), Expect = 1e-49 Identities = 101/174 (58%), Positives = 126/174 (72%), Gaps = 3/174 (1%) Frame = +3 Query: 3 EMIDIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGK 173 E D +KIH QR+ AQ RQKSYAD R +L FQVGD VFLKVSP+KGV+RF KKGK Sbjct: 117 EREDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGK 176 Query: 174 LCPRFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQ 353 L PR+IGPFEILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+I+Y+ + L Sbjct: 177 LSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIQYETIQLQN 236 Query: 354 DLSYEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 DL+YEE P IL R+++ LR K + VKV W H EE TWE+E +M+ +P L Sbjct: 237 DLTYEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEVEDEMRTKHPHL 290 >ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508779195|gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 200 bits (509), Expect = 2e-49 Identities = 100/171 (58%), Positives = 123/171 (71%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ RQKSYAD R +L FQVGD VFLK SP+KGV+RF KKGKL P Sbjct: 506 DATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKFSPTKGVMRFGKKGKLSP 565 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPF+ILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DLS Sbjct: 566 RYIGPFKILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNLDPSHVIRYETIQLQDDLS 625 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 626 YEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAEDEMRTKHPHL 676 >emb|CAN69016.1| hypothetical protein VITISV_016361 [Vitis vinifera] Length = 1043 Score = 199 bits (507), Expect = 3e-49 Identities = 100/171 (58%), Positives = 123/171 (71%) Frame = +3 Query: 3 EMIDIIDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 E D I I R+ AQ RQKSYAD R L FQ+GD VFL+V+P KGV RF K+GKL P Sbjct: 867 ETTDKIRVIRDRLLAAQSRQKSYADHRRRPLEFQIGDHVFLRVTPRKGVFRFGKRGKLAP 926 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R++GPFEIL++ G VAYKLALPP L+ +H+VFHVS LRK D H++ + DL L +D++ Sbjct: 927 RYVGPFEILQKIGEVAYKLALPPQLSGIHDVFHVSMLRKYEPDTTHVLDWQDLNLQEDVT 986 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEEGP+ IL +K +VLR K IPLVKV W HG E ATWELES M+ Y +L Sbjct: 987 YEEGPRQILDKKEKVLRTKIIPLVKVSWDHHGVEGATWELESDMRNKYXEL 1037 >ref|XP_007037177.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508774422|gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 199 bits (506), Expect = 4e-49 Identities = 100/171 (58%), Positives = 123/171 (71%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ RQKSYAD R L FQVGD VFLKVSP+KG++RF KKGKL P Sbjct: 275 DATEKIHMIRQRMLTAQSRQKSYADNRRRYLEFQVGDHVFLKVSPTKGIMRFGKKGKLSP 334 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPFEILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 335 RYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQDDLT 394 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 395 YEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAEDEMRTKHPHL 445 >ref|XP_007020066.1| Uncharacterized protein TCM_036439 [Theobroma cacao] gi|508725394|gb|EOY17291.1| Uncharacterized protein TCM_036439 [Theobroma cacao] Length = 240 Score = 198 bits (503), Expect = 8e-49 Identities = 100/171 (58%), Positives = 123/171 (71%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ RQKSYAD R +L FQVGD VFLKVSP+KGV+RF KKGKL P Sbjct: 67 DATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSP 126 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+I PFEILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 127 RYIRPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQNDLT 186 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 187 YEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAEDEMRTKHPHL 237 >ref|XP_007028165.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] gi|508716770|gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 198 bits (503), Expect = 8e-49 Identities = 99/171 (57%), Positives = 123/171 (71%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ R KSYAD R +L FQVGD VFLKVSP+KGV+RF KKGKL P Sbjct: 348 DATEKIHMIRQRMLTAQSRHKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSP 407 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPFEIL++ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 408 RYIGPFEILDKVGTVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQDDLT 467 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 468 YEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAEDEMRTKHPHL 518 >ref|XP_007049973.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702234|gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1401 Score = 198 bits (503), Expect = 8e-49 Identities = 99/171 (57%), Positives = 124/171 (72%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH Q++ AQ R+KSYAD R +L FQVGD VFLKVSP+KGV+RF KKGKL P Sbjct: 1228 DATEKIHMIRQKMLTAQSREKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLNP 1287 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPFEILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + DL+ Sbjct: 1288 RYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQSQNDLT 1347 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + LVKV W H EE TWE E +M+ +P L Sbjct: 1348 YEEQPVAILDRQVKKLRSKDVALVKVLWRNHTSEEVTWEAEDEMRTKHPHL 1398 >ref|XP_007023768.1| Uncharacterized protein TCM_027940 [Theobroma cacao] gi|508779134|gb|EOY26390.1| Uncharacterized protein TCM_027940 [Theobroma cacao] Length = 1052 Score = 197 bits (502), Expect = 1e-48 Identities = 97/166 (58%), Positives = 121/166 (72%) Frame = +3 Query: 18 IDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCPRFIGP 197 I I Q++ AQ RQKSYAD R +L FQVGD VFLKVSP+KGV+RF KKGKL PR+IGP Sbjct: 884 IRMIRQKMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGP 943 Query: 198 FEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLSYEEGP 377 FEIL++ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+YEE P Sbjct: 944 FEILDKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQDDLTYEEQP 1003 Query: 378 KCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 1004 VAILDRQVKKLRSKDVASVKVLWQNHTSEEVTWEAEDEMRTKHPHL 1049 >ref|XP_007010873.1| Uncharacterized protein TCM_044868 [Theobroma cacao] gi|508727786|gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao] Length = 403 Score = 197 bits (502), Expect = 1e-48 Identities = 99/171 (57%), Positives = 123/171 (71%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ RQKSYAD R +L FQVGD VFLKV P+KGV+RF KKGKL P Sbjct: 230 DATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVLPTKGVMRFGKKGKLSP 289 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPFEIL++ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 290 RYIGPFEILDKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQDDLT 349 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 350 YEEQPVAILDRQVKKLRSKDVASVKVLWWNHTSEEVTWEAEDEMRTKHPHL 400 >ref|XP_007014875.1| Uncharacterized protein TCM_040446 [Theobroma cacao] gi|508785238|gb|EOY32494.1| Uncharacterized protein TCM_040446 [Theobroma cacao] Length = 210 Score = 196 bits (498), Expect = 3e-48 Identities = 98/175 (56%), Positives = 127/175 (72%), Gaps = 4/175 (2%) Frame = +3 Query: 3 EMIDI----IDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKG 170 E+ID+ + I +R+K AQDRQK+Y+D+ R +L F+V DKVFLKVSP KGV+RF K+G Sbjct: 30 ELIDLTNDKVKVIRERLKTAQDRQKNYSDKRRKDLEFEVDDKVFLKVSPWKGVIRFAKRG 89 Query: 171 KLCPRFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLD 350 KL PR+IGPF I+ER GPVAY+L LPP L+ +HNVFHVS L+K + DP HI+ + L Sbjct: 90 KLNPRYIGPFRIIERIGPVAYRLELPPELDRIHNVFHVSMLKKYVPDPSHILETPPIELH 149 Query: 351 QDLSYEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 +DL +E P IL RK +VLR+K IP+VKV W EE TWE+ES+M+ YP L Sbjct: 150 EDLKFEVQPVRILDRKDRVLRNKSIPMVKVLWKNARMEEMTWEVESQMRNQYPHL 204 >ref|XP_007028011.1| Uncharacterized protein TCM_023017 [Theobroma cacao] gi|508716616|gb|EOY08513.1| Uncharacterized protein TCM_023017 [Theobroma cacao] Length = 243 Score = 196 bits (498), Expect = 3e-48 Identities = 98/175 (56%), Positives = 127/175 (72%), Gaps = 4/175 (2%) Frame = +3 Query: 3 EMIDI----IDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKG 170 E+ID+ + I +R+K AQDRQK+Y+D+ R +L F+V DKVFLKVSP KGV+RF K+G Sbjct: 63 ELIDLTNDKVKVIRERLKTAQDRQKNYSDKRRKDLEFEVDDKVFLKVSPWKGVIRFAKRG 122 Query: 171 KLCPRFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLD 350 KL PR+IGPF I+ER GPVAY+L LPP L+ +HNVFHVS L+K + DP HI+ + L Sbjct: 123 KLNPRYIGPFRIIERIGPVAYRLELPPELDRIHNVFHVSMLKKYVPDPSHILETPPIELH 182 Query: 351 QDLSYEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 +DL +E P IL RK +VLR+K IP+VKV W EE TWE+ES+M+ YP L Sbjct: 183 EDLKFEVQPVRILDRKDRVLRNKSIPMVKVLWKNARMEEMTWEVESQMRNQYPHL 237 >ref|XP_007036990.1| Retrotransposon protein, putative [Theobroma cacao] gi|508774235|gb|EOY21491.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1145 Score = 195 bits (496), Expect = 5e-48 Identities = 97/166 (58%), Positives = 121/166 (72%) Frame = +3 Query: 18 IDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCPRFIGP 197 I I QR+ AQ RQKSY D R +L FQVGD VFLKVSP+KG++RF KKGKL P++IGP Sbjct: 977 IRMIRQRMLIAQSRQKSYVDNRRRDLEFQVGDHVFLKVSPTKGIMRFGKKGKLSPQYIGP 1036 Query: 198 FEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLSYEEGP 377 FEILER G VAY+LALPP+L+++H VFHVS LRK DP H+I Y+ + L+ DL+YE+ P Sbjct: 1037 FEILERVGAVAYRLALPPDLSNIHPVFHVSILRKYNSDPSHVIWYETIQLNNDLTYEKQP 1096 Query: 378 KCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 IL R+++ L K+I LVKV W H EE TWE E +M+ YP L Sbjct: 1097 VAILDRQVKKLHSKEIALVKVLWRNHTSEEVTWEAEEEMRTKYPHL 1142 >ref|XP_007022529.1| Uncharacterized protein TCM_033221 [Theobroma cacao] gi|508722157|gb|EOY14054.1| Uncharacterized protein TCM_033221 [Theobroma cacao] Length = 207 Score = 195 bits (496), Expect = 5e-48 Identities = 99/171 (57%), Positives = 122/171 (71%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH QR+ AQ RQKSYA R +L FQVGD VFLKVSP+KGV+RF KK KL P Sbjct: 34 DATEKIHMIRQRMLTAQSRQKSYAHNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKRKLSP 93 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+IGPFEILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 94 RYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQNDLT 153 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 154 YEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAEDEMRTKHPHL 204 >ref|XP_007099735.1| Uncharacterized protein TCM_045699 [Theobroma cacao] gi|508728383|gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] Length = 415 Score = 194 bits (494), Expect = 9e-48 Identities = 97/171 (56%), Positives = 122/171 (71%), Gaps = 3/171 (1%) Frame = +3 Query: 12 DIIDKIH---QRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKGKLCP 182 D +KIH Q++ Q RQKSYAD R +L FQVGD VFLKVSP+KGV+RF KKGKL P Sbjct: 242 DATEKIHMIRQKMLTTQSRQKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSP 301 Query: 183 RFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLDQDLS 362 R+I PF+ILE+ G VAY+LALPP+L+++H VFHVS LRK DP H+IRY+ + L DL+ Sbjct: 302 RYIRPFDILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQLQNDLT 361 Query: 363 YEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 YEE P IL R+++ LR K + VKV W H EE TWE E +M+ +P L Sbjct: 362 YEEQPVAILDRQVKKLRSKDVASVKVLWQNHTSEEVTWEAEDEMRTKHPHL 412 >ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708318|gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 194 bits (493), Expect = 1e-47 Identities = 97/175 (55%), Positives = 126/175 (72%), Gaps = 4/175 (2%) Frame = +3 Query: 3 EMIDI----IDKIHQRIKQAQDRQKSYADEHRTELSFQVGDKVFLKVSPSKGVVRFIKKG 170 E+ID+ + I +R+K AQDRQK+Y+D+ R +L F+V DKVFLKVSP KGV+RF K+G Sbjct: 1343 ELIDLTNDKVKVIRERLKTAQDRQKNYSDKRRKDLEFEVDDKVFLKVSPWKGVIRFAKRG 1402 Query: 171 KLCPRFIGPFEILERNGPVAYKLALPPNLNDVHNVFHVSQLRKCMFDPKHIIRYDDLVLD 350 KL PR+IGPF I+ER GPVAY+L LPP L+ +HN FHVS L+K + DP HI+ + L Sbjct: 1403 KLNPRYIGPFHIIERIGPVAYRLELPPELDRIHNAFHVSMLKKYVPDPSHILETPPIELH 1462 Query: 351 QDLSYEEGPKCILYRKLQVLRDKQIPLVKVKWMRHGKEEATWELESKMKELYPDL 515 +DL +E P IL RK +VLR+K IP+VKV W EE TWE+ES+M+ YP L Sbjct: 1463 EDLKFEVQPIRILDRKDRVLRNKSIPMVKVLWKNARMEEMTWEVESQMRNQYPHL 1517