BLASTX nr result
ID: Atropa21_contig00042782
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00042782 (830 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] 249 6e-68 gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] 245 1e-66 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 213 3e-57 gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ... 213 3e-57 gb|EOX93994.1| DNA/RNA polymerases superfamily protein [Theobrom... 217 3e-56 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 207 5e-55 gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum] 206 5e-55 gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] 207 1e-54 gb|EOY08653.1| DNA/RNA polymerases superfamily protein [Theobrom... 213 2e-54 emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] 209 2e-54 gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobrom... 210 3e-54 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 209 7e-54 gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobrom... 208 9e-54 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 208 9e-54 gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 208 9e-54 gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom... 208 9e-54 ref|XP_004243173.1| PREDICTED: uncharacterized protein LOC101250... 216 1e-53 gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] 206 2e-53 emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera] 207 5e-53 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 206 5e-53 >gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] Length = 1475 Score = 249 bits (637), Expect(2) = 6e-68 Identities = 135/234 (57%), Positives = 161/234 (68%) Frame = -3 Query: 801 EAILNSEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMK 622 +A L + VLR R+C+ RV D I +I + H S YSI+ G TKMYR LRQHY W M+ Sbjct: 1164 QASLYPDGVLRFAGRICVPRVGDLIQLILSEGHESRYSIHPGTTKMYRDLRQHYWWSGMR 1223 Query: 621 RDIVNFMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*V 442 RDI +F+ + QQVK EH R GV +R+ I E KWERITM+ +VGL T DSI V Sbjct: 1224 RDIADFVSRCLCCQQVKAEHLRPGGVFKRLPIPEWKWERITMDFIVGLPRTPRGVDSIWV 1283 Query: 441 IVDRLTKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PE 262 IVDRLTKS HF+ V S + +LA+IYI E+V+ H VP+ II DRG QFTS FW E Sbjct: 1284 IVDRLTKSAHFLPVQCSFSAERLARIYIREVVRLHGVPVSIISDRGSQFTSNFWRTFQDE 1343 Query: 261 LGTQLDLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 LGT++DLST FHP TD QSERTIQVL D LR CV+DFGG DQFLP EF+YNN Sbjct: 1344 LGTRVDLSTAFHPQTDGQSERTIQVLEDMLRACVMDFGGQWDQFLPLAEFAYNN 1397 Score = 35.4 bits (80), Expect(2) = 6e-68 Identities = 17/37 (45%), Positives = 24/37 (64%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPIR*FDTFEV*P 3 ++ Y+SSI MA +EALY R RSP+ F++ E P Sbjct: 1394 AYNNSYHSSIQMAPFEALYGRRCRSPVGWFESTEPRP 1430 >gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] Length = 1771 Score = 245 bits (626), Expect(2) = 1e-66 Identities = 132/234 (56%), Positives = 160/234 (68%) Frame = -3 Query: 801 EAILNSEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMK 622 +A L+ + VL+ R+C+ RV D I +I +AH S YSI+ G KMYR LRQHY W M+ Sbjct: 1320 QATLDPDGVLKFAGRICVPRVGDLIQLILSEAHESRYSIHPGTAKMYRDLRQHYWWSGMR 1379 Query: 621 RDIVNFMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*V 442 RDI +F+ + QQVK EH R G QR+ I E KWERITM+ VVGL T DSI V Sbjct: 1380 RDIADFVSRCLCCQQVKAEHLRPGGEFQRLPIPEWKWERITMDFVVGLPRTSRGVDSIWV 1439 Query: 441 IVDRLTKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PE 262 IVDRLTKS HF+ V + + +LA+IYI E+V+ H VP+ II DRG QFTS FW E Sbjct: 1440 IVDRLTKSAHFLPVHTTFSAERLARIYIREVVRLHGVPVSIISDRGSQFTSSFWRAFQEE 1499 Query: 261 LGTQLDLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 LGT++ LST FHP TD QSERTIQVL D LR CV+DFGG +QFLP EF+YNN Sbjct: 1500 LGTRVHLSTSFHPQTDGQSERTIQVLEDMLRACVMDFGGQWEQFLPLAEFAYNN 1553 Score = 35.4 bits (80), Expect(2) = 1e-66 Identities = 17/37 (45%), Positives = 24/37 (64%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPIR*FDTFEV*P 3 ++ Y+SSI MA +EALY R RSP+ F++ E P Sbjct: 1550 AYNNSYHSSIQMAPFEALYGRRCRSPVGWFESTEPRP 1586 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 213 bits (542), Expect(2) = 3e-57 Identities = 115/226 (50%), Positives = 146/226 (64%) Frame = -3 Query: 777 VLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMV 598 VLR + RLC+ VD + E+AHSS YS++ G+TKMYR LR+ Y W MK+ I F+ Sbjct: 1158 VLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMYRDLREFYWWNGMKKGIAEFVA 1217 Query: 597 QYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKS 418 + N QQVK EH R G+ Q + + E KWE I M+ + GL + + DSI VIVDR+TKS Sbjct: 1218 KCPNCQQVKVEHQRPGGLAQNIELPEWKWEMINMDFITGLPRSRRQHDSIWVIVDRMTKS 1277 Query: 417 THFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLS 238 HF+ V + + AK+YI EIV+ H VPI II DRG QFT+ FW LG+++ LS Sbjct: 1278 AHFLPVKTTHSAEDYAKLYIQEIVRLHGVPISIISDRGAQFTAQFWKSFQKGLGSKVSLS 1337 Query: 237 TDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 T FHP TD Q+ERTIQ L D LR CVIDF + D LP EF+YNN Sbjct: 1338 TAFHPQTDGQAERTIQTLEDMLRACVIDFKSNWDDHLPLIEFAYNN 1383 Score = 36.2 bits (82), Expect(2) = 3e-57 Identities = 18/34 (52%), Positives = 22/34 (64%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPIR*FDTFE 12 ++ Y+SSI MA YEALY R RSPI F+ E Sbjct: 1380 AYNNSYHSSIQMAPYEALYGRRCRSPIGWFEVGE 1413 >gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1515 Score = 213 bits (542), Expect(2) = 3e-57 Identities = 115/226 (50%), Positives = 146/226 (64%) Frame = -3 Query: 777 VLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMV 598 VLR + RLC+ VD + E+AHSS YS++ G+TKMYR LR+ Y W MK+ I F+ Sbjct: 1152 VLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMYRDLREFYWWNGMKKGIAEFVA 1211 Query: 597 QYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKS 418 + N QQVK EH R G+ Q + + E KWE I M+ + GL + + DSI VIVDR+TKS Sbjct: 1212 KCPNCQQVKVEHQRPGGLAQNIELPEWKWEMINMDFITGLPRSRRQHDSIWVIVDRMTKS 1271 Query: 417 THFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLS 238 HF+ V + + AK+YI EIV+ H VPI II DRG QFT+ FW LG+++ LS Sbjct: 1272 AHFLPVRTTHSAEDYAKLYIQEIVRLHGVPISIISDRGAQFTAQFWKSFQKGLGSKVSLS 1331 Query: 237 TDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 T FHP TD Q+ERTIQ L D LR CVIDF + D LP EF+YNN Sbjct: 1332 TAFHPQTDGQAERTIQTLEDMLRACVIDFKSNWDDHLPLIEFAYNN 1377 Score = 36.2 bits (82), Expect(2) = 3e-57 Identities = 18/34 (52%), Positives = 22/34 (64%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPIR*FDTFE 12 ++ Y+SSI MA YEALY R RSPI F+ E Sbjct: 1374 AYNNSYHSSIQMAPYEALYGRRCRSPIGWFEVGE 1407 >gb|EOX93994.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 811 Score = 217 bits (552), Expect(2) = 3e-56 Identities = 115/235 (48%), Positives = 153/235 (65%) Frame = -3 Query: 804 SEAILNSEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRM 625 SE L+ + L ++ R+C+ + D I E+AHSS Y+++ G+TKMYR +++ Y W M Sbjct: 473 SEFRLSDDGTLMLRDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYRTIKESYWWPGM 532 Query: 624 KRDIVNFMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI* 445 KRDI F+ + QQ+K EH + G LQ + I E KWE +TM+ V+GL T D+I Sbjct: 533 KRDIAKFVAKCLTCQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFVLGLPRTQSGKDAIW 592 Query: 444 VIVDRLTKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*P 265 VIVDRLTKS HF+ + + + +LA++YI E+V+ H VPI I+ DR P+FTS FW Sbjct: 593 VIVDRLTKSAHFLAIHSTYSIERLARLYIDEVVRLHGVPISIVSDRDPRFTSRFWPKFQE 652 Query: 264 ELGTQLDLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 LGT+L ST FHP TD QSERTIQ L D LR CVIDF G D+ LP EF+YNN Sbjct: 653 ALGTKLRFSTSFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNN 707 Score = 29.3 bits (64), Expect(2) = 3e-56 Identities = 12/27 (44%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + SSI MA YEALY + R+P+ Sbjct: 704 AYNNSFQSSIGMAPYEALYGRKCRTPL 730 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 207 bits (528), Expect(2) = 5e-55 Identities = 113/225 (50%), Positives = 144/225 (64%) Frame = -3 Query: 774 LRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMVQ 595 LR + RLC+ VD I E+AH+S YSI+ G+TKMYR LR Y WG MK+DI F+ Sbjct: 1504 LRYQGRLCVPCVDGLREKILEEAHNSSYSIHPGSTKMYRDLRDVYWWGGMKKDIAKFVSG 1563 Query: 594 YRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKST 415 + QQVK EH R G+ Q + I KWE I M+ VVGL T F SI V+VDR+TKS Sbjct: 1564 CHSCQQVKAEHQRPGGLTQDIEIPTWKWEEINMDFVVGLPKTRKGFGSIWVVVDRMTKSA 1623 Query: 414 HFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLST 235 HF+ V + A++YIH++V+ H +P+ II DRG QFTS+FW LGT++ L+T Sbjct: 1624 HFLPVKTTYGAEDYARLYIHDLVRLHGIPLSIISDRGTQFTSHFWKSFQRGLGTRVKLTT 1683 Query: 234 DFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 FHP TD Q+ERTIQ L D LR CV++ G + LP EFSYNN Sbjct: 1684 AFHPQTDGQAERTIQTLEDMLRACVLELKGSWEDHLPLIEFSYNN 1728 Score = 34.3 bits (77), Expect(2) = 5e-55 Identities = 17/35 (48%), Positives = 22/35 (62%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPIR*FDTFEV 9 S+ Y+SSI MA +EALY R RS + F+ EV Sbjct: 1725 SYNNSYHSSIGMAPFEALYGRRCRSSVGLFEVGEV 1759 >gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum] Length = 1487 Score = 206 bits (523), Expect(2) = 5e-55 Identities = 115/226 (50%), Positives = 145/226 (64%) Frame = -3 Query: 777 VLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMV 598 VLR + RLC+ VD I E+AHSS YSI+ G TKMYR LR+ Y W MK+ I F+ Sbjct: 1043 VLRYQGRLCVPMVDGLQKRIMEEAHSSRYSIHPGFTKMYRDLREVYWWNGMKKGIAEFVA 1102 Query: 597 QYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKS 418 + N QQVK EH R G+ QR+ + E KWE I M+ + GL + + DSI VIVDR+TKS Sbjct: 1103 KCPNCQQVKVEHQRLGGLAQRIELLELKWEMINMDFITGLPRSRRQHDSIWVIVDRMTKS 1162 Query: 417 THFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLS 238 HF+ V +++ AK+YI E+V+ H VPI II +RG QF +F LG ++LS Sbjct: 1163 AHFLPVKTTNSAEDYAKLYIQEVVRLHGVPISIISNRGAQFWKFFQ----KGLGLNVNLS 1218 Query: 237 TDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 T FHP TD Q+ERTIQ L D LR CVIDF G+ D LP EF+YNN Sbjct: 1219 TAFHPQTDGQAERTIQTLEDMLRACVIDFKGNWDDHLPLIEFAYNN 1264 Score = 36.2 bits (82), Expect(2) = 5e-55 Identities = 18/34 (52%), Positives = 22/34 (64%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPIR*FDTFE 12 ++ Y+SSI MA YEALY R RSPI F+ E Sbjct: 1261 AYNNSYHSSIQMAPYEALYGRRCRSPIGWFEVGE 1294 >gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] Length = 1554 Score = 207 bits (528), Expect(2) = 1e-54 Identities = 113/226 (50%), Positives = 147/226 (65%) Frame = -3 Query: 777 VLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMV 598 VLR + RLC+ VD I E+AHSS YSI+ G+TKMY LR+ Y W MK+ I F+ Sbjct: 1158 VLRYQGRLCVPMVDGLQERIMEEAHSSRYSIHPGSTKMYHDLREVYWWNGMKKGIAEFVA 1217 Query: 597 QYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKS 418 + N QQVK EH R G+ QR+ + E KWE I M+ + GL + + DSI VIVD++TKS Sbjct: 1218 KCPNCQQVKVEHQRPVGLAQRIKLPEWKWEMINMDFITGLPKSHRQHDSIWVIVDQMTKS 1277 Query: 417 THFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLS 238 HF+ V ++ AK+Y+ EIV+ H +PI II DRG QFT+ FW LG++++LS Sbjct: 1278 AHFLPVRTTNIAEDYAKLYVQEIVRLHGIPISIISDRGAQFTAQFWKSFKKGLGSKVNLS 1337 Query: 237 TDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 T F+P TD Q+ERTI L D LR CVIDF G+ D LP EF+YNN Sbjct: 1338 TAFYPQTDGQAERTIHTLEDMLRACVIDFKGNWDDHLPLIEFAYNN 1383 Score = 32.7 bits (73), Expect(2) = 1e-54 Identities = 17/34 (50%), Positives = 21/34 (61%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPIR*FDTFE 12 ++ Y+SSI MA YEALY R SPI F+ E Sbjct: 1380 AYNNSYHSSIHMAPYEALYGRRCISPIGWFEVGE 1413 >gb|EOY08653.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1110 Score = 213 bits (542), Expect(2) = 2e-54 Identities = 116/235 (49%), Positives = 152/235 (64%) Frame = -3 Query: 804 SEAILNSEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRM 625 SE L+ + L ++ R+C+ + D I E+AHSS Y+++L +TKMYR +++ Y W M Sbjct: 767 SEFRLSDDGTLMLRDRICVLKDDQLRRAILEEAHSSAYALHLESTKMYRTIKESYWWPGM 826 Query: 624 KRDIVNFMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI* 445 KRDI F+ + QQ+K EH + G LQ + I E KWE +TM+ V+GL T D+I Sbjct: 827 KRDIAEFVAKCLTCQQIKAEHQKLSGTLQPLPIPEWKWEHVTMDFVLGLLRTQSGKDAIW 886 Query: 444 VIVDRLTKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*P 265 VIVDRLTKS HF+ + + + KL K+YI EIV+ + VPI I+ DR P+FTS FW Sbjct: 887 VIVDRLTKSAHFLAIHNTYSIEKLVKLYIDEIVRLYGVPISIVSDRDPRFTSRFWSKFQE 946 Query: 264 ELGTQLDLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 LGT+L ST FHP TD QSERTIQ L D LR CVIDF G D+ LP EF+YNN Sbjct: 947 ALGTKLRFSTAFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNN 1001 Score = 26.9 bits (58), Expect(2) = 2e-54 Identities = 11/26 (42%), Positives = 17/26 (65%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSP 36 ++ + SSI MA YEALY + ++P Sbjct: 998 AYNNSFQSSIGMAPYEALYGRKCQTP 1023 >emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] Length = 1573 Score = 209 bits (531), Expect(2) = 2e-54 Identities = 109/225 (48%), Positives = 148/225 (65%) Frame = -3 Query: 774 LRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMVQ 595 +R K RLC+ + + + + AH + Y+I+ G TKMY+ L++ + W MKRDI F+ Sbjct: 1180 VRFKGRLCVPKDVELRNELLADAHRAKYTIHPGNTKMYQDLKRQFXWSGMKRDIAQFVAN 1239 Query: 594 YRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKST 415 + QQVK EH R +LQ + I + KW+ ITM+ V+GL T K + + VIVDRLTKS Sbjct: 1240 CQICQQVKAEHQRPAELLQPLPIPKWKWDNITMDFVIGLPRTRSKKNGVWVIVDRLTKSA 1299 Query: 414 HFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLST 235 HF+ + +D+ LAK+YI EIV+ H +P+ I+ DR P+FTS FW L LGTQL+ ST Sbjct: 1300 HFLAMKTTDSMNSLAKLYIQEIVRLHGIPVSIVSDRDPKFTSQFWQSLQRALGTQLNFST 1359 Query: 234 DFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 FHP TD QSER IQ+L D LR CV+DFGG+ +LP EF+YNN Sbjct: 1360 VFHPQTDGQSERVIQILEDMLRACVLDFGGNWADYLPLAEFAYNN 1404 Score = 30.8 bits (68), Expect(2) = 2e-54 Identities = 14/27 (51%), Positives = 17/27 (62%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ Y SSI MA YEALY RSP+ Sbjct: 1401 AYNNXYQSSIGMAPYEALYGRPCRSPL 1427 >gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 210 bits (534), Expect(2) = 3e-54 Identities = 113/235 (48%), Positives = 152/235 (64%) Frame = -3 Query: 804 SEAILNSEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRM 625 SE L+ + L ++ R+C+ + D I E+AH S Y+++ G+TKMYR +++ Y W M Sbjct: 1072 SEFRLSDDGTLMLRDRICVPKDDQLRRAILEEAHYSAYALHPGSTKMYRTIKESYWWPGM 1131 Query: 624 KRDIVNFMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI* 445 +RDI F+ + QQ+K EH + G LQ +SI E KWE +TM+ V+GL T D+I Sbjct: 1132 ERDIAEFVAKCLTCQQIKAEHQKPSGTLQPLSIPEWKWEHVTMDFVLGLPRTQSGKDAIW 1191 Query: 444 VIVDRLTKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*P 265 VIVDRLTKS HF+ + + + +LA++YI EIV+ H VP+ I+ DR +FTS FW Sbjct: 1192 VIVDRLTKSAHFLAIHSTYSIERLARLYIDEIVRLHGVPVSIVSDRDLRFTSRFWPKFQE 1251 Query: 264 ELGTQLDLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 LGT+L ST FHP TD QSERTIQ L D LR CVIDF G D+ LP EF+YNN Sbjct: 1252 ALGTKLRFSTAFHPQTDGQSERTIQTLEDMLRACVIDFIGSWDRHLPLVEFAYNN 1306 Score = 29.3 bits (64), Expect(2) = 3e-54 Identities = 12/27 (44%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + SSI MA YEALY + R+P+ Sbjct: 1303 AYNNSFQSSIGMAPYEALYGRKCRTPL 1329 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 209 bits (531), Expect(2) = 7e-54 Identities = 112/229 (48%), Positives = 147/229 (64%) Frame = -3 Query: 786 SEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVN 607 ++ VLR RL + D I E+AH + Y ++ GATKMY+ L++ Y W +KRD+ Sbjct: 331 TDGVLRYGTRLYVPDGDGLRRKILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAE 390 Query: 606 FMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRL 427 F+ + QQVK EH + G+LQ + + E KWE I M+ V GL T G +DSI ++VDRL Sbjct: 391 FVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRL 450 Query: 426 TKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQL 247 TKS HF+ V + + A++Y+ EIV+ H +PI I+ DRG QFTS FW L LGT+L Sbjct: 451 TKSAHFLSVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKL 510 Query: 246 DLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 D ST FHP TD QSERTIQ L D LR CVID G +Q+LP EF+YNN Sbjct: 511 DFSTTFHPQTDGQSERTIQTLEDMLRACVIDLGVKWEQYLPLVEFAYNN 559 Score = 29.3 bits (64), Expect(2) = 7e-54 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + +SI MA +EALY R RSPI Sbjct: 556 AYNNSFQTSIQMAPFEALYGRRCRSPI 582 >gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1400 Score = 208 bits (530), Expect(2) = 9e-54 Identities = 112/229 (48%), Positives = 146/229 (63%) Frame = -3 Query: 786 SEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVN 607 ++ VLR RL + D I E+AH Y ++ GATKMY+ L++ Y W +KRD+ Sbjct: 1008 TDGVLRYGTRLYVPDGDGLRREILEEAHMVAYVVHPGATKMYQDLKEVYWWEELKRDVAE 1067 Query: 606 FMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRL 427 F+ + QQVK EH + G+LQ + + E KWE I M+ V GL T G +DSI ++VDRL Sbjct: 1068 FVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRL 1127 Query: 426 TKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQL 247 TKS HF+ V + + A++Y+ EIV+ H +PI I+FDRG QFT FW L LGT+L Sbjct: 1128 TKSAHFLPVKTTYGAAQYARVYVDEIVRQHGIPISIVFDRGAQFTGRFWGKLQEALGTKL 1187 Query: 246 DLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 D ST FHP TD QSERTIQ L D LR CVID G +Q+LP EF+YNN Sbjct: 1188 DFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNN 1236 Score = 29.3 bits (64), Expect(2) = 9e-54 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + +SI MA +EALY R RSPI Sbjct: 1233 AYNNSFQTSIQMAPFEALYGRRCRSPI 1259 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 208 bits (530), Expect(2) = 9e-54 Identities = 112/229 (48%), Positives = 147/229 (64%) Frame = -3 Query: 786 SEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVN 607 ++ VLR RL + D I E+AH + Y ++ GATKMY+ L++ Y W +KRD+ Sbjct: 614 TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAE 673 Query: 606 FMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRL 427 F+ + QQVK EH + G+LQ + + E KWE I M+ V GL T G +DSI ++VDRL Sbjct: 674 FVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRL 733 Query: 426 TKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQL 247 TKS HF+ V + + A++Y+ EIV+ H +PI I+ DRG QFTS FW L LGT+L Sbjct: 734 TKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKL 793 Query: 246 DLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 D ST FHP TD QSERTIQ L D LR CVID G +Q+LP EF+YNN Sbjct: 794 DFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNN 842 Score = 29.3 bits (64), Expect(2) = 9e-54 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + +SI MA +EALY R RSPI Sbjct: 839 AYNNSFQTSIQMAPFEALYGRRCRSPI 865 >gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 208 bits (530), Expect(2) = 9e-54 Identities = 112/229 (48%), Positives = 147/229 (64%) Frame = -3 Query: 786 SEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVN 607 ++ VLR RL + D I E+AH + Y ++ GATKMY+ L++ Y W +KRD+ Sbjct: 79 TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAE 138 Query: 606 FMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRL 427 F+ + QQVK EH + G+LQ + + E KWE I M+ V GL T G +DSI ++VDRL Sbjct: 139 FVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRL 198 Query: 426 TKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQL 247 TKS HF+ V + + A++Y+ EIV+ H +PI I+ DRG QFTS FW L LGT+L Sbjct: 199 TKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKL 258 Query: 246 DLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 D ST FHP TD QSERTIQ L D LR CVID G +Q+LP EF+YNN Sbjct: 259 DFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNN 307 Score = 29.3 bits (64), Expect(2) = 9e-54 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + +SI MA +EALY R RSPI Sbjct: 304 AYNNSFQTSIQMAPFEALYGRRCRSPI 330 >gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 208 bits (530), Expect(2) = 9e-54 Identities = 112/229 (48%), Positives = 147/229 (64%) Frame = -3 Query: 786 SEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVN 607 ++ VLR RL + D I E+AH + Y ++ GATKMY+ L++ Y W +KRD+ Sbjct: 6 TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAE 65 Query: 606 FMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRL 427 F+ + QQVK EH + G+LQ + + E KWE I M+ V GL T G +DSI ++VDRL Sbjct: 66 FVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRL 125 Query: 426 TKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQL 247 TKS HF+ V + + A++Y+ EIV+ H +PI I+ DRG QFTS FW L LGT+L Sbjct: 126 TKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKL 185 Query: 246 DLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 D ST FHP TD QSERTIQ L D LR CVID G +Q+LP EF+YNN Sbjct: 186 DFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNN 234 Score = 29.3 bits (64), Expect(2) = 9e-54 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + +SI MA +EALY R RSPI Sbjct: 231 AYNNSFQTSIQMAPFEALYGRRCRSPI 257 >ref|XP_004243173.1| PREDICTED: uncharacterized protein LOC101250031 [Solanum lycopersicum] Length = 609 Score = 216 bits (549), Expect = 1e-53 Identities = 114/177 (64%), Positives = 131/177 (74%) Frame = -3 Query: 759 RLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMVQYRNYQ 580 R+C+ RVD+ IH I +AHSS YSI+ GATKMYR L+QH+ W RMKRDIV+F+ Q N Q Sbjct: 423 RVCVPRVDELIHTILTEAHSSRYSIHPGATKMYRDLKQHFWWSRMKRDIVDFVAQCPNCQ 482 Query: 579 QVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKSTHFILV 400 QVKYEH R G LQRM I E KWERI M+ VVGL TLGKFDSI VIVDRLTKS HFI V Sbjct: 483 QVKYEHQRPGGTLQRMPIPEWKWERIAMDFVVGLPKTLGKFDSIWVIVDRLTKSAHFIPV 542 Query: 399 *VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLSTDF 229 V+ N KL ++YI E+V H VP+ II DRG QFTS W L ELGT+LDLST+F Sbjct: 543 KVTYNAEKLVRLYISEVVWLHGVPLSIISDRGTQFTSKLWRTLHAELGTRLDLSTNF 599 >gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 206 bits (525), Expect(2) = 2e-53 Identities = 111/229 (48%), Positives = 147/229 (64%) Frame = -3 Query: 786 SEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVN 607 ++ VLR RL + D I E+AH + Y ++ GATKMY+ L++ Y W +KRD+ Sbjct: 448 TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAE 507 Query: 606 FMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRL 427 F+ + QQVK EH + G+LQ + + E KWE I M+ V GL T G +DSI ++VDRL Sbjct: 508 FVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRL 567 Query: 426 TKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQL 247 TKS HF+ V + + A++Y+ EIV+ H +PI I+ DRG QFTS FW L LGT+L Sbjct: 568 TKSAHFLPVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKL 627 Query: 246 DLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 D ST FHP TD QSERTI+ L D LR CVID G +Q+LP EF+YNN Sbjct: 628 DFSTAFHPQTDGQSERTIKTLEDMLRACVIDLGVKWEQYLPLVEFAYNN 676 Score = 30.0 bits (66), Expect(2) = 2e-53 Identities = 13/27 (48%), Positives = 19/27 (70%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + +SI MA++EALY R RSPI Sbjct: 673 AYNNSFQTSIQMAAFEALYGRRCRSPI 699 >emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera] Length = 1313 Score = 207 bits (528), Expect(2) = 5e-53 Identities = 108/225 (48%), Positives = 148/225 (65%) Frame = -3 Query: 774 LRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVNFMVQ 595 +R K RLC+ + + + + AH + Y+I+ G TKMY+ L++ + W MKRDI F+ Sbjct: 874 VRFKGRLCVPKDVELRNELLADAHRAKYTIHPGNTKMYQDLKRQFWWSGMKRDIAQFVAN 933 Query: 594 YRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRLTKST 415 ++ QQVK EH R G+LQ + I E KW+ ITM+ V+GL T K + + VIVD LTKS Sbjct: 934 FQICQQVKAEHQRPAGLLQPLPIPEWKWDNITMDFVIGLPRTRSKKNGVWVIVDCLTKSA 993 Query: 414 HFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQLDLST 235 HF+ + +D+ LAK+YI EIV+ H + + I+ DR P+FTS FW L LGTQL+ +T Sbjct: 994 HFLAMKTTDSMNSLAKLYIQEIVRLHGILVSIVSDRDPKFTSQFWQSLQRALGTQLNFNT 1053 Query: 234 DFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 FHP TD QSER IQ+L D LR CV+DFGG+ +LP EF+YNN Sbjct: 1054 AFHPQTDGQSERVIQILEDMLRACVLDFGGNWADYLPLAEFAYNN 1098 Score = 27.7 bits (60), Expect(2) = 5e-53 Identities = 13/27 (48%), Positives = 16/27 (59%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ Y SSI A YEALY RSP+ Sbjct: 1095 AYNNSYQSSIXXAPYEALYGRPCRSPL 1121 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 206 bits (524), Expect(2) = 5e-53 Identities = 111/229 (48%), Positives = 146/229 (63%) Frame = -3 Query: 786 SEWVLRIKKRLCISRVDDFIHVIFEKAHSSCYSIYLGATKMYRHLRQHY*WGRMKRDIVN 607 ++ VLR RL + D I E+AH + Y ++ GATKMY+ L++ Y W +KRD+ Sbjct: 237 TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAE 296 Query: 606 FMVQYRNYQQVKYEH*RHWGVLQRMSIFE*KWERITMNLVVGLSMTLGKFDSI*VIVDRL 427 F+ + QQVK EH + G+LQ + + E KWE I M+ V GL T G +DSI ++VD+L Sbjct: 297 FVSKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDQL 356 Query: 426 TKSTHFILV*VSDNT*KLAKIYIHEIVQFHKVPIFIIFDRGPQFTSYFWWIL*PELGTQL 247 TKS HF+ V + A++Y+ EIV+ H +PI I+ DRG QFTS FW L LGT+L Sbjct: 357 TKSAHFLPVKTTYGAAHYARVYVDEIVRLHGIPISIVSDRGAQFTSRFWGKLQEALGTKL 416 Query: 246 DLSTDFHP*TDEQSERTIQVLVDRLR*CVIDFGGH*DQFLPFPEFSYNN 100 D ST FHP TD QSERTIQ L D LR CVID G +Q+LP EF+YNN Sbjct: 417 DFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNN 465 Score = 29.3 bits (64), Expect(2) = 5e-53 Identities = 13/27 (48%), Positives = 18/27 (66%) Frame = -1 Query: 113 SHTTIYYSSIAMASYEALYNMRYRSPI 33 ++ + +SI MA +EALY R RSPI Sbjct: 462 AYNNSFQTSIQMAPFEALYGRRCRSPI 488