BLASTX nr result
ID: Cheilocostus21_contig00053900
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00053900 (2740 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOX99807.1| CCHC-type integrase [Theobroma cacao] 146 7e-37 gb|EOY17292.1| CCHC-type integrase [Theobroma cacao] 145 1e-36 gb|EOY03103.1| CCHC-type integrase [Theobroma cacao] 146 3e-36 ref|XP_008222092.1| PREDICTED: uncharacterized protein LOC103322... 146 7e-36 ref|XP_020963827.1| uncharacterized protein LOC107611884 [Arachi... 143 1e-35 gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa] 144 5e-35 gb|EOY32249.1| CCHC-type integrase [Theobroma cacao] 144 1e-34 gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] 149 2e-34 gb|EOY14001.1| CCHC-type integrase, putative [Theobroma cacao] 143 2e-34 gb|EOY31663.1| CCHC-type integrase [Theobroma cacao] 146 3e-34 gb|OTG11525.1| putative reverse transcriptase domain-containing ... 134 3e-34 gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao] 145 4e-34 emb|CAJ65807.1| polyprotein, partial [Citrus sinensis] 135 4e-34 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 148 5e-34 gb|PNX92676.1| putative retrotransposon Ty3-gypsy subclass prote... 130 6e-34 dbj|GAV84963.1| hypothetical protein CFOL_v3_28404, partial [Cep... 137 1e-33 gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] 147 1e-33 gb|PNX70480.1| retrotransposon-related protein, partial [Trifoli... 139 1e-33 emb|CAA73042.1| polyprotein, partial [Ananas comosus] 130 2e-33 ref|XP_017981007.1| PREDICTED: uncharacterized protein LOC108663... 141 2e-33 >gb|EOX99807.1| CCHC-type integrase [Theobroma cacao] Length = 165 Score = 146 bits (369), Expect = 7e-37 Identities = 66/109 (60%), Positives = 87/109 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+LE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 15 KRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 74 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 75 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 123 >gb|EOY17292.1| CCHC-type integrase [Theobroma cacao] Length = 136 Score = 145 bits (365), Expect = 1e-36 Identities = 65/109 (59%), Positives = 87/109 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+L++A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 15 KRHEQNYPIHDLKMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 74 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 75 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 123 >gb|EOY03103.1| CCHC-type integrase [Theobroma cacao] Length = 214 Score = 146 bits (369), Expect = 3e-36 Identities = 66/109 (60%), Positives = 87/109 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+LE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 15 KRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 74 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 75 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 123 >ref|XP_008222092.1| PREDICTED: uncharacterized protein LOC103322015 [Prunus mume] Length = 244 Score = 146 bits (369), Expect = 7e-36 Identities = 71/119 (59%), Positives = 90/119 (75%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 KKHE+NY VH+LELA VVFALKIWR+YLYG T IFT+H+SLKY F+ +LNMRQ RW+E Sbjct: 15 KKHERNYPVHDLELAAVVFALKIWRHYLYGETCQIFTDHKSLKYFFTQKELNMRQRRWLE 74 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGVEMVHYTENG 2560 L+KDYDCTI YH G+A+VV DALSRK + H++ A L+ ++ + GVEM + G Sbjct: 75 LIKDYDCTIEYHPGRANVVADALSRKAPANLAHIKAAYLPLLVELRKEGVEMEMTQQGG 133 >ref|XP_020963827.1| uncharacterized protein LOC107611884 [Arachis ipaensis] Length = 2309 Score = 143 bits (360), Expect(2) = 1e-35 Identities = 66/108 (61%), Positives = 86/108 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 KKHE+NY H+LE+A V+FALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 758 KKHEQNYPTHDLEMAAVIFALKIWRHYLYGETCEIYTDHKSLKYIFQQKDLNLRQRRWME 817 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTEL 2527 LLKDYDCTI YH GKA+VV DALSRK++G + H+ +A +V +V +L Sbjct: 818 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHITLARRPIVEEVHQL 865 Score = 38.5 bits (88), Expect(2) = 1e-35 Identities = 24/68 (35%), Positives = 40/68 (58%), Gaps = 5/68 (7%) Frame = +1 Query: 2548 H*EWMSPLVQKIIEKQPDDLYLQSIIT---KGKRHEFT*DSEGVIRCWNRLCVPEL--VK 2712 H S L+++I Q DD L+ +I G+ +F+ D + V+RC RLCVP+ +K Sbjct: 883 HVRAQSSLIEQIKAAQRDDPKLRKLIEDVRNGRNSKFSLDQD-VLRCGQRLCVPDNHDLK 941 Query: 2713 EELMDEAH 2736 + +++EAH Sbjct: 942 KAILEEAH 949 >gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa] Length = 2037 Score = 144 bits (364), Expect(2) = 5e-35 Identities = 68/115 (59%), Positives = 88/115 (76%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 KKHE+NY H+LE+ V+FALKIWR+YLYG T IFT+H+SLKY+F LN+RQ RW+E Sbjct: 1818 KKHEQNYPTHDLEMTAVIFALKIWRHYLYGETCEIFTDHKSLKYIFQQRDLNLRQRRWME 1877 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGVEMVHY 2548 LLKDYDCTI+YH GKA+VV DALSRK+ G + H++ L+R++ EL E V + Sbjct: 1878 LLKDYDCTIHYHPGKANVVADALSRKSSGSLAHIQEVRRPLIRELHELVDEGVRF 1932 Score = 35.0 bits (79), Expect(2) = 5e-35 Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 5/69 (7%) Frame = +1 Query: 2548 H*EWMSPLVQKIIEKQPDD---LYLQSIITKGKRHEFT*DSEGVIRCWNRLCVPEL--VK 2712 H + S L KI Q D L +++ + +GK F + V+R +RLCVP++ ++ Sbjct: 1943 HFQVKSDLFDKIKAAQKKDDSLLRIRNEVEQGKAAGFVIGDDDVLRYKDRLCVPDVDDLR 2002 Query: 2713 EELMDEAHR 2739 ELM EAH+ Sbjct: 2003 RELMVEAHQ 2011 >gb|EOY32249.1| CCHC-type integrase [Theobroma cacao] Length = 282 Score = 144 bits (364), Expect = 1e-34 Identities = 65/109 (59%), Positives = 87/109 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+LE+A +VFALKIWR+YLYG T I+T+H+SLKY+F L++RQ RW+E Sbjct: 53 KRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLDLRQRRWME 112 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 113 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 161 >gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 149 bits (377), Expect(2) = 2e-34 Identities = 67/109 (61%), Positives = 88/109 (80%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+LE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQHRW+E Sbjct: 969 KRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQHRWME 1028 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 1029 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 1077 Score = 28.1 bits (61), Expect(2) = 2e-34 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 7/63 (11%) Frame = +1 Query: 2569 LVQKIIEKQPDDLYLQSIIT-----KGKRHEFT*DSEGVIRCWNRLCVP--ELVKEELMD 2727 L+ +I E Q D ++ + KGK FT ++GV+R RL VP + ++ E+++ Sbjct: 1101 LMDRIKEAQSKDEFVIKALEDPQGRKGKM--FTKGTDGVLRYGTRLYVPDGDGLRREILE 1158 Query: 2728 EAH 2736 EAH Sbjct: 1159 EAH 1161 >gb|EOY14001.1| CCHC-type integrase, putative [Theobroma cacao] Length = 268 Score = 143 bits (360), Expect = 2e-34 Identities = 65/109 (59%), Positives = 86/109 (78%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+LE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 11 KRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 70 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKD DCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 71 LLKDCDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 119 >gb|EOY31663.1| CCHC-type integrase [Theobroma cacao] Length = 395 Score = 146 bits (369), Expect = 3e-34 Identities = 66/109 (60%), Positives = 87/109 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+LE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 75 KRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 134 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 135 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 183 >gb|OTG11525.1| putative reverse transcriptase domain-containing protein [Helianthus annuus] Length = 1278 Score = 134 bits (337), Expect(2) = 3e-34 Identities = 64/110 (58%), Positives = 87/110 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K +E NYL H+LELA V+FALKIWR+YLYG T IFT+H+SLKY+F+ +LNMRQ RW+E Sbjct: 942 KPYEVNYLTHDLELAAVIFALKIWRHYLYGETCDIFTDHKSLKYIFTQKELNMRQRRWLE 1001 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGV 2533 LLKDYD I YH G+A+VV DALSRK+ G ++ L++ P +++ + +LG+ Sbjct: 1002 LLKDYDANIQYHPGRANVVADALSRKSSGSISSLQLQP-QILTDLDKLGI 1050 Score = 42.7 bits (99), Expect(2) = 3e-34 Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%) Frame = +1 Query: 2569 LVQKIIEKQPDDLYLQSIITK---GKRHEFT*DSEGVIRCWNRLCVP--ELVKEELMDEA 2733 L+ +I Q DD L +II GK+ EF D GV+ C RLCVP ++E L+ EA Sbjct: 1070 LISRIKSAQQDDGELWAIIQNLEVGKQSEFRIDENGVVWCGKRLCVPNDSTLRESLLAEA 1129 Query: 2734 H 2736 H Sbjct: 1130 H 1130 >gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1263 Score = 145 bits (367), Expect(2) = 4e-34 Identities = 67/114 (58%), Positives = 87/114 (76%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +HNLE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 681 KRHEQNYPIHNLEIAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 740 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGVEMVH 2545 LLKDYDCTI YH GKA+VV DA SRK++G + H+ LV+++ LG VH Sbjct: 741 LLKDYDCTILYHPGKANVVADAFSRKSMGSLAHISTGRRSLVKEIHSLGDIGVH 794 Score = 30.8 bits (68), Expect(2) = 4e-34 Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 7/63 (11%) Frame = +1 Query: 2569 LVQKIIEKQPDDLYLQSIIT-----KGKRHEFT*DSEGVIRCWNRLCVP--ELVKEELMD 2727 L+ KI E Q D ++ I KGK FT ++GV+R RL VP + ++ E+++ Sbjct: 813 LMDKIKEAQSKDEFVTKAIEDPQGRKGKM--FTKGTDGVLRYGTRLYVPDGDGLRREILE 870 Query: 2728 EAH 2736 EAH Sbjct: 871 EAH 873 >emb|CAJ65807.1| polyprotein, partial [Citrus sinensis] Length = 533 Score = 135 bits (340), Expect(2) = 4e-34 Identities = 65/112 (58%), Positives = 85/112 (75%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY H+L+LA VVFALKIWR+YLY T FT+H+SLKYL + +LN RQ RW+E Sbjct: 176 KEHEQNYPTHDLKLAAVVFALKIWRHYLYRATCQNFTDHKSLKYLVTQKELNSRQRRWIE 235 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGVEM 2539 L+KDYDCTI++H GKA+VV DALSRK+ + HLR L+ ++ LGVE+ Sbjct: 236 LIKDYDCTIDFHPGKANVVADALSRKSFSSIAHLRGTYMPLLIELRSLGVEL 287 Score = 41.2 bits (95), Expect(2) = 4e-34 Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 5/61 (8%) Frame = +1 Query: 2569 LVQKIIEKQPDDLYLQSI---ITKGKRHEFT*DSEGVIRCWNRLCVPEL--VKEELMDEA 2733 L+ K+ + Q DL L + + K R +F GV+ NRLCVP++ +K+E+M+EA Sbjct: 305 LIDKVHQMQDQDLQLLKLKENVQKDLRTDFAVRDNGVLVMGNRLCVPDIKELKKEIMEEA 364 Query: 2734 H 2736 H Sbjct: 365 H 365 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 148 bits (374), Expect(2) = 5e-34 Identities = 67/109 (61%), Positives = 87/109 (79%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +HNLE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 166 KRHEQNYPIHNLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 225 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELG 2530 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Sbjct: 226 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLG 274 Score = 27.7 bits (60), Expect(2) = 5e-34 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 7/63 (11%) Frame = +1 Query: 2569 LVQKIIEKQPDDLYLQSIIT-----KGKRHEFT*DSEGVIRCWNRLCVP--ELVKEELMD 2727 L+ KI E Q D ++ + KGK FT ++GV+R RL VP + ++ ++++ Sbjct: 298 LMDKIKEAQSKDEFVIKALEDPQGRKGKM--FTKGTDGVLRYGTRLYVPDGDGLRRKILE 355 Query: 2728 EAH 2736 EAH Sbjct: 356 EAH 358 >gb|PNX92676.1| putative retrotransposon Ty3-gypsy subclass protein, partial [Trifolium pratense] gb|PNY12677.1| putative retrotransposon Ty3-gypsy subclass protein, partial [Trifolium pratense] Length = 740 Score = 130 bits (326), Expect(2) = 6e-34 Identities = 63/110 (57%), Positives = 83/110 (75%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K HE+NY H+LELA VVF LKIWR+YLYG F +F++H+SLKYLF +LNM Q RW+E Sbjct: 129 KVHERNYPTHDLELAAVVFVLKIWRHYLYGSRFEVFSDHKSLKYLFDQKELNMWQRRWLE 188 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGV 2533 LLKDYD ++YH GKA+VV+DALSRK++ M+ L EL+ + +LG+ Sbjct: 189 LLKDYDFELSYHPGKANVVVDALSRKSLH-MSSLMAKELELIEEFRDLGL 237 Score = 45.8 bits (107), Expect(2) = 6e-34 Identities = 22/61 (36%), Positives = 39/61 (63%), Gaps = 5/61 (8%) Frame = +1 Query: 2572 VQKIIEKQPDD---LYLQSIITKGKRHEFT*DSEGVIRCWNRLCVPEL--VKEELMDEAH 2736 +++++EKQ D L +++I KGK + D GV+RC R+CVP++ +K +++E H Sbjct: 258 LEEVVEKQKTDTRLLKFKALIEKGKELDIKIDENGVMRCRGRVCVPDIPELKRMILEEGH 317 Query: 2737 R 2739 R Sbjct: 318 R 318 >dbj|GAV84963.1| hypothetical protein CFOL_v3_28404, partial [Cephalotus follicularis] Length = 148 Score = 137 bits (344), Expect = 1e-33 Identities = 63/103 (61%), Positives = 81/103 (78%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K HE+NY H+LELA V+FALKIWR+YLYG IFT+H+SLKY+F+ +LNMRQ RW+E Sbjct: 45 KTHEQNYPTHDLELAAVIFALKIWRHYLYGAKCEIFTDHKSLKYIFTQKELNMRQRRWLE 104 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVR 2512 L+KDYDCTI YH GKA+VV DALSRK++G + + +L+R Sbjct: 105 LIKDYDCTIQYHPGKANVVADALSRKSMGNLAMMITTQEDLIR 147 >gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] Length = 508 Score = 147 bits (371), Expect = 1e-33 Identities = 69/118 (58%), Positives = 90/118 (76%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY +H+LE+A +VFALKIWR+YLYG T I+T+H+SLKY+F LN+RQ RW+E Sbjct: 386 KRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWME 445 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGVEMVHYTEN 2557 LLKDYDCTI YH GKA+VV DALSRK++G + H+ + LVR++ LG Y EN Sbjct: 446 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREIHSLGDIGQVYGEN 503 >gb|PNX70480.1| retrotransposon-related protein, partial [Trifolium pratense] Length = 219 Score = 139 bits (350), Expect = 1e-33 Identities = 63/108 (58%), Positives = 83/108 (76%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HE+NY H+LE+A V+FALKIWR+YLYG T I+T+H+SL+Y+F LN+RQ RW+E Sbjct: 75 KRHEQNYPTHDLEMAAVIFALKIWRHYLYGETLEIYTDHKSLQYIFKQRDLNLRQRRWME 134 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTEL 2527 LLKDYDCTI YH GKA+VV DALSRK++G + HL ++ + EL Sbjct: 135 LLKDYDCTILYHPGKANVVADALSRKSMGSLAHLAAIKRPIINEFQEL 182 >emb|CAA73042.1| polyprotein, partial [Ananas comosus] Length = 871 Score = 130 bits (327), Expect(2) = 2e-33 Identities = 64/113 (56%), Positives = 85/113 (75%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K++EKNY H+LELA VVFALK+WR+YLYG ++T+H+SLKYLF+ +LN+RQ RW+E Sbjct: 343 KEYEKNYPTHDLELAAVVFALKLWRHYLYGERCEVYTDHKSLKYLFTQKELNLRQRRWLE 402 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGVEMV 2542 LLKDYD TI YH GKA+VV DALSRK++ + V L+ Q+ L +E+V Sbjct: 403 LLKDYDLTILYHPGKANVVADALSRKSMENLAMHVVTQPRLIEQMKRLELEIV 455 Score = 43.9 bits (102), Expect(2) = 2e-33 Identities = 26/62 (41%), Positives = 38/62 (61%), Gaps = 5/62 (8%) Frame = +1 Query: 2569 LVQKIIEKQPDDLYLQSIITK---GKRHEFT*DSEGVIRCWNRLCVP--ELVKEELMDEA 2733 L+ +I EKQ D+ LQ I K G +FT D +G++R R+CVP +KE+++ EA Sbjct: 472 LLDRIKEKQASDVELQKIKGKMVDGCTGDFTLDGDGLMRFRGRICVPADSGIKEDILQEA 531 Query: 2734 HR 2739 HR Sbjct: 532 HR 533 >ref|XP_017981007.1| PREDICTED: uncharacterized protein LOC108663032 [Theobroma cacao] Length = 296 Score = 141 bits (355), Expect = 2e-33 Identities = 65/114 (57%), Positives = 85/114 (74%) Frame = +2 Query: 2204 KKHEKNYLVHNLELAPVVFALKIWRYYLYGVTF*IFTNHQSLKYLFS*NQLNMRQHRWVE 2383 K+HEK Y VHNLE+ +VFALKIWR+YLYG T+ I+TNH+SLKY+F LN+ Q RW+E Sbjct: 64 KRHEKKYPVHNLEMEAIVFALKIWRHYLYGETYEIYTNHKSLKYIFQQRDLNLWQRRWME 123 Query: 2384 LLKDYDCTINYHLGKASVVIDALSRKTIGVMTHLRVAPTELVRQVTELGVEMVH 2545 LLKDYDCTI YH K +VV DALSRK++G + H+ + L++++ LG VH Sbjct: 124 LLKDYDCTILYHPSKVNVVADALSRKSMGSLAHISIDMRPLIKEMHSLGDVGVH 177