BLASTX nr result
ID: Paeonia24_contig00012057
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia24_contig00012057 (621 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] 148 3e-43 ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobrom... 147 1e-42 emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] 145 2e-42 ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [The... 148 3e-42 emb|CAN59997.1| hypothetical protein VITISV_020888 [Vitis vinifera] 145 6e-42 emb|CAN68955.1| hypothetical protein VITISV_014191 [Vitis vinifera] 147 1e-41 gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] 135 7e-41 emb|CAA73042.1| polyprotein [Ananas comosus] 145 9e-41 ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [The... 145 1e-40 ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [The... 143 4e-40 ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [The... 143 4e-40 ref|XP_002268718.2| PREDICTED: HIPL1 protein-like [Vitis vinifera] 150 7e-40 ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobrom... 143 7e-40 gb|AAP43915.1| integrase [Gossypium herbaceum] 141 7e-40 ref|XP_007032149.1| CCHC-type integrase [Theobroma cacao] gi|508... 142 7e-40 gb|AAD20658.1| putative retroelement pol polyprotein [Arabidopsi... 144 1e-39 ref|XP_007028157.1| DNA/RNA polymerases superfamily protein [The... 143 1e-39 gb|ADB85337.1| putative retrotransposon protein [Phyllostachys e... 137 2e-39 ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [The... 142 3e-39 gb|ABG66286.1| retrotransposon protein, putative, Ty3-gypsy subc... 140 4e-39 >emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] Length = 1495 Score = 148 bits (373), Expect(2) = 3e-43 Identities = 61/98 (62%), Positives = 78/98 (79%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G +RF GRLCVP D +LR E+L +AH+AKYT+HPG+TKM+ DLKR +WW GMKRD+ +FV Sbjct: 1082 GSVRFKGRLCVPKDVELRNELLADAHRAKYTIHPGNTKMYQDLKRQFWWSGMKRDIAQFV 1141 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 C CQ VKA+HQ+P GLL L IP+WKW+ +TMDF+ Sbjct: 1142 XNCQICQQVKAEHQRPAGLLQPLPIPEWKWDNITMDFV 1179 Score = 53.9 bits (128), Expect(2) = 3e-43 Identities = 32/84 (38%), Positives = 50/84 (59%), Gaps = 1/84 (1%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ-GRMSLILMQKWWSDLEFLSQHCVCVSSTGDV 445 E L+DY F L Y PGK N VADALSRK G++S + ++++ + +C+S G Sbjct: 989 ETLEDYDFALHYHPGKANVVADALSRKSYGQLSNLGLREFEMH-AVIEDFELCLSQEGRG 1047 Query: 444 RMMGNMSVQPTLISRIIATQQNDE 373 + ++S +P +I RI+ Q +DE Sbjct: 1048 PCLYSISARPMVIQRIVEAQVHDE 1071 >ref|XP_007022613.1| Uncharacterized protein TCM_033423 [Theobroma cacao] gi|508722241|gb|EOY14138.1| Uncharacterized protein TCM_033423 [Theobroma cacao] Length = 809 Score = 147 bits (371), Expect(2) = 1e-42 Identities = 61/106 (57%), Positives = 81/106 (76%) Frame = -3 Query: 319 SDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGM 140 S++ ++D G R+CVP D++LR +LEEAH + Y +HPGSTKM+ +K +YWW GM Sbjct: 577 SEFRLNDDGIFMLRDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYRTIKESYWWPGM 636 Query: 139 KRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 KRD+ EFVA+CLTCQ +KA+HQKP G L L IP+WKWE+VTMDF+ Sbjct: 637 KRDIAEFVAKCLTCQQIKAEHQKPSGTLQPLLIPEWKWEHVTMDFV 682 Score = 52.8 bits (125), Expect(2) = 1e-42 Identities = 28/83 (33%), Positives = 48/83 (57%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGRMSLILMQKWWSDLEFLSQHCVCVSSTGDVR 442 EL+KDY + Y PGK N VADALSRK L ++S L + + +++ D Sbjct: 480 ELIKDYDLVIDYHPGKENVVADALSRKSSSSLATLQSSYFSMLLEMKSLGIQLNNGEDGT 539 Query: 441 MMGNMSVQPTLISRIIATQQNDE 373 ++ + V+P+L+++I Q++D+ Sbjct: 540 LLASFVVRPSLLNQIRELQKSDD 562 >emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] Length = 1573 Score = 145 bits (365), Expect(2) = 2e-42 Identities = 62/109 (56%), Positives = 84/109 (77%) Frame = -3 Query: 328 ERNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWW 149 E + ++S+ + G +RF GRLCVP D +LR E+L +AH+AKYT+HPG+TKM+ DLKR + W Sbjct: 1167 EIDENWSMYEDGSVRFKGRLCVPKDVELRNELLADAHRAKYTIHPGNTKMYQDLKRQFXW 1226 Query: 148 DGMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 GMKRD+ +FVA C CQ VKA+HQ+P LL L IP+WKW+ +TMDF+ Sbjct: 1227 SGMKRDIAQFVANCQICQQVKAEHQRPAELLQPLPIPKWKWDNITMDFV 1275 Score = 53.9 bits (128), Expect(2) = 2e-42 Identities = 32/92 (34%), Positives = 46/92 (50%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGRMSLILMQKWWSDLEFLSQHCVCVSSTGDVR 442 E L+DY F L Y PGK N VADALSRK L + + + +C+ G Sbjct: 1072 ETLEDYDFALHYHPGKANVVADALSRKSYGQLFSLGLREFEMYAVIEDFELCLVQEGRGP 1131 Query: 441 MMGNMSVQPTLISRIIATQQNDETILNKKVSL 346 + ++S +P +I RI+ Q +DE + K L Sbjct: 1132 CLYSISARPMVIQRIVEAQVHDEFLEKVKAQL 1163 >ref|XP_007044383.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708318|gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 148 bits (374), Expect(2) = 3e-42 Identities = 65/124 (52%), Positives = 89/124 (71%) Frame = -3 Query: 373 DYFEQEGKFVFGKPEERNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHP 194 D+ +QE V + + S++ + D G L R+CVP D++LR +LEEAH + Y +HP Sbjct: 1057 DWLKQE---VQKLQDGKASEFRLSDDGTLMLRDRICVPKDDQLRRAILEEAHYSAYALHP 1113 Query: 193 GSTKMFMDLKRTYWWDGMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVT 14 GSTKM+ +K +YWW GM+RD+ EFVA+CLTCQ +KA+HQKP G L L IP+WKWE+VT Sbjct: 1114 GSTKMYRTIKESYWWPGMERDIAEFVAKCLTCQQIKAEHQKPSGTLQPLSIPEWKWEHVT 1173 Query: 13 MDFI 2 MDF+ Sbjct: 1174 MDFV 1177 Score = 50.1 bits (118), Expect(2) = 3e-42 Identities = 27/83 (32%), Positives = 47/83 (56%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGRMSLILMQKWWSDLEFLSQHCVCVSSTGDVR 442 EL+KDY + Y P K N VADALSRK L ++S L + + +++ D Sbjct: 975 ELIKDYDLVIDYHPRKANVVADALSRKSSSSLATLRSSYFSMLLEMKSLGIQLNNGEDGT 1034 Query: 441 MMGNMSVQPTLISRIIATQQNDE 373 ++ + V+P+L+++I Q++D+ Sbjct: 1035 LLASFVVRPSLLNQIRELQKSDD 1057 >emb|CAN59997.1| hypothetical protein VITISV_020888 [Vitis vinifera] Length = 893 Score = 145 bits (366), Expect(2) = 6e-42 Identities = 63/109 (57%), Positives = 84/109 (77%) Frame = -3 Query: 328 ERNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWW 149 E + ++S+ + G + F GRLCVP D LR E+L +AHKAKYT+HPG+TKM+ DLKR +W Sbjct: 473 EIDENWSMYEDGSVWFKGRLCVPKDVGLRNELLADAHKAKYTIHPGNTKMYQDLKRQFWC 532 Query: 148 DGMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 +GMKRD+ +FVA C CQ VKA+HQ+P GLL L IP+WKW+ +TMDF+ Sbjct: 533 NGMKRDIAQFVANCQICQQVKAEHQRPAGLLQPLPIPEWKWDNITMDFV 581 Score = 52.0 bits (123), Expect(2) = 6e-42 Identities = 32/93 (34%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ-GRMSLILMQKWWSDLEFLSQHCVCVSSTGDV 445 E L+DY F L Y PGK N VADALSRK G++S + ++++ + +C+ G Sbjct: 378 ETLEDYDFALHYHPGKANVVADALSRKNVGQLSSLELREFEMH-AVIEDFELCLGLEGHG 436 Query: 444 RMMGNMSVQPTLISRIIATQQNDETILNKKVSL 346 + ++ +P +I RI+ Q +DE + K L Sbjct: 437 PCLYSILARPMVIQRIVEAQVHDEFLEKVKAQL 469 >emb|CAN68955.1| hypothetical protein VITISV_014191 [Vitis vinifera] Length = 480 Score = 147 bits (370), Expect(2) = 1e-41 Identities = 63/109 (57%), Positives = 84/109 (77%) Frame = -3 Query: 328 ERNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWW 149 E + ++S+ G +RF GRLCVP D +LR E+L AH+AKY +H GSTKM+ DLKR +WW Sbjct: 97 EVDENWSMHVDGSVRFRGRLCVPRDVZLRNELLTYAHRAKYIIHLGSTKMYQDLKRXFWW 156 Query: 148 DGMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 GMKRD+V++VA C TCQ VK +HQ+P GLL L IP+WKW+++TMDF+ Sbjct: 157 SGMKRDIVQYVANCQTCQQVKTEHQRPVGLLQPLPIPEWKWDHITMDFV 205 Score = 49.3 bits (116), Expect(2) = 1e-41 Identities = 31/93 (33%), Positives = 50/93 (53%), Gaps = 1/93 (1%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ-GRMSLILMQKWWSDLEFLSQHCVCVSSTGDV 445 E L+DY F Y PGK N V DALSRK G++S + ++++ + + +C+S G Sbjct: 2 ETLEDYDFAPHYHPGKANVVVDALSRKSYGQLSSLGLREFEMH-AVIEDYELCLSWEGQG 60 Query: 444 RMMGNMSVQPTLISRIIATQQNDETILNKKVSL 346 + ++ +P I RI+ Q +DE + K L Sbjct: 61 PCLYSILARPMFIQRIVEAQVHDEFLEKVKARL 93 >gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] Length = 1475 Score = 135 bits (340), Expect(2) = 7e-41 Identities = 55/98 (56%), Positives = 75/98 (76%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G LRF GR+CVP L +L E H+++Y++HPG+TKM+ DL++ YWW GM+RD+ +FV Sbjct: 1171 GVLRFAGRICVPRVGDLIQLILSEGHESRYSIHPGTTKMYRDLRQHYWWSGMRRDIADFV 1230 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 +RCL CQ VKA+H +PGG+ L IP+WKWE +TMDFI Sbjct: 1231 SRCLCCQQVKAEHLRPGGVFKRLPIPEWKWERITMDFI 1268 Score = 58.5 bits (140), Expect(2) = 7e-41 Identities = 34/91 (37%), Positives = 53/91 (58%), Gaps = 5/91 (5%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRK---QGRMSLILMQK--WWSDLEFLSQHCVCVSS 457 ELLKDY + Y PGK N VADALSRK G ++ + +++ D++FL+ V + Sbjct: 1061 ELLKDYDLSILYHPGKANVVADALSRKAVSMGSLAFLSVEERPLAMDIQFLANSMVRLDI 1120 Query: 456 TGDVRMMGNMSVQPTLISRIIATQQNDETIL 364 + R++ +M VQ +L+ RI Q DE ++ Sbjct: 1121 SDSRRVLAHMGVQSSLLDRIRGCQFEDEALV 1151 >emb|CAA73042.1| polyprotein [Ananas comosus] Length = 871 Score = 145 bits (365), Expect(2) = 9e-41 Identities = 64/119 (53%), Positives = 86/119 (72%) Frame = -3 Query: 358 EGKFVFGKPEERNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKM 179 +GK V G D+++D G +RF GR+CVP D ++ ++L+EAH+A Y +HPG TKM Sbjct: 490 KGKMVDGC----TGDFTLDGDGLMRFRGRICVPADSGIKEDILQEAHRAPYAIHPGGTKM 545 Query: 178 FMDLKRTYWWDGMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 + DLK YWW G+K+DV EFVA+CLTCQ VKA+H+ P G L +L IP WKWE +TMDF+ Sbjct: 546 YKDLKLLYWWPGIKKDVGEFVAKCLTCQQVKAEHRVPAGKLQSLPIPVWKWEKITMDFV 604 Score = 48.5 bits (114), Expect(2) = 9e-41 Identities = 37/99 (37%), Positives = 50/99 (50%), Gaps = 6/99 (6%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGR---MSLILMQKWWSDLEFLSQHCVCVSSTG 451 ELLKDY + Y PGK N VADALSRK M ++ + ++ L V + Sbjct: 402 ELLKDYDLTILYHPGKANVVADALSRKSMENLAMHVVTQPRLIEQMKRLELEIVTPDT-- 459 Query: 450 DVRMMGNMSVQPTLISRIIATQQND---ETILNKKVSLC 343 +R+M + VQPTL+ RI Q +D + I K V C Sbjct: 460 PMRLM-TLVVQPTLLDRIKEKQASDVELQKIKGKMVDGC 497 >ref|XP_007023888.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508779254|gb|EOY26510.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1290 Score = 145 bits (365), Expect(2) = 1e-40 Identities = 60/106 (56%), Positives = 79/106 (74%) Frame = -3 Query: 319 SDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGM 140 S++ + D G L R+CVP D++LR +LEEAH + Y +HPGSTKM+ +K +YWW GM Sbjct: 861 SEFRLSDDGTLMLRDRICVPKDDQLRRAILEEAHSSAYALHPGSTKMYQTIKESYWWPGM 920 Query: 139 KRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 KRD+ EFVA+CL CQ +KA+HQK G L L IP+WKWE+VTMDF+ Sbjct: 921 KRDIAEFVAKCLICQQIKAEHQKSSGTLQPLPIPEWKWEHVTMDFV 966 Score = 48.1 bits (113), Expect(2) = 1e-40 Identities = 26/83 (31%), Positives = 45/83 (54%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGRMSLILMQKWWSDLEFLSQHCVCVSSTGDVR 442 EL+KDY + Y PGK N V DALSRK L ++ L + + +++ D Sbjct: 764 ELIKDYDLVIDYHPGKANVVTDALSRKSSSSLATLRSSYFPMLLEMKSLGIQLNNGEDGT 823 Query: 441 MMGNMSVQPTLISRIIATQQNDE 373 ++ + V+P+L+++I Q+ D+ Sbjct: 824 LLASFVVRPSLLNQIRELQKFDD 846 >ref|XP_007036977.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508774222|gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 143 bits (361), Expect(2) = 4e-40 Identities = 61/98 (62%), Positives = 77/98 (78%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G LR+ RL VP+ + LR E+LEEAH A Y VHPG+TKM+ DLK YWW+G+KRDV EFV Sbjct: 616 GVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFV 675 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 ++CL CQ VKA+HQKP GLL L +P+WKWE++ MDF+ Sbjct: 676 SKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFV 713 Score = 47.8 bits (112), Expect(2) = 4e-40 Identities = 33/95 (34%), Positives = 45/95 (47%), Gaps = 9/95 (9%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ---------GRMSLILMQKWWSDLEFLSQHCV 469 ELLKDY + Y PGK N VADALSRK GR SL+ ++ L V Sbjct: 508 ELLKDYDCTILYHPGKANVVADALSRKSMGSLAHIFIGRRSLV------REIHSLGDIGV 561 Query: 468 CVSSTGDVRMMGNMSVQPTLISRIIATQQNDETIL 364 + ++ + V+P L+ RI Q DE ++ Sbjct: 562 RLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVI 596 >ref|XP_007023829.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508779195|gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 143 bits (361), Expect(2) = 4e-40 Identities = 61/98 (62%), Positives = 77/98 (78%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G LR+ RL VP+ + LR E+LEEAH A Y VHPG+TKM+ DLK YWW+G+KRDV EFV Sbjct: 239 GVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFV 298 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 ++CL CQ VKA+HQKP GLL L +P+WKWE++ MDF+ Sbjct: 299 SKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFV 336 Score = 47.8 bits (112), Expect(2) = 4e-40 Identities = 33/95 (34%), Positives = 45/95 (47%), Gaps = 9/95 (9%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ---------GRMSLILMQKWWSDLEFLSQHCV 469 ELLKDY + Y PGK N VADALSRK GR SL+ ++ L V Sbjct: 131 ELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLV------REIHSLGDIGV 184 Query: 468 CVSSTGDVRMMGNMSVQPTLISRIIATQQNDETIL 364 + ++ + V+P L+ RI Q DE ++ Sbjct: 185 RLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVI 219 >ref|XP_002268718.2| PREDICTED: HIPL1 protein-like [Vitis vinifera] Length = 937 Score = 150 bits (378), Expect(2) = 7e-40 Identities = 64/105 (60%), Positives = 81/105 (77%) Frame = -3 Query: 316 DYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMK 137 D+ + D G LRF+ RLCVPND LR E LEEAH ++ +HPG TKM+ DL++ YWW GMK Sbjct: 102 DFVLSDDGILRFMTRLCVPNDGDLRREFLEEAHCSRLAIHPGGTKMYKDLRQNYWWSGMK 161 Query: 136 RDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 RD+ +FVARCL CQ VKA+HQ+P G L L IP+WKWE++TMDF+ Sbjct: 162 RDIAQFVARCLVCQQVKAEHQQPVGSLQPLAIPEWKWEHITMDFV 206 Score = 40.4 bits (93), Expect(2) = 7e-40 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 3/74 (4%) Frame = -1 Query: 588 YRPGKVNKVADALSRKQ-GRMSLI--LMQKWWSDLEFLSQHCVCVSSTGDVRMMGNMSVQ 418 Y GK N VADALS+K G ++ I ++ DL + H + S ++ N VQ Sbjct: 15 YHLGKANAVADALSKKSVGSLAAIRGCQRQLLEDLRSVQVHMRVLDSGA---LVANFRVQ 71 Query: 417 PTLISRIIATQQND 376 P L+ RI A Q+ND Sbjct: 72 PNLVGRIKALQKND 85 >ref|XP_007010454.1| Uncharacterized protein TCM_044274 [Theobroma cacao] gi|508727367|gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 143 bits (361), Expect(2) = 7e-40 Identities = 61/98 (62%), Positives = 77/98 (78%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G LR+ RL VP+ + LR E+LEEAH A Y VHPG+TKM+ DLK YWW+G+KRDV EFV Sbjct: 450 GVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFV 509 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 ++CL CQ VKA+HQKP GLL L +P+WKWE++ MDF+ Sbjct: 510 SKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFV 547 Score = 47.0 bits (110), Expect(2) = 7e-40 Identities = 32/95 (33%), Positives = 45/95 (47%), Gaps = 9/95 (9%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ---------GRMSLILMQKWWSDLEFLSQHCV 469 ELLKDY + Y PGK N VADALSRK GR SL+ ++ L V Sbjct: 342 ELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLV------REIHSLGDIGV 395 Query: 468 CVSSTGDVRMMGNMSVQPTLISRIIATQQNDETIL 364 + ++ + V+P L+ +I Q DE ++ Sbjct: 396 RLEVAETSALLAHFRVRPILMDKIKEAQSKDEFVI 430 >gb|AAP43915.1| integrase [Gossypium herbaceum] Length = 350 Score = 141 bits (355), Expect(2) = 7e-40 Identities = 60/108 (55%), Positives = 80/108 (74%) Frame = -3 Query: 325 RNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWD 146 + S++ +DD LRF RLCVP + +L +L EAH ++ +HPGSTKM+ DLKR +WW Sbjct: 155 KESEFQIDDDDCLRFRSRLCVPKNSELILIILNEAHCSRMAIHPGSTKMYNDLKRRFWWH 214 Query: 145 GMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 GMKRD+ +FV+RCL CQ VKA+HQ P GLL + IP+WKW+ VTMDF+ Sbjct: 215 GMKRDIFDFVSRCLICQQVKAEHQVPSGLLQPITIPEWKWDRVTMDFV 262 Score = 49.3 bits (116), Expect(2) = 7e-40 Identities = 36/103 (34%), Positives = 49/103 (47%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGRMSLILMQKWWSDLEFLSQHCVCVSSTGDVR 442 ELLKDY + Y PGK N VADALSRK L L V +S D Sbjct: 74 ELLKDYELVIDYHPGKANMVADALSRK--------------SLFALRAMNVYLSILPDNV 119 Query: 441 MMGNMSVQPTLISRIIATQQNDETILNKKVSLCLVSLRSGIQI 313 ++ + +P L +I Q+ DE +L K+ C+++ S QI Sbjct: 120 LVAELKAKPLLTHQIREAQKVDEELLAKRAE-CVLNKESEFQI 161 >ref|XP_007032149.1| CCHC-type integrase [Theobroma cacao] gi|508711178|gb|EOY03075.1| CCHC-type integrase [Theobroma cacao] Length = 246 Score = 142 bits (357), Expect(2) = 7e-40 Identities = 60/98 (61%), Positives = 76/98 (77%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G LR+ RL VP+ + LR E+LEEAH A Y VHPG+TKM+ DLK YWW+G+KRDV EFV Sbjct: 110 GVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFV 169 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 ++CL CQ VK +HQKP GLL L +P+WKWE++ MDF+ Sbjct: 170 SKCLVCQQVKVEHQKPAGLLQPLPVPEWKWEHIAMDFV 207 Score = 48.5 bits (114), Expect(2) = 7e-40 Identities = 32/95 (33%), Positives = 46/95 (48%), Gaps = 9/95 (9%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ---------GRMSLILMQKWWSDLEFLSQHCV 469 ELLKDY + + Y PGK N VADALSRK GR SL+ ++ L V Sbjct: 2 ELLKDYDYTILYHPGKANVVADALSRKSMGSLAHISIGRRSLV------REIHSLGDIGV 55 Query: 468 CVSSTGDVRMMGNMSVQPTLISRIIATQQNDETIL 364 + ++ + V+P L+ +I Q DE ++ Sbjct: 56 RLEVAETNALLAHFRVRPILMDKIKEAQSKDEFVI 90 >gb|AAD20658.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1611 Score = 144 bits (362), Expect(2) = 1e-39 Identities = 60/121 (49%), Positives = 88/121 (72%) Frame = -3 Query: 364 EQEGKFVFGKPEERNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGST 185 ++ + + G + ++Y + G + GR+CVPND L+ E+L EAH++K+++HPGS Sbjct: 1120 QERDEEIKGWAQNNKTEYQTSNNGTIVVNGRVCVPNDRALKEEILREAHQSKFSIHPGSN 1179 Query: 184 KMFMDLKRTYWWDGMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDF 5 KM+ DLKR Y W GMK+DV +VA+C TCQ+VKA+HQ P GLL NL IP+WKW+++TMDF Sbjct: 1180 KMYRDLKRYYHWVGMKKDVARWVAKCPTCQLVKAEHQVPSGLLQNLPIPEWKWDHITMDF 1239 Query: 4 I 2 + Sbjct: 1240 V 1240 Score = 45.8 bits (107), Expect(2) = 1e-39 Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 1/86 (1%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGRMSLILMQKWWSDLEFLSQHCVCVSSTGDVR 442 EL+ DY D+ Y PGK N+VADALSR R S + ++ DL + + + +V Sbjct: 1044 ELVADYNLDIAYHPGKANQVADALSR---RRSEVEAERSQVDLVNMMGTLHVNALSKEVE 1100 Query: 441 MMG-NMSVQPTLISRIIATQQNDETI 367 +G + Q L+SRI Q+ DE I Sbjct: 1101 PLGLGAADQADLLSRIRLAQERDEEI 1126 >ref|XP_007028157.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716762|gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 937 Score = 143 bits (361), Expect(2) = 1e-39 Identities = 60/98 (61%), Positives = 77/98 (78%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G LR+ RL VP+ + LR E+LEEAH A Y +HPG+TKM+ DLK YWW+G+KRDV EFV Sbjct: 469 GVLRYGTRLYVPDSDGLRREILEEAHMAAYVIHPGATKMYQDLKEVYWWEGLKRDVAEFV 528 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 ++CL CQ VKA+HQKP GLL L +P+WKWE++ MDF+ Sbjct: 529 SKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFV 566 Score = 46.2 bits (108), Expect(2) = 1e-39 Identities = 32/95 (33%), Positives = 45/95 (47%), Gaps = 9/95 (9%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ---------GRMSLILMQKWWSDLEFLSQHCV 469 ELLKDY + + PGK N VADALSRK GR SL+ ++ L V Sbjct: 361 ELLKDYDCTILHHPGKANVVADALSRKSMGSLAHISIGRRSLV------KEIHSLGDIGV 414 Query: 468 CVSSTGDVRMMGNMSVQPTLISRIIATQQNDETIL 364 + ++ + V+P L+ RI Q DE ++ Sbjct: 415 RLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVI 449 >gb|ADB85337.1| putative retrotransposon protein [Phyllostachys edulis] Length = 1053 Score = 137 bits (346), Expect(2) = 2e-39 Identities = 58/104 (55%), Positives = 78/104 (75%) Frame = -3 Query: 313 YSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKR 134 +S D+ G + F R+CVPN ++L+ +L+EAH++ Y++HPGSTKM+ DLK YWW MKR Sbjct: 605 FSEDEQGTVWFGNRICVPNQQELKQSILKEAHESPYSIHPGSTKMYQDLKEKYWWVSMKR 664 Query: 133 DVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 ++ EFVA C CQ VKA+HQ+P GLL L IP+WKWE + MDFI Sbjct: 665 EIAEFVAHCDICQRVKAEHQRPAGLLQPLPIPEWKWEEIGMDFI 708 Score = 51.6 bits (122), Expect(2) = 2e-39 Identities = 39/101 (38%), Positives = 50/101 (49%), Gaps = 9/101 (8%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQGRMSLILMQKWWSD---------LEFLSQHCV 469 EL+KDY + Y PGK N VADALSRK + IL+QK + LE ++Q CV Sbjct: 509 ELIKDYDLGIHYHPGKANVVADALSRK-AYCNTILVQKNQPELYEELKHLNLEIVNQGCV 567 Query: 468 CVSSTGDVRMMGNMSVQPTLISRIIATQQNDETILNKKVSL 346 + VQPTL S+I Q DE I K ++ Sbjct: 568 -----------NALEVQPTLQSQIREKQLEDEDIKEIKKNM 597 >ref|XP_007028176.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716781|gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 142 bits (357), Expect(2) = 3e-39 Identities = 60/98 (61%), Positives = 77/98 (78%) Frame = -3 Query: 295 GGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYWWDGMKRDVVEFV 116 G LR+ RL VP+ + LR ++LEEAH A Y VHPG+TKM+ DLK YWW+G+KRDV EFV Sbjct: 333 GVLRYGTRLYVPDGDGLRRKILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFV 392 Query: 115 ARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 ++CL CQ VKA+HQKP GLL L +P+WKWE++ MDF+ Sbjct: 393 SKCLVCQQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFV 430 Score = 46.6 bits (109), Expect(2) = 3e-39 Identities = 32/95 (33%), Positives = 45/95 (47%), Gaps = 9/95 (9%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRKQ---------GRMSLILMQKWWSDLEFLSQHCV 469 ELLKDY + Y PGK N VADALSRK GR SL+ ++ L V Sbjct: 225 ELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLV------REIHSLGDIGV 278 Query: 468 CVSSTGDVRMMGNMSVQPTLISRIIATQQNDETIL 364 + ++ + V+P L+ +I Q DE ++ Sbjct: 279 RLEVAETNALLAHFRVRPILMDKIKEAQSKDEFVI 313 >gb|ABG66286.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1759 Score = 140 bits (354), Expect(2) = 4e-39 Identities = 60/110 (54%), Positives = 85/110 (77%) Frame = -3 Query: 331 EERNSDYSVDDAGGLRFLGRLCVPNDEKLRAEVLEEAHKAKYTVHPGSTKMFMDLKRTYW 152 E++++D+S+DD G + + R+CVP ++LR +L+EAH++ Y++HPGSTKM+ D+K +W Sbjct: 1308 EKKDTDFSIDDQGTVWYGPRICVPAKKELRDLILKEAHESAYSIHPGSTKMYQDIKAYFW 1367 Query: 151 WDGMKRDVVEFVARCLTCQMVKAQHQKPGGLL*NLEIPQWKWEYVTMDFI 2 W GMKRDV E+VA C CQ VKA+HQ+P GLL L IP+WKWE + MDFI Sbjct: 1368 WAGMKRDVAEYVALCDICQRVKAEHQRPAGLLQPLPIPEWKWEEIGMDFI 1417 Score = 47.4 bits (111), Expect(2) = 4e-39 Identities = 37/91 (40%), Positives = 49/91 (53%), Gaps = 6/91 (6%) Frame = -1 Query: 621 ELLKDYTFDLQYRPGKVNKVADALSRK------QGRMSLILMQKWWSDLEFLSQHCVCVS 460 EL+KDY + Y PGK N VADALSRK Q R + DLE L + V Sbjct: 1218 ELIKDYDLGIHYHPGKANVVADALSRKTYCNVDQIRPD---QDRLCRDLEKLR---LTVV 1271 Query: 459 STGDVRMMGNMSVQPTLISRIIATQQNDETI 367 +G + +++VQPTL S+I Q++DE I Sbjct: 1272 QSG---VAASLTVQPTLESQIRKAQKDDEGI 1299