BLASTX nr result
ID: Catharanthus22_contig00002398
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00002398 (1547 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus pe... 269 2e-69 gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus pe... 267 1e-68 gb|EMJ02729.1| hypothetical protein PRUPE_ppa016152mg, partial [... 267 1e-68 gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [... 262 3e-67 gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana] 249 3e-63 gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] 247 1e-62 ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, part... 246 2e-62 ref|XP_006279432.1| hypothetical protein CARUB_v10007925mg, part... 243 1e-61 gb|EOY19559.1| T6D22.19, putative [Theobroma cacao] 242 3e-61 gb|AAD48963.1|AF147263_5 contains similarity to transposases [Ar... 240 1e-60 ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Caps... 232 3e-58 gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thali... 232 3e-58 ref|XP_006279274.1| hypothetical protein CARUB_v100165480mg, par... 227 1e-56 gb|AAF79806.1|AC020646_29 T32E20.13 [Arabidopsis thaliana] 224 1e-55 dbj|BAB02100.1| unnamed protein product [Arabidopsis thaliana] 214 6e-53 gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thal... 213 2e-52 gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] 211 7e-52 gb|AAF78383.1|AC069551_16 T10O22.20 [Arabidopsis thaliana] 209 3e-51 gb|EOY16831.1| BED zinc finger,hAT family dimerization domain [T... 197 1e-47 gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, p... 194 1e-46 >gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica] Length = 696 Score = 269 bits (688), Expect = 2e-69 Identities = 131/266 (49%), Positives = 181/266 (68%), Gaps = 3/266 (1%) Frame = +2 Query: 758 VWTYARSIG-EKDGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDML 934 VWT + +++ + RA+C+ C + Y+ S +GT LK HI+ C+ + RD+G +L Sbjct: 51 VWTQFEILPIDENNEQRAKCMKCGQKYLCD-SRYGTGNLKRHIESCV--KTDTRDLGQLL 107 Query: 935 IDNTTGK--ARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRN 1108 + + G R K RE + +AII HDLPF FVEY GIR +F Y+ D+K +SRN Sbjct: 108 LSKSDGAILTRSSKFDPMKFRELLVMAIIMHDLPFQFVEYAGIRQLFNYVCADIKLVSRN 167 Query: 1109 TAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSC 1288 TA +DV +Y +K LK+ L VP R+CLT D+WT+ T++GY+CLT HF+D WK+ Sbjct: 168 TAKADVLSLYNREKAKLKEILGSVPGRVCLTSDLWTSITTDGYLCLTVHFIDVNWKLQKR 227 Query: 1289 ILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNS 1468 IL FS PPPH+GV L K+Y LL +WG++KK+FS+TLDNASSND ++L+ QL L+++ Sbjct: 228 ILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKLFSMTLDNASSNDTFVELLKGQLNLKDA 287 Query: 1469 LLYNCEFFHIRCSAHILNLIVQEGLK 1546 LL N +FFHIRC AHILNLIVQ+GLK Sbjct: 288 LLMNGKFFHIRCCAHILNLIVQDGLK 313 >gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus persica] Length = 567 Score = 267 bits (682), Expect = 1e-68 Identities = 130/266 (48%), Positives = 181/266 (68%), Gaps = 3/266 (1%) Frame = +2 Query: 758 VWTYARSIG-EKDGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDML 934 VWT+ + +++ + RA+C+ C + Y+ S +GT LK HI+ C+ + D+G +L Sbjct: 51 VWTHFEILHIDENNEQRAKCMKCGQKYLFD-SRYGTGNLKRHIESCVKIDTC--DLGQLL 107 Query: 935 IDNTTGK--ARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRN 1108 + + G R K RE + +AII HDLPF FVEY GIR +F Y+ D+K +SRN Sbjct: 108 LSKSDGAILTRSSKFDPMKFRELLVMAIIMHDLPFQFVEYSGIRQLFNYVCADIKLVSRN 167 Query: 1109 TAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSC 1288 TA +DV +Y +K LK+ L VP R+CLT D+WT+ T++GY+CLT HF+D WK+ Sbjct: 168 TAKADVLSLYNREKAKLKEILGSVPGRVCLTSDLWTSITTDGYLCLTVHFIDVNWKLQKR 227 Query: 1289 ILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNS 1468 IL FS PPPH+GV L K+Y LL +WG++KK+FS+TLDNASSND ++L+ QL L+++ Sbjct: 228 ILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKLFSMTLDNASSNDTFVELLKGQLNLKDA 287 Query: 1469 LLYNCEFFHIRCSAHILNLIVQEGLK 1546 LL N +FFHIRC AHILNLIVQ+GLK Sbjct: 288 LLMNGKFFHIRCCAHILNLIVQDGLK 313 >gb|EMJ02729.1| hypothetical protein PRUPE_ppa016152mg, partial [Prunus persica] Length = 613 Score = 267 bits (682), Expect = 1e-68 Identities = 131/266 (49%), Positives = 178/266 (66%), Gaps = 3/266 (1%) Frame = +2 Query: 758 VWTYARSIG-EKDGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDML 934 VWT + +++ + RA+C+ C + Y+ S +GT LK HI+ C+ + RD+G +L Sbjct: 52 VWTQFEILPIDENNEQRAKCMKCGQKYLCD-SRYGTGNLKRHIESCV--KTDTRDLGQLL 108 Query: 935 IDNTTGK--ARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRN 1108 + G R K RE + +AII HDLPF FVEY GIR +F Y+ D+K +SRN Sbjct: 109 LSKYDGAILTRSSKFDPMKFRELLLMAIIMHDLPFQFVEYAGIRQLFNYVCADIKLVSRN 168 Query: 1109 TAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSC 1288 A +DV +Y +K LK+ L VP R+CLTFD+WT+ T++GY+CLT HF+D WK Sbjct: 169 IAKADVLSLYNREKAKLKEILGSVPGRVCLTFDLWTSITTDGYLCLTVHFIDVNWKWEKI 228 Query: 1289 ILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNS 1468 IL FS PPPH+GV L K+Y LL +WG+ KK+FS+TLDNASSND ++L+ QL L+++ Sbjct: 229 ILNFSFMPPPHTGVALCEKIYRLLTDWGVKKKLFSMTLDNASSNDTFVELLKGQLNLKDA 288 Query: 1469 LLYNCEFFHIRCSAHILNLIVQEGLK 1546 LL N +FFHIRC AHILNLIVQ+GLK Sbjct: 289 LLMNGKFFHIRCCAHILNLIVQDGLK 314 >gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica] Length = 697 Score = 262 bits (669), Expect = 3e-67 Identities = 128/266 (48%), Positives = 179/266 (67%), Gaps = 3/266 (1%) Frame = +2 Query: 758 VWTYARSIG-EKDGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDML 934 VWT + +++ + RA+C+ C + Y+ S +GT LK HI+ C+ + RD+G +L Sbjct: 52 VWTQFEILPIDENNEQRAKCMKCGQKYLCD-SRYGTRNLKRHIESCV--KTDTRDLGQLL 108 Query: 935 IDNTTGK--ARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRN 1108 + + G R K RE + +AII HDLPF FVEY GIR +F Y+ D+K +SRN Sbjct: 109 LSKSDGAILTRSSKFDPMKFRELLVMAIITHDLPFQFVEYSGIRQLFNYVCADIKLVSRN 168 Query: 1109 TAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSC 1288 TA +DV +Y +K LK+ L VP R+CL D+WT+ T++GY+CLT HF+D WK+ Sbjct: 169 TAKADVLSLYNREKAKLKEILDSVPGRVCLASDLWTSITTDGYLCLTVHFIDVNWKLQKR 228 Query: 1289 ILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNS 1468 IL FS PPPH+GV L K+Y LL +WG++KK+FS+TLDNASSND ++L+ Q L+++ Sbjct: 229 ILNFSFMPPPHTGVTLCEKIYKLLTDWGVEKKLFSMTLDNASSNDTFVELLKGQPNLKDA 288 Query: 1469 LLYNCEFFHIRCSAHILNLIVQEGLK 1546 LL N +FF+IRC AHILNLIVQ+GLK Sbjct: 289 LLMNGKFFYIRCCAHILNLIVQDGLK 314 >gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana] Length = 633 Score = 249 bits (635), Expect = 3e-63 Identities = 126/254 (49%), Positives = 172/254 (67%), Gaps = 2/254 (0%) Frame = +2 Query: 791 DGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDMLIDNTTGKARK-- 964 +GK C C++ Y +GT+T+ H+ C + T G + Sbjct: 68 NGKTEVTCKYCEQTYHLNLRRNGTNTMNRHMRSC---------------EKTPGSTPRIS 112 Query: 965 RKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKVYTN 1144 RK+ V RE IA+A+++H+LP+SFVEYE IR+ FTY N ++ SRNTAASDV+K+Y Sbjct: 113 RKVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYANPSIEFWSRNTAASDVYKIYER 172 Query: 1145 KKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPPPHS 1324 +K LK+ LA +P RICLT D+W A T E YICLTAH+VD + + IL+FS FPPPHS Sbjct: 173 EKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFSAFPPPHS 232 Query: 1325 GVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFFHIRC 1504 GV ++ K+ +LLK+WGI+KKIF++T+DNAS+ND MQ IL+ + LQ L+ + EFFH+RC Sbjct: 233 GVAIAMKLSELLKDWGIEKKIFTLTVDNASANDTMQSILKRK--LQKDLVCSGEFFHVRC 290 Query: 1505 SAHILNLIVQEGLK 1546 SAHILNLIVQ+GL+ Sbjct: 291 SAHILNLIVQDGLE 304 >gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] Length = 745 Score = 247 bits (630), Expect = 1e-62 Identities = 124/254 (48%), Positives = 172/254 (67%), Gaps = 2/254 (0%) Frame = +2 Query: 791 DGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDMLIDNTTGKARK-- 964 +GK C C++ Y +GT+T+ H+ C + T G + Sbjct: 158 NGKTEVTCKYCEQTYHLNLRRNGTNTMNRHMRSC---------------EKTPGSTPRIS 202 Query: 965 RKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKVYTN 1144 RK+ V RE IA+A+++H+LP+SFVEYE IR+ FTY+N ++ SRNTAASDV+K+Y Sbjct: 203 RKVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYVNPSIEFWSRNTAASDVYKIYER 262 Query: 1145 KKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPPPHS 1324 +K LK+ LA +P RICLT D+W A T E YICLTAH+VD + + IL+F FPPPHS Sbjct: 263 EKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFCAFPPPHS 322 Query: 1325 GVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFFHIRC 1504 GV ++ K+ +LLK+WGI+KK+F++T+DNAS+ND MQ IL+ + LQ L+ + EFFH+RC Sbjct: 323 GVAIAMKLSELLKDWGIEKKVFTLTVDNASANDTMQSILKRK--LQKHLVCSGEFFHVRC 380 Query: 1505 SAHILNLIVQEGLK 1546 SAHILNLIVQ+GL+ Sbjct: 381 SAHILNLIVQDGLE 394 >ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] gi|482560944|gb|EOA25135.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] Length = 547 Score = 246 bits (628), Expect = 2e-62 Identities = 122/194 (62%), Positives = 149/194 (76%) Frame = +2 Query: 965 RKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKVYTN 1144 RKI V+RE I L II HDLPFSFVEY +R++ YLN + K ISRNTA +DV K + Sbjct: 6 RKIDHSVVRELITLVIICHDLPFSFVEYPRVRELLKYLNPEYKTISRNTAVADVLKFHGI 65 Query: 1145 KKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPPPHS 1324 +K +KQ LA V +RICLT DVW + + EGYICLTAH+VD +WK+ S IL+F PPPHS Sbjct: 66 RKEQMKQELAGVGNRICLTCDVWRSISIEGYICLTAHYVDDSWKLKSKILSFCAMPPPHS 125 Query: 1325 GVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFFHIRC 1504 G EL++KV L++WGI+KKIFS+TLDNASSNDNMQ ILR+QL ++ LL + EFFHIRC Sbjct: 126 GFELAKKVLSCLEDWGIEKKIFSLTLDNASSNDNMQSILRDQLSSRHGLLCDGEFFHIRC 185 Query: 1505 SAHILNLIVQEGLK 1546 SAH+LNLIVQ GLK Sbjct: 186 SAHVLNLIVQVGLK 199 >ref|XP_006279432.1| hypothetical protein CARUB_v10007925mg, partial [Capsella rubella] gi|482548132|gb|EOA12330.1| hypothetical protein CARUB_v10007925mg, partial [Capsella rubella] Length = 539 Score = 243 bits (621), Expect = 1e-61 Identities = 125/265 (47%), Positives = 173/265 (65%), Gaps = 4/265 (1%) Frame = +2 Query: 761 WTYARSIGEKDGK----PRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGD 928 W + I +K+ K RA+C CK Y +GT + H++ C L +K DV Sbjct: 128 WEHFTVIKKKNNKGEIVERAQCNHCKHDYAYHSHKNGTKSYNRHMETCKVLISKV-DVSK 186 Query: 929 MLIDNTTGKARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRN 1108 M++ N K + +KI V RE +A II+HDLPF++VEYE + ISRN Sbjct: 187 MML-NAEAKLQAKKIDHMVFREMVAKCIIQHDLPFAYVEYE-------------RFISRN 232 Query: 1109 TAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSC 1288 TAA+DV+K Y N+ + LK+ LA++P RI T D+WTA T EGY+CLTAH+VD WK+N+ Sbjct: 233 TAAADVYKFYENEADNLKRELANLPGRISFTSDLWTAITQEGYMCLTAHYVDRNWKLNNK 292 Query: 1289 ILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNS 1468 I+AF F PPHSG+ ++ K+ + ++WG+ KK+FSIT DNASSND+ Q+IL+ QL L N+ Sbjct: 293 IIAFFAFAPPHSGMHIAMKILEKWEDWGVQKKVFSITFDNASSNDSSQEILKSQLVLHNN 352 Query: 1469 LLYNCEFFHIRCSAHILNLIVQEGL 1543 LL E+FH+RC+AHILN+IVQ GL Sbjct: 353 LLCGGEYFHVRCAAHILNIIVQIGL 377 >gb|EOY19559.1| T6D22.19, putative [Theobroma cacao] Length = 559 Score = 242 bits (618), Expect = 3e-61 Identities = 126/233 (54%), Positives = 160/233 (68%), Gaps = 5/233 (2%) Frame = +2 Query: 755 DVWTYARSIGEK-DGKPRAECLGCKKVYIAG----GSTHGTSTLKHHIDKCLPLRAKFRD 919 +VW Y IG+K DG RA C GCK Y G GS +GTS L+ HID C + + + Sbjct: 45 NVWNYFTKIGKKQDGVERATCNGCKTEYKVGPKPGGSNYGTSHLRRHIDTCKFI--SYFN 102 Query: 920 VGDMLIDNTTGKARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHI 1099 MLID GK + RK ++ R+ +A AIIKHDLP++FVEY+ IR Y+N DV Sbjct: 103 PHQMLIDYE-GKVKARKFDPRISRDMLAEAIIKHDLPYAFVEYDKIRAWAKYVNPDVVMP 161 Query: 1100 SRNTAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKM 1279 SRNT SDV +++ +K LKQ +A VP+RICLT DVWTA TSEGYICLTAHFV+ WK+ Sbjct: 162 SRNTTVSDVQRIHLREKEKLKQAMAKVPNRICLTSDVWTASTSEGYICLTAHFVNKNWKL 221 Query: 1280 NSCILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDI 1438 S +L F PPPH+GVEL+ ++D LKEWGID+K+FS+TLDNAS+NDNMQ + Sbjct: 222 CSKLLNFCRMPPPHTGVELAATIFDCLKEWGIDRKVFSLTLDNASANDNMQGV 274 >gb|AAD48963.1|AF147263_5 contains similarity to transposases [Arabidopsis thaliana] gi|7267311|emb|CAB81093.1| AT4g05510 [Arabidopsis thaliana] Length = 604 Score = 240 bits (612), Expect = 1e-60 Identities = 129/263 (49%), Positives = 170/263 (64%) Frame = +2 Query: 755 DVWTYARSIGEKDGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDML 934 D+W Y E DGK A C C K Y +T GTS L H KC + DVG Sbjct: 39 DMWDYFTLEDENDGKI-AYCKKCLKPYPILPTT-GTSNLIRHHRKC----SMGLDVG--- 89 Query: 935 IDNTTGKARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTA 1114 + KI KV+REK + II+HDLPF VEYE +RD +Y+N D K +RNTA Sbjct: 90 -------RKTTKIDHKVVREKFSRVIIRHDLPFLCVEYEELRDFISYMNPDYKCYTRNTA 142 Query: 1115 ASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCIL 1294 A+DV K + +K LK L +PSRICLT D WT+ +GYI LTAH+VD W +NS IL Sbjct: 143 AADVVKTWEKEKQILKSELERIPSRICLTSDCWTSLGGDGYIVLTAHYVDTRWILNSKIL 202 Query: 1295 AFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLL 1474 +FSD PPH+G L+ K+++ LKEWGI+KK+F++TLDNA++N++MQ++L ++L L N+L+ Sbjct: 203 SFSDMLPPHTGDALASKIHECLKEWGIEKKVFTLTLDNATANNSMQEVLIDRLKLDNNLM 262 Query: 1475 YNCEFFHIRCSAHILNLIVQEGL 1543 EFFH+RC AH+LN IVQ GL Sbjct: 263 CKGEFFHVRCCAHVLNRIVQNGL 285 >ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Capsella rubella] gi|482549037|gb|EOA13231.1| hypothetical protein CARUB_v10026257mg [Capsella rubella] Length = 508 Score = 232 bits (592), Expect = 3e-58 Identities = 111/198 (56%), Positives = 148/198 (74%) Frame = +2 Query: 953 KARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWK 1132 K R +KI QK++REK + +I+HDLPFS VEYE +RD Y+N D +RNTAASDV K Sbjct: 8 KLRAKKIDQKIVREKFSRVLIRHDLPFSAVEYEELRDFLKYMNPDYISYTRNTAASDVIK 67 Query: 1133 VYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFP 1312 + +K LK L ++PSRICLT D WTA + EGYI L AH+VD +N+ IL+F D Sbjct: 68 TWKTEKEKLKLELENIPSRICLTSDCWTAVSGEGYISLMAHYVDEKGLLNNKILSFCDIL 127 Query: 1313 PPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFF 1492 PPH+G L+ K+++ L++WGI+KK+F++TLDNA++ND MQDIL+E+L L ++LL EFF Sbjct: 128 PPHTGEALATKIHECLRDWGIEKKVFTLTLDNATANDTMQDILKERLNLDHNLLCEGEFF 187 Query: 1493 HIRCSAHILNLIVQEGLK 1546 H+RC AHILNLIVQ+GLK Sbjct: 188 HVRCCAHILNLIVQDGLK 205 >gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thaliana] Length = 577 Score = 232 bits (592), Expect = 3e-58 Identities = 111/194 (57%), Positives = 149/194 (76%) Frame = +2 Query: 965 RKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKVYTN 1144 RK+ V RE IA+A+++H+LP+SFVEYE IR+ FTY N ++ SRNTAA DV+K+Y Sbjct: 20 RKVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYANPSIEFWSRNTAAFDVYKIYER 79 Query: 1145 KKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPPPHS 1324 +K LK+ LA +P RICLT D+W A T E YICLTAH+VD + + IL+F FPPPHS Sbjct: 80 EKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFCAFPPPHS 139 Query: 1325 GVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFFHIRC 1504 GV ++ K+ +LLK+WGI+KK+F++T+DNAS+ND MQ IL+ + LQ L+ + EFFH+RC Sbjct: 140 GVAIAMKLSELLKDWGIEKKVFTLTVDNASANDTMQSILKRK--LQKDLVCSGEFFHVRC 197 Query: 1505 SAHILNLIVQEGLK 1546 SAHILNLIVQ+GL+ Sbjct: 198 SAHILNLIVQDGLE 211 >ref|XP_006279274.1| hypothetical protein CARUB_v100165480mg, partial [Capsella rubella] gi|482547953|gb|EOA12172.1| hypothetical protein CARUB_v100165480mg, partial [Capsella rubella] Length = 288 Score = 227 bits (578), Expect = 1e-56 Identities = 118/256 (46%), Positives = 168/256 (65%), Gaps = 2/256 (0%) Frame = +2 Query: 782 GEK--DGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDMLIDNTTGK 955 GEK DGK C C K+Y +GT+TL+ H C + +DVGDM++ GK Sbjct: 45 GEKGPDGKEDVRCNYCGKIYHWNLHRNGTTTLERHWKAC-KRSPRDQDVGDMMM-TYEGK 102 Query: 956 ARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKV 1135 + RK+ QK+ RE IA+AI++H LP+SFV+Y IR+ TY N D+KH SRNT +SD +K+ Sbjct: 103 LKARKLDQKIFREMIAMAIVEHGLPYSFVDYRRIRNALTYANPDIKHWSRNTTSSDCYKL 162 Query: 1136 YTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPP 1315 + +K LK LA++P+ H++D+ WK+ + IL+FS FPP Sbjct: 163 FEKEKASLKMELANIPT----------------------HYIDSDWKLQNKILSFSAFPP 200 Query: 1316 PHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFFH 1495 PH+G ++ K+ +LL+EWG++KK+FS+T+DNA++ND+MQ IL++ LQ LL N EFFH Sbjct: 201 PHTGYAIAMKLIELLREWGLEKKVFSLTVDNATANDSMQTILKKN--LQRDLLCNGEFFH 258 Query: 1496 IRCSAHILNLIVQEGL 1543 +RCSAHILNLIVQ+GL Sbjct: 259 VRCSAHILNLIVQDGL 274 >gb|AAF79806.1|AC020646_29 T32E20.13 [Arabidopsis thaliana] Length = 1335 Score = 224 bits (570), Expect = 1e-55 Identities = 108/199 (54%), Positives = 141/199 (70%) Frame = +2 Query: 950 GKARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVW 1129 G K ++ EK+ II HDLPFS VEYE +R F YLN D + +RNT A+DV Sbjct: 63 GDDGKNRVKCNYCGEKLVRVIIWHDLPFSVVEYEELRRFFQYLNPDYSYYTRNTEATDVV 122 Query: 1130 KVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDF 1309 K + ++KN LK LA + SRICL FD WTA EGYI LTAH+VD +W +NS IL+F D Sbjct: 123 KTWDSEKNKLKMDLAKIQSRICLAFDCWTAIAGEGYITLTAHYVDESWTLNSKILSFCDI 182 Query: 1310 PPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEF 1489 PPPH+ L+ K+++ LKEWGI++ IF++TLDNA +ND MQ+IL+E+L L ++LL E Sbjct: 183 PPPHTSDALATKIHECLKEWGIEENIFTLTLDNALANDTMQEILKERLNLDDNLLCGGEL 242 Query: 1490 FHIRCSAHILNLIVQEGLK 1546 FH++C AHILNLIVQ+GLK Sbjct: 243 FHVQCCAHILNLIVQDGLK 261 >dbj|BAB02100.1| unnamed protein product [Arabidopsis thaliana] Length = 463 Score = 214 bits (546), Expect = 6e-53 Identities = 124/274 (45%), Positives = 163/274 (59%), Gaps = 11/274 (4%) Frame = +2 Query: 755 DVWTYARSIGE--KDGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGD 928 DVW R I E +DGK R C+ K I S GTS LK H+ C Sbjct: 61 DVWKEFRPILELEEDGKQRGRCIHYDKKLIIENS-QGTSALKRHLQIC------------ 107 Query: 929 MLIDNTTGKARKRKISQKVL------REKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDV 1090 + R + +S+K++ RE ++ I+ HDLPF +VEYE +R YLN + Sbjct: 108 --------QKRPQVLSEKIVYDHKVDREMVSEIIVYHDLPFRYVEYEKVRARDKYLNPNC 159 Query: 1091 KHISRNTAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTAC-TSEGYICLTAHFVDA 1267 + I R TA +DV+K Y +K LK+ R+C T D+WTA GYICLTAH+VD Sbjct: 160 QPICRQTAGNDVFKRYELEKGKLKKFFEQFRGRVCCTADLWTARGIVTGYICLTAHYVDD 219 Query: 1268 TWKMNSCILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDIL-- 1441 W++N+ ILAF D PPH+G EL+ K+ LKEWG++KKIFS+TLDNA +ND+MQ IL Sbjct: 220 EWRLNNKILAFCDMKPPHTGEELANKILSCLKEWGLEKKIFSLTLDNARNNDSMQSILKH 279 Query: 1442 REQLCLQNSLLYNCEFFHIRCSAHILNLIVQEGL 1543 R Q+ N LL + +FFH+RC AH+LNLIVQEGL Sbjct: 280 RLQMISGNGLLCDGKFFHVRCCAHVLNLIVQEGL 313 >gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thaliana] Length = 659 Score = 213 bits (542), Expect = 2e-52 Identities = 118/264 (44%), Positives = 162/264 (61%), Gaps = 2/264 (0%) Frame = +2 Query: 761 WTYARSIG-EKDGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDMLI 937 W S+G E+DGK RA C C + + ++GTST+ H+ C P R + Sbjct: 37 WDEFTSVGIEEDGKERARCHHCG-IKLVVEKSYGTSTMNRHLTLC-PERPQ--------- 85 Query: 938 DNTTGKARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAA 1117 + K KV RE + II HD+PF +VEYE +R +LN D K I R TAA Sbjct: 86 -----PETRPKYDHKVDREMTSEIIIYHDMPFRYVEYEKVRARDKFLNPDCKPICRQTAA 140 Query: 1118 SDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTA-CTSEGYICLTAHFVDATWKMNSCIL 1294 DV+K + +K L A ++CLT D+W++ T GYIC+T+H++D +W++N+ IL Sbjct: 141 LDVFKRFEIEKAKLIDVFAKHNGQVCLTADLWSSRSTVTGYICVTSHYIDESWRLNNKIL 200 Query: 1295 AFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLL 1474 AF D PPH+G E+++KVYD LKEWG++KKI +ITLDNAS+N +MQ IL+ +L N LL Sbjct: 201 AFCDLKPPHNGEEIAKKVYDCLKEWGLEKKILTITLDNASANTSMQTILKHRLQSGNGLL 260 Query: 1475 YNCEFFHIRCSAHILNLIVQEGLK 1546 F H+RC AHILNLIVQ GL+ Sbjct: 261 CGGNFLHVRCCAHILNLIVQAGLE 284 >gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] Length = 682 Score = 211 bits (537), Expect = 7e-52 Identities = 114/252 (45%), Positives = 153/252 (60%) Frame = +2 Query: 791 DGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDMLIDNTTGKARKRK 970 DG RA C C S +GTS K H + C P R V + D + K K Sbjct: 74 DGIARAICKYCDGGPTLAYSGNGTSNFKRHTETC-PKRPLL-GVAHLTSDGSFIK----K 127 Query: 971 ISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKVYTNKK 1150 + V +E++ALA+I+H PFS+ EY+G R + LN K ISRNT + K++ +K Sbjct: 128 MDPLVYKERVALAVIRHAFPFSYAEYDGNRWLHEGLNESYKPISRNTLRNYCMKIHKREK 187 Query: 1151 NCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPPPHSGV 1330 LK++L+++P +ICLT D+WTA GYI LTAH++D+ W ++S IL F PPH Sbjct: 188 QILKESLSNLPGKICLTTDMWTAFVGMGYISLTAHYIDSEWNLHSKILNFCHLEPPHDAP 247 Query: 1331 ELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFFHIRCSA 1510 L +Y LKEW I KIF+ITLDNA NDNMQD+L L L + +L + E+FH+RC+A Sbjct: 248 SLHDSIYAKLKEWDIRSKIFTITLDNARCNDNMQDLLMNSLSLHSPILCDGEYFHVRCAA 307 Query: 1511 HILNLIVQEGLK 1546 HILNLIVQ+GLK Sbjct: 308 HILNLIVQDGLK 319 >gb|AAF78383.1|AC069551_16 T10O22.20 [Arabidopsis thaliana] Length = 876 Score = 209 bits (532), Expect = 3e-51 Identities = 106/231 (45%), Positives = 149/231 (64%), Gaps = 2/231 (0%) Frame = +2 Query: 791 DGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDMLIDNTTGKARK-- 964 +GK C C++ Y +GT+T+ H+ C + T G + Sbjct: 242 NGKTEVTCKYCEQTYHLNLRRNGTNTMNRHMRSC---------------EKTPGSTPRIS 286 Query: 965 RKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKVYTN 1144 RK+ V RE IA+A+++H+LP+SFVEYE IR+ FTY N ++ SRNTAASDV+K+Y Sbjct: 287 RKVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYANPYIEFWSRNTAASDVYKIYER 346 Query: 1145 KKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPPPHS 1324 +K LK+ LA +P RICLT D+W A T E YICLTAH+VD + + IL+F FPPPHS Sbjct: 347 EKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFCAFPPPHS 406 Query: 1325 GVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLY 1477 GV ++ K+ +LLK+WGI+KK+F++T+DNAS+ND MQ IL+ +N+ LY Sbjct: 407 GVAIAMKLNELLKDWGIEKKVFTLTVDNASANDTMQSILKHN-ASENAFLY 456 >gb|EOY16831.1| BED zinc finger,hAT family dimerization domain [Theobroma cacao] Length = 495 Score = 197 bits (500), Expect = 1e-47 Identities = 113/268 (42%), Positives = 149/268 (55%), Gaps = 5/268 (1%) Frame = +2 Query: 758 VWTYARSIGEK---DGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGD 928 +WT+ + EK DGK + +C C + + S +G LK H D C+ R RD+G Sbjct: 49 LWTFFERLPEKNSSDGKSKVKCKLCGYI-LNYESKYGIGNLKRHNDNCV--RKDTRDIGQ 105 Query: 929 MLI--DNTTGKARKRKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHIS 1102 M+ ++ + R K + RE + AI+ H+LP SFVEY GI+ + YL DV IS Sbjct: 106 MIFSKEHNSMLMRSSKFDLEKFRELVVAAIVMHNLPLSFVEYTGIKSMLPYLREDVVLIS 165 Query: 1103 RNTAASDVWKVYTNKKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMN 1282 RNT +D+ K Y+CLTAHFV+ W + Sbjct: 166 RNTVKADIIK----------------------------------YLCLTAHFVNKNWVLQ 191 Query: 1283 SCILAFSDFPPPHSGVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQ 1462 IL FS PPPH+GV LS K+Y LL EWGI+ K+FSITLDNAS+ND D+L+ QL ++ Sbjct: 192 KRILNFSFMPPPHNGVALSEKIYALLVEWGIESKLFSITLDNASANDTFVDLLKVQLIMR 251 Query: 1463 NSLLYNCEFFHIRCSAHILNLIVQEGLK 1546 LL +FFHIRC AHILNLIVQ+GLK Sbjct: 252 KQLLGRGKFFHIRCCAHILNLIVQDGLK 279 >gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778249|gb|EOY25505.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778250|gb|EOY25506.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778251|gb|EOY25507.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] Length = 678 Score = 194 bits (492), Expect = 1e-46 Identities = 103/254 (40%), Positives = 153/254 (60%), Gaps = 2/254 (0%) Frame = +2 Query: 791 DGKPRAECLGCKKVYIAGGSTHGTSTLKHHIDKCLPLRAKFRDVGDMLIDNTTGKA--RK 964 DGK A+C C + + S H LK + + C+ R++G M+ N G R Sbjct: 56 DGKAIAKCKHCG-IVLNCDSKHEIDNLKRYSENCVG--GDTREIGQMISSNQHGSTLTRS 112 Query: 965 RKISQKVLREKIALAIIKHDLPFSFVEYEGIRDIFTYLNLDVKHISRNTAASDVWKVYTN 1144 + + RE + AI H+LP SFVEY G R + +YL+ DV ISRNT + + K++ Sbjct: 113 SNLDPEKFRELVIGAIFMHNLPLSFVEYRGSRALSSYLHEDVTLISRNTLKAYMIKMHRA 172 Query: 1145 KKNCLKQTLADVPSRICLTFDVWTACTSEGYICLTAHFVDATWKMNSCILAFSDFPPPHS 1324 +++ +K L + P RI LTFD+W + T++ YICL AHFVD W + +L FS PPP++ Sbjct: 173 ERSKIKCLLEETPGRINLTFDLWNSITTDTYICLIAHFVDKNWVLQKRVLNFSFMPPPYN 232 Query: 1325 GVELSRKVYDLLKEWGIDKKIFSITLDNASSNDNMQDILREQLCLQNSLLYNCEFFHIRC 1504 V L KVY LL EWGI+ K+FS+TLDN +++ ++L++ L ++ + L +FFH+RC Sbjct: 233 CVALIEKVYALLAEWGIESKLFSVTLDNVLASNAFVELLKKNLNVRKTFLVGGKFFHLRC 292 Query: 1505 SAHILNLIVQEGLK 1546 A +LNLIVQ+ LK Sbjct: 293 FAQVLNLIVQDSLK 306