BLASTX nr result
ID: Mentha24_contig00041767
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00041767 (630 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, part... 212 8e-53 pir||H85073 probable transposon protein [imported] - Arabidopsis... 207 3e-51 gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thal... 195 8e-48 gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thali... 194 2e-47 gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] 193 3e-47 ref|XP_007213385.1| hypothetical protein PRUPE_ppa026473mg [Prun... 189 8e-46 ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [... 184 2e-44 gb|AAD48963.1|AF147263_5 contains similarity to transposases [Ar... 183 4e-44 ref|XP_007221311.1| hypothetical protein PRUPE_ppa025777mg, part... 181 2e-43 ref|XP_006279432.1| hypothetical protein CARUB_v10007925mg, part... 180 4e-43 ref|XP_003591130.1| DNA (cytosine-5)-methyltransferase 3B [Medic... 180 4e-43 gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] 179 8e-43 ref|XP_002451486.1| hypothetical protein SORBIDRAFT_04g002725 [S... 169 6e-40 ref|XP_007028994.1| Ac-like transposase THELMA13 [Theobroma caca... 169 8e-40 ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Caps... 166 7e-39 ref|XP_007200665.1| hypothetical protein PRUPE_ppa015215mg, part... 164 3e-38 ref|XP_007022882.1| BED zinc finger,hAT family dimerization doma... 163 4e-38 gb|AEF33496.1| putative transposase [Saccharum hybrid cultivar R... 163 4e-38 ref|XP_007219124.1| hypothetical protein PRUPE_ppa015847mg, part... 160 2e-37 dbj|BAB02100.1| unnamed protein product [Arabidopsis thaliana] 159 5e-37 >ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] gi|482560944|gb|EOA25135.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] Length = 547 Score = 212 bits (539), Expect = 8e-53 Identities = 103/204 (50%), Positives = 138/204 (67%) Frame = +3 Query: 18 LKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEARMI 197 L+ +L ++ LLC+G+ FHI+C AH+LNLIVQ GLK + L+KIR ++K++K SE R Sbjct: 164 LRDQLSSRHGLLCDGEFFHIRCSAHVLNLIVQVGLKFVESPLHKIRETVKWIKWSEGRKD 223 Query: 198 KFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPTXXX 377 FKEC GI+ T+ +++DVSTRWNSTY ML S +KYRRAFS+L + YK CP+ Sbjct: 224 LFKECVIDVGIKYTAGLKMDVSTRWNSTYLMLGSVIKYRRAFSLLERAERNYKFCPSDEE 283 Query: 378 XXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSMTVL 557 FL PF ++T L SGTSYPT+NLYF ++WKI LL S D +++M Sbjct: 284 WNKAEKIYTFLEPFYDITKLFSGTSYPTANLYFAQIWKIECLLNSYSNDGDMELQNMANE 343 Query: 558 MKGKFKKYWEEYSEILAMGAVFDP 629 M+ KF KYWEEYS IL++GA+ DP Sbjct: 344 MRTKFDKYWEEYSIILSIGAILDP 367 >pir||H85073 probable transposon protein [imported] - Arabidopsis thaliana gi|5032279|gb|AAD38227.1|AF147264_10 may be a pseudogene [Arabidopsis thaliana] gi|7267351|emb|CAB81124.1| putative transposon protein [Arabidopsis thaliana] Length = 483 Score = 207 bits (526), Expect = 3e-51 Identities = 106/209 (50%), Positives = 136/209 (65%), Gaps = 1/209 (0%) Frame = +3 Query: 6 MVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSE 185 M + +K++L L++ LLC G+ FH++C HILN+IVQ GLK D L KIR SIKY+KGSE Sbjct: 136 MQEIVKSQLVLRDDLLCKGEFFHVRCATHILNIIVQIGLKGIGDTLEKIRESIKYVKGSE 195 Query: 186 ARMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSML-AFDDLAYKLC 362 R I F +C + GI + + + LDV+ RWNST+ ML A+KYR AF L D YK Sbjct: 196 HREILFAKCMENVGINLKAGLLLDVANRWNSTFKMLDRALKYRAAFGNLKVIDAKNYKFH 255 Query: 363 PTXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVR 542 PT FL F+++T LISG+ YPTSNLYFM+VWK L N ++QDEV+R Sbjct: 256 PTDAEWHRLQQMSDFLESFDQITNLISGSIYPTSNLYFMQVWKFQNWLTVNESNQDEVIR 315 Query: 543 SMTVLMKGKFKKYWEEYSEILAMGAVFDP 629 +M VLMK +F KYW E S I A+ VFDP Sbjct: 316 NMIVLMKERFDKYWAEVSNIFAIATVFDP 344 >gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thaliana] Length = 659 Score = 195 bits (496), Expect = 8e-48 Identities = 99/204 (48%), Positives = 125/204 (61%) Frame = +3 Query: 18 LKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEARMI 197 LK RL N LLC G H++CCAHILNLIVQ GL++A L I S+K++K SE+R Sbjct: 249 LKHRLQSGNGLLCGGNFLHVRCCAHILNLIVQAGLELASGLLENITESVKFVKASESRKD 308 Query: 198 KFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPTXXX 377 F C + GI+ + + LDVSTRWNSTY ML A+K+R+AF++L + Y PT Sbjct: 309 SFATCLECVGIKSGAGLSLDVSTRWNSTYEMLARALKFRKAFAILNLYERGYCSLPTEEE 368 Query: 378 XXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSMTVL 557 L PFN +TT SG YPT+N+YF++VWKI LL + + D VR M Sbjct: 369 CDRGEKICDLLKPFNTITTYFSGVKYPTANIYFIQVWKIELLLMKYANCDDVDVREMAKK 428 Query: 558 MKGKFKKYWEEYSEILAMGAVFDP 629 M+ KF KYW EYS ILAMGA DP Sbjct: 429 MQKKFAKYWNEYSVILAMGAALDP 452 >gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thaliana] Length = 577 Score = 194 bits (493), Expect = 2e-47 Identities = 97/198 (48%), Positives = 128/198 (64%) Frame = +3 Query: 36 LQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEARMIKFKECA 215 LQ L+C+G+ FH++C AHILNLIVQ+GL+V AL KIR ++KY+KGSE R F+ C Sbjct: 182 LQKDLVCSGEFFHVRCSAHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCM 241 Query: 216 KKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPTXXXXXXXXX 395 GI+ +++ LDVSTRWNSTY ML A++++ LA D YK P+ Sbjct: 242 DTIGIQTEANLVLDVSTRWNSTYHMLSRAIQFKDVLRSLAEVDRGYKSFPSAVEWERAEL 301 Query: 396 XXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSMTVLMKGKFK 575 L PF E+T LISG+SYPT+N+YFM+VW I L + S D V+R M M K+ Sbjct: 302 ICDLLKPFAEITKLISGSSYPTANVYFMQVWAIKCWLGDHDDSHDRVIREMVEDMTEKYD 361 Query: 576 KYWEEYSEILAMGAVFDP 629 KYWE++S+ILAM AV DP Sbjct: 362 KYWEDFSDILAMAAVLDP 379 >gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] Length = 745 Score = 193 bits (491), Expect = 3e-47 Identities = 97/198 (48%), Positives = 127/198 (64%) Frame = +3 Query: 36 LQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEARMIKFKECA 215 LQ L+C+G+ FH++C AHILNLIVQ+GL+V AL KIR ++KY+KGSE R F+ C Sbjct: 365 LQKHLVCSGEFFHVRCSAHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCM 424 Query: 216 KKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPTXXXXXXXXX 395 GI+ +S+ LDVSTRWNSTY ML A++++ LA D YK P+ Sbjct: 425 DTIGIQTEASLVLDVSTRWNSTYHMLSRAIQFKDVLHSLAEVDRGYKSFPSAVEWERAEL 484 Query: 396 XXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSMTVLMKGKFK 575 L PF E+T LISG+SYPT+N+YFM+VW I L + S D +R M M K+ Sbjct: 485 ICDLLKPFAEITKLISGSSYPTANVYFMQVWAIKCWLGDHDDSHDRAIREMVEDMTEKYD 544 Query: 576 KYWEEYSEILAMGAVFDP 629 KYWE++S+ILAM AV DP Sbjct: 545 KYWEDFSDILAMAAVLDP 562 >ref|XP_007213385.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica] gi|462409250|gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica] Length = 696 Score = 189 bits (479), Expect = 8e-46 Identities = 96/207 (46%), Positives = 130/207 (62%) Frame = +3 Query: 9 VDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEA 188 V+ LK +L+L+++LL NG+ FHI+CCAHILNLIVQ+GLK DD++ KIR SIKY++GS+ Sbjct: 275 VELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVGKIRESIKYVRGSQG 334 Query: 189 RMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPT 368 R KF C + +E +R DV TRWNST+ M+ SA+ Y+RAF L D YK + Sbjct: 335 RKQKFLNCDARVSLECKRGLRQDVPTRWNSTFLMIDSALYYQRAFLHLQLSDSNYKHSLS 394 Query: 369 XXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSM 548 FL F ++T L SGT YPT+NLYF +V+ + L + D ++SM Sbjct: 395 QDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDSDSFMKSM 454 Query: 549 TVLMKGKFKKYWEEYSEILAMGAVFDP 629 M KF KYW+EYS ILA+ + DP Sbjct: 455 ATQMMEKFDKYWKEYSLILAIAVILDP 481 >ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] gi|355504225|gb|AES85428.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] Length = 555 Score = 184 bits (467), Expect = 2e-44 Identities = 91/210 (43%), Positives = 136/210 (64%), Gaps = 1/210 (0%) Frame = +3 Query: 3 LMVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGS 182 ++ ++LKT+L LQN LLC+G+ FH+ C A +LN IV+E LK+ ++KIR SI +++ S Sbjct: 164 ILQEQLKTQLVLQNGLLCDGEFFHVNCFARVLNQIVEEALKLVSCGVHKIRESIMFVRHS 223 Query: 183 EARMIKFKECAKKTG-IEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKL 359 ++R KFKEC +K G ++ + + LD+S +STY +L A+KYR AF D +Y L Sbjct: 224 KSRREKFKECFEKVGGVDSSVHLHLDISMSLSSTYMLLERALKYRCAFESFHLYDDSYDL 283 Query: 360 CPTXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVV 539 CP+ FL PF E +I+ T++PTSNLYF++VWK+ +L + +DE + Sbjct: 284 CPSAEEWKRVEKICAFLLPFCETANMINSTTHPTSNLYFLQVWKVQCVLVDSLGDEDEDI 343 Query: 540 RSMTVLMKGKFKKYWEEYSEILAMGAVFDP 629 + M M KF+KYW+EYS +LA+GAV DP Sbjct: 344 KKMAERMMSKFEKYWDEYSVVLALGAVLDP 373 >gb|AAD48963.1|AF147263_5 contains similarity to transposases [Arabidopsis thaliana] gi|7267311|emb|CAB81093.1| AT4g05510 [Arabidopsis thaliana] Length = 604 Score = 183 bits (464), Expect = 4e-44 Identities = 100/208 (48%), Positives = 124/208 (59%) Frame = +3 Query: 6 MVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSE 185 M + L RL L N+L+C G+ FH++CCAH+LN IVQ GL V DAL+KIR ++KY+KGS Sbjct: 247 MQEVLIDRLKLDNNLMCKGEFFHVRCCAHVLNRIVQNGLDVISDALSKIRETVKYVKGST 306 Query: 186 ARMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCP 365 +R + EC + G + S LDV TRWNSTY ML A+KY+RA + D YK CP Sbjct: 307 SRRLALAECVEGKGEVLLS---LDVQTRWNSTYLMLHKALKYQRALNRFKIVDKNYKNCP 363 Query: 366 TXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRS 545 + L PF ++T L+SG SY TSNLYF VWKI LLE Sbjct: 364 SSEEWKRAKTIHEILMPFYKITNLMSGRSYSTSNLYFGHVWKIQCLLE------------ 411 Query: 546 MTVLMKGKFKKYWEEYSEILAMGAVFDP 629 M+ KF KYW+EYS ILAM AV DP Sbjct: 412 ----MRLKFDKYWKEYSVILAMRAVLDP 435 >ref|XP_007221311.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica] gi|462417945|gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica] Length = 697 Score = 181 bits (458), Expect = 2e-43 Identities = 93/207 (44%), Positives = 128/207 (61%) Frame = +3 Query: 9 VDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEA 188 V+ LK + +L+++LL NG+ F+I+CCAHILNLIVQ+GLK DD++ KIR SIKY++GS+ Sbjct: 276 VELLKGQPNLKDALLMNGKFFYIRCCAHILNLIVQDGLKHIDDSVGKIRESIKYVRGSQG 335 Query: 189 RMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPT 368 R KF CA + +E +R DV TRWNST+ M+ SA+ Y+RAF L D YK + Sbjct: 336 RKQKFLNCAAQVSLECKRGLRQDVPTRWNSTFLMIDSALYYQRAFLHLQLSDSNYKHSLS 395 Query: 369 XXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSM 548 FL F ++T L SGT YPT+NLYF +V+ + L + D ++SM Sbjct: 396 QDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDSDSFMKSM 455 Query: 549 TVLMKGKFKKYWEEYSEILAMGAVFDP 629 M F KYW+EYS I A+ + DP Sbjct: 456 ATQMMEMFDKYWKEYSLIPAIAVILDP 482 >ref|XP_006279432.1| hypothetical protein CARUB_v10007925mg, partial [Capsella rubella] gi|482548132|gb|EOA12330.1| hypothetical protein CARUB_v10007925mg, partial [Capsella rubella] Length = 539 Score = 180 bits (456), Expect = 4e-43 Identities = 93/188 (49%), Positives = 123/188 (65%), Gaps = 1/188 (0%) Frame = +3 Query: 18 LKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEARMI 197 LK++L L N+LLC G++FH++C AHILN+IVQ GL D L+KIR SIKY++ S R + Sbjct: 343 LKSQLVLHNNLLCGGEYFHVRCAAHILNIIVQIGLDEIVDTLHKIRESIKYVRASRKREM 402 Query: 198 KFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAF-SMLAFDDLAYKLCPTXX 374 F +C + GI++ + + LDV TRWNSTY ML A+KYR AF + D Y PT Sbjct: 403 LFAKCVEAFGIKMKAGLILDVKTRWNSTYKMLDRALKYRAAFGNFKVIDGRNYNFHPTED 462 Query: 375 XXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSMTV 554 FL PF+ +T LISG++YPT NLYFM+VWKI L NS +QDEV+R+M V Sbjct: 463 EWHRLKLICEFLEPFDHITNLISGSTYPTFNLYFMQVWKINEWLISNSENQDEVIRNMIV 522 Query: 555 LMKGKFKK 578 M+ +F K Sbjct: 523 PMRERFDK 530 >ref|XP_003591130.1| DNA (cytosine-5)-methyltransferase 3B [Medicago truncatula] gi|355480178|gb|AES61381.1| DNA (cytosine-5)-methyltransferase 3B [Medicago truncatula] Length = 722 Score = 180 bits (456), Expect = 4e-43 Identities = 92/208 (44%), Positives = 127/208 (61%) Frame = +3 Query: 6 MVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSE 185 M + LK L L NSLL +G+ FHI+C HILNLIVQ+ LKV DAL+KIR S+ Y++ + Sbjct: 459 MQNFLKEYLGLSNSLLFDGEFFHIRCSTHILNLIVQDRLKVVSDALHKIRQSVAYVR-EQ 517 Query: 186 ARMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCP 365 +R ++F EC + G+ + + D TRWNST+ ML SA+ YRRAF L+ + +K CP Sbjct: 518 SRTLQFFECVRNVGMLVILIQQSDCVTRWNSTFRMLQSAINYRRAFYSLSLRNSNFKCCP 577 Query: 366 TXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRS 545 T L PF +T LI +SYP SNLYF E+WK+ L+ T++D ++++ Sbjct: 578 TSDEWRRAETMCDILKPFYNITNLICDSSYPPSNLYFGEIWKLECLIRSYLTNEDLLIQN 637 Query: 546 MTVLMKGKFKKYWEEYSEILAMGAVFDP 629 M MK F KYW Y + A GA+ DP Sbjct: 638 MAGSMKETFDKYWINYGVVFAFGAILDP 665 >gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] Length = 682 Score = 179 bits (453), Expect = 8e-43 Identities = 95/214 (44%), Positives = 126/214 (58%), Gaps = 6/214 (2%) Frame = +3 Query: 6 MVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSE 185 M D L L L + +LC+G++FH++C AHILNLIVQ+GLKV D + K+R + ++ GSE Sbjct: 280 MQDLLMNSLSLHSPILCDGEYFHVRCAAHILNLIVQDGLKVIDSGVRKLRMVVAHIVGSE 339 Query: 186 ARMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAF------SMLAFDDL 347 R+IKFK A G++ + + LD TRWNSTY ML A+ YR F M FD Sbjct: 340 RRLIKFKGNASALGVDTSKKLCLDCVTRWNSTYNMLERAMIYRNVFPTMRGPEMKKFDP- 398 Query: 348 AYKLCPTXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQ 527 + P+ L PF+ +TTLISG YPT+NLYF VWKI LL + + Sbjct: 399 HFPEPPSEAEWIRIVKIVELLKPFDHITTLISGRKYPTANLYFKSVWKIQYLLTRYAKCN 458 Query: 528 DEVVRSMTVLMKGKFKKYWEEYSEILAMGAVFDP 629 D ++ M LM+ KF KYWE YS IL+ A+ DP Sbjct: 459 DTHLKDMADLMRIKFDKYWENYSMILSFAAILDP 492 >ref|XP_002451486.1| hypothetical protein SORBIDRAFT_04g002725 [Sorghum bicolor] gi|241931317|gb|EES04462.1| hypothetical protein SORBIDRAFT_04g002725 [Sorghum bicolor] Length = 604 Score = 169 bits (428), Expect = 6e-40 Identities = 77/193 (39%), Positives = 123/193 (63%) Frame = +3 Query: 6 MVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSE 185 M+D LK L + L C+G FH++C AH++NLIV++GL+ D +N IR S+KY++GS+ Sbjct: 232 MMDILKANLLKMDLLHCDGDLFHVRCAAHVINLIVKDGLQAIDGVINNIRESVKYIRGSQ 291 Query: 186 ARMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCP 365 +R KF++ ++ GI S+ ++DV+ RWNSTY M+ SA+ ++ AF L D Y CP Sbjct: 292 SRKEKFEDIIEELGIRCRSAPQIDVANRWNSTYDMIQSAMPFKDAFLELKVKDSNYTYCP 351 Query: 366 TXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRS 545 + L F + T ++SG++YPTSNLYF ++W + ++LE+ + S +E + + Sbjct: 352 SSQDWQRANAVCKLLKVFKKATKVVSGSTYPTSNLYFHQIWSVRQVLEEEAFSPNETIAA 411 Query: 546 MTVLMKGKFKKYW 584 M + M+ KF KYW Sbjct: 412 MVLEMQAKFDKYW 424 >ref|XP_007028994.1| Ac-like transposase THELMA13 [Theobroma cacao] gi|508717599|gb|EOY09496.1| Ac-like transposase THELMA13 [Theobroma cacao] Length = 373 Score = 169 bits (427), Expect = 8e-40 Identities = 85/198 (42%), Positives = 119/198 (60%) Frame = +3 Query: 36 LQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEARMIKFKECA 215 ++ LL G+ FHI+C AHILNLIVQ+GLK D A+ K R SIKY+KGS+ R KF EC Sbjct: 1 MRKQLLRGGKFFHIRCYAHILNLIVQDGLKEVDSAIQKGRESIKYVKGSQGRKQKFLECV 60 Query: 216 KKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPTXXXXXXXXX 395 + ++ DV TRWNST+ ML SA+ +R FS L D +K P+ Sbjct: 61 SLVNLNAKRDLKQDVPTRWNSTFLMLESALYFRLGFSHLEISDSNFKHSPSRDEWDRIEK 120 Query: 396 XXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSMTVLMKGKFK 575 FLS F E+T + SGT YPT++L+F ++ +LE++ + D +++M M KFK Sbjct: 121 LSKFLSVFYEITCVFSGTKYPTADLHFPSIFMARMILEEHMSGDDVYLKNMATQMFVKFK 180 Query: 576 KYWEEYSEILAMGAVFDP 629 KYW ++S IL + +FDP Sbjct: 181 KYWSQFSLILTIAVIFDP 198 >ref|XP_006280333.1| hypothetical protein CARUB_v10026257mg [Capsella rubella] gi|482549037|gb|EOA13231.1| hypothetical protein CARUB_v10026257mg [Capsella rubella] Length = 508 Score = 166 bits (419), Expect = 7e-39 Identities = 91/208 (43%), Positives = 117/208 (56%) Frame = +3 Query: 6 MVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSE 185 M D LK RL+L ++LLC G+ FH++CCAHILNLIVQ+GLKV AL+KIR S+KY+K ++ Sbjct: 166 MQDILKERLNLDHNLLCEGEFFHVRCCAHILNLIVQDGLKVIGGALSKIRDSVKYVKATK 225 Query: 186 ARMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCP 365 AR I F+ CA F L D +YK CP Sbjct: 226 ARGIAFETCA-----------------------------------FKRLKVVDKSYKHCP 250 Query: 366 TXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRS 545 + L PF ++T L+ G SY TSNLYF+ VWKI LL++N D+ +R Sbjct: 251 SNDDWCKAKNILEILKPFYKITVLMLGRSYSTSNLYFVNVWKIECLLKENERHSDKDIRD 310 Query: 546 MTVLMKGKFKKYWEEYSEILAMGAVFDP 629 M M+ KFKKYW++YS LAMGAV DP Sbjct: 311 MAGRMRIKFKKYWDQYSVSLAMGAVLDP 338 >ref|XP_007200665.1| hypothetical protein PRUPE_ppa015215mg, partial [Prunus persica] gi|462396065|gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [Prunus persica] Length = 478 Score = 164 bits (414), Expect = 3e-38 Identities = 89/207 (42%), Positives = 121/207 (58%) Frame = +3 Query: 9 VDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEA 188 V+ LK +L+L+++LL NG+ FH++CCAHILNLIVQ+GLK DD + KIR SIKY++GS+ Sbjct: 122 VELLKGQLNLKDALLMNGKFFHVRCCAHILNLIVQDGLKHIDDYVGKIRESIKYVRGSQG 181 Query: 189 RMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPT 368 KF +CA + +E +R DV TRWNST+ M+ SA+ Y+RAF L D YK + Sbjct: 182 TKQKFLDCAAQVSLECKRGLRQDVPTRWNSTFLMINSALYYQRAFLHLQLSDSNYKHSLS 241 Query: 369 XXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSM 548 FL F ++T L GT YPT+NLYF +V+ + L Sbjct: 242 QDEWGKLEKLSKFLKVFYDVTCLFFGTKYPTANLYFPQVFVVEDTL-------------- 287 Query: 549 TVLMKGKFKKYWEEYSEILAMGAVFDP 629 K KYW+EYS ILA+ + DP Sbjct: 288 ------KKAKYWKEYSLILAIAVILDP 308 >ref|XP_007022882.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|590614243|ref|XP_007022883.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|590614248|ref|XP_007022884.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|590614254|ref|XP_007022885.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778248|gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778249|gb|EOY25505.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778250|gb|EOY25506.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778251|gb|EOY25507.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] Length = 678 Score = 163 bits (412), Expect = 4e-38 Identities = 80/207 (38%), Positives = 124/207 (59%) Frame = +3 Query: 9 VDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEA 188 V+ LK L+++ + L G+ FH++C A +LNLIVQ+ LK D + K+R S+KY+KGS+ Sbjct: 268 VELLKKNLNVRKTFLVGGKFFHLRCFAQVLNLIVQDSLKEVDCVVQKVRESVKYVKGSQV 327 Query: 189 RMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPT 368 R KF EC + +R DVST+WNST+ ML A+ +R+AFS L D Y+ CP+ Sbjct: 328 RKQKFLECVTLMKLNAKGGLRQDVSTKWNSTFLMLKRALYFRKAFSHLEIRDSNYRYCPS 387 Query: 369 XXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSM 548 L+ F ++T + S T YPT+NL+F ++ L+++ + QD +++M Sbjct: 388 EDEWERVEKLYKLLAVFYDVTCVFSRTKYPTANLFFPSMFIAHSTLQEHMSGQDVYMKNM 447 Query: 549 TVLMKGKFKKYWEEYSEILAMGAVFDP 629 + M KF KYW ++S ILA+ + DP Sbjct: 448 STQMLVKFVKYWSDFSLILAIAVILDP 474 >gb|AEF33496.1| putative transposase [Saccharum hybrid cultivar R570] Length = 607 Score = 163 bits (412), Expect = 4e-38 Identities = 79/193 (40%), Positives = 116/193 (60%) Frame = +3 Query: 6 MVDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSE 185 M+D LK L + L C+G FHI+C AH++NLIV++GL+ D +N IR S+KY++ S+ Sbjct: 236 MMDILKANLLKMDMLHCDGDLFHIRCAAHVINLIVKDGLQAIDGVINNIRESVKYVRASQ 295 Query: 186 ARMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCP 365 +R KF++ + GI S ++DV RWNST M+ SA+ ++ AF L D Y CP Sbjct: 296 SRKEKFEDIVVELGIRCRSVPKIDVENRWNSTCDMIESAMPFKEAFLELKVKDSNYSYCP 355 Query: 366 TXXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRS 545 + L F + ++SGTSYPTSNLYF E+W I ++LE+ + S +E + + Sbjct: 356 SSQDWERANAVCKLLKVFKKAMEVVSGTSYPTSNLYFHEIWSIKQVLEEEAFSPNETIVT 415 Query: 546 MTVLMKGKFKKYW 584 M M+ KF KYW Sbjct: 416 MVSEMQAKFDKYW 428 >ref|XP_007219124.1| hypothetical protein PRUPE_ppa015847mg, partial [Prunus persica] gi|462415586|gb|EMJ20323.1| hypothetical protein PRUPE_ppa015847mg, partial [Prunus persica] Length = 458 Score = 160 bits (406), Expect = 2e-37 Identities = 85/206 (41%), Positives = 118/206 (57%) Frame = +3 Query: 9 VDKLKTRLDLQNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEA 188 V+ LK +L+L+++LL NG+ FH++CCAHILNLIVQ+G IKY++GS+ Sbjct: 88 VELLKGQLNLKDALLMNGKFFHVRCCAHILNLIVQDG--------------IKYVRGSQG 133 Query: 189 RMIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPT 368 R KF +C + +E + +R DV TRWNST+ M+ SA+ Y+ AF L D YK + Sbjct: 134 RKHKFLDCTAQVSLECKTGLRQDVPTRWNSTFLMIGSALCYQHAFLHLQLSDSNYKHSLS 193 Query: 369 XXXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQDEVVRSM 548 FL F ++T L SGT YPT NLYF +V+ + L D ++SM Sbjct: 194 QDEWGKLKKLSKFLKVFYDVTCLFSGTKYPTENLYFPQVFMVDDTLRNVKVDSDSFMKSM 253 Query: 549 TVLMKGKFKKYWEEYSEILAMGAVFD 626 M KF KYW+EYS ILA+ + D Sbjct: 254 ATEMMEKFDKYWKEYSLILAIAVILD 279 >dbj|BAB02100.1| unnamed protein product [Arabidopsis thaliana] Length = 463 Score = 159 bits (403), Expect = 5e-37 Identities = 79/173 (45%), Positives = 108/173 (62%), Gaps = 2/173 (1%) Frame = +3 Query: 18 LKTRLDL--QNSLLCNGQHFHIKCCAHILNLIVQEGLKVADDALNKIRGSIKYLKGSEAR 191 LK RL + N LLC+G+ FH++CCAH+LNLIVQEGL +A + L IR S++++K SE+R Sbjct: 277 LKHRLQMISGNGLLCDGKFFHVRCCAHVLNLIVQEGLSIATELLENIRESVRFVKASESR 336 Query: 192 MIKFKECAKKTGIEITSSVRLDVSTRWNSTYTMLVSAVKYRRAFSMLAFDDLAYKLCPTX 371 F C + GI + + LDV TRWNSTY ML A+K+R+AF+ L D YK + Sbjct: 337 KDAFAACVESVGIRSGAGLSLDVPTRWNSTYDMLARALKFRKAFASLKECDRNYKSLTSE 396 Query: 372 XXXXXXXXXXXFLSPFNEMTTLISGTSYPTSNLYFMEVWKIARLLEQNSTSQD 530 L PF+ +TT SG YPT+N+YF++VWKI RLL+ + D Sbjct: 397 NEWDRGERICDLLKPFSTITTYFSGVKYPTANVYFLQVWKIERLLKDYAVCGD 449