BLASTX nr result
ID: Rehmannia23_contig00028060
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00028060 (716 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY19559.1| T6D22.19, putative [Theobroma cacao] 213 4e-53 ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, part... 196 5e-48 gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus pe... 186 7e-45 ref|XP_003591130.1| DNA (cytosine-5)-methyltransferase 3B [Medic... 184 2e-44 gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [... 182 7e-44 ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [... 181 2e-43 gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thali... 177 1e-42 gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] 176 4e-42 gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thal... 173 6e-41 pir||H85073 probable transposon protein [imported] - Arabidopsis... 172 7e-41 gb|AAD48963.1|AF147263_5 contains similarity to transposases [Ar... 168 1e-39 gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, p... 167 3e-39 gb|EMJ20323.1| hypothetical protein PRUPE_ppa015847mg, partial [... 167 3e-39 gb|EOY09496.1| Ac-like transposase THELMA13 [Theobroma cacao] 164 3e-38 gb|EOX99652.1| BED zinc finger,hAT family dimerization domain [T... 156 6e-36 gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [... 156 6e-36 ref|XP_002450498.1| hypothetical protein SORBIDRAFT_05g006263 [S... 155 1e-35 ref|XP_002451486.1| hypothetical protein SORBIDRAFT_04g002725 [S... 153 5e-35 ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A... 150 4e-34 ref|XP_006851229.1| hypothetical protein AMTR_s00180p00017340 [A... 149 7e-34 >gb|EOY19559.1| T6D22.19, putative [Theobroma cacao] Length = 559 Score = 213 bits (543), Expect = 4e-53 Identities = 106/174 (60%), Positives = 125/174 (71%) Frame = -1 Query: 581 NVGGGFLRLDVATRWNSTYMMLESAIKYRRAFASLSYHDKNYKLCPTSEEWERAERVCAF 402 N+ G LRLD +TRWNSTY+M ESAIKY++AFASL + D+ YK P+ +EW RA +C F Sbjct: 270 NMQGVGLRLDASTRWNSTYLMFESAIKYQKAFASLQFVDRTYKYNPSDKEWGRAMIICEF 329 Query: 401 LVPFYHITNLVSGSSYPTSNLYFMQVASIEMKLNENLACEDEVMKDMLIRMKGKFDKYWN 222 L PFY NL+SGSSYPTSNLYFMQV IE LNENL EDEV+KDM RMK KFDKYW Sbjct: 330 LEPFYETINLISGSSYPTSNLYFMQVWKIESILNENLHNEDEVIKDMSQRMKMKFDKYWK 389 Query: 221 EYCVTLAFGCILDPKAKLQFLNFCYKRLYLLDY*EKVNRVKEALYKLFGEYKHN 60 +Y V LAFG ILDP+ KL FL FCY ++ EK+ VK LY+LF +Y N Sbjct: 390 DYSVVLAFGAILDPRMKLDFLRFCYSKIDASTCHEKLENVKTKLYELFEQYASN 443 >ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] gi|482560944|gb|EOA25135.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] Length = 547 Score = 196 bits (499), Expect = 5e-48 Identities = 97/201 (48%), Positives = 133/201 (66%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRA 489 K RE+VK+++ SEGR F+ C+ +V + G L++DV+TRWNSTY+ML S IKYRRA Sbjct: 207 KIRETVKWIKWSEGRKDLFKECVIDVGIKYTAG--LKMDVSTRWNSTYLMLGSVIKYRRA 264 Query: 488 FASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEM 309 F+ L ++NYK CP+ EEW +AE++ FL PFY IT L SG+SYPT+NLYF Q+ IE Sbjct: 265 FSLLERAERNYKFCPSDEEWNKAEKIYTFLEPFYDITKLFSGTSYPTANLYFAQIWKIEC 324 Query: 308 KLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLL 129 LN D +++M M+ KFDKYW EY + L+ G ILDP+ K++ L +C+ +L Sbjct: 325 LLNSYSNDGDMELQNMANEMRTKFDKYWEEYSIILSIGAILDPRMKVEILTYCFDKLDPS 384 Query: 128 DY*EKVNRVKEALYKLFGEYK 66 KV VK+ L LF +YK Sbjct: 385 TTKAKVEVVKQKLNLLFDQYK 405 >gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica] Length = 696 Score = 186 bits (472), Expect = 7e-45 Identities = 97/200 (48%), Positives = 128/200 (64%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRA 489 K RES+KYVRGS+GR Q+F C V L G LR DV TRWNST++M++SA+ Y+RA Sbjct: 321 KIRESIKYVRGSQGRKQKFLNCDARVSLECKRG--LRQDVPTRWNSTFLMIDSALYYQRA 378 Query: 488 FASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEM 309 F L D NYK + +EW + E++ FL FY +T L SG+ YPT+NLYF QV +E Sbjct: 379 FLHLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVED 438 Query: 308 KLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLL 129 L + D MK M +M KFDKYW EY + LA ILDP+ K+QF+ FCYKRLY Sbjct: 439 TLRKAKVDSDSFMKSMATQMMEKFDKYWKEYSLILAIAVILDPRYKIQFVEFCYKRLYGY 498 Query: 128 DY*EKVNRVKEALYKLFGEY 69 + E++ +V++ L+ LF Y Sbjct: 499 NS-EEMTKVRDMLFSLFDLY 517 >ref|XP_003591130.1| DNA (cytosine-5)-methyltransferase 3B [Medicago truncatula] gi|355480178|gb|AES61381.1| DNA (cytosine-5)-methyltransferase 3B [Medicago truncatula] Length = 722 Score = 184 bits (468), Expect = 2e-44 Identities = 99/205 (48%), Positives = 126/205 (61%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRA 489 K R+SV YVR LQ F+ C+ V + + + D TRWNST+ ML+SAI YRRA Sbjct: 506 KIRQSVAYVREQSRTLQFFE-CVRNVGMLVIL--IQQSDCVTRWNSTFRMLQSAINYRRA 562 Query: 488 FASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEM 309 F SLS + N+K CPTS+EW RAE +C L PFY+ITNL+ SSYP SNLYF ++ +E Sbjct: 563 FYSLSLRNSNFKCCPTSDEWRRAETMCDILKPFYNITNLICDSSYPPSNLYFGEIWKLEC 622 Query: 308 KLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLL 129 + L ED ++++M MK FDKYW Y V AFG ILDP K FL F Y++L L Sbjct: 623 LIRSYLTNEDLLIQNMAGSMKETFDKYWINYGVVFAFGAILDPTKKFNFLKFAYQKLDPL 682 Query: 128 DY*EKVNRVKEALYKLFGEYKHNGL 54 EK+ V+ L LF EY NG+ Sbjct: 683 TGEEKLKMVRMTLENLFAEYVKNGI 707 >gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica] Length = 697 Score = 182 bits (463), Expect = 7e-44 Identities = 95/200 (47%), Positives = 127/200 (63%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRA 489 K RES+KYVRGS+GR Q+F C +V L G LR DV TRWNST++M++SA+ Y+RA Sbjct: 322 KIRESIKYVRGSQGRKQKFLNCAAQVSLECKRG--LRQDVPTRWNSTFLMIDSALYYQRA 379 Query: 488 FASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEM 309 F L D NYK + +EW + E++ FL FY +T L SG+ YPT+NLYF QV +E Sbjct: 380 FLHLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVED 439 Query: 308 KLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLL 129 L + D MK M +M FDKYW EY + A ILDP+ K+QF+ FCYKRLY Sbjct: 440 TLRKAKVDSDSFMKSMATQMMEMFDKYWKEYSLIPAIAVILDPRYKIQFVEFCYKRLYGY 499 Query: 128 DY*EKVNRVKEALYKLFGEY 69 + E++ +V++ L+ LF Y Sbjct: 500 NS-EEMTKVRDMLFSLFDLY 518 >ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] gi|355504225|gb|AES85428.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] Length = 555 Score = 181 bits (459), Expect = 2e-43 Identities = 91/208 (43%), Positives = 133/208 (63%), Gaps = 5/208 (2%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGF-----LRLDVATRWNSTYMMLESAI 504 K RES+ +VR S+ R ++F+ C E+V GG L LD++ +STYM+LE A+ Sbjct: 212 KIRESIMFVRHSKSRREKFKECFEKV------GGVDSSVHLHLDISMSLSSTYMLLERAL 265 Query: 503 KYRRAFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQV 324 KYR AF S +D +Y LCP++EEW+R E++CAFL+PF N+++ +++PTSNLYF+QV Sbjct: 266 KYRCAFESFHLYDDSYDLCPSAEEWKRVEKICAFLLPFCETANMINSTTHPTSNLYFLQV 325 Query: 323 ASIEMKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYK 144 ++ L ++L EDE +K M RM KF+KYW+EY V LA G +LDP+ K L +CY Sbjct: 326 WKVQCVLVDSLGDEDEDIKKMAERMMSKFEKYWDEYSVVLALGAVLDPRMKFTTLAYCYS 385 Query: 143 RLYLLDY*EKVNRVKEALYKLFGEYKHN 60 +L K+ +VK L LF ++ N Sbjct: 386 KLDASTCERKLQQVKRKLCMLFEKHSGN 413 >gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thaliana] Length = 577 Score = 177 bits (450), Expect(2) = 1e-42 Identities = 90/202 (44%), Positives = 128/202 (63%) Frame = -1 Query: 671 KKFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 +K RE+VKYV+GSE R FQ C++ + + L LDV+TRWNSTY ML AI+++ Sbjct: 218 EKIRETVKYVKGSETRENLFQNCMDTIGIQTEAN--LVLDVSTRWNSTYHMLSRAIQFKD 275 Query: 491 AFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIE 312 SL+ D+ YK P++ EWERAE +C L PF IT L+SGSSYPT+N+YFMQV +I+ Sbjct: 276 VLRSLAEVDRGYKSFPSAVEWERAELICDLLKPFAEITKLISGSSYPTANVYFMQVWAIK 335 Query: 311 MKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYL 132 L ++ D V+++M+ M K+DKYW ++ LA +LDP+ K L +CY L Sbjct: 336 CWLGDHDDSHDRVIREMVEDMTEKYDKYWEDFSDILAMAAVLDPRLKFSALEYCYNILNP 395 Query: 131 LDY*EKVNRVKEALYKLFGEYK 66 L E + V++ + +LFG YK Sbjct: 396 LTSKENLTHVRDKMVQLFGAYK 417 Score = 22.7 bits (47), Expect(2) = 1e-42 Identities = 10/17 (58%), Positives = 14/17 (82%) Frame = -3 Query: 714 IIVQEGLKVTCKSLEKI 664 +IVQ+GL+V +LEKI Sbjct: 204 LIVQDGLEVISGALEKI 220 >gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] Length = 745 Score = 176 bits (445), Expect(2) = 4e-42 Identities = 89/202 (44%), Positives = 127/202 (62%) Frame = -1 Query: 671 KKFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 +K RE+VKYV+GSE R FQ C++ + + L LDV+TRWNSTY ML AI+++ Sbjct: 401 EKIRETVKYVKGSETRENLFQNCMDTIGIQTEAS--LVLDVSTRWNSTYHMLSRAIQFKD 458 Query: 491 AFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIE 312 SL+ D+ YK P++ EWERAE +C L PF IT L+SGSSYPT+N+YFMQV +I+ Sbjct: 459 VLHSLAEVDRGYKSFPSAVEWERAELICDLLKPFAEITKLISGSSYPTANVYFMQVWAIK 518 Query: 311 MKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYL 132 L ++ D +++M+ M K+DKYW ++ LA +LDP+ K L +CY L Sbjct: 519 CWLGDHDDSHDRAIREMVEDMTEKYDKYWEDFSDILAMAAVLDPRLKFSALEYCYNILNP 578 Query: 131 LDY*EKVNRVKEALYKLFGEYK 66 L E + V++ + +LFG YK Sbjct: 579 LTSKENLTHVRDKMVQLFGAYK 600 Score = 22.7 bits (47), Expect(2) = 4e-42 Identities = 10/17 (58%), Positives = 14/17 (82%) Frame = -3 Query: 714 IIVQEGLKVTCKSLEKI 664 +IVQ+GL+V +LEKI Sbjct: 387 LIVQDGLEVISGALEKI 403 >gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thaliana] Length = 659 Score = 173 bits (438), Expect = 6e-41 Identities = 91/198 (45%), Positives = 125/198 (63%) Frame = -1 Query: 659 ESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRAFAS 480 ESVK+V+ SE R F C+E V + + G L LDV+TRWNSTY ML A+K+R+AFA Sbjct: 295 ESVKFVKASESRKDSFATCLECVGIKSGAG--LSLDVSTRWNSTYEMLARALKFRKAFAI 352 Query: 479 LSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEMKLN 300 L+ +++ Y PT EE +R E++C L PF IT SG YPT+N+YF+QV IE+ L Sbjct: 353 LNLYERGYCSLPTEEECDRGEKICDLLKPFNTITTYFSGVKYPTANIYFIQVWKIELLLM 412 Query: 299 ENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLLDY* 120 + C+D +++M +M+ KF KYWNEY V LA G LDP+ KLQ L Y ++ + Sbjct: 413 KYANCDDVDVREMAKKMQKKFAKYWNEYSVILAMGAALDPRLKLQILRSAYNKVDPVTAE 472 Query: 119 EKVNRVKEALYKLFGEYK 66 KV+ V+ L L+ EYK Sbjct: 473 GKVDIVRNNLILLYEEYK 490 >pir||H85073 probable transposon protein [imported] - Arabidopsis thaliana gi|5032279|gb|AAD38227.1|AF147264_10 may be a pseudogene [Arabidopsis thaliana] gi|7267351|emb|CAB81124.1| putative transposon protein [Arabidopsis thaliana] Length = 483 Score = 172 bits (437), Expect = 7e-41 Identities = 93/208 (44%), Positives = 129/208 (62%), Gaps = 1/208 (0%) Frame = -1 Query: 671 KKFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 +K RES+KYV+GSE R F C+E V ++ G L LDVA RWNST+ ML+ A+KYR Sbjct: 182 EKIRESIKYVKGSEHREILFAKCMENVGINLKAG--LLLDVANRWNSTFKMLDRALKYRA 239 Query: 491 AFASLSYHD-KNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASI 315 AF +L D KNYK PT EW R +++ FL F ITNL+SGS YPTSNLYFMQV Sbjct: 240 AFGNLKVIDAKNYKFHPTDAEWHRLQQMSDFLESFDQITNLISGSIYPTSNLYFMQVWKF 299 Query: 314 EMKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLY 135 + L N + +DEV+++M++ MK +FDKYW E A + DP+ KL ++C+ +L Sbjct: 300 QNWLTVNESNQDEVIRNMIVLMKERFDKYWAEVSNIFAIATVFDPRLKLTLADYCFAKLD 359 Query: 134 LLDY*EKVNRVKEALYKLFGEYKHNGLA 51 + + + ++ L KLF Y++ A Sbjct: 360 ISTREKGMKHLRAQLRKLFEVYENKSNA 387 >gb|AAD48963.1|AF147263_5 contains similarity to transposases [Arabidopsis thaliana] gi|7267311|emb|CAB81093.1| AT4g05510 [Arabidopsis thaliana] Length = 604 Score = 168 bits (426), Expect = 1e-39 Identities = 92/201 (45%), Positives = 117/201 (58%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRA 489 K RE+VKYV+GS R C+E G L LDV TRWNSTY+ML A+KY+RA Sbjct: 294 KIRETVKYVKGSTSRRLALAECVE-----GKGEVLLSLDVQTRWNSTYLMLHKALKYQRA 348 Query: 488 FASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEM 309 DKNYK CP+SEEW+RA+ + L+PFY ITNL+SG SY TSNLYF V I+ Sbjct: 349 LNRFKIVDKNYKNCPSSEEWKRAKTIHEILMPFYKITNLMSGRSYSTSNLYFGHVWKIQ- 407 Query: 308 KLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLL 129 L+ M+ KFDKYW EY V LA +LDP+ K + L CY L Sbjct: 408 ---------------CLLEMRLKFDKYWKEYSVILAMRAVLDPRMKFKLLKRCYDELDPT 452 Query: 128 DY*EKVNRVKEALYKLFGEYK 66 EK++ ++ + +LFGEY+ Sbjct: 453 TSQEKIDFLETKITELFGEYR 473 >gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778249|gb|EOY25505.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778250|gb|EOY25506.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778251|gb|EOY25507.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] Length = 678 Score = 167 bits (423), Expect = 3e-39 Identities = 83/201 (41%), Positives = 130/201 (64%) Frame = -1 Query: 671 KKFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 +K RESVKYV+GS+ R Q+F C+ + L+ GG LR DV+T+WNST++ML+ A+ +R+ Sbjct: 313 QKVRESVKYVKGSQVRKQKFLECVTLMKLNAKGG--LRQDVSTKWNSTFLMLKRALYFRK 370 Query: 491 AFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIE 312 AF+ L D NY+ CP+ +EWER E++ L FY +T + S + YPT+NL+F + Sbjct: 371 AFSHLEIRDSNYRYCPSEDEWERVEKLYKLLAVFYDVTCVFSRTKYPTANLFFPSMFIAH 430 Query: 311 MKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYL 132 L E+++ +D MK+M +M KF KYW+++ + LA ILDP+ K+ F+ + Y +LY Sbjct: 431 STLQEHMSGQDVYMKNMSTQMLVKFVKYWSDFSLILAIAVILDPRYKIHFVEWSYGKLYG 490 Query: 131 LDY*EKVNRVKEALYKLFGEY 69 D + V++ L+ L+ EY Sbjct: 491 NDS-TQFKNVRDWLFSLYNEY 510 >gb|EMJ20323.1| hypothetical protein PRUPE_ppa015847mg, partial [Prunus persica] Length = 458 Score = 167 bits (423), Expect = 3e-39 Identities = 88/199 (44%), Positives = 119/199 (59%) Frame = -1 Query: 662 RESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRAFA 483 ++ +KYVRGS+GR +F C +V L G LR DV TRWNST++M+ SA+ Y+ AF Sbjct: 122 QDGIKYVRGSQGRKHKFLDCTAQVSLECKTG--LRQDVPTRWNSTFLMIGSALCYQHAFL 179 Query: 482 SLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEMKL 303 L D NYK + +EW + +++ FL FY +T L SG+ YPT NLYF QV ++ L Sbjct: 180 HLQLSDSNYKHSLSQDEWGKLKKLSKFLKVFYDVTCLFSGTKYPTENLYFPQVFMVDDTL 239 Query: 302 NENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLLDY 123 D MK M M KFDKYW EY + LA ILD + K+QF+ FCYKRLY + Sbjct: 240 RNVKVDSDSFMKSMATEMMEKFDKYWKEYSLILAIAVILDARYKIQFVEFCYKRLYGYNS 299 Query: 122 *EKVNRVKEALYKLFGEYK 66 E++ V + L+ LF Y+ Sbjct: 300 -EEMTEVPDMLFSLFDLYE 317 >gb|EOY09496.1| Ac-like transposase THELMA13 [Theobroma cacao] Length = 373 Score = 164 bits (415), Expect = 3e-38 Identities = 83/201 (41%), Positives = 130/201 (64%) Frame = -1 Query: 671 KKFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 +K RES+KYV+GS+GR Q+F C+ V+L+ L+ DV TRWNST++MLESA+ +R Sbjct: 37 QKGRESIKYVKGSQGRKQKFLECVSLVNLNAKRD--LKQDVPTRWNSTFLMLESALYFRL 94 Query: 491 AFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIE 312 F+ L D N+K P+ +EW+R E++ FL FY IT + SG+ YPT++L+F + Sbjct: 95 GFSHLEISDSNFKHSPSRDEWDRIEKLSKFLSVFYEITCVFSGTKYPTADLHFPSIFMAR 154 Query: 311 MKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYL 132 M L E+++ +D +K+M +M KF KYW+++ + L I DP+ K+QF+ + Y +LY Sbjct: 155 MILEEHMSGDDVYLKNMATQMFVKFKKYWSQFSLILTIAVIFDPRYKIQFMEWSYTKLYG 214 Query: 131 LDY*EKVNRVKEALYKLFGEY 69 + E +VK+ L+ L+ EY Sbjct: 215 SNSAE-FKKVKDHLFALYDEY 234 >gb|EOX99652.1| BED zinc finger,hAT family dimerization domain [Theobroma cacao] Length = 528 Score = 156 bits (395), Expect = 6e-36 Identities = 82/201 (40%), Positives = 126/201 (62%) Frame = -1 Query: 671 KKFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 +K RES+KYV+GS+GR Q+F C+ V+L+ L+ DV T WNST+ MLESA+ +R Sbjct: 192 QKVRESIKYVKGSQGRKQKFLECVSLVNLNAKRS--LKQDVPTWWNSTFPMLESALYFRL 249 Query: 491 AFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIE 312 AF+ L D N+K P+ +W+R E++ FL FY IT + S + YPT++LYF + Sbjct: 250 AFSYLEISDSNFKHSPSRNKWDRIEKLSKFLSVFYEITCVFSETKYPTTDLYFPSIFMAR 309 Query: 311 MKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYL 132 M L E+++ +D +K+M +M KF+KYW+E + LA I D + K+QF+ + Y + Y Sbjct: 310 MTLEEHMSGDDVYLKNMATQMFFKFEKYWSEISLILAIAVIFDYRYKIQFVEWSYAKFYG 369 Query: 131 LDY*EKVNRVKEALYKLFGEY 69 D E +V++ L+ L+ EY Sbjct: 370 SDSAE-FKKVQDHLFSLYDEY 389 >gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [Prunus persica] Length = 478 Score = 156 bits (395), Expect = 6e-36 Identities = 87/200 (43%), Positives = 117/200 (58%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRA 489 K RES+KYVRGS+G Q+F C +V L G LR DV TRWNST++M+ SA+ Y+RA Sbjct: 168 KIRESIKYVRGSQGTKQKFLDCAAQVSLECKRG--LRQDVPTRWNSTFLMINSALYYQRA 225 Query: 488 FASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEM 309 F L D NYK + +EW + E++ FL FY +T L G+ YPT+NLYF QV +E Sbjct: 226 FLHLQLSDSNYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFFGTKYPTANLYFPQVFVVED 285 Query: 308 KLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLL 129 L K KYW EY + LA ILDP+ K+QF+ FCYKRLY Sbjct: 286 TL--------------------KKAKYWKEYSLILAIAVILDPRYKIQFVKFCYKRLYGY 325 Query: 128 DY*EKVNRVKEALYKLFGEY 69 + +++ +V++ L+ LF Y Sbjct: 326 NS-KEMTKVRDMLFSLFDLY 344 >ref|XP_002450498.1| hypothetical protein SORBIDRAFT_05g006263 [Sorghum bicolor] gi|241936341|gb|EES09486.1| hypothetical protein SORBIDRAFT_05g006263 [Sorghum bicolor] Length = 521 Score = 155 bits (392), Expect = 1e-35 Identities = 77/201 (38%), Positives = 126/201 (62%), Gaps = 1/201 (0%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVH-LSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 K RESVKY++GS R Q+F+ I++++ ++ ++D+ TRWNSTY+ML+ + + +R Sbjct: 309 KVRESVKYIQGSTSRKQKFEEIIQQLYPTADESPTLPKVDICTRWNSTYLMLKDSFELKR 368 Query: 491 AFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIE 312 AF SL+ D+ Y PTSEEWE+A +VC L F+ T ++SGS YPT+NL+F ++ I Sbjct: 369 AFESLTQQDQEYIFAPTSEEWEKARKVCRLLKVFFDATVVISGSLYPTANLHFHEIWEIR 428 Query: 311 MKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYL 132 + L + DE + + + M+ KF +YW + + F I DP+ KL F++F K+ + Sbjct: 429 LVLENQVPEADEELTETIQYMQRKFRRYWKLTWLQIIFPVIFDPRFKLGFVDFRLKQAFG 488 Query: 131 LDY*EKVNRVKEALYKLFGEY 69 ++ K+ VK+ L +LF +Y Sbjct: 489 IEAESKIEIVKKTLLELFKDY 509 >ref|XP_002451486.1| hypothetical protein SORBIDRAFT_04g002725 [Sorghum bicolor] gi|241931317|gb|EES04462.1| hypothetical protein SORBIDRAFT_04g002725 [Sorghum bicolor] Length = 604 Score = 153 bits (387), Expect = 5e-35 Identities = 81/199 (40%), Positives = 119/199 (59%), Gaps = 1/199 (0%) Frame = -1 Query: 662 RESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRAFA 483 RESVKY+RGS+ R ++F+ IEE+ + ++DVA RWNSTY M++SA+ ++ AF Sbjct: 281 RESVKYIRGSQSRKEKFEDIIEELGIRCRSAP--QIDVANRWNSTYDMIQSAMPFKDAFL 338 Query: 482 SLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEMKL 303 L D NY CP+S++W+RA VC L F T +VSGS+YPTSNLYF Q+ S+ L Sbjct: 339 ELKVKDSNYTYCPSSQDWQRANAVCKLLKVFKKATKVVSGSTYPTSNLYFHQIWSVRQVL 398 Query: 302 NENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLY-LLD 126 E +E + M++ M+ KFDKYW +T +LDP+ K F+ F K+ + Sbjct: 399 EEEAFSPNETIAAMVLEMQAKFDKYWMISYLTNCVPVVLDPRFKFGFIEFRLKQAFGQHG 458 Query: 125 Y*EKVNRVKEALYKLFGEY 69 +++V +A+ LF Y Sbjct: 459 SVHHLDKVDQAIRGLFNAY 477 >ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] gi|548861481|gb|ERN18855.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] Length = 685 Score = 150 bits (379), Expect = 4e-34 Identities = 77/202 (38%), Positives = 119/202 (58%) Frame = -1 Query: 671 KKFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRR 492 +K RES+KYV+ S R ++F I ++ + + FL DV TRWNSTY ML+ ++ R Sbjct: 338 QKIRESIKYVKTSHVRQERFNEIINQLGIQSKQNIFL--DVPTRWNSTYHMLDVTLELRE 395 Query: 491 AFASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIE 312 AF+ + D + P+ +EWER + +C L FY ITN GS YPT+NLYF +V + Sbjct: 396 AFSCFAQCDSMCNMVPSEDEWERVKEICDCLKLFYDITNTFLGSKYPTANLYFPEVYQMH 455 Query: 311 MKLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYL 132 ++L E ++ + M I+MK KFDKYW + LA ++DP+ KL+F+ + Y ++Y Sbjct: 456 LRLVEWSMSLNKHISSMAIKMKEKFDKYWKISNLVLAIAVVIDPRFKLKFVEYSYSQIYG 515 Query: 131 LDY*EKVNRVKEALYKLFGEYK 66 D + V++ +Y L EY+ Sbjct: 516 NDAEHHIRMVRQGVYDLCNEYE 537 >ref|XP_006851229.1| hypothetical protein AMTR_s00180p00017340 [Amborella trichopoda] gi|548854912|gb|ERN12810.1| hypothetical protein AMTR_s00180p00017340 [Amborella trichopoda] Length = 841 Score = 149 bits (377), Expect = 7e-34 Identities = 72/203 (35%), Positives = 121/203 (59%) Frame = -1 Query: 668 KFRESVKYVRGSEGRLQQFQICIEEVHLSNVGGGFLRLDVATRWNSTYMMLESAIKYRRA 489 K RESVKYV+ S+ Q F +++ + + L LDV WN+T++MLE+A+++++A Sbjct: 304 KIRESVKYVKASQAHEQNFSKLFQQLEIPSKKD--LCLDVQGEWNTTFLMLEAALEFKQA 361 Query: 488 FASLSYHDKNYKLCPTSEEWERAERVCAFLVPFYHITNLVSGSSYPTSNLYFMQVASIEM 309 F+ L HD NY+ P+ +EW++ E +C +L FY + S ++PT+NLYF ++ I M Sbjct: 362 FSCLGSHDSNYEGAPSEDEWKKVEVLCIYLKVFYDVLRAFSEVTHPTANLYFHELWKIHM 421 Query: 308 KLNENLACEDEVMKDMLIRMKGKFDKYWNEYCVTLAFGCILDPKAKLQFLNFCYKRLYLL 129 LN + D V+ ++ ++ KFDKYW EY + LA +DP+ K++F+ F + ++Y Sbjct: 422 HLNHTVTSPDIVIIPVIRNLQDKFDKYWREYSLVLAIAVSMDPRFKMKFVEFSFSKVYGT 481 Query: 128 DY*EKVNRVKEALYKLFGEYKHN 60 + V EA+ L+ +Y N Sbjct: 482 NAFMYTRVVIEAIRDLYSQYARN 504