BLASTX nr result
ID: Ephedra27_contig00017339
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra27_contig00017339 (2180 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus pe... 715 0.0 gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [... 706 0.0 gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, p... 585 e-164 gb|EMJ02729.1| hypothetical protein PRUPE_ppa016152mg, partial [... 568 e-159 gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus pe... 503 e-139 gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [... 484 e-134 gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] 439 e-120 gb|AAD48963.1|AF147263_5 contains similarity to transposases [Ar... 430 e-117 ref|XP_006851229.1| hypothetical protein AMTR_s00180p00017340 [A... 421 e-115 ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, part... 418 e-114 ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A... 417 e-114 emb|CAN80126.1| hypothetical protein VITISV_013417 [Vitis vinifera] 417 e-114 gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thali... 417 e-113 gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thal... 415 e-113 gb|EOY04304.1| BED zinc finger,hAT family dimerization domain is... 414 e-112 gb|EOY04303.1| BED zinc finger,hAT family dimerization domain is... 414 e-112 gb|EOY04302.1| BED zinc finger,hAT family dimerization domain is... 414 e-112 gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] 412 e-112 gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana] 405 e-110 ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [... 403 e-109 >gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica] Length = 696 Score = 715 bits (1845), Expect = 0.0 Identities = 352/652 (53%), Positives = 476/652 (73%), Gaps = 11/652 (1%) Frame = -3 Query: 1947 NPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSN 1780 +P++++ +T KRRR TS VW FE+L + +N+ RA+C +CG Y CDSR GT N Sbjct: 28 DPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDSRYGTGN 87 Query: 1779 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 1600 L RH+ C++ DLGQ+ LS+S + + KF FRE ++MAI+ HDLPFQFVEY Sbjct: 88 LKRHIESCVKTDTRDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIIMHDLPFQFVEY 147 Query: 1599 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 1420 GIR Y+ K V+R+T + DVL +Y+REKAK++ +L S+PGR+ LT DLWTSI+T Sbjct: 148 AGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILGSVPGRVCLTSDLWTSITT 207 Query: 1419 DEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 1240 D YL LT HF+D NWKLQK++LNF FM PPHT +AL EKIY LL +WG+E+KLFS+TLD+ Sbjct: 208 DGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKLFSMTLDN 267 Query: 1239 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1060 AS+ND FV L+ +LNLK+ LL NG+FFH+ CCAHILNLIVQDGLK ID SV IRES+K Sbjct: 268 ASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVGKIRESIK 327 Query: 1059 YLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 880 Y++GS R++KFL C VSLE K+ LRQD+PTRWNSTFLM++SALYY+RAF+H + D Sbjct: 328 YVRGSQGRKQKFLNCDARVSLECKRGLRQDVPTRWNSTFLMIDSALYYQRAFLHLQLSDS 387 Query: 879 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 700 +YKH S +EW ++EKL KFL VFY VT +FSGT+YPT+NLYFPQV VV+DTL KA + Sbjct: 388 NYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDS 447 Query: 699 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINEVS 520 D F+ MA +M +KF+ YW + +ILAIAV+LDPRYK++FVE+ Y+RLYG S ++ +V Sbjct: 448 DSFMKSMATQMMEKFDKYWKEYSLILAIAVILDPRYKIQFVEFCYKRLYGYNSEEMTKVR 507 Query: 519 EKLYSLFETYKQ--NSTNSVERTSKNLQKDVSHGTHRLP----DFMEEFDTFSAENXXXX 358 + L+SLF+ Y + +S+ SV TS SH + D M+EFD F +E Sbjct: 508 DMLFSLFDLYFRIYSSSESVSGTSSASNGARSHVDDMVSKECLDVMKEFDNFESEEFTTS 567 Query: 357 XXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNI 178 ++L LYLDE + D + +L+VLDFW+ N++R+P LS++ARD+LSIPISTV+SE AF++ Sbjct: 568 AQKTQLQLYLDEPKIDRKTKLNVLDFWKVNQFRYPELSILARDLLSIPISTVASESAFSV 627 Query: 177 GRRVVNRSRSVLKPEIVESTFCSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 25 G RV+++ RS LKPE VE+ C+R+WIFG++ + P++ E+ ED++K+ IN Sbjct: 628 GGRVLDQYRSALKPENVEALVCTRDWIFGEENCTLAPNLEELTEDISKMEIN 679 >gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica] Length = 697 Score = 706 bits (1822), Expect = 0.0 Identities = 348/652 (53%), Positives = 471/652 (72%), Gaps = 11/652 (1%) Frame = -3 Query: 1947 NPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSN 1780 +P++++ +T KRRR TS VW FE+L + +N+ RA+C +CG Y CDSR GT N Sbjct: 29 DPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDSRYGTRN 88 Query: 1779 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 1600 L RH+ C++ DLGQ+ LS+S + + KF FRE ++MAI+ HDLPFQFVEY Sbjct: 89 LKRHIESCVKTDTRDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIITHDLPFQFVEY 148 Query: 1599 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 1420 GIR Y+ K V+R+T + DVL +Y+REKAK++ +L S+PGR+ L DLWTSI+T Sbjct: 149 SGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILDSVPGRVCLASDLWTSITT 208 Query: 1419 DEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 1240 D YL LT HF+D NWKLQK++LNF FM PPHT + L EKIY LL +WG+E+KLFS+TLD+ Sbjct: 209 DGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVTLCEKIYKLLTDWGVEKKLFSMTLDN 268 Query: 1239 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1060 AS+ND FV L+ + NLK+ LL NG+FF++ CCAHILNLIVQDGLK ID SV IRES+K Sbjct: 269 ASSNDTFVELLKGQPNLKDALLMNGKFFYIRCCAHILNLIVQDGLKHIDDSVGKIRESIK 328 Query: 1059 YLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 880 Y++GS R++KFL C VSLE K+ LRQD+PTRWNSTFLM++SALYY+RAF+H + D Sbjct: 329 YVRGSQGRKQKFLNCAAQVSLECKRGLRQDVPTRWNSTFLMIDSALYYQRAFLHLQLSDS 388 Query: 879 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 700 +YKH S +EW ++EKL KFL VFY VT +FSGT+YPT+NLYFPQV VV+DTL KA + Sbjct: 389 NYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDS 448 Query: 699 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINEVS 520 D F+ MA +M + F+ YW + +I AIAV+LDPRYK++FVE+ Y+RLYG S ++ +V Sbjct: 449 DSFMKSMATQMMEMFDKYWKEYSLIPAIAVILDPRYKIQFVEFCYKRLYGYNSEEMTKVR 508 Query: 519 EKLYSLFETYKQ--NSTNSVERTSKNLQKDVSHGTHRLP----DFMEEFDTFSAENXXXX 358 + L+SLF+ Y Q +S+ SV TS SH + D M+EFD F +E Sbjct: 509 DMLFSLFDLYFQIYSSSESVSGTSSASNGARSHVDDMVSKECLDVMKEFDNFESEEFTTS 568 Query: 357 XXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNI 178 ++L LYLDE + D + +L+VLDFW+ N++R+P LS++ARD+LSIPISTV+SE AF++ Sbjct: 569 AQKTQLQLYLDEPKIDRKTKLNVLDFWKVNQFRYPELSILARDLLSIPISTVASESAFSV 628 Query: 177 GRRVVNRSRSVLKPEIVESTFCSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 25 G RV+++ RS LKPE VE+ C+R+WIFGK+ + P++ E+ ED++K+ IN Sbjct: 629 GGRVLDQYRSALKPENVEALVCTRDWIFGKENCTLAPNLEELTEDISKMEIN 680 >gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778249|gb|EOY25505.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778250|gb|EOY25506.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778251|gb|EOY25507.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] Length = 678 Score = 585 bits (1508), Expect = e-164 Identities = 299/668 (44%), Positives = 426/668 (63%), Gaps = 1/668 (0%) Frame = -3 Query: 2046 MDSLDNETKAESDIQS-QDGQETRVTQELTPTSMNPASSSGCSITSKRRRTSNVWNHFEM 1870 M+S +N ES+ ++ E + T E SS S S+ HF Sbjct: 1 MESENNGISLESNAHPLEENDEIQQTDEKMQGRQKRKLSSQVSTFSE---------HFPK 51 Query: 1869 LSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKNPDLGQMFLSQSGSLMSM 1690 S + D K A+C+ CG + +CDS++ NL R+ C+ ++GQM S Sbjct: 52 KS-SIDGKAIAKCKHCGIVLNCDSKHEIDNLKRYSENCVGGDTREIGQMISSNQHGSTLT 110 Query: 1689 KPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIY 1510 + + FRE ++ AI H+LP FVEY G R Y++E ++R+T++ ++K++ Sbjct: 111 RSSNLDPEKFRELVIGAIFMHNLPLSFVEYRGSRALSSYLHEDVTLISRNTLKAYMIKMH 170 Query: 1509 HREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPP 1330 E++K++ LL PGRI+LTFDLW SI+TD Y+ L AHF+D+NW LQK+VLNF FM PP Sbjct: 171 RAERSKIKCLLEETPGRINLTFDLWNSITTDTYICLIAHFVDKNWVLQKRVLNFSFMPPP 230 Query: 1329 HTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHV 1150 + +AL EK+YALL EWGIE KLFSVTLD+ ++ FV L+ LN++ L G+FFH+ Sbjct: 231 YNCVALIEKVYALLAEWGIESKLFSVTLDNVLASNAFVELLKKNLNVRKTFLVGGKFFHL 290 Query: 1149 CCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLECVRLVSLETKKALRQD 970 C A +LNLIVQD LKE+D V +RESVKY+KGS R++KFLECV L+ L K LRQD Sbjct: 291 RCFAQVLNLIVQDSLKEVDCVVQKVRESVKYVKGSQVRKQKFLECVTLMKLNAKGGLRQD 350 Query: 969 IPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDM 790 + T+WNSTFLML+ ALY+R+AF H + D +Y++CPS +EWERVEKL+K L VFY VT + Sbjct: 351 VSTKWNSTFLMLKRALYFRKAFSHLEIRDSNYRYCPSEDEWERVEKLYKLLAVFYDVTCV 410 Query: 789 FSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAV 610 FS T+YPT+NL+FP + + TL + M D ++ M+ +M KF YWS +ILAIAV Sbjct: 411 FSRTKYPTANLFFPSMFIAHSTLQEHMSGQDVYMKNMSTQMLVKFVKYWSDFSLILAIAV 470 Query: 609 VLDPRYKLRFVEWSYERLYGSGSSQINEVSEKLYSLFETYKQNSTNSVERTSKNLQKDVS 430 +LDPRYK+ FVEWSY +LYG+ S+Q V + L+SL+ Y + S +S N D Sbjct: 471 ILDPRYKIHFVEWSYGKLYGNDSTQFKNVRDWLFSLYNEYAVKA--SPTPSSFNNTSDEH 528 Query: 429 HGTHRLPDFMEEFDTFSAENXXXXXXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPC 250 T DF EEFD+++ S+L+ YL E + KEL++L FW++N+YR+P Sbjct: 529 TLTEGKRDFFEEFDSYATVKFGAATQKSQLEWYLSEPMVERTKELNILQFWKENQYRYPE 588 Query: 249 LSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWIFGKQYDMEP 70 L+ MARD+LSIPIS +SE AF++G +++++ RS LKP+I+E+T C ++W+FG+ + Sbjct: 589 LAAMARDVLSIPISATASEFAFSVGGKILDQHRSSLKPDILEATVCCKDWLFGEVEHEDM 648 Query: 69 DVNEMCED 46 D+N + ED Sbjct: 649 DLNVVIED 656 >gb|EMJ02729.1| hypothetical protein PRUPE_ppa016152mg, partial [Prunus persica] Length = 613 Score = 568 bits (1464), Expect = e-159 Identities = 299/647 (46%), Positives = 408/647 (63%), Gaps = 6/647 (0%) Frame = -3 Query: 1947 NPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSN 1780 +P++++ +T KRRR TS VW FE+L + +N+ RA+C +CG Y CDSR GT N Sbjct: 29 DPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDSRYGTGN 88 Query: 1779 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 1600 L RH+ C++ DLGQ+ LS+ + + KF FRE ++MAI+ HDLPFQFVEY Sbjct: 89 LKRHIESCVKTDTRDLGQLLLSKYDGAILTRSSKFDPMKFRELLLMAIIMHDLPFQFVEY 148 Query: 1599 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 1420 GIR Y+ K V+R+ + DVL +Y+REKAK++ +L S+PGR+ LTFDLWTSI+T Sbjct: 149 AGIRQLFNYVCADIKLVSRNIAKADVLSLYNREKAKLKEILGSVPGRVCLTFDLWTSITT 208 Query: 1419 DEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 1240 D YL LT HF+D NWK +K +LNF FM PPHT +AL EKIY LL +WG+++KLFS+TLD+ Sbjct: 209 DGYLCLTVHFIDVNWKWEKIILNFSFMPPPHTGVALCEKIYRLLTDWGVKKKLFSMTLDN 268 Query: 1239 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1060 AS+ND FV L+ +LNLK+ LL NG+FFH+ CCAHILNLIVQDGLK ID SV IRES+K Sbjct: 269 ASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVGKIRESIK 328 Query: 1059 YLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 880 Y +GS R++KFL C VSLE KK Sbjct: 329 YARGSQGRKQKFLNCAAQVSLECKKG---------------------------------- 354 Query: 879 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 700 C KFL VFY VT +FSGT+YPT+NLYFPQV VV+DTL KA + Sbjct: 355 ---DCVK----------IKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDS 401 Query: 699 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGS-SQINEV 523 D F+ MA +M KKF+ W + +ILAIAV+L+PRYK++FVE+ Y+R +G+ S ++++ Sbjct: 402 DSFMKSMATQMMKKFDKNWKEYSLILAIAVILNPRYKIQFVEFCYKRFASNGARSYVDDM 461 Query: 522 SEKLYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXSE 343 K D M+EFD F +E ++ Sbjct: 462 VSK--------------------------------ECLDVMKEFDNFESEEFTTSAQKTQ 489 Query: 342 LDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVV 163 L LYLDE + D + +L+VLDFW+ N++R+P LS++ARD+LSIPISTV+SE F++ RV+ Sbjct: 490 LQLYLDEAKIDRKTKLNVLDFWKVNQFRYPGLSILARDLLSIPISTVASESTFSVDGRVL 549 Query: 162 NRSRSVLKPEIVESTFCSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 25 ++ RS LKPE VE+ C+ +WIFG++ + P++ E+ ED++K+ IN Sbjct: 550 DQYRSALKPENVEALVCTLDWIFGEENCTLAPNLEELTEDISKMEIN 596 >gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus persica] Length = 567 Score = 503 bits (1295), Expect = e-139 Identities = 244/424 (57%), Positives = 315/424 (74%), Gaps = 4/424 (0%) Frame = -3 Query: 1950 MNPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTS 1783 ++P++++ +T KRRR TS VW HFE+L + +N+ RA+C +CG Y DSR GT Sbjct: 27 LDPSNNNNAVVTQIGKRRRKLTSAVWTHFEILHIDENNEQRAKCMKCGQKYLFDSRYGTG 86 Query: 1782 NLNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVE 1603 NL RH+ C++ DLGQ+ LS+S + + KF FRE ++MAI+ HDLPFQFVE Sbjct: 87 NLKRHIESCVKIDTCDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIIMHDLPFQFVE 146 Query: 1602 YEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIS 1423 Y GIR Y+ K V+R+T + DVL +Y+REKAK++ +L S+PGR+ LT DLWTSI+ Sbjct: 147 YSGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILGSVPGRVCLTSDLWTSIT 206 Query: 1422 TDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLD 1243 TD YL LT HF+D NWKLQK++LNF FM PPHT +AL EKIY LL +WG+E+KLFS+TLD Sbjct: 207 TDGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKLFSMTLD 266 Query: 1242 DASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESV 1063 +AS+ND FV L+ +LNLK+ LL NG+FFH+ CCAHILNLIVQDGLK ID SV IRES+ Sbjct: 267 NASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVGKIRESI 326 Query: 1062 KYLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVID 883 KY++GS R++KFL C VSLE K+ LRQD+PTRWNSTFLM++SAL+Y+RAF+H + D Sbjct: 327 KYVRGSQGRKQKFLNCAAQVSLECKRGLRQDVPTRWNSTFLMIDSALHYQRAFLHLQLSD 386 Query: 882 PDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKE 703 +YKH EW +++KL KFL VFY VT +F GT+YP +NLYFPQV VV+DTL KA KE Sbjct: 387 SNYKHSLPQNEWGKLKKLSKFLKVFYDVTCLFFGTKYPIANLYFPQVFVVEDTLRKA-KE 445 Query: 702 GDGF 691 D F Sbjct: 446 FDNF 449 Score = 107 bits (268), Expect = 2e-20 Identities = 50/104 (48%), Positives = 77/104 (74%) Frame = -3 Query: 399 EEFDTFSAENXXXXXXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILS 220 +EFD F +E ++L LYL+E + D + +L+VL+FW+ N++R+P LS++ARD+LS Sbjct: 444 KEFDNFESEEFTTSAQKTQLQLYLNEPKIDRKTKLNVLNFWKVNQFRYPELSILARDLLS 503 Query: 219 IPISTVSSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWIFGK 88 IPISTV+ E AF++G RV+++ S LKPE VE+ C+ +WIFG+ Sbjct: 504 IPISTVAYESAFSVGGRVLDQYHSALKPENVEALVCTHDWIFGE 547 >gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [Prunus persica] Length = 478 Score = 484 bits (1245), Expect = e-134 Identities = 268/571 (46%), Positives = 350/571 (61%), Gaps = 3/571 (0%) Frame = -3 Query: 1728 QMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFV 1549 Q+ LS+S + + KF FRE ++MAI+ HDLPFQFVEY GIR Sbjct: 1 QLLLSKSDGAILTRSSKFDPIKFRELLVMAIIMHDLPFQFVEYAGIRQT----------- 49 Query: 1548 TRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLSLTAHFLDQNWKL 1369 TSI+TD YL LT +F+D NWKL Sbjct: 50 --------------------------------------TSITTDGYLCLTVYFIDVNWKL 71 Query: 1368 QKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNL 1189 QK++LNF FM P HT +AL EKIY LL WG+E+KLFS+TLD+AS+ND FV L+ +LNL Sbjct: 72 QKRILNFSFMPPLHTGVALCEKIYRLLTNWGVEKKLFSLTLDNASSNDTFVELLKGQLNL 131 Query: 1188 KNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLECVR 1009 K+ LL NG+FFHV CCAHILNLIVQDGLK ID V IRES+KY++GS ++KFL+C Sbjct: 132 KDALLMNGKFFHVRCCAHILNLIVQDGLKHIDDYVGKIRESIKYVRGSQGTKQKFLDCAA 191 Query: 1008 LVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKL 829 VSLE K+ LRQD+PTRWNSTFLM+ SALYY+RAF+H + D +YKH S +EW ++EKL Sbjct: 192 QVSLECKRGLRQDVPTRWNSTFLMINSALYYQRAFLHLQLSDSNYKHSLSQDEWGKLEKL 251 Query: 828 FKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFEN 649 KFL VFY VT +F GT+YPT+NLYFPQV VV+DTL KA Sbjct: 252 SKFLKVFYDVTCLFFGTKYPTANLYFPQVFVVEDTLKKA--------------------K 291 Query: 648 YWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINEVSEKLYSLFETYKQ--NST 475 YW + +ILAIAV+LDPRYK++FV++ Y+RLYG S ++ +V + L+SLF+ Y + S+ Sbjct: 292 YWKEYSLILAIAVILDPRYKIQFVKFCYKRLYGYNSKEMTKVRDMLFSLFDLYVRIYTSS 351 Query: 474 NSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXSELDLYLDEQRSDWRKEL 295 SV TS VS G D M EFD F Sbjct: 352 ESVSGTS-----SVSIGARSHVDDM-EFDNFEM--------------------------- 378 Query: 294 DVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEIVESTF 115 N++R+P LS++ RD+LSIPISTV+SE AF++G R++++ RS LKP+ VE Sbjct: 379 --------NQFRYPELSILVRDLLSIPISTVASESAFSVGGRMLDQYRSALKPKNVEVLV 430 Query: 114 CSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 25 C+R+WIFGK+ Y + P++ E+ ED++K+ IN Sbjct: 431 CTRDWIFGKENYTLAPNLEELTEDISKMEIN 461 >gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] Length = 745 Score = 439 bits (1130), Expect = e-120 Identities = 261/676 (38%), Positives = 389/676 (57%), Gaps = 7/676 (1%) Frame = -3 Query: 2052 MEMDS--LDNETKAESDIQSQDGQETRVTQELT-PTSMNPASSSGCSITSKRRRTSNVWN 1882 ME+D+ L +E + Q D ++ + Q L T+ + G S S+ R W Sbjct: 91 MELDTQNLVDEDNFNLEDQEMDDEDPEMDQILPHDTASSGTVERGKSSVSRFRAAC--WK 148 Query: 1881 HFEMLSLTADNKPRARCRQCGAIYSCD-SRNGTSNLNRHVRICIRKKNPDLGQMFLSQSG 1705 +F+ + K C+ C Y + RNGT+ +NRH+R C +K P G Sbjct: 149 NFDRGQKYPNGKTEVTCKYCEQTYHLNLRRNGTNTMNRHMRSC--EKTP----------G 196 Query: 1704 SLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEED 1525 S + K + VFRE + +A+++H+LP+ FVEYE IR+A Y+N +F +R+T D Sbjct: 197 STPRISR-KVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYVNPSIEFWSRNTAASD 255 Query: 1524 VLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLSLTAHFLDQNWKLQKKVLNFH 1345 V KIY REK K++ L IPGRI LT DLW +++ + Y+ LTAH++D + L+ K+L+F Sbjct: 256 VYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFC 315 Query: 1344 FMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNG 1165 PPH+ +A++ K+ LL +WGIE+K+F++T+D+AS ND L+ +L + L+C+G Sbjct: 316 AFPPPHSGVAIAMKLSELLKDWGIEKKVFTLTVDNASANDTMQSILKRKL--QKHLVCSG 373 Query: 1164 EFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLECVRLVSLETKK 985 EFFHV C AHILNLIVQDGL+ I ++ IRE+VKY+KGS R F C+ + ++T+ Sbjct: 374 EFFHVRCSAHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCMDTIGIQTEA 433 Query: 984 ALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFY 805 +L D+ TRWNST+ ML A+ ++ A +D YK PS EWER E + L F Sbjct: 434 SLVLDVSTRWNSTYHMLSRAIQFKDVLHSLAEVDRGYKSFPSAVEWERAELICDLLKPFA 493 Query: 804 KVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVI 625 ++T + SG+ YPT+N+YF QV ++ L D + M +M +K++ YW I Sbjct: 494 EITKLISGSSYPTANVYFMQVWAIKCWLGDHDDSHDRAIREMVEDMTEKYDKYWEDFSDI 553 Query: 624 LAIAVVLDPRYKLRFVEWSYERLYGSGSSQ-INEVSEKLYSLFETYKQNSTNSVERTSKN 448 LA+A VLDPR K +E+ Y L S + + V +K+ LF YK+ + N TS++ Sbjct: 554 LAMAAVLDPRLKFSALEYCYNILNPLTSKENLTHVRDKMVQLFGAYKRTTCNVAASTSQS 613 Query: 447 LQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXSELDLYLDEQRSDW--RKELDVLDFWR 274 +KD+ G + + FS N S LD+YL+E D +++DV+ +W+ Sbjct: 614 SRKDIPFG------YDGFYSYFSQRN---GTGKSPLDMYLEEPVLDMVSFRDMDVIAYWK 664 Query: 273 DNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWIF 94 +N RF LS MA DILSI I+TV+SE F+IG RV+N+ RS L P V++ C+RNW Sbjct: 665 NNVSRFKELSSMACDILSISITTVASESTFSIGSRVLNKYRSCLLPTNVQALLCTRNWFR 724 Query: 93 GKQYDMEPDVNEMCED 46 G Q D+E D + ED Sbjct: 725 GFQ-DVETDEIQGQED 739 >gb|AAD48963.1|AF147263_5 contains similarity to transposases [Arabidopsis thaliana] gi|7267311|emb|CAB81093.1| AT4g05510 [Arabidopsis thaliana] Length = 604 Score = 430 bits (1105), Expect = e-117 Identities = 251/617 (40%), Positives = 355/617 (57%), Gaps = 4/617 (0%) Frame = -3 Query: 1911 KRRRTSNVWNHFEMLSLTADNKPR-ARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKNPD 1735 KR RTS++W++F +L +N + A C++C Y GTSNL RH R C Sbjct: 33 KRSRTSDMWDYF---TLEDENDGKIAYCKKCLKPYPILPTTGTSNLIRHHRKC------- 82 Query: 1734 LGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAK 1555 G + K K KV REK I++HDLPF VEYE +RD + Y+N K Sbjct: 83 -------SMGLDVGRKTTKIDHKVVREKFSRVIIRHDLPFLCVEYEELRDFISYMNPDYK 135 Query: 1554 FVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLSLTAHFLDQNW 1375 TR+T DV+K + +EK +++ L IP RI LT D WTS+ D Y+ LTAH++D W Sbjct: 136 CYTRNTAAADVVKTWEKEKQILKSELERIPSRICLTSDCWTSLGGDGYIVLTAHYVDTRW 195 Query: 1374 KLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNEL 1195 L K+L+F M PPHT AL+ KI+ L EWGIE+K+F++TLD+A+ N+ L + L Sbjct: 196 ILNSKILSFSDMLPPHTGDALASKIHECLKEWGIEKKVFTLTLDNATANNSMQEVLIDRL 255 Query: 1194 NLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLEC 1015 L N L+C GEFFHV CCAH+LN IVQ+GL I ++ IRE+VKY+KGS RR EC Sbjct: 256 KLDNNLMCKGEFFHVRCCAHVLNRIVQNGLDVISDALSKIRETVKYVKGSTSRRLALAEC 315 Query: 1014 VRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVE 835 V + + L D+ TRWNST+LML AL Y+RA F ++D +YK+CPS EEW+R + Sbjct: 316 ---VEGKGEVLLSLDVQTRWNSTYLMLHKALKYQRALNRFKIVDKNYKNCPSSEEWKRAK 372 Query: 834 KLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKF 655 + + L FYK+T++ SG Y TSNLYF V +Q L EM KF Sbjct: 373 TIHEILMPFYKITNLMSGRSYSTSNLYFGHVWKIQCLL----------------EMRLKF 416 Query: 654 ENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQ-INEVSEKLYSLFETYKQNS 478 + YW + VILA+ VLDPR K + ++ Y+ L + S + I+ + K+ LF Y++ Sbjct: 417 DKYWKEYSVILAMRAVLDPRMKFKLLKRCYDELDPTTSQEKIDFLETKITELFGEYRK-- 474 Query: 477 TNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXSELDLYLDEQRSDWRK- 301 + T +L L D E + SA LD+YL++ + + + Sbjct: 475 --AFPVTPVDL--------FDLDDVPEVEEGKSA-----------LDMYLEDPKLEMKNH 513 Query: 300 -ELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEIVE 124 L+VL +W++NR RF L+ MA D+LSIPI++V+SE +F+IG V+N+ RS L P V+ Sbjct: 514 PNLNVLQYWKENRLRFGALAYMAMDVLSIPITSVASESSFSIGSHVLNKYRSRLLPTNVQ 573 Query: 123 STFCSRNWIFGKQYDME 73 + C+R+W++G D E Sbjct: 574 ALLCTRSWLYGFVSDEE 590 >ref|XP_006851229.1| hypothetical protein AMTR_s00180p00017340 [Amborella trichopoda] gi|548854912|gb|ERN12810.1| hypothetical protein AMTR_s00180p00017340 [Amborella trichopoda] Length = 841 Score = 421 bits (1082), Expect = e-115 Identities = 240/648 (37%), Positives = 378/648 (58%), Gaps = 10/648 (1%) Frame = -3 Query: 1923 SITSKRRRTSN-VWNHFEMLSLTADNKPRARCRQCGAIYSCDS---RNGTSNLNRHVRIC 1756 S +SKRR+T++ VW HF M + + +ARC+ C ++ + + GTS+L RH+ IC Sbjct: 15 SSSSKRRKTTSIVWEHFTMETFIGGCR-KARCKYCLHTFAFGNGAKQLGTSHLKRHLGIC 73 Query: 1755 IRKKNPDLGQMFLS-----QSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGI 1591 + +N D Q L+ ++ S+ PKF RE + I+ H+ P VE+ Sbjct: 74 PKNRNSDRKQELLTLTPKDKNEGNTSLSNPKFDQSRSREDLARMIILHEYPLSVVEHPAF 133 Query: 1590 RDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEY 1411 + + + K V + TV +D L IY +EK + LL +IPGRISL+ D WT+ T EY Sbjct: 134 INFVQSLQPRFKMVNQATVRDDCLAIYQKEKQSLMQLLQTIPGRISLSLDKWTTEETLEY 193 Query: 1410 LSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDDAST 1231 + +T HF+D ++KLQK+VLNF + P T LS+ I L +W I KL +VTLD T Sbjct: 194 MRITGHFVDCDFKLQKRVLNFTMLPYPFTRNDLSDVILTCLTDWNILTKLSTVTLDRHHT 253 Query: 1230 NDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLK 1051 +D L++ L+ KN LL +G F+VCCCA +LNLIVQDGL+ I+ + IRESVKY+K Sbjct: 254 DDCIGSNLKDCLSSKNMLLLSGRVFNVCCCADVLNLIVQDGLEAINDVIHKIRESVKYVK 313 Query: 1050 GSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYK 871 S + F + + + + +KK L D+ WN+TFLMLE+AL +++AF D +Y+ Sbjct: 314 ASQAHEQNFSKLFQQLEIPSKKDLCLDVQGEWNTTFLMLEAALEFKQAFSCLGSHDSNYE 373 Query: 870 HCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGF 691 PS +EW++VE L +L VFY V FS +PT+NLYF ++ + L + D Sbjct: 374 GAPSEDEWKKVEVLCIYLKVFYDVLRAFSEVTHPTANLYFHELWKIHMHLNHTVTSPDIV 433 Query: 690 VHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINE-VSEK 514 + + + KF+ YW + ++LAIAV +DPR+K++FVE+S+ ++YG+ + V E Sbjct: 434 IIPVIRNLQDKFDKYWREYSLVLAIAVSMDPRFKMKFVEFSFSKVYGTNAFMYTRVVIEA 493 Query: 513 LYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXSELDL 334 + L+ Y +N V + N + S+ + ++ D +++FD F +E SELD Sbjct: 494 IRDLYSQYARNIPGPVPLATYNGDQSSSNNSFQINDGLQDFDQFLSELSGSQQTKSELDQ 553 Query: 333 YLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRS 154 YL+E +E D+L +W+ + ++P LS MARDIL+I ++TV SE FN G +V+++ Sbjct: 554 YLEEPLFPRNQEFDILRWWKMSAPKYPVLSEMARDILAIRVTTVDSESMFNTGGKVLDQY 613 Query: 153 RSVLKPEIVESTFCSRNWIFGKQYDMEPDVNEMCEDVTKLNINDSLIS 10 +S L PE +E+ C+R+W+ +++E ++ T LN++DS +S Sbjct: 614 QSSLSPETIEALICARDWL---HHELETSLD------TVLNMSDSTLS 652 >ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] gi|482560944|gb|EOA25135.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] Length = 547 Score = 418 bits (1074), Expect = e-114 Identities = 217/523 (41%), Positives = 334/523 (63%), Gaps = 3/523 (0%) Frame = -3 Query: 1680 KFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHRE 1501 K V RE + + I+ HDLPF FVEY +R+ L Y+N K ++R+T DVLK + Sbjct: 7 KIDHSVVRELITLVIICHDLPFSFVEYPRVRELLKYLNPEYKTISRNTAVADVLKFHGIR 66 Query: 1500 KAKVRNLLHSIPGRISLTFDLWTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTA 1321 K +++ L + RI LT D+W SIS + Y+ LTAH++D +WKL+ K+L+F M PPH+ Sbjct: 67 KEQMKQELAGVGNRICLTCDVWRSISIEGYICLTAHYVDDSWKLKSKILSFCAMPPPHSG 126 Query: 1320 IALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCC 1141 L++K+ + L +WGIE+K+FS+TLD+AS+ND LR++L+ ++GLLC+GEFFH+ C Sbjct: 127 FELAKKVLSCLEDWGIEKKIFSLTLDNASSNDNMQSILRDQLSSRHGLLCDGEFFHIRCS 186 Query: 1140 AHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLECVRLVSLETKKALRQDIPT 961 AH+LNLIVQ GLK ++S + IRE+VK++K S R+ F ECV V ++ L+ D+ T Sbjct: 187 AHVLNLIVQVGLKFVESPLHKIRETVKWIKWSEGRKDLFKECVIDVGIKYTAGLKMDVST 246 Query: 960 RWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSG 781 RWNST+LML S + YRRAF + +YK CPS EEW + EK++ FL FY +T +FSG Sbjct: 247 RWNSTYLMLGSVIKYRRAFSLLERAERNYKFCPSDEEWNKAEKIYTFLEPFYDITKLFSG 306 Query: 780 TQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLD 601 T YPT+NLYF Q+ ++ L +GD + MA EM KF+ YW + +IL+I +LD Sbjct: 307 TSYPTANLYFAQIWKIECLLNSYSNDGDMELQNMANEMRTKFDKYWEEYSIILSIGAILD 366 Query: 600 PRYKLRFVEWSYERLYGSGS-SQINEVSEKLYSLFETYKQNSTNSVERTSKNLQKDVSHG 424 PR K+ + + +++L S + +++ V +KL LF+ YK T++ +S S G Sbjct: 367 PRMKVEILTYCFDKLDPSTTKAKVEVVKQKLNLLFDQYKSTPTSTNVSSS-------SRG 419 Query: 423 THRLPDFMEEFDTFSAENXXXXXXXSELDLYLDEQRSD--WRKELDVLDFWRDNRYRFPC 250 T + +F + + S+L +YL++ R + + +++DVL++W++ R+ Sbjct: 420 TDFIAKTHSDFKAYE-KRTILEEGKSKLAVYLEDDRLEMTFYEDMDVLEWWKNQTQRYGE 478 Query: 249 LSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEIVES 121 L+ MA D+LSIPI++V++E +F+IG V+N+ RS L P VE+ Sbjct: 479 LARMACDVLSIPITSVAAESSFSIGAHVLNKYRSRLLPRHVEA 521 >ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] gi|548861481|gb|ERN18855.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] Length = 685 Score = 417 bits (1073), Expect = e-114 Identities = 231/622 (37%), Positives = 371/622 (59%), Gaps = 11/622 (1%) Frame = -3 Query: 1920 ITSKRRRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 1741 + SKR+ S+VW+ FE + + D +A C+ C S +GTS+L RH+ C ++ + Sbjct: 59 LPSKRKTISSVWDEFEKVR-SEDGSVKAACKHCHRNLVGSSAHGTSHLKRHLGRCAKRVH 117 Query: 1740 PDLGQMFLS---QSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYI 1570 GQ + + G S+ KF R + IL H+ P VE+ R + + Sbjct: 118 IGSGQQLVVTCIKKGEASSVNF-KFDQGRSRYDLAKMILLHEYPSSMVEHTTFRTFVRNL 176 Query: 1569 NEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLSLTAHF 1390 V+ T+E D+++IY +EK K+ L IP RISL+ ++W+S EYL L AH+ Sbjct: 177 QPLFSMVSPSTIESDIIEIYKKEKKKLYEELEKIPSRISLSANIWSSCQNLEYLCLIAHY 236 Query: 1389 LDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGE 1210 +D W LQK++L+F + P T A++E + LL++W +++KLFS+TL+ AS ND+ Sbjct: 237 IDDAWVLQKQILSFVNL-PSRTGGAIAEVLLDLLSQWNVDKKLFSITLNSASYNDVAASS 295 Query: 1209 LRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRR 1030 LR+ L+ + L G+ FH+CCC+H++NL+VQDGL+ I + IRES+KY+K S+ R+ Sbjct: 296 LRSRLSRNSSLPLEGKIFHLCCCSHVVNLMVQDGLEVIQEVLQKIRESIKYVKTSHVRQE 355 Query: 1029 KFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEE 850 +F E + + +++K+ + D+PTRWNST+ ML+ L R AF FA D PS +E Sbjct: 356 RFNEIINQLGIQSKQNIFLDVPTRWNSTYHMLDVTLELREAFSCFAQCDSMCNMVPSEDE 415 Query: 849 WERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVE 670 WERV+++ L +FY +T+ F G++YPT+NLYFP+V + L + + + MA++ Sbjct: 416 WERVKEICDCLKLFYDITNTFLGSKYPTANLYFPEVYQMHLRLVEWSMSLNKHISSMAIK 475 Query: 669 MNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS-QINEVSEKLYSLFET 493 M +KF+ YW S ++LAIAVV+DPR+KL+FVE+SY ++YG+ + I V + +Y L Sbjct: 476 MKEKFDKYWKISNLVLAIAVVIDPRFKLKFVEYSYSQIYGNDAEHHIRMVRQGVYDLCNE 535 Query: 492 YK------QNSTNSVERTSKNLQKDV-SHGTHRLPDFMEEFDTFSAENXXXXXXXSELDL 334 Y+ NS +S+ ++ V +HG + EF+ F E+ SELD Sbjct: 536 YESKEPLASNSESSLAVSASTSSGGVDTHGKL----WAMEFEKFVRESSSNQARKSELDR 591 Query: 333 YLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRS 154 YL+E + ++ ++W+ N RFP LS MARDIL IP+STV+S+ F+IG +V+++ Sbjct: 592 YLEEPIFPRNLDFNIRNWWQLNAPRFPTLSKMARDILGIPVSTVTSDSTFDIGGQVLDQY 651 Query: 153 RSVLKPEIVESTFCSRNWIFGK 88 RS L PE +++ C+++W++ + Sbjct: 652 RSSLLPETIQALMCAQDWLWNE 673 >emb|CAN80126.1| hypothetical protein VITISV_013417 [Vitis vinifera] Length = 1266 Score = 417 bits (1073), Expect = e-114 Identities = 227/632 (35%), Positives = 363/632 (57%), Gaps = 12/632 (1%) Frame = -3 Query: 1956 TSMNPASSS-----GCSITSKRRRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRN 1792 T NPA+ S G KR+ TS VWN FE + + D + A C+ C + DS+N Sbjct: 92 TGSNPATGSTSTTDGSLTCKKRKLTSIVWNEFEKVII--DGQDYAICKHCKSKLKADSKN 149 Query: 1791 GTSNLNRHVRICIRKKNPDLGQMFLS---QSGSLMSMKPPKFSLKVFREKMMMAILKHDL 1621 GT +L+ H+ CI+++N D+ Q FL+ + + + F + REK+ AI+ H+ Sbjct: 150 GTKHLHVHLDRCIKRRNVDIKQQFLAIERKGYGKVQIGGFTFDQDISREKLARAIILHEY 209 Query: 1620 PFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFD 1441 P V++ G RD + K V+R+T+++D++KIY EK K+ + L + R+++T D Sbjct: 210 PLSIVDHAGFRDFASSLQPLFKMVSRNTIKDDIMKIYEFEKGKMSSYLEKLETRMAITTD 269 Query: 1440 LWTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKL 1261 +WTS Y+++T H++D++W L ++ F ++ PPHT LS+ + L +W ++RKL Sbjct: 270 MWTSNQKKGYMAITVHYIDESWLLHHHIVRFVYVPPPHTKEVLSDVLLDFLLDWNMDRKL 329 Query: 1260 FSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVV 1081 ++T+D+ S+ND + L +L+ LL NG+ FH+ C AH+LNLIV++GL I + Sbjct: 330 STITVDNCSSNDGMIDILSEKLSSSGSLLLNGKIFHMRCAAHVLNLIVKEGLDVIRVEIE 389 Query: 1080 DIRESVKYLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFM 901 IRESV Y + R KF + R + L K L D TRWNST+LML A+ Y+ F Sbjct: 390 KIRESVAYWSATPSRVEKFEDAARQLRLPCNKKLCLDCKTRWNSTYLMLSIAITYKDVFP 449 Query: 900 HFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTL 721 + Y PS EEW ++ + L +FY +T +FSG YPT+N +F +V +++ L Sbjct: 450 RLKQREKLYTTVPSEEEWNLAREICERLKLFYNITKLFSGRNYPTANTFFIKVCEIKEAL 509 Query: 720 TKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGS-G 544 + + V MA M +KF+ YWS +++AIAVVLDPRYK++ +E+ + +YGS Sbjct: 510 YDWLICSNEVVSTMASSMLEKFDKYWSGCHIVMAIAVVLDPRYKMKILEFYFPIMYGSEA 569 Query: 543 SSQINEVSEKLYSLFETYKQNSTNSVERTSKNLQKDVSH---GTHRLPDFMEEFDTFSAE 373 SS+I ++ + Y L Y Q+ + ++TS + VS+ T+ D + +FD F Sbjct: 570 SSEIGKIRQLCYDLLSEY-QSKSKMGQQTSSHGASSVSNLFELTYDEQDPLSKFDLFVHS 628 Query: 372 NXXXXXXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSE 193 SELD YL+E + DVL +W+ N ++P L ++ RDI +IP+STV+SE Sbjct: 629 TSEEGHAKSELDYYLEETVLPRISDFDVLSWWKTNGIKYPTLQMIVRDIYAIPVSTVASE 688 Query: 192 VAFNIGRRVVNRSRSVLKPEIVESTFCSRNWI 97 AF+ G R+V++ RS L P +E+ C+++W+ Sbjct: 689 SAFSTGGRMVSKHRSRLHPNTLEALMCAQSWL 720 >gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thaliana] Length = 577 Score = 417 bits (1072), Expect = e-113 Identities = 234/568 (41%), Positives = 342/568 (60%), Gaps = 3/568 (0%) Frame = -3 Query: 1779 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 1600 +NRH+R C +K P GS + K + VFRE + +A+++H+LP+ FVEY Sbjct: 1 MNRHMRSC--EKTP----------GSTPRISR-KVDMMVFREMIAVALVQHNLPYSFVEY 47 Query: 1599 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 1420 E IR+A Y N +F +R+T DV KIY REK K++ L IPGRI LT DLW +++ Sbjct: 48 ERIREAFTYANPSIEFWSRNTAAFDVYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTV 107 Query: 1419 DEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 1240 + Y+ LTAH++D + L+ K+L+F PPH+ +A++ K+ LL +WGIE+K+F++T+D+ Sbjct: 108 ESYICLTAHYVDVDGVLKTKILSFCAFPPPHSGVAIAMKLSELLKDWGIEKKVFTLTVDN 167 Query: 1239 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1060 AS ND L+ +L + L+C+GEFFHV C AHILNLIVQDGL+ I ++ IRE+VK Sbjct: 168 ASANDTMQSILKRKL--QKDLVCSGEFFHVRCSAHILNLIVQDGLEVISGALEKIRETVK 225 Query: 1059 YLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 880 Y+KGS R F C+ + ++T+ L D+ TRWNST+ ML A+ ++ A +D Sbjct: 226 YVKGSETRENLFQNCMDTIGIQTEANLVLDVSTRWNSTYHMLSRAIQFKDVLRSLAEVDR 285 Query: 879 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 700 YK PS EWER E + L F ++T + SG+ YPT+N+YF QV ++ L Sbjct: 286 GYKSFPSAVEWERAELICDLLKPFAEITKLISGSSYPTANVYFMQVWAIKCWLGDHDDSH 345 Query: 699 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQ-INEV 523 D + M +M +K++ YW ILA+A VLDPR K +E+ Y L S + + V Sbjct: 346 DRVIREMVEDMTEKYDKYWEDFSDILAMAAVLDPRLKFSALEYCYNILNPLTSKENLTHV 405 Query: 522 SEKLYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXSE 343 +K+ LF YK+ + N TS++ +KD+ G + + FS N S Sbjct: 406 RDKMVQLFGAYKRTTCNVAASTSQSSRKDIPFG------YDGFYSYFSQRN---GTGKSP 456 Query: 342 LDLYLDEQRSDW--RKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRR 169 LD+YL+E D +++DV+ +W++N RF LS MA DILSIPI+TV+SE AF+IG R Sbjct: 457 LDMYLEEPVLDMVSFRDMDVIAYWKNNVSRFKELSSMACDILSIPITTVASESAFSIGSR 516 Query: 168 VVNRSRSVLKPEIVESTFCSRNWIFGKQ 85 V+N+ RS L P V++ C+RNW G Q Sbjct: 517 VLNKYRSCLLPTNVQALLCTRNWFRGFQ 544 >gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thaliana] Length = 659 Score = 415 bits (1066), Expect = e-113 Identities = 232/636 (36%), Positives = 364/636 (57%), Gaps = 10/636 (1%) Frame = -3 Query: 1911 KRRRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKNPDL 1732 ++++ + W+ F + + D K RARC CG + GTS +NRH+ +C + P+ Sbjct: 29 RKKQRALCWDEFTSVGIEEDGKERARCHHCGIKLVVEKSYGTSTMNRHLTLCPERPQPET 88 Query: 1731 GQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKF 1552 PK+ KV RE I+ HD+PF++VEYE +R ++N K Sbjct: 89 R---------------PKYDHKVDREMTSEIIIYHDMPFRYVEYEKVRARDKFLNPDCKP 133 Query: 1551 VTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST-DEYLSLTAHFLDQNW 1375 + R T DV K + EKAK+ ++ G++ LT DLW+S ST Y+ +T+H++D++W Sbjct: 134 ICRQTAALDVFKRFEIEKAKLIDVFAKHNGQVCLTADLWSSRSTVTGYICVTSHYIDESW 193 Query: 1374 KLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNEL 1195 +L K+L F + PPH +++K+Y L EWG+E+K+ ++TLD+AS N L++ L Sbjct: 194 RLNNKILAFCDLKPPHNGEEIAKKVYDCLKEWGLEKKILTITLDNASANTSMQTILKHRL 253 Query: 1194 NLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLEC 1015 NGLLC G F HV CCAHILNLIVQ GL+ + +I ESVK++K S R+ F C Sbjct: 254 QSGNGLLCGGNFLHVRCCAHILNLIVQAGLELASGLLENITESVKFVKASESRKDSFATC 313 Query: 1014 VRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVE 835 + V +++ L D+ TRWNST+ ML AL +R+AF + + Y P+ EE +R E Sbjct: 314 LECVGIKSGAGLSLDVSTRWNSTYEMLARALKFRKAFAILNLYERGYCSLPTEEECDRGE 373 Query: 834 KLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKF 655 K+ L F +T FSG +YPT+N+YF QV ++ L K D V MA +M KKF Sbjct: 374 KICDLLKPFNTITTYFSGVKYPTANIYFIQVWKIELLLMKYANCDDVDVREMAKKMQKKF 433 Query: 654 ENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYG-SGSSQINEVSEKLYSLFETYKQNS 478 YW++ VILA+ LDPR KL+ + +Y ++ + +++ V L L+E YK S Sbjct: 434 AKYWNEYSVILAMGAALDPRLKLQILRSAYNKVDPVTAEGKVDIVRNNLILLYEEYKTKS 493 Query: 477 TNSVERTSKNLQKDVSHGTHRLPDFMEE-FDTFSAENXXXXXXXSELDLYL-DEQRSDWR 304 +S ++ ++ + + D ++ F+ S+ S L++YL DE R + + Sbjct: 494 ASSSNSSTTLTPHELLNESPLEADVNDDLFELESSLISASKSTKSTLEIYLDDEPRLEMK 553 Query: 303 --KELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEI 130 ++++L FW++N++R+ L+ MA D+LSIPI+TV+SE AF++G RV+N R+ L P+ Sbjct: 554 TFSDMEILSFWKENQHRYGDLASMASDLLSIPITTVASESAFSVGGRVLNPFRNRLLPQN 613 Query: 129 VESTFCSRNWIFGKQYDMEPDVNEMC----EDVTKL 34 V++ C+RNW+ G D+E D+ E+ D TK+ Sbjct: 614 VQALICTRNWLLG-YADLEGDIEELFAEEDNDATKM 648 >gb|EOY04304.1| BED zinc finger,hAT family dimerization domain isoform 3, partial [Theobroma cacao] Length = 680 Score = 414 bits (1063), Expect = e-112 Identities = 234/635 (36%), Positives = 359/635 (56%), Gaps = 28/635 (4%) Frame = -3 Query: 1917 TSKR-RRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 1741 +SKR + TS VW+ FE L + +A C+ C IY+ + +GTS+L RH+ C+++ N Sbjct: 36 SSKRPKTTSKVWDVFEKLPAQQGDS-KAICKLCRRIYTAKTTSGTSHLRRHIEACLKRGN 94 Query: 1740 PDLGQMFLS---------------QSGSLMSMKPPKFSLKV----FREKMMMAILKHDLP 1618 DL Q G+L+ P S K+ R + M I+ P Sbjct: 95 HDLDQRSTEACFKPVNRDANRHTVSQGTLIDATTPLKSYKLDVDEIRRAIAMMIIVDAQP 154 Query: 1617 FQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDL 1438 F+ VE G R L ++R ++ D++ IY RE+ +R LL + PGRI LT Sbjct: 155 FRVVEDTGFRHVLNVACPEFPLLSRKAIKRDIISIYVRERENIRELLGACPGRICLTSST 214 Query: 1437 WTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLF 1258 W S D Y +TAHF+D W+LQK++L F + PP+ +++++++I + +W IE K+F Sbjct: 215 WKSNCDDHYNCVTAHFIDHEWRLQKRILRFKLIPPPYDSLSIADEIGLCMVQWNIEHKVF 274 Query: 1257 SVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVD 1078 SVTL++ S++D L+ L+ K G FF++ C ILNLIVQ G I + Sbjct: 275 SVTLENLSSDDCVADILKTRLDAKKYHPFKGVFFNMSCSTRILNLIVQAGFNLIIDIIGK 334 Query: 1077 IRESVKYLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMH 898 +R +KY++ S R++ F + ++L+T+K L D P+RWNST+ M+E AL Y+ AF++ Sbjct: 335 LRLGIKYVQQSPHRKKNFYIIAKTLNLDTQKKLCLDSPSRWNSTYNMIEVALCYKNAFLY 394 Query: 897 FAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLT 718 A D ++ H S +EWE+V +KFL V ++V +F + PTSNLYF + V L+ Sbjct: 395 LAEQDKNFIHKLSEDEWEKVSVSYKFLKVIFEVACIFFRNRQPTSNLYFKALWKVHRRLS 454 Query: 717 KAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS 538 ++ + F+ M EM KF YWS+ +IL+ A +LDPRYK++FVE+ Y +LYGSG+ Sbjct: 455 DMVRGPENFMTRMVKEMQSKFNQYWSEYNLILSCAAILDPRYKIKFVEYCYTKLYGSGAQ 514 Query: 537 QINEVS-EKLYSLFETYKQNS-------TNSVERTSKNLQKDVSHGTHRLPDFMEEFDTF 382 Q S LY LF Y QNS T SV T + KD + G E+++TF Sbjct: 515 QYVSASVNTLYGLFHDYMQNSACPSHTATLSVLTTKISNDKDDNDG-------FEDYETF 567 Query: 381 SAENXXXXXXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTV 202 + S+LDLYLDE D E+DVL++W R+P LS MARD+L+IP+ST+ Sbjct: 568 QSARFQTQVEKSQLDLYLDEPSHDLNSEIDVLEYWTLCSLRYPELSRMARDVLTIPVSTI 627 Query: 201 SSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWI 97 +S+ AF+IG +V++ RS LK +++++ C ++W+ Sbjct: 628 ASDNAFDIGPQVISTDRSSLKSKMIQALVCLQDWM 662 >gb|EOY04303.1| BED zinc finger,hAT family dimerization domain isoform 2 [Theobroma cacao] Length = 689 Score = 414 bits (1063), Expect = e-112 Identities = 234/635 (36%), Positives = 359/635 (56%), Gaps = 28/635 (4%) Frame = -3 Query: 1917 TSKR-RRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 1741 +SKR + TS VW+ FE L + +A C+ C IY+ + +GTS+L RH+ C+++ N Sbjct: 36 SSKRPKTTSKVWDVFEKLPAQQGDS-KAICKLCRRIYTAKTTSGTSHLRRHIEACLKRGN 94 Query: 1740 PDLGQMFLS---------------QSGSLMSMKPPKFSLKV----FREKMMMAILKHDLP 1618 DL Q G+L+ P S K+ R + M I+ P Sbjct: 95 HDLDQRSTEACFKPVNRDANRHTVSQGTLIDATTPLKSYKLDVDEIRRAIAMMIIVDAQP 154 Query: 1617 FQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDL 1438 F+ VE G R L ++R ++ D++ IY RE+ +R LL + PGRI LT Sbjct: 155 FRVVEDTGFRHVLNVACPEFPLLSRKAIKRDIISIYVRERENIRELLGACPGRICLTSST 214 Query: 1437 WTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLF 1258 W S D Y +TAHF+D W+LQK++L F + PP+ +++++++I + +W IE K+F Sbjct: 215 WKSNCDDHYNCVTAHFIDHEWRLQKRILRFKLIPPPYDSLSIADEIGLCMVQWNIEHKVF 274 Query: 1257 SVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVD 1078 SVTL++ S++D L+ L+ K G FF++ C ILNLIVQ G I + Sbjct: 275 SVTLENLSSDDCVADILKTRLDAKKYHPFKGVFFNMSCSTRILNLIVQAGFNLIIDIIGK 334 Query: 1077 IRESVKYLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMH 898 +R +KY++ S R++ F + ++L+T+K L D P+RWNST+ M+E AL Y+ AF++ Sbjct: 335 LRLGIKYVQQSPHRKKNFYIIAKTLNLDTQKKLCLDSPSRWNSTYNMIEVALCYKNAFLY 394 Query: 897 FAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLT 718 A D ++ H S +EWE+V +KFL V ++V +F + PTSNLYF + V L+ Sbjct: 395 LAEQDKNFIHKLSEDEWEKVSVSYKFLKVIFEVACIFFRNRQPTSNLYFKALWKVHRRLS 454 Query: 717 KAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS 538 ++ + F+ M EM KF YWS+ +IL+ A +LDPRYK++FVE+ Y +LYGSG+ Sbjct: 455 DMVRGPENFMTRMVKEMQSKFNQYWSEYNLILSCAAILDPRYKIKFVEYCYTKLYGSGAQ 514 Query: 537 QINEVS-EKLYSLFETYKQNS-------TNSVERTSKNLQKDVSHGTHRLPDFMEEFDTF 382 Q S LY LF Y QNS T SV T + KD + G E+++TF Sbjct: 515 QYVSASVNTLYGLFHDYMQNSACPSHTATLSVLTTKISNDKDDNDG-------FEDYETF 567 Query: 381 SAENXXXXXXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTV 202 + S+LDLYLDE D E+DVL++W R+P LS MARD+L+IP+ST+ Sbjct: 568 QSARFQTQVEKSQLDLYLDEPSHDLNSEIDVLEYWTLCSLRYPELSRMARDVLTIPVSTI 627 Query: 201 SSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWI 97 +S+ AF+IG +V++ RS LK +++++ C ++W+ Sbjct: 628 ASDNAFDIGPQVISTDRSSLKSKMIQALVCLQDWM 662 >gb|EOY04302.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] Length = 692 Score = 414 bits (1063), Expect = e-112 Identities = 234/635 (36%), Positives = 359/635 (56%), Gaps = 28/635 (4%) Frame = -3 Query: 1917 TSKR-RRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 1741 +SKR + TS VW+ FE L + +A C+ C IY+ + +GTS+L RH+ C+++ N Sbjct: 36 SSKRPKTTSKVWDVFEKLPAQQGDS-KAICKLCRRIYTAKTTSGTSHLRRHIEACLKRGN 94 Query: 1740 PDLGQMFLS---------------QSGSLMSMKPPKFSLKV----FREKMMMAILKHDLP 1618 DL Q G+L+ P S K+ R + M I+ P Sbjct: 95 HDLDQRSTEACFKPVNRDANRHTVSQGTLIDATTPLKSYKLDVDEIRRAIAMMIIVDAQP 154 Query: 1617 FQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDL 1438 F+ VE G R L ++R ++ D++ IY RE+ +R LL + PGRI LT Sbjct: 155 FRVVEDTGFRHVLNVACPEFPLLSRKAIKRDIISIYVRERENIRELLGACPGRICLTSST 214 Query: 1437 WTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKLF 1258 W S D Y +TAHF+D W+LQK++L F + PP+ +++++++I + +W IE K+F Sbjct: 215 WKSNCDDHYNCVTAHFIDHEWRLQKRILRFKLIPPPYDSLSIADEIGLCMVQWNIEHKVF 274 Query: 1257 SVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVD 1078 SVTL++ S++D L+ L+ K G FF++ C ILNLIVQ G I + Sbjct: 275 SVTLENLSSDDCVADILKTRLDAKKYHPFKGVFFNMSCSTRILNLIVQAGFNLIIDIIGK 334 Query: 1077 IRESVKYLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMH 898 +R +KY++ S R++ F + ++L+T+K L D P+RWNST+ M+E AL Y+ AF++ Sbjct: 335 LRLGIKYVQQSPHRKKNFYIIAKTLNLDTQKKLCLDSPSRWNSTYNMIEVALCYKNAFLY 394 Query: 897 FAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLT 718 A D ++ H S +EWE+V +KFL V ++V +F + PTSNLYF + V L+ Sbjct: 395 LAEQDKNFIHKLSEDEWEKVSVSYKFLKVIFEVACIFFRNRQPTSNLYFKALWKVHRRLS 454 Query: 717 KAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS 538 ++ + F+ M EM KF YWS+ +IL+ A +LDPRYK++FVE+ Y +LYGSG+ Sbjct: 455 DMVRGPENFMTRMVKEMQSKFNQYWSEYNLILSCAAILDPRYKIKFVEYCYTKLYGSGAQ 514 Query: 537 QINEVS-EKLYSLFETYKQNS-------TNSVERTSKNLQKDVSHGTHRLPDFMEEFDTF 382 Q S LY LF Y QNS T SV T + KD + G E+++TF Sbjct: 515 QYVSASVNTLYGLFHDYMQNSACPSHTATLSVLTTKISNDKDDNDG-------FEDYETF 567 Query: 381 SAENXXXXXXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTV 202 + S+LDLYLDE D E+DVL++W R+P LS MARD+L+IP+ST+ Sbjct: 568 QSARFQTQVEKSQLDLYLDEPSHDLNSEIDVLEYWTLCSLRYPELSRMARDVLTIPVSTI 627 Query: 201 SSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWI 97 +S+ AF+IG +V++ RS LK +++++ C ++W+ Sbjct: 628 ASDNAFDIGPQVISTDRSSLKSKMIQALVCLQDWM 662 >gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] Length = 682 Score = 412 bits (1060), Expect = e-112 Identities = 245/619 (39%), Positives = 348/619 (56%), Gaps = 12/619 (1%) Frame = -3 Query: 1962 TPTSMN---PASSSGCSITSKRRRTSNVWNHFEML--SLTADNKPRARCRQC-GAIYSCD 1801 TP+S N PA S S T R+ TS VW H+++ SL D RA C+ C G Sbjct: 34 TPSSQNDNIPAPSVS-SETRNRKWTSPVWQHYKLFDASLFPDGIARAICKYCDGGPTLAY 92 Query: 1800 SRNGTSNLNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDL 1621 S NGTSN RH C K P LG L+ GS + P V++E++ +A+++H Sbjct: 93 SGNGTSNFKRHTETC--PKRPLLGVAHLTSDGSFIKKMDPL----VYKERVALAVIRHAF 146 Query: 1620 PFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFD 1441 PF + EY+G R +NE K ++R+T+ +KI+ REK ++ L ++PG+I LT D Sbjct: 147 PFSYAEYDGNRWLHEGLNESYKPISRNTLRNYCMKIHKREKQILKESLSNLPGKICLTTD 206 Query: 1440 LWTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEKIYALLNEWGIERKL 1261 +WT+ Y+SLTAH++D W L K+LNF + PPH A +L + IYA L EW I K+ Sbjct: 207 MWTAFVGMGYISLTAHYIDSEWNLHSKILNFCHLEPPHDAPSLHDSIYAKLKEWDIRSKI 266 Query: 1260 FSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVV 1081 F++TLD+A ND L N L+L + +LC+GE+FHV C AHILNLIVQDGLK IDS V Sbjct: 267 FTITLDNARCNDNMQDLLMNSLSLHSPILCDGEYFHVRCAAHILNLIVQDGLKVIDSGVR 326 Query: 1080 DIRESVKYLKGSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAF- 904 +R V ++ GS +R KF + ++T K L D TRWNST+ MLE A+ YR F Sbjct: 327 KLRMVVAHIVGSERRLIKFKGNASALGVDTSKKLCLDCVTRWNSTYNMLERAMIYRNVFP 386 Query: 903 ----MHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLV 736 DP + PS EW R+ K+ + L F +T + SG +YPT+NLYF V Sbjct: 387 TMRGPEMKKFDPHFPEPPSEAEWIRIVKIVELLKPFDHITTLISGRKYPTANLYFKSVWK 446 Query: 735 VQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERL 556 +Q LT+ K D + MA M KF+ YW +IL+ A +LDPRYKL F+++ + +L Sbjct: 447 IQYLLTRYAKCNDTHLKDMADLMRIKFDKYWENYSMILSFAAILDPRYKLPFIKYCFHKL 506 Query: 555 -YGSGSSQINEVSEKLYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFS 379 S + V +K Y L+E Y + S + ++ TS + +PD + F F Sbjct: 507 DPESAELKTKVVKDKFYKLYEEYVKYSPHVLKETSVQM----------IPDELPGFANF- 555 Query: 378 AENXXXXXXXSELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVS 199 + S LD YLD+ R D +DVL +W++N ++ L+ MA DIL+I I+TV+ Sbjct: 556 -DGGAVIGGLSYLDTYLDDARLDHTLNIDVLKWWKENESKYLVLAEMAIDILTIQINTVA 614 Query: 198 SEVAFNIGRRVVNRSRSVL 142 SE AF + RV+ + R+ L Sbjct: 615 SESAFRMESRVLMKWRTTL 633 >gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana] Length = 633 Score = 405 bits (1041), Expect = e-110 Identities = 247/663 (37%), Positives = 366/663 (55%), Gaps = 7/663 (1%) Frame = -3 Query: 2052 MEMDS--LDNETKAESDIQSQDGQETRVTQELT-PTSMNPASSSGCSITSKRRRTSNVWN 1882 ME+D+ L +E + Q D ++ + Q L T+ + + G S S+ R W Sbjct: 1 MELDTQNLVDEDNFNLEDQEMDHEDPEMDQILPHETASSGTAERGNSSVSRFRAAC--WK 58 Query: 1881 HFEMLSLTADNKPRARCRQCGAIYSCD-SRNGTSNLNRHVRICIRKKNPDLGQMFLSQSG 1705 +F+ + K C+ C Y + RNGT+ +NRH+R C +K P G Sbjct: 59 NFDRGQKYPNGKTEVTCKYCEQTYHLNLRRNGTNTMNRHMRSC--EKTP----------G 106 Query: 1704 SLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEED 1525 S + K + VFRE + +A+++H+LP+ FVEYE IR+A Y N +F +R+T D Sbjct: 107 STPRISR-KVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYANPSIEFWSRNTAASD 165 Query: 1524 VLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLSLTAHFLDQNWKLQKKVLNFH 1345 V KIY REK K++ L IPGRI LT DLW +++ + Y+ LTAH++D + L+ K+L+F Sbjct: 166 VYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFS 225 Query: 1344 FMSPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNG 1165 PPH+ +A++ K+ LL +WGIE+K+F++T+D+AS ND L+ + L+ L+C+G Sbjct: 226 AFPPPHSGVAIAMKLSELLKDWGIEKKIFTLTVDNASANDTMQSILKRK--LQKDLVCSG 283 Query: 1164 EFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLECVRLVSLETKK 985 EFFHV C AHILNLIVQDGL+ I ++ IRE+VKY+KGS R F C+ + ++T+ Sbjct: 284 EFFHVRCSAHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCMDTIGIQTEA 343 Query: 984 ALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFY 805 +L D+ TRWNST+ ML A+ ++ A +D YK PS EWER E + L F Sbjct: 344 SLVLDVSTRWNSTYHMLSRAIQFKDVLRSLAEVDRVYKSFPSAVEWERAELICDLLKPFA 403 Query: 804 KVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVI 625 ++T + S +M +K++ YW I Sbjct: 404 EITKLIS-------------------------------------DMTEKYDKYWEDFSDI 426 Query: 624 LAIAVVLDPRYKLRFVEWSYERLYGSGSSQ-INEVSEKLYSLFETYKQNSTNSVERTSKN 448 LA+A VLDPR K +E+ Y L S + + V +K+ LF YK+ + N TS++ Sbjct: 427 LAMAAVLDPRLKFSALEYCYNILNPLTSKENLTHVRDKMVQLFGAYKRTTCNVAASTSQS 486 Query: 447 LQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXSELDLYLDEQRSDW--RKELDVLDFWR 274 +KD+ G + + FS N S LD+YL+E D K++DV+ +W+ Sbjct: 487 SRKDIPFG------YDGFYSYFSQRN---GTGKSPLDMYLEEPVLDMVSFKDMDVIAYWK 537 Query: 273 DNRYRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWIF 94 +N RF LS MA DILSIPI+TV+SE AF+IG RV+N+ RS L P V++ C+RNW Sbjct: 538 NNVSRFKELSSMACDILSIPITTVASESAFSIGSRVLNKYRSCLLPTNVQALLCTRNWFR 597 Query: 93 GKQ 85 G Q Sbjct: 598 GFQ 600 >ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] gi|355504225|gb|AES85428.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] Length = 555 Score = 403 bits (1036), Expect = e-109 Identities = 226/537 (42%), Positives = 325/537 (60%), Gaps = 14/537 (2%) Frame = -3 Query: 1662 FREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRN 1483 F E IL HDLPF F E EG+R ++N R+ +E V +Y +EK K++ Sbjct: 19 FVEICASTILAHDLPFHFFELEGMRKYSEFLNPNIPIPPRNVIEAYVSHLYTKEKPKLKQ 78 Query: 1482 LLHSIPGRISLTFDLWTSISTDEYLSLTAHFLDQNWKLQKKVLNFHFMSPPHTAIALSEK 1303 L +IP RISL+FDLW S +T+ Y+ LTAHF+D NWKL KV+NF + PP T+ + E+ Sbjct: 79 QLTTIPNRISLSFDLWESNTTETYICLTAHFVDANWKLNSKVINFRLVYPP-TSGEICER 137 Query: 1302 IYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNL 1123 + LLN+WGIE+K+FS+T+DD+S N++ +L+ +L L+NGLLC+GEFFHV C A +LN Sbjct: 138 MVELLNDWGIEKKIFSLTIDDSSENEILQEQLKTQLVLQNGLLCDGEFFHVNCFARVLNQ 197 Query: 1122 IVQDGLKEIDSSVVDIRESVKYLKGSNQRRRKFLECVRLVS-LETKKALRQDIPTRWNST 946 IV++ LK + V IRES+ +++ S RR KF EC V +++ L DI +ST Sbjct: 198 IVEEALKLVSCGVHKIRESIMFVRHSKSRREKFKECFEKVGGVDSSVHLHLDISMSLSST 257 Query: 945 FLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPT 766 +++LE AL YR AF F + D Y CPS EEW+RVEK+ FL F + +M + T +PT Sbjct: 258 YMLLERALKYRCAFESFHLYDDSYDLCPSAEEWKRVEKICAFLLPFCETANMINSTTHPT 317 Query: 765 SNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKL 586 SNLYF QV VQ L ++ + D + MA M KFE YW + V+LA+ VLDPR K Sbjct: 318 SNLYFLQVWKVQCVLVDSLGDEDEDIKKMAERMMSKFEKYWDEYSVVLALGAVLDPRMKF 377 Query: 585 RFVEWSYERLYGSG-SSQINEVSEKLYSLFETYKQNSTNS-VERTSKN---------LQK 439 + + Y +L S ++ +V KL LFE + NST + V+RT K LQK Sbjct: 378 TTLAYCYSKLDASTCERKLQQVKRKLCMLFEKHSGNSTTAGVQRTIKENQDQSSSMPLQK 437 Query: 438 DVSHGTHRLPDFMEEFDTFSAENXXXXXXXSELDLYLDEQRSDWR--KELDVLDFWRDNR 265 + +H L D ++ + S+LD+YLDE D+R E+DVL +W+ N Sbjct: 438 KLKSLSHGLFDELK----VHHQQLVTKTGKSQLDVYLDESVLDFRCYAEMDVLQWWKSNN 493 Query: 264 YRFPCLSLMARDILSIPISTVSSEVAFNIGRRVVNRSRSVLKPEIVESTFCSRNWIF 94 RFP LS++A D+LS+PI+ V+S+ F +G RV N+ + + P VE+ C+R+W++ Sbjct: 494 DRFPDLSILACDLLSVPIAAVASDSEFCMGSRVFNKYKDRMLPMNVEARICTRSWLY 550