BLASTX nr result
ID: Ephedra25_contig00012072
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra25_contig00012072 (2212 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus pe... 716 0.0 gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [... 707 0.0 gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, p... 584 e-164 gb|EMJ02729.1| hypothetical protein PRUPE_ppa016152mg, partial [... 569 e-159 gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus pe... 503 e-139 gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [... 485 e-134 gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] 442 e-121 gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thali... 424 e-116 ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [A... 424 e-115 ref|XP_006851229.1| hypothetical protein AMTR_s00180p00017340 [A... 422 e-115 ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, part... 422 e-115 gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thal... 421 e-115 emb|CAN80126.1| hypothetical protein VITISV_013417 [Vitis vinifera] 421 e-115 gb|EOY04304.1| BED zinc finger,hAT family dimerization domain is... 420 e-114 gb|EOY04303.1| BED zinc finger,hAT family dimerization domain is... 420 e-114 gb|EOY04302.1| BED zinc finger,hAT family dimerization domain is... 420 e-114 gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana] 411 e-112 ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [... 410 e-112 gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] 410 e-112 gb|ACX85638.1| putative transposase [Cucumis melo] 391 e-106 >gb|EMJ14584.1| hypothetical protein PRUPE_ppa026473mg [Prunus persica] Length = 696 Score = 716 bits (1848), Expect = 0.0 Identities = 351/652 (53%), Positives = 476/652 (73%), Gaps = 11/652 (1%) Frame = +1 Query: 223 NPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSN 390 +P++++ +T KRRR TS VW FE+L + +N+ RA+C +CG Y CDSR GT N Sbjct: 28 DPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDSRYGTGN 87 Query: 391 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 570 L RH+ C++ DLGQ+ LS+S + + KF FRE ++MAI+ HDLPFQFVEY Sbjct: 88 LKRHIESCVKTDTRDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIIMHDLPFQFVEY 147 Query: 571 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 750 GIR Y+ K V+R+T + DVL +Y+REKAK++ +L S+PGR+ LT DLWTSI+T Sbjct: 148 AGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILGSVPGRVCLTSDLWTSITT 207 Query: 751 DEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 930 D YLCLT HF+D NWKLQK++LNF F+ PPHT +AL EKIY LL +WG+E+KLFS+TLD+ Sbjct: 208 DGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKLFSMTLDN 267 Query: 931 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1110 AS+ND FV L+ +LNLK+ LL NG+FFH+ CCAHILNLIVQDGLK ID SV IRES+K Sbjct: 268 ASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVGKIRESIK 327 Query: 1111 YLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 1290 Y++ S R++KFL C VSLE K+ LRQD+PTRWNSTFLM++SALYY+RAF+H + D Sbjct: 328 YVRGSQGRKQKFLNCDARVSLECKRGLRQDVPTRWNSTFLMIDSALYYQRAFLHLQLSDS 387 Query: 1291 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 1470 +YKH S +EW ++EKL KFL VFY VT +FSGT+YPT+NLYFPQV VV+DTL KA + Sbjct: 388 NYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDS 447 Query: 1471 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINEVS 1650 D F+ MA +M +KF+ YW + +ILAIAV+LDPRYK++FVE+ Y+RLYG S ++ +V Sbjct: 448 DSFMKSMATQMMEKFDKYWKEYSLILAIAVILDPRYKIQFVEFCYKRLYGYNSEEMTKVR 507 Query: 1651 EKLYSLFETYKQ--NSTNSVERTSKNLQKDVSHGTHRLP----DFMEEFDTFSAENXXXX 1812 + L+SLF+ Y + +S+ SV TS SH + D M+EFD F +E Sbjct: 508 DMLFSLFDLYFRIYSSSESVSGTSSASNGARSHVDDMVSKECLDVMKEFDNFESEEFTTS 567 Query: 1813 XXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNI 1992 +L LYLDE + D + +L+VLDFW+ N++R+P LS++ARD+LSIPISTV+SE+AF++ Sbjct: 568 AQKTQLQLYLDEPKIDRKTKLNVLDFWKVNQFRYPELSILARDLLSIPISTVASESAFSV 627 Query: 1993 GRRVVNKSRSVLKPEIVESTFCSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 2145 G RV+++ RS LKPE VE+ C+R+WIFG++ + P++ E+ ED++K+ IN Sbjct: 628 GGRVLDQYRSALKPENVEALVCTRDWIFGEENCTLAPNLEELTEDISKMEIN 679 >gb|EMJ22510.1| hypothetical protein PRUPE_ppa025777mg, partial [Prunus persica] Length = 697 Score = 707 bits (1825), Expect = 0.0 Identities = 347/652 (53%), Positives = 471/652 (72%), Gaps = 11/652 (1%) Frame = +1 Query: 223 NPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSN 390 +P++++ +T KRRR TS VW FE+L + +N+ RA+C +CG Y CDSR GT N Sbjct: 29 DPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDSRYGTRN 88 Query: 391 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 570 L RH+ C++ DLGQ+ LS+S + + KF FRE ++MAI+ HDLPFQFVEY Sbjct: 89 LKRHIESCVKTDTRDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIITHDLPFQFVEY 148 Query: 571 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 750 GIR Y+ K V+R+T + DVL +Y+REKAK++ +L S+PGR+ L DLWTSI+T Sbjct: 149 SGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILDSVPGRVCLASDLWTSITT 208 Query: 751 DEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 930 D YLCLT HF+D NWKLQK++LNF F+ PPHT + L EKIY LL +WG+E+KLFS+TLD+ Sbjct: 209 DGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVTLCEKIYKLLTDWGVEKKLFSMTLDN 268 Query: 931 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1110 AS+ND FV L+ + NLK+ LL NG+FF++ CCAHILNLIVQDGLK ID SV IRES+K Sbjct: 269 ASSNDTFVELLKGQPNLKDALLMNGKFFYIRCCAHILNLIVQDGLKHIDDSVGKIRESIK 328 Query: 1111 YLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 1290 Y++ S R++KFL C VSLE K+ LRQD+PTRWNSTFLM++SALYY+RAF+H + D Sbjct: 329 YVRGSQGRKQKFLNCAAQVSLECKRGLRQDVPTRWNSTFLMIDSALYYQRAFLHLQLSDS 388 Query: 1291 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 1470 +YKH S +EW ++EKL KFL VFY VT +FSGT+YPT+NLYFPQV VV+DTL KA + Sbjct: 389 NYKHSLSQDEWGKLEKLSKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDS 448 Query: 1471 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINEVS 1650 D F+ MA +M + F+ YW + +I AIAV+LDPRYK++FVE+ Y+RLYG S ++ +V Sbjct: 449 DSFMKSMATQMMEMFDKYWKEYSLIPAIAVILDPRYKIQFVEFCYKRLYGYNSEEMTKVR 508 Query: 1651 EKLYSLFETYKQ--NSTNSVERTSKNLQKDVSHGTHRLP----DFMEEFDTFSAENXXXX 1812 + L+SLF+ Y Q +S+ SV TS SH + D M+EFD F +E Sbjct: 509 DMLFSLFDLYFQIYSSSESVSGTSSASNGARSHVDDMVSKECLDVMKEFDNFESEEFTTS 568 Query: 1813 XXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNI 1992 +L LYLDE + D + +L+VLDFW+ N++R+P LS++ARD+LSIPISTV+SE+AF++ Sbjct: 569 AQKTQLQLYLDEPKIDRKTKLNVLDFWKVNQFRYPELSILARDLLSIPISTVASESAFSV 628 Query: 1993 GRRVVNKSRSVLKPEIVESTFCSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 2145 G RV+++ RS LKPE VE+ C+R+WIFGK+ + P++ E+ ED++K+ IN Sbjct: 629 GGRVLDQYRSALKPENVEALVCTRDWIFGKENCTLAPNLEELTEDISKMEIN 680 >gb|EOY25504.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778249|gb|EOY25505.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778250|gb|EOY25506.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] gi|508778251|gb|EOY25507.1| BED zinc finger,hAT family dimerization domain, putative isoform 1 [Theobroma cacao] Length = 678 Score = 584 bits (1506), Expect = e-164 Identities = 289/625 (46%), Positives = 412/625 (65%), Gaps = 3/625 (0%) Frame = +1 Query: 259 KRRRTSNVWN---HFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 429 KR+ +S V HF S + D K A+C+ CG + +CDS++ NL R+ C+ Sbjct: 35 KRKLSSQVSTFSEHFPKKS-SIDGKAIAKCKHCGIVLNCDSKHEIDNLKRYSENCVGGDT 93 Query: 430 PDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEG 609 ++GQM S + + FRE ++ AI H+LP FVEY G R Y++E Sbjct: 94 REIGQMISSNQHGSTLTRSSNLDPEKFRELVIGAIFMHNLPLSFVEYRGSRALSSYLHED 153 Query: 610 AKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLCLTAHFLDQ 789 ++R+T++ ++K++ E++K++ LL PGRI+LTFDLW SI+TD Y+CL AHF+D+ Sbjct: 154 VTLISRNTLKAYMIKMHRAERSKIKCLLEETPGRINLTFDLWNSITTDTYICLIAHFVDK 213 Query: 790 NWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRN 969 NW LQK+VLNF F+ PP+ +AL EK+YALL EWGIE KLFSVTLD+ ++ FV L+ Sbjct: 214 NWVLQKRVLNFSFMPPPYNCVALIEKVYALLAEWGIESKLFSVTLDNVLASNAFVELLKK 273 Query: 970 ELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKSSNQRRRKFL 1149 LN++ L G+FFH+ C A +LNLIVQD LKE+D V +RESVKY+K S R++KFL Sbjct: 274 NLNVRKTFLVGGKFFHLRCFAQVLNLIVQDSLKEVDCVVQKVRESVKYVKGSQVRKQKFL 333 Query: 1150 ECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWER 1329 ECV L+ L K LRQD+ T+WNSTFLML+ ALY+R+AF H + D +Y++CPS +EWER Sbjct: 334 ECVTLMKLNAKGGLRQDVSTKWNSTFLMLKRALYFRKAFSHLEIRDSNYRYCPSEDEWER 393 Query: 1330 VEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNK 1509 VEKL+K L VFY VT +FS T+YPT+NL+FP + + TL + M D ++ M+ +M Sbjct: 394 VEKLYKLLAVFYDVTCVFSRTKYPTANLFFPSMFIAHSTLQEHMSGQDVYMKNMSTQMLV 453 Query: 1510 KFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINEVSEKLYSLFETYKQN 1689 KF YWS +ILAIAV+LDPRYK+ FVEWSY +LYG+ S+Q V + L+SL+ Y Sbjct: 454 KFVKYWSDFSLILAIAVILDPRYKIHFVEWSYGKLYGNDSTQFKNVRDWLFSLYNEYAVK 513 Query: 1690 STNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXXELDLYLDEQRSDWRK 1869 + S +S N D T DF EEFD+++ +L+ YL E + K Sbjct: 514 A--SPTPSSFNNTSDEHTLTEGKRDFFEEFDSYATVKFGAATQKSQLEWYLSEPMVERTK 571 Query: 1870 ELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEIVES 2049 EL++L FW++N+YR+P L+ MARD+LSIPIS +SE AF++G +++++ RS LKP+I+E+ Sbjct: 572 ELNILQFWKENQYRYPELAAMARDVLSIPISATASEFAFSVGGKILDQHRSSLKPDILEA 631 Query: 2050 TFCSRNWIFGKQYDMEPDVNEMCED 2124 T C ++W+FG+ + D+N + ED Sbjct: 632 TVCCKDWLFGEVEHEDMDLNVVIED 656 >gb|EMJ02729.1| hypothetical protein PRUPE_ppa016152mg, partial [Prunus persica] Length = 613 Score = 569 bits (1467), Expect = e-159 Identities = 298/647 (46%), Positives = 408/647 (63%), Gaps = 6/647 (0%) Frame = +1 Query: 223 NPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSN 390 +P++++ +T KRRR TS VW FE+L + +N+ RA+C +CG Y CDSR GT N Sbjct: 29 DPSNNNNAVVTQIGKRRRKLTSAVWTQFEILPIDENNEQRAKCMKCGQKYLCDSRYGTGN 88 Query: 391 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 570 L RH+ C++ DLGQ+ LS+ + + KF FRE ++MAI+ HDLPFQFVEY Sbjct: 89 LKRHIESCVKTDTRDLGQLLLSKYDGAILTRSSKFDPMKFRELLLMAIIMHDLPFQFVEY 148 Query: 571 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 750 GIR Y+ K V+R+ + DVL +Y+REKAK++ +L S+PGR+ LTFDLWTSI+T Sbjct: 149 AGIRQLFNYVCADIKLVSRNIAKADVLSLYNREKAKLKEILGSVPGRVCLTFDLWTSITT 208 Query: 751 DEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 930 D YLCLT HF+D NWK +K +LNF F+ PPHT +AL EKIY LL +WG+++KLFS+TLD+ Sbjct: 209 DGYLCLTVHFIDVNWKWEKIILNFSFMPPPHTGVALCEKIYRLLTDWGVKKKLFSMTLDN 268 Query: 931 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1110 AS+ND FV L+ +LNLK+ LL NG+FFH+ CCAHILNLIVQDGLK ID SV IRES+K Sbjct: 269 ASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVGKIRESIK 328 Query: 1111 YLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 1290 Y + S R++KFL C VSLE KK Sbjct: 329 YARGSQGRKQKFLNCAAQVSLECKKG---------------------------------- 354 Query: 1291 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 1470 C KFL VFY VT +FSGT+YPT+NLYFPQV VV+DTL KA + Sbjct: 355 ---DCVK----------IKFLKVFYDVTCLFSGTKYPTANLYFPQVFVVEDTLRKAKVDS 401 Query: 1471 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGS-SQINEV 1647 D F+ MA +M KKF+ W + +ILAIAV+L+PRYK++FVE+ Y+R +G+ S ++++ Sbjct: 402 DSFMKSMATQMMKKFDKNWKEYSLILAIAVILNPRYKIQFVEFCYKRFASNGARSYVDDM 461 Query: 1648 SEKLYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXXE 1827 K D M+EFD F +E + Sbjct: 462 VSK--------------------------------ECLDVMKEFDNFESEEFTTSAQKTQ 489 Query: 1828 LDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVV 2007 L LYLDE + D + +L+VLDFW+ N++R+P LS++ARD+LSIPISTV+SE+ F++ RV+ Sbjct: 490 LQLYLDEAKIDRKTKLNVLDFWKVNQFRYPGLSILARDLLSIPISTVASESTFSVDGRVL 549 Query: 2008 NKSRSVLKPEIVESTFCSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 2145 ++ RS LKPE VE+ C+ +WIFG++ + P++ E+ ED++K+ IN Sbjct: 550 DQYRSALKPENVEALVCTLDWIFGEENCTLAPNLEELTEDISKMEIN 596 >gb|EMJ28015.1| hypothetical protein PRUPE_ppa017701mg [Prunus persica] Length = 567 Score = 503 bits (1295), Expect = e-139 Identities = 243/424 (57%), Positives = 315/424 (74%), Gaps = 4/424 (0%) Frame = +1 Query: 220 MNPASSSGCSITS--KRRR--TSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTS 387 ++P++++ +T KRRR TS VW HFE+L + +N+ RA+C +CG Y DSR GT Sbjct: 27 LDPSNNNNAVVTQIGKRRRKLTSAVWTHFEILHIDENNEQRAKCMKCGQKYLFDSRYGTG 86 Query: 388 NLNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVE 567 NL RH+ C++ DLGQ+ LS+S + + KF FRE ++MAI+ HDLPFQFVE Sbjct: 87 NLKRHIESCVKIDTCDLGQLLLSKSDGAILTRSSKFDPMKFRELLVMAIIMHDLPFQFVE 146 Query: 568 YEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIS 747 Y GIR Y+ K V+R+T + DVL +Y+REKAK++ +L S+PGR+ LT DLWTSI+ Sbjct: 147 YSGIRQLFNYVCADIKLVSRNTAKADVLSLYNREKAKLKEILGSVPGRVCLTSDLWTSIT 206 Query: 748 TDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLD 927 TD YLCLT HF+D NWKLQK++LNF F+ PPHT +AL EKIY LL +WG+E+KLFS+TLD Sbjct: 207 TDGYLCLTVHFIDVNWKLQKRILNFSFMPPPHTGVALCEKIYRLLTDWGVEKKLFSMTLD 266 Query: 928 DASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESV 1107 +AS+ND FV L+ +LNLK+ LL NG+FFH+ CCAHILNLIVQDGLK ID SV IRES+ Sbjct: 267 NASSNDTFVELLKGQLNLKDALLMNGKFFHIRCCAHILNLIVQDGLKHIDDSVGKIRESI 326 Query: 1108 KYLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVID 1287 KY++ S R++KFL C VSLE K+ LRQD+PTRWNSTFLM++SAL+Y+RAF+H + D Sbjct: 327 KYVRGSQGRKQKFLNCAAQVSLECKRGLRQDVPTRWNSTFLMIDSALHYQRAFLHLQLSD 386 Query: 1288 PDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKE 1467 +YKH EW +++KL KFL VFY VT +F GT+YP +NLYFPQV VV+DTL KA KE Sbjct: 387 SNYKHSLPQNEWGKLKKLSKFLKVFYDVTCLFFGTKYPIANLYFPQVFVVEDTLRKA-KE 445 Query: 1468 GDGF 1479 D F Sbjct: 446 FDNF 449 Score = 108 bits (271), Expect = 7e-21 Identities = 50/104 (48%), Positives = 77/104 (74%) Frame = +1 Query: 1771 EEFDTFSAENXXXXXXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILS 1950 +EFD F +E +L LYL+E + D + +L+VL+FW+ N++R+P LS++ARD+LS Sbjct: 444 KEFDNFESEEFTTSAQKTQLQLYLNEPKIDRKTKLNVLNFWKVNQFRYPELSILARDLLS 503 Query: 1951 IPISTVSSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWIFGK 2082 IPISTV+ E+AF++G RV+++ S LKPE VE+ C+ +WIFG+ Sbjct: 504 IPISTVAYESAFSVGGRVLDQYHSALKPENVEALVCTHDWIFGE 547 >gb|EMJ01864.1| hypothetical protein PRUPE_ppa015215mg, partial [Prunus persica] Length = 478 Score = 485 bits (1248), Expect = e-134 Identities = 267/571 (46%), Positives = 351/571 (61%), Gaps = 3/571 (0%) Frame = +1 Query: 442 QMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFV 621 Q+ LS+S + + KF FRE ++MAI+ HDLPFQFVEY GIR Sbjct: 1 QLLLSKSDGAILTRSSKFDPIKFRELLVMAIIMHDLPFQFVEYAGIRQT----------- 49 Query: 622 TRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLCLTAHFLDQNWKL 801 TSI+TD YLCLT +F+D NWKL Sbjct: 50 --------------------------------------TSITTDGYLCLTVYFIDVNWKL 71 Query: 802 QKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNL 981 QK++LNF F+ P HT +AL EKIY LL WG+E+KLFS+TLD+AS+ND FV L+ +LNL Sbjct: 72 QKRILNFSFMPPLHTGVALCEKIYRLLTNWGVEKKLFSLTLDNASSNDTFVELLKGQLNL 131 Query: 982 KNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKSSNQRRRKFLECVR 1161 K+ LL NG+FFHV CCAHILNLIVQDGLK ID V IRES+KY++ S ++KFL+C Sbjct: 132 KDALLMNGKFFHVRCCAHILNLIVQDGLKHIDDYVGKIRESIKYVRGSQGTKQKFLDCAA 191 Query: 1162 LVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKL 1341 VSLE K+ LRQD+PTRWNSTFLM+ SALYY+RAF+H + D +YKH S +EW ++EKL Sbjct: 192 QVSLECKRGLRQDVPTRWNSTFLMINSALYYQRAFLHLQLSDSNYKHSLSQDEWGKLEKL 251 Query: 1342 FKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFEN 1521 KFL VFY VT +F GT+YPT+NLYFPQV VV+DTL KA Sbjct: 252 SKFLKVFYDVTCLFFGTKYPTANLYFPQVFVVEDTLKKA--------------------K 291 Query: 1522 YWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINEVSEKLYSLFETYKQ--NST 1695 YW + +ILAIAV+LDPRYK++FV++ Y+RLYG S ++ +V + L+SLF+ Y + S+ Sbjct: 292 YWKEYSLILAIAVILDPRYKIQFVKFCYKRLYGYNSKEMTKVRDMLFSLFDLYVRIYTSS 351 Query: 1696 NSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXXELDLYLDEQRSDWRKEL 1875 SV TS VS G D M EFD F Sbjct: 352 ESVSGTS-----SVSIGARSHVDDM-EFDNFEM--------------------------- 378 Query: 1876 DVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEIVESTF 2055 N++R+P LS++ RD+LSIPISTV+SE+AF++G R++++ RS LKP+ VE Sbjct: 379 --------NQFRYPELSILVRDLLSIPISTVASESAFSVGGRMLDQYRSALKPKNVEVLV 430 Query: 2056 CSRNWIFGKQ-YDMEPDVNEMCEDVTKLNIN 2145 C+R+WIFGK+ Y + P++ E+ ED++K+ IN Sbjct: 431 CTRDWIFGKENYTLAPNLEELTEDISKMEIN 461 >gb|AAF79835.1|AC026875_15 T6D22.19 [Arabidopsis thaliana] Length = 745 Score = 442 bits (1137), Expect = e-121 Identities = 261/676 (38%), Positives = 388/676 (57%), Gaps = 7/676 (1%) Frame = +1 Query: 118 MEMDF--LDNETKAESDIQSQDGQETRVTQELT-PTSMNPASSSGCSITSKRRRTSNVWN 288 ME+D L +E + Q D ++ + Q L T+ + G S S+ R W Sbjct: 91 MELDTQNLVDEDNFNLEDQEMDDEDPEMDQILPHDTASSGTVERGKSSVSRFRAAC--WK 148 Query: 289 HFEMLSLTADNKPRARCRQCGAIYSCD-SRNGTSNLNRHVRICIRKKNPDLGQMFLSQSG 465 +F+ + K C+ C Y + RNGT+ +NRH+R C +K P G Sbjct: 149 NFDRGQKYPNGKTEVTCKYCEQTYHLNLRRNGTNTMNRHMRSC--EKTP----------G 196 Query: 466 SLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEED 645 S + K + VFRE + +A+++H+LP+ FVEYE IR+A Y+N +F +R+T D Sbjct: 197 STPRISR-KVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYVNPSIEFWSRNTAASD 255 Query: 646 VLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLCLTAHFLDQNWKLQKKVLNFH 825 V KIY REK K++ L IPGRI LT DLW +++ + Y+CLTAH++D + L+ K+L+F Sbjct: 256 VYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFC 315 Query: 826 FISPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNG 1005 PPH+ +A++ K+ LL +WGIE+K+F++T+D+AS ND L+ +L + L+C+G Sbjct: 316 AFPPPHSGVAIAMKLSELLKDWGIEKKVFTLTVDNASANDTMQSILKRKL--QKHLVCSG 373 Query: 1006 EFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKSSNQRRRKFLECVRLVSLETKK 1185 EFFHV C AHILNLIVQDGL+ I ++ IRE+VKY+K S R F C+ + ++T+ Sbjct: 374 EFFHVRCSAHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCMDTIGIQTEA 433 Query: 1186 ALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFY 1365 +L D+ TRWNST+ ML A+ ++ A +D YK PS EWER E + L F Sbjct: 434 SLVLDVSTRWNSTYHMLSRAIQFKDVLHSLAEVDRGYKSFPSAVEWERAELICDLLKPFA 493 Query: 1366 KVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVI 1545 ++T + SG+ YPT+N+YF QV ++ L D + M +M +K++ YW I Sbjct: 494 EITKLISGSSYPTANVYFMQVWAIKCWLGDHDDSHDRAIREMVEDMTEKYDKYWEDFSDI 553 Query: 1546 LAIAVVLDPRYKLRFVEWSYERLYGSGSSQ-INEVSEKLYSLFETYKQNSTNSVERTSKN 1722 LA+A VLDPR K +E+ Y L S + + V +K+ LF YK+ + N TS++ Sbjct: 554 LAMAAVLDPRLKFSALEYCYNILNPLTSKENLTHVRDKMVQLFGAYKRTTCNVAASTSQS 613 Query: 1723 LQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXXELDLYLDEQRSDW--RKELDVLDFWR 1896 +KD+ G + + FS N LD+YL+E D +++DV+ +W+ Sbjct: 614 SRKDIPFG------YDGFYSYFSQRN---GTGKSPLDMYLEEPVLDMVSFRDMDVIAYWK 664 Query: 1897 DNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWIF 2076 +N RF LS MA DILSI I+TV+SE+ F+IG RV+NK RS L P V++ C+RNW Sbjct: 665 NNVSRFKELSSMACDILSISITTVASESTFSIGSRVLNKYRSCLLPTNVQALLCTRNWFR 724 Query: 2077 GKQYDMEPDVNEMCED 2124 G Q D+E D + ED Sbjct: 725 GFQ-DVETDEIQGQED 739 >gb|AAD24567.1|AF120335_1 putative transposase [Arabidopsis thaliana] Length = 577 Score = 424 bits (1091), Expect = e-116 Identities = 244/609 (40%), Positives = 359/609 (58%), Gaps = 3/609 (0%) Frame = +1 Query: 391 LNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEY 570 +NRH+R C +K P GS + K + VFRE + +A+++H+LP+ FVEY Sbjct: 1 MNRHMRSC--EKTP----------GSTPRISR-KVDMMVFREMIAVALVQHNLPYSFVEY 47 Query: 571 EGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST 750 E IR+A Y N +F +R+T DV KIY REK K++ L IPGRI LT DLW +++ Sbjct: 48 ERIREAFTYANPSIEFWSRNTAAFDVYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTV 107 Query: 751 DEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDD 930 + Y+CLTAH++D + L+ K+L+F PPH+ +A++ K+ LL +WGIE+K+F++T+D+ Sbjct: 108 ESYICLTAHYVDVDGVLKTKILSFCAFPPPHSGVAIAMKLSELLKDWGIEKKVFTLTVDN 167 Query: 931 ASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVK 1110 AS ND L+ +L + L+C+GEFFHV C AHILNLIVQDGL+ I ++ IRE+VK Sbjct: 168 ASANDTMQSILKRKL--QKDLVCSGEFFHVRCSAHILNLIVQDGLEVISGALEKIRETVK 225 Query: 1111 YLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDP 1290 Y+K S R F C+ + ++T+ L D+ TRWNST+ ML A+ ++ A +D Sbjct: 226 YVKGSETRENLFQNCMDTIGIQTEANLVLDVSTRWNSTYHMLSRAIQFKDVLRSLAEVDR 285 Query: 1291 DYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEG 1470 YK PS EWER E + L F ++T + SG+ YPT+N+YF QV ++ L Sbjct: 286 GYKSFPSAVEWERAELICDLLKPFAEITKLISGSSYPTANVYFMQVWAIKCWLGDHDDSH 345 Query: 1471 DGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQ-INEV 1647 D + M +M +K++ YW ILA+A VLDPR K +E+ Y L S + + V Sbjct: 346 DRVIREMVEDMTEKYDKYWEDFSDILAMAAVLDPRLKFSALEYCYNILNPLTSKENLTHV 405 Query: 1648 SEKLYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXXE 1827 +K+ LF YK+ + N TS++ +KD+ G + + FS N Sbjct: 406 RDKMVQLFGAYKRTTCNVAASTSQSSRKDIPFG------YDGFYSYFSQRN---GTGKSP 456 Query: 1828 LDLYLDEQRSDW--RKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRR 2001 LD+YL+E D +++DV+ +W++N RF LS MA DILSIPI+TV+SE+AF+IG R Sbjct: 457 LDMYLEEPVLDMVSFRDMDVIAYWKNNVSRFKELSSMACDILSIPITTVASESAFSIGSR 516 Query: 2002 VVNKSRSVLKPEIVESTFCSRNWIFGKQYDMEPDVNEMCEDVTKLNINDSLISVGES*WE 2181 V+NK RS L P V++ C+RNW G Q +V ++I +S I V + + Sbjct: 517 VLNKYRSCLLPTNVQALLCTRNWFRGFQ------------EVGNIHILNSFIIV--NLYG 562 Query: 2182 SWTLQFRLL 2208 WT FRL+ Sbjct: 563 FWTAAFRLI 571 >ref|XP_006857388.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] gi|548861481|gb|ERN18855.1| hypothetical protein AMTR_s00067p00136180 [Amborella trichopoda] Length = 685 Score = 424 bits (1089), Expect = e-115 Identities = 231/622 (37%), Positives = 373/622 (59%), Gaps = 11/622 (1%) Frame = +1 Query: 250 ITSKRRRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 429 + SKR+ S+VW+ FE + + D +A C+ C S +GTS+L RH+ C ++ + Sbjct: 59 LPSKRKTISSVWDEFEKVR-SEDGSVKAACKHCHRNLVGSSAHGTSHLKRHLGRCAKRVH 117 Query: 430 PDLGQMFLS---QSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYI 600 GQ + + G S+ KF R + IL H+ P VE+ R + + Sbjct: 118 IGSGQQLVVTCIKKGEASSVNF-KFDQGRSRYDLAKMILLHEYPSSMVEHTTFRTFVRNL 176 Query: 601 NEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLCLTAHF 780 V+ T+E D+++IY +EK K+ L IP RISL+ ++W+S EYLCL AH+ Sbjct: 177 QPLFSMVSPSTIESDIIEIYKKEKKKLYEELEKIPSRISLSANIWSSCQNLEYLCLIAHY 236 Query: 781 LDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGE 960 +D W LQK++L+F + P T A++E + LL++W +++KLFS+TL+ AS ND+ Sbjct: 237 IDDAWVLQKQILSFVNL-PSRTGGAIAEVLLDLLSQWNVDKKLFSITLNSASYNDVAASS 295 Query: 961 LRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKSSNQRRR 1140 LR+ L+ + L G+ FH+CCC+H++NL+VQDGL+ I + IRES+KY+K+S+ R+ Sbjct: 296 LRSRLSRNSSLPLEGKIFHLCCCSHVVNLMVQDGLEVIQEVLQKIRESIKYVKTSHVRQE 355 Query: 1141 KFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEE 1320 +F E + + +++K+ + D+PTRWNST+ ML+ L R AF FA D PS +E Sbjct: 356 RFNEIINQLGIQSKQNIFLDVPTRWNSTYHMLDVTLELREAFSCFAQCDSMCNMVPSEDE 415 Query: 1321 WERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVE 1500 WERV+++ L +FY +T+ F G++YPT+NLYFP+V + L + + + MA++ Sbjct: 416 WERVKEICDCLKLFYDITNTFLGSKYPTANLYFPEVYQMHLRLVEWSMSLNKHISSMAIK 475 Query: 1501 MNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS-QINEVSEKLYSLFET 1677 M +KF+ YW S ++LAIAVV+DPR+KL+FVE+SY ++YG+ + I V + +Y L Sbjct: 476 MKEKFDKYWKISNLVLAIAVVIDPRFKLKFVEYSYSQIYGNDAEHHIRMVRQGVYDLCNE 535 Query: 1678 YK------QNSTNSVERTSKNLQKDV-SHGTHRLPDFMEEFDTFSAENXXXXXXXXELDL 1836 Y+ NS +S+ ++ V +HG + EF+ F E+ ELD Sbjct: 536 YESKEPLASNSESSLAVSASTSSGGVDTHGKL----WAMEFEKFVRESSSNQARKSELDR 591 Query: 1837 YLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKS 2016 YL+E + ++ ++W+ N RFP LS MARDIL IP+STV+S++ F+IG +V+++ Sbjct: 592 YLEEPIFPRNLDFNIRNWWQLNAPRFPTLSKMARDILGIPVSTVTSDSTFDIGGQVLDQY 651 Query: 2017 RSVLKPEIVESTFCSRNWIFGK 2082 RS L PE +++ C+++W++ + Sbjct: 652 RSSLLPETIQALMCAQDWLWNE 673 >ref|XP_006851229.1| hypothetical protein AMTR_s00180p00017340 [Amborella trichopoda] gi|548854912|gb|ERN12810.1| hypothetical protein AMTR_s00180p00017340 [Amborella trichopoda] Length = 841 Score = 422 bits (1085), Expect = e-115 Identities = 240/657 (36%), Positives = 381/657 (57%), Gaps = 10/657 (1%) Frame = +1 Query: 247 SITSKRRRTSN-VWNHFEMLSLTADNKPRARCRQCGAIYSCDS---RNGTSNLNRHVRIC 414 S +SKRR+T++ VW HF M + + +ARC+ C ++ + + GTS+L RH+ IC Sbjct: 15 SSSSKRRKTTSIVWEHFTMETFIGGCR-KARCKYCLHTFAFGNGAKQLGTSHLKRHLGIC 73 Query: 415 IRKKNPDLGQMFLS-----QSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGI 579 + +N D Q L+ ++ S+ PKF RE + I+ H+ P VE+ Sbjct: 74 PKNRNSDRKQELLTLTPKDKNEGNTSLSNPKFDQSRSREDLARMIILHEYPLSVVEHPAF 133 Query: 580 RDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEY 759 + + + K V + TV +D L IY +EK + LL +IPGRISL+ D WT+ T EY Sbjct: 134 INFVQSLQPRFKMVNQATVRDDCLAIYQKEKQSLMQLLQTIPGRISLSLDKWTTEETLEY 193 Query: 760 LCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDDAST 939 + +T HF+D ++KLQK+VLNF + P T LS+ I L +W I KL +VTLD T Sbjct: 194 MRITGHFVDCDFKLQKRVLNFTMLPYPFTRNDLSDVILTCLTDWNILTKLSTVTLDRHHT 253 Query: 940 NDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLK 1119 +D L++ L+ KN LL +G F+VCCCA +LNLIVQDGL+ I+ + IRESVKY+K Sbjct: 254 DDCIGSNLKDCLSSKNMLLLSGRVFNVCCCADVLNLIVQDGLEAINDVIHKIRESVKYVK 313 Query: 1120 SSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYK 1299 +S + F + + + + +KK L D+ WN+TFLMLE+AL +++AF D +Y+ Sbjct: 314 ASQAHEQNFSKLFQQLEIPSKKDLCLDVQGEWNTTFLMLEAALEFKQAFSCLGSHDSNYE 373 Query: 1300 HCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGF 1479 PS +EW++VE L +L VFY V FS +PT+NLYF ++ + L + D Sbjct: 374 GAPSEDEWKKVEVLCIYLKVFYDVLRAFSEVTHPTANLYFHELWKIHMHLNHTVTSPDIV 433 Query: 1480 VHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSSQINE-VSEK 1656 + + + KF+ YW + ++LAIAV +DPR+K++FVE+S+ ++YG+ + V E Sbjct: 434 IIPVIRNLQDKFDKYWREYSLVLAIAVSMDPRFKMKFVEFSFSKVYGTNAFMYTRVVIEA 493 Query: 1657 LYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXXELDL 1836 + L+ Y +N V + N + S+ + ++ D +++FD F +E ELD Sbjct: 494 IRDLYSQYARNIPGPVPLATYNGDQSSSNNSFQINDGLQDFDQFLSELSGSQQTKSELDQ 553 Query: 1837 YLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKS 2016 YL+E +E D+L +W+ + ++P LS MARDIL+I ++TV SE+ FN G +V+++ Sbjct: 554 YLEEPLFPRNQEFDILRWWKMSAPKYPVLSEMARDILAIRVTTVDSESMFNTGGKVLDQY 613 Query: 2017 RSVLKPEIVESTFCSRNWIFGKQYDMEPDVNEMCEDVTKLNINDSLISVGES*WESW 2187 +S L PE +E+ C+R+W+ +++E ++ T LN++DS +S +W Sbjct: 614 QSSLSPETIEALICARDWL---HHELETSLD------TVLNMSDSTLSTAPLKLGAW 661 >ref|XP_006292237.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] gi|482560944|gb|EOA25135.1| hypothetical protein CARUB_v10018444mg, partial [Capsella rubella] Length = 547 Score = 422 bits (1085), Expect = e-115 Identities = 217/523 (41%), Positives = 335/523 (64%), Gaps = 3/523 (0%) Frame = +1 Query: 490 KFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHRE 669 K V RE + + I+ HDLPF FVEY +R+ L Y+N K ++R+T DVLK + Sbjct: 7 KIDHSVVRELITLVIICHDLPFSFVEYPRVRELLKYLNPEYKTISRNTAVADVLKFHGIR 66 Query: 670 KAKVRNLLHSIPGRISLTFDLWTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTA 849 K +++ L + RI LT D+W SIS + Y+CLTAH++D +WKL+ K+L+F + PPH+ Sbjct: 67 KEQMKQELAGVGNRICLTCDVWRSISIEGYICLTAHYVDDSWKLKSKILSFCAMPPPHSG 126 Query: 850 IALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCC 1029 L++K+ + L +WGIE+K+FS+TLD+AS+ND LR++L+ ++GLLC+GEFFH+ C Sbjct: 127 FELAKKVLSCLEDWGIEKKIFSLTLDNASSNDNMQSILRDQLSSRHGLLCDGEFFHIRCS 186 Query: 1030 AHILNLIVQDGLKEIDSSVVDIRESVKYLKSSNQRRRKFLECVRLVSLETKKALRQDIPT 1209 AH+LNLIVQ GLK ++S + IRE+VK++K S R+ F ECV V ++ L+ D+ T Sbjct: 187 AHVLNLIVQVGLKFVESPLHKIRETVKWIKWSEGRKDLFKECVIDVGIKYTAGLKMDVST 246 Query: 1210 RWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSG 1389 RWNST+LML S + YRRAF + +YK CPS EEW + EK++ FL FY +T +FSG Sbjct: 247 RWNSTYLMLGSVIKYRRAFSLLERAERNYKFCPSDEEWNKAEKIYTFLEPFYDITKLFSG 306 Query: 1390 TQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLD 1569 T YPT+NLYF Q+ ++ L +GD + MA EM KF+ YW + +IL+I +LD Sbjct: 307 TSYPTANLYFAQIWKIECLLNSYSNDGDMELQNMANEMRTKFDKYWEEYSIILSIGAILD 366 Query: 1570 PRYKLRFVEWSYERLYGSGS-SQINEVSEKLYSLFETYKQNSTNSVERTSKNLQKDVSHG 1746 PR K+ + + +++L S + +++ V +KL LF+ YK T++ +S S G Sbjct: 367 PRMKVEILTYCFDKLDPSTTKAKVEVVKQKLNLLFDQYKSTPTSTNVSSS-------SRG 419 Query: 1747 THRLPDFMEEFDTFSAENXXXXXXXXELDLYLDEQRSD--WRKELDVLDFWRDNRYRFPC 1920 T + +F + + +L +YL++ R + + +++DVL++W++ R+ Sbjct: 420 TDFIAKTHSDFKAYE-KRTILEEGKSKLAVYLEDDRLEMTFYEDMDVLEWWKNQTQRYGE 478 Query: 1921 LSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEIVES 2049 L+ MA D+LSIPI++V++E++F+IG V+NK RS L P VE+ Sbjct: 479 LARMACDVLSIPITSVAAESSFSIGAHVLNKYRSRLLPRHVEA 521 >gb|AAG50652.1|AC073433_4 transposase, putative [Arabidopsis thaliana] Length = 659 Score = 421 bits (1083), Expect = e-115 Identities = 236/648 (36%), Positives = 371/648 (57%), Gaps = 10/648 (1%) Frame = +1 Query: 259 KRRRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKNPDL 438 ++++ + W+ F + + D K RARC CG + GTS +NRH+ +C + P+ Sbjct: 29 RKKQRALCWDEFTSVGIEEDGKERARCHHCGIKLVVEKSYGTSTMNRHLTLCPERPQPET 88 Query: 439 GQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKF 618 PK+ KV RE I+ HD+PF++VEYE +R ++N K Sbjct: 89 R---------------PKYDHKVDREMTSEIIIYHDMPFRYVEYEKVRARDKFLNPDCKP 133 Query: 619 VTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDLWTSIST-DEYLCLTAHFLDQNW 795 + R T DV K + EKAK+ ++ G++ LT DLW+S ST Y+C+T+H++D++W Sbjct: 134 ICRQTAALDVFKRFEIEKAKLIDVFAKHNGQVCLTADLWSSRSTVTGYICVTSHYIDESW 193 Query: 796 KLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNEL 975 +L K+L F + PPH +++K+Y L EWG+E+K+ ++TLD+AS N L++ L Sbjct: 194 RLNNKILAFCDLKPPHNGEEIAKKVYDCLKEWGLEKKILTITLDNASANTSMQTILKHRL 253 Query: 976 NLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKSSNQRRRKFLEC 1155 NGLLC G F HV CCAHILNLIVQ GL+ + +I ESVK++K+S R+ F C Sbjct: 254 QSGNGLLCGGNFLHVRCCAHILNLIVQAGLELASGLLENITESVKFVKASESRKDSFATC 313 Query: 1156 VRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVE 1335 + V +++ L D+ TRWNST+ ML AL +R+AF + + Y P+ EE +R E Sbjct: 314 LECVGIKSGAGLSLDVSTRWNSTYEMLARALKFRKAFAILNLYERGYCSLPTEEECDRGE 373 Query: 1336 KLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKF 1515 K+ L F +T FSG +YPT+N+YF QV ++ L K D V MA +M KKF Sbjct: 374 KICDLLKPFNTITTYFSGVKYPTANIYFIQVWKIELLLMKYANCDDVDVREMAKKMQKKF 433 Query: 1516 ENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYG-SGSSQINEVSEKLYSLFETYKQNS 1692 YW++ VILA+ LDPR KL+ + +Y ++ + +++ V L L+E YK S Sbjct: 434 AKYWNEYSVILAMGAALDPRLKLQILRSAYNKVDPVTAEGKVDIVRNNLILLYEEYKTKS 493 Query: 1693 TNSVERTSKNLQKDVSHGTHRLPDFMEE-FDTFSAENXXXXXXXXELDLYL-DEQRSDWR 1866 +S ++ ++ + + D ++ F+ S+ L++YL DE R + + Sbjct: 494 ASSSNSSTTLTPHELLNESPLEADVNDDLFELESSLISASKSTKSTLEIYLDDEPRLEMK 553 Query: 1867 --KELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEI 2040 ++++L FW++N++R+ L+ MA D+LSIPI+TV+SE+AF++G RV+N R+ L P+ Sbjct: 554 TFSDMEILSFWKENQHRYGDLASMASDLLSIPITTVASESAFSVGGRVLNPFRNRLLPQN 613 Query: 2041 VESTFCSRNWIFGKQYDMEPDVNEMC----EDVTKLNINDSLISVGES 2172 V++ C+RNW+ G D+E D+ E+ D TK+ S VG+S Sbjct: 614 VQALICTRNWLLG-YADLEGDIEELFAEEDNDATKMT---SSSGVGDS 657 >emb|CAN80126.1| hypothetical protein VITISV_013417 [Vitis vinifera] Length = 1266 Score = 421 bits (1081), Expect = e-115 Identities = 227/632 (35%), Positives = 363/632 (57%), Gaps = 12/632 (1%) Frame = +1 Query: 214 TSMNPASSS-----GCSITSKRRRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRN 378 T NPA+ S G KR+ TS VWN FE + + D + A C+ C + DS+N Sbjct: 92 TGSNPATGSTSTTDGSLTCKKRKLTSIVWNEFEKVII--DGQDYAICKHCKSKLKADSKN 149 Query: 379 GTSNLNRHVRICIRKKNPDLGQMFLS---QSGSLMSMKPPKFSLKVFREKMMMAILKHDL 549 GT +L+ H+ CI+++N D+ Q FL+ + + + F + REK+ AI+ H+ Sbjct: 150 GTKHLHVHLDRCIKRRNVDIKQQFLAIERKGYGKVQIGGFTFDQDISREKLARAIILHEY 209 Query: 550 PFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFD 729 P V++ G RD + K V+R+T+++D++KIY EK K+ + L + R+++T D Sbjct: 210 PLSIVDHAGFRDFASSLQPLFKMVSRNTIKDDIMKIYEFEKGKMSSYLEKLETRMAITTD 269 Query: 730 LWTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKL 909 +WTS Y+ +T H++D++W L ++ F ++ PPHT LS+ + L +W ++RKL Sbjct: 270 MWTSNQKKGYMAITVHYIDESWLLHHHIVRFVYVPPPHTKEVLSDVLLDFLLDWNMDRKL 329 Query: 910 FSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVV 1089 ++T+D+ S+ND + L +L+ LL NG+ FH+ C AH+LNLIV++GL I + Sbjct: 330 STITVDNCSSNDGMIDILSEKLSSSGSLLLNGKIFHMRCAAHVLNLIVKEGLDVIRVEIE 389 Query: 1090 DIRESVKYLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFM 1269 IRESV Y ++ R KF + R + L K L D TRWNST+LML A+ Y+ F Sbjct: 390 KIRESVAYWSATPSRVEKFEDAARQLRLPCNKKLCLDCKTRWNSTYLMLSIAITYKDVFP 449 Query: 1270 HFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTL 1449 + Y PS EEW ++ + L +FY +T +FSG YPT+N +F +V +++ L Sbjct: 450 RLKQREKLYTTVPSEEEWNLAREICERLKLFYNITKLFSGRNYPTANTFFIKVCEIKEAL 509 Query: 1450 TKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGS-G 1626 + + V MA M +KF+ YWS +++AIAVVLDPRYK++ +E+ + +YGS Sbjct: 510 YDWLICSNEVVSTMASSMLEKFDKYWSGCHIVMAIAVVLDPRYKMKILEFYFPIMYGSEA 569 Query: 1627 SSQINEVSEKLYSLFETYKQNSTNSVERTSKNLQKDVSH---GTHRLPDFMEEFDTFSAE 1797 SS+I ++ + Y L Y Q+ + ++TS + VS+ T+ D + +FD F Sbjct: 570 SSEIGKIRQLCYDLLSEY-QSKSKMGQQTSSHGASSVSNLFELTYDEQDPLSKFDLFVHS 628 Query: 1798 NXXXXXXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVSSE 1977 ELD YL+E + DVL +W+ N ++P L ++ RDI +IP+STV+SE Sbjct: 629 TSEEGHAKSELDYYLEETVLPRISDFDVLSWWKTNGIKYPTLQMIVRDIYAIPVSTVASE 688 Query: 1978 AAFNIGRRVVNKSRSVLKPEIVESTFCSRNWI 2073 +AF+ G R+V+K RS L P +E+ C+++W+ Sbjct: 689 SAFSTGGRMVSKHRSRLHPNTLEALMCAQSWL 720 >gb|EOY04304.1| BED zinc finger,hAT family dimerization domain isoform 3, partial [Theobroma cacao] Length = 680 Score = 420 bits (1079), Expect = e-114 Identities = 235/635 (37%), Positives = 359/635 (56%), Gaps = 28/635 (4%) Frame = +1 Query: 253 TSKR-RRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 429 +SKR + TS VW+ FE L + +A C+ C IY+ + +GTS+L RH+ C+++ N Sbjct: 36 SSKRPKTTSKVWDVFEKLPAQQGDS-KAICKLCRRIYTAKTTSGTSHLRRHIEACLKRGN 94 Query: 430 PDLGQMFLS---------------QSGSLMSMKPPKFSLKV----FREKMMMAILKHDLP 552 DL Q G+L+ P S K+ R + M I+ P Sbjct: 95 HDLDQRSTEACFKPVNRDANRHTVSQGTLIDATTPLKSYKLDVDEIRRAIAMMIIVDAQP 154 Query: 553 FQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDL 732 F+ VE G R L ++R ++ D++ IY RE+ +R LL + PGRI LT Sbjct: 155 FRVVEDTGFRHVLNVACPEFPLLSRKAIKRDIISIYVRERENIRELLGACPGRICLTSST 214 Query: 733 WTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLF 912 W S D Y C+TAHF+D W+LQK++L F I PP+ +++++++I + +W IE K+F Sbjct: 215 WKSNCDDHYNCVTAHFIDHEWRLQKRILRFKLIPPPYDSLSIADEIGLCMVQWNIEHKVF 274 Query: 913 SVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVD 1092 SVTL++ S++D L+ L+ K G FF++ C ILNLIVQ G I + Sbjct: 275 SVTLENLSSDDCVADILKTRLDAKKYHPFKGVFFNMSCSTRILNLIVQAGFNLIIDIIGK 334 Query: 1093 IRESVKYLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMH 1272 +R +KY++ S R++ F + ++L+T+K L D P+RWNST+ M+E AL Y+ AF++ Sbjct: 335 LRLGIKYVQQSPHRKKNFYIIAKTLNLDTQKKLCLDSPSRWNSTYNMIEVALCYKNAFLY 394 Query: 1273 FAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLT 1452 A D ++ H S +EWE+V +KFL V ++V +F + PTSNLYF + V L+ Sbjct: 395 LAEQDKNFIHKLSEDEWEKVSVSYKFLKVIFEVACIFFRNRQPTSNLYFKALWKVHRRLS 454 Query: 1453 KAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS 1632 ++ + F+ M EM KF YWS+ +IL+ A +LDPRYK++FVE+ Y +LYGSG+ Sbjct: 455 DMVRGPENFMTRMVKEMQSKFNQYWSEYNLILSCAAILDPRYKIKFVEYCYTKLYGSGAQ 514 Query: 1633 QINEVS-EKLYSLFETYKQNS-------TNSVERTSKNLQKDVSHGTHRLPDFMEEFDTF 1788 Q S LY LF Y QNS T SV T + KD + G E+++TF Sbjct: 515 QYVSASVNTLYGLFHDYMQNSACPSHTATLSVLTTKISNDKDDNDG-------FEDYETF 567 Query: 1789 SAENXXXXXXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTV 1968 + +LDLYLDE D E+DVL++W R+P LS MARD+L+IP+ST+ Sbjct: 568 QSARFQTQVEKSQLDLYLDEPSHDLNSEIDVLEYWTLCSLRYPELSRMARDVLTIPVSTI 627 Query: 1969 SSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWI 2073 +S+ AF+IG +V++ RS LK +++++ C ++W+ Sbjct: 628 ASDNAFDIGPQVISTDRSSLKSKMIQALVCLQDWM 662 >gb|EOY04303.1| BED zinc finger,hAT family dimerization domain isoform 2 [Theobroma cacao] Length = 689 Score = 420 bits (1079), Expect = e-114 Identities = 235/635 (37%), Positives = 359/635 (56%), Gaps = 28/635 (4%) Frame = +1 Query: 253 TSKR-RRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 429 +SKR + TS VW+ FE L + +A C+ C IY+ + +GTS+L RH+ C+++ N Sbjct: 36 SSKRPKTTSKVWDVFEKLPAQQGDS-KAICKLCRRIYTAKTTSGTSHLRRHIEACLKRGN 94 Query: 430 PDLGQMFLS---------------QSGSLMSMKPPKFSLKV----FREKMMMAILKHDLP 552 DL Q G+L+ P S K+ R + M I+ P Sbjct: 95 HDLDQRSTEACFKPVNRDANRHTVSQGTLIDATTPLKSYKLDVDEIRRAIAMMIIVDAQP 154 Query: 553 FQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDL 732 F+ VE G R L ++R ++ D++ IY RE+ +R LL + PGRI LT Sbjct: 155 FRVVEDTGFRHVLNVACPEFPLLSRKAIKRDIISIYVRERENIRELLGACPGRICLTSST 214 Query: 733 WTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLF 912 W S D Y C+TAHF+D W+LQK++L F I PP+ +++++++I + +W IE K+F Sbjct: 215 WKSNCDDHYNCVTAHFIDHEWRLQKRILRFKLIPPPYDSLSIADEIGLCMVQWNIEHKVF 274 Query: 913 SVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVD 1092 SVTL++ S++D L+ L+ K G FF++ C ILNLIVQ G I + Sbjct: 275 SVTLENLSSDDCVADILKTRLDAKKYHPFKGVFFNMSCSTRILNLIVQAGFNLIIDIIGK 334 Query: 1093 IRESVKYLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMH 1272 +R +KY++ S R++ F + ++L+T+K L D P+RWNST+ M+E AL Y+ AF++ Sbjct: 335 LRLGIKYVQQSPHRKKNFYIIAKTLNLDTQKKLCLDSPSRWNSTYNMIEVALCYKNAFLY 394 Query: 1273 FAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLT 1452 A D ++ H S +EWE+V +KFL V ++V +F + PTSNLYF + V L+ Sbjct: 395 LAEQDKNFIHKLSEDEWEKVSVSYKFLKVIFEVACIFFRNRQPTSNLYFKALWKVHRRLS 454 Query: 1453 KAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS 1632 ++ + F+ M EM KF YWS+ +IL+ A +LDPRYK++FVE+ Y +LYGSG+ Sbjct: 455 DMVRGPENFMTRMVKEMQSKFNQYWSEYNLILSCAAILDPRYKIKFVEYCYTKLYGSGAQ 514 Query: 1633 QINEVS-EKLYSLFETYKQNS-------TNSVERTSKNLQKDVSHGTHRLPDFMEEFDTF 1788 Q S LY LF Y QNS T SV T + KD + G E+++TF Sbjct: 515 QYVSASVNTLYGLFHDYMQNSACPSHTATLSVLTTKISNDKDDNDG-------FEDYETF 567 Query: 1789 SAENXXXXXXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTV 1968 + +LDLYLDE D E+DVL++W R+P LS MARD+L+IP+ST+ Sbjct: 568 QSARFQTQVEKSQLDLYLDEPSHDLNSEIDVLEYWTLCSLRYPELSRMARDVLTIPVSTI 627 Query: 1969 SSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWI 2073 +S+ AF+IG +V++ RS LK +++++ C ++W+ Sbjct: 628 ASDNAFDIGPQVISTDRSSLKSKMIQALVCLQDWM 662 >gb|EOY04302.1| BED zinc finger,hAT family dimerization domain isoform 1 [Theobroma cacao] Length = 692 Score = 420 bits (1079), Expect = e-114 Identities = 235/635 (37%), Positives = 359/635 (56%), Gaps = 28/635 (4%) Frame = +1 Query: 253 TSKR-RRTSNVWNHFEMLSLTADNKPRARCRQCGAIYSCDSRNGTSNLNRHVRICIRKKN 429 +SKR + TS VW+ FE L + +A C+ C IY+ + +GTS+L RH+ C+++ N Sbjct: 36 SSKRPKTTSKVWDVFEKLPAQQGDS-KAICKLCRRIYTAKTTSGTSHLRRHIEACLKRGN 94 Query: 430 PDLGQMFLS---------------QSGSLMSMKPPKFSLKV----FREKMMMAILKHDLP 552 DL Q G+L+ P S K+ R + M I+ P Sbjct: 95 HDLDQRSTEACFKPVNRDANRHTVSQGTLIDATTPLKSYKLDVDEIRRAIAMMIIVDAQP 154 Query: 553 FQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFDL 732 F+ VE G R L ++R ++ D++ IY RE+ +R LL + PGRI LT Sbjct: 155 FRVVEDTGFRHVLNVACPEFPLLSRKAIKRDIISIYVRERENIRELLGACPGRICLTSST 214 Query: 733 WTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKLF 912 W S D Y C+TAHF+D W+LQK++L F I PP+ +++++++I + +W IE K+F Sbjct: 215 WKSNCDDHYNCVTAHFIDHEWRLQKRILRFKLIPPPYDSLSIADEIGLCMVQWNIEHKVF 274 Query: 913 SVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVVD 1092 SVTL++ S++D L+ L+ K G FF++ C ILNLIVQ G I + Sbjct: 275 SVTLENLSSDDCVADILKTRLDAKKYHPFKGVFFNMSCSTRILNLIVQAGFNLIIDIIGK 334 Query: 1093 IRESVKYLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAFMH 1272 +R +KY++ S R++ F + ++L+T+K L D P+RWNST+ M+E AL Y+ AF++ Sbjct: 335 LRLGIKYVQQSPHRKKNFYIIAKTLNLDTQKKLCLDSPSRWNSTYNMIEVALCYKNAFLY 394 Query: 1273 FAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVVQDTLT 1452 A D ++ H S +EWE+V +KFL V ++V +F + PTSNLYF + V L+ Sbjct: 395 LAEQDKNFIHKLSEDEWEKVSVSYKFLKVIFEVACIFFRNRQPTSNLYFKALWKVHRRLS 454 Query: 1453 KAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERLYGSGSS 1632 ++ + F+ M EM KF YWS+ +IL+ A +LDPRYK++FVE+ Y +LYGSG+ Sbjct: 455 DMVRGPENFMTRMVKEMQSKFNQYWSEYNLILSCAAILDPRYKIKFVEYCYTKLYGSGAQ 514 Query: 1633 QINEVS-EKLYSLFETYKQNS-------TNSVERTSKNLQKDVSHGTHRLPDFMEEFDTF 1788 Q S LY LF Y QNS T SV T + KD + G E+++TF Sbjct: 515 QYVSASVNTLYGLFHDYMQNSACPSHTATLSVLTTKISNDKDDNDG-------FEDYETF 567 Query: 1789 SAENXXXXXXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTV 1968 + +LDLYLDE D E+DVL++W R+P LS MARD+L+IP+ST+ Sbjct: 568 QSARFQTQVEKSQLDLYLDEPSHDLNSEIDVLEYWTLCSLRYPELSRMARDVLTIPVSTI 627 Query: 1969 SSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWI 2073 +S+ AF+IG +V++ RS LK +++++ C ++W+ Sbjct: 628 ASDNAFDIGPQVISTDRSSLKSKMIQALVCLQDWM 662 >gb|AAF19546.1|AC007190_14 F23N19.13 [Arabidopsis thaliana] Length = 633 Score = 411 bits (1057), Expect = e-112 Identities = 257/704 (36%), Positives = 382/704 (54%), Gaps = 7/704 (0%) Frame = +1 Query: 118 MEMDF--LDNETKAESDIQSQDGQETRVTQELT-PTSMNPASSSGCSITSKRRRTSNVWN 288 ME+D L +E + Q D ++ + Q L T+ + + G S S+ R W Sbjct: 1 MELDTQNLVDEDNFNLEDQEMDHEDPEMDQILPHETASSGTAERGNSSVSRFRAAC--WK 58 Query: 289 HFEMLSLTADNKPRARCRQCGAIYSCD-SRNGTSNLNRHVRICIRKKNPDLGQMFLSQSG 465 +F+ + K C+ C Y + RNGT+ +NRH+R C +K P G Sbjct: 59 NFDRGQKYPNGKTEVTCKYCEQTYHLNLRRNGTNTMNRHMRSC--EKTP----------G 106 Query: 466 SLMSMKPPKFSLKVFREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEED 645 S + K + VFRE + +A+++H+LP+ FVEYE IR+A Y N +F +R+T D Sbjct: 107 STPRISR-KVDMMVFREMIAVALVQHNLPYSFVEYERIREAFTYANPSIEFWSRNTAASD 165 Query: 646 VLKIYHREKAKVRNLLHSIPGRISLTFDLWTSISTDEYLCLTAHFLDQNWKLQKKVLNFH 825 V KIY REK K++ L IPGRI LT DLW +++ + Y+CLTAH++D + L+ K+L+F Sbjct: 166 VYKIYEREKIKLKEKLAIIPGRICLTTDLWRALTVESYICLTAHYVDVDGVLKTKILSFS 225 Query: 826 FISPPHTAIALSEKIYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNG 1005 PPH+ +A++ K+ LL +WGIE+K+F++T+D+AS ND L+ + L+ L+C+G Sbjct: 226 AFPPPHSGVAIAMKLSELLKDWGIEKKIFTLTVDNASANDTMQSILKRK--LQKDLVCSG 283 Query: 1006 EFFHVCCCAHILNLIVQDGLKEIDSSVVDIRESVKYLKSSNQRRRKFLECVRLVSLETKK 1185 EFFHV C AHILNLIVQDGL+ I ++ IRE+VKY+K S R F C+ + ++T+ Sbjct: 284 EFFHVRCSAHILNLIVQDGLEVISGALEKIRETVKYVKGSETRENLFQNCMDTIGIQTEA 343 Query: 1186 ALRQDIPTRWNSTFLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFY 1365 +L D+ TRWNST+ ML A+ ++ A +D YK PS EWER E + L F Sbjct: 344 SLVLDVSTRWNSTYHMLSRAIQFKDVLRSLAEVDRVYKSFPSAVEWERAELICDLLKPFA 403 Query: 1366 KVTDMFSGTQYPTSNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVI 1545 ++T + S +M +K++ YW I Sbjct: 404 EITKLIS-------------------------------------DMTEKYDKYWEDFSDI 426 Query: 1546 LAIAVVLDPRYKLRFVEWSYERLYGSGSSQ-INEVSEKLYSLFETYKQNSTNSVERTSKN 1722 LA+A VLDPR K +E+ Y L S + + V +K+ LF YK+ + N TS++ Sbjct: 427 LAMAAVLDPRLKFSALEYCYNILNPLTSKENLTHVRDKMVQLFGAYKRTTCNVAASTSQS 486 Query: 1723 LQKDVSHGTHRLPDFMEEFDTFSAENXXXXXXXXELDLYLDEQRSDW--RKELDVLDFWR 1896 +KD+ G + + FS N LD+YL+E D K++DV+ +W+ Sbjct: 487 SRKDIPFG------YDGFYSYFSQRN---GTGKSPLDMYLEEPVLDMVSFKDMDVIAYWK 537 Query: 1897 DNRYRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWIF 2076 +N RF LS MA DILSIPI+TV+SE+AF+IG RV+NK RS L P V++ C+RNW Sbjct: 538 NNVSRFKELSSMACDILSIPITTVASESAFSIGSRVLNKYRSCLLPTNVQALLCTRNWFR 597 Query: 2077 GKQYDMEPDVNEMCEDVTKLNINDSLISVGES*WESWTLQFRLL 2208 G Q +V ++I +S I V + + WT FRL+ Sbjct: 598 GFQ------------EVGNIHILNSFIIV--NLYGFWTAAFRLI 627 >ref|XP_003638290.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] gi|355504225|gb|AES85428.1| hypothetical protein MTR_126s0001, partial [Medicago truncatula] Length = 555 Score = 410 bits (1055), Expect = e-112 Identities = 227/537 (42%), Positives = 326/537 (60%), Gaps = 14/537 (2%) Frame = +1 Query: 508 FREKMMMAILKHDLPFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRN 687 F E IL HDLPF F E EG+R ++N R+ +E V +Y +EK K++ Sbjct: 19 FVEICASTILAHDLPFHFFELEGMRKYSEFLNPNIPIPPRNVIEAYVSHLYTKEKPKLKQ 78 Query: 688 LLHSIPGRISLTFDLWTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEK 867 L +IP RISL+FDLW S +T+ Y+CLTAHF+D NWKL KV+NF + PP T+ + E+ Sbjct: 79 QLTTIPNRISLSFDLWESNTTETYICLTAHFVDANWKLNSKVINFRLVYPP-TSGEICER 137 Query: 868 IYALLNEWGIERKLFSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNL 1047 + LLN+WGIE+K+FS+T+DD+S N++ +L+ +L L+NGLLC+GEFFHV C A +LN Sbjct: 138 MVELLNDWGIEKKIFSLTIDDSSENEILQEQLKTQLVLQNGLLCDGEFFHVNCFARVLNQ 197 Query: 1048 IVQDGLKEIDSSVVDIRESVKYLKSSNQRRRKFLECVRLVS-LETKKALRQDIPTRWNST 1224 IV++ LK + V IRES+ +++ S RR KF EC V +++ L DI +ST Sbjct: 198 IVEEALKLVSCGVHKIRESIMFVRHSKSRREKFKECFEKVGGVDSSVHLHLDISMSLSST 257 Query: 1225 FLMLESALYYRRAFMHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPT 1404 +++LE AL YR AF F + D Y CPS EEW+RVEK+ FL F + +M + T +PT Sbjct: 258 YMLLERALKYRCAFESFHLYDDSYDLCPSAEEWKRVEKICAFLLPFCETANMINSTTHPT 317 Query: 1405 SNLYFPQVLVVQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKL 1584 SNLYF QV VQ L ++ + D + MA M KFE YW + V+LA+ VLDPR K Sbjct: 318 SNLYFLQVWKVQCVLVDSLGDEDEDIKKMAERMMSKFEKYWDEYSVVLALGAVLDPRMKF 377 Query: 1585 RFVEWSYERLYGSG-SSQINEVSEKLYSLFETYKQNSTNS-VERTSKN---------LQK 1731 + + Y +L S ++ +V KL LFE + NST + V+RT K LQK Sbjct: 378 TTLAYCYSKLDASTCERKLQQVKRKLCMLFEKHSGNSTTAGVQRTIKENQDQSSSMPLQK 437 Query: 1732 DVSHGTHRLPDFMEEFDTFSAENXXXXXXXXELDLYLDEQRSDWR--KELDVLDFWRDNR 1905 + +H L D ++ + +LD+YLDE D+R E+DVL +W+ N Sbjct: 438 KLKSLSHGLFDELK----VHHQQLVTKTGKSQLDVYLDESVLDFRCYAEMDVLQWWKSNN 493 Query: 1906 YRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWIF 2076 RFP LS++A D+LS+PI+ V+S++ F +G RV NK + + P VE+ C+R+W++ Sbjct: 494 DRFPDLSILACDLLSVPIAAVASDSEFCMGSRVFNKYKDRMLPMNVEARICTRSWLY 550 >gb|AAP59878.1| Ac-like transposase THELMA13 [Silene latifolia] Length = 682 Score = 410 bits (1055), Expect = e-112 Identities = 243/619 (39%), Positives = 346/619 (55%), Gaps = 12/619 (1%) Frame = +1 Query: 208 TPTSMN---PASSSGCSITSKRRRTSNVWNHFEML--SLTADNKPRARCRQC-GAIYSCD 369 TP+S N PA S S T R+ TS VW H+++ SL D RA C+ C G Sbjct: 34 TPSSQNDNIPAPSVS-SETRNRKWTSPVWQHYKLFDASLFPDGIARAICKYCDGGPTLAY 92 Query: 370 SRNGTSNLNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHDL 549 S NGTSN RH C K P LG L+ GS + P V++E++ +A+++H Sbjct: 93 SGNGTSNFKRHTETC--PKRPLLGVAHLTSDGSFIKKMDPL----VYKERVALAVIRHAF 146 Query: 550 PFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTFD 729 PF + EY+G R +NE K ++R+T+ +KI+ REK ++ L ++PG+I LT D Sbjct: 147 PFSYAEYDGNRWLHEGLNESYKPISRNTLRNYCMKIHKREKQILKESLSNLPGKICLTTD 206 Query: 730 LWTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERKL 909 +WT+ Y+ LTAH++D W L K+LNF + PPH A +L + IYA L EW I K+ Sbjct: 207 MWTAFVGMGYISLTAHYIDSEWNLHSKILNFCHLEPPHDAPSLHDSIYAKLKEWDIRSKI 266 Query: 910 FSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSVV 1089 F++TLD+A ND L N L+L + +LC+GE+FHV C AHILNLIVQDGLK IDS V Sbjct: 267 FTITLDNARCNDNMQDLLMNSLSLHSPILCDGEYFHVRCAAHILNLIVQDGLKVIDSGVR 326 Query: 1090 DIRESVKYLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAF- 1266 +R V ++ S +R KF + ++T K L D TRWNST+ MLE A+ YR F Sbjct: 327 KLRMVVAHIVGSERRLIKFKGNASALGVDTSKKLCLDCVTRWNSTYNMLERAMIYRNVFP 386 Query: 1267 ----MHFAVIDPDYKHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLV 1434 DP + PS EW R+ K+ + L F +T + SG +YPT+NLYF V Sbjct: 387 TMRGPEMKKFDPHFPEPPSEAEWIRIVKIVELLKPFDHITTLISGRKYPTANLYFKSVWK 446 Query: 1435 VQDTLTKAMKEGDGFVHGMAVEMNKKFENYWSKSCVILAIAVVLDPRYKLRFVEWSYERL 1614 +Q LT+ K D + MA M KF+ YW +IL+ A +LDPRYKL F+++ + +L Sbjct: 447 IQYLLTRYAKCNDTHLKDMADLMRIKFDKYWENYSMILSFAAILDPRYKLPFIKYCFHKL 506 Query: 1615 -YGSGSSQINEVSEKLYSLFETYKQNSTNSVERTSKNLQKDVSHGTHRLPDFMEEFDTFS 1791 S + V +K Y L+E Y + S + ++ TS + +PD + F F Sbjct: 507 DPESAELKTKVVKDKFYKLYEEYVKYSPHVLKETSVQM----------IPDELPGFANF- 555 Query: 1792 AENXXXXXXXXELDLYLDEQRSDWRKELDVLDFWRDNRYRFPCLSLMARDILSIPISTVS 1971 + LD YLD+ R D +DVL +W++N ++ L+ MA DIL+I I+TV+ Sbjct: 556 -DGGAVIGGLSYLDTYLDDARLDHTLNIDVLKWWKENESKYLVLAEMAIDILTIQINTVA 614 Query: 1972 SEAAFNIGRRVVNKSRSVL 2028 SE+AF + RV+ K R+ L Sbjct: 615 SESAFRMESRVLMKWRTTL 633 >gb|ACX85638.1| putative transposase [Cucumis melo] Length = 680 Score = 391 bits (1004), Expect = e-106 Identities = 239/679 (35%), Positives = 367/679 (54%), Gaps = 33/679 (4%) Frame = +1 Query: 205 LTPTSMNPASSSGCS--ITSKRRRT---SNVWNHFEMLSLTADNKPRARCRQCGAIYSCD 369 +T S++ S+ CS + KR+ S+VW HF + PRA C+ CGA Y+CD Sbjct: 1 MTSFSVDETSNQSCSSPVLGKRKPVKPPSSVWEHFIKVEGCDPKYPRAACKHCGASYACD 60 Query: 370 S-RNGTSNLNRHVRICIRKKNPDLGQMFLSQSGSLMSMKPPKFSLKVFREKMMMAILKHD 546 S RNGT+NL RH+ C NP L + S ++ F+ + R+ + ++ + Sbjct: 61 SKRNGTTNLKRHLEKCKMYVNP-LEDNVEGEGDSESNLMTASFTQENCRKMLARMVILDE 119 Query: 547 LPFQFVEYEGIRDALGYINEGAKFVTRDTVEEDVLKIYHREKAKVRNLLHSIPGRISLTF 726 LPF+FVE EG +N +R TV +D ++Y +EK K++N L R+ LT Sbjct: 120 LPFKFVESEGFHQFCRALNPKFVIPSRVTVAKDCFQMYMKEKKKLKNALTRSGQRVCLTT 179 Query: 727 DLWTSISTDEYLCLTAHFLDQNWKLQKKVLNFHFISPPHTAIALSEKIYALLNEWGIERK 906 D WTS+ Y+ +TAHF+D +W L K++LNF ++ H + I L WGI+R Sbjct: 180 DTWTSVQNINYMVITAHFIDDDWNLHKRILNFCQVAN-HKGDTIGRAIEKCLEGWGIDR- 237 Query: 907 LFSVTLDDASTNDLFVGELRNELNLKNGLLCNGEFFHVCCCAHILNLIVQDGLKEIDSSV 1086 LF+VT+D+AS+ND+ + L + +NGL+ +GEF H+ CCAHILNLIV D LK++ S+ Sbjct: 238 LFTVTVDNASSNDVAIAYLVKKFKGRNGLVLDGEFIHIRCCAHILNLIVSDALKDLHVSI 297 Query: 1087 VDIRESVKYLKSSNQRRRKFLECVRLVSLETKKALRQDIPTRWNSTFLMLESALYYRRAF 1266 + IR +VKY++SS R + F + + + TK L D+PTRWNSTF ML+ A+ ++ F Sbjct: 298 IRIRNAVKYVRSSPARLQIFKDFAKEDKMSTKNCLTMDVPTRWNSTFTMLDGAIKCQKTF 357 Query: 1267 MHFAVIDPDY---KHCPSHEEWERVEKLFKFLGVFYKVTDMFSGTQYPTSNLYFPQVLVV 1437 DP Y P+ E+W+ + KFL F +VT FS + TSN++F ++ ++ Sbjct: 358 ERLEEHDPSYLPKDDIPTTEDWDNAKVFVKFLKTFSEVTMKFSASMSVTSNIFFHELCLI 417 Query: 1438 QDTLTKAMKEGDGFVHGMAVEMNKKFENYW-----SKSCVILAIAVVLDPRYKLRFVEWS 1602 Q+ + + + + M + M KF YW K+ ++L ++VVLDPRYKL +V + Sbjct: 418 QEIIREYSSYENALLSQMTLSMQTKFNKYWGITTSEKTNLLLYVSVVLDPRYKLAYVNYC 477 Query: 1603 YERLYGSGSSQI--NEVSEKLYSLFETY-------KQNSTNS---VERTSKNLQKDV--- 1737 + ++I N+V E L + Y K + T S +E Q ++ Sbjct: 478 FNEFLEEDCAKIWTNKVEEAFRRLCDDYYMRMSKEKYSQTQSCTPIEGFGFQSQSEIPSI 537 Query: 1738 -SHGTHRLPDFMEEFDTFSAEN-XXXXXXXXELDLYLDEQRSDWRKE--LDVLDFWRDNR 1905 S G+++ + D F N E+ YLDE R D + LD+L +W+ N Sbjct: 538 SSSGSYKARATVH--DRFKQSNKTCLDDAKTEVTRYLDEARIDCMGDEYLDLLTWWKVNA 595 Query: 1906 YRFPCLSLMARDILSIPISTVSSEAAFNIGRRVVNKSRSVLKPEIVESTFCSRNWIFGKQ 2085 RF +S +ARDI SIPISTV SE+AF+ G RV++ RS L P+ E+ C++NWI K Sbjct: 596 SRFKIISQVARDIYSIPISTVPSESAFSTGGRVLDSFRSSLTPQTAEALICAQNWIQSKP 655 Query: 2086 YDMEPDVNEMCEDVTKLNI 2142 D + + E++ + NI Sbjct: 656 LDDMTEEIDGAEEIDEGNI 674