BLASTX nr result
ID: Akebia27_contig00016080
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00016080 (1150 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006434757.1| hypothetical protein CICLE_v10002377mg [Citr... 254 4e-65 ref|XP_007017215.1| Uncharacterized protein isoform 1 [Theobroma... 254 6e-65 gb|EXC32856.1| hypothetical protein L484_009556 [Morus notabilis] 253 8e-65 ref|XP_002282478.1| PREDICTED: protein DCL, chloroplastic [Vitis... 253 1e-64 ref|XP_006473317.1| PREDICTED: protein DCL, chloroplastic-like [... 252 2e-64 ref|XP_004141259.1| PREDICTED: protein DCL, chloroplastic-like [... 247 8e-63 ref|XP_004291102.1| PREDICTED: protein DCL, chloroplastic-like [... 243 1e-61 ref|XP_007017218.1| Uncharacterized protein isoform 4 [Theobroma... 240 9e-61 ref|XP_002284778.1| PREDICTED: protein DCL, chloroplastic-like i... 239 2e-60 gb|EYU36304.1| hypothetical protein MIMGU_mgv1a013445mg [Mimulus... 237 6e-60 ref|XP_006304196.1| hypothetical protein CARUB_v10010275mg [Caps... 237 6e-60 gb|AFK33859.1| unknown [Lotus japonicus] 237 6e-60 ref|NP_683398.1| uncharacterized protein [Arabidopsis thaliana] ... 237 8e-60 ref|XP_007225808.1| hypothetical protein PRUPE_ppa011355mg [Prun... 236 1e-59 emb|CAD12248.1| DCL protein [Coffea arabica] 236 1e-59 ref|XP_002894020.1| hypothetical protein ARALYDRAFT_473852 [Arab... 235 2e-59 ref|NP_001235407.1| uncharacterized protein LOC100306718 [Glycin... 235 3e-59 ref|XP_006393639.1| hypothetical protein EUTSA_v10011763mg [Eutr... 234 4e-59 gb|ABC47855.1| defective chloroplast and leaves protein [Glycine... 234 5e-59 ref|NP_001031151.1| uncharacterized protein [Arabidopsis thalian... 234 7e-59 >ref|XP_006434757.1| hypothetical protein CICLE_v10002377mg [Citrus clementina] gi|557536879|gb|ESR47997.1| hypothetical protein CICLE_v10002377mg [Citrus clementina] Length = 227 Score = 254 bits (650), Expect = 4e-65 Identities = 129/211 (61%), Positives = 153/211 (72%), Gaps = 5/211 (2%) Frame = +2 Query: 89 TNPTSLHFSASPFYFHYHIIPLRSQFRALKAS----SEGINVGNQDGNGPDLLRRPAISP 256 +NPTS SP + + S F L+ S S+G +G+Q+ G +LLR+P +SP Sbjct: 21 SNPTSTSLFLSPVILSFPFYRMTS-FLHLRVSAALRSDGDKIGSQESQGFNLLRKPVVSP 79 Query: 257 IYKT-DGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLVGFVRMILHSG 433 + DG+ +K W+DWED+ILEDTVPLVGFVRMILHSG Sbjct: 80 ASRDLDGNSEKDEGESEDDEG-------------WIDWEDKILEDTVPLVGFVRMILHSG 126 Query: 434 KYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKDGE 613 +Y+SG +LSPEHE+ ILERLLPYHPEFEKKIGCGID+IT+GYHPDFE+SRCLFIVRKDGE Sbjct: 127 RYESGVRLSPEHERTILERLLPYHPEFEKKIGCGIDYITIGYHPDFESSRCLFIVRKDGE 186 Query: 614 LVDFSFWKCIKGLIRKNYPLYADSFILRHFR 706 LVDFS+WKCIKGLIRKNYPLYADSFILRHFR Sbjct: 187 LVDFSYWKCIKGLIRKNYPLYADSFILRHFR 217 >ref|XP_007017215.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590592209|ref|XP_007017216.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590592213|ref|XP_007017217.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590592219|ref|XP_007017219.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508722543|gb|EOY14440.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508722544|gb|EOY14441.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508722545|gb|EOY14442.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508722547|gb|EOY14444.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 215 Score = 254 bits (648), Expect = 6e-65 Identities = 128/225 (56%), Positives = 157/225 (69%), Gaps = 3/225 (1%) Frame = +2 Query: 50 MSCISKSPSFLFRTNPTSLHFSASPFYFH---YHIIPLRSQFRALKASSEGINVGNQDGN 220 M+ + K P + F N S+ S+SP L+ + AL+ S+G +G+Q+ Sbjct: 1 MASVLKPPPY-FHRNCISISSSSSPVILSSPSQRTTSLQVRSCALRTGSDGGRIGSQESY 59 Query: 221 GPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPL 400 G D+LR+P+I + G+ ++ W+DWED+ILEDTVPL Sbjct: 60 GADMLRKPSILTPKDSGGTSEQEEGSEGKRKRGK-----------WIDWEDRILEDTVPL 108 Query: 401 VGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENS 580 VGFVRMI+HSGKY+SGD+LSPEHEK IL+RLLPYHPE EKKIGCGID+ITVGYHPDFE S Sbjct: 109 VGFVRMIIHSGKYESGDRLSPEHEKTILDRLLPYHPECEKKIGCGIDYITVGYHPDFEGS 168 Query: 581 RCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 RCLFIVRKDGEL+DFS+WKCIKGLIRKNYPLYADSFILRHFR + Sbjct: 169 RCLFIVRKDGELIDFSYWKCIKGLIRKNYPLYADSFILRHFRRRR 213 >gb|EXC32856.1| hypothetical protein L484_009556 [Morus notabilis] Length = 233 Score = 253 bits (647), Expect = 8e-65 Identities = 135/235 (57%), Positives = 158/235 (67%), Gaps = 13/235 (5%) Frame = +2 Query: 50 MSCISKSPSFLFRTNPTSLHFSASPFYFHYHIIPLRSQFRA----------LKASSEG-- 193 M+ IS P L + +L + HY + L FRA LK S+G Sbjct: 1 MAFISNPPPLLNNLHRHTLS------HHHYSPVTLSFPFRATTSFPARVGALKTGSDGGG 54 Query: 194 INVGNQDGNGPDLLRRPAISPIYKTDG-SKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWE 370 +G+Q+ GPDLLR+P +SP G S+++ WVDWE Sbjct: 55 SRIGSQELFGPDLLRKPVVSPRKDLAGISEEEKEIERKRNYGGGGGGDDDEEEDKWVDWE 114 Query: 371 DQILEDTVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFIT 550 D+ILEDTVPLVGFVRMILHS KY+SGD+LSPEHEK ILERLLP+HPEFEKKIGCGID+IT Sbjct: 115 DKILEDTVPLVGFVRMILHSEKYESGDRLSPEHEKTILERLLPFHPEFEKKIGCGIDYIT 174 Query: 551 VGYHPDFENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 VGYHPDFE SRCLFIV+KDG+LVDFS+WKCIKGLIRKNYPLYADSFILRHFR + Sbjct: 175 VGYHPDFERSRCLFIVQKDGKLVDFSYWKCIKGLIRKNYPLYADSFILRHFRQRR 229 >ref|XP_002282478.1| PREDICTED: protein DCL, chloroplastic [Vitis vinifera] gi|147773590|emb|CAN69898.1| hypothetical protein VITISV_032063 [Vitis vinifera] Length = 205 Score = 253 bits (646), Expect = 1e-64 Identities = 131/209 (62%), Positives = 151/209 (72%) Frame = +2 Query: 89 TNPTSLHFSASPFYFHYHIIPLRSQFRALKASSEGINVGNQDGNGPDLLRRPAISPIYKT 268 TNP SLH S +F H P ALK +S+G +Q+ GPDLLR+P +S Sbjct: 18 TNPISLHPSHPILHFATHRTP---SVPALKTASDG----SQEVYGPDLLRKPHVSAP-DD 69 Query: 269 DGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLVGFVRMILHSGKYKSG 448 +G +K+ WVDWEDQILEDTVPLVGFVRMILHSGKY+SG Sbjct: 70 EGDRKRTKRKGGE----------------WVDWEDQILEDTVPLVGFVRMILHSGKYRSG 113 Query: 449 DKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKDGELVDFS 628 ++LSPEHEK ILERLLPYHP +E+KIG GID+ITVGYHP+FE+SRCLFIVRKDGELVDFS Sbjct: 114 ERLSPEHEKIILERLLPYHPGYERKIGSGIDYITVGYHPEFESSRCLFIVRKDGELVDFS 173 Query: 629 FWKCIKGLIRKNYPLYADSFILRHFRMHK 715 +WKCIKG IRKNYPLYADSFILRHFR H+ Sbjct: 174 YWKCIKGFIRKNYPLYADSFILRHFRQHR 202 >ref|XP_006473317.1| PREDICTED: protein DCL, chloroplastic-like [Citrus sinensis] Length = 227 Score = 252 bits (643), Expect = 2e-64 Identities = 128/211 (60%), Positives = 152/211 (72%), Gaps = 5/211 (2%) Frame = +2 Query: 89 TNPTSLHFSASPFYFHYHIIPLRSQFRALKAS----SEGINVGNQDGNGPDLLRRPAISP 256 +NPTS SP + S F L+ S S+G +G+Q+ G +LLR+P +SP Sbjct: 21 SNPTSTSLFLSPVILSFPFYRTTS-FLHLRVSAALRSDGDKIGSQESQGFNLLRKPVVSP 79 Query: 257 IYKT-DGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLVGFVRMILHSG 433 + DG+ +K W+DWED+ILEDTVPLVGFVRMILHSG Sbjct: 80 ASRDLDGNSEKDEGESEDDEG-------------WIDWEDKILEDTVPLVGFVRMILHSG 126 Query: 434 KYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKDGE 613 +Y+SG +LSPEHE+ ILERLLPYHPEF+KKIGCGID+IT+GYHPDFE+SRCLFIVRKDGE Sbjct: 127 RYESGVRLSPEHERTILERLLPYHPEFKKKIGCGIDYITIGYHPDFESSRCLFIVRKDGE 186 Query: 614 LVDFSFWKCIKGLIRKNYPLYADSFILRHFR 706 LVDFS+WKCIKGLIRKNYPLYADSFILRHFR Sbjct: 187 LVDFSYWKCIKGLIRKNYPLYADSFILRHFR 217 >ref|XP_004141259.1| PREDICTED: protein DCL, chloroplastic-like [Cucumis sativus] gi|449519344|ref|XP_004166695.1| PREDICTED: protein DCL, chloroplastic-like [Cucumis sativus] Length = 225 Score = 247 bits (630), Expect = 8e-63 Identities = 130/228 (57%), Positives = 152/228 (66%), Gaps = 5/228 (2%) Frame = +2 Query: 47 AMSCISKSPSF--LFRTNPTSLHFSASPFYFHYHIIPLRS---QFRALKASSEGINVGNQ 211 AM+ I K P F NP F++SP + P+ S RALK EGI + + Sbjct: 2 AMASILKPPPFPPFHSLNPN--FFNSSPLILCFPTHPINSFHPSTRALKTGPEGIRLRSH 59 Query: 212 DGNGPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDT 391 DLLR+P + P S K WVDWED+ILEDT Sbjct: 60 QEYSSDLLRKP-VGP------SAKDLAGPSEDDDSSEESGNEDEEEVEWVDWEDKILEDT 112 Query: 392 VPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDF 571 VPLVGFVRM+LH+GKY++GD+L PEHEK ILERLLPYHPE EKKIGCG+D+ITVGYHPDF Sbjct: 113 VPLVGFVRMVLHTGKYENGDRLRPEHEKTILERLLPYHPESEKKIGCGVDYITVGYHPDF 172 Query: 572 ENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 E+SRCLFIVRKDGE+VDFS+WKCIKGLIRKNYPLYA+SFILRHFR + Sbjct: 173 ESSRCLFIVRKDGEMVDFSYWKCIKGLIRKNYPLYAESFILRHFRRRR 220 >ref|XP_004291102.1| PREDICTED: protein DCL, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 204 Score = 243 bits (620), Expect = 1e-61 Identities = 125/224 (55%), Positives = 153/224 (68%), Gaps = 2/224 (0%) Frame = +2 Query: 50 MSCISKSPSFLFRTNPTSLHFSASP--FYFHYHIIPLRSQFRALKASSEGINVGNQDGNG 223 M+ +SK P + +S H P F LR++ ALK + G Q+ +G Sbjct: 1 MASLSKPPLLVHNHLTSSSHALIGPATLSFSKTTSLLRARLCALKTGA-----GRQESDG 55 Query: 224 PDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLV 403 P+LLR+P + + DG+ ++ WVDWED+ILEDTVPLV Sbjct: 56 PELLRKPVVKDL---DGNSEEDEGEDGK----------------WVDWEDKILEDTVPLV 96 Query: 404 GFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSR 583 GFVRMILHSGKY+SGD+LSPEH+K +LERLLP+HPE KKIGCGID++TVGYHPDFE+SR Sbjct: 97 GFVRMILHSGKYESGDRLSPEHQKTVLERLLPFHPEAAKKIGCGIDYVTVGYHPDFESSR 156 Query: 584 CLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 CLFIV+KDG LVDFS+WKCIKGLIRKNYPLYADSFILRHFR + Sbjct: 157 CLFIVQKDGTLVDFSYWKCIKGLIRKNYPLYADSFILRHFRKRR 200 >ref|XP_007017218.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508722546|gb|EOY14443.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 209 Score = 240 bits (612), Expect = 9e-61 Identities = 124/221 (56%), Positives = 151/221 (68%), Gaps = 3/221 (1%) Frame = +2 Query: 50 MSCISKSPSFLFRTNPTSLHFSASPFYFH---YHIIPLRSQFRALKASSEGINVGNQDGN 220 M+ + K P + F N S+ S+SP L+ + AL+ S+G +G+Q+ Sbjct: 1 MASVLKPPPY-FHRNCISISSSSSPVILSSPSQRTTSLQVRSCALRTGSDGGRIGSQESY 59 Query: 221 GPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPL 400 G D+LR+P+I + G+ ++ W+DWED+ILEDTVPL Sbjct: 60 GADMLRKPSILTPKDSGGTSEQEEGSEGKRKRGK-----------WIDWEDRILEDTVPL 108 Query: 401 VGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENS 580 VGFVRMI+HSGKY+SGD+LSPEHEK IL+RLLPYHPE EKKIGCGID+ITVGYHPDFE Sbjct: 109 VGFVRMIIHSGKYESGDRLSPEHEKTILDRLLPYHPECEKKIGCGIDYITVGYHPDFEGL 168 Query: 581 RCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHF 703 RCLFIV KDGELV FS+WKCIKGLIRKNYPLYADSFILR F Sbjct: 169 RCLFIVWKDGELVVFSYWKCIKGLIRKNYPLYADSFILRQF 209 >ref|XP_002284778.1| PREDICTED: protein DCL, chloroplastic-like isoform 1 [Vitis vinifera] Length = 215 Score = 239 bits (610), Expect = 2e-60 Identities = 131/223 (58%), Positives = 148/223 (66%), Gaps = 3/223 (1%) Frame = +2 Query: 47 AMSCISKS--PSFLFRTNPTSLHFSASPFYF-HYHIIPLRSQFRALKASSEGINVGNQDG 217 AM+ ++ S P L R NP SLH S S F LR A K S G +G+QD Sbjct: 2 AMAYVASSLLPVRLHR-NPISLHLSPSLRSFPSRQTTSLRPVLCARKPRSPGGKLGSQDA 60 Query: 218 NGPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVP 397 D LR+P ISP GS + WVDWEDQILEDTVP Sbjct: 61 RASDFLRKPTISPGDDGGGSSVREKSYKGREEEE------------WVDWEDQILEDTVP 108 Query: 398 LVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFEN 577 LVG+VRMI+HSGKY++GD+LS EHEK +LE+LL YHPE EKKIGCGID+ITVGYHPDFE Sbjct: 109 LVGYVRMIIHSGKYENGDRLSLEHEKFVLEKLLAYHPECEKKIGCGIDYITVGYHPDFEG 168 Query: 578 SRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 706 SRCLFIVR DGELVDFS+WKCIKGLIRK YP YADSFILRHF+ Sbjct: 169 SRCLFIVRNDGELVDFSYWKCIKGLIRKKYPQYADSFILRHFQ 211 >gb|EYU36304.1| hypothetical protein MIMGU_mgv1a013445mg [Mimulus guttatus] Length = 220 Score = 237 bits (605), Expect = 6e-60 Identities = 129/232 (55%), Positives = 151/232 (65%), Gaps = 9/232 (3%) Frame = +2 Query: 47 AMSCISKSPSFL-FRTNPTSLHFSASPFY--FHYHIIPLRSQFRALKASSEGINVGNQDG 217 A C+S SP F P S + SPF F +H PL A+K S+ ++ Sbjct: 2 ASICVSNSPDIRSFHRKPISKNLILSPFSLSFPFHKAPLC----AVKTGSDDGGAAARNS 57 Query: 218 NGP---DLLRRPAIS---PIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQI 379 P DLLR+P S P+ + + K+ WVDWEDQI Sbjct: 58 QTPYAADLLRKPLASSPAPVEQEETVKE------------YSGEKKGGDGDTWVDWEDQI 105 Query: 380 LEDTVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGY 559 LEDTVPLVGFVRMILHSGKY+SG +LSPEHE+ IL+RLL YHPE EKKIGCG+D+IT+GY Sbjct: 106 LEDTVPLVGFVRMILHSGKYESGTRLSPEHERTILDRLLAYHPESEKKIGCGVDYITIGY 165 Query: 560 HPDFENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 HP+FE SRCLFIVRKDGELVDFS+WKCIKGLIR NYPLYADSFILRHFR + Sbjct: 166 HPNFETSRCLFIVRKDGELVDFSYWKCIKGLIRTNYPLYADSFILRHFRRRR 217 >ref|XP_006304196.1| hypothetical protein CARUB_v10010275mg [Capsella rubella] gi|482572907|gb|EOA37094.1| hypothetical protein CARUB_v10010275mg [Capsella rubella] Length = 216 Score = 237 bits (605), Expect = 6e-60 Identities = 123/230 (53%), Positives = 155/230 (67%), Gaps = 7/230 (3%) Frame = +2 Query: 47 AMSCISKSPSFLFRTNPTSLHFSAS----PFYFHY---HIIPLRSQFRALKASSEGINVG 205 +++ +S SP FR FS+S P ++ I LR + RAL+ S+G +G Sbjct: 2 SLASVSCSPPPCFRCGAYIFSFSSSSSSSPLCLYFPRGDSISLRPRVRALRTESDGARIG 61 Query: 206 NQDGNGPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILE 385 N + G +LLRRP IS ++ ++ +VDWED+ILE Sbjct: 62 NTESYGSELLRRPHISSGESSEEEEESGEGDE------------------FVDWEDKILE 103 Query: 386 DTVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHP 565 TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPEFEKKIGCGID+I VG+HP Sbjct: 104 VTVPLVGFVRMILHSGKYANQDRLSPEHERMIVEMLLPYHPEFEKKIGCGIDYIMVGHHP 163 Query: 566 DFENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 DFE+SRC+FIVR+DGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR + Sbjct: 164 DFESSRCMFIVRRDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 213 >gb|AFK33859.1| unknown [Lotus japonicus] Length = 224 Score = 237 bits (605), Expect = 6e-60 Identities = 125/213 (58%), Positives = 144/213 (67%) Frame = +2 Query: 68 SPSFLFRTNPTSLHFSASPFYFHYHIIPLRSQFRALKASSEGINVGNQDGNGPDLLRRPA 247 +PS FR+ P L F FY+ + PL ++ ALKA++ +LLR+P Sbjct: 21 NPSSSFRSPPLILSFR---FYWSASL-PLHTRLSALKAAAAA-----SSDAADNLLRKPL 71 Query: 248 ISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLVGFVRMILH 427 I+P K WVDWEDQILEDTVPLVGFVR ILH Sbjct: 72 ITP------RKDPAGVLEEHGYAYEEEDDEEEEEDKWVDWEDQILEDTVPLVGFVRTILH 125 Query: 428 SGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKD 607 SG Y+SGD+LSPEHEK ILE+LLP+HPE EKKIGCGID+IT+GYHP F+ SRCLFIVRKD Sbjct: 126 SGHYESGDRLSPEHEKTILEKLLPFHPESEKKIGCGIDYITIGYHPQFDRSRCLFIVRKD 185 Query: 608 GELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 706 GELVDFS+WKCIKGLIRKNYPLYADSFILRHFR Sbjct: 186 GELVDFSYWKCIKGLIRKNYPLYADSFILRHFR 218 >ref|NP_683398.1| uncharacterized protein [Arabidopsis thaliana] gi|12321014|gb|AAG50632.1|AC083835_17 defective chloroplasts and leaves (DCL) protein, putative [Arabidopsis thaliana] gi|22135799|gb|AAM91086.1| At1g45261/F2G19.1 [Arabidopsis thaliana] gi|48310651|gb|AAT41860.1| At1g45261 [Arabidopsis thaliana] gi|62318604|dbj|BAD95026.1| defective chloroplasts and leaves (DCL) protein [Arabidopsis thaliana] gi|332193992|gb|AEE32113.1| uncharacterized protein AT1G45230 [Arabidopsis thaliana] Length = 219 Score = 237 bits (604), Expect = 8e-60 Identities = 124/229 (54%), Positives = 153/229 (66%), Gaps = 5/229 (2%) Frame = +2 Query: 44 SAMSCISKSP--SFLFRTNPTSLHFSASPFYFHY---HIIPLRSQFRALKASSEGINVGN 208 S S S SP S FR FS+SP ++ LR + RAL+ S+G +GN Sbjct: 2 SLASIPSSSPVASPYFRCRTYIFSFSSSPLCLYFPRGDSTSLRPRVRALRTESDGAKIGN 61 Query: 209 QDGNGPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILED 388 + G +LLRRP I+ ++ +++ +VDWED+ILE Sbjct: 62 SESYGSELLRRPRIASEESSEEEEEEEEENSEGDE--------------FVDWEDKILEV 107 Query: 389 TVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPD 568 TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPE EKKIGCGID+I VG+HPD Sbjct: 108 TVPLVGFVRMILHSGKYANRDRLSPEHERTIIEMLLPYHPECEKKIGCGIDYIMVGHHPD 167 Query: 569 FENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 FE+SRC+FIVRKDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR + Sbjct: 168 FESSRCMFIVRKDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 216 >ref|XP_007225808.1| hypothetical protein PRUPE_ppa011355mg [Prunus persica] gi|462422744|gb|EMJ27007.1| hypothetical protein PRUPE_ppa011355mg [Prunus persica] Length = 214 Score = 236 bits (603), Expect = 1e-59 Identities = 127/228 (55%), Positives = 152/228 (66%), Gaps = 6/228 (2%) Frame = +2 Query: 50 MSCISKSPSFLFRTNPTSLHFSASPFYFHYHIIP---LRSQFRALKASSEG-INVGNQDG 217 M+ +S+SP L SL + P + + L+++ ALK ++G G Sbjct: 1 MASLSRSPP-LVHHQINSLTLNPCPVTLSFPFLKTTALQARLCALKTGADGGSRTGRPGT 59 Query: 218 NGPD--LLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDT 391 GPD LLR+P +S DG + WVDWED+ILEDT Sbjct: 60 QGPDPGLLRKPVVSSGKDMDGISDEDEGEDGK----------------WVDWEDKILEDT 103 Query: 392 VPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDF 571 VPLVGFVRMILHSGKY+SGD+LSPEHEK +LERLLP+HPE +KKIG GID+ITVGYHPDF Sbjct: 104 VPLVGFVRMILHSGKYESGDRLSPEHEKTVLERLLPFHPEAQKKIGSGIDYITVGYHPDF 163 Query: 572 ENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 E+SRCLFIV+KDG LVDFS+WKCIKGLIRKNYPLYADSFILRHFR + Sbjct: 164 ESSRCLFIVQKDGTLVDFSYWKCIKGLIRKNYPLYADSFILRHFRKRR 211 >emb|CAD12248.1| DCL protein [Coffea arabica] Length = 224 Score = 236 bits (602), Expect = 1e-59 Identities = 122/215 (56%), Positives = 143/215 (66%) Frame = +2 Query: 62 SKSPSFLFRTNPTSLHFSASPFYFHYHIIPLRSQFRALKASSEGINVGNQDGNGPDLLRR 241 S S S +P SL F P Y+ LR SSEG D G +LLR+ Sbjct: 18 SNSISSNLLLSPPSLSFQLYP----YNRSQLRRYAAVRTTSSEG---RGSDSFGAELLRK 70 Query: 242 PAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLVGFVRMI 421 P +SP ++G WVDWEDQIL+DTVPLV FVRMI Sbjct: 71 PVVSPAVVSEGEDS-------VVEEDDKYRSGGEEVEAWVDWEDQILQDTVPLVNFVRMI 123 Query: 422 LHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVR 601 LHSGKY+SGD+LSPEHE+ ILER+LPYHP+ EKKIG G+D+IT+GYHPDF+ SRCLFIVR Sbjct: 124 LHSGKYESGDRLSPEHERTILERVLPYHPQCEKKIGSGVDYITIGYHPDFDRSRCLFIVR 183 Query: 602 KDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 706 KDGELVDFS+WKCIKGLIRKNYPLYAD+FI+RHF+ Sbjct: 184 KDGELVDFSYWKCIKGLIRKNYPLYADTFIIRHFK 218 >ref|XP_002894020.1| hypothetical protein ARALYDRAFT_473852 [Arabidopsis lyrata subsp. lyrata] gi|297339862|gb|EFH70279.1| hypothetical protein ARALYDRAFT_473852 [Arabidopsis lyrata subsp. lyrata] Length = 218 Score = 235 bits (600), Expect = 2e-59 Identities = 121/229 (52%), Positives = 155/229 (67%), Gaps = 6/229 (2%) Frame = +2 Query: 47 AMSCISKSP---SFLFRTNPTSLHFSASPFYFHY---HIIPLRSQFRALKASSEGINVGN 208 +++ IS SP S FR F++SP ++ L+ + RAL+ S+G +GN Sbjct: 2 SLASISSSPPVASPYFRCRAYIFSFASSPLCLYFPRGDSTSLKPRVRALRTESDGARIGN 61 Query: 209 QDGNGPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILED 388 + G +LLRRP I+ ++ +++ +VDWED+ILE Sbjct: 62 TESYGSELLRRPRIASEESSEEEEEEEETGEGDE---------------FVDWEDKILEV 106 Query: 389 TVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPD 568 TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPEFEKKIGCGID+I V +HPD Sbjct: 107 TVPLVGFVRMILHSGKYANRDRLSPEHERTIVEMLLPYHPEFEKKIGCGIDYIMVWHHPD 166 Query: 569 FENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 FE+SRC+FIVRKDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR + Sbjct: 167 FESSRCMFIVRKDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 215 >ref|NP_001235407.1| uncharacterized protein LOC100306718 [Glycine max] gi|255629361|gb|ACU15025.1| unknown [Glycine max] Length = 212 Score = 235 bits (599), Expect = 3e-59 Identities = 128/230 (55%), Positives = 151/230 (65%), Gaps = 3/230 (1%) Frame = +2 Query: 26 VKTERSSAMSCISKS-PSFLFRTNPTSLHFSASPFY--FHYHIIPLRSQFRALKASSEGI 196 + T + S + K+ P L TNP++ ++PF F +H +P ALKASS G Sbjct: 4 ISTSTMALASLLPKTAPHVLSLTNPSA----STPFILPFSFHCLPHPPLLSALKASSSG- 58 Query: 197 NVGNQDGNGPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQ 376 G DL +P +S G K WVDWEDQ Sbjct: 59 --------GDDLRGKPLLS-----QGIGK---------GILEEHAFEDDDDDKWVDWEDQ 96 Query: 377 ILEDTVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVG 556 ILEDTVPLVGFVRMILHSG+Y +GD+LS EHEK I+E+LLP+HPEFEKKIG G+D+IT+G Sbjct: 97 ILEDTVPLVGFVRMILHSGQYDNGDRLSAEHEKTIIEKLLPFHPEFEKKIGSGVDYITIG 156 Query: 557 YHPDFENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 706 YHPDFE SRCLFIVR+DGELVDFS+WKCIKGLIRKNYPLYADSFILRHFR Sbjct: 157 YHPDFERSRCLFIVREDGELVDFSYWKCIKGLIRKNYPLYADSFILRHFR 206 >ref|XP_006393639.1| hypothetical protein EUTSA_v10011763mg [Eutrema salsugineum] gi|557090217|gb|ESQ30925.1| hypothetical protein EUTSA_v10011763mg [Eutrema salsugineum] Length = 221 Score = 234 bits (598), Expect = 4e-59 Identities = 119/218 (54%), Positives = 145/218 (66%), Gaps = 7/218 (3%) Frame = +2 Query: 83 FRTNPTSLHFSASPFYFHYH-------IIPLRSQFRALKASSEGINVGNQDGNGPDLLRR 241 FR FS SP ++ + LR + RAL+ S+G +GN + G DLLRR Sbjct: 17 FRCRAYIFSFSPSPLCLYFPRGRGDSASLTLRPKIRALRTESDGARIGNTESYGSDLLRR 76 Query: 242 PAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLVGFVRMI 421 P IS + ++ +VDWED+ILE TVPLVGFVRMI Sbjct: 77 PRISSEEEESSGEEDESGEGDE----------------FVDWEDKILEVTVPLVGFVRMI 120 Query: 422 LHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVR 601 LHSGKY + D+LSPEHE+ I+E LLPYHPE EKKIGCGID+I VG+HP+FE+SRC+FIVR Sbjct: 121 LHSGKYANRDRLSPEHERTIIEMLLPYHPEVEKKIGCGIDYIMVGHHPEFESSRCMFIVR 180 Query: 602 KDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 KDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR + Sbjct: 181 KDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 218 >gb|ABC47855.1| defective chloroplast and leaves protein [Glycine max] Length = 204 Score = 234 bits (597), Expect = 5e-59 Identities = 125/215 (58%), Positives = 145/215 (67%), Gaps = 2/215 (0%) Frame = +2 Query: 68 SPSFLFRTNPTSLHFSASPFY--FHYHIIPLRSQFRALKASSEGINVGNQDGNGPDLLRR 241 +P L TNP++ ++PF F +H +P ALKASS G G DL + Sbjct: 11 APHVLSLTNPSA----STPFILPFSFHCLPHPPLLSALKASSSG---------GDDLRGK 57 Query: 242 PAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILEDTVPLVGFVRMI 421 P +S G K WVDWEDQILEDTVPLVGFVRMI Sbjct: 58 PLLS-----QGIGK---------GILEEHAFEDDDDDKWVDWEDQILEDTVPLVGFVRMI 103 Query: 422 LHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVR 601 LHSG+Y +GD+LS EHEK I+E+LLP+HPEFEKKIG G+D+IT+GYHPDFE SRCLFIVR Sbjct: 104 LHSGQYDNGDRLSAEHEKTIIEKLLPFHPEFEKKIGSGVDYITIGYHPDFERSRCLFIVR 163 Query: 602 KDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 706 +DGELVDFS+WKCIKGLIRKNYPLYADSFILRHFR Sbjct: 164 EDGELVDFSYWKCIKGLIRKNYPLYADSFILRHFR 198 >ref|NP_001031151.1| uncharacterized protein [Arabidopsis thaliana] gi|332193993|gb|AEE32114.1| uncharacterized protein AT1G45230 [Arabidopsis thaliana] Length = 219 Score = 234 bits (596), Expect = 7e-59 Identities = 123/229 (53%), Positives = 152/229 (66%), Gaps = 5/229 (2%) Frame = +2 Query: 44 SAMSCISKSP--SFLFRTNPTSLHFSASPFYFHY---HIIPLRSQFRALKASSEGINVGN 208 S S S SP S FR FS+SP ++ LR + RAL+ S+G +GN Sbjct: 2 SLASIPSSSPVASPYFRCRTYIFSFSSSPLCLYFPRGDSTSLRPRVRALRTESDGAKIGN 61 Query: 209 QDGNGPDLLRRPAISPIYKTDGSKKKXXXXXXXXXXXXXXXXXXXXXXXWVDWEDQILED 388 + G +LLRRP I+ ++ +++ +VDWED+ILE Sbjct: 62 SESYGSELLRRPRIASEESSEEEEEEEEENSEGDE--------------FVDWEDKILEV 107 Query: 389 TVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPD 568 TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPE EKKIGCGID+I V +HPD Sbjct: 108 TVPLVGFVRMILHSGKYANRDRLSPEHERTIIEMLLPYHPECEKKIGCGIDYIMVWHHPD 167 Query: 569 FENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 715 FE+SRC+FIVRKDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR + Sbjct: 168 FESSRCMFIVRKDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 216