BLASTX nr result

ID: Akebia23_contig00023092 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00023092
         (1134 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007017215.1| Uncharacterized protein isoform 1 [Theobroma...   273   1e-70
ref|XP_002282478.1| PREDICTED: protein DCL, chloroplastic [Vitis...   268   2e-69
gb|EXC32856.1| hypothetical protein L484_009556 [Morus notabilis]     267   5e-69
ref|XP_006434757.1| hypothetical protein CICLE_v10002377mg [Citr...   264   5e-68
ref|XP_004141259.1| PREDICTED: protein DCL, chloroplastic-like [...   262   2e-67
ref|XP_006473317.1| PREDICTED: protein DCL, chloroplastic-like [...   261   3e-67
ref|XP_007017218.1| Uncharacterized protein isoform 4 [Theobroma...   259   1e-66
ref|XP_004291102.1| PREDICTED: protein DCL, chloroplastic-like [...   252   2e-64
gb|AFK33859.1| unknown [Lotus japonicus]                              252   2e-64
ref|XP_002284778.1| PREDICTED: protein DCL, chloroplastic-like i...   251   4e-64
ref|XP_002454603.1| hypothetical protein SORBIDRAFT_04g034170 [S...   250   7e-64
ref|NP_683398.1| uncharacterized protein [Arabidopsis thaliana] ...   249   1e-63
ref|XP_002894020.1| hypothetical protein ARALYDRAFT_473852 [Arab...   249   2e-63
gb|EYU36304.1| hypothetical protein MIMGU_mgv1a013445mg [Mimulus...   248   3e-63
ref|XP_006393639.1| hypothetical protein EUTSA_v10011763mg [Eutr...   248   3e-63
ref|XP_006304196.1| hypothetical protein CARUB_v10010275mg [Caps...   248   3e-63
ref|XP_007225808.1| hypothetical protein PRUPE_ppa011355mg [Prun...   248   3e-63
ref|XP_002510210.1| DCL protein, chloroplast precursor, putative...   248   3e-63
emb|CAD12248.1| DCL protein [Coffea arabica]                          247   8e-63
ref|NP_001031151.1| uncharacterized protein [Arabidopsis thalian...   246   1e-62

>ref|XP_007017215.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590592209|ref|XP_007017216.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590592213|ref|XP_007017217.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590592219|ref|XP_007017219.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508722543|gb|EOY14440.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508722544|gb|EOY14441.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508722545|gb|EOY14442.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508722547|gb|EOY14444.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 215

 Score =  273 bits (698), Expect = 1e-70
 Identities = 139/225 (61%), Positives = 167/225 (74%), Gaps = 3/225 (1%)
 Frame = -1

Query: 1059 MSCISKSPSFLFRTNPTSLHFSASPFYFH---YHIIPLRSQFRALKASSEGINVGNQDGN 889
            M+ + K P + F  N  S+  S+SP            L+ +  AL+  S+G  +G+Q+  
Sbjct: 1    MASVLKPPPY-FHRNCISISSSSSPVILSSPSQRTTSLQVRSCALRTGSDGGRIGSQESY 59

Query: 888  GPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPL 709
            G D+LR+P+I         K  GG SE    +EGS G+ KRG+  W+DWED+ILEDTVPL
Sbjct: 60   GADMLRKPSILT------PKDSGGTSE---QEEGSEGKRKRGK--WIDWEDRILEDTVPL 108

Query: 708  VGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENS 529
            VGFVRMI+HSGKY+SGD+LSPEHEK IL+RLLPYHPE EKKIGCGID+ITVGYHPDFE S
Sbjct: 109  VGFVRMIIHSGKYESGDRLSPEHEKTILDRLLPYHPECEKKIGCGIDYITVGYHPDFEGS 168

Query: 528  RCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            RCLFIVRKDGEL+DFS+WKCIKGLIRKNYPLYADSFILRHFR  +
Sbjct: 169  RCLFIVRKDGELIDFSYWKCIKGLIRKNYPLYADSFILRHFRRRR 213


>ref|XP_002282478.1| PREDICTED: protein DCL, chloroplastic [Vitis vinifera]
            gi|147773590|emb|CAN69898.1| hypothetical protein
            VITISV_032063 [Vitis vinifera]
          Length = 205

 Score =  268 bits (686), Expect = 2e-69
 Identities = 137/209 (65%), Positives = 155/209 (74%)
 Frame = -1

Query: 1020 TNPTSLHFSASPFYFHYHIIPLRSQFRALKASSEGINVGNQDGNGPDLLRRPAISPIYKT 841
            TNP SLH S    +F  H  P      ALK +S+G    +Q+  GPDLLR+P +S     
Sbjct: 18   TNPISLHPSHPILHFATHRTP---SVPALKTASDG----SQEVYGPDLLRKPHVS----- 65

Query: 840  DGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPLVGFVRMILHSGKYKSG 661
                        + DDEG    TKR   EWVDWEDQILEDTVPLVGFVRMILHSGKY+SG
Sbjct: 66   ------------APDDEGDRKRTKRKGGEWVDWEDQILEDTVPLVGFVRMILHSGKYRSG 113

Query: 660  DKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKDGELVDFS 481
            ++LSPEHEK ILERLLPYHP +E+KIG GID+ITVGYHP+FE+SRCLFIVRKDGELVDFS
Sbjct: 114  ERLSPEHEKIILERLLPYHPGYERKIGSGIDYITVGYHPEFESSRCLFIVRKDGELVDFS 173

Query: 480  FWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            +WKCIKG IRKNYPLYADSFILRHFR H+
Sbjct: 174  YWKCIKGFIRKNYPLYADSFILRHFRQHR 202


>gb|EXC32856.1| hypothetical protein L484_009556 [Morus notabilis]
          Length = 233

 Score =  267 bits (683), Expect = 5e-69
 Identities = 139/235 (59%), Positives = 166/235 (70%), Gaps = 13/235 (5%)
 Frame = -1

Query: 1059 MSCISKSPSFLFRTNPTSLHFSASPFYFHYHIIPLRSQFRA----------LKASSEG-- 916
            M+ IS  P  L   +  +L       + HY  + L   FRA          LK  S+G  
Sbjct: 1    MAFISNPPPLLNNLHRHTLS------HHHYSPVTLSFPFRATTSFPARVGALKTGSDGGG 54

Query: 915  INVGNQDGNGPDLLRRPAISPIYKTDG-SKKKGGLSELSNDDEGSSGETKRGEEEWVDWE 739
              +G+Q+  GPDLLR+P +SP     G S+++  +    N   G  G+    E++WVDWE
Sbjct: 55   SRIGSQELFGPDLLRKPVVSPRKDLAGISEEEKEIERKRNYGGGGGGDDDEEEDKWVDWE 114

Query: 738  DQILEDTVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFIT 559
            D+ILEDTVPLVGFVRMILHS KY+SGD+LSPEHEK ILERLLP+HPEFEKKIGCGID+IT
Sbjct: 115  DKILEDTVPLVGFVRMILHSEKYESGDRLSPEHEKTILERLLPFHPEFEKKIGCGIDYIT 174

Query: 558  VGYHPDFENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            VGYHPDFE SRCLFIV+KDG+LVDFS+WKCIKGLIRKNYPLYADSFILRHFR  +
Sbjct: 175  VGYHPDFERSRCLFIVQKDGKLVDFSYWKCIKGLIRKNYPLYADSFILRHFRQRR 229


>ref|XP_006434757.1| hypothetical protein CICLE_v10002377mg [Citrus clementina]
            gi|557536879|gb|ESR47997.1| hypothetical protein
            CICLE_v10002377mg [Citrus clementina]
          Length = 227

 Score =  264 bits (675), Expect = 5e-68
 Identities = 134/211 (63%), Positives = 160/211 (75%), Gaps = 5/211 (2%)
 Frame = -1

Query: 1020 TNPTSLHFSASPFYFHYHIIPLRSQFRALKAS----SEGINVGNQDGNGPDLLRRPAISP 853
            +NPTS     SP    +    + S F  L+ S    S+G  +G+Q+  G +LLR+P +SP
Sbjct: 21   SNPTSTSLFLSPVILSFPFYRMTS-FLHLRVSAALRSDGDKIGSQESQGFNLLRKPVVSP 79

Query: 852  IYKT-DGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPLVGFVRMILHSG 676
              +  DG+ +K         DEG S +    +E W+DWED+ILEDTVPLVGFVRMILHSG
Sbjct: 80   ASRDLDGNSEK---------DEGESED----DEGWIDWEDKILEDTVPLVGFVRMILHSG 126

Query: 675  KYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKDGE 496
            +Y+SG +LSPEHE+ ILERLLPYHPEFEKKIGCGID+IT+GYHPDFE+SRCLFIVRKDGE
Sbjct: 127  RYESGVRLSPEHERTILERLLPYHPEFEKKIGCGIDYITIGYHPDFESSRCLFIVRKDGE 186

Query: 495  LVDFSFWKCIKGLIRKNYPLYADSFILRHFR 403
            LVDFS+WKCIKGLIRKNYPLYADSFILRHFR
Sbjct: 187  LVDFSYWKCIKGLIRKNYPLYADSFILRHFR 217


>ref|XP_004141259.1| PREDICTED: protein DCL, chloroplastic-like [Cucumis sativus]
            gi|449519344|ref|XP_004166695.1| PREDICTED: protein DCL,
            chloroplastic-like [Cucumis sativus]
          Length = 225

 Score =  262 bits (669), Expect = 2e-67
 Identities = 137/228 (60%), Positives = 161/228 (70%), Gaps = 5/228 (2%)
 Frame = -1

Query: 1062 AMSCISKSPSF--LFRTNPTSLHFSASPFYFHYHIIPLRS---QFRALKASSEGINVGNQ 898
            AM+ I K P F      NP    F++SP    +   P+ S     RALK   EGI + + 
Sbjct: 2    AMASILKPPPFPPFHSLNPN--FFNSSPLILCFPTHPINSFHPSTRALKTGPEGIRLRSH 59

Query: 897  DGNGPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDT 718
                 DLLR+P + P      +K   G SE  +D    SG     E EWVDWED+ILEDT
Sbjct: 60   QEYSSDLLRKP-VGP-----SAKDLAGPSE-DDDSSEESGNEDEEEVEWVDWEDKILEDT 112

Query: 717  VPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDF 538
            VPLVGFVRM+LH+GKY++GD+L PEHEK ILERLLPYHPE EKKIGCG+D+ITVGYHPDF
Sbjct: 113  VPLVGFVRMVLHTGKYENGDRLRPEHEKTILERLLPYHPESEKKIGCGVDYITVGYHPDF 172

Query: 537  ENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            E+SRCLFIVRKDGE+VDFS+WKCIKGLIRKNYPLYA+SFILRHFR  +
Sbjct: 173  ESSRCLFIVRKDGEMVDFSYWKCIKGLIRKNYPLYAESFILRHFRRRR 220


>ref|XP_006473317.1| PREDICTED: protein DCL, chloroplastic-like [Citrus sinensis]
          Length = 227

 Score =  261 bits (668), Expect = 3e-67
 Identities = 133/211 (63%), Positives = 159/211 (75%), Gaps = 5/211 (2%)
 Frame = -1

Query: 1020 TNPTSLHFSASPFYFHYHIIPLRSQFRALKAS----SEGINVGNQDGNGPDLLRRPAISP 853
            +NPTS     SP    +      S F  L+ S    S+G  +G+Q+  G +LLR+P +SP
Sbjct: 21   SNPTSTSLFLSPVILSFPFYRTTS-FLHLRVSAALRSDGDKIGSQESQGFNLLRKPVVSP 79

Query: 852  IYKT-DGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPLVGFVRMILHSG 676
              +  DG+ +K         DEG S +    +E W+DWED+ILEDTVPLVGFVRMILHSG
Sbjct: 80   ASRDLDGNSEK---------DEGESED----DEGWIDWEDKILEDTVPLVGFVRMILHSG 126

Query: 675  KYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKDGE 496
            +Y+SG +LSPEHE+ ILERLLPYHPEF+KKIGCGID+IT+GYHPDFE+SRCLFIVRKDGE
Sbjct: 127  RYESGVRLSPEHERTILERLLPYHPEFKKKIGCGIDYITIGYHPDFESSRCLFIVRKDGE 186

Query: 495  LVDFSFWKCIKGLIRKNYPLYADSFILRHFR 403
            LVDFS+WKCIKGLIRKNYPLYADSFILRHFR
Sbjct: 187  LVDFSYWKCIKGLIRKNYPLYADSFILRHFR 217


>ref|XP_007017218.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508722546|gb|EOY14443.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 209

 Score =  259 bits (662), Expect = 1e-66
 Identities = 135/221 (61%), Positives = 161/221 (72%), Gaps = 3/221 (1%)
 Frame = -1

Query: 1059 MSCISKSPSFLFRTNPTSLHFSASPFYFH---YHIIPLRSQFRALKASSEGINVGNQDGN 889
            M+ + K P + F  N  S+  S+SP            L+ +  AL+  S+G  +G+Q+  
Sbjct: 1    MASVLKPPPY-FHRNCISISSSSSPVILSSPSQRTTSLQVRSCALRTGSDGGRIGSQESY 59

Query: 888  GPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPL 709
            G D+LR+P+I         K  GG SE    +EGS G+ KRG+  W+DWED+ILEDTVPL
Sbjct: 60   GADMLRKPSILT------PKDSGGTSE---QEEGSEGKRKRGK--WIDWEDRILEDTVPL 108

Query: 708  VGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENS 529
            VGFVRMI+HSGKY+SGD+LSPEHEK IL+RLLPYHPE EKKIGCGID+ITVGYHPDFE  
Sbjct: 109  VGFVRMIIHSGKYESGDRLSPEHEKTILDRLLPYHPECEKKIGCGIDYITVGYHPDFEGL 168

Query: 528  RCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHF 406
            RCLFIV KDGELV FS+WKCIKGLIRKNYPLYADSFILR F
Sbjct: 169  RCLFIVWKDGELVVFSYWKCIKGLIRKNYPLYADSFILRQF 209


>ref|XP_004291102.1| PREDICTED: protein DCL, chloroplastic-like [Fragaria vesca subsp.
            vesca]
          Length = 204

 Score =  252 bits (644), Expect = 2e-64
 Identities = 130/224 (58%), Positives = 158/224 (70%), Gaps = 2/224 (0%)
 Frame = -1

Query: 1059 MSCISKSPSFLFRTNPTSLHFSASP--FYFHYHIIPLRSQFRALKASSEGINVGNQDGNG 886
            M+ +SK P  +     +S H    P    F      LR++  ALK  +     G Q+ +G
Sbjct: 1    MASLSKPPLLVHNHLTSSSHALIGPATLSFSKTTSLLRARLCALKTGA-----GRQESDG 55

Query: 885  PDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPLV 706
            P+LLR+P +  +   DG+         S +DEG  G+       WVDWED+ILEDTVPLV
Sbjct: 56   PELLRKPVVKDL---DGN---------SEEDEGEDGK-------WVDWEDKILEDTVPLV 96

Query: 705  GFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSR 526
            GFVRMILHSGKY+SGD+LSPEH+K +LERLLP+HPE  KKIGCGID++TVGYHPDFE+SR
Sbjct: 97   GFVRMILHSGKYESGDRLSPEHQKTVLERLLPFHPEAAKKIGCGIDYVTVGYHPDFESSR 156

Query: 525  CLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            CLFIV+KDG LVDFS+WKCIKGLIRKNYPLYADSFILRHFR  +
Sbjct: 157  CLFIVQKDGTLVDFSYWKCIKGLIRKNYPLYADSFILRHFRKRR 200


>gb|AFK33859.1| unknown [Lotus japonicus]
          Length = 224

 Score =  252 bits (644), Expect = 2e-64
 Identities = 130/213 (61%), Positives = 155/213 (72%)
 Frame = -1

Query: 1041 SPSFLFRTNPTSLHFSASPFYFHYHIIPLRSQFRALKASSEGINVGNQDGNGPDLLRRPA 862
            +PS  FR+ P  L F    FY+   + PL ++  ALKA++             +LLR+P 
Sbjct: 21   NPSSSFRSPPLILSFR---FYWSASL-PLHTRLSALKAAAAA-----SSDAADNLLRKPL 71

Query: 861  ISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPLVGFVRMILH 682
            I+P     G  ++ G +    DDE      +  E++WVDWEDQILEDTVPLVGFVR ILH
Sbjct: 72   ITPRKDPAGVLEEHGYAYEEEDDE------EEEEDKWVDWEDQILEDTVPLVGFVRTILH 125

Query: 681  SGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKD 502
            SG Y+SGD+LSPEHEK ILE+LLP+HPE EKKIGCGID+IT+GYHP F+ SRCLFIVRKD
Sbjct: 126  SGHYESGDRLSPEHEKTILEKLLPFHPESEKKIGCGIDYITIGYHPQFDRSRCLFIVRKD 185

Query: 501  GELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 403
            GELVDFS+WKCIKGLIRKNYPLYADSFILRHFR
Sbjct: 186  GELVDFSYWKCIKGLIRKNYPLYADSFILRHFR 218


>ref|XP_002284778.1| PREDICTED: protein DCL, chloroplastic-like isoform 1 [Vitis vinifera]
          Length = 215

 Score =  251 bits (641), Expect = 4e-64
 Identities = 139/223 (62%), Positives = 156/223 (69%), Gaps = 3/223 (1%)
 Frame = -1

Query: 1062 AMSCISKS--PSFLFRTNPTSLHFSASPFYF-HYHIIPLRSQFRALKASSEGINVGNQDG 892
            AM+ ++ S  P  L R NP SLH S S   F       LR    A K  S G  +G+QD 
Sbjct: 2    AMAYVASSLLPVRLHR-NPISLHLSPSLRSFPSRQTTSLRPVLCARKPRSPGGKLGSQDA 60

Query: 891  NGPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVP 712
               D LR+P ISP    DG    GG S      +G      R EEEWVDWEDQILEDTVP
Sbjct: 61   RASDFLRKPTISP--GDDG----GGSSVREKSYKG------REEEEWVDWEDQILEDTVP 108

Query: 711  LVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFEN 532
            LVG+VRMI+HSGKY++GD+LS EHEK +LE+LL YHPE EKKIGCGID+ITVGYHPDFE 
Sbjct: 109  LVGYVRMIIHSGKYENGDRLSLEHEKFVLEKLLAYHPECEKKIGCGIDYITVGYHPDFEG 168

Query: 531  SRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 403
            SRCLFIVR DGELVDFS+WKCIKGLIRK YP YADSFILRHF+
Sbjct: 169  SRCLFIVRNDGELVDFSYWKCIKGLIRKKYPQYADSFILRHFQ 211


>ref|XP_002454603.1| hypothetical protein SORBIDRAFT_04g034170 [Sorghum bicolor]
           gi|241934434|gb|EES07579.1| hypothetical protein
           SORBIDRAFT_04g034170 [Sorghum bicolor]
          Length = 236

 Score =  250 bits (639), Expect = 7e-64
 Identities = 123/199 (61%), Positives = 144/199 (72%), Gaps = 11/199 (5%)
 Frame = -1

Query: 966 IIPLRSQFRALKASSEGINVGNQDGNGPD---LLRRPAISPIYKTDGSKKKG-------- 820
           ++ LR + R    +   +    Q    PD    LRRP +  +  T+  ++          
Sbjct: 33  LLVLRGRGRGHGRAVAAVRAREQGAAPPDPAAFLRRPEVVTVTSTEEERETDAESSFDGP 92

Query: 819 GLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPLVGFVRMILHSGKYKSGDKLSPEH 640
           G  E   ++EG  G  K  E EWVDWED ILEDTVPLVGFVRMILHSGKY+SGD+LSPEH
Sbjct: 93  GEDEAPGEEEGVQGRRKATEREWVDWEDLILEDTVPLVGFVRMILHSGKYESGDRLSPEH 152

Query: 639 EKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVRKDGELVDFSFWKCIKG 460
           EKAILERLLPYHPE++KKIGCGID+ITVG HP+FENSRCLFIVRKDGE VDFSFWKC+KG
Sbjct: 153 EKAILERLLPYHPEYDKKIGCGIDYITVGLHPEFENSRCLFIVRKDGEQVDFSFWKCVKG 212

Query: 459 LIRKNYPLYADSFILRHFR 403
           LIR+ YP+YADSFILRHFR
Sbjct: 213 LIRQKYPMYADSFILRHFR 231


>ref|NP_683398.1| uncharacterized protein [Arabidopsis thaliana]
            gi|12321014|gb|AAG50632.1|AC083835_17 defective
            chloroplasts and leaves (DCL) protein, putative
            [Arabidopsis thaliana] gi|22135799|gb|AAM91086.1|
            At1g45261/F2G19.1 [Arabidopsis thaliana]
            gi|48310651|gb|AAT41860.1| At1g45261 [Arabidopsis
            thaliana] gi|62318604|dbj|BAD95026.1| defective
            chloroplasts and leaves (DCL) protein [Arabidopsis
            thaliana] gi|332193992|gb|AEE32113.1| uncharacterized
            protein AT1G45230 [Arabidopsis thaliana]
          Length = 219

 Score =  249 bits (637), Expect = 1e-63
 Identities = 130/229 (56%), Positives = 158/229 (68%), Gaps = 5/229 (2%)
 Frame = -1

Query: 1065 SAMSCISKSP--SFLFRTNPTSLHFSASPFYFHY---HIIPLRSQFRALKASSEGINVGN 901
            S  S  S SP  S  FR       FS+SP   ++       LR + RAL+  S+G  +GN
Sbjct: 2    SLASIPSSSPVASPYFRCRTYIFSFSSSPLCLYFPRGDSTSLRPRVRALRTESDGAKIGN 61

Query: 900  QDGNGPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILED 721
             +  G +LLRRP I+              SE S+++E    E     +E+VDWED+ILE 
Sbjct: 62   SESYGSELLRRPRIA--------------SEESSEEEEEEEEENSEGDEFVDWEDKILEV 107

Query: 720  TVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPD 541
            TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPE EKKIGCGID+I VG+HPD
Sbjct: 108  TVPLVGFVRMILHSGKYANRDRLSPEHERTIIEMLLPYHPECEKKIGCGIDYIMVGHHPD 167

Query: 540  FENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            FE+SRC+FIVRKDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR  +
Sbjct: 168  FESSRCMFIVRKDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 216


>ref|XP_002894020.1| hypothetical protein ARALYDRAFT_473852 [Arabidopsis lyrata subsp.
            lyrata] gi|297339862|gb|EFH70279.1| hypothetical protein
            ARALYDRAFT_473852 [Arabidopsis lyrata subsp. lyrata]
          Length = 218

 Score =  249 bits (636), Expect = 2e-63
 Identities = 129/229 (56%), Positives = 162/229 (70%), Gaps = 6/229 (2%)
 Frame = -1

Query: 1062 AMSCISKSP---SFLFRTNPTSLHFSASPFYFHY---HIIPLRSQFRALKASSEGINVGN 901
            +++ IS SP   S  FR       F++SP   ++       L+ + RAL+  S+G  +GN
Sbjct: 2    SLASISSSPPVASPYFRCRAYIFSFASSPLCLYFPRGDSTSLKPRVRALRTESDGARIGN 61

Query: 900  QDGNGPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILED 721
             +  G +LLRRP I+              SE S+++E    ET  G+E +VDWED+ILE 
Sbjct: 62   TESYGSELLRRPRIA--------------SEESSEEEEEEEETGEGDE-FVDWEDKILEV 106

Query: 720  TVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPD 541
            TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPEFEKKIGCGID+I V +HPD
Sbjct: 107  TVPLVGFVRMILHSGKYANRDRLSPEHERTIVEMLLPYHPEFEKKIGCGIDYIMVWHHPD 166

Query: 540  FENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            FE+SRC+FIVRKDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR  +
Sbjct: 167  FESSRCMFIVRKDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 215


>gb|EYU36304.1| hypothetical protein MIMGU_mgv1a013445mg [Mimulus guttatus]
          Length = 220

 Score =  248 bits (634), Expect = 3e-63
 Identities = 132/229 (57%), Positives = 156/229 (68%), Gaps = 6/229 (2%)
 Frame = -1

Query: 1062 AMSCISKSPSFL-FRTNPTSLHFSASPFY--FHYHIIPLRSQFRALKASSEGINVGNQDG 892
            A  C+S SP    F   P S +   SPF   F +H  PL     A+K  S+      ++ 
Sbjct: 2    ASICVSNSPDIRSFHRKPISKNLILSPFSLSFPFHKAPLC----AVKTGSDDGGAAARNS 57

Query: 891  NGP---DLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILED 721
              P   DLLR+P  S     +  +    + E S + +G  G+T      WVDWEDQILED
Sbjct: 58   QTPYAADLLRKPLASSPAPVEQEET---VKEYSGEKKGGDGDT------WVDWEDQILED 108

Query: 720  TVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPD 541
            TVPLVGFVRMILHSGKY+SG +LSPEHE+ IL+RLL YHPE EKKIGCG+D+IT+GYHP+
Sbjct: 109  TVPLVGFVRMILHSGKYESGTRLSPEHERTILDRLLAYHPESEKKIGCGVDYITIGYHPN 168

Query: 540  FENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            FE SRCLFIVRKDGELVDFS+WKCIKGLIR NYPLYADSFILRHFR  +
Sbjct: 169  FETSRCLFIVRKDGELVDFSYWKCIKGLIRTNYPLYADSFILRHFRRRR 217


>ref|XP_006393639.1| hypothetical protein EUTSA_v10011763mg [Eutrema salsugineum]
            gi|557090217|gb|ESQ30925.1| hypothetical protein
            EUTSA_v10011763mg [Eutrema salsugineum]
          Length = 221

 Score =  248 bits (634), Expect = 3e-63
 Identities = 126/219 (57%), Positives = 153/219 (69%), Gaps = 8/219 (3%)
 Frame = -1

Query: 1026 FRTNPTSLHFSASPFYFHYH-------IIPLRSQFRALKASSEGINVGNQDGNGPDLLRR 868
            FR       FS SP   ++         + LR + RAL+  S+G  +GN +  G DLLRR
Sbjct: 17   FRCRAYIFSFSPSPLCLYFPRGRGDSASLTLRPKIRALRTESDGARIGNTESYGSDLLRR 76

Query: 867  PAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGE-EEWVDWEDQILEDTVPLVGFVRM 691
            P IS                 S ++E S  E + GE +E+VDWED+ILE TVPLVGFVRM
Sbjct: 77   PRIS-----------------SEEEESSGEEDESGEGDEFVDWEDKILEVTVPLVGFVRM 119

Query: 690  ILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIV 511
            ILHSGKY + D+LSPEHE+ I+E LLPYHPE EKKIGCGID+I VG+HP+FE+SRC+FIV
Sbjct: 120  ILHSGKYANRDRLSPEHERTIIEMLLPYHPEVEKKIGCGIDYIMVGHHPEFESSRCMFIV 179

Query: 510  RKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            RKDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR  +
Sbjct: 180  RKDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 218


>ref|XP_006304196.1| hypothetical protein CARUB_v10010275mg [Capsella rubella]
            gi|482572907|gb|EOA37094.1| hypothetical protein
            CARUB_v10010275mg [Capsella rubella]
          Length = 216

 Score =  248 bits (634), Expect = 3e-63
 Identities = 130/230 (56%), Positives = 162/230 (70%), Gaps = 7/230 (3%)
 Frame = -1

Query: 1062 AMSCISKSPSFLFRTNPTSLHFSAS----PFYFHY---HIIPLRSQFRALKASSEGINVG 904
            +++ +S SP   FR       FS+S    P   ++     I LR + RAL+  S+G  +G
Sbjct: 2    SLASVSCSPPPCFRCGAYIFSFSSSSSSSPLCLYFPRGDSISLRPRVRALRTESDGARIG 61

Query: 903  NQDGNGPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILE 724
            N +  G +LLRRP IS              S  S+++E  SGE     +E+VDWED+ILE
Sbjct: 62   NTESYGSELLRRPHIS--------------SGESSEEEEESGEG----DEFVDWEDKILE 103

Query: 723  DTVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHP 544
             TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPEFEKKIGCGID+I VG+HP
Sbjct: 104  VTVPLVGFVRMILHSGKYANQDRLSPEHERMIVEMLLPYHPEFEKKIGCGIDYIMVGHHP 163

Query: 543  DFENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            DFE+SRC+FIVR+DGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR  +
Sbjct: 164  DFESSRCMFIVRRDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 213


>ref|XP_007225808.1| hypothetical protein PRUPE_ppa011355mg [Prunus persica]
            gi|462422744|gb|EMJ27007.1| hypothetical protein
            PRUPE_ppa011355mg [Prunus persica]
          Length = 214

 Score =  248 bits (634), Expect = 3e-63
 Identities = 132/228 (57%), Positives = 160/228 (70%), Gaps = 6/228 (2%)
 Frame = -1

Query: 1059 MSCISKSPSFLFRTNPTSLHFSASPFYFHYHIIP---LRSQFRALKASSEG-INVGNQDG 892
            M+ +S+SP  L      SL  +  P    +  +    L+++  ALK  ++G    G    
Sbjct: 1    MASLSRSPP-LVHHQINSLTLNPCPVTLSFPFLKTTALQARLCALKTGADGGSRTGRPGT 59

Query: 891  NGPD--LLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDT 718
             GPD  LLR+P +S     DG         +S++DEG  G+       WVDWED+ILEDT
Sbjct: 60   QGPDPGLLRKPVVSSGKDMDG---------ISDEDEGEDGK-------WVDWEDKILEDT 103

Query: 717  VPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDF 538
            VPLVGFVRMILHSGKY+SGD+LSPEHEK +LERLLP+HPE +KKIG GID+ITVGYHPDF
Sbjct: 104  VPLVGFVRMILHSGKYESGDRLSPEHEKTVLERLLPFHPEAQKKIGSGIDYITVGYHPDF 163

Query: 537  ENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            E+SRCLFIV+KDG LVDFS+WKCIKGLIRKNYPLYADSFILRHFR  +
Sbjct: 164  ESSRCLFIVQKDGTLVDFSYWKCIKGLIRKNYPLYADSFILRHFRKRR 211


>ref|XP_002510210.1| DCL protein, chloroplast precursor, putative [Ricinus communis]
            gi|223550911|gb|EEF52397.1| DCL protein, chloroplast
            precursor, putative [Ricinus communis]
          Length = 214

 Score =  248 bits (633), Expect = 3e-63
 Identities = 131/226 (57%), Positives = 157/226 (69%), Gaps = 4/226 (1%)
 Frame = -1

Query: 1059 MSCISKSPSFLFR--TNPTSLHFSAS--PFYFHYHIIPLRSQFRALKASSEGINVGNQDG 892
            M+ +SK P  L    +NP SL+FS     F FH       S+  ALK  S+G        
Sbjct: 3    MASLSKPPPCLHGHYSNPISLYFSPVILSFPFHRTTTSFNSRIFALKTGSDG-------- 54

Query: 891  NGPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVP 712
               DLLR+P +         K+  G+S+   D         + ++E VDWEDQILEDTVP
Sbjct: 55   --SDLLRKPIVP------SEKELSGISDDEEDSNRKRDNKDKEDDELVDWEDQILEDTVP 106

Query: 711  LVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFEN 532
            LVGFVRMILHSGKY++GD+LSPEHE+ I+ERLLP+HPE EKKIG GID+ITVG+H +FEN
Sbjct: 107  LVGFVRMILHSGKYENGDRLSPEHERTIVERLLPFHPECEKKIGPGIDYITVGHHTEFEN 166

Query: 531  SRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            SRCLFIVRKDG+LVDFS+WKCIKGLIRKNYPLYADSFILRHFR  +
Sbjct: 167  SRCLFIVRKDGKLVDFSYWKCIKGLIRKNYPLYADSFILRHFRRRR 212


>emb|CAD12248.1| DCL protein [Coffea arabica]
          Length = 224

 Score =  247 bits (630), Expect = 8e-63
 Identities = 128/215 (59%), Positives = 152/215 (70%)
 Frame = -1

Query: 1047 SKSPSFLFRTNPTSLHFSASPFYFHYHIIPLRSQFRALKASSEGINVGNQDGNGPDLLRR 868
            S S S     +P SL F   P    Y+   LR        SSEG      D  G +LLR+
Sbjct: 18   SNSISSNLLLSPPSLSFQLYP----YNRSQLRRYAAVRTTSSEG---RGSDSFGAELLRK 70

Query: 867  PAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILEDTVPLVGFVRMI 688
            P +SP   ++G       S +  DD+  SG  +   E WVDWEDQIL+DTVPLV FVRMI
Sbjct: 71   PVVSPAVVSEGED-----SVVEEDDKYRSGGEE--VEAWVDWEDQILQDTVPLVNFVRMI 123

Query: 687  LHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPDFENSRCLFIVR 508
            LHSGKY+SGD+LSPEHE+ ILER+LPYHP+ EKKIG G+D+IT+GYHPDF+ SRCLFIVR
Sbjct: 124  LHSGKYESGDRLSPEHERTILERVLPYHPQCEKKIGSGVDYITIGYHPDFDRSRCLFIVR 183

Query: 507  KDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFR 403
            KDGELVDFS+WKCIKGLIRKNYPLYAD+FI+RHF+
Sbjct: 184  KDGELVDFSYWKCIKGLIRKNYPLYADTFIIRHFK 218


>ref|NP_001031151.1| uncharacterized protein [Arabidopsis thaliana]
            gi|332193993|gb|AEE32114.1| uncharacterized protein
            AT1G45230 [Arabidopsis thaliana]
          Length = 219

 Score =  246 bits (629), Expect = 1e-62
 Identities = 129/229 (56%), Positives = 157/229 (68%), Gaps = 5/229 (2%)
 Frame = -1

Query: 1065 SAMSCISKSP--SFLFRTNPTSLHFSASPFYFHY---HIIPLRSQFRALKASSEGINVGN 901
            S  S  S SP  S  FR       FS+SP   ++       LR + RAL+  S+G  +GN
Sbjct: 2    SLASIPSSSPVASPYFRCRTYIFSFSSSPLCLYFPRGDSTSLRPRVRALRTESDGAKIGN 61

Query: 900  QDGNGPDLLRRPAISPIYKTDGSKKKGGLSELSNDDEGSSGETKRGEEEWVDWEDQILED 721
             +  G +LLRRP I+              SE S+++E    E     +E+VDWED+ILE 
Sbjct: 62   SESYGSELLRRPRIA--------------SEESSEEEEEEEEENSEGDEFVDWEDKILEV 107

Query: 720  TVPLVGFVRMILHSGKYKSGDKLSPEHEKAILERLLPYHPEFEKKIGCGIDFITVGYHPD 541
            TVPLVGFVRMILHSGKY + D+LSPEHE+ I+E LLPYHPE EKKIGCGID+I V +HPD
Sbjct: 108  TVPLVGFVRMILHSGKYANRDRLSPEHERTIIEMLLPYHPECEKKIGCGIDYIMVWHHPD 167

Query: 540  FENSRCLFIVRKDGELVDFSFWKCIKGLIRKNYPLYADSFILRHFRMHK 394
            FE+SRC+FIVRKDGE+VDFS+WKCIKGLI+K YPLYADSFILRHFR  +
Sbjct: 168  FESSRCMFIVRKDGEVVDFSYWKCIKGLIKKKYPLYADSFILRHFRKRR 216


Top