BLASTX nr result

ID: Cornus23_contig00018087 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00018087
         (589 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_005645057.1| light-harvesting complex I protein [Coccomyx...   243   5e-62
ref|XP_002957416.1| light-harvesting protein of photosystem I [V...   205   1e-50
ref|XP_002945875.1| light harvesting complex a protein [Volvox c...   197   3e-48
dbj|BAD06924.1| light-harvesting chlorophyll-a/b protein of phot...   197   4e-48
ref|XP_001691959.1| light-harvesting protein of photosystem I [C...   197   4e-48
gb|AAO16495.1| light-harvesting complex I protein [Chlamydomonas...   197   4e-48
gb|ABA01130.1| chloroplast light harvesting complex I protein [C...   196   9e-48
tpg|DAA05901.1| TPA_inf: chloroplast light-harvesting complex I ...   189   9e-46
ref|XP_013892797.1| hypothetical protein MNEG_14186 [Monoraphidi...   184   4e-44
ref|XP_001696202.1| light-harvesting protein of photosystem I [C...   183   6e-44
ref|XP_013901241.1| Chlorophyll a-b binding protein 7, chloropla...   183   6e-44
gb|ABD37906.1| light-harvesting chlorophyll-a/b binding protein ...   179   1e-42
emb|CEF99235.1| Chlorophyll A-B binding protein, plant [Ostreoco...   176   8e-42
ref|XP_003081427.1| PSI-associated light-harvesting chlorophyll ...   176   8e-42
ref|XP_011397429.1| Chlorophyll a-b binding protein 7, chloropla...   174   3e-41
ref|XP_012085680.1| PREDICTED: chlorophyll a-b binding protein 7...   173   7e-41
gb|KFK36161.1| hypothetical protein AALP_AA4G086000 [Arabis alpina]   172   1e-40
ref|XP_006393637.1| hypothetical protein EUTSA_v10011725mg [Eutr...   172   1e-40
ref|XP_010538121.1| PREDICTED: chlorophyll a-b binding protein 4...   171   2e-40
tpg|DAA05902.1| TPA_inf: chloroplast light-harvesting complex I ...   171   2e-40

>ref|XP_005645057.1| light-harvesting complex I protein [Coccomyxa subellipsoidea C-169]
           gi|384247025|gb|EIE20513.1| light-harvesting complex I
           protein [Coccomyxa subellipsoidea C-169]
          Length = 259

 Score =  243 bits (620), Expect = 5e-62
 Identities = 114/172 (66%), Positives = 135/172 (78%), Gaps = 2/172 (1%)
 Frame = +3

Query: 78  AAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQAELVHGRTAMIGVA 257
           +AADRQ+W+PG    P +LDGSLPGD+GFDPL LGSD +LL+WF QAELVHGRTAM  VA
Sbjct: 41  SAADRQVWFPGNAPAP-HLDGSLPGDFGFDPLSLGSDPQLLKWFQQAELVHGRTAMTAVA 99

Query: 258 GILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILTGWVEAKRWADYKNP 437
           GIL P +ATK G++N+P WYDAG VW++NN  FPF +LLF Q+ILTGWVE KRW D+KNP
Sbjct: 100 GILFPAVATKAGVVNIPQWYDAGSVWVQNNPNFPFAALLFIQIILTGWVETKRWLDFKNP 159

Query: 438 GSQGDGSFFGLTDDFVAKENGYPGG-LFDPFNLA-ADPGNYEEYKVKEIKNG 587
           GSQ DGSF G+TDDF    NGYPGG LFDPF L+    G  ++Y+  EIKNG
Sbjct: 160 GSQADGSFLGVTDDFKGVANGYPGGKLFDPFGLSRGSEGQLQKYQENEIKNG 211


>ref|XP_002957416.1| light-harvesting protein of photosystem I [Volvox carteri f.
           nagariensis] gi|300257220|gb|EFJ41471.1|
           light-harvesting protein of photosystem I [Volvox
           carteri f. nagariensis]
          Length = 241

 Score =  205 bits (522), Expect = 1e-50
 Identities = 104/171 (60%), Positives = 121/171 (70%), Gaps = 2/171 (1%)
 Frame = +3

Query: 81  AADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQAELVHGRTAMIGVAG 260
           AA R +W+PG    P +LDGSL GDYGFDPL LG + + L+W+VQAELVHGR AM+G AG
Sbjct: 27  AATRPVWFPG-NPPPAHLDGSLAGDYGFDPLFLGQEPQTLKWYVQAELVHGRFAMLGAAG 85

Query: 261 ILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILTGWVEAKRWADYKNPG 440
           I+L  +  K+G L  P WYDAGKV +E N    F +LL TQL L GW E KRW D+KNPG
Sbjct: 86  IILTSIGAKVG-LGFPEWYDAGKVVVEKNN-IDFPTLLITQLYLMGWAETKRWYDFKNPG 143

Query: 441 SQGDGSFFGLTDDFVAKENGYPGG-LFDPFNLA-ADPGNYEEYKVKEIKNG 587
           SQGDGSF G TD+F   ENGYPGG  FDP  L+  D   Y+EYK KEIKNG
Sbjct: 144 SQGDGSFLGFTDEFKGLENGYPGGRFFDPMGLSRGDADKYKEYKQKEIKNG 194


>ref|XP_002945875.1| light harvesting complex a protein [Volvox carteri f. nagariensis]
           gi|5902596|gb|AAD55568.1|AF110786_1 light harvesting
           complex a protein [Volvox carteri f. nagariensis]
           gi|300268690|gb|EFJ52870.1| light harvesting complex a
           protein [Volvox carteri f. nagariensis]
          Length = 243

 Score =  197 bits (501), Expect = 3e-48
 Identities = 108/195 (55%), Positives = 133/195 (68%), Gaps = 7/195 (3%)
 Frame = +3

Query: 24  RVAPASRAQVIRKGV-SCKAAADRQLWYPGIQKVPDYLDG----SLPGDYGFDPLRLGSD 188
           R A A+R  V RK V +C A   RQ W PG  ++P +LD     ++ G++GFDPL LG D
Sbjct: 7   RSAVAARPAVSRKAVVTCVA---RQSWLPG-SEIPKHLDSPAALAMAGNFGFDPLGLGKD 62

Query: 189 GELLRWFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWS 368
            E LRW+ QAELVH RTAM  VAGIL+PGL TK G LN+P WYDAGKV IE++ + PF +
Sbjct: 63  AEALRWYQQAELVHARTAMTAVAGILIPGLLTKAGALNVPEWYDAGKVAIESS-FAPFGA 121

Query: 369 LLFTQLILTGWVEAKRWADYKNPGSQGD-GSFFGLTDDFVAK-ENGYPGGLFDPFNLAAD 542
           LL  QL LTG+VEAKRW D+K PGSQ + GSF G    F    +NGYPGG+FDP  L+ D
Sbjct: 122 LLAVQLFLTGFVEAKRWQDFKKPGSQAEKGSFLGFETAFAGTGDNGYPGGIFDPLGLSKD 181

Query: 543 PGNYEEYKVKEIKNG 587
                ++K+KEIKNG
Sbjct: 182 ADKLADWKLKEIKNG 196


>dbj|BAD06924.1| light-harvesting chlorophyll-a/b protein of photosystem I
           [Chlamydomonas reinhardtii]
          Length = 241

 Score =  197 bits (500), Expect = 4e-48
 Identities = 103/190 (54%), Positives = 126/190 (66%), Gaps = 2/190 (1%)
 Frame = +3

Query: 24  RVAPASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLR 203
           R A A  A+   + V  +AA  R +W+PG    P +LDGSL GDYGFDPL LG + + L+
Sbjct: 9   RRAGAFSARQAPRAVRAQAAV-RPVWFPG-NPPPAHLDGSLAGDYGFDPLFLGQEPQTLK 66

Query: 204 WFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQ 383
           W+VQAELVHGR AM+G AGI+L  +  K+G L  P WYDAGKV +E N    F +L+  Q
Sbjct: 67  WYVQAELVHGRFAMLGAAGIILTSIGAKVG-LGFPEWYDAGKVVVEKNN-IDFPTLMVIQ 124

Query: 384 LILTGWVEAKRWADYKNPGSQGDGSFFGLTDDFVAKENGYPGG-LFDPFNLA-ADPGNYE 557
             L GW E KRW D+KNPGSQ DGSF G T++F   ENGYPGG  FDP  L+  D   Y+
Sbjct: 125 FYLMGWAETKRWYDFKNPGSQADGSFLGFTEEFKGLENGYPGGRFFDPMGLSRGDAAKYQ 184

Query: 558 EYKVKEIKNG 587
           EYK KE+KNG
Sbjct: 185 EYKQKEVKNG 194


>ref|XP_001691959.1| light-harvesting protein of photosystem I [Chlamydomonas
           reinhardtii] gi|158278686|gb|EDP04449.1|
           light-harvesting protein of photosystem I [Chlamydomonas
           reinhardtii]
          Length = 288

 Score =  197 bits (500), Expect = 4e-48
 Identities = 103/190 (54%), Positives = 126/190 (66%), Gaps = 2/190 (1%)
 Frame = +3

Query: 24  RVAPASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLR 203
           R A A  A+   + V  +AA  R +W+PG    P +LDGSL GDYGFDPL LG + + L+
Sbjct: 9   RRAGAFSARQAPRAVRAQAAV-RPVWFPG-NPPPAHLDGSLAGDYGFDPLFLGQEPQTLK 66

Query: 204 WFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQ 383
           W+VQAELVHGR AM+G AGI+L  +  K+G L  P WYDAGKV +E N    F +L+  Q
Sbjct: 67  WYVQAELVHGRFAMLGAAGIILTSIGAKVG-LGFPEWYDAGKVVVEKNN-IDFPTLMVIQ 124

Query: 384 LILTGWVEAKRWADYKNPGSQGDGSFFGLTDDFVAKENGYPGG-LFDPFNLA-ADPGNYE 557
             L GW E KRW D+KNPGSQ DGSF G T++F   ENGYPGG  FDP  L+  D   Y+
Sbjct: 125 FYLMGWAETKRWYDFKNPGSQADGSFLGFTEEFKGLENGYPGGRFFDPMGLSRGDAAKYQ 184

Query: 558 EYKVKEIKNG 587
           EYK KE+KNG
Sbjct: 185 EYKQKEVKNG 194


>gb|AAO16495.1| light-harvesting complex I protein [Chlamydomonas reinhardtii]
          Length = 241

 Score =  197 bits (500), Expect = 4e-48
 Identities = 103/190 (54%), Positives = 126/190 (66%), Gaps = 2/190 (1%)
 Frame = +3

Query: 24  RVAPASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLR 203
           R A A  A+   + V  +AA  R +W+PG    P +LDGSL GDYGFDPL LG + + L+
Sbjct: 9   RRAGAFSARQAPRAVRAQAAV-RPVWFPG-NPPPAHLDGSLAGDYGFDPLFLGQEPQTLK 66

Query: 204 WFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQ 383
           W+VQAELVHGR AM+G AGI+L  +  K+G L  P WYDAGKV +E N    F +L+  Q
Sbjct: 67  WYVQAELVHGRFAMLGAAGIILTSIGAKVG-LGFPEWYDAGKVVVEKNN-IDFPTLMVIQ 124

Query: 384 LILTGWVEAKRWADYKNPGSQGDGSFFGLTDDFVAKENGYPGG-LFDPFNLA-ADPGNYE 557
             L GW E KRW D+KNPGSQ DGSF G T++F   ENGYPGG  FDP  L+  D   Y+
Sbjct: 125 FYLMGWAETKRWYDFKNPGSQADGSFLGFTEEFKGLENGYPGGRFFDPMGLSRGDAAKYQ 184

Query: 558 EYKVKEIKNG 587
           EYK KE+KNG
Sbjct: 185 EYKQKEVKNG 194


>gb|ABA01130.1| chloroplast light harvesting complex I protein [Chlamydomonas
           incerta]
          Length = 241

 Score =  196 bits (497), Expect = 9e-48
 Identities = 98/171 (57%), Positives = 116/171 (67%), Gaps = 2/171 (1%)
 Frame = +3

Query: 81  AADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQAELVHGRTAMIGVAG 260
           A  R +W+PG    P +LDG+L GDYGFDPL LG + E LRW+VQAELVHGR AM+G AG
Sbjct: 27  ALTRPVWFPG-NPAPAHLDGTLAGDYGFDPLFLGQEKETLRWYVQAELVHGRFAMLGAAG 85

Query: 261 ILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILTGWVEAKRWADYKNPG 440
           I+L  +  K+G L  P WYDAGKV +E N    F +L+  Q  L GW E KRW D+KNPG
Sbjct: 86  IILTSIGAKVG-LGFPEWYDAGKVVVEKNN-IDFPTLIVIQFYLMGWAETKRWYDFKNPG 143

Query: 441 SQGDGSFFGLTDDFVAKENGYPGG-LFDPFNLA-ADPGNYEEYKVKEIKNG 587
           SQ DGSF G T++F   ENGYPGG  FDP  L+  D   Y EYK KE+KNG
Sbjct: 144 SQADGSFLGFTEEFKGLENGYPGGRFFDPMGLSRGDAAKYAEYKQKEVKNG 194


>tpg|DAA05901.1| TPA_inf: chloroplast light-harvesting complex I protein precursor
           Lhca6 [Acetabularia acetabulum]
          Length = 201

 Score =  189 bits (480), Expect = 9e-46
 Identities = 98/168 (58%), Positives = 118/168 (70%), Gaps = 4/168 (2%)
 Frame = +3

Query: 96  LWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQAELVHGRTAMIGVAGILLPG 275
           LW+PG    P +LDGSLP DYGFDPL LGSD ++L W  QAEL+H R AM+GVAGIL+P 
Sbjct: 1   LWFPG-DTPPAHLDGSLPADYGFDPLSLGSDPDMLAWMRQAELMHCRWAMMGVAGILVPA 59

Query: 276 LATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILTGWVEAKRWADYKNPGSQGDG 455
           L TKLG +N+P W++AGKV   +N   PF +LL  + +  G+VE KRW D K+PGSQGDG
Sbjct: 60  LLTKLGAMNVPVWFEAGKV-ANDNSSVPFSALLMVEFLAMGFVETKRWYDIKSPGSQGDG 118

Query: 456 SFFGLTDDFVAKENGYPGG-LFDPFNLAADPGNYEEYK---VKEIKNG 587
           S FG+TDDF  K  GYPGG  FDPF  +   G+ E YK   VKEI NG
Sbjct: 119 SIFGITDDFKGKSVGYPGGTFFDPFGFS--KGSEESYKTLQVKEIANG 164


>ref|XP_013892797.1| hypothetical protein MNEG_14186 [Monoraphidium neglectum]
           gi|761959801|gb|KIY93777.1| hypothetical protein
           MNEG_14186 [Monoraphidium neglectum]
          Length = 246

 Score =  184 bits (466), Expect = 4e-44
 Identities = 94/188 (50%), Positives = 121/188 (64%), Gaps = 2/188 (1%)
 Frame = +3

Query: 30  APASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWF 209
           A A R    R+ V  +AAA R+ W PG+   P +L G L GD+GFDPL LG D   LRW+
Sbjct: 9   ARAVRGAASRRSVRVEAAAARRSWAPGVA-APAHLTGELSGDFGFDPLNLGKDPAALRWY 67

Query: 210 VQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLI 389
           VQ+ELVHGRTAM  VAGIL+PG+ TK G+LN+P WYDA    +  N   P  +L   +L 
Sbjct: 68  VQSELVHGRTAMAAVAGILIPGILTKAGVLNVPEWYDATDAALAANG-IPVKALFMVELF 126

Query: 390 LTGWVEAKRWADYKNPGSQGD-GSFFGLTDDFVAKENGYPGGLFDPFNLAAD-PGNYEEY 563
           L G+VEAKRW D+  PGSQG+ GSF G        +NGYPGG+FDP  L  + P   +++
Sbjct: 127 LCGFVEAKRWVDFVKPGSQGEPGSFLGFESSLKGVKNGYPGGVFDPLGLTKESPEKTKDW 186

Query: 564 KVKEIKNG 587
           + KE++NG
Sbjct: 187 EEKELRNG 194


>ref|XP_001696202.1| light-harvesting protein of photosystem I [Chlamydomonas
           reinhardtii] gi|40714515|dbj|BAD06921.1|
           light-harvesting chlorophyll-a/b protein of photosystem
           I [Chlamydomonas reinhardtii]
           gi|158282427|gb|EDP08179.1| light-harvesting protein of
           photosystem I [Chlamydomonas reinhardtii]
          Length = 243

 Score =  183 bits (464), Expect = 6e-44
 Identities = 104/195 (53%), Positives = 129/195 (66%), Gaps = 7/195 (3%)
 Frame = +3

Query: 24  RVAPASRAQVIRKGV-SCKAAADRQLWYPGIQKVPDYLDG----SLPGDYGFDPLRLGSD 188
           R   A+R+   RK V +C A   RQ W PG Q +P +LD     +L G++GFDPL LG D
Sbjct: 7   RSGVAARSASSRKSVVTCVA---RQSWLPGSQ-IPAHLDTPAAQALAGNFGFDPLGLGKD 62

Query: 189 GELLRWFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWS 368
              LRW+ QAEL+H RTAM GVAGIL+PGL TK G LN+P WYDAGKV IEN+ + P+ S
Sbjct: 63  PVALRWYQQAELIHCRTAMAGVAGILIPGLLTKAGALNVPEWYDAGKVAIENS-FAPWGS 121

Query: 369 LLFTQLILTGWVEAKRWADYKNPGSQGD-GSFFGLTDDFV-AKENGYPGGLFDPFNLAAD 542
           LL  QL L G+VEAKRW D + PGSQG+ GSF G         E GYPGG FDP  L+ +
Sbjct: 122 LLAVQLFLCGFVEAKRWQDIRKPGSQGEPGSFLGFEASLKGTSELGYPGGPFDPLGLSKE 181

Query: 543 PGNYEEYKVKEIKNG 587
              + ++K+KE+KNG
Sbjct: 182 ADKWADWKLKEVKNG 196


>ref|XP_013901241.1| Chlorophyll a-b binding protein 7, chloroplastic [Monoraphidium
           neglectum] gi|761971682|gb|KIZ02222.1| Chlorophyll a-b
           binding protein 7, chloroplastic [Monoraphidium
           neglectum]
          Length = 237

 Score =  183 bits (464), Expect = 6e-44
 Identities = 97/151 (64%), Positives = 109/151 (72%), Gaps = 2/151 (1%)
 Frame = +3

Query: 141 SLPGDYGFDPLRLGSDGELLRWFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYD 320
           SL GDYGFDPL+L  D    RW VQAEL +GR AM+GVAGIL   L  + G L+LP WYD
Sbjct: 41  SLAGDYGFDPLKLSEDPLTRRWMVQAELQNGRWAMLGVAGILFTALGAEAG-LDLPQWYD 99

Query: 321 AGKVWIENNKYFPFWSLLFTQLILTGWVEAKRWADYKNPGSQGDGSFFGLTDDFVAKENG 500
           AGKV I N+  F F +LL  Q +L GWVE+KR AD+ NPGSQGDGSFFG+TDDF  KENG
Sbjct: 100 AGKVSIANSP-FSFQTLLGVQFLLFGWVESKRLADFLNPGSQGDGSFFGITDDFKGKENG 158

Query: 501 YPGG-LFDPFNLA-ADPGNYEEYKVKEIKNG 587
           YPGG  FDPF L+  D   Y EYK KEIKNG
Sbjct: 159 YPGGKYFDPFGLSRGDAAKYAEYKQKEIKNG 189


>gb|ABD37906.1| light-harvesting chlorophyll-a/b binding protein Lhca4
           [Chlamydomonas incerta]
          Length = 237

 Score =  179 bits (453), Expect = 1e-42
 Identities = 101/195 (51%), Positives = 127/195 (65%), Gaps = 7/195 (3%)
 Frame = +3

Query: 24  RVAPASRAQVIRKGV-SCKAAADRQLWYPGIQKVPDYLDG----SLPGDYGFDPLRLGSD 188
           R   A+R+   RK V +C A   RQ W PG Q +P +LD     +L G++GFDPL LG D
Sbjct: 1   RSGVAARSASSRKSVVTCVA---RQSWLPGSQ-IPAHLDTPSAQALAGNFGFDPLGLGKD 56

Query: 189 GELLRWFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWS 368
              L W+ QAEL+H RTAM GVAGIL+PGL TK G LN+P WYDAGKV IEN+ + P+ +
Sbjct: 57  PVALAWYQQAELIHCRTAMTGVAGILIPGLLTKAGALNVPEWYDAGKVAIENS-FAPWGT 115

Query: 369 LLFTQLILTGWVEAKRWADYKNPGSQGD-GSFFGLTDDFV-AKENGYPGGLFDPFNLAAD 542
           LL  QL L G+VE KRW D + PGSQG+ GSF G         E GYPGG FDP  L+ +
Sbjct: 116 LLAVQLFLCGFVEVKRWQDIRKPGSQGEPGSFLGFESSLKGTSEVGYPGGPFDPLGLSKE 175

Query: 543 PGNYEEYKVKEIKNG 587
              + ++K+KE+KNG
Sbjct: 176 ADKWADWKLKEVKNG 190


>emb|CEF99235.1| Chlorophyll A-B binding protein, plant [Ostreococcus tauri]
          Length = 239

 Score =  176 bits (446), Expect = 8e-42
 Identities = 90/184 (48%), Positives = 112/184 (60%)
 Frame = +3

Query: 36  ASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQ 215
           ASR  V+   VS +A A+R +WYPG    P +LDGSLPGD+GFDPL L +D E+ +W VQ
Sbjct: 21  ASRRSVV---VSAEAGAERPVWYPGKAPAP-HLDGSLPGDFGFDPLSLSADPEMRKWMVQ 76

Query: 216 AELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILT 395
           AEL H R AM+GVAG + P L TK+G+ +LP W DAG        + P   L F Q+ + 
Sbjct: 77  AELQHARWAMLGVAGAVAPELLTKIGVADLPNWVDAGTY----QYWAPAGPLFFIQMAMF 132

Query: 396 GWVEAKRWADYKNPGSQGDGSFFGLTDDFVAKENGYPGGLFDPFNLAADPGNYEEYKVKE 575
            W E +RW D KNPGS      FG   +    + GYPGGLFD    A DP   +E K+KE
Sbjct: 133 NWAEVRRWQDMKNPGSMNTDPLFGYNSNDTNTDVGYPGGLFDKLGYAKDPAKAKELKLKE 192

Query: 576 IKNG 587
           IKNG
Sbjct: 193 IKNG 196


>ref|XP_003081427.1| PSI-associated light-harvesting chlorophyll a/b binding protein,
           (IC) [Ostreococcus tauri] gi|63029293|gb|AAY27545.1|
           chloroplast light-harvesting complex I protein precursor
           Lhca2 [Ostreococcus tauri]
          Length = 242

 Score =  176 bits (446), Expect = 8e-42
 Identities = 90/184 (48%), Positives = 112/184 (60%)
 Frame = +3

Query: 36  ASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQ 215
           ASR  V+   VS +A A+R +WYPG    P +LDGSLPGD+GFDPL L +D E+ +W VQ
Sbjct: 24  ASRRSVV---VSAEAGAERPVWYPGKAPAP-HLDGSLPGDFGFDPLSLSADPEMRKWMVQ 79

Query: 216 AELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILT 395
           AEL H R AM+GVAG + P L TK+G+ +LP W DAG        + P   L F Q+ + 
Sbjct: 80  AELQHARWAMLGVAGAVAPELLTKIGVADLPNWVDAGTY----QYWAPAGPLFFIQMAMF 135

Query: 396 GWVEAKRWADYKNPGSQGDGSFFGLTDDFVAKENGYPGGLFDPFNLAADPGNYEEYKVKE 575
            W E +RW D KNPGS      FG   +    + GYPGGLFD    A DP   +E K+KE
Sbjct: 136 NWAEVRRWQDMKNPGSMNTDPLFGYNSNDTNTDVGYPGGLFDKLGYAKDPAKAKELKLKE 195

Query: 576 IKNG 587
           IKNG
Sbjct: 196 IKNG 199


>ref|XP_011397429.1| Chlorophyll a-b binding protein 7, chloroplastic [Auxenochlorella
           protothecoides] gi|675352101|gb|KFM24541.1| Chlorophyll
           a-b binding protein 7, chloroplastic [Auxenochlorella
           protothecoides]
          Length = 691

 Score =  174 bits (441), Expect = 3e-41
 Identities = 90/174 (51%), Positives = 114/174 (65%), Gaps = 3/174 (1%)
 Frame = +3

Query: 75  KAAADRQLWYPGIQ-KVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQAELVHGRTAMIG 251
           K   DR +W+PG   ++PDYLDG+L GDYGFDPL LGS+ E LRW VQ+E+ H RTAM  
Sbjct: 51  KKVVDRPIWFPGNDPEIPDYLDGTLAGDYGFDPLGLGSEPEQLRWNVQSEVFHARTAMTA 110

Query: 252 VAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILTGWVEAKRWADYK 431
           VAGIL   +        +P WYDAGKV++E N      +L++T + L+G+VE KR AD++
Sbjct: 111 VAGILAVSILHGANPA-VPQWYDAGKVYLEQNPNVSLGALIWTTIALSGFVEFKRLADWR 169

Query: 432 NPGSQGDGSFFGLTDDFVAKENGYPGG-LFDPFNLA-ADPGNYEEYKVKEIKNG 587
            PGSQ D  F G+  +F  KENGYPGG  FDP   +      Y+EYK KEIKNG
Sbjct: 170 KPGSQADQWFLGIQKEFKGKENGYPGGAFFDPLGYSRGSEAKYQEYKWKEIKNG 223


>ref|XP_012085680.1| PREDICTED: chlorophyll a-b binding protein 7, chloroplastic
           [Jatropha curcas] gi|643714139|gb|KDP26804.1|
           hypothetical protein JCGZ_17962 [Jatropha curcas]
          Length = 270

 Score =  173 bits (438), Expect = 7e-41
 Identities = 99/198 (50%), Positives = 122/198 (61%), Gaps = 5/198 (2%)
 Frame = +3

Query: 9   SGSNVRVAPASRAQVIRKGVSCKAAAD--RQLWYPGIQKVPDYLDGSLPGDYGFDPLRLG 182
           SG  +R+   + + V+ + V+  AAAD  R LW+PG    P +LDGSLPGD+GFDPL LG
Sbjct: 36  SGKKLRLRSNTSSPVVSRSVTVCAAADPDRPLWFPG-STPPPWLDGSLPGDFGFDPLGLG 94

Query: 183 SDGELLRWFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPF 362
           SD E LRW VQAELVH R AM+G AGI +P   TK+GILN P WYDAGK+     +YF  
Sbjct: 95  SDPETLRWNVQAELVHCRWAMLGAAGIFIPEFLTKIGILNTPSWYDAGKL-----EYFTD 149

Query: 363 WSLLF-TQLILTGWVEAKRWADYKNPGSQGDGSFFGLTDDFVAKENGYPGGL-FDPFNL- 533
            + LF  +LIL GW E +RWAD   PGS      F   +     + GYPGGL FDP    
Sbjct: 150 TTTLFIIELILIGWAEGRRWADILKPGSVNTDPIFP-NNKLTGTDVGYPGGLWFDPLGWG 208

Query: 534 AADPGNYEEYKVKEIKNG 587
           +  P   +E + KEIKNG
Sbjct: 209 SGSPEKIKELRTKEIKNG 226


>gb|KFK36161.1| hypothetical protein AALP_AA4G086000 [Arabis alpina]
          Length = 257

 Score =  172 bits (435), Expect = 1e-40
 Identities = 92/172 (53%), Positives = 112/172 (65%), Gaps = 3/172 (1%)
 Frame = +3

Query: 81  AADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLRWFVQAELVHGRTAMIGVAG 260
           A +R  W PG+   P YLDG L GD+GFDPL LG D E L+W+VQAELVH R AM+GVAG
Sbjct: 45  ATERATWLPGLDP-PSYLDGKLAGDFGFDPLGLGEDPESLKWYVQAELVHARFAMLGVAG 103

Query: 261 ILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQLILTGWVEAKRWADYKNPG 440
           IL   L    GI NLP WY+AG V  +   +    +L+  QL+L G+ E KR+ D+ +PG
Sbjct: 104 ILFTDLLRTTGIRNLPVWYEAGAVKFD---FASTKTLIVVQLLLMGFAETKRYMDFVSPG 160

Query: 441 SQG--DGSFFGLTDDFVAKENGYPGG-LFDPFNLAADPGNYEEYKVKEIKNG 587
           SQ   DGSFFG+   F   E GYPGG L +P  LA D GN  E+K+KEIKNG
Sbjct: 161 SQAQEDGSFFGIEAAFEGLEPGYPGGPLLNPLGLAKDIGNAHEWKLKEIKNG 212


>ref|XP_006393637.1| hypothetical protein EUTSA_v10011725mg [Eutrema salsugineum]
           gi|557090215|gb|ESQ30923.1| hypothetical protein
           EUTSA_v10011725mg [Eutrema salsugineum]
          Length = 255

 Score =  172 bits (435), Expect = 1e-40
 Identities = 95/193 (49%), Positives = 120/193 (62%), Gaps = 10/193 (5%)
 Frame = +3

Query: 39  SRAQVIRKGVSCKAAA--------DRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGE 194
           S A + R+ +SC  A          R  W PG++  P YLDG LPGDYGFDPL LG D +
Sbjct: 21  SSAVITRRRISCIGATTGRNVAVEQRATWLPGLEP-PPYLDGKLPGDYGFDPLGLGEDPK 79

Query: 195 LLRWFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLL 374
            L+W+VQAELVH R AM+GVAGIL   L    GI NLP WY+AG V  +   +    +L+
Sbjct: 80  SLKWYVQAELVHSRFAMLGVAGILFTDLLRTTGISNLPVWYEAGAVKFD---FASTKTLI 136

Query: 375 FTQLILTGWVEAKRWADYKNPGSQG-DGSFFGLTDDFVAKENGYPGG-LFDPFNLAADPG 548
           F Q +L G+ E KR+ D+ +PGSQ  +GSFFGL       E GYPGG L +P  LA D  
Sbjct: 137 FVQFLLMGFAETKRYMDFVSPGSQAIEGSFFGLEAALEGLEPGYPGGPLLNPLGLAKDIR 196

Query: 549 NYEEYKVKEIKNG 587
           N +++K+KEIKNG
Sbjct: 197 NADQWKLKEIKNG 209


>ref|XP_010538121.1| PREDICTED: chlorophyll a-b binding protein 4, chloroplastic
           [Tarenaya hassleriana]
          Length = 255

 Score =  171 bits (433), Expect = 2e-40
 Identities = 100/190 (52%), Positives = 119/190 (62%), Gaps = 2/190 (1%)
 Frame = +3

Query: 24  RVAPASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGSDGELLR 203
           R   ASRA+   + VS  AA +R  W PG+   P YLDG+L GDYGFDPL LG D E L+
Sbjct: 25  RTFSASRARSSGR-VSVGAAEERATWLPGLDP-PPYLDGTLTGDYGFDPLGLGEDPESLK 82

Query: 204 WFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKYFPFWSLLFTQ 383
           W+VQAELVH R AM+GVAGIL   L    GI +LP WY+AG    E   +    +LL  Q
Sbjct: 83  WYVQAELVHSRFAMLGVAGILFTDLLRVTGIRDLPVWYEAGATKFE---FASTRTLLTVQ 139

Query: 384 LILTGWVEAKRWADYKNPGSQG-DGSFFGLTDDFVAKENGYPGG-LFDPFNLAADPGNYE 557
            IL G+ E KR+ D+ NPGSQ  +GSFFGL       E GYPGG L +P  LA D  N  
Sbjct: 140 FILMGFAETKRYMDFINPGSQAKEGSFFGLEPALEGLEPGYPGGPLLNPLGLAKDIKNAH 199

Query: 558 EYKVKEIKNG 587
           E+K+KEIKNG
Sbjct: 200 EWKLKEIKNG 209


>tpg|DAA05902.1| TPA_inf: chloroplast light-harvesting complex I protein precursor
           Lhca7 [Acetabularia acetabulum]
          Length = 214

 Score =  171 bits (433), Expect = 2e-40
 Identities = 94/203 (46%), Positives = 123/203 (60%), Gaps = 9/203 (4%)
 Frame = +3

Query: 6   FSGSNVRVAPASRAQVIRKGVSCKAAADRQLWYPGIQKVPDYLDGSLPGDYGFDPLRLGS 185
           F G   RV P +     R+      A ++ +W PG+  VP +L+ +LP DYGFDPL LGS
Sbjct: 5   FLGQTSRVTPTNST---RRMSPVTQATEQPVWLPGVN-VPQHLNRNLPADYGFDPLGLGS 60

Query: 186 DGELLRWFVQAELVHGRTAMIGVAGILLPGLATKLGILNLPPWYDAGKVWIENNKY-FPF 362
           + E L+W+VQAELVH R AM+GVAGIL+PGL TK+GILN+P WY+AGKV IE+  +   F
Sbjct: 61  EPEALKWYVQAELVHCRFAMLGVAGILIPGLLTKVGILNVPQWYEAGKVAIESQPFGLNF 120

Query: 363 WSLLFTQLILTGWVEAKRWADYKNPGSQGDGSFFGLTDDFVAK--------ENGYPGGLF 518
            +LL  QL +  W E KRW ++KNPGSQ +   +   +    K        +  YPGGLF
Sbjct: 121 QTLLGFQLFMMTWAETKRWVEFKNPGSQSNPDSYPGAEMLPGKVFALGGSGDPNYPGGLF 180

Query: 519 DPFNLAADPGNYEEYKVKEIKNG 587
           +P  +  D       KVKEI NG
Sbjct: 181 NPMGMGDD-----SMKVKEIANG 198


Top