BLASTX nr result

ID: Cocculus23_contig00015944 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00015944
         (1113 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002532572.1| conserved hypothetical protein [Ricinus comm...   362   2e-97
ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g122...   357   5e-96
ref|XP_002303521.1| thylakoid lumenal family protein [Populus tr...   352   2e-94
ref|XP_006590551.1| PREDICTED: thylakoid lumenal protein At1g122...   352   2e-94
ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250...   350   6e-94
emb|CBI31881.3| unnamed protein product [Vitis vinifera]              350   6e-94
ref|XP_006355493.1| PREDICTED: thylakoid lumenal protein At1g122...   346   9e-93
ref|XP_004953270.1| PREDICTED: thylakoid lumenal protein At1g122...   346   9e-93
ref|XP_004298665.1| PREDICTED: thylakoid lumenal protein At1g122...   346   9e-93
ref|XP_004511661.1| PREDICTED: thylakoid lumenal protein At1g122...   346   1e-92
ref|XP_004245736.1| PREDICTED: thylakoid lumenal protein At1g122...   345   1e-92
ref|XP_007157331.1| hypothetical protein PHAVU_002G061100g [Phas...   345   2e-92
ref|XP_007039926.1| Pentapeptide repeat-containing protein isofo...   344   3e-92
ref|XP_007039925.1| Pentapeptide repeat-containing protein isofo...   344   3e-92
ref|XP_006477250.1| PREDICTED: thylakoid lumenal protein At1g122...   343   6e-92
ref|XP_006440376.1| hypothetical protein CICLE_v10021545mg [Citr...   343   6e-92
gb|AFK40674.1| unknown [Lotus japonicus]                              343   6e-92
ref|XP_006417253.1| hypothetical protein EUTSA_v10008422mg [Eutr...   343   1e-91
gb|AGV54388.1| thylakoid lumenal protein [Phaseolus vulgaris]         343   1e-91
dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]    342   1e-91

>ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
           gi|223527699|gb|EEF29806.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 280

 Score =  362 bits (929), Expect = 2e-97
 Identities = 196/290 (67%), Positives = 217/290 (74%), Gaps = 17/290 (5%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRSVVCELS---------- 248
           MA  S+SP S K++                  +P +S  F    ++C+L+          
Sbjct: 1   MAFTSISPLSIKSVN------ISPSSSRSPYHLPSQSKPFH---ILCQLATEREDRILDC 51

Query: 249 -------ENAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQ 407
                   ++KPK WRT+VSTALAAAA ++  G G+PA ADLNKFEAELRGEFGIGSAAQ
Sbjct: 52  STTRYKVHHSKPKNWRTLVSTALAAAAAVNL-GFGLPAAADLNKFEAELRGEFGIGSAAQ 110

Query: 408 FGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTL 587
           FGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTL
Sbjct: 111 FGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTL 170

Query: 588 MDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLT 767
           MDRMVLNEAN TNAVLVRSVLTRSDLGGA IEGADFSDAV+DL QKQALCKYA+GTN +T
Sbjct: 171 MDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLTQKQALCKYANGTNSIT 230

Query: 768 GVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           GVSTR SLGCGN+RRNAYG               DRDGFCDE+TGLCDAK
Sbjct: 231 GVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDEATGLCDAK 280


>ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cucumis sativus] gi|449520611|ref|XP_004167327.1|
           PREDICTED: thylakoid lumenal protein At1g12250,
           chloroplastic-like [Cucumis sativus]
          Length = 279

 Score =  357 bits (916), Expect = 5e-96
 Identities = 190/278 (68%), Positives = 212/278 (76%), Gaps = 6/278 (2%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRSVVCELSEN------AK 260
           MAL+S+S  S K L               R +I + S     +    + SE        +
Sbjct: 1   MALSSISSLSVKCLPLNSSKSRHPCSLQTRKQISMVSQINPQKDQTQDCSERKHIGKITE 60

Query: 261 PKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRKAVH 440
           PK+W+ +VSTALAAAAVI FS G MP++A+LNK+EA+ RGEFGIGSAAQ+GSADLRKAVH
Sbjct: 61  PKRWQKLVSTALAAAAVIGFSSG-MPSVAELNKYEADTRGEFGIGSAAQYGSADLRKAVH 119

Query: 441 VNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANF 620
           +NENFRRANFTSADMRESDFSG TFNGAYLEKAVAYK NFSGADLSDTLMDRMVLNEANF
Sbjct: 120 INENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLMDRMVLNEANF 179

Query: 621 TNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSLGCG 800
           TNAVLVRSVLTRSDLGGA I GADFSDAV+DLPQKQALCKYASGTNP+TGVSTR SLGCG
Sbjct: 180 TNAVLVRSVLTRSDLGGAIIVGADFSDAVIDLPQKQALCKYASGTNPVTGVSTRASLGCG 239

Query: 801 NNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDA 914
           N+RRNAYG               DRDGFCD+ TGLC+A
Sbjct: 240 NSRRNAYGTPSSPLLSAPPQQLLDRDGFCDQDTGLCEA 277


>ref|XP_002303521.1| thylakoid lumenal family protein [Populus trichocarpa]
           gi|222840953|gb|EEE78500.1| thylakoid lumenal family
           protein [Populus trichocarpa]
          Length = 275

 Score =  352 bits (903), Expect = 2e-94
 Identities = 179/223 (80%), Positives = 191/223 (85%)
 Frame = +3

Query: 249 ENAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLR 428
           E AK K W  +VST L AAA ISFS   +PA+ADLN+FEAE RGEFGIGSAAQFGSADLR
Sbjct: 54  ETAKAKNWARVVSTTLVAAA-ISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLR 112

Query: 429 KAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLN 608
           KAVH+NENFRRANFT+ADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLN
Sbjct: 113 KAVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLN 172

Query: 609 EANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTS 788
           E+N TNAVLVRSVLTRSDLGGA I GADFSDAV+DLPQKQALCKYASGTNP+TGVSTR S
Sbjct: 173 ESNLTNAVLVRSVLTRSDLGGALIAGADFSDAVIDLPQKQALCKYASGTNPITGVSTRAS 232

Query: 789 LGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           LGCGN+RRNAYG               DRDGFCD+ TGLCDAK
Sbjct: 233 LGCGNSRRNAYGTPSSPLLSAPPQKLLDRDGFCDQGTGLCDAK 275


>ref|XP_006590551.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           isoform X1 [Glycine max]
          Length = 266

 Score =  352 bits (902), Expect = 2e-94
 Identities = 196/285 (68%), Positives = 213/285 (74%), Gaps = 11/285 (3%)
 Frame = +3

Query: 96  LMALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRS-------VVCELSEN 254
           +MAL SLSP S  +L                  +   S S  S S       VVC+++ N
Sbjct: 1   MMALNSLSPLSINSL-----------------HVSSSSTSKISHSHSKSFPVVVCQINSN 43

Query: 255 AKPKQ----WRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSAD 422
              +Q    W  +VS  LAAA VI+FS   M ALADLNKFEAE+RGEFGIGSAAQFGSAD
Sbjct: 44  RDHRQESTKWGKVVSATLAAA-VIAFSSD-MSALADLNKFEAEMRGEFGIGSAAQFGSAD 101

Query: 423 LRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 602
           LRKAVHVNENFRRANFT+ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV
Sbjct: 102 LRKAVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMV 161

Query: 603 LNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTR 782
           LNEAN TNA+L+R+VLTRSDLGGA IEGADFSDAVLDLPQKQALCKYASGTNP+TGVSTR
Sbjct: 162 LNEANLTNAILLRTVLTRSDLGGAIIEGADFSDAVLDLPQKQALCKYASGTNPVTGVSTR 221

Query: 783 TSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
            SLGCGN RRNAYG               DRDGFCD++TGLCDAK
Sbjct: 222 VSLGCGNKRRNAYGSPSSPLLSAPPQKLLDRDGFCDDATGLCDAK 266


>ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
            vinifera]
          Length = 596

 Score =  350 bits (898), Expect = 6e-94
 Identities = 179/222 (80%), Positives = 191/222 (86%)
 Frame = +3

Query: 252  NAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRK 431
            NA+ K+W+ +VSTALAAA V       MPA+ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 377  NAESKKWQRLVSTALAAAVVTL--SPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 434

Query: 432  AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNE 611
            AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLNE
Sbjct: 435  AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 494

Query: 612  ANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSL 791
            AN TNAVL R+VLTRSDLGGA IEGADFSDAV+DLPQKQALCKYASGTNP+TGVSTR SL
Sbjct: 495  ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 554

Query: 792  GCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
            GCGN+RR+AYG               DRDGFCDE TGLCDAK
Sbjct: 555  GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 596



 Score =  167 bits (422), Expect = 1e-38
 Identities = 110/217 (50%), Positives = 135/217 (62%), Gaps = 12/217 (5%)
 Frame = +3

Query: 201 LKSLSFRSRSVVCELSE----------NAKPKQWRTMVSTALAAAAVISFSGGGMPALAD 350
           L+SLS +  +VVC +            NA+ K+W+ +VSTALAAA V       MPA+AD
Sbjct: 18  LRSLS-KPFTVVCRIERQRENNWRGEANAESKKWQRLVSTALAAAVVTL--SPVMPAVAD 74

Query: 351 LNKFEAELRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYL 530
           LNK+E E RGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNG YL
Sbjct: 75  LNKYEVETRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGEYL 134

Query: 531 EKAVAYKANFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRS-DLGGAT-IEGADFSDA 704
           EKAVAYKA+ +G D       +MVL+ +  T   ++R  L+    LG  T  E   ++  
Sbjct: 135 EKAVAYKASLTGPDAPHARPYKMVLHPS--TGLCVLRGSLSEPLKLGPCTESEAWGYTPQ 192

Query: 705 VLDLPQKQALCKYASGTNPLTGVSTRTSLGCGNNRRN 815
            + + +   LC  A G     G   + S+ C N   N
Sbjct: 193 KILIIKGTYLCLQAVG----LGKPAKLSVICSNPGSN 225


>emb|CBI31881.3| unnamed protein product [Vitis vinifera]
          Length = 261

 Score =  350 bits (898), Expect = 6e-94
 Identities = 179/222 (80%), Positives = 191/222 (86%)
 Frame = +3

Query: 252 NAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRK 431
           NA+ K+W+ +VSTALAAA V       MPA+ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 42  NAESKKWQRLVSTALAAAVVTL--SPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 99

Query: 432 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNE 611
           AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLNE
Sbjct: 100 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNE 159

Query: 612 ANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSL 791
           AN TNAVL R+VLTRSDLGGA IEGADFSDAV+DLPQKQALCKYASGTNP+TGVSTR SL
Sbjct: 160 ANLTNAVLARTVLTRSDLGGAVIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRASL 219

Query: 792 GCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           GCGN+RR+AYG               DRDGFCDE TGLCDAK
Sbjct: 220 GCGNSRRSAYGSPSSPLLSAPPPKLLDRDGFCDEGTGLCDAK 261


>ref|XP_006355493.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Solanum tuberosum]
          Length = 264

 Score =  346 bits (888), Expect = 9e-93
 Identities = 180/235 (76%), Positives = 198/235 (84%), Gaps = 3/235 (1%)
 Frame = +3

Query: 219 RSRSVVCELSE---NAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFG 389
           +S  +VC++ +   N + K+W+ +VSTALAAA VI+FS   M A+ADLNKFEAE RGEFG
Sbjct: 31  KSSKIVCQVEKSNSNREVKKWKAIVSTALAAA-VITFSSN-MTAMADLNKFEAETRGEFG 88

Query: 390 IGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGA 569
           IGSAAQFGSADL+K VH NENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGA
Sbjct: 89  IGSAAQFGSADLKKTVHTNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGA 148

Query: 570 DLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYAS 749
           DLSDTLMDRMVLNEAN TNAVLVRSVLTRSDLGGA +EGADFSDAV+DL QKQALCKYAS
Sbjct: 149 DLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIVEGADFSDAVIDLLQKQALCKYAS 208

Query: 750 GTNPLTGVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDA 914
           GTNP+TGVSTR SLGCGN+RRNAYG               DRDGFCD +TGLC+A
Sbjct: 209 GTNPVTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQQLLDRDGFCDPATGLCEA 263


>ref|XP_004953270.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Setaria italica]
          Length = 272

 Score =  346 bits (888), Expect = 9e-93
 Identities = 185/276 (67%), Positives = 202/276 (73%), Gaps = 3/276 (1%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSR---SVVCELSENAKPKQ 269
           M LAS SP +    +P              +R+  ++   R           S   +  +
Sbjct: 1   MTLASTSPIAAAAARPTKLPSFSRCPPRRLLRVSCQAAPDRPACGGGNASSASPAPQQPR 60

Query: 270 WRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRKAVHVNE 449
           WR  VS A+AAA V +     MPA ADLN+FEAE RGEFGIGSAAQFGSADL+KAVHVNE
Sbjct: 61  WRAAVSAAIAAAVVAA----AMPAYADLNRFEAEQRGEFGIGSAAQFGSADLKKAVHVNE 116

Query: 450 NFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANFTNA 629
           NFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEAN TNA
Sbjct: 117 NFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNA 176

Query: 630 VLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSLGCGNNR 809
           VLVRSVLTRSDLGGA IEGADFSDAV+DLPQKQALCKYASGTNP+TGVSTR SLGCGN+R
Sbjct: 177 VLVRSVLTRSDLGGAIIEGADFSDAVIDLPQKQALCKYASGTNPITGVSTRKSLGCGNSR 236

Query: 810 RNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           RNAYG               DRDGFCD  TG+CDAK
Sbjct: 237 RNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGMCDAK 272


>ref|XP_004298665.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Fragaria vesca subsp. vesca]
          Length = 233

 Score =  346 bits (888), Expect = 9e-93
 Identities = 179/218 (82%), Positives = 187/218 (85%)
 Frame = +3

Query: 264 KQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRKAVHV 443
           K W+ +VS ALA AAVISFS G MPA+ADLNKFEAE  GEFGIGSAAQFGSADLRKAVHV
Sbjct: 17  KCWKNIVSVALATAAVISFSSG-MPAIADLNKFEAETPGEFGIGSAAQFGSADLRKAVHV 75

Query: 444 NENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANFT 623
           NENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEAN  
Sbjct: 76  NENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFAGADLSDTLMDRMVLNEANLK 135

Query: 624 NAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSLGCGN 803
           +AVLVRSVLTRSDLGGA IEGADFSDAVLDL QKQALCKYASGTNP TGVSTRTSLGCGN
Sbjct: 136 DAVLVRSVLTRSDLGGALIEGADFSDAVLDLSQKQALCKYASGTNPTTGVSTRTSLGCGN 195

Query: 804 NRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           +RRNAYG               +RDG CD+ TGLCD K
Sbjct: 196 SRRNAYGSPSSPLLSAPPQKLLNRDGICDQETGLCDVK 233


>ref|XP_004511661.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cicer arietinum]
          Length = 269

 Score =  346 bits (887), Expect = 1e-92
 Identities = 188/279 (67%), Positives = 210/279 (75%), Gaps = 6/279 (2%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRS--VVCELSENA----K 260
           MAL SLSP S  +L                      +L F S+S  +VC+++ N+    +
Sbjct: 1   MALNSLSPLSINSLHVSSNYRSTSKIS--------NTLHFHSKSSPIVCKINLNSDHPQE 52

Query: 261 PKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRKAVH 440
            K+W  + S  LAAA ++  S   + ALADLNKFEAE+RGEFGIGSAAQFGSADL+KAVH
Sbjct: 53  TKKWGKLFSATLAAAVIVFSSD--LSALADLNKFEAEIRGEFGIGSAAQFGSADLKKAVH 110

Query: 441 VNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANF 620
           VNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEAN 
Sbjct: 111 VNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANL 170

Query: 621 TNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSLGCG 800
           TNA+LVR+VLTRSDLG A IEGADFSDAVLDL QKQALCKYASGTNP+TGVSTR SLGCG
Sbjct: 171 TNAILVRTVLTRSDLGSAIIEGADFSDAVLDLTQKQALCKYASGTNPVTGVSTRVSLGCG 230

Query: 801 NNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           N RRNAYG               DRDGFCDE+TGLCD+K
Sbjct: 231 NKRRNAYGTPSSPLLSAPPQKLLDRDGFCDEATGLCDSK 269


>ref|XP_004245736.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           isoform 2 [Solanum lycopersicum]
          Length = 309

 Score =  345 bits (886), Expect = 1e-92
 Identities = 180/235 (76%), Positives = 198/235 (84%), Gaps = 3/235 (1%)
 Frame = +3

Query: 219 RSRSVVCELSE---NAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFG 389
           +S  +VC++ +   N + K+W+ +VSTALAAA VI+FS   M A+ADLNKFEA+ RGEFG
Sbjct: 76  KSSKIVCQVEKSNSNIEIKKWKAIVSTALAAA-VITFSSN-MAAMADLNKFEADTRGEFG 133

Query: 390 IGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGA 569
           IGSAAQFGSADL+K VH NENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGA
Sbjct: 134 IGSAAQFGSADLKKTVHTNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGA 193

Query: 570 DLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYAS 749
           DLSDTLMDRMVLNEAN TNAVLVRSVLTRSDLGGA IEGADFSDAV+DL QKQALCKYAS
Sbjct: 194 DLSDTLMDRMVLNEANLTNAVLVRSVLTRSDLGGAIIEGADFSDAVIDLLQKQALCKYAS 253

Query: 750 GTNPLTGVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDA 914
           GTNP+TGVSTR SLGCGN+RRNAYG               DRDGFCD +TGLC+A
Sbjct: 254 GTNPVTGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQQLLDRDGFCDSATGLCEA 308


>ref|XP_007157331.1| hypothetical protein PHAVU_002G061100g [Phaseolus vulgaris]
           gi|561030746|gb|ESW29325.1| hypothetical protein
           PHAVU_002G061100g [Phaseolus vulgaris]
          Length = 280

 Score =  345 bits (885), Expect = 2e-92
 Identities = 181/232 (78%), Positives = 197/232 (84%), Gaps = 1/232 (0%)
 Frame = +3

Query: 225 RSVVCE-LSENAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSA 401
           +SV C  ++ NA+ ++W  +VS  LAAA VI+FS   M ALADLNKFEAE+RGEFGIGSA
Sbjct: 51  KSVGCSAVAANAESRKWGKVVSATLAAA-VIAFSSD-MSALADLNKFEAEMRGEFGIGSA 108

Query: 402 AQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSD 581
           AQFGSADLRKAVHVNENFRRANFT+ADMRESDFSGSTFNGAYLEKAVAY+ANFSGADLSD
Sbjct: 109 AQFGSADLRKAVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYRANFSGADLSD 168

Query: 582 TLMDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNP 761
           TLMDRMVLNEAN TNA+L+R+VLTRSDL GA IEGADFSDAVLDL QKQALCKYASGTNP
Sbjct: 169 TLMDRMVLNEANLTNAILLRTVLTRSDLAGAIIEGADFSDAVLDLLQKQALCKYASGTNP 228

Query: 762 LTGVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           +TGVSTR SLGCGN RRNAYG               DRDGFCDE+TGLCDAK
Sbjct: 229 VTGVSTRVSLGCGNKRRNAYGSPSSPLLSAPPQKLLDRDGFCDEATGLCDAK 280


>ref|XP_007039926.1| Pentapeptide repeat-containing protein isoform 2 [Theobroma cacao]
           gi|508777171|gb|EOY24427.1| Pentapeptide
           repeat-containing protein isoform 2 [Theobroma cacao]
          Length = 280

 Score =  344 bits (883), Expect = 3e-92
 Identities = 176/222 (79%), Positives = 190/222 (85%)
 Frame = +3

Query: 252 NAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRK 431
           +AK K WR +VSTALAAA V    G  M ALA+LNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 61  DAKFKSWRALVSTALAAAMVAF--GSDMSALAELNKYEAETRGEFGIGSAAQFGSADLRK 118

Query: 432 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNE 611
           AVH+NENFRRANFT+ADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLN+
Sbjct: 119 AVHMNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLND 178

Query: 612 ANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSL 791
           AN TNAVLVRSVLTRSDLGGA IEGADFSDAV+DLPQKQALCKYA+G NP+TGVSTR SL
Sbjct: 179 ANLTNAVLVRSVLTRSDLGGALIEGADFSDAVIDLPQKQALCKYANGKNPITGVSTRASL 238

Query: 792 GCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           GCGN+RRNAYG               DRDGFCD+ TGLC+AK
Sbjct: 239 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDKDTGLCEAK 280


>ref|XP_007039925.1| Pentapeptide repeat-containing protein isoform 1 [Theobroma cacao]
           gi|508777170|gb|EOY24426.1| Pentapeptide
           repeat-containing protein isoform 1 [Theobroma cacao]
          Length = 287

 Score =  344 bits (883), Expect = 3e-92
 Identities = 176/222 (79%), Positives = 190/222 (85%)
 Frame = +3

Query: 252 NAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRK 431
           +AK K WR +VSTALAAA V    G  M ALA+LNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 68  DAKFKSWRALVSTALAAAMVAF--GSDMSALAELNKYEAETRGEFGIGSAAQFGSADLRK 125

Query: 432 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNE 611
           AVH+NENFRRANFT+ADMRESDFSGSTFNGAYLEKAVAYKANF+GADLSDTLMDRMVLN+
Sbjct: 126 AVHMNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLMDRMVLND 185

Query: 612 ANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSL 791
           AN TNAVLVRSVLTRSDLGGA IEGADFSDAV+DLPQKQALCKYA+G NP+TGVSTR SL
Sbjct: 186 ANLTNAVLVRSVLTRSDLGGALIEGADFSDAVIDLPQKQALCKYANGKNPITGVSTRASL 245

Query: 792 GCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           GCGN+RRNAYG               DRDGFCD+ TGLC+AK
Sbjct: 246 GCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDKDTGLCEAK 287


>ref|XP_006477250.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Citrus sinensis]
          Length = 282

 Score =  343 bits (881), Expect = 6e-92
 Identities = 193/292 (66%), Positives = 209/292 (71%), Gaps = 19/292 (6%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRSVVCELSEN-------- 254
           MAL+S+SP S K+L               +    L +LS +   V C++S          
Sbjct: 1   MALSSISPLSIKSLN--------FCSSSSKGPYQLHALS-KPLWVACQISSKTESDGQFP 51

Query: 255 -----------AKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSA 401
                      AK K WR  VSTALAAA V S S   + ALADLNK+EAE RGEFGIGSA
Sbjct: 52  DCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSN-ISALADLNKYEAETRGEFGIGSA 110

Query: 402 AQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSD 581
           AQFGSADLRKAVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSD
Sbjct: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170

Query: 582 TLMDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNP 761
           TLMDRMVLNEAN TNAVLVR+VLTRSDLGGA IEGADFSDAV+DL QKQALCKYA+GTNP
Sbjct: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230

Query: 762 LTGVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           +TGVSTR SLGCGN+RRNAYG               DRDGFCD  TGLCDAK
Sbjct: 231 ITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282


>ref|XP_006440376.1| hypothetical protein CICLE_v10021545mg [Citrus clementina]
           gi|557542638|gb|ESR53616.1| hypothetical protein
           CICLE_v10021545mg [Citrus clementina]
          Length = 282

 Score =  343 bits (881), Expect = 6e-92
 Identities = 193/292 (66%), Positives = 209/292 (71%), Gaps = 19/292 (6%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRSVVCELSEN-------- 254
           MAL+S+SP S K+L               +    L +LS +   V C++S          
Sbjct: 1   MALSSISPLSIKSLN--------FCSSSSKGPYQLHALS-KPLWVACQISSKTESEGQFP 51

Query: 255 -----------AKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSA 401
                      AK K WR  VSTALAAA V S S   + ALADLNK+EAE RGEFGIGSA
Sbjct: 52  DCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSN-ISALADLNKYEAETRGEFGIGSA 110

Query: 402 AQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSD 581
           AQFGSADLRKAVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+GADLSD
Sbjct: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSD 170

Query: 582 TLMDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNP 761
           TLMDRMVLNEAN TNAVLVR+VLTRSDLGGA IEGADFSDAV+DL QKQALCKYA+GTNP
Sbjct: 171 TLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNP 230

Query: 762 LTGVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           +TGVSTR SLGCGN+RRNAYG               DRDGFCD  TGLCDAK
Sbjct: 231 ITGVSTRKSLGCGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK 282


>gb|AFK40674.1| unknown [Lotus japonicus]
          Length = 273

 Score =  343 bits (881), Expect = 6e-92
 Identities = 183/243 (75%), Positives = 200/243 (82%), Gaps = 7/243 (2%)
 Frame = +3

Query: 207 SLSFRSRS---VVCELSENA----KPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFE 365
           SL F  +S   V+C+++ N     + K+W  +VS  LAAA VI+FS   M ALADLNKFE
Sbjct: 30  SLHFHPKSSPIVLCQMNSNRDHPQESKKWGKLVSATLAAA-VIAFSSD-MSALADLNKFE 87

Query: 366 AELRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVA 545
           AE+RGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVA
Sbjct: 88  AEIRGEFGIGSAAQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVA 147

Query: 546 YKANFSGADLSDTLMDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQK 725
           YKANFSGADLSDTLMDRMVLNEAN TNA+LVR+VLTRSDLGG+ IEGADFSDAVLDL QK
Sbjct: 148 YKANFSGADLSDTLMDRMVLNEANLTNAILVRTVLTRSDLGGSIIEGADFSDAVLDLTQK 207

Query: 726 QALCKYASGTNPLTGVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGL 905
            ALCKYASGTNP+TGVSTR SLGCGN RRNAYG               +RDGFCDE+TGL
Sbjct: 208 LALCKYASGTNPVTGVSTRVSLGCGNKRRNAYGTPSSPLLSAPPQKLLNRDGFCDEATGL 267

Query: 906 CDA 914
           CD+
Sbjct: 268 CDS 270


>ref|XP_006417253.1| hypothetical protein EUTSA_v10008422mg [Eutrema salsugineum]
           gi|557095024|gb|ESQ35606.1| hypothetical protein
           EUTSA_v10008422mg [Eutrema salsugineum]
          Length = 280

 Score =  343 bits (879), Expect = 1e-91
 Identities = 188/282 (66%), Positives = 207/282 (73%), Gaps = 10/282 (3%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRSVVCELSENA------- 257
           MA +SLSP   K+L                 R PL+ L   S S   E+ +++       
Sbjct: 1   MAFSSLSPLPMKSLDVSRSSSSISRSLYHFPRYPLRRLQLSSHSNP-EIKDSSNHREDCC 59

Query: 258 ---KPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLR 428
              + K W+ ++S ALAAA + S SG  +PALA+LNKFEA+ RGEFGIGSAAQFGSADL 
Sbjct: 60  SGVESKTWKRILSAALAAALIASSSG--VPALAELNKFEADTRGEFGIGSAAQFGSADLS 117

Query: 429 KAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLN 608
           K VH NENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLN
Sbjct: 118 KTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLN 177

Query: 609 EANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTS 788
           EAN TNA+LVRSVLTRSDLGGA IEGADFSDAV+DL QKQALCKYA+GTNPLTGVSTR S
Sbjct: 178 EANLTNAILVRSVLTRSDLGGAKIEGADFSDAVIDLLQKQALCKYANGTNPLTGVSTRKS 237

Query: 789 LGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDA 914
           LGCGN+RRNAYG                RDGFCDE TGLCDA
Sbjct: 238 LGCGNSRRNAYGSPSSPLLSAPPQRLLGRDGFCDEKTGLCDA 279


>gb|AGV54388.1| thylakoid lumenal protein [Phaseolus vulgaris]
          Length = 280

 Score =  343 bits (879), Expect = 1e-91
 Identities = 180/232 (77%), Positives = 196/232 (84%), Gaps = 1/232 (0%)
 Frame = +3

Query: 225 RSVVCE-LSENAKPKQWRTMVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSA 401
           +SV C  ++ NA+ ++W  +VS  LAAA VI+FS   M ALADLNKFEAE+RGEFGIGSA
Sbjct: 51  KSVGCSAVAANAESRKWGKVVSATLAAA-VIAFSSD-MSALADLNKFEAEMRGEFGIGSA 108

Query: 402 AQFGSADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSD 581
           AQFGSADLRKAVHVNENFRRANFT+ADMRESDFSGSTFNGAYLEKAVAY+ANFSGADLSD
Sbjct: 109 AQFGSADLRKAVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYRANFSGADLSD 168

Query: 582 TLMDRMVLNEANFTNAVLVRSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNP 761
           TLMDRMVLNEAN TNA+L+R+VLTRSDL GA IEGADFSDAVLDL  KQALCKYASGTNP
Sbjct: 169 TLMDRMVLNEANLTNAILLRTVLTRSDLAGAIIEGADFSDAVLDLLPKQALCKYASGTNP 228

Query: 762 LTGVSTRTSLGCGNNRRNAYGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           +TGVSTR SLGCGN RRNAYG               DRDGFCDE+TGLCDAK
Sbjct: 229 VTGVSTRVSLGCGNKRRNAYGSPSSPLLSAPPQKLLDRDGFCDEATGLCDAK 280


>dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 267

 Score =  342 bits (878), Expect = 1e-91
 Identities = 182/273 (66%), Positives = 205/273 (75%)
 Frame = +3

Query: 99  MALASLSPFSTKTLKPXXXXXXXXXXXXXRVRIPLKSLSFRSRSVVCELSENAKPKQWRT 278
           MALAS SP +    +P               RI  ++ + RS       +  A P+ WR 
Sbjct: 1   MALASTSPLAATVARPKAPASLTRCRSRRLQRISCQATTDRSGGGNASNTSPAPPR-WRV 59

Query: 279 MVSTALAAAAVISFSGGGMPALADLNKFEAELRGEFGIGSAAQFGSADLRKAVHVNENFR 458
            VS ALAAA V++     MPA ADLNK+EA+ RGEFGIGSAAQFG+ADL+  VHVNENFR
Sbjct: 60  AVSAALAAAVVVA-----MPAHADLNKYEADQRGEFGIGSAAQFGNADLKNTVHVNENFR 114

Query: 459 RANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANFTNAVLV 638
           RANFTSADMRESDFSGSTFNGAY+EKAVA++ANF+GADLSDTLMDRMVLNEAN TNAVL 
Sbjct: 115 RANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTGADLSDTLMDRMVLNEANLTNAVLS 174

Query: 639 RSVLTRSDLGGATIEGADFSDAVLDLPQKQALCKYASGTNPLTGVSTRTSLGCGNNRRNA 818
           R+VLTRSDLGGATIEGADFSDAV+DLPQK ALCKYASGTNP+TGVSTR SLGCGN+RRNA
Sbjct: 175 RTVLTRSDLGGATIEGADFSDAVIDLPQKLALCKYASGTNPITGVSTRKSLGCGNSRRNA 234

Query: 819 YGXXXXXXXXXXXXXXXDRDGFCDESTGLCDAK 917
           YG               DRDGFCDE++GLCDAK
Sbjct: 235 YGSPSSPLLSAPPPKLLDRDGFCDEASGLCDAK 267


Top