BLASTX nr result

ID: Cocculus23_contig00024201 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00024201
         (1125 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006433180.1| hypothetical protein CICLE_v10002268mg [Citr...   305   2e-80
emb|CAN69845.1| hypothetical protein VITISV_038348 [Vitis vinifera]   296   1e-77
ref|XP_002284943.1| PREDICTED: uncharacterized protein LOC100249...   296   1e-77
ref|XP_007205769.1| hypothetical protein PRUPE_ppa010398mg [Prun...   295   2e-77
ref|XP_004302459.1| PREDICTED: uncharacterized protein LOC101305...   292   2e-76
ref|XP_002319030.1| hypothetical protein POPTR_0013s02750g [Popu...   291   3e-76
ref|XP_006355261.1| PREDICTED: uncharacterized protein LOC102582...   289   1e-75
gb|EXC31825.1| hypothetical protein L484_020653 [Morus notabilis]     289   2e-75
ref|XP_004244954.1| PREDICTED: uncharacterized protein LOC101266...   288   2e-75
ref|XP_006471883.1| PREDICTED: uncharacterized protein LOC102627...   288   4e-75
ref|XP_002512336.1| conserved hypothetical protein [Ricinus comm...   288   4e-75
ref|XP_002872272.1| hypothetical protein ARALYDRAFT_910845 [Arab...   280   6e-73
ref|XP_004156434.1| PREDICTED: uncharacterized protein LOC101230...   278   2e-72
ref|XP_004139199.1| PREDICTED: uncharacterized protein LOC101214...   278   3e-72
gb|EYU26892.1| hypothetical protein MIMGU_mgv1a012366mg [Mimulus...   277   5e-72
ref|XP_006288497.1| hypothetical protein CARUB_v10001762mg [Caps...   277   5e-72
dbj|BAC43363.1| unknown protein [Arabidopsis thaliana]                276   1e-71
ref|NP_198091.2| Mog1/PsbP/DUF1795-like photosystem II reaction ...   276   1e-71
ref|XP_007030936.1| Mog1/PsbP/DUF1795-like photosystem II reacti...   275   3e-71
gb|EPS66196.1| hypothetical protein M569_08583, partial [Genlise...   272   2e-70

>ref|XP_006433180.1| hypothetical protein CICLE_v10002268mg [Citrus clementina]
            gi|567881255|ref|XP_006433186.1| hypothetical protein
            CICLE_v10002268mg [Citrus clementina]
            gi|568835666|ref|XP_006471884.1| PREDICTED:
            uncharacterized protein LOC102627917 isoform X2 [Citrus
            sinensis] gi|557535302|gb|ESR46420.1| hypothetical
            protein CICLE_v10002268mg [Citrus clementina]
            gi|557535308|gb|ESR46426.1| hypothetical protein
            CICLE_v10002268mg [Citrus clementina]
          Length = 250

 Score =  305 bits (781), Expect = 2e-80
 Identities = 160/243 (65%), Positives = 196/243 (80%), Gaps = 3/243 (1%)
 Frame = -3

Query: 1090 LSFSSLPKNPSNLQ-RKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLDLALR 914
            LSFS  P +P N + + PN  SL     + + +++  KR+F L    T+ S+ S+ L   
Sbjct: 5    LSFSLFPLHPPNPKPQNPNPISL----PTNIQTSTFTKRKFILK--TTSLSLFSIALLQP 58

Query: 913  ISLSAPAIAETKNSKSI--LSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSL 740
                + A+AET  S  +  L+GI+NTKSWFQF+GDGF+IRVPPQFEDI+EPEDY+AGLSL
Sbjct: 59   PFAQSLALAETSPSPPLKALTGIANTKSWFQFYGDGFSIRVPPQFEDISEPEDYNAGLSL 118

Query: 739  YGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGG 560
            YGDK KPKT+AARFA+PDG EV+SV+IRPSNQLKITFLEA+D+ D G+LKDAAKIFVPGG
Sbjct: 119  YGDKAKPKTFAARFATPDGSEVLSVVIRPSNQLKITFLEAQDITDFGTLKDAAKIFVPGG 178

Query: 559  STLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGV 380
            +TLYSARTIKI+E+EG ++YYFYEFG D+QHVALVAA+NSGKAYIAGA APESKW+DDGV
Sbjct: 179  ATLYSARTIKIREEEGFKSYYFYEFGRDEQHVALVAAINSGKAYIAGATAPESKWDDDGV 238

Query: 379  KLR 371
            KLR
Sbjct: 239  KLR 241


>emb|CAN69845.1| hypothetical protein VITISV_038348 [Vitis vinifera]
          Length = 242

 Score =  296 bits (758), Expect = 1e-77
 Identities = 157/249 (63%), Positives = 187/249 (75%), Gaps = 4/249 (1%)
 Frame = -3

Query: 1105 MASLHLSFSSLP----KNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASV 938
            MA LHLS S  P    +NP+   +KP            +  NSS KR F L      AS+
Sbjct: 1    MAILHLSLSPRPLKPPQNPNPTTQKPR-----------ILCNSS-KRDFIL----NTASL 44

Query: 937  ISLDLALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDY 758
             +  L+     +      + +SKSILS I+NTK+WFQFFGDGF+IRVPP+FEDI EPED+
Sbjct: 45   CAFSLSAHYPFAIAFADTSPSSKSILSAIANTKTWFQFFGDGFSIRVPPEFEDIMEPEDF 104

Query: 757  SAGLSLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAK 578
             AGLSLYGDK KP+T+AARFAS DG EV+SV+IRP+NQLKITFLEAKD+ DLGSLK+AAK
Sbjct: 105  DAGLSLYGDKAKPRTFAARFASSDGSEVLSVVIRPTNQLKITFLEAKDITDLGSLKEAAK 164

Query: 577  IFVPGGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESK 398
            IFVP  +TLYSARTIKIKED+G RTYYFYEFG D+QH+A+VAAVN GKA+IAGAAAP+ K
Sbjct: 165  IFVPAAATLYSARTIKIKEDDGFRTYYFYEFGRDEQHLAVVAAVNGGKAFIAGAAAPQFK 224

Query: 397  WEDDGVKLR 371
            W+DDGVKLR
Sbjct: 225  WDDDGVKLR 233


>ref|XP_002284943.1| PREDICTED: uncharacterized protein LOC100249532 isoform 1 [Vitis
            vinifera] gi|359487717|ref|XP_003633636.1| PREDICTED:
            uncharacterized protein LOC100249532 isoform 2 [Vitis
            vinifera]
          Length = 242

 Score =  296 bits (757), Expect = 1e-77
 Identities = 157/249 (63%), Positives = 188/249 (75%), Gaps = 4/249 (1%)
 Frame = -3

Query: 1105 MASLHLSFSSLP----KNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASV 938
            MA LHLS S  P    +NP+    KP            +  NSS KR F L      AS+
Sbjct: 1    MAILHLSLSPRPLKPPQNPNPTTPKPR-----------ILCNSS-KRDFIL----NTASL 44

Query: 937  ISLDLALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDY 758
             +  L+    ++      + +SKSILS I+NTK+WFQFFGDGF+IRVPP+FEDI EPED+
Sbjct: 45   CAFSLSAHYPVAIAFADTSPSSKSILSAIANTKTWFQFFGDGFSIRVPPEFEDIMEPEDF 104

Query: 757  SAGLSLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAK 578
            +AGLSLYGDK KP+T+AARFAS DG EV+SV+IRP+NQLKITFLEAKD+ DLGSLK+AAK
Sbjct: 105  NAGLSLYGDKAKPRTFAARFASSDGSEVLSVVIRPTNQLKITFLEAKDITDLGSLKEAAK 164

Query: 577  IFVPGGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESK 398
            IFVP  +TLYSARTIKIKED+G RTYYFYEFG D+QH+A+VAAVN GKA+IAGAAAP+ K
Sbjct: 165  IFVPAAATLYSARTIKIKEDDGFRTYYFYEFGRDEQHLAVVAAVNGGKAFIAGAAAPQFK 224

Query: 397  WEDDGVKLR 371
            W+DDGVKLR
Sbjct: 225  WDDDGVKLR 233


>ref|XP_007205769.1| hypothetical protein PRUPE_ppa010398mg [Prunus persica]
            gi|462401411|gb|EMJ06968.1| hypothetical protein
            PRUPE_ppa010398mg [Prunus persica]
          Length = 251

 Score =  295 bits (756), Expect = 2e-77
 Identities = 160/247 (64%), Positives = 189/247 (76%), Gaps = 2/247 (0%)
 Frame = -3

Query: 1105 MASLHLSFSSLPKNPSNLQRKPNYHSLFRFKSS-CLYSNSSPKRQFALIGIATAASVISL 929
            MA L LS    P  P N   KP    L   + +  L  +++ KR+F L   +T+  +ISL
Sbjct: 1    MAILLLSLPPHPPIPPN-NPKPTTPKLTPPQPTIALTDSNTSKRRFILE--STSLFLISL 57

Query: 928  D-LALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSA 752
                  ++ S+P  + T      LSGI+NTKSWFQFFGDGFAIRVPPQF+D++EPEDY+ 
Sbjct: 58   TPQQYPVAHSSPEASATPRPS--LSGIANTKSWFQFFGDGFAIRVPPQFQDVSEPEDYNT 115

Query: 751  GLSLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIF 572
            GLSLYGDKVKPKT+AARFASPDG+EVVSV+IRPSN LKITFLEAKD+ DLGSLK+AAKIF
Sbjct: 116  GLSLYGDKVKPKTFAARFASPDGYEVVSVVIRPSNSLKITFLEAKDITDLGSLKEAAKIF 175

Query: 571  VPGGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWE 392
            +P GST+YSARTIKIKE+EG RTYYFYEFG+ +QH ALVAAVN GK YIAGA AP+SKW 
Sbjct: 176  IPVGSTVYSARTIKIKEEEGFRTYYFYEFGIQEQHAALVAAVNGGKTYIAGATAPQSKWN 235

Query: 391  DDGVKLR 371
            DDGVKLR
Sbjct: 236  DDGVKLR 242


>ref|XP_004302459.1| PREDICTED: uncharacterized protein LOC101305085 [Fragaria vesca
            subsp. vesca]
          Length = 250

 Score =  292 bits (747), Expect = 2e-76
 Identities = 149/236 (63%), Positives = 185/236 (78%)
 Frame = -3

Query: 1078 SLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLDLALRISLSA 899
            SLP +PSN + K +  +  +  S    +N++P  +  LI   T+  +ISL +  +  ++ 
Sbjct: 7    SLPPHPSNPKPKISNPTTPKLTSKLCITNNTPTSRRDLILKTTSIFLISL-IPQQYPIAH 65

Query: 898  PAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSLYGDKVKP 719
             +   +   KSILSGI+NTKSWFQF+GDGF+IRVPPQFEDI EPEDY+AGLSLYGDK KP
Sbjct: 66   SSTEASPPQKSILSGIANTKSWFQFYGDGFSIRVPPQFEDIMEPEDYNAGLSLYGDKAKP 125

Query: 718  KTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGGSTLYSAR 539
            K ++ARFAS DG EV++V+I+PSNQ KITFLEAKD+ DLGSLK+ AKIFVP GSTLYSAR
Sbjct: 126  KIFSARFASSDGSEVLNVVIKPSNQFKITFLEAKDITDLGSLKEVAKIFVPFGSTLYSAR 185

Query: 538  TIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGVKLR 371
            T+KIKE+EG RTYYFYEFG + QH ALVAAV+SG+ YIAGA APESKW++DGVKLR
Sbjct: 186  TLKIKEEEGFRTYYFYEFGREGQHAALVAAVSSGRTYIAGATAPESKWDEDGVKLR 241


>ref|XP_002319030.1| hypothetical protein POPTR_0013s02750g [Populus trichocarpa]
            gi|222857406|gb|EEE94953.1| hypothetical protein
            POPTR_0013s02750g [Populus trichocarpa]
          Length = 250

 Score =  291 bits (745), Expect = 3e-76
 Identities = 149/231 (64%), Positives = 179/231 (77%)
 Frame = -3

Query: 1063 PSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLDLALRISLSAPAIAE 884
            P N ++ PN +S      +    ++  KRQF    I    S+  + LA +  L+      
Sbjct: 15   PLNPRQNPNPYSPKPTPHATSLPSTISKRQF----IFKTTSLCLISLATQHPLAQALAEP 70

Query: 883  TKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSLYGDKVKPKTYAA 704
            +   KS+LS ++NTKSWFQF+GDGFAIRVPPQFEDI EPEDYSAGLSLYGDK KPKT+AA
Sbjct: 71   SPPLKSVLSILANTKSWFQFYGDGFAIRVPPQFEDIMEPEDYSAGLSLYGDKAKPKTFAA 130

Query: 703  RFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGGSTLYSARTIKIK 524
            RFAS DG EV++V++RPSNQLKITFLEAKD+ DLGSLK+AAK+FVPGG+TL+SART+KIK
Sbjct: 131  RFASSDGSEVLNVVVRPSNQLKITFLEAKDITDLGSLKEAAKLFVPGGTTLFSARTLKIK 190

Query: 523  EDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGVKLR 371
            E+EG RTYYFYEFG D QH ALVA VNSGKA IAGA AP+SKW++DGVKLR
Sbjct: 191  EEEGYRTYYFYEFGRDDQHAALVAVVNSGKAIIAGATAPQSKWDEDGVKLR 241


>ref|XP_006355261.1| PREDICTED: uncharacterized protein LOC102582099 [Solanum tuberosum]
          Length = 239

 Score =  289 bits (740), Expect = 1e-75
 Identities = 156/242 (64%), Positives = 177/242 (73%)
 Frame = -3

Query: 1096 LHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLDLAL 917
            L LS S  P  P    +KPN        S C   NS  +R+  L G             L
Sbjct: 11   LSLSVSIHPPKPL---QKPN--------SMCAQPNSVSRRRVFLSGST-----------L 48

Query: 916  RISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSLY 737
             +S   P      NS + LSGI+NTKSWFQF+GDGF+IRVPP+F+D+TEPEDY+AGLSLY
Sbjct: 49   FLSQLIPKSDAQTNSNTFLSGIANTKSWFQFYGDGFSIRVPPEFQDLTEPEDYNAGLSLY 108

Query: 736  GDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGGS 557
            GDK KPK +AARFAS DG EV+SV+IRPSNQLKITFLEAKD+ DLGSLK+AAKIFVP GS
Sbjct: 109  GDKAKPKKFAARFASSDGSEVLSVIIRPSNQLKITFLEAKDITDLGSLKEAAKIFVPAGS 168

Query: 556  TLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGVK 377
            TLYS R+IKIKEDEG RTYYFYEF  D+QHVALVAAVNSGKA IAGA APESKW +DG+K
Sbjct: 169  TLYSVRSIKIKEDEGFRTYYFYEFVRDEQHVALVAAVNSGKAVIAGATAPESKWAEDGLK 228

Query: 376  LR 371
            LR
Sbjct: 229  LR 230


>gb|EXC31825.1| hypothetical protein L484_020653 [Morus notabilis]
          Length = 248

 Score =  289 bits (739), Expect = 2e-75
 Identities = 159/239 (66%), Positives = 183/239 (76%), Gaps = 5/239 (2%)
 Frame = -3

Query: 1072 PKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISL-----DLALRIS 908
            PK PS L   PN         S   S ++  R+ +++ I T+  VIS       LA R S
Sbjct: 15   PKIPSLLTPNPN---------STFSSITTTSRRHSILKI-TSLFVISFAPQQFSLA-RSS 63

Query: 907  LSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSLYGDK 728
              APA A  K S   LSGI NTKSWFQF G+GFAIR+PPQFEDI EPED+ AGLSLYGDK
Sbjct: 64   EQAPAPALAKPS---LSGIVNTKSWFQFVGNGFAIRIPPQFEDIMEPEDFDAGLSLYGDK 120

Query: 727  VKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGGSTLY 548
             KPKT+AARFASPDG EV+SV++RP+NQLKITFLEA DV DLGSLK+AA+IF+PGG+TLY
Sbjct: 121  AKPKTFAARFASPDGSEVLSVIVRPTNQLKITFLEATDVTDLGSLKEAARIFIPGGATLY 180

Query: 547  SARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGVKLR 371
            SART+KIKE+EG RTYYFYEFG D QHVALVAAVNSGKA IAGA AP+SKW+DDG+KLR
Sbjct: 181  SARTLKIKEEEGYRTYYFYEFGRDDQHVALVAAVNSGKAIIAGATAPQSKWDDDGMKLR 239


>ref|XP_004244954.1| PREDICTED: uncharacterized protein LOC101266541 [Solanum
            lycopersicum]
          Length = 235

 Score =  288 bits (738), Expect = 2e-75
 Identities = 156/242 (64%), Positives = 175/242 (72%)
 Frame = -3

Query: 1096 LHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLDLAL 917
            L LS S  P  P    +KPN        S C   NS  +RQ    G             L
Sbjct: 7    LSLSVSIHPPKPL---QKPN--------SMCTQPNSISRRQVFFTGSN-----------L 44

Query: 916  RISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSLY 737
             +S   P      NS S LSGI+NTKSWFQF+GDGF+IRVPP+F+D+TEPEDY+AGLSLY
Sbjct: 45   LLSQLIPKSDAQTNSNSFLSGIANTKSWFQFYGDGFSIRVPPEFQDLTEPEDYNAGLSLY 104

Query: 736  GDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGGS 557
            GDK KPK +AARFAS DG EV+SV+IRPSNQLKITFLEAKD+ DLGSLK+AAKIFVP GS
Sbjct: 105  GDKAKPKKFAARFASSDGSEVLSVIIRPSNQLKITFLEAKDITDLGSLKEAAKIFVPAGS 164

Query: 556  TLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGVK 377
            TLYS RTIKIKEDEG RTYYFYEF  ++QHVALVA VNSGKA IAGA APESKW +DG+K
Sbjct: 165  TLYSVRTIKIKEDEGFRTYYFYEFVRNEQHVALVAGVNSGKAVIAGATAPESKWAEDGLK 224

Query: 376  LR 371
            LR
Sbjct: 225  LR 226


>ref|XP_006471883.1| PREDICTED: uncharacterized protein LOC102627917 isoform X1 [Citrus
            sinensis]
          Length = 284

 Score =  288 bits (736), Expect = 4e-75
 Identities = 160/277 (57%), Positives = 196/277 (70%), Gaps = 37/277 (13%)
 Frame = -3

Query: 1090 LSFSSLPKNPSNLQ-RKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLDLALR 914
            LSFS  P +P N + + PN  SL     + + +++  KR+F L    T+ S+ S+ L   
Sbjct: 5    LSFSLFPLHPPNPKPQNPNPISL----PTNIQTSTFTKRKFILK--TTSLSLFSIALLQP 58

Query: 913  ISLSAPAIAETKNSKSI--LSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSL 740
                + A+AET  S  +  L+GI+NTKSWFQF+GDGF+IRVPPQFEDI+EPEDY+AGLSL
Sbjct: 59   PFAQSLALAETSPSPPLKALTGIANTKSWFQFYGDGFSIRVPPQFEDISEPEDYNAGLSL 118

Query: 739  YGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGG 560
            YGDK KPKT+AARFA+PDG EV+SV+IRPSNQLKITFLEA+D+ D G+LKDAAKIFVPGG
Sbjct: 119  YGDKAKPKTFAARFATPDGSEVLSVVIRPSNQLKITFLEAQDITDFGTLKDAAKIFVPGG 178

Query: 559  STLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGK------------------ 434
            +TLYSARTIKI+E+EG ++YYFYEFG D+QHVALVAA+NSGK                  
Sbjct: 179  ATLYSARTIKIREEEGFKSYYFYEFGRDEQHVALVAAINSGKRHLDAPMCPSLNRRLLSG 238

Query: 433  ----------------AYIAGAAAPESKWEDDGVKLR 371
                            AYIAGA APESKW+DDGVKLR
Sbjct: 239  WLIHSAVGISLINVSEAYIAGATAPESKWDDDGVKLR 275


>ref|XP_002512336.1| conserved hypothetical protein [Ricinus communis]
            gi|223548297|gb|EEF49788.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 240

 Score =  288 bits (736), Expect = 4e-75
 Identities = 150/227 (66%), Positives = 180/227 (79%), Gaps = 1/227 (0%)
 Frame = -3

Query: 1048 RKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLDLALRISLSAPAIAETKN-S 872
            + PN       K +    ++  KRQF L    T+  VISL     +  S   +AE+ + S
Sbjct: 10   QNPNQSIFNTLKPNPTLRSNVSKRQFLLK--TTSLGVISLTTQNPLIQS---LAESSSPS 64

Query: 871  KSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSLYGDKVKPKTYAARFAS 692
            K  LSGI+NTKSWFQF+GDGF+IRVPPQF+DI EPED++AGLSLYGDK KP+T+AARFAS
Sbjct: 65   KPGLSGIANTKSWFQFYGDGFSIRVPPQFQDIMEPEDFNAGLSLYGDKAKPRTFAARFAS 124

Query: 691  PDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGGSTLYSARTIKIKEDEG 512
             DG EV+SV+IRPSNQLKITFLEAKD+ DLGSLK+AAK+FVPGGSTLYSAR IK+KE+EG
Sbjct: 125  SDGSEVLSVVIRPSNQLKITFLEAKDITDLGSLKEAAKVFVPGGSTLYSARAIKVKEEEG 184

Query: 511  LRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGVKLR 371
             RTYYFYEFG ++QHVALVAA+NSGKA IAGA AP+S+W+ DGVKLR
Sbjct: 185  FRTYYFYEFGREEQHVALVAAINSGKAIIAGATAPQSRWDTDGVKLR 231


>ref|XP_002872272.1| hypothetical protein ARALYDRAFT_910845 [Arabidopsis lyrata subsp.
            lyrata] gi|297318109|gb|EFH48531.1| hypothetical protein
            ARALYDRAFT_910845 [Arabidopsis lyrata subsp. lyrata]
          Length = 241

 Score =  280 bits (717), Expect = 6e-73
 Identities = 146/245 (59%), Positives = 177/245 (72%)
 Frame = -3

Query: 1105 MASLHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLD 926
            MA L  S S  P NP    + PN   +            SP   F    +   AS+  + 
Sbjct: 1    MAILLHSLSLHPPNPK--PQNPNKPKIV-----------SPFATFRRDVVLRTASLCFVS 47

Query: 925  LALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGL 746
              ++  +        K+SK +  GI+NTKSWFQFFGDGFAIRVPP FED+ EPEDYSAGL
Sbjct: 48   FIIQNPIQESLADPLKSSKPLRLGIANTKSWFQFFGDGFAIRVPPDFEDVNEPEDYSAGL 107

Query: 745  SLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVP 566
            SLYGDK KP+T+AARF +PDG EV+SV+IRPSNQLKITFLEAKD++DLGSLK AA++FVP
Sbjct: 108  SLYGDKAKPQTFAARFQTPDGSEVLSVVIRPSNQLKITFLEAKDISDLGSLKAAARLFVP 167

Query: 565  GGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDD 386
            G +T+YSARTIK+KE+EGLR YYFYEFG D++ +ALVA+VN GK YIAGAAAPESKW+DD
Sbjct: 168  GAATIYSARTIKVKEEEGLRNYYFYEFGRDEERIALVASVNRGKVYIAGAAAPESKWKDD 227

Query: 385  GVKLR 371
             +KLR
Sbjct: 228  ELKLR 232


>ref|XP_004156434.1| PREDICTED: uncharacterized protein LOC101230696 isoform 2 [Cucumis
           sativus]
          Length = 249

 Score =  278 bits (712), Expect = 2e-72
 Identities = 142/211 (67%), Positives = 166/211 (78%), Gaps = 3/211 (1%)
 Frame = -3

Query: 994 NSSPKRQFALIGIATAASVISLDLALRISLSAPAIAETKNS---KSILSGISNTKSWFQF 824
           NS  KR F L          SL L   I    P +  ++NS   K  L  I+NTKSWFQF
Sbjct: 38  NSKSKRHFIL-------KTASLCLISFIP-KCPVVQSSENSPTSKPGLPAIANTKSWFQF 89

Query: 823 FGDGFAIRVPPQFEDITEPEDYSAGLSLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQ 644
           +GDGF+IRVPPQFED+TEPEDYSAGLSLYGDK K KT+AARF SPDG EV+SV+ RP+NQ
Sbjct: 90  YGDGFSIRVPPQFEDLTEPEDYSAGLSLYGDKAKTKTFAARFGSPDGSEVLSVVTRPTNQ 149

Query: 643 LKITFLEAKDVADLGSLKDAAKIFVPGGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHV 464
           LKITFLEAKD+ D+GSL++AAKIFVPGGSTL+SART KIKEDEG RTYYFYEFG ++QHV
Sbjct: 150 LKITFLEAKDITDIGSLREAAKIFVPGGSTLFSARTFKIKEDEGFRTYYFYEFGKNEQHV 209

Query: 463 ALVAAVNSGKAYIAGAAAPESKWEDDGVKLR 371
           ALVA VNSG+ ++AGA AP SKW++DG+KLR
Sbjct: 210 ALVATVNSGQVFVAGATAPVSKWDEDGIKLR 240


>ref|XP_004139199.1| PREDICTED: uncharacterized protein LOC101214471 isoform 2 [Cucumis
           sativus]
          Length = 249

 Score =  278 bits (711), Expect = 3e-72
 Identities = 142/211 (67%), Positives = 166/211 (78%), Gaps = 3/211 (1%)
 Frame = -3

Query: 994 NSSPKRQFALIGIATAASVISLDLALRISLSAPAIAETKNS---KSILSGISNTKSWFQF 824
           NS  KR F L          SL L   I    P +  ++NS   K  L  I+NTKSWFQF
Sbjct: 38  NSKSKRHFIL-------KTASLCLISFIP-KCPVVQSSENSPTSKPGLPAIANTKSWFQF 89

Query: 823 FGDGFAIRVPPQFEDITEPEDYSAGLSLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQ 644
           +GDGF+IRVPPQFED+TEPEDYSAGLSLYGDK K KT+AARF SPDG EV+SV+ RP+NQ
Sbjct: 90  YGDGFSIRVPPQFEDLTEPEDYSAGLSLYGDKAKTKTFAARFGSPDGSEVLSVVTRPTNQ 149

Query: 643 LKITFLEAKDVADLGSLKDAAKIFVPGGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHV 464
           LKITFLEAKD+ D+GSL++AAKIFVPGGSTL+SART KIKEDEG RTYYFYEFG ++QHV
Sbjct: 150 LKITFLEAKDITDIGSLREAAKIFVPGGSTLFSARTFKIKEDEGFRTYYFYEFGKNEQHV 209

Query: 463 ALVAAVNSGKAYIAGAAAPESKWEDDGVKLR 371
           ALVA VNSG+ ++AGA AP SKW++DG+KLR
Sbjct: 210 ALVATVNSGQVFVAGATAPLSKWDEDGIKLR 240


>gb|EYU26892.1| hypothetical protein MIMGU_mgv1a012366mg [Mimulus guttatus]
          Length = 252

 Score =  277 bits (709), Expect = 5e-72
 Identities = 149/246 (60%), Positives = 178/246 (72%), Gaps = 1/246 (0%)
 Frame = -3

Query: 1105 MASLHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLD 926
            M+ + LS S  P  P     + + H  F F S   +S    +R F L     + S  S  
Sbjct: 3    MSRIVLSLSPTPPPPPTTIPESHPHPSF-FDSFFTFS----RRSFILTSSVLSLSSSSSS 57

Query: 925  LALRISL-SAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAG 749
            LA       +P    + ++KS LSGI +TKSWFQF+G GFAIRVPP FEDI EPEDY+AG
Sbjct: 58   LAQTPPPPKSPPPPSSSSAKSFLSGIGSTKSWFQFYGSGFAIRVPPNFEDIMEPEDYNAG 117

Query: 748  LSLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFV 569
            LSLYGDK KPKT+AARFASPDG EV+SV++RPSNQLKITFLEAKD+ADLGSLK+A++IFV
Sbjct: 118  LSLYGDKAKPKTFAARFASPDGSEVLSVVVRPSNQLKITFLEAKDIADLGSLKEASRIFV 177

Query: 568  PGGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWED 389
            P G TLYSAR IKIKE++G R YYFYEFG D+Q VALVAAVNSGK  IAG  AP++KW+D
Sbjct: 178  PVGVTLYSARIIKIKEEDGYRNYYFYEFGADEQRVALVAAVNSGKVIIAGVTAPQNKWDD 237

Query: 388  DGVKLR 371
             GV+LR
Sbjct: 238  HGVRLR 243


>ref|XP_006288497.1| hypothetical protein CARUB_v10001762mg [Capsella rubella]
            gi|482557203|gb|EOA21395.1| hypothetical protein
            CARUB_v10001762mg [Capsella rubella]
          Length = 258

 Score =  277 bits (709), Expect = 5e-72
 Identities = 146/245 (59%), Positives = 174/245 (71%)
 Frame = -3

Query: 1105 MASLHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLD 926
            MA L  S S  P NP     KP   +  R  S C          F    +   AS+  + 
Sbjct: 18   MAILLHSLSLHPPNP-----KPQNQNRPRILSPCA--------TFRRDVVLRTASLCFVS 64

Query: 925  LALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGL 746
               +  +        K+SK +  GI+NTKSWFQFFGDGFAIRVPP FED+ EPEDYSAGL
Sbjct: 65   FIYQNQVPESLADPEKSSKPLRLGIANTKSWFQFFGDGFAIRVPPDFEDVNEPEDYSAGL 124

Query: 745  SLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVP 566
            SLYGDK KPKT+AARF +PDG EV+SV+IRPSNQLKITFLEA D++DLGSLK AA++FVP
Sbjct: 125  SLYGDKAKPKTFAARFQTPDGSEVLSVVIRPSNQLKITFLEATDISDLGSLKAAARLFVP 184

Query: 565  GGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDD 386
            G +T+Y+ARTIK+KE+EGLR YYFYEFG D++ +ALVAAVN GK YI GAAAPESKW+DD
Sbjct: 185  GAATIYAARTIKVKEEEGLRNYYFYEFGRDEERIALVAAVNRGKVYIVGAAAPESKWKDD 244

Query: 385  GVKLR 371
             +KLR
Sbjct: 245  ELKLR 249


>dbj|BAC43363.1| unknown protein [Arabidopsis thaliana]
          Length = 258

 Score =  276 bits (706), Expect = 1e-71
 Identities = 145/245 (59%), Positives = 179/245 (73%)
 Frame = -3

Query: 1105 MASLHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLD 926
            MA L  S S  P NP     KP       +K   L S+++ +R   L      AS+  + 
Sbjct: 18   MAILLHSLSLHPPNP-----KPQNP----YKPKILSSSATFRRDVVL----RTASLCFVS 64

Query: 925  LALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGL 746
               +  +        K++K +  GI+NTKSWFQ+FG GFAIRVPP FED+ EPEDYSAGL
Sbjct: 65   FIFQNQIPESLADPLKSTKPLRLGIANTKSWFQYFGSGFAIRVPPDFEDVNEPEDYSAGL 124

Query: 745  SLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVP 566
            SLYGDK KP+T+AARF +PDG EV+SV+IRPSNQLKITFLEAKD++DLGSLK AA++FVP
Sbjct: 125  SLYGDKAKPQTFAARFQTPDGSEVLSVVIRPSNQLKITFLEAKDISDLGSLKAAARLFVP 184

Query: 565  GGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDD 386
            G +T+YSARTIK+KE+EGLR YYFYEFG D++ +ALVA+VN GK YIAGAAAPESKW+DD
Sbjct: 185  GAATIYSARTIKVKEEEGLRNYYFYEFGRDEERIALVASVNRGKVYIAGAAAPESKWKDD 244

Query: 385  GVKLR 371
             +KLR
Sbjct: 245  ELKLR 249


>ref|NP_198091.2| Mog1/PsbP/DUF1795-like photosystem II reaction center PsbP family
            protein [Arabidopsis thaliana]
            gi|332006298|gb|AED93681.1| Mog1/PsbP/DUF1795-like
            photosystem II reaction center PsbP family protein
            [Arabidopsis thaliana]
          Length = 241

 Score =  276 bits (706), Expect = 1e-71
 Identities = 145/245 (59%), Positives = 179/245 (73%)
 Frame = -3

Query: 1105 MASLHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISLD 926
            MA L  S S  P NP     KP       +K   L S+++ +R   L      AS+  + 
Sbjct: 1    MAILLHSLSLHPPNP-----KPQNP----YKPKILSSSATFRRDVVL----RTASLCFVS 47

Query: 925  LALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGL 746
               +  +        K++K +  GI+NTKSWFQ+FG GFAIRVPP FED+ EPEDYSAGL
Sbjct: 48   FIFQNQIPESLADPLKSTKPLRLGIANTKSWFQYFGSGFAIRVPPDFEDVNEPEDYSAGL 107

Query: 745  SLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVP 566
            SLYGDK KP+T+AARF +PDG EV+SV+IRPSNQLKITFLEAKD++DLGSLK AA++FVP
Sbjct: 108  SLYGDKAKPQTFAARFQTPDGSEVLSVVIRPSNQLKITFLEAKDISDLGSLKAAARLFVP 167

Query: 565  GGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDD 386
            G +T+YSARTIK+KE+EGLR YYFYEFG D++ +ALVA+VN GK YIAGAAAPESKW+DD
Sbjct: 168  GAATIYSARTIKVKEEEGLRNYYFYEFGRDEERIALVASVNRGKVYIAGAAAPESKWKDD 227

Query: 385  GVKLR 371
             +KLR
Sbjct: 228  ELKLR 232


>ref|XP_007030936.1| Mog1/PsbP/DUF1795-like photosystem II reaction center PsbP family
            protein isoform 1 [Theobroma cacao]
            gi|508719541|gb|EOY11438.1| Mog1/PsbP/DUF1795-like
            photosystem II reaction center PsbP family protein
            isoform 1 [Theobroma cacao]
          Length = 253

 Score =  275 bits (703), Expect = 3e-71
 Identities = 144/246 (58%), Positives = 182/246 (73%)
 Frame = -3

Query: 1108 FMASLHLSFSSLPKNPSNLQRKPNYHSLFRFKSSCLYSNSSPKRQFALIGIATAASVISL 929
            ++ SL     SL  +P    + P   SL     + +    + +RQF +   +    + + 
Sbjct: 4    YLPSLMALLLSLSLHPPKPPQNPKLTSLNPPVPTSVLKTLNIRRQFIINSTSLCIILWAP 63

Query: 928  DLALRISLSAPAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAG 749
               +  SL+ P+      SK  L+ I+NTKSWFQF+GDGFAIRVPP+FEDI EPED++AG
Sbjct: 64   QNPVPQSLAEPSTT----SKPALN-IANTKSWFQFYGDGFAIRVPPEFEDIMEPEDFNAG 118

Query: 748  LSLYGDKVKPKTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFV 569
             SLYGDK KP+T+AARFAS DG EV+SV+IR +NQLKITFLEA+D+ DLGS+K+AA+IFV
Sbjct: 119  ASLYGDKAKPRTFAARFASTDGSEVLSVVIRRTNQLKITFLEAQDITDLGSIKEAARIFV 178

Query: 568  PGGSTLYSARTIKIKEDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWED 389
            PGG+TLY+ARTIKIKEDEG +TYYFYEFG D+QH+ALVA VNSGKA IAGA AP+SKW+D
Sbjct: 179  PGGATLYNARTIKIKEDEGFKTYYFYEFGRDEQHIALVATVNSGKAVIAGATAPQSKWDD 238

Query: 388  DGVKLR 371
            DGVKLR
Sbjct: 239  DGVKLR 244


>gb|EPS66196.1| hypothetical protein M569_08583, partial [Genlisea aurea]
          Length = 186

 Score =  272 bits (696), Expect = 2e-70
 Identities = 135/177 (76%), Positives = 154/177 (87%), Gaps = 1/177 (0%)
 Frame = -3

Query: 898 PAIAETKNSKSILSGISNTKSWFQFFGDGFAIRVPPQFEDITEPEDYSAGLSLYGDKVKP 719
           PA+AE  NS   LSGI+NTKSWFQ++GDGFAIRVPP FEDI EPEDY AG SLYGDK KP
Sbjct: 3   PALAEPPNS--FLSGIANTKSWFQYYGDGFAIRVPPNFEDIMEPEDYDAGRSLYGDKAKP 60

Query: 718 KTYAARFASPDGFEVVSVLIRPSNQLKITFLEAKDVADLGSLKDAAKIFVPGGSTLYSAR 539
           KT+AARFASPDG EV+SV+IRP NQLKITFLEA+DV DLGSLK+AA IFVPGG+TL+SAR
Sbjct: 61  KTFAARFASPDGSEVISVVIRPCNQLKITFLEARDVVDLGSLKEAAGIFVPGGATLFSAR 120

Query: 538 TIKIK-EDEGLRTYYFYEFGLDKQHVALVAAVNSGKAYIAGAAAPESKWEDDGVKLR 371
            IKIK ED G R YYFYEFG +++HVALVA+VNSGKA IAGA AP+ KW++DGV+LR
Sbjct: 121 MIKIKEEDGGFRNYYFYEFGRNEEHVALVASVNSGKAIIAGATAPQKKWDEDGVRLR 177


Top