BLASTX nr result

ID: Sinomenium21_contig00023384 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00023384
         (1075 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   271   3e-70
ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prun...   271   3e-70
ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfam...   264   5e-68
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   261   3e-67
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   260   6e-67
ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   243   1e-61
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   240   8e-61
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                239   1e-60
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   239   1e-60
ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [A...   239   2e-60
ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   238   2e-60
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   237   5e-60
ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutr...   236   9e-60
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   236   9e-60
ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   236   1e-59
gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus...   228   4e-57
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     223   1e-55
ref|XP_006381507.1| pentatricopeptide repeat-containing family p...   219   2e-54
emb|CAB86037.1| putative protein [Arabidopsis thaliana]               217   6e-54
ref|XP_007162713.1| hypothetical protein PHAVU_001G174000g [Phas...   203   9e-50

>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic [Vitis vinifera]
            gi|297741486|emb|CBI32618.3| unnamed protein product
            [Vitis vinifera]
          Length = 842

 Score =  271 bits (694), Expect = 3e-70
 Identities = 140/244 (57%), Positives = 169/244 (69%)
 Frame = +2

Query: 344  YADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVEAV 523
            Y+DLA+KL  DGRFDDF  + E           +       +LVSAGISGL+ +GRV  V
Sbjct: 68   YSDLATKLVQDGRFDDFSTMAETLILSGVELSQL------VELVSAGISGLLREGRVYCV 121

Query: 524  IGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKEFV 703
            + V R+V  LGI P +LFDGS   LL  ECRR++  G+++ VVEL+EIL GFH PVK+ +
Sbjct: 122  VEVLRKVDKLGICPLELFDGSTLELLSKECRRILNCGQVEEVVELIEILDGFHFPVKKLL 181

Query: 704  DPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCNSN 883
            +P   IK+CV K +P++AVRYACILP + I F +II EFGKKRDL SAL  FEASK    
Sbjct: 182  EPLDFIKICVNKRNPNLAVRYACILPHAQILFCTIIHEFGKKRDLGSALTAFEASKQKLI 241

Query: 884  GPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLRVY 1063
            GPNMY  R ++DVCG C    KSRYIYEEL+ Q +TPN YVFNSLMNVN HDLSYT  VY
Sbjct: 242  GPNMYCYRTMIDVCGLCSHYQKSRYIYEELLAQKITPNIYVFNSLMNVNVHDLSYTFNVY 301

Query: 1064 KQMQ 1075
            K MQ
Sbjct: 302  KNMQ 305


>ref|XP_007208081.1| hypothetical protein PRUPE_ppa001520mg [Prunus persica]
            gi|462403723|gb|EMJ09280.1| hypothetical protein
            PRUPE_ppa001520mg [Prunus persica]
          Length = 809

 Score =  271 bits (693), Expect = 3e-70
 Identities = 134/245 (54%), Positives = 172/245 (70%)
 Frame = +2

Query: 341  YYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVEA 520
            Y+ADLASKLA DG+F DF M+ E            F   L  +LV+ GISGL+ +G+V +
Sbjct: 76   YFADLASKLARDGKFQDFAMVVESVVLSGVRGSE-FTAALKLELVAKGISGLLKEGKVRS 134

Query: 521  VIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKEF 700
            V+ V  +V  LG+ P  LFDG    LLG +C RL+K  +++ +VELME LAG+  P+KE 
Sbjct: 135  VVEVLGKVNELGVPPLKLFDGYAMELLGRQCSRLLKCKQVQELVELMEALAGYRFPIKEL 194

Query: 701  VDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCNS 880
            ++P ++IK+CV+KC P +A+RYACI P +HI F +II EFGK++ L  AL  +EASK N 
Sbjct: 195  LEPSEVIKLCVDKCCPKLAIRYACIFPHAHILFCNIIYEFGKRKALEPALAAYEASKENL 254

Query: 881  NGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLRV 1060
            NG NMY  R I+DVCG C D +KSRYIYE+L+ Q VTPN YVFNSLMNVNAHDL+YT  V
Sbjct: 255  NGSNMYVYRTIIDVCGLCKDYMKSRYIYEDLLKQKVTPNIYVFNSLMNVNAHDLNYTFHV 314

Query: 1061 YKQMQ 1075
            YK MQ
Sbjct: 315  YKSMQ 319


>ref|XP_007027210.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao] gi|508715815|gb|EOY07712.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein,
            putative [Theobroma cacao]
          Length = 858

 Score =  264 bits (674), Expect = 5e-68
 Identities = 136/246 (55%), Positives = 170/246 (69%)
 Frame = +2

Query: 338  KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVE 517
            KYYADLASKLA DGR +DF MI E           + +  L  + VS G++  + +G+V+
Sbjct: 84   KYYADLASKLAEDGRLEDFAMIVEMLVASGVNAPRI-VSMLSVQFVSKGVASNVQEGKVK 142

Query: 518  AVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            +V+ V ++V  LGI+P  L DG     +  E +R+V  G ++  V+L+E L GF   +KE
Sbjct: 143  SVVEVLKKVEKLGIAPSKLVDGFGLVSMKREFQRIVGSGEVEQAVDLLEALRGFQFTIKE 202

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             VDP  IIK+CV+K +P++AVRYAC+LP + I F SII EFGKKRDL SAL  +EASK N
Sbjct: 203  LVDPSYIIKVCVDKRNPNLAVRYACLLPHAKILFCSIISEFGKKRDLASALTAYEASKKN 262

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             +GPNMY  RAI+D CG CGD LKSR IYE+LV Q VTPN YVFNSLMNVNAHDL YTL 
Sbjct: 263  LSGPNMYLYRAIIDACGLCGDYLKSRNIYEDLVNQRVTPNIYVFNSLMNVNAHDLGYTLD 322

Query: 1058 VYKQMQ 1075
            VYK MQ
Sbjct: 323  VYKDMQ 328


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  261 bits (668), Expect = 3e-67
 Identities = 132/245 (53%), Positives = 166/245 (67%)
 Frame = +2

Query: 341  YYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVEA 520
            Y+ADLASKLA DG+  DF M+ E            F   L   +VS GISG++ DG+V  
Sbjct: 77   YFADLASKLARDGKLHDFSMLLESVVLSGVKPS-QFTAALQLDMVSRGISGILKDGKVGG 135

Query: 521  VIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKEF 700
            ++ V  +V  LG+ P +LFDG    LLG  C RL+K  +++ +VELME+L G H P++E 
Sbjct: 136  LVEVLVKVAELGVRPVELFDGYAMELLGAHCLRLLKFKQVQELVELMEVLYGLHFPIREL 195

Query: 701  VDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCNS 880
            VDP ++IK CVEK  P +A+RYACI P SH+ F +I+ EFGKKR L SAL  +EASK   
Sbjct: 196  VDPSEVIKACVEKRRPKLAIRYACIFPHSHMLFCNIMYEFGKKRALASALTAYEASKEKL 255

Query: 881  NGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLRV 1060
            +G NMY  R I+DVCG C D +KSRYIYE+L+ Q V PN YVFNSLMNVN+HDLSYT  V
Sbjct: 256  SGSNMYIYRTIIDVCGVCKDYMKSRYIYEDLLKQKVIPNIYVFNSLMNVNSHDLSYTFHV 315

Query: 1061 YKQMQ 1075
            YK MQ
Sbjct: 316  YKSMQ 320


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
            gi|568853887|ref|XP_006480569.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Citrus sinensis]
            gi|557530964|gb|ESR42147.1| hypothetical protein
            CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  260 bits (665), Expect = 6e-67
 Identities = 131/250 (52%), Positives = 176/250 (70%)
 Frame = +2

Query: 326  ANRFKYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICD 505
            ++R  YYAD+ASKLA DGR ++F MI E            F   L  ++V++GI   I +
Sbjct: 68   SSRNDYYADMASKLAKDGRLEEFAMIVESVVVSEGNVSK-FASMLSLEMVASGIVKSIGE 126

Query: 506  GRVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHL 685
            GR++ V+GV +++  LG++P +LF GS   LL  EC+RL+  G ++  V LME+L  F L
Sbjct: 127  GRIDCVVGVLKKLNELGVAPLELFHGSGFKLLKNECQRLLDSGEVEMFVGLMEVLEEFRL 186

Query: 686  PVKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEA 865
            PVKE  + F+I+++CV K D ++A+RYACI+PR+ I F + ++EFGKKRDLVSAL  +EA
Sbjct: 187  PVKELDEEFRIVQLCVNKPDVNLAIRYACIVPRADILFCNFVREFGKKRDLVSALRAYEA 246

Query: 866  SKCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLS 1045
            SK + + PNMY CR I+DVCG CGD +KSR IYE+L +Q VT N YVFNSLMNVNAHDL 
Sbjct: 247  SKKHLSSPNMYICRTIIDVCGLCGDYMKSRAIYEDLRSQNVTLNIYVFNSLMNVNAHDLK 306

Query: 1046 YTLRVYKQMQ 1075
            +TL VYK MQ
Sbjct: 307  FTLEVYKNMQ 316


>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  243 bits (620), Expect = 1e-61
 Identities = 133/249 (53%), Positives = 160/249 (64%)
 Frame = +2

Query: 329  NRFKYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDG 508
            N  KYYA+LASKLA DGRFDD LMI E            F   L+ KLVS GI  L+ + 
Sbjct: 73   NGLKYYAELASKLAQDGRFDDSLMIAESVVVSGVNAAE-FAALLNVKLVSGGIVRLLEER 131

Query: 509  RVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLP 688
            +V +V+ +    + LGI P  L DG     L  ECRR +  G ++ VV LME L G  +P
Sbjct: 132  KVGSVVELLNGAQQLGIDPLKLLDGDALNALSRECRRTMGCGEIEEVVSLMETLKGCGMP 191

Query: 689  VKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEAS 868
            +K+ V P +I+++CV +  P+ AVRYA I P   I F +II EFGKK DLVSAL VFEAS
Sbjct: 192  IKDLVKPSEILRLCVSQRKPNAAVRYAHIFPHVDIMFCTIILEFGKKGDLVSALTVFEAS 251

Query: 869  KCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSY 1048
            K N + PN+Y  R  +DVCG CGD LKSR IYE L+    TPN YVFNSLMNVNA DLSY
Sbjct: 252  KQNQDTPNLYIYRTAIDVCGLCGDYLKSRSIYEGLIASKFTPNIYVFNSLMNVNACDLSY 311

Query: 1049 TLRVYKQMQ 1075
            TL +YKQMQ
Sbjct: 312  TLDIYKQMQ 320


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
            gi|482555757|gb|EOA19949.1| hypothetical protein
            CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  240 bits (612), Expect = 8e-61
 Identities = 117/246 (47%), Positives = 160/246 (65%)
 Frame = +2

Query: 338  KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVE 517
            +YYAD ASKLA DGR +D  +I E            F   +D  L+S GIS  +  G++E
Sbjct: 82   EYYADFASKLAEDGRIEDVALIAETLAAESGANVARFASMVDFDLLSKGISSNLRQGKIE 141

Query: 518  AVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            +V+   +R+  +GI+P DL D S   L+  + R +    +++  ++LMEILAG    +KE
Sbjct: 142  SVVYTLKRIEKVGIAPLDLVDESSVKLMRKQFRAMANSVQVEKAIDLMEILAGLRFKIKE 201

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             VDPF I+K CV+  +P++A+RYAC+LP + I    II  FGKK D+VS +  +EA K  
Sbjct: 202  LVDPFDIVKSCVDISNPELAIRYACLLPHTEILLCRIILGFGKKGDMVSVMTAYEACKQI 261

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             + PNMY CR ++DVCG CGD +KSRYIYE+L+ + V PN YV NSLMNVN+HDL YTL+
Sbjct: 262  LDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLKENVKPNIYVMNSLMNVNSHDLGYTLK 321

Query: 1058 VYKQMQ 1075
            VYK MQ
Sbjct: 322  VYKNMQ 327


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  239 bits (610), Expect = 1e-60
 Identities = 115/246 (46%), Positives = 159/246 (64%)
 Frame = +2

Query: 338  KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVE 517
            +YYAD ASKLA DGR +D  +I E            F   +D  L+S GIS  +  G++E
Sbjct: 82   EYYADFASKLAEDGRIEDVALIAETLAAESGANVARFASMVDYDLLSKGISSNLRQGKIE 141

Query: 518  AVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            +V+   +R+  +GI+P DL D S   L+  + R +    +++  ++LMEILAG    +KE
Sbjct: 142  SVVYTLKRIEKVGIAPLDLVDDSSVKLMRKQFRAMANSVQVEKAIDLMEILAGLGFKIKE 201

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             VDPF ++K CVE  +P +A+RYAC+LP + +    II  FGKK D+VS +  +EA K  
Sbjct: 202  LVDPFDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVMTAYEACKQI 261

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             + PNMY CR ++DVCG CGD +KSRYIYE+L+ + + PN YV NSLMNVN+HDL YTL+
Sbjct: 262  LDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVINSLMNVNSHDLGYTLK 321

Query: 1058 VYKQMQ 1075
            VYK MQ
Sbjct: 322  VYKNMQ 327


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g02830, chloroplastic; Flags: Precursor
            gi|332003140|gb|AED90523.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  239 bits (610), Expect = 1e-60
 Identities = 115/246 (46%), Positives = 159/246 (64%)
 Frame = +2

Query: 338  KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVE 517
            +YYAD ASKLA DGR +D  +I E            F   +D  L+S GIS  +  G++E
Sbjct: 82   EYYADFASKLAEDGRIEDVALIAETLAAESGANVARFASMVDYDLLSKGISSNLRQGKIE 141

Query: 518  AVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            +V+   +R+  +GI+P DL D S   L+  + R +    +++  ++LMEILAG    +KE
Sbjct: 142  SVVYTLKRIEKVGIAPLDLVDDSSVKLMRKQFRAMANSVQVEKAIDLMEILAGLGFKIKE 201

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             VDPF ++K CVE  +P +A+RYAC+LP + +    II  FGKK D+VS +  +EA K  
Sbjct: 202  LVDPFDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVMTAYEACKQI 261

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             + PNMY CR ++DVCG CGD +KSRYIYE+L+ + + PN YV NSLMNVN+HDL YTL+
Sbjct: 262  LDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVINSLMNVNSHDLGYTLK 321

Query: 1058 VYKQMQ 1075
            VYK MQ
Sbjct: 322  VYKNMQ 327


>ref|XP_006828302.1| hypothetical protein AMTR_s00023p00232870 [Amborella trichopoda]
            gi|548832949|gb|ERM95718.1| hypothetical protein
            AMTR_s00023p00232870 [Amborella trichopoda]
          Length = 855

 Score =  239 bits (609), Expect = 2e-60
 Identities = 124/260 (47%), Positives = 167/260 (64%), Gaps = 3/260 (1%)
 Frame = +2

Query: 305  DLAPQSGANR---FKYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLV 475
            D+ P  G       K+YA +ASKLA +GR D+F M+ E            F++ L  K V
Sbjct: 55   DIRPDLGLQNPSSLKFYASMASKLAENGRLDEFSMLAESFIGSGMAPGH-FVEALSIKHV 113

Query: 476  SAGISGLICDGRVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVE 655
            SAG +  + +G  + V+GV  +   LGI P  +FDGS + LL   CRR++    +   V 
Sbjct: 114  SAGFALCLKNGEFDTVLGVMEKFDKLGICPSLIFDGSARRLLLSACRRVLDGDNIGEFVR 173

Query: 656  LMEILAGFHLPVKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRD 835
            L+EI AG+   VK+ V P  I++ C+++ DP MA RYA ILP + +WF+ +I EFGKK+D
Sbjct: 174  LVEIFAGYRFSVKDVVKPTFILQACIDRHDPFMAGRYASILPHADVWFNFLICEFGKKKD 233

Query: 836  LVSALVVFEASKCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNS 1015
            L SALV FE SK  S  PNMY  R+I+D CG+CGD+LKSR I+E+L+ Q +TPNT+VFNS
Sbjct: 234  LQSALVAFEVSKGKSVSPNMYIYRSIIDACGYCGDSLKSRSIFEDLLVQKITPNTFVFNS 293

Query: 1016 LMNVNAHDLSYTLRVYKQMQ 1075
            LMNVNAHD  Y L +YKQM+
Sbjct: 294  LMNVNAHDSHYALHIYKQMK 313


>ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At5g02830, chloroplastic-like [Cucumis sativus]
          Length = 855

 Score =  238 bits (608), Expect = 2e-60
 Identities = 127/259 (49%), Positives = 168/259 (64%), Gaps = 2/259 (0%)
 Frame = +2

Query: 305  DLAPQSGANRF--KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVS 478
            D+A  S   R   ++YA +ASKLA  G+ +DF M+ E            F   L  +LV+
Sbjct: 64   DIAGASSGGRIPIQHYAGVASKLAEGGKLEDFAMVVESVVVAGVEPS-QFGAMLAVELVA 122

Query: 479  AGISGLICDGRVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVEL 658
             GIS  + +G+V +V+ V R+V  LGIS  +L D      L  +CRR+ K G L+ +VEL
Sbjct: 123  KGISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRDCRRMAKSGELEELVEL 182

Query: 659  MEILAGFHLPVKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDL 838
            ME+L+GF   V+E + P ++IK+CV+  +P MA+RYA ILP + I F + I EFGKKRDL
Sbjct: 183  MEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILFCTTINEFGKKRDL 242

Query: 839  VSALVVFEASKCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSL 1018
             SA + +  SK N NG NMY  R I+DVCG CGD  KSR IY++LV Q VTPN +VFNSL
Sbjct: 243  KSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDLVNQNVTPNIFVFNSL 302

Query: 1019 MNVNAHDLSYTLRVYKQMQ 1075
            MNVNAHDL+YT ++YK MQ
Sbjct: 303  MNVNAHDLNYTFQLYKNMQ 321


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  237 bits (605), Expect = 5e-60
 Identities = 134/258 (51%), Positives = 163/258 (63%), Gaps = 1/258 (0%)
 Frame = +2

Query: 305  DLAPQSGA-NRFKYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSA 481
            D A  SG+ N  KYYA+LASKLA DGRFDD LMI E            F   L+ KLVS 
Sbjct: 66   DSASASGSCNGLKYYAELASKLAQDGRFDDSLMIAESVVVSGVNAEE-FTALLNVKLVSG 124

Query: 482  GISGLICDGRVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELM 661
            GI  L+ + +V +V+ +    + LGI P  L D      L  ECRR ++   ++ VV LM
Sbjct: 125  GIVRLLEERKVGSVVELLNGAQQLGIDPSKLLDEDSINALSRECRRTMQCSEIEEVVSLM 184

Query: 662  EILAGFHLPVKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLV 841
            E L G  +P+K+ V P +I+++CV +  P+ AVRYA I P   I F +II EFGKK DL 
Sbjct: 185  ETLRGCGMPIKDLVKPSEILRLCVSQRKPNAAVRYAHIFPHVDIMFCTIILEFGKKGDLA 244

Query: 842  SALVVFEASKCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLM 1021
            SAL VFEASK N + PN+Y  R  +DVCG CGD LKSR IYE L+    TPN YVFNSLM
Sbjct: 245  SALTVFEASKQNQDTPNLYIYRTAIDVCGLCGDYLKSRSIYEGLIASKFTPNIYVFNSLM 304

Query: 1022 NVNAHDLSYTLRVYKQMQ 1075
            NVNA DLSYTL +YKQMQ
Sbjct: 305  NVNACDLSYTLDIYKQMQ 322


>ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099830|gb|ESQ40193.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 863

 Score =  236 bits (603), Expect = 9e-60
 Identities = 115/246 (46%), Positives = 159/246 (64%)
 Frame = +2

Query: 338  KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVE 517
            +YYAD ASKLA DGR  D  +I E            F   +D+ L+S GIS  +  G++E
Sbjct: 80   EYYADFASKLAEDGRIQDVALIAETLAAESGANVARFASMVDSDLLSKGISLNLRQGKIE 139

Query: 518  AVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            +V+   +R+  +GI+P DL D S   L+    R +    +++  ++LMEILAGF   +KE
Sbjct: 140  SVVYTLQRIEKVGIAPLDLVDESSVKLMRKHFRAMANSVQVEKAIDLMEILAGFRFKIKE 199

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             VDPF ++K+CV+  +P +A+RYAC+LP + +    II  FGKK D+VS L  +EA K  
Sbjct: 200  LVDPFDVVKICVDISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVLTAYEACKQI 259

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             + PNMY  R ++DVCG CGD +KSRYIYE+L+ + + PN YV NSLMNVN+HDL YTL+
Sbjct: 260  LDNPNMYIYRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVMNSLMNVNSHDLGYTLK 319

Query: 1058 VYKQMQ 1075
            VYK MQ
Sbjct: 320  VYKNMQ 325


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099829|gb|ESQ40192.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  236 bits (603), Expect = 9e-60
 Identities = 115/246 (46%), Positives = 159/246 (64%)
 Frame = +2

Query: 338  KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVE 517
            +YYAD ASKLA DGR  D  +I E            F   +D+ L+S GIS  +  G++E
Sbjct: 80   EYYADFASKLAEDGRIQDVALIAETLAAESGANVARFASMVDSDLLSKGISLNLRQGKIE 139

Query: 518  AVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            +V+   +R+  +GI+P DL D S   L+    R +    +++  ++LMEILAGF   +KE
Sbjct: 140  SVVYTLQRIEKVGIAPLDLVDESSVKLMRKHFRAMANSVQVEKAIDLMEILAGFRFKIKE 199

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             VDPF ++K+CV+  +P +A+RYAC+LP + +    II  FGKK D+VS L  +EA K  
Sbjct: 200  LVDPFDVVKICVDISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVSVLTAYEACKQI 259

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             + PNMY  R ++DVCG CGD +KSRYIYE+L+ + + PN YV NSLMNVN+HDL YTL+
Sbjct: 260  LDNPNMYIYRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVMNSLMNVNSHDLGYTLK 319

Query: 1058 VYKQMQ 1075
            VYK MQ
Sbjct: 320  VYKNMQ 325


>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  236 bits (602), Expect = 1e-59
 Identities = 126/259 (48%), Positives = 167/259 (64%), Gaps = 2/259 (0%)
 Frame = +2

Query: 305  DLAPQSGANRF--KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVS 478
            D+A  S   R   ++YA +ASKLA  G+ +DF M+ E            F   L  +LV+
Sbjct: 64   DIAGASSGGRIPIQHYAGVASKLAEGGKLEDFAMVVESVVVAGVEPS-QFGAMLAVELVA 122

Query: 479  AGISGLICDGRVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVEL 658
             GIS  + +G+V +V+ V R+V  LGIS  +L D      L  +CRR+ K G L+ +VEL
Sbjct: 123  KGISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRDCRRMAKSGELEELVEL 182

Query: 659  MEILAGFHLPVKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDL 838
            ME+L+GF   V+E + P ++IK+CV+  +P MA+RYA ILP + I F + I EFGKKRDL
Sbjct: 183  MEVLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILFCTTINEFGKKRDL 242

Query: 839  VSALVVFEASKCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSL 1018
             SA + +  SK N NG NMY  R I+DVCG CGD  KSR IY++LV Q V PN +VFNSL
Sbjct: 243  KSAYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDLVNQNVIPNIFVFNSL 302

Query: 1019 MNVNAHDLSYTLRVYKQMQ 1075
            MNVNAHDL+YT ++YK MQ
Sbjct: 303  MNVNAHDLNYTFQLYKNMQ 321


>gb|EYU41644.1| hypothetical protein MIMGU_mgv1a001284mg [Mimulus guttatus]
          Length = 847

 Score =  228 bits (580), Expect = 4e-57
 Identities = 119/246 (48%), Positives = 159/246 (64%), Gaps = 1/246 (0%)
 Frame = +2

Query: 341  YYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVEA 520
            Y  +LASKLA DG F+DFLMI E            F+  L+AK V+ G++ ++ +G + +
Sbjct: 72   YNTELASKLAEDGMFEDFLMISESVVASGVKPSE-FLALLNAKCVAIGVARVLDEGNLHS 130

Query: 521  VIGV-FRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            V+ + F  +  +GI P  +FD      L  ECRRL+K G ++ +V  ME LAGF   ++E
Sbjct: 131  VVKMLFNGLEKIGIDPVQMFDAVSTESLRRECRRLLKRGEVEQLVSFMETLAGFKFQIRE 190

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             V+P  +I +C+ + DP  A+RYA   P   I F SII EFGKKRDL SAL  FEA+K N
Sbjct: 191  LVEPSDVISLCISQRDPTAAIRYAQNFPHMEIMFCSIILEFGKKRDLASALTAFEAAKQN 250

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
            ++ PNM+A R I+DVCG CGD LKSR IYE L+   +TPN YVFNSLMNVN+ DL+Y L 
Sbjct: 251  TSTPNMHAYRTIIDVCGLCGDYLKSRTIYEGLLAGNITPNVYVFNSLMNVNSRDLNYALG 310

Query: 1058 VYKQMQ 1075
             YK+M+
Sbjct: 311  TYKKMK 316


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  223 bits (568), Expect = 1e-55
 Identities = 122/261 (46%), Positives = 164/261 (62%), Gaps = 4/261 (1%)
 Frame = +2

Query: 305  DLAPQSGANR--FKYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVS 478
            +L  +S A R   +++AD A     D +  D  ++ E           +    L A+L S
Sbjct: 41   NLPSRSSAVRSDLRHFADFAG----DAKLRDLSVVVESLAVSGVDASRLR-SALRAELAS 95

Query: 479  A--GISGLICDGRVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVV 652
            A  GIS ++ DG+V +   +  ++  LG  P ++FDG    L+  ECRR+++  +++ +V
Sbjct: 96   AEKGISAVLRDGKVRSFARLLGKLDELGFPPVEIFDGWALELIRRECRRILRCEQVEELV 155

Query: 653  ELMEILAGFHLPVKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKR 832
            EL E+L+G+   +KE V P  +IK+CVEK +P MA+RYAC LP +HI F   + EFGKK 
Sbjct: 156  ELFEVLSGYGFSIKELVKPSDVIKICVEKRNPKMAIRYACTLPHAHIIFCDAVYEFGKKG 215

Query: 833  DLVSALVVFEASKCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFN 1012
            DLVSAL+  EASK NS   NMY  R I+DVCG C D  KSRYIYE+L+ + VTPN YVFN
Sbjct: 216  DLVSALIAHEASKKNSTSTNMYLYRTIIDVCGRCHDYQKSRYIYEDLLNEKVTPNVYVFN 275

Query: 1013 SLMNVNAHDLSYTLRVYKQMQ 1075
            SLMNVNAHD SYTL VYK MQ
Sbjct: 276  SLMNVNAHDFSYTLNVYKDMQ 296


>ref|XP_006381507.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336211|gb|ERP59304.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 828

 Score =  219 bits (558), Expect = 2e-54
 Identities = 122/246 (49%), Positives = 152/246 (61%), Gaps = 1/246 (0%)
 Frame = +2

Query: 341  YYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVEA 520
            Y+A+LASKLA DGR  DF+MI E            F+  L    V+ GIS  +  G V+ 
Sbjct: 73   YHANLASKLAEDGRLQDFVMIAESVIASGVEPSS-FVAALSVGPVAKGISKNLQQGNVDC 131

Query: 521  VIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKEF 700
            V+   ++   LG+S     DG    LL  E  R+V  G ++ VV +ME LAGF    KE 
Sbjct: 132  VVRFLKKTEELGVSTLKFLDGVAIDLLKKEFIRIVNCGDVEQVVYIMETLAGFCFSFKEL 191

Query: 701  VDPFKIIKMCVEKCDPDMAVRYACILP-RSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
            VDP  IIK+CV+K +P MAVRYA I P    I F +II EFG+K  L SALV ++ +K  
Sbjct: 192  VDPSYIIKICVDKLNPKMAVRYAAIFPGEGRILFCNIISEFGRKGHLDSALVAYDEAKHK 251

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             + PNMY  R I+DVCG CGD +KSRYIYE+L+ + V PN YVFNSLMNVNAHDL YT  
Sbjct: 252  LSVPNMYLHRTIIDVCGLCGDYMKSRYIYEDLINRKVIPNVYVFNSLMNVNAHDLGYTFS 311

Query: 1058 VYKQMQ 1075
            V+K MQ
Sbjct: 312  VFKNMQ 317


>emb|CAB86037.1| putative protein [Arabidopsis thaliana]
          Length = 798

 Score =  217 bits (553), Expect = 6e-54
 Identities = 110/246 (44%), Positives = 151/246 (61%)
 Frame = +2

Query: 338  KYYADLASKLALDGRFDDFLMIFEXXXXXXXXXXXMFIDCLDAKLVSAGISGLICDGRVE 517
            +YYAD ASKLA DGR +D  +I E            F   +D  L+S GIS  +  G++E
Sbjct: 82   EYYADFASKLAEDGRIEDVALIAETLAAESGANVARFASMVDYDLLSKGISSNLRQGKIE 141

Query: 518  AVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLKAVVELMEILAGFHLPVKE 697
            +V+   +R+  +GI+P DL D S   L+  + R +    +++  ++LMEILAG    +KE
Sbjct: 142  SVVYTLKRIEKVGIAPLDLVDDSSVKLMRKQFRAMANSVQVEKAIDLMEILAGLGFKIKE 201

Query: 698  FVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFGKKRDLVSALVVFEASKCN 877
             VDPF ++K CVE  +P +A+R              II  FGKK D+VS +  +EA K  
Sbjct: 202  LVDPFDVVKSCVEISNPQLAIR--------------IIHGFGKKGDMVSVMTAYEACKQI 247

Query: 878  SNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTYVFNSLMNVNAHDLSYTLR 1057
             + PNMY CR ++DVCG CGD +KSRYIYE+L+ + + PN YV NSLMNVN+HDL YTL+
Sbjct: 248  LDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVINSLMNVNSHDLGYTLK 307

Query: 1058 VYKQMQ 1075
            VYK MQ
Sbjct: 308  VYKNMQ 313


>ref|XP_007162713.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
            gi|561036177|gb|ESW34707.1| hypothetical protein
            PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 809

 Score =  203 bits (517), Expect = 9e-50
 Identities = 105/204 (51%), Positives = 134/204 (65%)
 Frame = +2

Query: 464  AKLVSAGISGLICDGRVEAVIGVFRRVRSLGISPFDLFDGSVKTLLGLECRRLVKEGRLK 643
            AK+V  GI G      V +V+    RV+   +S     +GS    +  EC RLV  G ++
Sbjct: 82   AKMVLLGIQG----NSVRSVVHTLNRVQDHSVSLASHLNGSSIDAIAKECCRLVMCGHIE 137

Query: 644  AVVELMEILAGFHLPVKEFVDPFKIIKMCVEKCDPDMAVRYACILPRSHIWFSSIIQEFG 823
              VELME+L  F + ++ FV P  +IK CV   +P +AVRYAC+LP + I F SII EFG
Sbjct: 138  EAVELMEVLTRFKISIRGFVQPSDVIKRCVLSRNPILAVRYACLLPHAQILFCSIISEFG 197

Query: 824  KKRDLVSALVVFEASKCNSNGPNMYACRAIVDVCGFCGDNLKSRYIYEELVTQAVTPNTY 1003
            K+RDL+SA   +E SK + N PNMY  RAI+D CG C D +KSRYIYE+L+ Q +TPN Y
Sbjct: 198  KRRDLISAFKAYELSKKHMNIPNMYMYRAIIDACGLCRDYMKSRYIYEDLLNQKITPNIY 257

Query: 1004 VFNSLMNVNAHDLSYTLRVYKQMQ 1075
            VFNSLMNVNAHDLSYTL +Y+ MQ
Sbjct: 258  VFNSLMNVNAHDLSYTLNLYQNMQ 281


Top