BLASTX nr result

ID: Catharanthus23_contig00034332 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00034332
         (441 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004236339.1| PREDICTED: putative pentatricopeptide repeat...   229   2e-58
ref|XP_006447959.1| hypothetical protein CICLE_v10014595mg [Citr...   216   2e-54
ref|XP_006469338.1| PREDICTED: putative pentatricopeptide repeat...   216   3e-54
ref|XP_002303023.2| pentatricopeptide repeat-containing family p...   199   4e-49
ref|XP_003635064.1| PREDICTED: putative pentatricopeptide repeat...   199   4e-49
emb|CBI18728.3| unnamed protein product [Vitis vinifera]              199   4e-49
gb|EOX93714.1| Tetratricopeptide repeat-like superfamily protein...   197   9e-49
ref|XP_003635033.1| PREDICTED: putative pentatricopeptide repeat...   197   9e-49
emb|CBI38389.3| unnamed protein product [Vitis vinifera]              197   9e-49
gb|EXB61171.1| hypothetical protein L484_007437 [Morus notabilis]     197   1e-48
ref|XP_003540936.1| PREDICTED: putative pentatricopeptide repeat...   192   4e-47
gb|ESW23889.1| hypothetical protein PHAVU_004G084900g [Phaseolus...   190   1e-46
ref|XP_004292328.1| PREDICTED: putative pentatricopeptide repeat...   181   9e-44
ref|XP_006401000.1| hypothetical protein EUTSA_v10015737mg [Eutr...   172   5e-41
ref|NP_200728.2| protein ORGANELLE TRANSCRIPT PROCESSING 80 [Ara...   168   8e-40
ref|XP_006282116.1| hypothetical protein CARUB_v10028364mg [Caps...   166   4e-39
ref|XP_002527276.1| pentatricopeptide repeat-containing protein,...   165   7e-39
ref|XP_003550993.1| PREDICTED: pentatricopeptide repeat-containi...   159   3e-37
ref|XP_006380676.1| hypothetical protein POPTR_0007s10370g [Popu...   154   9e-36
ref|XP_004141894.1| PREDICTED: pentatricopeptide repeat-containi...   154   9e-36

>ref|XP_004236339.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At5g59200, chloroplastic-like [Solanum lycopersicum]
          Length = 630

 Score =  229 bits (585), Expect = 2e-58
 Identities = 103/145 (71%), Positives = 130/145 (89%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+M G++PN+VTIVC+LSACAQLG LELGKW+HSYV KY+IE+NH+VGSAL+NMYSRCG
Sbjct: 248 EMQMAGLKPNEVTIVCVLSACAQLGALELGKWVHSYVEKYNIEVNHIVGSALVNMYSRCG 307

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +IDEA  +F  L+ RDVTTYNS+IVG++LNGKS+EA+++FQRM   G+KPT+ITF+ VLN
Sbjct: 308 DIDEAASLFEELKARDVTTYNSMIVGYALNGKSIEAIKVFQRMKREGVKPTSITFSGVLN 367

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIE 435
           ACS GGLVD+GF+IFE+M  EYGIE
Sbjct: 368 ACSHGGLVDIGFDIFESMETEYGIE 392



 Score = 74.3 bits (181), Expect = 2e-11
 Identities = 45/162 (27%), Positives = 75/162 (46%), Gaps = 31/162 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM  D + P+   I  +L AC     L+ G+ IH  V K  + ++  V   L+ +Y +CG
Sbjct: 116 QMIKDFILPDIYIIPLVLKACGCGLDLKSGQQIHCQVMKLGLSLDRFVRVKLMELYGKCG 175

Query: 181 NIDEAEKIFCSLQQRDVTT-------------------------------YNSLIVGFSL 267
             ++A+K+F  + QRDV                                 + ++I G   
Sbjct: 176 EFNDAKKVFDEMPQRDVVASTVMISCYLDHGLVSKAMDEFRVVSTKDNVCWTAMIDGLVK 235

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           NG+   A+++F+ M   G+KP  +T   VL+AC+Q G ++LG
Sbjct: 236 NGEMNYALELFREMQMAGLKPNEVTIVCVLSACAQLGALELG 277


>ref|XP_006447959.1| hypothetical protein CICLE_v10014595mg [Citrus clementina]
           gi|557550570|gb|ESR61199.1| hypothetical protein
           CICLE_v10014595mg [Citrus clementina]
          Length = 631

 Score =  216 bits (550), Expect = 2e-54
 Identities = 95/147 (64%), Positives = 129/147 (87%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+ D VRPN+VTIVC+LSAC+QLG LELG+WIHSY+ K+ I++NH+VG ALINMYSRCG
Sbjct: 249 EMQRDNVRPNEVTIVCVLSACSQLGALELGRWIHSYMGKHRIDLNHIVGGALINMYSRCG 308

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +ID+A ++F  +++RDVTTYNSLI G +++G+S+EAV++F+ M++ GI+PT +TF  VLN
Sbjct: 309 DIDKALRVFEEMKERDVTTYNSLIAGLAMHGRSIEAVEMFREMINQGIRPTKVTFVGVLN 368

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS GGLVDLGFEIF++M+ +YGIEPQ
Sbjct: 369 ACSHGGLVDLGFEIFQSMTRDYGIEPQ 395



 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 40/163 (24%), Positives = 68/163 (41%), Gaps = 32/163 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM  + V P+   +   L AC  L  L  G+ IH  V K  +  N      L+ +Y +CG
Sbjct: 116 QMVEESVLPDNYAVSSALKACGFLLGLREGREIHGQVLKLGLRSNRSTRLKLVELYGKCG 175

Query: 181 N--------------------------------IDEAEKIFCSLQQRDVTTYNSLIVGFS 264
                                            ++ A ++F  ++ +D   + ++I G  
Sbjct: 176 EFKDAMQLFDEMPECNDVVASTVMINCYVEHGLVENAFEVFSRVKVKDTVCWTAMIDGLV 235

Query: 265 LNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
            NG+   A+ +F+ M    ++P  +T   VL+ACSQ G ++LG
Sbjct: 236 RNGEMARALDLFREMQRDNVRPNEVTIVCVLSACSQLGALELG 278


>ref|XP_006469338.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At5g59200, chloroplastic-like [Citrus sinensis]
          Length = 631

 Score =  216 bits (549), Expect = 3e-54
 Identities = 95/147 (64%), Positives = 129/147 (87%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+ D VRPN+VTIVC+LSAC+QLG LELG+WIHSY+ K+ I++NH+VG ALINMYSRCG
Sbjct: 249 EMQRDNVRPNEVTIVCVLSACSQLGALELGRWIHSYMGKHRIDLNHIVGGALINMYSRCG 308

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +ID+A ++F  +++RDVTTYNSLI G +++G+S+EAV++F+ M++ GI+PT +TF  VLN
Sbjct: 309 DIDKALQVFEEMKERDVTTYNSLIAGLAMHGRSIEAVEMFREMINQGIRPTKVTFVGVLN 368

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS GGLVDLGFEIF++M+ +YGIEPQ
Sbjct: 369 ACSHGGLVDLGFEIFQSMTRDYGIEPQ 395



 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 40/163 (24%), Positives = 68/163 (41%), Gaps = 32/163 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM  + V P+   +   L AC  L  L  G+ IH  V K  +  N      L+ +Y +CG
Sbjct: 116 QMVEESVLPDNYAVSSALKACGFLLGLREGREIHGQVLKLGLRSNRSTRLKLVELYGKCG 175

Query: 181 N--------------------------------IDEAEKIFCSLQQRDVTTYNSLIVGFS 264
                                            ++ A ++F  ++ +D   + ++I G  
Sbjct: 176 EFKDAMQLFDEMPECNDVVASTVMINCYVEHGLVENAFEVFSRVKVKDTVCWTAMIDGLV 235

Query: 265 LNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
            NG+   A+ +F+ M    ++P  +T   VL+ACSQ G ++LG
Sbjct: 236 RNGEMARALDLFREMQRDNVRPNEVTIVCVLSACSQLGALELG 278


>ref|XP_002303023.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550345712|gb|EEE82296.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 621

 Score =  199 bits (505), Expect = 4e-49
 Identities = 86/146 (58%), Positives = 124/146 (84%)
 Frame = +1

Query: 4   MRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGN 183
           M+ + V PN+VTIVC+LSAC++LG L+LG+W+ SY++K+ IE+NH VG ALINMYSRCG+
Sbjct: 240 MQREDVMPNEVTIVCVLSACSELGALQLGRWVRSYMDKHRIELNHFVGGALINMYSRCGD 299

Query: 184 IDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNA 363
           IDEA+++F  +++++V TYNS+I+GF+L+GKSVEAV++F+ ++  G  P+++TF  VLNA
Sbjct: 300 IDEAQRVFEQMKEKNVITYNSMIMGFALHGKSVEAVELFRGLIKQGFTPSSVTFVGVLNA 359

Query: 364 CSQGGLVDLGFEIFETMSIEYGIEPQ 441
           CS GGL +LGFEIF +M+ +YGIEPQ
Sbjct: 360 CSHGGLAELGFEIFHSMAKDYGIEPQ 385



 Score = 74.3 bits (181), Expect = 2e-11
 Identities = 41/149 (27%), Positives = 71/149 (47%), Gaps = 18/149 (12%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM    + P+   +  +L AC     L+ G+ +HS V K  +  N  +   LI +Y +CG
Sbjct: 120 QMINSSLVPDSYAVTSVLKACGCHLALKEGREVHSQVLKLGLSSNRSIRIKLIELYGKCG 179

Query: 181 NIDEAEKIFCSLQQRDVTT------------------YNSLIVGFSLNGKSVEAVQIFQR 306
             ++A ++F  + +RDV                    + ++I G   NG+S  A+++F+ 
Sbjct: 180 AFEDARRVFDEMPERDVVASTVMINYYFDHGIKDTVCWTAMIDGLVRNGESNRALEVFRN 239

Query: 307 MVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           M    + P  +T   VL+ACS+ G + LG
Sbjct: 240 MQREDVMPNEVTIVCVLSACSELGALQLG 268


>ref|XP_003635064.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At5g59200, chloroplastic-like [Vitis vinifera]
          Length = 650

 Score =  199 bits (505), Expect = 4e-49
 Identities = 86/146 (58%), Positives = 121/146 (82%)
 Frame = +1

Query: 4   MRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGN 183
           M+ + VRPN+ TIVC+LSAC+QLG LE+G+W+HSY+ K++IE+N  VG+ALINMYSRCG+
Sbjct: 269 MQGENVRPNEFTIVCVLSACSQLGALEIGRWVHSYMRKFEIELNLFVGNALINMYSRCGS 328

Query: 184 IDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNA 363
           IDEA+ +F  ++ RDV TYN++I G S+NGKS +A+++F+ M+   ++PTN+TF  VLNA
Sbjct: 329 IDEAQTVFDEMKDRDVITYNTMISGLSMNGKSRQAIELFRVMIGRRLRPTNVTFVGVLNA 388

Query: 364 CSQGGLVDLGFEIFETMSIEYGIEPQ 441
           CS GGLVD GF+IF +M+ +YG+EPQ
Sbjct: 389 CSHGGLVDFGFKIFHSMTRDYGVEPQ 414



 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 39/161 (24%), Positives = 70/161 (43%), Gaps = 30/161 (18%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M  D + P+   +  IL AC     L  G+ +HS   K  +  N LV   ++ +Y +CG
Sbjct: 137 RMLHDSILPDNYLMASILKACGSQLALREGREVHSRALKLGLSSNRLVRLRIMELYGKCG 196

Query: 181 N------------------------------IDEAEKIFCSLQQRDVTTYNSLIVGFSLN 270
                                          ++EA  +F  ++++D   + ++I GF  N
Sbjct: 197 ELGDARRVFEEMPEDVVASTVMISSYSDQGLVEEAGAVFSRVRRKDTVCWTAMIDGFVRN 256

Query: 271 GKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
            +   A++ F+ M    ++P   T   VL+ACSQ G +++G
Sbjct: 257 EEMNRALEAFRGMQGENVRPNEFTIVCVLSACSQLGALEIG 297


>emb|CBI18728.3| unnamed protein product [Vitis vinifera]
          Length = 607

 Score =  199 bits (505), Expect = 4e-49
 Identities = 86/146 (58%), Positives = 121/146 (82%)
 Frame = +1

Query: 4   MRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGN 183
           M+ + VRPN+ TIVC+LSAC+QLG LE+G+W+HSY+ K++IE+N  VG+ALINMYSRCG+
Sbjct: 226 MQGENVRPNEFTIVCVLSACSQLGALEIGRWVHSYMRKFEIELNLFVGNALINMYSRCGS 285

Query: 184 IDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNA 363
           IDEA+ +F  ++ RDV TYN++I G S+NGKS +A+++F+ M+   ++PTN+TF  VLNA
Sbjct: 286 IDEAQTVFDEMKDRDVITYNTMISGLSMNGKSRQAIELFRVMIGRRLRPTNVTFVGVLNA 345

Query: 364 CSQGGLVDLGFEIFETMSIEYGIEPQ 441
           CS GGLVD GF+IF +M+ +YG+EPQ
Sbjct: 346 CSHGGLVDFGFKIFHSMTRDYGVEPQ 371



 Score = 75.1 bits (183), Expect = 9e-12
 Identities = 38/135 (28%), Positives = 69/135 (51%), Gaps = 4/135 (2%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M  D + P+   +  IL AC     L  G+ +HS   K  +  N LV   ++ +Y +CG
Sbjct: 120 RMLHDSILPDNYLMASILKACGSQLALREGREVHSRALKLGLSSNRLVRLRIMELYGKCG 179

Query: 181 NIDEAEKIFCSLQQ----RDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFT 348
            + +A ++F  + +    +D   + ++I GF  N +   A++ F+ M    ++P   T  
Sbjct: 180 ELGDARRVFEEMPEDVVAKDTVCWTAMIDGFVRNEEMNRALEAFRGMQGENVRPNEFTIV 239

Query: 349 AVLNACSQGGLVDLG 393
            VL+ACSQ G +++G
Sbjct: 240 CVLSACSQLGALEIG 254


>gb|EOX93714.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao]
          Length = 632

 Score =  197 bits (502), Expect = 9e-49
 Identities = 88/148 (59%), Positives = 122/148 (82%), Gaps = 1/148 (0%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNK-YDIEINHLVGSALINMYSRC 177
           +M+ + VRPN++TIVC+LSAC+ LG LELG+W+HSY+ K + I ++H VG ALINMYSRC
Sbjct: 249 EMQKENVRPNEITIVCVLSACSHLGALELGRWVHSYMGKEHGIVLSHFVGGALINMYSRC 308

Query: 178 GNIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVL 357
           G+IDEAE++F  +++R+V TYN +I G +++GKS+EA++IFQ M+  G+ PT +TF AVL
Sbjct: 309 GDIDEAERVFAMMKERNVITYNLMISGLAMHGKSIEAIEIFQVMIKKGLLPTGVTFVAVL 368

Query: 358 NACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           NACS GGLVD GFEIF +M+ +YGI+PQ
Sbjct: 369 NACSHGGLVDFGFEIFLSMTRDYGIQPQ 396



 Score = 73.2 bits (178), Expect = 3e-11
 Identities = 44/167 (26%), Positives = 70/167 (41%), Gaps = 31/167 (18%)
 Frame = +1

Query: 25  PNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGNIDEAEKI 204
           P++  I  +L AC     L  GK  H    K  +  N  +   L+  Y +CG  D+A K+
Sbjct: 125 PDKYVITSVLKACGSHFALREGKEFHCQALKLGLSSNRSITMKLLEFYGKCGEFDDARKV 184

Query: 205 FCSLQQRDVTT-------------------------------YNSLIVGFSLNGKSVEAV 291
           F  + +RDV                                 + ++I G   NG+   A+
Sbjct: 185 FDEMVERDVVASTIMINCYLDHGLVEQAIEVFDRVRIKDTVCWTAMIDGLVRNGEMNRAL 244

Query: 292 QIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLGFEIFETMSIEYGI 432
           ++F+ M    ++P  IT   VL+ACS  G ++LG  +   M  E+GI
Sbjct: 245 EMFREMQKENVRPNEITIVCVLSACSHLGALELGRWVHSYMGKEHGI 291


>ref|XP_003635033.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At5g59200, chloroplastic-like [Vitis vinifera]
          Length = 650

 Score =  197 bits (502), Expect = 9e-49
 Identities = 87/146 (59%), Positives = 120/146 (82%)
 Frame = +1

Query: 4   MRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGN 183
           M+ + VRPN+ TIVC+LSAC+QLG LE+G+W+HSY+ K++IE+N  VG+ALINMYSRCG+
Sbjct: 269 MQGENVRPNEFTIVCVLSACSQLGALEIGRWVHSYMRKFEIELNLFVGNALINMYSRCGS 328

Query: 184 IDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNA 363
           IDEA+ +F  ++ RDV TYN++I G S+NGKS +A+++F+ MV   ++PTN+TF  VLNA
Sbjct: 329 IDEAQTVFDEMKDRDVITYNTMISGLSMNGKSRQAIELFRVMVGRRLRPTNVTFVGVLNA 388

Query: 364 CSQGGLVDLGFEIFETMSIEYGIEPQ 441
           CS GGLVD GFEIF +M+ +Y +EPQ
Sbjct: 389 CSHGGLVDFGFEIFHSMARDYRVEPQ 414



 Score = 65.1 bits (157), Expect = 9e-09
 Identities = 38/161 (23%), Positives = 70/161 (43%), Gaps = 30/161 (18%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M  + + P+   +  IL AC     L  G+ +HS   K     N LV   ++ +Y +CG
Sbjct: 137 RMLHESILPDNYLMASILKACGSQLALREGREVHSRALKLGFSSNRLVRLRIMELYGKCG 196

Query: 181 N------------------------------IDEAEKIFCSLQQRDVTTYNSLIVGFSLN 270
                                          ++EA  +F  ++++D   + ++I GF  N
Sbjct: 197 ELGDARRVFEEMPEDVVASTVMISSYSDQGLVEEAGAVFSRVRRKDTVCWTAMIDGFVRN 256

Query: 271 GKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
            ++  A++ F+ M    ++P   T   VL+ACSQ G +++G
Sbjct: 257 EETNRALEAFRGMQGENVRPNEFTIVCVLSACSQLGALEIG 297


>emb|CBI38389.3| unnamed protein product [Vitis vinifera]
          Length = 614

 Score =  197 bits (502), Expect = 9e-49
 Identities = 87/146 (59%), Positives = 120/146 (82%)
 Frame = +1

Query: 4   MRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGN 183
           M+ + VRPN+ TIVC+LSAC+QLG LE+G+W+HSY+ K++IE+N  VG+ALINMYSRCG+
Sbjct: 233 MQGENVRPNEFTIVCVLSACSQLGALEIGRWVHSYMRKFEIELNLFVGNALINMYSRCGS 292

Query: 184 IDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNA 363
           IDEA+ +F  ++ RDV TYN++I G S+NGKS +A+++F+ MV   ++PTN+TF  VLNA
Sbjct: 293 IDEAQTVFDEMKDRDVITYNTMISGLSMNGKSRQAIELFRVMVGRRLRPTNVTFVGVLNA 352

Query: 364 CSQGGLVDLGFEIFETMSIEYGIEPQ 441
           CS GGLVD GFEIF +M+ +Y +EPQ
Sbjct: 353 CSHGGLVDFGFEIFHSMARDYRVEPQ 378



 Score = 73.6 bits (179), Expect = 3e-11
 Identities = 37/135 (27%), Positives = 69/135 (51%), Gaps = 4/135 (2%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M  + + P+   +  IL AC     L  G+ +HS   K     N LV   ++ +Y +CG
Sbjct: 127 RMLHESILPDNYLMASILKACGSQLALREGREVHSRALKLGFSSNRLVRLRIMELYGKCG 186

Query: 181 NIDEAEKIFCSLQQ----RDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFT 348
            + +A ++F  + +    +D   + ++I GF  N ++  A++ F+ M    ++P   T  
Sbjct: 187 ELGDARRVFEEMPEDVVAKDTVCWTAMIDGFVRNEETNRALEAFRGMQGENVRPNEFTIV 246

Query: 349 AVLNACSQGGLVDLG 393
            VL+ACSQ G +++G
Sbjct: 247 CVLSACSQLGALEIG 261


>gb|EXB61171.1| hypothetical protein L484_007437 [Morus notabilis]
          Length = 631

 Score =  197 bits (501), Expect = 1e-48
 Identities = 82/147 (55%), Positives = 122/147 (82%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+M+ V+PN+ TIVC+LSAC+ LG LELG+W+HSY+ KY+I++NH+VG ALINMY+RCG
Sbjct: 248 EMQMENVKPNEATIVCVLSACSHLGALELGRWVHSYMGKYEIKLNHIVGGALINMYARCG 307

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +ID+ +K+F  + +RD++TYNS+I G  ++GKS+EA+++F+ M++ G +P +ITF  VLN
Sbjct: 308 DIDKVKKVFDEMNERDISTYNSIIAGLGMHGKSIEAIEMFRAMLNNGFRPNSITFVGVLN 367

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS  GLV+LGFE+F +M  +Y IEP+
Sbjct: 368 ACSHRGLVELGFEMFHSMVKDYKIEPR 394



 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 43/158 (27%), Positives = 72/158 (45%), Gaps = 31/158 (19%)
 Frame = +1

Query: 13  DGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGNIDE 192
           D + P+   +V  L AC     L+LG+ IH  V K  +  N  V   L+ +Y +CG +++
Sbjct: 120 DSIVPDNYAVVSALKACGFQLALKLGREIHGQVMKLGLCSNRSVKMKLMELYGKCGELED 179

Query: 193 AEKIFCSLQQRD-------VTTY------------------------NSLIVGFSLNGKS 279
           A ++F  + +RD       +T+Y                         ++I G   NG+ 
Sbjct: 180 ARRVFDEMPERDFVASTVMMTSYLHHGFVREASAVFKQVRRKDTVCWTAMIDGLVKNGEM 239

Query: 280 VEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
             A+++F+ M    +KP   T   VL+ACS  G ++LG
Sbjct: 240 NWALEVFREMQMENVKPNEATIVCVLSACSHLGALELG 277



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 35/125 (28%), Positives = 61/125 (48%)
 Frame = +1

Query: 28  NQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGNIDEAEKIF 207
           N+  ++ +L  C  +  ++     H+ + +   E +  +   L+ + S   +ID A KIF
Sbjct: 27  NRKEVIFLLQKCKHISHVQS---THAKIIRNGQEQDPFIVFELLRLCSNLNSIDYASKIF 83

Query: 208 CSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVD 387
             +Q  +V  Y +LI GF L+G   +A+ ++ RM+   I P N    + L AC     + 
Sbjct: 84  QRIQTPNVFLYTALIDGFVLHGSYFDAILLYCRMIDDSIVPDNYAVVSALKACGFQLALK 143

Query: 388 LGFEI 402
           LG EI
Sbjct: 144 LGREI 148


>ref|XP_003540936.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At5g59200, chloroplastic-like [Glycine max]
          Length = 629

 Score =  192 bits (488), Expect = 4e-47
 Identities = 87/147 (59%), Positives = 117/147 (79%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M++ GV PN+VT VC+LSACAQLG LELG+WIH+Y+ K  +E+N  V  ALINMYSRCG
Sbjct: 247 EMQVKGVEPNEVTFVCVLSACAQLGALELGRWIHAYMRKCGVEVNRFVAGALINMYSRCG 306

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +IDEA+ +F  ++ +DV+TYNS+I G +L+GKS+EAV++F  M+   ++P  ITF  VLN
Sbjct: 307 DIDEAQALFDGVRVKDVSTYNSMIGGLALHGKSIEAVELFSEMLKERVRPNGITFVGVLN 366

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS GGLVDLG EIFE+M + +GIEP+
Sbjct: 367 ACSHGGLVDLGGEIFESMEMIHGIEPE 393



 Score = 67.8 bits (164), Expect = 1e-09
 Identities = 40/162 (24%), Positives = 71/162 (43%), Gaps = 31/162 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM    V  +   +  +L AC     L  GK +H  V K  + ++  +   L+ +Y +CG
Sbjct: 115 QMVRKHVLADNYAVTAMLKACVLQRALGSGKEVHGLVLKSGLGLDRSIALKLVELYGKCG 174

Query: 181 NIDEAEKIFCSLQQRDVTT-------------------------------YNSLIVGFSL 267
            +++A K+F  + +RDV                                 +  +I G   
Sbjct: 175 VLEDARKMFDGMPERDVVACTVMIGSCFDCGMVEEAIEVFNEMGTRDTVCWTMVIDGLVR 234

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           NG+    +++F+ M   G++P  +TF  VL+AC+Q G ++LG
Sbjct: 235 NGEFNRGLEVFREMQVKGVEPNEVTFVCVLSACAQLGALELG 276


>gb|ESW23889.1| hypothetical protein PHAVU_004G084900g [Phaseolus vulgaris]
          Length = 632

 Score =  190 bits (483), Expect = 1e-46
 Identities = 85/147 (57%), Positives = 119/147 (80%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M++ GVRPN+VT VC+LSAC+QLG LELG+WIH+Y+ K D+E+N  V  ALINMYSRCG
Sbjct: 250 EMQVKGVRPNEVTFVCVLSACSQLGALELGRWIHAYLCKCDVEVNWFVAGALINMYSRCG 309

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +IDEA+ +F  ++ +DV+TYNS+I G +++GKS+EAV++F+ M+   ++P  ITF  VLN
Sbjct: 310 DIDEAQVLFDEVKVKDVSTYNSMIRGLAMHGKSMEAVELFREMLMQRVRPNGITFVGVLN 369

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
            CS GGLVDLG+EIF++M   +GIEP+
Sbjct: 370 GCSHGGLVDLGWEIFQSMKTVHGIEPE 396



 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 40/162 (24%), Positives = 70/162 (43%), Gaps = 31/162 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM    V  +   +  +L AC     L  G+ +H  V K  + ++  +   L  +Y +CG
Sbjct: 118 QMVRGHVLADSYAVTAVLKACVLQRALGRGREVHGLVFKRGLCLDRSIALKLAELYGKCG 177

Query: 181 NIDEAEKIFCSLQQRDVTT-------------------------------YNSLIVGFSL 267
            +++A K+F  + +RDV                                 +  +I G   
Sbjct: 178 VLEDAWKVFDEMPERDVVACTVMMGSCFDWGMVEEAVGVFNEMRSRDTVCWTLMIDGLVR 237

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           NG+    +++F+ M   G++P  +TF  VL+ACSQ G ++LG
Sbjct: 238 NGEFNRGLEMFREMQVKGVRPNEVTFVCVLSACSQLGALELG 279


>ref|XP_004292328.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At5g59200, chloroplastic-like [Fragaria vesca subsp.
           vesca]
          Length = 625

 Score =  181 bits (459), Expect = 9e-44
 Identities = 82/147 (55%), Positives = 115/147 (78%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+ + VRPN+VT+VC+LSAC+QLG LEL +W+  Y+ K  IE N++VG ALINMYSRCG
Sbjct: 243 EMQRENVRPNEVTLVCVLSACSQLGALELRRWVRLYMEKNKIECNYIVGGALINMYSRCG 302

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +ID A ++F  +++RDV+TYNS+I G  ++GKS  A+++F+ M   G++P +ITF  VLN
Sbjct: 303 DIDGAVEVFSWMKERDVSTYNSMIEGLGMHGKSTMAIEMFREMRERGLRPNSITFVKVLN 362

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS GGLV+LGFEIF +M+  + IEPQ
Sbjct: 363 ACSHGGLVELGFEIFHSMTSSHRIEPQ 389



 Score = 62.8 bits (151), Expect = 5e-08
 Identities = 36/161 (22%), Positives = 72/161 (44%), Gaps = 31/161 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M  + + P++  I  +L AC     +E  + +H+   K ++  N  +   L+ +Y +CG
Sbjct: 111 EMVNEYIFPDKYVITSVLKACGFGLAVEESRQVHAQALKLELSSNRSIRLKLMGVYGKCG 170

Query: 181 N-------------------------------IDEAEKIFCSLQQRDVTTYNSLIVGFSL 267
                                           ++EA  +F  L+++D   + ++I G   
Sbjct: 171 EFESARQVFDEMSENDAVAATVMITSYADHGLVEEASGVFDRLRKKDTVCWTAMIDGLVK 230

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDL 390
           NG+   A+++F+ M    ++P  +T   VL+ACSQ G ++L
Sbjct: 231 NGEMNRALEVFREMQRENVRPNEVTLVCVLSACSQLGALEL 271


>ref|XP_006401000.1| hypothetical protein EUTSA_v10015737mg [Eutrema salsugineum]
           gi|557102090|gb|ESQ42453.1| hypothetical protein
           EUTSA_v10015737mg [Eutrema salsugineum]
          Length = 533

 Score =  172 bits (435), Expect = 5e-41
 Identities = 74/147 (50%), Positives = 113/147 (76%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+ + V PN+ T VC+LSAC+ LG LELG+W+HSYV    +E++  VG+ALINMYSRCG
Sbjct: 244 RMQTEKVSPNEFTAVCVLSACSDLGALELGRWVHSYVENEKMELSSFVGNALINMYSRCG 303

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +I+EA+++F  +Q +DV +YN++I G +++G+S EA++ F+ MV+ G +P  +T  A+LN
Sbjct: 304 DINEAKRVFRGMQDKDVISYNTMISGLAMHGESFEAIKEFRDMVNRGFRPNQVTLVALLN 363

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS GGL+D+G E+F +MS  + +EPQ
Sbjct: 364 ACSHGGLLDIGLEVFNSMSRVFSVEPQ 390



 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 45/162 (27%), Positives = 73/162 (45%), Gaps = 31/162 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYD-----------IEI----- 132
           +M  D V P+   I  +L AC     LE  + IHS V K             +EI     
Sbjct: 116 RMIHDSVLPDNYAITSVLKAC----DLEQCREIHSQVLKLGFGSSRSVGLKLMEIYGKSG 171

Query: 133 ---------------NHLVGSALINMYSRCGNIDEAEKIFCSLQQRDVTTYNSLIVGFSL 267
                          +H+  + +IN YS  G I EA ++F  ++ +D   + ++I G   
Sbjct: 172 DLVDAKKMFDEMPKRDHVAATVMINCYSEYGYIKEALELFRDVKIKDTICWTAMIDGLVR 231

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           N +  +A+++F+RM +  + P   T   VL+ACS  G ++LG
Sbjct: 232 NREMNKALELFRRMQTEKVSPNEFTAVCVLSACSDLGALELG 273


>ref|NP_200728.2| protein ORGANELLE TRANSCRIPT PROCESSING 80 [Arabidopsis thaliana]
           gi|75170817|sp|Q9FIF7.1|PP435_ARATH RecName:
           Full=Putative pentatricopeptide repeat-containing
           protein At5g59200, chloroplastic; Flags: Precursor
           gi|9759241|dbj|BAB09765.1| unnamed protein product
           [Arabidopsis thaliana] gi|332009773|gb|AED97156.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 544

 Score =  168 bits (425), Expect = 8e-40
 Identities = 72/147 (48%), Positives = 111/147 (75%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+M+ V  N+ T VC+LSAC+ LG LELG+W+HS+V    +E+++ VG+ALINMYSRCG
Sbjct: 245 EMQMENVSANEFTAVCVLSACSDLGALELGRWVHSFVENQRMELSNFVGNALINMYSRCG 304

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +I+EA ++F  ++ +DV +YN++I G +++G SVEA+  F+ MV+ G +P  +T  A+LN
Sbjct: 305 DINEARRVFRVMRDKDVISYNTMISGLAMHGASVEAINEFRDMVNRGFRPNQVTLVALLN 364

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS GGL+D+G E+F +M   + +EPQ
Sbjct: 365 ACSHGGLLDIGLEVFNSMKRVFNVEPQ 391



 Score = 57.4 bits (137), Expect = 2e-06
 Identities = 27/87 (31%), Positives = 49/87 (56%)
 Frame = +1

Query: 133 NHLVGSALINMYSRCGNIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMV 312
           +H+  + +IN YS CG I EA ++F  ++ +D   + ++I G   N +  +A+++F+ M 
Sbjct: 188 DHVAATVMINCYSECGFIKEALELFQDVKIKDTVCWTAMIDGLVRNKEMNKALELFREMQ 247

Query: 313 SGGIKPTNITFTAVLNACSQGGLVDLG 393
              +     T   VL+ACS  G ++LG
Sbjct: 248 MENVSANEFTAVCVLSACSDLGALELG 274


>ref|XP_006282116.1| hypothetical protein CARUB_v10028364mg [Capsella rubella]
           gi|482550820|gb|EOA15014.1| hypothetical protein
           CARUB_v10028364mg [Capsella rubella]
          Length = 550

 Score =  166 bits (419), Expect = 4e-39
 Identities = 69/147 (46%), Positives = 112/147 (76%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+++GV PN+ T VC+LSAC+ LG LELG+W+HS+V    +++++ VG+ALINMY RCG
Sbjct: 245 EMQIEGVSPNEFTAVCVLSACSDLGALELGRWVHSFVENRKMDLSNYVGNALINMYCRCG 304

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           +I+EA+++F  ++ +DV +YN++I G +++G S EA+  F+ MV+ G +P  +T  A+LN
Sbjct: 305 DINEAKRVFRGMRDKDVISYNTMISGLAMHGASFEAINEFRDMVNTGFRPNQVTLVALLN 364

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS GGL+D+G E+F +M   + ++PQ
Sbjct: 365 ACSHGGLLDIGLEVFNSMWSVFNVDPQ 391



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 43/146 (29%), Positives = 72/146 (49%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M    V P+   I  +L AC     LE  K IH+ V K        VG  L+ +Y + G
Sbjct: 117 KMIHSSVLPDNYVITSVLKAC----DLEDCKEIHAQVLKLGFGSGRSVGLKLMEIYGKSG 172

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
            + +A+K+F  + +RD      +I  +S  G   EA+++FQ +    IK T + +TA+++
Sbjct: 173 ELADAKKVFDEMPERDQVAATVMINCYSERGCIEEALELFQDV---KIKDT-VCWTAMID 228

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEP 438
              +   ++   E+F  M IE G+ P
Sbjct: 229 GLVRNREMNKALELFREMQIE-GVSP 253



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 28/89 (31%), Positives = 51/89 (57%)
 Frame = +1

Query: 127 EINHLVGSALINMYSRCGNIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQR 306
           E + +  + +IN YS  G I+EA ++F  ++ +D   + ++I G   N +  +A+++F+ 
Sbjct: 186 ERDQVAATVMINCYSERGCIEEALELFQDVKIKDTVCWTAMIDGLVRNREMNKALELFRE 245

Query: 307 MVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           M   G+ P   T   VL+ACS  G ++LG
Sbjct: 246 MQIEGVSPNEFTAVCVLSACSDLGALELG 274


>ref|XP_002527276.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223533369|gb|EEF35120.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 507

 Score =  165 bits (417), Expect = 7e-39
 Identities = 73/115 (63%), Positives = 99/115 (86%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           +M+ + VRPN+VTIVC+LSAC+QLGTLELG+W+HSY+ KY I INH VG ALINMYSRCG
Sbjct: 357 EMQREDVRPNEVTIVCVLSACSQLGTLELGRWVHSYMGKYGIGINHFVGGALINMYSRCG 416

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITF 345
           +IDEA ++F  +++R+V TYNS+IVGFSL+GKS EA+++F+ M   G++PT++TF
Sbjct: 417 DIDEAWRVFEEMKERNVITYNSMIVGFSLHGKSSEAIELFRGMTKQGLEPTSVTF 471



 Score = 68.6 bits (166), Expect = 8e-10
 Identities = 45/175 (25%), Positives = 77/175 (44%), Gaps = 31/175 (17%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM    + P+   I  +L AC     L+ G  +H  V K  +    L+   L+  Y +CG
Sbjct: 225 QMINLSIVPDNYVITSVLEACGFQLALKQGIQVHCQVLKLGLSSKRLMRLKLMKFYGKCG 284

Query: 181 NIDEAEKIFCSLQQRDVTT-------------------------------YNSLIVGFSL 267
           ++ +AE++F  + +RDV                                 + ++I G   
Sbjct: 285 SLKDAERLFDEMPERDVVASTIMINSYFEHGLIQEAIRVFNLTKSKDTVCWTAVIDGLVR 344

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLGFEIFETMSIEYGI 432
           NG+   A+++F+ M    ++P  +T   VL+ACSQ G ++LG  +   M  +YGI
Sbjct: 345 NGEMNRALEVFREMQREDVRPNEVTIVCVLSACSQLGTLELGRWVHSYMG-KYGI 398



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 31/114 (27%), Positives = 58/114 (50%)
 Frame = +1

Query: 25  PNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGNIDEAEKI 204
           PN+  ++ +L +C     +     IH+ + + +   +  V   L+ + S   +I+ A KI
Sbjct: 135 PNRKQLISLLQSCRHSNQITP---IHAKIIRNNYHNDAFVVFELLRVCSNLSSINYASKI 191

Query: 205 FCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNAC 366
           F   +  +V  Y +LI GF L+G  +  + ++ +M++  I P N   T+VL AC
Sbjct: 192 FSFTENPNVYLYTALIDGFVLSGSFISGIHLYYQMINLSIVPDNYVITSVLEAC 245


>ref|XP_003550993.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37380,
           chloroplastic-like [Glycine max]
          Length = 628

 Score =  159 bits (403), Expect = 3e-37
 Identities = 68/141 (48%), Positives = 107/141 (75%)
 Frame = +1

Query: 19  VRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGNIDEAE 198
           VRPN++T+V +LS+C Q+G LE GKW+HSYV    I++N  VG+AL++MY +CG++++A 
Sbjct: 252 VRPNEITVVAVLSSCGQVGALECGKWVHSYVENNGIKVNVRVGTALVDMYCKCGSLEDAR 311

Query: 199 KIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGG 378
           K+F  ++ +DV  +NS+I+G+ ++G S EA+Q+F  M   G+KP++ITF AVL AC+  G
Sbjct: 312 KVFDVMEGKDVVAWNSMIMGYGIHGFSDEALQLFHEMCCIGVKPSDITFVAVLTACAHAG 371

Query: 379 LVDLGFEIFETMSIEYGIEPQ 441
           LV  G+E+F++M   YG+EP+
Sbjct: 372 LVSKGWEVFDSMKDGYGMEPK 392



 Score = 65.1 bits (157), Expect = 9e-09
 Identities = 43/169 (25%), Positives = 80/169 (47%), Gaps = 38/169 (22%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM    ++PN  T+  +L AC    TL   + +HS+  K+ +  +  V + L++ Y+R G
Sbjct: 111 QMLTHPIQPNAFTLSSLLKAC----TLHPARAVHSHAIKFGLSSHLYVSTGLVDAYARGG 166

Query: 181 NIDEAEKIFCSLQQR-------------------------------DVTTYNSLIVGFSL 267
           ++  A+K+F ++ +R                               DV  +N +I G++ 
Sbjct: 167 DVASAQKLFDAMPERSLVSYTAMLTCYAKHGMLPEARVLFEGMGMKDVVCWNVMIDGYAQ 226

Query: 268 NGKSVEAVQIFQRMV-------SGGIKPTNITFTAVLNACSQGGLVDLG 393
           +G   EA+  F++M+       +G ++P  IT  AVL++C Q G ++ G
Sbjct: 227 HGCPNEALVFFRKMMMMMGGNGNGKVRPNEITVVAVLSSCGQVGALECG 275


>ref|XP_006380676.1| hypothetical protein POPTR_0007s10370g [Populus trichocarpa]
           gi|550334566|gb|ERP58473.1| hypothetical protein
           POPTR_0007s10370g [Populus trichocarpa]
          Length = 631

 Score =  154 bits (390), Expect = 9e-36
 Identities = 70/147 (47%), Positives = 106/147 (72%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM    VRPN+VT++ +LSAC Q G LE G+W+HSY+    I IN  VG++LI+MYS+CG
Sbjct: 249 QMLNAKVRPNEVTVLAVLSACGQTGALETGRWVHSYIENNGIGINVRVGTSLIDMYSKCG 308

Query: 181 NIDEAEKIFCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLN 360
           ++++A  +F  +  +DV  +NS++VG++++G S +A+++F+ M   G +PT+ITF  VLN
Sbjct: 309 SLEDARLVFERISNKDVVAWNSMVVGYAMHGFSQDALRLFKEMCMIGYQPTDITFIGVLN 368

Query: 361 ACSQGGLVDLGFEIFETMSIEYGIEPQ 441
           ACS  GLV  G++ F +M  EYGIEP+
Sbjct: 369 ACSHAGLVSEGWKFFYSMKDEYGIEPK 395



 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 43/162 (26%), Positives = 74/162 (45%), Gaps = 31/162 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM    V PN  T   IL +C     +E  K +H    K+  +    V + L+++Y+R G
Sbjct: 121 QMLSQNVFPNAFTFSSILKSCP----IEPAKLLHGQAIKFGFDAELYVRTCLVDVYARGG 176

Query: 181 N-------------------------------IDEAEKIFCSLQQRDVTTYNSLIVGFSL 267
           +                               IDEA  +F  L++RD   +N +I G++ 
Sbjct: 177 DVVSARTLFDAMPEKSLVSLTAMITCYAKYGMIDEARVLFDGLEERDAICWNVMIDGYAQ 236

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           +G   E + +F++M++  ++P  +T  AVL+AC Q G ++ G
Sbjct: 237 HGLPNEGLLLFRQMLNAKVRPNEVTVLAVLSACGQTGALETG 278


>ref|XP_004141894.1| PREDICTED: pentatricopeptide repeat-containing protein At4g37380,
           chloroplastic-like [Cucumis sativus]
           gi|449513125|ref|XP_004164238.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g37380,
           chloroplastic-like [Cucumis sativus]
          Length = 645

 Score =  154 bits (390), Expect = 9e-36
 Identities = 69/139 (49%), Positives = 102/139 (73%)
 Frame = +1

Query: 25  PNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCGNIDEAEKI 204
           PN+VT++ +LSAC QLG LE G+WIHSY+    I+IN  VG+ALI+MYS+CG++++A  +
Sbjct: 271 PNEVTVLAVLSACGQLGALESGRWIHSYIENKGIQINVHVGTALIDMYSKCGSLEDARLV 330

Query: 205 FCSLQQRDVTTYNSLIVGFSLNGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLV 384
           F  ++ +DV  +NS+IVG++++G S  A+Q+F+ M   G KPT+ITF  +L+AC  GGLV
Sbjct: 331 FDRIRDKDVVAWNSMIVGYAMHGFSQHALQLFEEMTETGHKPTDITFIGILSACGHGGLV 390

Query: 385 DLGFEIFETMSIEYGIEPQ 441
           + G   F  M  +YGIEP+
Sbjct: 391 EEGRSFFRLMRDKYGIEPK 409



 Score = 67.0 bits (162), Expect = 2e-09
 Identities = 44/162 (27%), Positives = 77/162 (47%), Gaps = 31/162 (19%)
 Frame = +1

Query: 1   QMRMDGVRPNQVTIVCILSACAQLGTLELGKWIHSYVNKYDIEINHLVGSALINMYSRCG 180
           QM   GV PN  T   +L +C+    LE GK +H    K  +  +  V + L+++Y+R G
Sbjct: 135 QMLSCGVEPNAFTFSSVLKSCS----LESGKVLHCQAIKLGLGSDLYVRTGLVDVYARGG 190

Query: 181 NI-------------------------------DEAEKIFCSLQQRDVTTYNSLIVGFSL 267
           ++                               D+A  +F  +++RDV  +N +I G++ 
Sbjct: 191 DVVCARQLFDKMPERSLVSLTTMLTCYSKMGELDKARSLFEGMKERDVVCWNVMIGGYAQ 250

Query: 268 NGKSVEAVQIFQRMVSGGIKPTNITFTAVLNACSQGGLVDLG 393
           +G   E++++F+RM+     P  +T  AVL+AC Q G ++ G
Sbjct: 251 SGVPNESLKLFRRMLVAKAIPNEVTVLAVLSACGQLGALESG 292


Top