BLASTX nr result

ID: Cocculus22_contig00018027 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00018027
         (560 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002309173.2| pentatricopeptide repeat-containing family p...   215   7e-54
ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phas...   208   9e-52
ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr...   207   1e-51
ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phas...   207   2e-51
ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi...   206   4e-51
ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi...   205   6e-51
ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi...   199   5e-49
ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi...   199   5e-49
ref|XP_002533822.1| pentatricopeptide repeat-containing protein,...   197   1e-48
gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]     196   3e-48
ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ...   196   3e-48
gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss...   195   6e-48
ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein...   194   1e-47
ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr...   184   1e-44
ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps...   176   5e-42
ref|XP_002879744.1| pentatricopeptide repeat-containing protein ...   173   2e-41
ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar...   172   4e-41
gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637...   172   4e-41
ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi...   162   4e-38
ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [A...   152   7e-35

>ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550335936|gb|EEE92696.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 490

 Score =  215 bits (547), Expect = 7e-54
 Identities = 103/184 (55%), Positives = 136/184 (73%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I  YG TNK H+A+++F+RIP FRC PS +SLN L+SVLC   +GL++V ++LLKS  MN
Sbjct: 120 IEVYGRTNKTHEAIELFYRIPKFRCVPSVYSLNTLISVLCRNSKGLKLVPEILLKSQVMN 179

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IR++ STF++LI+ALC I KV FAI++ N M    G+  ++ +YS +LS LC+ KD +  
Sbjct: 180 IRVEESTFQVLITALCRIRKVGFAIEMLNCMVND-GFIVNAEIYSLLLSCLCEQKDATKF 238

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           +V+GFL Q+R  GF P  VDY+NV+ FLVK  RGLDAL +LN MKSD IKPD+ CYTMVL
Sbjct: 239 EVIGFLEQLRKLGFFPGMVDYSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVL 298

Query: 542 DGFI 553
            G I
Sbjct: 299 HGVI 302


>ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris]
           gi|561011455|gb|ESW10362.1| hypothetical protein
           PHAVU_009G202600g [Phaseolus vulgaris]
          Length = 513

 Score =  208 bits (529), Expect = 9e-52
 Identities = 103/185 (55%), Positives = 136/185 (73%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           IR YGL++K+ DAVD+F RIP FRCTP+  SLN +LS+LC +RE L+MV ++LLKS  MN
Sbjct: 122 IRFYGLSDKVQDAVDLFLRIPRFRCTPTVCSLNLVLSLLCRKRECLKMVPEILLKSQHMN 181

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IR++ STF++LI ALC I +V +AI + N M    GY  D T+ S I+S+LC+ +D++S 
Sbjct: 182 IRVEESTFQVLIKALCRIKRVGYAIKMLNYM-IEGGYGLDETMCSLIISSLCEQEDMTSV 240

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           + L     MR  GF P  +DYTN++ FLVK  +G+DALD+LNQ K DGIKPDVVCYTMVL
Sbjct: 241 EALVIWRDMRKLGFCPGIMDYTNMIRFLVKEGKGMDALDILNQQKKDGIKPDVVCYTMVL 300

Query: 542 DGFIS 556
            G I+
Sbjct: 301 SGIIA 305


>ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina]
           gi|557531581|gb|ESR42764.1| hypothetical protein
           CICLE_v10013613mg [Citrus clementina]
          Length = 506

 Score =  207 bits (528), Expect = 1e-51
 Identities = 103/184 (55%), Positives = 138/184 (75%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I+TY   ++  D+V++F++IP FRC PS +SLNALLSVLC  +E ++MV Q+LLKS  MN
Sbjct: 123 IKTYADAHRFQDSVNLFYKIPKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMN 182

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IR++ S+FRILIS LC IN+V FAI+I N M    G+  D    S+ILS++C+ +DLSS 
Sbjct: 183 IRIEESSFRILISTLCRINRVGFAIEILNCMIND-GFCVDGKTCSWILSSVCEQRDLSSD 241

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           ++LGF+ +M+  GF    VDYTNV+  LVK ++  DAL +LNQMKSDGIKPD+VCYTMVL
Sbjct: 242 ELLGFVQEMKKLGFCFGMVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVL 301

Query: 542 DGFI 553
           +G I
Sbjct: 302 NGVI 305


>ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris]
           gi|561013301|gb|ESW12162.1| hypothetical protein
           PHAVU_008G089500g [Phaseolus vulgaris]
          Length = 514

 Score =  207 bits (526), Expect = 2e-51
 Identities = 101/185 (54%), Positives = 136/185 (73%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           IR YGL++++ DAVD+F RIP FRCTP+ +SLN +LS+LC +RE L+MV ++LLKS  MN
Sbjct: 122 IRFYGLSDRVQDAVDLFLRIPRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQHMN 181

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IR++ STF++LI ALC I +V +AI + N M    GY  D T+ S I+S+LC+ +D++S 
Sbjct: 182 IRVEESTFQVLIEALCRIKRVGYAIKMLNYM-IEGGYGLDETICSLIISSLCEQEDMTSV 240

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           + L     MR  GF P  +DYTN++ FLVK  +G DALD+LNQ K DGIKPDVVCYTMVL
Sbjct: 241 EALVIWRDMRKLGFCPGVMDYTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVL 300

Query: 542 DGFIS 556
            G ++
Sbjct: 301 SGIVA 305


>ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Glycine max]
          Length = 499

 Score =  206 bits (523), Expect = 4e-51
 Identities = 102/185 (55%), Positives = 135/185 (72%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           IR YGL++++ DAVD+FFRIP FRCTP+  SLN +LS+LC +R+ L+MV ++LLKS  MN
Sbjct: 125 IRFYGLSDRVQDAVDLFFRIPRFRCTPTVCSLNLVLSLLCRKRDCLEMVPEILLKSQHMN 184

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IR++ STFR+LI ALC I +V +AI + N M    GY  D  + S ++SALC+ KDL+S+
Sbjct: 185 IRVEESTFRVLIRALCRIKRVGYAIKMLNFMVED-GYGLDEKICSLVISALCEQKDLTSA 243

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           + L     MR  GF P  +DYTN++ FLVK  RG+DALD+LNQ K DGIK DVV YTMVL
Sbjct: 244 EALVVWRDMRKLGFCPGVMDYTNMIRFLVKEGRGMDALDILNQQKQDGIKLDVVSYTMVL 303

Query: 542 DGFIS 556
            G ++
Sbjct: 304 SGIVA 308


>ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Vitis vinifera]
          Length = 505

 Score =  205 bits (522), Expect = 6e-51
 Identities = 107/182 (58%), Positives = 130/182 (71%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I+ YG  N   DAVD+FFRIPNFRC PS +SLNALL VLC RREGL MV Q+LLKS AMN
Sbjct: 120 IKVYGNANMFEDAVDLFFRIPNFRCVPSVYSLNALLYVLCKRREGLVMVPQILLKSQAMN 179

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IRL+ S+FRIL++ALC I K  +AI I N M    GY  D+ + S ILS+LC+ K LS  
Sbjct: 180 IRLEESSFRILVAALCRIKKHNYAIRILNYMLND-GYAVDAKMCSIILSSLCEQKGLSGD 238

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           +VL F+ +MR  GF P  VD  NV+ FLVK    +DAL + +QMK+DGIKPD V YTM+L
Sbjct: 239 EVLRFMEEMRKLGFYPGRVDCNNVIRFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMIL 298

Query: 542 DG 547
           +G
Sbjct: 299 NG 300


>ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 491

 Score =  199 bits (505), Expect = 5e-49
 Identities = 101/185 (54%), Positives = 134/185 (72%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           IR YG  N++ DA+D+F RIP FRC PSA SLN+LL VLC   EGL+MV QVL+ S AM 
Sbjct: 114 IRFYGSANRVEDAIDVFCRIPKFRCDPSAVSLNSLLYVLCGSSEGLKMVPQVLMNSRAMG 173

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IRL+ S+FRILISALC I  V +AI+I   M ++ GYD D  + S +LS+LC+ K +   
Sbjct: 174 IRLEESSFRILISALCRIGSVGYAIEIMKCMISN-GYDLDVKICSLVLSSLCEQKGVGGL 232

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           +V+GF+ +M+  GF P  +DY+NV+  LVK  +GLDAL +L +MK +G+KPD+VCYTMVL
Sbjct: 233 EVVGFVEEMKKVGFCPGMLDYSNVIRCLVKQGKGLDALRVLCKMKVEGMKPDIVCYTMVL 292

Query: 542 DGFIS 556
            G I+
Sbjct: 293 YGVIA 297


>ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Cucumis sativus]
           gi|449483740|ref|XP_004156675.1| PREDICTED:
           pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Cucumis sativus]
          Length = 491

 Score =  199 bits (505), Expect = 5e-49
 Identities = 99/185 (53%), Positives = 131/185 (70%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I+ YG  N++ DAV +F RIP FRC PS  SLN+LLS L    +GL ++  ++L SH+M 
Sbjct: 115 IKLYGRMNRIQDAVTLFRRIPMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSHSMG 174

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IRL+HSTF+ILI+ALC +NKV  A+++FN M T  GY  +  + S IL++LC+ K  S  
Sbjct: 175 IRLEHSTFQILITALCKVNKVGHAMELFNYMITE-GYGLNPQICSLILASLCQQKKSSGD 233

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            VLGFL +MR  GF P  VDY+NV+ F V    G DA+DLLN+MK+DG KPD+VCYTMVL
Sbjct: 234 VVLGFLEEMRQKGFCPAVVDYSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVL 293

Query: 542 DGFIS 556
           +G I+
Sbjct: 294 NGVIA 298


>ref|XP_002533822.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223526239|gb|EEF28557.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 373

 Score =  197 bits (502), Expect = 1e-48
 Identities = 95/176 (53%), Positives = 131/176 (74%)
 Frame = +2

Query: 29  LHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMNIRLDHSTFR 208
           + +A+ +F+R PNFRC PS + LN LLSVLC   EGL  V +VLLKS  MNIR++ S+FR
Sbjct: 1   MQNAIHLFYRTPNFRCVPSVYLLNTLLSVLCRTNEGLNFVPEVLLKSQDMNIRMEESSFR 60

Query: 209 ILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSSQVLGFLGQM 388
           +LI+ALC+INKV +A+++FN M    G+  DS + S +LS+LC   D+SSS+V+ FLG++
Sbjct: 61  LLINALCSINKVGYAVEMFNCMIND-GFSVDSKICSLLLSSLCYQADISSSEVMRFLGEL 119

Query: 389 RNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVLDGFIS 556
           R  GF P   DY+ V+ FLV+   G++AL++LNQMK DGIKPD+VCYT VL+G I+
Sbjct: 120 RKFGFCPGIKDYSKVINFLVRRGMGMEALNVLNQMKLDGIKPDIVCYTTVLNGVIA 175


>gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]
          Length = 494

 Score =  196 bits (499), Expect = 3e-48
 Identities = 101/185 (54%), Positives = 133/185 (71%), Gaps = 3/185 (1%)
 Frame = +2

Query: 11  YGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMNIRL 190
           YG  +++ DA+DIF+RIP FRC PS++SLN+LL VLC R EGL+ V +VL+KS  MNIRL
Sbjct: 115 YGFLDRIEDAIDIFWRIPKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRDMNIRL 174

Query: 191 DHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALC---KIKDLSSS 361
           + ++FRILI+ALC I KV +AI+I + M +  GYD D+ + S ILS LC   K  DL+  
Sbjct: 175 EEASFRILITALCKIGKVGYAIEILDCMISD-GYDIDARICSLILSFLCGKNKELDLAGF 233

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            VL  L +M   GF P   DY+ V+  LV+  RGL+ALD+L QMK+DG+KPDVVCYTMVL
Sbjct: 234 DVLELLQKMEKMGFCPRMGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVL 293

Query: 542 DGFIS 556
            G ++
Sbjct: 294 HGIVA 298


>ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355498545|gb|AES79748.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 653

 Score =  196 bits (498), Expect = 3e-48
 Identities = 98/184 (53%), Positives = 130/184 (70%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           IR YG  +++ DAVD+FFRIP FRCTP+  SLN LLS+LC +RE L+MV  +LLKS  M 
Sbjct: 119 IRFYGFNDRVQDAVDLFFRIPRFRCTPTVCSLNLLLSLLCGKRECLRMVPDILLKSRDMK 178

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IRL+ S+F +LI ALC I +V +AI + N M    GY  D  + S I+S+LC+  DL+S 
Sbjct: 179 IRLEESSFWVLIKALCRIKRVDYAIKMMNCMVED-GYCLDDKICSLIISSLCEQNDLTSV 237

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           + L   G MR  GF P  +D TN++ FLVK  +G+DAL++LNQ+K DGIKPD+VCYT+VL
Sbjct: 238 EALVVWGNMRKLGFCPGVMDCTNMIRFLVKEGKGMDALEILNQLKEDGIKPDIVCYTIVL 297

Query: 542 DGFI 553
            G +
Sbjct: 298 SGIV 301


>gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 480

 Score =  195 bits (496), Expect = 6e-48
 Identities = 98/182 (53%), Positives = 129/182 (70%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           ++ YG  N++ DAVDIF+RIP FRC PSA+SLNALL++LC  + GL+++ QVLL S  MN
Sbjct: 112 VKFYGKANRIQDAVDIFYRIPQFRCFPSAYSLNALLALLCRSQRGLKLLPQVLLNSLHMN 171

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IRL+ STFR+L+  LC +NKVA+AI+I   M    G   +  ++SF+LS++C   DL   
Sbjct: 172 IRLEESTFRLLVCTLCRMNKVAYAIEILQRMLDD-GLGVNDKVFSFVLSSVCAEGDLDGE 230

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            V+GF   +R  GFSP   DY  V+ FLVK  RGLDA D+LNQMKSDGI P ++ YTMVL
Sbjct: 231 DVIGFWRGLRKLGFSPAMGDYDGVVRFLVKKGRGLDAWDVLNQMKSDGIMPGIISYTMVL 290

Query: 542 DG 547
           +G
Sbjct: 291 NG 292


>ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
           cacao] gi|508715129|gb|EOY07026.1| Pentatricopeptide
           repeat superfamily protein, putative [Theobroma cacao]
          Length = 542

 Score =  194 bits (494), Expect = 1e-47
 Identities = 100/185 (54%), Positives = 129/185 (69%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I TYG+ N++ DAVDIF+RIP FRC PSA+SLN+LL++LC  +  L++V QVLLKS  MN
Sbjct: 161 ITTYGIANRIQDAVDIFYRIPKFRCVPSAYSLNSLLALLCRNQYSLKLVPQVLLKSLLMN 220

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IR++ ST RIL+SALC +NKV++AIDI   M    G   +  + SFILS++C   DL   
Sbjct: 221 IRVEESTLRILVSALCRMNKVSYAIDILQRMIDE-GLGVNDKVCSFILSSICAKADLDGE 279

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            V+G   ++   GF P   DY  ++ FLVK  RGLDALD LNQMKS GIKP +V YTM L
Sbjct: 280 DVMGLWRELGKLGFCPAMSDYNCLIRFLVKKGRGLDALDFLNQMKSVGIKPGIVSYTMAL 339

Query: 542 DGFIS 556
           +G I+
Sbjct: 340 NGVIA 344


>ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum]
           gi|557112223|gb|ESQ52507.1| hypothetical protein
           EUTSA_v10017948mg [Eutrema salsugineum]
          Length = 456

 Score =  184 bits (468), Expect = 1e-44
 Identities = 93/185 (50%), Positives = 129/185 (69%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I  YG + ++ +A+D+FF+IPNFRC PSA++LNALLSVL  +R+GL+MV +VLLK+  + 
Sbjct: 118 IFAYGFSGRIEEAIDVFFKIPNFRCVPSAYTLNALLSVLVRKRQGLKMVPEVLLKASKLG 177

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           +RL+ ST  ILI ALC I +V  A D+   M+    Y  D  LYS +LS++CK KD S  
Sbjct: 178 VRLEESTLGILIDALCRIGEVDCATDLVKDMSDDC-YIVDPRLYSLLLSSVCKHKDSSCF 236

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            V+G+L  +R   FSPD  DYT V+ FLV+  RG + + +LNQMK D I+PD+VCYT++L
Sbjct: 237 DVIGYLEGLRKTRFSPDLRDYTAVMRFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIIL 296

Query: 542 DGFIS 556
            G I+
Sbjct: 297 QGVIA 301


>ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella]
           gi|482562854|gb|EOA27044.1| hypothetical protein
           CARUB_v10023139mg [Capsella rubella]
          Length = 470

 Score =  176 bits (445), Expect = 5e-42
 Identities = 89/185 (48%), Positives = 126/185 (68%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I  YG   ++ +A+D+FF+IPNFRC PSA++LNALL VL  +RE L++V ++L+K+  M 
Sbjct: 132 IAAYGFAGRIGEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRESLELVPEILVKASRMG 191

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           +RL+ STF ILI ALC I +V  A ++   M+       D  LYS +LS++CK KD S  
Sbjct: 192 VRLEESTFGILIDALCKIGEVDCATELVRYMSIDC-VIVDPRLYSQLLSSVCKHKDSSCF 250

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            V+G+L  +R   FSP   DYT V++FLV+  RG + + +LNQMK D I+PD+VCYT+VL
Sbjct: 251 DVVGYLEDLRKTRFSPGLRDYTVVMSFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIVL 310

Query: 542 DGFIS 556
            G I+
Sbjct: 311 QGVIA 315


>ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297325583|gb|EFH56003.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 444

 Score =  173 bits (439), Expect = 2e-41
 Identities = 88/185 (47%), Positives = 125/185 (67%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I  YG + ++ +A+D+FF+IPNFRC PSA++LNALL VL  +R+ L++V ++L+K+  M 
Sbjct: 106 IAAYGFSGRIEEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKASRMG 165

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           +RL+ STF ILI+ALC I +V  A ++   M+       D  LYS +LS++CK KD S  
Sbjct: 166 VRLEESTFGILINALCRIGEVDCATELVRYMSED-SVIVDPRLYSLLLSSVCKHKDSSCF 224

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            V+G+L  +R   F P   DYT V+ FLV+  RG + + +LNQMK D I PDVVCYT+VL
Sbjct: 225 DVIGYLEDLRKTRFLPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRIDPDVVCYTIVL 284

Query: 542 DGFIS 556
            G I+
Sbjct: 285 LGVIA 289


>ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g38420, mitochondrial; Flags: Precursor
           gi|3395430|gb|AAC28762.1| hypothetical protein
           [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 453

 Score =  172 bits (437), Expect = 4e-41
 Identities = 86/185 (46%), Positives = 126/185 (68%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I  YG + ++ +A+++FF+IPNFRC PSA++LNALL VL  +R+ L++V ++L+K+  M 
Sbjct: 115 IAAYGFSGRIEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMG 174

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           +RL+ STF ILI ALC I +V  A ++   M+       D  LYS +LS++CK KD S  
Sbjct: 175 VRLEESTFGILIDALCRIGEVDCATELVRYMSQD-SVIVDPRLYSRLLSSVCKHKDSSCF 233

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            V+G+L  +R   FSP   DYT V+ FLV+  RG + + +LNQMK D ++PD+VCYT+VL
Sbjct: 234 DVIGYLEDLRKTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVL 293

Query: 542 DGFIS 556
            G I+
Sbjct: 294 QGVIA 298


>gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1|
           At2g38420 [Arabidopsis thaliana]
          Length = 444

 Score =  172 bits (437), Expect = 4e-41
 Identities = 86/185 (46%), Positives = 126/185 (68%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I  YG + ++ +A+++FF+IPNFRC PSA++LNALL VL  +R+ L++V ++L+K+  M 
Sbjct: 106 IAAYGFSGRIEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMG 165

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           +RL+ STF ILI ALC I +V  A ++   M+       D  LYS +LS++CK KD S  
Sbjct: 166 VRLEESTFGILIDALCRIGEVDCATELVRYMSQD-SVIVDPRLYSRLLSSVCKHKDSSCF 224

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
            V+G+L  +R   FSP   DYT V+ FLV+  RG + + +LNQMK D ++PD+VCYT+VL
Sbjct: 225 DVIGYLEDLRKTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVL 284

Query: 542 DGFIS 556
            G I+
Sbjct: 285 QGVIA 289


>ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
           mitochondrial-like [Solanum lycopersicum]
          Length = 496

 Score =  162 bits (411), Expect = 4e-38
 Identities = 84/184 (45%), Positives = 120/184 (65%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I+ YG +N  H A ++FF +P +RC PS  SLN L+ VLC     L++V QVL+KS  +N
Sbjct: 135 IKFYGDSNMTHLAYEMFFTMPAYRCNPSVKSLNCLIWVLCKNNYDLRIVLQVLVKSQLLN 194

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           I ++ STF+ILI ALC I K   A+D+  LM    G++ D+ + S ILS +  +KD    
Sbjct: 195 IWVEESTFKILIRALCRIGKTNNAVDLLKLMVDS-GFNLDANICSLILSTMPDVKDCVGV 253

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           ++ G L +MR  G+SP  VD  NV+ F V + +G+DAL++LN+MK  G+ PDVVCY +VL
Sbjct: 254 EIWGVLEEMRKLGYSPKRVDLCNVIRFYVNNGKGIDALEVLNKMKMCGMVPDVVCYNLVL 313

Query: 542 DGFI 553
           +G I
Sbjct: 314 NGLI 317


>ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda]
           gi|548857785|gb|ERN15583.1| hypothetical protein
           AMTR_s00048p00149840 [Amborella trichopoda]
          Length = 464

 Score =  152 bits (383), Expect = 7e-35
 Identities = 82/183 (44%), Positives = 119/183 (65%)
 Frame = +2

Query: 2   IRTYGLTNKLHDAVDIFFRIPNFRCTPSAFSLNALLSVLCHRREGLQMVRQVLLKSHAMN 181
           I++   +  + +A+D+FF +P+ RC PS  SLNALLSVLC   +   +V ++L+K+  MN
Sbjct: 124 IQSCASSKMVKEALDLFFAMPHLRCQPSTTSLNALLSVLCDT-DSFHLVPELLIKTLEMN 182

Query: 182 IRLDHSTFRILISALCNINKVAFAIDIFNLMTTHYGYDPDSTLYSFILSALCKIKDLSSS 361
           IRLD S+FRILI +LC I K+ FAI++  LM    G  PDS  Y+ IL  LC+  + S  
Sbjct: 183 IRLDASSFRILIGSLCRIGKLGFAIELLRLMPDQ-GCWPDSGFYAEILCKLCEFGEFS-- 239

Query: 362 QVLGFLGQMRNAGFSPDGVDYTNVLTFLVKSDRGLDALDLLNQMKSDGIKPDVVCYTMVL 541
           ++ GFL +M++AGF PD + Y  V+  L K  R  +A  +LN+MK +G KPD + YT ++
Sbjct: 240 EIYGFLDEMKDAGFFPDKIAYAIVIDSLAKGGRLNEARAILNRMKLEGAKPDTITYTSMM 299

Query: 542 DGF 550
           DGF
Sbjct: 300 DGF 302


Top