BLASTX nr result

ID: Sinomenium21_contig00019721 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00019721
         (2344 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267684.1| PREDICTED: pentatricopeptide repeat-containi...   455   e-125
emb|CAN60667.1| hypothetical protein VITISV_028261 [Vitis vinifera]   452   e-124
ref|XP_006442285.1| hypothetical protein CICLE_v10020422mg [Citr...   444   e-122
ref|XP_006849944.1| hypothetical protein AMTR_s00022p00130870 [A...   444   e-122
ref|XP_006477941.1| PREDICTED: pentatricopeptide repeat-containi...   443   e-121
ref|XP_004248491.1| PREDICTED: pentatricopeptide repeat-containi...   433   e-118
gb|EXC11739.1| hypothetical protein L484_020794 [Morus notabilis]     432   e-118
ref|XP_007022117.1| Pentatricopeptide repeat (PPR) superfamily p...   425   e-116
ref|XP_006360029.1| PREDICTED: pentatricopeptide repeat-containi...   425   e-116
gb|EMT16192.1| hypothetical protein F775_07734 [Aegilops tauschii]    421   e-115
ref|XP_004146072.1| PREDICTED: pentatricopeptide repeat-containi...   419   e-114
ref|XP_002521673.1| pentatricopeptide repeat-containing protein,...   416   e-113
ref|XP_006407066.1| hypothetical protein EUTSA_v10020856mg [Eutr...   414   e-113
ref|XP_002882884.1| predicted protein [Arabidopsis lyrata subsp....   414   e-112
ref|XP_004976824.1| PREDICTED: pentatricopeptide repeat-containi...   412   e-112
ref|XP_002447107.1| hypothetical protein SORBIDRAFT_06g028710 [S...   412   e-112
dbj|BAK08007.1| predicted protein [Hordeum vulgare subsp. vulgare]    410   e-111
ref|XP_004301959.1| PREDICTED: pentatricopeptide repeat-containi...   410   e-111
ref|XP_006299283.1| hypothetical protein CARUB_v10015437mg [Caps...   407   e-110
ref|NP_188076.1| pentatricopeptide repeat-containing protein [Ar...   406   e-110

>ref|XP_002267684.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Vitis vinifera]
          Length = 393

 Score =  455 bits (1170), Expect = e-125
 Identities = 214/339 (63%), Positives = 273/339 (80%)
 Frame = -1

Query: 2149 DPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHA 1970
            D +KRL HKDWL+P EVLKIF  L++P SV+   + VS+RKD++P+E LYTLVI KL+ A
Sbjct: 39   DDLKRLDHKDWLSPREVLKIFDGLRNPESVMPVLDSVSKRKDFKPNEALYTLVINKLAQA 98

Query: 1969 KKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTK 1790
            + FDAI+D++  ++ ++ CR+SD FFY+VIK+YGNVAG  D+AVETLF MP+++CWP+ K
Sbjct: 99   RMFDAIEDVIKTLKIDKQCRLSDVFFYNVIKVYGNVAGQPDRAVETLFDMPKFHCWPSVK 158

Query: 1789 TFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDEL 1610
            TFN VLNMLVSAK+FDV+H+VY GAP LGVEID CCLNIL+KGLC    +D+A  LLDE 
Sbjct: 159  TFNLVLNMLVSAKRFDVVHKVYAGAPELGVEIDACCLNILVKGLCRSGNVDAACELLDEY 218

Query: 1609 PKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRV 1430
            PKQ  +PNVRT+ST+MH LCE GRVE A  L ERMEREG  PDT++FN LISGL K+GRV
Sbjct: 219  PKQRCRPNVRTFSTLMHGLCESGRVEGALGLLERMEREGVYPDTVVFNILISGLRKRGRV 278

Query: 1429 ADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLL 1250
             +GM+L  RM LKGC PN+GSYQ ++YG+LD+  F +AK FM +MI +GV PSF SYK++
Sbjct: 279  EEGMELLGRMKLKGCYPNAGSYQEVLYGVLDTGRFGKAKEFMCQMIDEGVSPSFVSYKMM 338

Query: 1249 IHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETM 1133
            I+GLC ENL+ DV  +LK+MV+QGFVP+  MWR+IL+TM
Sbjct: 339  IYGLCKENLVADVVWILKQMVEQGFVPERWMWRRILQTM 377


>emb|CAN60667.1| hypothetical protein VITISV_028261 [Vitis vinifera]
          Length = 393

 Score =  452 bits (1164), Expect = e-124
 Identities = 213/339 (62%), Positives = 272/339 (80%)
 Frame = -1

Query: 2149 DPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHA 1970
            D +KRL HKDWL+P EVLKIF  L++P SV+   + V +RKD++P+E LYTLVI KL+ A
Sbjct: 39   DDLKRLDHKDWLSPREVLKIFDGLRNPESVMPVLDSVCKRKDFKPNEALYTLVINKLAQA 98

Query: 1969 KKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTK 1790
            + FDAI+D++  ++ ++ CR+SD FFY+VIK+YGNVAG  D+AVETLF MP+++CWP+ K
Sbjct: 99   RMFDAIEDVIKTLKIDKQCRLSDVFFYNVIKVYGNVAGRPDRAVETLFDMPKFHCWPSVK 158

Query: 1789 TFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDEL 1610
            TFN VLNMLVSAK+FDV+H+VY GAP LGVEID CCLNIL+KGLC    +D+A  LLDE 
Sbjct: 159  TFNLVLNMLVSAKRFDVVHKVYAGAPELGVEIDACCLNILVKGLCRSGNVDAACELLDEY 218

Query: 1609 PKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRV 1430
            PKQ  +PNVRT+ST+MH LCE GRVE A  L ERMEREG  PDT++FN LISGL K+GRV
Sbjct: 219  PKQRCRPNVRTFSTLMHGLCESGRVEGALGLLERMEREGVYPDTVVFNILISGLRKRGRV 278

Query: 1429 ADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLL 1250
             +GM+L  RM LKGC PN+GSYQ ++YG+LD+  F +AK FM +MI +GV PSF SYK++
Sbjct: 279  EEGMELLGRMKLKGCYPNAGSYQEVLYGVLDTGRFGKAKEFMCQMIDEGVSPSFVSYKMV 338

Query: 1249 IHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETM 1133
            I+GLC ENL+ DV  +LK+MV+QGFVP+  MWR+IL+TM
Sbjct: 339  IYGLCKENLVADVVWILKQMVEQGFVPERWMWRRILQTM 377


>ref|XP_006442285.1| hypothetical protein CICLE_v10020422mg [Citrus clementina]
            gi|557544547|gb|ESR55525.1| hypothetical protein
            CICLE_v10020422mg [Citrus clementina]
          Length = 404

 Score =  444 bits (1143), Expect = e-122
 Identities = 216/337 (64%), Positives = 262/337 (77%)
 Frame = -1

Query: 2137 RLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKKFD 1958
            +L+HKDWL+P EVLKIF  L+DP SV++   + S+RKDY P+E LYTL+I KL+ AK+FD
Sbjct: 49   KLNHKDWLSPTEVLKIFSNLRDPISVISVLNQYSKRKDYNPNEALYTLIINKLAQAKRFD 108

Query: 1957 AIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTFNY 1778
            AI+DI+ RI+ E+ CR SD FFY+VIKIYGN+AG I KA+ETLF MP Y CWP+ KTFN 
Sbjct: 109  AIEDIMQRIKVEKLCRFSDGFFYNVIKIYGNMAGRISKAIETLFDMPSYNCWPSVKTFNL 168

Query: 1777 VLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPKQG 1598
            VLN+LVSAK F  I  +Y  A +LGVEID CCLNIL+KGLCE   L++AF +LDE PKQ 
Sbjct: 169  VLNLLVSAKLFGEIQGIYTSAAKLGVEIDACCLNILLKGLCENGNLEAAFYVLDEFPKQN 228

Query: 1597 LKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGM 1418
             +PNVRTYST+MH LCE+G VEEAF L ERME EG D DT+ FN LISGL KQG+V +GM
Sbjct: 229  CEPNVRTYSTLMHGLCEKGNVEEAFGLLERMESEGIDADTVTFNILISGLRKQGKVEEGM 288

Query: 1417 DLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGL 1238
             L +RM  KGC PNS SYQ ++YGLLD K F EAK  + RMI + + PSF SYK LIHGL
Sbjct: 289  KLLERMKGKGCYPNSASYQEVLYGLLDKKRFPEAKELVGRMICERMSPSFVSYKKLIHGL 348

Query: 1237 CDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETMGC 1127
            C++ L++DVD VLK MVQQGFVP+MGMWR+I   +GC
Sbjct: 349  CNQKLVEDVDWVLKTMVQQGFVPRMGMWREI---VGC 382


>ref|XP_006849944.1| hypothetical protein AMTR_s00022p00130870 [Amborella trichopoda]
            gi|548853542|gb|ERN11525.1| hypothetical protein
            AMTR_s00022p00130870 [Amborella trichopoda]
          Length = 404

 Score =  444 bits (1142), Expect = e-122
 Identities = 205/343 (59%), Positives = 270/343 (78%)
 Frame = -1

Query: 2140 KRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKKF 1961
            KRL HKDWLAPNEVLKIF+ ++DP    + F+K+ +RKDY+P+E LYT++I+ L+ AKKF
Sbjct: 56   KRLDHKDWLAPNEVLKIFKSVRDPQMAFDLFQKLVRRKDYKPNEALYTILIEMLASAKKF 115

Query: 1960 DAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTFN 1781
            DAI+++L+R++ E+ C++SDEFF  +IK+Y N+  +  +AV  L+ MP+++CWP+ +TFN
Sbjct: 116  DAIEELLTRMKMEK-CKLSDEFFRHLIKLYANIGKNAVQAVNILYRMPDFHCWPSVRTFN 174

Query: 1780 YVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPKQ 1601
             VLNMLV AKQ++++HEVYL A +LGV +DTCC NILIK LC+C  LD+AFSLL E PKQ
Sbjct: 175  SVLNMLVCAKQYEMVHEVYLSASQLGVAVDTCCFNILIKALCQCGDLDAAFSLLQEAPKQ 234

Query: 1600 GLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADG 1421
            G +PN  TY+T+MH LC+ GRV EAF+LYERMERE C PDTI FN LISGLCKQG V   
Sbjct: 235  GCRPNATTYATLMHGLCKSGRVSEAFELYERMEREVCYPDTITFNILISGLCKQGSVKQA 294

Query: 1420 MDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHG 1241
            MDL   M LKGC PNSGSYQA++YGLLD+ +F EA   +  M+SKG+FPSF SYK+LI G
Sbjct: 295  MDLLHTMKLKGCYPNSGSYQALIYGLLDASDFVEANKLLSLMVSKGIFPSFLSYKMLIDG 354

Query: 1240 LCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETMGCQESCK 1112
            LCD +LL DVD+VL +M+ QGF+P+MG W +ILE++    +C+
Sbjct: 355  LCDMDLLRDVDAVLTQMINQGFIPRMGTWMRILESLFRGRTCE 397



 Score = 75.5 bits (184), Expect = 1e-10
 Identities = 58/234 (24%), Positives = 100/234 (42%)
 Frame = -1

Query: 2017 PSEPLYTLVIQKLSHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAV 1838
            PS   +  V+  L  AK+++ + ++     ++    +    F  +IK      G +D A 
Sbjct: 168  PSVRTFNSVLNMLVCAKQYEMVHEVYLSA-SQLGVAVDTCCFNILIKALCQ-CGDLDAAF 225

Query: 1837 ETLFSMPEYYCWPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGL 1658
              L   P+  C P   T+  +++ L  + +     E+Y    R     DT   NILI GL
Sbjct: 226  SLLQEAPKQGCRPNATTYATLMHGLCKSGRVSEAFELYERMEREVCYPDTITFNILISGL 285

Query: 1657 CECDKLDSAFSLLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDT 1478
            C+   +  A  LL  +  +G  PN  +Y  +++ L +     EA KL   M  +G  P  
Sbjct: 286  CKQGSVKQAMDLLHTMKLKGCYPNSGSYQALIYGLLDASDFVEANKLLSLMVSKGIFPSF 345

Query: 1477 IIFNTLISGLCKQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEA 1316
            + +  LI GLC    + D   +  +M  +G  P  G++  I+  L   +   +A
Sbjct: 346  LSYKMLIDGLCDMDLLRDVDAVLTQMINQGFIPRMGTWMRILESLFRGRTCEDA 399


>ref|XP_006477941.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Citrus sinensis]
          Length = 404

 Score =  443 bits (1139), Expect = e-121
 Identities = 215/339 (63%), Positives = 263/339 (77%)
 Frame = -1

Query: 2143 VKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKK 1964
            V +L+HKDWL+P EVLKIF  L+DP SV++   + S+RKDY P+E LYTL+I KL+ AK 
Sbjct: 47   VYKLNHKDWLSPTEVLKIFSNLRDPISVISVLNQYSKRKDYNPNEALYTLIINKLAQAKS 106

Query: 1963 FDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTF 1784
            FDAI+DI+ RI+ E+ CR SD FFY+VIKIYGN+AG I KA+ETLF MP Y CWP+ KTF
Sbjct: 107  FDAIEDIMQRIKVEKLCRFSDAFFYNVIKIYGNMAGRIGKAIETLFDMPSYNCWPSVKTF 166

Query: 1783 NYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPK 1604
            N VLN+LVSAK +  I  +Y  A +LGVEID CCLNIL+KGLCE   L++AF +LDE PK
Sbjct: 167  NLVLNLLVSAKLYGEIQGIYTSAAKLGVEIDACCLNILLKGLCENGNLEAAFYVLDEFPK 226

Query: 1603 QGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVAD 1424
            Q  +PNVRT+ST+MH LCE+G VEEAF L ERME EG D DT+ FN LISGL KQG+V +
Sbjct: 227  QNCEPNVRTFSTLMHGLCEKGNVEEAFGLLERMESEGIDADTVTFNILISGLRKQGKVEE 286

Query: 1423 GMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIH 1244
            GM L +RM  KGC PNS SYQ ++YGLLD K F EAK  + RMI + + PSF SYK LIH
Sbjct: 287  GMKLLERMKGKGCYPNSASYQEVLYGLLDKKRFPEAKELVGRMICERMSPSFVSYKKLIH 346

Query: 1243 GLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETMGC 1127
            GLC++ L++DVD VLK+MVQQGFVP+MGMWR+I   +GC
Sbjct: 347  GLCNQKLVEDVDWVLKKMVQQGFVPRMGMWREI---VGC 382


>ref|XP_004248491.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Solanum lycopersicum]
          Length = 387

 Score =  433 bits (1113), Expect = e-118
 Identities = 200/359 (55%), Positives = 273/359 (76%), Gaps = 4/359 (1%)
 Frame = -1

Query: 2179 SRLNSTVSSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLY 2000
            S L +  S F    ++ + DWL+ NEV+KIF+ LK+P S L    ++S RKDY+P+E +Y
Sbjct: 22   SNLGTPSSPFSS-SQIQNSDWLSSNEVIKIFQNLKNPNSALTLLNQISNRKDYRPNEAIY 80

Query: 1999 TLVIQKLSHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSM 1820
            ++V++ L+ AK FDAI+ ++ +I+ ER CR+SDEFFY+VIKIYG++AG I+++++TLF M
Sbjct: 81   SVVVKNLAIAKNFDAIETLMEKIKIERKCRLSDEFFYNVIKIYGHLAGRINRSIDTLFDM 140

Query: 1819 PEYYCWPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKL 1640
            P Y C+P+ KTFN+VLN+LV+ KQFDV+H+VY+    LGVEID CCLNI+IKGLC C ++
Sbjct: 141  PNYKCFPSVKTFNFVLNLLVNTKQFDVVHKVYVRGSELGVEIDACCLNIIIKGLCRCGEI 200

Query: 1639 DSAFSLLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTL 1460
            D+A+ + DE PKQ   PNVRT+STIMH LC+ GRV+EA  L +RME E  +PD I+FNTL
Sbjct: 201  DAAYKVFDEFPKQNCSPNVRTFSTIMHALCDHGRVDEALSLLDRMENENVEPDAIVFNTL 260

Query: 1459 ISGLCKQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGV 1280
            ISGL KQ RV +G+D+F ++ LKGC PN G+YQ ++Y LLD+K F EAK FM  MI K V
Sbjct: 261  ISGLRKQRRVDEGIDMFKKVMLKGCDPNPGTYQEVLYALLDAKRFLEAKNFMSVMIDKRV 320

Query: 1279 FPSFQSYKLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETM----GCQESC 1115
             PSF+SYK+++HGLCD  L+ D+D VLK+MV+ GFVP+MGMW+KIL  +    GC  +C
Sbjct: 321  NPSFESYKVIVHGLCDGKLVGDLDWVLKQMVRHGFVPRMGMWKKILGCLFPDGGCCTTC 379


>gb|EXC11739.1| hypothetical protein L484_020794 [Morus notabilis]
          Length = 405

 Score =  432 bits (1111), Expect = e-118
 Identities = 200/357 (56%), Positives = 275/357 (77%), Gaps = 1/357 (0%)
 Frame = -1

Query: 2170 NSTVSSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLV 1991
            +S+ SS +   RL HKDWLAP EVL++F  L +P S++ A    S+RKDY+P+EPL TL+
Sbjct: 40   SSSSSSSEIPSRLHHKDWLAPKEVLQVFSSLTNPNSIVPALNHYSKRKDYKPNEPLLTLI 99

Query: 1990 IQKLSHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEY 1811
            I  L+ A+ FD ++ +++RI+ ER+C +SD+FF +VI++YGN+AG I +A+E LF MPE+
Sbjct: 100  INNLAEARLFDDVEVVVARIKAERNCNLSDDFFRNVIRVYGNLAGRIKRAIEILFEMPEF 159

Query: 1810 Y-CWPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDS 1634
            Y CWPT KTFN VLN+LVSA+ FDV+HE+++ AP+LGV ID CCLNI+IKGLCEC KL+ 
Sbjct: 160  YGCWPTAKTFNSVLNLLVSARLFDVVHELFVAAPKLGVVIDACCLNIMIKGLCECRKLEV 219

Query: 1633 AFSLLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLIS 1454
            A  +LDE P+QG +PN+ T++T+MH+LC  G+VEEA KL+ERME EG +PDTI FN LI+
Sbjct: 220  ALQMLDEFPRQGCEPNLLTFTTLMHYLCVHGKVEEAIKLFERMEEEGIEPDTITFNVLIA 279

Query: 1453 GLCKQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFP 1274
            GL +QGRV +GM L  RM LKGC+PN GSYQ +  GLLD++ F+EA   M R+IS G  P
Sbjct: 280  GLRRQGRVDEGMVLLQRMKLKGCNPNVGSYQEVFNGLLDAERFSEANEVMSRIISMGSSP 339

Query: 1273 SFQSYKLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETMGCQESCKSCI 1103
            S +S+K LIHGLC+EN ++D+D  LK+M +QGFVPKM MW++IL+++   ++   C+
Sbjct: 340  SIKSFKCLIHGLCEENRMEDIDWALKQMGKQGFVPKMWMWKEILQSLFGGKTSDKCV 396


>ref|XP_007022117.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
            [Theobroma cacao] gi|508721745|gb|EOY13642.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative [Theobroma cacao]
          Length = 408

 Score =  425 bits (1093), Expect = e-116
 Identities = 200/342 (58%), Positives = 266/342 (77%)
 Frame = -1

Query: 2164 TVSSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQ 1985
            T  +  P+ +L+HKDWL+PNE+LKIF  LK+P SV++   + S RKDY+P+EPL+TLVI 
Sbjct: 47   TAPTSPPLFKLTHKDWLSPNEILKIFDNLKEPTSVISVLNQYSARKDYKPTEPLFTLVIN 106

Query: 1984 KLSHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYC 1805
            KL+ A+ FD I++I+ +++ E+ CR+SD+FF +VIK YG+  G I +A+ETLFSMP+Y  
Sbjct: 107  KLADAQDFDCIENIMEKLKHEKPCRLSDDFFQNVIKKYGHCGGRIKRAIETLFSMPDYGT 166

Query: 1804 WPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFS 1625
            WP+ KTFN +L++LV+ K FDV+HE+Y   P+LG+EI+ C LNILIKGLCE  KL+SAF 
Sbjct: 167  WPSVKTFNIILSLLVANKLFDVVHEIYGKGPKLGIEIEACTLNILIKGLCENGKLESAFQ 226

Query: 1624 LLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLC 1445
            +LDE PKQG KPNVRT+ST+MH LCE+G+V+EAF+L  RME EG + D + FN LISGL 
Sbjct: 227  VLDEFPKQGCKPNVRTFSTLMHGLCEKGKVDEAFELMGRMETEGIEADAVSFNILISGLR 286

Query: 1444 KQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQ 1265
            KQGRV +G+ L ++M  KGC PN+GSYQ ++YGLLD++ F EAK  M RMI + V PSF 
Sbjct: 287  KQGRVEEGVKLLEKMKRKGCYPNAGSYQEVLYGLLDAERFMEAKELMGRMILERVSPSFD 346

Query: 1264 SYKLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILE 1139
            SYK LIHG C E L+ +VD  LK+MVQQGFVPKMGMW ++++
Sbjct: 347  SYKKLIHGFCKEKLVREVDWALKQMVQQGFVPKMGMWTQMVK 388


>ref|XP_006360029.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Solanum tuberosum]
          Length = 387

 Score =  425 bits (1092), Expect = e-116
 Identities = 199/363 (54%), Positives = 270/363 (74%)
 Frame = -1

Query: 2215 FIHVMLMD*RMVSRLNSTVSSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVS 2036
            FIH +     +      T SS     ++ + DWL+ NEV+KIF+ LK+  S L    ++S
Sbjct: 9    FIHKLYFPFSLQRSNLGTPSSPFSSSQIQNSDWLSSNEVIKIFQNLKNANSALTLLNQIS 68

Query: 2035 QRKDYQPSEPLYTLVIQKLSHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAG 1856
             RKDY+P+E +Y++V++ L+ AK FDAI+ ++ +I+ ER CR+SDEFFY+VIKIYG++AG
Sbjct: 69   NRKDYRPNEAIYSVVVKNLAIAKNFDAIESLMEKIKIERKCRLSDEFFYNVIKIYGHLAG 128

Query: 1855 HIDKAVETLFSMPEYYCWPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLN 1676
             I++A++T F MP Y C+P+ KTFN++LN+LV+ KQFDV+H+VYL    LGVEID CCLN
Sbjct: 129  RINRAIDTFFDMPNYKCFPSVKTFNFLLNLLVNTKQFDVVHKVYLRGSELGVEIDACCLN 188

Query: 1675 ILIKGLCECDKLDSAFSLLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMERE 1496
            I+IKGLC C ++ +A+ + DE PKQ   PNVRT+STIMH LC+ GRV+EA  L ERME E
Sbjct: 189  IIIKGLCRCGEIAAAYKVFDEFPKQNCSPNVRTFSTIMHALCDHGRVDEALSLLERMENE 248

Query: 1495 GCDPDTIIFNTLISGLCKQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEA 1316
              +PD I+FNTLISGL KQ RV +G+D+F ++ LKGC PN G+YQ ++Y LLD+K + EA
Sbjct: 249  DVEPDAIVFNTLISGLRKQRRVDEGIDMFKKVMLKGCDPNPGTYQEVLYALLDAKRYLEA 308

Query: 1315 KCFMDRMISKGVFPSFQSYKLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILET 1136
            K FM  MI K V PSF+SYK+++HGLCD  L+ D+D VLK+M++ GFVP+MGMWRKIL  
Sbjct: 309  KDFMAVMIDKRVNPSFESYKVIVHGLCDGKLVGDLDWVLKQMMRHGFVPRMGMWRKIL-- 366

Query: 1135 MGC 1127
             GC
Sbjct: 367  -GC 368


>gb|EMT16192.1| hypothetical protein F775_07734 [Aegilops tauschii]
          Length = 364

 Score =  421 bits (1083), Expect = e-115
 Identities = 196/339 (57%), Positives = 255/339 (75%)
 Frame = -1

Query: 2143 VKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKK 1964
            + RL HKDWLAPNEVLKIF  ++D   + + F+K   R+DY+PSE LY L+I +L  A++
Sbjct: 27   IGRLDHKDWLAPNEVLKIFASIRDAALITSVFKKACARRDYKPSEALYGLMIDRLPRARR 86

Query: 1963 FDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTF 1784
                 ++L+R R ER  R+SDEFFY +IK+YGNVA H +KA+ETL++MP+Y CWP+TKTF
Sbjct: 87   VGVAWELLARARAER-VRVSDEFFYRLIKMYGNVANHPEKAMETLYAMPDYGCWPSTKTF 145

Query: 1783 NYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPK 1604
            NYVL+MLV  +Q++V+HEVY  APRLGVE+DTCC NILIKGLC+  + + A SLLDE+PK
Sbjct: 146  NYVLHMLVCKRQYEVVHEVYASAPRLGVELDTCCFNILIKGLCQFGRFNEALSLLDEMPK 205

Query: 1603 QGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVAD 1424
            Q  +PNV TYST+MHFLC + RV+EAFKL+ERM +E  D DT+++N L+SGLC+QGRV  
Sbjct: 206  QECRPNVTTYSTLMHFLCRKSRVDEAFKLFERMRKEEIDADTVVYNILVSGLCRQGRVTS 265

Query: 1423 GMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIH 1244
              DLF  M  +GC PNSG+YQ ++ GL+ SK F EAK  +  M ++ + PSF SYKLLI 
Sbjct: 266  AYDLFKSMSSEGCHPNSGTYQVLLDGLVASKKFVEAKDLVTMMSAESLRPSFSSYKLLID 325

Query: 1243 GLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETMGC 1127
            GLC  N LDD   VLK+MV QGFVP+MG W K+L ++ C
Sbjct: 326  GLCSVNCLDDAHHVLKQMVDQGFVPRMGTWTKLLTSLLC 364


>ref|XP_004146072.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Cucumis sativus]
            gi|449503658|ref|XP_004162112.1| PREDICTED:
            pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Cucumis sativus]
          Length = 411

 Score =  419 bits (1077), Expect = e-114
 Identities = 194/335 (57%), Positives = 259/335 (77%)
 Frame = -1

Query: 2137 RLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKKFD 1958
            +LSH+DWL+PNEV+ I + ++ P SVL    + S RKDY+P++ +YTLV+ +L+  + FD
Sbjct: 49   KLSHRDWLSPNEVINIIQQIQHPSSVLAFLHQWSNRKDYKPNKEIYTLVVSRLAEGRLFD 108

Query: 1957 AIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTFNY 1778
             I+ ++ RI+ ER+ R+SDEFFY VIKIYGNVAG ++KA++TLF MP Y CWP+ KTFN+
Sbjct: 109  DIEKVMLRIKAERNFRLSDEFFYHVIKIYGNVAGRLNKAIDTLFDMPNYNCWPSVKTFNF 168

Query: 1777 VLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPKQG 1598
            VLN+LVSAK FDV+HEVY+GAP+LG+EID CCLNIL+KGLC+   LD+A  +LDE P+Q 
Sbjct: 169  VLNLLVSAKMFDVVHEVYMGAPKLGIEIDACCLNILVKGLCQSGNLDAALKVLDEFPQQR 228

Query: 1597 LKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGM 1418
             +PNVRT+ST++H LCE G +  A +L+ +ME EG  PDTI FN LISGL K+ R+ + +
Sbjct: 229  CRPNVRTFSTLLHGLCENGELGRALELFCKMENEGVCPDTITFNILISGLRKKKRIEEAI 288

Query: 1417 DLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGL 1238
            +L  RM LKGC PN+G+YQ ++YGLLD+  F EA+  M RMIS+G+ PSF SYK L+ GL
Sbjct: 289  ELLGRMKLKGCYPNAGTYQEVLYGLLDTGKFIEARDCMHRMISEGMDPSFVSYKKLLSGL 348

Query: 1237 CDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETM 1133
            C + L +DVD VLK+MV QGFVPK+GMW+ IL  M
Sbjct: 349  CKKKLTEDVDWVLKQMVMQGFVPKVGMWKVILRCM 383


>ref|XP_002521673.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539064|gb|EEF40660.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 341

 Score =  416 bits (1070), Expect = e-113
 Identities = 199/323 (61%), Positives = 255/323 (78%), Gaps = 1/323 (0%)
 Frame = -1

Query: 2098 LKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKKFDAIQDILSRIRTER 1919
            ++IF  +KDP SV + +++ + RKDY+P+E LYTLVI KL+ AK FDAI+DI+ RI+ E+
Sbjct: 1    MRIFESIKDPNSVFSVWDQYTNRKDYKPNEALYTLVINKLAQAKNFDAIEDIMQRIKLEK 60

Query: 1918 SCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPE-YYCWPTTKTFNYVLNMLVSAKQFD 1742
            SCR+S+ FFY+VIKIYG++AG I  A++TLF MP  Y CWP  KTFN VLN+LVSA+ FD
Sbjct: 61   SCRLSNGFFYNVIKIYGHLAGRIKVAIDTLFDMPRGYNCWPDVKTFNLVLNLLVSARIFD 120

Query: 1741 VIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPKQGLKPNVRTYSTIM 1562
            V+HE+Y  AP LGVEID CCLNILIKGLCE   L++AF +LDE PKQ  KPNVRT+ST+M
Sbjct: 121  VVHEIYEKAPILGVEIDACCLNILIKGLCENGDLEAAFYVLDEFPKQRCKPNVRTFSTLM 180

Query: 1561 HFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGMDLFDRMGLKGCS 1382
            H+LC +G V +AF L +RME EG + DTI FN LISGL K+GR+ +GM+L  +M LKGC 
Sbjct: 181  HYLCAKGEVNQAFGLLDRMENEGIEVDTITFNILISGLRKRGRIEEGMELLVKMKLKGCE 240

Query: 1381 PNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGLCDENLLDDVDSV 1202
            PN+GSYQ ++YGLLD+  F+EAK FMDRM+ +G  PSF SYK LI GLC E L+ +VD V
Sbjct: 241  PNAGSYQEVLYGLLDAGKFSEAKDFMDRMVCEGNGPSFVSYKKLIDGLCKEKLIGEVDCV 300

Query: 1201 LKRMVQQGFVPKMGMWRKILETM 1133
            LK+M++QGFVPKMGMW+ I+ +M
Sbjct: 301  LKQMLKQGFVPKMGMWKHIVGSM 323



 Score = 87.8 bits (216), Expect = 2e-14
 Identities = 60/243 (24%), Positives = 110/243 (45%), Gaps = 6/243 (2%)
 Frame = -1

Query: 2017 PSEPLYTLVIQKLSHAKKFDAIQDILSR-----IRTERSCRISDEFFYSVIKIYGNVA-G 1856
            P    + LV+  L  A+ FD + +I  +     +  +  C          I I G    G
Sbjct: 101  PDVKTFNLVLNLLVSARIFDVVHEIYEKAPILGVEIDACCLN--------ILIKGLCENG 152

Query: 1855 HIDKAVETLFSMPEYYCWPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLN 1676
             ++ A   L   P+  C P  +TF+ +++ L +  + +    +       G+E+DT   N
Sbjct: 153  DLEAAFYVLDEFPKQRCKPNVRTFSTLMHYLCAKGEVNQAFGLLDRMENEGIEVDTITFN 212

Query: 1675 ILIKGLCECDKLDSAFSLLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMERE 1496
            ILI GL +  +++    LL ++  +G +PN  +Y  +++ L + G+  EA    +RM  E
Sbjct: 213  ILISGLRKRGRIEEGMELLVKMKLKGCEPNAGSYQEVLYGLLDAGKFSEAKDFMDRMVCE 272

Query: 1495 GCDPDTIIFNTLISGLCKQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEA 1316
            G  P  + +  LI GLCK+  + +   +  +M  +G  P  G ++ I+  +L       +
Sbjct: 273  GNGPSFVSYKKLIDGLCKEKLIGEVDCVLKQMLKQGFVPKMGMWKHIVGSMLSESGDCTS 332

Query: 1315 KCF 1307
             CF
Sbjct: 333  ICF 335


>ref|XP_006407066.1| hypothetical protein EUTSA_v10020856mg [Eutrema salsugineum]
            gi|557108212|gb|ESQ48519.1| hypothetical protein
            EUTSA_v10020856mg [Eutrema salsugineum]
          Length = 404

 Score =  414 bits (1064), Expect = e-113
 Identities = 190/332 (57%), Positives = 256/332 (77%)
 Frame = -1

Query: 2137 RLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKKFD 1958
            RL HKDWLAPNEVLKIF  +KDP  ++ A++  S+RKDYQP+EPLY L+I K   AK FD
Sbjct: 48   RLKHKDWLAPNEVLKIFENVKDPSFLMPAYQHYSKRKDYQPTEPLYALLINKFGQAKMFD 107

Query: 1957 AIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTFNY 1778
             I++++  ++ E+ CR S+EFFY++++IYGN+AG I++A+E LF MP++ CWP+ K+FN+
Sbjct: 108  EIEELMRNVKLEKRCRFSEEFFYNLMRIYGNLAGRINRAIEILFGMPDFGCWPSVKSFNF 167

Query: 1777 VLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPKQG 1598
            VLN+LVSAK FD IH++++ APRLGVEID CCLNILIKGLCE   L++A  +LDE PKQ 
Sbjct: 168  VLNLLVSAKLFDEIHKIFVSAPRLGVEIDGCCLNILIKGLCESGNLEAALQVLDEFPKQK 227

Query: 1597 LKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGM 1418
             +PNV T+S ++   C +G+ EEAFKL ERME+E  +PDTI FN LISGL K+GRV +G+
Sbjct: 228  SRPNVMTFSPLIRGFCNKGKFEEAFKLLERMEKERIEPDTITFNILISGLRKKGRVEEGI 287

Query: 1417 DLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGL 1238
            +L +RM +KGC PN G+YQ ++YGLLD K   EAK  M +MIS G+ PSF SYK ++ GL
Sbjct: 288  ELLERMRVKGCQPNPGTYQEVLYGLLDKKRNLEAKEMMSQMISWGMRPSFVSYKKMVLGL 347

Query: 1237 CDENLLDDVDSVLKRMVQQGFVPKMGMWRKIL 1142
            C+   ++++D VL++MV  GFVPK GMW K+L
Sbjct: 348  CETKSVEEMDWVLRQMVNHGFVPKTGMWWKVL 379


>ref|XP_002882884.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297328724|gb|EFH59143.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 409

 Score =  414 bits (1063), Expect = e-112
 Identities = 192/337 (56%), Positives = 256/337 (75%)
 Frame = -1

Query: 2158 SSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKL 1979
            S  D + RL HKDWLAPNEVLKIF  +KDP  ++ A++  S+RKDYQP+E LY L+I K 
Sbjct: 46   SGDDKLARLKHKDWLAPNEVLKIFENVKDPSFLMPAYQHYSKRKDYQPTESLYALLINKF 105

Query: 1978 SHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWP 1799
              AK FD I++++S I+ E+ CR S++FFY++++IYGN+AG I++A+E LF MP++ CWP
Sbjct: 106  GQAKMFDEIEEVMSTIKLEKRCRFSEDFFYNLMRIYGNLAGRINRAIEILFGMPDFGCWP 165

Query: 1798 TTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLL 1619
            + K+FN++LN+LVSAK FD IH++++ AP+LGVEID CCLNILIKGLCE   L++A  LL
Sbjct: 166  SAKSFNFILNLLVSAKLFDEIHKIFVSAPKLGVEIDACCLNILIKGLCESGNLEAALQLL 225

Query: 1618 DELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQ 1439
            DE PKQ  +PNV T+S ++   C +G+ EEAFKL ERME+E  +PDTI FN LISGL K+
Sbjct: 226  DEFPKQKSRPNVMTFSPLIRGFCNKGKFEEAFKLLERMEKERIEPDTITFNILISGLRKK 285

Query: 1438 GRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSY 1259
            GRV +G+DL +RM LKGC PN G+YQ ++YGLLD K   EAK  M +MIS G+ PSF SY
Sbjct: 286  GRVEEGIDLLERMKLKGCEPNPGTYQEVLYGLLDKKRNLEAKEMMSQMISWGMRPSFLSY 345

Query: 1258 KLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRK 1148
            K ++ GLC+   + ++D VL++MV  GFVPK GMW K
Sbjct: 346  KKMVLGLCETKSVAEMDWVLRKMVNHGFVPKTGMWWK 382



 Score = 81.3 bits (199), Expect = 2e-12
 Identities = 61/262 (23%), Positives = 118/262 (45%), Gaps = 14/262 (5%)
 Frame = -1

Query: 1885 VIKIYGNVAGHIDKAVETLFSMPEYYCW-------PTTKTFNYVLNMLVSAKQFDVIHEV 1727
            V+KI+ NV        +  F MP Y  +       PT   +  ++N    AK FD I EV
Sbjct: 65   VLKIFENVK-------DPSFLMPAYQHYSKRKDYQPTESLYALLINKFGQAKMFDEIEEV 117

Query: 1726 YLGAPRLGVEIDTCC-------LNILIKGLCECDKLDSAFSLLDELPKQGLKPNVRTYST 1568
                    ++++  C        N++        +++ A  +L  +P  G  P+ ++++ 
Sbjct: 118  MST-----IKLEKRCRFSEDFFYNLMRIYGNLAGRINRAIEILFGMPDFGCWPSAKSFNF 172

Query: 1567 IMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGMDLFDRMGLKG 1388
            I++ L      +E  K++    + G + D    N LI GLC+ G +   + L D    + 
Sbjct: 173  ILNLLVSAKLFDEIHKIFVSAPKLGVEIDACCLNILIKGLCESGNLEAALQLLDEFPKQK 232

Query: 1387 CSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGLCDENLLDDVD 1208
              PN  ++  ++ G  +   F EA   ++RM  + + P   ++ +LI GL  +  +++  
Sbjct: 233  SRPNVMTFSPLIRGFCNKGKFEEAFKLLERMEKERIEPDTITFNILISGLRKKGRVEEGI 292

Query: 1207 SVLKRMVQQGFVPKMGMWRKIL 1142
             +L+RM  +G  P  G ++++L
Sbjct: 293  DLLERMKLKGCEPNPGTYQEVL 314



 Score = 71.2 bits (173), Expect = 2e-09
 Identities = 54/241 (22%), Positives = 112/241 (46%)
 Frame = -1

Query: 2017 PSEPLYTLVIQKLSHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAV 1838
            PS   +  ++  L  AK FD I  I   +   +     D    +++      +G+++ A+
Sbjct: 165  PSAKSFNFILNLLVSAKLFDEIHKIF--VSAPKLGVEIDACCLNILIKGLCESGNLEAAL 222

Query: 1837 ETLFSMPEYYCWPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGL 1658
            + L   P+    P   TF+ ++    +  +F+   ++     +  +E DT   NILI GL
Sbjct: 223  QLLDEFPKQKSRPNVMTFSPLIRGFCNKGKFEEAFKLLERMEKERIEPDTITFNILISGL 282

Query: 1657 CECDKLDSAFSLLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDT 1478
             +  +++    LL+ +  +G +PN  TY  +++ L ++ R  EA ++  +M   G  P  
Sbjct: 283  RKKGRVEEGIDLLERMKLKGCEPNPGTYQEVLYGLLDKKRNLEAKEMMSQMISWGMRPSF 342

Query: 1477 IIFNTLISGLCKQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDR 1298
            + +  ++ GLC+   VA+   +  +M   G  P +G +   +  ++   N ++A   +DR
Sbjct: 343  LSYKKMVLGLCETKSVAEMDWVLRKMVNHGFVPKTGMWWKAVCCVVSKNNDSQAN--LDR 400

Query: 1297 M 1295
            +
Sbjct: 401  I 401


>ref|XP_004976824.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Setaria italica]
          Length = 363

 Score =  412 bits (1060), Expect = e-112
 Identities = 192/335 (57%), Positives = 257/335 (76%)
 Frame = -1

Query: 2137 RLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKKFD 1958
            RL HKDWLAPNEVLKIF  ++DPG + + F K   R+DY+PSE LY+L+I KL+ A++F 
Sbjct: 29   RLDHKDWLAPNEVLKIFANIRDPGLITSVFNKACNRRDYKPSEALYSLMIDKLACARRFS 88

Query: 1957 AIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTFNY 1778
             ++++L++ R E+  R SDEFFY +IK+YGNVA H  KA++TLF+MP Y CWP+TKTFNY
Sbjct: 89   DVEELLAKARAEKF-RFSDEFFYRLIKMYGNVAEHPQKAIDTLFAMPGYNCWPSTKTFNY 147

Query: 1777 VLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPKQG 1598
            VL+MLV  +Q++V+HE+Y  APRLGV +DTC  NIL+KGLC+  K+D A SLL E+PKQG
Sbjct: 148  VLHMLVCKRQYEVVHEIYSSAPRLGVTLDTCSFNILVKGLCQFGKIDEAMSLLHEMPKQG 207

Query: 1597 LKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGM 1418
             +PNV TYST+MHFLC+R +V++AF+L+ERM+++    DT+++N LISGLCK+ RV +  
Sbjct: 208  CQPNVTTYSTLMHFLCQRCQVDKAFELFERMQKQDIAADTVVYNILISGLCKEERVTEAF 267

Query: 1417 DLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGL 1238
             LF  M  +GC PNSG+YQ ++ GL+ S  F EAK  +  M ++ V PSFQSYKLLI GL
Sbjct: 268  GLFKSMTSEGCYPNSGTYQVLLDGLISSGKFGEAKNLISVMSTESVRPSFQSYKLLIDGL 327

Query: 1237 CDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETM 1133
            C ++ LDD   VLK+MV QGFVP+MG WRK+L +M
Sbjct: 328  CSDDCLDDAHLVLKQMVGQGFVPRMGTWRKLLTSM 362


>ref|XP_002447107.1| hypothetical protein SORBIDRAFT_06g028710 [Sorghum bicolor]
            gi|241938290|gb|EES11435.1| hypothetical protein
            SORBIDRAFT_06g028710 [Sorghum bicolor]
          Length = 363

 Score =  412 bits (1059), Expect = e-112
 Identities = 191/335 (57%), Positives = 255/335 (76%)
 Frame = -1

Query: 2137 RLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKKFD 1958
            RL HKDWLAPNEVLKIF  ++DP  + + F+K   R DY+PSE LY+L+I KL+ A++F 
Sbjct: 29   RLDHKDWLAPNEVLKIFANIRDPSLINSVFKKACSRIDYKPSEALYSLMIDKLAFARRFS 88

Query: 1957 AIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTFNY 1778
             ++++LS+ +TE+  R SDEFFY +IK+YGNVA H  KA++TLF+MP Y CWP+TKTFNY
Sbjct: 89   DVEELLSKAKTEK-LRFSDEFFYRLIKMYGNVAEHPQKAIDTLFAMPGYNCWPSTKTFNY 147

Query: 1777 VLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPKQG 1598
            VL+MLV  +Q++V+HE+Y  APRLGV +DTC  NIL+KGLC+C K D A SLL E+PKQG
Sbjct: 148  VLHMLVCKRQYEVVHEIYSSAPRLGVTLDTCSFNILVKGLCQCSKFDEAISLLHEMPKQG 207

Query: 1597 LKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGM 1418
             +PNV TYST MHFLC+R  V++AF+L+ERM ++    DT+++N LISGLC++ RV++  
Sbjct: 208  CQPNVATYSTFMHFLCQRSLVDKAFELFERMRKQDIAADTVVYNILISGLCREERVSEAF 267

Query: 1417 DLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGL 1238
            DLF  M  +GC PNSG+YQ ++ GL+    F EAK  +  M ++GV PSFQSYKLLI GL
Sbjct: 268  DLFKSMTSEGCYPNSGTYQVLLDGLISLGKFFEAKSLVSTMSTEGVRPSFQSYKLLIDGL 327

Query: 1237 CDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETM 1133
            C E+ +DD   VLK+MV QGFVP+MG W K+L ++
Sbjct: 328  CSEDCVDDAHLVLKQMVGQGFVPRMGTWTKLLTSI 362


>dbj|BAK08007.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 363

 Score =  410 bits (1055), Expect = e-111
 Identities = 190/337 (56%), Positives = 252/337 (74%)
 Frame = -1

Query: 2143 VKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKLSHAKK 1964
            + RL H+DWLAPNEVLKIF  ++D   + + F K   R+DY+PSE LY L+I +L+ A++
Sbjct: 27   IGRLDHRDWLAPNEVLKIFASIRDAALITSVFRKACARRDYKPSEALYGLMIDRLAGARR 86

Query: 1963 FDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWPTTKTF 1784
            F  ++++L+R R ER  R SD+FFY +IK+YGNVA H +KA+ETL++M EY CWP+TKTF
Sbjct: 87   FGDVEELLARARAERF-RFSDDFFYRLIKMYGNVANHPEKAMETLYAMSEYGCWPSTKTF 145

Query: 1783 NYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLLDELPK 1604
            NYVL+MLV  +Q++V+HEVY  APRLGVE+DTCC NILIKGLC+  + + A SLLDE+PK
Sbjct: 146  NYVLHMLVCRRQYEVVHEVYSSAPRLGVELDTCCFNILIKGLCQFGRFNEALSLLDEMPK 205

Query: 1603 QGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVAD 1424
            Q  +PN  TYST+MHFLC   RV+EAF+L+ERM +E  D DT+++N L+SGLC++GRV  
Sbjct: 206  QDCRPNAMTYSTLMHFLCRNCRVDEAFELFERMRKEEIDADTVVYNILVSGLCREGRVTS 265

Query: 1423 GMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIH 1244
              DLF  M  +GC PNSG+YQ ++ GL+ SKNF EAK  +  M ++ + PSF SYKLLI 
Sbjct: 266  AYDLFKSMSSQGCHPNSGTYQVLLDGLVASKNFVEAKDLVGMMSAESLRPSFSSYKLLID 325

Query: 1243 GLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILETM 1133
            G C  N LDD   VLK+MV QGFVP+M  W K+L ++
Sbjct: 326  GFCSVNCLDDAHHVLKQMVDQGFVPRMSTWTKLLTSL 362


>ref|XP_004301959.1| PREDICTED: pentatricopeptide repeat-containing protein At3g14580,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 399

 Score =  410 bits (1053), Expect = e-111
 Identities = 205/378 (54%), Positives = 274/378 (72%), Gaps = 10/378 (2%)
 Frame = -1

Query: 2209 HVMLMD*RMVSRLN-----STVSSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFE 2045
            H +L      S LN     S+ SS    K+L HKDWL+P EVL++F  L+DP S+L A  
Sbjct: 13   HPVLQSLTQYSTLNQNPPSSSSSSPQIPKKLHHKDWLSPTEVLQVFTSLQDPTSLLPALH 72

Query: 2044 KVSQRKDYQPSEPLYTLVIQKLSHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGN 1865
              S RKDY+P+E LYTL+I KLS A  F+AI ++++RI++ER CR+SD+FF  VIK YGN
Sbjct: 73   HYSTRKDYKPTEALYTLIINKLSQAHLFEAIDNVMNRIKSERKCRLSDDFFRGVIKNYGN 132

Query: 1864 VAGHIDKAVETLFSMPEYY-CWPTTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDT 1688
            V G+I+KA++TLF MP  + CWP+ KTFN VL++LVS K FDV+HEVYL + +LGVE+D 
Sbjct: 133  VGGYINKAMQTLFDMPGGFGCWPSVKTFNLVLHILVSTKMFDVVHEVYLMSAKLGVEVDA 192

Query: 1687 CCLNILIKGLCECDKLDSAFSLLDELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYER 1508
            C LNI++KGLCE  K+D A  +LDE P Q  +PN  T+ST+MH LC  G+V+EAF L  R
Sbjct: 193  CSLNIIVKGLCESGKVDGALQVLDEFPHQKCEPNALTFSTLMHGLCVIGKVDEAFGLLRR 252

Query: 1507 MEREGCDPDTIIFNTLISGLCKQGRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKN 1328
            ME EG DPD++ FN LI+GL +Q R  +G++L ++M LKGC+PN  SYQ ++Y LLD++ 
Sbjct: 253  MENEGIDPDSVTFNILIAGLRRQKRYDEGIELLEQMKLKGCAPNPASYQEVLYCLLDAQR 312

Query: 1327 FAEAKCFMDRMISKGVFPSFQSYKLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRK 1148
            F EAK FM RM+SK V PSF SYK LI GLC EN ++++D VL++M +QGFVPKMGMWR+
Sbjct: 313  FVEAKEFMIRMVSKRVGPSFVSYKQLIQGLCKENKVEELDWVLRQMTRQGFVPKMGMWRQ 372

Query: 1147 ILETMGCQES----CKSC 1106
            I+ ++  ++S    C SC
Sbjct: 373  IIRSVFPEKSNNHHCVSC 390


>ref|XP_006299283.1| hypothetical protein CARUB_v10015437mg [Capsella rubella]
            gi|482567992|gb|EOA32181.1| hypothetical protein
            CARUB_v10015437mg [Capsella rubella]
          Length = 408

 Score =  407 bits (1045), Expect = e-110
 Identities = 187/338 (55%), Positives = 255/338 (75%)
 Frame = -1

Query: 2158 SSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKL 1979
            S  D + RL HKDWLAPNEVLKIF  +K+P  ++ A++  S+RKDYQP+EPLY L+I K 
Sbjct: 46   SGEDKLARLKHKDWLAPNEVLKIFENVKEPSFLIPAYQHYSKRKDYQPTEPLYALLINKF 105

Query: 1978 SHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWP 1799
              AK FD I++++  ++ E+ CR S+EFFY++++IYGN+ G I++A+E LFSMP++ CWP
Sbjct: 106  GQAKMFDEIEEVMRIVKLEKRCRFSEEFFYNLMRIYGNLGGRINRAIEILFSMPDFGCWP 165

Query: 1798 TTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLL 1619
            + K+FN++LN+LVSAK FD IH++++ AP+LGVEID CC+NILIKGLCE   L++A  LL
Sbjct: 166  SPKSFNFILNLLVSAKLFDEIHKIFVSAPKLGVEIDACCMNILIKGLCESGNLEAALQLL 225

Query: 1618 DELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQ 1439
            DE PKQ  +PNV T+S ++   C +G+  EAF+L ERME+E  +PDTI FN LISGL K+
Sbjct: 226  DEFPKQKSRPNVMTFSPLIRGFCNKGKFAEAFELLERMEKERIEPDTITFNILISGLRKK 285

Query: 1438 GRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSY 1259
            GRV +G+ L +RM LKGC PN G+YQ ++YGLLD K   EAK  M +MIS G+ PSF SY
Sbjct: 286  GRVEEGIQLLERMKLKGCQPNPGTYQEVLYGLLDKKRNLEAKEMMSQMISWGMRPSFLSY 345

Query: 1258 KLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKI 1145
            K ++ GLC+   + ++D VL++MV  GFVPK GMW K+
Sbjct: 346  KKMVLGLCETKSVAELDWVLRQMVNHGFVPKTGMWWKV 383



 Score = 81.3 bits (199), Expect = 2e-12
 Identities = 64/262 (24%), Positives = 121/262 (46%), Gaps = 14/262 (5%)
 Frame = -1

Query: 1885 VIKIYGNVAGHIDKAVETLFSMPEYYCW-------PTTKTFNYVLNMLVSAKQFDVIHEV 1727
            V+KI+ NV        E  F +P Y  +       PT   +  ++N    AK FD I EV
Sbjct: 65   VLKIFENVK-------EPSFLIPAYQHYSKRKDYQPTEPLYALLINKFGQAKMFDEIEEV 117

Query: 1726 YLGAPRLGVEIDTCC-------LNILIKGLCECDKLDSAFSLLDELPKQGLKPNVRTYST 1568
                 R+ V+++  C        N++        +++ A  +L  +P  G  P+ ++++ 
Sbjct: 118  M----RI-VKLEKRCRFSEEFFYNLMRIYGNLGGRINRAIEILFSMPDFGCWPSPKSFNF 172

Query: 1567 IMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQGRVADGMDLFDRMGLKG 1388
            I++ L      +E  K++    + G + D    N LI GLC+ G +   + L D    + 
Sbjct: 173  ILNLLVSAKLFDEIHKIFVSAPKLGVEIDACCMNILIKGLCESGNLEAALQLLDEFPKQK 232

Query: 1387 CSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSYKLLIHGLCDENLLDDVD 1208
              PN  ++  ++ G  +   FAEA   ++RM  + + P   ++ +LI GL  +  +++  
Sbjct: 233  SRPNVMTFSPLIRGFCNKGKFAEAFELLERMEKERIEPDTITFNILISGLRKKGRVEEGI 292

Query: 1207 SVLKRMVQQGFVPKMGMWRKIL 1142
             +L+RM  +G  P  G ++++L
Sbjct: 293  QLLERMKLKGCQPNPGTYQEVL 314


>ref|NP_188076.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75274210|sp|Q9LUD6.1|PP230_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g14580, mitochondrial; Flags: Precursor
            gi|9294380|dbj|BAB02390.1| unnamed protein product
            [Arabidopsis thaliana] gi|119935972|gb|ABM06049.1|
            At3g14580 [Arabidopsis thaliana]
            gi|332642020|gb|AEE75541.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 405

 Score =  406 bits (1044), Expect = e-110
 Identities = 189/340 (55%), Positives = 258/340 (75%)
 Frame = -1

Query: 2158 SSFDPVKRLSHKDWLAPNEVLKIFRCLKDPGSVLNAFEKVSQRKDYQPSEPLYTLVIQKL 1979
            S  D + RL HKDWLAPNEVLKIF  +KDP  +L A++  S+RKDYQP+E LY L+I K 
Sbjct: 46   SGDDRLARLRHKDWLAPNEVLKIFDNVKDPSFLLPAYQHYSKRKDYQPTESLYALMINKF 105

Query: 1978 SHAKKFDAIQDILSRIRTERSCRISDEFFYSVIKIYGNVAGHIDKAVETLFSMPEYYCWP 1799
              AK +D I++++  I+ E+ CR S+EFFY++++IYGN+AG I++A+E LF MP++ CWP
Sbjct: 106  GQAKMYDEIEEVMRTIKLEKRCRFSEEFFYNLMRIYGNLAGRINRAIEILFGMPDFGCWP 165

Query: 1798 TTKTFNYVLNMLVSAKQFDVIHEVYLGAPRLGVEIDTCCLNILIKGLCECDKLDSAFSLL 1619
            ++K+FN++LN+LVSAK FD IH++++ AP+LGVEID CCLNILIKGLCE   L++A  LL
Sbjct: 166  SSKSFNFILNLLVSAKLFDEIHKIFVSAPKLGVEIDACCLNILIKGLCESGNLEAALQLL 225

Query: 1618 DELPKQGLKPNVRTYSTIMHFLCERGRVEEAFKLYERMEREGCDPDTIIFNTLISGLCKQ 1439
            DE P+Q  +PNV T+S ++   C +G+ EEAFKL ERME+E  +PDTI FN LISGL K+
Sbjct: 226  DEFPQQKSRPNVMTFSPLIRGFCNKGKFEEAFKLLERMEKERIEPDTITFNILISGLRKK 285

Query: 1438 GRVADGMDLFDRMGLKGCSPNSGSYQAIMYGLLDSKNFAEAKCFMDRMISKGVFPSFQSY 1259
            GRV +G+DL +RM +KGC PN G+YQ ++YGLLD K   EAK  M +MIS G+ PSF SY
Sbjct: 286  GRVEEGIDLLERMKVKGCEPNPGTYQEVLYGLLDKKRNLEAKEMMSQMISWGMRPSFLSY 345

Query: 1258 KLLIHGLCDENLLDDVDSVLKRMVQQGFVPKMGMWRKILE 1139
            K ++ GLC+   + ++D VL++MV  GFVPK  MW K+++
Sbjct: 346  KKMVLGLCETKSVVEMDWVLRQMVNHGFVPKTLMWWKVVQ 385


Top