BLASTX nr result

ID: Mentha26_contig00020670 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00020670
         (1007 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU31068.1| hypothetical protein MIMGU_mgv1a024034mg [Mimulus...   451   e-124
gb|EPS58459.1| hypothetical protein M569_16353, partial [Genlise...   410   e-112
ref|XP_004169853.1| PREDICTED: pentatricopeptide repeat-containi...   404   e-110
ref|XP_004143565.1| PREDICTED: pentatricopeptide repeat-containi...   404   e-110
ref|XP_006354698.1| PREDICTED: pentatricopeptide repeat-containi...   402   e-109
ref|XP_004504032.1| PREDICTED: pentatricopeptide repeat-containi...   402   e-109
ref|XP_002306340.2| pentatricopeptide repeat-containing family p...   401   e-109
ref|XP_004237613.1| PREDICTED: pentatricopeptide repeat-containi...   400   e-109
ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containi...   400   e-109
ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containi...   400   e-109
ref|XP_007159746.1| hypothetical protein PHAVU_002G263600g [Phas...   399   e-108
ref|XP_002532046.1| pentatricopeptide repeat-containing protein,...   395   e-107
ref|XP_007212021.1| hypothetical protein PRUPE_ppa004899mg [Prun...   395   e-107
gb|EXB80843.1| hypothetical protein L484_020101 [Morus notabilis]     394   e-107
ref|XP_007048085.1| Tetratricopeptide repeat (TPR)-like superfam...   392   e-106
ref|XP_006585305.1| PREDICTED: pentatricopeptide repeat-containi...   391   e-106
ref|XP_006428072.1| hypothetical protein CICLE_v10025440mg [Citr...   390   e-106
ref|XP_003630096.1| Pentatricopeptide repeat-containing protein ...   386   e-105
ref|XP_004296690.1| PREDICTED: pentatricopeptide repeat-containi...   385   e-104
ref|XP_006411755.1| hypothetical protein EUTSA_v10024997mg [Eutr...   383   e-104

>gb|EYU31068.1| hypothetical protein MIMGU_mgv1a024034mg [Mimulus guttatus]
          Length = 496

 Score =  451 bits (1159), Expect = e-124
 Identities = 222/303 (73%), Positives = 254/303 (83%), Gaps = 1/303 (0%)
 Frame = +3

Query: 102  FASTSLQLTSAPKLSYIYTPIHSSIYRSYPISTIRMCG-ISALXXXXXXXXXXXXXXXPS 278
            F S+SL L   PK +YI TP  +      P STIRMC  +SA                 S
Sbjct: 8    FTSSSLHLPHTPKFTYISTPFRALTRHPCPPSTIRMCSWVSARPERNPGPRKTHKRSTSS 67

Query: 279  PSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRW 458
            PSSEAE+LV+ I+RN SDKQPL+NTL+K+V+F+RT+HCFLLFEELG++E+WLQCLEVFRW
Sbjct: 68   PSSEAEDLVKLIMRNFSDKQPLVNTLDKYVKFVRTDHCFLLFEELGKTEKWLQCLEVFRW 127

Query: 459  MQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSR 638
            MQKQRWY+ADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYN+LITAHLHSR
Sbjct: 128  MQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNSLITAHLHSR 187

Query: 639  DKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPD 818
            DK+KAL+K+L YFEKMK MERCKPS+VTYNILLRAFAQ+KNV+QVN LF DL+ESI++PD
Sbjct: 188  DKSKALAKSLGYFEKMKSMERCKPSIVTYNILLRAFAQAKNVDQVNTLFKDLEESIVTPD 247

Query: 819  AFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLA 998
             FTFNGVMDAYGK G+IREME VLAKMKS +IKPD ITFN LIDAYG+KQEFEKMEQV  
Sbjct: 248  IFTFNGVMDAYGKNGMIREMEFVLAKMKSNQIKPDTITFNSLIDAYGKKQEFEKMEQVFT 307

Query: 999  SLL 1007
            SLL
Sbjct: 308  SLL 310



 Score = 61.2 bits (147), Expect = 6e-07
 Identities = 47/175 (26%), Positives = 82/175 (46%), Gaps = 1/175 (0%)
 Frame = +3

Query: 486  DNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKA 665
            D   ++ LI   GKK +      +F+ +  S  KP    +N++IT +     KA+   K 
Sbjct: 282  DTITFNSLIDAYGKKQEFEKMEQVFTSLLRSKEKPTLPTFNSMITNY----GKARLREKG 337

Query: 666  LSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMD 845
               F++M  M   KPS VTY  L+  +     V +   +F+++ +S       T N ++D
Sbjct: 338  DLVFKRMTEMGY-KPSFVTYECLITMYGYCDRVSRAREVFDEMVDSENEKKISTLNAMLD 396

Query: 846  AYGKVGLIREMELVLAKMKSKKI-KPDIITFNLLIDAYGRKQEFEKMEQVLASLL 1007
            AY   GL  E + +    +S ++ + D  T+ LL  AY +      M+++L  L+
Sbjct: 397  AYCMNGLPMEADALFDNARSSRMFRVDSSTYKLLYKAYTKAD----MKELLGKLV 447


>gb|EPS58459.1| hypothetical protein M569_16353, partial [Genlisea aurea]
          Length = 415

 Score =  410 bits (1053), Expect = e-112
 Identities = 195/244 (79%), Positives = 225/244 (92%)
 Frame = +3

Query: 276  SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 455
            S SSEAE+LVRS++RN +D QPL +TLNK+V+ LRT HCFL+FEELG+S+RWLQCLEVFR
Sbjct: 4    SSSSEAEDLVRSVMRNFTDSQPLTSTLNKYVKLLRTAHCFLIFEELGKSDRWLQCLEVFR 63

Query: 456  WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 635
            WMQKQRWYVADNGVYSKLISVMGK+G++RMAMWLFSEMRNSGC+PDTSVYN+LI+AHLHS
Sbjct: 64   WMQKQRWYVADNGVYSKLISVMGKQGKTRMAMWLFSEMRNSGCRPDTSVYNSLISAHLHS 123

Query: 636  RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 815
            RDK KAL K L YFEKMKG+ERC+P+VVTYNILLRAFAQ+KN+EQVN LF +LD SIISP
Sbjct: 124  RDKTKALDKVLWYFEKMKGIERCQPNVVTYNILLRAFAQAKNIEQVNALFKELDGSIISP 183

Query: 816  DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVL 995
            D  T+NGVMDAYGK G+IREMELVL+KMKS +IKPD+ITFNLLIDAYGR+QEF+KMEQV 
Sbjct: 184  DVLTYNGVMDAYGKNGMIREMELVLSKMKSAQIKPDVITFNLLIDAYGRRQEFDKMEQVF 243

Query: 996  ASLL 1007
             SL+
Sbjct: 244  KSLM 247


>ref|XP_004169853.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like [Cucumis sativus]
          Length = 494

 Score =  404 bits (1038), Expect = e-110
 Identities = 202/302 (66%), Positives = 243/302 (80%)
 Frame = +3

Query: 102  FASTSLQLTSAPKLSYIYTPIHSSIYRSYPISTIRMCGISALXXXXXXXXXXXXXXXPSP 281
            F+ T     S+ +LS ++ P+ SSI ++  +ST  +C                     + 
Sbjct: 9    FSPTFTPSYSSSQLSRLWLPLPSSIKKA--VSTRVVC---------ISTRPSRKFGVKTD 57

Query: 282  SSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWM 461
             SEAEELVR I+RN SDK+PLL TL+K+V+ +RTEHCFLLFEELG+ ++WL+CLEVFRWM
Sbjct: 58   RSEAEELVRGIIRNFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKRDKWLECLEVFRWM 117

Query: 462  QKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRD 641
            QKQRWY+ADNGVYSKLIS+MGKKGQ RMAMWLFSEMRNSGC+PDTSVYNALITAHLHS+D
Sbjct: 118  QKQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKD 177

Query: 642  KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDA 821
            KAKAL K LSYFEKMKGMERCKP++VTYNIL RAFAQ+  V+QVN LF DLDES++S D 
Sbjct: 178  KAKALVKVLSYFEKMKGMERCKPNIVTYNILTRAFAQAAKVDQVNTLFKDLDESVVSADI 237

Query: 822  FTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLAS 1001
            +T+NGVMDAYGK G I+EMEL+LA+MKS +IKPDII+FNLLID+YG+KQ F+KMEQV  S
Sbjct: 238  YTYNGVMDAYGKNGNIKEMELMLARMKSNQIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 297

Query: 1002 LL 1007
            LL
Sbjct: 298  LL 299


>ref|XP_004143565.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like [Cucumis sativus]
          Length = 528

 Score =  404 bits (1038), Expect = e-110
 Identities = 202/302 (66%), Positives = 243/302 (80%)
 Frame = +3

Query: 102  FASTSLQLTSAPKLSYIYTPIHSSIYRSYPISTIRMCGISALXXXXXXXXXXXXXXXPSP 281
            F+ T     S+ +LS ++ P+ SSI ++  +ST  +C                     + 
Sbjct: 9    FSPTFTPSYSSSQLSRLWLPLPSSIKKA--VSTRVVC---------ISTRPSRKFGVKTD 57

Query: 282  SSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWM 461
             SEAEELVR I+RN SDK+PLL TL+K+V+ +RTEHCFLLFEELG+ ++WL+CLEVFRWM
Sbjct: 58   RSEAEELVRGIIRNFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKRDKWLECLEVFRWM 117

Query: 462  QKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRD 641
            QKQRWY+ADNGVYSKLIS+MGKKGQ RMAMWLFSEMRNSGC+PDTSVYNALITAHLHS+D
Sbjct: 118  QKQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKD 177

Query: 642  KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDA 821
            KAKAL K LSYFEKMKGMERCKP++VTYNIL RAFAQ+  V+QVN LF DLDES++S D 
Sbjct: 178  KAKALVKVLSYFEKMKGMERCKPNIVTYNILTRAFAQAAKVDQVNTLFKDLDESVVSADI 237

Query: 822  FTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLAS 1001
            +T+NGVMDAYGK G I+EMEL+LA+MKS +IKPDII+FNLLID+YG+KQ F+KMEQV  S
Sbjct: 238  YTYNGVMDAYGKNGNIKEMELMLARMKSNQIKPDIISFNLLIDSYGKKQLFDKMEQVFKS 297

Query: 1002 LL 1007
            LL
Sbjct: 298  LL 299


>ref|XP_006354698.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like isoform X1 [Solanum tuberosum]
            gi|565376411|ref|XP_006354699.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like isoform X2 [Solanum tuberosum]
            gi|565376413|ref|XP_006354700.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 457

 Score =  402 bits (1032), Expect = e-109
 Identities = 187/244 (76%), Positives = 225/244 (92%)
 Frame = +3

Query: 276  SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 455
            S SSEA+ELV  ++RN SDK+PL++TL+K+V+ +RTEHCFLLFE+LG+++ WLQCLEVFR
Sbjct: 32   SSSSEAQELVTLVMRNFSDKKPLVSTLDKYVKLVRTEHCFLLFEQLGKTDNWLQCLEVFR 91

Query: 456  WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 635
            WMQKQRWY+ADNGVYSKLISVMGKKGQ RMAMWLFSEMRNSGC+PDTSVYNA+I+AHLHS
Sbjct: 92   WMQKQRWYIADNGVYSKLISVMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNAVISAHLHS 151

Query: 636  RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 815
            RDK+KAL+KA+ YFEKMKGMERC PS+VTYNILLRAFAQ+KNVEQV+ L  DLDESI++P
Sbjct: 152  RDKSKALTKAMGYFEKMKGMERCSPSIVTYNILLRAFAQAKNVEQVDALLKDLDESIVTP 211

Query: 816  DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVL 995
            D FTFNG+MDAYGK G+I EME +L++MKS ++KPDIITFN+LID+YG+KQ+F+KMEQV 
Sbjct: 212  DIFTFNGLMDAYGKNGMINEMEHILSRMKSNQLKPDIITFNILIDSYGKKQDFQKMEQVF 271

Query: 996  ASLL 1007
             SLL
Sbjct: 272  KSLL 275



 Score = 64.3 bits (155), Expect = 7e-08
 Identities = 50/170 (29%), Positives = 80/170 (47%), Gaps = 1/170 (0%)
 Frame = +3

Query: 498  YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYF 677
            ++ LI   GKK   +    +F  +  S  KP    +N++IT +     KA+   K+    
Sbjct: 251  FNILIDSYGKKQDFQKMEQVFKSLLQSKEKPTIPTFNSMITNY----GKARLREKSELVL 306

Query: 678  EKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGK 857
            EKM  +   KPS +TY  L+  +     V +   LF+ + ES     A T N ++DAY  
Sbjct: 307  EKMIDLGY-KPSYITYECLIVMYGHCDCVAKARELFDRVIESETEKKASTLNSMLDAYCM 365

Query: 858  VGLIREMELVLAKMKSKKIKP-DIITFNLLIDAYGRKQEFEKMEQVLASL 1004
             GL  E  L+   + S K+ P D  T+ LL  AY +    E ++++L  +
Sbjct: 366  NGLPMEAHLLFESIHSAKVFPIDSSTYKLLYKAYTKADMKELVQKLLTCM 415


>ref|XP_004504032.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like isoform X1 [Cicer arietinum]
            gi|502140047|ref|XP_004504033.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like isoform X2 [Cicer arietinum]
          Length = 477

 Score =  402 bits (1032), Expect = e-109
 Identities = 191/284 (67%), Positives = 237/284 (83%)
 Frame = +3

Query: 156  TPIHSSIYRSYPISTIRMCGISALXXXXXXXXXXXXXXXPSPSSEAEELVRSIVRNISDK 335
            TP   S Y S+P S++ +  +  +                S  SE +ELVR + R IS+K
Sbjct: 8    TPPSPSYYYSFPTSSVNLPRVIRISCGSNPTRLNRKKIT-SERSETQELVRLLTRKISEK 66

Query: 336  QPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQKQRWYVADNGVYSKLIS 515
            +PL+ TLNK+V+ +RTEHCFLLFEELG+ ++WLQCLEVFRWMQ+QRWY+ADNGVYSKLIS
Sbjct: 67   EPLVTTLNKYVKLVRTEHCFLLFEELGKHDKWLQCLEVFRWMQRQRWYIADNGVYSKLIS 126

Query: 516  VMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYFEKMKGM 695
            VMGKKGQ R+AMWLFSEMRN+GC+PDTSVYNALI+AHLH+R+K+ AL+KAL YFEKMKG+
Sbjct: 127  VMGKKGQIRLAMWLFSEMRNTGCRPDTSVYNALISAHLHTRNKSNALAKALGYFEKMKGI 186

Query: 696  ERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIRE 875
            ERCKP++VTYNILLRAFAQS+NV+QVN LF DLD+S++SPD +TFNGVMDAYGK G+IRE
Sbjct: 187  ERCKPNIVTYNILLRAFAQSRNVDQVNSLFKDLDDSVVSPDIYTFNGVMDAYGKNGMIRE 246

Query: 876  MELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASLL 1007
            ME VLA+MKS ++KPD+IT+NLLID+YG+KQ+F+KMEQV  SLL
Sbjct: 247  METVLARMKSNQVKPDLITYNLLIDSYGKKQQFDKMEQVFKSLL 290



 Score = 65.5 bits (158), Expect = 3e-08
 Identities = 49/166 (29%), Positives = 78/166 (46%)
 Frame = +3

Query: 498 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYF 677
           Y+ LI   GKK Q      +F  +  S  KP    +N++I  +     KA+   KA + F
Sbjct: 266 YNLLIDSYGKKQQFDKMEQVFKSLLRSKEKPSLPTFNSMILNY----GKARLKDKAENVF 321

Query: 678 EKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGK 857
           + M  M    PS VT+  L+  +     V +   LF+ L ES +     T N ++D Y  
Sbjct: 322 QNMTDMGYT-PSFVTHESLIYMYGFCDCVSKAVELFDGLIESKVPMKVSTLNAMLDVYCI 380

Query: 858 VGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVL 995
            GL +E + + A+ +   I PD  T+ LL  AY +    E ++++L
Sbjct: 381 NGLPQEADSLFARARRVNIFPDASTYKLLYKAYTKANSKELLDKLL 426


>ref|XP_002306340.2| pentatricopeptide repeat-containing family protein, partial [Populus
            trichocarpa] gi|550338395|gb|EEE93336.2|
            pentatricopeptide repeat-containing family protein,
            partial [Populus trichocarpa]
          Length = 414

 Score =  401 bits (1030), Expect = e-109
 Identities = 188/241 (78%), Positives = 224/241 (92%)
 Frame = +3

Query: 285  SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQ 464
            SEA+ELVR +VR+ SDKQPL+ TLNK+V+ +RTEHCF+LFEELG++++WLQCLEVFRWMQ
Sbjct: 26   SEAQELVRVLVRSFSDKQPLVKTLNKYVKVMRTEHCFMLFEELGKTDKWLQCLEVFRWMQ 85

Query: 465  KQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDK 644
            KQRWYVADNG YSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALITAHLHS+DK
Sbjct: 86   KQRWYVADNGCYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKDK 145

Query: 645  AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAF 824
            AK+L+KAL+YFEKMK +ERC+P+VVTYNI+LRAFAQ++NV QVN LF DL+ESI+SPD +
Sbjct: 146  AKSLTKALAYFEKMKSIERCQPNVVTYNIILRAFAQARNVNQVNALFKDLEESIVSPDIY 205

Query: 825  TFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASL 1004
            T+NGV+DAYGK G+IREME VL++MK  + KPDIITFNLLID+YG+KQ+FEKMEQV  SL
Sbjct: 206  TYNGVLDAYGKNGMIREMESVLSRMKIDQCKPDIITFNLLIDSYGKKQDFEKMEQVFKSL 265

Query: 1005 L 1007
            L
Sbjct: 266  L 266


>ref|XP_004237613.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like [Solanum lycopersicum]
          Length = 478

 Score =  400 bits (1029), Expect = e-109
 Identities = 188/244 (77%), Positives = 224/244 (91%)
 Frame = +3

Query: 276  SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 455
            S SSEA+ELV  ++RN SDK+PL++TL+K+V+ +RTEHCFLLFE+LG+++ WLQCLEVFR
Sbjct: 53   SSSSEAQELVTLVMRNFSDKKPLVSTLDKYVKLVRTEHCFLLFEQLGKTDNWLQCLEVFR 112

Query: 456  WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 635
            WMQKQRWY+ADNGVYSKLISVMGKKGQ RMAMWLFSEMRNSGC+PDTSVYNA+I+AHLHS
Sbjct: 113  WMQKQRWYIADNGVYSKLISVMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNAVISAHLHS 172

Query: 636  RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 815
            RDK+KAL+KA+ YFEKMK MERC PS+VTYNILLRAFAQ+KNVEQV+ L  DLDESI++P
Sbjct: 173  RDKSKALTKAMGYFEKMKEMERCSPSIVTYNILLRAFAQAKNVEQVDALLKDLDESIVTP 232

Query: 816  DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVL 995
            D FTFNG+MDAYGK G+I EME VL++MKS K+KPDIITFN+LID+YG+KQ+F+KMEQV 
Sbjct: 233  DIFTFNGLMDAYGKNGMINEMEHVLSRMKSNKLKPDIITFNILIDSYGKKQDFQKMEQVF 292

Query: 996  ASLL 1007
             SLL
Sbjct: 293  KSLL 296



 Score = 62.0 bits (149), Expect = 4e-07
 Identities = 50/167 (29%), Positives = 78/167 (46%), Gaps = 1/167 (0%)
 Frame = +3

Query: 498 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYF 677
           ++ LI   GKK   +    +F  +  S  KP    +N++IT +     KA+   K+    
Sbjct: 272 FNILIDSYGKKQDFQKMEQVFKSLLQSKEKPTIPTFNSMITNY----GKARLREKSELVL 327

Query: 678 EKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGK 857
           EKM  +   KPS +TY  L+  +     V +   LF+ + ES     A T N ++DAY  
Sbjct: 328 EKMIDLGY-KPSYITYECLIVMYGHCDCVSKARELFDRVMESEKEKKASTLNSMLDAYCM 386

Query: 858 VGLIREMELVLAKMKSKKIKP-DIITFNLLIDAYGRKQEFEKMEQVL 995
            GL  E  L+   + S K  P D  T+ LL  AY +    E ++++L
Sbjct: 387 NGLPMEAHLLFESIHSAKAFPIDSSTYKLLYKAYTKADMKELVQKLL 433


>ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like [Glycine max]
          Length = 503

 Score =  400 bits (1027), Expect = e-109
 Identities = 191/243 (78%), Positives = 221/243 (90%), Gaps = 1/243 (0%)
 Frame = +3

Query: 282  SSEAEELVRSIVRNIS-DKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRW 458
            +SEA+ELVR +   IS DK+PLL TLNK+V+ +RT+HCFLLFEEL + + WLQCLEVFRW
Sbjct: 39   NSEAQELVRLLTSKISNDKEPLLKTLNKYVKQVRTQHCFLLFEELAKHDNWLQCLEVFRW 98

Query: 459  MQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSR 638
            MQKQRWY+ADNG+YSKLISVMGKKGQ+RMAMWLFSEMRN+GC+PDTSVYNALITAHLHSR
Sbjct: 99   MQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSEMRNTGCRPDTSVYNALITAHLHSR 158

Query: 639  DKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPD 818
            DK KAL+KA+ YF+KMKGMERCKP++VTYNILLRAFAQ++NVEQVN LF DLDESI+SPD
Sbjct: 159  DKTKALAKAIGYFQKMKGMERCKPNIVTYNILLRAFAQARNVEQVNSLFKDLDESIVSPD 218

Query: 819  AFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLA 998
             +TFNGVMDAYGK G+IREME VLA+MKS + KPD+ITFNLLID+YG+KQEF KMEQV  
Sbjct: 219  IYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDLITFNLLIDSYGKKQEFGKMEQVFK 278

Query: 999  SLL 1007
            SLL
Sbjct: 279  SLL 281


>ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic [Vitis vinifera]
            gi|296082481|emb|CBI21486.3| unnamed protein product
            [Vitis vinifera]
          Length = 489

 Score =  400 bits (1027), Expect = e-109
 Identities = 201/307 (65%), Positives = 242/307 (78%), Gaps = 8/307 (2%)
 Frame = +3

Query: 111  TSLQLTSAPKLSYIYTPIHSSIYRSY-------PISTIRMCGISALXXXXXXXXXXXXXX 269
            +SLQL S+  LS  +T   S  Y+         P +T+  C                   
Sbjct: 10   SSLQLYSSSTLSPTFTLPSSRFYKPTRLHLPPRPSTTVVSC----------VSTRPRRKP 59

Query: 270  XPSPS-SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLE 446
             P P  SE EELVR +++N   ++PL++TLNK+V+ +RTEHCF LFEELG++++WLQCLE
Sbjct: 60   GPKPDKSEVEELVRVLMKNFGGERPLISTLNKYVKVIRTEHCFRLFEELGKTDKWLQCLE 119

Query: 447  VFRWMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH 626
            VFRWMQKQRWY+ADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALITAH
Sbjct: 120  VFRWMQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAH 179

Query: 627  LHSRDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESI 806
            LHSRDK+KAL KAL YF+KMKGMERCKP++VTYNILLRAFAQ++NV Q N LF +L+ESI
Sbjct: 180  LHSRDKSKALIKALGYFDKMKGMERCKPNIVTYNILLRAFAQAQNVNQANALFKELNESI 239

Query: 807  ISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKME 986
            +SPD FTFNGVMDAYGK G+I+EME VL++MKS + KPDIITFN+LID+YGR+QEF+KME
Sbjct: 240  VSPDIFTFNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFNVLIDSYGRRQEFDKME 299

Query: 987  QVLASLL 1007
            QV  SLL
Sbjct: 300  QVFKSLL 306


>ref|XP_007159746.1| hypothetical protein PHAVU_002G263600g [Phaseolus vulgaris]
            gi|561033161|gb|ESW31740.1| hypothetical protein
            PHAVU_002G263600g [Phaseolus vulgaris]
          Length = 534

 Score =  399 bits (1025), Expect = e-108
 Identities = 188/245 (76%), Positives = 221/245 (90%)
 Frame = +3

Query: 273  PSPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVF 452
            P+ ++EA ELVR + R ISDK+PLL TLNK V+ +RTEHCFLLFEELG+   WLQC+EVF
Sbjct: 40   PNHNAEARELVRLLTRKISDKEPLLKTLNKFVKQVRTEHCFLLFEELGKEGNWLQCIEVF 99

Query: 453  RWMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLH 632
            RWMQKQRWY+ADNG+YSKLISVMGK+GQ+RMAMWLFSEMRN+GC+PDTSVYNALITAHLH
Sbjct: 100  RWMQKQRWYIADNGIYSKLISVMGKRGQTRMAMWLFSEMRNAGCRPDTSVYNALITAHLH 159

Query: 633  SRDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIIS 812
            SRDK KALSKA+ YF+KMKG+ERCKP++VTYNILLRAFAQ++N+EQV+ LF DLDES IS
Sbjct: 160  SRDKTKALSKAIGYFQKMKGIERCKPNIVTYNILLRAFAQARNLEQVSSLFKDLDESSIS 219

Query: 813  PDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQV 992
            PD +TFNGVMDAYGK G+IREME +LA+M+S + KPD+ITFNLLID+YG+KQEF KMEQV
Sbjct: 220  PDIYTFNGVMDAYGKNGMIREMEAILAQMRSSQYKPDLITFNLLIDSYGKKQEFGKMEQV 279

Query: 993  LASLL 1007
              SLL
Sbjct: 280  FKSLL 284



 Score = 70.5 bits (171), Expect = 1e-09
 Identities = 53/185 (28%), Positives = 88/185 (47%)
 Frame = +3

Query: 441 LEVFRWMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALIT 620
           +E      +   Y  D   ++ LI   GKK +      +F  + +S  +P  S +N++I 
Sbjct: 241 MEAILAQMRSSQYKPDLITFNLLIDSYGKKQEFGKMEQVFKSLLSSKERPTLSTFNSMIL 300

Query: 621 AHLHSRDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDE 800
            +     KA+  +KA   F+KM  M    PS VT+  L+  +     V     LF++L E
Sbjct: 301 NY----GKARLKNKAEDVFKKMIDMGYT-PSFVTHESLIFMYGLCDCVSSAVQLFDELVE 355

Query: 801 SIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEK 980
           S +     T N ++D Y   GL +E   +  + KS KI PD  T+ LL  AY + ++ E 
Sbjct: 356 SKVPIKVSTLNAILDVYCLNGLQQEAHSLFERAKSIKIHPDSSTYKLLYRAYTKAKQKEL 415

Query: 981 MEQVL 995
           ++++L
Sbjct: 416 LDKLL 420


>ref|XP_002532046.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223528289|gb|EEF30336.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 478

 Score =  395 bits (1016), Expect = e-107
 Identities = 189/245 (77%), Positives = 223/245 (91%), Gaps = 1/245 (0%)
 Frame = +3

Query: 276  SPSSEAEELVRSIVRNIS-DKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVF 452
            S  SE E+LVR ++R+ S DK PL+ TL+K+V+ +RTEHCFLLFEELGR ++WLQCLEVF
Sbjct: 60   SEESETEDLVRYVLRSFSSDKVPLVRTLDKYVRVVRTEHCFLLFEELGRRDKWLQCLEVF 119

Query: 453  RWMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLH 632
            RWMQKQRWY+AD+GVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PD+SVYNALITAHLH
Sbjct: 120  RWMQKQRWYIADSGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDSSVYNALITAHLH 179

Query: 633  SRDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIIS 812
            S+DKAKAL KAL YFEKMKGM+RC+P+VVTYNILLRAFAQ++NV QVN LF DLD+SI+S
Sbjct: 180  SKDKAKALIKALGYFEKMKGMQRCQPNVVTYNILLRAFAQARNVNQVNALFKDLDQSIVS 239

Query: 813  PDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQV 992
            PD +T+NGVMDAYGK G+IREME VL++MKS + KPDIITFNLLID+YG+KQ+F+KMEQV
Sbjct: 240  PDIYTYNGVMDAYGKNGMIREMESVLSRMKSNQCKPDIITFNLLIDSYGKKQDFDKMEQV 299

Query: 993  LASLL 1007
              SLL
Sbjct: 300  FKSLL 304



 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 52/197 (26%), Positives = 86/197 (43%), Gaps = 31/197 (15%)
 Frame = +3

Query: 498 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH----------------L 629
           Y+ ++   GK G  R    + S M+++ CKPD   +N LI ++                L
Sbjct: 245 YNGVMDAYGKNGMIREMESVLSRMKSNQCKPDIITFNLLIDSYGKKQDFDKMEQVFKSLL 304

Query: 630 HSRD---------------KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNV 764
           HS++               KA+    A S  +KM  M +  P+ +TY  L+  +    +V
Sbjct: 305 HSKERPTLPTFNSMITNYGKARQKENAESVLQKMTKM-KYTPNFITYESLIMMYGFCDSV 363

Query: 765 EQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLL 944
            +   +F+D+ ES       T N ++D Y   GL  E +L+    ++  + PD  T+ LL
Sbjct: 364 SKAREIFDDMIESGKEVKVSTLNAMLDVYCLNGLPMEADLLFDNARNVGLLPDSTTYKLL 423

Query: 945 IDAYGRKQEFEKMEQVL 995
             AY  K   +K+ Q L
Sbjct: 424 YKAY-TKANMKKLVQKL 439


>ref|XP_007212021.1| hypothetical protein PRUPE_ppa004899mg [Prunus persica]
            gi|462407886|gb|EMJ13220.1| hypothetical protein
            PRUPE_ppa004899mg [Prunus persica]
          Length = 486

 Score =  395 bits (1015), Expect = e-107
 Identities = 188/240 (78%), Positives = 219/240 (91%)
 Frame = +3

Query: 288  EAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQK 467
            +  E+VR ++R+ SDK+PLL TLNK+V+ +RTEHCFLLFEELG+S+ WLQCLEVFRWMQK
Sbjct: 64   DVREVVRMLMRSFSDKEPLLKTLNKYVRIVRTEHCFLLFEELGKSDEWLQCLEVFRWMQK 123

Query: 468  QRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKA 647
            QRWYVADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALI+AHL+S+DKA
Sbjct: 124  QRWYVADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALISAHLNSKDKA 183

Query: 648  KALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFT 827
            KAL KAL YF+KMKGMERC+P++VTYNILLRAFAQS+NVE+VN LF DLDESI SPD +T
Sbjct: 184  KALDKALRYFDKMKGMERCQPNIVTYNILLRAFAQSRNVEKVNSLFKDLDESIASPDIYT 243

Query: 828  FNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASLL 1007
            +NGVMDAYGK G IREME VL+ MKS + KPDIITFNLLID+YG+KQ+F+KMEQV  SL+
Sbjct: 244  YNGVMDAYGKNGNIREMESVLSHMKSNQCKPDIITFNLLIDSYGKKQQFDKMEQVFKSLV 303



 Score = 62.8 bits (151), Expect = 2e-07
 Identities = 49/197 (24%), Positives = 86/197 (43%), Gaps = 31/197 (15%)
 Frame = +3

Query: 498 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH----------------L 629
           Y+ ++   GK G  R    + S M+++ CKPD   +N LI ++                +
Sbjct: 244 YNGVMDAYGKNGNIREMESVLSHMKSNQCKPDIITFNLLIDSYGKKQQFDKMEQVFKSLV 303

Query: 630 HSRDK---------------AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNV 764
            S++K               A+   KA   F+KM  M +  PS +TY  L+  +    +V
Sbjct: 304 RSKEKPTLPTFNSMIINYGKARLKEKAEDVFKKMIDM-KYTPSFITYESLIMMYGFCDSV 362

Query: 765 EQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLL 944
            +   +F+ L +S       T N ++D Y   GL  E + +     S  ++P++ T+ LL
Sbjct: 363 SKAREVFDRLADSGKELKVSTLNAMLDVYCMNGLPVEADKLFVNGNSIGVRPNVSTYKLL 422

Query: 945 IDAYGRKQEFEKMEQVL 995
             AY +    E +E++L
Sbjct: 423 YKAYTKANMKELLEKLL 439


>gb|EXB80843.1| hypothetical protein L484_020101 [Morus notabilis]
          Length = 485

 Score =  394 bits (1013), Expect = e-107
 Identities = 190/242 (78%), Positives = 220/242 (90%), Gaps = 1/242 (0%)
 Frame = +3

Query: 285  SEAEELVRSIVRNI-SDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWM 461
            SEA +LVR ++R+  SDK+PL+ TLNK+V+ +RTEHCFLLFEELGRS++WLQCLEVFRWM
Sbjct: 61   SEALDLVRLLMRSFNSDKEPLVKTLNKYVKTVRTEHCFLLFEELGRSDKWLQCLEVFRWM 120

Query: 462  QKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRD 641
            QKQRWY+ADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNS C+PDTSVYNALITAHLHS D
Sbjct: 121  QKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSSCRPDTSVYNALITAHLHSSD 180

Query: 642  KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDA 821
            K KAL KA+ YFEKMKG+ERCKP++VTYNILLRAFAQ++NV++VN LF DLD SI+SPD 
Sbjct: 181  KVKALDKAIGYFEKMKGIERCKPNIVTYNILLRAFAQARNVQRVNSLFKDLDGSIVSPDI 240

Query: 822  FTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLAS 1001
            +T+NGVMDAYGK G+IREME VL+ MKS  IKPDIITFNLLID+YG+KQEF+KMEQV  S
Sbjct: 241  YTYNGVMDAYGKNGMIREMESVLSLMKSNHIKPDIITFNLLIDSYGKKQEFDKMEQVFKS 300

Query: 1002 LL 1007
            LL
Sbjct: 301  LL 302



 Score = 61.6 bits (148), Expect = 5e-07
 Identities = 49/170 (28%), Positives = 78/170 (45%)
 Frame = +3

Query: 498  YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYF 677
            ++ LI   GKK +      +F  +  S  +P    +N++I  +     KA+ L KA + F
Sbjct: 278  FNLLIDSYGKKQEFDKMEQVFKSLLRSKERPTLPTFNSMIINY----GKARRLDKAENVF 333

Query: 678  EKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGK 857
            EKM  M    PS +TY  L+  +     V +   +FN L ES       T N ++D Y  
Sbjct: 334  EKMTDMGYT-PSFITYESLIMMYGYCDCVSRAQDIFNRLVESGKDIKVSTLNAMLDVYCM 392

Query: 858  VGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASLL 1007
             GL  E   +    K+  + P+  T+ LL  AY +      M+++L +LL
Sbjct: 393  NGLPMEAHKLFEDSKNIGVVPNSSTYKLLYKAYTK----ANMKELLGNLL 438


>ref|XP_007048085.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
            [Theobroma cacao] gi|590707730|ref|XP_007048086.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 1 [Theobroma cacao] gi|508700346|gb|EOX92242.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 1 [Theobroma cacao] gi|508700347|gb|EOX92243.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 1 [Theobroma cacao]
          Length = 488

 Score =  392 bits (1006), Expect = e-106
 Identities = 182/240 (75%), Positives = 219/240 (91%)
 Frame = +3

Query: 288  EAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQK 467
            EA ELVR ++R+ SDK+PL+ TLN++V+ +R EHCFLLFEELG++++WLQCLEVFRWMQK
Sbjct: 61   EALELVRVLMRSFSDKEPLVKTLNRYVRVVRCEHCFLLFEELGKTDKWLQCLEVFRWMQK 120

Query: 468  QRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKA 647
            QRWY+ADNG+YSKLI+VMGKKGQ+RMAMWLFSEMRNSGC+PD SVYNALITAHLHSRDK+
Sbjct: 121  QRWYIADNGIYSKLITVMGKKGQTRMAMWLFSEMRNSGCRPDVSVYNALITAHLHSRDKS 180

Query: 648  KALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFT 827
            KAL KA+ YF KMKGMERCKP++VTYNILLRAF+Q++NV+QVN LF DL ESII+PD +T
Sbjct: 181  KALDKAMGYFNKMKGMERCKPNIVTYNILLRAFSQARNVDQVNALFKDLAESIIAPDIYT 240

Query: 828  FNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASLL 1007
            +NGVMDAYGK G+IREME VL++MKS + KPD ITFN+LID+YG+KQEF+KMEQV  SLL
Sbjct: 241  YNGVMDAYGKNGMIREMESVLSRMKSNQCKPDTITFNVLIDSYGKKQEFDKMEQVFKSLL 300


>ref|XP_006585305.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like [Glycine max]
          Length = 526

 Score =  391 bits (1005), Expect = e-106
 Identities = 188/244 (77%), Positives = 219/244 (89%), Gaps = 2/244 (0%)
 Frame = +3

Query: 282  SSEAEELVRSIVRNI--SDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 455
            +SEA+ELVR +   I  +DK+ LL TLNK+V+ +RT+HCFLLFEELG+ + WLQCLEVFR
Sbjct: 47   NSEAQELVRLLTSKIRSNDKEVLLKTLNKYVKQVRTQHCFLLFEELGKHDNWLQCLEVFR 106

Query: 456  WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 635
            WMQKQRWY+ADNG+YSKLISVMGKKGQ+RMAMWLFSEMRN+GC+PDTSVYNALITAHL S
Sbjct: 107  WMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSEMRNTGCRPDTSVYNALITAHLRS 166

Query: 636  RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 815
            RDK KAL+KA+ YF+KMKGMERCKP++VTYNILLRAFAQ++NVEQVN LF DLDESI+SP
Sbjct: 167  RDKIKALAKAIGYFQKMKGMERCKPNIVTYNILLRAFAQARNVEQVNSLFKDLDESIVSP 226

Query: 816  DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVL 995
            D +TFNGVMDAYGK G+IREME VLA+MKS + KPD+ITFNLLID+YG+KQ F KMEQV 
Sbjct: 227  DIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDLITFNLLIDSYGKKQAFGKMEQVF 286

Query: 996  ASLL 1007
             SLL
Sbjct: 287  KSLL 290



 Score = 65.5 bits (158), Expect = 3e-08
 Identities = 52/197 (26%), Positives = 88/197 (44%), Gaps = 31/197 (15%)
 Frame = +3

Query: 498 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH----------------L 629
           ++ ++   GK G  R    + + M+++ CKPD   +N LI ++                L
Sbjct: 231 FNGVMDAYGKNGMIREMEAVLARMKSNQCKPDLITFNLLIDSYGKKQAFGKMEQVFKSLL 290

Query: 630 HSRD---------------KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNV 764
           HS++               KA+   KA   F+KM  M     S VT+  ++  +     V
Sbjct: 291 HSKERPSLPTFNSMILNYGKARLKDKAEDVFKKMTDMGYTL-SFVTHESMIYMYGFCDCV 349

Query: 765 EQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLL 944
            +   LF++L ES +     T N ++D Y   GL +E + +  +  S KI PD  TF LL
Sbjct: 350 SRAAQLFDELVESKVHIKVSTLNAMLDVYCLNGLPQEADSLFERAISIKIHPDSSTFKLL 409

Query: 945 IDAYGRKQEFEKMEQVL 995
             AY +  + E ++++L
Sbjct: 410 YKAYTKANQKELLDKLL 426


>ref|XP_006428072.1| hypothetical protein CICLE_v10025440mg [Citrus clementina]
            gi|568819570|ref|XP_006464322.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like [Citrus sinensis]
            gi|557530062|gb|ESR41312.1| hypothetical protein
            CICLE_v10025440mg [Citrus clementina]
          Length = 500

 Score =  390 bits (1002), Expect = e-106
 Identities = 180/244 (73%), Positives = 223/244 (91%)
 Frame = +3

Query: 276  SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 455
            S   E++ELVR ++R+ SDK+PL+ TLNK+V+ +R+EHCFLLFEELG+S++WLQCLEVFR
Sbjct: 74   SEELESKELVRVLMRSFSDKEPLVRTLNKYVKVVRSEHCFLLFEELGKSDKWLQCLEVFR 133

Query: 456  WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 635
            WMQKQRWY+AD G+YSKLI+VMGKKGQ+R+AMWLFSEMRNSGC+PD SVYNALITAHLH+
Sbjct: 134  WMQKQRWYIADTGIYSKLIAVMGKKGQTRLAMWLFSEMRNSGCRPDPSVYNALITAHLHT 193

Query: 636  RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 815
            RDKAKAL+KAL YF+KMKGMERCKP++VTYNILLRA AQ++NV+QVN LF +LDESI++P
Sbjct: 194  RDKAKALAKALGYFQKMKGMERCKPNIVTYNILLRACAQARNVDQVNALFKELDESILAP 253

Query: 816  DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVL 995
            D +T+NGVMDAYGK G+I+EME VL++MKS + KPDIITFNLLID+YG++Q F+KMEQV 
Sbjct: 254  DIYTYNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFNLLIDSYGKRQAFDKMEQVF 313

Query: 996  ASLL 1007
             SL+
Sbjct: 314  KSLM 317



 Score = 68.6 bits (166), Expect = 4e-09
 Identities = 48/197 (24%), Positives = 87/197 (44%), Gaps = 31/197 (15%)
 Frame = +3

Query: 498 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH----------------L 629
           Y+ ++   GK G  +    + S M+++ CKPD   +N LI ++                +
Sbjct: 258 YNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFNLLIDSYGKRQAFDKMEQVFKSLM 317

Query: 630 HSRDK---------------AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNV 764
           HS++K               A+   KA   F+KM  M +  PS +TY  ++  +    NV
Sbjct: 318 HSKEKPTLPTFNSMIINYGKARLQGKAEYVFQKMTAM-KYTPSFITYECIITMYGYCDNV 376

Query: 765 EQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLL 944
            +   +F++L +        T N +++AY   GL  E +L+     +  + PD  T+ LL
Sbjct: 377 SRAREIFDELSKLGKDMKVSTLNAMLEAYCMNGLPTEADLLFENSHNMGVTPDSSTYKLL 436

Query: 945 IDAYGRKQEFEKMEQVL 995
             AY +    E ++++L
Sbjct: 437 YKAYTKANMKELVQKLL 453


>ref|XP_003630096.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355524118|gb|AET04572.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 635

 Score =  386 bits (992), Expect = e-105
 Identities = 181/240 (75%), Positives = 215/240 (89%)
 Frame = +3

Query: 285  SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQ 464
            SE +ELVR + R ISDK+PLL TLNK+V+ +RTEHCFLLFEELG+ ++WLQCLEVFRWMQ
Sbjct: 49   SETQELVRLLTRKISDKEPLLKTLNKYVKLVRTEHCFLLFEELGKHDKWLQCLEVFRWMQ 108

Query: 465  KQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDK 644
            +QRWY+ADNGVYSKLISVMGKKGQ R+AMWLFSEMRN+GC+PDTSVYN+LI+AHLHSRDK
Sbjct: 109  RQRWYIADNGVYSKLISVMGKKGQIRLAMWLFSEMRNTGCRPDTSVYNSLISAHLHSRDK 168

Query: 645  AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAF 824
            +KAL KAL YFEKMK  ERCKP++VTYNILLRAFAQ+++V QVN LF DLDES +SPD +
Sbjct: 169  SKALVKALGYFEKMKTTERCKPNIVTYNILLRAFAQARDVNQVNYLFKDLDESSVSPDIY 228

Query: 825  TFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASL 1004
            TFNGVMD YGK G+IREME VL +MKS ++K D+IT+NLLID+YG+KQ+F+KMEQV  SL
Sbjct: 229  TFNGVMDGYGKNGMIREMESVLVRMKSNQVKLDLITYNLLIDSYGKKQQFDKMEQVFKSL 288



 Score = 69.7 bits (169), Expect = 2e-09
 Identities = 51/169 (30%), Positives = 79/169 (46%)
 Frame = +3

Query: 498  YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYF 677
            Y+ LI   GKK Q      +F  +  S  KP    +N++I  +     KA+   KA + F
Sbjct: 265  YNLLIDSYGKKQQFDKMEQVFKSLSRSKEKPTLPTFNSMILNY----GKARLKDKAENVF 320

Query: 678  EKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGK 857
            + M  M    PS VT+  L+  +     V     LF+ L ES +     T N ++D Y  
Sbjct: 321  QNMTDMGYT-PSFVTHESLIHMYGLCGCVSNAVELFDQLIESKVPIKVSTLNAMLDVYCI 379

Query: 858  VGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASL 1004
             GL +E + +  + KS KI PD  T+ LL  AY +    E ++++L  +
Sbjct: 380  NGLQQEADSLFTRAKSIKIFPDATTYKLLYKAYTKANSKELLDKLLKQM 428


>ref|XP_004296690.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 420

 Score =  385 bits (989), Expect = e-104
 Identities = 180/236 (76%), Positives = 215/236 (91%)
 Frame = +3

Query: 300  LVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQKQRWY 479
            +VR ++R+ SDK+PL+ TLNK+V+ +RTEHCFLLFEELG+S +WLQCLEVFRWMQKQRWY
Sbjct: 1    MVRMLIRSFSDKEPLVKTLNKYVKIVRTEHCFLLFEELGKSGKWLQCLEVFRWMQKQRWY 60

Query: 480  VADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALS 659
            VADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALI+AHL+S+DK KAL 
Sbjct: 61   VADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALISAHLNSKDKGKALE 120

Query: 660  KALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGV 839
            K L YF KMKGMERC+P++VTYNILLRA+AQ++NV++VN LF DLDESI  PD +T+NGV
Sbjct: 121  KGLVYFNKMKGMERCQPNIVTYNILLRAYAQARNVDKVNSLFKDLDESIACPDIYTYNGV 180

Query: 840  MDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASLL 1007
            MDAYGK G+IR+ME VL++MKS + KPDIITFNLLID+YG+KQ+F+KMEQV  SLL
Sbjct: 181  MDAYGKNGMIRDMESVLSRMKSNQCKPDIITFNLLIDSYGKKQQFDKMEQVFKSLL 236



 Score = 70.9 bits (172), Expect = 8e-10
 Identities = 51/200 (25%), Positives = 92/200 (46%), Gaps = 31/200 (15%)
 Frame = +3

Query: 498  YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH----------------L 629
            Y+ ++   GK G  R    + S M+++ CKPD   +N LI ++                L
Sbjct: 177  YNGVMDAYGKNGMIRDMESVLSRMKSNQCKPDIITFNLLIDSYGKKQQFDKMEQVFKSLL 236

Query: 630  HSRD---------------KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNV 764
            HS++               KA+   +A S F++M  M +  PS +TY  L+  +    +V
Sbjct: 237  HSKERPTLPTFNSMIINYGKARLKEQAESVFKRMIDM-KYSPSFITYESLMMMYGYCDSV 295

Query: 765  EQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLL 944
             +   +F+ + ES       T N ++D Y + GL  E + +L    S  I+P++ T+ LL
Sbjct: 296  SKAREIFDGVAESGQEMKVSTLNVMLDVYCRNGLPMEADKLLLSANSIGIRPNVCTYKLL 355

Query: 945  IDAYGRKQEFEKMEQVLASL 1004
              AY +    + ++++L S+
Sbjct: 356  YKAYTKANMKDLLDKLLKSM 375


>ref|XP_006411755.1| hypothetical protein EUTSA_v10024997mg [Eutrema salsugineum]
            gi|557112925|gb|ESQ53208.1| hypothetical protein
            EUTSA_v10024997mg [Eutrema salsugineum]
          Length = 496

 Score =  383 bits (984), Expect = e-104
 Identities = 182/244 (74%), Positives = 213/244 (87%)
 Frame = +3

Query: 276  SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 455
            S   E   LVRS++  ISD++PL+ TL+K+V+ +R EHCFLLFEELG+S++WLQCLEVFR
Sbjct: 64   SAERENRVLVRSLMSRISDREPLVKTLDKYVKVVRCEHCFLLFEELGKSDKWLQCLEVFR 123

Query: 456  WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 635
            WMQKQRWY+ADNGVYSKLISVMGKKGQ+RMAMWLFSEM+NSGC+PD SVYNALITAHLH+
Sbjct: 124  WMQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHT 183

Query: 636  RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 815
            RDKAKAL K   YF+KMKGMERC+P+VVTYNILLRAFAQS  V+QVN LF +LD S +SP
Sbjct: 184  RDKAKALEKVRGYFDKMKGMERCQPNVVTYNILLRAFAQSGKVDQVNALFKELDISAVSP 243

Query: 816  DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVL 995
            D +TFNGVMDAYGK G+I+EME VL +M+S + KPDIITFNLLID+YG+KQEFEKMEQ  
Sbjct: 244  DVYTFNGVMDAYGKNGMIKEMESVLTRMRSNECKPDIITFNLLIDSYGKKQEFEKMEQTF 303

Query: 996  ASLL 1007
             SLL
Sbjct: 304  KSLL 307



 Score = 62.4 bits (150), Expect = 3e-07
 Identities = 44/171 (25%), Positives = 79/171 (46%), Gaps = 2/171 (1%)
 Frame = +3

Query: 498  YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYF 677
            ++ LI   GKK +       F  +  S  KP    +N++I  +     KA+   KA   F
Sbjct: 283  FNLLIDSYGKKQEFEKMEQTFKSLLRSKEKPTLPTFNSMIINY----GKARRRDKAEWVF 338

Query: 678  EKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDES--IISPDAFTFNGVMDAY 851
            EKM  M    PS +TY  ++  +    +V +   +F ++ ES  ++ P   T N ++D Y
Sbjct: 339  EKMNDMNYM-PSFITYECMIMMYGYCGSVSRAREMFEEVVESERVLKPS--TLNAMLDVY 395

Query: 852  GKVGLIREMELVLAKMKSKKIKPDIITFNLLIDAYGRKQEFEKMEQVLASL 1004
               GL  E + +     + ++ PD  T+ LL  AY +    E+++ ++  +
Sbjct: 396  CLNGLHMEADKLFHSASAFRVHPDASTYKLLYKAYTKADMKERVQMLMKKM 446


Top