BLASTX nr result

ID: Catharanthus23_contig00020923 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00020923
         (2706 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004236239.1| PREDICTED: pentatricopeptide repeat-containi...   632   e-178
ref|XP_006344442.1| PREDICTED: pentatricopeptide repeat-containi...   629   e-177
ref|XP_006465271.1| PREDICTED: pentatricopeptide repeat-containi...   605   e-170
gb|EMJ27555.1| hypothetical protein PRUPE_ppa022331mg [Prunus pe...   598   e-168
gb|EOY25969.1| Pentatricopeptide repeat superfamily protein isof...   598   e-168
ref|XP_002306508.1| pentatricopeptide repeat-containing family p...   590   e-165
ref|XP_002274114.2| PREDICTED: pentatricopeptide repeat-containi...   590   e-165
ref|XP_004158824.1| PREDICTED: pentatricopeptide repeat-containi...   575   e-161
gb|EXB67206.1| hypothetical protein L484_025684 [Morus notabilis]     572   e-160
ref|XP_004297847.1| PREDICTED: pentatricopeptide repeat-containi...   566   e-158
ref|XP_004136096.1| PREDICTED: uncharacterized protein LOC101205...   552   e-154
ref|XP_006302258.1| hypothetical protein CARUB_v10020295mg [Caps...   537   e-149
gb|EOY25971.1| Pentatricopeptide repeat (PPR) superfamily protei...   535   e-149
ref|NP_178170.1| pentatricopeptide repeat-containing protein [Ar...   533   e-148
ref|XP_002887822.1| pentatricopeptide repeat-containing protein ...   533   e-148
gb|EPS66144.1| hypothetical protein M569_08630, partial [Genlise...   532   e-148
ref|XP_006590730.1| PREDICTED: pentatricopeptide repeat-containi...   529   e-147
ref|XP_006389809.1| hypothetical protein EUTSA_v10018541mg [Eutr...   528   e-147
ref|XP_003542124.1| PREDICTED: pentatricopeptide repeat-containi...   528   e-147
ref|XP_003597983.1| Pentatricopeptide repeat-containing protein ...   526   e-146

>ref|XP_004236239.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Solanum lycopersicum]
          Length = 467

 Score =  632 bits (1630), Expect = e-178
 Identities = 316/487 (64%), Positives = 372/487 (76%), Gaps = 2/487 (0%)
 Frame = +1

Query: 97   MHSLVTCRRYSLDFSRFILFHFQDFLQQ--YHTSPSKNPSLVSNSKNPITVFPQNCVXXX 270
            M S V  R  +   S  +L  F+  L    YH++PSK P   S        FP       
Sbjct: 1    MLSSVISRSSTFLPSSLLLVQFETLLHHHGYHSTPSKAPKPTS--------FPN------ 46

Query: 271  XXXXXXXXXARRIPSSNGNDILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTT 450
                         PSS        N   VLETLS Y+NDW++ALEFFNW E  CGF HT+
Sbjct: 47   ------YDDPNSSPSSTSASSDPLNPTIVLETLSCYNNDWRRALEFFNWAETQCGFHHTS 100

Query: 451  ETYNRVIDILGKYFEFDAAWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTF 630
            +T N++IDILGK+FEFDAAW LI KMR  SS   PDHTTFR++FKRYVSAH+VKEA D F
Sbjct: 101  QTSNQLIDILGKFFEFDAAWSLIEKMRSVSSM--PDHTTFRVLFKRYVSAHMVKEAIDMF 158

Query: 631  DQMEEFNLRDDVSFSNLIDALCEHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRG 810
            D+MEEFNL+D VSFSNLIDALCE+KHVIEAE+LCF  N N  K+    ++ KI NM+LRG
Sbjct: 159  DKMEEFNLKDQVSFSNLIDALCEYKHVIEAEDLCFPKNKNDVKYSCFKVDTKICNMLLRG 218

Query: 811  WFKMGWWRKCREFWEEMDKRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLD 990
            WFKM WW KCR+FWEEMD RG++KDLYSYSIYMD+QCKSGK ++AV LYKEMKKKGI LD
Sbjct: 219  WFKMSWWGKCRQFWEEMDTRGVQKDLYSYSIYMDVQCKSGKPWKAVKLYKEMKKKGIDLD 278

Query: 991  VVAYNTVIRAIGISDGVDVALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLN 1170
            V+AYNTVIRAIGI+DGVDVA KL +EM+ELGC+PNV+TYN+++KLMCENGR ++A ++LN
Sbjct: 279  VIAYNTVIRAIGIADGVDVAAKLCQEMIELGCKPNVSTYNTLIKLMCENGRYRDAYKVLN 338

Query: 1171 EMFKKGYKPNVLTYHCFFGSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLR 1350
            +M +KG +PNV+TY+ FFG L+KP+ IL LFDRMIE+GVRPRMDTYVMLMRKFGRW FLR
Sbjct: 339  QMPQKGCEPNVITYNSFFGCLEKPREILTLFDRMIESGVRPRMDTYVMLMRKFGRWEFLR 398

Query: 1351 PVFLIWEKIEEHGLSPDEFAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGK 1530
            PVF++WEK+E+ GLSPD  AYNALIDALVQKGMVDMARKYDEEM+AKGLSAKPR ELG K
Sbjct: 399  PVFILWEKMEKQGLSPDASAYNALIDALVQKGMVDMARKYDEEMLAKGLSAKPRVELGTK 458

Query: 1531 LVSVASE 1551
            L S   E
Sbjct: 459  LTSADCE 465


>ref|XP_006344442.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Solanum tuberosum]
          Length = 467

 Score =  629 bits (1623), Expect = e-177
 Identities = 309/469 (65%), Positives = 366/469 (78%), Gaps = 2/469 (0%)
 Frame = +1

Query: 139  SRFILFHFQDFLQQ--YHTSPSKNPSLVSNSKNPITVFPQNCVXXXXXXXXXXXXARRIP 312
            S  +L  F+  L    YH++PS  P+  S        FP                    P
Sbjct: 15   SSLLLAQFETLLHHHGYHSTPSNAPNTTS--------FPN------------YDDPNSSP 54

Query: 313  SSNGNDILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYF 492
            SS        N   +LETLS Y+NDW++ALEFFNW E  CGF HT++T N++IDILGK+F
Sbjct: 55   SSTSASSDPLNPTIMLETLSCYNNDWRRALEFFNWAETQCGFHHTSQTCNQLIDILGKFF 114

Query: 493  EFDAAWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSF 672
            EFDAAW LI KMR  SS   PDHTTFR++FKRYVSAH+VKEA D FD+MEEFNL+D VSF
Sbjct: 115  EFDAAWSLIEKMRSVSSM--PDHTTFRVLFKRYVSAHMVKEAIDMFDKMEEFNLKDQVSF 172

Query: 673  SNLIDALCEHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFW 852
            SNLIDALCE+KHVIEAE+LCF  N N  K+    ++ KI NM+LRGWFKM WW KCR+FW
Sbjct: 173  SNLIDALCEYKHVIEAEDLCFPKNKNDVKYSCFKVDTKICNMLLRGWFKMSWWGKCRQFW 232

Query: 853  EEMDKRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGIS 1032
            EEMD RG++KDLYSYSIYMD+QCKSGK ++AV LYKEMKKKGI LDV+AYNTVIRAIGIS
Sbjct: 233  EEMDTRGVQKDLYSYSIYMDVQCKSGKPWKAVKLYKEMKKKGINLDVIAYNTVIRAIGIS 292

Query: 1033 DGVDVALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTY 1212
            DGVDVA KL +EM+ELGC+PNV+TYN+++KLMCENGR ++A ++L++M  KG +PNV+TY
Sbjct: 293  DGVDVAAKLCQEMIELGCKPNVSTYNTLIKLMCENGRYRDAYKVLSQMPHKGCEPNVITY 352

Query: 1213 HCFFGSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGL 1392
            + FFG L+KP+ IL LFDRMIE+GVRPRMDTYVMLMRKFGRWGFLRPVF++WEK+E+ GL
Sbjct: 353  NSFFGCLEKPREILKLFDRMIESGVRPRMDTYVMLMRKFGRWGFLRPVFILWEKMEKQGL 412

Query: 1393 SPDEFAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLVS 1539
            SPD  AYNALIDALVQKGMVDMARKYDEEM+AKGLSAKPR ELG KL +
Sbjct: 413  SPDASAYNALIDALVQKGMVDMARKYDEEMLAKGLSAKPRVELGTKLTT 461


>ref|XP_006465271.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Citrus sinensis]
          Length = 455

 Score =  605 bits (1559), Expect = e-170
 Identities = 287/410 (70%), Positives = 346/410 (84%), Gaps = 1/410 (0%)
 Frame = +1

Query: 310  PSSNGNDILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKY 489
            P SN ++   F+ +TV ETLS Y+NDWK+ALEFFNWVE  C F HTT+TYN VIDILGK+
Sbjct: 40   PQSNPHN---FHQSTVRETLSCYANDWKRALEFFNWVETDCHFTHTTDTYNSVIDILGKF 96

Query: 490  FEFDAAWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVS 669
            FEFD +W+LI +M+ + S + P+H TFRIMFKRYV+AHLV EA  TF++++EF L+D+VS
Sbjct: 97   FEFDLSWNLIHRMKDNPS-SIPNHATFRIMFKRYVTAHLVNEAMGTFNKLDEFGLKDEVS 155

Query: 670  FSNLIDALCEHKHVIEAEELCFGSNSNGGKHLFCSLN-VKIYNMILRGWFKMGWWRKCRE 846
            + NL+DALCE+KHVIEA+ELCFG N N G      +N  KIYNMILRGWFKM WW KCRE
Sbjct: 156  YCNLVDALCEYKHVIEAQELCFGENKNVGFSGLVEMNKTKIYNMILRGWFKMSWWGKCRE 215

Query: 847  FWEEMDKRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIG 1026
            FWEEMDKRG+ KDL+SYSIYMDI CKSGK ++AV LYKEMKKK IK+DVVAYNTVIRA+G
Sbjct: 216  FWEEMDKRGVVKDLHSYSIYMDIMCKSGKPWKAVKLYKEMKKKRIKMDVVAYNTVIRAVG 275

Query: 1027 ISDGVDVALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVL 1206
            IS+GVD A+++Y EM E+GCQP+V T N+++KL+CENGR KEA  +L EM KKG  P+V+
Sbjct: 276  ISEGVDFAMRVYCEMREMGCQPSVVTCNTVIKLLCENGRVKEAYAVLAEMPKKGCVPDVI 335

Query: 1207 TYHCFFGSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEH 1386
            TYHCFF  L+KP+ IL LFDRMIE+G+RP+MDTYVML+RKFGRWGFLRPVF++W+K+EE 
Sbjct: 336  TYHCFFRCLEKPREILGLFDRMIESGIRPKMDTYVMLLRKFGRWGFLRPVFVVWKKMEEL 395

Query: 1387 GLSPDEFAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLV 1536
            G SPDEFAYNAL+DAL+ KGM+DMARKYDEEM AKGLSAKPREELG KLV
Sbjct: 396  GCSPDEFAYNALVDALIDKGMLDMARKYDEEMFAKGLSAKPREELGTKLV 445


>gb|EMJ27555.1| hypothetical protein PRUPE_ppa022331mg [Prunus persica]
          Length = 455

 Score =  598 bits (1542), Expect = e-168
 Identities = 281/406 (69%), Positives = 343/406 (84%)
 Frame = +1

Query: 340  FNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAAWDLI 519
            ++  TV ETLSSY NDWK+AL+FFNW+E  C F HTT TYNR++DILGK+FEF+  W+LI
Sbjct: 55   YDHTTVRETLSSYCNDWKKALDFFNWLETECHFLHTTVTYNRMLDILGKFFEFELCWNLI 114

Query: 520  RKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALCE 699
            +KM K +  + PDHTTFRI+FKRYVSAHLVKEA DT++++EEF L+D+ S+ NLIDALCE
Sbjct: 115  QKM-KQNPVSVPDHTTFRILFKRYVSAHLVKEAIDTYNRLEEFGLKDETSYCNLIDALCE 173

Query: 700  HKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDKRGIK 879
            +KHVIEA+ELCF  N    K L    + K+YN++LRGW KMGWW KCR+FWEEMD+RG++
Sbjct: 174  YKHVIEAQELCFWKN----KDLGFDKSTKLYNLLLRGWLKMGWWGKCRDFWEEMDRRGVR 229

Query: 880  KDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVALKL 1059
            KDL+SYSIYMDI CKSGK ++AV LYKEMK KGIKLDVVAYNTVIRAIG+SDGVD +++L
Sbjct: 230  KDLHSYSIYMDILCKSGKPWKAVKLYKEMKNKGIKLDVVAYNTVIRAIGLSDGVDFSMRL 289

Query: 1060 YREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFGSLQK 1239
             REM ELGCQPNV TYN+I+KL+CENGR KEA  +L++M + G  P+V+TYHC F  L+K
Sbjct: 290  LREMKELGCQPNVGTYNTIIKLLCENGRCKEAFSLLHQMPRMGLLPDVITYHCIFKHLEK 349

Query: 1240 PKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAYNA 1419
            P  IL LFDRM E+GV+P+MDT+VMLMRKFGRWGFLRP+FL+W ++E+ G SPDE AYNA
Sbjct: 350  PNEILRLFDRMTESGVQPKMDTFVMLMRKFGRWGFLRPMFLVWNRMEKLGCSPDESAYNA 409

Query: 1420 LIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLVSVASEDG 1557
            LIDALV+KGM+DMAR+YDEEM+AKGLSAKPREELG KLVS  S+DG
Sbjct: 410  LIDALVEKGMLDMARQYDEEMLAKGLSAKPREELGTKLVSSESDDG 455


>gb|EOY25969.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao]
          Length = 481

 Score =  598 bits (1541), Expect = e-168
 Identities = 279/399 (69%), Positives = 341/399 (85%), Gaps = 1/399 (0%)
 Frame = +1

Query: 337  DFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAAWDL 516
            +F+  TV ETLS YSNDWK+ALEFFNWVE  C F HTTET+N+++DILGK FEFD +WDL
Sbjct: 51   NFDHQTVRETLSCYSNDWKRALEFFNWVETQCQFPHTTETFNKMLDILGKSFEFDLSWDL 110

Query: 517  IRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALC 696
            I +M K+   + PDH TFRI+FKRY++AHLVKEA  TFD++EEFNL+D++SF NL+DALC
Sbjct: 111  IDRM-KNKPCSIPDHATFRILFKRYITAHLVKEAISTFDRLEEFNLKDEISFCNLVDALC 169

Query: 697  EHKHVIEAEELCFGSNSNGGKHLFCSLN-VKIYNMILRGWFKMGWWRKCREFWEEMDKRG 873
            E+KHVIEA+ELCF       K +  S+N  KI+NMILRGWFKMGWW KCREFW+EMDK+G
Sbjct: 170  EYKHVIEAQELCFFGKI---KEIGLSVNDTKIHNMILRGWFKMGWWSKCREFWQEMDKKG 226

Query: 874  IKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVAL 1053
            +KKDL+SYSIYMDI CKSGK ++AV LYKEMKKKG+KLDVVAYNTVIRAIGIS+G D  +
Sbjct: 227  VKKDLHSYSIYMDIMCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIRAIGISEGADFGV 286

Query: 1054 KLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFGSL 1233
             ++REM +LGC+PNV TYN+++KL+CENGR ++A  +L++M KK   P+V+TYHCFFG L
Sbjct: 287  GVFREMRDLGCEPNVVTYNTVIKLLCENGRVRQAYAVLDQMLKKDCAPDVITYHCFFGCL 346

Query: 1234 QKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAY 1413
            +KP+ IL LFD MI NG++PRMDTYVMLMRKFGRWGFLRPVF++W+K+EE G SP+EFAY
Sbjct: 347  EKPREILKLFDLMITNGIQPRMDTYVMLMRKFGRWGFLRPVFMVWKKMEELGSSPNEFAY 406

Query: 1414 NALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGK 1530
            NALIDAL+QKGM+DMARKYDEEM+ KGLS+KPREELG K
Sbjct: 407  NALIDALIQKGMLDMARKYDEEMLEKGLSSKPREELGTK 445


>ref|XP_002306508.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222855957|gb|EEE93504.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 439

 Score =  590 bits (1521), Expect = e-165
 Identities = 285/411 (69%), Positives = 344/411 (83%), Gaps = 5/411 (1%)
 Frame = +1

Query: 304  RIPSSNGNDILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILG 483
            R P+   +D L+ +S+TV +TLS Y+NDWK+AL+FFNWVE    FQHTTETYNR+IDILG
Sbjct: 32   RTPNPPQSDPLNLDSSTVFQTLSCYNNDWKRALDFFNWVETESQFQHTTETYNRMIDILG 91

Query: 484  KYFEFDAAWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTF-DQMEEFNLRD 660
            K+FEFD +WDLI++MR ++ F+ P+HTTFR++F RY+SAHLV EA   + D+++EF L+D
Sbjct: 92   KFFEFDLSWDLIQRMR-NNPFSTPNHTTFRVLFHRYISAHLVNEAVSVYEDRLKEFGLKD 150

Query: 661  DVSFSNLIDALCEHKHVIEAEELCFGSNSNGGKHLFCSLNV----KIYNMILRGWFKMGW 828
            + S+  L+DALCE+KHVIEA ELCFG+N+N       S+NV    KIYNMILRGWFKMGW
Sbjct: 151  ETSYCILVDALCEYKHVIEAHELCFGNNNN-------SINVRNITKIYNMILRGWFKMGW 203

Query: 829  WRKCREFWEEMDKRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNT 1008
            W KCREFWEEMD++ + KDL+SYSIYMDI CKSGK ++AV LYKEMK KGIKLDVVAYNT
Sbjct: 204  WGKCREFWEEMDRKEVCKDLHSYSIYMDILCKSGKPWKAVKLYKEMKSKGIKLDVVAYNT 263

Query: 1009 VIRAIGISDGVDVALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKG 1188
            VI AIG+S+GVD  L++YREM ELGCQPNV T N+++KL+CENGR KEA +ML+EM +  
Sbjct: 264  VINAIGLSEGVDFVLRVYREMRELGCQPNVVTCNTVIKLLCENGRIKEAYKMLDEMPQSY 323

Query: 1189 YKPNVLTYHCFFGSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIW 1368
              P+V TYHCFF  L+KPK IL LFD+MIENGV PRMDTYVMLMRKFGRWGFLRPVFL+W
Sbjct: 324  IAPDVFTYHCFFRCLEKPKEILCLFDQMIENGVCPRMDTYVMLMRKFGRWGFLRPVFLVW 383

Query: 1369 EKIEEHGLSPDEFAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREEL 1521
            +K+E+ G SPDEFAYNALIDAL+QKGMVDMARKYDEEM+AKGLSAKPR EL
Sbjct: 384  KKMEKLGCSPDEFAYNALIDALIQKGMVDMARKYDEEMMAKGLSAKPRVEL 434


>ref|XP_002274114.2| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Vitis vinifera]
          Length = 571

 Score =  590 bits (1520), Expect = e-165
 Identities = 276/397 (69%), Positives = 338/397 (85%)
 Frame = +1

Query: 340  FNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAAWDLI 519
            F+ +TV +TLS Y+NDWK+ALEFF+WV+  CGF HTT+TYN +IDILGK+FEFD  W LI
Sbjct: 176  FDHSTVRQTLSCYANDWKRALEFFDWVQTQCGFNHTTDTYNGMIDILGKFFEFDLIWVLI 235

Query: 520  RKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALCE 699
            ++M K+   A P+H TFR +FKRY +AHLV+EA + + + EEFNLRD+ S+SNLIDALCE
Sbjct: 236  QRM-KADPVAYPNHVTFRFVFKRYAAAHLVEEAMNAYYRTEEFNLRDETSYSNLIDALCE 294

Query: 700  HKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDKRGIK 879
            +KHVIEAEEL    +    K L  + +VKIYN+ILRGWFKMGWW+KCREFWEEMD+RG+ 
Sbjct: 295  YKHVIEAEELFLKES----KDLVFNDDVKIYNIILRGWFKMGWWKKCREFWEEMDRRGVC 350

Query: 880  KDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVALKL 1059
            K LYSYSIYMDIQCKSGK +RAV LYKEMKKKGI+LDVVAYNTVIRAIG+S+GVD ++++
Sbjct: 351  KSLYSYSIYMDIQCKSGKPWRAVKLYKEMKKKGIRLDVVAYNTVIRAIGLSEGVDFSIRV 410

Query: 1060 YREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFGSLQK 1239
            +REM E+GC+PNV TYN+I+KL+CENGR +EA  + ++M +KGY PNV+TYHCFFG ++K
Sbjct: 411  FREMKEVGCEPNVVTYNTIIKLLCENGRIREAYGVFDQMREKGYAPNVITYHCFFGCIEK 470

Query: 1240 PKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAYNA 1419
            PK IL  FDRMI +GVRPRMDTYVMLM+KFGRWGFLRPVF++W+K+EE G SPD  AYNA
Sbjct: 471  PKQILRTFDRMINSGVRPRMDTYVMLMKKFGRWGFLRPVFIVWKKMEEQGCSPDACAYNA 530

Query: 1420 LIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGK 1530
            LIDALVQKGMVD+ARKY+EEM+AKGLSAKPR +LG K
Sbjct: 531  LIDALVQKGMVDLARKYEEEMLAKGLSAKPRVDLGTK 567


>ref|XP_004158824.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Cucumis sativus]
          Length = 450

 Score =  575 bits (1482), Expect = e-161
 Identities = 270/401 (67%), Positives = 330/401 (82%)
 Frame = +1

Query: 337  DFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAAWDL 516
            +F+  TV E L SY NDWK++ EFFNWVE  C F HTTETYNR++DILGK+FEFD +W L
Sbjct: 44   NFDPFTVREALDSYCNDWKRSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVL 103

Query: 517  IRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALC 696
            I +MR+S S A PDH TFRI+FKRY  AHLV EA   ++++ EF LRD+ SF NLIDALC
Sbjct: 104  INRMRQSPS-ASPDHATFRILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALC 162

Query: 697  EHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDKRGI 876
            E +HV EA+ELCFG N    + L C  + KI+N+ILRGW KMGWW KCR+FWEEMDK+G+
Sbjct: 163  ESRHVDEAQELCFGKN----RKLDCDSSTKIHNLILRGWLKMGWWSKCRDFWEEMDKKGV 218

Query: 877  KKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVALK 1056
            +KDL+SYSIYMDIQCKSGK ++AV LYKEMKKKG+KLDVVAYNTVI A+GIS+GVD A +
Sbjct: 219  RKDLHSYSIYMDIQCKSGKPWKAVKLYKEMKKKGMKLDVVAYNTVIHAVGISEGVDFASR 278

Query: 1057 LYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFGSLQ 1236
            ++ EM E+GC+PNV T N+++KL CENGR K+A  ML++M K+  +PNV+TYHCFF SL+
Sbjct: 279  VFHEMKEMGCKPNVVTCNTVIKLFCENGRFKDAHMMLDQMLKRDCQPNVITYHCFFRSLE 338

Query: 1237 KPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAYN 1416
            KPK IL LFDRMI+ GV P+MDTYVML+RKFGRWGFLRPVFL+W K+EE G SP+E AYN
Sbjct: 339  KPKEILVLFDRMIKYGVHPKMDTYVMLLRKFGRWGFLRPVFLVWNKMEELGCSPNECAYN 398

Query: 1417 ALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLVS 1539
            ALIDALV+KGM+DMARKYDEEM+AKGLS K R ELG ++++
Sbjct: 399  ALIDALVEKGMIDMARKYDEEMVAKGLSPKLRVELGTQMMN 439


>gb|EXB67206.1| hypothetical protein L484_025684 [Morus notabilis]
          Length = 442

 Score =  572 bits (1475), Expect = e-160
 Identities = 263/405 (64%), Positives = 339/405 (83%)
 Frame = +1

Query: 325  NDILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDA 504
            +D L FNS TV ETL+SY NDW++A EFF WVE +C F HTT+TYNR++DILGK+FEFD 
Sbjct: 41   SDSLQFNSDTVTETLTSYCNDWQRAFEFFTWVETNCRFLHTTDTYNRMLDILGKFFEFDL 100

Query: 505  AWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLI 684
            +WDLI +M ++   + P H TFR+MF RY +AHLVKEA + +++ EEF L+D+ ++SNLI
Sbjct: 101  SWDLIHRMNQNP-VSVPSHATFRVMFHRYAAAHLVKEAVEAYNRSEEFGLKDETTYSNLI 159

Query: 685  DALCEHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMD 864
            DALC+ KHVIEA++LCF +    GK L    + KIYNMILRGW ++GWW KC +FWEEMD
Sbjct: 160  DALCDQKHVIEAQDLCFWN----GKELGFEKSTKIYNMILRGWSRVGWWSKCGDFWEEMD 215

Query: 865  KRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVD 1044
            +RG++KDL++YSIYMDI CKSGK ++AV LYKEMKKK IKLDVVAYNT++RA+G+S+GVD
Sbjct: 216  RRGLEKDLHTYSIYMDILCKSGKPWKAVKLYKEMKKKRIKLDVVAYNTIVRAVGLSEGVD 275

Query: 1045 VALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFF 1224
             ++++ REM ELGCQPNV TYN+++KL+CENGR +EA ++L++M + G  P+V+TYHCFF
Sbjct: 276  FSMRVLREMRELGCQPNVVTYNTLIKLLCENGRYREASKVLDKMPEWGCSPDVITYHCFF 335

Query: 1225 GSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDE 1404
            GS++KPK IL LFDRMI++G+RPR DTYVMLMRKFGRWGFLRPV ++W+K+EE G SP++
Sbjct: 336  GSMEKPKEILRLFDRMIDSGIRPRTDTYVMLMRKFGRWGFLRPVLVVWKKMEELGCSPND 395

Query: 1405 FAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLVS 1539
             AYNALIDAL+ KGM+DMARKYDEEM+AKGLS KPR ELG +LV+
Sbjct: 396  AAYNALIDALIDKGMLDMARKYDEEMLAKGLSPKPRAELGTRLVT 440


>ref|XP_004297847.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 465

 Score =  566 bits (1459), Expect = e-158
 Identities = 270/406 (66%), Positives = 329/406 (81%)
 Frame = +1

Query: 340  FNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAAWDLI 519
            ++  TV ETLSSY NDWK+AL+FF WVE    FQHTTETYNR++DILGKYFEF+  WDL+
Sbjct: 65   YDHTTVRETLSSYCNDWKKALDFFIWVESQPHFQHTTETYNRLLDILGKYFEFELCWDLV 124

Query: 520  RKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALCE 699
             KM K +    PDHTTFRIMFKRYVSAHLVKEA DT+++++EF L+D+ S+ NL+DALCE
Sbjct: 125  HKM-KQNPLCVPDHTTFRIMFKRYVSAHLVKEAIDTYNKLDEFGLKDETSYCNLVDALCE 183

Query: 700  HKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDKRGIK 879
            HKHVIEA+ELC   N    K L    + K++N+ILRGW KMGWW KCR+FWEEMD+RG+ 
Sbjct: 184  HKHVIEAQELCSWKN----KELGFDRSTKLHNIILRGWSKMGWWGKCRDFWEEMDRRGVC 239

Query: 880  KDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVALKL 1059
            KDL+SYSIYMDI CKSGKA++AV LYKE+K+K IKLDVVAYNTVI A+G S+GVD A+++
Sbjct: 240  KDLHSYSIYMDIMCKSGKAWKAVKLYKEVKRKRIKLDVVAYNTVIGAVGASEGVDFAIRI 299

Query: 1060 YREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFGSLQK 1239
             REM ELGC PN+ TYN+I+KL+CEN R +EA  ML  M K    P+V+TY   F  L+K
Sbjct: 300  LREMKELGCDPNIVTYNTIIKLLCENMRVREAFSMLRVMSKNSCGPDVITYQIIFKYLEK 359

Query: 1240 PKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAYNA 1419
            P  IL LFDRMIE+GV+PRMDTYVM+MRKFGRWGFLRP+F++W+K+E+ G SP+E AYNA
Sbjct: 360  PNEILRLFDRMIESGVQPRMDTYVMIMRKFGRWGFLRPMFIVWQKMEKLGCSPNESAYNA 419

Query: 1420 LIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLVSVASEDG 1557
            LIDALV+KGM+DMARKYDEEMIAKGL  +PR ELG KLVS   ++G
Sbjct: 420  LIDALVEKGMLDMARKYDEEMIAKGLPTRPRVELGTKLVSNEFDEG 465


>ref|XP_004136096.1| PREDICTED: uncharacterized protein LOC101205322 [Cucumis sativus]
          Length = 1559

 Score =  552 bits (1423), Expect = e-154
 Identities = 259/382 (67%), Positives = 317/382 (82%)
 Frame = +1

Query: 394  QALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAAWDLIRKMRKSSSFARPDHTTFR 573
            ++ EFFNWVE  C F HTTETYNR++DILGK+FEFD +W LI +MR+S S A PDH TFR
Sbjct: 614  RSYEFFNWVESECKFDHTTETYNRMLDILGKFFEFDLSWVLINRMRQSPS-ASPDHATFR 672

Query: 574  IMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALCEHKHVIEAEELCFGSNSNG 753
            I+FKRY  AHLV EA   ++++ EF LRD+ SF NLIDALCE +HV EA+ELCFG N   
Sbjct: 673  ILFKRYALAHLVSEAIAAYERLREFKLRDETSFCNLIDALCESRHVDEAQELCFGKN--- 729

Query: 754  GKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDKRGIKKDLYSYSIYMDIQCKSGK 933
             + L C  + KI+N+ILRGW KMGWW KCR+FWEEMDK+G++KDL+SYSIYMDIQCKSGK
Sbjct: 730  -RKLDCDSSTKIHNLILRGWLKMGWWSKCRDFWEEMDKKGVRKDLHSYSIYMDIQCKSGK 788

Query: 934  AYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVALKLYREMVELGCQPNVATYNS 1113
             ++AV LYKEMKKKG+KLDVVAYNTVI A+GIS+GVD A +++ EM E+GC+PNV T N+
Sbjct: 789  PWKAVKLYKEMKKKGMKLDVVAYNTVIHAVGISEGVDFASRVFHEMKEMGCKPNVVTCNT 848

Query: 1114 ILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFGSLQKPKMILNLFDRMIENGVRP 1293
            ++KL CENGR K+A  ML++M K+  +PNV+TYHCFF SL+KPK IL LFDRMI+ GV P
Sbjct: 849  VIKLFCENGRFKDAHMMLDQMLKRDCQPNVITYHCFFRSLEKPKEILVLFDRMIKYGVHP 908

Query: 1294 RMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAYNALIDALVQKGMVDMARKYD 1473
            +MDTYVML+RKFGRWGFLRPVFL+W K+EE G SP+E AYNALIDALV+KGM+DMARKYD
Sbjct: 909  KMDTYVMLLRKFGRWGFLRPVFLVWNKMEELGCSPNECAYNALIDALVEKGMIDMARKYD 968

Query: 1474 EEMIAKGLSAKPREELGGKLVS 1539
            EEM+AKGLS K R ELG ++++
Sbjct: 969  EEMVAKGLSPKLRVELGTQMMN 990



 Score = 97.8 bits (242), Expect = 2e-17
 Identities = 52/110 (47%), Positives = 65/110 (59%), Gaps = 16/110 (14%)
 Frame = +3

Query: 2424 ELSDMKFNRRFGASCQNMASNSKL---------------NFDENRWIIQIRQTXXXXXXX 2558
            EL     N  + A+C  M  +SK                +FDE RW+IQIRQ+       
Sbjct: 983  ELGTQMMNGGYHANCSTMRFSSKSRLHSLPAGNSWGLNSDFDEERWVIQIRQSLDEEELE 1042

Query: 2559 XXX-IPVSIFCVPKTLIVSNPDFYIPQQVALGPYHHWRSELYDMERYKLA 2705
                IPV IF VPK+L+V +PD YIPQ+VA+GPYHHWR ELY+MERYK+A
Sbjct: 1043 EDTGIPVCIFNVPKSLMVIDPDSYIPQEVAIGPYHHWRQELYEMERYKIA 1092


>ref|XP_006302258.1| hypothetical protein CARUB_v10020295mg [Capsella rubella]
            gi|482570968|gb|EOA35156.1| hypothetical protein
            CARUB_v10020295mg [Capsella rubella]
          Length = 447

 Score =  537 bits (1383), Expect = e-149
 Identities = 251/398 (63%), Positives = 320/398 (80%)
 Frame = +1

Query: 328  DILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAA 507
            D   ++   V E LS YSNDW++ALEFFNWVE   GF+HTTET+NR+IDILGKYFEFDA+
Sbjct: 40   DQSSYDQKAVCEALSCYSNDWQKALEFFNWVEKESGFRHTTETFNRMIDILGKYFEFDAS 99

Query: 508  WDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLID 687
            W LI +M      + P+H TFRI+FKRYV+AHLV+EA D +D++++FNLRD+ SF NL+D
Sbjct: 100  WGLINRMIGMPQ-SLPNHVTFRIVFKRYVTAHLVQEAIDAYDKLDDFNLRDETSFYNLVD 158

Query: 688  ALCEHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDK 867
            ALCEHKHV+EAEELCFG N     + F   N KI+N+ILRGW K+GWW KC+EFWE+MD 
Sbjct: 159  ALCEHKHVVEAEELCFGKNVIA--NAFSLSNTKIHNLILRGWSKLGWWGKCKEFWEKMDT 216

Query: 868  RGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDV 1047
             G+ KDL+SYSIYMDI CKSGK ++AV LYKEM+ +GIKLDVVAYNTVIRAIG S GV+ 
Sbjct: 217  EGVAKDLFSYSIYMDIMCKSGKPWKAVRLYKEMRSRGIKLDVVAYNTVIRAIGASQGVEF 276

Query: 1048 ALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFG 1227
             ++++REM + GC+PNVAT+N+I+KL+CENGR ++A +ML+EM KKG + + +TY C F 
Sbjct: 277  GIRVFREMRDRGCEPNVATHNTIIKLLCENGRMRDAYQMLDEMPKKGCQADSITYMCLFA 336

Query: 1228 SLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEF 1407
             L+KP  IL+LF RMI +GVRP+MDTYVMLMRKF RWGFL+PV  +W+ ++E G +PD  
Sbjct: 337  RLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERWGFLQPVLYVWKTMKESGDTPDAA 396

Query: 1408 AYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREEL 1521
            AYNA+IDAL+QKGM+DMAR+Y+EEMI +GLS + R EL
Sbjct: 397  AYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRRPEL 434


>gb|EOY25971.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 3
            [Theobroma cacao]
          Length = 360

 Score =  535 bits (1379), Expect = e-149
 Identities = 251/358 (70%), Positives = 308/358 (86%), Gaps = 1/358 (0%)
 Frame = +1

Query: 466  VIDILGKYFEFDAAWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEE 645
            ++DILGK FEFD +WDLI +M K+   + PDH TFRI+FKRY++AHLVKEA  TFD++EE
Sbjct: 1    MLDILGKSFEFDLSWDLIDRM-KNKPCSIPDHATFRILFKRYITAHLVKEAISTFDRLEE 59

Query: 646  FNLRDDVSFSNLIDALCEHKHVIEAEELCFGSNSNGGKHLFCSLN-VKIYNMILRGWFKM 822
            FNL+D++SF NL+DALCE+KHVIEA+ELCF       K +  S+N  KI+NMILRGWFKM
Sbjct: 60   FNLKDEISFCNLVDALCEYKHVIEAQELCFFGKI---KEIGLSVNDTKIHNMILRGWFKM 116

Query: 823  GWWRKCREFWEEMDKRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAY 1002
            GWW KCREFW+EMDK+G+KKDL+SYSIYMDI CKSGK ++AV LYKEMKKKG+KLDVVAY
Sbjct: 117  GWWSKCREFWQEMDKKGVKKDLHSYSIYMDIMCKSGKPWKAVKLYKEMKKKGMKLDVVAY 176

Query: 1003 NTVIRAIGISDGVDVALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFK 1182
            NTVIRAIGIS+G D  + ++REM +LGC+PNV TYN+++KL+CENGR ++A  +L++M K
Sbjct: 177  NTVIRAIGISEGADFGVGVFREMRDLGCEPNVVTYNTVIKLLCENGRVRQAYAVLDQMLK 236

Query: 1183 KGYKPNVLTYHCFFGSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFL 1362
            K   P+V+TYHCFFG L+KP+ IL LFD MI NG++PRMDTYVMLMRKFGRWGFLRPVF+
Sbjct: 237  KDCAPDVITYHCFFGCLEKPREILKLFDLMITNGIQPRMDTYVMLMRKFGRWGFLRPVFM 296

Query: 1363 IWEKIEEHGLSPDEFAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLV 1536
            +W+K+EE G SP+EFAYNALIDAL+QKGM+DMARKYDEEM+ KGLS+KPREELG KLV
Sbjct: 297  VWKKMEELGSSPNEFAYNALIDALIQKGMLDMARKYDEEMLEKGLSSKPREELGTKLV 354


>ref|NP_178170.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75264854|sp|Q9M8M3.1|PP136_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g80550, mitochondrial; Flags: Precursor
            gi|6730729|gb|AAF27119.1|AC018849_7 unknown protein;
            31926-33272 [Arabidopsis thaliana]
            gi|332198297|gb|AEE36418.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 448

 Score =  533 bits (1374), Expect = e-148
 Identities = 249/403 (61%), Positives = 321/403 (79%)
 Frame = +1

Query: 313  SSNGNDILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYF 492
            S    D   ++  TV E L+ YSNDW++ALEFFNWVE   GF+HTTET+NRVIDILGKYF
Sbjct: 36   SQEEEDQSSYDQKTVCEALTCYSNDWQKALEFFNWVERESGFRHTTETFNRVIDILGKYF 95

Query: 493  EFDAAWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSF 672
            EF+ +W LI +M  ++    P+H TFRI+FKRYV+AHLV+EA D +D++++FNLRD+ SF
Sbjct: 96   EFEISWALINRMIGNTESV-PNHVTFRIVFKRYVTAHLVQEAIDAYDKLDDFNLRDETSF 154

Query: 673  SNLIDALCEHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFW 852
             NL+DALCEHKHV+EAEELCFG N  G    F   N KI+N+ILRGW K+GWW KC+E+W
Sbjct: 155  YNLVDALCEHKHVVEAEELCFGKNVIGNG--FSVSNTKIHNLILRGWSKLGWWGKCKEYW 212

Query: 853  EEMDKRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGIS 1032
            ++MD  G+ KDL+SYSIYMDI CKSGK ++AV LYKEMK + +KLDVVAYNTVIRAIG S
Sbjct: 213  KKMDTEGVTKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSRRMKLDVVAYNTVIRAIGAS 272

Query: 1033 DGVDVALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTY 1212
             GV+  ++++REM E GC+PNVAT+N+I+KL+CE+GR ++A  ML+EM K+G +P+ +TY
Sbjct: 273  QGVEFGIRVFREMRERGCEPNVATHNTIIKLLCEDGRMRDAYRMLDEMPKRGCQPDSITY 332

Query: 1213 HCFFGSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGL 1392
             C F  L+KP  IL+LF RMI +GVRP+MDTYVMLMRKF RWGFL+PV  +W+ ++E G 
Sbjct: 333  MCLFSRLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERWGFLQPVLYVWKTMKESGD 392

Query: 1393 SPDEFAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREEL 1521
            +PD  AYNA+IDAL+QKGM+DMAR+Y+EEMI +GLS + R EL
Sbjct: 393  TPDSAAYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRRPEL 435


>ref|XP_002887822.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297333663|gb|EFH64081.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 407

 Score =  533 bits (1372), Expect = e-148
 Identities = 249/398 (62%), Positives = 318/398 (79%)
 Frame = +1

Query: 328  DILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAA 507
            D   ++  TV E LS Y NDW++ALEFFNWVE   GF+HTTET+NR+IDILGKYFEF+  
Sbjct: 1    DQSSYDQKTVCEALSCYINDWQKALEFFNWVEKESGFRHTTETFNRMIDILGKYFEFETC 60

Query: 508  WDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLID 687
            W LI +M  +   + P+H TFRI+FKRYV+AHLV+EA D +D++++FNLRDD SF NL+D
Sbjct: 61   WALINRMIGNPE-SLPNHVTFRIVFKRYVTAHLVQEAIDAYDKLDDFNLRDDTSFYNLVD 119

Query: 688  ALCEHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDK 867
            ALCEHKHV+EAEELCFG N     H F   N KI+N+ILRGW K+GWW KC+E+W++MD 
Sbjct: 120  ALCEHKHVVEAEELCFGKNVIA--HGFSVSNTKIHNLILRGWSKLGWWGKCKEYWDKMDT 177

Query: 868  RGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDV 1047
             G+ KDL+SYSIYMDI CKSGK ++AV LYKEMK + IKLDVVAYNTVIRAIG S GV+ 
Sbjct: 178  EGVPKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSRRIKLDVVAYNTVIRAIGASQGVEF 237

Query: 1048 ALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFG 1227
             ++++REM E GC+PNVAT+N+I+KL+CE+GR ++A  ML+EM KKG +P+ ++Y C F 
Sbjct: 238  GIRVFREMRERGCEPNVATHNTIIKLLCEDGRMRDAYRMLDEMPKKGCQPDSISYMCLFS 297

Query: 1228 SLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEF 1407
             L+KP  IL+LF RMI +GVRP+MDTYVMLMRKF RWGFL+PV  +W+ ++E G +PD  
Sbjct: 298  RLEKPSEILSLFGRMIRSGVRPKMDTYVMLMRKFERWGFLQPVLYVWKTMKESGDTPDSA 357

Query: 1408 AYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREEL 1521
            AYNA+IDAL+QKGM+DMAR+Y+EEMI +GLS + R EL
Sbjct: 358  AYNAVIDALIQKGMLDMAREYEEEMIERGLSPRRRPEL 395


>gb|EPS66144.1| hypothetical protein M569_08630, partial [Genlisea aurea]
          Length = 403

 Score =  532 bits (1370), Expect = e-148
 Identities = 256/394 (64%), Positives = 317/394 (80%), Gaps = 4/394 (1%)
 Frame = +1

Query: 337  DFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAAWDL 516
            DFN ATVLETL+ Y+NDWK ALEFFNW E   GF HT ET+NR++D LGK+FEF+ AW L
Sbjct: 10   DFNPATVLETLNCYANDWKLALEFFNWSETQSGFVHTAETFNRMVDTLGKFFEFELAWSL 69

Query: 517  IRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALC 696
            I++M +S S + P+HTTFR++ KRYVSA LVKEA D F +++E+NLRD+ SFS LID+LC
Sbjct: 70   IQRMNESPS-SPPNHTTFRVLCKRYVSARLVKEAIDAFRRLDEYNLRDETSFSILIDSLC 128

Query: 697  EHKHVIEAEELCFGSNSNGGKH-LFCSLNV---KIYNMILRGWFKMGWWRKCREFWEEMD 864
            E++HVI+AE+LCF  N +     +F   +V   KIYNMILRG+FK+ WW KCR FWE MD
Sbjct: 129  EYRHVIDAEDLCFKRNRDTEYDGVFAGFDVETTKIYNMILRGFFKIQWWGKCRAFWEAMD 188

Query: 865  KRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVD 1044
            ++GI+KDL+SYSIYMDIQCKSGK  +A+ L+KEMK+KGIK D VAYNTVIRA G   GVD
Sbjct: 189  RKGIQKDLFSYSIYMDIQCKSGKPCKAMKLFKEMKRKGIKPDAVAYNTVIRAAGEQRGVD 248

Query: 1045 VALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFF 1224
             +L++Y++MVE GC P++ T+N+ILKL+CENGR  EA EML+ M +KG  PNV+TY+CFF
Sbjct: 249  DSLRIYKQMVEAGCSPSLVTFNTILKLLCENGRYGEAREMLSWMRRKGCPPNVVTYNCFF 308

Query: 1225 GSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDE 1404
            GSL+KP  IL LFD M+  G+RPRMDTYVMLM KFGRWGFLRPV  +WEK+EE G SPDE
Sbjct: 309  GSLEKPGEILKLFDEMVGRGIRPRMDTYVMLMSKFGRWGFLRPVVYVWEKMEELGDSPDE 368

Query: 1405 FAYNALIDALVQKGMVDMARKYDEEMIAKGLSAK 1506
            FAYNALIDA + KG+V+ ARKYD+EM  KG+SAK
Sbjct: 369  FAYNALIDAFMNKGLVEEARKYDDEMYRKGISAK 402


>ref|XP_006590730.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like isoform X1 [Glycine max]
            gi|571487719|ref|XP_006590731.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like isoform X2 [Glycine max]
          Length = 449

 Score =  529 bits (1363), Expect = e-147
 Identities = 249/401 (62%), Positives = 321/401 (80%), Gaps = 2/401 (0%)
 Frame = +1

Query: 343  NSATVLETLSSYSNDWKQALEFFNWVEFSCG-FQHTTETYNRVIDILGKYFEFDAAWDLI 519
            + ATV +TL S++NDWK+ALEFFNWVE S   F H+T+T+N ++DILGK+FEF   WDLI
Sbjct: 37   DDATVRQTLLSFNNDWKRALEFFNWVEDSHSQFHHSTDTFNLMLDILGKFFEFKLCWDLI 96

Query: 520  RKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALCE 699
            R+M    S + P+H TFR+MFKRYVSAH V +A DTF+++ EFNL+D  SFSNL+DALCE
Sbjct: 97   RRMNAHPS-SPPNHATFRLMFKRYVSAHSVNDAIDTFNRLGEFNLKDHTSFSNLLDALCE 155

Query: 700  HKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDKRGIK 879
            +KHVIEA++L FG+++     +    N KI+NM+LRGWFK+GWW KC EFWEEMDK+G+ 
Sbjct: 156  YKHVIEAQDLLFGNDNRVTLSVDPIGNTKIHNMVLRGWFKLGWWSKCNEFWEEMDKKGVH 215

Query: 880  KDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVALKL 1059
            KDL+SYSIYMDI CK GK ++AV L+KE+KKKG KLDVV YN VIRAIG+S GVD ++++
Sbjct: 216  KDLHSYSIYMDILCKGGKPWKAVKLFKEIKKKGFKLDVVVYNIVIRAIGLSHGVDFSIRV 275

Query: 1060 YREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNE-MFKKGYKPNVLTYHCFFGSLQ 1236
            +REM ELG  P V TYN++++L+C+  R+KEAL +L   M + G  P  ++YHCFF S++
Sbjct: 276  FREMKELGINPTVVTYNTLIRLLCDCYRHKEALALLRTIMPRDGCHPTAVSYHCFFASME 335

Query: 1237 KPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAYN 1416
            KPK IL +FD M+E+GVRP MDTYVML+ KFGRWGFLRPVF++W K+++ G SPD  AYN
Sbjct: 336  KPKQILAMFDEMVESGVRPTMDTYVMLLNKFGRWGFLRPVFMVWNKMKQLGCSPDAAAYN 395

Query: 1417 ALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLVS 1539
            ALIDALV K ++DMARKYDEEM+AKGLS KPR+ELG KL++
Sbjct: 396  ALIDALVDKALIDMARKYDEEMLAKGLSPKPRKELGTKLLA 436


>ref|XP_006389809.1| hypothetical protein EUTSA_v10018541mg [Eutrema salsugineum]
            gi|557086243|gb|ESQ27095.1| hypothetical protein
            EUTSA_v10018541mg [Eutrema salsugineum]
          Length = 448

 Score =  528 bits (1361), Expect = e-147
 Identities = 246/398 (61%), Positives = 316/398 (79%)
 Frame = +1

Query: 328  DILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKYFEFDAA 507
            D   ++  TV E L+ Y NDW++ALEFFNWV+   GF HTT+T+NR+IDILGKYFEF   
Sbjct: 42   DQSSYDQKTVCEALTCYGNDWQKALEFFNWVDKESGFSHTTDTFNRMIDILGKYFEFQTC 101

Query: 508  WDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLID 687
            W LI +M ++   + P+H TFRI+FKRY  AHLV+EA DT+D++++FNLRD+ SF NL+D
Sbjct: 102  WVLINRMAENP-LSVPNHVTFRIIFKRYAMAHLVQEALDTYDKLDDFNLRDETSFYNLVD 160

Query: 688  ALCEHKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDK 867
            +LCEHKHV+EAEELCFG N  G    F   N KI+N+ILRGW K+GWW KC+E+WE+MD 
Sbjct: 161  SLCEHKHVVEAEELCFGKNVIGNG--FSVSNTKIHNLILRGWSKLGWWGKCKEYWEKMDT 218

Query: 868  RGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDV 1047
             G+ KDL+SYSIYMDI CKSGK ++AV LYKEMK K +KLDVVAYNTVIRAIG S GV+ 
Sbjct: 219  EGVAKDLFSYSIYMDIMCKSGKPWKAVKLYKEMKSKRMKLDVVAYNTVIRAIGASQGVEF 278

Query: 1048 ALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFKKGYKPNVLTYHCFFG 1227
             ++++REM E GC+PNVAT+N+I+KL+CE+GR K+A  MLNEM KKG +P+ +TY C F 
Sbjct: 279  GMRMFREMRERGCEPNVATHNTIIKLLCEDGRMKDAYGMLNEMPKKGCQPDSVTYMCLFA 338

Query: 1228 SLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEF 1407
             L+KP  IL+LF +MI +GVRPRMDTYVML+RKF RWGFL+PV  +W+ ++E G +PD  
Sbjct: 339  RLEKPSEILSLFGKMIRSGVRPRMDTYVMLIRKFERWGFLQPVLHVWKTMKESGDTPDSA 398

Query: 1408 AYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREEL 1521
            AYNA+IDALVQKGM+DMAR+Y++EM+ +GLS + R EL
Sbjct: 399  AYNAVIDALVQKGMLDMAREYEDEMVERGLSPRRRPEL 436


>ref|XP_003542124.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80550,
            mitochondrial-like [Glycine max]
          Length = 445

 Score =  528 bits (1360), Expect = e-147
 Identities = 248/401 (61%), Positives = 321/401 (80%), Gaps = 2/401 (0%)
 Frame = +1

Query: 343  NSATVLETLSSYSNDWKQALEFFNWVEFSCG-FQHTTETYNRVIDILGKYFEFDAAWDLI 519
            + ATV +TL S++NDWK+ALEFFNWVE S   F H+T+T+N ++DILGK+FEF   WDLI
Sbjct: 33   DDATVRQTLLSFNNDWKRALEFFNWVEESHSQFHHSTDTFNLMLDILGKFFEFKLCWDLI 92

Query: 520  RKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVSFSNLIDALCE 699
            R+M    S + P+H TFR+MFKRYVSAH V +A DTF+++ EFNL+D  SFSNL+DALCE
Sbjct: 93   RRMNAHPS-SPPNHATFRLMFKRYVSAHSVNDAIDTFNRLGEFNLKDHTSFSNLLDALCE 151

Query: 700  HKHVIEAEELCFGSNSNGGKHLFCSLNVKIYNMILRGWFKMGWWRKCREFWEEMDKRGIK 879
            +KHV+EA++L FG+++     +    N KI+NM+LRGWFK+GWW KC EFWEEMDK+G+ 
Sbjct: 152  YKHVLEAQDLLFGNDNRVTLSVDPIGNTKIHNMVLRGWFKLGWWSKCNEFWEEMDKKGVH 211

Query: 880  KDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAYNTVIRAIGISDGVDVALKL 1059
            KDL+SYSIYMDI CK GK ++AV L+KE+KKKG KLDVV YN VIRAIG+S GVD ++++
Sbjct: 212  KDLHSYSIYMDILCKGGKPWKAVKLFKEIKKKGFKLDVVVYNIVIRAIGLSHGVDFSIRV 271

Query: 1060 YREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNE-MFKKGYKPNVLTYHCFFGSLQ 1236
            +REM ELG +P V TYN++++L+C+  R+KEAL +L   M   G  P  ++YHCFF S++
Sbjct: 272  FREMKELGIKPTVVTYNTLIRLLCDCYRHKEALALLRTIMPSDGCHPTAVSYHCFFASME 331

Query: 1237 KPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFLIWEKIEEHGLSPDEFAYN 1416
            KPK IL +FD M+E+GVRP MDTYVML+ KFGRWGFLRPVF++W K+++ G SPD  AYN
Sbjct: 332  KPKQILAMFDEMVESGVRPTMDTYVMLLNKFGRWGFLRPVFMVWNKMKQLGCSPDAAAYN 391

Query: 1417 ALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGKLVS 1539
            ALIDALV K ++DMARKYDEEM+AKGLS KPR+ELG KL++
Sbjct: 392  ALIDALVDKALIDMARKYDEEMLAKGLSPKPRKELGTKLLA 432


>ref|XP_003597983.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355487031|gb|AES68234.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 520

 Score =  526 bits (1355), Expect = e-146
 Identities = 248/416 (59%), Positives = 324/416 (77%), Gaps = 9/416 (2%)
 Frame = +1

Query: 310  PSSNGNDILDFNSATVLETLSSYSNDWKQALEFFNWVEFSCGFQHTTETYNRVIDILGKY 489
            P+ N + I   +  TV  TL+S++ND+K+ALEFFNWVE    FQH+TETYN V+DILGK+
Sbjct: 64   PNPNPSPIPFVDHTTVRATLTSFNNDYKRALEFFNWVETKFKFQHSTETYNLVLDILGKF 123

Query: 490  FEFDAAWDLIRKMRKSSSFARPDHTTFRIMFKRYVSAHLVKEAFDTFDQMEEFNLRDDVS 669
            FEF   W+LI +MR++   + P+HTTFR+MFKRYVSAH V++A +TF ++ EFNL+D+ S
Sbjct: 124  FEFQQCWNLIHRMRQNPH-SLPNHTTFRVMFKRYVSAHCVQDAVNTFQRLNEFNLKDETS 182

Query: 670  FSNLIDALCEHKHVIEAEELCFGSNSNG---------GKHLFCSLNVKIYNMILRGWFKM 822
            FSNLIDALCE+KHV+EA++L FG   N             +  S N KI+N++LRGW+K+
Sbjct: 183  FSNLIDALCEYKHVLEAQDLVFGDKKNQTLTWIVDGVDGFVASSKNTKIFNIVLRGWYKL 242

Query: 823  GWWRKCREFWEEMDKRGIKKDLYSYSIYMDIQCKSGKAYRAVNLYKEMKKKGIKLDVVAY 1002
            GWW KC EFW+EMD+RG++KDL+SYSIYMDI  K GK ++AV L+KEMK+KGI+LDVV Y
Sbjct: 243  GWWSKCWEFWDEMDRRGVEKDLHSYSIYMDILSKGGKPWKAVKLFKEMKRKGIQLDVVVY 302

Query: 1003 NTVIRAIGISDGVDVALKLYREMVELGCQPNVATYNSILKLMCENGRNKEALEMLNEMFK 1182
            N VIRAIG+S GVD +++++ EM +LG  P V TYN+I++L+C++ R KEAL ++  M +
Sbjct: 303  NIVIRAIGVSQGVDFSIRMFCEMKDLGLNPTVVTYNTIIRLLCDSYRYKEALTLIRTMRR 362

Query: 1183 KGYKPNVLTYHCFFGSLQKPKMILNLFDRMIENGVRPRMDTYVMLMRKFGRWGFLRPVFL 1362
             G  PN ++Y CFF  L+KPK I+ LFD MIE+GVRP MDTYVML++KF RWGFLR VFL
Sbjct: 363  DGCSPNAVSYQCFFACLEKPKFIIELFDGMIESGVRPTMDTYVMLLKKFARWGFLRLVFL 422

Query: 1363 IWEKIEEHGLSPDEFAYNALIDALVQKGMVDMARKYDEEMIAKGLSAKPREELGGK 1530
            +W ++EE G SPD  AYNALIDALV+KG++DMARKYDEEM+AKGLS KPR+ELG K
Sbjct: 423  VWNRMEELGCSPDASAYNALIDALVEKGLIDMARKYDEEMLAKGLSPKPRKELGTK 478


Top