BLASTX nr result

ID: Forsythia22_contig00008391 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00008391
         (1064 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containi...   474   e-131
ref|XP_011088649.1| PREDICTED: pentatricopeptide repeat-containi...   439   e-120
emb|CDP14534.1| unnamed protein product [Coffea canephora]            412   e-112
ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   404   e-110
ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi...   401   e-109
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   395   e-107
ref|XP_009629638.1| PREDICTED: pentatricopeptide repeat-containi...   393   e-106
ref|XP_009761148.1| PREDICTED: pentatricopeptide repeat-containi...   389   e-105
ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing pr...   389   e-105
gb|KDO68195.1| hypothetical protein CISIN_1g042756mg, partial [C...   371   e-100
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   371   e-100
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   368   4e-99
ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prun...   368   4e-99
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   367   6e-99
ref|XP_010086846.1| hypothetical protein L484_006076 [Morus nota...   367   8e-99
ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containi...   367   1e-98
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   360   1e-96
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           360   1e-96
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   360   1e-96
ref|XP_012480490.1| PREDICTED: pentatricopeptide repeat-containi...   359   2e-96

>ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Erythranthe guttatus] gi|604333640|gb|EYU37991.1|
            hypothetical protein MIMGU_mgv1a006093mg [Erythranthe
            guttata]
          Length = 458

 Score =  474 bits (1220), Expect = e-131
 Identities = 230/319 (72%), Positives = 268/319 (84%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L   LY II QESWF+WN+KLVADLI+LL K +RFDEA+NLF E VSKLGFKER+LC FY
Sbjct: 86   LAFPLYGIIEQESWFTWNSKLVADLISLLYKAERFDEADNLFGETVSKLGFKERDLCTFY 145

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CNLVDSHAKH SE+ V DSC  LKQLIL SSSVYVK+KGYES++ GFC IG P++AEN +
Sbjct: 146  CNLVDSHAKHMSERGVSDSCTRLKQLILASSSVYVKQKGYESMIAGFCEIGSPDKAENLM 205

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEMR  GLKPS FE R+L+YGYG++G  EDMKRS+ QME EGFELDTVC NMV+SSFGA 
Sbjct: 206  EEMRQNGLKPSAFELRTLVYGYGQMGLLEDMKRSVGQMEKEGFELDTVCYNMVLSSFGAR 265

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            N+  DM+ WLKK+RNSGI  SIRTYNSVLNSCP +  +++D+K+LPLS+NELVDNL   E
Sbjct: 266  NEFLDMLLWLKKMRNSGIPFSIRTYNSVLNSCPTVILLLEDMKSLPLSVNELVDNLKTGE 325

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            ADLV+EL+KS VLDQ MEWKS+ELKLD+HGMHLS+AYLILLQWF E+++RF  GN   P 
Sbjct: 326  ADLVLELMKSDVLDQVMEWKSTELKLDMHGMHLSTAYLILLQWFKELKVRFGDGNHETPT 385

Query: 57   EILVVCGSGKHSAVRGESP 1
            EILVVCGSGKHS+ RGESP
Sbjct: 386  EILVVCGSGKHSSKRGESP 404


>ref|XP_011088649.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Sesamum indicum]
          Length = 454

 Score =  439 bits (1129), Expect = e-120
 Identities = 208/319 (65%), Positives = 263/319 (82%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L   LY++I QESWFSWNTKL+ADLIA L K++ FD+AE+L +E V +L FK+R+LC FY
Sbjct: 86   LAFPLYSMIKQESWFSWNTKLLADLIAFLYKEEHFDDAEDLLTETVMRLRFKKRDLCMFY 145

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CNLV+SHAKHKSE  V+DSC +L+ LI  +SSVYV+ + Y S+V GFC +GLP++AEN +
Sbjct: 146  CNLVESHAKHKSEGGVLDSCTQLRHLIFLTSSVYVRHRAYGSMVAGFCEVGLPDKAENLM 205

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            +EMR  GLKPS+FE RSL+YGYG++G  EDMKRSI+Q+E +GFELDTV CNMV+SSFGAH
Sbjct: 206  QEMRENGLKPSVFELRSLVYGYGQMGFLEDMKRSIVQVEKDGFELDTVGCNMVLSSFGAH 265

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            N+L +M+SWLKK+   GI  S RTYNSVLNSCP I  M++D+KNLPLS +EL+ NL  +E
Sbjct: 266  NELLEMLSWLKKMTTLGIPFSTRTYNSVLNSCPTIILMLEDMKNLPLSTDELLGNLKVEE 325

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            A+LV+EL+KS+VLDQ MEW SSELKLD+HGMHL++AYL+LLQ F E++LRF+AGN   P 
Sbjct: 326  ANLVLELLKSTVLDQVMEWGSSELKLDMHGMHLTTAYLVLLQCFKELKLRFLAGNHTTPT 385

Query: 57   EILVVCGSGKHSAVRGESP 1
            EI V+CG GKHS+ RGESP
Sbjct: 386  EISVICGCGKHSSTRGESP 404


>emb|CDP14534.1| unnamed protein product [Coffea canephora]
          Length = 449

 Score =  412 bits (1058), Expect = e-112
 Identities = 200/323 (61%), Positives = 257/323 (79%)
 Frame = -3

Query: 969  ITFYLCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKEREL 790
            ++++L L LY IISQ SWFSWN KL+AD+ AL+ KQ+RF EAE L  +A+ KL   +R+L
Sbjct: 89   LSYHLALPLYLIISQASWFSWNAKLLADVTALMYKQERFIEAEALILQALKKLPAHDRDL 148

Query: 789  CNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEA 610
            CNFYC+L+ S+AKH+S K V DS   LKQL+ +SSSVYV+++ YES++ G C IGLP EA
Sbjct: 149  CNFYCHLLHSNAKHRSRKGVFDSLTSLKQLLARSSSVYVQKRAYESMISGLCEIGLPGEA 208

Query: 609  ENSIEEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSS 430
            EN +EEMR VGLKPS FE +SL++ YGRLG FEDMKRS+ QME+ G ELDTVC NMV+SS
Sbjct: 209  ENLMEEMRGVGLKPSGFEFKSLVHAYGRLGLFEDMKRSVTQMEDAGVELDTVCSNMVLSS 268

Query: 429  FGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNL 250
             G+H   S+MVSWL+++++S +S SIRTYNSVLNSCP +  ++QD K +PLS+ +L+ NL
Sbjct: 269  LGSHKVFSEMVSWLRRMKDSEVSFSIRTYNSVLNSCPTLILLLQDPKTIPLSMEDLMGNL 328

Query: 249  NKDEADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQ 70
            +++EADLV ELV SSVLD+AME  S+ELKLDLHGMHLS++ LI LQW   ++LRF AG+ 
Sbjct: 329  SQEEADLVRELVASSVLDEAMECNSAELKLDLHGMHLSTSCLIFLQWIDRLRLRFSAGDN 388

Query: 69   VAPAEILVVCGSGKHSAVRGESP 1
            + P +I VVCGSGKHSA RGESP
Sbjct: 389  MVPTQITVVCGSGKHSASRGESP 411


>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  404 bits (1039), Expect = e-110
 Identities = 195/319 (61%), Positives = 258/319 (80%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L L LY  IS+ SWF WN+KLVADL+ALL K +RFDEAE L +E VSKLG +ER+LC+FY
Sbjct: 92   LALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGSRERDLCSFY 151

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
              L+ S +KH SE+ V+D C +LK ++L+SSSVY+K++GY S+V GFC IGLP +AE  +
Sbjct: 152  SQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGYASMVEGFCLIGLPRKAEELM 211

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEM+ +GLK S FE RSL+Y YG+ G   DMKR +++ME+ GF+LDTV  NMV++SFG+H
Sbjct: 212  EEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMESMGFQLDTVSSNMVLNSFGSH 271

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            N+LS++VS L+K+  SG+  SIRTYNSVLNSCP I+ ++QDLK++PLS+ EL+ NL+++E
Sbjct: 272  NELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQDLKSVPLSLEELMGNLDENE 331

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            A LV  LV SSVL++ M+WK SELKLDLHGMHL+SAY+I+LQWF ++Q +F+A N+V P 
Sbjct: 332  AVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVIILQWFHQLQCKFLAENRVLPG 391

Query: 57   EILVVCGSGKHSAVRGESP 1
            EI+VVCG+GKHS VRGESP
Sbjct: 392  EIIVVCGAGKHSVVRGESP 410


>ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Solanum lycopersicum]
          Length = 459

 Score =  401 bits (1031), Expect = e-109
 Identities = 199/347 (57%), Positives = 268/347 (77%), Gaps = 7/347 (2%)
 Frame = -3

Query: 1020 GLSWTHAFFTLMVYIIAIT----FYLC---LQLYTIISQESWFSWNTKLVADLIALLNKQ 862
            G S  H   + + ++++ T    + LC   L LY  IS+ SWF WN+KLVA+L+ALL K 
Sbjct: 64   GSSSKHVALSTLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKL 123

Query: 861  KRFDEAENLFSEAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSS 682
            +RFDEAE L +E+VSKLG +ER+LC+FY  L+ S +KH SE+ V+D C +LK ++L SSS
Sbjct: 124  ERFDEAETLVTESVSKLGSRERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLLHSSS 183

Query: 681  VYVKRKGYESIVGGFCAIGLPNEAENSIEEMRNVGLKPSLFENRSLIYGYGRLGSFEDMK 502
            VY+K++GY S+V GFC IGLP +AE  +EEM+ +GLK S FE RSL+Y YG+ G   DMK
Sbjct: 184  VYLKQRGYASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMK 243

Query: 501  RSIIQMENEGFELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSC 322
            R +++ME  GF+LDTV  NMV++SFG+HN+LS++VS L+K+  SG+  SIRTYNSVLNSC
Sbjct: 244  RIVVEMERMGFQLDTVGSNMVLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSC 303

Query: 321  PEITSMVQDLKNLPLSINELVDNLNKDEADLVMELVKSSVLDQAMEWKSSELKLDLHGMH 142
            P I+ ++QDLK++PLS+ EL+ NL+++EA LV  LV SSVL++ M+WK  ELKLDLHGMH
Sbjct: 304  PTISLLLQDLKSVPLSLEELMGNLDENEAVLVKILVGSSVLEETMQWKPKELKLDLHGMH 363

Query: 141  LSSAYLILLQWFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESP 1
            L+SAYLI+LQWF ++Q +F+A N+V P EI+VVCG+GKHS VRGESP
Sbjct: 364  LTSAYLIILQWFHQLQCKFLAENRVLPGEIIVVCGAGKHSVVRGESP 410


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  395 bits (1016), Expect = e-107
 Identities = 197/319 (61%), Positives = 246/319 (77%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L L LY+ IS+ SWFSWN KL+AD+IALL KQ +  EAE L SE + KLG +ER+L +FY
Sbjct: 79   LALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLVSFY 138

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CNL+DSH+KH S + V D  + L +++ +SSSVYVK + Y+S++   CA+GLP EAEN I
Sbjct: 139  CNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAENLI 198

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEMR  GLKPS+FE RS++YGYGR+G  EDM+R ++QM NEGFELDTV  NMV+SS+GA+
Sbjct: 199  EEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSYGAY 258

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            NK S+MVSWL++++NS I  SIRTYNSVLNSCP I S++QDLK  P +I+EL++ L  DE
Sbjct: 259  NKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLKGDE 318

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            A LV EL+ S VL + MEW  SE KLDLHGMHL SAYLI+LQW  E++ R  A   V P 
Sbjct: 319  ALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAAEYVMPV 378

Query: 57   EILVVCGSGKHSAVRGESP 1
            EI VVCGSGKHS+VRGESP
Sbjct: 379  EITVVCGSGKHSSVRGESP 397


>ref|XP_009629638.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nicotiana tomentosiformis]
          Length = 459

 Score =  393 bits (1009), Expect = e-106
 Identities = 195/319 (61%), Positives = 249/319 (78%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L L LY  IS+ SWF WN+KLVADL+ALL K +RFDEAE L +E VSKLG +ER+LC+FY
Sbjct: 92   LALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGGRERDLCSFY 151

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
              L+ S +KHKSEK V+D C +LK  +  SSSVY+K++GY S+V  FC+IGLP +AE  I
Sbjct: 152  SQLIHSQSKHKSEKGVLDFCTKLKLFLSCSSSVYLKQQGYASMVDAFCSIGLPRDAEELI 211

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEM+ +GLK S FE R+L+Y YG+ G F DMKR + QME+ G +LDTV  NMV++SFG+ 
Sbjct: 212  EEMKELGLKLSKFEFRALVYSYGKSGFFSDMKRIVGQMESMGLQLDTVGANMVLNSFGSQ 271

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
             +LS+MVSWL+K+  SG+  SIRTYNSVLNSCP I+ ++QD K++PLS+ EL+ NLN++E
Sbjct: 272  YELSEMVSWLQKMDVSGVPFSIRTYNSVLNSCPTISLLLQDPKSVPLSLEELLANLNENE 331

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            A LV  LV SSVL++ M+W  SELKLDLHGMH SSAY+I+LQWF ++Q +  A N+V PA
Sbjct: 332  ASLVKILVGSSVLEETMQWNPSELKLDLHGMHFSSAYVIILQWFHQLQCKLDAENRVLPA 391

Query: 57   EILVVCGSGKHSAVRGESP 1
            EI VVCG+GKHS VRGESP
Sbjct: 392  EITVVCGAGKHSVVRGESP 410


>ref|XP_009761148.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nicotiana sylvestris]
          Length = 455

 Score =  389 bits (1000), Expect = e-105
 Identities = 192/319 (60%), Positives = 251/319 (78%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L L LY  IS+ SWF WN+KLVADL+ALL K +RFDEAE L +E VSKLG +ER+LC+FY
Sbjct: 92   LALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGSRERDLCSFY 151

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
              L+ S +K KSE+ V++   +LKQ+IL SSSVY+K++GY S+V  FC+IGLP EAE  +
Sbjct: 152  SQLIHSLSKQKSERGVLNFVTKLKQVILCSSSVYLKQQGYASMVDAFCSIGLPREAEEFM 211

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEM+ +GLK S FE R+L+Y YG+ G F +MKR + QM+  G +LDTV  NMV++SFG+ 
Sbjct: 212  EEMKELGLKLSKFEFRALVYSYGKSGCFSEMKRIVGQMDGLGLKLDTVGANMVLNSFGSQ 271

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
             +LS+MVSWL+K++ S +  SIRTYNSVLNSCP I+ ++QD K+LPLS+ EL+ NLN++E
Sbjct: 272  YELSEMVSWLRKMKASDVPFSIRTYNSVLNSCPTISHLLQDPKSLPLSLEELMGNLNENE 331

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            A LV  LV SSVL++ M+W  SELKLDLHGMHLSSAY+++LQWF ++Q + +A N+V PA
Sbjct: 332  AGLVKILVGSSVLEETMQWNPSELKLDLHGMHLSSAYVVILQWFHQLQCKLVAENRVLPA 391

Query: 57   EILVVCGSGKHSAVRGESP 1
            EI VVCG+GKHS VRGESP
Sbjct: 392  EITVVCGTGKHSVVRGESP 410


>ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao] gi|508705664|gb|EOX97560.1|
            Pentatricopeptide (PPR) repeat-containing protein,
            putative [Theobroma cacao]
          Length = 456

 Score =  389 bits (999), Expect = e-105
 Identities = 198/320 (61%), Positives = 244/320 (76%), Gaps = 1/320 (0%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L   LYT IS+ SW++WN KLVA+LIALL KQ R+DE+E L S+AVSKL F+ER+L  FY
Sbjct: 93   LAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQFY 152

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CN ++S +KH S++   D+   L +LI  SSSVYVKR+GY+S+V   C +  PNEAEN +
Sbjct: 153  CNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAENLV 212

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEMR  GL P+LFE R + YGYG+LG FEDM+R + +ME EGFE+DT+C NMV+SS+GA+
Sbjct: 213  EEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYGAY 272

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            N  S MV WL+K++   I  SIRTYNSVLNSCPEI S+VQ L ++PLS+ EL   LN+DE
Sbjct: 273  NAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILNEDE 332

Query: 237  ADLVMELVK-SSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 61
            A LV ELVK SSVLD+AMEW  SE KLDLHGMHL SAYLI+LQW  EM+ RF     V P
Sbjct: 333  ALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKVEECVIP 392

Query: 60   AEILVVCGSGKHSAVRGESP 1
            A+I +VCGSGKHS+VRGESP
Sbjct: 393  AQITIVCGSGKHSSVRGESP 412


>gb|KDO68195.1| hypothetical protein CISIN_1g042756mg, partial [Citrus sinensis]
          Length = 425

 Score =  371 bits (953), Expect = e-100
 Identities = 190/320 (59%), Positives = 236/320 (73%), Gaps = 2/320 (0%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L   LY  I++ESWF WN KLVA++IA L+KQ + +EAE L  E +SKLG +EREL  FY
Sbjct: 93   LAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAETLILETLSKLGSRERELVLFY 152

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CNL+DS  KH S++   D+   L QL+  SSSVYVKR+  +S++ G C +G P+EAEN I
Sbjct: 153  CNLIDSFCKHDSKRGFDDTYARLNQLVNSSSSVYVKRQALKSMISGLCEMGQPHEAENLI 212

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEMR  GL+PS FE + +IYGYGRLG  EDM+R + QME++G  +DTVC NMV+SS+G H
Sbjct: 213  EEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQMESDGTRVDTVCSNMVLSSYGDH 272

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDL--KNLPLSINELVDNLNK 244
            N+LS MV WL+K+++SGI  S+RTYNSVLNSC  I SM+QDL   + PLSI EL + LN+
Sbjct: 273  NELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSMLQDLNSNDFPLSILELTEVLNE 332

Query: 243  DEADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVA 64
            +E  +V EL  SSVLD+AM+W S E KLDLHGMHL SAY I+LQW  EM+ RF     V 
Sbjct: 333  EEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAYFIILQWMDEMRNRFNNEKHVI 392

Query: 63   PAEILVVCGSGKHSAVRGES 4
            PAEI VVCGSGKHS VRGES
Sbjct: 393  PAEITVVCGSGKHSTVRGES 412


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  371 bits (953), Expect = e-100
 Identities = 190/320 (59%), Positives = 236/320 (73%), Gaps = 2/320 (0%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L   LY  I++ESWF WN KLVA++IA L+KQ + +EAE L  E +SKLG +EREL  FY
Sbjct: 93   LAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAETLILETLSKLGSRERELVLFY 152

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CNL+DS  KH S++   D+   L QL+  SSSVYVKR+  +S++ G C +G P+EAEN I
Sbjct: 153  CNLIDSFCKHDSKRGFDDTYARLNQLVNSSSSVYVKRQALKSMISGLCEMGQPHEAENLI 212

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEMR  GL+PS FE + +IYGYGRLG  EDM+R + QME++G  +DTVC NMV+SS+G H
Sbjct: 213  EEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQMESDGTRVDTVCSNMVLSSYGDH 272

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDL--KNLPLSINELVDNLNK 244
            N+LS MV WL+K+++SGI  S+RTYNSVLNSC  I SM+QDL   + PLSI EL + LN+
Sbjct: 273  NELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSMLQDLNSNDFPLSILELTEVLNE 332

Query: 243  DEADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVA 64
            +E  +V EL  SSVLD+AM+W S E KLDLHGMHL SAY I+LQW  EM+ RF     V 
Sbjct: 333  EEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAYFIILQWMDEMRNRFNNEKHVI 392

Query: 63   PAEILVVCGSGKHSAVRGES 4
            PAEI VVCGSGKHS VRGES
Sbjct: 393  PAEITVVCGSGKHSTVRGES 412


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  368 bits (945), Expect = 4e-99
 Identities = 180/320 (56%), Positives = 243/320 (75%)
 Frame = -3

Query: 960  YLCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNF 781
            +  L LY+ I++ SWF WN KL+A+L+ALLNKQ+R  E+E L S AVS+L   ER++  F
Sbjct: 111  FFALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLSNAVSRLKSNERDIALF 170

Query: 780  YCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENS 601
            YCNLV+S++K  S +   ++C  L+++  +S+SVYVK + Y+S+V G C +  P++AE+ 
Sbjct: 171  YCNLVESNSKQGSIQGFNEACVRLREITRRSTSVYVKTQAYKSMVSGLCNMDQPHDAESV 230

Query: 600  IEEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGA 421
            IEEMR   +KP LFE +S++YGYGRLG FEDM R + +ME EG ++DTVC NMV+SS+GA
Sbjct: 231  IEEMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEGHKIDTVCSNMVLSSYGA 290

Query: 420  HNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKD 241
            HN L  M SWL+K+++S + LS RTYNSVLNSCP I S+++DL + P+S++EL+  LNKD
Sbjct: 291  HNALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDLDSCPVSLSELLTFLNKD 350

Query: 240  EADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 61
            E  LV  L +SSVLD+A+EW S E KLDLHGMHLSS+YLI++QW  EM++RF  G  V P
Sbjct: 351  EEVLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQWMDEMRIRFSEGKCVVP 410

Query: 60   AEILVVCGSGKHSAVRGESP 1
            AEI++V GSGKHS VRGESP
Sbjct: 411  AEIVLVSGSGKHSNVRGESP 430


>ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
            gi|462396130|gb|EMJ01929.1| hypothetical protein
            PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  368 bits (945), Expect = 4e-99
 Identities = 185/319 (57%), Positives = 239/319 (74%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L L  Y+ I++ SWF WN KLVA L+ALL+KQ + +EAE L SE +SKLG +EREL  F+
Sbjct: 91   LALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELALFH 150

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            C LV+SH+K  S+     S + L QL+  SSSVYVK + +ES+V G C +  P EA+N I
Sbjct: 151  CQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGLCEMDRPREADNLI 210

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEMR  GLKPS+FE RS++YGYGRLG FEDM + + QMEN+G  +DT+C NMV+SS+GAH
Sbjct: 211  EEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSYGAH 270

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            ++L+ M+ WL+K+++  +  SIRTYNSVLNSC  I +M+Q+ K+ P SI EL   LN DE
Sbjct: 271  SELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLNGDE 330

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            A LV ELV+S+VLD+ M W+  E KLDLHGMHL SAYLILL+WF  M+ RF +G  V PA
Sbjct: 331  ALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKDVIPA 390

Query: 57   EILVVCGSGKHSAVRGESP 1
            E++V+CGSGKHS+VRGESP
Sbjct: 391  EVVVICGSGKHSSVRGESP 409


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  367 bits (943), Expect = 6e-99
 Identities = 181/319 (56%), Positives = 238/319 (74%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            L L +Y+ I++ SWF WN KLVA L+ALL KQ +  ++E L SE +SKLG KEREL  F+
Sbjct: 92   LALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEALISETISKLGNKERELVQFH 151

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            C LV+SH+K  S+     +C  L QL+  SSSVYVKR+ +ES+VGG CA+  P EA+  I
Sbjct: 152  CQLVESHSKMSSKCGFDRACTYLHQLLQNSSSVYVKRRAFESMVGGLCAMDRPGEADELI 211

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            EEMR  GLK S+FE RS++YGYGRLG FE+M + + QME +GF  DT+CCNMV+SS+GAH
Sbjct: 212  EEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQGFGDDTICCNMVLSSYGAH 271

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            N+L+ M +WL+K++ S +  S+RTYNSVLNSCP I +M+Q+ K +P S+ EL   L+ DE
Sbjct: 272  NELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQEPKAVPCSVGELSGVLDGDE 331

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
            A +V ELV S+V+D+AM W S+E KLDLHGMHL SAYL++L+WF  M  RF +   V PA
Sbjct: 332  ALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVMLEWFEAMGNRFKSAECVVPA 391

Query: 57   EILVVCGSGKHSAVRGESP 1
            E+++VCG GKHS+VRGESP
Sbjct: 392  EVVIVCGLGKHSSVRGESP 410


>ref|XP_010086846.1| hypothetical protein L484_006076 [Morus notabilis]
            gi|587833217|gb|EXB24044.1| hypothetical protein
            L484_006076 [Morus notabilis]
          Length = 517

 Score =  367 bits (942), Expect = 8e-99
 Identities = 189/317 (59%), Positives = 236/317 (74%)
 Frame = -3

Query: 951  LQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFYCN 772
            L LY+ I + SWF ++ KLVA L ALL+KQ R+ EAE L +EAVSKLG ++REL  FYC+
Sbjct: 163  LHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRELAVFYCS 222

Query: 771  LVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSIEE 592
            LV+SH+K  S+     S   L QL+  SSS YVK + +E++VG  C +  P EAE+ +EE
Sbjct: 223  LVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETMVGALCTMDRPCEAESLMEE 282

Query: 591  MRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAHNK 412
            MR+ GLKPS+FE RSL+YGYGRLG +EDM R++ QME EG  +DT+C NMV+SS+GAHN+
Sbjct: 283  MRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLSSYGAHNE 342

Query: 411  LSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDEAD 232
            L  MV WL+K+R S I  SIRTYNSVLN CP IT+M+QDLK++PLS+ EL   L  DE  
Sbjct: 343  LQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKDIPLSMYELNATLRGDEGL 402

Query: 231  LVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPAEI 52
            LVMELV SSVL++ + W S E+KLDLHGMHL SAYLI+L+W  EM  RF  GN   PAE+
Sbjct: 403  LVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDGNHGIPAEV 462

Query: 51   LVVCGSGKHSAVRGESP 1
            +VVCGSGKHS VRG SP
Sbjct: 463  VVVCGSGKHSNVRGVSP 479


>ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Tarenaya hassleriana] gi|729371006|ref|XP_010548125.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g17033 [Tarenaya hassleriana]
          Length = 462

 Score =  367 bits (941), Expect = 1e-98
 Identities = 181/319 (56%), Positives = 244/319 (76%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            + L LY+ I++  WF WN KLVADL+ALLNKQ++F E+E+L S AVS+L   ER L  F+
Sbjct: 105  IALNLYSEIAEAPWFDWNPKLVADLVALLNKQEQFPESESLLSAAVSRLKPNERGLALFH 164

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CNLV+S++K  S +   DS + L+++I +SSSVYVK +GY+SIV G C +  P +AE  +
Sbjct: 165  CNLVESNSKQGSTRGFNDSYSCLREIIQRSSSVYVKSQGYKSIVSGLCNMDRPYDAERVL 224

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
             EM+  G+KP LFE RS++YGYGRLG F DM R++ +ME++G ++DTVC NMV+SS+GA 
Sbjct: 225  AEMKTEGIKPELFEYRSVLYGYGRLGLFFDMNRTVHEMESDGHKIDTVCSNMVLSSYGAR 284

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            + L +M SWL+K++  GI LSIRTYNSVLNSCP ITS+++DL + P+S++EL   LN+DE
Sbjct: 285  DALPEMGSWLQKLKGFGIPLSIRTYNSVLNSCPTITSLLKDLDSCPVSLSELTGLLNEDE 344

Query: 237  ADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 58
              L  ELV+SSVLD+AMEW + E KLDLHGMHLSS+YLI++QW  ++++RF  G  V P 
Sbjct: 345  MLLTRELVQSSVLDEAMEWNALEGKLDLHGMHLSSSYLIMMQWMDKVRIRFEEGKHVIPV 404

Query: 57   EILVVCGSGKHSAVRGESP 1
            EI++V GSGKHS VRGESP
Sbjct: 405  EIVIVSGSGKHSNVRGESP 423


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  360 bits (923), Expect = 1e-96
 Identities = 182/320 (56%), Positives = 238/320 (74%)
 Frame = -3

Query: 960  YLCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNF 781
            +  L LY+ I++ SWF WN KL+A+LIALLNKQ+RFDE+E L S AVS+L   ER+   F
Sbjct: 147  FFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLF 206

Query: 780  YCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENS 601
             CNLV+S++K  S +   ++   L+++I +SSSVYVK + Y+S+V G C +  P++AE  
Sbjct: 207  LCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERV 266

Query: 600  IEEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGA 421
            IEEMR   +KP LFE +S++YGYGRLG F+DM R + +M  EG ++DTVC NMV+SS+GA
Sbjct: 267  IEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 326

Query: 420  HNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKD 241
            H+ L  M SWL+K++   +  SIRTYNSVLNSCP I SM++DL + P+S++EL   LN+D
Sbjct: 327  HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 386

Query: 240  EADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 61
            EA LV EL +SSVLD+A+EW + E KLDLHGMHLSS+YLILLQW  E +LRF     V P
Sbjct: 387  EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 446

Query: 60   AEILVVCGSGKHSAVRGESP 1
            AEI+VV GSGKHS VRGESP
Sbjct: 447  AEIVVVSGSGKHSNVRGESP 466


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  360 bits (923), Expect = 1e-96
 Identities = 182/320 (56%), Positives = 238/320 (74%)
 Frame = -3

Query: 960  YLCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNF 781
            +  L LY+ I++ SWF WN KL+A+LIALLNKQ+RFDE+E L S AVS+L   ER+   F
Sbjct: 143  FFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLF 202

Query: 780  YCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENS 601
             CNLV+S++K  S +   ++   L+++I +SSSVYVK + Y+S+V G C +  P++AE  
Sbjct: 203  LCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERV 262

Query: 600  IEEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGA 421
            IEEMR   +KP LFE +S++YGYGRLG F+DM R + +M  EG ++DTVC NMV+SS+GA
Sbjct: 263  IEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 322

Query: 420  HNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKD 241
            H+ L  M SWL+K++   +  SIRTYNSVLNSCP I SM++DL + P+S++EL   LN+D
Sbjct: 323  HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 382

Query: 240  EADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 61
            EA LV EL +SSVLD+A+EW + E KLDLHGMHLSS+YLILLQW  E +LRF     V P
Sbjct: 383  EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 442

Query: 60   AEILVVCGSGKHSAVRGESP 1
            AEI+VV GSGKHS VRGESP
Sbjct: 443  AEIVVVSGSGKHSNVRGESP 462


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  360 bits (923), Expect = 1e-96
 Identities = 182/320 (56%), Positives = 238/320 (74%)
 Frame = -3

Query: 960  YLCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNF 781
            +  L LY+ I++ SWF WN KL+A+LIALLNKQ+RFDE+E L S AVS+L   ER+   F
Sbjct: 146  FFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLF 205

Query: 780  YCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENS 601
             CNLV+S++K  S +   ++   L+++I +SSSVYVK + Y+S+V G C +  P++AE  
Sbjct: 206  LCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERV 265

Query: 600  IEEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGA 421
            IEEMR   +KP LFE +S++YGYGRLG F+DM R + +M  EG ++DTVC NMV+SS+GA
Sbjct: 266  IEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 325

Query: 420  HNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKD 241
            H+ L  M SWL+K++   +  SIRTYNSVLNSCP I SM++DL + P+S++EL   LN+D
Sbjct: 326  HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 385

Query: 240  EADLVMELVKSSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 61
            EA LV EL +SSVLD+A+EW + E KLDLHGMHLSS+YLILLQW  E +LRF     V P
Sbjct: 386  EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 445

Query: 60   AEILVVCGSGKHSAVRGESP 1
            AEI+VV GSGKHS VRGESP
Sbjct: 446  AEIVVVSGSGKHSNVRGESP 465


>ref|XP_012480490.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Gossypium raimondii] gi|763765430|gb|KJB32684.1|
            hypothetical protein B456_005G255600 [Gossypium
            raimondii]
          Length = 458

 Score =  359 bits (922), Expect = 2e-96
 Identities = 180/320 (56%), Positives = 239/320 (74%), Gaps = 1/320 (0%)
 Frame = -3

Query: 957  LCLQLYTIISQESWFSWNTKLVADLIALLNKQKRFDEAENLFSEAVSKLGFKERELCNFY 778
            +   LYT IS+ SW++WN KLVADL+ LL+ Q + DE++ L S+ VSKL FKER+L  FY
Sbjct: 97   IAFPLYTKISEASWYNWNPKLVADLVPLLDIQGKHDESQALISQVVSKLKFKERDLVQFY 156

Query: 777  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVGGFCAIGLPNEAENSI 598
            CNL++S +KH+S++   D+   L +L+  SSS+YVK++GY+S+V   C +G PNEAEN +
Sbjct: 157  CNLIESCSKHESKQGFNDAYGYLSELVNNSSSMYVKKQGYKSMVSSLCEMGQPNEAENVV 216

Query: 597  EEMRNVGLKPSLFENRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 418
            E+M   G+KPSLFE R ++YGYG++G FEDM+R + +ME EGF +DT+  NM++SS+GA+
Sbjct: 217  EDMIKNGVKPSLFELRFVLYGYGKMGFFEDMERMVKKMEIEGFGVDTISSNMILSSYGAY 276

Query: 417  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITSMVQDLKNLPLSINELVDNLNKDE 238
            N L  MV WL+K++   I  SIRTYN VLNSCP I S V+     P+S++ELV+ L++DE
Sbjct: 277  NALPKMVPWLQKMKALEIPFSIRTYNCVLNSCPMIMSFVRGSGGFPVSVSELVNVLDEDE 336

Query: 237  ADLVMELVK-SSVLDQAMEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 61
            A LV ELV+ SSVLD+AMEW   ELKLDLHGMH  SAYLI+LQW  EM+ RF     V P
Sbjct: 337  ALLVKELVESSSVLDEAMEWDDLELKLDLHGMHSGSAYLIMLQWIKEMKSRFRVKECVVP 396

Query: 60   AEILVVCGSGKHSAVRGESP 1
            A+I VVCG+GKHS+VRGESP
Sbjct: 397  AQITVVCGTGKHSSVRGESP 416


Top