BLASTX nr result

ID: Rehmannia23_contig00009197 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00009197
         (1243 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004236182.1| PREDICTED: pentatricopeptide repeat-containi...   372   e-100
ref|XP_006367706.1| PREDICTED: pentatricopeptide repeat-containi...   370   e-100
emb|CBI15289.3| unnamed protein product [Vitis vinifera]              344   4e-92
ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containi...   341   4e-91
ref|XP_004299605.1| PREDICTED: pentatricopeptide repeat-containi...   331   4e-88
gb|EMJ11573.1| hypothetical protein PRUPE_ppa001256mg [Prunus pe...   328   2e-87
ref|XP_006439668.1| hypothetical protein CICLE_v10018829mg [Citr...   327   6e-87
ref|XP_002511505.1| pentatricopeptide repeat-containing protein,...   326   1e-86
ref|XP_006476670.1| PREDICTED: pentatricopeptide repeat-containi...   325   3e-86
gb|EOY21688.1| Pentatricopeptide repeat-containing protein, puta...   312   2e-82
ref|XP_002321537.1| pentatricopeptide repeat-containing family p...   308   3e-81
gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis]     301   3e-79
ref|XP_004138146.1| PREDICTED: pentatricopeptide repeat-containi...   298   4e-78
gb|ESW29652.1| hypothetical protein PHAVU_002G087700g [Phaseolus...   296   9e-78
ref|XP_004154991.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   296   2e-77
ref|XP_004513211.1| PREDICTED: pentatricopeptide repeat-containi...   291   4e-76
ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containi...   289   1e-75
gb|AAF79278.1|AC068602_1 F14D16.2 [Arabidopsis thaliana]              286   1e-74
ref|NP_173324.1| pentatricopeptide repeat-containing protein [Ar...   286   1e-74
ref|NP_001185030.1| pentatricopeptide repeat-containing protein ...   286   1e-74

>ref|XP_004236182.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like
            [Solanum lycopersicum]
          Length = 864

 Score =  372 bits (955), Expect = e-100
 Identities = 221/419 (52%), Positives = 267/419 (63%), Gaps = 29/419 (6%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAK LGTLSQSARSFFL GSR                    R   + +V+H   SS+L
Sbjct: 1    MLRAKQLGTLSQSARSFFLGGSRCSAGDGSSCTCSEDETCISKRLQTKNDVRHPQISSNL 60

Query: 252  LSKASVGVSPLASLDSVKEVNAKTTEH-NSTLSQGVAIACS-LDRAEDSVSYAD---DFD 416
            +SK+SVGV  L S D+VK V +K  E  +S +S+  A+  S L    DSV Y +   +  
Sbjct: 61   VSKSSVGVGALLSGDAVKVVTSKKNESLDSPVSRPQAVPVSTLSGRVDSVKYGNIDAEIT 120

Query: 417  VHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVRSAN 596
            V  SPPI+DQFV+AG+AAV  LSD+VNY+IPM DGS +L+SS N MVD TKP+SNVR +N
Sbjct: 121  VQSSPPISDQFVRAGIAAVSFLSDVVNYKIPMLDGSKVLSSSNNCMVDHTKPVSNVRPSN 180

Query: 597  TKNPRKDKVYVKPSAAPNFTSGAT-------GSGDRSVSEKG---VTNESNNFVETHGIS 746
                R DK+  K S     TS            GD+S S +G   V+N S   VE+HG+ 
Sbjct: 181  INTSRNDKLQGKASTDTPVTSTLAHNSNYTKNKGDKSNSVRGRNPVSNSSVGKVESHGVI 240

Query: 747  SEPRDRVKSIPQRARANSNRFMPNTPQSDG---------FTRAIRQAK-----VNTRQFI 884
             + RDR K++P + R   NR M N   S+G         F+R  R  K     V  RQF 
Sbjct: 241  PDSRDRRKTMPPKPRTYPNRNMTNVRGSEGKLKHEIPEGFSRPQRATKLPSAGVMVRQFS 300

Query: 885  SSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVALGFFYWLKR 1064
            +S+ VV +V  I+ Q  W P TE AL +LN  LD YQANQVLKQ+ D+ VALGFFYWLK+
Sbjct: 301  NSSHVVGTVSRIIQQLNWSPETENALRELNYLLDPYQANQVLKQIHDHAVALGFFYWLKQ 360

Query: 1065 KPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIHSYGRANYL 1241
            +PGFKHDGHTYTTMVGILGRARQFGAINKLL+QMV DGC+PNVVTYNRLIHSYGRANYL
Sbjct: 361  QPGFKHDGHTYTTMVGILGRARQFGAINKLLEQMVKDGCQPNVVTYNRLIHSYGRANYL 419


>ref|XP_006367706.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like
            [Solanum tuberosum]
          Length = 864

 Score =  370 bits (950), Expect = e-100
 Identities = 222/419 (52%), Positives = 267/419 (63%), Gaps = 29/419 (6%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAK LGTLSQSARSFFL GSR                    R   R +V+H   SS+L
Sbjct: 1    MLRAKQLGTLSQSARSFFLGGSRCSAGDGSSCTCSEDDTCISKRPQTRNDVRHPQISSNL 60

Query: 252  LSKASVGVSPLASLDSVKEVNAKTTEH-NSTLSQGVAIACS-LDRAEDSVSYAD---DFD 416
            +SK+SVGV  L S D+VK V++K  E  +S +S+  AI  S L    DSV Y +   +  
Sbjct: 61   VSKSSVGVGALLSGDAVKVVSSKKNETLDSPVSRPQAIPVSTLSGRVDSVKYGNIDAEIT 120

Query: 417  VHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVRSAN 596
            V  SPPI+DQFV+AG+AAV  LSD+VNY+IPM DGS +L+SS N +VDRTKP+SNVR +N
Sbjct: 121  VQSSPPISDQFVRAGIAAVSFLSDVVNYKIPMLDGSEVLSSSNNCVVDRTKPVSNVRPSN 180

Query: 597  TKNPRKDKVYVKPSAAPNFTSGAT-------GSGDRSVSEKGVTNESNNFV---ETHGIS 746
             K  R DK+  K S     TS            GD+  S +G    SNN V   ETHG+ 
Sbjct: 181  IKTSRNDKLQGKASTDTPVTSTLAHNSNYTKNKGDKCNSVRGRNPVSNNGVGEVETHGVI 240

Query: 747  SEPRDRVKSIPQRARANSNRFMPNTPQSDG---------FTRAIRQAK-----VNTRQFI 884
             +  DR +++P + R   NR M N   S+G         F+R  R  K     V  RQF 
Sbjct: 241  PDSCDRRRTMPPKPRTYPNRNMTNVRGSEGKLKHEIPEGFSRPQRATKLQNAGVMVRQFS 300

Query: 885  SSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVALGFFYWLKR 1064
            +S+ VV +V  I+ Q  W P TE AL +LN  LD YQANQVLKQ+ D+ VALGFFYWLK+
Sbjct: 301  NSSHVVGTVSRIIQQLNWSPETENALRELNYLLDPYQANQVLKQIHDHAVALGFFYWLKQ 360

Query: 1065 KPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIHSYGRANYL 1241
            +PGFKHDGHTYTTMVGILGRARQFGAINKLL+QMV DGC+PNVVTYNRLIHSYGRANYL
Sbjct: 361  QPGFKHDGHTYTTMVGILGRARQFGAINKLLEQMVKDGCQPNVVTYNRLIHSYGRANYL 419


>emb|CBI15289.3| unnamed protein product [Vitis vinifera]
          Length = 793

 Score =  344 bits (883), Expect = 4e-92
 Identities = 215/433 (49%), Positives = 270/433 (62%), Gaps = 33/433 (7%)
 Frame = +3

Query: 42   WVPKER------HFQTMLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXR 203
            W+ +ER      + Q MLR K +G LS SARS  +SG+R                    +
Sbjct: 12   WILQERNLKVLLYLQIMLRTKQIGPLSNSARSILISGTRSSTPDGNSCPCSEDETCVSTK 71

Query: 204  RHLRANVQHIGASSSLLSKASVGVSPLASLDSVKEVNAK---TTEHNSTLSQGVAIACSL 374
            +H R  V  +   ++L SK +  V PL   D+VK V ++   + EH ++L+Q VA   S+
Sbjct: 72   QHARNEVLIMQKQTTLASKTAARVGPLFLGDAVKVVGSQKVESVEHATSLAQVVAAPRSV 131

Query: 375  DRAEDSVSYADDF-----DVHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTS 539
                D VSY  D      D   +PPI+DQF++AG+ AV  LSDLVNY+IPM+DGS ML  
Sbjct: 132  V-GSDCVSYVSDNVGVNNDAVHAPPISDQFIRAGIVAVNFLSDLVNYKIPMSDGSGMLKL 190

Query: 540  SPNSMVDRTKPISNVRSANTKNPRK---DKVYVKPSA----APNFTSG---ATGSGDRSV 689
              N MVD TKP+S ++S N K  RK    KV  + SA    A N TS      G GD+S 
Sbjct: 191  PQNCMVDPTKPLSKIKSTNIKPIRKGKFSKVRAESSANIAAASNSTSSYHSTRGKGDKSG 250

Query: 690  SEKGVTNESN----NFVETHGISSEPRDRVKSIPQRARANSNRFMPNT-----PQSDGFT 842
            S KG ++  +    N V+T  +SS+  ++ +S+PQ+++A SN    N+     P  D  T
Sbjct: 251  SVKGCSHVGDTWTRNTVDTRSLSSDTHNK-RSMPQKSKAYSNYSTSNSNFNSNPLRD--T 307

Query: 843  RAIRQAKVNTRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQ 1022
            + I  A V+ RQF SS  VVE+V  IL Q  WGPA EEAL  LNC +DAYQANQVLKQ+Q
Sbjct: 308  KMIGIAPVS-RQFGSSGHVVENVSRILRQLSWGPAAEEALRNLNCLMDAYQANQVLKQIQ 366

Query: 1023 DYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTY 1202
            D+ VALGFFYWLKR+ GFKHDGHTYTTMVGILGRARQFGAINKLL +MV DGC+PNVVTY
Sbjct: 367  DHPVALGFFYWLKRQTGFKHDGHTYTTMVGILGRARQFGAINKLLAEMVRDGCQPNVVTY 426

Query: 1203 NRLIHSYGRANYL 1241
            NRLIHSYGRANYL
Sbjct: 427  NRLIHSYGRANYL 439


>ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750
            [Vitis vinifera]
          Length = 875

 Score =  341 bits (874), Expect = 4e-91
 Identities = 211/432 (48%), Positives = 266/432 (61%), Gaps = 42/432 (9%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLR K +G LS SARS  +SG+R                    ++H R  V  +   ++L
Sbjct: 1    MLRTKQIGPLSNSARSILISGTRSSTPDGNSCPCSEDETCVSTKQHARNEVLIMQKQTTL 60

Query: 252  LSKASVGVSPLASLDSVKEVNAK---TTEHNSTLSQGVAIACSLDRAEDSVSYADDF--- 413
             SK +  V PL   D+VK V ++   + EH ++L+Q VA   S+    D VSY  D    
Sbjct: 61   ASKTAARVGPLFLGDAVKVVGSQKVESVEHATSLAQVVAAPRSVV-GSDCVSYVSDNVGV 119

Query: 414  --DVHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVR 587
              D   +PPI+DQF++AG+ AV  LSDLVNY+IPM+DGS ML    N MVD TKP+S ++
Sbjct: 120  NNDAVHAPPISDQFIRAGIVAVNFLSDLVNYKIPMSDGSGMLKLPQNCMVDPTKPLSKIK 179

Query: 588  SANTKNPRK---DKVYVKPSA----APNFTSG---ATGSGDRSVSEKGVTNESN----NF 725
            S N K  RK    KV  + SA    A N TS      G GD+S S KG ++  +    N 
Sbjct: 180  STNIKPIRKGKFSKVRAESSANIAAASNSTSSYHSTRGKGDKSGSVKGCSHVGDTWTRNT 239

Query: 726  VETHGISSEPRDRVKSIPQRARANSN------RFMPNTPQSD---------GFTRAIRQA 860
            V+T  +SS+  ++ +S+PQ+++A SN       F  N   S+         GF++ +R  
Sbjct: 240  VDTRSLSSDTHNK-RSMPQKSKAYSNYSTSNSNFNSNVRNSEPRFVGGIAGGFSKPLRDT 298

Query: 861  KVN-----TRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQD 1025
            K+      +RQF SS  VVE+V  IL Q  WGPA EEAL  LNC +DAYQANQVLKQ+QD
Sbjct: 299  KMIGIAPVSRQFGSSGHVVENVSRILRQLSWGPAAEEALRNLNCLMDAYQANQVLKQIQD 358

Query: 1026 YTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYN 1205
            + VALGFFYWLKR+ GFKHDGHTYTTMVGILGRARQFGAINKLL +MV DGC+PNVVTYN
Sbjct: 359  HPVALGFFYWLKRQTGFKHDGHTYTTMVGILGRARQFGAINKLLAEMVRDGCQPNVVTYN 418

Query: 1206 RLIHSYGRANYL 1241
            RLIHSYGRANYL
Sbjct: 419  RLIHSYGRANYL 430


>ref|XP_004299605.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like
            [Fragaria vesca subsp. vesca]
          Length = 879

 Score =  331 bits (848), Expect = 4e-88
 Identities = 200/434 (46%), Positives = 268/434 (61%), Gaps = 42/434 (9%)
 Frame = +3

Query: 66   QTMLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASS 245
            Q +LRAKHL  LS SARSFF+SGSR                    R+            S
Sbjct: 3    QNLLRAKHLSNLSSSARSFFISGSRCSATEGNSCTCSEDENCGPQRQQAINRGLLAQNPS 62

Query: 246  SLLSKASVGVSPLASLDSVKEVNA---KTTEHNSTLSQGVAIACSLDRAEDSVSYADDFD 416
            S +SK + G   L S D+VK  ++   K+ +  +++ Q         RA D+VSYA + D
Sbjct: 63   SSVSKPAAGAGILISQDAVKVADSGKSKSVDQRTSIKQVATAPTPFVRA-DTVSYATNVD 121

Query: 417  -----VHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISN 581
                 +  SPP  +QFVKAG+AAV  LSD+V+Y+IP++DG  MLT   N+MV  T  +S+
Sbjct: 122  AIQKDISSSPPTTEQFVKAGVAAVNFLSDIVSYKIPLSDGMGMLTLPKNTMVRPTVGVSS 181

Query: 582  VRSANTKNPRKDK---VYVKPS----AAPNFTS---GATGSGDRSVSEKGVTN----ESN 719
            V+S+N K   ++    V+ KPS    AA   TS   G+ G+ D+S S KG+ +     + 
Sbjct: 182  VKSSNVKQINRENFISVHPKPSTETAAASERTSNHQGSKGNYDKSNSVKGLNHVPYTRTE 241

Query: 720  NFVETHGISS-EPRDRVKSIPQRARANSNRFMPNTPQ-------------SDGFTRAIRQ 857
            N    H + + E RDR +++P++++A  N F+P+                S GF++ +R+
Sbjct: 242  NSAVAHSVQTLETRDR-RALPRKSKAQPNHFVPDFKSNVQISDAETTRCGSKGFSKPVRE 300

Query: 858  AKVNT------RQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQL 1019
             K+ T      RQF+ +  VV++V  IL Q KWGP+ E +L  LNCS+DAYQANQ+LKQL
Sbjct: 301  MKMPTAIAPFNRQFVHNGNVVQNVSHILQQLKWGPSAEASLRNLNCSMDAYQANQILKQL 360

Query: 1020 QDYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVT 1199
            QD+TVALGFF WLKR+ GF+HDGHTYTTMVGILGRARQFGAINKLL+QMV++GC+PNVVT
Sbjct: 361  QDHTVALGFFNWLKRQAGFRHDGHTYTTMVGILGRARQFGAINKLLNQMVNEGCQPNVVT 420

Query: 1200 YNRLIHSYGRANYL 1241
            YNRLIHSYGRANYL
Sbjct: 421  YNRLIHSYGRANYL 434


>gb|EMJ11573.1| hypothetical protein PRUPE_ppa001256mg [Prunus persica]
          Length = 870

 Score =  328 bits (842), Expect = 2e-87
 Identities = 192/427 (44%), Positives = 261/427 (61%), Gaps = 37/427 (8%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAKH+  LS SARSFFL+G R                    R+  R         S++
Sbjct: 1    MLRAKHISNLSSSARSFFLNGPRCSATEGSSCTCSEDETCVSQRQQTRNGGPLAQTPSTM 60

Query: 252  LSKASVGVSPLASLDSVKEVN---AKTTEHNSTLSQGVAIACSLDRAEDSVSYADDFD-V 419
            +SK S G   + + D+VK  +   A++ EH + + Q      S  R+  +V+Y+   D V
Sbjct: 61   VSKPSAGAGTIITGDAVKVASSHKAESVEHTTNIKQVTTAPRSFGRSA-TVTYSSSTDAV 119

Query: 420  HLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVRSANT 599
            H SP + DQF +AG+AAV  LSD+VN ++P++DG  +L    N MVD T+P+S+++ ++ 
Sbjct: 120  HSSPLVVDQFARAGVAAVNFLSDIVNGKLPLSDGLGLLNLPQNCMVDPTRPLSSIKPSHV 179

Query: 600  KNPRKD---KVYVKPS----AAPNFTS---GATGSGDRSVSEKGVTN----ESNNFVETH 737
            K  +++    V+ KPS    AA   TS   G+ G G++    KG+ +       N V  H
Sbjct: 180  KQIKREHFISVHPKPSTETAAASKHTSNNHGSKGKGEKPSFVKGLNHVPYTRKENSVVAH 239

Query: 738  GISSEPRDRVKSIPQRARANSNRFMPNTPQS-------------DGFTRAIRQAKVNT-- 872
              SS+  D+ +S+P++++ +SN F+PN   +              GF R  R  K+ T  
Sbjct: 240  TASSDTFDK-RSMPRKSKGHSNNFIPNYSSNVQTSDAESMGRVTKGFNRPTRDMKMPTGI 298

Query: 873  ----RQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVAL 1040
                RQF+ +  VV++V  IL Q +WGPA E AL  LNCS+DAYQANQ+LKQLQD++VAL
Sbjct: 299  TPINRQFVHTGNVVQNVSHILQQMRWGPAAEAALLNLNCSMDAYQANQILKQLQDHSVAL 358

Query: 1041 GFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIHS 1220
             FFYWLKR+ GFKHDGHTYTTMVGILGR+RQFGAINKLL+QMV +GC+PNVVTYNRLIHS
Sbjct: 359  SFFYWLKRQAGFKHDGHTYTTMVGILGRSRQFGAINKLLNQMVKEGCQPNVVTYNRLIHS 418

Query: 1221 YGRANYL 1241
            YGRANYL
Sbjct: 419  YGRANYL 425


>ref|XP_006439668.1| hypothetical protein CICLE_v10018829mg [Citrus clementina]
            gi|557541930|gb|ESR52908.1| hypothetical protein
            CICLE_v10018829mg [Citrus clementina]
          Length = 856

 Score =  327 bits (838), Expect = 6e-87
 Identities = 209/428 (48%), Positives = 258/428 (60%), Gaps = 38/428 (8%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAKH+  LS +ARSFFL+GSR                    R+H   N Q +      
Sbjct: 1    MLRAKHITNLSSTARSFFLNGSRCSASDGSSCTCSEDETCVSRRQH---NAQMV------ 51

Query: 252  LSKASVGVSPLASLDSVKEVNAKTTEHNSTLSQGVAIACSLDRAEDSVSYADDFD----- 416
             + A  GV  + S ++VK    + +E  S  S       SL R+ D VSYA   D     
Sbjct: 52   YTPARAGV--VVSGEAVKAAGLQKSERVSVPSPS-----SLGRS-DHVSYASTVDAVPKD 103

Query: 417  VHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVRSAN 596
            V  S PI+DQFVKAG+AAV  LSDLVNY++P  DGS    S  N MVD T+P+SN++ AN
Sbjct: 104  VLTSSPISDQFVKAGVAAVSFLSDLVNYKLPALDGSGTANSPTNFMVDPTRPLSNIKPAN 163

Query: 597  TKNPRKDKVY-VKPSAAPNFTSGATGS-GDRSVSEKG-----------VTNESNNF-VET 734
             K  R++ V  V P+++   T G+  S G  +  +KG           V+N SN   +ET
Sbjct: 164  VKTIRRENVSKVYPNSSAESTVGSNPSTGCHNAKDKGDNSNIARRFKRVSNASNGTSLET 223

Query: 735  HGISSEPRDRVKSIPQRARANSNR----FMPNTPQSDG---------FTRAIRQAKVN-- 869
            H +SS+  DR + +  R++A+SNR    F  N   SD          F++  R+ K+   
Sbjct: 224  HNVSSDNSDRRRIVQPRSKAHSNRLNSNFKSNLQPSDAKVVECVSERFSKPSREMKIPAG 283

Query: 870  ----TRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVA 1037
                +R F S+  VVESV  IL Q+KWGP  EEALG  N S+DAYQANQVLKQLQD+TVA
Sbjct: 284  LAPFSRHFASTGNVVESVSRILQQWKWGPLAEEALGNTNYSMDAYQANQVLKQLQDHTVA 343

Query: 1038 LGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIH 1217
            LGFF WL+R+ GFKHD HTYTTMVGILGRARQFGAINKLLDQMV DGC+PNVVTYNRLIH
Sbjct: 344  LGFFNWLRRQAGFKHDEHTYTTMVGILGRARQFGAINKLLDQMVRDGCQPNVVTYNRLIH 403

Query: 1218 SYGRANYL 1241
            SYGRANYL
Sbjct: 404  SYGRANYL 411


>ref|XP_002511505.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223550620|gb|EEF52107.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 876

 Score =  326 bits (835), Expect = 1e-86
 Identities = 210/434 (48%), Positives = 259/434 (59%), Gaps = 44/434 (10%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAK L  LS +ARSFFLSGSR                    R+  R N        +L
Sbjct: 1    MLRAKQLSNLSSNARSFFLSGSRCSTSDGSSCTCSEDESCLPRRQQTRNNAVLAQRGPAL 60

Query: 252  LSKASVGVSPLASLDSVKEV---NAKTTEHNSTLSQGVAIACSLDRAEDSVSYADDFDV- 419
            + KAS  VS  + L    ++   +   +    TL Q V+   S+ R  D VSYA   D  
Sbjct: 61   VPKASARVSQTSLLGDAGKLLVPHKVESVECPTLPQVVSAPISI-RKSDCVSYASGIDAV 119

Query: 420  -----HLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNV 584
                 + SPPI+DQF KAG+AAV  LSDLVNY++P+TDGS  + S  N MVD T+P S V
Sbjct: 120  ENDIPYSSPPISDQFFKAGIAAVSFLSDLVNYKLPITDGSG-INSPKNCMVDPTRPQSTV 178

Query: 585  RSANTKNPRKD---KVYVKPS---AAPNFTSGATGSGDRSVSE---KGVTNESN----NF 725
            RS+N K  R++   KVY K S   A  + TS    + D+S      KG    SN    N 
Sbjct: 179  RSSNVKPIRRENCSKVYPKASPEAAVSSSTSNYDSTRDKSEKSSFIKGSKRVSNTPAGNS 238

Query: 726  VETHGISSEPRDRVKSIPQRARANSNRFMPN------TPQS----------DGFTRAIRQ 857
            V+T  I+S+  DR + IPQ+++  SNR   N      T Q+          + + +  R+
Sbjct: 239  VKTCSIASDTCDR-RIIPQKSKGQSNRSTANFNANVQTVQTSDTKYGEYVAEDYRKPPRE 297

Query: 858  AKV------NTRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQL 1019
             K+      +TR+F S+  +VE+V  IL Q +WGPA EEAL  LN S+D YQANQVLKQL
Sbjct: 298  TKMPVVRVPSTRRFASNGHIVENVAHILRQIRWGPAAEEALANLNYSMDPYQANQVLKQL 357

Query: 1020 QDYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVT 1199
            QD+TVAL FFYWLKR+PGF HDGHTYTTMVGILGRA+QFGAINKLLDQMV DGC+PNVVT
Sbjct: 358  QDHTVALNFFYWLKRQPGFNHDGHTYTTMVGILGRAKQFGAINKLLDQMVKDGCQPNVVT 417

Query: 1200 YNRLIHSYGRANYL 1241
            YNRLIHSYGRANYL
Sbjct: 418  YNRLIHSYGRANYL 431


>ref|XP_006476670.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like
            [Citrus sinensis]
          Length = 856

 Score =  325 bits (832), Expect = 3e-86
 Identities = 208/428 (48%), Positives = 257/428 (60%), Gaps = 38/428 (8%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAKH+  LS +ARSFFL+GSR                    R+H   N Q +      
Sbjct: 1    MLRAKHITNLSSTARSFFLNGSRCSASDGSSCTCSEDETCVSRRQH---NAQMV------ 51

Query: 252  LSKASVGVSPLASLDSVKEVNAKTTEHNSTLSQGVAIACSLDRAEDSVSYADDFD----- 416
             + A  GV  + S ++ K    + +E  S  S       SL R+ D VSYA   D     
Sbjct: 52   YTPARAGV--VVSGEAAKAAGLQKSERVSVPSPS-----SLGRS-DHVSYASSVDAVPKD 103

Query: 417  VHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVRSAN 596
            V  S PI+DQFVKAG+AAV  LSDLVNY++P  DGS    S  N MVD T+P+SN++ AN
Sbjct: 104  VLTSSPISDQFVKAGVAAVCFLSDLVNYKLPALDGSGTANSPTNFMVDPTRPLSNIKPAN 163

Query: 597  TKNPRKDKVY-VKPSAAPNFTSGATGS-GDRSVSEKG-----------VTNESNNF-VET 734
             K  R++ V  V P+++   T G+  S G  +  +KG           V+N SN   +ET
Sbjct: 164  VKTIRRENVSKVYPNSSAESTVGSNPSTGYHNAKDKGDNSNIARRFKRVSNASNGTSLET 223

Query: 735  HGISSEPRDRVKSIPQRARANSNR----FMPNTPQSDG---------FTRAIRQAKVN-- 869
            H +SS+  DR + +  R++A+SNR    F  N   SD          F++  R+ K+   
Sbjct: 224  HNVSSDNSDRRRIVQPRSKAHSNRLNSNFKSNLQPSDAKVVECVSERFSKPSREMKIPAG 283

Query: 870  ----TRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVA 1037
                +R F S+  VVESV  IL Q+KWGP  EEALG  N S+DAYQANQVLKQLQD+TVA
Sbjct: 284  LAPFSRHFASTGNVVESVSRILRQWKWGPLAEEALGNTNYSMDAYQANQVLKQLQDHTVA 343

Query: 1038 LGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIH 1217
            LGFF WL+R+ GFKHD HTYTTMVGILGRARQFGAINKLLDQMV DGC+PNVVTYNRLIH
Sbjct: 344  LGFFNWLRRQAGFKHDEHTYTTMVGILGRARQFGAINKLLDQMVRDGCQPNVVTYNRLIH 403

Query: 1218 SYGRANYL 1241
            SYGRANYL
Sbjct: 404  SYGRANYL 411


>gb|EOY21688.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao]
          Length = 859

 Score =  312 bits (799), Expect = 2e-82
 Identities = 199/428 (46%), Positives = 251/428 (58%), Gaps = 38/428 (8%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAK +G LS SARSF  SGSR                    +R +R  V         
Sbjct: 1    MLRAKQIGNLSSSARSFLFSGSRCSASDGNSCTCPEDESCVSRKRSIRNEV--------- 51

Query: 252  LSKASVGVSPLASLDSVKEVNAKTTEHNSTLSQGVAIACSLDRAEDSVSY-----ADDFD 416
            LSK+S G   LA   + K V +   E    L   V+    L R+  +V+Y     A   D
Sbjct: 52   LSKSS-GRGTLALGTASKAVGSHEAERAPQL---VSSPIPLHRS-GNVNYDVNIDAAQLD 106

Query: 417  VHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVRSAN 596
               S PI+DQFVKAG+AAV  LSD++NY++P++DG  ML+S  N +V+ ++ + N++S  
Sbjct: 107  GQASAPISDQFVKAGIAAVSFLSDMMNYKLPLSDGGVMLSSPKNCVVESSRQLPNIKSPA 166

Query: 597  TKNPRKD---KVYVKPS----AAPNFTSGATGSGDRSVSE------KGVTNESN-NFVET 734
             K  +K+   KVY KPS    A P  T    G+ DR          K V+N ++    ET
Sbjct: 167  VKPIKKENFAKVYPKPSSEIAAGPKSTVSYHGTKDRGNKPNFVRGYKQVSNAASVGSSET 226

Query: 735  HGISSEPRDRVKSIPQRARANSNRFMPNTPQS-------------DGFTRAIRQAKVNT- 872
            H  S+   D+ K +PQR +A+S+RFM N   +             +GF ++ R  K+ T 
Sbjct: 227  HRTSANTCDKGKPMPQRVKAHSHRFMSNFNSNVLPSDAKFSDSGTEGFKKSFRDMKMPTG 286

Query: 873  -----RQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVA 1037
                 R    +  V ESV  IL Q  WGPA E+AL  LN S+DAYQANQVLKQ+QD+TVA
Sbjct: 287  VVPMTRPLAGTRHVTESVSHILQQLNWGPAAEQALENLNFSMDAYQANQVLKQIQDHTVA 346

Query: 1038 LGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIH 1217
            LGFFYWLK++ GFKHDGHTYTTMVGILGRARQFGAIN+LLDQMV DGC+PNVVTYNRLIH
Sbjct: 347  LGFFYWLKQRAGFKHDGHTYTTMVGILGRARQFGAINRLLDQMVKDGCQPNVVTYNRLIH 406

Query: 1218 SYGRANYL 1241
            SYGRANYL
Sbjct: 407  SYGRANYL 414


>ref|XP_002321537.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222868533|gb|EEF05664.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 834

 Score =  308 bits (789), Expect = 3e-81
 Identities = 189/406 (46%), Positives = 250/406 (61%), Gaps = 16/406 (3%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXX-RRHLRANVQHIGASSS 248
            MLRAK LG LS SARSFFLSGSR                     R+  R ++      S+
Sbjct: 1    MLRAKQLGNLSSSARSFFLSGSRCSATDGSSSCTCSEDETCVSTRQQPRNSILLAQKPSN 60

Query: 249  LLSKASVGVSPLASLDSVKEVNAKTTEHNSTLSQGVAIACSLDRAEDSVSYADDFDVHLS 428
              SK S  V    S D    +  + +  +  +S  V+ A  +D AE  V        H S
Sbjct: 61   FGSKTSARVEASVSGDGSSFLLPQKS--SCGMSGCVSYAIGIDIAEKDVG-------HSS 111

Query: 429  PPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVRSANTKNP 608
            PPI+DQFV+ G+AAV  LSDLVNY++P +DG+ ++ S+ N M+D T+ +SN++S+N K  
Sbjct: 112  PPISDQFVRVGIAAVSFLSDLVNYKLPTSDGT-VINSTINCMIDPTRQLSNIKSSNVKPI 170

Query: 609  RKD-----------KVYVKPSAAPNFTSGATGSGDRSVSEKGVTNESN----NFVETHGI 743
            R++           ++ V  +AA N+ S     G++S   +G    S+    + +++H +
Sbjct: 171  RRENFTKAYPNSSAEIPVGSNAAVNYNS-MKDRGNKSSFVRGFKQVSSIAADSSLDSHSL 229

Query: 744  SSEPRDRVKSIPQRARANSNRFMPNTPQSDGFTRAIRQAKVNTRQFISSNPVVESVYCIL 923
             S+  D+ ++IPQR +A  NR     P  D  T+       + RQF+S+  VVE+V  IL
Sbjct: 230  PSDAFDKRRTIPQRLKAQPNR----RPSRD--TKMPAVVARSARQFVSTGHVVENVSQIL 283

Query: 924  HQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVALGFFYWLKRKPGFKHDGHTYTT 1103
             Q +WGP+ EEAL  LNC +DAYQANQVLKQLQD+TVALGFF+WLK+ PGFKHDG+TYTT
Sbjct: 284  RQLRWGPSAEEALVNLNCHMDAYQANQVLKQLQDHTVALGFFHWLKQLPGFKHDGYTYTT 343

Query: 1104 MVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIHSYGRANYL 1241
            MVGILGRA+QF AINKLLDQMV DGC+P VVTYNRLIHSYGRANYL
Sbjct: 344  MVGILGRAKQFVAINKLLDQMVRDGCQPTVVTYNRLIHSYGRANYL 389


>gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis]
          Length = 872

 Score =  301 bits (772), Expect = 3e-79
 Identities = 196/432 (45%), Positives = 251/432 (58%), Gaps = 42/432 (9%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGA---- 239
            MLRAK +G LS SARSFFLSGSR                    RR    N++H G     
Sbjct: 1    MLRAKQIGNLSNSARSFFLSGSRCNAADGSSSCTCSEDETCVSRRQ---NLRHGGILAQN 57

Query: 240  SSSLLSKASVGVSPLASLDSVKEVNA-KTTEHNST-LSQGVAIACSLDRAEDSVSYADDF 413
             S+L S+ S  V  L S D+VK V+  K + HN T L Q +    SL R+E  VSYA   
Sbjct: 58   PSTLASRTSARVGTLISGDAVKAVSTEKASMHNPTSLKQVIISPKSLGRSE-CVSYASTV 116

Query: 414  DV---HLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSS--PNSMVDRTKPIS 578
            +    H SP  +DQFVKAG+AAV  LSD++NY+ P++DG  +  ++   N MVD  +  +
Sbjct: 117  EKNVEHSSPVFSDQFVKAGVAAVNFLSDVMNYKFPLSDGIGIFNNNLPQNCMVDPARLST 176

Query: 579  NVRSANTKNPRKDK---VYVKPSAAP----NFTSGATGSGDRSVSEKGVTNESN----NF 725
            ++RS++  + ++     V+ +PS       N TS       +S S KGV N  N    N 
Sbjct: 177  SIRSSHVNHVKRKNFSGVHPRPSVEAAVQYNSTSSTKSKDSKSSSVKGVNNVPNTRNGNS 236

Query: 726  VETHGISSEPRDRVKSIPQRARANSNRFMPNTPQSD--------------GFTRAIRQAK 863
              T  + +E RDR ++IP R +A  N F  +                   GF R  R+  
Sbjct: 237  WATRSVPAEARDR-RAIPNRTKACLNSFKADFSSDSNQSTDGGNVGFGNKGFNRPPREMN 295

Query: 864  VNT------RQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQD 1025
              T      R + ++  VVE V  +LH  +WG A EEAL  LN ++DA+QANQVLKQLQD
Sbjct: 296  FPTGYAPIKRPYANTANVVERVSHMLHGLRWGRAAEEALENLNYAMDAFQANQVLKQLQD 355

Query: 1026 YTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYN 1205
            + VALGFFYWLKR+ GFKHDGHTYTTMVGILGR+R+FGAINKLL +MV +GC+PNVVTYN
Sbjct: 356  HNVALGFFYWLKRQAGFKHDGHTYTTMVGILGRSREFGAINKLLHEMVKEGCQPNVVTYN 415

Query: 1206 RLIHSYGRANYL 1241
            RLIHSYGRANYL
Sbjct: 416  RLIHSYGRANYL 427


>ref|XP_004138146.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            [Cucumis sativus]
          Length = 874

 Score =  298 bits (762), Expect = 4e-78
 Identities = 187/433 (43%), Positives = 254/433 (58%), Gaps = 43/433 (9%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAK +G+LS SARSFFLSGSR                    R++ R         S+L
Sbjct: 1    MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETCVSE-RQNARNETLPSQKPSTL 59

Query: 252  LSKASVGVSPLASLDSVKEVNAKTTEH---NSTLSQGVAIACSLDRAEDSVSYADDFDVH 422
            ++ +S  V PL + ++ K + +  T++   + ++ Q      +  R  + V YA   +  
Sbjct: 60   VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 119

Query: 423  L-----SPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVR 587
            L     SP IADQ VKAG+ AV L SD VN++IP +D     +SS N MVD  + I++V+
Sbjct: 120  LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 179

Query: 588  SANTKNPRKD---KVYVKPS------AAPNFTS--GATGSGDRSVSEKGVTNE-----SN 719
             +  K+ R++   +V+ +PS      + P  +S  G+     +S   KG   E     + 
Sbjct: 180  PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 239

Query: 720  NFVETHGISSEPRDRVKSIPQRARANSNRF------MPNTPQSDGFTRAIRQAKVN---- 869
              V    ISS+  D+ +++PQR R +SN F      +  T  SD FT + +  K      
Sbjct: 240  KLVVFQNISSDKCDK-RNLPQRTRVHSNSFTSHFHSIAQTTGSD-FTNSSKNFKKFPDNL 297

Query: 870  ---------TRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQ 1022
                     T  F+++  VVESV CIL Q KWGPA EEA+GKLNCS+DAYQANQ+LK++ 
Sbjct: 298  KSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVD 357

Query: 1023 DYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTY 1202
            D+ VALGFFYWLKR P F+HDGHTYTTM+G+LGRA+QF AINKLLDQM+ DGC+PNVVTY
Sbjct: 358  DHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTY 417

Query: 1203 NRLIHSYGRANYL 1241
            NR+IHSYGRANYL
Sbjct: 418  NRIIHSYGRANYL 430


>gb|ESW29652.1| hypothetical protein PHAVU_002G087700g [Phaseolus vulgaris]
          Length = 881

 Score =  296 bits (759), Expect = 9e-78
 Identities = 193/437 (44%), Positives = 241/437 (55%), Gaps = 45/437 (10%)
 Frame = +3

Query: 66   QTMLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGAS- 242
            Q MLRAK + TLS +ARSF L GSR                    R   R N + +    
Sbjct: 2    QNMLRAKQISTLSSNARSFLLGGSRCNGADGNSCTCPEDETCIS-RGQQRKNSEDLVVQK 60

Query: 243  -SSLLSKAS---VGVSPLASLDSVKEVNAKTTEHNSTLSQGVAIACSLDRAEDSVSYA-- 404
             SSL+SK +   VG     SL +    +       S   Q +          DSV+YA  
Sbjct: 61   PSSLVSKTTSQVVGTLVSGSLANGPASHKAGDVGQSGCVQQIRSTSFAPSRPDSVTYACV 120

Query: 405  ----DDFDVHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKP 572
                 D   H S   ADQF +AG+AAV  +SD+VNY+ P++DG  +L  S N MVD  + 
Sbjct: 121  VDGVQDHVAHSSSVNADQFYRAGIAAVNFISDVVNYKFPLSDGMGILNYSKNYMVDPGRA 180

Query: 573  ISNVRSANTKNPRKDK-----------VYVKPSAAPNFTSGATGSGDRSVSEKGVTNESN 719
            + ++RS+N K  RK+             +  PS   N   GA G GD+S   KG    ++
Sbjct: 181  LPSIRSSNVKQIRKESFTAVHPKPPVSTHPGPSKPTNNHHGAKGKGDKSNLAKGFKPVAS 240

Query: 720  NFVETHG----ISSEPRDRVKSIPQRARANSNRFMPN--------TPQSDG--------F 839
              +E  G    I     DR +++PQR R   NRF+ N         PQ  G        +
Sbjct: 241  PGIEKSGEAPNIPVNSHDR-RALPQRTRTRPNRFVTNFGSNMPSSNPQMAGSFKESFCKY 299

Query: 840  TRAIRQAK---VNTRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVL 1010
            TR +  A     + R F +S  VV+ V  +L Q KWGPATE+AL  LN S+DAYQANQ+L
Sbjct: 300  TRNVNMAAGIAPSNRHFTNSGHVVDMVKDMLRQLKWGPATEKALCNLNFSIDAYQANQIL 359

Query: 1011 KQLQDYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPN 1190
            KQLQD++VAL FFYWLK +PGF HDGHTYTTMVGILGRAR+FGAINKLL+QMV DGC+PN
Sbjct: 360  KQLQDHSVALSFFYWLKLQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGCQPN 419

Query: 1191 VVTYNRLIHSYGRANYL 1241
            VVTYNRLIHSYGRANYL
Sbjct: 420  VVTYNRLIHSYGRANYL 436


>ref|XP_004154991.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At1g18900-like [Cucumis sativus]
          Length = 874

 Score =  296 bits (757), Expect = 2e-77
 Identities = 186/433 (42%), Positives = 253/433 (58%), Gaps = 43/433 (9%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASSSL 251
            MLRAK +G+LS SARSFF SGSR                    R++ R         S+L
Sbjct: 1    MLRAKQIGSLSNSARSFFXSGSRCNADGASCTCPEDETCVSE-RQNARNETLPSQKPSTL 59

Query: 252  LSKASVGVSPLASLDSVKEVNAKTTEH---NSTLSQGVAIACSLDRAEDSVSYADDFDVH 422
            ++ +S  V PL + ++ K + +  T++   + ++ Q      +  R  + V YA   +  
Sbjct: 60   VANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNTV 119

Query: 423  L-----SPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVR 587
            L     SP IADQ VKAG+ AV L SD VN++IP +D     +SS N MVD  + I++V+
Sbjct: 120  LDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSVK 179

Query: 588  SANTKNPRKD---KVYVKPS------AAPNFTS--GATGSGDRSVSEKGVTNE-----SN 719
             +  K+ R++   +V+ +PS      + P  +S  G+     +S   KG   E     + 
Sbjct: 180  PSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEARTQ 239

Query: 720  NFVETHGISSEPRDRVKSIPQRARANSNRF------MPNTPQSDGFTRAIRQAKVN---- 869
              V    ISS+  D+ +++PQR R +SN F      +  T  SD FT + +  K      
Sbjct: 240  KLVVFQNISSDKCDK-RNLPQRTRVHSNSFTSHFHSIAQTTGSD-FTNSSKNFKKFPDNL 297

Query: 870  ---------TRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQ 1022
                     T  F+++  VVESV CIL Q KWGPA EEA+GKLNCS+DAYQANQ+LK++ 
Sbjct: 298  KSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQILKRVD 357

Query: 1023 DYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTY 1202
            D+ VALGFFYWLKR P F+HDGHTYTTM+G+LGRA+QF AINKLLDQM+ DGC+PNVVTY
Sbjct: 358  DHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGCQPNVVTY 417

Query: 1203 NRLIHSYGRANYL 1241
            NR+IHSYGRANYL
Sbjct: 418  NRIIHSYGRANYL 430


>ref|XP_004513211.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            [Cicer arietinum]
          Length = 879

 Score =  291 bits (745), Expect = 4e-76
 Identities = 185/434 (42%), Positives = 245/434 (56%), Gaps = 42/434 (9%)
 Frame = +3

Query: 66   QTMLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGASS 245
            Q MLRAK + +LS SARSFFLSGSR                    R+  +     +   S
Sbjct: 2    QNMLRAKPISSLSSSARSFFLSGSRCNAPDANSCTCNEDETCVSRRQETKNENLLMQKPS 61

Query: 246  SLLSKASVGVSPLASLDSVKEVNAKTTEHNSTLSQGVAIACSLDRAEDSVSYA---DDFD 416
            S+    S+    L S +S      K  + + ++ Q V    S     DSV+YA   DD  
Sbjct: 62   SVSKTTSLVERTLVSGNSASSHKVKGIDQSGSVQQ-VRSNSSPSSKSDSVTYACVADDIP 120

Query: 417  VHL---SPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNSMVDRTKPISNVR 587
             H+   SP   DQF +AG+AAV  +SD V+ ++P++DG  +L+ S N MV+    I+++R
Sbjct: 121  NHVTHSSPLDTDQFYRAGIAAVNFISDFVHCKLPVSDGMGILSYSKNCMVEPASTITSIR 180

Query: 588  SANTKNPRKD---KVYVKPSA---------APNFTSGATGSGDRSVSEKGVTNESNNFVE 731
            S+N K  RK+    V+ KP           A +  +G+ G GD+S   KG  + +++  E
Sbjct: 181  SSNVKQIRKEDFISVHPKPPVSNHPGSSNHASSSYNGSKGKGDKSKFGKGFKHIASSATE 240

Query: 732  TH----GISSEPRDRVKSIPQRARANSNRFMPNTPQS-------------DGFTRAIRQA 860
                   I+    D  + +PQR R ++N+F+ N+  +             + F R  R  
Sbjct: 241  KSEVAPNIAFNNHDGRRPLPQRTRTHTNQFVTNSGSNVQTSNSHMLGSFKESFHRHPRYL 300

Query: 861  KVN-------TRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQL 1019
            K +       T    + + VVE V  IL Q KWGPATEEAL  L  S+DAYQ NQ+LKQL
Sbjct: 301  KTSAGSSSTKTHFTKTGHRVVEVVIDILQQLKWGPATEEALYNLKSSIDAYQGNQILKQL 360

Query: 1020 QDYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVT 1199
            +D++VAL FFYWLKR+P F HDGHTYTTMVGILGRAR+FGAINKLL+QMV DGCKPNVVT
Sbjct: 361  EDHSVALSFFYWLKRQPNFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGCKPNVVT 420

Query: 1200 YNRLIHSYGRANYL 1241
            YNRLIHSYGRANYL
Sbjct: 421  YNRLIHSYGRANYL 434


>ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            [Glycine max]
          Length = 882

 Score =  289 bits (740), Expect = 1e-75
 Identities = 185/446 (41%), Positives = 243/446 (54%), Gaps = 54/446 (12%)
 Frame = +3

Query: 66   QTMLRAKHLG-TLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANVQHIGAS 242
            Q MLRAK +  TLS +ARS  LSGSR                    R+  + N   +   
Sbjct: 2    QNMLRAKAISSTLSSNARSILLSGSRCNAADGNSCNCPEDETCVSKRQQRKNNEDLLAPK 61

Query: 243  S-SLLSKA------------------SVGVSPLASLDSVKEVNAKTTEHNSTLSQGVAIA 365
              SL+SKA                  S  V  +     V++V  + T +  + S+ V  A
Sbjct: 62   PPSLVSKATSQVVGTLVSGNLANGPASHNVGSVGQSGCVQKV--RPTSYAPSKSESVTSA 119

Query: 366  CSLDRAEDSVSYADDFDVHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSP 545
            C +D  ++ V+++   +       ADQF +AG+AAV  +SD+VNY++P++DG  +L  S 
Sbjct: 120  CVVDGVQEHVAHSSSLN-------ADQFYRAGIAAVNFISDVVNYKLPLSDGMGILNYSK 172

Query: 546  NSMVDRTKPISNVRSANTKNPRKDK-----------VYVKPSAAPNFTSGATGSGDRSVS 692
            N MVD  + +  +RS+N +  R +             +  PS   N   GA G  ++S  
Sbjct: 173  NCMVDPARALPKIRSSNVQQIRTENFTSVHPKPPVPAHPGPSKHTNNNHGAKGKANKSNL 232

Query: 693  EKGVTNESNNFVETHG----ISSEPRDRVKSIPQRARANSNRFMPN-------------T 821
             KG    + +  E  G    I     DR +++PQR R NSN F+ N              
Sbjct: 233  AKGFKYVAASGTEKSGAAPNIPVNNHDR-RALPQRTRTNSNHFVTNFGSNMQSSNPQMAR 291

Query: 822  PQSDGFTRAIRQAKV------NTRQFISSNPVVESVYCILHQYKWGPATEEALGKLNCSL 983
            P  + F +  R   +        R F +S  VVE V  IL Q +WGPATE+AL  LN S+
Sbjct: 292  PFKESFNKHTRDLNMPAGIAPTRRHFTNSGHVVEGVKDILKQLRWGPATEKALYNLNFSI 351

Query: 984  DAYQANQVLKQLQDYTVALGFFYWLKRKPGFKHDGHTYTTMVGILGRARQFGAINKLLDQ 1163
            DAYQANQ+LKQLQD++VAL FFYWLKR+PGF HDGHTYTTMVGILGRAR+FGAINKLL+Q
Sbjct: 352  DAYQANQILKQLQDHSVALSFFYWLKRQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQ 411

Query: 1164 MVSDGCKPNVVTYNRLIHSYGRANYL 1241
            MV DGC+PNVVTYNRLIHSYGRANYL
Sbjct: 412  MVKDGCQPNVVTYNRLIHSYGRANYL 437


>gb|AAF79278.1|AC068602_1 F14D16.2 [Arabidopsis thaliana]
          Length = 977

 Score =  286 bits (732), Expect = 1e-74
 Identities = 186/417 (44%), Positives = 241/417 (57%), Gaps = 27/417 (6%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANV-QHIGASSS 248
            M+RAKH+  LS +ARSFFL+GSR                    R+ LR    Q     SS
Sbjct: 118  MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 177

Query: 249  LLSKASVGVSPLASLDSVKEVNAKTTE---HNSTLSQGVAIACSLDRAEDSVSYADDF-- 413
            +L K SV V  +   +  K V  K  +     S L Q V+ + +L     SV+YA     
Sbjct: 178  ILPKPSV-VGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVR 236

Query: 414  ----DVHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNS-MVDRTKPIS 578
                    S PI DQ  KAG+ AV  LSDL N +IP  DG S     P S MVD T+PIS
Sbjct: 237  EEVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPIS 296

Query: 579  NVRSANTKNPRKD---KVYVKPSAAPNFTSGATGS----------GDRSVSEKGVTNESN 719
            +V+S+N K  R++   K+Y + SAA   + G T +           +R+   KG    SN
Sbjct: 297  SVKSSNVKAIRREHFAKIYPR-SAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQVSN 355

Query: 720  NFV-ETHGISSEPRDRVKSIPQRARANSNRFMPN--TPQSDGFTRAIRQAKVNTRQFISS 890
            + V ++   ++    +  S+ QR   +SNRF+P+  +  S    +      + +RQ+ +S
Sbjct: 356  SVVGKSLPTTNNTYGKRTSVLQRPHIDSNRFVPSGFSNSSVEMMKGPSGTALTSRQYCNS 415

Query: 891  NPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVALGFFYWLKRKP 1070
              +VE+V  +L +++WGPA EEAL  L   +DAYQANQVLKQ+ DY  ALGFFYWLKR+P
Sbjct: 416  GHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQP 475

Query: 1071 GFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIHSYGRANYL 1241
            GFKHDGHTYTTMVG LGRA+QFGAINKLLD+MV DGC+PN VTYNRLIHSYGRANYL
Sbjct: 476  GFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYL 532


>ref|NP_173324.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|42571539|ref|NP_973860.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75151479|sp|Q8GYP6.1|PPR49_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g18900 gi|26450017|dbj|BAC42129.1| unknown protein
            [Arabidopsis thaliana] gi|28827402|gb|AAO50545.1| unknown
            protein [Arabidopsis thaliana]
            gi|332191657|gb|AEE29778.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332191658|gb|AEE29779.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 860

 Score =  286 bits (732), Expect = 1e-74
 Identities = 186/417 (44%), Positives = 241/417 (57%), Gaps = 27/417 (6%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANV-QHIGASSS 248
            M+RAKH+  LS +ARSFFL+GSR                    R+ LR    Q     SS
Sbjct: 1    MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60

Query: 249  LLSKASVGVSPLASLDSVKEVNAKTTE---HNSTLSQGVAIACSLDRAEDSVSYADDF-- 413
            +L K SV V  +   +  K V  K  +     S L Q V+ + +L     SV+YA     
Sbjct: 61   ILPKPSV-VGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVR 119

Query: 414  ----DVHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNS-MVDRTKPIS 578
                    S PI DQ  KAG+ AV  LSDL N +IP  DG S     P S MVD T+PIS
Sbjct: 120  EEVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPIS 179

Query: 579  NVRSANTKNPRKD---KVYVKPSAAPNFTSGATGS----------GDRSVSEKGVTNESN 719
            +V+S+N K  R++   K+Y + SAA   + G T +           +R+   KG    SN
Sbjct: 180  SVKSSNVKAIRREHFAKIYPR-SAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQVSN 238

Query: 720  NFV-ETHGISSEPRDRVKSIPQRARANSNRFMPN--TPQSDGFTRAIRQAKVNTRQFISS 890
            + V ++   ++    +  S+ QR   +SNRF+P+  +  S    +      + +RQ+ +S
Sbjct: 239  SVVGKSLPTTNNTYGKRTSVLQRPHIDSNRFVPSGFSNSSVEMMKGPSGTALTSRQYCNS 298

Query: 891  NPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVALGFFYWLKRKP 1070
              +VE+V  +L +++WGPA EEAL  L   +DAYQANQVLKQ+ DY  ALGFFYWLKR+P
Sbjct: 299  GHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQP 358

Query: 1071 GFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIHSYGRANYL 1241
            GFKHDGHTYTTMVG LGRA+QFGAINKLLD+MV DGC+PN VTYNRLIHSYGRANYL
Sbjct: 359  GFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYL 415


>ref|NP_001185030.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332191659|gb|AEE29780.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 886

 Score =  286 bits (732), Expect = 1e-74
 Identities = 186/417 (44%), Positives = 241/417 (57%), Gaps = 27/417 (6%)
 Frame = +3

Query: 72   MLRAKHLGTLSQSARSFFLSGSRXXXXXXXXXXXXXXXXXXXXRRHLRANV-QHIGASSS 248
            M+RAKH+  LS +ARSFFL+GSR                    R+ LR    Q     SS
Sbjct: 1    MIRAKHISNLSSTARSFFLNGSRTSVTDGNSCVYSDDENCVSKRQQLRKEAGQTEKRPSS 60

Query: 249  LLSKASVGVSPLASLDSVKEVNAKTTE---HNSTLSQGVAIACSLDRAEDSVSYADDF-- 413
            +L K SV V  +   +  K V  K  +     S L Q V+ + +L     SV+YA     
Sbjct: 61   ILPKPSV-VGCILPGEVTKPVVPKKVDDFGRPSLLPQHVSSSPALPLKSHSVNYASTVVR 119

Query: 414  ----DVHLSPPIADQFVKAGMAAVGLLSDLVNYRIPMTDGSSMLTSSPNS-MVDRTKPIS 578
                    S PI DQ  KAG+ AV  LSDL N +IP  DG S     P S MVD T+PIS
Sbjct: 120  EEVEGKASSEPIGDQIFKAGIVAVNFLSDLSNCKIPSYDGGSDAFGLPKSCMVDPTRPIS 179

Query: 579  NVRSANTKNPRKD---KVYVKPSAAPNFTSGATGS----------GDRSVSEKGVTNESN 719
            +V+S+N K  R++   K+Y + SAA   + G T +           +R+   KG    SN
Sbjct: 180  SVKSSNVKAIRREHFAKIYPR-SAAKESSVGTTRNPSSNFRGAKEAERTGFVKGFRQVSN 238

Query: 720  NFV-ETHGISSEPRDRVKSIPQRARANSNRFMPN--TPQSDGFTRAIRQAKVNTRQFISS 890
            + V ++   ++    +  S+ QR   +SNRF+P+  +  S    +      + +RQ+ +S
Sbjct: 239  SVVGKSLPTTNNTYGKRTSVLQRPHIDSNRFVPSGFSNSSVEMMKGPSGTALTSRQYCNS 298

Query: 891  NPVVESVYCILHQYKWGPATEEALGKLNCSLDAYQANQVLKQLQDYTVALGFFYWLKRKP 1070
              +VE+V  +L +++WGPA EEAL  L   +DAYQANQVLKQ+ DY  ALGFFYWLKR+P
Sbjct: 299  GHIVENVSSVLRRFRWGPAAEEALQNLGLRIDAYQANQVLKQMNDYGNALGFFYWLKRQP 358

Query: 1071 GFKHDGHTYTTMVGILGRARQFGAINKLLDQMVSDGCKPNVVTYNRLIHSYGRANYL 1241
            GFKHDGHTYTTMVG LGRA+QFGAINKLLD+MV DGC+PN VTYNRLIHSYGRANYL
Sbjct: 359  GFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQPNTVTYNRLIHSYGRANYL 415


Top