BLASTX nr result

ID: Forsythia23_contig00014990 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00014990
         (1872 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011093449.1| PREDICTED: pentatricopeptide repeat-containi...   535   e-149
ref|XP_012847188.1| PREDICTED: pentatricopeptide repeat-containi...   483   e-133
gb|EYU29299.1| hypothetical protein MIMGU_mgv1a002580mg [Erythra...   466   e-128
ref|XP_010662700.1| PREDICTED: pentatricopeptide repeat-containi...   463   e-127
emb|CDP08317.1| unnamed protein product [Coffea canephora]            442   e-121
ref|XP_009785923.1| PREDICTED: pentatricopeptide repeat-containi...   440   e-120
ref|XP_010110548.1| hypothetical protein L484_023382 [Morus nota...   437   e-119
ref|XP_006359014.1| PREDICTED: pentatricopeptide repeat-containi...   437   e-119
ref|XP_009604974.1| PREDICTED: pentatricopeptide repeat-containi...   436   e-119
ref|XP_004237845.1| PREDICTED: pentatricopeptide repeat-containi...   434   e-119
ref|XP_007204496.1| hypothetical protein PRUPE_ppa019323mg [Prun...   428   e-117
ref|XP_008240720.1| PREDICTED: pentatricopeptide repeat-containi...   427   e-116
ref|XP_012075523.1| PREDICTED: pentatricopeptide repeat-containi...   426   e-116
gb|KDP34852.1| hypothetical protein JCGZ_09140 [Jatropha curcas]      426   e-116
ref|XP_004301723.1| PREDICTED: pentatricopeptide repeat-containi...   420   e-114
ref|XP_009345148.1| PREDICTED: pentatricopeptide repeat-containi...   419   e-114
ref|XP_008392809.1| PREDICTED: pentatricopeptide repeat-containi...   417   e-113
gb|KDO39066.1| hypothetical protein CISIN_1g048743mg, partial [C...   414   e-112
ref|XP_007048805.1| Pentatricopeptide repeat superfamily protein...   414   e-112
ref|XP_010274524.1| PREDICTED: pentatricopeptide repeat-containi...   413   e-112

>ref|XP_011093449.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Sesamum indicum] gi|747040687|ref|XP_011093529.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Sesamum indicum]
            gi|747040689|ref|XP_011093613.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g17616
            [Sesamum indicum] gi|747040691|ref|XP_011093687.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Sesamum indicum]
            gi|747040693|ref|XP_011093767.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g17616
            [Sesamum indicum] gi|747040695|ref|XP_011093841.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Sesamum indicum]
          Length = 690

 Score =  535 bits (1378), Expect = e-149
 Identities = 273/406 (67%), Positives = 320/406 (78%)
 Frame = -2

Query: 1220 YGNLFTRGLSSIGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPS 1041
            Y   F +G S+IG K E LY +  SR ILLGKLES+LKEHQVD+AWETYK+FKRLYGFP 
Sbjct: 34   YRTCFLQGFSAIGGKRENLYWKGASRSILLGKLESALKEHQVDEAWETYKDFKRLYGFPD 93

Query: 1040 QFLVAGLITELSYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPAS 861
            QFLVA L+TE SYS+D KCLR+ACDLVLSISKEK V+L P++MTKLALS ARAQ+PV AS
Sbjct: 94   QFLVANLVTESSYSADPKCLRRACDLVLSISKEKSVLLRPEMMTKLALSLARAQIPVAAS 153

Query: 860  TILRLMLEKRSLPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELK 681
            +ILRLM+EKRSLP LDVLRM+FLH+VKTE GTYLASNIL  IC C       KSA  EL 
Sbjct: 154  SILRLMVEKRSLPPLDVLRMIFLHLVKTENGTYLASNILDHICYCFQKLIVNKSAQAELT 213

Query: 680  KPDTVIFNLVLDACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKK 501
            KPD  IFNLVLDACVRF + LKGQQ++ELM Q+G +AD HTA+IIARIHE+N MRDELKK
Sbjct: 214  KPDVTIFNLVLDACVRFVSPLKGQQIIELMPQLGAIADGHTAVIIARIHEMNGMRDELKK 273

Query: 500  FKDYIDRVPVNLLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQ 321
            FKDY+D VPV L+ HYQ FYD +LSLHFKFNDI+ AS L++DL +  ES+ FQ G++E  
Sbjct: 274  FKDYVDMVPVTLIPHYQQFYDSLLSLHFKFNDIDGASALLMDLCRCSESSTFQRGQKEEL 333

Query: 320  TSCTVSIGSHNMNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVG 141
             SC+VSIGS N+  GL L FLPQ LQ+D +YK  +KQ L+L+K+GKF LSNK LAKLI+ 
Sbjct: 334  KSCSVSIGSDNIRMGLRLQFLPQQLQRDFVYKGHNKQELILYKSGKFVLSNKCLAKLIIS 393

Query: 140  YKRCGNIGKLSKLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            YKR G I  LS+LL  I N  SS      CS+VIDACI+LGWL+TA
Sbjct: 394  YKRSGRINGLSRLLIGIHNMLSSEGNSRSCSDVIDACIYLGWLETA 439


>ref|XP_012847188.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Erythranthe guttatus]
          Length = 683

 Score =  483 bits (1243), Expect = e-133
 Identities = 261/450 (58%), Positives = 324/450 (72%), Gaps = 6/450 (1%)
 Frame = -2

Query: 1334 ISMTQSLIAVIHKAALRLTFDRKFDFIHNLKRFEVLDT------YGNLFTRGLSSIGIKG 1173
            +S  QSLI       LRL F  +    HN+  F+ L++      + + F+   S+I  K 
Sbjct: 1    MSKLQSLIRCNQNGFLRLIFGGRNVCSHNVS-FQQLNSCNDNRLHRSRFSEEFSTIAEKL 59

Query: 1172 EKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSD 993
            EKL  ++P + ILL KLE +LKEHQ+D+AW+TY++FK +YG+P Q  ++ LITE SY++D
Sbjct: 60   EKLSSKKPPQWILLEKLEKALKEHQLDEAWKTYQDFKLVYGYPEQLFISNLITEFSYTTD 119

Query: 992  SKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLD 813
            SK LR+A DL LSIS+EK V+L  D+MTKL LS +RAQ+PVPAS ILR+ML+K SLPSL+
Sbjct: 120  SKYLRRASDLALSISREKSVLLRHDVMTKLVLSLSRAQIPVPASNILRIMLDKNSLPSLE 179

Query: 812  VLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVR 633
            VLRMVFLH+VKTE G+YLASNIL EIC C    + +KS   +L KPD  IFNLVLD+C R
Sbjct: 180  VLRMVFLHLVKTETGSYLASNILEEICYCFQKLSVKKSC--QLTKPDVTIFNLVLDSCAR 237

Query: 632  FGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHY 453
            FG  LKGQQ+MELM   GVVADA +A+IIAR+HE+N  RDELKKFKDYID VPV L  HY
Sbjct: 238  FGNCLKGQQIMELMPITGVVADADSAVIIARVHEMNGTRDELKKFKDYIDAVPVTLSRHY 297

Query: 452  QHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGL 273
            Q FYD ++SLHFKFNDI++ S L+L+L    E N      RE +  CTVSIGS  +  GL
Sbjct: 298  QQFYDRLISLHFKFNDIDSVSALLLELSGNREPN---PSPREQKGYCTVSIGSDKIKMGL 354

Query: 272  ALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFS 93
             L FLPQ +QKD +YKVD K  LVL+KNGKF LSN  LAKL++ YKRCG I  LSKLL S
Sbjct: 355  KLQFLPQQIQKDFVYKVDGKNELVLYKNGKFVLSNNGLAKLVIEYKRCGRISDLSKLLIS 414

Query: 92   IQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            IQ+  +S    S CS+VIDACI+LGWL+TA
Sbjct: 415  IQSMLNSPPNNSSCSDVIDACIYLGWLETA 444


>gb|EYU29299.1| hypothetical protein MIMGU_mgv1a002580mg [Erythranthe guttata]
          Length = 657

 Score =  466 bits (1198), Expect = e-128
 Identities = 241/381 (63%), Positives = 291/381 (76%)
 Frame = -2

Query: 1145 RLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSDSKCLRKACD 966
            R ILL KLE +LKEHQ+D+AW+TY++FK +YG+P Q  ++ LITE SY++DSK LR+A D
Sbjct: 43   RWILLEKLEKALKEHQLDEAWKTYQDFKLVYGYPEQLFISNLITEFSYTTDSKYLRRASD 102

Query: 965  LVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLDVLRMVFLHM 786
            L LSIS+EK V+L  D+MTKL LS +RAQ+PVPAS ILR+ML+K SLPSL+VLRMVFLH+
Sbjct: 103  LALSISREKSVLLRHDVMTKLVLSLSRAQIPVPASNILRIMLDKNSLPSLEVLRMVFLHL 162

Query: 785  VKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVRFGASLKGQQ 606
            VKTE G+YLASNIL EIC C    + +KS   +L KPD  IFNLVLD+C RFG  LKGQQ
Sbjct: 163  VKTETGSYLASNILEEICYCFQKLSVKKSC--QLTKPDVTIFNLVLDSCARFGNCLKGQQ 220

Query: 605  LMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHYQHFYDCMLS 426
            +MELM   GVVADA +A+IIAR+HE+N  RDELKKFKDYID VPV L  HYQ FYD ++S
Sbjct: 221  IMELMPITGVVADADSAVIIARVHEMNGTRDELKKFKDYIDAVPVTLSRHYQQFYDRLIS 280

Query: 425  LHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGLALHFLPQHL 246
            LHFKFNDI++ S L+L+L    E N      RE +  CTVSIGS  +  GL L FLPQ +
Sbjct: 281  LHFKFNDIDSVSALLLELSGNREPN---PSPREQKGYCTVSIGSDKIKMGLKLQFLPQQI 337

Query: 245  QKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFSIQNSWSSLE 66
            QKD +YKVD K  LVL+KNGKF LSN  LAKL++ YKRCG I  LSKLL SIQ+  +S  
Sbjct: 338  QKDFVYKVDGKNELVLYKNGKFVLSNNGLAKLVIEYKRCGRISDLSKLLISIQSMLNSPP 397

Query: 65   YYSLCSNVIDACIHLGWLQTA 3
              S CS+VIDACI+LGWL+TA
Sbjct: 398  NNSSCSDVIDACIYLGWLETA 418


>ref|XP_010662700.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Vitis vinifera]
          Length = 716

 Score =  463 bits (1192), Expect = e-127
 Identities = 238/396 (60%), Positives = 304/396 (76%)
 Frame = -2

Query: 1190 SIGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITE 1011
            SI  + E +  E     +LL KLE +LK+HQVD+AWET+K+ KRLYGFPS  LV+ LITE
Sbjct: 66   SISSQPELICWEGSCHAVLLRKLEIALKDHQVDEAWETFKDIKRLYGFPSHSLVSRLITE 125

Query: 1010 LSYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKR 831
            LSYSS+   L+KACDLV  I KEK  +LH D +TKL+LS +RAQMP+PAS ILRLMLEK 
Sbjct: 126  LSYSSNPHWLQKACDLVYLILKEKSDLLHSDSLTKLSLSLSRAQMPIPASMILRLMLEKG 185

Query: 830  SLPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLV 651
            S+P  +VL ++ LHMVKTEIGTYLASN L++IC   L  +A KS H +L KPDT+IFNLV
Sbjct: 186  SVPQKNVLWLIILHMVKTEIGTYLASNYLVQICDHFLLLSASKSNHAKLIKPDTMIFNLV 245

Query: 650  LDACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPV 471
            LDACVRFG+S KGQQ++ELM QVGV ADAH+ IIIA+IHE+N  RD+LKKFK +ID+V +
Sbjct: 246  LDACVRFGSSFKGQQIIELMPQVGVGADAHSIIIIAQIHEMNGQRDDLKKFKCHIDQVSI 305

Query: 470  NLLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSH 291
             L CHY+ FYD +LSLHFKFNDI+ A+ L+LD+ + ++S   Q  + +P  +C V IGS+
Sbjct: 306  QLACHYRQFYDSLLSLHFKFNDIDGAAGLVLDMCRCWDSLSIQKDRNDPHKTCLVPIGSY 365

Query: 290  NMNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKL 111
             +  GL L  +P+ LQKD+++K+DSKQ L+LF+NGK+ LSNKALAKLI+ YKR G IG+L
Sbjct: 366  YLKEGLKLQIVPELLQKDSVFKMDSKQELLLFRNGKYVLSNKALAKLIIAYKRDGRIGEL 425

Query: 110  SKLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            S+L+ S+Q    +LE   L S+VIDACI LGWL+TA
Sbjct: 426  SRLMLSLQKELGTLE-GGLISDVIDACIQLGWLETA 460


>emb|CDP08317.1| unnamed protein product [Coffea canephora]
          Length = 717

 Score =  442 bits (1137), Expect = e-121
 Identities = 244/450 (54%), Positives = 313/450 (69%), Gaps = 4/450 (0%)
 Frame = -2

Query: 1340 FLISMTQSLIAVIHKAALRLTFDRKFDFIHNLKRFEVLDTYGNLFTRGL----SSIGIKG 1173
            FLI    S+I +   ++L+  F R +   +      +L+   N  T+ L    SS  IK 
Sbjct: 14   FLIKSYWSVIGIASNSSLKSVFHRYYASSNKADEDGLLNACDNP-TKSLAFKDSSTCIKP 72

Query: 1172 EKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSD 993
             KL  E  S  ILL KLE+ LK+HQVD+AWETYK+FKRLYGFP   ++  LITE SYS D
Sbjct: 73   AKLGWEGSSHAILLEKLENVLKDHQVDEAWETYKDFKRLYGFPEDSIMRQLITEFSYSLD 132

Query: 992  SKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLD 813
            S  L +A D+VLS+SKEK  +   D++TKL LS ARAQMP P S ILRLM++K   P LD
Sbjct: 133  STWLCRAFDIVLSMSKEKSALPRLDVLTKLCLSLARAQMPSPTSVILRLMIQKNCFPPLD 192

Query: 812  VLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVR 633
            +L  VFLHMVKTE+G  LA+NIL EI       N  KS   ++ KPDT++FNL+LDAC+R
Sbjct: 193  ILGSVFLHMVKTEMGAILAANILTEIRDLYEQLNESKSNFAKMIKPDTMLFNLILDACIR 252

Query: 632  FGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHY 453
            + +SLKGQQ++ELMA+VGVVADAHT +IIA+I+E+N MRDELKK+K +ID V  +L+ HY
Sbjct: 253  YQSSLKGQQIIELMAEVGVVADAHTIVIIAQIYEMNCMRDELKKYKRHIDVVSASLVSHY 312

Query: 452  QHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGL 273
            + FYD +LSLHF FNDI+AAS LI D+Y++ ESN  + G++E   SCT+ IGS N+  GL
Sbjct: 313  RQFYDSLLSLHFIFNDIDAASALIKDMYQHGESNPAREGRKE---SCTIPIGSPNLKMGL 369

Query: 272  ALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFS 93
             LH LP+ LQKD + KV+ K  LVL KNGK  LS+ A+ KL+  YKRC  I +LS LL  
Sbjct: 370  KLHILPELLQKDTVIKVEGKPKLVLSKNGKLVLSSNAVTKLMREYKRCERINELSTLLNY 429

Query: 92   IQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            IQ+   S + ++LC +VIDACIHLGWLQTA
Sbjct: 430  IQSKLGSSDSHNLCHDVIDACIHLGWLQTA 459


>ref|XP_009785923.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Nicotiana sylvestris] gi|698477412|ref|XP_009785924.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Nicotiana sylvestris]
          Length = 715

 Score =  440 bits (1132), Expect = e-120
 Identities = 242/455 (53%), Positives = 312/455 (68%), Gaps = 3/455 (0%)
 Frame = -2

Query: 1358 ILTCLIFLISMTQSLIAVIHKAALRLTFDRKFDFIHNLKR---FEVLDTYGNLFTRGLSS 1188
            I  C IF  S + S + V   A  RLT +R+       K    +E     G LF R   S
Sbjct: 9    IAVCSIFRKSYS-SFVDVASNAC-RLTCNRRCTVFSRTKSSISYENSKPRGELFPRQFCS 66

Query: 1187 IGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITEL 1008
               + E L     S ++LLGKLES+LK H +++AWETYK+FKRLYGFP   LV  L+TE 
Sbjct: 67   -SREPETLSWGVSSDIVLLGKLESALKNHNLEEAWETYKDFKRLYGFPDPSLVGRLLTES 125

Query: 1007 SYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRS 828
            SYSSDS+ LRKA ++V SI KEK  +L  +LMTKL LS ARAQMP+ AS+ILRLML+KR 
Sbjct: 126  SYSSDSRWLRKAYNMVDSILKEKRELLRTELMTKLCLSLARAQMPIRASSILRLMLDKRI 185

Query: 827  LPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVL 648
            LP +D+L M+  HMVKTE G  L+SNILIEI G S     +KSA++E+ K DT++FNLVL
Sbjct: 186  LPPIDMLGMIIFHMVKTEGGMILSSNILIEIYGSSQQLTTKKSAYLEINKNDTLVFNLVL 245

Query: 647  DACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVN 468
            DAC RFG+S KG Q++ELMAQVGV ADAHT  +I+ IHE+N MRDELKKFK++ID+V   
Sbjct: 246  DACARFGSSSKGHQIIELMAQVGVAADAHTISVISLIHEMNGMRDELKKFKEHIDQVSAT 305

Query: 467  LLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHN 288
            L+ HY+ FY+ +LSLHFKFNDI+AAS+L+LD+Y++ ES+   G + +P   C VSIGS N
Sbjct: 306  LVPHYRQFYESLLSLHFKFNDIDAASDLVLDIYRFQESHHMHGDEAQPPKPCLVSIGSDN 365

Query: 287  MNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLS 108
            +  G  L   P  L +++++ V   Q  V++KNGK  LSNKALA+LIV YKR G I +LS
Sbjct: 366  LRMGFKLRIFPHSLSRESVFNVGRNQAFVMYKNGKLILSNKALARLIVRYKRDGRINELS 425

Query: 107  KLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            KLL +IQ      E   +CS+V+ ACI + WL+TA
Sbjct: 426  KLLCNIQRK-GLFESSRMCSDVVAACICMEWLETA 459


>ref|XP_010110548.1| hypothetical protein L484_023382 [Morus notabilis]
            gi|587940145|gb|EXC26766.1| hypothetical protein
            L484_023382 [Morus notabilis]
          Length = 718

 Score =  437 bits (1124), Expect = e-119
 Identities = 223/396 (56%), Positives = 289/396 (72%)
 Frame = -2

Query: 1190 SIGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITE 1011
            S  +  E+L     S+ +LL KLE +LK HQVD+AWE++ ++K+LYGFP   LV  LITE
Sbjct: 68   STDVGPERLCWGVSSQDVLLKKLERALKCHQVDEAWESFFDYKKLYGFPEDSLVQRLITE 127

Query: 1010 LSYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKR 831
            LSYSS+ +CL+KACD VL +S EK  +L  D++TKL+LS AR+Q+P PA+ ILRLMLEK 
Sbjct: 128  LSYSSEPRCLQKACDFVLIVSNEKSGLLRRDILTKLSLSLARSQLPNPATKILRLMLEKD 187

Query: 830  SLPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLV 651
             LPS+++L +V LHMVKTE+GT+LASN L +IC       A+     EL KPDT+IFNLV
Sbjct: 188  MLPSMNILWLVVLHMVKTEVGTHLASNFLAQICESFQQVGAKDRKRAELMKPDTMIFNLV 247

Query: 650  LDACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPV 471
            LDACVRF  + KGQQ+MELM Q GVVADAH+ +++A+IHE+N  RDELKK+K +ID+V  
Sbjct: 248  LDACVRFKLAFKGQQIMELMPQTGVVADAHSIVVVAQIHEMNGQRDELKKYKVHIDQVSP 307

Query: 470  NLLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSH 291
              +CHY+ FYD +LSLHFKFNDI+AA+ L+ ++ +Y ES   +  K+ PQ    + IGSH
Sbjct: 308  QFVCHYRQFYDSLLSLHFKFNDIDAAAGLVWNMCRYRESLPIKSEKKNPQKIFHIPIGSH 367

Query: 290  NMNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKL 111
            N+  GL L   P+ LQKD + KV+SKQ LV+F+NGK  LSN+ALAK I G+KR GNI +L
Sbjct: 368  NLKAGLKLQIQPELLQKDTVLKVESKQELVIFRNGKLVLSNRALAKFIKGFKRDGNISQL 427

Query: 110  SKLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            SKLL  IQ    SL    LCS+VI+ACI LGWL+ A
Sbjct: 428  SKLLLGIQKESCSLRGSDLCSDVIEACIRLGWLEYA 463


>ref|XP_006359014.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616-like
            isoform X1 [Solanum tuberosum]
          Length = 715

 Score =  437 bits (1124), Expect = e-119
 Identities = 240/456 (52%), Positives = 315/456 (69%), Gaps = 4/456 (0%)
 Frame = -2

Query: 1358 ILTCLIFLISMTQSLIAVIHKAALRLTFDRKF--DFI--HNLKRFEVLDTYGNLFTRGLS 1191
            I  C +F  S   S I  +   A+RLT++  +   ++   +   +E     G +F+R   
Sbjct: 9    ITVCSVFRKSY--SSIVAVSSNAIRLTYNSTYVPQYLGTESSISYENYKPGGVMFSRQFG 66

Query: 1190 SIGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITE 1011
            S   + E L     S ++LLGKLES+L+ H +++AWETYK+FKRLYGFP  FLV  L+T+
Sbjct: 67   S-SRESETLSWGVSSDVVLLGKLESALRNHNLEEAWETYKDFKRLYGFPDPFLVDKLLTK 125

Query: 1010 LSYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKR 831
            LSYSSDS+ L+KAC++V SI KEK  +L  +LMTKL LS ARAQMPV AS+ILRLML+K 
Sbjct: 126  LSYSSDSRWLKKACNMVGSILKEKREMLRTELMTKLCLSLARAQMPVQASSILRLMLDKG 185

Query: 830  SLPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLV 651
            +LP +D+L M+  HMVKT+ G  ++SNILIEICG S     +KS   EL K +T++FNLV
Sbjct: 186  NLPPIDMLGMIIFHMVKTDTGMIVSSNILIEICGSSQQLTTKKSTCTELNKHNTLLFNLV 245

Query: 650  LDACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPV 471
            LDAC RFG+S KG Q++ELMAQVGV ADAHT  II+ IHE+N MRDELKKFK +ID+V V
Sbjct: 246  LDACARFGSSSKGHQIIELMAQVGVTADAHTISIISLIHEMNGMRDELKKFKKHIDQVSV 305

Query: 470  NLLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSH 291
             L+  YQ FY+ +L LHFKFNDI+AAS+L+ D+Y +  S+  QG + +P   C V+IGS 
Sbjct: 306  PLVSCYQQFYESLLCLHFKFNDIDAASDLVQDIYGFQVSHHEQGNETQPPKPCIVAIGSD 365

Query: 290  NMNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKL 111
            N+ TGL L   P  L +D+++ V   Q LV +KNGK  LSN+ALAKLI+ YKR G I  L
Sbjct: 366  NLRTGLKLRIFPHSLSRDSVFNVGRNQVLVKYKNGKLVLSNRALAKLIIQYKRGGRINDL 425

Query: 110  SKLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            SKLL SIQ    S+E   +CS+V+ ACI +GWL+ A
Sbjct: 426  SKLLCSIQKK-GSVESSRMCSDVVAACICMGWLEIA 460


>ref|XP_009604974.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Nicotiana tomentosiformis]
          Length = 714

 Score =  436 bits (1121), Expect = e-119
 Identities = 245/455 (53%), Positives = 313/455 (68%), Gaps = 3/455 (0%)
 Frame = -2

Query: 1358 ILTCLIFLISMTQSLIAVIHKAALRLTFDRK---FDFIHNLKRFEVLDTYGNLFTRGLSS 1188
            I  C IF  S + S++ V  KA+ R T +R+   F    +   +E       LF R   S
Sbjct: 9    IAVCSIFRKSYS-SIVDVASKAS-RGTCNRRCTVFPRTESSISYENAKPGSELFPRQFCS 66

Query: 1187 IGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITEL 1008
               + E L     S ++LLGKLES+LK H +++AWETYK+FKRLYGFP   LV  L+TEL
Sbjct: 67   -SREPETLSWGVSSDIVLLGKLESALKNHNLEEAWETYKDFKRLYGFPDPSLVGRLLTEL 125

Query: 1007 SYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRS 828
            SYSSDS+ LRKA ++V SI KEK  +L  +LMTKL LS ARAQMPV AS+ILRLML+KR 
Sbjct: 126  SYSSDSRWLRKAYNMVDSILKEKRELLRSELMTKLCLSLARAQMPVQASSILRLMLDKRI 185

Query: 827  LPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVL 648
            LP +D+L M+  HMVKTE G  L+SNILIEI G S     +KSA+ ++ K DT++FNLVL
Sbjct: 186  LPPIDMLGMIIFHMVKTESGMILSSNILIEIYGSSQQLTTKKSAYAKINKHDTLVFNLVL 245

Query: 647  DACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVN 468
            DAC RF +S+KG Q++ELMAQVGV ADAHT  II  I E+N MRDELKKFK++ID+V   
Sbjct: 246  DACARFRSSIKGHQIIELMAQVGVAADAHTISIICLIQEMNGMRDELKKFKEHIDQVSTT 305

Query: 467  LLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHN 288
            L+ HY+ FY+ +LSLHFKFNDINAAS+L+LD+YK  ES+   G +  P   C VSIGS +
Sbjct: 306  LVPHYRQFYESLLSLHFKFNDINAASDLVLDIYKLQESHNMHGDETPPPKPCLVSIGSDH 365

Query: 287  MNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLS 108
            +  GL L   P  L +++++ V   Q LV++KNGK  LSNKALA+LIV YKR G I +LS
Sbjct: 366  LRMGLKLRIFPHSLSRESVFNVGHNQVLVMYKNGKLVLSNKALARLIVWYKRGGRINELS 425

Query: 107  KLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            KLL +IQ    S E   +CS+V+ ACI + WL+TA
Sbjct: 426  KLLCNIQRK-GSFESSRMCSDVVAACICMEWLETA 459


>ref|XP_004237845.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Solanum lycopersicum] gi|723694042|ref|XP_010320122.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Solanum lycopersicum]
            gi|723694047|ref|XP_010320123.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g17616
            [Solanum lycopersicum] gi|723694050|ref|XP_010320124.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Solanum lycopersicum]
            gi|723694054|ref|XP_010320125.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g17616
            [Solanum lycopersicum]
          Length = 711

 Score =  434 bits (1117), Expect = e-119
 Identities = 241/456 (52%), Positives = 317/456 (69%), Gaps = 4/456 (0%)
 Frame = -2

Query: 1358 ILTCLIFLISMTQSLIAVIHKAALRLTFDRKFDFIH----NLKRFEVLDTYGNLFTRGLS 1191
            I  C +F  S + S++AV   A +RLT++  +  ++    +   +E     G +F+R  S
Sbjct: 9    ITVCSVFRKSYS-SILAVASNA-IRLTYNSTYVPLYLGMESSISYENYKPGGVMFSRQFS 66

Query: 1190 SIGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITE 1011
            S   + E L     S ++LLGKLES+L+ H +++AWETYK+FKRLYGFP  FLV  L+T+
Sbjct: 67   SRR-ESETLSWGVSSDVVLLGKLESALRNHNLEEAWETYKDFKRLYGFPDPFLVDKLLTK 125

Query: 1010 LSYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKR 831
            LSYSSDS+ L+KAC++V SI KEK  +L  +LMTKL LS AR QMP+ AS+ILRLMLEK 
Sbjct: 126  LSYSSDSRWLKKACNIVGSILKEKREMLRTELMTKLCLSLARTQMPIQASSILRLMLEKG 185

Query: 830  SLPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLV 651
            +LP +D+L M+  HMVK++ G  ++SNILIEI G S     +KS   EL K +T++FNLV
Sbjct: 186  NLPPIDMLGMIIFHMVKSDTGMIVSSNILIEIYGSSHQLTTKKST--ELNKHNTLLFNLV 243

Query: 650  LDACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPV 471
            LDAC RFG+S KG Q++ELMAQVGV ADAHT  II+ IHE+N MRDELKKFK +ID+V V
Sbjct: 244  LDACARFGSSSKGHQIIELMAQVGVTADAHTISIISLIHEMNGMRDELKKFKKHIDQVSV 303

Query: 470  NLLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSH 291
             L   YQ FY+ +L LHFKFNDI+AAS L+ D+Y +  S+  QG + +P   C VSIGS 
Sbjct: 304  PLFSCYQQFYESLLCLHFKFNDIDAASNLVQDIYGFQVSHHQQGNETQPPKPCLVSIGSD 363

Query: 290  NMNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKL 111
            N+ TGL L   P  L +D+++ V   Q LV++KNGK  LSN+ALAKLI+ YKRCG I  L
Sbjct: 364  NLRTGLKLRIFPHSLSRDSVFNVGRNQVLVMYKNGKLALSNRALAKLIIQYKRCGRINDL 423

Query: 110  SKLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            SKLL SIQ    S+E   +CS+V+ ACI +GWL+ A
Sbjct: 424  SKLLCSIQKK-GSVESSRMCSDVVSACICMGWLEIA 458


>ref|XP_007204496.1| hypothetical protein PRUPE_ppa019323mg [Prunus persica]
            gi|462400027|gb|EMJ05695.1| hypothetical protein
            PRUPE_ppa019323mg [Prunus persica]
          Length = 659

 Score =  428 bits (1101), Expect = e-117
 Identities = 217/393 (55%), Positives = 290/393 (73%)
 Frame = -2

Query: 1181 IKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSY 1002
            ++ E+L  E  S  I+L +L+ +LKEHQV++AWE++ +FKRL+GFP  F++  LITEL Y
Sbjct: 14   VQPERLCWEGSSHAIMLKRLKKALKEHQVNEAWESFIDFKRLHGFPEDFVIRELITELCY 73

Query: 1001 SSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLP 822
            SSD   L KACD+VL I KE+  +L  D++ KL+LS AR+QMP PA+ ILR++LEK++LP
Sbjct: 74   SSDPHWLLKACDIVLLILKERSDLLQSDILAKLSLSLARSQMPKPATMILRILLEKQNLP 133

Query: 821  SLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDA 642
             ++VL +V LHMVKT +GT LASN L++IC C    +  KS H +L KP+T+IFNLVLDA
Sbjct: 134  PMNVLCLVVLHMVKTRVGTDLASNFLVQICHCFQRSSVNKSIHAKLVKPNTMIFNLVLDA 193

Query: 641  CVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLL 462
            CVRF  S KGQQ+MELM Q GVVADAH+ IIIA+IHE++  RDE++K+K ++D+V    +
Sbjct: 194  CVRFKLSFKGQQIMELMPQTGVVADAHSIIIIAQIHELSGQRDEIQKYKSHVDQVSAPFM 253

Query: 461  CHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMN 282
             HY+HFYD +LSLHFKFNDI AA+EL+L +  Y+ES   Q  ++  Q S  V IGSHN+ 
Sbjct: 254  QHYRHFYDSLLSLHFKFNDIEAATELVLQMCDYHESLPIQRDRKISQRSYLVPIGSHNLK 313

Query: 281  TGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKL 102
            +GL +  LP+ L  D++ K++ KQ LVL  NGK  LSN+ALAKLI GYK+ G+  KLS++
Sbjct: 314  SGLNMQILPELLLCDSVLKIEGKQELVLCWNGKLVLSNRALAKLINGYKKGGDTCKLSEI 373

Query: 101  LFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            L  IQ    SL    LCS+VIDACI+LGWL+TA
Sbjct: 374  LLKIQKELCSLRGSRLCSDVIDACINLGWLETA 406


>ref|XP_008240720.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Prunus mume]
          Length = 718

 Score =  427 bits (1098), Expect = e-116
 Identities = 217/393 (55%), Positives = 288/393 (73%)
 Frame = -2

Query: 1181 IKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSY 1002
            ++ E L  E  S  I+L  L+ +LKEHQV++AWE++ +FKRL+GFP  F++  LITEL Y
Sbjct: 73   VQPEGLCWEGSSHAIMLKSLKKALKEHQVNEAWESFIDFKRLHGFPEDFVIRKLITELCY 132

Query: 1001 SSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLP 822
            SSD   L KACD+VL I KE+  +L  D++ KL+LS AR++MP PA+ ILR++LEK +LP
Sbjct: 133  SSDPHWLLKACDIVLVILKERSDLLQSDILAKLSLSLARSEMPKPATMILRILLEKENLP 192

Query: 821  SLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDA 642
             ++VL +V LHMVKTE+GT+LASN L++IC C    +  KS H +L KP+T+IFNLVLDA
Sbjct: 193  PMNVLCLVVLHMVKTEVGTHLASNFLVQICHCFQRSSVNKSIHAKLVKPNTMIFNLVLDA 252

Query: 641  CVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLL 462
            CVRF  S KGQQ+MELM Q GVVADAH+ IIIA+IHE+N  RDE++K+K +ID+V    +
Sbjct: 253  CVRFKLSFKGQQIMELMPQTGVVADAHSIIIIAQIHELNGQRDEIQKYKSHIDQVSAPFM 312

Query: 461  CHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMN 282
             HY+HFYD +LSLHFKFNDI AA EL+L +  Y+ES   Q  ++  Q S  V IGSHN+ 
Sbjct: 313  QHYRHFYDSLLSLHFKFNDIEAAIELVLQMCNYHESLPIQRDRKISQRSYLVPIGSHNLK 372

Query: 281  TGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKL 102
            +GL +  LP+ L  D++ K++ KQ LVL+ NGK  LSN+ALAKLI GY+R  +  KLS++
Sbjct: 373  SGLNMQILPELLLCDSVLKIEGKQELVLYWNGKLALSNRALAKLINGYRRGRDTCKLSEI 432

Query: 101  LFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            L  +Q    SL    LCS+VIDACI+LGWL+TA
Sbjct: 433  LLKMQKELCSLRGSRLCSDVIDACINLGWLETA 465


>ref|XP_012075523.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Jatropha curcas] gi|802619714|ref|XP_012075524.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Jatropha curcas]
          Length = 715

 Score =  426 bits (1096), Expect = e-116
 Identities = 223/437 (51%), Positives = 305/437 (69%), Gaps = 3/437 (0%)
 Frame = -2

Query: 1304 IHKAALRLTFDRKFDFIHNLKRFEVLDTYGN---LFTRGLSSIGIKGEKLYQEEPSRLIL 1134
            I K AL      KF    +L+ F V+D + +          S G + E++     SR +L
Sbjct: 29   IQKTALISYCASKFLVEESLRMFPVVDVFCSQRQFVNFHPFSTGTQSERISWGVSSRALL 88

Query: 1133 LGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSDSKCLRKACDLVLS 954
            L KLE SL+ HQVD+AW T+ +FK LYGFP+  LV  LITEL YSSD   L+KA +LV  
Sbjct: 89   LRKLEVSLEHHQVDEAWLTFNDFKSLYGFPTSSLVNRLITELCYSSDPHWLQKAYNLVFG 148

Query: 953  ISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLDVLRMVFLHMVKTE 774
            I KEK  +   +++T L+L  ARAQMP+PAS ILRLMLEK ++PSL V +++ LHMVK++
Sbjct: 149  ILKEKSELFQTEILTTLSLCLARAQMPIPASMILRLMLEKENMPSLSVFQIILLHMVKSK 208

Query: 773  IGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVRFGASLKGQQLMEL 594
            IGTYLASNILI++C C L     K  H ++ +P+T+IFNLVLDAC RF +SLKGQ+++E 
Sbjct: 209  IGTYLASNILIQVCDCLLCLRKNKIDHAKVIRPNTMIFNLVLDACFRFRSSLKGQEILEW 268

Query: 593  MAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHYQHFYDCMLSLHFK 414
            MAQ GVVADA + IIIA+I+E N +RDE+KKFKD+IDRV     C+Y+ FYDC+L+LHFK
Sbjct: 269  MAQTGVVADAQSIIIIAQIYETNGLRDEIKKFKDHIDRVSSPFACYYRQFYDCLLNLHFK 328

Query: 413  FNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGLALHFLPQHLQKDA 234
            F+D+++A+EL+LD+ K+  S   +   ++ Q    VSIGS N+  GL +  +P+ LQKD+
Sbjct: 329  FDDLDSAAELLLDMNKFRVSTPNKNSTKDIQKPYLVSIGSQNLRAGLKIQIMPELLQKDS 388

Query: 233  IYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFSIQNSWSSLEYYSL 54
            + K++ K+ LV+F+NGK  LSN+AL KLI+GYKR G + +L K+L S+Q  +  L   +L
Sbjct: 389  VIKLEDKKELVIFENGKLLLSNRALTKLILGYKRHGRMAELPKVLVSMQKDFQKLGGSNL 448

Query: 53   CSNVIDACIHLGWLQTA 3
            C +VIDACI LGWL+TA
Sbjct: 449  CFDVIDACIRLGWLETA 465


>gb|KDP34852.1| hypothetical protein JCGZ_09140 [Jatropha curcas]
          Length = 691

 Score =  426 bits (1096), Expect = e-116
 Identities = 223/437 (51%), Positives = 305/437 (69%), Gaps = 3/437 (0%)
 Frame = -2

Query: 1304 IHKAALRLTFDRKFDFIHNLKRFEVLDTYGN---LFTRGLSSIGIKGEKLYQEEPSRLIL 1134
            I K AL      KF    +L+ F V+D + +          S G + E++     SR +L
Sbjct: 5    IQKTALISYCASKFLVEESLRMFPVVDVFCSQRQFVNFHPFSTGTQSERISWGVSSRALL 64

Query: 1133 LGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSDSKCLRKACDLVLS 954
            L KLE SL+ HQVD+AW T+ +FK LYGFP+  LV  LITEL YSSD   L+KA +LV  
Sbjct: 65   LRKLEVSLEHHQVDEAWLTFNDFKSLYGFPTSSLVNRLITELCYSSDPHWLQKAYNLVFG 124

Query: 953  ISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLDVLRMVFLHMVKTE 774
            I KEK  +   +++T L+L  ARAQMP+PAS ILRLMLEK ++PSL V +++ LHMVK++
Sbjct: 125  ILKEKSELFQTEILTTLSLCLARAQMPIPASMILRLMLEKENMPSLSVFQIILLHMVKSK 184

Query: 773  IGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVRFGASLKGQQLMEL 594
            IGTYLASNILI++C C L     K  H ++ +P+T+IFNLVLDAC RF +SLKGQ+++E 
Sbjct: 185  IGTYLASNILIQVCDCLLCLRKNKIDHAKVIRPNTMIFNLVLDACFRFRSSLKGQEILEW 244

Query: 593  MAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHYQHFYDCMLSLHFK 414
            MAQ GVVADA + IIIA+I+E N +RDE+KKFKD+IDRV     C+Y+ FYDC+L+LHFK
Sbjct: 245  MAQTGVVADAQSIIIIAQIYETNGLRDEIKKFKDHIDRVSSPFACYYRQFYDCLLNLHFK 304

Query: 413  FNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGLALHFLPQHLQKDA 234
            F+D+++A+EL+LD+ K+  S   +   ++ Q    VSIGS N+  GL +  +P+ LQKD+
Sbjct: 305  FDDLDSAAELLLDMNKFRVSTPNKNSTKDIQKPYLVSIGSQNLRAGLKIQIMPELLQKDS 364

Query: 233  IYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFSIQNSWSSLEYYSL 54
            + K++ K+ LV+F+NGK  LSN+AL KLI+GYKR G + +L K+L S+Q  +  L   +L
Sbjct: 365  VIKLEDKKELVIFENGKLLLSNRALTKLILGYKRHGRMAELPKVLVSMQKDFQKLGGSNL 424

Query: 53   CSNVIDACIHLGWLQTA 3
            C +VIDACI LGWL+TA
Sbjct: 425  CFDVIDACIRLGWLETA 441


>ref|XP_004301723.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Fragaria vesca subsp. vesca]
            gi|764591024|ref|XP_011465204.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g17616
            [Fragaria vesca subsp. vesca]
          Length = 741

 Score =  420 bits (1080), Expect = e-114
 Identities = 221/390 (56%), Positives = 283/390 (72%)
 Frame = -2

Query: 1172 EKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSD 993
            EKL  E  SR  +L +LE +LKEHQV++ WE++ +FKRL+GFP  FL+  LITEL YSSD
Sbjct: 76   EKLCWEGSSRAAMLKRLEVALKEHQVNEVWESFIDFKRLHGFPEGFLIHKLITELCYSSD 135

Query: 992  SKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLD 813
               L+KACDLVL   +E+  +L  D++TKL+LS AR+QMP PA  ILRLMLEKR+LP ++
Sbjct: 136  PYWLQKACDLVLVNLRERSDVLQSDILTKLSLSLARSQMPKPAMMILRLMLEKRNLPPMN 195

Query: 812  VLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVR 633
            VL +V LH+VKTEIGT+LASN LI+IC    +  A+KS H +L +PDT+IFNLVLDACVR
Sbjct: 196  VLCLVVLHLVKTEIGTHLASNFLIQICDHFQSLRAKKSDHTKLLQPDTMIFNLVLDACVR 255

Query: 632  FGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHY 453
            F  +LKGQQ+MELM+  GV ADAH+ +IIARIHE+N  R+E+K +K YID+V    + HY
Sbjct: 256  FKLALKGQQIMELMSATGVAADAHSIVIIARIHELNGQREEIKNYKCYIDQVSAPFVQHY 315

Query: 452  QHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGL 273
              FYD +LSLHFKFND+ AASELIL +    +S L Q  K+  Q S  V IGSHN  +GL
Sbjct: 316  HQFYDSLLSLHFKFNDVVAASELILQMCDDRKSLLIQRDKKNSQRSYLVPIGSHNQKSGL 375

Query: 272  ALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFS 93
             +  +P+ LQKD++ K++ KQ LV++ NGK  LSN+ALAKLI  YK  G+  +LSKLL  
Sbjct: 376  NMQIVPELLQKDSVLKLEGKQELVMYLNGKLVLSNRALAKLITRYKIDGDTSELSKLLHK 435

Query: 92   IQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            IQ    S     L ++VIDACI LGWL+TA
Sbjct: 436  IQKELCSFRGSRLGNDVIDACIQLGWLETA 465


>ref|XP_009345148.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Pyrus x bretschneideri]
          Length = 740

 Score =  419 bits (1077), Expect = e-114
 Identities = 213/389 (54%), Positives = 284/389 (73%)
 Frame = -2

Query: 1169 KLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSDS 990
            +L  E  S  +LL +LE +LKEHQ+++AWE++ +FKRL+GFP  F+V  LITEL YSSD 
Sbjct: 97   RLCWEGSSPTVLLKRLEIALKEHQLNEAWESFIDFKRLHGFPEVFIVRKLITELCYSSDP 156

Query: 989  KCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLDV 810
              L KACD+VL + K++  +L  D++ KL+LS AR+QMP PA+ ILR++LEK +LP L+ 
Sbjct: 157  HWLLKACDVVLEVLKDQSDLLQSDILPKLSLSLARSQMPKPATMILRILLEKDNLPPLNA 216

Query: 809  LRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVRF 630
            L +V LHMVKTE+GT LASN LI+IC      +  KS H +  +PDT+IFNLVLDACVRF
Sbjct: 217  LCLVVLHMVKTEVGTNLASNFLIQICHRFQRLSVNKSGHAKKIQPDTMIFNLVLDACVRF 276

Query: 629  GASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHYQ 450
              S KGQQ++ELM Q GVVADAH+ III++IHE+N  RDE+KK+K +ID+V V LL HY+
Sbjct: 277  KLSFKGQQILELMPQTGVVADAHSVIIISQIHELNGQRDEIKKYKSHIDQVSVALLQHYR 336

Query: 449  HFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGLA 270
             FYD +L+LHFKFNDI AA+EL+L +  Y+ S   Q  ++    S  V IGSHN+ +GL 
Sbjct: 337  QFYDSLLTLHFKFNDIEAATELVLQMCDYHVSLPVQRDRKNSHKSYNVPIGSHNLKSGLQ 396

Query: 269  LHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFSI 90
            +  LP+ LQKD++ KV+ K  LV++ NGK  LSN+ALAKL+ GY++ G+  KLSK+L  +
Sbjct: 397  MQILPELLQKDSVLKVEGKHELVIYWNGKLVLSNRALAKLVNGYRKGGDTCKLSKILLKM 456

Query: 89   QNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            Q    S     LC++VIDACIHLGWL+TA
Sbjct: 457  QKELCSSRGSGLCTDVIDACIHLGWLETA 485


>ref|XP_008392809.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            [Malus domestica] gi|658000706|ref|XP_008392811.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At4g17616 [Malus domestica]
          Length = 714

 Score =  417 bits (1071), Expect = e-113
 Identities = 212/389 (54%), Positives = 282/389 (72%)
 Frame = -2

Query: 1169 KLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSDS 990
            +L  E  S  +LL +L+ +LKEHQV++AWE++ +FKRL+GFP  F+V  LITEL YSSD 
Sbjct: 71   RLCWEGSSPTVLLKRLQIALKEHQVNEAWESFIDFKRLHGFPEVFIVRKLITELCYSSDP 130

Query: 989  KCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLDV 810
              L KACD+ L + K++  +L  D++ KL+LS AR+QMP PA+ ILR++LEK +LP L+ 
Sbjct: 131  HWLLKACDVALEVLKDQSDLLQSDILQKLSLSLARSQMPKPATMILRILLEKDNLPPLNA 190

Query: 809  LRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVRF 630
            L +V LHMVKTE+GT LASN LI+IC      +  KS H +  +PDT+IFNLVLDACVRF
Sbjct: 191  LCLVVLHMVKTEVGTNLASNFLIQICHRFQRLSVNKSGHAKQIQPDTMIFNLVLDACVRF 250

Query: 629  GASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHYQ 450
              S KGQQ++ELM Q GVVADAH+ III++IHE+N  RDE+KK+K +ID+V V LL HY+
Sbjct: 251  KLSFKGQQILELMPQTGVVADAHSVIIISQIHELNGQRDEIKKYKSHIDQVSVALLQHYR 310

Query: 449  HFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGLA 270
             FYD +L+LHFKFNDI AA+EL+L +  Y+ES   Q  ++    S  V IGSHN+ +GL 
Sbjct: 311  QFYDSLLTLHFKFNDIEAATELVLQMCDYHESLPVQRDRKNSHKSYNVPIGSHNLKSGLQ 370

Query: 269  LHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFSI 90
            +  LP+ LQKD++ KV+ K  LV++ NGK  LSN+ALAKL+ GY++ G+   LSK+L  +
Sbjct: 371  MQILPELLQKDSVLKVEGKHELVIYWNGKLVLSNRALAKLVNGYRKGGDTCNLSKILLKM 430

Query: 89   QNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            Q    S     LCS+VIDACIHL WL+TA
Sbjct: 431  QKELCSSRGSGLCSDVIDACIHLXWLETA 459


>gb|KDO39066.1| hypothetical protein CISIN_1g048743mg, partial [Citrus sinensis]
          Length = 653

 Score =  414 bits (1064), Expect = e-112
 Identities = 222/396 (56%), Positives = 286/396 (72%)
 Frame = -2

Query: 1190 SIGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITE 1011
            S  ++ EKL  E  SR +LL KLES+ K HQV +AWET+ +F+RL+G P + +V   IT+
Sbjct: 4    SSSVQQEKLSWEGSSREVLLRKLESASKNHQVGEAWETFNDFQRLHGIPERHVVNRFITD 63

Query: 1010 LSYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKR 831
            L YS++   L+KACDLVL I K K  +L  DL+ KL+LS ARAQMPVPAS ILRLML + 
Sbjct: 64   LCYSAEPHWLQKACDLVLKIQKGKADLLQLDLLAKLSLSLARAQMPVPASMILRLMLGRE 123

Query: 830  SLPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLV 651
            +LP  D+L +VF+HMVKTEIGT LASN LI++C   L+ +A KS   EL KPDT+IFNLV
Sbjct: 124  NLPCSDLLLLVFVHMVKTEIGTCLASNFLIQLCDVFLHLSAEKSNGAELIKPDTMIFNLV 183

Query: 650  LDACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPV 471
            L ACVRFG+SLKGQ +MELM+Q GVVADAH+ II+A+IHE+N  RDELKKFK YID++  
Sbjct: 184  LHACVRFGSSLKGQHIMELMSQTGVVADAHSIIILAQIHEMNCQRDELKKFKCYIDQLST 243

Query: 470  NLLCHYQHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSH 291
                HYQ FY+ +LSLHFKF+DI+AA ELILD+ +Y E       +++ Q    +SIGS 
Sbjct: 244  PFAHHYQQFYESLLSLHFKFDDIDAAGELILDMNRYREPLPNPKLRQDAQKPYLISIGSP 303

Query: 290  NMNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKL 111
            N+  GL L  +P+ L+KD+I K++ KQ LVLF+NGK   SN+A+AKLI GYK+ G   +L
Sbjct: 304  NLRCGLKLQIMPELLEKDSILKMEGKQELVLFRNGKLLHSNRAMAKLINGYKKHGKNSEL 363

Query: 110  SKLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            S LL SI+    S    +LCS+VIDA I LG+L+ A
Sbjct: 364  SWLLLSIKKEHHSFGESTLCSDVIDALIQLGFLEAA 399


>ref|XP_007048805.1| Pentatricopeptide repeat superfamily protein, putative isoform 1
            [Theobroma cacao] gi|590710359|ref|XP_007048806.1|
            Pentatricopeptide repeat superfamily protein, putative
            isoform 1 [Theobroma cacao] gi|508701066|gb|EOX92962.1|
            Pentatricopeptide repeat superfamily protein, putative
            isoform 1 [Theobroma cacao] gi|508701067|gb|EOX92963.1|
            Pentatricopeptide repeat superfamily protein, putative
            isoform 1 [Theobroma cacao]
          Length = 708

 Score =  414 bits (1063), Expect = e-112
 Identities = 213/390 (54%), Positives = 287/390 (73%)
 Frame = -2

Query: 1172 EKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITELSYSSD 993
            E+L  E  +  +LL K+E+SLKE ++D+AWET+ +FKRLYGFP+  LV+  IT+LSYSS 
Sbjct: 63   ERLSWEGSTHAVLLTKIENSLKELKLDEAWETFNDFKRLYGFPNHLLVSRFITQLSYSSS 122

Query: 992  SKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKRSLPSLD 813
               L+KACDLV+ +SKEK   L PD++ KL LS ARAQMP+P+STILRLMLEK  LP ++
Sbjct: 123  PHWLQKACDLVMIVSKEKSYHLQPDILAKLILSLARAQMPIPSSTILRLMLEKEILPPIN 182

Query: 812  VLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNARKSAHIELKKPDTVIFNLVLDACVR 633
            VL +VF HMVKTE+GT +ASN+L++IC   +   + KS +    KPDT+IFNLVLDACVR
Sbjct: 183  VLWLVFQHMVKTEVGTCVASNLLVQICDYYIRFCSEKSHYANFLKPDTMIFNLVLDACVR 242

Query: 632  FGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDRVPVNLLCHY 453
            F +SLKGQQ++ELM++ GVVADAH+  IIA+IHE+N  RDELKKFKD+I  +PV L+ HY
Sbjct: 243  FASSLKGQQIIELMSKTGVVADAHSIDIIAQIHEMNGHRDELKKFKDHIAPLPVPLVSHY 302

Query: 452  QHFYDCMLSLHFKFNDINAASELILDLYKYYESNLFQGGKREPQTSCTVSIGSHNMNTGL 273
            Q FY+C+LSLHFKF+DI+AA+EL+L++ +  ES+     +++ Q    V IGS N+  GL
Sbjct: 303  QQFYECLLSLHFKFDDIDAAAELVLEMNRSRESHPIGELRKDYQKPRFVPIGSQNLRNGL 362

Query: 272  ALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKRCGNIGKLSKLLFS 93
             +  +P+ LQKD+    + K  L+++++ K   SN+ALAKLI GYK+ G I +LSK L S
Sbjct: 363  KIQIVPELLQKDSALIAEGKSDLIMYRDKKLCPSNRALAKLINGYKKHGKINELSKFLLS 422

Query: 92   IQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
            ++    S    SL S+VIDACI LGWL+ A
Sbjct: 423  LKRELCSSGGSSLFSDVIDACITLGWLEIA 452


>ref|XP_010274524.1| PREDICTED: pentatricopeptide repeat-containing protein At4g17616
            isoform X2 [Nelumbo nucifera]
          Length = 687

 Score =  413 bits (1061), Expect = e-112
 Identities = 228/403 (56%), Positives = 288/403 (71%), Gaps = 7/403 (1%)
 Frame = -2

Query: 1190 SIGIKGEKLYQEEPSRLILLGKLESSLKEHQVDKAWETYKEFKRLYGFPSQFLVAGLITE 1011
            S  I+  K+  E  S  ILL KLE++LK+ Q+ +A + + +F+ LYGFP   LV  LITE
Sbjct: 30   SSAIQPGKICWEASSHEILLQKLENALKDQQMGEALDAFNDFRNLYGFPKHSLVRRLITE 89

Query: 1010 LSYSSDSKCLRKACDLVLSISKEKPVILHPDLMTKLALSSARAQMPVPASTILRLMLEKR 831
            LSYSSDS  LRKA DLVL ISKEK   L+ D +T LALS ARAQMP+PAST+LRLM+EK 
Sbjct: 90   LSYSSDSHWLRKAYDLVLLISKEKSTFLNHDCLTLLALSLARAQMPIPASTVLRLMMEKH 149

Query: 830  SLPSLDVLRMVFLHMVKTEIGTYLASNILIEICGCSLNRNA---RKSAHIELKKPDTVIF 660
                 D+LRMVF+HMVKTEIGTYLAS+IL+EIC   LN  A    KS   +L  PDT+IF
Sbjct: 150  KFLQKDILRMVFIHMVKTEIGTYLASDILVEICDFLLNHMAYRREKSFKGKLINPDTMIF 209

Query: 659  NLVLDACVRFGASLKGQQLMELMAQVGVVADAHTAIIIARIHEVNSMRDELKKFKDYIDR 480
            NLVLDACVRF ++LK QQ++EL+AQVGVVADA++ +II+RIHE+N  RDELKKFK++ID 
Sbjct: 210  NLVLDACVRFKSTLKAQQIVELLAQVGVVADANSIVIISRIHEINGQRDELKKFKEHIDV 269

Query: 479  VPVNLLCHYQHFYDCMLSLHFKFNDINAASELILDLYK----YYESNLFQGGKREPQTSC 312
            V    L HY+ FYD +L+LHFKFNDI++AS L+LD+Y          LF   +++ Q   
Sbjct: 270  VSAPFLRHYRQFYDSLLNLHFKFNDIDSASRLVLDMYHERSCCCSDGLFPRDRKDSQNPR 329

Query: 311  TVSIGSHNMNTGLALHFLPQHLQKDAIYKVDSKQGLVLFKNGKFFLSNKALAKLIVGYKR 132
             V +GS N+  GL +   P+ LQKD + +++++  LVLF NGKF LSNKALAKLIVG KR
Sbjct: 330  LVPVGSGNLRAGLRMCIEPELLQKDFVLEMENRPELVLFMNGKFVLSNKALAKLIVGNKR 389

Query: 131  CGNIGKLSKLLFSIQNSWSSLEYYSLCSNVIDACIHLGWLQTA 3
             G +G++SKLL SIQ    SLE   L S+VI+ACI L WL+ A
Sbjct: 390  DGKVGEISKLLISIQKMSGSLE-VDLISDVINACIQLCWLEIA 431


Top