BLASTX nr result
ID: Cheilocostus21_contig00013508
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00013508 (1400 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_009408172.1| PREDICTED: pentatricopeptide repeat-containi... 564 0.0 ref|XP_010935201.1| PREDICTED: pentatricopeptide repeat-containi... 472 e-161 ref|XP_020582022.1| pentatricopeptide repeat-containing protein ... 407 e-136 ref|XP_020276341.1| pentatricopeptide repeat-containing protein ... 397 e-132 ref|XP_020685912.1| pentatricopeptide repeat-containing protein ... 395 e-131 gb|PKA66355.1| Pentatricopeptide repeat-containing protein [Apos... 389 e-129 gb|ONK64277.1| uncharacterized protein A4U43_C07F24000 [Asparagu... 389 e-128 ref|XP_008781345.2| PREDICTED: pentatricopeptide repeat-containi... 379 e-126 ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containi... 365 e-119 ref|XP_011045590.1| PREDICTED: pentatricopeptide repeat-containi... 348 e-112 ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi... 347 e-112 dbj|GAV86601.1| PPR domain-containing protein [Cephalotus follic... 344 e-111 ref|XP_020092848.1| uncharacterized protein LOC109713258, partia... 350 e-110 gb|OMO91929.1| hypothetical protein COLO4_18015 [Corchorus olito... 337 e-108 ref|NP_001324067.1| pentatricopeptide (PPR) repeat-containing pr... 335 e-107 ref|XP_015893244.1| PREDICTED: pentatricopeptide repeat-containi... 334 e-107 ref|XP_012077696.1| pentatricopeptide repeat-containing protein ... 335 e-107 gb|OAP09950.1| hypothetical protein AXX17_AT2G12140 [Arabidopsis... 335 e-107 ref|NP_001324066.1| pentatricopeptide (PPR) repeat-containing pr... 335 e-107 ref|NP_565402.1| pentatricopeptide (PPR) repeat-containing prote... 335 e-107 >ref|XP_009408172.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Musa acuminata subsp. malaccensis] Length = 430 Score = 564 bits (1453), Expect = 0.0 Identities = 297/437 (67%), Positives = 337/437 (77%), Gaps = 8/437 (1%) Frame = +3 Query: 63 MALLCTTAGSFSPSAATTRCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSK 242 MALL TA SFSPS+A RCALA KHADRL S L S DDP AADRLIRKFLAASSK Sbjct: 1 MALLWATAASFSPSSAGLRCALAGRRKHADRLVSDLRGASADDPSAADRLIRKFLAASSK 60 Query: 243 PAALHSLSRFISLSSPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMD 422 PAALH+LS F+SLSSPFA P+YERISEASWF+W P LAA+V+ALLEKQGR AEA TLT+D Sbjct: 61 PAALHALSSFLSLSSPFAPPLYERISEASWFSWKPKLAATVVALLEKQGRCAEAETLTLD 120 Query: 423 AVARLKSPRDLALFYCDLLESISEHGLKQSALETYARLREMPF--------RXXXXXXXX 578 AV+R K+ RDLALFYCDL+E SE GL+Q LETYARLRE+PF Sbjct: 121 AVSRSKTHRDLALFYCDLIECFSEQGLEQPVLETYARLREVPFAGRRPYESMIKALCLMG 180 Query: 579 XXXXAEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSN 758 AE KL EMAS GCKPSPF+FR VIQ YG+ GL SEM+ + SME+ G+PIDTVC N Sbjct: 181 MPGEAEAKLKEMASSGCKPSPFEFRSVIQSYGRSGLLSEMRRVVGSMEDAGLPIDTVCVN 240 Query: 759 IVLSCYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSID 938 +VLSCYG HG+L EMASWM KMR GI SIRT N VLNSCP VVSI +S+ SLPLS++ Sbjct: 241 VVLSCYGHHGELPEMASWMTKMREKGIVFSIRTFNCVLNSCPRVVSI-ASDAGSLPLSME 299 Query: 939 DLMKKLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLI 1118 +L++KLE E++ EALLV EL S VLADI EWSPSGSKLDLHG HV AAY+I Sbjct: 300 ELLQKLENESS------SRTEALLVQELTSSSVLADISEWSPSGSKLDLHGLHVAAAYII 353 Query: 1119 LLNWMQELRLRFCEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRK 1298 LL W+QELR RF EE ++PLEISVICGSG+HSERRG+SPIK+LVSEMMF+ +SPMRID K Sbjct: 354 LLKWIQELRRRFQEEDVIPLEISVICGSGKHSERRGRSPIKDLVSEMMFRKSSPMRIDSK 413 Query: 1299 NPGRFAASGKAVREWLC 1349 NPGRF A GKAV EW+C Sbjct: 414 NPGRFVARGKAVWEWMC 430 >ref|XP_010935201.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Elaeis guineensis] Length = 434 Score = 472 bits (1214), Expect = e-161 Identities = 244/433 (56%), Positives = 311/433 (71%), Gaps = 16/433 (3%) Frame = +3 Query: 99 PSAATTRCAL--------ACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAAL 254 P+A RCAL A GKH RL S+L T + DP AADRL+RKF+AASSK AAL Sbjct: 10 PAATGPRCALRNSHSSRTAGGGKHIHRLLSSLDTAA--DPSAADRLVRKFVAASSKSAAL 67 Query: 255 HSLSRFISLSSPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVAR 434 H+LS +SLSS FALPIY R+SEA+WF WNP LAA++ A+L QGR EA +L ++V+R Sbjct: 68 HTLSHLLSLSSRFALPIYRRVSEANWFKWNPKLAAAMAAVLVNQGRATEAESLISESVSR 127 Query: 435 LKSPRDLALFYCDLLESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX-------- 590 L S +++LFYCDL+E+ SE GLK AL+ Y+RL E+P Sbjct: 128 LNSDLEISLFYCDLIEAFSERGLKDFALDFYSRLHEIPCSVRKPYESMIKALCLMGLPVD 187 Query: 591 AEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLS 770 AE+KL EMA LG +PSPF+FRLV+Q YG+ G F+EM L ME+ G+ IDTVC+N+VLS Sbjct: 188 AEEKLKEMAFLGFRPSPFEFRLVMQSYGKSGSFAEMSRVLGIMEDAGLAIDTVCTNVVLS 247 Query: 771 CYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMK 950 CYG HG+L++M SW+ KM+ GIG S+RT N VLNSCP ++S+V + + +PLSI L+K Sbjct: 248 CYGDHGELAKMVSWIRKMKKLGIGFSVRTFNVVLNSCPTIISMVQ-DVKHIPLSIAALVK 306 Query: 951 KLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNW 1130 K+EE++ +EALLV EL+GS VL DILEWSP KLDLHGFHV +A++ILL W Sbjct: 307 KVEEDSLSL------DEALLVRELVGSSVLVDILEWSPDEGKLDLHGFHVASAFVILLQW 360 Query: 1131 MQELRLRFCEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310 ++ELR+RF ++ VPLEISV+CGSG+HS++ G+SP+K LVSEMMFQ NS MRIDRKN GR Sbjct: 361 VEELRIRFRVDEAVPLEISVVCGSGKHSDKIGESPVKMLVSEMMFQLNSSMRIDRKNAGR 420 Query: 1311 FAASGKAVREWLC 1349 F A GKAVR+WLC Sbjct: 421 FVARGKAVRDWLC 433 >ref|XP_020582022.1| pentatricopeptide repeat-containing protein At2g17033 [Phalaenopsis equestris] Length = 426 Score = 407 bits (1047), Expect = e-136 Identities = 213/420 (50%), Positives = 291/420 (69%), Gaps = 9/420 (2%) Frame = +3 Query: 117 RCAL-ACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPF 293 RC++ A +GK + RL ++LS +T DP A RL+RKF+A+SSK ++L +LS IS SSPF Sbjct: 16 RCSIPAGAGKPSRRLLNSLS--ATSDPSTAVRLVRKFVASSSKSSSLQALSFLISHSSPF 73 Query: 294 ALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCD 473 +L +Y+ +SE SWF +N L AS+I+LLE + +A TL + + L+SPRDL+LFYCD Sbjct: 74 SLHLYQFLSETSWFQFNSKLIASLISLLEDHHCSLDALTLISQSTSVLRSPRDLSLFYCD 133 Query: 474 LLESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGC 629 L+++ S GLK L++YARL+E+PF AE+ L EM S G Sbjct: 134 LIDAFSGRGLKTQVLQSYARLKEIPFSGKRPYQSIIKGMCLMEMPEEAEEFLREMGSSGF 193 Query: 630 KPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMAS 809 KPSPF+FRLV + YG+ G F+EM L+SMEE G+ +DT+ +N VLSCYG HG+LSEM S Sbjct: 194 KPSPFEFRLVFRAYGRAGAFAEMTRVLQSMEENGMALDTLSANTVLSCYGDHGKLSEMVS 253 Query: 810 WMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXX 989 W+ K+R +GIG S RT+NSVLNSCP +++++ + SLPLSID L +K+EE + Sbjct: 254 WIQKIRESGIGFSFRTVNSVLNSCP-TIAMLTDDVLSLPLSIDALFRKVEESSPCS---- 308 Query: 990 XXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKL 1169 NE+LL+ EL+ P+L +L+WS S KLDLHG H+ +AY+I+L WM+E+R K Sbjct: 309 --NESLLLRELVDFPLLCSLLDWSDSEVKLDLHGLHLVSAYVIILQWMREIRSCLVAGKA 366 Query: 1170 VPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349 VPLE S+ICGSG+HS RG+SP+K LVS MMFQ SP++IDRKN G+F A GK V++WLC Sbjct: 367 VPLEFSIICGSGKHSRSRGESPVKKLVSVMMFQLKSPLKIDRKNVGKFVAKGKKVKDWLC 426 >ref|XP_020276341.1| pentatricopeptide repeat-containing protein At2g17033 [Asparagus officinalis] Length = 430 Score = 397 bits (1019), Expect = e-132 Identities = 214/413 (51%), Positives = 275/413 (66%), Gaps = 10/413 (2%) Frame = +3 Query: 141 KHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYERIS 320 KH+DRL S+LS S+ DP AA +IR+F+++SSK AL +LS +SLSSPF+LP Y RIS Sbjct: 32 KHSDRLLSSLS--SSCDPSAAAHVIRRFVSSSSKSTALRTLSLLLSLSSPFSLPFYRRIS 89 Query: 321 EASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHG 500 + WF W P L A VIALLE G EAR L ++V RL S R++A FYCDL+ + S G Sbjct: 90 ASHWFKWTPKLVAEVIALLESDGHPLEARELVSESVLRLSSQREIAHFYCDLVVAASGRG 149 Query: 501 LKQSALETYARLREMPF--------RXXXXXXXXXXXXAEDKLNEMASLGCKPSPFQFRL 656 LK+ LE +++ F AE L+EM LG KPS F++R+ Sbjct: 150 LKEFVLECCGWIKDTGFVGKRVFECMVRGLSLVGMVEDAEKVLDEMGHLGFKPSGFEYRV 209 Query: 657 VIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRSTG 836 VIQGYG+LG F EM+ + ME I IDTVC+N+VLSCYG +G L+EM +W+ MR G Sbjct: 210 VIQGYGRLGSFKEMRRVIGRMENAEIGIDTVCANLVLSCYGDYGNLAEMVTWIRNMRLLG 269 Query: 837 IGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALLVG 1016 I S+RT NSVLNSCP VVS+V + +LPLS+DDL++ L +E EALLV Sbjct: 270 ISYSVRTCNSVLNSCPSVVSMV-KDLENLPLSMDDLLEMLNDE-----------EALLVK 317 Query: 1017 ELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQEL--RLRFCEEKLVPLEISV 1190 E+ G+ VL + L WS S KLDLHGFH+ +AY+ILL WM+E RLR EE +PLEISV Sbjct: 318 EMAGTSVLLEKLTWSDSEGKLDLHGFHLASAYIILLQWMEEFRRRLRVKEEVPIPLEISV 377 Query: 1191 ICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349 +CG G+HS RG+SP+K LVS++M + SPM+IDRKN GRF A GKAV++WLC Sbjct: 378 VCGLGKHSIMRGESPVKKLVSKLMSRLKSPMKIDRKNVGRFVAKGKAVKDWLC 430 >ref|XP_020685912.1| pentatricopeptide repeat-containing protein At2g17033 [Dendrobium catenatum] gb|PKU84129.1| Pentatricopeptide repeat-containing protein [Dendrobium catenatum] Length = 426 Score = 395 bits (1014), Expect = e-131 Identities = 207/413 (50%), Positives = 280/413 (67%), Gaps = 8/413 (1%) Frame = +3 Query: 135 SGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYER 314 + K + RL ++LST S D AA RL+RKF+A+SSK +L +LS IS SSPF+L +Y+ Sbjct: 23 ASKRSHRLLTSLSTAS--DSSAAIRLVRKFVASSSKSTSLQALSFLISHSSPFSLHLYQT 80 Query: 315 ISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISE 494 +SEA WF +NP LAAS+I+LLE Q T +A TL + + L+ PR LALFYCDL+++ S+ Sbjct: 81 LSEAPWFQFNPKLAASLISLLEDQHCTVDALTLLSQSASGLRLPRHLALFYCDLIDAFSD 140 Query: 495 HGLKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSPFQF 650 GLK +YARL+E+PF AE L EM G KPSPF+F Sbjct: 141 RGLKVQVHRSYARLKEIPFSGRRPYESMIKGMCLMKMPEEAEVFLREMGLAGFKPSPFEF 200 Query: 651 RLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRS 830 R V+Q YG++G F+EM L SM+E G+ +DT+ +N VLSCYG HG+LSEM SW+ K R Sbjct: 201 RQVLQAYGRVGAFAEMTRVLESMQENGMALDTLSANTVLSCYGDHGKLSEMVSWIQKTRE 260 Query: 831 TGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALL 1010 +G+G SIRT NSVLNSCP +V +++ N SLP SI+ L +K++E + NE+LL Sbjct: 261 SGVGFSIRTFNSVLNSCPTIV-MITKNVSSLPPSIEALFRKVDESSPCL------NESLL 313 Query: 1011 VGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKLVPLEISV 1190 EL+ P+L+ +L+WS S KLDLHG H+ +AY+I+L WM+++R KLVPLE S+ Sbjct: 314 FRELVNFPLLSAMLDWSDSEVKLDLHGLHLVSAYVIILLWMEKIRSCLVAGKLVPLEFSI 373 Query: 1191 ICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349 +CGSG+HS+ G+SP+K LVS MMFQ SP++IDR N G+F A GK VR+WLC Sbjct: 374 VCGSGKHSKIIGESPVKKLVSVMMFQLKSPLKIDRNNAGKFVAKGKKVRDWLC 426 >gb|PKA66355.1| Pentatricopeptide repeat-containing protein [Apostasia shenzhenica] Length = 426 Score = 389 bits (999), Expect = e-129 Identities = 203/410 (49%), Positives = 284/410 (69%), Gaps = 8/410 (1%) Frame = +3 Query: 141 KHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYERIS 320 + + RL S+LS + D +ADRL+RKF+A+SSKP AL SLS FISLSSPF+L +Y+ I+ Sbjct: 25 RRSHRLLSSLSAAA--DFSSADRLLRKFVASSSKPDALQSLSLFISLSSPFSLLLYQAIA 82 Query: 321 EASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHG 500 + WF WNP LAASV++LLE+Q R+ +A L + +RL++PRDL FYC+L+++ S Sbjct: 83 DTPWFRWNPKLAASVVSLLEEQQRSTDAEALISRSTSRLRAPRDLPAFYCELIDAFSCRW 142 Query: 501 LKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSPFQFRL 656 L+ AL ++ARLRE+P+ AE+ L EMA G +PS F+FR Sbjct: 143 LQLPALRSFARLREIPYSGRKPYESIIKGLCSMGMATDAEELLREMAIAGFRPSAFEFRS 202 Query: 657 VIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRSTG 836 V Q YG+ G F++M SM+E GI +DTV +NI LSCYG H + S M S++ KMR +G Sbjct: 203 VAQAYGRSGAFADMTRVFESMQEAGIVLDTVSANIALSCYGDHFKFSVMVSFLRKMRESG 262 Query: 837 IGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALLVG 1016 I S+RT NSVLNSCP VV+I + + RSLPLS+ L++K+EE + +EALL+ Sbjct: 263 IIFSLRTFNSVLNSCPSVVTI-TKDLRSLPLSMAALLRKVEEASLCL------DEALLIR 315 Query: 1017 ELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKLVPLEISVIC 1196 E+ S ++ D+L+WS S KLDLHGFH+ +AY+++L W++ +R R E+ ++PLEIS+IC Sbjct: 316 EVTDSSLMGDMLQWSDSEGKLDLHGFHLASAYVMILMWIEVVRDRLSEDGIIPLEISIIC 375 Query: 1197 GSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWL 1346 GSG++S +G SP+K LVSEMMFQ SPM++DRKN G+F A GKAV++WL Sbjct: 376 GSGKNSRMKGDSPLKKLVSEMMFQLYSPMKMDRKNVGKFVAKGKAVKDWL 425 >gb|ONK64277.1| uncharacterized protein A4U43_C07F24000 [Asparagus officinalis] Length = 448 Score = 389 bits (1000), Expect = e-128 Identities = 214/431 (49%), Positives = 275/431 (63%), Gaps = 28/431 (6%) Frame = +3 Query: 141 KHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSPFALPIYERIS 320 KH+DRL S+LS S+ DP AA +IR+F+++SSK AL +LS +SLSSPF+LP Y RIS Sbjct: 32 KHSDRLLSSLS--SSCDPSAAAHVIRRFVSSSSKSTALRTLSLLLSLSSPFSLPFYRRIS 89 Query: 321 EASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHG 500 + WF W P L A VIALLE G EAR L ++V RL S R++A FYCDL+ + S G Sbjct: 90 ASHWFKWTPKLVAEVIALLESDGHPLEARELVSESVLRLSSQREIAHFYCDLVVAASGRG 149 Query: 501 LKQSALETYARLREMPF--------------------------RXXXXXXXXXXXXAEDK 602 LK+ LE +++ F AE Sbjct: 150 LKEFVLECCGWIKDTGFVGKRVFECMVRGLSLVGMVEDAEKVLDEMGHLGVGMVEDAEKV 209 Query: 603 LNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQ 782 L+EM LG KPS F++R+VIQGYG+LG F EM+ + ME I IDTVC+N+VLSCYG Sbjct: 210 LDEMGHLGFKPSGFEYRVVIQGYGRLGSFKEMRRVIGRMENAEIGIDTVCANLVLSCYGD 269 Query: 783 HGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEE 962 +G L+EM +W+ MR GI S+RT NSVLNSCP VVS+V + +LPLS+DDL++ L + Sbjct: 270 YGNLAEMVTWIRNMRLLGISYSVRTCNSVLNSCPSVVSMV-KDLENLPLSMDDLLEMLND 328 Query: 963 ETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQEL 1142 E EALLV E+ G+ VL + L WS S KLDLHGFH+ +AY+ILL WM+E Sbjct: 329 E-----------EALLVKEMAGTSVLLEKLTWSDSEGKLDLHGFHLASAYIILLQWMEEF 377 Query: 1143 --RLRFCEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFA 1316 RLR EE +PLEISV+CG G+HS RG+SP+K LVS++M + SPM+IDRKN GRF Sbjct: 378 RRRLRVKEEVPIPLEISVVCGLGKHSIMRGESPVKKLVSKLMSRLKSPMKIDRKNVGRFV 437 Query: 1317 ASGKAVREWLC 1349 A GKAV++WLC Sbjct: 438 AKGKAVKDWLC 448 >ref|XP_008781345.2| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Phoenix dactylifera] Length = 330 Score = 379 bits (973), Expect = e-126 Identities = 191/335 (57%), Positives = 246/335 (73%), Gaps = 8/335 (2%) Frame = +3 Query: 369 ALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLESISEHGLKQSALETYARLREMP 548 A+L QGR AEA +L ++V+RL S +++LFYCDL+E+ SE GLK AL+ Y RLREMP Sbjct: 3 AVLVNQGRAAEAESLISESVSRLNSDLEISLFYCDLIEAFSERGLKDLALDFYFRLREMP 62 Query: 549 FRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKS 704 AE+KL EMA LG +PSPF+FRLV+Q YG+LG F+EM+ Sbjct: 63 CSRRKPYESMIKALCLMGLPVDAEEKLKEMALLGFRPSPFEFRLVLQSYGKLGSFAEMRR 122 Query: 705 ALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCP 884 L ME+ G+ +DT+C+N+VLSCYG HG+L+EM SW+ KM+ G+G SIRT N VLNSCP Sbjct: 123 VLGIMEDAGLAVDTICTNVVLSCYGDHGELAEMVSWIRKMKKLGVGFSIRTFNVVLNSCP 182 Query: 885 MVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSP 1064 ++SIV + + PLSI L+KK+EE++ +EALLV EL+GS VL DILEWSP Sbjct: 183 TIISIVQ-DAKHFPLSIAALVKKVEEDSPSP------DEALLVRELVGSSVLVDILEWSP 235 Query: 1065 SGSKLDLHGFHVTAAYLILLNWMQELRLRFCEEKLVPLEISVICGSGRHSERRGQSPIKN 1244 + KLDLHGFHV++AY+ILL WM+ELR+RF +++VPLEISV+CGSG+ S++ G+SP+K Sbjct: 236 NEGKLDLHGFHVSSAYVILLQWMEELRMRFRVDEVVPLEISVVCGSGKKSDKIGESPVKM 295 Query: 1245 LVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349 LVSEMMFQ NS MRIDRKN GRF A GKAVR+WLC Sbjct: 296 LVSEMMFQLNSSMRIDRKNAGRFVAQGKAVRDWLC 330 >ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Nelumbo nucifera] Length = 451 Score = 365 bits (937), Expect = e-119 Identities = 201/431 (46%), Positives = 273/431 (63%), Gaps = 21/431 (4%) Frame = +3 Query: 120 CALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS----- 284 CAL+ K R F++L+ + D AA+RLIRKF+A+SSK AL++LS IS + Sbjct: 31 CALS---KKGHRFFTSLAAAAGDSA-AANRLIRKFVASSSKSDALNALSHLISSNTTHFH 86 Query: 285 -SPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLA 458 S LP+Y RI+E WFNWNP L ASVIA L+KQG+ EA L ++V +L RD+A Sbjct: 87 LSSLVLPMYRRIAETPWFNWNPKLVASVIAYLDKQGQPEEAEALISESVQKLGFQERDVA 146 Query: 459 LFYCDLLESISEHGLKQSALETYARLREM-------------PFRXXXXXXXXXXXXAED 599 LFYCDL++S S+ + E+YARL+++ AE+ Sbjct: 147 LFYCDLIDSYSKQRSRIGVFESYARLKQLFSDSSSSLSRRAYETIICSLCSVDLPRDAEN 206 Query: 600 KLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYG 779 + EM G KPS F+FR ++ GYG+LGLF++M+ LR ME+ G +DT+CSN+VLS +G Sbjct: 207 MVEEMTISGFKPSAFEFRSLVSGYGRLGLFTDMRRVLRKMEDAGYCLDTICSNMVLSSFG 266 Query: 780 QHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLE 959 H +LSEMASW+ KM+ + I SIRT NSV+NSCP + S++ + + +PLS++DL +L+ Sbjct: 267 AHSELSEMASWLRKMKDSNISFSIRTYNSVMNSCPTITSLL-KDLKFVPLSMEDLKGRLQ 325 Query: 960 EETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQE 1139 ++ E LLV +LIGS VL D L+W PS KLDLHG H+ AYLI+L W+Q Sbjct: 326 KD-----------ETLLVEQLIGSSVLMDALKWCPSEGKLDLHGMHLATAYLIMLQWVQV 374 Query: 1140 LRLRFCEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFA 1316 LR RF ++P E VICGSG+HS RG+SP+K LV +MM + SPM+IDR N G F Sbjct: 375 LRSRFSAGNWVIPTEFRVICGSGKHSSVRGESPVKALVKQMMVRMKSPMKIDRNNVGCFV 434 Query: 1317 ASGKAVREWLC 1349 GKAVR+WLC Sbjct: 435 GRGKAVRDWLC 445 >ref|XP_011045590.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Populus euphratica] Length = 473 Score = 348 bits (893), Expect = e-112 Identities = 193/432 (44%), Positives = 265/432 (61%), Gaps = 24/432 (5%) Frame = +3 Query: 126 LACSGKHADRLFSA-LSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLSSP---- 290 LA K A R FSA L TV+ D A +RLI+KF+A+S K AL +LS +S S Sbjct: 53 LAAISKQAQRFFSAVLPTVAARDTSATNRLIKKFVASSPKSIALDALSHLLSPDSTHHPL 112 Query: 291 ---FALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLA 458 LP+Y +ISEASWF+WNP L A V+ LL+KQG E + L + V+RL+ R+L Sbjct: 113 LYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKERELV 172 Query: 459 LFYCDLLESISEHGLKQSALETYARLREM--------------PFRXXXXXXXXXXXXAE 596 LFYC+L+ S+H + ++Y+RL + AE Sbjct: 173 LFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSESKSVYVKKQGYKAMISGLCEMGRAREAE 232 Query: 597 DKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCY 776 D + EM G KP+ F+FR V+ GYG+LGLF +M+ L ME I +DTVC+N+VL+ Y Sbjct: 233 DLIGEMRERGLKPTLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVLASY 292 Query: 777 GQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKL 956 G H L EM W+ KM++ GI +SIRT NSVLNSCP +++++ + S P+SI +L+K L Sbjct: 293 GAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELLKIL 352 Query: 957 EEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQ 1136 E+ EA+LV ELI S VL + +EW S KLDLHG H+ +AY+I+L WM+ Sbjct: 353 SED-----------EAMLVKELIESSVLKEAVEWDTSEGKLDLHGMHLGSAYVIMLQWME 401 Query: 1137 ELRLRFCE-EKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRF 1313 E R R + E ++P EI+V+CGSG HS RG+SP+K++++E+M QT SPMRIDRKN G F Sbjct: 402 ETRNRLSDGEHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCF 461 Query: 1314 AASGKAVREWLC 1349 A G V++WLC Sbjct: 462 VAKGNVVKKWLC 473 >ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Vitis vinifera] emb|CBI37819.3| unnamed protein product, partial [Vitis vinifera] Length = 435 Score = 347 bits (889), Expect = e-112 Identities = 201/433 (46%), Positives = 273/433 (63%), Gaps = 22/433 (5%) Frame = +3 Query: 117 RCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS---- 284 +CAL+ G+ LF LS+V+ D P A++RLI KF+A+SSK AL++LS +S + Sbjct: 22 QCALSKQGQ----LF--LSSVARD-PSASNRLICKFIASSSKSIALNALSHLLSPTTTHP 74 Query: 285 --SPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDL 455 S ALP+Y RISEASWF+WNP L A VIALL KQG+ EA TL + + +L S RDL Sbjct: 75 YLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDL 134 Query: 456 ALFYCDLLESISEHGLKQSALETYARL--------------REMPFRXXXXXXXXXXXXA 593 FYC+L++S S+H Q + +RL R A Sbjct: 135 VSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEA 194 Query: 594 EDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSC 773 E+ + EM G KPS F+FR V+ GYG++GL +M+ L M G +DTV SN+VLS Sbjct: 195 ENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSS 254 Query: 774 YGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKK 953 YG + + SEM SW+ +M+++ I SIRT NSVLNSCPM++SI+ + ++ P +ID+LM+ Sbjct: 255 YGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSIL-QDLKTFPPTIDELMET 313 Query: 954 LEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWM 1133 L+ +EALLV ELIGS VLA+++EW S KLDLHG H+ +AYLI+L W Sbjct: 314 LK-----------GDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWR 362 Query: 1134 QELRLRF-CEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310 +ELR R E ++P+EI+V+CGSG+HS RG+SP+K +V EMM +T SPM+IDRKN G Sbjct: 363 EELRYRLNAAEYVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGC 422 Query: 1311 FAASGKAVREWLC 1349 F A K V+ WLC Sbjct: 423 FVAKAKVVKNWLC 435 >dbj|GAV86601.1| PPR domain-containing protein [Cephalotus follicularis] Length = 451 Score = 344 bits (883), Expect = e-111 Identities = 193/433 (44%), Positives = 266/433 (61%), Gaps = 22/433 (5%) Frame = +3 Query: 117 RCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS---- 284 +CA + + + R S L+ + + + A RLI KF+A+S K AL++LS +SL Sbjct: 32 QCAASLTTR-GHRFISTLAAATNEPQVVAHRLISKFVASSPKSVALNALSHLLSLDTSQP 90 Query: 285 --SPFALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDL 455 S ALP+Y RISEA WFNWNP L A ++ALL+KQG+ ++++ L + +++L+ RDL Sbjct: 91 HLSSLALPLYSRISEAPWFNWNPKLVADLVALLDKQGQYSQSQALIFETISKLQFKERDL 150 Query: 456 ALFYCDLLESI----SEHGLKQSAL----------ETYARLREMPFRXXXXXXXXXXXXA 593 ALFYC+L+ES SEHG S + Y + + A Sbjct: 151 ALFYCNLIESHAKNKSEHGFNDSYICLNEVIRKSCSVYVKSQGYKSMVSALCEMGEPHEA 210 Query: 594 EDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSC 773 E+ + EM G K S F+FR V+ GYG+LGLF +M + ME G +DTV SN+VLS Sbjct: 211 ENVVEEMRVNGLKLSLFEFRCVLYGYGRLGLFEDMLRIVEQMESEGFQVDTVSSNMVLSS 270 Query: 774 YGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKK 953 YG H LS+M SW+ +++S GI S+RT NSVLNSCPM++S++ + +SLPLS+ +L Sbjct: 271 YGAHNALSDMLSWLQQLKSLGIPFSVRTYNSVLNSCPMMISML-QDLKSLPLSLKELTVT 329 Query: 954 LEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWM 1133 L + EALLV EL SPVL +EW +KLDLHG H+ +AYLI+L WM Sbjct: 330 LNND-----------EALLVKELTQSPVLDGAIEWGALEAKLDLHGMHLGSAYLIMLQWM 378 Query: 1134 QELRLRFCEEKL-VPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310 E+R RF + K +P EI ++CGSG+HS RG+SP+K +V E+M +T SPMRIDRKN G Sbjct: 379 DEMRNRFKDGKFALPAEIILVCGSGKHSSVRGESPVKGMVREIMVRTRSPMRIDRKNIGC 438 Query: 1311 FAASGKAVREWLC 1349 F A GK VR+WLC Sbjct: 439 FIAKGKVVRDWLC 451 >ref|XP_020092848.1| uncharacterized protein LOC109713258, partial [Ananas comosus] Length = 720 Score = 350 bits (899), Expect = e-110 Identities = 180/357 (50%), Positives = 239/357 (66%), Gaps = 9/357 (2%) Frame = +3 Query: 306 YERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKSPRDLALFYCDLLES 485 Y RI E SWF+WN L A V ALLE+ G+ +A L +++ ++SPRDLALFYC+L+ES Sbjct: 371 YARIRETSWFSWNSKLTADVAALLEQLGQCFDAEHLVSSSISTIRSPRDLALFYCNLIES 430 Query: 486 ISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------AEDKLNEMASLGCKPSP 641 S GL+Q ++ + LR +PF AE L EMA LG KPS Sbjct: 431 YSGRGLRQKVVDICSSLRNLPFSGRKPYKSMIKGFCLLDMPEEAEANLQEMALLGLKPSA 490 Query: 642 FQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQLSEMASWMAK 821 F+FRL+ Q YG+ G FSEMK + ME+ G +DTVC+N+VLSCYG G+L EM W+ Sbjct: 491 FEFRLIAQSYGKPGSFSEMKRVIGLMEDAGFSVDTVCANVVLSCYGDRGELPEMVEWLKW 550 Query: 822 MRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAXXXXXXXXNE 1001 MR GI S+RT N+VLNSC ++V +V + +LPLSI+ L+ KLE E+A E Sbjct: 551 MRELGIDFSVRTFNTVLNSCSVIVGMV-RDLDTLPLSIEQLLDKLESESASVV------E 603 Query: 1002 ALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF-CEEKLVPL 1178 A L+ +LI S +L ++LEWSP+ KLDLHGFH T+A++I+L ++ ELR R E +VP Sbjct: 604 AALIRKLIDSALLVEMLEWSPAEGKLDLHGFHATSAFVIMLQFVDELRSRLSAENAVVPA 663 Query: 1179 EISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKAVREWLC 1349 EISV+CGSG+HS+ RG+SP+K +VSEM+F+TNS MR+DRKN GRF GKAV+EWLC Sbjct: 664 EISVVCGSGKHSDVRGRSPVKMVVSEMLFRTNSVMRLDRKNSGRFVGRGKAVKEWLC 720 >gb|OMO91929.1| hypothetical protein COLO4_18015 [Corchorus olitorius] Length = 467 Score = 337 bits (864), Expect = e-108 Identities = 188/427 (44%), Positives = 265/427 (62%), Gaps = 25/427 (5%) Frame = +3 Query: 141 KHADRLFSALS-TVSTDDPLAADRLIRKFLAASSKPAALHSLSRFISLS------SPFAL 299 K R FS+L+ T +DP AA+R+I+KF+A+S K AL++LS +S S A Sbjct: 50 KQGQRFFSSLAATAGVNDPAAANRIIKKFVASSPKAIALNALSHLLSTRNSHPHLSAIAF 109 Query: 300 PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLALFYCDL 476 P+Y +I+E SWF+WNP L A +IALL+KQGR E L AV++LK RDL FYC+L Sbjct: 110 PLYTKITETSWFDWNPKLVADLIALLDKQGRYDETEALISQAVSKLKFRERDLVQFYCNL 169 Query: 477 LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614 +ES S+H KQ + Y L E+ AE+ EM Sbjct: 170 IESCSKHDSKQGFNDAYGYLSELVRNSSSLYVKRQGYKSLVSSFCEMGQPNEAENVFEEM 229 Query: 615 ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794 G KPS F+FR +I GYG++G F +M+ + ME G +DT+CSN+VLS YG + L Sbjct: 230 RKNGVKPSSFEFRFIIYGYGKMGFFEDMERMVSEMEIAGFEVDTICSNMVLSSYGDYNAL 289 Query: 795 SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974 ++M SW+ KM++ I S+RT NSVLNSCP ++S+V + +LPLS+ +L+K L+E+ Sbjct: 290 AKMVSWLQKMKTLQIPFSVRTYNSVLNSCPGIMSLV-QDINNLPLSLGELVKVLKED--- 345 Query: 975 XXXXXXXNEALLVGELI-GSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLR 1151 EALLV EL+ S VL + +E S +KLDLHG H+ +AYLI+L W++E++ R Sbjct: 346 --------EALLVKELVESSAVLDNAVECDVSEAKLDLHGMHLGSAYLIMLQWIEEMKCR 397 Query: 1152 F-CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASG 1325 F EEK ++P +I+++CGSG+HS RG+SP+K+L+ +MM Q SPM+IDRKN G F A G Sbjct: 398 FKAEEKCVIPAQITIVCGSGKHSSVRGESPVKSLLKKMMVQMKSPMKIDRKNIGCFTAKG 457 Query: 1326 KAVREWL 1346 V+ WL Sbjct: 458 HVVKNWL 464 >ref|NP_001324067.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana] gb|ANM61875.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana] Length = 470 Score = 335 bits (860), Expect = e-107 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%) Frame = +3 Query: 141 KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299 KH DR S+LS+ + DP A +R I+KF+AAS K AL+ LS +S S P FAL Sbjct: 56 KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 115 Query: 300 PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476 +Y I+EASWF+WNP L A +IALL KQ R E+ TL AV+RLKS RD LF C+L Sbjct: 116 SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 175 Query: 477 LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614 +ES S+ G Q E RLRE+ R AE + EM Sbjct: 176 VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 235 Query: 615 ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794 KP F+++ V+ GYG+LGLF +M + M G IDTVCSN+VLS YG H L Sbjct: 236 RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 295 Query: 795 SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974 +M SW+ K++ + SIRT NSVLNSCP ++S++ + S P+S+ +L L E+ Sbjct: 296 PQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 351 Query: 975 XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154 EALLV EL S VL + +EW+ KLDLHG H++++YLILL WM E RLRF Sbjct: 352 --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 403 Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331 EEK ++P EI V+ GSG+HS RG+SP+K LV ++M +T SPMRIDRKN G F A GK Sbjct: 404 SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 463 Query: 1332 VREWLC 1349 V+EWLC Sbjct: 464 VKEWLC 469 >ref|XP_015893244.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Ziziphus jujuba] Length = 450 Score = 334 bits (857), Expect = e-107 Identities = 189/439 (43%), Positives = 270/439 (61%), Gaps = 22/439 (5%) Frame = +3 Query: 99 PSAATTRCALACSGKHADRLFSALSTVSTDDPLAADRLIRKFLAASSKPAALHSLSRFIS 278 PS+++ +CAL+ G R S LS V+ DP A+ +LI KF+ +SSK AL++LS +S Sbjct: 28 PSSSSIKCALSKQGL---RFISTLS-VNAGDPSASAKLIGKFVGSSSKSIALNALSHLLS 83 Query: 279 LSSP------FALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK 440 + ALP+Y +I EASWF NP L A++ ALL+KQGR +E+ TL + +A L Sbjct: 84 PDTTHPHLTSLALPLYSKIKEASWFERNPKLVAAMAALLDKQGRHSESETLISETIAELG 143 Query: 441 S-PRDLALFYCDLLESISE--------------HGLKQSALETYARLREMPFRXXXXXXX 575 + R+LALFYC L+ES S+ H L ++ Y + R + Sbjct: 144 NRERELALFYCQLVESHSKQNSGHGFERSYTYLHHLLHNSSSVYVKRRALESMVGGLCTM 203 Query: 576 XXXXXAEDKLNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCS 755 AE + EM +G KPS F+ R V+ GYG+LGL EM ++ M+ G+ IDT+ S Sbjct: 204 DRPIEAESLIEEMRVVGLKPSVFELRSVMYGYGRLGLLKEMLRIVQQMDNGGLAIDTISS 263 Query: 756 NIVLSCYGQHGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSI 935 N+VLS G H +LSEM W+ KM++ I S RT N+VLNSCP ++ I+ N+ +P SI Sbjct: 264 NMVLSSLGIHNELSEMVLWLRKMKTFNIPFSTRTYNTVLNSCPTIMEIL-QNSDHIPFSI 322 Query: 936 DDLMKKLEEETAXXXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYL 1115 ++L L+ +EALLV EL+GS VL ++++W +KLDLHG H+ +AYL Sbjct: 323 EELKGVLK-----------GDEALLVDELVGSGVLKEVMKWDSLEAKLDLHGLHLGSAYL 371 Query: 1116 ILLNWMQELRLRFCEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRID 1292 I+L WM+E++ RF EK ++P E++V+CG G+HS RG SP+K ++ EMM +T SPMRID Sbjct: 372 IMLEWMEEMKCRFNNEKHVLPAEVTVVCGVGKHSNFRGVSPVKVMIKEMMARTRSPMRID 431 Query: 1293 RKNPGRFAASGKAVREWLC 1349 RKN G F A G+AV++WLC Sbjct: 432 RKNAGCFIAKGRAVKDWLC 450 >ref|XP_012077696.1| pentatricopeptide repeat-containing protein At2g17033 [Jatropha curcas] Length = 473 Score = 335 bits (859), Expect = e-107 Identities = 191/432 (44%), Positives = 262/432 (60%), Gaps = 26/432 (6%) Frame = +3 Query: 129 ACSGKHADRLFSALSTVSTD-DPLAADRLIRKFLAASSKPAALHSLSRFISLSSPF---- 293 A K R S+L+T + D A + LI+KF+AAS K AL +LS +S +S + Sbjct: 52 AALSKQGQRFLSSLATATAARDNSATNSLIKKFVAASPKSIALDALSHLLSPNSSYSHLS 111 Query: 294 --ALPIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLK-SPRDLALF 464 A P+Y +I EA WF+WNP L A V+ALL+KQG+ E+ TL D++++LK RDLALF Sbjct: 112 SLAFPLYLKIQEAHWFDWNPKLVAEVVALLDKQGQYNESGTLISDSISKLKLRERDLALF 171 Query: 465 YCDLLESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDK 602 YC+L+ES S+ Q +++ARL ++ F A+D Sbjct: 172 YCNLVESHSKQNCVQGFEDSFARLNQLVFSSNSVYIKKQAYKSMISGLCEMGRPKEAQDL 231 Query: 603 LNEMASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQ 782 + EM G KPS ++FR V+ YG+LGLF EM+ L ME G +DTVCSN+VLS YG Sbjct: 232 IEEMRGKGVKPSVYEFRCVLHAYGKLGLFQEMQMILDQMESGGFKVDTVCSNMVLSSYGV 291 Query: 783 HGQLSEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIV-SSNTRSLPLSIDDLMKKLE 959 + L E+ SW+ KM+ GI S RT NSVLNSCP ++S V +SN + P+SI +LMK L Sbjct: 292 YNALPEIVSWLKKMKDLGIPFSSRTCNSVLNSCPTMMSTVQNSNANTYPISIQELMKILR 351 Query: 960 EETAXXXXXXXXNEALLVGELI--GSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWM 1133 +EA++V ELI S VL + ++W SKLDLHG H+ +AYLI+L W Sbjct: 352 -----------GDEAMVVNELIIGSSSVLEEAMQWDALESKLDLHGMHLCSAYLIMLLWF 400 Query: 1134 QELRLRF-CEEKLVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGR 1310 +E++ RF ++P EI+V+CGSG HS RG+SP+K ++ +M QT SPMR+DRKN G Sbjct: 401 EEMKNRFNGGNYVIPAEITVVCGSGNHSIVRGESPVKRMIKSIMVQTRSPMRVDRKNLGC 460 Query: 1311 FAASGKAVREWL 1346 F A GK V+EWL Sbjct: 461 FIAKGKVVKEWL 472 >gb|OAP09950.1| hypothetical protein AXX17_AT2G12140 [Arabidopsis thaliana] Length = 470 Score = 335 bits (858), Expect = e-107 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%) Frame = +3 Query: 141 KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299 KH DR S+LS+ + DP A +R I+KF+AAS K AL+ LS +S S P FAL Sbjct: 56 KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 115 Query: 300 PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476 +Y I+EASWF+WNP L A +IALL KQ R E+ TL AV+RLKS RD LF C+L Sbjct: 116 SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 175 Query: 477 LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614 +ES S+ G Q E RLRE+ R AE + EM Sbjct: 176 VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 235 Query: 615 ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794 KP F+++ V+ GYG+LGLF +M + M G IDTVCSN+VLS YG H L Sbjct: 236 RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 295 Query: 795 SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974 +M SW+ K++ + SIRT NSVLNSCP ++S++ + S P+S+ +L L E+ Sbjct: 296 PQMGSWLQKLKGFNVLFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 351 Query: 975 XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154 EALLV EL S VL + +EW+ KLDLHG H++++YLILL WM E RLRF Sbjct: 352 --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 403 Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331 EEK ++P EI V+ GSG+HS RG+SP+K LV ++M +T SPMRIDRKN G F A GK Sbjct: 404 SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 463 Query: 1332 VREWLC 1349 V+EWLC Sbjct: 464 VKEWLC 469 >ref|NP_001324066.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana] dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] gb|ANM61874.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana] Length = 501 Score = 335 bits (860), Expect = e-107 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%) Frame = +3 Query: 141 KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299 KH DR S+LS+ + DP A +R I+KF+AAS K AL+ LS +S S P FAL Sbjct: 87 KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 146 Query: 300 PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476 +Y I+EASWF+WNP L A +IALL KQ R E+ TL AV+RLKS RD LF C+L Sbjct: 147 SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 206 Query: 477 LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614 +ES S+ G Q E RLRE+ R AE + EM Sbjct: 207 VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 266 Query: 615 ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794 KP F+++ V+ GYG+LGLF +M + M G IDTVCSN+VLS YG H L Sbjct: 267 RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 326 Query: 795 SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974 +M SW+ K++ + SIRT NSVLNSCP ++S++ + S P+S+ +L L E+ Sbjct: 327 PQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 382 Query: 975 XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154 EALLV EL S VL + +EW+ KLDLHG H++++YLILL WM E RLRF Sbjct: 383 --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 434 Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331 EEK ++P EI V+ GSG+HS RG+SP+K LV ++M +T SPMRIDRKN G F A GK Sbjct: 435 SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 494 Query: 1332 VREWLC 1349 V+EWLC Sbjct: 495 VKEWLC 500 >ref|NP_565402.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana] gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana] gb|AAM44931.1| unknown protein [Arabidopsis thaliana] gb|AEC06575.1| pentatricopeptide (PPR) repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 335 bits (860), Expect = e-107 Identities = 199/426 (46%), Positives = 257/426 (60%), Gaps = 23/426 (5%) Frame = +3 Query: 141 KHADRLFSALSTVS-TDDPLAADRLIRKFLAASSKPAALHSLSRFIS--LSSP----FAL 299 KH DR S+LS+ + DP A +R I+KF+AAS K AL+ LS +S S P FAL Sbjct: 90 KHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFAL 149 Query: 300 PIYERISEASWFNWNPTLAASVIALLEKQGRTAEARTLTMDAVARLKS-PRDLALFYCDL 476 +Y I+EASWF+WNP L A +IALL KQ R E+ TL AV+RLKS RD LF C+L Sbjct: 150 SLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNL 209 Query: 477 LESISEHGLKQSALETYARLREMPFRXXXXXXXXXXXX--------------AEDKLNEM 614 +ES S+ G Q E RLRE+ R AE + EM Sbjct: 210 VESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEM 269 Query: 615 ASLGCKPSPFQFRLVIQGYGQLGLFSEMKSALRSMEEVGIPIDTVCSNIVLSCYGQHGQL 794 KP F+++ V+ GYG+LGLF +M + M G IDTVCSN+VLS YG H L Sbjct: 270 RMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDAL 329 Query: 795 SEMASWMAKMRSTGIGISIRTLNSVLNSCPMVVSIVSSNTRSLPLSIDDLMKKLEEETAX 974 +M SW+ K++ + SIRT NSVLNSCP ++S++ + S P+S+ +L L E+ Sbjct: 330 PQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISML-KDLDSCPVSLSELRTFLNED--- 385 Query: 975 XXXXXXXNEALLVGELIGSPVLADILEWSPSGSKLDLHGFHVTAAYLILLNWMQELRLRF 1154 EALLV EL S VL + +EW+ KLDLHG H++++YLILL WM E RLRF Sbjct: 386 --------EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRF 437 Query: 1155 CEEK-LVPLEISVICGSGRHSERRGQSPIKNLVSEMMFQTNSPMRIDRKNPGRFAASGKA 1331 EEK ++P EI V+ GSG+HS RG+SP+K LV ++M +T SPMRIDRKN G F A GK Sbjct: 438 SEEKCVIPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKT 497 Query: 1332 VREWLC 1349 V+EWLC Sbjct: 498 VKEWLC 503