BLASTX nr result
ID: Papaver31_contig00027633
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver31_contig00027633 (1515 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containi... 494 e-137 ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prun... 458 e-126 ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containi... 444 e-122 ref|XP_008358358.1| PREDICTED: pentatricopeptide repeat-containi... 444 e-122 ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi... 437 e-119 ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr... 426 e-116 ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi... 425 e-116 ref|XP_010086846.1| hypothetical protein L484_006076 [Morus nota... 419 e-114 ref|XP_002884032.1| pentatricopeptide repeat-containing protein ... 408 e-111 ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing pr... 407 e-110 ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar... 406 e-110 dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] 406 e-110 ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar... 406 e-110 ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containi... 402 e-109 ref|XP_012077696.1| PREDICTED: pentatricopeptide repeat-containi... 401 e-109 ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr... 401 e-109 ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containi... 398 e-108 ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps... 397 e-107 ref|XP_013622437.1| PREDICTED: pentatricopeptide repeat-containi... 396 e-107 emb|CDP14534.1| unnamed protein product [Coffea canephora] 396 e-107 >ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Nelumbo nucifera] Length = 451 Score = 494 bits (1273), Expect = e-137 Identities = 261/444 (58%), Positives = 330/444 (74%), Gaps = 2/444 (0%) Frame = -1 Query: 1467 MLLQFSVSLKWGNHHHHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASS 1288 + L FS L +HHHH + C+LSKKG R ++RLIRKFVASS Sbjct: 8 LALAFSADLL--HHHHHRRPLFLPWCALSKKGHRFFTSLAAAAGDSAAANRLIRKFVASS 65 Query: 1287 SKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114 SKS ALN LSHL+SS+ H SSL LPMY I E WF WNPKLV++VIA L+ QGQ + Sbjct: 66 SKSDALNALSHLISSNTTHFHLSSLVLPMYRRIAETPWFNWNPKLVASVIAYLDKQGQPE 125 Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNR 934 +E LIS+SVQKL QER++ALFYC+LI+SYSK S+ GVF+SY L+R Sbjct: 126 EAEALISESVQKLGFQERDVALFYCDLIDSYSKQRSRIGVFESYARLKQLFSDSSSSLSR 185 Query: 933 QAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQ 754 +AYE++I LC++DLP DAE ++E+M + GFKPS FEFRS+V YGRLGL DMRRVL + Sbjct: 186 RAYETIICSLCSVDLPRDAENMVEEMTISGFKPSAFEFRSLVSGYGRLGLFTDMRRVLRK 245 Query: 753 MEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMA 574 ME +GY LDTI SN+VLSS+G + ELSEM W++KMK SN+ FS+RTYNSV+NSCPTI + Sbjct: 246 MEDAGYCLDTICSNMVLSSFGAHSELSEMASWLRKMKDSNISFSIRTYNSVMNSCPTITS 305 Query: 573 MLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTA 394 +LK+ +F+PLS+EDL ++K E +LV++LI GSSVL++ LKW EGKLDLHGMHL TA Sbjct: 306 LLKDLKFVPLSMEDLKGRLQKDETLLVEQLI-GSSVLMDALKWCPSEGKLDLHGMHLATA 364 Query: 393 YVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSP 214 Y+I+L+W+ VL+SRF G + VIP E RV+CG GKHSSV G+SP+KALV +MMVR+KSP Sbjct: 365 YLIMLQWVQVLRSRFSAG--NWVIPTEFRVICGSGKHSSVRGESPVKALVKQMMVRMKSP 422 Query: 213 MRIGRKNDVGSFVGKGKAVKDWLC 142 M+I R N+VG FVG+GKAV+DWLC Sbjct: 423 MKIDR-NNVGCFVGRGKAVRDWLC 445 >ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica] gi|462396130|gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica] Length = 447 Score = 458 bits (1179), Expect = e-126 Identities = 241/443 (54%), Positives = 318/443 (71%), Gaps = 3/443 (0%) Frame = -1 Query: 1461 LQFSVSLKWGNHHHHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSK 1282 L FS++L W ++ I C+++K+GQR +++LI KF+ SS+K Sbjct: 9 LSFSIALPWNPLRPLPPLTSPIQCAVTKQGQRFLTKLAANARDAKVTNKLIAKFLTSSTK 68 Query: 1281 SIALNTLSHLLSSD--IRHHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVS 1108 SIALNTLS+LLS D + H SSLALP Y ITEASWFEWNPKLV+ ++A L+ QGQ + + Sbjct: 69 SIALNTLSYLLSPDTTLPHLSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEA 128 Query: 1107 EKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-NRQ 931 E LIS+++ KL S+ERELALF+C L+ S+SK SK G SY + Sbjct: 129 EVLISETISKLGSRERELALFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNR 188 Query: 930 AYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQM 751 A+ESM++GLC +D P +A+ ++E+MR+ G KPS FEFRSVV YGRLGL DM +V++QM Sbjct: 189 AFESMVSGLCEMDRPREADNLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQM 248 Query: 750 EKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAM 571 E G +DTI SN+VLSSYG + EL+ M+VW++KMKS ++PFS+RTYNSVLNSC TIMAM Sbjct: 249 ENQGIAIDTICSNMVLSSYGAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAM 308 Query: 570 LKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAY 391 L+EP+ P SIE+L + E +LV+EL++ S+VL E + W+ LE KLDLHGMHLG+AY Sbjct: 309 LQEPKDFPCSIEELNGVLNGDEALLVKELVE-STVLDEVMVWEPLEAKLDLHGMHLGSAY 367 Query: 390 VILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPM 211 +ILLEW + ++ RF G D VIPAE+ V+CG GKHSSV G+SP+K LV +MM+R++SPM Sbjct: 368 LILLEWFEAMRCRFNSGKD--VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPM 425 Query: 210 RIGRKNDVGSFVGKGKAVKDWLC 142 RI RKN VG FV KG+AVKDWLC Sbjct: 426 RIDRKN-VGCFVAKGRAVKDWLC 447 >ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Malus domestica] Length = 461 Score = 444 bits (1143), Expect = e-122 Identities = 235/445 (52%), Positives = 315/445 (70%), Gaps = 5/445 (1%) Frame = -1 Query: 1461 LQFSVSLKWGNHHHHNQS--SAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASS 1288 L FSV+ W +H ++++ C L+K+GQR +++LI KF++SS Sbjct: 18 LSFSVASPWKHHQPRPTPPLASSVQCVLTKQGQRFLTKLAANARDPKFTNKLISKFLSSS 77 Query: 1287 SKSIALNTLSHLLSSDIR--HHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114 KSIAL+TLS+LLS D H SSLA P+Y ITE SWFEWNPKLV++++A L+NQG + Sbjct: 78 PKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVALLDNQGLYS 137 Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-N 937 SE LIS+++ KL S+ERELALF+C L+ S+SK SK G +Y Sbjct: 138 QSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLLHNSSSVYVK 197 Query: 936 RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757 R+A+ESM+ GLC +D P +A+ ++E+M ++G KPS FEFRSVV YGRLGL +M +V++ Sbjct: 198 RRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGLFEEMLKVVE 257 Query: 756 QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577 +ME G +DTI SN+VLSSYG EL+ MV+W++KMK +PFS+RTYNSVLNSCPTIM Sbjct: 258 KMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKILRLPFSIRTYNSVLNSCPTIM 317 Query: 576 AMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGT 397 AML++P+ +P SIE L + EG++V+EL+ GS+VL E + W+SLE KLDLHG+HLG+ Sbjct: 318 AMLQDPKDVPCSIEQLNGVLNGDEGLVVKELV-GSTVLEEVMVWESLEAKLDLHGLHLGS 376 Query: 396 AYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKS 217 AY+I+LEW + ++ RF G VIPAE+ +VCGLGKHSSV G+SP+K LV MM R+ S Sbjct: 377 AYLIMLEWFEAMRHRFNCG--ECVIPAEVVIVCGLGKHSSVRGESPVKGLVKVMMHRMGS 434 Query: 216 PMRIGRKNDVGSFVGKGKAVKDWLC 142 PMRI RKN VG F+ KG+AVKDWLC Sbjct: 435 PMRIDRKN-VGCFIAKGRAVKDWLC 458 >ref|XP_008358358.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Malus domestica] Length = 461 Score = 444 bits (1143), Expect = e-122 Identities = 235/445 (52%), Positives = 315/445 (70%), Gaps = 5/445 (1%) Frame = -1 Query: 1461 LQFSVSLKWGNHHHHNQS--SAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASS 1288 L FSV+ W +H ++++ C L+K+GQR +++LI KF++SS Sbjct: 18 LSFSVASPWKHHQPRPTPPLASSVQCVLTKQGQRFLTKLAANARDPKFTNKLISKFLSSS 77 Query: 1287 SKSIALNTLSHLLSSDIR--HHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114 KSIAL+TLS+LLS D H SSLA P+Y ITE SWFEWNPKLV++++A L+NQG + Sbjct: 78 PKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVALLDNQGLYS 137 Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-N 937 SE LIS+++ KL S+ERELALF+C L+ S+SK SK G +Y Sbjct: 138 QSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLLHNSSSVYVK 197 Query: 936 RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757 R+A+ESM+ GLC +D P +A+ ++E+M ++G KPS FEFRSVV YGRLGL +M +V++ Sbjct: 198 RRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGLFEEMLKVVE 257 Query: 756 QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577 +ME G +DTI SN+VLSSYG EL+ MV+W++KMK +PFS+RTYNSVLNSCPTIM Sbjct: 258 KMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKILRLPFSIRTYNSVLNSCPTIM 317 Query: 576 AMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGT 397 AML++P+ +P SIE L + EG++V+EL+ GS+VL E + W+SLE KLDLHG+HLG+ Sbjct: 318 AMLQDPKDVPCSIEQLNGVLNGDEGLVVKELV-GSTVLEEVMVWESLEAKLDLHGLHLGS 376 Query: 396 AYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKS 217 AY+I+LEW + ++ RF G VIPAE+ +VCGLGKHSSV G+SP+K LV MM R+ S Sbjct: 377 AYLIMLEWFEAMRHRFNCG--ECVIPAEVVIVCGLGKHSSVRGESPVKGLVKVMMHRMGS 434 Query: 216 PMRIGRKNDVGSFVGKGKAVKDWLC 142 PMRI RKN VG F+ KG+AVKDWLC Sbjct: 435 PMRIDRKN-VGCFIAKGRAVKDWLC 458 >ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Fragaria vesca subsp. vesca] Length = 448 Score = 437 bits (1123), Expect = e-119 Identities = 234/445 (52%), Positives = 313/445 (70%), Gaps = 5/445 (1%) Frame = -1 Query: 1461 LQFSVSLKWGNHH-HHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSS 1285 L FSV+L W + H++ S I C+L+K+GQR +++LI KF+++S Sbjct: 9 LSFSVALPWRHDPPQHSKLSLQIQCALTKQGQRFLTKLAANAGNPSVANKLISKFLSTSP 68 Query: 1284 KSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDV 1111 KS AL TLS+LLS H SSLALPMY ITEASWFEWNPKLV+ ++A L QGQ Sbjct: 69 KSTALTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQ 128 Query: 1110 SEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL--N 937 SE LIS+++ KL ++EREL F+C L+ S+SK SK G FD + Sbjct: 129 SEALISETISKLGNKERELVQFHCQLVESHSKMSSKCG-FDRACTYLHQLLQNSSSVYVK 187 Query: 936 RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757 R+A+ESM+ GLC +D P +A+E++E+MR++G K S FEFRSVV YGRLG+ +M +++D Sbjct: 188 RRAFESMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVD 247 Query: 756 QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577 QMEK G+ DTI N+VLSSYG + EL+ M W++KMK S+VPFSVRTYNSVLNSCPTIM Sbjct: 248 QMEKQGFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIM 307 Query: 576 AMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGT 397 AML+EP+ +P S+ +L + E ++V+EL+ GS+V+ E + WDS E KLDLHGMHLG+ Sbjct: 308 AMLQEPKAVPCSVGELSGVLDGDEALVVKELV-GSAVVDEAMVWDSAEAKLDLHGMHLGS 366 Query: 396 AYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKS 217 AY+++LEW + + +RF V+PAE+ +VCGLGKHSSV G+SP+K LV EMM +++S Sbjct: 367 AYLVMLEWFEAMGNRFKSA--ECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMES 424 Query: 216 PMRIGRKNDVGSFVGKGKAVKDWLC 142 PMRI RKN VG F+ KG+AVKDWLC Sbjct: 425 PMRIDRKN-VGCFIAKGRAVKDWLC 448 >ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] gi|568866680|ref|XP_006486677.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Citrus sinensis] gi|557524456|gb|ESR35762.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] Length = 451 Score = 426 bits (1094), Expect = e-116 Identities = 223/397 (56%), Positives = 291/397 (73%), Gaps = 5/397 (1%) Frame = -1 Query: 1317 RLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVI 1144 RLI KFVASS + IALN LSHLLS D H SSLA P+Y ITE SWF+WNPKLV+ +I Sbjct: 59 RLISKFVASSPQFIALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEII 118 Query: 1143 ASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXX 964 A L+ QGQ + +E LI +++ KL S+EREL LFYCNLI+S+ K+ SK+G D+Y Sbjct: 119 AFLDKQGQREEAETLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQL 178 Query: 963 XXXXXXXL-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLG 787 RQA +SMI+GLC + PH+AE ++E+MR++G +PS FE++ ++ YGRLG Sbjct: 179 VNSSSSVYVKRQALKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLG 238 Query: 786 LLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYN 607 LL DM R+++QME G +DT+ SN+VLSSYGD+ ELS MV+W++KMK S +PFSVRTYN Sbjct: 239 LLEDMERIVNQMESDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYN 298 Query: 606 SVLNSCPTIMAMLKE--PEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLE 433 SVLNSC TIM+ML++ PLSI +L + + + E +V+EL D SSVL E +KWDS E Sbjct: 299 SVLNSCSTIMSMLQDLNSNDFPLSILELTEVLNEEEVSVVKELED-SSVLDEAMKWDSGE 357 Query: 432 GKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIK 253 KLDLHGMHLG+AY I+L+WMD +++RF + VIPAEI VVCG GKHS+V G+S +K Sbjct: 358 TKLDLHGMHLGSAYFIILQWMDEMRNRF--NNEKHVIPAEITVVCGSGKHSTVRGESSVK 415 Query: 252 ALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 A+V +MMVR SPMR+ R N++G F+ KG VKDWLC Sbjct: 416 AMVKKMMVRTSSPMRVHR-NNIGCFIAKGHVVKDWLC 451 >ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 425 bits (1093), Expect = e-116 Identities = 234/443 (52%), Positives = 309/443 (69%), Gaps = 3/443 (0%) Frame = -1 Query: 1461 LQFSVSLKWGNHHHHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSK 1282 LQ S W NH +S I C+LSK+GQ S+RLI KF+ASSSK Sbjct: 5 LQVSRPQPW-NH----RSPLLIQCALSKQGQ---LFLSSVARDPSASNRLICKFIASSSK 56 Query: 1281 SIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVS 1108 SIALN LSHLLS H SSLALP+Y I+EASWF WNPKL+++VIA L QGQ + Sbjct: 57 SIALNALSHLLSPTTTHPYLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEA 116 Query: 1107 EKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNRQ- 931 E L+S+++ KL S+ER+L FYCNLI+S+SK+ S +GVFD ++ Sbjct: 117 ETLVSETLIKLGSRERDLVSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKER 176 Query: 930 AYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQM 751 AY+SMI+ LC + LP +AE ++E+MR++G KPS FEFRSVV YGR+GL DM+R+L QM Sbjct: 177 AYKSMISSLCAVGLPLEAENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQM 236 Query: 750 EKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAM 571 G+ LDT+ SN+VLSSYG + SEMV W+++MK+S++PFS+RTYNSVLNSCP IM++ Sbjct: 237 GNEGFELDTVVSNMVLSSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSI 296 Query: 570 LKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAY 391 L++ + P +I++L++ +K E +LV+ELI GS VL E ++WD EGKLDLHGMHLG+AY Sbjct: 297 LQDLKTFPPTIDELMETLKGDEALLVKELI-GSMVLAELMEWDCSEGKLDLHGMHLGSAY 355 Query: 390 VILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPM 211 +I+L+W + L+ R V+P EI VVCG GKHSSV G+SP+K +V EMM R +SPM Sbjct: 356 LIMLQWREELRYRLNAA--EYVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPM 413 Query: 210 RIGRKNDVGSFVGKGKAVKDWLC 142 +I RKN +G FV K K VK+WLC Sbjct: 414 KIDRKN-IGCFVAKAKVVKNWLC 435 >ref|XP_010086846.1| hypothetical protein L484_006076 [Morus notabilis] gi|587833217|gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis] Length = 517 Score = 419 bits (1078), Expect = e-114 Identities = 224/426 (52%), Positives = 302/426 (70%), Gaps = 3/426 (0%) Frame = -1 Query: 1410 SSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSKSIALNTLSHLLSSDIRH 1231 +S++I C+L+K+G R +++LI KFVASS KSI+LN LSHLLS D H Sbjct: 96 ASSSIQCALTKQGHRFLSTLSINAGNASAANKLIGKFVASSPKSISLNALSHLLSPDTTH 155 Query: 1230 H--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERE 1057 +S +L +Y I EASWF ++PKLV+ + A L+ QG++ +E LI+++V KL ++RE Sbjct: 156 THLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRE 215 Query: 1056 LALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNR-QAYESMINGLCTLDLPHD 880 LA+FYC+L+ S+SK SK G SY + +A+E+M+ LCT+D P + Sbjct: 216 LAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETMVGALCTMDRPCE 275 Query: 879 AEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLS 700 AE +ME+MR +G KPS FEFRS+V YGRLGL DM R ++QME G +DTI SN+VLS Sbjct: 276 AESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLS 335 Query: 699 SYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDN 520 SYG + EL +MV+W++KM++S++PFS+RTYNSVLN CPTI AML++ + +PLS+ +L Sbjct: 336 SYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKDIPLSMYELNAT 395 Query: 519 VKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGG 340 ++ EG+LV EL+ GSSVL E L WDSLE KLDLHGMHLG+AY+I+LEWM+ + RF G Sbjct: 396 LRGDEGLLVMELV-GSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDG 454 Query: 339 GDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKA 160 IPAE+ VVCG GKHS+V G SP+K LV EMMV++KSPM+I RKN G F+ KGK Sbjct: 455 NHG--IPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKN-AGCFLAKGKT 511 Query: 159 VKDWLC 142 V+DWLC Sbjct: 512 VRDWLC 517 >ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297329872|gb|EFH60291.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 504 Score = 408 bits (1049), Expect = e-111 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIR--HHSSLALPMYEWITEASWFEWNPKLVSNV 1147 HR I+KFVA+S KS+ LN LSHLLS H S AL +Y ITEASWF+WNPKL++ + Sbjct: 112 HRHIKKFVAASPKSVTLNVLSHLLSDQTSYPHLSFFALSLYSEITEASWFDWNPKLIAEL 171 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 970 +A L NQ +FD SE L+S +V +L+S ER+ ALF CNL+ S SK GS +G ++ + Sbjct: 172 VAVLNNQERFDESETLLSTAVSRLKSNERDFALFLCNLVESNSKQGSIQGFNEACFRLRE 231 Query: 969 XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + QAY+SM+ GLC +D PHDAE ++E+MR+E KP FE +SV+ YGRL Sbjct: 232 RIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAERVIEEMRVEKIKPGSFEHKSVLYGYGRL 291 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL +DM RV+ +ME G+ +DT+ SN+VLSSYG + L +M W++K+K NVPFS+RTY Sbjct: 292 GLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 351 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 NSVLNSCPTIM++LK+ P+S+ +L + + E +LV EL S+VL E ++W+++EG Sbjct: 352 NSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNEDEALLVLELTQ-STVLDEAIEWNAVEG 410 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL ++Y+ILL+WMD ++ RF VIPAEI VV G GKHS+V G+SP+KA Sbjct: 411 KLDLHGMHLSSSYLILLQWMDEIRLRF--RDQKCVIPAEIVVVSGSGKHSNVRGESPVKA 468 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 LV ++MVR +SPMRI RKN VGSF+ KGK VK+WLC Sbjct: 469 LVKKIMVRTESPMRIDRKN-VGSFIAKGKNVKEWLC 503 >ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing protein, putative [Theobroma cacao] gi|508705664|gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative [Theobroma cacao] Length = 456 Score = 407 bits (1047), Expect = e-110 Identities = 216/441 (48%), Positives = 299/441 (67%), Gaps = 10/441 (2%) Frame = -1 Query: 1431 NHHHHNQSSAAIICS-----LSKKGQRXXXXXXXXXXXXXXS--HRLIRKFVASSSKSIA 1273 NH H + +I C L+K+G R + +RLI+KFVASS KSIA Sbjct: 14 NHRHLRPTRPSIKCESGGVPLTKQGHRFFSSLAATADVNDPATANRLIKKFVASSPKSIA 73 Query: 1272 LNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKL 1099 LN LSHLLS + H S+LA P+Y I+E SW+ WNPKLV+ +IA L QG++D SE L Sbjct: 74 LNALSHLLSPRNSHPHLSALAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEAL 133 Query: 1098 ISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-NRQAYE 922 IS +V KL+ +ER+L FYCN I S SK+ SK+G D+Y RQ Y+ Sbjct: 134 ISQAVSKLKFRERDLVQFYCNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYK 193 Query: 921 SMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKS 742 SM++ LC +D P++AE ++E+MR G P+ FEFR + YG+LGL DM R++ +ME Sbjct: 194 SMVSSLCEMDRPNEAENLVEEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIE 253 Query: 741 GYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKE 562 G+ +DTI SN+VLSSYG S+MV W++KMK+ +PFS+RTYNSVLNSCP IM++++ Sbjct: 254 GFEVDTICSNMVLSSYGAYNAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQG 313 Query: 561 PEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVIL 382 + +PLS+ +L + + E +LVQEL+ SSVL E ++W+ EGKLDLHGMHLG+AY+I+ Sbjct: 314 LDSVPLSLGELAKILNEDEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIM 373 Query: 381 LEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIG 202 L+W++ +K RF + VIPA+I +VCG GKHSSV G+SP+K L+ +MMV++KSPM+I Sbjct: 374 LQWIEEMKCRF--KVEECVIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKID 431 Query: 201 RKNDVGSFVGKGKAVKDWLCP 139 RKN +G F+ KG+ VK+WL P Sbjct: 432 RKN-IGCFIAKGQVVKNWLIP 451 >ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 505 Score = 406 bits (1044), Expect = e-110 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147 +R I+KFVA+S KS+ALN LSHLLS H S AL +Y ITEASWF+WNPKL++ + Sbjct: 113 NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 172 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 970 IA L Q +FD SE L+S +V +L+S ER+ LF CNL+ S SK GS +G + S+ Sbjct: 173 IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 232 Query: 969 XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + QAY+SM++GLC +D PHDAE ++E+MR+E KP FE++SV+ YGRL Sbjct: 233 IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 292 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL +DM RV+ +M G+ +DT+ SN+VLSSYG + L +M W++K+K NVPFS+RTY Sbjct: 293 GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 352 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 NSVLNSCPTI++MLK+ + P+S+ +L + + E +LV EL SSVL E ++W+++EG Sbjct: 353 NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 411 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL ++Y+ILL+WMD + RF + VIPAEI VV G GKHS+V G+SP+KA Sbjct: 412 KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 469 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 LV ++MVR SPMRI RKN VGSF+ KGK VK+WLC Sbjct: 470 LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 504 >dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] Length = 501 Score = 406 bits (1044), Expect = e-110 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147 +R I+KFVA+S KS+ALN LSHLLS H S AL +Y ITEASWF+WNPKL++ + Sbjct: 109 NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 168 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 970 IA L Q +FD SE L+S +V +L+S ER+ LF CNL+ S SK GS +G + S+ Sbjct: 169 IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 228 Query: 969 XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + QAY+SM++GLC +D PHDAE ++E+MR+E KP FE++SV+ YGRL Sbjct: 229 IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 288 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL +DM RV+ +M G+ +DT+ SN+VLSSYG + L +M W++K+K NVPFS+RTY Sbjct: 289 GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 348 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 NSVLNSCPTI++MLK+ + P+S+ +L + + E +LV EL SSVL E ++W+++EG Sbjct: 349 NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 407 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL ++Y+ILL+WMD + RF + VIPAEI VV G GKHS+V G+SP+KA Sbjct: 408 KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 465 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 LV ++MVR SPMRI RKN VGSF+ KGK VK+WLC Sbjct: 466 LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 500 >ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown protein [Arabidopsis thaliana] gi|330251481|gb|AEC06575.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 406 bits (1044), Expect = e-110 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147 +R I+KFVA+S KS+ALN LSHLLS H S AL +Y ITEASWF+WNPKL++ + Sbjct: 112 NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 171 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 970 IA L Q +FD SE L+S +V +L+S ER+ LF CNL+ S SK GS +G + S+ Sbjct: 172 IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 231 Query: 969 XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + QAY+SM++GLC +D PHDAE ++E+MR+E KP FE++SV+ YGRL Sbjct: 232 IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 291 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL +DM RV+ +M G+ +DT+ SN+VLSSYG + L +M W++K+K NVPFS+RTY Sbjct: 292 GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 351 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 NSVLNSCPTI++MLK+ + P+S+ +L + + E +LV EL SSVL E ++W+++EG Sbjct: 352 NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 410 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL ++Y+ILL+WMD + RF + VIPAEI VV G GKHS+V G+SP+KA Sbjct: 411 KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 468 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 LV ++MVR SPMRI RKN VGSF+ KGK VK+WLC Sbjct: 469 LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 503 >ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Erythranthe guttatus] gi|604333640|gb|EYU37991.1| hypothetical protein MIMGU_mgv1a006093mg [Erythranthe guttata] Length = 458 Score = 402 bits (1033), Expect = e-109 Identities = 213/433 (49%), Positives = 288/433 (66%), Gaps = 3/433 (0%) Frame = -1 Query: 1398 IICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSKSIALNTLSHLLSSDIRHH--S 1225 ++C L+K+GQR L+RKFVASSSK +AL+TLSHLLS H S Sbjct: 26 LVCVLTKQGQRLLSSIATSEQPSAAIS-LLRKFVASSSKHVALSTLSHLLSPSTSHPRLS 84 Query: 1224 SLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERELALF 1045 SLA P+Y I + SWF WN KLV+++I+ L +FD ++ L ++V KL +ER+L F Sbjct: 85 SLAFPLYGIIEQESWFTWNSKLVADLISLLYKAERFDEADNLFGETVSKLGFKERDLCTF 144 Query: 1044 YCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNRQ-AYESMINGLCTLDLPHDAEEI 868 YCNL++S++K+ S++GV DS +Q YESMI G C + P AE + Sbjct: 145 YCNLVDSHAKHMSERGVSDSCTRLKQLILASSSVYVKQKGYESMIAGFCEIGSPDKAENL 204 Query: 867 MEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGD 688 ME+MR G KPS FE R++V YG++GLL DM+R + QMEK G+ LDT+ N+VLSS+G Sbjct: 205 MEEMRQNGLKPSAFELRTLVYGYGQMGLLEDMKRSVGQMEKEGFELDTVCYNMVLSSFGA 264 Query: 687 NKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKV 508 E +M++W+KKM++S +PFS+RTYNSVLNSCPT++ +L++ + LPLS+ +L+DN+K Sbjct: 265 RNEFLDMLLWLKKMRNSGIPFSIRTYNSVLNSCPTVILLLEDMKSLPLSVNELVDNLKTG 324 Query: 507 EGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSV 328 E LV EL+ S VL + ++W S E KLD+HGMHL TAY+ILL+W LK RFG G Sbjct: 325 EADLVLELMK-SDVLDQVMEWKSTELKLDMHGMHLSTAYLILLQWFKELKVRFGDGNHET 383 Query: 327 VIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDW 148 P EI VVCG GKHSS G+SP+K L EM+ R+K P+RI RKN +G F+GKGK KDW Sbjct: 384 --PTEILVVCGSGKHSSKRGESPVKVLAKEMVTRMKCPLRIDRKN-IGCFIGKGKTFKDW 440 Query: 147 LCPVVSNETLARL 109 LC SN+ A + Sbjct: 441 LCNEDSNKNPAEI 453 >ref|XP_012077696.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Jatropha curcas] Length = 473 Score = 401 bits (1031), Expect = e-109 Identities = 224/447 (50%), Positives = 302/447 (67%), Gaps = 19/447 (4%) Frame = -1 Query: 1428 HHHHNQ--------SSAAII---CSLSKKGQRXXXXXXXXXXXXXXS--HRLIRKFVASS 1288 HHHH Q SS + +LSK+GQR S + LI+KFVA+S Sbjct: 29 HHHHIQVGPLETKLSSKWRVFECAALSKQGQRFLSSLATATAARDNSATNSLIKKFVAAS 88 Query: 1287 SKSIALNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114 KSIAL+ LSHLLS S H SSLA P+Y I EA WF+WNPKLV+ V+A L+ QGQ++ Sbjct: 89 PKSIALDALSHLLSPNSSYSHLSSLAFPLYLKIQEAHWFDWNPKLVAEVVALLDKQGQYN 148 Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-N 937 S LISDS+ KL+ +ER+LALFYCNL+ S+SK +G DS+ Sbjct: 149 ESGTLISDSISKLKLRERDLALFYCNLVESHSKQNCVQGFEDSFARLNQLVFSSNSVYIK 208 Query: 936 RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757 +QAY+SMI+GLC + P +A++++E+MR +G KPS +EFR V+ AYG+LGL +M+ +LD Sbjct: 209 KQAYKSMISGLCEMGRPKEAQDLIEEMRGKGVKPSVYEFRCVLHAYGKLGLFQEMQMILD 268 Query: 756 QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577 QME G+ +DT+ SN+VLSSYG L E+V W+KKMK +PFS RT NSVLNSCPT+M Sbjct: 269 QMESGGFKVDTVCSNMVLSSYGVYNALPEIVSWLKKMKDLGIPFSSRTCNSVLNSCPTMM 328 Query: 576 AMLK--EPEFLPLSIEDLIDNVKKVEGMLVQELIDG-SSVLLENLKWDSLEGKLDLHGMH 406 + ++ P+SI++L+ ++ E M+V ELI G SSVL E ++WD+LE KLDLHGMH Sbjct: 329 STVQNSNANTYPISIQELMKILRGDEAMVVNELIIGSSSVLEEAMQWDALESKLDLHGMH 388 Query: 405 LGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVR 226 L +AY+I+L W + +K+RF GG + VIPAEI VVCG G HS V G+SP+K ++ +MV+ Sbjct: 389 LCSAYLIMLLWFEEMKNRFNGG--NYVIPAEITVVCGSGNHSIVRGESPVKRMIKSIMVQ 446 Query: 225 LKSPMRIGRKNDVGSFVGKGKAVKDWL 145 +SPMR+ RKN +G F+ KGK VK+WL Sbjct: 447 TRSPMRVDRKN-LGCFIAKGKVVKEWL 472 >ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] gi|557110519|gb|ESQ50810.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] Length = 469 Score = 401 bits (1031), Expect = e-109 Identities = 206/396 (52%), Positives = 285/396 (71%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147 +R I+KFVA+S KS++LN LSHLLS+ H S AL +Y ITEASWF+WNPKL++ + Sbjct: 77 NRHIKKFVAASPKSVSLNVLSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 136 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 970 +A L Q + SE L+S++V +L+S ER++ALFYCNL+ S SK GS +G ++ Sbjct: 137 VALLNKQERSHESETLLSNAVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLRE 196 Query: 969 XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + QAY+SM++GLC +D PHDAE ++E+MR+ KP FE++SV+ YGRL Sbjct: 197 ITRRSTSVYVKTQAYKSMVSGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRL 256 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL DM RV+ +ME G+ +DT+ SN+VLSSYG + L +M W++K+K SNVP S RTY Sbjct: 257 GLFEDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTY 316 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 NSVLNSCPTI+++LK+ + P+S+ +L+ + K E +LV+ L SSVL E ++W SLEG Sbjct: 317 NSVLNSCPTILSLLKDLDSCPVSLSELLTFLNKDEEVLVRGLTQ-SSVLDEAIEWSSLEG 375 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL ++Y+I+++WMD ++ RF G V+PAEI +V G GKHS+V G+SP+KA Sbjct: 376 KLDLHGMHLSSSYLIMMQWMDEMRIRFSEG--KCVVPAEIVLVSGSGKHSNVRGESPVKA 433 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 LV ++MVR SPMRI RKN +GSF+ KGK VK+WLC Sbjct: 434 LVKKIMVRTGSPMRIDRKN-IGSFIAKGKTVKEWLC 468 >ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Tarenaya hassleriana] gi|729371006|ref|XP_010548125.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Tarenaya hassleriana] Length = 462 Score = 398 bits (1022), Expect = e-108 Identities = 206/396 (52%), Positives = 281/396 (70%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNV 1147 +R IRKFVA+S KS+ALN LSHLLS + H SS+AL +Y I EA WF+WNPKLV+++ Sbjct: 70 NRQIRKFVAASPKSVALNVLSHLLSPLNSHPHLSSIALNLYSEIAEAPWFDWNPKLVADL 129 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 967 +A L Q QF SE L+S +V +L+ ER LALF+CNL+ S SK GS +G DSY Sbjct: 130 VALLNKQEQFPESESLLSAAVSRLKPNERGLALFHCNLVESNSKQGSTRGFNDSYSCLRE 189 Query: 966 XXXXXXXXLNR-QAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + Q Y+S+++GLC +D P+DAE ++ +M+ EG KP FE+RSV+ YGRL Sbjct: 190 IIQRSSSVYVKSQGYKSIVSGLCNMDRPYDAERVLAEMKTEGIKPELFEYRSVLYGYGRL 249 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL DM R + +ME G+ +DT+ SN+VLSSYG L EM W++K+K +P S+RTY Sbjct: 250 GLFFDMNRTVHEMESDGHKIDTVCSNMVLSSYGARDALPEMGSWLQKLKGFGIPLSIRTY 309 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 NSVLNSCPTI ++LK+ + P+S+ +L + + E +L +EL+ SSVL E ++W++LEG Sbjct: 310 NSVLNSCPTITSLLKDLDSCPVSLSELTGLLNEDEMLLTRELVQ-SSVLDEAMEWNALEG 368 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL ++Y+I+++WMD ++ RF G VIP EI +V G GKHS+V G+SP+KA Sbjct: 369 KLDLHGMHLSSSYLIMMQWMDKVRIRFEEG--KHVIPVEIVIVSGSGKHSNVRGESPVKA 426 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 LV ++MVR SPMRI RKN +GSF+ KGKAVK+WLC Sbjct: 427 LVKKIMVRTGSPMRIDRKN-IGSFIAKGKAVKEWLC 461 >ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] gi|482566151|gb|EOA30340.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] Length = 516 Score = 397 bits (1019), Expect = e-107 Identities = 207/396 (52%), Positives = 283/396 (71%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRH-HSSLALP-MYEWITEASWFEWNPKLVSNV 1147 +RLI+KFVA+S KS+ALN LSHLLS + H H S P +Y ITEASWF+WNPKL+ + Sbjct: 123 NRLIKKFVAASPKSVALNVLSHLLSDNTSHPHLSYFAPQLYLEITEASWFDWNPKLIGEL 182 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 967 ++ L Q +F SE L+S +V +LES ER+ ALF CNL+ S SK GS +G D+ Sbjct: 183 VSLLNKQERFVESETLLSTAVSRLESNERDFALFLCNLVESNSKQGSIQGFSDACSRLRE 242 Query: 966 XXXXXXXXLNR-QAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + QAY+SM++GLC +D P DAE ++E+MR+E KP FE++SV+ YGRL Sbjct: 243 IIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAERVIEEMRMETIKPGLFEYKSVLYGYGRL 302 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL +DM R++ +ME G+ +DT+ SN+VLSSYG + L +M W++K+K NVP S+RTY Sbjct: 303 GLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGYNVPLSIRTY 362 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 NSVLNSCPTI+++LK+ + PLS+ +L+ + + E +LV+EL S VL E ++W+++EG Sbjct: 363 NSVLNSCPTIISLLKDLDSCPLSLSELLPILNEDEALLVRELTQ-SLVLDEAIEWNAVEG 421 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL +Y+I+L+WMD + RF V+PAEI VV G GKHS+V G+SP+KA Sbjct: 422 KLDLHGMHLSASYLIMLQWMDETRLRF-SEDKKCVVPAEIVVVSGSGKHSNVRGESPVKA 480 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 +V ++MVR KSPMRI RKN VGSF+ KGK VK+WLC Sbjct: 481 MVKKIMVRTKSPMRIDRKN-VGSFIAKGKNVKEWLC 515 >ref|XP_013622437.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Brassica oleracea var. oleracea] gi|922432660|ref|XP_013622438.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Brassica oleracea var. oleracea] gi|922432662|ref|XP_013622439.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Brassica oleracea var. oleracea] Length = 461 Score = 396 bits (1018), Expect = e-107 Identities = 205/396 (51%), Positives = 282/396 (71%), Gaps = 3/396 (0%) Frame = -1 Query: 1320 HRLIRKFVASSSKSIALNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNV 1147 +R I+KFVA+S KS++LN LSHLLS + + H S AL +Y IT+ASWF+WNPKL++++ Sbjct: 69 NRHIKKFVAASPKSVSLNVLSHLLSPHTSLPHLSFFALSLYSEITDASWFDWNPKLIADL 128 Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 970 +A L Q +F SE L+S +V L+S ER+ ALF CNL S SK GS +G ++ Sbjct: 129 VALLNKQERFHESETLLSTAVTNLKSNERDFALFLCNLAESNSKQGSAQGFKEACLRLRE 188 Query: 969 XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790 + QAY+SM++GLC +D P+DAE ++E+MRLE KP FE++SV+ YGRL Sbjct: 189 VLQTSSSVYVKTQAYKSMVSGLCNMDQPNDAETVIEEMRLEKLKPGVFEYKSVLYGYGRL 248 Query: 789 GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610 GL +DM R++ +ME G+ +DT+ SN+VLSSYG + L +M W++++K NVP S+RTY Sbjct: 249 GLFDDMNRIVHRMETEGHRVDTVCSNMVLSSYGAHDALPQMGSWLQRLKDFNVPLSLRTY 308 Query: 609 NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430 N+VLNSCPT+ +MLK+ PLS+ +++ + + E +LV+ L SSVL E ++W SLEG Sbjct: 309 NTVLNSCPTVTSMLKDLNSCPLSVSEVLTFLNEDEVVLVRALTQ-SSVLHEAMEWSSLEG 367 Query: 429 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250 KLDLHGMHL +AY+I+++WMD +K RF G+ V+PAEI VV G GKHSSV G+SP+KA Sbjct: 368 KLDLHGMHLSSAYLIMMQWMDEIKVRF--SGEKCVVPAEIVVVSGSGKHSSVRGESPVKA 425 Query: 249 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142 LV +MMVR SPMRI RKN VG F+ KGK VK+W C Sbjct: 426 LVKKMMVRTGSPMRIDRKN-VGCFIAKGKTVKEWFC 460 >emb|CDP14534.1| unnamed protein product [Coffea canephora] Length = 449 Score = 396 bits (1017), Expect = e-107 Identities = 213/424 (50%), Positives = 295/424 (69%), Gaps = 5/424 (1%) Frame = -1 Query: 1398 IICSLSKKGQRXXXXXXXXXXXXXXSH-RLIRKFVASSSKSIALNTLSHLLSSDIRH-HS 1225 + CSL K+GQR +H R +RKFV +SSK +AL+TLSHLLS H H Sbjct: 30 VCCSLCKQGQRFLSSLATTDESSSAAHHRSLRKFVKTSSKHVALDTLSHLLSPTTAHPHL 89 Query: 1224 S--LALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERELA 1051 S LALP+Y I++ASWF WN KL+++V A + Q +F +E LI +++KL + +R+L Sbjct: 90 SYHLALPLYLIISQASWFSWNAKLLADVTALMYKQERFIEAEALILQALKKLPAHDRDLC 149 Query: 1050 LFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-NRQAYESMINGLCTLDLPHDAE 874 FYC+L++S +K+ S+KGVFDS ++AYESMI+GLC + LP +AE Sbjct: 150 NFYCHLLHSNAKHRSRKGVFDSLTSLKQLLARSSSVYVQKRAYESMISGLCEIGLPGEAE 209 Query: 873 EIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSY 694 +ME+MR G KPS FEF+S+V AYGRLGL DM+R + QME +G LDT+ SN+VLSS Sbjct: 210 NLMEEMRGVGLKPSGFEFKSLVHAYGRLGLFEDMKRSVTQMEDAGVELDTVCSNMVLSSL 269 Query: 693 GDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVK 514 G +K SEMV W+++MK S V FS+RTYNSVLNSCPT++ +L++P+ +PLS+EDL+ N+ Sbjct: 270 GSHKVFSEMVSWLRRMKDSEVSFSIRTYNSVLNSCPTLILLLQDPKTIPLSMEDLMGNLS 329 Query: 513 KVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGD 334 + E LV+EL+ SSVL E ++ +S E KLDLHGMHL T+ +I L+W+D L+ RF G + Sbjct: 330 QEEADLVRELV-ASSVLDEAMECNSAELKLDLHGMHLSTSCLIFLQWIDRLRLRFSAGDN 388 Query: 333 SVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVK 154 ++P +I VVCG GKHS+ G+SP+K L+ EM++R+K P+RI R+N +G FV KGK Sbjct: 389 --MVPTQITVVCGSGKHSASRGESPVKGLLREMILRIKCPLRIDRRN-LGCFVAKGKVFS 445 Query: 153 DWLC 142 DWLC Sbjct: 446 DWLC 449