BLASTX nr result
ID: Rauwolfia21_contig00001061
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rauwolfia21_contig00001061 (1645 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi... 468 e-129 ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi... 462 e-127 ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi... 447 e-123 ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr... 445 e-122 gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein... 444 e-122 gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus pe... 426 e-116 ref|XP_002521239.1| pentatricopeptide repeat-containing protein,... 417 e-114 ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu... 414 e-113 ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr... 410 e-112 gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis] 409 e-111 ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi... 407 e-111 ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps... 405 e-110 ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar... 405 e-110 dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] 405 e-110 ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar... 405 e-110 ref|XP_002884032.1| pentatricopeptide repeat-containing protein ... 402 e-109 gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu... 392 e-106 gb|AGH33847.1| PPR [Cucumis melo] 390 e-106 ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223... 390 e-105 ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204... 390 e-105 >ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Solanum tuberosum] Length = 459 Score = 468 bits (1203), Expect = e-129 Identities = 243/436 (55%), Positives = 316/436 (72%), Gaps = 4/436 (0%) Frame = +1 Query: 10 LSRRCSLDQRAL--CP-LALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVAL 180 LS R SL R CP +LSKQGHRFL++L A S D SAT+ RKFV SS KHVAL Sbjct: 14 LSHRLSLWNRRPRPCPRCSLSKQGHRFLSTLIAADS-EDISATRHLLRKFVASSSKHVAL 72 Query: 181 DXXXXXXXXXXXXXXXX-AMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETL 357 ++A PLYL I++ SWF+WN+KLVAD++A++YK E F +AETL Sbjct: 73 STLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETL 132 Query: 358 IMETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAY 537 + ET+ KL +ERD+C FY LI S +KH + V D +K SSS+Y+K+R Y Sbjct: 133 VTETVSKLGSRERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVLL-RSSSVYLKQRGY 191 Query: 538 VSMISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQS 717 SM+ C IG PR+AEE+MEEM+ LGLK S FEFRSLVY+YG+ G + DMKR VV+M+S Sbjct: 192 ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMES 251 Query: 718 QGFELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMK 897 GF+LDTV +NMVL S G+H EL ++VS L+ +++SG+PFSIRTYNSVLNSCP I L+++ Sbjct: 252 MGFQLDTVSSNMVLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQ 311 Query: 898 DIKSIPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIF 1077 D+KS+P+S EEL+ +L E+EA +V LVGSSVL+E ++W SELKLDLHGMHL+S+Y+I Sbjct: 312 DLKSVPLSLEELMGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVII 371 Query: 1078 LQWIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARN 1257 LQW L+ + N++LP EI VVCG+GKHS VRG+SPVK LI+E++LR+ CPL+I R Sbjct: 372 LQWFHQLQCKFLAENRVLPGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRK 431 Query: 1258 NAGCFVAKGKVFMDWL 1305 N GCF+AKGK FM+WL Sbjct: 432 NIGCFIAKGKSFMEWL 447 >ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Solanum lycopersicum] Length = 459 Score = 462 bits (1188), Expect = e-127 Identities = 240/436 (55%), Positives = 315/436 (72%), Gaps = 4/436 (0%) Frame = +1 Query: 10 LSRRCSLDQRALCP---LALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVAL 180 LS R SL R P +LSKQGHRFL++L AT S D SAT+ RKFV SS KHVAL Sbjct: 14 LSHRLSLWNRRPRPGPRCSLSKQGHRFLSTLIATDS-DDISATRHLLRKFVGSSSKHVAL 72 Query: 181 DXXXXXXXXXXXXXXXX-AMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETL 357 ++A PLYL I++ SWF+WN+KLVA+++A++YK E F +AETL Sbjct: 73 STLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETL 132 Query: 358 IMETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAY 537 + E++ KL +ERD+C FY LI S +KH + V D +K SSS+Y+K+R Y Sbjct: 133 VTESVSKLGSRERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLL-HSSSVYLKQRGY 191 Query: 538 VSMISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQS 717 SM+ C IG PR+AEE+MEEM+ LGLK S FEFRSLVY+YG+ G + DMKR VV+M+ Sbjct: 192 ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMER 251 Query: 718 QGFELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMK 897 GF+LDTV +NMVL S G+H EL ++VS L+ +++SG+ FSIRTYNSVLNSCP I L+++ Sbjct: 252 MGFQLDTVGSNMVLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQ 311 Query: 898 DIKSIPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIF 1077 D+KS+P+S EEL+ +L E+EA +V+ LVGSSVL+E ++W ELKLDLHGMHL+S+YLI Sbjct: 312 DLKSVPLSLEELMGNLDENEAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLII 371 Query: 1078 LQWIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARN 1257 LQW L+ + N++LP EI VVCG+GKHS VRG+SPVK LI+E++LR+ CPL+I R Sbjct: 372 LQWFHQLQCKFLAENRVLPGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRK 431 Query: 1258 NAGCFVAKGKVFMDWL 1305 N GCF+AKGKVFM+WL Sbjct: 432 NVGCFIAKGKVFMEWL 447 >ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033 [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 447 bits (1151), Expect = e-123 Identities = 229/422 (54%), Positives = 303/422 (71%) Frame = +1 Query: 43 LCPLALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXX 222 L ALSKQG FL+S+A RD SA+ R KF+ SS K +AL+ Sbjct: 20 LIQCALSKQGQLFLSSVA-----RDPSASNRLICKFIASSSKSIALNALSHLLSPTTTHP 74 Query: 223 XXXAMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDM 402 ++A PLY I++ SWF+WN KL+ADVIA++YK +AETL+ ET+ KL +ERD+ Sbjct: 75 YLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDL 134 Query: 403 CKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPRE 582 FYC LI+S +KH + V D+ + + + SSS+YVK+RAY SMISSLC +G P E Sbjct: 135 VSFYCNLIDSHSKHSSNQGVFDVISRLSRIVS-ESSSVYVKERAYKSMISSLCAVGLPLE 193 Query: 583 AEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLA 762 AE ++EEMR GLK S FEFRS+VY YGR+GL EDM+R +++M ++GFELDTV +NMVL+ Sbjct: 194 AENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLS 253 Query: 763 SLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNS 942 S GA+ + +MVSWL+ MK+S IPFSIRTYNSVLNSCP I+ +++D+K+ P + +EL+ + Sbjct: 254 SYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMET 313 Query: 943 LSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGN 1122 L DEA +V+EL+GS VL E +EW+ SE KLDLHGMHL S+YLI LQW + LR RL Sbjct: 314 LKGDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAAE 373 Query: 1123 QMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDW 1302 ++P+EITVVCGSGKHS+VRG+SPVK ++REM+ R + P+KI R N GCFVAK KV +W Sbjct: 374 YVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKNW 433 Query: 1303 LC 1308 LC Sbjct: 434 LC 435 >ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] gi|568866680|ref|XP_006486677.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Citrus sinensis] gi|557524456|gb|ESR35762.1| hypothetical protein CICLE_v10028424mg [Citrus clementina] Length = 451 Score = 445 bits (1144), Expect = e-122 Identities = 230/439 (52%), Positives = 304/439 (69%), Gaps = 7/439 (1%) Frame = +1 Query: 13 SRRCSLDQRAL----CPLA-LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVA 177 SR C L Q+ L C A L+KQG RFL+SLA + RD A R KFV SSP+ +A Sbjct: 15 SRCCRLRQQRLTLVQCLTARLTKQGQRFLSSLALAVT-RDSKAASRLISKFVASSPQFIA 73 Query: 178 LDXXXXXXXXXXXXXXXXAMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETL 357 L+ ++AFPLY+ IT+ SWF WN KLVA++IA + K +AETL Sbjct: 74 LNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAETL 133 Query: 358 IMETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAY 537 I+ET+ KL +ER++ FYC LI+S KH K D Y + Q SSSS+YVK++A Sbjct: 134 ILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQL-VNSSSSVYVKRQAL 192 Query: 538 VSMISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQS 717 SMIS LCE+GQP EAE ++EEMR GL+ SGFE++ ++Y YGR+GL+EDM+R V +M+S Sbjct: 193 KSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQMES 252 Query: 718 QGFELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMK 897 G +DTVC+NMVL+S G H EL +MV WL+ MK SGIPFS+RTYNSVLNSC I+ M++ Sbjct: 253 DGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSMLQ 312 Query: 898 DIKS--IPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYL 1071 D+ S P+S EL L+E+E ++V+EL SSVLDEA++W+S E KLDLHGMHL S+Y Sbjct: 313 DLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAYF 372 Query: 1072 IFLQWIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIA 1251 I LQW+D +R+R ++P EITVVCGSGKHS VRG+S VK ++++M++R P+++ Sbjct: 373 IILQWMDEMRNRFNNEKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKMMVRTSSPMRVH 432 Query: 1252 RNNAGCFVAKGKVFMDWLC 1308 RNN GCF+AKG V DWLC Sbjct: 433 RNNIGCFIAKGHVVKDWLC 451 >gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative [Theobroma cacao] Length = 456 Score = 444 bits (1142), Expect = e-122 Identities = 220/417 (52%), Positives = 301/417 (72%), Gaps = 1/417 (0%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L+KQGHRF +SLAAT+ V D + R +KFV SSPK +AL+ A+ Sbjct: 34 LTKQGHRFFSSLAATADVNDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHLSAL 93 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 AFPLY I++TSW+NWN KLVA++IA++ K + ++E LI + + KL +ERD+ +FYC Sbjct: 94 AFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQFYC 153 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 IES +KH KE +D Y C +SSS+YVK++ Y SM+SSLCE+ +P EAE ++ Sbjct: 154 NWIESCSKHNSKEGFNDAY-CYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAENLV 212 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 EEMR GL + FEFR + Y YG++GL EDM+R V +M+ +GFE+DT+C+NMVL+S GA+ Sbjct: 213 EEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYGAY 272 Query: 778 GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957 +MV WL+ MK+ IPFSIRTYNSVLNSCP I+ +++ + S+P+S EL L+EDE Sbjct: 273 NAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILNEDE 332 Query: 958 ANMVRELV-GSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134 A +V+ELV SSVLDEA+EWN SE KLDLHGMHL S+YLI LQWI+ ++ R ++P Sbjct: 333 ALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKVEECVIP 392 Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWL 1305 +IT+VCGSGKHS+VRG+SPVK L+R+M++++K P+KI R N GCF+AKG+V +WL Sbjct: 393 AQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449 >gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica] Length = 447 Score = 426 bits (1094), Expect = e-116 Identities = 213/418 (50%), Positives = 285/418 (68%) Frame = +1 Query: 55 ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234 A++KQG RFLT LAA + RD T + KF+ SS K +AL+ + Sbjct: 33 AVTKQGQRFLTKLAANA--RDAKVTNKLIAKFLTSSTKSIALNTLSYLLSPDTTLPHLSS 90 Query: 235 MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414 +A P Y IT+ SWF WN KLVA ++A++ K +AE LI ET+ KL +ER++ F+ Sbjct: 91 LALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELALFH 150 Query: 415 CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594 C L+ES +K K Y + Q +SSS+YVK RA+ SM+S LCE+ +PREA+ + Sbjct: 151 CQLVESHSKLSSKHGFDSSYSYLYQLLH-NSSSVYVKNRAFESMVSGLCEMDRPREADNL 209 Query: 595 MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774 +EEMR GLK S FEFRS+VY YGR+GL EDM + V +M++QG +DT+C+NMVL+S GA Sbjct: 210 IEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSYGA 269 Query: 775 HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954 H EL M+ WLR MKS +PFSIRTYNSVLNSC I+ M+++ K P S EEL L+ D Sbjct: 270 HSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLNGD 329 Query: 955 EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134 EA +V+ELV S+VLDE + W E KLDLHGMHL S+YLI L+W + +R R G ++P Sbjct: 330 EALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKDVIP 389 Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 E+ V+CGSGKHS+VRG+SPVKGL+++M+LR++ P++I R N GCFVAKG+ DWLC Sbjct: 390 AEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAVKDWLC 447 >ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223539507|gb|EEF41095.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 460 Score = 417 bits (1073), Expect = e-114 Identities = 206/418 (49%), Positives = 286/418 (68%) Frame = +1 Query: 55 ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234 ALSKQG RFL+SLA ++ D AT R +KFV +SPK +ALD + Sbjct: 44 ALSKQGQRFLSSLAIATTKGDTVATNRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSS 103 Query: 235 MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414 +AF LYL I + WF WN KLVADV+A + K + ++ TL+ +++ KL V+ERD+ +FY Sbjct: 104 LAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKERDLARFY 163 Query: 415 CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594 C L+ES +K + + Q +S+S+YVK++ Y SM++ LCE+G+PREAE + Sbjct: 164 CNLVESQSKQNSIRGFDNSVASLMQLVC-NSNSVYVKRQGYKSMVNGLCEMGRPREAETL 222 Query: 595 MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774 +EEM G++ S FEF+ +VYAYG +G E+M + + +M+ GF +DTVC+NM+LAS GA Sbjct: 223 IEEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGA 282 Query: 775 HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954 H L +MV WL+ MK GIPFS+RT NS LNSCP I+ MM++ PIS +L+ LSED Sbjct: 283 HNALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMKILSED 342 Query: 955 EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134 EA +V+E+V SSVLDEA++W+ +E KLDLHG HL S+YLI L WI+ +R R N + P Sbjct: 343 EALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSVNYVNP 402 Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 EITVVCGSG HS VRG+SPVK ++++ ++R + P++I R N GCF+AKGKV +WLC Sbjct: 403 TEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKGKVVEEWLC 460 >ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa] gi|550331693|gb|EEE86893.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa] Length = 473 Score = 414 bits (1063), Expect = e-113 Identities = 213/423 (50%), Positives = 288/423 (68%), Gaps = 2/423 (0%) Frame = +1 Query: 46 CPLALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXX 225 C A+SKQ RF +++ T + D SAT R +KFV SSPK +ALD Sbjct: 52 CLAAISKQAQRFFSAVLPTVATSDTSATNRLIKKFVASSPKSIALDALSNLLSPDSTHHP 111 Query: 226 XX-AMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDM 402 + PLYL I++ SWF+WN KLVA V+ ++ K + + L+ ET+ +L +ER++ Sbjct: 112 LLYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKEREL 171 Query: 403 CKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPRE 582 FYC LI ++KH D Y + QF + S+S+YVKK+ Y +MIS LCE+G+ RE Sbjct: 172 VLFYCNLIGFNSKHNWVRGFDDSYSRLNQFVS-DSNSVYVKKQGYKAMISGLCEMGRARE 230 Query: 583 AEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLA 762 AE+++ EMR GLK FEFR ++Y YGR+GL +DM+R + KM+S E+DTVCANMVLA Sbjct: 231 AEDLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVLA 290 Query: 763 SLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIK-SIPISTEELLN 939 S GAH L +M WLR MK+ GIP SIRT NSVLNSCP I+ +M+++ S P+S +ELL Sbjct: 291 SYGAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELLK 350 Query: 940 SLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPG 1119 LSE+EA +V+EL+ SSVL EA +W++SE KLDLHGMHL S+Y+I LQW++ R+RL+ G Sbjct: 351 ILSEEEAMLVKELIESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSDG 410 Query: 1120 NQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMD 1299 ++P EITVVCGSG HS VRG+SPVK +I E++ + + P++I R N GCFVAKG V Sbjct: 411 EHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCFVAKGNVVKK 470 Query: 1300 WLC 1308 WLC Sbjct: 471 WLC 473 >ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] gi|557110519|gb|ESQ50810.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum] Length = 469 Score = 410 bits (1054), Expect = e-112 Identities = 208/435 (47%), Positives = 293/435 (67%) Frame = +1 Query: 4 TSLSRRCSLDQRALCPLALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALD 183 TS+ RC + L KQGHRFL+SL++ + D SAT R +KFV +SPK V+L+ Sbjct: 40 TSMEVRCKAGT-----VPLMKQGHRFLSSLSSPALAGDPSATNRHIKKFVAASPKSVSLN 94 Query: 184 XXXXXXXXXXXXXXXXAMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIM 363 A LY IT+ SWF+WN KL+A+++A++ K E ++ETL+ Sbjct: 95 VLSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLS 154 Query: 364 ETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVS 543 + +L ERD+ FYC L+ES++K + ++ +++ T S+S+YVK +AY S Sbjct: 155 NAVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLREI-TRRSTSVYVKTQAYKS 213 Query: 544 MISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQG 723 M+S LC + QP +AE V+EEMR +K FE++S++Y YGR+GL EDM R V +M+++G Sbjct: 214 MVSGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEG 273 Query: 724 FELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDI 903 ++DTVC+NMVL+S GAH L QM SWL+ +K S +P S RTYNSVLNSCP I+ ++KD+ Sbjct: 274 HKIDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDL 333 Query: 904 KSIPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQ 1083 S P+S ELL L++DE +VR L SSVLDEA+EW+S E KLDLHGMHLSSSYLI +Q Sbjct: 334 DSCPVSLSELLTFLNKDEEVLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQ 393 Query: 1084 WIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNA 1263 W+D +R R + G ++P EI +V GSGKHS VRG+SPVK L++++++R P++I R N Sbjct: 394 WMDEMRIRFSEGKCVVPAEIVLVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNI 453 Query: 1264 GCFVAKGKVFMDWLC 1308 G F+AKGK +WLC Sbjct: 454 GSFIAKGKTVKEWLC 468 >gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis] Length = 517 Score = 409 bits (1051), Expect = e-111 Identities = 211/418 (50%), Positives = 284/418 (67%) Frame = +1 Query: 55 ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234 AL+KQGHRFL++L+ + + SA + KFV SSPK ++L+ + Sbjct: 103 ALTKQGHRFLSTLSINAG--NASAANKLIGKFVASSPKSISLNALSHLLSPDTTHTHLTS 160 Query: 235 MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414 + LY I + SWF ++ KLVA + A++ K + +AE LI E + KL ++R++ FY Sbjct: 161 HSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRELAVFY 220 Query: 415 CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594 C L+ES +K K Y + Q SSS+ YVK RA+ +M+ +LC + +P EAE + Sbjct: 221 CSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSA-YVKCRAFETMVGALCTMDRPCEAESL 279 Query: 595 MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774 MEEMR GLK S FEFRSLVY YGR+GL EDM R+V +M+ +G +DT+C+NMVL+S GA Sbjct: 280 MEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLSSYGA 339 Query: 775 HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954 H EL QMV WL+ M++S IPFSIRTYNSVLN CP I M++D+K IP+S EL +L D Sbjct: 340 HNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKDIPLSMYELNATLRGD 399 Query: 955 EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134 E +V ELVGSSVL+E + W+S E+KLDLHGMHL S+YLI L+W++ + R GN +P Sbjct: 400 EGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDGNHGIP 459 Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 E+ VVCGSGKHS VRG SPVK L++EM++++K P+KI R NAGCF+AKGK DWLC Sbjct: 460 AEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNAGCFLAKGKTVRDWLC 517 >ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like [Fragaria vesca subsp. vesca] Length = 448 Score = 407 bits (1045), Expect = e-111 Identities = 202/418 (48%), Positives = 284/418 (67%) Frame = +1 Query: 55 ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234 AL+KQG RFLT LAA + + S + KF+++SPK AL + Sbjct: 34 ALTKQGQRFLTKLAANAG--NPSVANKLISKFLSTSPKSTALTTLSYLLSPHTAHPHLSS 91 Query: 235 MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414 +A P+Y IT+ SWF WN KLVA ++A++ K Q+E LI ET+ KL +ER++ +F+ Sbjct: 92 LALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEALISETISKLGNKERELVQFH 151 Query: 415 CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594 C L+ES +K K + Q +SSS+YVK+RA+ SM+ LC + +P EA+E+ Sbjct: 152 CQLVESHSKMSSKCGFDRACTYLHQLLQ-NSSSVYVKRRAFESMVGGLCAMDRPGEADEL 210 Query: 595 MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774 +EEMR GLK S FEFRS+VY YGR+G+ E+M + V +M+ QGF DT+C NMVL+S GA Sbjct: 211 IEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQGFGDDTICCNMVLSSYGA 270 Query: 775 HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954 H EL M +WLR MK S +PFS+RTYNSVLNSCP I+ M+++ K++P S EL L D Sbjct: 271 HNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQEPKAVPCSVGELSGVLDGD 330 Query: 955 EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134 EA +V+ELVGS+V+DEA+ W+S+E KLDLHGMHL S+YL+ L+W + + +R ++P Sbjct: 331 EALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVMLEWFEAMGNRFKSAECVVP 390 Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 E+ +VCG GKHS+VRG+SPVK L++EM+ +++ P++I R N GCF+AKG+ DWLC Sbjct: 391 AEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRKNVGCFIAKGRAVKDWLC 448 >ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] gi|482566151|gb|EOA30340.1| hypothetical protein CARUB_v10013465mg [Capsella rubella] Length = 516 Score = 405 bits (1042), Expect = e-110 Identities = 206/418 (49%), Positives = 286/418 (68%), Gaps = 1/418 (0%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L KQGH+FL+SL++ + D AT R +KFV +SPK VAL+ Sbjct: 99 LMKQGHQFLSSLSSPALAGDPPATNRLIKKFVAASPKSVALNVLSHLLSDNTSHPHLSYF 158 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 A LYL IT+ SWF+WN KL+ ++++++ K E F ++ETL+ + +L+ ERD F C Sbjct: 159 APQLYLEITEASWFDWNPKLIGELVSLLNKQERFVESETLLSTAVSRLESNERDFALFLC 218 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 L+ES++K + SD +++ SSS+YVK +AY SM+S LC + QP +AE V+ Sbjct: 219 NLVESNSKQGSIQGFSDACSRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPLDAERVI 277 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 EEMR +K FE++S++Y YGR+GL +DM R V +M++QG ++DTVC+NMVL+S GAH Sbjct: 278 EEMRMETIKPGLFEYKSVLYGYGRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYGAH 337 Query: 778 GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957 L QM SWL+ +K +P SIRTYNSVLNSCP I+ ++KD+ S P+S ELL L+EDE Sbjct: 338 DALPQMGSWLQKLKGYNVPLSIRTYNSVLNSCPTIISLLKDLDSCPLSLSELLPILNEDE 397 Query: 958 ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQ-MLP 1134 A +VREL S VLDEA+EWN+ E KLDLHGMHLS+SYLI LQW+D R R + + ++P Sbjct: 398 ALLVRELTQSLVLDEAIEWNAVEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKKCVVP 457 Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 EI VV GSGKHS VRG+SPVK +++++++R K P++I R N G F+AKGK +WLC Sbjct: 458 AEIVVVSGSGKHSNVRGESPVKAMVKKIMVRTKSPMRIDRKNVGSFIAKGKNVKEWLC 515 >ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 505 Score = 405 bits (1040), Expect = e-110 Identities = 207/417 (49%), Positives = 280/417 (67%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L K G RFL+SL++ + D SA R +KFV +SPK VAL+ Sbjct: 89 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 A LY IT+ SWF+WN KL+A++IA++ K E F ++ETL+ + +L ERD F C Sbjct: 149 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 L+ES++K + S+ +++ SSS+YVK +AY SM+S LC + QP +AE V+ Sbjct: 209 NLVESNSKQGSIQGFSEASFRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVI 267 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 EEMR +K FE++S++Y YGR+GL +DM R V +M ++G ++DTVC+NMVL+S GAH Sbjct: 268 EEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAH 327 Query: 778 GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957 L QM SWL+ +K +PFSIRTYNSVLNSCP I+ M+KD+ S P+S EL L+EDE Sbjct: 328 DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDE 387 Query: 958 ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137 A +V EL SSVLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D R R + ++P Sbjct: 388 ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 447 Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 EI VV GSGKHS VRG+SPVK L++++++R P++I R N G F+AKGK +WLC Sbjct: 448 EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504 >dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana] Length = 501 Score = 405 bits (1040), Expect = e-110 Identities = 207/417 (49%), Positives = 280/417 (67%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L K G RFL+SL++ + D SA R +KFV +SPK VAL+ Sbjct: 85 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 144 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 A LY IT+ SWF+WN KL+A++IA++ K E F ++ETL+ + +L ERD F C Sbjct: 145 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 204 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 L+ES++K + S+ +++ SSS+YVK +AY SM+S LC + QP +AE V+ Sbjct: 205 NLVESNSKQGSIQGFSEASFRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVI 263 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 EEMR +K FE++S++Y YGR+GL +DM R V +M ++G ++DTVC+NMVL+S GAH Sbjct: 264 EEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAH 323 Query: 778 GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957 L QM SWL+ +K +PFSIRTYNSVLNSCP I+ M+KD+ S P+S EL L+EDE Sbjct: 324 DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDE 383 Query: 958 ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137 A +V EL SSVLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D R R + ++P Sbjct: 384 ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 443 Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 EI VV GSGKHS VRG+SPVK L++++++R P++I R N G F+AKGK +WLC Sbjct: 444 EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500 >ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown protein [Arabidopsis thaliana] gi|330251481|gb|AEC06575.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 405 bits (1040), Expect = e-110 Identities = 207/417 (49%), Positives = 280/417 (67%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L K G RFL+SL++ + D SA R +KFV +SPK VAL+ Sbjct: 88 LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 A LY IT+ SWF+WN KL+A++IA++ K E F ++ETL+ + +L ERD F C Sbjct: 148 ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 L+ES++K + S+ +++ SSS+YVK +AY SM+S LC + QP +AE V+ Sbjct: 208 NLVESNSKQGSIQGFSEASFRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVI 266 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 EEMR +K FE++S++Y YGR+GL +DM R V +M ++G ++DTVC+NMVL+S GAH Sbjct: 267 EEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAH 326 Query: 778 GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957 L QM SWL+ +K +PFSIRTYNSVLNSCP I+ M+KD+ S P+S EL L+EDE Sbjct: 327 DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDE 386 Query: 958 ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137 A +V EL SSVLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D R R + ++P Sbjct: 387 ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 446 Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 EI VV GSGKHS VRG+SPVK L++++++R P++I R N G F+AKGK +WLC Sbjct: 447 EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503 >ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297329872|gb|EFH60291.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 504 Score = 402 bits (1032), Expect = e-109 Identities = 203/417 (48%), Positives = 281/417 (67%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L KQG RFL+SL++ + D SAT R +KFV +SPK V L+ Sbjct: 88 LMKQGDRFLSSLSSPALAGDPSATHRHIKKFVAASPKSVTLNVLSHLLSDQTSYPHLSFF 147 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 A LY IT+ SWF+WN KL+A+++AV+ E F ++ETL+ + +L ERD F C Sbjct: 148 ALSLYSEITEASWFDWNPKLIAELVAVLNNQERFDESETLLSTAVSRLKSNERDFALFLC 207 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 L+ES++K + ++ +++ SSS+YVK +AY SM++ LC + QP +AE V+ Sbjct: 208 NLVESNSKQGSIQGFNEACFRLRERIQ-RSSSVYVKTQAYKSMVAGLCNMDQPHDAERVI 266 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 EEMR +K FE +S++Y YGR+GL +DM R V +M+++G ++DTVC+NMVL+S GAH Sbjct: 267 EEMRVEKIKPGSFEHKSVLYGYGRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYGAH 326 Query: 778 GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957 L QM SWL+ +K +PFSIRTYNSVLNSCP I+ ++KD+ S P+S EL L+EDE Sbjct: 327 DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNEDE 386 Query: 958 ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137 A +V EL S+VLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D +R R ++P Sbjct: 387 ALLVLELTQSTVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDEIRLRFRDQKCVIPA 446 Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308 EI VV GSGKHS VRG+SPVK L++++++R + P++I R N G F+AKGK +WLC Sbjct: 447 EIVVVSGSGKHSNVRGESPVKALVKKIMVRTESPMRIDRKNVGSFIAKGKNVKEWLC 503 >gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo] Length = 488 Score = 392 bits (1006), Expect = e-106 Identities = 199/422 (47%), Positives = 286/422 (67%), Gaps = 5/422 (1%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L+KQ HRFL++L+ T + D SAT R RKFV SSPK + L + Sbjct: 45 LTKQTHRFLSTLSTTGATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSA 104 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 A LY IT+ SWF WN+KLVAD++A + ++ + ++E LI E + KL QER + FY Sbjct: 105 ALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYS 164 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 L+ES +KH + D Y + + +S S+YVK+RAY SM++ LC + +P EAE ++ Sbjct: 165 QLVESQSKHGFERGFGDSYSRLFELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAESLV 223 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 +EMR G+ + +E+RS++YAYG +GL E+MKRS+ +M++ ELDTVC+NMVL+S GAH Sbjct: 224 KEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAH 283 Query: 778 GELMQMVSWLRIMK-SSGIPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSLS 948 +L M+ WL+ MK SS S+RTYNSVLNSCP+I M++D KS +P+ E+L+ L Sbjct: 284 NKLGDMLLWLQRMKTSSHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILD 343 Query: 949 EDE-ANMVREL-VGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGN 1122 DE A +V+EL VGSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI +R + Sbjct: 344 GDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDES 403 Query: 1123 QMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDW 1302 ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK +W Sbjct: 404 NVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNW 463 Query: 1303 LC 1308 LC Sbjct: 464 LC 465 >gb|AGH33847.1| PPR [Cucumis melo] Length = 488 Score = 390 bits (1003), Expect = e-106 Identities = 198/422 (46%), Positives = 287/422 (68%), Gaps = 5/422 (1%) Frame = +1 Query: 58 LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237 L+KQ HRFL++L+ T++ D SAT R RKFV SSPK + L + Sbjct: 45 LTKQTHRFLSTLSTTAATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSA 104 Query: 238 AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417 A LY IT+ SWF WN+KLVAD++A + ++ + ++E LI E + KL QER + FY Sbjct: 105 ALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYS 164 Query: 418 YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597 L+ES +KH + D Y + + +S S+YVK+RAY SM++ LC + +P EAE ++ Sbjct: 165 QLVESQSKHGFERGFGDSYSRLFELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAESLV 223 Query: 598 EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777 +EMR G+ + +E+RS++YAYG +GL E+MKRS+ +M++ ELDTVC+NMVL+S GAH Sbjct: 224 KEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAH 283 Query: 778 GELMQMVSWLRIMKSSG-IPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSLS 948 +L M+ WL+ MK+S S+RTYNSVLNSCP+I M++D KS +P+ E+L+ L Sbjct: 284 NKLGDMLLWLQRMKTSPHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILD 343 Query: 949 EDE-ANMVREL-VGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGN 1122 DE A +V+EL VGSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI +R + Sbjct: 344 GDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDES 403 Query: 1123 QMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDW 1302 ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK +W Sbjct: 404 YVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNW 463 Query: 1303 LC 1308 LC Sbjct: 464 LC 465 >ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus] Length = 1296 Score = 390 bits (1001), Expect = e-105 Identities = 196/423 (46%), Positives = 285/423 (67%), Gaps = 5/423 (1%) Frame = +1 Query: 55 ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234 +L+KQ HRFL++L+ T++ D SAT R RKFV SSPK + L + Sbjct: 44 SLTKQTHRFLSTLSTTAATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCS 103 Query: 235 MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414 A LY IT+ SWF WN+KLVAD++A + ++ + ++E LI E + KL QER + FY Sbjct: 104 AALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKLGSQERKLVNFY 163 Query: 415 CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594 L+ES +KH + D Y + + +S S+YVK+RAY SM++ LC + +P EAE + Sbjct: 164 SQLVESQSKHGFERGFVDSYSRLLELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAENL 222 Query: 595 MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774 ++EMR G+ + +E+RS++YAYG +GL E+MKRS+ +M++ ELDTVC+NMVL+S GA Sbjct: 223 VKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGA 282 Query: 775 HGELMQMVSWLRIMKSSG-IPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSL 945 H +L MV WL+ MK+S S+RTYNSVLNSCP+I M++D KS +P+ E+L+ L Sbjct: 283 HNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVL 342 Query: 946 SEDEANMVRE--LVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPG 1119 DE ++ E L GSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI +R Sbjct: 343 DGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDE 402 Query: 1120 NQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMD 1299 + ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK + Sbjct: 403 SYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKN 462 Query: 1300 WLC 1308 WLC Sbjct: 463 WLC 465 >ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus] Length = 1913 Score = 390 bits (1001), Expect = e-105 Identities = 196/423 (46%), Positives = 285/423 (67%), Gaps = 5/423 (1%) Frame = +1 Query: 55 ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234 +L+KQ HRFL++L+ T++ D SAT R RKFV SSPK + L + Sbjct: 44 SLTKQTHRFLSTLSTTAATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCS 103 Query: 235 MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414 A LY IT+ SWF WN+KLVAD++A + ++ + ++E LI E + KL QER + FY Sbjct: 104 AALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKLGSQERKLVNFY 163 Query: 415 CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594 L+ES +KH + D Y + + +S S+YVK+RAY SM++ LC + +P EAE + Sbjct: 164 SQLVESQSKHGFERGFVDSYSRLLELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAENL 222 Query: 595 MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774 ++EMR G+ + +E+RS++YAYG +GL E+MKRS+ +M++ ELDTVC+NMVL+S GA Sbjct: 223 VKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGA 282 Query: 775 HGELMQMVSWLRIMKSSG-IPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSL 945 H +L MV WL+ MK+S S+RTYNSVLNSCP+I M++D KS +P+ E+L+ L Sbjct: 283 HNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVL 342 Query: 946 SEDEANMVRE--LVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPG 1119 DE ++ E L GSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI +R Sbjct: 343 DGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDE 402 Query: 1120 NQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMD 1299 + ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK + Sbjct: 403 SYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKN 462 Query: 1300 WLC 1308 WLC Sbjct: 463 WLC 465