BLASTX nr result
ID: Catharanthus22_contig00035440
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00035440 (701 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004238502.1| PREDICTED: pentatricopeptide repeat-containi... 249 5e-64 ref|XP_006359053.1| PREDICTED: pentatricopeptide repeat-containi... 248 1e-63 gb|EPS70180.1| hypothetical protein M569_04582 [Genlisea aurea] 232 1e-58 gb|ESW03397.1| hypothetical protein PHAVU_011G010900g [Phaseolus... 229 6e-58 ref|XP_004159440.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope... 226 5e-57 ref|XP_004140941.1| PREDICTED: pentatricopeptide repeat-containi... 226 5e-57 ref|XP_004515953.1| PREDICTED: pentatricopeptide repeat-containi... 224 2e-56 ref|XP_002306730.2| pentatricopeptide repeat-containing family p... 215 1e-53 gb|EOY17416.1| Tetratricopeptide repeat (TPR)-like superfamily p... 214 2e-53 gb|EMJ00471.1| hypothetical protein PRUPE_ppa022734mg [Prunus pe... 214 2e-53 ref|XP_006307085.1| hypothetical protein CARUB_v10008671mg [Caps... 211 2e-52 ref|XP_002527112.1| pentatricopeptide repeat-containing protein,... 209 7e-52 ref|XP_006415018.1| hypothetical protein EUTSA_v10010025mg [Eutr... 206 6e-51 ref|XP_006434562.1| hypothetical protein CICLE_v10003512mg [Citr... 204 2e-50 gb|EXC75282.1| hypothetical protein L484_000391 [Morus notabilis] 203 4e-50 ref|XP_006473158.1| PREDICTED: pentatricopeptide repeat-containi... 203 4e-50 ref|XP_002891080.1| pentatricopeptide repeat-containing protein ... 199 7e-49 gb|AAG12522.1|AC015446_3 Hypothetical Protein [Arabidopsis thali... 198 2e-48 ref|NP_174678.2| pentatricopeptide repeat-containing protein [Ar... 198 2e-48 ref|XP_002265412.1| PREDICTED: pentatricopeptide repeat-containi... 196 5e-48 >ref|XP_004238502.1| PREDICTED: pentatricopeptide repeat-containing protein At1g34160-like [Solanum lycopersicum] Length = 574 Score = 249 bits (637), Expect = 5e-64 Identities = 124/199 (62%), Positives = 153/199 (76%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 AYV+ LL +CT F +KQLQAHLI TG F + RAK LDF A SSAG+ YA +F I Sbjct: 2 AYVDSLLSKCTCFSKLKQLQAHLIITGNFQFYTCRAKFLDFCAVSSAGNLPYATHIFRHI 61 Query: 283 PHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETF 462 P N+WNAIIRGLAQS +P++A+ FYVSM + CKPDALTCSF LKAC+RALAR ET Sbjct: 62 TSPFKNEWNAIIRGLAQSHKPIDALTFYVSMSRSLCKPDALTCSFTLKACARALARSETP 121 Query: 463 QMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQGN 642 Q+H+ V++FG +D+LL+TTLLDAY+K GDL+ A +FDEM +RDIASWN+LI+GLAQGN Sbjct: 122 QLHTHVIRFGFDADVLLRTTLLDAYSKSGDLDYAYKVFDEMGVRDIASWNALIAGLAQGN 181 Query: 643 QPEEALELFKRMKENGPLP 699 +P EAL LFK+M+E P Sbjct: 182 RPTEALLLFKKMREEDMEP 200 Score = 101 bits (252), Expect = 2e-19 Identities = 73/230 (31%), Positives = 109/230 (47%), Gaps = 1/230 (0%) Frame = +1 Query: 4 DCLTCWGAATNPTQSPSSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTG 183 D LT + + + P + T + +L AC R A QL H+I G Sbjct: 84 DALTFYVSMSRSLCKPDALT-CSFTLKACA-----------RALARSETPQLHTHVIRFG 131 Query: 184 LFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIF 363 L R LLD A S +G YA VFD + WNA+I GLAQ ++P A++ Sbjct: 132 FDADVLLRTTLLD--AYSKSGDLDYAYKVFDEMGVRDIASWNALIAGLAQGNRPTEALLL 189 Query: 364 YVSMICASCKPDALTCSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAK 543 + M +P+ +T L ACS+ A E +H + + +++ ++D YAK Sbjct: 190 FKKMREEDMEPNEVTVLGALSACSQLGANKEGELVHEYIKSKNLDCKVIVCNAVIDMYAK 249 Query: 544 CGDLNSASNLFDEM-TMRDIASWNSLISGLAQGNQPEEALELFKRMKENG 690 CG + A +F EM +R +WN++I LA E+ALELF+RM + G Sbjct: 250 CGVVGRAYEVFSEMKCLRTRVTWNTMIMALAIYGDGEQALELFERMGQAG 299 >ref|XP_006359053.1| PREDICTED: pentatricopeptide repeat-containing protein At1g34160-like [Solanum tuberosum] Length = 574 Score = 248 bits (634), Expect = 1e-63 Identities = 123/199 (61%), Positives = 151/199 (75%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 AYV+ LL +CT F +KQLQAHLI TG F + RAK LDF A SSAG+ YA +F I Sbjct: 2 AYVDTLLSKCTCFSKLKQLQAHLIITGNFQFYTCRAKFLDFCAVSSAGNLPYATHIFRHI 61 Query: 283 PHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETF 462 P N+WNAIIRGLAQS P++A+ FYVSM + CKPDALTCSF LKAC+RALAR ET Sbjct: 62 TSPYKNEWNAIIRGLAQSHNPIDALTFYVSMSRSLCKPDALTCSFTLKACARALARSETP 121 Query: 463 QMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQGN 642 Q+H+ V++FG +D+LL+TTLLDAY+KC DL+ A +FDEM +RDIA WN+LI+GLAQGN Sbjct: 122 QLHAHVIRFGFAADVLLRTTLLDAYSKCSDLDYAYKVFDEMGVRDIAIWNALIAGLAQGN 181 Query: 643 QPEEALELFKRMKENGPLP 699 +P EAL LFK+M+E P Sbjct: 182 RPTEALLLFKKMREENMEP 200 Score = 100 bits (248), Expect = 6e-19 Identities = 73/230 (31%), Positives = 109/230 (47%), Gaps = 1/230 (0%) Frame = +1 Query: 4 DCLTCWGAATNPTQSPSSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTG 183 D LT + + + P + T + +L AC R A QL AH+I G Sbjct: 84 DALTFYVSMSRSLCKPDALT-CSFTLKACA-----------RALARSETPQLHAHVIRFG 131 Query: 184 LFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIF 363 L R LLD A S YA VFD + WNA+I GLAQ ++P A++ Sbjct: 132 FAADVLLRTTLLD--AYSKCSDLDYAYKVFDEMGVRDIAIWNALIAGLAQGNRPTEALLL 189 Query: 364 YVSMICASCKPDALTCSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAK 543 + M + +P+ +T L ACS+ A E +H + + S +++ ++D YAK Sbjct: 190 FKKMREENMEPNEVTVLGALSACSQLGANKEGELVHEYIKSKNLDSKVIVCNAVIDMYAK 249 Query: 544 CGDLNSASNLFDEM-TMRDIASWNSLISGLAQGNQPEEALELFKRMKENG 690 CG + A +F+ M R +WN++I LA E+ALELF+RM + G Sbjct: 250 CGLVGRAYEVFNGMKCSRTRVTWNTMIMALAMYGDGEQALELFERMSQAG 299 >gb|EPS70180.1| hypothetical protein M569_04582 [Genlisea aurea] Length = 577 Score = 232 bits (591), Expect = 1e-58 Identities = 113/194 (58%), Positives = 143/194 (73%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 AYVE LL C +F H++Q+ AHL+ TGLF + RAK LD+ AT+ + +A F I Sbjct: 2 AYVESLLHNCASFSHVRQIHAHLLATGLFQFYPYRAKFLDYCATAFSSGLRHAVAAFPFI 61 Query: 283 PHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETF 462 P TNDWNAIIRG AQSD P AV +YVSM A CKPDALTCSF+ KAC+R+L+R+E Sbjct: 62 RLPGTNDWNAIIRGYAQSDSPNEAVAWYVSMSRAPCKPDALTCSFLFKACARSLSRIEAL 121 Query: 463 QMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQGN 642 Q+H+ V ++G +D+LLQTTLLDAYAK DL+ A LFDEM+ RDI SWN+LI+GLAQG+ Sbjct: 122 QVHAHVRRYGFFADVLLQTTLLDAYAKFADLDDACKLFDEMSRRDIPSWNALIAGLAQGD 181 Query: 643 QPEEALELFKRMKE 684 +P +AL LF RM+E Sbjct: 182 RPSDALLLFNRMRE 195 Score = 79.3 bits (194), Expect = 1e-12 Identities = 60/216 (27%), Positives = 100/216 (46%), Gaps = 12/216 (5%) Frame = +1 Query: 88 CCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKF 267 C F+ L R A Q+ AH+ G F L + LLD +YAKF Sbjct: 104 CSFLFKACARSLSRIEAL----QVHAHVRRYGFFADVLLQTTLLD----------AYAKF 149 Query: 268 --------VFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASC---KPDALTCS 414 +FD + WNA+I GLAQ D+P +A++ + M + P+ +T Sbjct: 150 ADLDDACKLFDEMSRRDIPSWNALIAGLAQGDRPSDALLLFNRMREGNADDNSPNEVTVL 209 Query: 415 FVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTM- 591 L ACS+ A E ++ +++ + ++++ ++D +AK G + A +F+ M Sbjct: 210 GALSACSQLGAIKEADRVFEYILQNNLHHNLIVCNAVIDMFAKSGQIEKAYGVFNSMKCG 269 Query: 592 RDIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 R+I +WN++I G A AL LF+ ++ G P Sbjct: 270 RNIVTWNTMIMGFAIDGDGVNALRLFRLVEGRGLKP 305 >gb|ESW03397.1| hypothetical protein PHAVU_011G010900g [Phaseolus vulgaris] Length = 577 Score = 229 bits (584), Expect = 6e-58 Identities = 115/197 (58%), Positives = 148/197 (75%) Frame = +1 Query: 109 VEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPH 288 ++ LL++CT+ +KQLQAHLITTG F + SRAKLL+ + S AG S+A +F RI Sbjct: 7 LDSLLQKCTSLISMKQLQAHLITTGKFQFHPSRAKLLELCSISPAGDLSFAGQIFRRIQT 66 Query: 289 PATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETFQM 468 P+TNDWNA++RGLAQS +PM A+ +Y +M + K DALTCSF LK C+RALA E Q+ Sbjct: 67 PSTNDWNAVLRGLAQSPEPMQALSWYRAMSRSPQKVDALTCSFALKGCARALAFSEATQI 126 Query: 469 HSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQGNQP 648 HSQ+++FG ++DILL TTLLD YAK GDL++A +FD M RDIASWN++ISGLAQG+QP Sbjct: 127 HSQLLRFGFEADILLLTTLLDVYAKTGDLDAAHKVFDNMQKRDIASWNAMISGLAQGSQP 186 Query: 649 EEALELFKRMKENGPLP 699 EA+ LF RMKE G P Sbjct: 187 NEAIALFNRMKEEGWRP 203 Score = 92.4 bits (228), Expect = 1e-16 Identities = 60/192 (31%), Positives = 95/192 (49%), Gaps = 1/192 (0%) Frame = +1 Query: 127 RCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDW 306 R AF Q+ + L+ G L LLD A + G A VFD + W Sbjct: 116 RALAFSEATQIHSQLLRFGFEADILLLTTLLDVYAKT--GDLDAAHKVFDNMQKRDIASW 173 Query: 307 NAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETFQMHSQVVK 486 NA+I GLAQ QP A+ + M +P+ +T L ACS+ A +H+ VV Sbjct: 174 NAMISGLAQGSQPNEAIALFNRMKEEGWRPNEVTVLGALSACSQLGALKHGQIIHAYVVD 233 Query: 487 FGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMT-MRDIASWNSLISGLAQGNQPEEALE 663 + +++++ ++D YAKCG ++ A ++F MT + + +WN++I LA +ALE Sbjct: 234 EKLDTNVIVCNAVIDMYAKCGFVDKAYSVFVSMTCKKSLVTWNTMIMALAMNGDGNQALE 293 Query: 664 LFKRMKENGPLP 699 L +M +G +P Sbjct: 294 LLDKMVVDGVVP 305 >ref|XP_004159440.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g34160-like [Cucumis sativus] Length = 576 Score = 226 bits (576), Expect = 5e-57 Identities = 115/200 (57%), Positives = 147/200 (73%), Gaps = 2/200 (1%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 AY LL++C++F IKQLQA+LI G F++ SR KLL+ A SS G SYA +F I Sbjct: 2 AYFNLLLQKCSSFSQIKQLQANLIINGDFHFSSSRTKLLELCAISSFGDLSYALHIFRYI 61 Query: 283 PHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASC--KPDALTCSFVLKACSRALARLE 456 P+P+TNDWNA+IRG A S P NAV +Y +M ++ + DALTCSF LKAC+RALAR E Sbjct: 62 PYPSTNDWNAVIRGTALSSDPANAVFWYRAMAASNGLHRIDALTCSFALKACARALARSE 121 Query: 457 TFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQ 636 Q+HSQ+++FG +D+LLQTTLLDAYAK GDL+ A LFDEM DIASWN+LI+G AQ Sbjct: 122 AIQLHSQLLRFGFNADVLLQTTLLDAYAKIGDLDLAQKLFDEMPQPDIASWNALIAGFAQ 181 Query: 637 GNQPEEALELFKRMKENGPL 696 G++P +A+ FKRMK +G L Sbjct: 182 GSRPADAIMTFKRMKVDGNL 201 Score = 98.2 bits (243), Expect = 2e-18 Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 2/211 (0%) Frame = +1 Query: 73 ISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSF 252 I C F L R A QL + L+ G L + LLD A + G Sbjct: 101 IDALTCSFALKACARALARSEAI----QLHSQLLRFGFNADVLLQTTLLD--AYAKIGDL 154 Query: 253 SYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSM-ICASCKPDALTCSFVLKA 429 A+ +FD +P P WNA+I G AQ +P +A++ + M + + +P+A+T L A Sbjct: 155 DLAQKLFDEMPQPDIASWNALIAGFAQGSRPADAIMTFKRMKVDGNLRPNAVTVQGALLA 214 Query: 430 CSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTM-RDIAS 606 CS+ A E +H +V+ + S++ + ++D YAKCG ++ A +F+ M + + + Sbjct: 215 CSQLGALKEGESVHKYIVEEKLDSNVQVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLIT 274 Query: 607 WNSLISGLAQGNQPEEALELFKRMKENGPLP 699 WN++I A +AL+LF+++ +G P Sbjct: 275 WNTMIMAFAMHGDGHKALDLFEKLGRSGMSP 305 >ref|XP_004140941.1| PREDICTED: pentatricopeptide repeat-containing protein At1g34160-like [Cucumis sativus] Length = 576 Score = 226 bits (576), Expect = 5e-57 Identities = 115/200 (57%), Positives = 147/200 (73%), Gaps = 2/200 (1%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 AY LL++C++F IKQLQA+LI G F++ SR KLL+ A SS G SYA +F I Sbjct: 2 AYFNLLLQKCSSFSQIKQLQANLIINGDFHFSSSRTKLLELCAISSFGDLSYALHIFRYI 61 Query: 283 PHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASC--KPDALTCSFVLKACSRALARLE 456 P+P+TNDWNA+IRG A S P NAV +Y +M ++ + DALTCSF LKAC+RALAR E Sbjct: 62 PYPSTNDWNAVIRGTALSSDPANAVFWYRAMAASNGLHRIDALTCSFALKACARALARSE 121 Query: 457 TFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQ 636 Q+HSQ+++FG +D+LLQTTLLDAYAK GDL+ A LFDEM DIASWN+LI+G AQ Sbjct: 122 AIQLHSQLLRFGFNADVLLQTTLLDAYAKIGDLDLAQKLFDEMPQPDIASWNALIAGFAQ 181 Query: 637 GNQPEEALELFKRMKENGPL 696 G++P +A+ FKRMK +G L Sbjct: 182 GSRPADAIMTFKRMKVDGNL 201 Score = 98.6 bits (244), Expect = 2e-18 Identities = 62/211 (29%), Positives = 105/211 (49%), Gaps = 2/211 (0%) Frame = +1 Query: 73 ISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSF 252 I C F L R A QL + L+ G L + LLD A + G Sbjct: 101 IDALTCSFALKACARALARSEAI----QLHSQLLRFGFNADVLLQTTLLD--AYAKIGDL 154 Query: 253 SYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSM-ICASCKPDALTCSFVLKA 429 A+ +FD +P P WNA+I G AQ +P +A++ + M + + +P+A+T L A Sbjct: 155 DLAQKLFDEMPQPDIASWNALIAGFAQGSRPADAIMTFKRMKVDGNLRPNAVTVQGALLA 214 Query: 430 CSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTM-RDIAS 606 CS+ A E +H +V+ + S++ + ++D YAKCG ++ A +F+ M + + + Sbjct: 215 CSQLGALKEGESVHKYIVEEKLNSNVQVCNVVIDMYAKCGSMDKAYWVFENMRCDKSLIT 274 Query: 607 WNSLISGLAQGNQPEEALELFKRMKENGPLP 699 WN++I A +AL+LF+++ +G P Sbjct: 275 WNTMIMAFAMHGDGHKALDLFEKLGRSGMSP 305 >ref|XP_004515953.1| PREDICTED: pentatricopeptide repeat-containing protein At1g34160-like [Cicer arietinum] Length = 577 Score = 224 bits (572), Expect = 2e-56 Identities = 110/198 (55%), Positives = 149/198 (75%) Frame = +1 Query: 106 YVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIP 285 +++ LL++C + H+KQLQAHLITTG F + SR KLL+ + S +G S+A +F +I Sbjct: 6 HIDSLLQKCNSLIHMKQLQAHLITTGKFQFHPSRTKLLELFSISPSGDLSFAGKLFRQIQ 65 Query: 286 HPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETFQ 465 +P+TND+NA++RGLAQS +P A+++Y SM+ K DALTCSF LK C+RALA E Q Sbjct: 66 NPSTNDYNAVLRGLAQSSEPTQAILWYRSMLRYLQKIDALTCSFALKGCARALAFSEATQ 125 Query: 466 MHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQGNQ 645 +HSQ+++FG +D+LL TTLLD YAK G L+ A+ +FDEM RDIASWN++ISGLAQG++ Sbjct: 126 LHSQLLRFGFDADVLLVTTLLDVYAKTGYLDDATKVFDEMPQRDIASWNAMISGLAQGSR 185 Query: 646 PEEALELFKRMKENGPLP 699 P EAL+LF RMKE G P Sbjct: 186 PNEALDLFNRMKEEGWKP 203 Score = 86.7 bits (213), Expect = 7e-15 Identities = 60/192 (31%), Positives = 92/192 (47%), Gaps = 1/192 (0%) Frame = +1 Query: 127 RCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDW 306 R AF QL + L+ G L LLD A + G A VFD +P W Sbjct: 116 RALAFSEATQLHSQLLRFGFDADVLLVTTLLDVYAKT--GYLDDATKVFDEMPQRDIASW 173 Query: 307 NAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETFQMHSQVVK 486 NA+I GLAQ +P A+ + M KP+ +T L ACS+ A + +H VV Sbjct: 174 NAMISGLAQGSRPNEALDLFNRMKEEGWKPNEVTVLGALSACSQLGALKQGEIVHGYVVD 233 Query: 487 FGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMR-DIASWNSLISGLAQGNQPEEALE 663 + ++++ ++D YAKCG ++ A ++F M+ R + +WN++I A +AL+ Sbjct: 234 EKLDVNVIVCNAVIDMYAKCGFVDKAYSVFSSMSCRKSLITWNTMIMAFAMNGDGYKALD 293 Query: 664 LFKRMKENGPLP 699 L M +G P Sbjct: 294 LLDGMFLDGTCP 305 >ref|XP_002306730.2| pentatricopeptide repeat-containing family protein, partial [Populus trichocarpa] gi|550339513|gb|EEE93726.2| pentatricopeptide repeat-containing family protein, partial [Populus trichocarpa] Length = 577 Score = 215 bits (548), Expect = 1e-53 Identities = 116/205 (56%), Positives = 145/205 (70%), Gaps = 4/205 (1%) Frame = +1 Query: 97 MAAYVEHLLRRCT--AFPHIKQLQAHLITTGLFNYQLS--RAKLLDFSATSSAGSFSYAK 264 MA+ ++ L +CT + PH KQL AHL TTG F +S R+KLL+ A S G+ S+A Sbjct: 1 MASSLDSFLSKCTTLSLPHTKQLHAHLFTTGQFRLPISPARSKLLELYALS-LGNLSFAI 59 Query: 265 FVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRAL 444 F +I P+TNDWNAIIRG QS P NA +Y SMI S K DALTCSFVLKAC+R L Sbjct: 60 LTFSQIRTPSTNDWNAIIRGFIQSPNPTNAFAWYKSMISKSRKVDALTCSFVLKACARVL 119 Query: 445 ARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLIS 624 ARLE+ Q+H+ +V+ G +D LL TTLLD YAK G+++SA +FDEM RDIASWN+LIS Sbjct: 120 ARLESIQIHTHIVRKGFIADALLGTTLLDVYAKVGEIDSAEKVFDEMVKRDIASWNALIS 179 Query: 625 GLAQGNQPEEALELFKRMKENGPLP 699 G AQG++P EAL LFKRM+ +G P Sbjct: 180 GFAQGSKPTEALSLFKRMEIDGFKP 204 Score = 89.0 bits (219), Expect = 1e-15 Identities = 65/233 (27%), Positives = 107/233 (45%), Gaps = 14/233 (6%) Frame = +1 Query: 43 QSPSSTTEIA-----------ISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLF 189 QSP+ T A + C F+ +L R + Q+ H++ G Sbjct: 82 QSPNPTNAFAWYKSMISKSRKVDALTCSFVLKACARVLARLESI----QIHTHIVRKGFI 137 Query: 190 NYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYV 369 L LLD A G A+ VFD + WNA+I G AQ +P A+ + Sbjct: 138 ADALLGTTLLDVYA--KVGEIDSAEKVFDEMVKRDIASWNALISGFAQGSKPTEALSLFK 195 Query: 370 SMICASCKPDALTCSFVLKACSRALARLETFQMHS--QVVKFGVKSDILLQTTLLDAYAK 543 M KP+ ++ L AC++ E ++H +V +F + + + ++D YAK Sbjct: 196 RMEIDGFKPNEISVLGALSACAQLGDFKEGEKIHGYIKVERFDMNAQVC--NVVIDMYAK 253 Query: 544 CGDLNSASNLFDEMTMR-DIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 CG ++ A +F+ M+ R DI +WN++I A + +ALELF++M ++G P Sbjct: 254 CGFVDKAYLVFESMSCRKDIVTWNTMIMAFAMHGEGCKALELFEKMDQSGVSP 306 >gb|EOY17416.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao] Length = 574 Score = 214 bits (546), Expect = 2e-53 Identities = 113/199 (56%), Positives = 140/199 (70%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 A +E L++RC AF HIKQLQA+ ITTG F +R+KLLD A + GS S+A +F +I Sbjct: 2 ANLESLVQRCAAFSHIKQLQAYFITTGNFQSCRTRSKLLDLCAVAPFGSLSFAIVIFRQI 61 Query: 283 PHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETF 462 P TND+NAIIRGL QS +P A +Y +M S + DALTCSF LKAC+R LA E Sbjct: 62 RSPFTNDFNAIIRGLIQSPEPSTAFQWYRTMQRGSFRLDALTCSFTLKACARVLAATEAL 121 Query: 463 QMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQGN 642 Q+H+ +V+FG +D LL TTLLD YAK GDL +A +F EM RDIASWNSLI GLAQG+ Sbjct: 122 QLHANIVRFGFMADALLATTLLDVYAKVGDLGNARKVFGEMPRRDIASWNSLILGLAQGD 181 Query: 643 QPEEALELFKRMKENGPLP 699 Q EAL+LFKRM+ +G P Sbjct: 182 QASEALDLFKRMEVDGLTP 200 Score = 85.9 bits (211), Expect = 1e-14 Identities = 56/192 (29%), Positives = 90/192 (46%), Gaps = 1/192 (0%) Frame = +1 Query: 127 RCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDW 306 R A QL A+++ G L LLD A G A+ VF +P W Sbjct: 113 RVLAATEALQLHANIVRFGFMADALLATTLLDVYA--KVGDLGNARKVFGEMPRRDIASW 170 Query: 307 NAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETFQMHSQVVK 486 N++I GLAQ DQ A+ + M P+ +T L ACSR E ++H + Sbjct: 171 NSLILGLAQGDQASEALDLFKRMEVDGLTPNEVTVLGALSACSRMGDFKEGEKIHGFIRN 230 Query: 487 FGVKSDILLQTTLLDAYAKCGDLNSASNLFDEM-TMRDIASWNSLISGLAQGNQPEEALE 663 ++ ++ + ++D YA CG ++ A +FD+M + + +WN+++ A +ALE Sbjct: 231 AKLELNVQVCNAVIDMYANCGFVDKAYGVFDDMGCNKSLVTWNTMVMAFAMDGDGHKALE 290 Query: 664 LFKRMKENGPLP 699 LF++M G P Sbjct: 291 LFEQMDGAGLQP 302 >gb|EMJ00471.1| hypothetical protein PRUPE_ppa022734mg [Prunus persica] Length = 576 Score = 214 bits (546), Expect = 2e-53 Identities = 110/195 (56%), Positives = 142/195 (72%), Gaps = 1/195 (0%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLS-RAKLLDFSATSSAGSFSYAKFVFDR 279 A +E LL++CT+ IKQLQ+HL+T+G F + S KL++ A S S+A +F + Sbjct: 2 ANLESLLQKCTSLARIKQLQSHLLTSGKFQFYPSLTTKLIELCALSPIADLSHAITLFHQ 61 Query: 280 IPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLET 459 + P+TN WNA++RGLAQS QP A+ +Y +M AS K DALTCSF LKAC+RALA E Sbjct: 62 LRKPSTNQWNAVVRGLAQSLQPTQAISWYKTMSKASQKVDALTCSFALKACARALAFSEA 121 Query: 460 FQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQG 639 Q+HSQ+V+FG D+LLQTTLLD YAK GDL A +FDEM+ RDIASWN+LI+GLAQG Sbjct: 122 MQIHSQIVRFGFGVDVLLQTTLLDVYAKVGDLGFAQKVFDEMSERDIASWNALIAGLAQG 181 Query: 640 NQPEEALELFKRMKE 684 ++P EA+ LFKRM E Sbjct: 182 SRPTEAIALFKRMSE 196 Score = 83.6 bits (205), Expect = 6e-14 Identities = 55/193 (28%), Positives = 93/193 (48%), Gaps = 2/193 (1%) Frame = +1 Query: 127 RCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDW 306 R AF Q+ + ++ G L + LLD A G +A+ VFD + W Sbjct: 114 RALAFSEAMQIHSQIVRFGFGVDVLLQTTLLDVYA--KVGDLGFAQKVFDEMSERDIASW 171 Query: 307 NAIIRGLAQSDQPMNAVIFYVSMICAS-CKPDALTCSFVLKACSRALARLETFQMHSQVV 483 NA+I GLAQ +P A+ + M KP+ +T L ACS+ ++H ++ Sbjct: 172 NALIAGLAQGSRPTEAIALFKRMSEEEGLKPNEVTVLGALSACSQLGGVKGGEKIHVYIM 231 Query: 484 KFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTM-RDIASWNSLISGLAQGNQPEEAL 660 + + +++ ++D YAKCG ++ A +F M +++ +WN++I A +AL Sbjct: 232 EEKLDMHVIVCNAVIDMYAKCGFVDKAYWVFKNMKCGKNLITWNTMIMAFAMHGDGGKAL 291 Query: 661 ELFKRMKENGPLP 699 ELF M ++G P Sbjct: 292 ELFGEMAKSGVCP 304 >ref|XP_006307085.1| hypothetical protein CARUB_v10008671mg [Capsella rubella] gi|482575796|gb|EOA39983.1| hypothetical protein CARUB_v10008671mg [Capsella rubella] Length = 585 Score = 211 bits (537), Expect = 2e-52 Identities = 110/201 (54%), Positives = 136/201 (67%), Gaps = 6/201 (2%) Frame = +1 Query: 106 YVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIP 285 Y+E +++RC F IKQLQ+H +T G F R++LLD A S G S+A +F RIP Sbjct: 5 YMETMIQRCVTFSQIKQLQSHFLTAGHFQSSFLRSRLLDRCAVSPFGDLSFAVKIFRRIP 64 Query: 286 HPATNDWNAIIRGLAQSDQPMNAVIFYVSMI------CASCKPDALTCSFVLKACSRALA 447 P TNDWNAIIRG A S QP A +Y SM+ A C+ DALTCSF LKAC+RAL Sbjct: 65 KPFTNDWNAIIRGFAASSQPSLAFSWYRSMLRQSSSSSALCRVDALTCSFTLKACARALC 124 Query: 448 RLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISG 627 L T Q+H Q+ G +D LL TTLLDAY+K GDL SA LFDEM++RD+ASWN+LISG Sbjct: 125 SLATVQIHGQISSRGFLADALLCTTLLDAYSKNGDLVSAQKLFDEMSVRDVASWNALISG 184 Query: 628 LAQGNQPEEALELFKRMKENG 690 L GN+ EAL+L+KRM+ G Sbjct: 185 LVSGNRANEALDLYKRMEVEG 205 Score = 82.4 bits (202), Expect = 1e-13 Identities = 58/221 (26%), Positives = 101/221 (45%), Gaps = 2/221 (0%) Frame = +1 Query: 43 QSPSSTTEIAISLFACCF-MAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLL 219 QS SS+ + C F + A L T Q+ + + G L LL Sbjct: 97 QSSSSSALCRVDALTCSFTLKACARALCSLATV-----QIHGQISSRGFLADALLCTTLL 151 Query: 220 DFSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPD 399 D A S G A+ +FD + WNA+I GL ++ A+ Y M + + Sbjct: 152 D--AYSKNGDLVSAQKLFDEMSVRDVASWNALISGLVSGNRANEALDLYKRMEVEGIRRN 209 Query: 400 ALTCSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFD 579 +T L ACS A E +++S + ++++ +D Y+KCG ++ A +FD Sbjct: 210 EVTFVAALGACSHLGAIKEGEKIYSYFKNANLDHNVIVNNAAIDMYSKCGFVDKAFEVFD 269 Query: 580 EMT-MRDIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 ++T + + +WN++I G A + A+E+F+ +++NG P Sbjct: 270 QITGKKSVVTWNTMIMGFAVHGEAHRAIEIFEELEKNGIKP 310 >ref|XP_002527112.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223533535|gb|EEF35275.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 364 Score = 209 bits (532), Expect = 7e-52 Identities = 112/203 (55%), Positives = 145/203 (71%), Gaps = 2/203 (0%) Frame = +1 Query: 97 MAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLS--RAKLLDFSATSSAGSFSYAKFV 270 MA ++ LL +CT H +Q+Q HLITTG F +++S R+KLL+F A S + S A Sbjct: 1 MATLLDSLLPKCTTLSHAEQIQCHLITTGHFQFKISSSRSKLLEFFALS-LNNLSVAIKA 59 Query: 271 FDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALAR 450 F +I P+TNDWNA++RGL QS P+++ +Y +MI S K DALTCSFVLKAC+R LA Sbjct: 60 FYQILTPSTNDWNAVLRGLIQSPDPIDSFKWYKTMIRGSYKVDALTCSFVLKACARVLAF 119 Query: 451 LETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGL 630 E+ Q+HS +V+ G +D LL TTLLD YAK GDL+SA +FDEM ++DIASWN+LISG Sbjct: 120 SESTQLHSHIVRKGFVADALLGTTLLDLYAKTGDLDSAQKMFDEMIVKDIASWNALISGF 179 Query: 631 AQGNQPEEALELFKRMKENGPLP 699 AQGN+P EAL LFKRM+ G P Sbjct: 180 AQGNKPSEALGLFKRMEVLGFKP 202 Score = 90.5 bits (223), Expect = 5e-16 Identities = 58/200 (29%), Positives = 96/200 (48%), Gaps = 1/200 (0%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 ++V R AF QL +H++ G L LLD A + G A+ +FD + Sbjct: 107 SFVLKACARVLAFSESTQLHSHIVRKGFVADALLGTTLLDLYAKT--GDLDSAQKMFDEM 164 Query: 283 PHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETF 462 WNA+I G AQ ++P A+ + M KP+ +T L ACS+ A E Sbjct: 165 IVKDIASWNALISGFAQGNKPSEALGLFKRMEVLGFKPNEITVLGALSACSQLGAFKEGE 224 Query: 463 QMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTM-RDIASWNSLISGLAQG 639 ++H + + ++ + +D YAKCG + A +F+ M+ + + +WN++I A Sbjct: 225 KIHEYIRSQKLDMNVQVCNAAIDMYAKCGFADKAYLVFESMSCGKSLVTWNTMIMAFAMH 284 Query: 640 NQPEEALELFKRMKENGPLP 699 ++AL+LFK M + G P Sbjct: 285 GDGDKALKLFKYMHQEGVSP 304 >ref|XP_006415018.1| hypothetical protein EUTSA_v10010025mg [Eutrema salsugineum] gi|557092789|gb|ESQ33371.1| hypothetical protein EUTSA_v10010025mg [Eutrema salsugineum] Length = 589 Score = 206 bits (524), Expect = 6e-51 Identities = 108/204 (52%), Positives = 140/204 (68%), Gaps = 9/204 (4%) Frame = +1 Query: 106 YVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIP 285 Y+E L++RC F IKQLQ+H +T G F R++LLD A S G S+A +F +IP Sbjct: 5 YMETLIQRCVTFSQIKQLQSHFLTAGHFQSSFLRSRLLDRCAVSPFGDLSFAVQIFRQIP 64 Query: 286 HPATNDWNAIIRGLAQSDQPMNAVIFYVSMICAS---------CKPDALTCSFVLKACSR 438 P TNDWNAIIRG A S QP A+ +Y SM+ + C+ DALTCSF LKAC+R Sbjct: 65 KPLTNDWNAIIRGFAASSQPSIALTWYRSMLFQASSSSSSSSLCRIDALTCSFTLKACAR 124 Query: 439 ALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSL 618 AL+ T Q+H+Q+ + G+ +D LL TT+LDAY+K GDL SA LFDEM +RD+ASWN+L Sbjct: 125 ALSSSFTPQLHAQINRRGLFADALLCTTMLDAYSKNGDLISARKLFDEMPVRDVASWNAL 184 Query: 619 ISGLAQGNQPEEALELFKRMKENG 690 I+GLA GN+ EALEL+KRM+ G Sbjct: 185 IAGLAFGNRAHEALELYKRMESEG 208 Score = 89.0 bits (219), Expect = 1e-15 Identities = 61/221 (27%), Positives = 102/221 (46%), Gaps = 1/221 (0%) Frame = +1 Query: 40 TQSPSSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLL 219 + S SS++ I C F R + QL A + GLF L +L Sbjct: 99 SSSSSSSSLCRIDALTCSFTLK----ACARALSSSFTPQLHAQINRRGLFADALLCTTML 154 Query: 220 DFSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPD 399 D A S G A+ +FD +P WNA+I GLA ++ A+ Y M + + Sbjct: 155 D--AYSKNGDLISARKLFDEMPVRDVASWNALIAGLAFGNRAHEALELYKRMESEGIRRN 212 Query: 400 ALTCSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFD 579 +T L ACS A E ++H V + ++++ +D YAKCG + A +FD Sbjct: 213 EITVVAALGACSHLGAVKEGEKIHGYVKDSNLDQNVIVCNATIDMYAKCGFVEKAFQVFD 272 Query: 580 EMT-MRDIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 ++ + + +WN++I G A + A+E+F+++++N P Sbjct: 273 QIRGEKSVVTWNTMIMGFAVHGEANRAIEIFEKLEDNSIKP 313 >ref|XP_006434562.1| hypothetical protein CICLE_v10003512mg [Citrus clementina] gi|557536684|gb|ESR47802.1| hypothetical protein CICLE_v10003512mg [Citrus clementina] Length = 582 Score = 204 bits (519), Expect = 2e-50 Identities = 109/203 (53%), Positives = 143/203 (70%), Gaps = 7/203 (3%) Frame = +1 Query: 103 AYVEHLLRRCTA-----FPHIKQLQAHLITTGLFNYQLS--RAKLLDFSATSSAGSFSYA 261 A + LL++C++ HIKQLQAHL TTG F +L R+K+++F A S +YA Sbjct: 2 ANLNALLQKCSSNVPVSHIHIKQLQAHLTTTGQFQSKLCPVRSKIIEFYALSPLNELAYA 61 Query: 262 KFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRA 441 +F +I P+TND+NAI+RGLA S +P NAV++Y M+ S + DALTCSF LKAC+R Sbjct: 62 HALFRQINAPSTNDFNAILRGLAHSSKPTNAVLWYRQMLRGSHRSDALTCSFALKACARV 121 Query: 442 LARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLI 621 LA ET Q+HS V++ G +D LL TTLLD YAK G++ SA +FDEM +RDIASWN+LI Sbjct: 122 LALFETLQIHSHVLRHGFLADALLGTTLLDVYAKVGEIVSAKKVFDEMGVRDIASWNALI 181 Query: 622 SGLAQGNQPEEALELFKRMKENG 690 +GLAQGN EA++LFKRMK G Sbjct: 182 AGLAQGNLASEAVDLFKRMKMEG 204 Score = 80.9 bits (198), Expect = 4e-13 Identities = 63/218 (28%), Positives = 97/218 (44%), Gaps = 2/218 (0%) Frame = +1 Query: 52 SSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSA 231 S + +L AC + A E L Q+ +H++ G L LLD A Sbjct: 106 SDALTCSFALKACARVLALFETL-----------QIHSHVLRHGFLADALLGTTLLDVYA 154 Query: 232 TSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASC-KPDALT 408 G AK VFD + WNA+I GLAQ + AV + M KP+ +T Sbjct: 155 --KVGEIVSAKKVFDEMGVRDIASWNALIAGLAQGNLASEAVDLFKRMKMEGVFKPNEVT 212 Query: 409 CSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMT 588 L AC A E ++H + + + ++++ ++D YAKCG L+ A +FD + Sbjct: 213 VLGALAACGHLGAWKEGDKIHEYIREERLDMNVVVCNAVIDMYAKCGLLDKAFEVFDNIK 272 Query: 589 MR-DIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 R + +WN+++ A ALELF++M G P Sbjct: 273 CRKSLVTWNTMVMAFAVHGDGPRALELFEQMGRAGVKP 310 >gb|EXC75282.1| hypothetical protein L484_000391 [Morus notabilis] Length = 581 Score = 203 bits (517), Expect = 4e-50 Identities = 104/195 (53%), Positives = 142/195 (72%), Gaps = 2/195 (1%) Frame = +1 Query: 106 YVEHLLRRCTAFPHIKQLQAHLITTG-LFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRI 282 Y++ LL +C +F IKQL +HL+T+G L + + AKLLD + S + SYA +F R+ Sbjct: 7 YLDILLHKCRSFCQIKQLHSHLLTSGQLHSSPSAAAKLLDLCSHSPSADLSYAALLFRRL 66 Query: 283 PH-PATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLET 459 P P+TN WNA++RGLA+S P A+ ++ M A K DALTCSF L+AC+RALA E Sbjct: 67 PTTPSTNAWNALVRGLARSPNPTRAISWFRDMSRAPQKADALTCSFALQACARALAGFEA 126 Query: 460 FQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLAQG 639 ++HS+VV+ GV +D+LL TTLLD YAK GDL A N+FDEM RDIA+WN+LI+GLAQG Sbjct: 127 REIHSRVVRLGVGADVLLMTTLLDVYAKVGDLECARNVFDEMPRRDIAAWNALIAGLAQG 186 Query: 640 NQPEEALELFKRMKE 684 ++P EAL+LF+R++E Sbjct: 187 SRPGEALDLFRRLRE 201 Score = 81.6 bits (200), Expect = 2e-13 Identities = 54/185 (29%), Positives = 93/185 (50%), Gaps = 2/185 (1%) Frame = +1 Query: 151 KQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLA 330 +++ + ++ G+ L LLD A G A+ VFD +P WNA+I GLA Sbjct: 127 REIHSRVVRLGVGADVLLMTTLLDVYA--KVGDLECARNVFDEMPRRDIAAWNALIAGLA 184 Query: 331 QSDQPMNAV-IFYVSMICASCKPDALTCSFVLKACSRALARLETFQMHSQVVKFGVKSDI 507 Q +P A+ +F +P+ +T L ACS+ A E ++H V++ + + Sbjct: 185 QGSRPGEALDLFRRLREEEGLRPNEVTVLGGLSACSQLGAFREGEKIHDYVMEERLDMSV 244 Query: 508 LLQTTLLDAYAKCGDLNSASNLFDEMTM-RDIASWNSLISGLAQGNQPEEALELFKRMKE 684 ++ ++D YAKCG + A +F M + + +WN++I A + +ALE+F +M+E Sbjct: 245 IVCNAVIDMYAKCGFVEKAFGVFRSMRCGKSLVTWNTMIMAFALHGEASKALEIFGQMRE 304 Query: 685 NGPLP 699 G P Sbjct: 305 AGLEP 309 >ref|XP_006473158.1| PREDICTED: pentatricopeptide repeat-containing protein At1g34160-like [Citrus sinensis] Length = 582 Score = 203 bits (517), Expect = 4e-50 Identities = 108/203 (53%), Positives = 143/203 (70%), Gaps = 7/203 (3%) Frame = +1 Query: 103 AYVEHLLRRCTA-----FPHIKQLQAHLITTGLFNYQLS--RAKLLDFSATSSAGSFSYA 261 A + LL++C++ HIKQLQAHL TTG F +L R+K+++F A S +YA Sbjct: 2 ANLNALLQKCSSNGAVSHIHIKQLQAHLTTTGQFQSKLFPVRSKIIEFYALSPLNELAYA 61 Query: 262 KFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRA 441 +F +I P+TND+NA++RGLA S +P NAV++Y M+ S + DALTCSF LKAC+R Sbjct: 62 HALFRQINAPSTNDFNAVLRGLAHSSKPTNAVLWYRQMLRGSHRSDALTCSFALKACARV 121 Query: 442 LARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLI 621 LA ET Q+HS V++ G +D LL TTLLD YAK G++ SA +FDEM +RDIASWN+LI Sbjct: 122 LALFETLQIHSHVLRHGFLADALLGTTLLDVYAKVGEIVSAKKVFDEMGVRDIASWNALI 181 Query: 622 SGLAQGNQPEEALELFKRMKENG 690 +GLAQGN EA++LFKRMK G Sbjct: 182 AGLAQGNLASEAVDLFKRMKMEG 204 Score = 80.9 bits (198), Expect = 4e-13 Identities = 63/218 (28%), Positives = 97/218 (44%), Gaps = 2/218 (0%) Frame = +1 Query: 52 SSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSA 231 S + +L AC + A E L Q+ +H++ G L LLD A Sbjct: 106 SDALTCSFALKACARVLALFETL-----------QIHSHVLRHGFLADALLGTTLLDVYA 154 Query: 232 TSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASC-KPDALT 408 G AK VFD + WNA+I GLAQ + AV + M KP+ +T Sbjct: 155 --KVGEIVSAKKVFDEMGVRDIASWNALIAGLAQGNLASEAVDLFKRMKMEGVFKPNEVT 212 Query: 409 CSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMT 588 L AC A E ++H + + + ++++ ++D YAKCG L+ A +FD + Sbjct: 213 VLGALAACGHLGAWKEGDKIHDYIREERLDMNVVVCNAVIDMYAKCGLLDKAFEVFDNIK 272 Query: 589 MR-DIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 R + +WN+++ A ALELF++M G P Sbjct: 273 CRKSLVTWNTMVMAFAVHGDGPRALELFEQMGRAGVKP 310 >ref|XP_002891080.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297336922|gb|EFH67339.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 562 Score = 199 bits (506), Expect = 7e-49 Identities = 104/200 (52%), Positives = 132/200 (66%), Gaps = 5/200 (2%) Frame = +1 Query: 106 YVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIP 285 Y+E +++ C F IKQLQ+H +T G F R++LL+ A S G S+A +F IP Sbjct: 5 YMETMIQNCVTFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVKIFRHIP 64 Query: 286 HPATNDWNAIIRGLAQSDQPMNAVIFYVSMI-----CASCKPDALTCSFVLKACSRALAR 450 P TNDWNAIIRG A S P A +Y SM+ A C+ DALTCSF LKAC+RAL Sbjct: 65 KPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQRSSSSALCRVDALTCSFTLKACARALCS 124 Query: 451 LETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGL 630 Q+H Q+ + G +D LL TTLLDAY+K GDL SA LFDEM++RD+ASWN+LI+GL Sbjct: 125 SAMVQIHCQISRRGFSADALLCTTLLDAYSKNGDLISALKLFDEMSVRDVASWNALIAGL 184 Query: 631 AQGNQPEEALELFKRMKENG 690 GN+ EALEL+KRM+ G Sbjct: 185 VAGNRASEALELYKRMEMEG 204 Score = 71.6 bits (174), Expect = 2e-10 Identities = 54/221 (24%), Positives = 98/221 (44%), Gaps = 2/221 (0%) Frame = +1 Query: 43 QSPSSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLD 222 Q SS+ + C F L C++ + Q+ + G L LLD Sbjct: 96 QRSSSSALCRVDALTCSFTLKACARAL--CSSA--MVQIHCQISRRGFSADALLCTTLLD 151 Query: 223 FSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDA 402 A S G A +FD + WNA+I GL ++ A+ Y M + Sbjct: 152 --AYSKNGDLISALKLFDEMSVRDVASWNALIAGLVAGNRASEALELYKRMEMEGIRRSE 209 Query: 403 LTCSFVLKACSRALARLETFQ-MHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFD 579 +T L ACS E + +H + + ++++ ++D Y+KCG ++ A +F+ Sbjct: 210 VTVVAALGACSHLGDVKEGEKILHGYIKDEKLDHNVIVSNAVIDMYSKCGFVDKAFQVFE 269 Query: 580 EMT-MRDIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 + T + + +WN++I+G + + ALE+F++++ NG P Sbjct: 270 QFTGKKSVVTWNTMITGFSVHGEAHRALEIFEKLEHNGIKP 310 >gb|AAG12522.1|AC015446_3 Hypothetical Protein [Arabidopsis thaliana] Length = 539 Score = 198 bits (503), Expect = 2e-48 Identities = 103/201 (51%), Positives = 134/201 (66%), Gaps = 6/201 (2%) Frame = +1 Query: 106 YVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIP 285 Y+E ++++C +F IKQLQ+H +T G F R++LL+ A S G S+A +F IP Sbjct: 5 YMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIP 64 Query: 286 HPATNDWNAIIRGLAQSDQPMNAVIFYVSMI------CASCKPDALTCSFVLKACSRALA 447 P TNDWNAIIRG A S P A +Y SM+ A C+ DALTCSF LKAC+RAL Sbjct: 65 KPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALC 124 Query: 448 RLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISG 627 Q+H Q+ + G+ +D LL TTLLDAY+K GDL SA LFDEM +RD+ASWN+LI+G Sbjct: 125 SSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAG 184 Query: 628 LAQGNQPEEALELFKRMKENG 690 L GN+ EA+EL+KRM+ G Sbjct: 185 LVSGNRASEAMELYKRMETEG 205 Score = 78.2 bits (191), Expect = 2e-12 Identities = 58/220 (26%), Positives = 97/220 (44%), Gaps = 1/220 (0%) Frame = +1 Query: 43 QSPSSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLD 222 QS SS+ + C F L C++ + QL + GL L LLD Sbjct: 97 QSSSSSAICRVDALTCSFTLKACARAL--CSSA--MDQLHCQINRRGLSADSLLCTTLLD 152 Query: 223 FSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDA 402 A S G A +FD +P WNA+I GL ++ A+ Y M + Sbjct: 153 --AYSKNGDLISAYKLFDEMPVRDVASWNALIAGLVSGNRASEAMELYKRMETEGIRRSE 210 Query: 403 LTCSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDE 582 +T L ACS L + + ++++ +D Y+KCG ++ A +F++ Sbjct: 211 VTVVAALGACSH----LGDVKEGENIFHGYSNDNVIVSNAAIDMYSKCGFVDKAYQVFEQ 266 Query: 583 MT-MRDIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 T + + +WN++I+G A + ALE+F ++++NG P Sbjct: 267 FTGKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKP 306 >ref|NP_174678.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|193806500|sp|Q9FX24.2|PPR71_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g34160 gi|332193557|gb|AEE31678.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 581 Score = 198 bits (503), Expect = 2e-48 Identities = 103/201 (51%), Positives = 134/201 (66%), Gaps = 6/201 (2%) Frame = +1 Query: 106 YVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIP 285 Y+E ++++C +F IKQLQ+H +T G F R++LL+ A S G S+A +F IP Sbjct: 5 YMETMIQKCVSFSQIKQLQSHFLTAGHFQSSFLRSRLLERCAISPFGDLSFAVQIFRYIP 64 Query: 286 HPATNDWNAIIRGLAQSDQPMNAVIFYVSMI------CASCKPDALTCSFVLKACSRALA 447 P TNDWNAIIRG A S P A +Y SM+ A C+ DALTCSF LKAC+RAL Sbjct: 65 KPLTNDWNAIIRGFAGSSHPSLAFSWYRSMLQQSSSSSAICRVDALTCSFTLKACARALC 124 Query: 448 RLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISG 627 Q+H Q+ + G+ +D LL TTLLDAY+K GDL SA LFDEM +RD+ASWN+LI+G Sbjct: 125 SSAMDQLHCQINRRGLSADSLLCTTLLDAYSKNGDLISAYKLFDEMPVRDVASWNALIAG 184 Query: 628 LAQGNQPEEALELFKRMKENG 690 L GN+ EA+EL+KRM+ G Sbjct: 185 LVSGNRASEAMELYKRMETEG 205 Score = 78.2 bits (191), Expect = 2e-12 Identities = 58/220 (26%), Positives = 97/220 (44%), Gaps = 1/220 (0%) Frame = +1 Query: 43 QSPSSTTEIAISLFACCFMAAYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLSRAKLLD 222 QS SS+ + C F L C++ + QL + GL L LLD Sbjct: 97 QSSSSSAICRVDALTCSFTLKACARAL--CSSA--MDQLHCQINRRGLSADSLLCTTLLD 152 Query: 223 FSATSSAGSFSYAKFVFDRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDA 402 A S G A +FD +P WNA+I GL ++ A+ Y M + Sbjct: 153 --AYSKNGDLISAYKLFDEMPVRDVASWNALIAGLVSGNRASEAMELYKRMETEGIRRSE 210 Query: 403 LTCSFVLKACSRALARLETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDE 582 +T L ACS L + + ++++ +D Y+KCG ++ A +F++ Sbjct: 211 VTVVAALGACSH----LGDVKEGENIFHGYSNDNVIVSNAAIDMYSKCGFVDKAYQVFEQ 266 Query: 583 MT-MRDIASWNSLISGLAQGNQPEEALELFKRMKENGPLP 699 T + + +WN++I+G A + ALE+F ++++NG P Sbjct: 267 FTGKKSVVTWNTMITGFAVHGEAHRALEIFDKLEDNGIKP 306 >ref|XP_002265412.1| PREDICTED: pentatricopeptide repeat-containing protein At1g34160-like [Vitis vinifera] Length = 573 Score = 196 bits (499), Expect = 5e-48 Identities = 100/199 (50%), Positives = 145/199 (72%), Gaps = 3/199 (1%) Frame = +1 Query: 103 AYVEHLLRRCTAFPHIKQLQAHLITTGLFNYQLS--RAKLLDFSATSSAGSF-SYAKFVF 273 A+++ ++++CT HIKQ+QAHL+TTG FN ++S R +LL+ A S + ++ YA + Sbjct: 2 AFMDSIIQKCTTLSHIKQVQAHLLTTGQFNLRISPSRTRLLEHCALSPSPAYLPYAAHIH 61 Query: 274 DRIPHPATNDWNAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARL 453 IPHP+TND+NA++RGLA+ P +A+ F +++ PDALT SF L A +RALA Sbjct: 62 RHIPHPSTNDFNALLRGLARGPHPTHALTFLSTIL----HPDALTFSFSLIASARALALS 117 Query: 454 ETFQMHSQVVKFGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTMRDIASWNSLISGLA 633 ET Q+HS +++ G +DILL TTL+DAYAKCGDL+SA +FDE+ +RD+A+WN+LI+GLA Sbjct: 118 ETSQIHSHLLRRGCHADILLGTTLIDAYAKCGDLDSAQRVFDEIPLRDVAAWNALIAGLA 177 Query: 634 QGNQPEEALELFKRMKENG 690 QG++ EAL LF RM+ G Sbjct: 178 QGSKSSEALALFNRMRAEG 196 Score = 80.5 bits (197), Expect = 5e-13 Identities = 55/185 (29%), Positives = 85/185 (45%), Gaps = 1/185 (0%) Frame = +1 Query: 127 RCTAFPHIKQLQAHLITTGLFNYQLSRAKLLDFSATSSAGSFSYAKFVFDRIPHPATNDW 306 R A Q+ +HL+ G L L+D A + G A+ VFD IP W Sbjct: 112 RALALSETSQIHSHLLRRGCHADILLGTTLID--AYAKCGDLDSAQRVFDEIPLRDVAAW 169 Query: 307 NAIIRGLAQSDQPMNAVIFYVSMICASCKPDALTCSFVLKACSRALARLETFQMHSQVVK 486 NA+I GLAQ + A+ + M K + ++ L ACS+ A +H+ V K Sbjct: 170 NALIAGLAQGSKSSEALALFNRMRAEGEKINEISVLGALAACSQLGALRAGEGVHACVRK 229 Query: 487 FGVKSDILLQTTLLDAYAKCGDLNSASNLFDEMTM-RDIASWNSLISGLAQGNQPEEALE 663 + ++ + ++D YAKCG + +F MT + + +WN++I A ALE Sbjct: 230 MDLDINVQVCNAVIDMYAKCGFADKGFRVFSTMTCGKSVVTWNTMIMAFAMHGDGCRALE 289 Query: 664 LFKRM 678 LF+ M Sbjct: 290 LFEEM 294