BLASTX nr result
ID: Catharanthus23_contig00022689
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00022689 (1621 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containi... 545 e-152 ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containi... 543 e-152 ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containi... 438 e-120 gb|EOY30252.1| Pentatricopeptide repeat superfamily protein, put... 436 e-119 ref|XP_002512275.1| pentatricopeptide repeat-containing protein,... 428 e-117 ref|XP_006381622.1| pentatricopeptide repeat-containing family p... 426 e-116 ref|XP_002326124.1| predicted protein [Populus trichocarpa] 426 e-116 ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containi... 426 e-116 ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containi... 426 e-116 ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containi... 424 e-116 gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis] 418 e-114 ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containi... 417 e-114 ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citr... 416 e-113 ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containi... 403 e-109 ref|XP_002885810.1| pentatricopeptide repeat-containing protein ... 388 e-105 dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana] 385 e-104 ref|NP_178657.1| pentatricopeptide repeat-containing protein [Ar... 385 e-104 ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Caps... 382 e-103 ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containi... 375 e-101 ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutr... 370 e-100 >ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X1 [Solanum tuberosum] gi|565370447|ref|XP_006351832.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X2 [Solanum tuberosum] Length = 550 Score = 545 bits (1403), Expect = e-152 Identities = 271/471 (57%), Positives = 348/471 (73%) Frame = -3 Query: 1415 MPLWIKRAPRNITFKSIARCFHGISNIESSPPTHNKIESLWFIKFVCTLCIRNAENLAIF 1236 MPLW++RA + IAR FHG+++ +S P E++WF K VC LC ++++L +F Sbjct: 1 MPLWVQRASNILL---IAR-FHGLTSSKSIPSYGPGPEAVWFTKVVCLLCFHHSQSLDVF 56 Query: 1235 GSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQ 1056 GSDYFR+NL P IAF VI HIN N PRLAF F Q TR+NLNL+H I +FNLLLRSL Q Sbjct: 57 GSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLNLVHCIGSFNLLLRSLSQ 116 Query: 1055 MGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEII 876 MG D A LV ++M ADG L+ +LE +V ++A+AGKF I +EILISQA+L + I+ Sbjct: 117 MGFHDSAMLVFKFMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGREEGRIV 176 Query: 875 NSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFE 696 FVHN LSLL+K++RVDEA FF+ H+L+ DTC+FN VI GLC+ G +D+AFE Sbjct: 177 RPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAFE 236 Query: 695 VFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGF 516 FNDMGSFGCF D TYN++INGLC +G +RA LL ++ Q GLSPDV TYT++I+G+ Sbjct: 237 FFNDMGSFGCFPDTVTYNTLINGLCSVGQVNRARGLLGNLELQDGLSPDVVTYTSVIAGY 296 Query: 515 FKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDV 336 KLGR EA++L D+MT GI PN+ TFN+LI+GFG+ GD+ SA++++ M G PDV Sbjct: 297 CKLGRMDEAINLMDEMTTYGISPNLVTFNILINGFGKIGDMFSAIQMYGRMCAVGYPPDV 356 Query: 335 ITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQ 156 +TFT+L+ GYC+ GE+DQGLKLWDEMN R L PN YTFSI+I++L K NRLNEAR+LL Q Sbjct: 357 VTFTSLIDGYCRTGELDQGLKLWDEMNTRNLSPNLYTFSILISALSKENRLNEARELLRQ 416 Query: 155 LRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 L+ R+DI+PQ F+YNPV+DGFCKAGNL AN I AEME + C DK TFTI Sbjct: 417 LKSRDDIVPQPFVYNPVLDGFCKAGNLSKANVIAAEMESRGCCHDKITFTI 467 Score = 97.8 bits (242), Expect = 1e-17 Identities = 70/294 (23%), Positives = 135/294 (45%), Gaps = 2/294 (0%) Frame = -3 Query: 1145 AFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNA-DGLSLDGPLLEFL 969 AFEFF + T+N L+ LC +G ++ A ++ + DGLS D + Sbjct: 234 AFEFFNDMG-SFGCFPDTVTYNTLINGLCSVGQVNRARGLLGNLELQDGLSPDVVTYTSV 292 Query: 968 VSSVAHAGKFSIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDH- 792 ++ G+ ++ + + + + N F ++ + F + + Sbjct: 293 IAGYCKLGRMDEAINLMDEMTTYGISPNLVTFNILINGF-------GKIGDMFSAIQMYG 345 Query: 791 LLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLG 612 + + D +F +IDG C+ G++D+ +++++M + ++ T++ +I+ L K Sbjct: 346 RMCAVGYPPDVVTFTSLIDGYCRTGELDQGLKLWDEMNTRNLSPNLYTFSILISALSKEN 405 Query: 611 DADRALELLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTF 432 + A ELLR+++S+ + P Y ++ GF K G +A + +M RG + TF Sbjct: 406 RLNEARELLRQLKSRDDIVPQPFVYNPVLDGFCKAGNLSKANVIAAEMESRGCCHDKITF 465 Query: 431 NVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKL 270 +LI G G + A+ IF+ M GC PD IT + L S + G + + K+ Sbjct: 466 TILILGHCMKGRMLEAMAIFDKMLSLGCVPDDITVSCLTSCLLKAGMVKEAYKV 519 >ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Solanum lycopersicum] Length = 550 Score = 543 bits (1399), Expect = e-152 Identities = 274/471 (58%), Positives = 348/471 (73%) Frame = -3 Query: 1415 MPLWIKRAPRNITFKSIARCFHGISNIESSPPTHNKIESLWFIKFVCTLCIRNAENLAIF 1236 MPLW++RA NI+ IAR FHG+++ +S P E++WF K VC LC ++++L +F Sbjct: 1 MPLWVQRAS-NISL--IAR-FHGLTSSKSIPSYGPGPEAVWFTKVVCLLCFHHSQSLDVF 56 Query: 1235 GSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQ 1056 GSDYFR+NL P IAF VI HIN N PRLAF F Q TR+NLNLIH I +FNLLLRSL Q Sbjct: 57 GSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLNLIHCIGSFNLLLRSLSQ 116 Query: 1055 MGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEII 876 MG D A LV +YM ADG L+ +LE +V ++A+AGKF I +EILISQA+L + I+ Sbjct: 117 MGFHDSAMLVFKYMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGREEGSIV 176 Query: 875 NSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFE 696 FVHN LSLL+K++RVDEA FF+ H+L+ DTC+FN VI GLC+ G +D+AFE Sbjct: 177 RPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAFE 236 Query: 695 VFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGF 516 FNDMGSFGC D TYN++INGLC +G +RA LL +Q Q GLSPDV TYT++ISG+ Sbjct: 237 FFNDMGSFGCSPDTVTYNTLINGLCAVGQVNRAQGLLGNLQLQDGLSPDVVTYTSLISGY 296 Query: 515 FKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDV 336 KL R EA++L D+M GI PN+ TFN+LI+GFG+ GD+ SA+K++ M G PDV Sbjct: 297 CKLSRMDEAINLMDEMITYGISPNLVTFNILINGFGKIGDMFSAIKMYGKMCAVGYPPDV 356 Query: 335 ITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQ 156 +TFT+L+ GYC+ GE+DQGLKLWD+MN+R L PN YTFS++I++L K NRLNEAR+LL Q Sbjct: 357 VTFTSLIDGYCRTGELDQGLKLWDDMNSRNLSPNLYTFSVLISALSKENRLNEARELLRQ 416 Query: 155 LRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 L+ R+DI+PQ F+YNPV+DGFCKAGNL AN I AEME K C DK TFTI Sbjct: 417 LKSRDDIVPQPFVYNPVLDGFCKAGNLSEANVIAAEMESKGCCHDKITFTI 467 Score = 104 bits (259), Expect = 1e-19 Identities = 86/332 (25%), Positives = 148/332 (44%), Gaps = 40/332 (12%) Frame = -3 Query: 1145 AFEFFQFTRLNLN-LIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFL 969 A +FF+ L L TFN ++R LC++G +D A M + G S D L Sbjct: 197 AVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAFEFFNDMGSFGCSPDTVTYNTL 256 Query: 968 VSSVAHAGKFSIGREILISQAQLCLGKEEIINSF-----VHNKFLSLLVKQNRVDEAFIF 804 ++ + G+ +++AQ LG ++ + + +S K +R+DEA I Sbjct: 257 INGLCAVGQ--------VNRAQGLLGNLQLQDGLSPDVVTYTSLISGYCKLSRMDEA-IN 307 Query: 803 FRDHLL------KLRSFCL----------------------------DTCSFNIVIDGLC 726 D ++ L +F + D +F +IDG C Sbjct: 308 LMDEMITYGISPNLVTFNILINGFGKIGDMFSAIKMYGKMCAVGYPPDVVTFTSLIDGYC 367 Query: 725 KAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDV 546 + G++D+ ++++DM S ++ T++ +I+ L K + A ELLR+++S+ + P Sbjct: 368 RTGELDQGLKLWDDMNSRNLSPNLYTFSVLISALSKENRLNEARELLRQLKSRDDIVPQP 427 Query: 545 KTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFES 366 Y ++ GF K G EA + +M +G + TF +LI G G + AL IF+ Sbjct: 428 FVYNPVLDGFCKAGNLSEANVIAAEMESKGCCHDKITFTILILGHCMKGRMLEALAIFDK 487 Query: 365 MSKFGCAPDVITFTNLLSGYCQIGEIDQGLKL 270 M GC PD IT + L S + G + + K+ Sbjct: 488 MLSLGCVPDDITISCLTSCLLKAGMVKEAYKV 519 >ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Vitis vinifera] Length = 641 Score = 438 bits (1127), Expect = e-120 Identities = 222/431 (51%), Positives = 301/431 (69%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 W +K +CTLC+R A DYF K L P IAF V++ +NN P LA +FFQ +R+ Sbjct: 76 WIVKVICTLCVRTHSLDACL--DYFSKTLTPSIAFEVVRGLNN----PELALKFFQLSRV 129 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 NLNL HS T++ LLRSL +MG + A V + MN DG S D +L FLVSS AGKF+ Sbjct: 130 NLNLCHSFRTYSFLLRSLSEMGFHESAKAVYDCMNIDGHSPDASVLGFLVSSATDAGKFN 189 Query: 935 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 756 I R + + L V+NK L+ LV+ N+VDEA FFR+ + F D+C Sbjct: 190 IART-WVDGVEFSL--------VVYNKLLNQLVRGNQVDEAVCFFREQMGLHGPF--DSC 238 Query: 755 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 576 SFNI+I GLC+ G++D+AFE+FN+M FGC D+ TYN++ING C++ + DR +LL+E+ Sbjct: 239 SFNILIRGLCRIGKVDKAFELFNEMRGFGCSPDVITYNTLINGFCRVNEVDRGHDLLKEL 298 Query: 575 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 396 S+ LSPDV TYT+IISG+ KLG+ +A L+++M GI+PN +TFN+LI+GFG+ GD Sbjct: 299 LSKNDLSPDVVTYTSIISGYCKLGKMEKASILFNNMISSGIKPNAFTFNILINGFGKVGD 358 Query: 395 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 216 + SA ++E M GC PD+ITFT+L+ G+C+ G++++ LKLW E+NAR L PN YTF+I Sbjct: 359 MVSAENMYEEMLLLGCPPDIITFTSLIDGHCRTGKVERSLKLWHELNARNLSPNEYTFAI 418 Query: 215 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 36 + N+LCK NRL+EAR L L+WR+ I+ Q F+YNPVIDGFCKAGN+D AN I+AEMEEK Sbjct: 419 LTNALCKENRLHEARGFLRDLKWRH-IVAQPFMYNPVIDGFCKAGNVDEANVILAEMEEK 477 Query: 35 KCNPDKYTFTI 3 +C PDK T+TI Sbjct: 478 RCKPDKITYTI 488 Score = 109 bits (272), Expect = 4e-21 Identities = 84/312 (26%), Positives = 143/312 (45%), Gaps = 39/312 (12%) Frame = -3 Query: 1088 TFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQ 909 +FN+L+R LC++G +D A + M G S D L++ + G ++L Sbjct: 239 SFNILIRGLCRIGKVDKAFELFNEMRGFGCSPDVITYNTLINGFCRVNEVDRGHDLL--- 295 Query: 908 AQLCLGKEEIINSFV-HNKFLSLLVKQNRVDEAFIFFRDHL---LKLRSFCLDTCSFNIV 741 + L K ++ V + +S K ++++A I F + + +K +F +FNI+ Sbjct: 296 -KELLSKNDLSPDVVTYTSIISGYCKLGKMEKASILFNNMISSGIKPNAF-----TFNIL 349 Query: 740 IDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGG 561 I+G K G + A ++ +M GC DI T+ S+I+G C+ G +R+L+L E+ ++ Sbjct: 350 INGFGKVGDMVSAENMYEEMLLLGCPPDIITFTSLIDGHCRTGKVERSLKLWHELNARN- 408 Query: 560 LSPDVKT-----------------------------------YTTIISGFFKLGRNHEAL 486 LSP+ T Y +I GF K G EA Sbjct: 409 LSPNEYTFAILTNALCKENRLHEARGFLRDLKWRHIVAQPFMYNPVIDGFCKAGNVDEAN 468 Query: 485 HLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFTNLLSGY 306 + +M + +P+ T+ +LI G G L A+ IF M GCAPD IT T+L+S Sbjct: 469 VILAEMEEKRCKPDKITYTILIIGHCMKGRLSEAISIFNRMLGTGCAPDSITMTSLISCL 528 Query: 305 CQIGEIDQGLKL 270 + G ++ ++ Sbjct: 529 LKAGMPNEAYRI 540 Score = 73.6 bits (179), Expect = 2e-10 Identities = 61/254 (24%), Positives = 113/254 (44%), Gaps = 3/254 (1%) Frame = -3 Query: 1109 NLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIG 930 +L + T+ ++ C++G ++ A+++ M + G+ + L++ G + Sbjct: 303 DLSPDVVTYTSIISGYCKLGKMEKASILFNNMISSGIKPNAFTFNILINGFGKVGDM-VS 361 Query: 929 REILISQAQLCLGKEEIINSFVHNKFLSLL---VKQNRVDEAFIFFRDHLLKLRSFCLDT 759 E + + L +II F SL+ + +V+ + + H L R+ + Sbjct: 362 AENMYEEMLLLGCPPDIIT------FTSLIDGHCRTGKVERSLKLW--HELNARNLSPNE 413 Query: 758 CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 579 +F I+ + LCK ++ A D+ A YN VI+G CK G+ D A +L E Sbjct: 414 YTFAILTNALCKENRLHEARGFLRDLKWRHIVAQPFMYNPVIDGFCKAGNVDEANVILAE 473 Query: 578 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 399 ++ + PD TYT +I G GR EA+ +++ M G P+ T LI ++G Sbjct: 474 MEEKR-CKPDKITYTILIIGHCMKGRLSEAISIFNRMLGTGCAPDSITMTSLISCLLKAG 532 Query: 398 DLGSALKIFESMSK 357 A +I + S+ Sbjct: 533 MPNEAYRIMQIASE 546 >gb|EOY30252.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 592 Score = 436 bits (1122), Expect = e-119 Identities = 220/431 (51%), Positives = 301/431 (69%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 WF+K VCTL + + + L Y KNL P I F V++ +NN P L +F +F+R+ Sbjct: 91 WFVKVVCTLFVYS-QPLDDSCLSYLSKNLTPLIEFEVVKWLNN----PALGLKFLEFSRV 145 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 N N+ HS T+NLL+RS C MGL D A LV +YM DG D +L F++SS AG+F Sbjct: 146 NFNIAHSFWTYNLLMRSFCHMGLHDSAKLVFDYMRIDGHLPDTTILGFMISSFGRAGEFG 205 Query: 935 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 756 + +++L E +I+ F N L+++VKQN+++EA ++++L +F D Sbjct: 206 MAKKLLADVQS----DEVVISIFALNNLLNMMVKQNKLEEAVSLYKENLGS--NFYPDAW 259 Query: 755 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 576 +FNI+I GLC+ G++D+AFE+FNDMGSFGCF DI TYN++INGLCK+ + DR +LL ++ Sbjct: 260 TFNILIRGLCRVGKVDQAFELFNDMGSFGCFPDIVTYNTIINGLCKVNEVDRGHKLLNQV 319 Query: 575 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 396 QS+ SPDV TYT++ISG+ KLG+ EA L+ +M G P V TFNVLI GFG+ GD Sbjct: 320 QSRDDCSPDVVTYTSVISGYCKLGKMDEASALFHEMISSGTVPTVVTFNVLIDGFGKVGD 379 Query: 395 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 216 + SA ++E M+ FGC DV+TFT+L+ GYC+IG+++Q L+LW+ M R L PN YTF+I Sbjct: 380 MVSAKSMYEQMASFGCIADVVTFTSLIDGYCRIGDVNQSLQLWNTMKGRDLSPNVYTFAI 439 Query: 215 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 36 IN+LCK NRL+EAR L +L+ RN I+P+ FI+NPVIDGFCKAGNLD AN IVAEMEEK Sbjct: 440 TINALCKENRLHEARGFLRELQCRN-IVPKPFIFNPVIDGFCKAGNLDEANLIVAEMEEK 498 Query: 35 KCNPDKYTFTI 3 +C+PDK TFTI Sbjct: 499 QCHPDKVTFTI 509 >ref|XP_002512275.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223548236|gb|EEF49727.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 532 Score = 428 bits (1100), Expect = e-117 Identities = 217/436 (49%), Positives = 302/436 (69%) Frame = -3 Query: 1310 KIESLWFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFF 1131 K + WF+K + L +R+ + A K P +AF VI+ +NNN P++ +F Sbjct: 24 KNQEAWFVKVIAILFVRSHCSDATSLGYLSEKLNDPLVAFEVIKRLNNN---PQVGLKFM 80 Query: 1130 QFTRLNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAH 951 +F RLN +LIH +T+ LL+RSLCQMGL DL +V+ YM +DG +D +L FLV+S A Sbjct: 81 EFCRLNFSLIHCFSTYELLIRSLCQMGLHDLVEMVIGYMRSDGHLIDSRVLGFLVTSFAQ 140 Query: 950 AGKFSIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSF 771 AGKF + ++++I G+E I+SFV+N L+ LVK +V EA F+++L Sbjct: 141 AGKFDLAKKLIIEVQ----GEEARISSFVYNYLLNELVKGGKVHEAIFLFKENLAFHSP- 195 Query: 770 CLDTCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALE 591 +T +FNI+I GLC+ G++++ FE+FN M SFGC D+ TYN++I+GLCK + DRA + Sbjct: 196 -PNTWTFNILIRGLCRVGEVEKGFELFNAMQSFGCLPDVVTYNTLISGLCKANELDRACD 254 Query: 590 LLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGF 411 LL+E+QS+ SPDV TYT+IISGF KLG+ A L+++M GI P V TFNVLI GF Sbjct: 255 LLKEVQSRNDCSPDVMTYTSIISGFRKLGKLEAASVLFEEMIRSGIEPTVVTFNVLIDGF 314 Query: 410 GQSGDLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNA 231 G+ G++ +A + E M+ + C PDV+TFT+L+ GYC+ G+I GLK+WD M AR + PN Sbjct: 315 GKIGNMVAAEAMHEKMASYSCIPDVVTFTSLIDGYCRTGDIRLGLKVWDVMKARNVSPNI 374 Query: 230 YTFSIVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVA 51 YT+S++IN+LCK NR++EARDLL QL+ +D+ P+ FIYNPVIDGFCKAGN+D AN IV Sbjct: 375 YTYSVIINALCKDNRIHEARDLLRQLKC-SDVFPKPFIYNPVIDGFCKAGNVDEANVIVT 433 Query: 50 EMEEKKCNPDKYTFTI 3 EMEEK+C PDK TFTI Sbjct: 434 EMEEKRCRPDKVTFTI 449 Score = 67.0 bits (162), Expect = 2e-08 Identities = 67/294 (22%), Positives = 124/294 (42%), Gaps = 32/294 (10%) Frame = -3 Query: 1142 FEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLDLA-NLVVEYMNADGLSLDGPLLEFLV 966 FE F + + + + T+N L+ LC+ LD A +L+ E + + S D ++ Sbjct: 218 FELFNAMQ-SFGCLPDVVTYNTLISGLCKANELDRACDLLKEVQSRNDCSPDVMTYTSII 276 Query: 965 SSVAHAGKFSIGREILISQAQLCLGKEEIINSF------------------VHNK----- 855 S GK ++ + + G E + +F +H K Sbjct: 277 SGFRKLGKLEAAS--VLFEEMIRSGIEPTVVTFNVLIDGFGKIGNMVAAEAMHEKMASYS 334 Query: 854 -------FLSLLVKQNRVDEAFIFFRD-HLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAF 699 F SL+ R + + + ++K R+ + +++++I+ LCK +I A Sbjct: 335 CIPDVVTFTSLIDGYCRTGDIRLGLKVWDVMKARNVSPNIYTYSVIINALCKDNRIHEAR 394 Query: 698 EVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISG 519 ++ + F YN VI+G CK G+ D A ++ E++ + PD T+T +I G Sbjct: 395 DLLRQLKCSDVFPKPFIYNPVIDGFCKAGNVDEANVIVTEMEEKR-CRPDKVTFTILIIG 453 Query: 518 FFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSK 357 GR EAL ++ M G P+ T + L+ ++G A I ++ S+ Sbjct: 454 HCMKGRMVEALDIFKKMLAIGCAPDNITISSLVACLLKAGKPSEAFHIVQTASE 507 >ref|XP_006381622.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550336330|gb|ERP59419.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 511 Score = 426 bits (1095), Expect = e-116 Identities = 213/408 (52%), Positives = 290/408 (71%) Frame = -3 Query: 1226 YFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGL 1047 Y + L P IAF VI+ NN P++ F+F +F+RLNLN+ H +T+NLL+RSLCQMG Sbjct: 33 YPDRQLTPLIAFEVIKRFNN----PKVGFKFLEFSRLNLNVNHCYSTYNLLMRSLCQMGH 88 Query: 1046 LDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSF 867 DL N+V +YM +DG D LL FLV+ +A A F + +++L ++ Q GKE INSF Sbjct: 89 HDLVNIVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLL-AEVQ---GKEVRINSF 144 Query: 866 VHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFN 687 V+N LS+LVKQN+V EA F+++L DT +FNI+I GLC+ G +DRAFEVF Sbjct: 145 VYNNLLSVLVKQNQVHEAIYLFKEYLAMQSP---DTWTFNILIRGLCRVGGVDRAFEVFK 201 Query: 686 DMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKL 507 DM SFGC D+ TYN++INGLCK + R EL +EIQS+ SPD+ TYT+IISGF K Sbjct: 202 DMESFGCLPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKS 261 Query: 506 GRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITF 327 G+ EA +L+++M GI+PNV TFNVLI GFG+ G++ A ++ M+ F C+ DV+TF Sbjct: 262 GKMKEASNLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTF 321 Query: 326 TNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRW 147 T+L+ GYC+ G+++ GLK W+ M R + P YT++++IN+LCK NRLNEARD L Q++ Sbjct: 322 TSLIDGYCRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIK- 380 Query: 146 RNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 + I+P+ F+YNPVIDGFCKAGN+D N I+ EMEEK+C+PDK TFTI Sbjct: 381 NSSIIPKPFMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTI 428 Score = 66.6 bits (161), Expect = 3e-08 Identities = 58/271 (21%), Positives = 114/271 (42%), Gaps = 6/271 (2%) Frame = -3 Query: 1169 NNFCKP---RLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGL 999 N CK + E F+ + + I T+ ++ C+ G + A+ + E M G+ Sbjct: 220 NGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKSGKMKEASNLFEEMMRSGI 279 Query: 998 SLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFVHNKFLSLL---VKQN 828 + L+ G + + A + + F SL+ + Sbjct: 280 QPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVT-------FTSLIDGYCRAG 332 Query: 827 RVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITT 648 +V+ F+ +++K R+ ++ ++I+ LCK +++ A + + + Sbjct: 333 QVNHGLKFW--NVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIKNSSIIPKPFM 390 Query: 647 YNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDM 468 YN VI+G CK G+ D +L+E++ + PD T+T +I G GR EA+++++ M Sbjct: 391 YNPVIDGFCKAGNVDEGNVILKEMEEKR-CDPDKVTFTILIIGHCVKGRMFEAINIFNRM 449 Query: 467 THRGIRPNVYTFNVLIHGFGQSGDLGSALKI 375 P+ T N LI ++G A +I Sbjct: 450 LATRCAPDNITVNSLISCLLKAGMPNEAYRI 480 >ref|XP_002326124.1| predicted protein [Populus trichocarpa] Length = 512 Score = 426 bits (1095), Expect = e-116 Identities = 213/408 (52%), Positives = 291/408 (71%) Frame = -3 Query: 1226 YFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGL 1047 Y + L P IAF VI+ NN P++ F+F +F+RLNLN+ H +T+NLL+RSLCQMG Sbjct: 33 YPDRQLTPLIAFEVIKRFNN----PKVGFKFLEFSRLNLNVNHCYSTYNLLMRSLCQMGH 88 Query: 1046 LDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSF 867 DL N+V +YM +DG D LL FLV+ +A A F + +++L ++ Q GKE INSF Sbjct: 89 HDLVNIVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLL-AEVQ---GKEVRINSF 144 Query: 866 VHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFN 687 V+N LS+LVKQN+V EA F+++L+ DT +FNI+I GLC+ G +DRAFEVF Sbjct: 145 VYNNLLSVLVKQNQVHEAIYLFKEYLVMQSP--PDTWTFNILIRGLCRVGGVDRAFEVFK 202 Query: 686 DMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKL 507 DM SFGC D+ TYN++INGLCK + R EL +EIQS+ SPD+ TYT+IISGF K Sbjct: 203 DMESFGCLPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKS 262 Query: 506 GRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITF 327 G+ EA +L+++M GI+PNV TFNVLI GFG+ G++ A ++ M+ F C+ DV+TF Sbjct: 263 GKMKEASNLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTF 322 Query: 326 TNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRW 147 T+L+ GYC+ G+++ GLK W+ M R + P YT++++IN+LCK NRLNEARD L Q++ Sbjct: 323 TSLIDGYCRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIK- 381 Query: 146 RNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 + I+P+ F+YNPVIDGFCKAGN+D N I+ EMEEK+C+PDK TFTI Sbjct: 382 NSSIIPKPFMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTI 429 Score = 66.6 bits (161), Expect = 3e-08 Identities = 58/271 (21%), Positives = 114/271 (42%), Gaps = 6/271 (2%) Frame = -3 Query: 1169 NNFCKP---RLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGL 999 N CK + E F+ + + I T+ ++ C+ G + A+ + E M G+ Sbjct: 221 NGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKSGKMKEASNLFEEMMRSGI 280 Query: 998 SLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFVHNKFLSLL---VKQN 828 + L+ G + + A + + F SL+ + Sbjct: 281 QPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVT-------FTSLIDGYCRAG 333 Query: 827 RVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITT 648 +V+ F+ +++K R+ ++ ++I+ LCK +++ A + + + Sbjct: 334 QVNHGLKFW--NVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIKNSSIIPKPFM 391 Query: 647 YNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDM 468 YN VI+G CK G+ D +L+E++ + PD T+T +I G GR EA+++++ M Sbjct: 392 YNPVIDGFCKAGNVDEGNVILKEMEEKR-CDPDKVTFTILIIGHCVKGRMFEAINIFNRM 450 Query: 467 THRGIRPNVYTFNVLIHGFGQSGDLGSALKI 375 P+ T N LI ++G A +I Sbjct: 451 LATRCAPDNITVNSLISCLLKAGMPNEAYRI 481 >ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Cucumis sativus] Length = 548 Score = 426 bits (1094), Expect = e-116 Identities = 217/431 (50%), Positives = 298/431 (69%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 W +K VCTL R+ A FG Y +NL P IAF VI+ F P L +FF+F+R Sbjct: 48 WLVKVVCTLFFRSHSLNACFG--YLSRNLNPSIAFEVIKR----FSDPLLGLKFFEFSRT 101 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 +L++ H+ T++LL+R+LC++GL D A +V + M +DG+ D +LE LVSS A GK Sbjct: 102 HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLD 161 Query: 935 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 756 + L C G + ++ FV+N L++LVKQN VDEA + FR+HL F D Sbjct: 162 SAKNFL--NEVHCYGIK--VSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPY--FVPDVY 215 Query: 755 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 576 SFNI+I GLC+ G+ID+AFE F +MG+FGCF DI +YN++ING C++ + + +LL+E Sbjct: 216 SFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKED 275 Query: 575 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 396 G+SPDV TYT+IISG+ KLG A L+D+M GI+PN +TFNVLI GFG+ G+ Sbjct: 276 MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGN 335 Query: 395 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 216 + SA+ ++E M GC PDV+TFT+L+ GYC+ GE++QGLKLW+EM R L PN YT+++ Sbjct: 336 MRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAV 395 Query: 215 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 36 +IN+LCK NR+ EAR+ L L+ ++++P+ FIYNPVIDGFCKAG +D AN IVAEM+EK Sbjct: 396 LINALCKENRIREARNFLRHLK-SSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEK 454 Query: 35 KCNPDKYTFTI 3 KC PDK TFTI Sbjct: 455 KCRPDKITFTI 465 >ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Cucumis sativus] Length = 548 Score = 426 bits (1094), Expect = e-116 Identities = 217/431 (50%), Positives = 298/431 (69%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 W +K VCTL R+ A FG Y +NL P IAF VI+ F P L +FF+F+R Sbjct: 48 WLVKVVCTLFFRSHSLNACFG--YLSRNLNPSIAFEVIKR----FSDPLLGLKFFEFSRT 101 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 +L++ H+ T++LL+R+LC++GL D A +V + M +DG+ D +LE LVSS A GK Sbjct: 102 HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLD 161 Query: 935 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 756 + L C G + ++ FV+N L++LVKQN VDEA + FR+HL F D Sbjct: 162 SAKNFL--NEVHCYGIK--VSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPY--FVPDVY 215 Query: 755 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 576 SFNI+I GLC+ G+ID+AFE F +MG+FGCF DI +YN++ING C++ + + +LL+E Sbjct: 216 SFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKED 275 Query: 575 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 396 G+SPDV TYT+IISG+ KLG A L+D+M GI+PN +TFNVLI GFG+ G+ Sbjct: 276 MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGN 335 Query: 395 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 216 + SA+ ++E M GC PDV+TFT+L+ GYC+ GE++QGLKLW+EM R L PN YT+++ Sbjct: 336 MRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAV 395 Query: 215 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 36 +IN+LCK NR+ EAR+ L L+ ++++P+ FIYNPVIDGFCKAG +D AN IVAEM+EK Sbjct: 396 LINALCKENRIREARNFLRHLK-SSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEK 454 Query: 35 KCNPDKYTFTI 3 KC PDK TFTI Sbjct: 455 KCRPDKITFTI 465 >ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Fragaria vesca subsp. vesca] Length = 583 Score = 424 bits (1090), Expect = e-116 Identities = 217/432 (50%), Positives = 301/432 (69%), Gaps = 1/432 (0%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 WF+K V TL +R+ + G Y KNL P +AF VI+ +NN P+L FF+ ++ Sbjct: 84 WFVKVVYTLFLRSHSLDSYVG--YLSKNLTPSLAFEVIKRLNN----PKLGLRFFELSKF 137 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 +LN+ H + T++ LLRSLCQMGL D A LV +YM DGLS + +LEFLVSS A G+ Sbjct: 138 SLNVNHGVWTYHYLLRSLCQMGLQDSAKLVFDYMRTDGLSPNESVLEFLVSSCAQMGRSD 197 Query: 935 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCL-DT 759 + +IL +G ++SFV+N ++LVK NRVDEA FR ++ S+C D+ Sbjct: 198 LAEKILDEVHCSVVG----LSSFVYNNLFNVLVKLNRVDEAVCLFRKYV---GSYCCPDS 250 Query: 758 CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 579 +FNI+I GLC+ G +D+ E F+DM SFGC ++ TYN++I+GLC+ + DR +LLRE Sbjct: 251 WTFNILIRGLCRTGAVDKGLEFFSDMRSFGCSPNVVTYNTLISGLCRAHEVDRGCDLLRE 310 Query: 578 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 399 +Q + LSPDV T+T++ISG+ KLGR EA ++D+M G++P TFN LI G+G++G Sbjct: 311 VQFRSELSPDVITFTSVISGYCKLGRMEEASAIFDEMIGCGLKPTAVTFNALIDGYGKAG 370 Query: 398 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 219 D+ SA ++ESM G DVITFT+L+ GYC+ G ++ GL+LW EMNA+ + P+AYTFS Sbjct: 371 DMSSAFSLYESMLFHGHCADVITFTSLIDGYCRAGHLNHGLQLWHEMNAKNVSPSAYTFS 430 Query: 218 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 39 ++IN+LCK NRL EARDLL +L+ N ++P++F+YNPVIDG CKAGN+D AN IVAEMEE Sbjct: 431 VLINALCKGNRLCEARDLLRELKGSN-VVPKSFLYNPVIDGLCKAGNIDEANLIVAEMEE 489 Query: 38 KKCNPDKYTFTI 3 KKC PD+ TFTI Sbjct: 490 KKCTPDRVTFTI 501 Score = 112 bits (281), Expect = 3e-22 Identities = 83/308 (26%), Positives = 147/308 (47%), Gaps = 3/308 (0%) Frame = -3 Query: 1088 TFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQ 909 TFN+L+R LC+ G +D M + G S + L+S + A + G ++L Sbjct: 252 TFNILIRGLCRTGAVDKGLEFFSDMRSFGCSPNVVTYNTLISGLCRAHEVDRGCDLL--- 308 Query: 908 AQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHL---LKLRSFCLDTCSFNIVI 738 ++ E + +S K R++EA F + + LK + +FN +I Sbjct: 309 REVQFRSELSPDVITFTSVISGYCKLGRMEEASAIFDEMIGCGLKPTAV-----TFNALI 363 Query: 737 DGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGL 558 DG KAG + AF ++ M G AD+ T+ S+I+G C+ G + L+L E+ ++ + Sbjct: 364 DGYGKAGDMSSAFSLYESMLFHGHCADVITFTSLIDGYCRAGHLNHGLQLWHEMNAKN-V 422 Query: 557 SPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALK 378 SP T++ +I+ K R EA L ++ + P + +N +I G ++G++ A Sbjct: 423 SPSAYTFSVLINALCKGNRLCEARDLLRELKGSNVVPKSFLYNPVIDGLCKAGNIDEANL 482 Query: 377 IFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLC 198 I M + C PD +TFT L+ G G + + + + +M + P+ T +I+ L Sbjct: 483 IVAEMEEKKCTPDRVTFTILILGNSMKGRMSEAIGNFSKMLSIGCAPDKITIDSLISCLS 542 Query: 197 KSNRLNEA 174 K+ +EA Sbjct: 543 KAGMPSEA 550 >gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis] Length = 570 Score = 418 bits (1074), Expect = e-114 Identities = 217/432 (50%), Positives = 289/432 (66%), Gaps = 1/432 (0%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 WF+K V TL +R+ FG Y K L P I+F VI+ +NNN P L +FF+ +R Sbjct: 75 WFVKVVSTLFVRSQSLNTFFG--YLSKKLTPSISFEVIKRLNNN---PNLGLKFFELSRA 129 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 NL++ HS +T+NLL+RSLCQMG D A V + M DG S D +EFLV A GK Sbjct: 130 NLSVNHSFSTYNLLIRSLCQMGFHDSAKFVFDCMRIDGHSPDNSTIEFLVCVFAKVGKLD 189 Query: 935 IGREILISQAQLCLGKEEI-INSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDT 759 ++L EEI + FV++ ++LVK N+V EA FR + F DT Sbjct: 190 SCEKLL----------EEIRASKFVYSSLFNVLVKNNKVYEAVCLFRKQIGS--HFVPDT 237 Query: 758 CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 579 +FNI+I GLC G++ AFE FNDMG F C D+ TYN++I+GLC+ + DR +LLRE Sbjct: 238 WTFNILIGGLCGVGEVHSAFEFFNDMGKFRCSPDVVTYNTLISGLCRTNEVDRGCDLLRE 297 Query: 578 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 399 +Q +G SP+V+T+T++I G+ KLGR EA L+D+M G RP TFNVLI F + G Sbjct: 298 VQLRGDFSPNVRTFTSVILGYCKLGRMEEASALFDEMMDSGTRPTTVTFNVLIDAFSKVG 357 Query: 398 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 219 D+ SA+ ++E M G PDV+TFT+L+ GYC++G++++GLKLW EM+ R + PN YT+S Sbjct: 358 DMASAIALYEKMLFHGYRPDVVTFTSLIDGYCRVGQLNRGLKLWCEMSVRNVSPNGYTYS 417 Query: 218 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 39 +VI++LCK NRL+EARDLL QL N I+P+ F+YNPVIDGFCKAGN+D AN IVAEMEE Sbjct: 418 VVIHALCKVNRLHEARDLLRQLNCTN-IVPKPFMYNPVIDGFCKAGNVDEANMIVAEMEE 476 Query: 38 KKCNPDKYTFTI 3 K+CNPDK TFTI Sbjct: 477 KRCNPDKMTFTI 488 Score = 60.5 bits (145), Expect = 2e-06 Identities = 50/247 (20%), Positives = 110/247 (44%) Frame = -3 Query: 1097 SIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREIL 918 ++ TF ++ C++G ++ A+ + + M G L+ + + G + I Sbjct: 307 NVRTFTSVILGYCKLGRMEEASALFDEMMDSGTRPTTVTFNVLIDAFSKVG--DMASAIA 364 Query: 917 ISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVI 738 + + L G + +F + + +++ + + + +R+ + ++++VI Sbjct: 365 LYEKMLFHGYRPDVVTFT--SLIDGYCRVGQLNRGLKLWCE--MSVRNVSPNGYTYSVVI 420 Query: 737 DGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGL 558 LCK ++ A ++ + YN VI+G CK G+ D A ++ E++ + Sbjct: 421 HALCKVNRLHEARDLLRQLNCTNIVPKPFMYNPVIDGFCKAGNVDEANMIVAEMEEKR-C 479 Query: 557 SPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALK 378 +PD T+T +I G GR +A+ ++ M G P+ T + L+ ++G A Sbjct: 480 NPDKMTFTILILGNCMKGRMVDAIGVFYKMLAVGCAPDKITVHCLMSCLLKAGMPNEAFH 539 Query: 377 IFESMSK 357 I E++ K Sbjct: 540 IKETVMK 546 >ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X1 [Citrus sinensis] gi|568841566|ref|XP_006474729.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X2 [Citrus sinensis] Length = 595 Score = 417 bits (1072), Expect = e-114 Identities = 220/467 (47%), Positives = 315/467 (67%), Gaps = 5/467 (1%) Frame = -3 Query: 1388 RNITFKSIARCFHGISNIESSPPTHNKIE-----SLWFIKFVCTLCIRNAENLAIFGSDY 1224 R T +IA FHG++N S P ++ WF+K VCTL +R++ L+ + Y Sbjct: 59 RASTIAAIAH-FHGLANGGSRPFDEKEVNYRCSNEFWFVKVVCTLLLRSSY-LSDTCARY 116 Query: 1223 FRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLL 1044 + L P + VI+ ++N P+L +F +F+R+NL+L HS T+NL++RSLC+MGL Sbjct: 117 LCEKLSPLNSLEVIKRLDN----PKLGLKFLEFSRVNLSLNHSFKTYNLVMRSLCEMGLH 172 Query: 1043 DLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFV 864 D +V +YM +DG + P++EF VSS AGK + +L +Q G E +++F+ Sbjct: 173 DSVQVVFDYMRSDGHLPNSPMIEFFVSSCIRAGKCDAAKGLL---SQFRPG-EVTMSTFM 228 Query: 863 HNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFND 684 +N L+ LVKQN DEA F+++ DT +FNI+I GL + G++ +AFE F D Sbjct: 229 YNSLLNALVKQNNADEAVYMFKEYFRLYSQ--PDTWTFNILIQGLSRIGEVKKAFEFFYD 286 Query: 683 MGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLG 504 MGSFGC DI TYN++I+GLC++ + R ELL+E++ + SPDV TYT++ISG+ KLG Sbjct: 287 MGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFKSEFSPDVVTYTSVISGYCKLG 346 Query: 503 RNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFT 324 + +A ++++M GI+P+ TFNVLI GFG+ G++ SA + E M FG PDV+TF+ Sbjct: 347 KMDKATGIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVSAEYMRERMLSFGYLPDVVTFS 406 Query: 323 NLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWR 144 +L+ GYC+ G+++QGLKL DEM + L PN YTF+I+IN+LCK NRLN+AR L QL+W Sbjct: 407 SLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFTILINALCKENRLNDARRFLKQLKW- 465 Query: 143 NDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 ND++P+ F+YNPVIDGFCKAGN+D AN IVAEMEEK+C PDK TFTI Sbjct: 466 NDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKPDKVTFTI 512 >ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citrus clementina] gi|557556032|gb|ESR66046.1| hypothetical protein CICLE_v10007804mg [Citrus clementina] Length = 595 Score = 416 bits (1069), Expect = e-113 Identities = 219/467 (46%), Positives = 314/467 (67%), Gaps = 5/467 (1%) Frame = -3 Query: 1388 RNITFKSIARCFHGISNIESSPPTHNKIE-----SLWFIKFVCTLCIRNAENLAIFGSDY 1224 R T +IA FHG++N S P ++ WF+K VCTL +R++ L+ + Y Sbjct: 59 RASTIAAIAH-FHGLANGGSRPFDEKEVNYRCSNEFWFVKVVCTLLLRSSY-LSDTCARY 116 Query: 1223 FRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLL 1044 + L P + VI+ ++N P+L +F +F+R+NL+L HS T+NL++RSLC+MGL Sbjct: 117 LCEKLSPLNSLEVIKRLDN----PKLGLKFLEFSRVNLSLNHSFKTYNLVMRSLCEMGLH 172 Query: 1043 DLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFV 864 D +V +YM +DG + P++EF VSS AGK + +L +Q G E +++F+ Sbjct: 173 DSVQVVFDYMRSDGHLPNSPMIEFFVSSCIRAGKCDAAKGLL---SQFRPG-EVTMSTFM 228 Query: 863 HNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFND 684 +N L+ LVKQN DEA F+++ DT +FNI+I GLC+ G++ +AFE F D Sbjct: 229 YNSLLNALVKQNNADEAVYMFKEYFRLYSQ--PDTWTFNILIRGLCRIGEVKKAFEFFYD 286 Query: 683 MGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLG 504 MGSFGC DI TYN++I+GLC++ + R ELL+E++ + PDV TYT++ISG+ KLG Sbjct: 287 MGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFKSEFLPDVVTYTSVISGYCKLG 346 Query: 503 RNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFT 324 + +A ++++M GI+P+ TFNVLI GFG+ G++ SA + E M G PDV+TF+ Sbjct: 347 KMDKATSIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVSAEYMRERMLSLGYLPDVVTFS 406 Query: 323 NLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWR 144 +L+ GYC+ G+++QGLKL DEM + L PN YTF+I+IN+LCK NRLN+AR L QL+W Sbjct: 407 SLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFAILINALCKENRLNDARRFLKQLKW- 465 Query: 143 NDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 ND++P+ F+YNPVIDGFCKAGN+D AN IVAEMEEK+C PDK TFTI Sbjct: 466 NDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKPDKVTFTI 512 Score = 60.5 bits (145), Expect = 2e-06 Identities = 45/194 (23%), Positives = 85/194 (43%) Frame = -3 Query: 1139 EFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSS 960 E+ + L+L + + TF+ L+ C+ G L+ + + M LS + L+++ Sbjct: 387 EYMRERMLSLGYLPDVVTFSSLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFAILINA 446 Query: 959 VAHAGKFSIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKL 780 + + + R L + + + F++N + K VDEA + + ++ Sbjct: 447 LCKENRLNDARRFL----KQLKWNDLVPKPFMYNPVIDGFCKAGNVDEANVIVAE--MEE 500 Query: 779 RSFCLDTCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADR 600 + D +F I+I G C G++ A +FN M GC D T NS+I+ L K G + Sbjct: 501 KRCKPDKVTFTILIIGHCMKGRMVEAISIFNKMLRIGCAPDDITVNSLISCLLKGGMPNE 560 Query: 599 ALELLREIQSQGGL 558 A +++ L Sbjct: 561 AFRIMQRASEDQNL 574 >ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Glycine max] Length = 544 Score = 403 bits (1035), Expect = e-109 Identities = 214/432 (49%), Positives = 303/432 (70%), Gaps = 1/432 (0%) Frame = -3 Query: 1295 WFIKFVCTLCI-RNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTR 1119 WF+K V TL + N+ + G YFR++L P V++ NN P L F+FF+FTR Sbjct: 46 WFVKIVSTLFLCSNSLDDRFLG--YFREHLTPSHVLEVVKRFNN----PNLGFKFFRFTR 99 Query: 1118 LNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKF 939 L++ HS T+N+LLRSLCQ GL + A L+ + M +DG D LL FLVSS A A +F Sbjct: 100 ERLSMSHSFWTYNMLLRSLCQAGLHNSAKLLYDSMRSDGQLPDSRLLGFLVSSFALADRF 159 Query: 938 SIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDT 759 + +E+L ++AQ C G + ++ V+N FL++L+K NR+D+A FR+ L++ S CLD Sbjct: 160 DVSKELL-AEAQ-CSGVQ--VDVIVYNNFLNILIKHNRLDDAICLFRE-LMRSHS-CLDA 213 Query: 758 CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 579 +FNI+I GLC AG +D AFE+ DMGSFGC DI TYN +++GLC++ DRA +LL E Sbjct: 214 FTFNILIRGLCTAGDVDEAFELLGDMGSFGCSPDIVTYNILLHGLCRIDQVDRARDLLEE 273 Query: 578 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 399 + + +P+V +YTT+ISG+ +L + EA L+ +M G +PNV+TF+ L+ GF ++G Sbjct: 274 VCLKCEFAPNVVSYTTVISGYCRLSKMDEASSLFYEMVRSGTKPNVFTFSALVDGFVKAG 333 Query: 398 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 219 D+ SAL + + + GCAP+VIT T+L++GYC+ G ++ GL LW EMNAR + N YT+S Sbjct: 334 DMASALGMHKKILFHGCAPNVITLTSLINGYCRAGWVNHGLDLWREMNARNIPANLYTYS 393 Query: 218 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 39 ++I++LCKSNRL EAR+LL L+ ++DI+P AF+YNPVIDG+CK+GN+D ANAIVAEMEE Sbjct: 394 VLISALCKSNRLQEARNLLRILK-QSDIVPLAFVYNPVIDGYCKSGNIDEANAIVAEMEE 452 Query: 38 KKCNPDKYTFTI 3 KC PDK TFTI Sbjct: 453 -KCKPDKLTFTI 463 Score = 65.5 bits (158), Expect = 6e-08 Identities = 70/310 (22%), Positives = 122/310 (39%), Gaps = 68/310 (21%) Frame = -3 Query: 1088 TFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQ 909 TFN+L+R LC G +D A ++ M + G S D L+ + + R++L Sbjct: 215 TFNILIRGLCTAGDVDEAFELLGDMGSFGCSPDIVTYNILLHGLCRIDQVDRARDLL--- 271 Query: 908 AQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFR------------------DHLLK 783 ++CL E N + +S + +++DEA F D +K Sbjct: 272 EEVCLKCEFAPNVVSYTTVISGYCRLSKMDEASSLFYEMVRSGTKPNVFTFSALVDGFVK 331 Query: 782 L----------RSFCLDTCSFNIV-----IDGLCKAGQIDRAFEVFNDMGSFGCFADITT 648 + C+ N++ I+G C+AG ++ +++ +M + A++ T Sbjct: 332 AGDMASALGMHKKILFHGCAPNVITLTSLINGYCRAGWVNHGLDLWREMNARNIPANLYT 391 Query: 647 Y-----------------------------------NSVINGLCKLGDADRALELLREIQ 573 Y N VI+G CK G+ D A ++ E++ Sbjct: 392 YSVLISALCKSNRLQEARNLLRILKQSDIVPLAFVYNPVIDGYCKSGNIDEANAIVAEME 451 Query: 572 SQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDL 393 + PD T+T +I G GR EA+ ++ M G P+ T L +SG Sbjct: 452 EK--CKPDKLTFTILIIGHCMKGRTPEAIGIFYKMLASGCTPDDITIRTLSSCLLKSGMP 509 Query: 392 GSALKIFESM 363 G A +I E++ Sbjct: 510 GEAARIKETL 519 >ref|XP_002885810.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297331650|gb|EFH62069.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 536 Score = 388 bits (997), Expect = e-105 Identities = 211/469 (44%), Positives = 299/469 (63%), Gaps = 10/469 (2%) Frame = -3 Query: 1379 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 1221 TF + FH S+ ++ P +N E + W +K V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPIQNNTREKIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63 Query: 1220 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 1041 KNL P I+F V++ ++NN P + F F++F+R LN+ HS T+NLL RSLC+ G+ D Sbjct: 64 SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGMHD 120 Query: 1040 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 870 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 121 LAGQMFECMKSDGISPNSRLLGFLVSSFAEKGKLHCATALLLQSYEVEGCCM-------- 172 Query: 869 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 690 V N L+ LVK +RV++A F +HL + +S C DT +FNI+I GLC G+ ++A E+ Sbjct: 173 -VVNSLLNTLVKLDRVEDAMKLFEEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKAVELL 229 Query: 689 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 510 M FGC DI TYN++I G CK + +A E+ +++S G SPDV TYT++ISG+ K Sbjct: 230 GGMSGFGCLPDIVTYNTLIKGFCKSNELKKANEMFDDVKSSSGCSPDVVTYTSMISGYCK 289 Query: 509 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 330 G+ EA L DDM GI P TFNVL+ G+ ++G++ +A +I M FGC PDV+T Sbjct: 290 AGKMQEASVLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMHTAEEIRGKMISFGCFPDVVT 349 Query: 329 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 150 FT+L+ GYC++G+++QG +LW+EMNAR ++PNA+T+SI+IN+LCK NRL +AR+LL QL Sbjct: 350 FTSLIDGYCRVGQVNQGFRLWEEMNARGMFPNAFTYSILINALCKENRLLKARELLGQLA 409 Query: 149 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 DI+PQ F+YNPVIDGFCKAG ++ A IV EME+KKC PDK TFTI Sbjct: 410 -SKDIIPQPFMYNPVIDGFCKAGKVNEAIVIVEEMEKKKCKPDKITFTI 457 >dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana] Length = 536 Score = 385 bits (990), Expect = e-104 Identities = 211/469 (44%), Positives = 299/469 (63%), Gaps = 10/469 (2%) Frame = -3 Query: 1379 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 1221 TF + FH S+ ++ P +N E + W +K V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63 Query: 1220 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 1041 KNL P I+F V++ ++NN P + F F++F+R LN+ HS T+NLL RSLC+ GL D Sbjct: 64 SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHD 120 Query: 1040 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 870 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 121 LAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCM-------- 172 Query: 869 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 690 V N L+ LVK +RV++A F +HL + +S C DT +FNI+I GLC G+ ++A E+ Sbjct: 173 -VVNSLLNTLVKLDRVEDAMKLFDEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKALELL 229 Query: 689 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 510 M FGC DI TYN++I G CK + ++A E+ ++++S SPDV TYT++ISG+ K Sbjct: 230 GVMSGFGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCK 289 Query: 509 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 330 G+ EA L DDM GI P TFNVL+ G+ ++G++ +A +I M FGC PDV+T Sbjct: 290 AGKMREASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVT 349 Query: 329 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 150 FT+L+ GYC++G++ QG +LW+EMNAR ++PNA+T+SI+IN+LC NRL +AR+LL QL Sbjct: 350 FTSLIDGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLA 409 Query: 149 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 DI+PQ F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTI Sbjct: 410 -SKDIIPQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTI 457 >ref|NP_178657.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|42570711|ref|NP_973429.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75216767|sp|Q9ZUE9.1|PP149_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g06000 gi|4006835|gb|AAC95177.1| hypothetical protein [Arabidopsis thaliana] gi|110736272|dbj|BAF00106.1| hypothetical protein [Arabidopsis thaliana] gi|330250896|gb|AEC05990.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|330250897|gb|AEC05991.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 536 Score = 385 bits (990), Expect = e-104 Identities = 211/469 (44%), Positives = 299/469 (63%), Gaps = 10/469 (2%) Frame = -3 Query: 1379 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 1221 TF + FH S+ ++ P +N E + W +K V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63 Query: 1220 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 1041 KNL P I+F V++ ++NN P + F F++F+R LN+ HS T+NLL RSLC+ GL D Sbjct: 64 SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHD 120 Query: 1040 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 870 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 121 LAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCM-------- 172 Query: 869 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 690 V N L+ LVK +RV++A F +HL + +S C DT +FNI+I GLC G+ ++A E+ Sbjct: 173 -VVNSLLNTLVKLDRVEDAMKLFDEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKALELL 229 Query: 689 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 510 M FGC DI TYN++I G CK + ++A E+ ++++S SPDV TYT++ISG+ K Sbjct: 230 GVMSGFGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCK 289 Query: 509 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 330 G+ EA L DDM GI P TFNVL+ G+ ++G++ +A +I M FGC PDV+T Sbjct: 290 AGKMREASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVT 349 Query: 329 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 150 FT+L+ GYC++G++ QG +LW+EMNAR ++PNA+T+SI+IN+LC NRL +AR+LL QL Sbjct: 350 FTSLIDGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLA 409 Query: 149 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 DI+PQ F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTI Sbjct: 410 -SKDIIPQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTI 457 >ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] gi|565479514|ref|XP_006297397.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] gi|482566105|gb|EOA30294.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] gi|482566106|gb|EOA30295.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] Length = 535 Score = 382 bits (982), Expect = e-103 Identities = 212/469 (45%), Positives = 292/469 (62%), Gaps = 10/469 (2%) Frame = -3 Query: 1379 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 1221 TF + FH S+ ++ P NK E + W IK V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPLHSNKREVMHCPEAWLIKIVSTLFVYRVPDSDLCFC-YL 63 Query: 1220 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 1041 KNL P IAF V++ ++NN P L F F++F+R LN+ HS T+N+L RSLC+ G+ D Sbjct: 64 SKNLNPFIAFEVVKKLDNNH--PHLGFRFWEFSRFKLNIRHSFWTYNVLTRSLCKAGMHD 121 Query: 1040 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 870 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 122 LAGQMFECMRSDGVSPNSRLLGFLVSSFAEKGKLQFATALLLQSYEVERCCM-------- 173 Query: 869 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 690 V N L+ LVK +RVD+A F HL C DT +FNI+I GLC G+ ++A E+ Sbjct: 174 -VVNSLLNTLVKLDRVDDAMKLFDKHLRF--QCCNDTKTFNILIRGLCSVGKGEKALELL 230 Query: 689 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 510 +M FGC DI TYN++I G CK + +A E+L +++S G SPDV TYT++ISG+ K Sbjct: 231 GEMSGFGCSPDIVTYNTLIKGFCKSNELAKANEMLNDVKSSSGCSPDVVTYTSMISGYCK 290 Query: 509 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 330 G+ EA L DDM GI P TFNVL+ G+ ++G++ SA I M FGC PDV+T Sbjct: 291 AGKMQEAYLLLDDMLGLGIYPTTITFNVLVDGYAKAGEMTSAEDIRGKMISFGCFPDVVT 350 Query: 329 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 150 FT+L+ GYC+ G+++QG +LW+EMNA+ + PN +T+SI+IN+LCK N L +AR+LL QL Sbjct: 351 FTSLIDGYCRAGQVNQGFRLWEEMNAKGMLPNEFTYSILINALCKENSLLKARELLGQLA 410 Query: 149 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTI 3 DI+ + F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTI Sbjct: 411 -SKDIITKPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTI 458 >ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X1 [Glycine max] gi|571448762|ref|XP_006577947.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X2 [Glycine max] gi|571448764|ref|XP_006577948.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X3 [Glycine max] gi|571448766|ref|XP_006577949.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X4 [Glycine max] Length = 510 Score = 375 bits (963), Expect = e-101 Identities = 191/431 (44%), Positives = 282/431 (65%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 WF+K CT+ +R+ G YF K+L P + + V+ ++ P L F+F +F R Sbjct: 10 WFVKIACTVFVRSNSLDPFVG--YFSKHLTPSLVYEVVNRLHI----PNLGFKFVEFCRH 63 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 L++ HS T++LLLRSLC+ L A +V ++M DG D LL FLV S A G+ Sbjct: 64 KLHMSHSYLTYSLLLRSLCRSNLHHTAKVVYDWMRCDGQIPDNRLLGFLVWSYAIVGRLD 123 Query: 935 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 756 + RE+L +G +N+ V+N ++L++QN+V +A + FR+ L++LR + T Sbjct: 124 VSRELLADVQCNNVG----VNAVVYNDLFNVLIRQNKVVDAVVLFRE-LIRLR-YKPVTY 177 Query: 755 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 576 + NI++ GLC+AG+ID AF + ND+ SFGC D+ TYN++I+GLC++ + DRA LL+E+ Sbjct: 178 TVNILMRGLCRAGEIDEAFRLLNDLRSFGCLPDVITYNTLIHGLCRINEVDRARSLLKEV 237 Query: 575 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 396 G +PDV +YTTIISG+ K + E L+ +M G PN +TFN LI GFG+ GD Sbjct: 238 CLNGEFAPDVVSYTTIISGYCKFSKMEEGNLLFGEMIRSGTAPNTFTFNALIGGFGKLGD 297 Query: 395 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 216 + SAL ++E M GC PDV TFT+L++GY ++G++ Q + +W +MN + + YTFS+ Sbjct: 298 MASALALYEKMLVQGCVPDVATFTSLINGYFRLGQVHQAMDMWHKMNDKNIGATLYTFSV 357 Query: 215 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 36 +++ LC +NRL++ARD+L L +DI+PQ FIYNPVIDG+CK+GN+D AN IVAEME Sbjct: 358 LVSGLCNNNRLHKARDILRLLN-ESDIVPQPFIYNPVIDGYCKSGNVDEANKIVAEMEVN 416 Query: 35 KCNPDKYTFTI 3 +C PDK TFTI Sbjct: 417 RCKPDKLTFTI 427 Score = 61.2 bits (147), Expect = 1e-06 Identities = 51/249 (20%), Positives = 103/249 (41%), Gaps = 1/249 (0%) Frame = -3 Query: 1118 LNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKF 939 LN + ++ ++ C+ ++ NL+ M G + + L+ G Sbjct: 239 LNGEFAPDVVSYTTIISGYCKFSKMEEGNLLFGEMIRSGTAPNTFTFNALIGGFGKLGDM 298 Query: 938 SIGREILISQ-AQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLD 762 + + Q C+ S ++ F + +V +A + H + ++ Sbjct: 299 ASALALYEKMLVQGCVPDVATFTSLINGYF-----RLGQVHQAMDMW--HKMNDKNIGAT 351 Query: 761 TCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLR 582 +F++++ GLC ++ +A ++ + YN VI+G CK G+ D A +++ Sbjct: 352 LYTFSVLVSGLCNNNRLHKARDILRLLNESDIVPQPFIYNPVIDGYCKSGNVDEANKIVA 411 Query: 581 EIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQS 402 E++ PD T+T +I G GR EA+ ++ M G P+ T N L ++ Sbjct: 412 EMEVNR-CKPDKLTFTILIIGHCMKGRMPEAIGIFHKMLAVGCAPDEITVNNLRSCLLKA 470 Query: 401 GDLGSALKI 375 G G A ++ Sbjct: 471 GMPGEAARV 479 >ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutrema salsugineum] gi|557096393|gb|ESQ36901.1| hypothetical protein EUTSA_v10002477mg [Eutrema salsugineum] Length = 535 Score = 370 bits (951), Expect = e-100 Identities = 193/431 (44%), Positives = 279/431 (64%) Frame = -3 Query: 1295 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 1116 W +K V TL + + + Y KNL P IAF V++ ++N P + F F++F+R Sbjct: 40 WLVKIVSTLFVYQVPDSDLCFC-YLSKNLNPFIAFEVVKKLDN----PHIGFRFWEFSRF 94 Query: 1115 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 936 LN+ HS T+NLL RSLC+ GL DLA + E M +DG+S + LL FLVSS A GK Sbjct: 95 KLNIRHSFWTYNLLTRSLCKAGLHDLAGKMFECMKSDGVSPNSRLLGFLVSSFAEKGKLH 154 Query: 935 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 756 +L+ ++ G ++NS +H LV+ +RV++A F HL C DT Sbjct: 155 FATALLLQSYEV-EGSSMVVNSLLHT-----LVRLDRVEDAMKLFDTHLRS--QSCNDTR 206 Query: 755 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 576 +FNI+I GLC G+ A ++ +M SFG DI TYN++I G CK + ++A E+ E+ Sbjct: 207 TFNILIQGLCGIGKAHEALKLLGEMSSFGSSPDIVTYNTLIKGFCKSNELNKANEIFNEV 266 Query: 575 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 396 +S+ G DV TYT+++SG+ K G+ EA L D+M G+ P TFNVL++G+ ++G+ Sbjct: 267 KSRNGCFRDVVTYTSMMSGYCKAGKMREASLLLDEMVGLGMYPTNITFNVLVYGYVKAGE 326 Query: 395 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 216 + SA I M FGC PDV+TFT L+ GYC++G++++G LW+EM+A+ ++PNA+T+SI Sbjct: 327 MSSAEAIRRKMDSFGCFPDVVTFTTLIDGYCRVGQVNKGFSLWEEMSAKGMFPNAFTYSI 386 Query: 215 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 36 +IN+LCK NRL +AR+LL QL DI+P+ F+YNP+IDGFCKAG ++ AN IVAEME+ Sbjct: 387 LINALCKENRLLKARELLGQLACM-DIVPKPFLYNPIIDGFCKAGKVNEANVIVAEMEKF 445 Query: 35 KCNPDKYTFTI 3 +C PDK TFTI Sbjct: 446 RCKPDKITFTI 456