BLASTX nr result
ID: Catharanthus22_contig00011419
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00011419 (3290 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containi... 637 e-179 ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containi... 630 e-178 gb|EOY30252.1| Pentatricopeptide repeat superfamily protein, put... 522 e-145 ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containi... 522 e-145 ref|XP_006381622.1| pentatricopeptide repeat-containing family p... 519 e-144 ref|XP_002326124.1| predicted protein [Populus trichocarpa] 519 e-144 ref|XP_002512275.1| pentatricopeptide repeat-containing protein,... 514 e-143 ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containi... 508 e-141 ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containi... 506 e-140 gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis] 506 e-140 ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containi... 503 e-139 ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containi... 503 e-139 ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citr... 499 e-138 ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containi... 470 e-129 ref|XP_002885810.1| pentatricopeptide repeat-containing protein ... 464 e-128 dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana] 462 e-127 ref|NP_178657.1| pentatricopeptide repeat-containing protein [Ar... 462 e-127 ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Caps... 460 e-126 ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containi... 458 e-126 ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutr... 447 e-122 >ref|XP_006351831.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X1 [Solanum tuberosum] gi|565370447|ref|XP_006351832.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X2 [Solanum tuberosum] Length = 550 Score = 637 bits (1643), Expect = e-179 Identities = 318/554 (57%), Positives = 408/554 (73%) Frame = +2 Query: 236 MPLWIKRAPRNITFKSIARCFHGISNIESSPPTHNKIESLWFIKFVCTLCIRNAENLAIF 415 MPLW++RA + IAR FHG+++ +S P E++WF K VC LC ++++L +F Sbjct: 1 MPLWVQRASNILL---IAR-FHGLTSSKSIPSYGPGPEAVWFTKVVCLLCFHHSQSLDVF 56 Query: 416 GSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQ 595 GSDYFR+NL P IAF VI HIN N PRLAF F Q TR+NLNL+H I +FNLLLRSL Q Sbjct: 57 GSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLNLVHCIGSFNLLLRSLSQ 116 Query: 596 MGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEII 775 MG D A LV ++M ADG L+ +LE +V ++A+AGKF I +EILISQA+L + I+ Sbjct: 117 MGFHDSAMLVFKFMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGREEGRIV 176 Query: 776 NSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFE 955 FVHN LSLL+K++RVDEA FF+ H+L+ DTC+FN VI GLC+ G +D+AFE Sbjct: 177 RPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAFE 236 Query: 956 VFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGF 1135 FNDMGSFGCF D TYN++INGLC +G +RA LL ++ Q GLSPDV TYT++I+G+ Sbjct: 237 FFNDMGSFGCFPDTVTYNTLINGLCSVGQVNRARGLLGNLELQDGLSPDVVTYTSVIAGY 296 Query: 1136 FKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDV 1315 KLGR EA++L D+MT GI PN+ TFN+LI+GFG+ GD+ SA++++ M G PDV Sbjct: 297 CKLGRMDEAINLMDEMTTYGISPNLVTFNILINGFGKIGDMFSAIQMYGRMCAVGYPPDV 356 Query: 1316 ITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQ 1495 +TFT+L+ GYC+ GE+DQGLKLWDEMN R L PN YTFSI+I++L K NRLNEAR+LL Q Sbjct: 357 VTFTSLIDGYCRTGELDQGLKLWDEMNTRNLSPNLYTFSILISALSKENRLNEARELLRQ 416 Query: 1496 LRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKG 1675 L+ R+DI+PQ F+YNPV+DGFCKAGNL AN I AEME + C DK TFTILI+GHCMKG Sbjct: 417 LKSRDDIVPQPFVYNPVLDGFCKAGNLSKANVIAAEMESRGCCHDKITFTILILGHCMKG 476 Query: 1676 RMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGG 1855 RM +A+ +F+KM +GC PD ITV+CL SCLLKAGMV EAY+++ + L SS+ Sbjct: 477 RMLEAMAIFDKMLSLGCVPDDITVSCLTSCLLKAGMVKEAYKVRLTPSKDLNPDLSSSKQ 536 Query: 1856 PKPFRASMDITVAV 1897 PFR S+DI VAV Sbjct: 537 SVPFRTSLDIPVAV 550 >ref|XP_004230611.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Solanum lycopersicum] Length = 550 Score = 630 bits (1626), Expect = e-178 Identities = 319/554 (57%), Positives = 407/554 (73%) Frame = +2 Query: 236 MPLWIKRAPRNITFKSIARCFHGISNIESSPPTHNKIESLWFIKFVCTLCIRNAENLAIF 415 MPLW++RA NI+ IAR FHG+++ +S P E++WF K VC LC ++++L +F Sbjct: 1 MPLWVQRAS-NISL--IAR-FHGLTSSKSIPSYGPGPEAVWFTKVVCLLCFHHSQSLDVF 56 Query: 416 GSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQ 595 GSDYFR+NL P IAF VI HIN N PRLAF F Q TR+NLNLIH I +FNLLLRSL Q Sbjct: 57 GSDYFRQNLDPHIAFTVIHHINTNLNNPRLAFRFLQCTRINLNLIHCIGSFNLLLRSLSQ 116 Query: 596 MGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEII 775 MG D A LV +YM ADG L+ +LE +V ++A+AGKF I +EILISQA+L + I+ Sbjct: 117 MGFHDSAMLVFKYMKADGYLLENSILESVVLALANAGKFEIAKEILISQAELGREEGSIV 176 Query: 776 NSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFE 955 FVHN LSLL+K++RVDEA FF+ H+L+ DTC+FN VI GLC+ G +D+AFE Sbjct: 177 RPFVHNSLLSLLMKRSRVDEAVDFFKHHILRSERLFPDTCTFNTVIRGLCRVGGVDKAFE 236 Query: 956 VFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGF 1135 FNDMGSFGC D TYN++INGLC +G +RA LL +Q Q GLSPDV TYT++ISG+ Sbjct: 237 FFNDMGSFGCSPDTVTYNTLINGLCAVGQVNRAQGLLGNLQLQDGLSPDVVTYTSLISGY 296 Query: 1136 FKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDV 1315 KL R EA++L D+M GI PN+ TFN+LI+GFG+ GD+ SA+K++ M G PDV Sbjct: 297 CKLSRMDEAINLMDEMITYGISPNLVTFNILINGFGKIGDMFSAIKMYGKMCAVGYPPDV 356 Query: 1316 ITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQ 1495 +TFT+L+ GYC+ GE+DQGLKLWD+MN+R L PN YTFS++I++L K NRLNEAR+LL Q Sbjct: 357 VTFTSLIDGYCRTGELDQGLKLWDDMNSRNLSPNLYTFSVLISALSKENRLNEARELLRQ 416 Query: 1496 LRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKG 1675 L+ R+DI+PQ F+YNPV+DGFCKAGNL AN I AEME K C DK TFTILI+GHCMKG Sbjct: 417 LKSRDDIVPQPFVYNPVLDGFCKAGNLSEANVIAAEMESKGCCHDKITFTILILGHCMKG 476 Query: 1676 RMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGG 1855 RM +A+ +F+KM +GC PD IT++CL SCLLKAGMV EAY+++ + L S + Sbjct: 477 RMLEALAIFDKMLSLGCVPDDITISCLTSCLLKAGMVKEAYKVRLIPSKDLNPDLSPSKL 536 Query: 1856 PKPFRASMDITVAV 1897 PFR S+DI VAV Sbjct: 537 FIPFRTSLDIPVAV 550 >gb|EOY30252.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 592 Score = 522 bits (1345), Expect = e-145 Identities = 265/513 (51%), Positives = 356/513 (69%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 WF+K VCTL + + + L Y KNL P I F V++ +NN P L +F +F+R+ Sbjct: 91 WFVKVVCTLFVYS-QPLDDSCLSYLSKNLTPLIEFEVVKWLNN----PALGLKFLEFSRV 145 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 N N+ HS T+NLL+RS C MGL D A LV +YM DG D +L F++SS AG+F Sbjct: 146 NFNIAHSFWTYNLLMRSFCHMGLHDSAKLVFDYMRIDGHLPDTTILGFMISSFGRAGEFG 205 Query: 716 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895 + +++L E +I+ F N L+++VKQN+++EA ++++L +F D Sbjct: 206 MAKKLLADVQS----DEVVISIFALNNLLNMMVKQNKLEEAVSLYKENLGS--NFYPDAW 259 Query: 896 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075 +FNI+I GLC+ G++D+AFE+FNDMGSFGCF DI TYN++INGLCK+ + DR +LL ++ Sbjct: 260 TFNILIRGLCRVGKVDQAFELFNDMGSFGCFPDIVTYNTIINGLCKVNEVDRGHKLLNQV 319 Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255 QS+ SPDV TYT++ISG+ KLG+ EA L+ +M G P V TFNVLI GFG+ GD Sbjct: 320 QSRDDCSPDVVTYTSVISGYCKLGKMDEASALFHEMISSGTVPTVVTFNVLIDGFGKVGD 379 Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435 + SA ++E M+ FGC DV+TFT+L+ GYC+IG+++Q L+LW+ M R L PN YTF+I Sbjct: 380 MVSAKSMYEQMASFGCIADVVTFTSLIDGYCRIGDVNQSLQLWNTMKGRDLSPNVYTFAI 439 Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615 IN+LCK NRL+EAR L +L+ RN I+P+ FI+NPVIDGFCKAGNLD AN IVAEMEEK Sbjct: 440 TINALCKENRLHEARGFLRELQCRN-IVPKPFIFNPVIDGFCKAGNLDEANLIVAEMEEK 498 Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795 +C+PDK TFTILIIGHCMKGRM +AI +FNKM VGC PD +TVN L+SCLLKAGM +EA Sbjct: 499 QCHPDKVTFTILIIGHCMKGRMFEAISIFNKMLSVGCTPDDVTVNSLISCLLKAGMPSEA 558 Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVA 1894 RI K + ++ G S P R + + VA Sbjct: 559 SRITKMASEDMKLGSSLLENNSPLRINRGVPVA 591 >ref|XP_002278014.2| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Vitis vinifera] Length = 641 Score = 522 bits (1345), Expect = e-145 Identities = 266/513 (51%), Positives = 354/513 (69%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 W +K +CTLC+R A DYF K L P IAF V++ +NN P LA +FFQ +R+ Sbjct: 76 WIVKVICTLCVRTHSLDACL--DYFSKTLTPSIAFEVVRGLNN----PELALKFFQLSRV 129 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 NLNL HS T++ LLRSL +MG + A V + MN DG S D +L FLVSS AGKF+ Sbjct: 130 NLNLCHSFRTYSFLLRSLSEMGFHESAKAVYDCMNIDGHSPDASVLGFLVSSATDAGKFN 189 Query: 716 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895 I R + + L V+NK L+ LV+ N+VDEA FFR+ + F D+C Sbjct: 190 IART-WVDGVEFSL--------VVYNKLLNQLVRGNQVDEAVCFFREQMGLHGPF--DSC 238 Query: 896 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075 SFNI+I GLC+ G++D+AFE+FN+M FGC D+ TYN++ING C++ + DR +LL+E+ Sbjct: 239 SFNILIRGLCRIGKVDKAFELFNEMRGFGCSPDVITYNTLINGFCRVNEVDRGHDLLKEL 298 Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255 S+ LSPDV TYT+IISG+ KLG+ +A L+++M GI+PN +TFN+LI+GFG+ GD Sbjct: 299 LSKNDLSPDVVTYTSIISGYCKLGKMEKASILFNNMISSGIKPNAFTFNILINGFGKVGD 358 Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435 + SA ++E M GC PD+ITFT+L+ G+C+ G++++ LKLW E+NAR L PN YTF+I Sbjct: 359 MVSAENMYEEMLLLGCPPDIITFTSLIDGHCRTGKVERSLKLWHELNARNLSPNEYTFAI 418 Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615 + N+LCK NRL+EAR L L+WR+ I+ Q F+YNPVIDGFCKAGN+D AN I+AEMEEK Sbjct: 419 LTNALCKENRLHEARGFLRDLKWRH-IVAQPFMYNPVIDGFCKAGNVDEANVILAEMEEK 477 Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795 +C PDK T+TILIIGHCMKGR+ +AI +FN+M GCAPD IT+ L+SCLLKAGM EA Sbjct: 478 RCKPDKITYTILIIGHCMKGRLSEAISIFNRMLGTGCAPDSITMTSLISCLLKAGMPNEA 537 Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVA 1894 YRI + + G S P R + DI VA Sbjct: 538 YRIMQIASEDFNLGLKSLKRNVPLRTNTDIPVA 570 >ref|XP_006381622.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550336330|gb|ERP59419.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 511 Score = 519 bits (1336), Expect = e-144 Identities = 262/491 (53%), Positives = 348/491 (70%) Frame = +2 Query: 425 YFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGL 604 Y + L P IAF VI+ NN P++ F+F +F+RLNLN+ H +T+NLL+RSLCQMG Sbjct: 33 YPDRQLTPLIAFEVIKRFNN----PKVGFKFLEFSRLNLNVNHCYSTYNLLMRSLCQMGH 88 Query: 605 LDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSF 784 DL N+V +YM +DG D LL FLV+ +A A F + +++L ++ Q GKE INSF Sbjct: 89 HDLVNIVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLL-AEVQ---GKEVRINSF 144 Query: 785 VHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFN 964 V+N LS+LVKQN+V EA F+++L DT +FNI+I GLC+ G +DRAFEVF Sbjct: 145 VYNNLLSVLVKQNQVHEAIYLFKEYLAMQSP---DTWTFNILIRGLCRVGGVDRAFEVFK 201 Query: 965 DMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKL 1144 DM SFGC D+ TYN++INGLCK + R EL +EIQS+ SPD+ TYT+IISGF K Sbjct: 202 DMESFGCLPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKS 261 Query: 1145 GRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITF 1324 G+ EA +L+++M GI+PNV TFNVLI GFG+ G++ A ++ M+ F C+ DV+TF Sbjct: 262 GKMKEASNLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTF 321 Query: 1325 TNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRW 1504 T+L+ GYC+ G+++ GLK W+ M R + P YT++++IN+LCK NRLNEARD L Q++ Sbjct: 322 TSLIDGYCRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIK- 380 Query: 1505 RNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMH 1684 + I+P+ F+YNPVIDGFCKAGN+D N I+ EMEEK+C+PDK TFTILIIGHC+KGRM Sbjct: 381 NSSIIPKPFMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTILIIGHCVKGRMF 440 Query: 1685 DAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKP 1864 +AI++FN+M CAPD ITVN L+SCLLKAGM EAYRI+K L+ G SS P Sbjct: 441 EAINIFNRMLATRCAPDNITVNSLISCLLKAGMPNEAYRIRKMALEDRNLGLSSFEKAIP 500 Query: 1865 FRASMDITVAV 1897 R + DI VAV Sbjct: 501 LRTNTDIPVAV 511 >ref|XP_002326124.1| predicted protein [Populus trichocarpa] Length = 512 Score = 519 bits (1336), Expect = e-144 Identities = 262/491 (53%), Positives = 349/491 (71%) Frame = +2 Query: 425 YFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGL 604 Y + L P IAF VI+ NN P++ F+F +F+RLNLN+ H +T+NLL+RSLCQMG Sbjct: 33 YPDRQLTPLIAFEVIKRFNN----PKVGFKFLEFSRLNLNVNHCYSTYNLLMRSLCQMGH 88 Query: 605 LDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSF 784 DL N+V +YM +DG D LL FLV+ +A A F + +++L ++ Q GKE INSF Sbjct: 89 HDLVNIVFDYMGSDGHLPDSKLLGFLVTWMAQASDFDMVKKLL-AEVQ---GKEVRINSF 144 Query: 785 VHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFN 964 V+N LS+LVKQN+V EA F+++L+ DT +FNI+I GLC+ G +DRAFEVF Sbjct: 145 VYNNLLSVLVKQNQVHEAIYLFKEYLVMQSP--PDTWTFNILIRGLCRVGGVDRAFEVFK 202 Query: 965 DMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKL 1144 DM SFGC D+ TYN++INGLCK + R EL +EIQS+ SPD+ TYT+IISGF K Sbjct: 203 DMESFGCLPDVVTYNTLINGLCKANEVQRGCELFKEIQSRSDCSPDIVTYTSIISGFCKS 262 Query: 1145 GRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITF 1324 G+ EA +L+++M GI+PNV TFNVLI GFG+ G++ A ++ M+ F C+ DV+TF Sbjct: 263 GKMKEASNLFEEMMRSGIQPNVITFNVLIDGFGKIGNIAEAEAMYRKMAYFDCSADVVTF 322 Query: 1325 TNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRW 1504 T+L+ GYC+ G+++ GLK W+ M R + P YT++++IN+LCK NRLNEARD L Q++ Sbjct: 323 TSLIDGYCRAGQVNHGLKFWNVMKTRNVSPTVYTYAVLINALCKENRLNEARDFLGQIK- 381 Query: 1505 RNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMH 1684 + I+P+ F+YNPVIDGFCKAGN+D N I+ EMEEK+C+PDK TFTILIIGHC+KGRM Sbjct: 382 NSSIIPKPFMYNPVIDGFCKAGNVDEGNVILKEMEEKRCDPDKVTFTILIIGHCVKGRMF 441 Query: 1685 DAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKP 1864 +AI++FN+M CAPD ITVN L+SCLLKAGM EAYRI+K L+ G SS P Sbjct: 442 EAINIFNRMLATRCAPDNITVNSLISCLLKAGMPNEAYRIRKMALEDRNLGLSSFEKAIP 501 Query: 1865 FRASMDITVAV 1897 R + DI VAV Sbjct: 502 LRTNTDIPVAV 512 >ref|XP_002512275.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223548236|gb|EEF49727.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 532 Score = 514 bits (1324), Expect = e-143 Identities = 261/518 (50%), Positives = 358/518 (69%) Frame = +2 Query: 341 KIESLWFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFF 520 K + WF+K + L +R+ + A K P +AF VI+ +NNN P++ +F Sbjct: 24 KNQEAWFVKVIAILFVRSHCSDATSLGYLSEKLNDPLVAFEVIKRLNNN---PQVGLKFM 80 Query: 521 QFTRLNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAH 700 +F RLN +LIH +T+ LL+RSLCQMGL DL +V+ YM +DG +D +L FLV+S A Sbjct: 81 EFCRLNFSLIHCFSTYELLIRSLCQMGLHDLVEMVIGYMRSDGHLIDSRVLGFLVTSFAQ 140 Query: 701 AGKFSIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSF 880 AGKF + ++++I G+E I+SFV+N L+ LVK +V EA F+++L Sbjct: 141 AGKFDLAKKLIIEVQ----GEEARISSFVYNYLLNELVKGGKVHEAIFLFKENLAFHSP- 195 Query: 881 CLDTCSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALE 1060 +T +FNI+I GLC+ G++++ FE+FN M SFGC D+ TYN++I+GLCK + DRA + Sbjct: 196 -PNTWTFNILIRGLCRVGEVEKGFELFNAMQSFGCLPDVVTYNTLISGLCKANELDRACD 254 Query: 1061 LLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGF 1240 LL+E+QS+ SPDV TYT+IISGF KLG+ A L+++M GI P V TFNVLI GF Sbjct: 255 LLKEVQSRNDCSPDVMTYTSIISGFRKLGKLEAASVLFEEMIRSGIEPTVVTFNVLIDGF 314 Query: 1241 GQSGDLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNA 1420 G+ G++ +A + E M+ + C PDV+TFT+L+ GYC+ G+I GLK+WD M AR + PN Sbjct: 315 GKIGNMVAAEAMHEKMASYSCIPDVVTFTSLIDGYCRTGDIRLGLKVWDVMKARNVSPNI 374 Query: 1421 YTFSIVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVA 1600 YT+S++IN+LCK NR++EARDLL QL+ +D+ P+ FIYNPVIDGFCKAGN+D AN IV Sbjct: 375 YTYSVIINALCKDNRIHEARDLLRQLKC-SDVFPKPFIYNPVIDGFCKAGNVDEANVIVT 433 Query: 1601 EMEEKKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAG 1780 EMEEK+C PDK TFTILIIGHCMKGRM +A+D+F KM +GCAPD IT++ LV+CLLKAG Sbjct: 434 EMEEKRCRPDKVTFTILIIGHCMKGRMVEALDIFKKMLAIGCAPDNITISSLVACLLKAG 493 Query: 1781 MVTEAYRIKKDVLQGLQSGFSSAGGPKPFRASMDITVA 1894 +EA+ I + + L FSS P R DI+VA Sbjct: 494 KPSEAFHIVQTASEDLNLSFSSLRKTFPMRVKTDISVA 531 >ref|XP_004141071.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Cucumis sativus] Length = 548 Score = 508 bits (1307), Expect = e-141 Identities = 264/514 (51%), Positives = 353/514 (68%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 W +K VCTL R+ A FG Y +NL P IAF VI+ F P L +FF+F+R Sbjct: 48 WLVKVVCTLFFRSHSLNACFG--YLSRNLNPSIAFEVIKR----FSDPLLGLKFFEFSRT 101 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 +L++ H+ T++LL+R+LC++GL D A +V + M +DG+ D +LE LVSS A GK Sbjct: 102 HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLD 161 Query: 716 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895 + L C G + ++ FV+N L++LVKQN VDEA + FR+HL F D Sbjct: 162 SAKNFL--NEVHCYGIK--VSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPY--FVPDVY 215 Query: 896 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075 SFNI+I GLC+ G+ID+AFE F +MG+FGCF DI +YN++ING C++ + + +LL+E Sbjct: 216 SFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKED 275 Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255 G+SPDV TYT+IISG+ KLG A L+D+M GI+PN +TFNVLI GFG+ G+ Sbjct: 276 MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGN 335 Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435 + SA+ ++E M GC PDV+TFT+L+ GYC+ GE++QGLKLW+EM R L PN YT+++ Sbjct: 336 MRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAV 395 Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615 +IN+LCK NR+ EAR+ L L+ ++++P+ FIYNPVIDGFCKAG +D AN IVAEM+EK Sbjct: 396 LINALCKENRIREARNFLRHLK-SSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEK 454 Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795 KC PDK TFTILIIG+CMKGRM +AI F KM + C PD IT+N L+SCLLKAGM EA Sbjct: 455 KCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEA 514 Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897 +IK+ LQ L G SS G P R S + VAV Sbjct: 515 SQIKQAALQKLNLGLSSLGSPLT-RKSSRVPVAV 547 >ref|XP_004157939.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Cucumis sativus] Length = 548 Score = 506 bits (1304), Expect = e-140 Identities = 260/507 (51%), Positives = 350/507 (69%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 W +K VCTL R+ A FG Y +NL P IAF VI+ F P L +FF+F+R Sbjct: 48 WLVKVVCTLFFRSHSLNACFG--YLSRNLNPSIAFEVIKR----FSDPLLGLKFFEFSRT 101 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 +L++ H+ T++LL+R+LC++GL D A +V + M +DG+ D +LE LVSS A GK Sbjct: 102 HLSINHTFNTYDLLMRNLCKVGLNDSAKIVFDCMRSDGILPDSSILELLVSSYARMGKLD 161 Query: 716 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895 + L C G + ++ FV+N L++LVKQN VDEA + FR+HL F D Sbjct: 162 SAKNFL--NEVHCYGIK--VSPFVYNNLLNMLVKQNLVDEAVLLFREHLEPY--FVPDVY 215 Query: 896 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075 SFNI+I GLC+ G+ID+AFE F +MG+FGCF DI +YN++ING C++ + + +LL+E Sbjct: 216 SFNILIRGLCRIGEIDKAFEFFQNMGNFGCFPDIVSYNTLINGFCRVNEISKGHDLLKED 275 Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255 G+SPDV TYT+IISG+ KLG A L+D+M GI+PN +TFNVLI GFG+ G+ Sbjct: 276 MLIKGVSPDVITYTSIISGYCKLGDMKAASELFDEMVSSGIKPNDFTFNVLIDGFGKVGN 335 Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435 + SA+ ++E M GC PDV+TFT+L+ GYC+ GE++QGLKLW+EM R L PN YT+++ Sbjct: 336 MRSAMVMYEKMLLLGCLPDVVTFTSLIDGYCREGEVNQGLKLWEEMKVRNLSPNVYTYAV 395 Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615 +IN+LCK NR+ EAR+ L L+ ++++P+ FIYNPVIDGFCKAG +D AN IVAEM+EK Sbjct: 396 LINALCKENRIREARNFLRHLK-SSEVVPKPFIYNPVIDGFCKAGKVDEANFIVAEMQEK 454 Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795 KC PDK TFTILIIG+CMKGRM +AI F KM + C PD IT+N L+SCLLKAGM EA Sbjct: 455 KCRPDKITFTILIIGNCMKGRMVEAISTFYKMIEINCVPDEITINSLISCLLKAGMPNEA 514 Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRAS 1876 +IK+ LQ L G SS G P ++S Sbjct: 515 SQIKQAALQKLNLGLSSLGSPLTRKSS 541 >gb|EXB38956.1| hypothetical protein L484_027391 [Morus notabilis] Length = 570 Score = 506 bits (1303), Expect = e-140 Identities = 266/515 (51%), Positives = 349/515 (67%), Gaps = 1/515 (0%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 WF+K V TL +R+ FG Y K L P I+F VI+ +NNN P L +FF+ +R Sbjct: 75 WFVKVVSTLFVRSQSLNTFFG--YLSKKLTPSISFEVIKRLNNN---PNLGLKFFELSRA 129 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 NL++ HS +T+NLL+RSLCQMG D A V + M DG S D +EFLV A GK Sbjct: 130 NLSVNHSFSTYNLLIRSLCQMGFHDSAKFVFDCMRIDGHSPDNSTIEFLVCVFAKVGKLD 189 Query: 716 IGREILISQAQLCLGKEEI-INSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDT 892 ++L EEI + FV++ ++LVK N+V EA FR + F DT Sbjct: 190 SCEKLL----------EEIRASKFVYSSLFNVLVKNNKVYEAVCLFRKQIGS--HFVPDT 237 Query: 893 CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 1072 +FNI+I GLC G++ AFE FNDMG F C D+ TYN++I+GLC+ + DR +LLRE Sbjct: 238 WTFNILIGGLCGVGEVHSAFEFFNDMGKFRCSPDVVTYNTLISGLCRTNEVDRGCDLLRE 297 Query: 1073 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 1252 +Q +G SP+V+T+T++I G+ KLGR EA L+D+M G RP TFNVLI F + G Sbjct: 298 VQLRGDFSPNVRTFTSVILGYCKLGRMEEASALFDEMMDSGTRPTTVTFNVLIDAFSKVG 357 Query: 1253 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 1432 D+ SA+ ++E M G PDV+TFT+L+ GYC++G++++GLKLW EM+ R + PN YT+S Sbjct: 358 DMASAIALYEKMLFHGYRPDVVTFTSLIDGYCRVGQLNRGLKLWCEMSVRNVSPNGYTYS 417 Query: 1433 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 1612 +VI++LCK NRL+EARDLL QL N I+P+ F+YNPVIDGFCKAGN+D AN IVAEMEE Sbjct: 418 VVIHALCKVNRLHEARDLLRQLNCTN-IVPKPFMYNPVIDGFCKAGNVDEANMIVAEMEE 476 Query: 1613 KKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTE 1792 K+CNPDK TFTILI+G+CMKGRM DAI VF KM VGCAPD+ITV+CL+SCLLKAGM E Sbjct: 477 KRCNPDKMTFTILILGNCMKGRMVDAIGVFYKMLAVGCAPDKITVHCLMSCLLKAGMPNE 536 Query: 1793 AYRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897 A+ IK+ V++ L G SS RA +I +AV Sbjct: 537 AFHIKETVMKSLNVGMSSLRS-NHMRAIAEIPMAV 570 >ref|XP_006474728.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X1 [Citrus sinensis] gi|568841566|ref|XP_006474729.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X2 [Citrus sinensis] Length = 595 Score = 503 bits (1295), Expect = e-139 Identities = 266/549 (48%), Positives = 369/549 (67%), Gaps = 5/549 (0%) Frame = +2 Query: 263 RNITFKSIARCFHGISNIESSPPTHNKIE-----SLWFIKFVCTLCIRNAENLAIFGSDY 427 R T +IA FHG++N S P ++ WF+K VCTL +R++ L+ + Y Sbjct: 59 RASTIAAIAH-FHGLANGGSRPFDEKEVNYRCSNEFWFVKVVCTLLLRSSY-LSDTCARY 116 Query: 428 FRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLL 607 + L P + VI+ ++N P+L +F +F+R+NL+L HS T+NL++RSLC+MGL Sbjct: 117 LCEKLSPLNSLEVIKRLDN----PKLGLKFLEFSRVNLSLNHSFKTYNLVMRSLCEMGLH 172 Query: 608 DLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFV 787 D +V +YM +DG + P++EF VSS AGK + +L +Q G E +++F+ Sbjct: 173 DSVQVVFDYMRSDGHLPNSPMIEFFVSSCIRAGKCDAAKGLL---SQFRPG-EVTMSTFM 228 Query: 788 HNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFND 967 +N L+ LVKQN DEA F+++ DT +FNI+I GL + G++ +AFE F D Sbjct: 229 YNSLLNALVKQNNADEAVYMFKEYFRLYSQ--PDTWTFNILIQGLSRIGEVKKAFEFFYD 286 Query: 968 MGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLG 1147 MGSFGC DI TYN++I+GLC++ + R ELL+E++ + SPDV TYT++ISG+ KLG Sbjct: 287 MGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFKSEFSPDVVTYTSVISGYCKLG 346 Query: 1148 RNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFT 1327 + +A ++++M GI+P+ TFNVLI GFG+ G++ SA + E M FG PDV+TF+ Sbjct: 347 KMDKATGIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVSAEYMRERMLSFGYLPDVVTFS 406 Query: 1328 NLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWR 1507 +L+ GYC+ G+++QGLKL DEM + L PN YTF+I+IN+LCK NRLN+AR L QL+W Sbjct: 407 SLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFTILINALCKENRLNDARRFLKQLKW- 465 Query: 1508 NDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMHD 1687 ND++P+ F+YNPVIDGFCKAGN+D AN IVAEMEEK+C PDK TFTILIIGHCMKGRM + Sbjct: 466 NDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKPDKVTFTILIIGHCMKGRMVE 525 Query: 1688 AIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKPF 1867 AI +FNKM +GCAPD ITVN L+SCLLK GM EA+RI + + L S P Sbjct: 526 AISIFNKMLTIGCAPDDITVNSLISCLLKGGMPNEAFRIMQRASEDLNLQLPSWKKAVPL 585 Query: 1868 RASMDITVA 1894 R + DI VA Sbjct: 586 RTNTDIPVA 594 >ref|XP_004301429.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Fragaria vesca subsp. vesca] Length = 583 Score = 503 bits (1294), Expect = e-139 Identities = 262/515 (50%), Positives = 359/515 (69%), Gaps = 1/515 (0%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 WF+K V TL +R+ + G Y KNL P +AF VI+ +NN P+L FF+ ++ Sbjct: 84 WFVKVVYTLFLRSHSLDSYVG--YLSKNLTPSLAFEVIKRLNN----PKLGLRFFELSKF 137 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 +LN+ H + T++ LLRSLCQMGL D A LV +YM DGLS + +LEFLVSS A G+ Sbjct: 138 SLNVNHGVWTYHYLLRSLCQMGLQDSAKLVFDYMRTDGLSPNESVLEFLVSSCAQMGRSD 197 Query: 716 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCL-DT 892 + +IL +G ++SFV+N ++LVK NRVDEA FR ++ S+C D+ Sbjct: 198 LAEKILDEVHCSVVG----LSSFVYNNLFNVLVKLNRVDEAVCLFRKYV---GSYCCPDS 250 Query: 893 CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 1072 +FNI+I GLC+ G +D+ E F+DM SFGC ++ TYN++I+GLC+ + DR +LLRE Sbjct: 251 WTFNILIRGLCRTGAVDKGLEFFSDMRSFGCSPNVVTYNTLISGLCRAHEVDRGCDLLRE 310 Query: 1073 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 1252 +Q + LSPDV T+T++ISG+ KLGR EA ++D+M G++P TFN LI G+G++G Sbjct: 311 VQFRSELSPDVITFTSVISGYCKLGRMEEASAIFDEMIGCGLKPTAVTFNALIDGYGKAG 370 Query: 1253 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 1432 D+ SA ++ESM G DVITFT+L+ GYC+ G ++ GL+LW EMNA+ + P+AYTFS Sbjct: 371 DMSSAFSLYESMLFHGHCADVITFTSLIDGYCRAGHLNHGLQLWHEMNAKNVSPSAYTFS 430 Query: 1433 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 1612 ++IN+LCK NRL EARDLL +L+ N ++P++F+YNPVIDG CKAGN+D AN IVAEMEE Sbjct: 431 VLINALCKGNRLCEARDLLRELKGSN-VVPKSFLYNPVIDGLCKAGNIDEANLIVAEMEE 489 Query: 1613 KKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTE 1792 KKC PD+ TFTILI+G+ MKGRM +AI F+KM +GCAPD+IT++ L+SCL KAGM +E Sbjct: 490 KKCTPDRVTFTILILGNSMKGRMSEAIGNFSKMLSIGCAPDKITIDSLISCLSKAGMPSE 549 Query: 1793 AYRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897 A RIKK + L G S G P RA+ +I VAV Sbjct: 550 AGRIKKIAYEDLNMGAPSMGRPPHLRAN-EIPVAV 583 >ref|XP_006452806.1| hypothetical protein CICLE_v10007804mg [Citrus clementina] gi|557556032|gb|ESR66046.1| hypothetical protein CICLE_v10007804mg [Citrus clementina] Length = 595 Score = 499 bits (1285), Expect = e-138 Identities = 264/549 (48%), Positives = 367/549 (66%), Gaps = 5/549 (0%) Frame = +2 Query: 263 RNITFKSIARCFHGISNIESSPPTHNKIE-----SLWFIKFVCTLCIRNAENLAIFGSDY 427 R T +IA FHG++N S P ++ WF+K VCTL +R++ L+ + Y Sbjct: 59 RASTIAAIAH-FHGLANGGSRPFDEKEVNYRCSNEFWFVKVVCTLLLRSSY-LSDTCARY 116 Query: 428 FRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLL 607 + L P + VI+ ++N P+L +F +F+R+NL+L HS T+NL++RSLC+MGL Sbjct: 117 LCEKLSPLNSLEVIKRLDN----PKLGLKFLEFSRVNLSLNHSFKTYNLVMRSLCEMGLH 172 Query: 608 DLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQLCLGKEEIINSFV 787 D +V +YM +DG + P++EF VSS AGK + +L +Q G E +++F+ Sbjct: 173 DSVQVVFDYMRSDGHLPNSPMIEFFVSSCIRAGKCDAAKGLL---SQFRPG-EVTMSTFM 228 Query: 788 HNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVFND 967 +N L+ LVKQN DEA F+++ DT +FNI+I GLC+ G++ +AFE F D Sbjct: 229 YNSLLNALVKQNNADEAVYMFKEYFRLYSQ--PDTWTFNILIRGLCRIGEVKKAFEFFYD 286 Query: 968 MGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLG 1147 MGSFGC DI TYN++I+GLC++ + R ELL+E++ + PDV TYT++ISG+ KLG Sbjct: 287 MGSFGCSPDIVTYNTLISGLCRVNEVARGHELLKEVKFKSEFLPDVVTYTSVISGYCKLG 346 Query: 1148 RNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFT 1327 + +A ++++M GI+P+ TFNVLI GFG+ G++ SA + E M G PDV+TF+ Sbjct: 347 KMDKATSIYNEMNSCGIKPSAVTFNVLIDGFGKVGNMVSAEYMRERMLSLGYLPDVVTFS 406 Query: 1328 NLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWR 1507 +L+ GYC+ G+++QGLKL DEM + L PN YTF+I+IN+LCK NRLN+AR L QL+W Sbjct: 407 SLIDGYCRNGQLNQGLKLCDEMKGKNLSPNVYTFAILINALCKENRLNDARRFLKQLKW- 465 Query: 1508 NDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMHD 1687 ND++P+ F+YNPVIDGFCKAGN+D AN IVAEMEEK+C PDK TFTILIIGHCMKGRM + Sbjct: 466 NDLVPKPFMYNPVIDGFCKAGNVDEANVIVAEMEEKRCKPDKVTFTILIIGHCMKGRMVE 525 Query: 1688 AIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQGLQSGFSSAGGPKPF 1867 AI +FNKM +GCAPD ITVN L+SCLLK GM EA+RI + + S P Sbjct: 526 AISIFNKMLRIGCAPDDITVNSLISCLLKGGMPNEAFRIMQRASEDQNLQLPSWKKAVPL 585 Query: 1868 RASMDITVA 1894 R + DI VA Sbjct: 586 RTNTDIPVA 594 >ref|XP_003550612.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like [Glycine max] Length = 544 Score = 470 bits (1210), Expect = e-129 Identities = 249/494 (50%), Positives = 346/494 (70%), Gaps = 1/494 (0%) Frame = +2 Query: 356 WFIKFVCTLCI-RNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTR 532 WF+K V TL + N+ + G YFR++L P V++ NN P L F+FF+FTR Sbjct: 46 WFVKIVSTLFLCSNSLDDRFLG--YFREHLTPSHVLEVVKRFNN----PNLGFKFFRFTR 99 Query: 533 LNLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKF 712 L++ HS T+N+LLRSLCQ GL + A L+ + M +DG D LL FLVSS A A +F Sbjct: 100 ERLSMSHSFWTYNMLLRSLCQAGLHNSAKLLYDSMRSDGQLPDSRLLGFLVSSFALADRF 159 Query: 713 SIGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDT 892 + +E+L ++AQ C G + ++ V+N FL++L+K NR+D+A FR+ L++ S CLD Sbjct: 160 DVSKELL-AEAQ-CSGVQ--VDVIVYNNFLNILIKHNRLDDAICLFRE-LMRSHS-CLDA 213 Query: 893 CSFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLRE 1072 +FNI+I GLC AG +D AFE+ DMGSFGC DI TYN +++GLC++ DRA +LL E Sbjct: 214 FTFNILIRGLCTAGDVDEAFELLGDMGSFGCSPDIVTYNILLHGLCRIDQVDRARDLLEE 273 Query: 1073 IQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSG 1252 + + +P+V +YTT+ISG+ +L + EA L+ +M G +PNV+TF+ L+ GF ++G Sbjct: 274 VCLKCEFAPNVVSYTTVISGYCRLSKMDEASSLFYEMVRSGTKPNVFTFSALVDGFVKAG 333 Query: 1253 DLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFS 1432 D+ SAL + + + GCAP+VIT T+L++GYC+ G ++ GL LW EMNAR + N YT+S Sbjct: 334 DMASALGMHKKILFHGCAPNVITLTSLINGYCRAGWVNHGLDLWREMNARNIPANLYTYS 393 Query: 1433 IVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEE 1612 ++I++LCKSNRL EAR+LL L+ ++DI+P AF+YNPVIDG+CK+GN+D ANAIVAEMEE Sbjct: 394 VLISALCKSNRLQEARNLLRILK-QSDIVPLAFVYNPVIDGYCKSGNIDEANAIVAEMEE 452 Query: 1613 KKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTE 1792 KC PDK TFTILIIGHCMKGR +AI +F KM GC PD IT+ L SCLLK+GM E Sbjct: 453 -KCKPDKLTFTILIIGHCMKGRTPEAIGIFYKMLASGCTPDDITIRTLSSCLLKSGMPGE 511 Query: 1793 AYRIKKDVLQGLQS 1834 A RIK+ + + +S Sbjct: 512 AARIKETLFENQES 525 >ref|XP_002885810.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297331650|gb|EFH62069.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 536 Score = 464 bits (1195), Expect = e-128 Identities = 246/528 (46%), Positives = 346/528 (65%), Gaps = 10/528 (1%) Frame = +2 Query: 272 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430 TF + FH S+ ++ P +N E + W +K V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPIQNNTREKIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63 Query: 431 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610 KNL P I+F V++ ++NN P + F F++F+R LN+ HS T+NLL RSLC+ G+ D Sbjct: 64 SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGMHD 120 Query: 611 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 121 LAGQMFECMKSDGISPNSRLLGFLVSSFAEKGKLHCATALLLQSYEVEGCCM-------- 172 Query: 782 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961 V N L+ LVK +RV++A F +HL + +S C DT +FNI+I GLC G+ ++A E+ Sbjct: 173 -VVNSLLNTLVKLDRVEDAMKLFEEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKAVELL 229 Query: 962 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141 M FGC DI TYN++I G CK + +A E+ +++S G SPDV TYT++ISG+ K Sbjct: 230 GGMSGFGCLPDIVTYNTLIKGFCKSNELKKANEMFDDVKSSSGCSPDVVTYTSMISGYCK 289 Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321 G+ EA L DDM GI P TFNVL+ G+ ++G++ +A +I M FGC PDV+T Sbjct: 290 AGKMQEASVLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMHTAEEIRGKMISFGCFPDVVT 349 Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501 FT+L+ GYC++G+++QG +LW+EMNAR ++PNA+T+SI+IN+LCK NRL +AR+LL QL Sbjct: 350 FTSLIDGYCRVGQVNQGFRLWEEMNARGMFPNAFTYSILINALCKENRLLKARELLGQLA 409 Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681 DI+PQ F+YNPVIDGFCKAG ++ A IV EME+KKC PDK TFTILIIGHCMKGRM Sbjct: 410 -SKDIIPQPFMYNPVIDGFCKAGKVNEAIVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 468 Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQG 1825 +A+ +F+KM +GC+PD+ITV+ L+SCLLKAGM EAY + + +G Sbjct: 469 FEAVSIFHKMVAIGCSPDKITVSSLLSCLLKAGMAKEAYHLNQIAHKG 516 >dbj|BAH19478.1| AT2G06000 [Arabidopsis thaliana] Length = 536 Score = 462 bits (1189), Expect = e-127 Identities = 246/528 (46%), Positives = 346/528 (65%), Gaps = 10/528 (1%) Frame = +2 Query: 272 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430 TF + FH S+ ++ P +N E + W +K V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63 Query: 431 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610 KNL P I+F V++ ++NN P + F F++F+R LN+ HS T+NLL RSLC+ GL D Sbjct: 64 SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHD 120 Query: 611 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 121 LAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCM-------- 172 Query: 782 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961 V N L+ LVK +RV++A F +HL + +S C DT +FNI+I GLC G+ ++A E+ Sbjct: 173 -VVNSLLNTLVKLDRVEDAMKLFDEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKALELL 229 Query: 962 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141 M FGC DI TYN++I G CK + ++A E+ ++++S SPDV TYT++ISG+ K Sbjct: 230 GVMSGFGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCK 289 Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321 G+ EA L DDM GI P TFNVL+ G+ ++G++ +A +I M FGC PDV+T Sbjct: 290 AGKMREASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVT 349 Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501 FT+L+ GYC++G++ QG +LW+EMNAR ++PNA+T+SI+IN+LC NRL +AR+LL QL Sbjct: 350 FTSLIDGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLA 409 Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681 DI+PQ F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTILIIGHCMKGRM Sbjct: 410 -SKDIIPQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 468 Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQG 1825 +A+ +F+KM +GC+PD+ITV+ L+SCLLKAGM EAY + + +G Sbjct: 469 FEAVSIFHKMVAIGCSPDKITVSSLLSCLLKAGMAKEAYHLNQIARKG 516 >ref|NP_178657.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|42570711|ref|NP_973429.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75216767|sp|Q9ZUE9.1|PP149_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g06000 gi|4006835|gb|AAC95177.1| hypothetical protein [Arabidopsis thaliana] gi|110736272|dbj|BAF00106.1| hypothetical protein [Arabidopsis thaliana] gi|330250896|gb|AEC05990.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|330250897|gb|AEC05991.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 536 Score = 462 bits (1189), Expect = e-127 Identities = 246/528 (46%), Positives = 346/528 (65%), Gaps = 10/528 (1%) Frame = +2 Query: 272 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430 TF + FH S+ ++ P +N E + W +K V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPLQNNTREVIHCPEAWLVKIVSTLFVYRVPDSDLCFC-YL 63 Query: 431 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610 KNL P I+F V++ ++NN P + F F++F+R LN+ HS T+NLL RSLC+ GL D Sbjct: 64 SKNLNPFISFEVVKKLDNN---PHIGFRFWEFSRFKLNIRHSFWTYNLLTRSLCKAGLHD 120 Query: 611 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 121 LAGQMFECMKSDGVSPNNRLLGFLVSSFAEKGKLHFATALLLQSFEVEGCCM-------- 172 Query: 782 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961 V N L+ LVK +RV++A F +HL + +S C DT +FNI+I GLC G+ ++A E+ Sbjct: 173 -VVNSLLNTLVKLDRVEDAMKLFDEHL-RFQS-CNDTKTFNILIRGLCGVGKAEKALELL 229 Query: 962 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141 M FGC DI TYN++I G CK + ++A E+ ++++S SPDV TYT++ISG+ K Sbjct: 230 GVMSGFGCEPDIVTYNTLIQGFCKSNELNKASEMFKDVKSGSVCSPDVVTYTSMISGYCK 289 Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321 G+ EA L DDM GI P TFNVL+ G+ ++G++ +A +I M FGC PDV+T Sbjct: 290 AGKMREASSLLDDMLRLGIYPTNVTFNVLVDGYAKAGEMLTAEEIRGKMISFGCFPDVVT 349 Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501 FT+L+ GYC++G++ QG +LW+EMNAR ++PNA+T+SI+IN+LC NRL +AR+LL QL Sbjct: 350 FTSLIDGYCRVGQVSQGFRLWEEMNARGMFPNAFTYSILINALCNENRLLKARELLGQLA 409 Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681 DI+PQ F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTILIIGHCMKGRM Sbjct: 410 -SKDIIPQPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 468 Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVLQG 1825 +A+ +F+KM +GC+PD+ITV+ L+SCLLKAGM EAY + + +G Sbjct: 469 FEAVSIFHKMVAIGCSPDKITVSSLLSCLLKAGMAKEAYHLNQIARKG 516 >ref|XP_006297396.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] gi|565479514|ref|XP_006297397.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] gi|482566105|gb|EOA30294.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] gi|482566106|gb|EOA30295.1| hypothetical protein CARUB_v10013421mg [Capsella rubella] Length = 535 Score = 460 bits (1184), Expect = e-126 Identities = 247/523 (47%), Positives = 337/523 (64%), Gaps = 10/523 (1%) Frame = +2 Query: 272 TFKSIARCFHGISN--IESSPPTHNKIESL-----WFIKFVCTLCIRNAENLAIFGSDYF 430 TF + FH S+ ++ P NK E + W IK V TL + + + Y Sbjct: 5 TFATAIAHFHTHSHGGAQARPLHSNKREVMHCPEAWLIKIVSTLFVYRVPDSDLCFC-YL 63 Query: 431 RKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRLNLNLIHSIATFNLLLRSLCQMGLLD 610 KNL P IAF V++ ++NN P L F F++F+R LN+ HS T+N+L RSLC+ G+ D Sbjct: 64 SKNLNPFIAFEVVKKLDNNH--PHLGFRFWEFSRFKLNIRHSFWTYNVLTRSLCKAGMHD 121 Query: 611 LANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFSIGREILISQAQL---CLGKEEIINS 781 LA + E M +DG+S + LL FLVSS A GK +L+ ++ C+ Sbjct: 122 LAGQMFECMRSDGVSPNSRLLGFLVSSFAEKGKLQFATALLLQSYEVERCCM-------- 173 Query: 782 FVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTCSFNIVIDGLCKAGQIDRAFEVF 961 V N L+ LVK +RVD+A F HL C DT +FNI+I GLC G+ ++A E+ Sbjct: 174 -VVNSLLNTLVKLDRVDDAMKLFDKHLRF--QCCNDTKTFNILIRGLCSVGKGEKALELL 230 Query: 962 NDMGSFGCFADITTYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFK 1141 +M FGC DI TYN++I G CK + +A E+L +++S G SPDV TYT++ISG+ K Sbjct: 231 GEMSGFGCSPDIVTYNTLIKGFCKSNELAKANEMLNDVKSSSGCSPDVVTYTSMISGYCK 290 Query: 1142 LGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVIT 1321 G+ EA L DDM GI P TFNVL+ G+ ++G++ SA I M FGC PDV+T Sbjct: 291 AGKMQEAYLLLDDMLGLGIYPTTITFNVLVDGYAKAGEMTSAEDIRGKMISFGCFPDVVT 350 Query: 1322 FTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLR 1501 FT+L+ GYC+ G+++QG +LW+EMNA+ + PN +T+SI+IN+LCK N L +AR+LL QL Sbjct: 351 FTSLIDGYCRAGQVNQGFRLWEEMNAKGMLPNEFTYSILINALCKENSLLKARELLGQLA 410 Query: 1502 WRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRM 1681 DI+ + F+YNPVIDGFCKAG ++ AN IV EME+KKC PDK TFTILIIGHCMKGRM Sbjct: 411 -SKDIITKPFMYNPVIDGFCKAGKVNEANVIVEEMEKKKCKPDKITFTILIIGHCMKGRM 469 Query: 1682 HDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKK 1810 +A+ +F+KM +GC+PD+ITVN L+SCLLKAGM EAY + + Sbjct: 470 FEAVSIFHKMVAIGCSPDKITVNSLLSCLLKAGMAEEAYHLNQ 512 Score = 131 bits (330), Expect = 2e-27 Identities = 77/272 (28%), Positives = 136/272 (50%) Frame = +2 Query: 1001 TYNSVINGLCKLGDADRALELLREIQSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDD 1180 TYN + LCK G D A ++ ++S G +SP+ + ++S F + G+ A L Sbjct: 106 TYNVLTRSLCKAGMHDLAGQMFECMRSDG-VSPNSRLLGFLVSSFAEKGKLQFATALL-- 162 Query: 1181 MTHRGIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGE 1360 + + N L++ + + A+K+F+ +F C D TF L+ G C +G+ Sbjct: 163 LQSYEVERCCMVVNSLLNTLVKLDRVDDAMKLFDKHLRFQCCNDTKTFNILIRGLCSVGK 222 Query: 1361 IDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYN 1540 ++ L+L EM+ P+ T++ +I CKSN L +A ++L+ ++ + P Y Sbjct: 223 GEKALELLGEMSGFGCSPDIVTYNTLIKGFCKSNELAKANEMLNDVKSSSGCSPDVVTYT 282 Query: 1541 PVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLV 1720 +I G+CKAG + A ++ +M P TF +L+ G+ G M A D+ KM Sbjct: 283 SMISGYCKAGKMQEAYLLLDDMLGLGIYPTTITFNVLVDGYAKAGEMTSAEDIRGKMISF 342 Query: 1721 GCAPDRITVNCLVSCLLKAGMVTEAYRIKKDV 1816 GC PD +T L+ +AG V + +R+ +++ Sbjct: 343 GCFPDVVTFTSLIDGYCRAGQVNQGFRLWEEM 374 Score = 84.0 bits (206), Expect = 4e-13 Identities = 63/231 (27%), Positives = 110/231 (47%), Gaps = 4/231 (1%) Frame = +2 Query: 1139 KLGRNHEAL--HLWDDMTHR-GIRPNVYTFNVLIHGFGQSGDLGSALKIFESMSKFGCAP 1309 KL NH L W+ + IR + +T+NVL ++G A ++FE M G +P Sbjct: 78 KLDNNHPHLGFRFWEFSRFKLNIRHSFWTYNVLTRSLCKAGMHDLAGQMFECMRSDGVSP 137 Query: 1310 DVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSIVINSLCKSNRLNEARDLL 1489 + L+S + + G++ L + + ++ + ++N+L K +R+++A L Sbjct: 138 NSRLLGFLVSSFAEKGKLQFATALL--LQSYEVERCCMVVNSLLNTLVKLDRVDDAMKLF 195 Query: 1490 SQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEKKCNPDKYTFTILIIGHCM 1669 + R +N +I G C G + A ++ EM C+PD T+ LI G C Sbjct: 196 DK-HLRFQCCNDTKTFNILIRGLCSVGKGEKALELLGEMSGFGCSPDIVTYNTLIKGFCK 254 Query: 1670 KGRMHDAIDVFNKM-SLVGCAPDRITVNCLVSCLLKAGMVTEAYRIKKDVL 1819 + A ++ N + S GC+PD +T ++S KAG + EAY + D+L Sbjct: 255 SNELAKANEMLNDVKSSSGCSPDVVTYTSMISGYCKAGKMQEAYLLLDDML 305 >ref|XP_006577946.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X1 [Glycine max] gi|571448762|ref|XP_006577947.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X2 [Glycine max] gi|571448764|ref|XP_006577948.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X3 [Glycine max] gi|571448766|ref|XP_006577949.1| PREDICTED: pentatricopeptide repeat-containing protein At2g06000-like isoform X4 [Glycine max] Length = 510 Score = 458 bits (1179), Expect = e-126 Identities = 239/514 (46%), Positives = 338/514 (65%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 WF+K CT+ +R+ G YF K+L P + + V+ ++ P L F+F +F R Sbjct: 10 WFVKIACTVFVRSNSLDPFVG--YFSKHLTPSLVYEVVNRLHI----PNLGFKFVEFCRH 63 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 L++ HS T++LLLRSLC+ L A +V ++M DG D LL FLV S A G+ Sbjct: 64 KLHMSHSYLTYSLLLRSLCRSNLHHTAKVVYDWMRCDGQIPDNRLLGFLVWSYAIVGRLD 123 Query: 716 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895 + RE+L +G +N+ V+N ++L++QN+V +A + FR+ L++LR + T Sbjct: 124 VSRELLADVQCNNVG----VNAVVYNDLFNVLIRQNKVVDAVVLFRE-LIRLR-YKPVTY 177 Query: 896 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075 + NI++ GLC+AG+ID AF + ND+ SFGC D+ TYN++I+GLC++ + DRA LL+E+ Sbjct: 178 TVNILMRGLCRAGEIDEAFRLLNDLRSFGCLPDVITYNTLIHGLCRINEVDRARSLLKEV 237 Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255 G +PDV +YTTIISG+ K + E L+ +M G PN +TFN LI GFG+ GD Sbjct: 238 CLNGEFAPDVVSYTTIISGYCKFSKMEEGNLLFGEMIRSGTAPNTFTFNALIGGFGKLGD 297 Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435 + SAL ++E M GC PDV TFT+L++GY ++G++ Q + +W +MN + + YTFS+ Sbjct: 298 MASALALYEKMLVQGCVPDVATFTSLINGYFRLGQVHQAMDMWHKMNDKNIGATLYTFSV 357 Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615 +++ LC +NRL++ARD+L L +DI+PQ FIYNPVIDG+CK+GN+D AN IVAEME Sbjct: 358 LVSGLCNNNRLHKARDILRLLN-ESDIVPQPFIYNPVIDGYCKSGNVDEANKIVAEMEVN 416 Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795 +C PDK TFTILIIGHCMKGRM +AI +F+KM VGCAPD ITVN L SCLLKAGM EA Sbjct: 417 RCKPDKLTFTILIIGHCMKGRMPEAIGIFHKMLAVGCAPDEITVNNLRSCLLKAGMPGEA 476 Query: 1796 YRIKKDVLQGLQSGFSSAGGPKPFRASMDITVAV 1897 R+KK + Q L G +S+ + I VAV Sbjct: 477 ARVKKVLAQNLTLGITSSKKSYHETTNESIPVAV 510 >ref|XP_006396122.1| hypothetical protein EUTSA_v10002477mg [Eutrema salsugineum] gi|557096393|gb|ESQ36901.1| hypothetical protein EUTSA_v10002477mg [Eutrema salsugineum] Length = 535 Score = 447 bits (1150), Expect = e-122 Identities = 229/490 (46%), Positives = 327/490 (66%) Frame = +2 Query: 356 WFIKFVCTLCIRNAENLAIFGSDYFRKNLCPPIAFYVIQHINNNFCKPRLAFEFFQFTRL 535 W +K V TL + + + Y KNL P IAF V++ ++N P + F F++F+R Sbjct: 40 WLVKIVSTLFVYQVPDSDLCFC-YLSKNLNPFIAFEVVKKLDN----PHIGFRFWEFSRF 94 Query: 536 NLNLIHSIATFNLLLRSLCQMGLLDLANLVVEYMNADGLSLDGPLLEFLVSSVAHAGKFS 715 LN+ HS T+NLL RSLC+ GL DLA + E M +DG+S + LL FLVSS A GK Sbjct: 95 KLNIRHSFWTYNLLTRSLCKAGLHDLAGKMFECMKSDGVSPNSRLLGFLVSSFAEKGKLH 154 Query: 716 IGREILISQAQLCLGKEEIINSFVHNKFLSLLVKQNRVDEAFIFFRDHLLKLRSFCLDTC 895 +L+ ++ G ++NS +H LV+ +RV++A F HL C DT Sbjct: 155 FATALLLQSYEV-EGSSMVVNSLLHT-----LVRLDRVEDAMKLFDTHLRS--QSCNDTR 206 Query: 896 SFNIVIDGLCKAGQIDRAFEVFNDMGSFGCFADITTYNSVINGLCKLGDADRALELLREI 1075 +FNI+I GLC G+ A ++ +M SFG DI TYN++I G CK + ++A E+ E+ Sbjct: 207 TFNILIQGLCGIGKAHEALKLLGEMSSFGSSPDIVTYNTLIKGFCKSNELNKANEIFNEV 266 Query: 1076 QSQGGLSPDVKTYTTIISGFFKLGRNHEALHLWDDMTHRGIRPNVYTFNVLIHGFGQSGD 1255 +S+ G DV TYT+++SG+ K G+ EA L D+M G+ P TFNVL++G+ ++G+ Sbjct: 267 KSRNGCFRDVVTYTSMMSGYCKAGKMREASLLLDEMVGLGMYPTNITFNVLVYGYVKAGE 326 Query: 1256 LGSALKIFESMSKFGCAPDVITFTNLLSGYCQIGEIDQGLKLWDEMNARKLYPNAYTFSI 1435 + SA I M FGC PDV+TFT L+ GYC++G++++G LW+EM+A+ ++PNA+T+SI Sbjct: 327 MSSAEAIRRKMDSFGCFPDVVTFTTLIDGYCRVGQVNKGFSLWEEMSAKGMFPNAFTYSI 386 Query: 1436 VINSLCKSNRLNEARDLLSQLRWRNDILPQAFIYNPVIDGFCKAGNLDGANAIVAEMEEK 1615 +IN+LCK NRL +AR+LL QL DI+P+ F+YNP+IDGFCKAG ++ AN IVAEME+ Sbjct: 387 LINALCKENRLLKARELLGQLACM-DIVPKPFLYNPIIDGFCKAGKVNEANVIVAEMEKF 445 Query: 1616 KCNPDKYTFTILIIGHCMKGRMHDAIDVFNKMSLVGCAPDRITVNCLVSCLLKAGMVTEA 1795 +C PDK TFTILIIGHCMKGRM +AI +F+KM +GC+PD+ITV+ L SCLLKAGM EA Sbjct: 446 RCKPDKITFTILIIGHCMKGRMCEAISIFHKMVAIGCSPDKITVSSLSSCLLKAGMAKEA 505 Query: 1796 YRIKKDVLQG 1825 Y++ + ++G Sbjct: 506 YQLNQFAVKG 515