BLASTX nr result
ID: Akebia25_contig00011578
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00011578 (1783 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004288257.1| PREDICTED: pentatricopeptide repeat-containi... 471 e-130 ref|XP_006466446.1| PREDICTED: pentatricopeptide repeat-containi... 451 e-124 ref|XP_006426111.1| hypothetical protein CICLE_v10027042mg [Citr... 451 e-124 gb|EXB94039.1| hypothetical protein L484_009383 [Morus notabilis] 450 e-123 ref|XP_007204940.1| hypothetical protein PRUPE_ppa001240mg [Prun... 447 e-123 ref|XP_002525278.1| GTP binding protein, putative [Ricinus commu... 442 e-121 ref|XP_003550974.1| PREDICTED: pentatricopeptide repeat-containi... 441 e-121 ref|XP_007155826.1| hypothetical protein PHAVU_003G234700g [Phas... 439 e-120 ref|XP_002310894.2| hypothetical protein POPTR_0007s14930g [Popu... 437 e-119 ref|XP_007047547.1| Tetratricopeptide repeat-like superfamily pr... 433 e-118 ref|XP_007047546.1| Tetratricopeptide repeat-like superfamily pr... 433 e-118 ref|XP_007047545.1| Tetratricopeptide repeat-like superfamily pr... 433 e-118 ref|XP_006841116.1| hypothetical protein AMTR_s00086p00094500 [A... 433 e-118 ref|XP_004509062.1| PREDICTED: pentatricopeptide repeat-containi... 431 e-118 emb|CAN83934.1| hypothetical protein VITISV_035768 [Vitis vinifera] 429 e-117 ref|XP_003608531.1| Pentatricopeptide repeat-containing protein ... 424 e-116 ref|XP_006393964.1| hypothetical protein EUTSA_v10003664mg [Eutr... 418 e-114 ref|XP_006363825.1| PREDICTED: pentatricopeptide repeat-containi... 407 e-110 ref|XP_004141982.1| PREDICTED: pentatricopeptide repeat-containi... 407 e-110 ref|XP_006279544.1| hypothetical protein CARUB_v10028435mg [Caps... 405 e-110 >ref|XP_004288257.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 985 Score = 471 bits (1212), Expect = e-130 Identities = 257/500 (51%), Positives = 327/500 (65%), Gaps = 8/500 (1%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH+ F KM++SG PKALTYKV+VRA W EGKVNEA+EAVRDMERRGVVG Sbjct: 512 VMLQSGKYDLVHELFRKMKKSGEAPKALTYKVIVRALWCEGKVNEAIEAVRDMERRGVVG 571 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 T+ VYYELACCLCK+GRW++A+ +FTGMI S M+GGH+DDC+SI Sbjct: 572 TSGVYYELACCLCKSGRWQDALLQVEKMKNVTNTKPLEVTFTGMIKSSMEGGHIDDCVSI 631 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGS-LTLDS 1245 FEHMK+HCSPNIGTIN MLKV+G DMF KAKELFEE K S S+ ++ GGGS L D Sbjct: 632 FEHMKNHCSPNIGTINTMLKVFGHTDMFSKAKELFEETKAAKSDSDPSLEGGGSSLVPDE 691 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTY+S+L+ASA A QWEYFEYVYKEMAL YQ DQ+K+A +L+EASRA KG+LLEHAF+ Sbjct: 692 YTYTSMLKASASALQWEYFEYVYKEMALSGYQIDQSKNASILMEASRAGKGYLLEHAFDR 751 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 LEAGEIPHL FF EMV ATA+HDY+RA TL+N+MA+A F+VSE QWTD+FK +ED IS Sbjct: 752 TLEAGEIPHLLFFIEMVYQATARHDYKRAATLVNTMAYAPFQVSERQWTDVFKKNEDGIS 811 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDG 705 +DGL+KLLD L + DV +E T+ NL ++L+S+C S RD S+++ + S D Sbjct: 812 QDGLKKLLDALEHCDVTSEATLLNLKRSLQSLCWSYTSRDFSDSVSVSSLNDNDEGSDDN 871 Query: 704 KWKLN-------LIGRLGDGNTNPPNGAETNVYDSANDDVSLLSYSPSCXXXXXXXXXXX 546 + + + G++ G T+PP+ DS++ V+ + S Sbjct: 872 EGLITPNHYLGYINGKMSPG-TDPPD-------DSSDAPVNEFPHRSS------------ 911 Query: 545 XXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQVNNSSDSVPSASE 366 DV + + R I G + ++ +PSA E Sbjct: 912 -----TRRDVAADIE---IVSRPLDYISDGGLESTEIDEEIEALIYKDDSHKSHLPSAKE 963 Query: 365 ILEGWRESRNKDGIFLPFQL 306 I++ W+E R K GI +PFQL Sbjct: 964 IMKDWKERRKKGGILVPFQL 983 >ref|XP_006466446.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic-like [Citrus sinensis] Length = 901 Score = 451 bits (1160), Expect = e-124 Identities = 229/345 (66%), Positives = 266/345 (77%), Gaps = 2/345 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH+FF KM +SG ALTYKVLVRAFWEEGK+NEAV AVR+ME+RGVVG Sbjct: 379 VMLQSGKYDLVHEFFRKMAKSGEAIGALTYKVLVRAFWEEGKINEAVAAVRNMEQRGVVG 438 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLC NGRW++AM +FTG+I+S MDGGH+DDCISI Sbjct: 439 TASVYYELACCLCNNGRWQDAMLVVEKIKSLRHSKPLEITFTGLIISSMDGGHIDDCISI 498 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGS-LTLDS 1245 F+HMKDHC PNIGT+NAMLKVY RNDMF KAKELFEE R NS T + G G+ L D Sbjct: 499 FQHMKDHCEPNIGTVNAMLKVYSRNDMFSKAKELFEETTRANSSGYTFLSGDGAPLKPDE 558 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTYSS+LEASA AHQWEYFEYVYK MAL Q DQ KHAWLLVEASRA K HLLEHAF+ Sbjct: 559 YTYSSMLEASATAHQWEYFEYVYKGMALSGCQLDQTKHAWLLVEASRAGKCHLLEHAFDS 618 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 +LEAGEIPH FFTEM+ A Q +YE+AV LIN+MA+A F ++E QWT++F+ +EDRIS Sbjct: 619 LLEAGEIPHPLFFTEMLIQAIVQSNYEKAVALINAMAYAPFHITERQWTELFESNEDRIS 678 Query: 884 KDGLQKLLDTLGNTDVV-TETTVSNLSKALKSICGSNAPRDSLSS 753 +D L+KLL+ L N + +E TVSNLS+AL ++C S RD SS Sbjct: 679 RDKLEKLLNALCNCNAASSEITVSNLSRALHALCRSEKERDLSSS 723 >ref|XP_006426111.1| hypothetical protein CICLE_v10027042mg [Citrus clementina] gi|557528101|gb|ESR39351.1| hypothetical protein CICLE_v10027042mg [Citrus clementina] Length = 900 Score = 451 bits (1160), Expect = e-124 Identities = 229/345 (66%), Positives = 266/345 (77%), Gaps = 2/345 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH+FF KM +SG ALTYKVLVRAFWEEGK+NEAV AVR+ME+RGVVG Sbjct: 379 VMLQSGKYDLVHEFFRKMAKSGEAIGALTYKVLVRAFWEEGKINEAVAAVRNMEQRGVVG 438 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLC NGRW++AM +FTG+I+S MDGGH+DDCISI Sbjct: 439 TASVYYELACCLCNNGRWQDAMLVVEKIKSLRHSKPLEITFTGLIISSMDGGHIDDCISI 498 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGS-LTLDS 1245 F+HMKDHC PNIGT+NAMLKVY RNDMF KAKELFEE R NS T + G G+ L D Sbjct: 499 FQHMKDHCEPNIGTVNAMLKVYSRNDMFSKAKELFEETTRANSSGYTFLSGDGTPLKPDE 558 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTYSS+LEASA AHQWEYFEYVYK MAL Q DQ KHAWLLVEASRA K HLLEHAF+ Sbjct: 559 YTYSSMLEASATAHQWEYFEYVYKGMALSGCQLDQTKHAWLLVEASRAGKCHLLEHAFDS 618 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 +LEAGEIPH FFTEM+ A Q +YE+AV LIN+MA+A F ++E QWT++F+ +EDRIS Sbjct: 619 LLEAGEIPHPLFFTEMLIQAIVQSNYEKAVALINAMAYAPFHITERQWTELFESNEDRIS 678 Query: 884 KDGLQKLLDTLGNTDVV-TETTVSNLSKALKSICGSNAPRDSLSS 753 +D L+KLL+ L N + +E TVSNLS+AL ++C S RD SS Sbjct: 679 RDKLEKLLNALCNCNAASSEITVSNLSRALHALCRSEKERDLSSS 723 >gb|EXB94039.1| hypothetical protein L484_009383 [Morus notabilis] Length = 910 Score = 450 bits (1157), Expect = e-123 Identities = 230/349 (65%), Positives = 270/349 (77%), Gaps = 4/349 (1%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH++F KMR+SG TPKALTYKVLVRAFW EGKVNEAVE VRDME+RGVVG Sbjct: 408 VMLQSGKYDLVHEYFRKMRKSGETPKALTYKVLVRAFWGEGKVNEAVEVVRDMEQRGVVG 467 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 +SVYYELACCLC N RW++AM +FTGMIMS M GGH+ DCISI Sbjct: 468 ASSVYYELACCLCSNRRWEDAMLEVEKMKKLSNSRPLEVAFTGMIMSSMQGGHISDCISI 527 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTL-DS 1245 FEHMK HCSPNIGT+N MLKVYGRNDMF KAKELFEEIK++NS S ++ GG + + D Sbjct: 528 FEHMKTHCSPNIGTLNIMLKVYGRNDMFSKAKELFEEIKKRNSDSCSSFDGGDTFLIPDE 587 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTY+++LEASA A QWEYFEYVYKEM L YQ DQNKHA LL EASRA K HLLEHAF+ Sbjct: 588 YTYNAMLEASASALQWEYFEYVYKEMVLSGYQLDQNKHASLLPEASRAGKWHLLEHAFDA 647 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 ILEAGEIP+ +FTEMV ATA+HDY+RAVTL+N+ A A F+V+E QW D F+ + +RIS Sbjct: 648 ILEAGEIPNSQYFTEMVLQATARHDYDRAVTLVNAAALAPFQVTEEQWKDFFEKNRERIS 707 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICG---SNAPRDSLSSLA 747 +D L+KLL +L N +V +E TV NLS+AL+ + S A RD SS+A Sbjct: 708 QDNLEKLLRSLDNCNVKSEATVVNLSRALRGLSDLSESGASRDFSSSIA 756 >ref|XP_007204940.1| hypothetical protein PRUPE_ppa001240mg [Prunus persica] gi|462400582|gb|EMJ06139.1| hypothetical protein PRUPE_ppa001240mg [Prunus persica] Length = 874 Score = 447 bits (1151), Expect = e-123 Identities = 262/493 (53%), Positives = 319/493 (64%), Gaps = 4/493 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH+ F KM+ SG PKAL YKVLVRAFW EGKVNEAVEAVRDME+RGVVG Sbjct: 389 VMLQSGKYDLVHELFRKMKNSGEAPKALNYKVLVRAFWCEGKVNEAVEAVRDMEQRGVVG 448 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 T SVYYELACCLC NGRW++A+ +FTGMI S M+GGH+D CISI Sbjct: 449 TGSVYYELACCLCNNGRWQDALVEVEKMKNVSNTKPLEVTFTGMITSSMEGGHIDSCISI 508 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTL-DS 1245 F+HMK+ C+PNIGTIN MLKV+GR+DMF KAKELFEEIK + S+ ++ GGG+L + D Sbjct: 509 FKHMKNRCAPNIGTINTMLKVFGRSDMFFKAKELFEEIKTVRAESDFSLEGGGTLVVPDQ 568 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTY+S+L+ASA A QWEYFEYVYKEMAL YQ DQ KHA LLV+ASR+ K +LLEHAF+ Sbjct: 569 YTYTSMLKASASALQWEYFEYVYKEMALSGYQVDQTKHASLLVKASRSGKFYLLEHAFDT 628 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 LEAGEIPH FTEMV ATAQHDY+RAVTL+N+MA+A F+VSE QWTD+F+ + D I+ Sbjct: 629 SLEAGEIPHPLIFTEMVFQATAQHDYKRAVTLVNAMAYAPFQVSERQWTDLFEKNGDTIT 688 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDG 705 +DGL+KLLD L N DVV+E TV NLS++L +C S R SS T S S DG Sbjct: 689 QDGLEKLLDALHNCDVVSEATVLNLSRSLLRLCRSYRSRGLSSSAPFGSGATETS-SLDG 747 Query: 704 KWKLNLIGRLGDGNTNPPNGAETNVYDSANDDVSLLSYSPSCXXXXXXXXXXXXXXXXXI 525 GN PN + ++ S N L S + + Sbjct: 748 D------NEEIYGNGIMPNHSLESIDGSHNPRREPLDKSTN--VPLDAFSVNHASTRRDV 799 Query: 524 EDVTFSVSGGCVDYRNSRPIL--PGXXXXXXXXXXXXXXTSQVNNSSDS-VPSASEILEG 354 ++VT +VS R+S I G V++S DS +PSA EIL+ Sbjct: 800 DEVTRTVS------RSSEYISDEDGEYSTEIDKEIEALIYKDVDDSHDSDLPSAPEILKV 853 Query: 353 WRESRNKDGIFLP 315 W+E R + LP Sbjct: 854 WKERRKEARDSLP 866 >ref|XP_002525278.1| GTP binding protein, putative [Ricinus communis] gi|223535436|gb|EEF37106.1| GTP binding protein, putative [Ricinus communis] Length = 1010 Score = 442 bits (1136), Expect = e-121 Identities = 257/505 (50%), Positives = 308/505 (60%), Gaps = 13/505 (2%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML SGKYDLVH+ F KM RSG PKALTYKVLVRAFWEEGKVNEA+EAVRDME RGVVG Sbjct: 515 VMLNSGKYDLVHELFRKMNRSGEAPKALTYKVLVRAFWEEGKVNEAMEAVRDMENRGVVG 574 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TAS+YYELACCLC G W++AM +FTG+IMS +DGGHV DCISI Sbjct: 575 TASLYYELACCLCYYGMWQDAMLEVKKMKNLRHSKPLEVTFTGLIMSSLDGGHVSDCISI 634 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTLDSY 1242 FE+MK +C PNIGTIN MLKVYGRND+F KAKELF EIK N+ G L D + Sbjct: 635 FEYMKAYCVPNIGTINIMLKVYGRNDLFSKAKELFGEIKGTNND-------GTYLVPDEF 687 Query: 1241 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1062 TYSS+LEASA A QWEYFE VYKEM C YQ DQ KHA LLVEASR K HLLEHAF+ Sbjct: 688 TYSSMLEASASALQWEYFELVYKEMTFCGYQLDQKKHASLLVEASRVGKYHLLEHAFDAA 747 Query: 1061 LEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRISK 882 LEAGEIPH FTEMV ATAQ +YERAV L+N++A A FK+SE QW D+F+ + D+I++ Sbjct: 748 LEAGEIPHHLLFTEMVFQATAQQNYERAVVLVNTLALAPFKISEKQWIDLFQKNGDKITQ 807 Query: 881 DGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDGK 702 DGL+KLLD L ++DV +E TV+NLS+ L S+CG S +L T S G Sbjct: 808 DGLEKLLDALRSSDVASEPTVANLSRTLHSLCGRGRSEYLSGSTSLGIDVTNSSYLDSGS 867 Query: 701 WKL-----------NLIGRLGDGNTNPPNGAETNVYDSANDDVSLLSYSP-SCXXXXXXX 558 K+ LI + D + +N +DD S SP + Sbjct: 868 RKIMGDKGPEMHEDTLIDKT-DIAYGDLSVTRSNTGGEGSDDTDEASSSPRNYSTDRDGI 926 Query: 557 XXXXXXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQVNNS-SDSV 381 +D S C+D+ +P +QV++S + Sbjct: 927 ASICTNVKIFGDDEASGASTDCLDFDEMEYGIP---------------INQVDDSCGTKL 971 Query: 380 PSASEILEGWRESRNKDGIFLPFQL 306 PSA EIL+ W+ESR K +F PFQL Sbjct: 972 PSADEILDIWKESR-KGRLFFPFQL 995 >ref|XP_003550974.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic-like [Glycine max] Length = 865 Score = 441 bits (1133), Expect = e-121 Identities = 219/340 (64%), Positives = 264/340 (77%), Gaps = 1/340 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML+SG YDLVH+FFGKM+RSG PKALTYKVLV+ FW+EGKVNEAV+AVRDMERRGV+G Sbjct: 368 VMLESGNYDLVHEFFGKMKRSGEVPKALTYKVLVKTFWKEGKVNEAVKAVRDMERRGVIG 427 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLC NGRW++A+ +FTGMI S MDGGH++DCI I Sbjct: 428 TASVYYELACCLCNNGRWQDAILEVDNIRSLPHAKPLEVTFTGMIKSSMDGGHINDCICI 487 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGG-GSLTLDS 1245 FE+MK+HC PNIG IN MLKVYG+NDMF KAK LFEE+K S GG S+ D Sbjct: 488 FEYMKEHCVPNIGAINTMLKVYGQNDMFSKAKVLFEEVKVAKSEFYATPEGGYSSVVPDV 547 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 Y+Y+S+LEASA A QWEYFE+VY+EM + YQ DQ+KH LLV+ASRA K HLLEHAF+M Sbjct: 548 YSYNSMLEASATAQQWEYFEHVYREMIVSGYQLDQDKHLSLLVKASRAGKLHLLEHAFDM 607 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 ILEAGEIPH FF E+V A AQH+YERAV LIN+MA+A F+V+E QWT++FK SEDRIS Sbjct: 608 ILEAGEIPHHLFFFELVIQAIAQHNYERAVILINTMAYAPFRVTEKQWTNLFKESEDRIS 667 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRD 765 + L++LLD LGN D+V+E TVSNL+++L +CG R+ Sbjct: 668 LENLERLLDALGNCDIVSELTVSNLTRSLHVLCGLGTSRN 707 >ref|XP_007155826.1| hypothetical protein PHAVU_003G234700g [Phaseolus vulgaris] gi|561029180|gb|ESW27820.1| hypothetical protein PHAVU_003G234700g [Phaseolus vulgaris] Length = 870 Score = 439 bits (1129), Expect = e-120 Identities = 218/339 (64%), Positives = 261/339 (76%), Gaps = 1/339 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML+SG YDLVH+FFGKM+RSG PKALTYKVLVR FW+EGKV EAV+A+RDMERRGV+G Sbjct: 377 VMLESGNYDLVHEFFGKMKRSGEVPKALTYKVLVRTFWKEGKVEEAVKAIRDMERRGVIG 436 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TA VYYELACCLC GRW++A+ +FTGMI S M GGH+DD I I Sbjct: 437 TAGVYYELACCLCNCGRWRDAILEVDNIRNLPRAKPLEVTFTGMIKSSMGGGHIDDSIRI 496 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTL-DS 1245 FE+M+DHC+PNIG IN MLKVYG+NDMF KAK LFEE+K S S GG S + DS Sbjct: 497 FEYMRDHCAPNIGAINTMLKVYGQNDMFSKAKVLFEEVKAAKSESYATPGGGNSSAVPDS 556 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTY+S+LEASA A QWEYFE+VY+EM + YQ DQNKH LLV+ASRA K HLLEHAF M Sbjct: 557 YTYNSMLEASASAQQWEYFEHVYREMIVSGYQLDQNKHLLLLVKASRAGKLHLLEHAFNM 616 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 ILEAGEIPH FF E+V A QH+YERAV LIN++A+A F+VSE QWT++FK SEDRIS Sbjct: 617 ILEAGEIPHHLFFFELVIQAIVQHNYERAVILINTLAYAPFRVSEKQWTNLFKESEDRIS 676 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPR 768 + L++LLD LG+ DV++E+TVSNL+++L +CGS R Sbjct: 677 HENLERLLDALGSCDVISESTVSNLTRSLHVLCGSGISR 715 >ref|XP_002310894.2| hypothetical protein POPTR_0007s14930g [Populus trichocarpa] gi|550334917|gb|EEE91344.2| hypothetical protein POPTR_0007s14930g [Populus trichocarpa] Length = 879 Score = 437 bits (1123), Expect = e-119 Identities = 254/509 (49%), Positives = 312/509 (61%), Gaps = 26/509 (5%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML SGKY VH++F KM++SG + KALTYKVLVRAFWEEG+VNEAVEAVRDME+RGVVG Sbjct: 381 VMLLSGKYKSVHEYFRKMKKSGESLKALTYKVLVRAFWEEGRVNEAVEAVRDMEQRGVVG 440 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 ASVYYELACCLC NGRW++AM S TGMI S MDGGH+D+CISI Sbjct: 441 AASVYYELACCLCYNGRWQDAMLEVEKMKRLRYKKPLEVSLTGMIASSMDGGHIDNCISI 500 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTLDSY 1242 FEHMK HC PNIGTIN MLKVY R+D+F +AKELFE+IK + T I D Y Sbjct: 501 FEHMKAHCVPNIGTINTMLKVYSRSDLFSEAKELFEDIKGVDHSGTTIIP-------DGY 553 Query: 1241 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1062 TYSS+LE SARA QWEYFEYVYKEM+ YQ DQ KHA LLVEASR+ K HLLEHAF+ I Sbjct: 554 TYSSMLEVSARALQWEYFEYVYKEMSFSGYQLDQIKHAPLLVEASRSGKNHLLEHAFDEI 613 Query: 1061 LEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRISK 882 LEAGEIPH FTEMV ATAQ +YERAVTLIN+MAHASF++SE QWTD+F+ + ++IS+ Sbjct: 614 LEAGEIPHPLLFTEMVFQATAQENYERAVTLINTMAHASFQISERQWTDLFEKNGEKISQ 673 Query: 881 DGLQKLLDTLGNTDVVTETTVSNLSKALKSIC----GSNAPRDSLSSLALDD-------- 738 D L+KLLD +G+ + +E TVSNLS++L+S+C + PR + DD Sbjct: 674 DSLEKLLDAVGHCRMASEVTVSNLSRSLRSLCRPGSSGDLPRTNSCIEDTDDTHINTNSG 733 Query: 737 ----------VTTGRSLSHDGKWKLNLIGRLGDGNTNPP----NGAETNVYDSANDDVSL 600 VTT S++ DG +L+ + + P N + TN DD Sbjct: 734 EIAGNRSAYMVTTSASMA-DGNLELDEDTFVNKTSITPDMSLVNNSSTN---REGDDPEA 789 Query: 599 LSYSPSCXXXXXXXXXXXXXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXX 420 S + + +DV S C+D + S +L Sbjct: 790 ASSTGNSVNGLDVATNLLVKRDVFADDVASGASTDCLDKKLSNILLEESAKDAEEVELEI 849 Query: 419 XXTSQVNNSSDSVPSASEILEGWRESRNK 333 T + +PSA IL+ W+ESR K Sbjct: 850 GTTEANDLYRSELPSAHAILDVWKESRKK 878 >ref|XP_007047547.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 3 [Theobroma cacao] gi|508699808|gb|EOX91704.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 3 [Theobroma cacao] Length = 628 Score = 433 bits (1114), Expect = e-118 Identities = 215/335 (64%), Positives = 259/335 (77%), Gaps = 1/335 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH+FF KM+RSG P+AL+Y+VLV+AFWEEGK+NEAVEAVRDME+RGV+G Sbjct: 117 VMLQSGKYDLVHEFFRKMKRSGEAPRALSYRVLVKAFWEEGKINEAVEAVRDMEQRGVIG 176 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLCKNGRW++A+ +FTG+IM+ +DGGH +DCISI Sbjct: 177 TASVYYELACCLCKNGRWRDAIIEVDKMKKLSQRKPLEITFTGLIMASLDGGHFNDCISI 236 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGS-LTLDS 1245 F++MKDHC+PNIGTINAMLKVYG+NDMF KAKELFEEI + SG + G + L D Sbjct: 237 FQYMKDHCAPNIGTINAMLKVYGQNDMFSKAKELFEEINKAKSGPYDSQNGKSTNLIPDG 296 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTYS +L ASA A QWEYFEYVYKEM L Y DQ KHA LLVEASRARK +LLEHAF+ Sbjct: 297 YTYSLMLGASASALQWEYFEYVYKEMTLSGYHLDQTKHAILLVEASRARKWYLLEHAFDT 356 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 LE GEIPH FTEM+ ATAQ +YE+ VTL+N+MAHA ++VSE QWT+ F+ + DRIS Sbjct: 357 FLEVGEIPHPLLFTEMIIQATAQSNYEKVVTLVNTMAHALYQVSEKQWTEAFEENGDRIS 416 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGS 780 L KLLD L N ++ +E T SNL ++L+ +CGS Sbjct: 417 HGSLSKLLDALSNCELSSEITASNLIRSLQYLCGS 451 >ref|XP_007047546.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 2 [Theobroma cacao] gi|508699807|gb|EOX91703.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 2 [Theobroma cacao] Length = 596 Score = 433 bits (1114), Expect = e-118 Identities = 215/335 (64%), Positives = 259/335 (77%), Gaps = 1/335 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH+FF KM+RSG P+AL+Y+VLV+AFWEEGK+NEAVEAVRDME+RGV+G Sbjct: 85 VMLQSGKYDLVHEFFRKMKRSGEAPRALSYRVLVKAFWEEGKINEAVEAVRDMEQRGVIG 144 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLCKNGRW++A+ +FTG+IM+ +DGGH +DCISI Sbjct: 145 TASVYYELACCLCKNGRWRDAIIEVDKMKKLSQRKPLEITFTGLIMASLDGGHFNDCISI 204 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGS-LTLDS 1245 F++MKDHC+PNIGTINAMLKVYG+NDMF KAKELFEEI + SG + G + L D Sbjct: 205 FQYMKDHCAPNIGTINAMLKVYGQNDMFSKAKELFEEINKAKSGPYDSQNGKSTNLIPDG 264 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTYS +L ASA A QWEYFEYVYKEM L Y DQ KHA LLVEASRARK +LLEHAF+ Sbjct: 265 YTYSLMLGASASALQWEYFEYVYKEMTLSGYHLDQTKHAILLVEASRARKWYLLEHAFDT 324 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 LE GEIPH FTEM+ ATAQ +YE+ VTL+N+MAHA ++VSE QWT+ F+ + DRIS Sbjct: 325 FLEVGEIPHPLLFTEMIIQATAQSNYEKVVTLVNTMAHALYQVSEKQWTEAFEENGDRIS 384 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGS 780 L KLLD L N ++ +E T SNL ++L+ +CGS Sbjct: 385 HGSLSKLLDALSNCELSSEITASNLIRSLQYLCGS 419 >ref|XP_007047545.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] gi|508699806|gb|EOX91702.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1 [Theobroma cacao] Length = 897 Score = 433 bits (1114), Expect = e-118 Identities = 215/335 (64%), Positives = 259/335 (77%), Gaps = 1/335 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKYDLVH+FF KM+RSG P+AL+Y+VLV+AFWEEGK+NEAVEAVRDME+RGV+G Sbjct: 386 VMLQSGKYDLVHEFFRKMKRSGEAPRALSYRVLVKAFWEEGKINEAVEAVRDMEQRGVIG 445 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLCKNGRW++A+ +FTG+IM+ +DGGH +DCISI Sbjct: 446 TASVYYELACCLCKNGRWRDAIIEVDKMKKLSQRKPLEITFTGLIMASLDGGHFNDCISI 505 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGS-LTLDS 1245 F++MKDHC+PNIGTINAMLKVYG+NDMF KAKELFEEI + SG + G + L D Sbjct: 506 FQYMKDHCAPNIGTINAMLKVYGQNDMFSKAKELFEEINKAKSGPYDSQNGKSTNLIPDG 565 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTYS +L ASA A QWEYFEYVYKEM L Y DQ KHA LLVEASRARK +LLEHAF+ Sbjct: 566 YTYSLMLGASASALQWEYFEYVYKEMTLSGYHLDQTKHAILLVEASRARKWYLLEHAFDT 625 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 LE GEIPH FTEM+ ATAQ +YE+ VTL+N+MAHA ++VSE QWT+ F+ + DRIS Sbjct: 626 FLEVGEIPHPLLFTEMIIQATAQSNYEKVVTLVNTMAHALYQVSEKQWTEAFEENGDRIS 685 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGS 780 L KLLD L N ++ +E T SNL ++L+ +CGS Sbjct: 686 HGSLSKLLDALSNCELSSEITASNLIRSLQYLCGS 720 >ref|XP_006841116.1| hypothetical protein AMTR_s00086p00094500 [Amborella trichopoda] gi|548843010|gb|ERN02791.1| hypothetical protein AMTR_s00086p00094500 [Amborella trichopoda] Length = 828 Score = 433 bits (1113), Expect = e-118 Identities = 222/351 (63%), Positives = 261/351 (74%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML+SGKYDLVHKFFG MRR G PKALTYKVLV W EGKVNEAVEAV DMERRGVVG Sbjct: 396 VMLKSGKYDLVHKFFGTMRRGGLAPKALTYKVLVSCLWAEGKVNEAVEAVEDMERRGVVG 455 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLC NGRWKEAM +FTGMI SCMDGG+V D ISI Sbjct: 456 TASVYYELACCLCNNGRWKEAMTQIEKLKSLPLSRPLEVAFTGMIQSCMDGGYVRDGISI 515 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTLDSY 1242 FE+M+++C+ NIGTIN MLK+YG NDMF KAKELFE IK + + N+ G + D+Y Sbjct: 516 FENMQEYCTLNIGTINVMLKLYGCNDMFTKAKELFEGIKMPEARYDMNLDCHGVNSPDAY 575 Query: 1241 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1062 TYS +LEASA + QWEYFE+VYKEMAL +Q DQNKHAWLLVEASRA HLLEHAF+ Sbjct: 576 TYSLMLEASAISLQWEYFEHVYKEMALSGFQLDQNKHAWLLVEASRAGMMHLLEHAFDSA 635 Query: 1061 LEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRISK 882 LEAGE+PH S FTEM+C HD++RA+TL+NSMAH S +VSE QWT++FK + D+IS Sbjct: 636 LEAGELPHWSIFTEMICQTLICHDFKRAITLVNSMAHVSLQVSEKQWTNLFKRNSDKISI 695 Query: 881 DGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTT 729 + LQKL L + +++E V+NLSK+L +CGSN P + AL DVTT Sbjct: 696 EELQKLRQCLNDKGLMSEPIVTNLSKSLCYLCGSNIP----TEYALCDVTT 742 >ref|XP_004509062.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic-like [Cicer arietinum] Length = 883 Score = 431 bits (1109), Expect = e-118 Identities = 249/529 (47%), Positives = 320/529 (60%), Gaps = 35/529 (6%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML+SG YDLVH+ FGKMRRSG P+ALTYKVLVR W+EGKV+EAV+ VRDMER+GV+G Sbjct: 385 VMLESGNYDLVHELFGKMRRSGEVPEALTYKVLVRTCWKEGKVDEAVKVVRDMERKGVMG 444 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLC GRW++A+ +FTGMI S MDGGH+DDCISI Sbjct: 445 TASVYYELACCLCNCGRWQDAIPEVERIRRLSHARPLEVTFTGMIRSSMDGGHIDDCISI 504 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGG-SLTLDS 1245 FE+M+DHC+PN+GT+N MLKVYG+NDMF KAK LFEE+K S GG S+ D+ Sbjct: 505 FEYMEDHCTPNVGTVNIMLKVYGQNDMFSKAKVLFEEVKVAKSDIYDFPKGGSTSIVPDA 564 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTYS +LEASARAHQWEYFE+VYKEM L Y DQNKH+ LLV+ASRA K HLLEHAF+M Sbjct: 565 YTYSLMLEASARAHQWEYFEHVYKEMILSGYHLDQNKHSSLLVKASRAGKLHLLEHAFDM 624 Query: 1064 ILEAGEIP-HLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRI 888 ILE GEIP HL FF E+V A AQH+YERAV L+++MA+A ++V+E QWT++FK ++DRI Sbjct: 625 ILEVGEIPCHLIFF-ELVIQAIAQHNYERAVILLSTMAYAPYRVTEKQWTELFKKNKDRI 683 Query: 887 SKDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHD 708 + + L++LLD LG +VV+E TVSNLS++L +CG + R+ S + Sbjct: 684 NHENLERLLDALGKCNVVSEATVSNLSRSLHVLCGLGSSRNISSIIPF------------ 731 Query: 707 GKWKLNLIGRL--GDGNTNPPN-GAETNVYDSANDDVSLLSYSPSCXXXXXXXXXXXXXX 537 G +N + + G GN N PN + + A ++L S Sbjct: 732 GSENVNGLNEIIDGGGNGNVPNISGRMTIIEGAESGNNILLGSDQA-------------- 777 Query: 536 XXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQV--------NNSSD-- 387 E TF+V+ +D N+ ++ +V + SSD Sbjct: 778 ----ESDTFTVNRNQIDRVNNNDVVVCTPQNCNIDDKVSLCADKVEFCDHLALDKSSDGS 833 Query: 386 --------------------SVPSASEILEGWRESRNKDGIFLPFQLKC 300 PSA +ILE W+E R +D L +L C Sbjct: 834 DDELSDDESYEDDDVDDGVIDKPSAYQILEAWKEMREEDKTLLHSELDC 882 >emb|CAN83934.1| hypothetical protein VITISV_035768 [Vitis vinifera] Length = 615 Score = 429 bits (1104), Expect = e-117 Identities = 245/463 (52%), Positives = 291/463 (62%), Gaps = 10/463 (2%) Frame = -3 Query: 1688 VLVRAFWEEGKVNEAVEAVRDMERRGVVGTASVYYELACCLCKNGRWKEAMXXXXXXXXX 1509 VLVRAFWEEGKVNEAVE VRDMERRGVVG ASVYYELACCLC NGRW++A+ Sbjct: 12 VLVRAFWEEGKVNEAVEVVRDMERRGVVGIASVYYELACCLCNNGRWQDAIVEVEKLKKR 71 Query: 1508 XXXXXXXXSFTGMIMSCMDGGHVDDCISIFEHMKDHCSPNIGTINAMLKVYGRNDMFVKA 1329 +FTGMI S MDGGH+DDC+SIFEHMK HCSPNIGTINAMLKVYGRNDMF KA Sbjct: 72 PHSKPLEVTFTGMITSSMDGGHLDDCLSIFEHMKYHCSPNIGTINAMLKVYGRNDMFSKA 131 Query: 1328 KELFEEIKRKNSGSNTNIVGGG-SLTLDSYTYSSILEASARAHQWEYFEYVYKEMALCLY 1152 KELFEE KR SNT + G SL D YTYSS+LEASA AHQWE+FEYVYKEM L Y Sbjct: 132 KELFEETKRSTFASNTCMDDGSISLVPDLYTYSSMLEASASAHQWEFFEYVYKEMTLSGY 191 Query: 1151 QFDQNKHAWLLVEASRARKGHLLEHAFEMILEAGEIPHLSFFTEMVCLATAQHDYERAVT 972 Q DQ+KHA LL +ASRA K HLLEHAF+ ILEAGEIPH S FTEM+C ATAQH+YERAVT Sbjct: 192 QLDQSKHALLLGKASRAGKWHLLEHAFDTILEAGEIPHPSIFTEMICQATAQHNYERAVT 251 Query: 971 LINSMAHASFKVSENQWTDIFKGSEDRISKDGLQKLLDTLGNTDVVTETTVSNLSKALKS 792 LIN+MAHA F VSE QWTD+F ++DRIS+ L+KLLD+L N DV E TVSNL K+L+S Sbjct: 252 LINAMAHAPFVVSEKQWTDLFV-TDDRISRVNLEKLLDSLHNCDVAEEATVSNLYKSLQS 310 Query: 791 ICGSNAPRDSLSSLALDDVTTGRSLSHDGKWKLNLIGRLGDGNTNPPNGAETNVYDSAND 612 +CGS D SS+A D + + L G G+ + N + D+ Sbjct: 311 LCGSGTSMDQ-SSVAFGD---------EAMIRTPLNGNSGELDDNKKVFFQKFSADARGS 360 Query: 611 DVSLLSYSPSCXXXXXXXXXXXXXXXXXIED---------VTFSVSGGCVDYRNSRPILP 459 D+S P ED F+ + + ++ P Sbjct: 361 DLSPHENPPVKNSDVTFDIFSVNLTRSEEEDDDTDGEAISEAFNYACNGDEVASNEPNTL 420 Query: 458 GXXXXXXXXXXXXXXTSQVNNSSDSVPSASEILEGWRESRNKD 330 + ++ ++PSA+EILE W++SR +D Sbjct: 421 DGNSEGINKIELNMRAKEDDSHGSNLPSANEILETWKKSRERD 463 >ref|XP_003608531.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355509586|gb|AES90728.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 877 Score = 424 bits (1089), Expect = e-116 Identities = 206/339 (60%), Positives = 258/339 (76%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSG YDLVH+ F KM+R+G P+ALTYKV+VR FW+EGKV+EAV+AVRDMERRGV+G Sbjct: 390 VMLQSGNYDLVHELFEKMQRNGEVPEALTYKVMVRTFWKEGKVDEAVKAVRDMERRGVMG 449 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 TASVYYELACCLC GRW++A +FTGMI S MDGGH+DDCI I Sbjct: 450 TASVYYELACCLCNCGRWQDATLEVEKIKRLPHAKPLEVTFTGMIRSSMDGGHIDDCICI 509 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTLDSY 1242 FE+M+DHC+PN+GT+N MLKVY +NDMF AK LFEE+K V L D+Y Sbjct: 510 FEYMQDHCAPNVGTVNTMLKVYSQNDMFSTAKVLFEEVK----------VAKSDLRPDAY 559 Query: 1241 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1062 TY+ +LEAS+R HQWEYFE+VYKEM L Y DQNKH LLV+ASRA K HLLEHAF+M+ Sbjct: 560 TYNLMLEASSRGHQWEYFEHVYKEMILSGYHLDQNKHLPLLVKASRAGKLHLLEHAFDMV 619 Query: 1061 LEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRISK 882 LEAGEIPH FF E+V A AQH+YERA+ L+++MAHA ++V+E QWT++FK +EDRI+ Sbjct: 620 LEAGEIPHHLFFFELVIQAIAQHNYERAIILLSTMAHAPYRVTEKQWTELFKENEDRINH 679 Query: 881 DGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRD 765 + L++LLD LGN +VV+E T+SNLS++L +CG + R+ Sbjct: 680 ENLKRLLDDLGNCNVVSEATISNLSRSLHDLCGLGSSRN 718 >ref|XP_006393964.1| hypothetical protein EUTSA_v10003664mg [Eutrema salsugineum] gi|557090603|gb|ESQ31250.1| hypothetical protein EUTSA_v10003664mg [Eutrema salsugineum] Length = 811 Score = 418 bits (1074), Expect = e-114 Identities = 215/376 (57%), Positives = 270/376 (71%), Gaps = 3/376 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML+SGKYD VH+FF KMR SG PKA+TYKVLVRA W E K+ EAVEAVRDME++GVVG Sbjct: 397 VMLESGKYDRVHEFFRKMRSSGEAPKAITYKVLVRALWRENKIEEAVEAVRDMEQKGVVG 456 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 T SVYYELACCLC NGRW++AM +FTG+I + ++GGHVDDC+SI Sbjct: 457 TGSVYYELACCLCNNGRWRDAMLEVGRMRRLENCRPLEITFTGLIAASLNGGHVDDCMSI 516 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTLDSY 1242 F++MKD C PNIGT+N ML+VYGRNDMF +AKELFEEI R+ L D Y Sbjct: 517 FQYMKDKCDPNIGTVNTMLRVYGRNDMFSEAKELFEEIVREKEAH---------LVPDEY 567 Query: 1241 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1062 TYS +LEASAR+ QWEYFE+VY+ M L YQ DQ KHA +L+EASRA K LLEHAF+ I Sbjct: 568 TYSFMLEASARSLQWEYFEHVYQTMILSGYQIDQTKHAPMLIEASRAGKWSLLEHAFDAI 627 Query: 1061 LEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRISK 882 LE GEIPH FFTEM+C ATA+ DY+RA+TLIN++A ASF++SE QWTD+F+ ++D +++ Sbjct: 628 LEDGEIPHPLFFTEMLCHATAKGDYQRAITLINTVALASFQISEEQWTDLFEENQDWLTQ 687 Query: 881 DGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDGK 702 + LQ L D + + D +E TV+NLSK+LKS+CG ++ S L DVTT H K Sbjct: 688 ENLQNLCDYILDCDYASEPTVANLSKSLKSLCGVSSSSSSTEPLLAIDVTT-----HSEK 742 Query: 701 WKLNLI---GRLGDGN 663 + +L+ R+ DGN Sbjct: 743 PEEDLLFHDTRMKDGN 758 >ref|XP_006363825.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic-like [Solanum tuberosum] Length = 864 Score = 407 bits (1045), Expect = e-110 Identities = 199/335 (59%), Positives = 252/335 (75%), Gaps = 1/335 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VMLQSGKY+LVH+FFGKM+RSG KAL+YKVLV++FWEEG+VNEA++AVR+ME+RGVVG Sbjct: 381 VMLQSGKYELVHEFFGKMKRSGEALKALSYKVLVKSFWEEGRVNEAIQAVREMEQRGVVG 440 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 +ASVYYELACCLC +G WKEA +FTGMI+S MDGGH+D CI I Sbjct: 441 SASVYYELACCLCYHGMWKEAFLEIEKLKMLRRTRPLAVTFTGMILSSMDGGHIDGCICI 500 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKNSGSNTNIVGGGSLTL-DS 1245 +EH K HC P+IG INAMLKVYG+NDMF KAKELFE K ++SG + S D+ Sbjct: 501 YEHSKKHCEPDIGIINAMLKVYGKNDMFYKAKELFEWAKTESSGPQLSQDDFSSARRPDA 560 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTY+S+LE+SA + QWEYFEYVYKEMAL Y DQ++HA+LLVEAS+A K HLLEHAF+ Sbjct: 561 YTYTSMLESSAFSLQWEYFEYVYKEMALAGYLLDQSRHAYLLVEASKAGKVHLLEHAFDA 620 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 ILE G+IPH SFF E++C AT QHD+ERA+ LI M H F+VS+ +W D+F + +R+S Sbjct: 621 ILEVGQIPHPSFFFEILCQATCQHDHERALALIKLMVHVPFQVSKQEWIDLFNSNNERLS 680 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGS 780 L+ LLD + + ++TT+ NL +AL+S+CGS Sbjct: 681 HSSLRGLLDVICRQSLGSDTTIVNLCRALESVCGS 715 >ref|XP_004141982.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic-like [Cucumis sativus] gi|449499902|ref|XP_004160949.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic-like [Cucumis sativus] Length = 860 Score = 407 bits (1045), Expect = e-110 Identities = 206/352 (58%), Positives = 267/352 (75%), Gaps = 1/352 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML+SGKY+ +H F KM+++G T KA TY+VLV+AFWEEG VN A+EAVRDME+RGVVG Sbjct: 387 VMLKSGKYEQLHNLFTKMKKNGQTLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVG 446 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 +ASVYYELACCLC NG+W++A+ +FTGMI S +GGH+DDCISI Sbjct: 447 SASVYYELACCLCYNGKWQDALVEVEKMKTLSHMKPLVVTFTGMISSSFNGGHIDDCISI 506 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRK-NSGSNTNIVGGGSLTLDS 1245 FE+MK C+PNIGTIN MLKVYGRNDM+ KAK+LFEEIKRK +S S+ + V SL D Sbjct: 507 FEYMKQICAPNIGTINTMLKVYGRNDMYSKAKDLFEEIKRKADSSSHDSAVP--SLVPDE 564 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTY+S+LEA+A + QWEYFE VY+EMAL YQ DQ+KHA LLVEAS+A K +LL+HAF+ Sbjct: 565 YTYASMLEAAASSLQWEYFESVYREMALSGYQLDQSKHALLLVEASKAGKWYLLDHAFDT 624 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 ILEAG+IPH FTEM+ T Q +YE+AVTL+ +M +A F+VSE QWT++F+G+ DRI Sbjct: 625 ILEAGQIPHPLLFTEMILQLTTQDNYEQAVTLVRTMGYAPFQVSERQWTELFEGNTDRIR 684 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTT 729 ++ L++LL LG+ D +E TVSNLS++L+S+C + P ++ S+A D T Sbjct: 685 RNNLKQLLHALGDCD-ASEATVSNLSRSLQSLCKFDIPENTSQSVACDHDAT 735 >ref|XP_006279544.1| hypothetical protein CARUB_v10028435mg [Capsella rubella] gi|482548248|gb|EOA12442.1| hypothetical protein CARUB_v10028435mg [Capsella rubella] Length = 801 Score = 405 bits (1040), Expect = e-110 Identities = 206/355 (58%), Positives = 261/355 (73%), Gaps = 1/355 (0%) Frame = -3 Query: 1781 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1602 VML+SGKYD VH FF KM+ SG PKA+TYKVLVRA W EGK+ EAVEAVRDME++GV+G Sbjct: 391 VMLESGKYDRVHDFFRKMKSSGEAPKAITYKVLVRALWREGKIEEAVEAVRDMEQKGVIG 450 Query: 1601 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1422 T SVYYELACCLC NGRW +AM +FTG+I + ++GGHV DC++I Sbjct: 451 TGSVYYELACCLCNNGRWHDAMLEVGRMKRLENCKPLEITFTGLIAASLNGGHVGDCMAI 510 Query: 1421 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEI-KRKNSGSNTNIVGGGSLTLDS 1245 F++MKD C PNIGT+N ML+VYGRNDMF +AKELFEEI RK + L + Sbjct: 511 FQYMKDRCDPNIGTVNMMLRVYGRNDMFSEAKELFEEIVSRKET----------HLAPNE 560 Query: 1244 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1065 YTYS +LEASAR+ QWEYFE+VY+ M L YQ DQ KHA +L+EASRA K LLEHAF+ Sbjct: 561 YTYSFMLEASARSLQWEYFEHVYQTMILSGYQMDQTKHAPMLIEASRAGKWSLLEHAFDA 620 Query: 1064 ILEAGEIPHLSFFTEMVCLATAQHDYERAVTLINSMAHASFKVSENQWTDIFKGSEDRIS 885 +LE GEIPH FFTE++C ATA+ DY+RA+TLIN++A ASF++SE +WTD+F+ +D ++ Sbjct: 621 VLEDGEIPHPLFFTELLCHATAKGDYQRAITLINTVALASFQISEEEWTDLFEEHQDWLT 680 Query: 884 KDGLQKLLDTLGNTDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRS 720 ++ LQKL D L + D V E TVSNLSK+LKS+CGS++ + LA+D T +S Sbjct: 681 QENLQKLSDHLLDCDYVNEPTVSNLSKSLKSLCGSSS-SSTQPLLAIDVPTPSQS 734