BLASTX nr result

ID: Akebia27_contig00010171 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00010171
         (1761 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004288257.1| PREDICTED: pentatricopeptide repeat-containi...   456   e-125
ref|XP_002525278.1| GTP binding protein, putative [Ricinus commu...   439   e-120
ref|XP_006466446.1| PREDICTED: pentatricopeptide repeat-containi...   438   e-120
ref|XP_006426111.1| hypothetical protein CICLE_v10027042mg [Citr...   438   e-120
gb|EXB94039.1| hypothetical protein L484_009383 [Morus notabilis]     437   e-120
ref|XP_007204940.1| hypothetical protein PRUPE_ppa001240mg [Prun...   434   e-119
ref|XP_003550974.1| PREDICTED: pentatricopeptide repeat-containi...   434   e-119
ref|XP_002310894.2| hypothetical protein POPTR_0007s14930g [Popu...   433   e-118
ref|XP_007047547.1| Tetratricopeptide repeat-like superfamily pr...   430   e-117
ref|XP_007047546.1| Tetratricopeptide repeat-like superfamily pr...   430   e-117
ref|XP_007047545.1| Tetratricopeptide repeat-like superfamily pr...   430   e-117
ref|XP_007155826.1| hypothetical protein PHAVU_003G234700g [Phas...   429   e-117
ref|XP_006841116.1| hypothetical protein AMTR_s00086p00094500 [A...   429   e-117
ref|XP_004509062.1| PREDICTED: pentatricopeptide repeat-containi...   427   e-117
ref|XP_003608531.1| Pentatricopeptide repeat-containing protein ...   420   e-115
emb|CAN83934.1| hypothetical protein VITISV_035768 [Vitis vinifera]   419   e-114
ref|XP_006393964.1| hypothetical protein EUTSA_v10003664mg [Eutr...   417   e-113
ref|XP_006363825.1| PREDICTED: pentatricopeptide repeat-containi...   408   e-111
ref|XP_004233609.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-110
ref|XP_006279544.1| hypothetical protein CARUB_v10028435mg [Caps...   404   e-110

>ref|XP_004288257.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 985

 Score =  456 bits (1172), Expect = e-125
 Identities = 253/500 (50%), Positives = 319/500 (63%), Gaps = 8/500 (1%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+ F KM++SG  PKALTYKV+VRA W EGKVNEA+EAVRDMERRGVVG
Sbjct: 512  VMLQSGKYDLVHELFRKMKKSGEAPKALTYKVIVRALWCEGKVNEAIEAVRDMERRGVVG 571

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            T+ VYYELACCLCK+GRW++A+                 +FTGMI S M+GGH+DDC+SI
Sbjct: 572  TSGVYYELACCLCKSGRWQDALLQVEKMKNVTNTKPLEVTFTGMIKSSMEGGHIDDCVSI 631

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKR-KXXXXXXXXXXXXSLTLDS 1223
            FEHMK+HCSPNIGTIN MLKV+G  DMF KAKELFEE K  K            SL  D 
Sbjct: 632  FEHMKNHCSPNIGTINTMLKVFGHTDMFSKAKELFEETKAAKSDSDPSLEGGGSSLVPDE 691

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTY+S+L+ASA A QWEYFEYVYKEMAL  YQ DQ+K+A +L+EASRA KG+LLEHAF+ 
Sbjct: 692  YTYTSMLKASASALQWEYFEYVYKEMALSGYQIDQSKNASILMEASRAGKGYLLEHAFDR 751

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
             LEAGEIPHL FF EM+  ATA+HDY+RA TLVN+MA+A F+VSE QWTD+FK +ED IS
Sbjct: 752  TLEAGEIPHLLFFIEMVYQATARHDYKRAATLVNTMAYAPFQVSERQWTDVFKKNEDGIS 811

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDG 683
            +DGL+KLLD L + DV +E T+ NL ++L+S+C S   RD   S+++  +      S D 
Sbjct: 812  QDGLKKLLDALEHCDVTSEATLLNLKRSLQSLCWSYTSRDFSDSVSVSSLNDNDEGSDDN 871

Query: 682  KWKLN-------LIGRLGDGNTNPPNGAETNVYDSANDDVSLLSDSPSCXXXXXXXXXXX 524
            +  +        + G++  G T+PP+       DS++  V+      S            
Sbjct: 872  EGLITPNHYLGYINGKMSPG-TDPPD-------DSSDAPVNEFPHRSS------------ 911

Query: 523  XXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQVNNSSDSVPSASE 344
                    DV   +    +  R    I  G                + ++    +PSA E
Sbjct: 912  -----TRRDVAADIE---IVSRPLDYISDGGLESTEIDEEIEALIYKDDSHKSHLPSAKE 963

Query: 343  ILEGWRESRNKDGIFLPFQL 284
            I++ W+E R K GI +PFQL
Sbjct: 964  IMKDWKERRKKGGILVPFQL 983


>ref|XP_002525278.1| GTP binding protein, putative [Ricinus communis]
            gi|223535436|gb|EEF37106.1| GTP binding protein, putative
            [Ricinus communis]
          Length = 1010

 Score =  439 bits (1128), Expect = e-120
 Identities = 256/505 (50%), Positives = 305/505 (60%), Gaps = 13/505 (2%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML SGKYDLVH+ F KM RSG  PKALTYKVLVRAFWEEGKVNEA+EAVRDME RGVVG
Sbjct: 515  VMLNSGKYDLVHELFRKMNRSGEAPKALTYKVLVRAFWEEGKVNEAMEAVRDMENRGVVG 574

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TAS+YYELACCLC  G W++AM                 +FTG+IMS +DGGHV DCISI
Sbjct: 575  TASLYYELACCLCYYGMWQDAMLEVKKMKNLRHSKPLEVTFTGLIMSSLDGGHVSDCISI 634

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTLDSY 1220
            FE+MK +C PNIGTIN MLKVYGRND+F KAKELF EIK               L  D +
Sbjct: 635  FEYMKAYCVPNIGTINIMLKVYGRNDLFSKAKELFGEIK-------GTNNDGTYLVPDEF 687

Query: 1219 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1040
            TYSS+LEASA A QWEYFE VYKEM  C YQ DQ KHA LLVEASR  K HLLEHAF+  
Sbjct: 688  TYSSMLEASASALQWEYFELVYKEMTFCGYQLDQKKHASLLVEASRVGKYHLLEHAFDAA 747

Query: 1039 LEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRISK 860
            LEAGEIPH   FTEM+  ATAQ +YERAV LVN++A A FK+SE QW D+F+ + D+I++
Sbjct: 748  LEAGEIPHHLLFTEMVFQATAQQNYERAVVLVNTLALAPFKISEKQWIDLFQKNGDKITQ 807

Query: 859  DGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDGK 680
            DGL+KLLD L +SDV +E TV+NLS+ L S+CG         S +L    T  S    G 
Sbjct: 808  DGLEKLLDALRSSDVASEPTVANLSRTLHSLCGRGRSEYLSGSTSLGIDVTNSSYLDSGS 867

Query: 679  WKL-----------NLIGRLGDGNTNPPNGAETNVYDSANDDVSLLSDSP-SCXXXXXXX 536
             K+            LI +  D      +   +N     +DD    S SP +        
Sbjct: 868  RKIMGDKGPEMHEDTLIDKT-DIAYGDLSVTRSNTGGEGSDDTDEASSSPRNYSTDRDGI 926

Query: 535  XXXXXXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQVNNS-SDSV 359
                       +D     S  C+D+      +P                +QV++S    +
Sbjct: 927  ASICTNVKIFGDDEASGASTDCLDFDEMEYGIP---------------INQVDDSCGTKL 971

Query: 358  PSASEILEGWRESRNKDGIFLPFQL 284
            PSA EIL+ W+ESR K  +F PFQL
Sbjct: 972  PSADEILDIWKESR-KGRLFFPFQL 995


>ref|XP_006466446.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570,
            chloroplastic-like [Citrus sinensis]
          Length = 901

 Score =  438 bits (1126), Expect = e-120
 Identities = 223/345 (64%), Positives = 259/345 (75%), Gaps = 2/345 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+FF KM +SG    ALTYKVLVRAFWEEGK+NEAV AVR+ME+RGVVG
Sbjct: 379  VMLQSGKYDLVHEFFRKMAKSGEAIGALTYKVLVRAFWEEGKINEAVAAVRNMEQRGVVG 438

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLC NGRW++AM                 +FTG+I+S MDGGH+DDCISI
Sbjct: 439  TASVYYELACCLCNNGRWQDAMLVVEKIKSLRHSKPLEITFTGLIISSMDGGHIDDCISI 498

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKR-KXXXXXXXXXXXXSLTLDS 1223
            F+HMKDHC PNIGT+NAMLKVY RNDMF KAKELFEE  R               L  D 
Sbjct: 499  FQHMKDHCEPNIGTVNAMLKVYSRNDMFSKAKELFEETTRANSSGYTFLSGDGAPLKPDE 558

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTYSS+LEASA AHQWEYFEYVYK MAL   Q DQ KHAWLLVEASRA K HLLEHAF+ 
Sbjct: 559  YTYSSMLEASATAHQWEYFEYVYKGMALSGCQLDQTKHAWLLVEASRAGKCHLLEHAFDS 618

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
            +LEAGEIPH  FFTEM+  A  Q +YE+AV L+N+MA+A F ++E QWT++F+ +EDRIS
Sbjct: 619  LLEAGEIPHPLFFTEMLIQAIVQSNYEKAVALINAMAYAPFHITERQWTELFESNEDRIS 678

Query: 862  KDGLQKLLDTLGNSDVV-TETTVSNLSKALKSICGSNAPRDSLSS 731
            +D L+KLL+ L N +   +E TVSNLS+AL ++C S   RD  SS
Sbjct: 679  RDKLEKLLNALCNCNAASSEITVSNLSRALHALCRSEKERDLSSS 723


>ref|XP_006426111.1| hypothetical protein CICLE_v10027042mg [Citrus clementina]
            gi|557528101|gb|ESR39351.1| hypothetical protein
            CICLE_v10027042mg [Citrus clementina]
          Length = 900

 Score =  438 bits (1126), Expect = e-120
 Identities = 223/345 (64%), Positives = 259/345 (75%), Gaps = 2/345 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+FF KM +SG    ALTYKVLVRAFWEEGK+NEAV AVR+ME+RGVVG
Sbjct: 379  VMLQSGKYDLVHEFFRKMAKSGEAIGALTYKVLVRAFWEEGKINEAVAAVRNMEQRGVVG 438

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLC NGRW++AM                 +FTG+I+S MDGGH+DDCISI
Sbjct: 439  TASVYYELACCLCNNGRWQDAMLVVEKIKSLRHSKPLEITFTGLIISSMDGGHIDDCISI 498

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKR-KXXXXXXXXXXXXSLTLDS 1223
            F+HMKDHC PNIGT+NAMLKVY RNDMF KAKELFEE  R               L  D 
Sbjct: 499  FQHMKDHCEPNIGTVNAMLKVYSRNDMFSKAKELFEETTRANSSGYTFLSGDGTPLKPDE 558

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTYSS+LEASA AHQWEYFEYVYK MAL   Q DQ KHAWLLVEASRA K HLLEHAF+ 
Sbjct: 559  YTYSSMLEASATAHQWEYFEYVYKGMALSGCQLDQTKHAWLLVEASRAGKCHLLEHAFDS 618

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
            +LEAGEIPH  FFTEM+  A  Q +YE+AV L+N+MA+A F ++E QWT++F+ +EDRIS
Sbjct: 619  LLEAGEIPHPLFFTEMLIQAIVQSNYEKAVALINAMAYAPFHITERQWTELFESNEDRIS 678

Query: 862  KDGLQKLLDTLGNSDVV-TETTVSNLSKALKSICGSNAPRDSLSS 731
            +D L+KLL+ L N +   +E TVSNLS+AL ++C S   RD  SS
Sbjct: 679  RDKLEKLLNALCNCNAASSEITVSNLSRALHALCRSEKERDLSSS 723


>gb|EXB94039.1| hypothetical protein L484_009383 [Morus notabilis]
          Length = 910

 Score =  437 bits (1123), Expect = e-120
 Identities = 225/349 (64%), Positives = 263/349 (75%), Gaps = 4/349 (1%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH++F KMR+SG TPKALTYKVLVRAFW EGKVNEAVE VRDME+RGVVG
Sbjct: 408  VMLQSGKYDLVHEYFRKMRKSGETPKALTYKVLVRAFWGEGKVNEAVEVVRDMEQRGVVG 467

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
             +SVYYELACCLC N RW++AM                 +FTGMIMS M GGH+ DCISI
Sbjct: 468  ASSVYYELACCLCSNRRWEDAMLEVEKMKKLSNSRPLEVAFTGMIMSSMQGGHISDCISI 527

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTL-DS 1223
            FEHMK HCSPNIGT+N MLKVYGRNDMF KAKELFEEIK++            +  + D 
Sbjct: 528  FEHMKTHCSPNIGTLNIMLKVYGRNDMFSKAKELFEEIKKRNSDSCSSFDGGDTFLIPDE 587

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTY+++LEASA A QWEYFEYVYKEM L  YQ DQNKHA LL EASRA K HLLEHAF+ 
Sbjct: 588  YTYNAMLEASASALQWEYFEYVYKEMVLSGYQLDQNKHASLLPEASRAGKWHLLEHAFDA 647

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
            ILEAGEIP+  +FTEM+  ATA+HDY+RAVTLVN+ A A F+V+E QW D F+ + +RIS
Sbjct: 648  ILEAGEIPNSQYFTEMVLQATARHDYDRAVTLVNAAALAPFQVTEEQWKDFFEKNRERIS 707

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICG---SNAPRDSLSSLA 725
            +D L+KLL +L N +V +E TV NLS+AL+ +     S A RD  SS+A
Sbjct: 708  QDNLEKLLRSLDNCNVKSEATVVNLSRALRGLSDLSESGASRDFSSSIA 756


>ref|XP_007204940.1| hypothetical protein PRUPE_ppa001240mg [Prunus persica]
            gi|462400582|gb|EMJ06139.1| hypothetical protein
            PRUPE_ppa001240mg [Prunus persica]
          Length = 874

 Score =  434 bits (1117), Expect = e-119
 Identities = 258/493 (52%), Positives = 311/493 (63%), Gaps = 4/493 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+ F KM+ SG  PKAL YKVLVRAFW EGKVNEAVEAVRDME+RGVVG
Sbjct: 389  VMLQSGKYDLVHELFRKMKNSGEAPKALNYKVLVRAFWCEGKVNEAVEAVRDMEQRGVVG 448

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            T SVYYELACCLC NGRW++A+                 +FTGMI S M+GGH+D CISI
Sbjct: 449  TGSVYYELACCLCNNGRWQDALVEVEKMKNVSNTKPLEVTFTGMITSSMEGGHIDSCISI 508

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTL-DS 1223
            F+HMK+ C+PNIGTIN MLKV+GR+DMF KAKELFEEIK              +L + D 
Sbjct: 509  FKHMKNRCAPNIGTINTMLKVFGRSDMFFKAKELFEEIKTVRAESDFSLEGGGTLVVPDQ 568

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTY+S+L+ASA A QWEYFEYVYKEMAL  YQ DQ KHA LLV+ASR+ K +LLEHAF+ 
Sbjct: 569  YTYTSMLKASASALQWEYFEYVYKEMALSGYQVDQTKHASLLVKASRSGKFYLLEHAFDT 628

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
             LEAGEIPH   FTEM+  ATAQHDY+RAVTLVN+MA+A F+VSE QWTD+F+ + D I+
Sbjct: 629  SLEAGEIPHPLIFTEMVFQATAQHDYKRAVTLVNAMAYAPFQVSERQWTDLFEKNGDTIT 688

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDG 683
            +DGL+KLLD L N DVV+E TV NLS++L  +C S   R   SS       T  S S DG
Sbjct: 689  QDGLEKLLDALHNCDVVSEATVLNLSRSLLRLCRSYRSRGLSSSAPFGSGATETS-SLDG 747

Query: 682  KWKLNLIGRLGDGNTNPPNGAETNVYDSANDDVSLLSDSPSCXXXXXXXXXXXXXXXXXI 503
                        GN   PN +  ++  S N     L  S +                  +
Sbjct: 748  D------NEEIYGNGIMPNHSLESIDGSHNPRREPLDKSTN--VPLDAFSVNHASTRRDV 799

Query: 502  EDVTFSVSGGCVDYRNSRPIL--PGXXXXXXXXXXXXXXTSQVNNSSDS-VPSASEILEG 332
            ++VT +VS      R+S  I    G                 V++S DS +PSA EIL+ 
Sbjct: 800  DEVTRTVS------RSSEYISDEDGEYSTEIDKEIEALIYKDVDDSHDSDLPSAPEILKV 853

Query: 331  WRESRNKDGIFLP 293
            W+E R +    LP
Sbjct: 854  WKERRKEARDSLP 866


>ref|XP_003550974.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570,
            chloroplastic-like [Glycine max]
          Length = 865

 Score =  434 bits (1117), Expect = e-119
 Identities = 215/340 (63%), Positives = 262/340 (77%), Gaps = 1/340 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML+SG YDLVH+FFGKM+RSG  PKALTYKVLV+ FW+EGKVNEAV+AVRDMERRGV+G
Sbjct: 368  VMLESGNYDLVHEFFGKMKRSGEVPKALTYKVLVKTFWKEGKVNEAVKAVRDMERRGVIG 427

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLC NGRW++A+                 +FTGMI S MDGGH++DCI I
Sbjct: 428  TASVYYELACCLCNNGRWQDAILEVDNIRSLPHAKPLEVTFTGMIKSSMDGGHINDCICI 487

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIK-RKXXXXXXXXXXXXSLTLDS 1223
            FE+MK+HC PNIG IN MLKVYG+NDMF KAK LFEE+K  K            S+  D 
Sbjct: 488  FEYMKEHCVPNIGAINTMLKVYGQNDMFSKAKVLFEEVKVAKSEFYATPEGGYSSVVPDV 547

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            Y+Y+S+LEASA A QWEYFE+VY+EM +  YQ DQ+KH  LLV+ASRA K HLLEHAF+M
Sbjct: 548  YSYNSMLEASATAQQWEYFEHVYREMIVSGYQLDQDKHLSLLVKASRAGKLHLLEHAFDM 607

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
            ILEAGEIPH  FF E++  A AQH+YERAV L+N+MA+A F+V+E QWT++FK SEDRIS
Sbjct: 608  ILEAGEIPHHLFFFELVIQAIAQHNYERAVILINTMAYAPFRVTEKQWTNLFKESEDRIS 667

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRD 743
             + L++LLD LGN D+V+E TVSNL+++L  +CG    R+
Sbjct: 668  LENLERLLDALGNCDIVSELTVSNLTRSLHVLCGLGTSRN 707


>ref|XP_002310894.2| hypothetical protein POPTR_0007s14930g [Populus trichocarpa]
            gi|550334917|gb|EEE91344.2| hypothetical protein
            POPTR_0007s14930g [Populus trichocarpa]
          Length = 879

 Score =  433 bits (1113), Expect = e-118
 Identities = 250/509 (49%), Positives = 311/509 (61%), Gaps = 26/509 (5%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML SGKY  VH++F KM++SG + KALTYKVLVRAFWEEG+VNEAVEAVRDME+RGVVG
Sbjct: 381  VMLLSGKYKSVHEYFRKMKKSGESLKALTYKVLVRAFWEEGRVNEAVEAVRDMEQRGVVG 440

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
             ASVYYELACCLC NGRW++AM                 S TGMI S MDGGH+D+CISI
Sbjct: 441  AASVYYELACCLCYNGRWQDAMLEVEKMKRLRYKKPLEVSLTGMIASSMDGGHIDNCISI 500

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTLDSY 1220
            FEHMK HC PNIGTIN MLKVY R+D+F +AKELFE+IK              ++  D Y
Sbjct: 501  FEHMKAHCVPNIGTINTMLKVYSRSDLFSEAKELFEDIK-------GVDHSGTTIIPDGY 553

Query: 1219 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1040
            TYSS+LE SARA QWEYFEYVYKEM+   YQ DQ KHA LLVEASR+ K HLLEHAF+ I
Sbjct: 554  TYSSMLEVSARALQWEYFEYVYKEMSFSGYQLDQIKHAPLLVEASRSGKNHLLEHAFDEI 613

Query: 1039 LEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRISK 860
            LEAGEIPH   FTEM+  ATAQ +YERAVTL+N+MAHASF++SE QWTD+F+ + ++IS+
Sbjct: 614  LEAGEIPHPLLFTEMVFQATAQENYERAVTLINTMAHASFQISERQWTDLFEKNGEKISQ 673

Query: 859  DGLQKLLDTLGNSDVVTETTVSNLSKALKSIC----GSNAPRDSLSSLALDD-------- 716
            D L+KLLD +G+  + +E TVSNLS++L+S+C      + PR +      DD        
Sbjct: 674  DSLEKLLDAVGHCRMASEVTVSNLSRSLRSLCRPGSSGDLPRTNSCIEDTDDTHINTNSG 733

Query: 715  ----------VTTGRSLSHDGKWKLNLIGRLGDGNTNPP----NGAETNVYDSANDDVSL 578
                      VTT  S++ DG  +L+    +   +  P     N + TN      DD   
Sbjct: 734  EIAGNRSAYMVTTSASMA-DGNLELDEDTFVNKTSITPDMSLVNNSSTN---REGDDPEA 789

Query: 577  LSDSPSCXXXXXXXXXXXXXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXX 398
             S + +                   +DV    S  C+D + S  +L              
Sbjct: 790  ASSTGNSVNGLDVATNLLVKRDVFADDVASGASTDCLDKKLSNILLEESAKDAEEVELEI 849

Query: 397  XXTSQVNNSSDSVPSASEILEGWRESRNK 311
              T   +     +PSA  IL+ W+ESR K
Sbjct: 850  GTTEANDLYRSELPSAHAILDVWKESRKK 878


>ref|XP_007047547.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 3
            [Theobroma cacao] gi|508699808|gb|EOX91704.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative isoform 3 [Theobroma cacao]
          Length = 628

 Score =  430 bits (1105), Expect = e-117
 Identities = 216/335 (64%), Positives = 256/335 (76%), Gaps = 1/335 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+FF KM+RSG  P+AL+Y+VLV+AFWEEGK+NEAVEAVRDME+RGV+G
Sbjct: 117  VMLQSGKYDLVHEFFRKMKRSGEAPRALSYRVLVKAFWEEGKINEAVEAVRDMEQRGVIG 176

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLCKNGRW++A+                 +FTG+IM+ +DGGH +DCISI
Sbjct: 177  TASVYYELACCLCKNGRWRDAIIEVDKMKKLSQRKPLEITFTGLIMASLDGGHFNDCISI 236

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEI-KRKXXXXXXXXXXXXSLTLDS 1223
            F++MKDHC+PNIGTINAMLKVYG+NDMF KAKELFEEI K K            +L  D 
Sbjct: 237  FQYMKDHCAPNIGTINAMLKVYGQNDMFSKAKELFEEINKAKSGPYDSQNGKSTNLIPDG 296

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTYS +L ASA A QWEYFEYVYKEM L  Y  DQ KHA LLVEASRARK +LLEHAF+ 
Sbjct: 297  YTYSLMLGASASALQWEYFEYVYKEMTLSGYHLDQTKHAILLVEASRARKWYLLEHAFDT 356

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
             LE GEIPH   FTEMI  ATAQ +YE+ VTLVN+MAHA ++VSE QWT+ F+ + DRIS
Sbjct: 357  FLEVGEIPHPLLFTEMIIQATAQSNYEKVVTLVNTMAHALYQVSEKQWTEAFEENGDRIS 416

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGS 758
               L KLLD L N ++ +E T SNL ++L+ +CGS
Sbjct: 417  HGSLSKLLDALSNCELSSEITASNLIRSLQYLCGS 451


>ref|XP_007047546.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 2
            [Theobroma cacao] gi|508699807|gb|EOX91703.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative isoform 2 [Theobroma cacao]
          Length = 596

 Score =  430 bits (1105), Expect = e-117
 Identities = 216/335 (64%), Positives = 256/335 (76%), Gaps = 1/335 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+FF KM+RSG  P+AL+Y+VLV+AFWEEGK+NEAVEAVRDME+RGV+G
Sbjct: 85   VMLQSGKYDLVHEFFRKMKRSGEAPRALSYRVLVKAFWEEGKINEAVEAVRDMEQRGVIG 144

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLCKNGRW++A+                 +FTG+IM+ +DGGH +DCISI
Sbjct: 145  TASVYYELACCLCKNGRWRDAIIEVDKMKKLSQRKPLEITFTGLIMASLDGGHFNDCISI 204

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEI-KRKXXXXXXXXXXXXSLTLDS 1223
            F++MKDHC+PNIGTINAMLKVYG+NDMF KAKELFEEI K K            +L  D 
Sbjct: 205  FQYMKDHCAPNIGTINAMLKVYGQNDMFSKAKELFEEINKAKSGPYDSQNGKSTNLIPDG 264

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTYS +L ASA A QWEYFEYVYKEM L  Y  DQ KHA LLVEASRARK +LLEHAF+ 
Sbjct: 265  YTYSLMLGASASALQWEYFEYVYKEMTLSGYHLDQTKHAILLVEASRARKWYLLEHAFDT 324

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
             LE GEIPH   FTEMI  ATAQ +YE+ VTLVN+MAHA ++VSE QWT+ F+ + DRIS
Sbjct: 325  FLEVGEIPHPLLFTEMIIQATAQSNYEKVVTLVNTMAHALYQVSEKQWTEAFEENGDRIS 384

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGS 758
               L KLLD L N ++ +E T SNL ++L+ +CGS
Sbjct: 385  HGSLSKLLDALSNCELSSEITASNLIRSLQYLCGS 419


>ref|XP_007047545.1| Tetratricopeptide repeat-like superfamily protein, putative isoform 1
            [Theobroma cacao] gi|508699806|gb|EOX91702.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative isoform 1 [Theobroma cacao]
          Length = 897

 Score =  430 bits (1105), Expect = e-117
 Identities = 216/335 (64%), Positives = 256/335 (76%), Gaps = 1/335 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+FF KM+RSG  P+AL+Y+VLV+AFWEEGK+NEAVEAVRDME+RGV+G
Sbjct: 386  VMLQSGKYDLVHEFFRKMKRSGEAPRALSYRVLVKAFWEEGKINEAVEAVRDMEQRGVIG 445

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLCKNGRW++A+                 +FTG+IM+ +DGGH +DCISI
Sbjct: 446  TASVYYELACCLCKNGRWRDAIIEVDKMKKLSQRKPLEITFTGLIMASLDGGHFNDCISI 505

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEI-KRKXXXXXXXXXXXXSLTLDS 1223
            F++MKDHC+PNIGTINAMLKVYG+NDMF KAKELFEEI K K            +L  D 
Sbjct: 506  FQYMKDHCAPNIGTINAMLKVYGQNDMFSKAKELFEEINKAKSGPYDSQNGKSTNLIPDG 565

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTYS +L ASA A QWEYFEYVYKEM L  Y  DQ KHA LLVEASRARK +LLEHAF+ 
Sbjct: 566  YTYSLMLGASASALQWEYFEYVYKEMTLSGYHLDQTKHAILLVEASRARKWYLLEHAFDT 625

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
             LE GEIPH   FTEMI  ATAQ +YE+ VTLVN+MAHA ++VSE QWT+ F+ + DRIS
Sbjct: 626  FLEVGEIPHPLLFTEMIIQATAQSNYEKVVTLVNTMAHALYQVSEKQWTEAFEENGDRIS 685

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGS 758
               L KLLD L N ++ +E T SNL ++L+ +CGS
Sbjct: 686  HGSLSKLLDALSNCELSSEITASNLIRSLQYLCGS 720


>ref|XP_007155826.1| hypothetical protein PHAVU_003G234700g [Phaseolus vulgaris]
            gi|561029180|gb|ESW27820.1| hypothetical protein
            PHAVU_003G234700g [Phaseolus vulgaris]
          Length = 870

 Score =  429 bits (1104), Expect = e-117
 Identities = 243/504 (48%), Positives = 312/504 (61%), Gaps = 12/504 (2%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML+SG YDLVH+FFGKM+RSG  PKALTYKVLVR FW+EGKV EAV+A+RDMERRGV+G
Sbjct: 377  VMLESGNYDLVHEFFGKMKRSGEVPKALTYKVLVRTFWKEGKVEEAVKAIRDMERRGVIG 436

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TA VYYELACCLC  GRW++A+                 +FTGMI S M GGH+DD I I
Sbjct: 437  TAGVYYELACCLCNCGRWRDAILEVDNIRNLPRAKPLEVTFTGMIKSSMGGGHIDDSIRI 496

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKR-KXXXXXXXXXXXXSLTLDS 1223
            FE+M+DHC+PNIG IN MLKVYG+NDMF KAK LFEE+K  K            S   DS
Sbjct: 497  FEYMRDHCAPNIGAINTMLKVYGQNDMFSKAKVLFEEVKAAKSESYATPGGGNSSAVPDS 556

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTY+S+LEASA A QWEYFE+VY+EM +  YQ DQNKH  LLV+ASRA K HLLEHAF M
Sbjct: 557  YTYNSMLEASASAQQWEYFEHVYREMIVSGYQLDQNKHLLLLVKASRAGKLHLLEHAFNM 616

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
            ILEAGEIPH  FF E++  A  QH+YERAV L+N++A+A F+VSE QWT++FK SEDRIS
Sbjct: 617  ILEAGEIPHHLFFFELVIQAIVQHNYERAVILINTLAYAPFRVSEKQWTNLFKESEDRIS 676

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDG 683
             + L++LLD LG+ DV++E+TVSNL+++L  +CGS   R          +  G   S +G
Sbjct: 677  HENLERLLDALGSCDVISESTVSNLTRSLHVLCGSGISR---------IIPFGSKDSVNG 727

Query: 682  KWKLNLIGRLGDGNTNPPNGAETNVY---DSAND------DVSLLSDSPSCXXXXXXXXX 530
            + +   I    D + N PN + T +    +S ND      +  L++ + +          
Sbjct: 728  QGRNERI----DDDQNVPNFSTTMMIEGTESENDIYVGSYNTELVTSTCTSDGVNEGDNN 783

Query: 529  XXXXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQVNNSS--DSVP 356
                      D+   +S        +  +                 +S+ +N     + P
Sbjct: 784  DVMVFRPQNSDIEDGMSSQADRLECTDNLALDESSDELDKELSDDGSSEDDNGEGVTNKP 843

Query: 355  SASEILEGWRESRNKDGIFLPFQL 284
            +A EILE W+E R +DG  L  +L
Sbjct: 844  TAYEILELWKELREEDGSLLHSEL 867


>ref|XP_006841116.1| hypothetical protein AMTR_s00086p00094500 [Amborella trichopoda]
            gi|548843010|gb|ERN02791.1| hypothetical protein
            AMTR_s00086p00094500 [Amborella trichopoda]
          Length = 828

 Score =  429 bits (1102), Expect = e-117
 Identities = 222/351 (63%), Positives = 256/351 (72%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML+SGKYDLVHKFFG MRR G  PKALTYKVLV   W EGKVNEAVEAV DMERRGVVG
Sbjct: 396  VMLKSGKYDLVHKFFGTMRRGGLAPKALTYKVLVSCLWAEGKVNEAVEAVEDMERRGVVG 455

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLC NGRWKEAM                 +FTGMI SCMDGG+V D ISI
Sbjct: 456  TASVYYELACCLCNNGRWKEAMTQIEKLKSLPLSRPLEVAFTGMIQSCMDGGYVRDGISI 515

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTLDSY 1220
            FE+M+++C+ NIGTIN MLK+YG NDMF KAKELFE IK                + D+Y
Sbjct: 516  FENMQEYCTLNIGTINVMLKLYGCNDMFTKAKELFEGIKMPEARYDMNLDCHGVNSPDAY 575

Query: 1219 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1040
            TYS +LEASA + QWEYFE+VYKEMAL  +Q DQNKHAWLLVEASRA   HLLEHAF+  
Sbjct: 576  TYSLMLEASAISLQWEYFEHVYKEMALSGFQLDQNKHAWLLVEASRAGMMHLLEHAFDSA 635

Query: 1039 LEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRISK 860
            LEAGE+PH S FTEMIC     HD++RA+TLVNSMAH S +VSE QWT++FK + D+IS 
Sbjct: 636  LEAGELPHWSIFTEMICQTLICHDFKRAITLVNSMAHVSLQVSEKQWTNLFKRNSDKISI 695

Query: 859  DGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTT 707
            + LQKL   L +  +++E  V+NLSK+L  +CGSN P    +  AL DVTT
Sbjct: 696  EELQKLRQCLNDKGLMSEPIVTNLSKSLCYLCGSNIP----TEYALCDVTT 742


>ref|XP_004509062.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570,
            chloroplastic-like [Cicer arietinum]
          Length = 883

 Score =  427 bits (1098), Expect = e-117
 Identities = 246/529 (46%), Positives = 318/529 (60%), Gaps = 35/529 (6%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML+SG YDLVH+ FGKMRRSG  P+ALTYKVLVR  W+EGKV+EAV+ VRDMER+GV+G
Sbjct: 385  VMLESGNYDLVHELFGKMRRSGEVPEALTYKVLVRTCWKEGKVDEAVKVVRDMERKGVMG 444

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLC  GRW++A+                 +FTGMI S MDGGH+DDCISI
Sbjct: 445  TASVYYELACCLCNCGRWQDAIPEVERIRRLSHARPLEVTFTGMIRSSMDGGHIDDCISI 504

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIK-RKXXXXXXXXXXXXSLTLDS 1223
            FE+M+DHC+PN+GT+N MLKVYG+NDMF KAK LFEE+K  K            S+  D+
Sbjct: 505  FEYMEDHCTPNVGTVNIMLKVYGQNDMFSKAKVLFEEVKVAKSDIYDFPKGGSTSIVPDA 564

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTYS +LEASARAHQWEYFE+VYKEM L  Y  DQNKH+ LLV+ASRA K HLLEHAF+M
Sbjct: 565  YTYSLMLEASARAHQWEYFEHVYKEMILSGYHLDQNKHSSLLVKASRAGKLHLLEHAFDM 624

Query: 1042 ILEAGEIP-HLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRI 866
            ILE GEIP HL FF E++  A AQH+YERAV L+++MA+A ++V+E QWT++FK ++DRI
Sbjct: 625  ILEVGEIPCHLIFF-ELVIQAIAQHNYERAVILLSTMAYAPYRVTEKQWTELFKKNKDRI 683

Query: 865  SKDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHD 686
            + + L++LLD LG  +VV+E TVSNLS++L  +CG  + R+  S +              
Sbjct: 684  NHENLERLLDALGKCNVVSEATVSNLSRSLHVLCGLGSSRNISSIIPF------------ 731

Query: 685  GKWKLNLIGRL--GDGNTNPPN-GAETNVYDSANDDVSLLSDSPSCXXXXXXXXXXXXXX 515
            G   +N +  +  G GN N PN      + + A    ++L  S                 
Sbjct: 732  GSENVNGLNEIIDGGGNGNVPNISGRMTIIEGAESGNNILLGSDQA-------------- 777

Query: 514  XXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQV--------NNSSD-- 365
                E  TF+V+   +D  N+  ++                  +V        + SSD  
Sbjct: 778  ----ESDTFTVNRNQIDRVNNNDVVVCTPQNCNIDDKVSLCADKVEFCDHLALDKSSDGS 833

Query: 364  --------------------SVPSASEILEGWRESRNKDGIFLPFQLKC 278
                                  PSA +ILE W+E R +D   L  +L C
Sbjct: 834  DDELSDDESYEDDDVDDGVIDKPSAYQILEAWKEMREEDKTLLHSELDC 882


>ref|XP_003608531.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355509586|gb|AES90728.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 877

 Score =  420 bits (1080), Expect = e-115
 Identities = 204/339 (60%), Positives = 257/339 (75%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSG YDLVH+ F KM+R+G  P+ALTYKV+VR FW+EGKV+EAV+AVRDMERRGV+G
Sbjct: 390  VMLQSGNYDLVHELFEKMQRNGEVPEALTYKVMVRTFWKEGKVDEAVKAVRDMERRGVMG 449

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            TASVYYELACCLC  GRW++A                  +FTGMI S MDGGH+DDCI I
Sbjct: 450  TASVYYELACCLCNCGRWQDATLEVEKIKRLPHAKPLEVTFTGMIRSSMDGGHIDDCICI 509

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTLDSY 1220
            FE+M+DHC+PN+GT+N MLKVY +NDMF  AK LFEE+K               L  D+Y
Sbjct: 510  FEYMQDHCAPNVGTVNTMLKVYSQNDMFSTAKVLFEEVK----------VAKSDLRPDAY 559

Query: 1219 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1040
            TY+ +LEAS+R HQWEYFE+VYKEM L  Y  DQNKH  LLV+ASRA K HLLEHAF+M+
Sbjct: 560  TYNLMLEASSRGHQWEYFEHVYKEMILSGYHLDQNKHLPLLVKASRAGKLHLLEHAFDMV 619

Query: 1039 LEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRISK 860
            LEAGEIPH  FF E++  A AQH+YERA+ L+++MAHA ++V+E QWT++FK +EDRI+ 
Sbjct: 620  LEAGEIPHHLFFFELVIQAIAQHNYERAIILLSTMAHAPYRVTEKQWTELFKENEDRINH 679

Query: 859  DGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRD 743
            + L++LLD LGN +VV+E T+SNLS++L  +CG  + R+
Sbjct: 680  ENLKRLLDDLGNCNVVSEATISNLSRSLHDLCGLGSSRN 718


>emb|CAN83934.1| hypothetical protein VITISV_035768 [Vitis vinifera]
          Length = 615

 Score =  419 bits (1077), Expect = e-114
 Identities = 240/463 (51%), Positives = 286/463 (61%), Gaps = 10/463 (2%)
 Frame = -3

Query: 1666 VLVRAFWEEGKVNEAVEAVRDMERRGVVGTASVYYELACCLCKNGRWKEAMXXXXXXXXX 1487
            VLVRAFWEEGKVNEAVE VRDMERRGVVG ASVYYELACCLC NGRW++A+         
Sbjct: 12   VLVRAFWEEGKVNEAVEVVRDMERRGVVGIASVYYELACCLCNNGRWQDAIVEVEKLKKR 71

Query: 1486 XXXXXXXXSFTGMIMSCMDGGHVDDCISIFEHMKDHCSPNIGTINAMLKVYGRNDMFVKA 1307
                    +FTGMI S MDGGH+DDC+SIFEHMK HCSPNIGTINAMLKVYGRNDMF KA
Sbjct: 72   PHSKPLEVTFTGMITSSMDGGHLDDCLSIFEHMKYHCSPNIGTINAMLKVYGRNDMFSKA 131

Query: 1306 KELFEEIKRKXXXXXXXXXXXXS-LTLDSYTYSSILEASARAHQWEYFEYVYKEMALCLY 1130
            KELFEE KR               L  D YTYSS+LEASA AHQWE+FEYVYKEM L  Y
Sbjct: 132  KELFEETKRSTFASNTCMDDGSISLVPDLYTYSSMLEASASAHQWEFFEYVYKEMTLSGY 191

Query: 1129 QFDQNKHAWLLVEASRARKGHLLEHAFEMILEAGEIPHLSFFTEMICLATAQHDYERAVT 950
            Q DQ+KHA LL +ASRA K HLLEHAF+ ILEAGEIPH S FTEMIC ATAQH+YERAVT
Sbjct: 192  QLDQSKHALLLGKASRAGKWHLLEHAFDTILEAGEIPHPSIFTEMICQATAQHNYERAVT 251

Query: 949  LVNSMAHASFKVSENQWTDIFKGSEDRISKDGLQKLLDTLGNSDVVTETTVSNLSKALKS 770
            L+N+MAHA F VSE QWTD+F  ++DRIS+  L+KLLD+L N DV  E TVSNL K+L+S
Sbjct: 252  LINAMAHAPFVVSEKQWTDLFV-TDDRISRVNLEKLLDSLHNCDVAEEATVSNLYKSLQS 310

Query: 769  ICGSNAPRDSLSSLALDDVTTGRSLSHDGKWKLNLIGRLGDGNTNPPNGAETNVYDSAND 590
            +CGS    D  SS+A  D         +   +  L G  G+ + N     +    D+   
Sbjct: 311  LCGSGTSMDQ-SSVAFGD---------EAMIRTPLNGNSGELDDNKKVFFQKFSADARGS 360

Query: 589  DVSLLSDSPSCXXXXXXXXXXXXXXXXXIED---------VTFSVSGGCVDYRNSRPILP 437
            D+S   + P                    ED           F+ +    +  ++ P   
Sbjct: 361  DLSPHENPPVKNSDVTFDIFSVNLTRSEEEDDDTDGEAISEAFNYACNGDEVASNEPNTL 420

Query: 436  GXXXXXXXXXXXXXXTSQVNNSSDSVPSASEILEGWRESRNKD 308
                             + ++   ++PSA+EILE W++SR +D
Sbjct: 421  DGNSEGINKIELNMRAKEDDSHGSNLPSANEILETWKKSRERD 463


>ref|XP_006393964.1| hypothetical protein EUTSA_v10003664mg [Eutrema salsugineum]
            gi|557090603|gb|ESQ31250.1| hypothetical protein
            EUTSA_v10003664mg [Eutrema salsugineum]
          Length = 811

 Score =  417 bits (1071), Expect = e-113
 Identities = 214/376 (56%), Positives = 270/376 (71%), Gaps = 3/376 (0%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML+SGKYD VH+FF KMR SG  PKA+TYKVLVRA W E K+ EAVEAVRDME++GVVG
Sbjct: 397  VMLESGKYDRVHEFFRKMRSSGEAPKAITYKVLVRALWRENKIEEAVEAVRDMEQKGVVG 456

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            T SVYYELACCLC NGRW++AM                 +FTG+I + ++GGHVDDC+SI
Sbjct: 457  TGSVYYELACCLCNNGRWRDAMLEVGRMRRLENCRPLEITFTGLIAASLNGGHVDDCMSI 516

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTLDSY 1220
            F++MKD C PNIGT+N ML+VYGRNDMF +AKELFEEI R+             L  D Y
Sbjct: 517  FQYMKDKCDPNIGTVNTMLRVYGRNDMFSEAKELFEEIVREKEAH---------LVPDEY 567

Query: 1219 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1040
            TYS +LEASAR+ QWEYFE+VY+ M L  YQ DQ KHA +L+EASRA K  LLEHAF+ I
Sbjct: 568  TYSFMLEASARSLQWEYFEHVYQTMILSGYQIDQTKHAPMLIEASRAGKWSLLEHAFDAI 627

Query: 1039 LEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRISK 860
            LE GEIPH  FFTEM+C ATA+ DY+RA+TL+N++A ASF++SE QWTD+F+ ++D +++
Sbjct: 628  LEDGEIPHPLFFTEMLCHATAKGDYQRAITLINTVALASFQISEEQWTDLFEENQDWLTQ 687

Query: 859  DGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRSLSHDGK 680
            + LQ L D + + D  +E TV+NLSK+LKS+CG ++   S   L   DVTT     H  K
Sbjct: 688  ENLQNLCDYILDCDYASEPTVANLSKSLKSLCGVSSSSSSTEPLLAIDVTT-----HSEK 742

Query: 679  WKLNLI---GRLGDGN 641
             + +L+    R+ DGN
Sbjct: 743  PEEDLLFHDTRMKDGN 758


>ref|XP_006363825.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570,
            chloroplastic-like [Solanum tuberosum]
          Length = 864

 Score =  408 bits (1049), Expect = e-111
 Identities = 234/509 (45%), Positives = 304/509 (59%), Gaps = 17/509 (3%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKY+LVH+FFGKM+RSG   KAL+YKVLV++FWEEG+VNEA++AVR+ME+RGVVG
Sbjct: 381  VMLQSGKYELVHEFFGKMKRSGEALKALSYKVLVKSFWEEGRVNEAIQAVREMEQRGVVG 440

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            +ASVYYELACCLC +G WKEA                  +FTGMI+S MDGGH+D CI I
Sbjct: 441  SASVYYELACCLCYHGMWKEAFLEIEKLKMLRRTRPLAVTFTGMILSSMDGGHIDGCICI 500

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTL-DS 1223
            +EH K HC P+IG INAMLKVYG+NDMF KAKELFE  K +            S    D+
Sbjct: 501  YEHSKKHCEPDIGIINAMLKVYGKNDMFYKAKELFEWAKTESSGPQLSQDDFSSARRPDA 560

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTY+S+LE+SA + QWEYFEYVYKEMAL  Y  DQ++HA+LLVEAS+A K HLLEHAF+ 
Sbjct: 561  YTYTSMLESSAFSLQWEYFEYVYKEMALAGYLLDQSRHAYLLVEASKAGKVHLLEHAFDA 620

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
            ILE G+IPH SFF E++C AT QHD+ERA+ L+  M H  F+VS+ +W D+F  + +R+S
Sbjct: 621  ILEVGQIPHPSFFFEILCQATCQHDHERALALIKLMVHVPFQVSKQEWIDLFNSNNERLS 680

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDD---VTTGRSLS 692
               L+ LLD +    + ++TT+ NL +AL+S+CGS     + S L +++   +T   +L+
Sbjct: 681  HSSLRGLLDVICRQSLGSDTTIVNLCRALESVCGS----CTSSMLIINEPAKLTDASALA 736

Query: 691  HDGKWKLNLIGRLGDGN---TNPPNGAETNV----YDSAND------DVSLLSDSPSCXX 551
             D            DG+    N P  AE  +     D A D      D  L+SD      
Sbjct: 737  AD-----------KDGSPYRCNAPANAELPLQHVQVDEAYDEREKGADRELVSDMSHLSH 785

Query: 550  XXXXXXXXXXXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQVNNS 371
                            +++TF      +D  +   +                     N S
Sbjct: 786  REDMRAGTNTIFELSDDELTFDDQSDYLDDIDQLEL-------------GMSSDEDDNFS 832

Query: 370  SDSVPSASEILEGWRESRNKDGIFLPFQL 284
               VPSA EIL+ W + R KD  F  FQL
Sbjct: 833  ETKVPSAYEILKTWEDMRKKDATFFNFQL 861


>ref|XP_004233609.1| PREDICTED: pentatricopeptide repeat-containing protein At5g67570,
            chloroplastic-like [Solanum lycopersicum]
          Length = 1092

 Score =  407 bits (1045), Expect = e-110
 Identities = 225/505 (44%), Positives = 303/505 (60%), Gaps = 13/505 (2%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VMLQSGKYDLVH+FFGKM++SG   KAL+YK+LV++FWEEG+VNEA++AVR+ME+RGVVG
Sbjct: 602  VMLQSGKYDLVHEFFGKMKKSGEALKALSYKILVKSFWEEGRVNEAIQAVREMEQRGVVG 661

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            +ASVYYELACCLC +G WKEA                  +F+GMI+S MDGGH+D CI I
Sbjct: 662  SASVYYELACCLCYHGMWKEAFLEVRKLKMLRRTRPLAVTFSGMILSSMDGGHIDGCICI 721

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXS-LTLDS 1223
            +++ K HC P+IG INAMLKVYG+NDMF KAKELFE  K +            S L+ D+
Sbjct: 722  YDYSKKHCKPDIGIINAMLKVYGKNDMFYKAKELFEWAKTESHGRQLSKDDFSSSLSPDA 781

Query: 1222 YTYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEM 1043
            YTY+S+LE+SA + QWEYFEYVYKEMAL  +  DQ++HA+LLVEAS+A K HLLEHAF+ 
Sbjct: 782  YTYTSMLESSACSLQWEYFEYVYKEMALAGHLLDQSRHAYLLVEASKAGKVHLLEHAFDA 841

Query: 1042 ILEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRIS 863
            ILE G IPH SFF E++C AT QHD+ERA+ L+ SM H  F+VS+ +W D+F  +  RIS
Sbjct: 842  ILEVGHIPHPSFFFEILCQATCQHDHERALALIKSMVHVPFQVSKQEWIDLFNSNNGRIS 901

Query: 862  KDGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDD---VTTGRSLS 692
               L++LLD + +  + ++ T+ NL +AL+S+CGS     + S L +D+   +T   +++
Sbjct: 902  HSSLRELLDVICSHSLGSDATIVNLCRALRSVCGS----CTSSMLIIDEPAKLTDASAMT 957

Query: 691  HDGKWKLNLIGRLGDGNTNPPNGAETNVYDSAND---------DVSLLSDSPSCXXXXXX 539
             D    L       + +  P    + +  D +++         D  L+SD          
Sbjct: 958  ADKDGSLYRCSVPANTDELPLQHVQVDEDDCSDEAYDEREKGADGELVSDMSHLSHREDE 1017

Query: 538  XXXXXXXXXXXIEDVTFSVSGGCVDYRNSRPILPGXXXXXXXXXXXXXXTSQVNNSSDSV 359
                        +++TF                P                   N+S   V
Sbjct: 1018 RAGTNTMFELADDELTFDDQ-------------PDYLDDIDQLELGMSSDEDDNSSETKV 1064

Query: 358  PSASEILEGWRESRNKDGIFLPFQL 284
            PSA EIL+ W + R KD  F  FQL
Sbjct: 1065 PSAYEILKTWEDMRKKDATFFNFQL 1089


>ref|XP_006279544.1| hypothetical protein CARUB_v10028435mg [Capsella rubella]
            gi|482548248|gb|EOA12442.1| hypothetical protein
            CARUB_v10028435mg [Capsella rubella]
          Length = 801

 Score =  404 bits (1038), Expect = e-110
 Identities = 203/354 (57%), Positives = 259/354 (73%)
 Frame = -3

Query: 1759 VMLQSGKYDLVHKFFGKMRRSGATPKALTYKVLVRAFWEEGKVNEAVEAVRDMERRGVVG 1580
            VML+SGKYD VH FF KM+ SG  PKA+TYKVLVRA W EGK+ EAVEAVRDME++GV+G
Sbjct: 391  VMLESGKYDRVHDFFRKMKSSGEAPKAITYKVLVRALWREGKIEEAVEAVRDMEQKGVIG 450

Query: 1579 TASVYYELACCLCKNGRWKEAMXXXXXXXXXXXXXXXXXSFTGMIMSCMDGGHVDDCISI 1400
            T SVYYELACCLC NGRW +AM                 +FTG+I + ++GGHV DC++I
Sbjct: 451  TGSVYYELACCLCNNGRWHDAMLEVGRMKRLENCKPLEITFTGLIAASLNGGHVGDCMAI 510

Query: 1399 FEHMKDHCSPNIGTINAMLKVYGRNDMFVKAKELFEEIKRKXXXXXXXXXXXXSLTLDSY 1220
            F++MKD C PNIGT+N ML+VYGRNDMF +AKELFEEI  +             L  + Y
Sbjct: 511  FQYMKDRCDPNIGTVNMMLRVYGRNDMFSEAKELFEEIVSRKETH---------LAPNEY 561

Query: 1219 TYSSILEASARAHQWEYFEYVYKEMALCLYQFDQNKHAWLLVEASRARKGHLLEHAFEMI 1040
            TYS +LEASAR+ QWEYFE+VY+ M L  YQ DQ KHA +L+EASRA K  LLEHAF+ +
Sbjct: 562  TYSFMLEASARSLQWEYFEHVYQTMILSGYQMDQTKHAPMLIEASRAGKWSLLEHAFDAV 621

Query: 1039 LEAGEIPHLSFFTEMICLATAQHDYERAVTLVNSMAHASFKVSENQWTDIFKGSEDRISK 860
            LE GEIPH  FFTE++C ATA+ DY+RA+TL+N++A ASF++SE +WTD+F+  +D +++
Sbjct: 622  LEDGEIPHPLFFTELLCHATAKGDYQRAITLINTVALASFQISEEEWTDLFEEHQDWLTQ 681

Query: 859  DGLQKLLDTLGNSDVVTETTVSNLSKALKSICGSNAPRDSLSSLALDDVTTGRS 698
            + LQKL D L + D V E TVSNLSK+LKS+CGS++   +   LA+D  T  +S
Sbjct: 682  ENLQKLSDHLLDCDYVNEPTVSNLSKSLKSLCGSSS-SSTQPLLAIDVPTPSQS 734


Top