BLASTX nr result

ID: Rehmannia25_contig00017314 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00017314
         (1126 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containi...   477   e-132
ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containi...   471   e-130
ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   452   e-124
gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily p...   415   e-113
gb|EMJ09280.1| hypothetical protein PRUPE_ppa001520mg [Prunus pe...   415   e-113
ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citr...   404   e-110
ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containi...   403   e-110
ref|XP_002525196.1| pentatricopeptide repeat-containing protein,...   400   e-109
gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]     393   e-107
ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Caps...   391   e-106
ref|XP_002326162.1| predicted protein [Populus trichocarpa]           389   e-106
ref|XP_006381507.1| pentatricopeptide repeat-containing family p...   389   e-105
ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containi...   388   e-105
ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutr...   385   e-104
ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containi...   385   e-104
ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutr...   379   e-103
gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus...   376   e-102
gb|ESW34706.1| hypothetical protein PHAVU_001G174000g [Phaseolus...   376   e-102
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   375   e-101
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                373   e-101

>ref|XP_006367266.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum tuberosum]
          Length = 859

 Score =  477 bits (1228), Expect = e-132
 Identities = 235/358 (65%), Positives = 295/358 (82%), Gaps = 2/358 (0%)
 Frame = -3

Query: 1121 RVPLRPYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILI 942
            R     + H + RVPFRPTTSTYNIL+KACG+DYYRAKALM+EMK +GLSPNHI+W+ILI
Sbjct: 499  RTSASSHGHFSTRVPFRPTTSTYNILIKACGSDYYRAKALMEEMKEVGLSPNHITWTILI 558

Query: 941  DICGGSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIK 762
            DICGGSGNV GA+QILR+MRE+GIQPDV+TYTT IK+CVE+K+ K AF+LF  M++YQIK
Sbjct: 559  DICGGSGNVEGALQILRAMREAGIQPDVVTYTTIIKVCVENKDFKSAFSLFAAMKRYQIK 618

Query: 761  PNLVTYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEH 582
            PN+VTYNT+LRAR RYGSLQ+VQQCLA+YQHMRKAGYKP DYYLK+L+E+WCEGVIQN +
Sbjct: 619  PNMVTYNTLLRARSRYGSLQEVQQCLAIYQHMRKAGYKPNDYYLKQLIEQWCEGVIQNGN 678

Query: 581  QNKSQFASR-LTEFGPQSLLLEKVAEHLQ-DSAESLSIDLRGLSKVEARIVVLAVLRMIK 408
            Q K  F++R  T+ GP+S++L+KVAEHLQ DSA S+SI+LRGLSKVEARIVVLAVLRMI+
Sbjct: 679  QRKYNFSTRNRTDLGPESMILDKVAEHLQKDSANSISINLRGLSKVEARIVVLAVLRMIR 738

Query: 407  EKFIAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDV 228
            EK+ AG+SIK+DV I LG++E+G  A   ES VKEA+V+LLQHDLGL V ++  R GND 
Sbjct: 739  EKYTAGDSIKEDVQIFLGVQEVGIRAVGQESVVKEAIVKLLQHDLGLEVISAASRIGNDR 798

Query: 227  RRDKESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRRTRSS 54
             +D  +    + N+EE  E+      + SPTR+PVVLQ++++T++SL  WL RR  +S
Sbjct: 799  NQDGINHPDKHSNMEENAERVILRANVHSPTRKPVVLQKMRITKESLQSWLTRRLDAS 856


>ref|XP_004246707.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Solanum lycopersicum]
          Length = 857

 Score =  471 bits (1213), Expect = e-130
 Identities = 235/358 (65%), Positives = 293/358 (81%), Gaps = 2/358 (0%)
 Frame = -3

Query: 1121 RVPLRPYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILI 942
            R     + HI+ RVPF PTTSTYNILMKACG+DYYRAKALM+EMK +GLSPNHI+W+ILI
Sbjct: 501  RTSASSHRHISTRVPFIPTTSTYNILMKACGSDYYRAKALMEEMKEVGLSPNHITWTILI 560

Query: 941  DICGGSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIK 762
            DICGGSGNV GA+QILR MRE+GIQPDV+TYTT IK+CVE+K+ K AF+LF  M++YQIK
Sbjct: 561  DICGGSGNVEGALQILRVMREAGIQPDVVTYTTIIKVCVENKDFKSAFSLFAAMKRYQIK 620

Query: 761  PNLVTYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEH 582
            PN+VTYNT+LRAR RYGSLQ+VQQCLA+YQ MRKAGYKP DYYLK+L+E+WCEGVIQN +
Sbjct: 621  PNMVTYNTLLRARSRYGSLQEVQQCLAIYQDMRKAGYKPNDYYLKQLIEQWCEGVIQNAN 680

Query: 581  QNKSQFASR-LTEFGPQSLLLEKVAEHLQ-DSAESLSIDLRGLSKVEARIVVLAVLRMIK 408
            Q K  F++R  T+ GPQS++LEKVAEHLQ DSA S+SI+LRGL+KVEARIVVLAVLRMI+
Sbjct: 681  QRKYNFSTRNRTDLGPQSMILEKVAEHLQKDSANSISINLRGLTKVEARIVVLAVLRMIR 740

Query: 407  EKFIAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDV 228
            EK+ AG+SIKDDV I LG++E+G  A K ES VKEA+++LLQHDLGL V ++    GN +
Sbjct: 741  EKYTAGDSIKDDVQIFLGVKEVGIRAVKQESVVKEAIIQLLQHDLGLEVISAASTIGNGI 800

Query: 227  RRDKESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRRTRSS 54
                  P + + N+EE  E+     ++ SPTR+PVVLQ++++T++SL  WL RR  +S
Sbjct: 801  NH----PDNKHSNMEENAERVILRPSVYSPTRKPVVLQKMRITKESLQSWLTRRLDAS 854


>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic [Vitis vinifera]
            gi|297741486|emb|CBI32618.3| unnamed protein product
            [Vitis vinifera]
          Length = 842

 Score =  452 bits (1162), Expect = e-124
 Identities = 228/341 (66%), Positives = 282/341 (82%), Gaps = 3/341 (0%)
 Frame = -3

Query: 1079 PFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMGAIQ 900
            PF PTT+TYNILMKACGTDYYRAKALMDEMKT GLSPNHISWSILIDICGG+GN++GA++
Sbjct: 496  PFTPTTTTYNILMKACGTDYYRAKALMDEMKTAGLSPNHISWSILIDICGGTGNIVGAVR 555

Query: 899  ILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILRARD 720
            IL++MRE+GI+PDV+ YTTAIK CVE KN+K+AF+LF EM++YQI+PNLVTYNT+LRAR 
Sbjct: 556  ILKTMREAGIKPDVVAYTTAIKYCVESKNLKIAFSLFAEMKRYQIQPNLVTYNTLLRARS 615

Query: 719  RYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQFAS-RLTEF 543
            RYGSL +VQQCLA+YQHMRKAGYK  DYYLK L+EEWCEGVIQ+ + N+S+F+S    ++
Sbjct: 616  RYGSLHEVQQCLAIYQHMRKAGYKSNDYYLKELIEEWCEGVIQDNNLNQSKFSSVNRADW 675

Query: 542  G-PQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIKEKFIAGNSIKDDV 369
            G PQSLLLEKVA HLQ S AESL+IDL+GL++VEARIVVLAVLRMIKE +I G+ IKDD+
Sbjct: 676  GRPQSLLLEKVAAHLQKSVAESLAIDLQGLTQVEARIVVLAVLRMIKENYILGHPIKDDI 735

Query: 368  SIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDKESPAHSNLN 189
             IILG++++  +  + ES VK A+++LLQ +LGL V  +G +   D R +   P  S+ +
Sbjct: 736  LIILGIKKVDANLVEHESPVKGAIIKLLQDELGLEVAFAGPKIALDKRINLGGPPGSDPD 795

Query: 188  LEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
             +E   ++  P  LES TRRP VLQR KVTRKSL+HWLQRR
Sbjct: 796  WQEALGRNRLPTELESSTRRPAVLQRFKVTRKSLDHWLQRR 836


>gb|EOY07712.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao]
          Length = 858

 Score =  415 bits (1067), Expect = e-113
 Identities = 218/344 (63%), Positives = 270/344 (78%), Gaps = 3/344 (0%)
 Frame = -3

Query: 1088 ERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMG 909
            ++  F PTT+TYNILMKAC TDYYRAKALMDEMK++GLSPNH+SWSILIDIC GSGNV G
Sbjct: 510  KKFSFTPTTATYNILMKACCTDYYRAKALMDEMKSVGLSPNHVSWSILIDICRGSGNVEG 569

Query: 908  AIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILR 729
            AIQIL++M  +GI+PDV+ YTTAIK+CV  KN+KLAF+LF+EM++Y+++PNLVTYNT+LR
Sbjct: 570  AIQILKTMHVTGIKPDVVAYTTAIKVCVGSKNLKLAFSLFEEMKRYRVQPNLVTYNTLLR 629

Query: 728  ARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVI-QNEHQNKSQFASRL 552
            AR RYGSL +VQQCLA+YQ MRKAGYK  D YLK L+EEWCEGVI +N H+ +   + + 
Sbjct: 630  ARSRYGSLHEVQQCLAIYQDMRKAGYKSNDIYLKELIEEWCEGVIKENNHKREGLSSCKR 689

Query: 551  TEF-GPQSLLLEKVAEHLQ-DSAESLSIDLRGLSKVEARIVVLAVLRMIKEKFIAGNSIK 378
            T+   P SLLLEK+A HLQ  +AES +IDLRGL+KVEARIVVLAVLRMIKE  I G+S+K
Sbjct: 690  TDLERPHSLLLEKIAVHLQMSTAESPAIDLRGLTKVEARIVVLAVLRMIKENHILGHSVK 749

Query: 377  DDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDKESPAHS 198
            DD+ IILG+ E   +A K +S VK+AV++LLQ +LGL V     +  N +  D ++P  +
Sbjct: 750  DDMLIILGVSERHANAAKQKSEVKDAVMKLLQDELGLEVLLVEPQVKNGL-VDLQTPIDA 808

Query: 197  NLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
            +  L ET  K+       S TRRPV+LQRLKVTRKSLNHWL RR
Sbjct: 809  DPVLLETVGKNSLSSKPLSSTRRPVILQRLKVTRKSLNHWLWRR 852


>gb|EMJ09280.1| hypothetical protein PRUPE_ppa001520mg [Prunus persica]
          Length = 809

 Score =  415 bits (1066), Expect = e-113
 Identities = 209/345 (60%), Positives = 272/345 (78%), Gaps = 3/345 (0%)
 Frame = -3

Query: 1088 ERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMG 909
            +R+ F+PTT+TYN LMKACGTDYY AKAL+DEM+ +GL PN ISWSIL DICGGSGNV G
Sbjct: 463  KRLSFKPTTTTYNTLMKACGTDYYHAKALLDEMRAVGLYPNQISWSILADICGGSGNVEG 522

Query: 908  AIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILR 729
            A+QIL++MR +G++PDV+ YTTAIK+CVE++N++LA +LF EM+KYQI PNLVTYNT+LR
Sbjct: 523  ALQILKNMRAAGMKPDVVAYTTAIKVCVENENLELALSLFGEMKKYQIHPNLVTYNTLLR 582

Query: 728  ARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQFAS-RL 552
            AR RYGS+ +VQQCLA+YQ MRKAGYK  DYYL++L+EEWCEGVIQ+ +  + +F+S   
Sbjct: 583  ARSRYGSVSEVQQCLAIYQDMRKAGYKSNDYYLEQLIEEWCEGVIQDSNAKQEEFSSCNK 642

Query: 551  TEFG-PQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIKEKFIAGNSIK 378
            T+ G P SLLLEKVAEHLQ   AE+L++DL+GL+KVEARIVVLAVLRMIKE +  G+S+K
Sbjct: 643  TDIGRPGSLLLEKVAEHLQTHIAETLAVDLQGLTKVEARIVVLAVLRMIKENYTLGHSVK 702

Query: 377  DDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDKESPAHS 198
            DD+ I++G  E+ G +T     VK+A+ +LLQ +LGL V A+G + G D   ++ +   S
Sbjct: 703  DDMLIVVG--EVDGGSTTQNLEVKDAITKLLQDELGLKVLAAGAKVGLDTTIERGNTTDS 760

Query: 197  NLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRRT 63
            + +L+E   +   P  L   TRRPV L+RLKVTR SL HWL+RR+
Sbjct: 761  DQDLDEMSGRDELPAELIYSTRRPVALERLKVTRGSLQHWLRRRS 805



 Score = 72.0 bits (175), Expect = 4e-10
 Identities = 46/170 (27%), Positives = 76/170 (44%), Gaps = 32/170 (18%)
 Frame = -3

Query: 1058 TYNILMK--ACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMGAIQILRSM 885
            TY+ ++K  A    ++ A  + ++M + G++PN ++WS LI  C  +G V  AIQ+   M
Sbjct: 368  TYSTIVKVFADAKLWHMALNVKEDMLSAGVTPNTVTWSSLISACANAGIVEKAIQLFEEM 427

Query: 884  RESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILRA------- 726
              +G +P+   +   +  CVE      AF LF  +++   KP   TYNT+++A       
Sbjct: 428  LLAGSEPNSQCFNILLHACVEANQYDRAFRLFQSLKRLSFKPTTTTYNTLMKACGTDYYH 487

Query: 725  -----------------------RDRYGSLQQVQQCLAVYQHMRKAGYKP 645
                                    D  G    V+  L + ++MR AG KP
Sbjct: 488  AKALLDEMRAVGLYPNQISWSILADICGGSGNVEGALQILKNMRAAGMKP 537


>ref|XP_006428907.1| hypothetical protein CICLE_v10011055mg [Citrus clementina]
            gi|568853887|ref|XP_006480569.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Citrus sinensis]
            gi|557530964|gb|ESR42147.1| hypothetical protein
            CICLE_v10011055mg [Citrus clementina]
          Length = 850

 Score =  404 bits (1039), Expect = e-110
 Identities = 213/363 (58%), Positives = 265/363 (73%), Gaps = 8/363 (2%)
 Frame = -3

Query: 1118 VPLRPYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILID 939
            VP   YS   +R  F+PTT+TYNILMKAC TDYYR KALMDEM+T+GLSPNHISW+ILID
Sbjct: 497  VPNSHYSSFDKRFSFKPTTTTYNILMKACCTDYYRVKALMDEMRTVGLSPNHISWTILID 556

Query: 938  ICGGSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKP 759
             CGGSGNV GA+QIL+ MRE G+ PDV+ YTTAIK+CV  K +KLAF+LF+EM+ YQI+P
Sbjct: 557  ACGGSGNVEGALQILKIMREDGMSPDVVAYTTAIKVCVRSKRLKLAFSLFEEMKHYQIQP 616

Query: 758  NLVTYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQ 579
            NLVTY T+LRAR RYGSL +VQQCLAVYQ M KAGYK  D YLK ++EEWCEGVIQ+++Q
Sbjct: 617  NLVTYITLLRARSRYGSLHEVQQCLAVYQDMWKAGYKANDTYLKEVIEEWCEGVIQDKNQ 676

Query: 578  NKSQ--FASRLTEFGPQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIK 408
            N+ +     R     PQSLLLEKVA HLQ S AE+L+IDL+GL+KVEARIVVLAVL+M+K
Sbjct: 677  NQGEVTLCRRTNSQRPQSLLLEKVAVHLQKSAAENLAIDLQGLTKVEARIVVLAVLQMMK 736

Query: 407  EKFIAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDV 228
            E +  G  +KDD+ I+LG  ++     K +  VK+A+ +LLQ DLGL VF  G       
Sbjct: 737  ENYSLGVPVKDDLMIVLGPNKVNKIQAKHDLEVKDAITKLLQDDLGLKVFLDG------- 789

Query: 227  RRDKESPAHSNLNLEETREKSGSPQ-----ALESPTRRPVVLQRLKVTRKSLNHWLQRRT 63
                 S  H N ++++  +   +        L+S TRRP +LQRLKV +KSL+HWLQRR 
Sbjct: 790  ----PSIQHKNAHMQKLLDSESNMAKTLHIELKSSTRRPKILQRLKVPKKSLHHWLQRRV 845

Query: 62   RSS 54
             S+
Sbjct: 846  GST 848



 Score = 58.5 bits (140), Expect = 5e-06
 Identities = 39/155 (25%), Positives = 80/155 (51%), Gaps = 4/155 (2%)
 Frame = -3

Query: 1055 YNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMGAIQI---LRSM 885
            +N LM     D      +   M+ LG+  +  S++IL+  C  +GN + A +I   ++ +
Sbjct: 294  FNSLMNVNAHDLKFTLEVYKNMQKLGVMADMASYNILLKACCLAGNTVLAQEIYGEVKHL 353

Query: 884  RESGI-QPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILRARDRYGS 708
               G+ + DV TY+T +K+  + K  ++A  + ++M    + PN +T+++++ A    G 
Sbjct: 354  EAKGVLKLDVFTYSTIVKVFADAKWWQMALKVKEDMLSAGVTPNTITWSSLINACANAG- 412

Query: 707  LQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCE 603
               V+Q + +++ MR+AG +P       L++   E
Sbjct: 413  --LVEQAMHLFEEMRQAGCEPNSQCCNILLQACVE 445


>ref|XP_004305248.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 847

 Score =  403 bits (1035), Expect = e-110
 Identities = 209/350 (59%), Positives = 268/350 (76%), Gaps = 4/350 (1%)
 Frame = -3

Query: 1100 SHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSG 921
            S+  E + F+PTT+TYN LMKACG+DYY AKALMDEMKT+GL PN I+WSIL DICG SG
Sbjct: 494  SNFAEGLSFKPTTTTYNTLMKACGSDYYHAKALMDEMKTVGLLPNQITWSILADICGSSG 553

Query: 920  NVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYN 741
            NV GA+QIL+SMR +GIQPDV+ YTTAIKICVE +N+ LA  LF EM+KYQI PNLVTYN
Sbjct: 554  NVQGALQILKSMRVAGIQPDVVAYTTAIKICVESENLDLALLLFAEMKKYQIHPNLVTYN 613

Query: 740  TILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQFA 561
            T+LRAR RYGS+ +VQQCLA+YQ MRKAGYKP DYYL++L+EEWCEGVIQ+    + +F+
Sbjct: 614  TLLRARSRYGSVSEVQQCLAIYQDMRKAGYKPNDYYLEQLIEEWCEGVIQDSCPKQGEFS 673

Query: 560  -SRLTEFG-PQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIKEKFIAG 390
                 + G P SLLLEKVAEHLQ   A++L++DL+GL+KVEARIVVLAVLRMIKE +I G
Sbjct: 674  YGDKADIGRPGSLLLEKVAEHLQQHIADTLAVDLQGLTKVEARIVVLAVLRMIKENYILG 733

Query: 389  NSIKDDVSIILGL-EELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDKE 213
            +S+KDD+ I++G+ +E+ G +T     VK+A+ +LLQ +LGL V ++  +   D     +
Sbjct: 734  DSVKDDMLIMVGVHDEVDGGSTAHNLEVKDAITKLLQDELGLKVLSTVPKVALDTTIVSQ 793

Query: 212  SPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRRT 63
            +   S+ NL+E   +      L   TRRPVVL+RLKV+RKSL  WL++R+
Sbjct: 794  NTIDSDQNLDEKPLRKELQPELIYSTRRPVVLERLKVSRKSLQQWLRKRS 843


>ref|XP_002525196.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223535493|gb|EEF37162.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 786

 Score =  400 bits (1029), Expect = e-109
 Identities = 209/344 (60%), Positives = 266/344 (77%), Gaps = 3/344 (0%)
 Frame = -3

Query: 1088 ERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMG 909
            ++ PF P+++TYN LMKACG+DY RAKALMDEM+ +GLSPNHISWSILIDICG SGN+ G
Sbjct: 440  KKFPFTPSSATYNTLMKACGSDYNRAKALMDEMQAVGLSPNHISWSILIDICGSSGNMEG 499

Query: 908  AIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILR 729
            AIQIL++MR +GI+PDVI YTTAIK+ VE KN+K+AF+LF EM++YQ+KPNLVTY+T+LR
Sbjct: 500  AIQILKNMRMAGIEPDVIAYTTAIKVSVESKNLKMAFSLFAEMKRYQLKPNLVTYDTLLR 559

Query: 728  ARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQF-ASRL 552
            AR RYGSL++VQQCLA+YQ MRKAGYK  D YLK+L+EEWCEGVIQ+  Q +  F   + 
Sbjct: 560  ARTRYGSLKEVQQCLAIYQDMRKAGYKSNDNYLKQLIEEWCEGVIQDNDQCQDDFKPCKR 619

Query: 551  TEFG-PQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIKEKFIAGNSIK 378
             EFG P SLLLEKVA HL  + AESLS+DL+GL+KVEARIVVLAVLRM+KE +I G+ +K
Sbjct: 620  AEFGRPHSLLLEKVAAHLHHNVAESLSVDLQGLTKVEARIVVLAVLRMVKENYIQGHLVK 679

Query: 377  DDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDKESPAHS 198
            DD+SI LG++++       ++ VK+A+ +LL ++LGL V     R   D+  D E P +S
Sbjct: 680  DDMSITLGIDKVDVLPATQKAEVKDAIFKLLHNELGLEVLIVVPRYTADLETDLEIPLNS 739

Query: 197  NLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
              N  ++   SG      S  RRP+VLQRLKVTR SL+ WLQR+
Sbjct: 740  YQNWSKS---SGRENIRVSSARRPLVLQRLKVTRNSLHSWLQRK 780


>gb|EXB84820.1| hypothetical protein L484_003853 [Morus notabilis]
          Length = 822

 Score =  393 bits (1010), Expect = e-107
 Identities = 198/347 (57%), Positives = 265/347 (76%), Gaps = 3/347 (0%)
 Frame = -3

Query: 1097 HITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGN 918
            +    +PF PTT+TYNILMKACG+DYY AKAL++EM+ +GLSPN I+WSILIDICG  GN
Sbjct: 477  NFARELPFTPTTTTYNILMKACGSDYYHAKALIEEMEAVGLSPNQITWSILIDICGDLGN 536

Query: 917  VMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNT 738
            V GA+QIL++MR +GI+PDV+ YTT IK+CVE K++K AF LF EM++YQI+PNLVTYNT
Sbjct: 537  VEGALQILKTMRATGIEPDVVAYTTVIKVCVESKDLKQAFELFAEMKRYQIQPNLVTYNT 596

Query: 737  ILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQFAS 558
            +LRAR+RYGSLQ+V+QCLAVYQ MR+AGY   DYYLK+L+EEWCEGVIQ  +QN+ + +S
Sbjct: 597  LLRARNRYGSLQEVKQCLAVYQDMRRAGYNSNDYYLKQLIEEWCEGVIQGNNQNREESSS 656

Query: 557  --RLTEFGPQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIKEKFIAGN 387
              +  +  PQSLLLEKVAEHL+   AE+L++D++GL KVEARIVVLAVLRM+KE +  G 
Sbjct: 657  FNKTDKKRPQSLLLEKVAEHLEKHIAETLTVDVQGLKKVEARIVVLAVLRMVKENYTMGY 716

Query: 386  SIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDKESP 207
             +KDD+ II+G  ++     + E  VK+A+ +LL+ +LGL V ++G +   + + D +S 
Sbjct: 717  LVKDDMLIIIGACKVDAVPDEQELEVKDAITKLLKDELGLEVLSTGLKIEPNRQVDSDSL 776

Query: 206  AHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
              S+ + E           ++  TRRPVV+QRLKVT++SL HWLQR+
Sbjct: 777  GSSDFSGE-----------MKYSTRRPVVIQRLKVTKESLQHWLQRK 812


>ref|XP_006287051.1| hypothetical protein CARUB_v10000200mg [Capsella rubella]
            gi|482555757|gb|EOA19949.1| hypothetical protein
            CARUB_v10000200mg [Capsella rubella]
          Length = 858

 Score =  391 bits (1004), Expect = e-106
 Identities = 199/350 (56%), Positives = 268/350 (76%), Gaps = 3/350 (0%)
 Frame = -3

Query: 1106 PYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGG 927
            PY   + R  F+PTT+TYNIL+KACGTDYYR K LMDEMK+LGL+PN I+WS LID+CGG
Sbjct: 512  PYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLTPNQITWSTLIDMCGG 571

Query: 926  SGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVT 747
            SG+V GA++ILR+M  +G +PDV+ YTTAIKIC E+K++KLAF+LF+EM++YQIKPN VT
Sbjct: 572  SGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVT 631

Query: 746  YNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQ 567
            YNT+L+AR +YGSL +V+QCLA+YQ MRKAGYKP D++LK L+EEWCEGVIQ   Q++++
Sbjct: 632  YNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENGQSQNK 691

Query: 566  FASRLTEFG--PQSLLLEKVAEHLQD-SAESLSIDLRGLSKVEARIVVLAVLRMIKEKFI 396
             + +  +    P SLL+EKVA HLQ+ +A +L+IDL+GL+KVEAR+VVLAVLRMIKE ++
Sbjct: 692  ISDQEGDHAGRPVSLLIEKVATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYM 751

Query: 395  AGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDK 216
             G+ + DDV IILG  E    + K +  VKEA+V+LLQ +L L V  +G R   ++++D 
Sbjct: 752  RGDVVIDDVLIILGTSEANTDSGKQDIAVKEALVKLLQEELSLVVLPAGQR---NIKQDA 808

Query: 215  ESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
                 +N + E T E + S  ++ S TRRP +L+RL VT+ SL  WLQR+
Sbjct: 809  HCVDDANQDTEHTLENTKSFISISS-TRRPAILERLMVTKASLYQWLQRK 857


>ref|XP_002326162.1| predicted protein [Populus trichocarpa]
          Length = 828

 Score =  389 bits (1000), Expect = e-106
 Identities = 206/354 (58%), Positives = 269/354 (75%), Gaps = 3/354 (0%)
 Frame = -3

Query: 1118 VPLRPYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILID 939
            VP   + +  ++ PF PT +TY++LMKACG+DY+RAKALMDEMKT+G+SPNHISWSILID
Sbjct: 494  VPNSHHLNFIKKFPFTPTPATYHMLMKACGSDYHRAKALMDEMKTVGISPNHISWSILID 553

Query: 938  ICGGSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKP 759
            ICG SGNV GA+QIL++MR +G++PDV+ YTTAIK+CVE KN+KLAF+LF EM++ QI P
Sbjct: 554  ICGVSGNVSGAVQILKNMRMAGVEPDVVAYTTAIKVCVETKNLKLAFSLFAEMKRCQINP 613

Query: 758  NLVTYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQ 579
            NLVTYNT+LRAR RYGSL++VQQCLA+YQ MRKAGYK  DYYLK+L+EEWCEGVIQ+ +Q
Sbjct: 614  NLVTYNTLLRARTRYGSLREVQQCLAIYQDMRKAGYKSNDYYLKQLIEEWCEGVIQDNNQ 673

Query: 578  NKSQFAS-RLTEFG-PQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIK 408
             +  FAS + T+ G P+SLLLEKVA HLQ++ +E+L+IDL+GL+KVEARIVVLAVLRMIK
Sbjct: 674  IQGGFASCKRTDLGRPRSLLLEKVAAHLQNNISENLAIDLQGLTKVEARIVVLAVLRMIK 733

Query: 407  EKFIAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDV 228
            E +  G S+K+D+ I L + ++   A+K +S VK A++ LL+++LGL V  +     +D+
Sbjct: 734  ENYTLGYSVKEDMWITLDVSKV-DPASKRDSEVKNAIIELLRNELGLEVLVAVPGHLDDI 792

Query: 227  RRDKESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
            + D +S                        +  PVV QRLKV RKSL+ WLQRR
Sbjct: 793  KTDSKS------------------------SLDPVVTQRLKVRRKSLHEWLQRR 822


>ref|XP_006381507.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550336211|gb|ERP59304.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 828

 Score =  389 bits (999), Expect = e-105
 Identities = 206/354 (58%), Positives = 269/354 (75%), Gaps = 3/354 (0%)
 Frame = -3

Query: 1118 VPLRPYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILID 939
            VP   + +  ++ PF PT +TY++LMKACG+DY+RAKALMDEMKT+G+SPNHISWSILID
Sbjct: 494  VPNSHHLNFIKKFPFTPTPATYHMLMKACGSDYHRAKALMDEMKTVGISPNHISWSILID 553

Query: 938  ICGGSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKP 759
            ICG SGNV GA+QIL++MR +G++PDV+ YTTAIK+CVE KN+KLAF+LF EM++ QI P
Sbjct: 554  ICGVSGNVSGAVQILKNMRLAGVEPDVVAYTTAIKVCVETKNLKLAFSLFAEMKRCQINP 613

Query: 758  NLVTYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQ 579
            NLVTYNT+LRAR RYGSL++VQQCLA+YQ MRKAGYK  DYYLK+L+EEWCEGVIQ+ +Q
Sbjct: 614  NLVTYNTLLRARTRYGSLREVQQCLAIYQDMRKAGYKSNDYYLKQLIEEWCEGVIQDNNQ 673

Query: 578  NKSQFAS-RLTEFG-PQSLLLEKVAEHLQDS-AESLSIDLRGLSKVEARIVVLAVLRMIK 408
             +  FAS + T+ G P+SLLLEKVA HLQ++ +E+L+IDL+GL+KVEARIVVLAVLRMIK
Sbjct: 674  IQGGFASCKRTDLGRPRSLLLEKVAAHLQNNISENLAIDLQGLTKVEARIVVLAVLRMIK 733

Query: 407  EKFIAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDV 228
            E +  G S+K+D+ I L + ++   A+K +S VK A++ LL+++LGL V  +     +D+
Sbjct: 734  ENYTLGYSVKEDMWITLDVSKV-DPASKRDSEVKNAIIELLRNELGLEVLVAVPGHLDDI 792

Query: 227  RRDKESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
            + D +S                        +  PVV QRLKV RKSL+ WLQRR
Sbjct: 793  KTDSKS------------------------SLDPVVTQRLKVRRKSLHEWLQRR 822


>ref|XP_003554352.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Glycine max]
          Length = 811

 Score =  388 bits (997), Expect = e-105
 Identities = 203/351 (57%), Positives = 261/351 (74%), Gaps = 6/351 (1%)
 Frame = -3

Query: 1100 SHI---TERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICG 930
            SHI    ER PF PTT+TYNIL+KACGTDYY AKAL+ EM+T+GLSPN ISWSILIDICG
Sbjct: 465  SHILNFAERFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSILIDICG 524

Query: 929  GSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLV 750
             S NV GAI+IL++M ++GI+PDVI YTTAIK+CVE KN   A TL++EM+ YQI+PN V
Sbjct: 525  ASSNVEGAIEILKTMGDAGIKPDVIAYTTAIKVCVESKNFMQALTLYEEMKCYQIRPNWV 584

Query: 749  TYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKS 570
            TYNT+L+AR +YG L +VQQCLA+YQ MRKAGYKP DYYL+ L+EEWCEGVIQN  + + 
Sbjct: 585  TYNTLLKARSKYGFLHEVQQCLAIYQDMRKAGYKPNDYYLEELIEEWCEGVIQNNREKQG 644

Query: 569  QFAS--RLTEFGPQSLLLEKVAEH-LQDSAESLSIDLRGLSKVEARIVVLAVLRMIKEKF 399
            +F+S  +     PQSLLLEK+A H L+  A+ L+ID++GL+KVEAR+VVLAVLRMIKE +
Sbjct: 645  EFSSSNKSESERPQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLAVLRMIKENY 704

Query: 398  IAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRD 219
              G+S+ DD+ II+G  ++  + +K    V+EA+++LL+++LGL VF +  R       +
Sbjct: 705  GLGHSVNDDILIIIGATKVDENPSKHILEVQEAIIKLLRNELGLEVFPAKTRLALSDTAN 764

Query: 218  KESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
             E P  SNL++    E      AL   TRRP VL RLKVT+KSL  WL R+
Sbjct: 765  LEYPNFSNLSI----EAQPGENALGFQTRRPGVLVRLKVTKKSLYRWLHRK 811


>ref|XP_006398739.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099829|gb|ESQ40192.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 858

 Score =  385 bits (990), Expect = e-104
 Identities = 199/350 (56%), Positives = 263/350 (75%), Gaps = 3/350 (0%)
 Frame = -3

Query: 1106 PYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGG 927
            PY   + R  F+PTT+TYNIL+KACGTDYYR K LMDEM++LGL+PN I+WS LIDICGG
Sbjct: 509  PYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGG 568

Query: 926  SGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVT 747
            SG+V GA+ ILR+M  +G +PDV+ YTTAIKIC E+K++KLAF+LF+EM++YQIKPN VT
Sbjct: 569  SGDVEGAVGILRTMHSAGTRPDVVAYTTAIKICAENKSLKLAFSLFEEMRRYQIKPNWVT 628

Query: 746  YNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQ 567
            YNT+L+AR +YGSL +V+QCLA+YQ MRKAGYKP D++LK L+EEWCEGVIQ   Q++ +
Sbjct: 629  YNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENSQSQIK 688

Query: 566  FASRL-TEFG-PQSLLLEKVAEHLQD-SAESLSIDLRGLSKVEARIVVLAVLRMIKEKFI 396
             + +  T  G P SLL+EKVA HLQ+ +A +L+IDL+GL+KVEAR+VVLAVLRMIKE +I
Sbjct: 689  TSDQEGTNLGRPVSLLIEKVATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMIKEDYI 748

Query: 395  AGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDK 216
             G+ + DD+ IILG  E      K E  VK+ +V+LL+ +L L V  +G R   D+  D 
Sbjct: 749  RGDVVTDDLLIILGTGEANIDPGKQEIAVKDVLVQLLKDELSLVVLPAGHRHVLDITLDA 808

Query: 215  ESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
                 ++  +E T E + S   + S TRRP +L+RL VT+ SL+ WLQR+
Sbjct: 809  RCVDDADQGIELTSENTKSIVGISS-TRRPAILERLMVTKASLHQWLQRK 857


>ref|XP_004493936.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
            chloroplastic-like [Cicer arietinum]
          Length = 799

 Score =  385 bits (990), Expect = e-104
 Identities = 202/351 (57%), Positives = 266/351 (75%), Gaps = 6/351 (1%)
 Frame = -3

Query: 1100 SHI---TERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICG 930
            SHI   TER PF+PTTSTYN L+KACGT+YY AKAL++EMKT+GLSPN ISWSILI+ICG
Sbjct: 456  SHIMSFTERFPFKPTTSTYNTLLKACGTNYYHAKALINEMKTVGLSPNQISWSILINICG 515

Query: 929  GSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLV 750
            GS NV GAI+ILR+M ++G++PDV+ YTTAIK+CVE KN   A TL++EM+ Y+ +PNLV
Sbjct: 516  GSENVEGAIEILRTMIDAGVKPDVVAYTTAIKVCVESKNFTKALTLYEEMKSYETQPNLV 575

Query: 749  TYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKS 570
            TYNT+LRAR +YGSL++VQQCLA+YQ MRKAGYKP DYYL+ L+EEWCEGVIQ+  + + 
Sbjct: 576  TYNTLLRARSKYGSLREVQQCLAIYQDMRKAGYKPNDYYLEELIEEWCEGVIQDNEEYEV 635

Query: 569  QF-ASRLTEF-GPQSLLLEKVAEH-LQDSAESLSIDLRGLSKVEARIVVLAVLRMIKEKF 399
            +F +S+  E   P+SLLLEK+A H L+  A+ L+ID++GLSKVEAR+V+LAVLRMIKE +
Sbjct: 636  EFSSSKKPEIERPESLLLEKIAAHLLKRVADILAIDVQGLSKVEARLVILAVLRMIKENY 695

Query: 398  IAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRD 219
              G+S+ DD+ II+G  +      K+   V+EAV++LL+++LGL    +  R        
Sbjct: 696  AFGHSVNDDILIIIGATKADESPAKEILEVQEAVIKLLRNELGLEALPAKTRFA-----P 750

Query: 218  KESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
             +SP      L+ T+E +  P  +   TRRP VLQRLKVT++SL+ WLQRR
Sbjct: 751  SDSP-----KLQNTKE-NALPTTMVFHTRRPAVLQRLKVTKQSLHRWLQRR 795


>ref|XP_006398740.1| hypothetical protein EUTSA_v10012661mg [Eutrema salsugineum]
            gi|557099830|gb|ESQ40193.1| hypothetical protein
            EUTSA_v10012661mg [Eutrema salsugineum]
          Length = 863

 Score =  379 bits (974), Expect = e-103
 Identities = 199/355 (56%), Positives = 263/355 (74%), Gaps = 8/355 (2%)
 Frame = -3

Query: 1106 PYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGG 927
            PY   + R  F+PTT+TYNIL+KACGTDYYR K LMDEM++LGL+PN I+WS LIDICGG
Sbjct: 509  PYIQASNRFFFKPTTATYNILLKACGTDYYRGKELMDEMRSLGLAPNQITWSTLIDICGG 568

Query: 926  SGNVMGAIQILRSMRESGIQPDVITYTTAIK-----ICVEHKNIKLAFTLFDEMQKYQIK 762
            SG+V GA+ ILR+M  +G +PDV+ YTTAIK     IC E+K++KLAF+LF+EM++YQIK
Sbjct: 569  SGDVEGAVGILRTMHSAGTRPDVVAYTTAIKHAIFQICAENKSLKLAFSLFEEMRRYQIK 628

Query: 761  PNLVTYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEH 582
            PN VTYNT+L+AR +YGSL +V+QCLA+YQ MRKAGYKP D++LK L+EEWCEGVIQ   
Sbjct: 629  PNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRKAGYKPNDHFLKELIEEWCEGVIQENS 688

Query: 581  QNKSQFASRL-TEFG-PQSLLLEKVAEHLQD-SAESLSIDLRGLSKVEARIVVLAVLRMI 411
            Q++ + + +  T  G P SLL+EKVA HLQ+ +A +L+IDL+GL+KVEAR+VVLAVLRMI
Sbjct: 689  QSQIKTSDQEGTNLGRPVSLLIEKVATHLQERTAGNLAIDLQGLTKVEARLVVLAVLRMI 748

Query: 410  KEKFIAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGND 231
            KE +I G+ + DD+ IILG  E      K E  VK+ +V+LL+ +L L V  +G R   D
Sbjct: 749  KEDYIRGDVVTDDLLIILGTGEANIDPGKQEIAVKDVLVQLLKDELSLVVLPAGHRHVLD 808

Query: 230  VRRDKESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
            +  D      ++  +E T E + S   + S TRRP +L+RL VT+ SL+ WLQR+
Sbjct: 809  ITLDARCVDDADQGIELTSENTKSIVGISS-TRRPAILERLMVTKASLHQWLQRK 862


>gb|ESW34707.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 809

 Score =  376 bits (966), Expect = e-102
 Identities = 193/351 (54%), Positives = 256/351 (72%), Gaps = 6/351 (1%)
 Frame = -3

Query: 1100 SHI---TERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICG 930
            SHI    ER PF PTT+TYNIL+KACGTDYY AKAL+ EM+T+GLSPN ISWS LIDICG
Sbjct: 459  SHILSFAERFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSTLIDICG 518

Query: 929  GSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLV 750
             S NV GAI+IL++M ++GI+PDVI YTTAIK+CVE KN   A  L+ EM+ Y I+PNL+
Sbjct: 519  ASANVEGAIEILKNMGDAGIKPDVIAYTTAIKVCVESKNFMQALALYKEMKSYHIRPNLI 578

Query: 749  TYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKS 570
            TYNT+L+AR +YGSL +VQQCLA+YQ MRKAGYKP D YL+ L+EEWCEGVIQ+  + + 
Sbjct: 579  TYNTLLKARSKYGSLHEVQQCLAIYQDMRKAGYKPNDCYLEELIEEWCEGVIQDNREIQG 638

Query: 569  QFAS--RLTEFGPQSLLLEKVAEH-LQDSAESLSIDLRGLSKVEARIVVLAVLRMIKEKF 399
            +F+S  +      QSLLLEK+A H L+  A+ L+ID++GL+KVEAR+VVLAVLRMIKE +
Sbjct: 639  EFSSSNKSELEKSQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLAVLRMIKENY 698

Query: 398  IAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRD 219
              G+SI DD+ I++G  ++  +  K    V+EA+++LL+++LGL  F +  R        
Sbjct: 699  SLGHSINDDILIVIGATKVDENPAKRILEVQEAILKLLRNELGLEAFPARTRLALSDTPK 758

Query: 218  KESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
             ++P  +NL +E    +   P ++   TRRP +L RLK+TRKSL  WL R+
Sbjct: 759  LKNPTLANLKIEAVPAEDALPTSMGFQTRRPGILVRLKITRKSLYSWLHRK 809



 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 40/146 (27%), Positives = 72/146 (49%), Gaps = 4/146 (2%)
 Frame = -3

Query: 1070 PTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMGAIQILR 891
            P    +N LM     D      L   M+ LGL P+  S++IL+  C  +G V  A  I R
Sbjct: 254  PNIYVFNSLMNVNAHDLSYTLNLYQNMQNLGLKPDMTSYNILLKGCCVAGRVDLAQDIYR 313

Query: 890  SMRE----SGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILRAR 723
             ++       ++ DV TY+T IK+  + +  ++A T+  +M    +  N+V +++++ A 
Sbjct: 314  ELKHLESVGQLKLDVFTYSTIIKVFADARLWQMALTIKQDMLSAGVSLNIVAWSSLINAC 373

Query: 722  DRYGSLQQVQQCLAVYQHMRKAGYKP 645
               G    V+Q + +++ M  AG +P
Sbjct: 374  AHAG---LVEQAIQLFEEMLLAGREP 396


>gb|ESW34706.1| hypothetical protein PHAVU_001G174000g [Phaseolus vulgaris]
          Length = 594

 Score =  376 bits (966), Expect = e-102
 Identities = 193/351 (54%), Positives = 256/351 (72%), Gaps = 6/351 (1%)
 Frame = -3

Query: 1100 SHI---TERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICG 930
            SHI    ER PF PTT+TYNIL+KACGTDYY AKAL+ EM+T+GLSPN ISWS LIDICG
Sbjct: 244  SHILSFAERFPFTPTTTTYNILLKACGTDYYHAKALIKEMETVGLSPNQISWSTLIDICG 303

Query: 929  GSGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLV 750
             S NV GAI+IL++M ++GI+PDVI YTTAIK+CVE KN   A  L+ EM+ Y I+PNL+
Sbjct: 304  ASANVEGAIEILKNMGDAGIKPDVIAYTTAIKVCVESKNFMQALALYKEMKSYHIRPNLI 363

Query: 749  TYNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKS 570
            TYNT+L+AR +YGSL +VQQCLA+YQ MRKAGYKP D YL+ L+EEWCEGVIQ+  + + 
Sbjct: 364  TYNTLLKARSKYGSLHEVQQCLAIYQDMRKAGYKPNDCYLEELIEEWCEGVIQDNREIQG 423

Query: 569  QFAS--RLTEFGPQSLLLEKVAEH-LQDSAESLSIDLRGLSKVEARIVVLAVLRMIKEKF 399
            +F+S  +      QSLLLEK+A H L+  A+ L+ID++GL+KVEAR+VVLAVLRMIKE +
Sbjct: 424  EFSSSNKSELEKSQSLLLEKIAAHLLKRVADILAIDVQGLTKVEARLVVLAVLRMIKENY 483

Query: 398  IAGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRD 219
              G+SI DD+ I++G  ++  +  K    V+EA+++LL+++LGL  F +  R        
Sbjct: 484  SLGHSINDDILIVIGATKVDENPAKRILEVQEAILKLLRNELGLEAFPARTRLALSDTPK 543

Query: 218  KESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
             ++P  +NL +E    +   P ++   TRRP +L RLK+TRKSL  WL R+
Sbjct: 544  LKNPTLANLKIEAVPAEDALPTSMGFQTRRPGILVRLKITRKSLYSWLHRK 594



 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 40/146 (27%), Positives = 72/146 (49%), Gaps = 4/146 (2%)
 Frame = -3

Query: 1070 PTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGGSGNVMGAIQILR 891
            P    +N LM     D      L   M+ LGL P+  S++IL+  C  +G V  A  I R
Sbjct: 39   PNIYVFNSLMNVNAHDLSYTLNLYQNMQNLGLKPDMTSYNILLKGCCVAGRVDLAQDIYR 98

Query: 890  SMRE----SGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVTYNTILRAR 723
             ++       ++ DV TY+T IK+  + +  ++A T+  +M    +  N+V +++++ A 
Sbjct: 99   ELKHLESVGQLKLDVFTYSTIIKVFADARLWQMALTIKQDMLSAGVSLNIVAWSSLINAC 158

Query: 722  DRYGSLQQVQQCLAVYQHMRKAGYKP 645
               G    V+Q + +++ M  AG +P
Sbjct: 159  AHAG---LVEQAIQLFEEMLLAGREP 181


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g02830, chloroplastic; Flags: Precursor
            gi|332003140|gb|AED90523.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  375 bits (962), Expect = e-101
 Identities = 192/350 (54%), Positives = 262/350 (74%), Gaps = 3/350 (0%)
 Frame = -3

Query: 1106 PYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGG 927
            PY   ++R  F+PTT+TYNIL+KACGTDYYR K LMDEMK+LGLSPN I+WS LID+CGG
Sbjct: 512  PYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGG 571

Query: 926  SGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVT 747
            SG+V GA++ILR+M  +G +PDV+ YTTAIKIC E+K +KLAF+LF+EM++YQIKPN VT
Sbjct: 572  SGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVT 631

Query: 746  YNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQ 567
            YNT+L+AR +YGSL +V+QCLA+YQ MR AGYKP D++LK L+EEWCEGVIQ   Q++ +
Sbjct: 632  YNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQENGQSQDK 691

Query: 566  FASRLTEFG--PQSLLLEKVAEHLQD-SAESLSIDLRGLSKVEARIVVLAVLRMIKEKFI 396
             + +  +    P SLL+EKVA H+Q+ +A +L+IDL+GL+K+EAR+VVLAVLRMIKE ++
Sbjct: 692  ISDQEGDNAGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYM 751

Query: 395  AGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDK 216
             G+ + DDV II+G +E    + K E  V+EA+V+LL+ +L L V  +G       +R+ 
Sbjct: 752  RGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSLVVLPAG-------QRNI 804

Query: 215  ESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
               AH    +++  +++       S TRRP +L+RL VT+ SL  WLQRR
Sbjct: 805  IQDAHC---VDDADQENTKSFVSISSTRRPAILERLMVTKASLYQWLQRR 851


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  373 bits (958), Expect = e-101
 Identities = 191/350 (54%), Positives = 262/350 (74%), Gaps = 3/350 (0%)
 Frame = -3

Query: 1106 PYSHITERVPFRPTTSTYNILMKACGTDYYRAKALMDEMKTLGLSPNHISWSILIDICGG 927
            PY   ++R  F+PTT+TYNIL+KACGTDYYR K LMDEMK+LGLSPN I+WS LID+CGG
Sbjct: 512  PYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSPNQITWSTLIDMCGG 571

Query: 926  SGNVMGAIQILRSMRESGIQPDVITYTTAIKICVEHKNIKLAFTLFDEMQKYQIKPNLVT 747
            SG+V GA++ILR+M  +G +PDV+ YTTAIKIC E+K +KLAF+LF+EM++YQIKPN VT
Sbjct: 572  SGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLFEEMRRYQIKPNWVT 631

Query: 746  YNTILRARDRYGSLQQVQQCLAVYQHMRKAGYKPKDYYLKRLVEEWCEGVIQNEHQNKSQ 567
            YNT+L+AR +YGSL +V+QCLA+YQ MR AGYKP D++LK L+EEWCEGVIQ   +++ +
Sbjct: 632  YNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEWCEGVIQENGRSQDK 691

Query: 566  FASRLTEFG--PQSLLLEKVAEHLQD-SAESLSIDLRGLSKVEARIVVLAVLRMIKEKFI 396
             + +  +    P SLL+EKVA H+Q+ +A +L+IDL+GL+K+EAR+VVLAVLRMIKE ++
Sbjct: 692  ISDQEGDNAGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIEARLVVLAVLRMIKEDYM 751

Query: 395  AGNSIKDDVSIILGLEELGGHATKDESGVKEAVVRLLQHDLGLHVFASGFRTGNDVRRDK 216
             G+ + DDV II+G +E    + K E  V+EA+V+LL+ +L L V  +G       +R+ 
Sbjct: 752  RGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSLVVLPAG-------QRNI 804

Query: 215  ESPAHSNLNLEETREKSGSPQALESPTRRPVVLQRLKVTRKSLNHWLQRR 66
               AH    +++  +++       S TRRP +L+RL VT+ SL  WLQRR
Sbjct: 805  IQDAHC---VDDADQENTKSFVSISSTRRPAILERLMVTKASLYQWLQRR 851


Top