BLASTX nr result

ID: Mentha26_contig00033213 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00033213
         (953 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU38829.1| hypothetical protein MIMGU_mgv1a001151mg [Mimulus...   389   e-105
ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containi...   310   7e-82
ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containi...   308   3e-81
ref|XP_007017649.1| Pentatricopeptide repeat (PPR-like) superfam...   293   7e-77
ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citr...   291   3e-76
ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutr...   287   4e-75
ref|XP_003615696.1| Pentatricopeptide repeat-containing protein ...   286   8e-75
ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containi...   285   2e-74
ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containi...   284   3e-74
ref|XP_007142200.1| hypothetical protein PHAVU_008G260600g [Phas...   282   2e-73
ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containi...   281   4e-73
ref|XP_006386200.1| pentatricopeptide repeat-containing family p...   279   1e-72
ref|XP_007227217.1| hypothetical protein PRUPE_ppa019183mg [Prun...   276   1e-71
ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containi...   266   7e-69
emb|CAA06829.1| DYW7 protein [Arabidopsis thaliana]                   264   4e-68
ref|NP_173402.2| pentatricopeptide repeat-containing protein [Ar...   264   4e-68
gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis]     257   5e-66
ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containi...   246   7e-63
ref|XP_006828626.1| hypothetical protein AMTR_s00129p00082590 [A...   233   1e-58
ref|XP_002893064.1| hypothetical protein ARALYDRAFT_472198 [Arab...   226   1e-56

>gb|EYU38829.1| hypothetical protein MIMGU_mgv1a001151mg [Mimulus guttatus]
          Length = 876

 Score =  389 bits (998), Expect = e-105
 Identities = 198/318 (62%), Positives = 244/318 (76%), Gaps = 12/318 (3%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            MPS+DIITWNT+ T YVLHGC+ EAIELFE M +   +PNR TFAS+ISAYGLAKKV+EG
Sbjct: 561  MPSVDIITWNTMTTGYVLHGCADEAIELFEHMTRQECRPNRGTFASVISAYGLAKKVEEG 620

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELG--VQIWSAVLTA 600
            +RVFS MT+EYQI+PCLDHYVA+VNLYGRSG +DEAFEF+  +  E    V IW A+LT 
Sbjct: 621  KRVFSNMTEEYQIVPCLDHYVAVVNLYGRSGKVDEAFEFVANMASEESEDVSIWRALLTC 680

Query: 599  CRRQGNVRLAIHAGEVLLELEPDNS----LTQRLLSQLYELRRVTRDSSKRSRKVPNG-P 435
            CRR GNV+LAIHAGE LLELEPDN+      ++L+ QLY+LR ++++S K  RK   G  
Sbjct: 681  CRRHGNVKLAIHAGEKLLELEPDNNNDTLFVRKLVLQLYDLRGISKESLKMKRKETTGYS 740

Query: 434  TGCSWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEEN 255
             G SWI  E +NTVH FV+GD  Q+DGK L SWI R+E   + S+Y DML+I+EEE+EE 
Sbjct: 741  LGRSWI--EEKNTVHTFVSGDLRQLDGKSLRSWIERVESCNKESQYRDMLSIEEEEEEEE 798

Query: 254  S---GILSEKLALAYAVMKFRRPL--RTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHD 90
                GI SEKLALA+A++K  R    RTIR+VKN+RMC +CHRFA+L+SK++GCEIY+ D
Sbjct: 799  EESVGIHSEKLALAFALIKSCRESTPRTIRVVKNVRMCGNCHRFAKLVSKRHGCEIYISD 858

Query: 89   TICLHHFKNGNCSCGDYW 36
            +  LHHFKNG CSC DYW
Sbjct: 859  SKSLHHFKNGVCSCRDYW 876


>ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Solanum tuberosum]
          Length = 884

 Score =  310 bits (793), Expect = 7e-82
 Identities = 161/308 (52%), Positives = 215/308 (69%), Gaps = 2/308 (0%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DII+WNTL   YVLHG SSEA +LF +M +   KPNR TF+S+IS+YGLAK V+EG
Sbjct: 579  MSTKDIISWNTLIAGYVLHGFSSEATKLFHQMEEAGLKPNRGTFSSMISSYGLAKMVEEG 638

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            +R+FS M +EY+I+P L+HYVAMV LYGRSG L+EA +FI  + +E  + IW A+LTA R
Sbjct: 639  KRMFSSMYEEYRIVPGLEHYVAMVTLYGRSGKLEEAIDFIDNMTMEHDISIWGALLTASR 698

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDS--SKRSRKVPNGPTGCSW 420
              GN+ LAIHAGE LL+L+P N +  +LL QL  LR ++ +S    R RK  +     SW
Sbjct: 699  VHGNLNLAIHAGEQLLKLDPGNVVIHQLLLQLNVLRGISEESVTVMRPRKRNHHEEPLSW 758

Query: 419  IQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILS 240
               E  N VHAF +G   Q + +   SWI R E+   GS   + L I+EEE+E+ + + S
Sbjct: 759  SWTEINNVVHAFASG--QQSNSEVPDSWIKRKEVKMEGSSSCNRLCIKEEENEDITRVHS 816

Query: 239  EKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFKNG 60
            EKLAL++A++   +  R IR+VKNLRMC+ CHR A+L+S+KY  EIY+HD+ CLHHFK+G
Sbjct: 817  EKLALSFALINSPQSSRVIRIVKNLRMCEDCHRIAKLVSQKYEREIYIHDSKCLHHFKDG 876

Query: 59   NCSCGDYW 36
             CSCG+YW
Sbjct: 877  YCSCGNYW 884


>ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Solanum lycopersicum]
          Length = 884

 Score =  308 bits (788), Expect = 3e-81
 Identities = 158/308 (51%), Positives = 214/308 (69%), Gaps = 2/308 (0%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DII+WNTL   YVLHG SSE+ +LF +M +   KPNR TF+S+I +YGLAK V+EG
Sbjct: 579  MSTKDIISWNTLIAGYVLHGFSSESTKLFHQMEEAGLKPNRGTFSSVILSYGLAKMVEEG 638

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            +R+FS M+++Y+I+P L+H VAMVNLYGRSG L+EA  FI  + +E  + IW A+LTA R
Sbjct: 639  KRMFSSMSEKYRIVPGLEHCVAMVNLYGRSGKLEEAINFIDNMTMEHDISIWGALLTASR 698

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSS--KRSRKVPNGPTGCSW 420
              GN+ LAIHAGE L +L+P N +  +LL QLY LR ++ +S    R RK  +     SW
Sbjct: 699  VHGNLNLAIHAGEQLFKLDPGNVVIHQLLLQLYVLRGISEESETVMRPRKRNHHEEPLSW 758

Query: 419  IQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILS 240
               E  N VHAF +G   Q + +   SWI R E+   GS   + L I+EEE+E+ + + S
Sbjct: 759  SWTEINNVVHAFASG--QQCNSEVPDSWIKRKEVKMEGSSSCNRLCIKEEENEDITRVHS 816

Query: 239  EKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFKNG 60
            EKLAL++A++   +  R IR+VKNLRMC+ CHR A+L+S+KY  EIY+HD+ CLHHFK+G
Sbjct: 817  EKLALSFALINSPQSSRVIRIVKNLRMCEDCHRIAKLVSQKYEREIYIHDSKCLHHFKDG 876

Query: 59   NCSCGDYW 36
             CSCG+YW
Sbjct: 877  YCSCGNYW 884


>ref|XP_007017649.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 1
            [Theobroma cacao] gi|590593723|ref|XP_007017650.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein
            isoform 1 [Theobroma cacao] gi|508722977|gb|EOY14874.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein
            isoform 1 [Theobroma cacao] gi|508722978|gb|EOY14875.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein
            isoform 1 [Theobroma cacao]
          Length = 890

 Score =  293 bits (750), Expect = 7e-77
 Identities = 148/310 (47%), Positives = 200/310 (64%), Gaps = 4/310 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DII+WN++   YVLHGCS  A++LF +MRK+  KPNR TF SII A+G+A  VDEG
Sbjct: 583  MSTRDIISWNSIIGGYVLHGCSDAALDLFNQMRKLGLKPNRGTFLSIILAHGIAGMVDEG 642

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            +++FS ++D Y+I+P ++HY AM+++YGRSG L EA EFI  +P+E    +W+++LTA R
Sbjct: 643  KQIFSSISDNYEIIPAVEHYAAMIDVYGRSGRLGEAVEFIEDMPIEPDSSVWTSLLTASR 702

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSK----RSRKVPNGPTGC 426
               ++ LA+ AGE LL+LEP N L  R++ Q+Y L     D  K        +     G 
Sbjct: 703  IHRDIALAVLAGERLLDLEPANILINRVMFQIYVLSGKLDDPLKVRKLEKENILRRSLGH 762

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGI 246
            SWI  E  NTVH FVTGD S+     L+SW+  I        +     ++EEE EE  G+
Sbjct: 763  SWI--EVRNTVHKFVTGDQSKPCADLLYSWVKSIAREVNIHDHHGRFFLEEEEKEETGGV 820

Query: 245  LSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFK 66
             SEKL LA+A++      R+IR+VKN RMC +CH  A+ IS K+GCEIY+ D  C HHFK
Sbjct: 821  HSEKLTLAFALIGLPYSPRSIRIVKNTRMCSNCHLTAKYISLKFGCEIYLSDRKCFHHFK 880

Query: 65   NGNCSCGDYW 36
            NG CSCGDYW
Sbjct: 881  NGQCSCGDYW 890


>ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citrus clementina]
            gi|557537195|gb|ESR48313.1| hypothetical protein
            CICLE_v10000229mg [Citrus clementina]
          Length = 889

 Score =  291 bits (745), Expect = 3e-76
 Identities = 155/311 (49%), Positives = 205/311 (65%), Gaps = 5/311 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M S DIITWN+L   YVLHG    A++LF++M+    KPNR TF SII A+ LA  VD G
Sbjct: 582  MSSKDIITWNSLICGYVLHGFWHAALDLFDQMKSFGLKPNRGTFLSIILAHSLAGMVDLG 641

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            ++VF  +T+ YQI+P ++HY AM++LYGRSG L+EA EFI  +P+E    IW A+LTACR
Sbjct: 642  KQVFCSITECYQIIPMIEHYSAMIDLYGRSGKLEEAMEFIEDMPIEPDSSIWEALLTACR 701

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSRKVPNGPT-----G 429
              GN+ LA+ A E L +LEP + L QRL+ Q+Y +     D+ K  RK+    T     G
Sbjct: 702  IHGNIDLAVLAIERLFDLEPGDVLIQRLILQIYAICGKPEDALK-VRKLEKENTRRNSFG 760

Query: 428  CSWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSG 249
             SWI  E +N V+ FVTG +S+     L+SW+  +  N         L I+EEE EE SG
Sbjct: 761  QSWI--EVKNLVYTFVTGGWSESYSDLLYSWLQNVPENVTARSCHSGLCIEEEEKEEISG 818

Query: 248  ILSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHF 69
            I SEKLALA+A++   +   TIR+VKN+RMC HCH+ A+ +SK + CEI++ D+ CLHHF
Sbjct: 819  IHSEKLALAFALIGSSQAPHTIRIVKNIRMCVHCHKTAKYVSKMHHCEIFLADSKCLHHF 878

Query: 68   KNGNCSCGDYW 36
            KNG CSCGDYW
Sbjct: 879  KNGQCSCGDYW 889


>ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutrema salsugineum]
            gi|557094240|gb|ESQ34822.1| hypothetical protein
            EUTSA_v10006756mg [Eutrema salsugineum]
          Length = 893

 Score =  287 bits (735), Expect = 4e-75
 Identities = 144/311 (46%), Positives = 202/311 (64%), Gaps = 5/311 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DIITWN+L   YVLHG    A++LF +M+    KPNR T +SII A+GL   VDEG
Sbjct: 585  METKDIITWNSLIGGYVLHGRYGPALDLFNQMKTQGIKPNRGTLSSIILAHGLMGNVDEG 644

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            ++VFS + D+Y I+P L+H  AM++LYGRS  L+EA +FI+ + ++    IW + LT CR
Sbjct: 645  KKVFSSIADDYNIIPALEHCSAMISLYGRSNRLEEAVQFIQEMNVQSETPIWESFLTGCR 704

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLY----ELRRVTRDSSKRSRKVPNGPTGC 426
              G++ LAIHA E L  LEP+N +T+ ++SQ+Y    +L R       R   +   P G 
Sbjct: 705  IHGDIDLAIHAAEHLFSLEPENPITENVVSQIYALGAKLGRSLEGKKPRRDNLLKKPLGH 764

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRI-EMNTRGSKYDDMLNIQEEEDEENSG 249
            SWI  E  N++H F TGD SQ+    L+ W+ ++  ++ R  +Y+  L I+EE  EE  G
Sbjct: 765  SWI--EVRNSIHTFTTGDKSQLCTDVLYPWVEKLCRLDDRNDQYNGELLIEEEGREETCG 822

Query: 248  ILSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHF 69
            I SEK A+A+ ++   R  +TIR++KNLRMC  CH  A+ IS++YGC+I + DT CLHHF
Sbjct: 823  IHSEKFAMAFGLISSSRAHKTIRILKNLRMCRDCHNTAKYISRRYGCDILLEDTRCLHHF 882

Query: 68   KNGNCSCGDYW 36
            KNG+CSC DYW
Sbjct: 883  KNGDCSCKDYW 893


>ref|XP_003615696.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355517031|gb|AES98654.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 887

 Score =  286 bits (732), Expect = 8e-75
 Identities = 147/302 (48%), Positives = 194/302 (64%)
 Frame = -1

Query: 941  DIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEGQRVF 762
            D ++WN++ +SYVLHGCS  A++LF +MRK   +PNR TFASI+ AYG A  VDEG+ VF
Sbjct: 591  DAVSWNSMLSSYVLHGCSESALDLFYQMRKQGLQPNRGTFASILLAYGHAGMVDEGKSVF 650

Query: 761  SLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACRRQGN 582
            S +T +Y +   ++HY AMV L GRSG L EA +FI+++P+E    +W A+LTACR   N
Sbjct: 651  SCITKDYLVRQGMEHYSAMVYLLGRSGKLAEALDFIQSMPIEPNSSVWGALLTACRIHRN 710

Query: 581  VRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSRKVPNGPTGCSWIQGEGE 402
              +A+ AG+ +LE EP N++T+ LLSQ Y L            K  N P G SWI  E  
Sbjct: 711  FGVAVLAGKRMLEFEPGNNITRHLLSQAYSL---CGKFEPEGEKAVNKPIGQSWI--ERN 765

Query: 401  NTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILSEKLALA 222
            N VH FV GD S      LHSW+ R+ +N +    D+ L I+EEE E  S + SEKLA A
Sbjct: 766  NVVHTFVVGDQSNPYLDKLHSWLKRVAVNVKTHVSDNELYIEEEEKENTSSVHSEKLAFA 825

Query: 221  YAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFKNGNCSCGD 42
            +A++      + +R+VK LRMC  CH  A+ IS  YGCEIY+ D+ CLHHFK G+CSC D
Sbjct: 826  FALIDPHNKPQILRIVKKLRMCRDCHDTAKYISMAYGCEIYLSDSNCLHHFKGGHCSCRD 885

Query: 41   YW 36
            YW
Sbjct: 886  YW 887


>ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Glycine max]
          Length = 896

 Score =  285 bits (729), Expect = 2e-74
 Identities = 150/307 (48%), Positives = 200/307 (65%), Gaps = 5/307 (1%)
 Frame = -1

Query: 941  DIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEGQRVF 762
            DII+WN+L + YVLHGCS  A++LF++MRK    P+R T  SIISAY  A+ VDEG+  F
Sbjct: 592  DIISWNSLLSGYVLHGCSESALDLFDQMRKDGLHPSRVTLTSIISAYSHAEMVDEGKHAF 651

Query: 761  SLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACRRQGN 582
            S +++EYQI   L+HY AMV L GRSG L +A EFI+ +P+E    +W+A+LTACR   N
Sbjct: 652  SNISEEYQIRLDLEHYSAMVYLLGRSGKLAKALEFIQNMPVEPNSSVWAALLTACRIHKN 711

Query: 581  VRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSR----KVPNGPTGCSWIQ 414
              +AI AGE +LEL+P+N +TQ LLSQ Y +   + ++ K ++    K    P G SWI 
Sbjct: 712  FGMAIFAGEHMLELDPENIITQHLLSQAYSVCGKSWEAQKMTKLEKEKFVKMPVGQSWI- 770

Query: 413  GEGENTVHAFVTGDFSQIDG-KCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILSE 237
             E  N VH FV GD   I     +HSW+ R+  N +    D+ L I+EEE E    + SE
Sbjct: 771  -EMNNMVHTFVVGDDQSIPYLDKIHSWLKRVGENVKAHISDNGLRIEEEEKENIGSVHSE 829

Query: 236  KLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFKNGN 57
            KLA A+ ++ F    + +R+VKNLRMC  CH  A+ IS  YGCEIY+ D+ CLHHFK+G+
Sbjct: 830  KLAFAFGLIDFHHTPQILRIVKNLRMCRDCHDTAKYISLAYGCEIYLSDSNCLHHFKDGH 889

Query: 56   CSCGDYW 36
            CSC DYW
Sbjct: 890  CSCRDYW 896


>ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Cicer arietinum]
          Length = 888

 Score =  284 bits (727), Expect = 3e-74
 Identities = 149/302 (49%), Positives = 195/302 (64%)
 Frame = -1

Query: 941  DIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEGQRVF 762
            D+++ N++ + YVL+GCS  AI+LF +MRK   +PNR TFA+I+ AYG    VDEG+ VF
Sbjct: 592  DVVSLNSMLSGYVLNGCSESAIDLFHQMRKEGIRPNRGTFATILLAYGHTGMVDEGKHVF 651

Query: 761  SLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACRRQGN 582
            S MT+EY I P ++HY AMV + GRSG L EA EFI+ +P+E    +W A+LTAC+   N
Sbjct: 652  SCMTNEYLIRPGMEHYSAMVYMLGRSGKLAEALEFIQNMPIEPNSLVWDALLTACKIHRN 711

Query: 581  VRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSRKVPNGPTGCSWIQGEGE 402
              +A+ AG+ LLELEP N++T+ LLSQ Y L       +    K  N P G  WI  E  
Sbjct: 712  FGMAVLAGKRLLELEPGNNITRYLLSQAYSL---CGKFTLEEEKAVNKPVGQCWI--ERN 766

Query: 401  NTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILSEKLALA 222
            NTVH FV GD S      L SW+ R+ +N +   +D+ L I+EEE E NS + SEKLA A
Sbjct: 767  NTVHTFVVGDQSYTYLDKLRSWLKRVAVNVKTHVFDNGLCIEEEERENNSIVHSEKLAFA 826

Query: 221  YAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFKNGNCSCGD 42
            +A +      R + +VKNLRMC  CH  A+ IS  YGCEIY+ D+ CLHHFK G+CSC D
Sbjct: 827  FAFIDPHNTPRILHIVKNLRMCRDCHDTAKYISLAYGCEIYLSDSNCLHHFKGGHCSCRD 886

Query: 41   YW 36
            YW
Sbjct: 887  YW 888


>ref|XP_007142200.1| hypothetical protein PHAVU_008G260600g [Phaseolus vulgaris]
            gi|561015333|gb|ESW14194.1| hypothetical protein
            PHAVU_008G260600g [Phaseolus vulgaris]
          Length = 893

 Score =  282 bits (721), Expect = 2e-73
 Identities = 151/304 (49%), Positives = 203/304 (66%), Gaps = 2/304 (0%)
 Frame = -1

Query: 941  DIITWNTLATSYVLHGCSSEAIELFERMRKM-RYKPNRSTFASIISAYGLAKKVDEGQRV 765
            DII+WN+L + YVLHG S  A++LF++M K  R  PNR T ASIISAY  A  VDEG+  
Sbjct: 592  DIISWNSLLSGYVLHGSSESALDLFDQMNKDDRLHPNRVTLASIISAYSHAGMVDEGKHA 651

Query: 764  FSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACRRQG 585
            FS M+++++I+  L+HY AMV L GRSG L EA EFI  +P+E  + +W+A LTACR   
Sbjct: 652  FSNMSEDFKIILDLEHYSAMVYLLGRSGKLAEAQEFILNMPIEPNISVWTAFLTACRIHR 711

Query: 584  NVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSR-KVPNGPTGCSWIQGE 408
            N  +AI AGE LLEL+P+N +TQ LLSQ Y L     ++ K ++ +    P G SWI  E
Sbjct: 712  NFGMAIFAGERLLELDPENIITQHLLSQAYSLCGKYWEAPKMTKLEKEKIPVGQSWI--E 769

Query: 407  GENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILSEKLA 228
              N VH FV GD S+     LHSW+ R+ +N +    D+ L I+EEE E+ + + SEKLA
Sbjct: 770  MNNMVHTFVVGDQSKPYLDKLHSWLKRVHVNVKAHISDNGLCIEEEEKEDINSVHSEKLA 829

Query: 227  LAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFKNGNCSC 48
            +A+A++      + +R+VKNLR+C  CH  A+ IS  YGCEIY+ D+ CLHHFK+G+CSC
Sbjct: 830  IAFALIDSHHRPQILRIVKNLRVCKDCHDTAKYISLAYGCEIYLSDSNCLHHFKDGHCSC 889

Query: 47   GDYW 36
             DYW
Sbjct: 890  RDYW 893


>ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            isoform X1 [Glycine max] gi|571441335|ref|XP_006575413.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g19720-like isoform X2 [Glycine max]
          Length = 896

 Score =  281 bits (718), Expect = 4e-73
 Identities = 150/307 (48%), Positives = 198/307 (64%), Gaps = 5/307 (1%)
 Frame = -1

Query: 941  DIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEGQRVF 762
            DII+WN+L + YVLHGCS  A++LF++MRK    PNR T  SIISAY  A  VDEG+  F
Sbjct: 592  DIISWNSLLSGYVLHGCSESALDLFDQMRKDGVHPNRVTLTSIISAYSHAGMVDEGKHAF 651

Query: 761  SLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACRRQGN 582
            S +++EYQI   L+HY AMV L GRSG L +A EFI+ +P+E    +W+A++TACR   N
Sbjct: 652  SNISEEYQIRLDLEHYSAMVYLLGRSGKLAKALEFIQNMPVEPNSSVWAALMTACRIHKN 711

Query: 581  VRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSR----KVPNGPTGCSWIQ 414
              +AI AGE + EL+P+N +TQ LLSQ Y +   + ++ K ++    K  N P G SWI 
Sbjct: 712  FGMAIFAGERMHELDPENIITQHLLSQAYSVCGKSLEAPKMTKLEKEKFVNIPVGQSWI- 770

Query: 413  GEGENTVHAFVTGDFSQIDG-KCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILSE 237
             E  N VH FV GD         LHSW+ R+  N +    D+ L I+EEE E  S + SE
Sbjct: 771  -EMNNMVHTFVVGDDQSTPYLDKLHSWLKRVGANVKAHISDNGLCIEEEEKENISSVHSE 829

Query: 236  KLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFKNGN 57
            KLA A+ ++      + +R+VKNLRMC  CH  A+ IS  YGCEIY+ D+ CLHHFK+G+
Sbjct: 830  KLAFAFGLIDSHHTPQILRIVKNLRMCRDCHDSAKYISLAYGCEIYLSDSNCLHHFKDGH 889

Query: 56   CSCGDYW 36
            CSC DYW
Sbjct: 890  CSCRDYW 896


>ref|XP_006386200.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344175|gb|ERP63997.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 810

 Score =  279 bits (713), Expect = 1e-72
 Identities = 153/313 (48%), Positives = 198/313 (63%), Gaps = 7/313 (2%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            +PS D IT N++ T YVLHGCS  A+ L ++MR++  KPNR T  +II A+ LA  VDEG
Sbjct: 501  IPSKDFITVNSMITGYVLHGCSDSALGLLDQMRELGLKPNRGTLVNIILAHSLAGMVDEG 560

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            ++VFS MT+++QI+P  +HY AMV+LYGRSG L EA E I  +P++    +W A+LTACR
Sbjct: 561  RQVFSSMTEDFQIIPASEHYAAMVDLYGRSGRLKEAIELIDNMPIKPQSSVWYALLTACR 620

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSK----RSRKVPNGPTGC 426
              GN  LAI A E LL+LEP NS   + + Q Y +     D+ K      R     P G 
Sbjct: 621  NHGNSDLAIRARENLLDLEPWNSSIHQSILQSYAMHGKYEDAPKVKKLEKRNEVQKPKGQ 680

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENS-- 252
            SWI  E  NTVH+FV GD S      L SW+ RI M  +         I+EEE+EE    
Sbjct: 681  SWI--EVNNTVHSFVAGDQS-TSYSDLFSWVERISMEAKVHDLHCGCCIEEEEEEEKEEI 737

Query: 251  -GILSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLH 75
             GI SEKLALA+A+++     ++IR+VKNLR C  CHR A+ IS K+GCEIY+ D+   H
Sbjct: 738  VGIHSEKLALAFAIIRSPSAPQSIRIVKNLRTCADCHRMAKYISAKHGCEIYLSDSNFFH 797

Query: 74   HFKNGNCSCGDYW 36
            HFK+G CSCGDYW
Sbjct: 798  HFKSGCCSCGDYW 810


>ref|XP_007227217.1| hypothetical protein PRUPE_ppa019183mg [Prunus persica]
            gi|462424153|gb|EMJ28416.1| hypothetical protein
            PRUPE_ppa019183mg [Prunus persica]
          Length = 882

 Score =  276 bits (705), Expect = 1e-71
 Identities = 149/312 (47%), Positives = 199/312 (63%), Gaps = 6/312 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M S D ITWN+  + YVLHG S  A++LF++M+K  ++PNR TFA+II AY LA KVDEG
Sbjct: 576  MSSKDTITWNSAISGYVLHGRSDVALDLFDQMKKSGFEPNRGTFANIIHAYSLAGKVDEG 635

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
             + F  +T++YQI+P L+HY AMV+LYGRSG L EA EFI  +P+E    +W A+ TACR
Sbjct: 636  TQAFHSITEDYQIIPGLEHYSAMVDLYGRSGRLQEAMEFIEGMPIEPDSSVWGALFTACR 695

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSRKVPNGP----TGC 426
              GN+ LA+ AGE LL  EP N L Q+L+ Q Y L   + D SK  +   + P     G 
Sbjct: 696  IYGNLALAVRAGEHLLVSEPGNVLIQQLMLQAYALCGKSEDISKLRKFGKDYPKKKFLGQ 755

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLN--IQEEEDEENS 252
             WI  E +N++H F++GD  ++    L+ W+  IE     +K  D+ N    EEE+EE  
Sbjct: 756  CWI--EVKNSLHTFISGDRLKLCSIFLNLWLQNIE---EKAKTPDLCNELCVEEEEEEIG 810

Query: 251  GILSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHH 72
             I SEKLA A+A+       ++IR++KNLRMC  CHR A+ IS  +GC+IY+ D    HH
Sbjct: 811  WIHSEKLAFAFALSGSPSVPQSIRIMKNLRMCGDCHRIAKYISVAFGCDIYLSDVKSFHH 870

Query: 71   FKNGNCSCGDYW 36
            F NG CSCGDYW
Sbjct: 871  FSNGRCSCGDYW 882


>ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Vitis vinifera]
          Length = 1545

 Score =  266 bits (681), Expect = 7e-69
 Identities = 144/295 (48%), Positives = 187/295 (63%), Gaps = 4/295 (1%)
 Frame = -1

Query: 947  SMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEGQR 768
            S DII+WN+L   YVLHGCS  A++LF++M KM  KP+R TF SII A+ L+  VD+G++
Sbjct: 591  SKDIISWNSLIAGYVLHGCSDSALDLFDQMTKMGVKPSRGTFLSIIYAFSLSGMVDKGKQ 650

Query: 767  VFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACRRQ 588
            VFS M ++YQILP L+H+ AM++L GRSG L EA EFI  + +E    IW+A+LTA +  
Sbjct: 651  VFSSMMEDYQILPGLEHHSAMIDLLGRSGKLGEAIEFIEDMAIEPDSCIWAALLTASKIH 710

Query: 587  GNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSK----RSRKVPNGPTGCSW 420
            GN+ LAI AGE LLELEP N    + + Q+Y L     D SK      R     P GCSW
Sbjct: 711  GNIGLAIRAGECLLELEPSNFSIHQQILQMYALSGKFEDVSKLRKSEKRSETKQPLGCSW 770

Query: 419  IQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGILS 240
            I  E +N VH FV  D S+     LHSWI  +    +     D L I+EEE EE  G+ S
Sbjct: 771  I--EAKNIVHTFVADDRSRPYFDFLHSWIENVARKVKAPDQHDRLFIEEEEKEEIGGVHS 828

Query: 239  EKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLH 75
            EKLALA+A++      R++R+VKNLRMC  CH  A+ +S  Y CEIY+ D+ CLH
Sbjct: 829  EKLALAFALIDPSCAPRSVRIVKNLRMCGDCHGTAKFLSMLYSCEIYLSDSKCLH 883


>emb|CAA06829.1| DYW7 protein [Arabidopsis thaliana]
          Length = 406

 Score =  264 bits (674), Expect = 4e-68
 Identities = 137/312 (43%), Positives = 196/312 (62%), Gaps = 6/312 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DIITWN+L   YVLHG    A+ LF +M+     PNR T +SII A+GL   VDEG
Sbjct: 97   METKDIITWNSLIGGYVLHGSYGPALALFNQMKTQGITPNRGTLSSIILAHGLMGNVDEG 156

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            ++VF  + ++Y I+P L+H  AMV LYGR+  L+EA +FI+ + ++    IW + LT CR
Sbjct: 157  KKVFYSIANDYHIIPALEHCSAMVYLYGRANRLEEALQFIQEMNIQSETPIWESFLTGCR 216

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLY----ELRRVTRDSSKRSRKVPNGPTGC 426
              G++ +AIHA E L  LEP+N+ T+ ++SQ+Y    +L R    +  R   +   P G 
Sbjct: 217  IHGDIDMAIHAAENLFSLEPENTATESIVSQIYALGAKLGRSLEGNKPRRDNLLKKPLGQ 276

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRI-EMNTRGSKYDDMLNIQEEEDEENSG 249
            SWI  E  N +H F TGD S++    L+  + ++  ++ R  +Y+  L I+EE  EE  G
Sbjct: 277  SWI--EVRNLIHTFTTGDQSKLCTDVLYPLVEKMSRLDNRSDQYNGELWIEEEGREETCG 334

Query: 248  ILSEKLALAYAVMKFRRPLR-TIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHH 72
            I SEK A+A+ ++      + TIR++KNLRMC  CH  A+ +SK+YGC+I + DT CLHH
Sbjct: 335  IHSEKFAMAFGLISSSGASKTTIRILKNLRMCRDCHDTAKYVSKRYGCDILLEDTRCLHH 394

Query: 71   FKNGNCSCGDYW 36
            FKNG+CSC DYW
Sbjct: 395  FKNGDCSCKDYW 406


>ref|NP_173402.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75263158|sp|Q9FXH1.1|PPR52_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g19720; AltName: Full=Protein DYW7
            gi|10086495|gb|AAG12555.1|AC007797_15 Unknown Protein
            [Arabidopsis thaliana] gi|332191770|gb|AEE29891.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 894

 Score =  264 bits (674), Expect = 4e-68
 Identities = 137/312 (43%), Positives = 196/312 (62%), Gaps = 6/312 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DIITWN+L   YVLHG    A+ LF +M+     PNR T +SII A+GL   VDEG
Sbjct: 585  METKDIITWNSLIGGYVLHGSYGPALALFNQMKTQGITPNRGTLSSIILAHGLMGNVDEG 644

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            ++VF  + ++Y I+P L+H  AMV LYGR+  L+EA +FI+ + ++    IW + LT CR
Sbjct: 645  KKVFYSIANDYHIIPALEHCSAMVYLYGRANRLEEALQFIQEMNIQSETPIWESFLTGCR 704

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLY----ELRRVTRDSSKRSRKVPNGPTGC 426
              G++ +AIHA E L  LEP+N+ T+ ++SQ+Y    +L R    +  R   +   P G 
Sbjct: 705  IHGDIDMAIHAAENLFSLEPENTATESIVSQIYALGAKLGRSLEGNKPRRDNLLKKPLGQ 764

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRI-EMNTRGSKYDDMLNIQEEEDEENSG 249
            SWI  E  N +H F TGD S++    L+  + ++  ++ R  +Y+  L I+EE  EE  G
Sbjct: 765  SWI--EVRNLIHTFTTGDQSKLCTDVLYPLVEKMSRLDNRSDQYNGELWIEEEGREETCG 822

Query: 248  ILSEKLALAYAVMKFRRPLR-TIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHH 72
            I SEK A+A+ ++      + TIR++KNLRMC  CH  A+ +SK+YGC+I + DT CLHH
Sbjct: 823  IHSEKFAMAFGLISSSGASKTTIRILKNLRMCRDCHDTAKYVSKRYGCDILLEDTRCLHH 882

Query: 71   FKNGNCSCGDYW 36
            FKNG+CSC DYW
Sbjct: 883  FKNGDCSCKDYW 894


>gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis]
          Length = 880

 Score =  257 bits (656), Expect = 5e-66
 Identities = 140/310 (45%), Positives = 191/310 (61%), Gaps = 4/310 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M S DIITWN++   YVLHG S+ A++LF+ M K   KPNR TF SII +  L+  VD+G
Sbjct: 576  MLSKDIITWNSIIAGYVLHGFSNAALDLFDDMTKSGLKPNRGTFLSIIYSCSLSGLVDKG 635

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            +  FS +T++Y I+P L+HY A+V+LYGR G L EA EFI  +P+E    +W+A+LTA R
Sbjct: 636  RLAFSSITEDYNIVPGLEHYAAVVDLYGRPGRLGEAMEFIENMPVEPDSSVWAALLTASR 695

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSRKVPNGPT----GC 426
               N+   + A + +L+LEP N L QRL +Q   L   + +  K  +      T    G 
Sbjct: 696  NHRNIGFTVRALDKILDLEPGNYLIQRLRAQADALVAKSENDPKMRKLEKENATKRHLGR 755

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEEDEENSGI 246
             WI  E +N V+ FV GD S+     L+ WI  I        + + L I+EEE EE   +
Sbjct: 756  CWI--ELQNRVYTFVNGDQSE---PYLYPWIHDIAGKASKYGFHEGLCIEEEEKEEVGRV 810

Query: 245  LSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFK 66
              EK+A+A+A++ F R  + IR+VK+LRMC +CH  A+ ISK YGCEIYV D+ CLH F 
Sbjct: 811  HCEKIAIAFALIGFPRKAQCIRIVKSLRMCGNCHETAKYISKTYGCEIYVTDSKCLHRFS 870

Query: 65   NGNCSCGDYW 36
            NG+CSC DYW
Sbjct: 871  NGHCSCKDYW 880


>ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Cucumis sativus]
          Length = 1463

 Score =  246 bits (629), Expect = 7e-63
 Identities = 136/293 (46%), Positives = 185/293 (63%), Gaps = 11/293 (3%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M S DIITWN++   Y+LHGCS  A +LF++MR +  +PNR T ASII AYG+A  VD+G
Sbjct: 584  MSSKDIITWNSIIAGYILHGCSDSAFQLFDQMRNLGIRPNRGTLASIIHAYGIAGMVDKG 643

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            + VFS +T+E+QILP LDHY+AMV+LYGRSG L +A EFI  +P+E  V IW+++LTACR
Sbjct: 644  RHVFSSITEEHQILPTLDHYLAMVDLYGRSGRLADAIEFIEDMPIEPDVSIWTSLLTACR 703

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYEL----------RRVTRDSSKRSRKVP 444
              GN+ LA+ A + L ELEPDN +  RLL Q Y L          R++ ++S+ +     
Sbjct: 704  FHGNLNLAVLAAKRLHELEPDNHVIYRLLVQAYALYGKFEQTLKVRKLGKESAMKK---- 759

Query: 443  NGPTGCSWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRIEMNTRGSKYDDMLNIQEEED 264
               T   W+  E  N VH FVTGD S++D   L++WI  IE   +       L+I+EEE 
Sbjct: 760  --CTAQCWV--EVRNKVHLFVTGDQSKLD--VLNTWIKSIEGKVKKFNNHHQLSIEEEEK 813

Query: 263  EEN-SGILSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGC 108
            EE   G   EK A A+ ++      ++I++VKNLRMC  CH+ A+ IS  Y C
Sbjct: 814  EEKIGGFHCEKFAFAFGLIGSSHTRKSIKIVKNLRMCVDCHQMAKYISAAYEC 866


>ref|XP_006828626.1| hypothetical protein AMTR_s00129p00082590 [Amborella trichopoda]
            gi|548833416|gb|ERM96042.1| hypothetical protein
            AMTR_s00129p00082590 [Amborella trichopoda]
          Length = 646

 Score =  233 bits (593), Expect = 1e-58
 Identities = 131/310 (42%), Positives = 190/310 (61%), Gaps = 4/310 (1%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DII++N++ +  V++G    AI++FE+M+ +R  PN+ T  S+I+A+GL K V EG
Sbjct: 343  MLTKDIISYNSIISGLVMNGKGRNAIDIFEQMKLLRIAPNQRTILSVINAFGLEKMVPEG 402

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            +++FS M +E+Q+ P ++H  AMV L GR+ LL EA +FI  +PLE    IWSA L+ACR
Sbjct: 403  EKLFSTMAEEFQLFPTIEHCSAMVGLLGRARLLREAMDFIDNMPLEPDASIWSAFLSACR 462

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLYELRRVTRDSSKRSRKVPNGPTGCSWIQ 414
              GN+RLA HA + LLELEP++ L  R+LSQ+Y+  R +R+  K    V     G +WI 
Sbjct: 463  CNGNIRLATHAIKKLLELEPEDPLINRILSQIYDGSR-SRNHVKNQAMVK--AFGHTWI- 518

Query: 413  GEGENTVHAFVTGDFSQIDGKCLHSWIGRIE--MNTRGSKYDD--MLNIQEEEDEENSGI 246
             E +N VH FV G  S      L   +  +     ++G K  D   L ++EEE EE  G 
Sbjct: 519  -EVKNLVHEFVAGMVSLPSSDSLSIKLKSLAEIYKSKGRKLPDAGFLQLEEEEKEEIIGT 577

Query: 245  LSEKLALAYAVMKFRRPLRTIRMVKNLRMCDHCHRFAELISKKYGCEIYVHDTICLHHFK 66
               KLA+ + ++    P ++IR+VKN+R+C  CH FA+ +S+    EI + D   LH FK
Sbjct: 578  HCVKLAITFGIINIHAP-QSIRVVKNIRVCQECHDFAKFVSRNCSQEILLKDPKTLHCFK 636

Query: 65   NGNCSCGDYW 36
            NG CSC DYW
Sbjct: 637  NGQCSCKDYW 646


>ref|XP_002893064.1| hypothetical protein ARALYDRAFT_472198 [Arabidopsis lyrata subsp.
            lyrata] gi|297338906|gb|EFH69323.1| hypothetical protein
            ARALYDRAFT_472198 [Arabidopsis lyrata subsp. lyrata]
          Length = 1490

 Score =  226 bits (575), Expect = 1e-56
 Identities = 121/280 (43%), Positives = 173/280 (61%), Gaps = 6/280 (2%)
 Frame = -1

Query: 953  MPSMDIITWNTLATSYVLHGCSSEAIELFERMRKMRYKPNRSTFASIISAYGLAKKVDEG 774
            M + DIITWN+L   YVLHG    A+ELF +M+    KPNR T +SII A+GL   VDEG
Sbjct: 585  METKDIITWNSLIGGYVLHGSYGPALELFNQMKTQGIKPNRGTLSSIILAHGLMGNVDEG 644

Query: 773  QRVFSLMTDEYQILPCLDHYVAMVNLYGRSGLLDEAFEFIRTIPLELGVQIWSAVLTACR 594
            ++VF  + ++Y I+P L+H  AMV+LYGRS  L+EA +FI+ + ++    IW + LT CR
Sbjct: 645  KKVFYSIANDYHIIPALEHCSAMVSLYGRSNRLEEALQFIQEMNIQSETPIWESFLTGCR 704

Query: 593  RQGNVRLAIHAGEVLLELEPDNSLTQRLLSQLY----ELRRVTRDSSKRSRKVPNGPTGC 426
              G++ +AIHA E L  LEP+N++T+ ++SQ+Y    +L R       R   +   P G 
Sbjct: 705  IHGDIDMAIHAAENLFSLEPENTVTENIVSQIYALGAKLGRSLEGKKPRRDNLLKKPLGQ 764

Query: 425  SWIQGEGENTVHAFVTGDFSQIDGKCLHSWIGRI-EMNTRGSKYDDMLNIQEEEDEENSG 249
            SWI  E  N +H F TGD S++    L+ W+ ++  ++ R  +Y+  L I+EE  EE  G
Sbjct: 765  SWI--EVRNLIHTFTTGDQSKLCTDLLYPWVEKMCRVDNRSDQYNGELLIEEEGREETCG 822

Query: 248  ILSEKLALAYA-VMKFRRPLRTIRMVKNLRMCDHCHRFAE 132
            I SEK A+A+  +   R P  TIR++KNLRMC  CH  A+
Sbjct: 823  IHSEKFAMAFGLISSSRAPKATIRILKNLRMCRDCHNTAK 862


Top