BLASTX nr result

ID: Akebia24_contig00018571 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00018571
         (3132 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containi...  1106   0.0  
ref|XP_007017649.1| Pentatricopeptide repeat (PPR-like) superfam...  1051   0.0  
ref|XP_007227217.1| hypothetical protein PRUPE_ppa019183mg [Prun...  1035   0.0  
ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citr...  1026   0.0  
gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis]    1023   0.0  
ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutr...   986   0.0  
ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containi...   983   0.0  
ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containi...   977   0.0  
ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containi...   972   0.0  
ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containi...   969   0.0  
ref|NP_173402.2| pentatricopeptide repeat-containing protein [Ar...   964   0.0  
ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containi...   957   0.0  
ref|XP_006386200.1| pentatricopeptide repeat-containing family p...   954   0.0  
ref|XP_007142200.1| hypothetical protein PHAVU_008G260600g [Phas...   942   0.0  
ref|XP_003615696.1| Pentatricopeptide repeat-containing protein ...   939   0.0  
ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containi...   930   0.0  
ref|XP_004168675.1| PREDICTED: pentatricopeptide repeat-containi...   929   0.0  
ref|XP_004301846.1| PREDICTED: pentatricopeptide repeat-containi...   927   0.0  
ref|XP_002893064.1| hypothetical protein ARALYDRAFT_472198 [Arab...   922   0.0  
gb|EYU38829.1| hypothetical protein MIMGU_mgv1a001151mg [Mimulus...   888   0.0  

>ref|XP_002280968.2| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Vitis vinifera]
          Length = 1545

 Score = 1106 bits (2861), Expect = 0.0
 Identities = 553/890 (62%), Positives = 686/890 (77%), Gaps = 6/890 (0%)
 Frame = -2

Query: 3026 MENSIILQKFISKPPPVADPWKQQVSTEFSSKPIKHTVSFTK------NSATPLPSEIHL 2865
            MEN I+  K     PP+A P KQ  S E SS+ I+  VSFTK          P  ++ HL
Sbjct: 1    MENLILPCK---SRPPLATPSKQGTSFECSSRIIQPRVSFTKIHQPLTPKLKPKVTDAHL 57

Query: 2864 NYLCRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDP 2685
            N+LC+NG+L  AI  LD IA+ G  +KP TY+ LLQSCID  S E GR LHARIGL+++ 
Sbjct: 58   NHLCKNGRLADAIACLDAIAQGGSNVKPNTYMQLLQSCIDQGSAELGRKLHARIGLLEEM 117

Query: 2684 NPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMM 2505
            NPFV+TKL+SMYAKCG L +AR+VF  M ERNL+ W+AMIG Y+REQ W+E+++ FF MM
Sbjct: 118  NPFVETKLVSMYAKCGSLGEARKVFGEMRERNLYAWSAMIGAYSREQMWREVVQHFFFMM 177

Query: 2504 MEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGEL 2325
             EDGI PDEFLLPKIL+AC N GDA+TGKLIHS +IR G++  + V+NS+L++YAKCG L
Sbjct: 178  -EDGIVPDEFLLPKILQACGNCGDAETGKLIHSLVIRCGMNFNIRVSNSILAVYAKCGRL 236

Query: 2324 NSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASY 2145
            + A+ FFE MD +D V+WNSII+GYCQ G+ EK+ +LF++MQ EG+EPGLVTWNILI SY
Sbjct: 237  SCARRFFENMDYRDRVSWNSIITGYCQKGELEKSHQLFEKMQEEGIEPGLVTWNILINSY 296

Query: 2144 NQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNG 1965
            +QSG CD AMELM KM    I PDVFTWT MISGFAQNNR +QALELFREM+L G+EPNG
Sbjct: 297  SQSGKCDDAMELMKKMESFRIVPDVFTWTSMISGFAQNNRRSQALELFREMLLAGIEPNG 356

Query: 1964 VTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEM 1785
            VTV            L KG ELH++ VK+G + ++LVGNSLIDMYSK G+ E ARRVF+M
Sbjct: 357  VTVTSGISACASLKALKKGMELHSVAVKIGCVEDLLVGNSLIDMYSKSGELEDARRVFDM 416

Query: 1784 ILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQA 1605
            IL+KDV TWNS+IGGY QAGYCGKAYDLF+KM  SDVPPNVVTWN M SGY+Q GDEDQA
Sbjct: 417  ILKKDVYTWNSMIGGYCQAGYCGKAYDLFIKMHESDVPPNVVTWNAMISGYIQNGDEDQA 476

Query: 1604 MVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILP 1425
            M LF RME +G++KR+TASWN LIAG LQNG KNKALGIFRQMQ  C +PNS+T+LSILP
Sbjct: 477  MDLFHRMEKDGLIKRDTASWNSLIAGYLQNGHKNKALGIFRQMQSFCIRPNSVTMLSILP 536

Query: 1424 ACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVIS 1245
            ACANL++AKKVKEIHGC+LRRNL S +S+ N LIDTYAKSG+IV A+ +F+ + S+D+IS
Sbjct: 537  ACANLVAAKKVKEIHGCILRRNLGSELSVANCLIDTYAKSGNIVYAQTIFQGISSKDIIS 596

Query: 1244 WNTLIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMT 1065
            WN+LIAGYVLHG  + +LDLF++M  +G  P+RGTF S I A+SL+ MV++GKQ FSSM 
Sbjct: 597  WNSLIAGYVLHGCSDSALDLFDQMTKMGVKPSRGTFLSIIYAFSLSGMVDKGKQVFSSMM 656

Query: 1064 KDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLA 885
            +DY+ILPGLEH+SAM+ LLGRSG+  EA EFIE+M I+PDS +W ALLTA +IHGNIGLA
Sbjct: 657  EDYQILPGLEHHSAMIDLLGRSGKLGEAIEFIEDMAIEPDSCIWAALLTASKIHGNIGLA 716

Query: 884  IHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNK 705
            I A E L+ LEP N+ +H+ +LQ+YAL GK ED S++RK+  R+ T   LGCS I   N 
Sbjct: 717  IRAGECLLELEPSNFSIHQQILQMYALSGKFEDVSKLRKSEKRSETKQPLGCSWIEAKNI 776

Query: 704  EHTFMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAI 525
             HTF+  DRS P  D +++ I+++  ++K  APD  + +L I+EEEKE IGG+HSEKLA+
Sbjct: 777  VHTFVADDRSRPYFDFLHSWIENVARKVK--APDQHD-RLFIEEEEKEEIGGVHSEKLAL 833

Query: 524  SFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGREIYLYDSKCFH 375
            +FALI      +S+RI+KN RMC DCH TAK +S++Y  EIYL DSKC H
Sbjct: 834  AFALIDPSCAPRSVRIVKNLRMCGDCHGTAKFLSMLYSCEIYLSDSKCLH 883


>ref|XP_007017649.1| Pentatricopeptide repeat (PPR-like) superfamily protein isoform 1
            [Theobroma cacao] gi|590593723|ref|XP_007017650.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein
            isoform 1 [Theobroma cacao] gi|508722977|gb|EOY14874.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein
            isoform 1 [Theobroma cacao] gi|508722978|gb|EOY14875.1|
            Pentatricopeptide repeat (PPR-like) superfamily protein
            isoform 1 [Theobroma cacao]
          Length = 890

 Score = 1051 bits (2719), Expect = 0.0
 Identities = 519/897 (57%), Positives = 670/897 (74%)
 Frame = -2

Query: 3026 MENSIILQKFISKPPPVADPWKQQVSTEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRN 2847
            MEN +I     +  PPV  P K +  +EFS  P K   S TK +  P  S+ HLNYL RN
Sbjct: 1    MENLMIP---CTSKPPVIIPTKHENLSEFSQTPTKLAFSNTKKTNNPKISDSHLNYLSRN 57

Query: 2846 GQLKQAITALDTIAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQT 2667
            G+L +AITALD+IA+ G +++  T+I+LLQ+CID  S+E GR LHAR+ LV++ +PFV+T
Sbjct: 58   GRLTEAITALDSIAQSGSQVRANTFINLLQACIDFGSLELGRKLHARVHLVKESDPFVET 117

Query: 2666 KLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGIT 2487
            KL+SMYAKCG   DAR+VFD M ERNL+ W+AMIG  +RE RWKE++ELFFLMM EDG+ 
Sbjct: 118  KLVSMYAKCGSFVDARKVFDKMKERNLYAWSAMIGACSRELRWKEVVELFFLMM-EDGVL 176

Query: 2486 PDEFLLPKILKACANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWF 2307
            PDE L PK L+ACAN GD +TG+L+HS +IR G+     V+NS+L++YAKCG+L+SA+ F
Sbjct: 177  PDEILFPKFLQACANCGDVRTGRLLHSLVIRLGMVCFARVSNSVLAVYAKCGKLSSARRF 236

Query: 2306 FEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNC 2127
            FE M+ +D VTWNS+I  YCQ G+N++A  LF  M  +G++P LVTWNILI SYNQ G C
Sbjct: 237  FENMNERDIVTWNSMILAYCQKGENDEAYGLFYGMWKDGIQPCLVTWNILINSYNQLGQC 296

Query: 2126 DLAMELMNKMNGRGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXX 1947
            D+AM LM +M    I PDVFTWT MISG AQN R  QAL LF+EM+L G++PNGVT+   
Sbjct: 297  DVAMGLMKEMEISRIIPDVFTWTSMISGLAQNGRRWQALCLFKEMLLAGIKPNGVTITSA 356

Query: 1946 XXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDV 1767
                     LN G+E+H++ +K G + NVLVGNSLIDMY+KCG+ E AR+VF+ I E+DV
Sbjct: 357  VSACASLRVLNMGREIHSIALKKGIIDNVLVGNSLIDMYAKCGELEAARQVFDKIEERDV 416

Query: 1766 VTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQR 1587
             TWNS++ GY QAGYCGKAY+LFMKM+ SD+ PNV+TWN M SGY+Q GDED+AM LFQR
Sbjct: 417  YTWNSMVAGYCQAGYCGKAYELFMKMRESDLKPNVITWNTMISGYIQNGDEDRAMDLFQR 476

Query: 1586 METEGILKRNTASWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLL 1407
            ME +G ++RNTASWN  IAG +Q G+ +KA G+FRQMQ      NS+T+LSILP CANL+
Sbjct: 477  MEQDGKIRRNTASWNAFIAGYVQLGEIDKAFGVFRQMQSCSVSSNSVTILSILPGCANLV 536

Query: 1406 SAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIA 1227
            +AKKVKEIHGCVLRRNLE  +SI NSLIDTYAKSG+I+ +R +F+ + +RD+ISWN++I 
Sbjct: 537  AAKKVKEIHGCVLRRNLEFVLSISNSLIDTYAKSGNILYSRIIFDGMSTRDIISWNSIIG 596

Query: 1226 GYVLHGYPNISLDLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEIL 1047
            GYVLHG  + +LDLFN+MR LG  PNRGTF S ILA+ +A MV+EGKQ FSS++ +YEI+
Sbjct: 597  GYVLHGCSDAALDLFNQMRKLGLKPNRGTFLSIILAHGIAGMVDEGKQIFSSISDNYEII 656

Query: 1046 PGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQ 867
            P +EHY+AM+ + GRSGR  EA EFIE+M I+PDS+VWT+LLTA RIH +I LA+ A E+
Sbjct: 657  PAVEHYAAMIDVYGRSGRLGEAVEFIEDMPIEPDSSVWTSLLTASRIHRDIALAVLAGER 716

Query: 866  LITLEPENYMVHRLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMT 687
            L+ LEP N +++R++ Q+Y L GK +D  ++RK    N    SLG S I V N  H F+T
Sbjct: 717  LLDLEPANILINRVMFQIYVLSGKLDDPLKVRKLEKENILRRSLGHSWIEVRNTVHKFVT 776

Query: 686  GDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAISFALIA 507
            GD+S P +D +Y+ + SI  E+ +        +  ++EEEKE  GG+HSEKL ++FALI 
Sbjct: 777  GDQSKPCADLLYSWVKSIAREVNI---HDHHGRFFLEEEEKEETGGVHSEKLTLAFALIG 833

Query: 506  SPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
             PY+ +SIRI+KN RMC +CH TAK +SL +G EIYL D KCFHHFKNGQCSC DYW
Sbjct: 834  LPYSPRSIRIVKNTRMCSNCHLTAKYISLKFGCEIYLSDRKCFHHFKNGQCSCGDYW 890


>ref|XP_007227217.1| hypothetical protein PRUPE_ppa019183mg [Prunus persica]
            gi|462424153|gb|EMJ28416.1| hypothetical protein
            PRUPE_ppa019183mg [Prunus persica]
          Length = 882

 Score = 1035 bits (2676), Expect = 0.0
 Identities = 507/851 (59%), Positives = 646/851 (75%)
 Frame = -2

Query: 2888 PLPSEIHLNYLCRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHA 2709
            P  ++ HLNYLC+NGQ  +AIT LD+IA+ G K+ P TY++LLQSCID++SI+ GR LH 
Sbjct: 37   PKFTDTHLNYLCKNGQFSEAITVLDSIAQIGSKVPPTTYMNLLQSCIDTNSIQLGRKLHE 96

Query: 2708 RIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMIGGYNREQRWKEI 2529
             I LV++ NPFV+TKL+SMYAKCG L+DAR+VF  M ERNL+TW+AMIG   R+QRWKE+
Sbjct: 97   HIDLVEEINPFVETKLVSMYAKCGFLDDARKVFHAMRERNLYTWSAMIGACLRDQRWKEV 156

Query: 2528 IELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLS 2349
            +ELFF  MM+DG+ PD FL PKIL+AC N  + +  KLIHS  +R  L SC+HVNNS+L+
Sbjct: 157  VELFF-SMMKDGVLPDYFLFPKILQACGNCSNIEATKLIHSIAVRCNLTSCIHVNNSILA 215

Query: 2348 MYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVT 2169
            +YAKCG L  A+ FF+ MD +D V+WN+IISGYC  G++E+A RLFD M  EG+EPGLVT
Sbjct: 216  VYAKCGILEWARRFFDNMDERDGVSWNAIISGYCHKGESEEARRLFDAMSKEGIEPGLVT 275

Query: 2168 WNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNNRINQALELFREMM 1989
            WN LIAS+NQ  +CD+AMELM +M   GITPDV+TWT MISGFAQNNR +Q+L+ F++M+
Sbjct: 276  WNTLIASHNQLRHCDVAMELMRRMESCGITPDVYTWTSMISGFAQNNRKHQSLDFFKKML 335

Query: 1988 LVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPE 1809
            L GV+PNG+T+            LN+G E+++L +K+G + +VLVGNSLIDM+SKCG+ E
Sbjct: 336  LAGVQPNGITITSAISACTSLKSLNQGLEIYSLAIKMGFIDDVLVGNSLIDMFSKCGEVE 395

Query: 1808 VARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYL 1629
             A+++F MI +KDV TWNS+IGGY QA YCGKAY+LF KMQ SDV PN VTWNVM +GY+
Sbjct: 396  AAQKIFSMIPDKDVYTWNSMIGGYCQAKYCGKAYELFTKMQESDVHPNAVTWNVMITGYM 455

Query: 1628 QKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNS 1449
            Q GD DQAM LFQRME +G +KRNTASWN L++G LQ G+KNKA G+FRQMQ  C  PNS
Sbjct: 456  QNGDADQAMDLFQRMEKDGKIKRNTASWNSLVSGYLQLGEKNKAFGVFRQMQAYCVNPNS 515

Query: 1448 ITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKSGDIVSARALFED 1269
            +T+LS+LP+CANL++ KKVKEIHG VLRRNLES + + N+LIDTYAKSG+I  +R +F+ 
Sbjct: 516  VTILSVLPSCANLVAMKKVKEIHGSVLRRNLESEIPVANALIDTYAKSGNIAYSRIIFDT 575

Query: 1268 LLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEG 1089
            + S+D I+WN+ I+GYVLHG  +++LDLF++M+  GF PNRGTFA+ I AYSLA  V+EG
Sbjct: 576  MSSKDTITWNSAISGYVLHGRSDVALDLFDQMKKSGFEPNRGTFANIIHAYSLAGKVDEG 635

Query: 1088 KQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPDSTVWTALLTACR 909
             Q F S+T+DY+I+PGLEHYSAMV L GRSGR +EA EFIE M I+PDS+VW AL TACR
Sbjct: 636  TQAFHSITEDYQIIPGLEHYSAMVDLYGRSGRLQEAMEFIEGMPIEPDSSVWGALFTACR 695

Query: 908  IHGNIGLAIHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRKTINRNGTANSLGC 729
            I+GN+ LA+ A E L+  EP N ++ +L+LQ YAL GKSED S++RK          LG 
Sbjct: 696  IYGNLALAVRAGEHLLVSEPGNVLIQQLMLQAYALCGKSEDISKLRKFGKDYPKKKFLGQ 755

Query: 728  SRITVNNKEHTFMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQLCIDEEEKENIGG 549
              I V N  HTF++GDR      SI+  +     E K   PD    +LC++EEE+E IG 
Sbjct: 756  CWIEVKNSLHTFISGDRL--KLCSIFLNLWLQNIEEKAKTPDLC-NELCVEEEEEE-IGW 811

Query: 548  IHSEKLAISFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHF 369
            IHSEKLA +FAL  SP   QSIRI+KN RMC DCH+ AK +S+ +G +IYL D K FHHF
Sbjct: 812  IHSEKLAFAFALSGSPSVPQSIRIMKNLRMCGDCHRIAKYISVAFGCDIYLSDVKSFHHF 871

Query: 368  KNGQCSCRDYW 336
             NG+CSC DYW
Sbjct: 872  SNGRCSCGDYW 882


>ref|XP_006435073.1| hypothetical protein CICLE_v10000229mg [Citrus clementina]
            gi|557537195|gb|ESR48313.1| hypothetical protein
            CICLE_v10000229mg [Citrus clementina]
          Length = 889

 Score = 1026 bits (2653), Expect = 0.0
 Identities = 514/884 (58%), Positives = 662/884 (74%), Gaps = 2/884 (0%)
 Frame = -2

Query: 2981 PVADPWKQQV--STEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTI 2808
            PV  P K +   S+ FS     +T S TK S  P   + HL++LC NG+L +AIT LD+I
Sbjct: 11   PVIIPQKHKPDSSSGFSPHSNNYTRSLTKKS-NPRFRDTHLDFLCGNGRLNEAITVLDSI 69

Query: 2807 AKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLE 2628
            A  G K++  TYI+LLQ+CIDS+SI   R LHA + LV + + FV+TKLLS+YAKCG L+
Sbjct: 70   ATQGAKVRRNTYINLLQACIDSNSIHLARKLHAFLNLVTEIDVFVKTKLLSVYAKCGCLD 129

Query: 2627 DARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKAC 2448
            DAR VF+ M ERNL+TW+AMIG Y+R+QRW+E++ELFFLM+ +DG+ PD+FL PKIL+AC
Sbjct: 130  DAREVFEDMRERNLYTWSAMIGAYSRDQRWREVVELFFLMV-QDGLFPDDFLFPKILQAC 188

Query: 2447 ANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWN 2268
             N GD + GKL+HS +I+ G+     V NS+L++Y KCG+L  A+ FFE MD KD V WN
Sbjct: 189  GNCGDFEAGKLMHSLVIKLGMSCVRRVRNSVLAVYVKCGKLIWARRFFESMDEKDGVAWN 248

Query: 2267 SIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGR 2088
            S+ISGY Q G+N++A RLFD+M  E ++ G+VT+NILI SYNQ G CD+AME++ +M   
Sbjct: 249  SMISGYFQIGENDEAHRLFDKMCREEIKLGVVTFNILIRSYNQLGQCDVAMEMVKRMESL 308

Query: 2087 GITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKG 1908
            GITPDVFTWTCMISGFAQN R +QAL+LF+EM  VGV PNGVT+            L  G
Sbjct: 309  GITPDVFTWTCMISGFAQNGRTSQALDLFKEMSFVGVMPNGVTITSAISACTDLKALAMG 368

Query: 1907 KELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQA 1728
             E+H+L VK+G   +VLVGNSLI+MYSKC + E A RVF+MI +KDV +WNS+I GY QA
Sbjct: 369  MEIHSLAVKMGFTDDVLVGNSLINMYSKCEELEAAERVFDMIKDKDVYSWNSMIAGYCQA 428

Query: 1727 GYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTAS 1548
            GYCGKAY+LF+KMQ SDVPPNV+TWNV+ SGY+Q G+ED+A+ LFQRM     +KRNTAS
Sbjct: 429  GYCGKAYELFIKMQESDVPPNVITWNVLISGYIQNGNEDEAVDLFQRMGKNDKVKRNTAS 488

Query: 1547 WNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVL 1368
            WN LIAG  Q GQKN ALG+FR+MQ  C  PN +T+LS+LPACA L+++ KVKEIHGCVL
Sbjct: 489  WNSLIAGYQQLGQKNNALGVFRKMQSSCFYPNCVTILSVLPACAYLVASNKVKEIHGCVL 548

Query: 1367 RRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLD 1188
            RR+LES++ ++NSLIDTYAKSG+IV +R +F+++ S+D+I+WN+LI GYVLHG+ + +LD
Sbjct: 549  RRSLESSLPVMNSLIDTYAKSGNIVYSRTIFDEMSSKDIITWNSLICGYVLHGFWHAALD 608

Query: 1187 LFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALL 1008
            LF++M+  G  PNRGTF S ILA+SLA MV+ GKQ F S+T+ Y+I+P +EHYSAM+ L 
Sbjct: 609  LFDQMKSFGLKPNRGTFLSIILAHSLAGMVDLGKQVFCSITECYQIIPMIEHYSAMIDLY 668

Query: 1007 GRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHR 828
            GRSG+ +EA EFIE+M I+PDS++W ALLTACRIHGNI LA+ A E+L  LEP + ++ R
Sbjct: 669  GRSGKLEEAMEFIEDMPIEPDSSIWEALLTACRIHGNIDLAVLAIERLFDLEPGDVLIQR 728

Query: 827  LLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYA 648
            L+LQ+YA+ GK EDA ++RK    N   NS G S I V N  +TF+TG  S   SD +Y+
Sbjct: 729  LILQIYAICGKPEDALKVRKLEKENTRRNSFGQSWIEVKNLVYTFVTGGWSESYSDLLYS 788

Query: 647  RIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKN 468
             + ++      V   S  + LCI+EEEKE I GIHSEKLA++FALI S     +IRI+KN
Sbjct: 789  WLQNVPEN---VTARSCHSGLCIEEEEKEEISGIHSEKLALAFALIGSSQAPHTIRIVKN 845

Query: 467  FRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
             RMC  CHKTAK VS ++  EI+L DSKC HHFKNGQCSC DYW
Sbjct: 846  IRMCVHCHKTAKYVSKMHHCEIFLADSKCLHHFKNGQCSCGDYW 889


>gb|EXB97347.1| hypothetical protein L484_024210 [Morus notabilis]
          Length = 880

 Score = 1023 bits (2644), Expect = 0.0
 Identities = 522/904 (57%), Positives = 662/904 (73%), Gaps = 7/904 (0%)
 Frame = -2

Query: 3026 MENSIILQKFISKPPPVAD--PWKQQV--STEFSSKPIKHTVSFTKNSATPLPSEIHLNY 2859
            MEN II      KPPPV    P K  +   +EFS+     T+SF          + HL+ 
Sbjct: 1    MENVIIPCNL--KPPPVLPIIPTKAGIIQPSEFST-----TISF----------DSHLDK 43

Query: 2858 LCRNGQLKQAITALDTIAKHG--YKLKPRTYISLLQSCIDSDSIEQGRMLHARI-GLVQD 2688
            LCR+G+L  A+ ALD IA+ G   KLKPRTY++LLQSCID++SIE GR LHAR+ GLVQ 
Sbjct: 44   LCRDGRLSDAVAALDAIAERGSKVKLKPRTYMNLLQSCIDTNSIELGRKLHARMMGLVQY 103

Query: 2687 PNPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLM 2508
             NPFV+TKL+SMYAKCG L DARRVFD M ERNLFTW+AMIG  +REQRWKE+++LF+LM
Sbjct: 104  VNPFVETKLVSMYAKCGCLHDARRVFDGMRERNLFTWSAMIGACSREQRWKEVLKLFYLM 163

Query: 2507 MMEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGE 2328
            M  DGI PD+FLLPKIL+AC N  D +T K+IHS ++R G    + V NS+L++YAKCG+
Sbjct: 164  M-GDGILPDKFLLPKILEACGNCADFKTAKVIHSMVVRCGFCGSIRVINSILAVYAKCGK 222

Query: 2327 LNSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIAS 2148
            LN A+ FFE MD +D V+WN+IISG+CQ+G+ E+A RLFD ++ EG EPGLVTWNI+IAS
Sbjct: 223  LNWARRFFESMDKRDLVSWNAIISGFCQNGRMEEATRLFDAVREEGTEPGLVTWNIMIAS 282

Query: 2147 YNQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPN 1968
            YNQ G  D+AM LM KM   GI PDVFTWT +ISGFAQNNR NQAL+LF+EM+L GV+PN
Sbjct: 283  YNQLGQTDVAMGLMKKMESLGIVPDVFTWTSLISGFAQNNRRNQALDLFKEMLLAGVKPN 342

Query: 1967 GVTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFE 1788
             VT+            L KG E+HA  +K+G + +VLVGNSLIDMYSKCG+ E A+ VF+
Sbjct: 343  AVTITSAVSACASLKSLGKGLEIHAFSIKIGLIEDVLVGNSLIDMYSKCGELEAAQEVFD 402

Query: 1787 MILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQ 1608
            MI+EKDV TWNSLIGGY QAGYCGKA +LFMKMQ SDV PNV+TWNVM SGY+Q GDED+
Sbjct: 403  MIIEKDVFTWNSLIGGYCQAGYCGKACELFMKMQESDVAPNVITWNVMISGYIQNGDEDE 462

Query: 1607 AMVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSIL 1428
            AM LF+RME +G +KRNTASWN L+AG L  G+K+KALGIFRQMQ  C  PN +T+LS+L
Sbjct: 463  AMDLFRRMEKDGKVKRNTASWNSLVAGYLHVGEKDKALGIFRQMQSYCVIPNLVTMLSVL 522

Query: 1427 PACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVI 1248
            P CANLL+ KKV+EIH C+LRR L+S + + NSL+DTYAK+G++  +R +F+ +LS+D+I
Sbjct: 523  PTCANLLAEKKVREIHCCILRRVLDSELPVANSLLDTYAKAGNMTYSRTIFDRMLSKDII 582

Query: 1247 SWNTLIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSM 1068
            +WN++IAGYVLHG+ N +LDLF+ M   G  PNRGTF S I + SL+ +V++G+  FSS+
Sbjct: 583  TWNSIIAGYVLHGFSNAALDLFDDMTKSGLKPNRGTFLSIIYSCSLSGLVDKGRLAFSSI 642

Query: 1067 TKDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGL 888
            T+DY I+PGLEHY+A+V L GR GR  EA EFIE M ++PDS+VW ALLTA R H NIG 
Sbjct: 643  TEDYNIVPGLEHYAAVVDLYGRPGRLGEAMEFIENMPVEPDSSVWAALLTASRNHRNIGF 702

Query: 887  AIHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNN 708
             + A ++++ LEP NY++ RL  Q  AL  KSE+  +MRK    N T   LG   I + N
Sbjct: 703  TVRALDKILDLEPGNYLIQRLRAQADALVAKSENDPKMRKLEKENATKRHLGRCWIELQN 762

Query: 707  KEHTFMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLA 528
            + +TF+ GD+S P    +Y  I  I  +    +       LCI+EEEKE +G +H EK+A
Sbjct: 763  RVYTFVNGDQSEP---YLYPWIHDIAGK---ASKYGFHEGLCIEEEEKEEVGRVHCEKIA 816

Query: 527  ISFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSC 348
            I+FALI  P  +Q IRI+K+ RMC +CH+TAK +S  YG EIY+ DSKC H F NG CSC
Sbjct: 817  IAFALIGFPRKAQCIRIVKSLRMCGNCHETAKYISKTYGCEIYVTDSKCLHRFSNGHCSC 876

Query: 347  RDYW 336
            +DYW
Sbjct: 877  KDYW 880


>ref|XP_006416469.1| hypothetical protein EUTSA_v10006756mg [Eutrema salsugineum]
            gi|557094240|gb|ESQ34822.1| hypothetical protein
            EUTSA_v10006756mg [Eutrema salsugineum]
          Length = 893

 Score =  986 bits (2548), Expect = 0.0
 Identities = 486/882 (55%), Positives = 636/882 (72%), Gaps = 4/882 (0%)
 Frame = -2

Query: 2969 PWKQQVSTEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYK 2790
            P K + S E   K  K T+SFTK +   +  +  L YLCRNG L +A  ALD++ + G K
Sbjct: 19   PAKVENSPEVHPKSRKKTLSFTKRNEPIIIPDEQLEYLCRNGSLLEAEKALDSMFQQGSK 78

Query: 2789 LKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVF 2610
            +K  TY++LL+SCIDS S+  GR+LH+R GL+  P+ F++TKLLSMYAKCG L DAR+VF
Sbjct: 79   VKRSTYLNLLESCIDSGSVHLGRILHSRFGLLPQPDVFLETKLLSMYAKCGCLVDARKVF 138

Query: 2609 DTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDA 2430
            D+M ERNL+TW+AMIG Y+RE RWKE+ +LF LMM  DG+ PD+FLLPKIL+ CAN GD 
Sbjct: 139  DSMRERNLYTWSAMIGAYSREHRWKEVSKLFRLMM-GDGVLPDDFLLPKILQGCANCGDV 197

Query: 2429 QTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGY 2250
            +TGKLIHS +I+ G+ SCL V+NS+L++YAKCGEL+ A  FF +M+ +D V WNS++  Y
Sbjct: 198  ETGKLIHSVVIKLGMTSCLRVSNSILAVYAKCGELSLATKFFRRMEERDVVAWNSVLLAY 257

Query: 2249 CQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDV 2070
            CQ+GK+E+A+ L ++M+ EG+ PGLVTWNILI  YNQ G CD AM+LM KM   G+T DV
Sbjct: 258  CQNGKHEEAVELVEEMEKEGISPGLVTWNILIGGYNQLGKCDAAMDLMQKMESFGVTADV 317

Query: 2069 FTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHAL 1890
            FTWT MISG   N +  QAL+ FR M L GV PNGVT+            LN G E+H++
Sbjct: 318  FTWTAMISGLIHNGKRYQALDTFRRMFLAGVVPNGVTIMSAVSACSCLKVLNLGSEVHSI 377

Query: 1889 GVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKA 1710
             VK+G M +VLVGNSL+DMYSKCGK E AR+VF+ +  KDV TWNS+I GY  A YCGKA
Sbjct: 378  AVKMGFMDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDVYTWNSMITGYCHAEYCGKA 437

Query: 1709 YDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIA 1530
            Y+LF +MQ ++V PN++TWN M SGY++ GDE +AM LFQRME +G ++RNTASWNL+IA
Sbjct: 438  YELFTRMQDANVKPNIITWNTMISGYIKNGDEGEAMDLFQRMEKDGKVQRNTASWNLIIA 497

Query: 1529 GLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLES 1350
            G +QNG+K++AL +FR+MQ     PNS+T+LS+LPACANLL+ K V+EIHGCVLRRNL++
Sbjct: 498  GYIQNGKKDEALELFRKMQFSRFTPNSVTILSLLPACANLLATKMVREIHGCVLRRNLDA 557

Query: 1349 NVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMR 1170
              ++ N+L DTYAKSGDI  AR +F+ + ++D+I+WN+LI GYVLHG    +LDLFN+M+
Sbjct: 558  VHAVKNALTDTYAKSGDIAYARTIFKGMETKDIITWNSLIGGYVLHGRYGPALDLFNQMK 617

Query: 1169 LLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRF 990
              G  PNRGT +S ILA+ L   V+EGK+ FSS+  DY I+P LEH SAM++L GRS R 
Sbjct: 618  TQGIKPNRGTLSSIILAHGLMGNVDEGKKVFSSIADDYNIIPALEHCSAMISLYGRSNRL 677

Query: 989  KEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLY 810
            +EA +FI+EMN+Q ++ +W + LT CRIHG+I LAIHAAE L +LEPEN +   ++ Q+Y
Sbjct: 678  EEAVQFIQEMNVQSETPIWESFLTGCRIHGDIDLAIHAAEHLFSLEPENPITENVVSQIY 737

Query: 809  ALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIG 630
            ALG K   +   +K    N     LG S I V N  HTF TGD+S   +D +Y  ++   
Sbjct: 738  ALGAKLGRSLEGKKPRRDNLLKKPLGHSWIEVRNSIHTFTTGDKSQLCTDVLYPWVE--- 794

Query: 629  NEIKVVAPDSRETQ----LCIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKNFR 462
               K+   D R  Q    L I+EE +E   GIHSEK A++F LI+S    ++IRI+KN R
Sbjct: 795  ---KLCRLDDRNDQYNGELLIEEEGREETCGIHSEKFAMAFGLISSSRAHKTIRILKNLR 851

Query: 461  MCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
            MCRDCH TAK +S  YG +I L D++C HHFKNG CSC+DYW
Sbjct: 852  MCRDCHNTAKYISRRYGCDILLEDTRCLHHFKNGDCSCKDYW 893


>ref|XP_006596427.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Glycine max]
          Length = 896

 Score =  983 bits (2540), Expect = 0.0
 Identities = 481/863 (55%), Positives = 641/863 (74%), Gaps = 1/863 (0%)
 Frame = -2

Query: 2921 HTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCIDS 2742
            ++VS T+ S   L  +  LN LC NG L +A+  LD++A+ G K++P T+++LLQ+CID 
Sbjct: 39   NSVSMTQRSHPKLV-DTQLNQLCANGSLSEAVAILDSLAQQGSKVRPITFMNLLQACIDK 97

Query: 2741 DSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMIG 2562
            D I  GR LH RIGLV+  NPFV+TKL+SMYAKCG L++AR+VFD M ERNLFTW+AMIG
Sbjct: 98   DCILVGRELHTRIGLVRKVNPFVETKLVSMYAKCGHLDEARKVFDEMRERNLFTWSAMIG 157

Query: 2561 GYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSGLD 2382
              +R+ +W+E++ELF+  MM+ G+ PD+FLLPK+LKAC    D +TG+LIHS +IR G+ 
Sbjct: 158  ACSRDLKWEEVVELFY-DMMQHGVLPDDFLLPKVLKACGKFRDIETGRLIHSLVIRGGMC 216

Query: 2381 SCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQM 2202
            S LHVNNS+L++YAKCGE++ A+  F +MD ++ V+WN II+GYCQ G+ E+A + FD M
Sbjct: 217  SSLHVNNSILAVYAKCGEMSCAEKIFRRMDERNCVSWNVIITGYCQRGEIEQAQKYFDAM 276

Query: 2201 QAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNNRI 2022
            Q EG+EPGLVTWNILIASY+Q G+CD+AM+LM KM   GITPDV+TWT MISGF Q  RI
Sbjct: 277  QEEGMEPGLVTWNILIASYSQLGHCDIAMDLMRKMESFGITPDVYTWTSMISGFTQKGRI 336

Query: 2021 NQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNSL 1842
            N+A +L R+M++VGVEPN +T+            L+ G E+H++ VK   + ++L+GNSL
Sbjct: 337  NEAFDLLRDMLIVGVEPNSITIASAASACASVKSLSMGSEIHSIAVKTSMVDDILIGNSL 396

Query: 1841 IDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPNV 1662
            IDMY+K G  E A+ +F+++LE+DV +WNS+IGGY QAG+CGKA++LFMKMQ SD PPNV
Sbjct: 397  IDMYAKGGDLEAAQSIFDVMLERDVYSWNSIIGGYCQAGFCGKAHELFMKMQESDSPPNV 456

Query: 1661 VTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGIFR 1482
            VTWNVM +G++Q GDED+A+ LF R+E +G +K N ASWN LI+G LQN QK+KAL IFR
Sbjct: 457  VTWNVMITGFMQNGDEDEALNLFLRIEKDGKIKPNVASWNSLISGFLQNRQKDKALQIFR 516

Query: 1481 QMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKSG 1302
            QMQ     PN +T+L+ILPAC NL++AKKVKEIH C  RRNL S +S+ N+ ID+YAKSG
Sbjct: 517  QMQFSNMAPNLVTVLTILPACTNLVAAKKVKEIHCCATRRNLVSELSVSNTFIDSYAKSG 576

Query: 1301 DIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFASTIL 1122
            +I+ +R +F+ L  +D+ISWN+L++GYVLHG    +LDLF++MR  G  P+R T  S I 
Sbjct: 577  NIMYSRKVFDGLSPKDIISWNSLLSGYVLHGCSESALDLFDQMRKDGLHPSRVTLTSIIS 636

Query: 1121 AYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPDS 942
            AYS A+MV+EGK  FS+++++Y+I   LEHYSAMV LLGRSG+  +A EFI+ M ++P+S
Sbjct: 637  AYSHAEMVDEGKHAFSNISEEYQIRLDLEHYSAMVYLLGRSGKLAKALEFIQNMPVEPNS 696

Query: 941  TVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRKTI 762
            +VW ALLTACRIH N G+AI A E ++ L+PEN +   LL Q Y++ GKS +A +M K  
Sbjct: 697  SVWAALLTACRIHKNFGMAIFAGEHMLELDPENIITQHLLSQAYSVCGKSWEAQKMTKLE 756

Query: 761  NRNGTANSLGCSRITVNNKEHTFMTG-DRSMPNSDSIYARIDSIGNEIKVVAPDSRETQL 585
                    +G S I +NN  HTF+ G D+S+P  D I++ +  +G  +K    D+    L
Sbjct: 757  KEKFVKMPVGQSWIEMNNMVHTFVVGDDQSIPYLDKIHSWLKRVGENVKAHISDN---GL 813

Query: 584  CIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGRE 405
             I+EEEKENIG +HSEKLA +F LI   +T Q +RI+KN RMCRDCH TAK +SL YG E
Sbjct: 814  RIEEEEKENIGSVHSEKLAFAFGLIDFHHTPQILRIVKNLRMCRDCHDTAKYISLAYGCE 873

Query: 404  IYLYDSKCFHHFKNGQCSCRDYW 336
            IYL DS C HHFK+G CSCRDYW
Sbjct: 874  IYLSDSNCLHHFKDGHCSCRDYW 896


>ref|XP_006575412.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            isoform X1 [Glycine max] gi|571441335|ref|XP_006575413.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g19720-like isoform X2 [Glycine max]
          Length = 896

 Score =  977 bits (2526), Expect = 0.0
 Identities = 484/892 (54%), Positives = 656/892 (73%), Gaps = 6/892 (0%)
 Frame = -2

Query: 2993 SKP-PPVADPWKQQVSTEF--SSKPI--KHTVSFTKNSATPLPSEIHLNYLCRNGQLKQA 2829
            SKP PP+  P    +  E+  S++ +   ++VS T+ S  P   +  LN LC NG L +A
Sbjct: 10   SKPWPPLFIPSHCSIQLEWHGSTRVLANSNSVSITQRS-NPKLIDTQLNQLCANGPLSEA 68

Query: 2828 ITALDTIAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMY 2649
            +  LD++A+ G K++P T+++LLQ+CID D I  GR LHARIGLV   NPFV+TKL+SMY
Sbjct: 69   VAILDSLAQQGSKVRPITFMNLLQACIDKDCILVGRELHARIGLVGKVNPFVETKLVSMY 128

Query: 2648 AKCGGLEDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLL 2469
            AKCG L++A +VFD M ERNLFTW+AMIG  +R+ +W+E+++LF+  MM+ G+ PDEFLL
Sbjct: 129  AKCGHLDEAWKVFDEMRERNLFTWSAMIGACSRDLKWEEVVKLFY-DMMQHGVLPDEFLL 187

Query: 2468 PKILKACANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDN 2289
            PK+LKAC    D +TG+LIHS  IR G+ S LHVNNS+L++YAKCGE++ A+ FF +MD 
Sbjct: 188  PKVLKACGKCRDIETGRLIHSVAIRGGMCSSLHVNNSILAVYAKCGEMSCAEKFFRRMDE 247

Query: 2288 KDNVTWNSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMEL 2109
            ++ ++WN II+GYCQ G+ E+A + FD M+ EG++PGLVTWNILIASY+Q G+CD+AM+L
Sbjct: 248  RNCISWNVIITGYCQRGEIEQAQKYFDAMREEGMKPGLVTWNILIASYSQLGHCDIAMDL 307

Query: 2108 MNKMNGRGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXX 1929
            + KM   GITPDV+TWT MISGF+Q  RIN+A +L R+M++VGVEPN +T+         
Sbjct: 308  IRKMESFGITPDVYTWTSMISGFSQKGRINEAFDLLRDMLIVGVEPNSITIASAASACAS 367

Query: 1928 XXXLNKGKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSL 1749
               L+ G E+H++ VK   +G++L+ NSLIDMY+K G  E A+ +F+++L++DV +WNS+
Sbjct: 368  VKSLSMGSEIHSIAVKTSLVGDILIANSLIDMYAKGGNLEAAQSIFDVMLQRDVYSWNSI 427

Query: 1748 IGGYTQAGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGI 1569
            IGGY QAG+CGKA++LFMKMQ SD PPNVVTWNVM +G++Q GDED+A+ LFQR+E +G 
Sbjct: 428  IGGYCQAGFCGKAHELFMKMQESDSPPNVVTWNVMITGFMQNGDEDEALNLFQRIENDGK 487

Query: 1568 LKRNTASWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVK 1389
            +K N ASWN LI+G LQN QK+KAL IFR+MQ     PN +T+L+ILPAC NL++AKKVK
Sbjct: 488  IKPNVASWNSLISGFLQNRQKDKALQIFRRMQFSNMAPNLVTVLTILPACTNLVAAKKVK 547

Query: 1388 EIHGCVLRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHG 1209
            EIH C +RRNL S +S+ N+ ID+YAKSG+I+ +R +F+ L  +D+ISWN+L++GYVLHG
Sbjct: 548  EIHCCAIRRNLVSELSVSNTFIDSYAKSGNIMYSRKVFDGLSPKDIISWNSLLSGYVLHG 607

Query: 1208 YPNISLDLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHY 1029
                +LDLF++MR  G  PNR T  S I AYS A MV+EGK  FS+++++Y+I   LEHY
Sbjct: 608  CSESALDLFDQMRKDGVHPNRVTLTSIISAYSHAGMVDEGKHAFSNISEEYQIRLDLEHY 667

Query: 1028 SAMVALLGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEP 849
            SAMV LLGRSG+  +A EFI+ M ++P+S+VW AL+TACRIH N G+AI A E++  L+P
Sbjct: 668  SAMVYLLGRSGKLAKALEFIQNMPVEPNSSVWAALMTACRIHKNFGMAIFAGERMHELDP 727

Query: 848  ENYMVHRLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTG-DRSM 672
            EN +   LL Q Y++ GKS +A +M K          +G S I +NN  HTF+ G D+S 
Sbjct: 728  ENIITQHLLSQAYSVCGKSLEAPKMTKLEKEKFVNIPVGQSWIEMNNMVHTFVVGDDQST 787

Query: 671  PNSDSIYARIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAISFALIASPYTS 492
            P  D +++ +  +G  +K    D+    LCI+EEEKENI  +HSEKLA +F LI S +T 
Sbjct: 788  PYLDKLHSWLKRVGANVKAHISDN---GLCIEEEEKENISSVHSEKLAFAFGLIDSHHTP 844

Query: 491  QSIRIIKNFRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
            Q +RI+KN RMCRDCH +AK +SL YG EIYL DS C HHFK+G CSCRDYW
Sbjct: 845  QILRIVKNLRMCRDCHDSAKYISLAYGCEIYLSDSNCLHHFKDGHCSCRDYW 896


>ref|XP_006341986.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Solanum tuberosum]
          Length = 884

 Score =  973 bits (2514), Expect = 0.0
 Identities = 484/864 (56%), Positives = 632/864 (73%), Gaps = 1/864 (0%)
 Frame = -2

Query: 2924 KHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCID 2745
            K  ++F  N+     ++ HL+YLC+ G+L +AIT L++I+++GYK+K  T+  L++SCI+
Sbjct: 27   KVPINFVPNTEQSRFTDTHLDYLCKKGRLSEAITTLESISQYGYKVKTETFSRLIESCIN 86

Query: 2744 SDSIEQGRMLHARIG-LVQDPNPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAM 2568
              S+  GR LH  +  L+   +PF++TKLL MY+KCG L++A  +FD M +R+LF W+AM
Sbjct: 87   EKSLYLGRKLHKEMNFLLAKVDPFIETKLLGMYSKCGSLQEAYEMFDKMRKRDLFAWSAM 146

Query: 2567 IGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSG 2388
            IG  +R+ RW E++ELF+ MMM DG+ PD FL PKIL+ACAN GD +TG LIHS  IR G
Sbjct: 147  IGACSRDCRWSEVMELFY-MMMGDGVVPDSFLFPKILQACANCGDVETGILIHSIAIRCG 205

Query: 2387 LDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFD 2208
            + S + VNNSLL++YAKCG L+ AK  FE  + +D V+WNSII  YC  G   +A RL +
Sbjct: 206  MISEIRVNNSLLAVYAKCGLLDCAKRIFESTEMRDTVSWNSIIMAYCHKGDIVEARRLLN 265

Query: 2207 QMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNN 2028
             M+ EGVEPGL+TWNILIASYNQ G CD A+E+M +M G GI PDVFTWTC+ISG +Q+N
Sbjct: 266  LMRLEGVEPGLITWNILIASYNQLGRCDEALEVMKEMEGNGIMPDVFTWTCLISGMSQHN 325

Query: 2027 RINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGN 1848
            R ++ALELFREM+L GV P+ VT+            L KG+ELH+L VKLG  G V+VGN
Sbjct: 326  RNSRALELFREMILNGVTPSEVTLTSTVSACASLKDLRKGRELHSLVVKLGFDGGVIVGN 385

Query: 1847 SLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPP 1668
            +L+D+YSKCGK E AR VF+MI EKDV +WNSLIGGY QAG CGKAYDLFMKM   DV P
Sbjct: 386  ALVDLYSKCGKLEAARLVFDMIPEKDVYSWNSLIGGYCQAGCCGKAYDLFMKMHEFDVSP 445

Query: 1667 NVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGI 1488
            NV+TWNV+ +G++Q GDEDQA+ LF RME +G ++R+ ASWN LIAG L NGQK+KALGI
Sbjct: 446  NVITWNVLITGHMQNGDEDQALDLFWRMEKDGNVERDAASWNALIAGYLHNGQKDKALGI 505

Query: 1487 FRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAK 1308
            FR+MQ    KPN++T+LSILPACANL+ AKKVKEIH CVLR NLE+ +SI NSLIDTY+K
Sbjct: 506  FRKMQSFGFKPNTVTILSILPACANLIGAKKVKEIHCCVLRCNLENELSIANSLIDTYSK 565

Query: 1307 SGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFAST 1128
            SG +  ++ +F+ + ++D+ISWNTLIAGYVLHG+ + +  LF++M   G  PNRGTF+S 
Sbjct: 566  SGGLQYSKTIFDGMSTKDIISWNTLIAGYVLHGFSSEATKLFHQMEEAGLKPNRGTFSSM 625

Query: 1127 ILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQP 948
            I +Y LAKMV EGK+ FSSM ++Y I+PGLEHY AMV L GRSG+ +EA +FI+ M ++ 
Sbjct: 626  ISSYGLAKMVEEGKRMFSSMYEEYRIVPGLEHYVAMVTLYGRSGKLEEAIDFIDNMTMEH 685

Query: 947  DSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRK 768
            D ++W ALLTA R+HGN+ LAIHA EQL+ L+P N ++H+LLLQL  L G SE++  + +
Sbjct: 686  DISIWGALLTASRVHGNLNLAIHAGEQLLKLDPGNVVIHQLLLQLNVLRGISEESVTVMR 745

Query: 767  TINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQ 588
               RN     L  S   +NN  H F +G +S       + +      E+K+    S   +
Sbjct: 746  PRKRNHHEEPLSWSWTEINNVVHAFASGQQSNSEVPDSWIK----RKEVKMEGSSSC-NR 800

Query: 587  LCIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGR 408
            LCI EEE E+I  +HSEKLA+SFALI SP +S+ IRI+KN RMC DCH+ AKLVS  Y R
Sbjct: 801  LCIKEEENEDITRVHSEKLALSFALINSPQSSRVIRIVKNLRMCEDCHRIAKLVSQKYER 860

Query: 407  EIYLYDSKCFHHFKNGQCSCRDYW 336
            EIY++DSKC HHFK+G CSC +YW
Sbjct: 861  EIYIHDSKCLHHFKDGYCSCGNYW 884


>ref|XP_004238610.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Solanum lycopersicum]
          Length = 884

 Score =  969 bits (2506), Expect = 0.0
 Identities = 484/864 (56%), Positives = 628/864 (72%), Gaps = 1/864 (0%)
 Frame = -2

Query: 2924 KHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCID 2745
            K  ++F  N+     ++ HL+YLC+NG+L +AIT L++I+++GYK+K  T+  L++SCI+
Sbjct: 27   KVPINFVPNTEESRLTDTHLDYLCKNGRLSEAITTLESISQYGYKVKTETFSRLIESCIN 86

Query: 2744 SDSIEQGRMLHARIG-LVQDPNPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAM 2568
              S+  GR LH  +  L++  +PF++TKLL MY+KCG L++A  +FD M +R+LF W+AM
Sbjct: 87   EKSLYLGRKLHKEMNILLEKVDPFIETKLLGMYSKCGSLQEAYEMFDKMRKRDLFAWSAM 146

Query: 2567 IGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSG 2388
            IG  +R+ RW E++ELF+ MMM DG+ PD FL P+IL+A AN GD +TG LIHS  IR G
Sbjct: 147  IGACSRDSRWSEVMELFY-MMMGDGVVPDSFLFPRILQASANCGDVETGMLIHSIAIRCG 205

Query: 2387 LDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFD 2208
            + S + VNNSLL++YAKCG L  AK  FE M+ +D V+WNS+I  YC  G    A RL +
Sbjct: 206  MSSEIRVNNSLLAVYAKCGLLGCAKRIFESMEMRDTVSWNSMIMAYCHKGDIVVARRLLN 265

Query: 2207 QMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNN 2028
             M  EGVEPGL+TWNILIASYNQ G CD A+E+M +M G GI PDVFTWT +ISG +Q+N
Sbjct: 266  LMPLEGVEPGLITWNILIASYNQLGRCDEALEVMKEMEGNGIMPDVFTWTSLISGMSQHN 325

Query: 2027 RINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGN 1848
            R +QALELFREM+L GV P+ VT+            L KGKELH+L VKLG  G V+VGN
Sbjct: 326  RNSQALELFREMILNGVTPSEVTLTSTVSACASLKDLRKGKELHSLVVKLGFDGGVIVGN 385

Query: 1847 SLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPP 1668
            +L+D+YSKCGK E AR+VF+MI EKDV +WNSLIGGY QAG CGKAYDLFMKM    V P
Sbjct: 386  ALVDLYSKCGKLEAARQVFDMIPEKDVYSWNSLIGGYCQAGCCGKAYDLFMKMHEFAVSP 445

Query: 1667 NVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGI 1488
            NV+TWNV+ +G++Q GDEDQA+ LF RME +G ++R+ ASWN LIAG L NGQK+KALGI
Sbjct: 446  NVITWNVLITGHMQNGDEDQALDLFWRMEKDGNVERDAASWNALIAGYLHNGQKDKALGI 505

Query: 1487 FRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAK 1308
            FR+MQ    KPN++T+LSILPACANL+ AKKVKEIH CVLR NLE+ +SI NSLIDTY+K
Sbjct: 506  FRKMQSSGLKPNTVTILSILPACANLIGAKKVKEIHCCVLRCNLENELSIANSLIDTYSK 565

Query: 1307 SGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFAST 1128
            SG +  ++ +F+ + ++D+ISWNTLIAGYVLHG+ + S  LF++M   G  PNRGTF+S 
Sbjct: 566  SGGLQYSKTIFDVMSTKDIISWNTLIAGYVLHGFSSESTKLFHQMEEAGLKPNRGTFSSV 625

Query: 1127 ILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQP 948
            IL+Y LAKMV EGK+ FSSM++ Y I+PGLEH  AMV L GRSG+ +EA  FI+ M ++ 
Sbjct: 626  ILSYGLAKMVEEGKRMFSSMSEKYRIVPGLEHCVAMVNLYGRSGKLEEAINFIDNMTMEH 685

Query: 947  DSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRK 768
            D ++W ALLTA R+HGN+ LAIHA EQL  L+P N ++H+LLLQLY L G SE++  + +
Sbjct: 686  DISIWGALLTASRVHGNLNLAIHAGEQLFKLDPGNVVIHQLLLQLYVLRGISEESETVMR 745

Query: 767  TINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQ 588
               RN     L  S   +NN  H F +G +        + +      E+K+    S   +
Sbjct: 746  PRKRNHHEEPLSWSWTEINNVVHAFASGQQCNSEVPDSWIK----RKEVKMEGSSSC-NR 800

Query: 587  LCIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGR 408
            LCI EEE E+I  +HSEKLA+SFALI SP +S+ IRI+KN RMC DCH+ AKLVS  Y R
Sbjct: 801  LCIKEEENEDITRVHSEKLALSFALINSPQSSRVIRIVKNLRMCEDCHRIAKLVSQKYER 860

Query: 407  EIYLYDSKCFHHFKNGQCSCRDYW 336
            EIY++DSKC HHFK+G CSC +YW
Sbjct: 861  EIYIHDSKCLHHFKDGYCSCGNYW 884


>ref|NP_173402.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75263158|sp|Q9FXH1.1|PPR52_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g19720; AltName: Full=Protein DYW7
            gi|10086495|gb|AAG12555.1|AC007797_15 Unknown Protein
            [Arabidopsis thaliana] gi|332191770|gb|AEE29891.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 894

 Score =  964 bits (2491), Expect = 0.0
 Identities = 477/879 (54%), Positives = 626/879 (71%), Gaps = 1/879 (0%)
 Frame = -2

Query: 2969 PWKQQVSTEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYK 2790
            P K + S E   K  K  +SFTK     +  +   +YLCRNG L +A  ALD++ + G K
Sbjct: 19   PAKVENSPELHPKSRKKNLSFTKKKEPNIIPDEQFDYLCRNGSLLEAEKALDSLFQQGSK 78

Query: 2789 LKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVF 2610
            +K  TY+ LL+SCIDS SI  GR+LHAR GL  +P+ FV+TKLLSMYAKCG + DAR+VF
Sbjct: 79   VKRSTYLKLLESCIDSGSIHLGRILHARFGLFTEPDVFVETKLLSMYAKCGCIADARKVF 138

Query: 2609 DTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDA 2430
            D+M ERNLFTW+AMIG Y+RE RW+E+ +LF LMM +DG+ PD+FL PKIL+ CAN GD 
Sbjct: 139  DSMRERNLFTWSAMIGAYSRENRWREVAKLFRLMM-KDGVLPDDFLFPKILQGCANCGDV 197

Query: 2429 QTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGY 2250
            + GK+IHS +I+ G+ SCL V+NS+L++YAKCGEL+ A  FF +M  +D + WNS++  Y
Sbjct: 198  EAGKVIHSVVIKLGMSSCLRVSNSILAVYAKCGELDFATKFFRRMRERDVIAWNSVLLAY 257

Query: 2249 CQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDV 2070
            CQ+GK+E+A+ L  +M+ EG+ PGLVTWNILI  YNQ G CD AM+LM KM   GIT DV
Sbjct: 258  CQNGKHEEAVELVKEMEKEGISPGLVTWNILIGGYNQLGKCDAAMDLMQKMETFGITADV 317

Query: 2069 FTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHAL 1890
            FTWT MISG   N    QAL++FR+M L GV PN VT+            +N+G E+H++
Sbjct: 318  FTWTAMISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSCLKVINQGSEVHSI 377

Query: 1889 GVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKA 1710
             VK+G + +VLVGNSL+DMYSKCGK E AR+VF+ +  KDV TWNS+I GY QAGYCGKA
Sbjct: 378  AVKMGFIDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDVYTWNSMITGYCQAGYCGKA 437

Query: 1709 YDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIA 1530
            Y+LF +MQ +++ PN++TWN M SGY++ GDE +AM LFQRME +G ++RNTA+WNL+IA
Sbjct: 438  YELFTRMQDANLRPNIITWNTMISGYIKNGDEGEAMDLFQRMEKDGKVQRNTATWNLIIA 497

Query: 1529 GLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLES 1350
            G +QNG+K++AL +FR+MQ     PNS+T+LS+LPACANLL AK V+EIHGCVLRRNL++
Sbjct: 498  GYIQNGKKDEALELFRKMQFSRFMPNSVTILSLLPACANLLGAKMVREIHGCVLRRNLDA 557

Query: 1349 NVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMR 1170
              ++ N+L DTYAKSGDI  +R +F  + ++D+I+WN+LI GYVLHG    +L LFN+M+
Sbjct: 558  IHAVKNALTDTYAKSGDIEYSRTIFLGMETKDIITWNSLIGGYVLHGSYGPALALFNQMK 617

Query: 1169 LLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRF 990
              G  PNRGT +S ILA+ L   V+EGK+ F S+  DY I+P LEH SAMV L GR+ R 
Sbjct: 618  TQGITPNRGTLSSIILAHGLMGNVDEGKKVFYSIANDYHIIPALEHCSAMVYLYGRANRL 677

Query: 989  KEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLY 810
            +EA +FI+EMNIQ ++ +W + LT CRIHG+I +AIHAAE L +LEPEN     ++ Q+Y
Sbjct: 678  EEALQFIQEMNIQSETPIWESFLTGCRIHGDIDMAIHAAENLFSLEPENTATESIVSQIY 737

Query: 809  ALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIG 630
            ALG K   +    K    N     LG S I V N  HTF TGD+S   +D +Y  ++ + 
Sbjct: 738  ALGAKLGRSLEGNKPRRDNLLKKPLGQSWIEVRNLIHTFTTGDQSKLCTDVLYPLVEKMS 797

Query: 629  NEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAISFALIASPYTSQ-SIRIIKNFRMCR 453
                    D    +L I+EE +E   GIHSEK A++F LI+S   S+ +IRI+KN RMCR
Sbjct: 798  RLDN--RSDQYNGELWIEEEGREETCGIHSEKFAMAFGLISSSGASKTTIRILKNLRMCR 855

Query: 452  DCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
            DCH TAK VS  YG +I L D++C HHFKNG CSC+DYW
Sbjct: 856  DCHDTAKYVSKRYGCDILLEDTRCLHHFKNGDCSCKDYW 894


>ref|XP_004152769.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Cucumis sativus]
          Length = 1463

 Score =  957 bits (2474), Expect = 0.0
 Identities = 481/860 (55%), Positives = 627/860 (72%), Gaps = 3/860 (0%)
 Frame = -2

Query: 2984 PPVADPWK--QQVSTEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDT 2811
            PP++ P    +    +FSSKPIK ++ FT    +    + HL+YLC NG L++AITA+D+
Sbjct: 12   PPISGPASVIKPRPLKFSSKPIKTSIFFTYKLTSKFNDD-HLSYLCSNGLLREAITAIDS 70

Query: 2810 IAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGL 2631
            I+K G KL   TYI+LLQ+CID  SIE GR LH R+GLV   NPFV+TKL+SMYAKCG L
Sbjct: 71   ISKRGSKLSTNTYINLLQTCIDVGSIELGRELHVRMGLVHRVNPFVETKLVSMYAKCGCL 130

Query: 2630 EDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKA 2451
            +DAR+VFD M ERNL+TW+AMIG Y+REQRWKE++ELFFLMM  DG+ PD FL PKIL+A
Sbjct: 131  KDARKVFDGMQERNLYTWSAMIGAYSREQRWKEVVELFFLMM-GDGVLPDAFLFPKILQA 189

Query: 2450 CANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTW 2271
            C N  D +T KLIHS +IR GL   + ++NS+L+ + KCG+L+ A+ FF  MD +D V+W
Sbjct: 190  CGNCEDLETVKLIHSLVIRCGLSCYMRLSNSILTAFVKCGKLSLARKFFGNMDERDGVSW 249

Query: 2270 NSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNG 2091
            N +I+GYCQ G  ++A RL D M  +G +PGLVT+NI+IASY+Q G+CDL ++L  KM  
Sbjct: 250  NVMIAGYCQKGNGDEARRLLDTMSNQGFKPGLVTYNIMIASYSQLGDCDLVIDLKKKMES 309

Query: 2090 RGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNK 1911
             G+ PDV+TWT MISGF+Q++RI+QAL+ F++M+L GVEPN +T+            L  
Sbjct: 310  VGLAPDVYTWTSMISGFSQSSRISQALDFFKKMILAGVEPNTITIASATSACASLKSLQN 369

Query: 1910 GKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQ 1731
            G E+H   +K+G     LVGNSLIDMYSKCGK E AR VF+ ILEKDV TWNS+IGGY Q
Sbjct: 370  GLEIHCFAIKMGIARETLVGNSLIDMYSKCGKLEAARHVFDTILEKDVYTWNSMIGGYCQ 429

Query: 1730 AGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTA 1551
            AGY GKAY+LFM+++ S V PNVVTWN M SG +Q GDEDQAM LFQ ME +G +KRNTA
Sbjct: 430  AGYGGKAYELFMRLRESTVMPNVVTWNAMISGCIQNGDEDQAMDLFQIMEKDGGVKRNTA 489

Query: 1550 SWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCV 1371
            SWN LIAG  Q G+KNKAL IFRQMQ L   PNS+T+LSILPACAN+++ KK+KEIHGCV
Sbjct: 490  SWNSLIAGYHQLGEKNKALAIFRQMQSLNFSPNSVTILSILPACANVMAEKKIKEIHGCV 549

Query: 1370 LRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISL 1191
            LRRNLES +++ NSL+DTYAKSG+I  +R +F  + S+D+I+WN++IAGY+LHG  + + 
Sbjct: 550  LRRNLESELAVANSLVDTYAKSGNIKYSRTVFNGMSSKDIITWNSIIAGYILHGCSDSAF 609

Query: 1190 DLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVAL 1011
             LF++MR LG  PNRGT AS I AY +A MV++G+  FSS+T++++ILP L+HY AMV L
Sbjct: 610  QLFDQMRNLGIRPNRGTLASIIHAYGIAGMVDKGRHVFSSITEEHQILPTLDHYLAMVDL 669

Query: 1010 LGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVH 831
             GRSGR  +A EFIE+M I+PD ++WT+LLTACR HGN+ LA+ AA++L  LEP+N++++
Sbjct: 670  YGRSGRLADAIEFIEDMPIEPDVSIWTSLLTACRFHGNLNLAVLAAKRLHELEPDNHVIY 729

Query: 830  RLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIY 651
            RLL+Q YAL GK E   ++RK    +          + V NK H F+TGD+S    D + 
Sbjct: 730  RLLVQAYALYGKFEQTLKVRKLGKESAMKKCTAQCWVEVRNKVHLFVTGDQS--KLDVLN 787

Query: 650  ARIDSIGNEIKVVAPDSRETQLCIDEEEK-ENIGGIHSEKLAISFALIASPYTSQSIRII 474
              I SI  ++K     +   QL I+EEEK E IGG H EK A +F LI S +T +SI+I+
Sbjct: 788  TWIKSIEGKVKKF---NNHHQLSIEEEEKEEKIGGFHCEKFAFAFGLIGSSHTRKSIKIV 844

Query: 473  KNFRMCRDCHKTAKLVSLIY 414
            KN RMC DCH+ AK +S  Y
Sbjct: 845  KNLRMCVDCHQMAKYISAAY 864


>ref|XP_006386200.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344175|gb|ERP63997.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 810

 Score =  954 bits (2466), Expect = 0.0
 Identities = 482/815 (59%), Positives = 605/815 (74%), Gaps = 3/815 (0%)
 Frame = -2

Query: 2771 ISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVFDTMLER 2592
            ++LLQSCID++SI  GR  HARI +VQ+ +P ++TKL+SMYAKCG L DAR+VFD M ER
Sbjct: 1    MNLLQSCIDTNSINLGRKFHARISVVQEKSPVIETKLVSMYAKCGYLRDARKVFDEMSER 60

Query: 2591 NLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGKLI 2412
            +LFTW+AMIG   RE+RWKE++EL++ MMM+D + PD FLLPKIL+A  N  D +TG+L+
Sbjct: 61   SLFTWSAMIGACCREKRWKEVVELYY-MMMKDNVLPDGFLLPKILQAVGNCRDVKTGELL 119

Query: 2411 HSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSGKN 2232
            HSF++R G+ S   VNNS+L++Y+KCG+L+ A+ FFE MD +D V WN+++SGYC  G+ 
Sbjct: 120  HSFVVRCGMGSSPRVNNSILAVYSKCGKLSLARRFFESMDERDIVAWNAMMSGYCLKGEV 179

Query: 2231 EKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWTCM 2052
            E+A RLFD M  EG+EPGLVTWNILIA YNQ G CD+AM LM KM   G++PDV  WT M
Sbjct: 180  EEAHRLFDAMCEEGIEPGLVTWNILIAGYNQKGQCDVAMNLMKKMVSFGVSPDVVAWTSM 239

Query: 2051 ISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKLGT 1872
            ISGFAQNNR  QAL+L++EM+L GVEPNGVT+            LN G  +H+L VK+  
Sbjct: 240  ISGFAQNNRNGQALDLYKEMILAGVEPNGVTITSALSACASLKVLNTGLGIHSLAVKMSF 299

Query: 1871 MGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLFMK 1692
            + +VLVGNSLIDMYSKCG+   A+ VF+++ EKD+ TWNS+IGGY QAGYCGKAY LF K
Sbjct: 300  VNDVLVGNSLIDMYSKCGQLGAAQLVFDLMSEKDLYTWNSMIGGYCQAGYCGKAYVLFTK 359

Query: 1691 MQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLLQNG 1512
            MQ S V PNVVTWN M SGY+Q GDEDQAM LF RME EG +KR+ ASWN LIAG +Q  
Sbjct: 360  MQKSQVQPNVVTWNTMISGYIQSGDEDQAMDLFHRMEKEGEIKRDNASWNSLIAGFMQIR 419

Query: 1511 QKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVSIVN 1332
            +K+KALGIFRQMQ  C  PN +T+LS+LPACA+L++ KKVKEIHGCVLRRNL S +SI N
Sbjct: 420  KKDKALGIFRQMQSFCISPNPVTILSMLPACASLVALKKVKEIHGCVLRRNLVSVLSISN 479

Query: 1331 SLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLGFLP 1152
            SLIDTYAKSG I  +RA+F+ + S+D I+ N++I GYVLHG  + +L L ++MR LG  P
Sbjct: 480  SLIDTYAKSGKIEYSRAIFDRIPSKDFITVNSMITGYVLHGCSDSALGLLDQMRELGLKP 539

Query: 1151 NRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEATEF 972
            NRGT  + ILA+SLA MV+EG+Q FSSMT+D++I+P  EHY+AMV L GRSGR KEA E 
Sbjct: 540  NRGTLVNIILAHSLAGMVDEGRQVFSSMTEDFQIIPASEHYAAMVDLYGRSGRLKEAIEL 599

Query: 971  IEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLYALGGKS 792
            I+ M I+P S+VW ALLTACR HGN  LAI A E L+ LEP N  +H+ +LQ YA+ GK 
Sbjct: 600  IDNMPIKPQSSVWYALLTACRNHGNSDLAIRARENLLDLEPWNSSIHQSILQSYAMHGKY 659

Query: 791  EDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIGNEIKVV 612
            EDA +++K   RN      G S I VNN  H+F+ GD+S   SD +++ ++ I  E KV 
Sbjct: 660  EDAPKVKKLEKRNEVQKPKGQSWIEVNNTVHSFVAGDQSTSYSD-LFSWVERISMEAKV- 717

Query: 611  APDSRETQLCI---DEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKNFRMCRDCHK 441
                     CI   +EEEKE I GIHSEKLA++FA+I SP   QSIRI+KN R C DCH+
Sbjct: 718  --HDLHCGCCIEEEEEEEKEEIVGIHSEKLALAFAIIRSPSAPQSIRIVKNLRTCADCHR 775

Query: 440  TAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
             AK +S  +G EIYL DS  FHHFK+G CSC DYW
Sbjct: 776  MAKYISAKHGCEIYLSDSNFFHHFKSGCCSCGDYW 810


>ref|XP_007142200.1| hypothetical protein PHAVU_008G260600g [Phaseolus vulgaris]
            gi|561015333|gb|ESW14194.1| hypothetical protein
            PHAVU_008G260600g [Phaseolus vulgaris]
          Length = 893

 Score =  942 bits (2436), Expect = 0.0
 Identities = 473/863 (54%), Positives = 628/863 (72%), Gaps = 1/863 (0%)
 Frame = -2

Query: 2921 HTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCIDS 2742
            ++VS T N + P   +  LN LC NG L +A+  LD++A+ G K++P T+I+LLQ+CID 
Sbjct: 39   NSVSMT-NLSHPKLIDTQLNELCVNGHLSEAVGILDSLAQQGSKVRPITFINLLQACIDR 97

Query: 2741 DSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMIG 2562
            D I  GR LHAR+GLV+  NPFV+TKL+SMYAKCG LE+AR+VFD M ERNLFTW+AMIG
Sbjct: 98   DCIWVGRELHARVGLVRKVNPFVETKLVSMYAKCGLLEEARKVFDEMHERNLFTWSAMIG 157

Query: 2561 GYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSGLD 2382
              +R+ +W E++ELF+  MM+ G+ PD+FLLPKILKAC      + G+LIHS +IR G  
Sbjct: 158  ACSRDLKWDEVVELFY-NMMQHGVLPDDFLLPKILKACGKCRAFEAGRLIHSMVIRRGRC 216

Query: 2381 SCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQM 2202
            S L V NS+L++YAKCGE+  A+  F +M+ ++ V+WN II+GYCQ G+ E+A + FD M
Sbjct: 217  SSLRVINSILAVYAKCGEMTYAEKLFRRMEERNYVSWNVIITGYCQKGEIEEARKYFDAM 276

Query: 2201 QAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNNRI 2022
            Q EG++PGLVTWNILIASY+Q G  ++A++LM  M   GITPDV+TWT +ISGF Q  RI
Sbjct: 277  QGEGIDPGLVTWNILIASYSQCGQSEIAIDLMRMMESFGITPDVYTWTSLISGFTQKGRI 336

Query: 2021 NQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNSL 1842
            N A +L REM +VGVEPN +T+            L+ G E+H++ VK   + ++L+GNSL
Sbjct: 337  NDAFDLLREMFIVGVEPNSITIASAVSACASVKSLSMGSEVHSIAVKTSLVDDMLIGNSL 396

Query: 1841 IDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPNV 1662
            IDMY+K G  E A+R+F+++L++DV +WNS+IGGY QAG+CGKA++LFMKMQ SD PPNV
Sbjct: 397  IDMYAKGGNLEAAQRIFDVMLKRDVYSWNSIIGGYCQAGFCGKAHELFMKMQESDSPPNV 456

Query: 1661 VTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGIFR 1482
            VTWNVM +G++Q G ED+A+ LFQR+E +G +K N ASWN LI+G LQ+ QK KAL IFR
Sbjct: 457  VTWNVMITGFMQNGAEDEALDLFQRIEKDGNIKPNVASWNSLISGFLQSRQKEKALQIFR 516

Query: 1481 QMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKSG 1302
            +MQ     PN +T+L+ILPACANL++AKKVKEIH C +RRNL S + + N+ ID YAKSG
Sbjct: 517  RMQFSNMAPNLVTVLTILPACANLVAAKKVKEIHCCAIRRNLVSELYVSNTFIDNYAKSG 576

Query: 1301 DIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLGFL-PNRGTFASTI 1125
            +I+ +R +F+ L  +D+ISWN+L++GYVLHG    +LDLF++M     L PNR T AS I
Sbjct: 577  NIMYSRKVFDGLSPKDIISWNSLLSGYVLHGSSESALDLFDQMNKDDRLHPNRVTLASII 636

Query: 1124 LAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPD 945
             AYS A MV+EGK  FS+M++D++I+  LEHYSAMV LLGRSG+  EA EFI  M I+P+
Sbjct: 637  SAYSHAGMVDEGKHAFSNMSEDFKIILDLEHYSAMVYLLGRSGKLAEAQEFILNMPIEPN 696

Query: 944  STVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRKT 765
             +VWTA LTACRIH N G+AI A E+L+ L+PEN +   LL Q Y+L GK  +A +M K 
Sbjct: 697  ISVWTAFLTACRIHRNFGMAIFAGERLLELDPENIITQHLLSQAYSLCGKYWEAPKMTKL 756

Query: 764  INRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQL 585
                     +G S I +NN  HTF+ GD+S P  D +++ +  +   +K    D+    L
Sbjct: 757  EKEK---IPVGQSWIEMNNMVHTFVVGDQSKPYLDKLHSWLKRVHVNVKAHISDN---GL 810

Query: 584  CIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGRE 405
            CI+EEEKE+I  +HSEKLAI+FALI S +  Q +RI+KN R+C+DCH TAK +SL YG E
Sbjct: 811  CIEEEEKEDINSVHSEKLAIAFALIDSHHRPQILRIVKNLRVCKDCHDTAKYISLAYGCE 870

Query: 404  IYLYDSKCFHHFKNGQCSCRDYW 336
            IYL DS C HHFK+G CSCRDYW
Sbjct: 871  IYLSDSNCLHHFKDGHCSCRDYW 893


>ref|XP_003615696.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355517031|gb|AES98654.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 887

 Score =  939 bits (2426), Expect = 0.0
 Identities = 471/885 (53%), Positives = 627/885 (70%), Gaps = 3/885 (0%)
 Frame = -2

Query: 2981 PVADPWKQQVSTEFSSKPIK---HTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDT 2811
            P++ P K       SSK +    + VS TK S   L  +  LN LC NG L +A+T LD+
Sbjct: 15   PLSFPNKPTKFDCISSKRVNANSNNVSTTKPSIRKL-IDSQLNQLCINGSLSEAVTILDS 73

Query: 2810 IAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGL 2631
            +A+ G ++KP TY++LLQSCID D I  G+ LH+RIGLV++ NPFV+TKL+SMYAKCG L
Sbjct: 74   LAEQGCRVKPITYMNLLQSCIDKDCIFIGKELHSRIGLVENVNPFVETKLVSMYAKCGLL 133

Query: 2630 EDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKA 2451
              AR+VF+ M  RNLFTW+AMIGG +R + W E++ LF+ MM  DG+ PDEFLLPK+L+A
Sbjct: 134  GMARKVFNEMSVRNLFTWSAMIGGCSRNKSWGEVVGLFYAMM-RDGVLPDEFLLPKVLQA 192

Query: 2450 CANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTW 2271
            C    D +TG+LIHS +IR G+    H+ NS++++YAKCGE++ AK  F+ MD +D+V W
Sbjct: 193  CGKCRDLETGRLIHSMVIRRGMRWSKHLRNSIMAVYAKCGEMDCAKKIFDCMDERDSVAW 252

Query: 2270 NSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNG 2091
            N++ISG+CQ+G+  +A + FD MQ +GVEP LVTWNILI+ YNQ G+CDLA++LM KM  
Sbjct: 253  NAMISGFCQNGEIGQAQKYFDAMQKDGVEPSLVTWNILISCYNQLGHCDLAIDLMRKMEW 312

Query: 2090 RGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNK 1911
             GI PDV+TWT MISGF Q  RI+ AL+L +EM L GVE N +T+            L+ 
Sbjct: 313  FGIAPDVYTWTSMISGFTQKGRISHALDLLKEMFLAGVEANNITIASAASACAALKSLSM 372

Query: 1910 GKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQ 1731
            G E+H++ VK+  + NVLVGNSLIDMY KCG  + A+ +F+M+ E+DV +WNS+IGGY Q
Sbjct: 373  GLEIHSIAVKMNLVDNVLVGNSLIDMYCKCGDLKAAQHIFDMMSERDVYSWNSIIGGYFQ 432

Query: 1730 AGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTA 1551
            AG+CGKA++LFMKMQ SD PPN++TWN+M +GY+Q G EDQA+ LF+ +E +G  KRN A
Sbjct: 433  AGFCGKAHELFMKMQESDSPPNIITWNIMITGYMQSGAEDQALDLFKSIEKDGKTKRNAA 492

Query: 1550 SWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCV 1371
            SWN LI+G +Q+GQK+KAL IFR MQ     PNS+T+LSILP CANL+++KKVKEIH   
Sbjct: 493  SWNSLISGFVQSGQKDKALQIFRNMQFCHILPNSVTILSILPVCANLVASKKVKEIHCFA 552

Query: 1370 LRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISL 1191
            +RR L S +S+ N LID+YAKSG+++ ++ +F +L  +D +SWN++++ YVLHG    +L
Sbjct: 553  VRRILVSELSVSNLLIDSYAKSGNLMYSKNIFNELSWKDAVSWNSMLSSYVLHGCSESAL 612

Query: 1190 DLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVAL 1011
            DLF +MR  G  PNRGTFAS +LAY  A MV+EGK  FS +TKDY +  G+EHYSAMV L
Sbjct: 613  DLFYQMRKQGLQPNRGTFASILLAYGHAGMVDEGKSVFSCITKDYLVRQGMEHYSAMVYL 672

Query: 1010 LGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVH 831
            LGRSG+  EA +FI+ M I+P+S+VW ALLTACRIH N G+A+ A ++++  EP N +  
Sbjct: 673  LGRSGKLAEALDFIQSMPIEPNSSVWGALLTACRIHRNFGVAVLAGKRMLEFEPGNNITR 732

Query: 830  RLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIY 651
             LL Q Y+L GK E      K +N+      +G S I  NN  HTF+ GD+S P  D ++
Sbjct: 733  HLLSQAYSLCGKFEPEG--EKAVNK-----PIGQSWIERNNVVHTFVVGDQSNPYLDKLH 785

Query: 650  ARIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIK 471
            + +  +   +K    D+   +L I+EEEKEN   +HSEKLA +FALI      Q +RI+K
Sbjct: 786  SWLKRVAVNVKTHVSDN---ELYIEEEEKENTSSVHSEKLAFAFALIDPHNKPQILRIVK 842

Query: 470  NFRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
              RMCRDCH TAK +S+ YG EIYL DS C HHFK G CSCRDYW
Sbjct: 843  KLRMCRDCHDTAKYISMAYGCEIYLSDSNCLHHFKGGHCSCRDYW 887


>ref|XP_004490605.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Cicer arietinum]
          Length = 888

 Score =  930 bits (2404), Expect = 0.0
 Identities = 477/900 (53%), Positives = 628/900 (69%), Gaps = 3/900 (0%)
 Frame = -2

Query: 3026 MENSIILQKFISKPPPVADPWKQQVSTEFSSKPIK---HTVSFTKNSATPLPSEIHLNYL 2856
            MEN  I+    SKPP     +         SK +    + VS TK S  P   +  LN L
Sbjct: 1    MENIHIIIPSKSKPPLSFSSYNATQFDCIVSKRVNANSNNVSITKTS-NPKLMDAQLNQL 59

Query: 2855 CRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPF 2676
            C NG L + +T LD IA+ G K++P TY++LLQSCID D I  G+ LHARIGLV+  NPF
Sbjct: 60   CINGSLSEVVTYLDAIAEQGSKVRPITYMNLLQSCIDKDCIFVGKELHARIGLVEKVNPF 119

Query: 2675 VQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMED 2496
            V+TKL+SMYAKCG L+ AR+VFD M  RNLFTW+AMIG  +R + WKE++ LF+  MME 
Sbjct: 120  VETKLVSMYAKCGYLDKARKVFDEMHVRNLFTWSAMIGACSRNKSWKEVVGLFY-EMMEH 178

Query: 2495 GITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSA 2316
            G+ PDEFLLPK+L+AC    D +T +LIHS +IR G+     V+NS++++YAKCGE++ A
Sbjct: 179  GVLPDEFLLPKVLQACGKCRDLETARLIHSMMIRRGMCWNERVHNSIMAVYAKCGEMDCA 238

Query: 2315 KWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQS 2136
            K  F+ MD K++V WN++ISG+CQ+G+ E+A + FD MQ EG+EPGLVTWNILIA YNQ 
Sbjct: 239  KKIFDCMDRKNSVVWNAMISGFCQNGEIEQAHKYFDAMQKEGIEPGLVTWNILIACYNQL 298

Query: 2135 GNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTV 1956
            G CDLA++LM KM   GI PDV+TWT MISGF+Q  RI+ AL+L REM L GVEPN +T+
Sbjct: 299  GFCDLAIDLMRKMECLGIAPDVYTWTSMISGFSQKGRISHALDLLREMFLAGVEPNSITI 358

Query: 1955 XXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILE 1776
                        L+ G E+H++ VK+  +GN+L+GNSLIDMYSKCG  + A+ +F+M+L 
Sbjct: 359  ASAASACASLKSLSMGLEIHSIAVKMNLVGNLLIGNSLIDMYSKCGDLKAAQCIFDMMLV 418

Query: 1775 KDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVL 1596
            +DV +WNS+IGGY QAG+CGKA++LF KMQ S+ PPN+VTWNVM +GY+Q G ED+A+ L
Sbjct: 419  RDVYSWNSIIGGYFQAGFCGKAHELFRKMQESNSPPNIVTWNVMITGYMQSGAEDRALDL 478

Query: 1595 FQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACA 1416
            F  +E +G +KRN ASWN LI+G LQ GQK+KAL +FR MQ      NS+T+LSILPACA
Sbjct: 479  FTSIEKDGKIKRNVASWNSLISGFLQIGQKDKALQLFRNMQFFHIALNSVTILSILPACA 538

Query: 1415 NLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNT 1236
            NL+++KKVKEIH C +RRNL S + + + LID+YAKSG+++ +R +F  L  +DV+S N+
Sbjct: 539  NLVASKKVKEIHCCSVRRNLVSELPVSHLLIDSYAKSGNLMYSRNIFYGLSWKDVVSLNS 598

Query: 1235 LIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDY 1056
            +++GYVL+G    ++DLF++MR  G  PNRGTFA+ +LAY    MV+EGK  FS MT +Y
Sbjct: 599  MLSGYVLNGCSESAIDLFHQMRKEGIRPNRGTFATILLAYGHTGMVDEGKHVFSCMTNEY 658

Query: 1055 EILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHA 876
             I PG+EHYSAMV +LGRSG+  EA EFI+ M I+P+S VW ALLTAC+IH N G+A+ A
Sbjct: 659  LIRPGMEHYSAMVYMLGRSGKLAEALEFIQNMPIEPNSLVWDALLTACKIHRNFGMAVLA 718

Query: 875  AEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHT 696
             ++L+ LEP N +   LL Q Y+L GK        K +N+      +G   I  NN  HT
Sbjct: 719  GKRLLELEPGNNITRYLLSQAYSLCGKF--TLEEEKAVNK-----PVGQCWIERNNTVHT 771

Query: 695  FMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAISFA 516
            F+ GD+S    D + + +  +   +K    D+    LCI+EEE+EN   +HSEKLA +FA
Sbjct: 772  FVVGDQSYTYLDKLRSWLKRVAVNVKTHVFDN---GLCIEEEERENNSIVHSEKLAFAFA 828

Query: 515  LIASPYTSQSIRIIKNFRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
             I    T + + I+KN RMCRDCH TAK +SL YG EIYL DS C HHFK G CSCRDYW
Sbjct: 829  FIDPHNTPRILHIVKNLRMCRDCHDTAKYISLAYGCEIYLSDSNCLHHFKGGHCSCRDYW 888


>ref|XP_004168675.1| PREDICTED: pentatricopeptide repeat-containing protein
            At1g19720-like, partial [Cucumis sativus]
          Length = 1090

 Score =  929 bits (2401), Expect = 0.0
 Identities = 469/839 (55%), Positives = 612/839 (72%), Gaps = 3/839 (0%)
 Frame = -2

Query: 2984 PPVADPWK--QQVSTEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDT 2811
            PP++ P    +    +FSSKPIK ++ FT    +    + HL+YLC NG L++AITA+D+
Sbjct: 12   PPISGPASVIKPRPLKFSSKPIKTSIFFTYKLTSKFNDD-HLSYLCSNGLLREAITAIDS 70

Query: 2810 IAKHGYKLKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGL 2631
            I+K G KL   TYI+LLQ+CID  SIE GR LH R+GLV   NPFV+TKL+SMYAKCG L
Sbjct: 71   ISKRGSKLSTNTYINLLQTCIDVGSIELGRELHVRMGLVHRVNPFVETKLVSMYAKCGCL 130

Query: 2630 EDARRVFDTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKA 2451
            +DAR+VFD M ERNL+TW+AMIG Y+REQRWKE++ELFFLMM  DG+ PD FL PKIL+A
Sbjct: 131  KDARKVFDGMQERNLYTWSAMIGAYSREQRWKEVVELFFLMM-GDGVLPDAFLFPKILQA 189

Query: 2450 CANSGDAQTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTW 2271
            C N  D +T KLIHS +IR GL   + ++NS+L+ + KCG+L+ A+ FF  MD +D V+W
Sbjct: 190  CGNCEDLETVKLIHSLVIRCGLSCYMRLSNSILTAFVKCGKLSLARKFFGNMDERDGVSW 249

Query: 2270 NSIISGYCQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNG 2091
            N +I+GYCQ G  ++A RL D M  +G +PGLVT+NI+IASY+Q G+CDL ++L  KM  
Sbjct: 250  NVMIAGYCQKGNGDEARRLLDTMSNQGFKPGLVTYNIMIASYSQLGDCDLVIDLKKKMES 309

Query: 2090 RGITPDVFTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNK 1911
             G+ PDV+TWT MISGF+Q++RI+QAL+ F++M+L GVEPN +T+            L  
Sbjct: 310  VGLAPDVYTWTSMISGFSQSSRISQALDFFKKMILAGVEPNTITIASATSACASLKSLQN 369

Query: 1910 GKELHALGVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQ 1731
            G E+H   +K+G     LVGNSLIDMYSKCGK E AR VF+ ILEKDV TWNS+IGGY Q
Sbjct: 370  GLEIHCFAIKMGIARETLVGNSLIDMYSKCGKLEAARHVFDTILEKDVYTWNSMIGGYCQ 429

Query: 1730 AGYCGKAYDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTA 1551
            AGY GKAY+LFM+++ S V PNVVTWN M SG +Q GDEDQAM LFQ ME +G +KRNTA
Sbjct: 430  AGYGGKAYELFMRLRESTVMPNVVTWNAMISGCIQNGDEDQAMDLFQIMEKDGGVKRNTA 489

Query: 1550 SWNLLIAGLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCV 1371
            SWN LIAG  Q G+KNKAL IFRQMQ L   PNS+T+LSILPACAN+++ KK+KEIHGCV
Sbjct: 490  SWNSLIAGYHQLGEKNKALAIFRQMQSLNFSPNSVTILSILPACANVMAEKKIKEIHGCV 549

Query: 1370 LRRNLESNVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISL 1191
            LRRNLES +++ NSL+DTYAKSG+I  +R +F  + S+D+I+WN++IAGY+LHG  + + 
Sbjct: 550  LRRNLESELAVANSLVDTYAKSGNIKYSRTVFNGMSSKDIITWNSIIAGYILHGCSDSAF 609

Query: 1190 DLFNRMRLLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVAL 1011
             LF++MR LG  PNRGT AS I AY +A MV++G+  FSS+T++++ILP L+HY AMV L
Sbjct: 610  QLFDQMRNLGIRPNRGTLASIIHAYGIAGMVDKGRHVFSSITEEHQILPTLDHYLAMVDL 669

Query: 1010 LGRSGRFKEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVH 831
             GRSGR  +A EFIE+M I+PD ++WT+LLTACR HGN+ LA+ AA++L  LEP+N++++
Sbjct: 670  YGRSGRLADAIEFIEDMPIEPDVSIWTSLLTACRFHGNLNLAVLAAKRLHELEPDNHVIY 729

Query: 830  RLLLQLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIY 651
            RLL+Q YAL GK E   ++RK    +          + V NK H F+TGD+S    D + 
Sbjct: 730  RLLVQAYALYGKFEQTLKVRKLGKESAMKKCTAQCWVEVRNKVHLFVTGDQS--KLDVLN 787

Query: 650  ARIDSIGNEIKVVAPDSRETQLCIDEEEK-ENIGGIHSEKLAISFALIASPYTSQSIRI 477
              I SI  ++K     +   QL I+EEEK E IGG H EK A +F LI S +T +SI+I
Sbjct: 788  TWIKSIEGKVKKF---NNHHQLSIEEEEKEEKIGGFHCEKFAFAFGLIGSSHTRKSIKI 843


>ref|XP_004301846.1| PREDICTED: pentatricopeptide repeat-containing protein At1g19720-like
            [Fragaria vesca subsp. vesca]
          Length = 892

 Score =  927 bits (2395), Expect = 0.0
 Identities = 472/835 (56%), Positives = 612/835 (73%), Gaps = 1/835 (0%)
 Frame = -2

Query: 2924 KHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYKLKPRTYISLLQSCID 2745
            K T++F+       P   +L+ LC+ GQL  A+  LD+IA+ G KL   TY++LLQSCID
Sbjct: 27   KPTLTFSNKPRQNPPLVQNLHLLCKTGQLADAVAVLDSIAQTGSKLPAATYMNLLQSCID 86

Query: 2744 SDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVFDTMLERNLFTWAAMI 2565
            S+SI  GR LH  I  V D  PFV+TKL+SMYAKCG LEDAR+VFD M ERNL+TW+AMI
Sbjct: 87   SNSIHLGRKLHRVIHAVDDVTPFVETKLVSMYAKCGCLEDARKVFDEMRERNLYTWSAMI 146

Query: 2564 GGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGKLIHSFIIRSGL 2385
            G   RE+RW E++ELF LM+  DG+ PD FL+PK+L+AC N GD    +++HS ++RSGL
Sbjct: 147  GACLRERRWGEVVELFALMV-RDGVLPDWFLVPKVLQACGNCGDFAAARMVHSMVVRSGL 205

Query: 2384 DSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSGKNEKAMRLFDQ 2205
               L V+N+LL++YAKCGEL SA+ FF+KM+ +D V+WNSI+SGYCQ+G N +A RL D+
Sbjct: 206  IGNLRVSNALLAVYAKCGELESARRFFDKMEVRDGVSWNSIVSGYCQNGDNVEARRLIDE 265

Query: 2204 MQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWTCMISGFAQNNR 2025
            M  +G+EPGLVTWNILI+S N+SG CD+AMELM KM   GI PDV+TWT MISGFAQNNR
Sbjct: 266  MIRQGIEPGLVTWNILISSCNKSGQCDVAMELMKKMESCGIIPDVYTWTAMISGFAQNNR 325

Query: 2024 INQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKLGTMGNVLVGNS 1845
             NQAL+L+++M+L+GV PNG+T+            L KG E++A  VK+G   +VLVGNS
Sbjct: 326  TNQALDLWKKMILLGVLPNGITIASAILACTSLKSLTKGLEVYAFAVKIGLTDDVLVGNS 385

Query: 1844 LIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLFMKMQGSDVPPN 1665
            LIDM+SKCG  E A +VF ++ EKDV +WNS+IGGY QA YCGKAY+LFMKMQ SDV PN
Sbjct: 386  LIDMFSKCGDLEAAEQVFNVMSEKDVYSWNSMIGGYCQARYCGKAYELFMKMQESDVRPN 445

Query: 1664 VVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLLQNGQKNKALGIF 1485
             +T+NVM +GY+Q GD DQAM LFQ ME +G +KRNTASWN LIAG  Q G+ N+AL IF
Sbjct: 446  AITYNVMITGYIQNGDADQAMDLFQMMERDGKVKRNTASWNSLIAGYAQLGEINEALRIF 505

Query: 1484 RQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVSIVNSLIDTYAKS 1305
            R+MQ     PN++TLLSILPACA+L + KKVKEIHG V RRNLE  + + NSLIDTYAKS
Sbjct: 506  RKMQTFGVSPNAVTLLSILPACASLAAMKKVKEIHGSVFRRNLEFELPVANSLIDTYAKS 565

Query: 1304 GDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLGFLPNRGTFASTI 1125
            G+I  +R +F+ + S+D+I+WN+ I+GYVLHG+P+++LDLF+RM+ LG  PNRGTFA+ +
Sbjct: 566  GNIEYSRTIFDRMASKDIITWNSAISGYVLHGHPDVALDLFDRMKQLGLKPNRGTFAAVL 625

Query: 1124 LAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEATEFIEEMNIQPD 945
             AYSLAKMVNEG +  SS++++Y+I+PG EHYSA+V L GRSGR +EA EFIE+M I+PD
Sbjct: 626  YAYSLAKMVNEGIEALSSISEEYQIIPGPEHYSAIVDLYGRSGRLQEAVEFIEDMPIEPD 685

Query: 944  STVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLYALGGKSEDASRMRKT 765
            S+VW ALLTACR HGN+ LAIHA E+LI LE  N ++ + +LQ YAL GK +D S++R+ 
Sbjct: 686  SSVWAALLTACRNHGNLSLAIHAGERLIDLEQGNVLIQQFVLQAYALSGKPDDTSKLRRL 745

Query: 764  INRNGT-ANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIGNEIKVVAPDSRETQ 588
               N T   SLG   + VNN  HTF++GDRS   S  + + +  I    K   PD R   
Sbjct: 746  GKENATIKRSLGQCWMLVNNTVHTFISGDRSKLCSKYVNSWLQDIAE--KANGPDFR-CG 802

Query: 587  LCIDEEEKENIGGIHSEKLAISFALIASPYTSQSIRIIKNFRMCRDCHKTAKLVS 423
            L ++EEE E I  +H EKLA++FALI S    +  R++     C    +TA++ S
Sbjct: 803  LAVEEEE-EGISMVHCEKLALAFALIGSQSVPKRDRVLVKGSSCVLKEETAEVPS 856


>ref|XP_002893064.1| hypothetical protein ARALYDRAFT_472198 [Arabidopsis lyrata subsp.
            lyrata] gi|297338906|gb|EFH69323.1| hypothetical protein
            ARALYDRAFT_472198 [Arabidopsis lyrata subsp. lyrata]
          Length = 1490

 Score =  922 bits (2383), Expect = 0.0
 Identities = 463/851 (54%), Positives = 608/851 (71%), Gaps = 5/851 (0%)
 Frame = -2

Query: 2969 PWKQQVSTEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYK 2790
            P K + S E   K  K  +SFTK     +  +  L+YLCRNG L +A  ALD++ + G K
Sbjct: 19   PAKVENSPEVHPKSRKKNLSFTKKKEPNIIPDEQLDYLCRNGSLLEAEKALDSLFQQGSK 78

Query: 2789 LKPRTYISLLQSCIDSDSIEQGRMLHARIGLVQDPNPFVQTKLLSMYAKCGGLEDARRVF 2610
            +K  TY++LL+SCIDS SI  GR+LHAR GL  +P+ FV+TKLLSMYAKCG L DAR+VF
Sbjct: 79   VKRSTYLNLLESCIDSGSIHLGRILHARFGLFPEPDVFVETKLLSMYAKCGCLVDARKVF 138

Query: 2609 DTMLERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDA 2430
            D+M ERNL+TW+AMIG Y+RE RW+E+ +LF LMM E+G+ PD+FL PKIL+ CAN GD 
Sbjct: 139  DSMRERNLYTWSAMIGAYSRENRWREVSKLFRLMM-EEGVLPDDFLFPKILQGCANCGDV 197

Query: 2429 QTGKLIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGY 2250
            +TGKLIHS +I+ G+ SCL V+NS+L++YAKCGE + A  FF +M  +D V WNS++  Y
Sbjct: 198  ETGKLIHSVVIKLGMSSCLRVSNSILAVYAKCGEWDFATKFFRRMKERDVVAWNSVLLAY 257

Query: 2249 CQSGKNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDV 2070
            CQ+GK+E+A+ L ++M+ EG+ PGLVTWNILI  YNQ G CD AM+LM KM   GIT DV
Sbjct: 258  CQNGKHEEAVELVEEMEKEGISPGLVTWNILIGGYNQLGKCDAAMDLMQKMENFGITADV 317

Query: 2069 FTWTCMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHAL 1890
            FTWT MISG   N    QAL++FR+M L GV PN VT+            +N G E+H++
Sbjct: 318  FTWTAMISGLIHNGMRYQALDMFRKMFLAGVVPNAVTIMSAVSACSYLKVINLGSEVHSI 377

Query: 1889 GVKLGTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKA 1710
             VK+G + +VLVGNSL+DMYSKCGK E AR+VF+ +  KDV TWNS+I GY QAGYCGKA
Sbjct: 378  AVKMGFIDDVLVGNSLVDMYSKCGKLEDARKVFDSVKNKDVYTWNSMITGYCQAGYCGKA 437

Query: 1709 YDLFMKMQGSDVPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIA 1530
            Y+LF +MQ ++V PN++TWN M SGY++ GDE +AM LFQRME +G ++RNTA+WNL+IA
Sbjct: 438  YELFTRMQDANVRPNIITWNTMISGYIKNGDEGEAMDLFQRMEKDGKVQRNTATWNLIIA 497

Query: 1529 GLLQNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLES 1350
            G +QNG+K+ AL IFR+MQ     PNS+T+LS+LPACANLL  K V+EIHGCVLRRNL++
Sbjct: 498  GYIQNGKKDDALEIFRKMQFSRFMPNSVTILSLLPACANLLGTKMVREIHGCVLRRNLDA 557

Query: 1349 NVSIVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMR 1170
              ++ N+L DTYAKSGDI  ++ +F  + ++D+I+WN+LI GYVLHG    +L+LFN+M+
Sbjct: 558  IHAVKNALTDTYAKSGDIGYSKTIFMGMETKDIITWNSLIGGYVLHGSYGPALELFNQMK 617

Query: 1169 LLGFLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRF 990
              G  PNRGT +S ILA+ L   V+EGK+ F S+  DY I+P LEH SAMV+L GRS R 
Sbjct: 618  TQGIKPNRGTLSSIILAHGLMGNVDEGKKVFYSIANDYHIIPALEHCSAMVSLYGRSNRL 677

Query: 989  KEATEFIEEMNIQPDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPENYMVHRLLLQLY 810
            +EA +FI+EMNIQ ++ +W + LT CRIHG+I +AIHAAE L +LEPEN +   ++ Q+Y
Sbjct: 678  EEALQFIQEMNIQSETPIWESFLTGCRIHGDIDMAIHAAENLFSLEPENTVTENIVSQIY 737

Query: 809  ALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYARIDSIG 630
            ALG K   +   +K    N     LG S I V N  HTF TGD+S   +D +Y  ++   
Sbjct: 738  ALGAKLGRSLEGKKPRRDNLLKKPLGQSWIEVRNLIHTFTTGDQSKLCTDLLYPWVE--- 794

Query: 629  NEIKVVAPDSRETQ----LCIDEEEKENIGGIHSEKLAISFALIASPYTSQ-SIRIIKNF 465
               K+   D+R  Q    L I+EE +E   GIHSEK A++F LI+S    + +IRI+KN 
Sbjct: 795  ---KMCRVDNRSDQYNGELLIEEEGREETCGIHSEKFAMAFGLISSSRAPKATIRILKNL 851

Query: 464  RMCRDCHKTAK 432
            RMCRDCH TAK
Sbjct: 852  RMCRDCHNTAK 862


>gb|EYU38829.1| hypothetical protein MIMGU_mgv1a001151mg [Mimulus guttatus]
          Length = 876

 Score =  888 bits (2294), Expect = 0.0
 Identities = 461/886 (52%), Positives = 627/886 (70%), Gaps = 13/886 (1%)
 Frame = -2

Query: 2954 VSTEFSSKPIKHTVSFTKNSATPLPSEIHLNYLCRNGQLKQAITALDTIAKHGYKLKPRT 2775
            + ++ S  P K   +F++ +   L ++ +L  LC +G+L +AI++LD             
Sbjct: 13   IPSKISEYPSKFK-AFSRIAQQKLANDAYLKGLCNHGRLTEAISSLD------------- 58

Query: 2774 YISLLQSCIDSDSIEQGRMLHARIGL-VQDPNPFVQTKLLSMYAKCGGLEDARRVFDTML 2598
              SL++SCIDS+S++    LHA +   V++P+PF++TKL+ MYAKCG L+DA  VF+ M 
Sbjct: 59   --SLIESCIDSNSLDLCYKLHATVKKWVKEPDPFLETKLVGMYAKCGSLDDAFIVFEEMR 116

Query: 2597 ERNLFTWAAMIGGYNREQRWKEIIELFFLMMMEDGITPDEFLLPKILKACANSGDAQTGK 2418
            +RNL+TW+A+IG  +RE+RW +++ELF+ MM +  + PD FL PKIL+AC+NS DA+TG+
Sbjct: 117  QRNLYTWSAIIGACSREKRWGDVVELFYWMMKDGDVIPDNFLFPKILQACSNSRDAETGR 176

Query: 2417 LIHSFIIRSGLDSCLHVNNSLLSMYAKCGELNSAKWFFEKMDNKDNVTWNSIISGYCQSG 2238
            LIH   I+ GL   L VNNS+LS+YAKCG L+ A+ FFE+M+  D V+WN++I+GYC +G
Sbjct: 177  LIHGMAIKLGLSRELRVNNSILSVYAKCGLLSLAEKFFERMEVNDRVSWNAMITGYCHAG 236

Query: 2237 KNEKAMRLFDQMQAEGVEPGLVTWNILIASYNQSGNCDLAMELMNKMNGRGITPDVFTWT 2058
            +  +A RL + M+ EG+EP  +TWN+LI+S N  G CD+A +LMN M   G+ PDVFTWT
Sbjct: 237  QINEAERLIESMKEEGLEPDEITWNVLISSCNHLGKCDVAKKLMNAMETCGVKPDVFTWT 296

Query: 2057 CMISGFAQNNRINQALELFREMMLVGVEPNGVTVXXXXXXXXXXXXLNKGKELHALGVKL 1878
             MI GFAQNNR  +A++LFREM+L GV PNG+TV            + KGKE+H + +KL
Sbjct: 297  SMILGFAQNNRRLEAVKLFREMLLSGVVPNGITVMSAISACSSLKDVRKGKEVHLVAIKL 356

Query: 1877 GTMGNVLVGNSLIDMYSKCGKPEVARRVFEMILEKDVVTWNSLIGGYTQAGYCGKAYDLF 1698
            G   +VLVGNSL+DMYSKCGK + ARRVF+ + EKDV TWNS+IGGY QAGYCG A+DLF
Sbjct: 357  GHGEDVLVGNSLVDMYSKCGKLDSARRVFDTMSEKDVYTWNSMIGGYCQAGYCGVAHDLF 416

Query: 1697 MKMQGSD-VPPNVVTWNVMASGYLQKGDEDQAMVLFQRMETEGILKRNTASWNLLIAGLL 1521
             +MQ S  + PNVVTWNVM +GY+Q GDED+AM +F  ME  G +KR+TA+WN LIAGLL
Sbjct: 417  KQMQESGFILPNVVTWNVMITGYIQNGDEDEAMDMFNTMEKIGGVKRDTATWNALIAGLL 476

Query: 1520 QNGQKNKALGIFRQMQRLCAKPNSITLLSILPACANLLSAKKVKEIHGCVLRRNLESNVS 1341
             +GQKNKALGIFRQMQ    KPNS+T+LSILPACANL++ KK+KEIH CV++R+LES +S
Sbjct: 477  DHGQKNKALGIFRQMQSCGVKPNSVTVLSILPACANLIAVKKLKEIHCCVVKRSLESELS 536

Query: 1340 IVNSLIDTYAKSGDIVSARALFEDLLSRDVISWNTLIAGYVLHGYPNISLDLFNRMRLLG 1161
            + NS+IDTYAK+G+I  ++ +F ++ S D+I+WNT+  GYVLHG  + +++LF  M    
Sbjct: 537  VANSMIDTYAKAGEIEYSKKIFANMPSVDIITWNTMTTGYVLHGCADEAIELFEHMTRQE 596

Query: 1160 FLPNRGTFASTILAYSLAKMVNEGKQTFSSMTKDYEILPGLEHYSAMVALLGRSGRFKEA 981
              PNRGTFAS I AY LAK V EGK+ FS+MT++Y+I+P L+HY A+V L GRSG+  EA
Sbjct: 597  CRPNRGTFASVISAYGLAKKVEEGKRVFSNMTEEYQIVPCLDHYVAVVNLYGRSGKVDEA 656

Query: 980  TEFIEEMNIQ--PDSTVWTALLTACRIHGNIGLAIHAAEQLITLEPEN----YMVHRLLL 819
             EF+  M  +   D ++W ALLT CR HGN+ LAIHA E+L+ LEP+N      V +L+L
Sbjct: 657  FEFVANMASEESEDVSIWRALLTCCRRHGNVKLAIHAGEKLLELEPDNNNDTLFVRKLVL 716

Query: 818  QLYALGGKSEDASRMRKTINRNGTANSLGCSRITVNNKEHTFMTGDRSMPNSDSIYA--- 648
            QLY L G S+++ +M++   +  T  SLG S I   N  HTF++GD    +  S+ +   
Sbjct: 717  QLYDLRGISKESLKMKR---KETTGYSLGRSWIEEKNTVHTFVSGDLRQLDGKSLRSWIE 773

Query: 647  RIDSIGNEIKVVAPDSRETQLCIDEEEKENIGGIHSEKLAISFALIAS--PYTSQSIRII 474
            R++S   E +     S E +   +EEE+E   GIHSEKLA++FALI S    T ++IR++
Sbjct: 774  RVESCNKESQYRDMLSIEEE---EEEEEEESVGIHSEKLALAFALIKSCRESTPRTIRVV 830

Query: 473  KNFRMCRDCHKTAKLVSLIYGREIYLYDSKCFHHFKNGQCSCRDYW 336
            KN RMC +CH+ AKLVS  +G EIY+ DSK  HHFKNG CSCRDYW
Sbjct: 831  KNVRMCGNCHRFAKLVSKRHGCEIYISDSKSLHHFKNGVCSCRDYW 876


Top