BLASTX nr result

ID: Catharanthus23_contig00016841 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00016841
         (2441 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279589.1| PREDICTED: pentatricopeptide repeat-containi...  1066   0.0  
gb|EOX97442.1| Pentatricopeptide repeat (PPR) superfamily protei...  1046   0.0  
gb|EMJ01512.1| hypothetical protein PRUPE_ppa001249mg [Prunus pe...  1040   0.0  
ref|XP_002527150.1| pentatricopeptide repeat-containing protein,...  1028   0.0  
ref|XP_006350361.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...  1026   0.0  
ref|XP_004290060.1| PREDICTED: pentatricopeptide repeat-containi...  1016   0.0  
ref|XP_006384843.1| hypothetical protein POPTR_0004s21560g [Popu...  1002   0.0  
ref|XP_006422433.1| hypothetical protein CICLE_v10027787mg [Citr...  1001   0.0  
ref|XP_004231526.1| PREDICTED: pentatricopeptide repeat-containi...   995   0.0  
ref|XP_006486600.1| PREDICTED: pentatricopeptide repeat-containi...   995   0.0  
gb|EXC31542.1| hypothetical protein L484_006574 [Morus notabilis]     976   0.0  
ref|XP_002328265.1| predicted protein [Populus trichocarpa]           965   0.0  
ref|XP_004154721.1| PREDICTED: pentatricopeptide repeat-containi...   956   0.0  
ref|XP_004144886.1| PREDICTED: pentatricopeptide repeat-containi...   956   0.0  
ref|NP_179305.2| pentatricopeptide repeat-containing protein [Ar...   940   0.0  
ref|XP_006296960.1| hypothetical protein CARUB_v10012952mg, part...   940   0.0  
ref|XP_002884041.1| binding protein [Arabidopsis lyrata subsp. l...   936   0.0  
ref|XP_006409347.1| hypothetical protein EUTSA_v10022542mg [Eutr...   924   0.0  
gb|ESW31227.1| hypothetical protein PHAVU_002G220400g [Phaseolus...   906   0.0  
ref|XP_004504955.1| PREDICTED: pentatricopeptide repeat-containi...   905   0.0  

>ref|XP_002279589.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17140
            [Vitis vinifera] gi|297744485|emb|CBI37747.3| unnamed
            protein product [Vitis vinifera]
          Length = 878

 Score = 1066 bits (2758), Expect = 0.0
 Identities = 516/789 (65%), Positives = 632/789 (80%), Gaps = 8/789 (1%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIPVIS--------INYVTLITRILIGSGMFTEIEALHNLLLSRPCE 2188
            P LAW LFKR  +SIP  S        +  + +IT ILI + M ++I+ L  LLL +P E
Sbjct: 18   PTLAWHLFKRI-LSIPTSSSSISSRSILRSIPIITHILIRAKMISQIDHLQQLLLQQPQE 76

Query: 2187 THRVSSYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELIS 2008
               VS  A+++ILA  G  D A SQFQ  R+  P++PPP+ LYN ++ESS++ +  +  S
Sbjct: 77   VSHVSLIALIRILAKSGLSDLAFSQFQSFRSQVPANPPPVYLYNMVLESSLREDKVDSFS 136

Query: 2007 WLYKDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGY 1828
            WLYKDM+ AG+ P+TYT NLLI  LCD GR EDAR++FDKM  KGC PNEF+FGILVRGY
Sbjct: 137  WLYKDMVVAGVSPETYTLNLLIAGLCDSGRFEDAREVFDKMGVKGCRPNEFSFGILVRGY 196

Query: 1827 CRXXXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNV 1648
            CR          LD M   GV PN VIYNTLIS FCREG+ E AERLVE+M+EDG+ P+V
Sbjct: 197  CRAGLSMRALELLDGMGSFGVQPNKVIYNTLISSFCREGRNEEAERLVERMREDGLFPDV 256

Query: 1647 VTFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQK 1468
            VTFNSRISALC AG ILEASRIF DMQ+D+E GLP+PN  T+NLMLEGFCKEGMLEE + 
Sbjct: 257  VTFNSRISALCSAGKILEASRIFRDMQIDEELGLPRPNITTFNLMLEGFCKEGMLEEAKT 316

Query: 1467 LIETMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLC 1288
            L+E+MKRNG    ++SYNIWL GLV+NGKL +AQ  LK+MVD+GI+PN Y+FN V+DGLC
Sbjct: 317  LVESMKRNGNLMELESYNIWLLGLVRNGKLLEAQLALKEMVDKGIEPNIYSFNTVMDGLC 376

Query: 1287 KNGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTY 1108
            KNG++ DARM+MGLM ++GI PDTVTYSTLLHG CS GK+++AN +LH MM  GC+PNTY
Sbjct: 377  KNGLISDARMIMGLMISSGIGPDTVTYSTLLHGCCSTGKVLKANNILHEMMRRGCSPNTY 436

Query: 1107 TCNVLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEM 928
            TCN+LLHSLWKEG+I EAEKLLQ+MNER YDLD V+CNIVIDGLCK+GK+D+AVEIV  M
Sbjct: 437  TCNILLHSLWKEGRIFEAEKLLQKMNERSYDLDNVTCNIVIDGLCKSGKLDEAVEIVEGM 496

Query: 927  WNHGSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQ 748
            W HGSAALG+LGNSFIGLVD  +N KKC+PDLITYS IINGLC  GRLDEA+KKFIEM+ 
Sbjct: 497  WIHGSAALGNLGNSFIGLVDSSSNGKKCLPDLITYSIIINGLCKAGRLDEARKKFIEMVG 556

Query: 747  KNLYPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYE 568
            K+L+PDSI+YD F++S CK GK+SSAF+VLKDMEKRGC +SLQTYNSLILGLG+KNQI+E
Sbjct: 557  KSLHPDSIIYDTFIHSFCKHGKISSAFRVLKDMEKRGCNKSLQTYNSLILGLGSKNQIFE 616

Query: 567  MCGLMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLII 388
            + GL+D+M+E+GI PN+ TYN +ISCLCEGGR+++A SL++EMLQ+GISPNI SF+LLI 
Sbjct: 617  IYGLLDDMKEKGITPNICTYNNMISCLCEGGRIKDATSLLDEMLQKGISPNISSFRLLIK 676

Query: 387  SFCRIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLG 208
            +FC+  +F   +EVF++AL ICGHKE LY+ +FNE+L GGE  EAKELF+A LDRCFDLG
Sbjct: 677  AFCKASDFGVVKEVFEIALSICGHKEALYSLMFNELLIGGEVSEAKELFDAALDRCFDLG 736

Query: 207  SFLYKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSE 28
            +F Y DL+++LCK+E  E A DIL KM+ KGY FDP SFMPVID L K G KH+A+EL+E
Sbjct: 737  NFQYNDLIEKLCKDEMLENASDILHKMIDKGYRFDPASFMPVIDGLGKRGKKHDADELAE 796

Query: 27   SMLNMASGG 1
             M++MAS G
Sbjct: 797  RMMDMASEG 805



 Score =  152 bits (385), Expect = 5e-34
 Identities = 105/420 (25%), Positives = 183/420 (43%), Gaps = 57/420 (13%)
 Frame = -3

Query: 1992 MIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXX 1813
            MI +G+ P T T++ L+   C  G++  A  +  +M  +GC PN +T  IL+    +   
Sbjct: 391  MISSGIGPDTVTYSTLLHGCCSTGKVLKANNILHEMMRRGCSPNTYTCNILLHSLWKEGR 450

Query: 1812 XXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDG---------- 1663
                   L  M E     + V  N +I   C+ GK + A  +VE M   G          
Sbjct: 451  IFEAEKLLQKMNERSYDLDNVTCNIVIDGLCKSGKLDEAVEIVEGMWIHGSAALGNLGNS 510

Query: 1662 -------------IVPNVVTFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTY 1522
                          +P+++T++  I+ LC+AG + EA + F +M          P+++ Y
Sbjct: 511  FIGLVDSSSNGKKCLPDLITYSIIINGLCKAGRLDEARKKFIEMVGKSLH----PDSIIY 566

Query: 1521 NLMLEGFCKEGMLEEVQKLIETMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVD 1342
            +  +  FCK G +    ++++ M++ G   ++ +YN  + GL    ++F+   +L DM +
Sbjct: 567  DTFIHSFCKHGKISSAFRVLKDMEKRGCNKSLQTYNSLILGLGSKNQIFEIYGLLDDMKE 626

Query: 1341 EGIKPNNYTFNIVIDGLCKNGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSR----- 1177
            +GI PN  T+N +I  LC+ G + DA  L+  M   GI+P+  ++  L+  +C       
Sbjct: 627  KGITPNICTYNNMISCLCEGGRIKDATSLLDEMLQKGISPNISSFRLLIKAFCKASDFGV 686

Query: 1176 -----------------------------GKIIEANKVLHAMMMSGCTPNTYTCNVLLHS 1084
                                         G++ EA ++  A +        +  N L+  
Sbjct: 687  VKEVFEIALSICGHKEALYSLMFNELLIGGEVSEAKELFDAALDRCFDLGNFQYNDLIEK 746

Query: 1083 LWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSAAL 904
            L K+  +  A  +L +M ++GY  D  S   VIDGL K GK   A E+   M +  S  +
Sbjct: 747  LCKDEMLENASDILHKMIDKGYRFDPASFMPVIDGLGKRGKKHDADELAERMMDMASEGM 806


>gb|EOX97442.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 1
            [Theobroma cacao]
          Length = 872

 Score = 1046 bits (2704), Expect = 0.0
 Identities = 501/784 (63%), Positives = 632/784 (80%), Gaps = 3/784 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIPVIS--INYVTLITRILIGSGMFTEIEALHNLLLSRPCETHRVSS 2170
            P LAWQLFKR   S+P  S  +  V  I+RILI S M  EI+ LH+LLLS   + + +SS
Sbjct: 17   PKLAWQLFKRIQ-SLPTDSSFLPSVPTISRILIRSNMLQEIDHLHHLLLSSQPQLNPLSS 75

Query: 2169 Y-AVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKD 1993
              ++VK+LA  G+ D+A SQFQ +R  FP +PP I LYN L E  +K    + + WLYKD
Sbjct: 76   LISLVKLLARSGFFDRAFSQFQSIRTKFPQNPPSICLYNVLFECCIKERCSDYVLWLYKD 135

Query: 1992 MIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXX 1813
            M+ AG+ PQTYTFNLLIC LCDLG L+DAR+LFDKM +KGC+PNEF+FGILVRGYCR   
Sbjct: 136  MVGAGVSPQTYTFNLLICGLCDLGHLDDARELFDKMSEKGCVPNEFSFGILVRGYCRFGL 195

Query: 1812 XXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNS 1633
                   LD M+   + PN V+YNTLIS FC+EGKT+ AE+LVE+M+EDG+ P+VVTFNS
Sbjct: 196  ADKGVELLDDMRRFEIRPNRVVYNTLISSFCKEGKTDDAEKLVERMREDGLFPDVVTFNS 255

Query: 1632 RISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETM 1453
            RISALC AG ILEASRIF DMQMD+E GLP+PN +TYNLMLEGFCK+GMLEE + L+E+M
Sbjct: 256  RISALCRAGKILEASRIFRDMQMDEELGLPRPNVITYNLMLEGFCKQGMLEEAKTLVESM 315

Query: 1452 KRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGML 1273
            ++ G   N++SYNIWL GL++N KL +AQ VLKDM+ +G++PN Y++N+V+DGLCKNGML
Sbjct: 316  EKKGDLMNLESYNIWLLGLLRNAKLVEAQLVLKDMIYKGVEPNIYSYNVVMDGLCKNGML 375

Query: 1272 GDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVL 1093
             DARM+MG + ++G++PDTVT+STLLHGYC +G++  AN +LH MM +GC PNTYTCN+L
Sbjct: 376  SDARMVMGFIISSGLSPDTVTFSTLLHGYCCKGRLYAANSILHEMMRNGCFPNTYTCNIL 435

Query: 1092 LHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGS 913
            LHSLWKEGKISEAE LLQ+MNE+GY +DTV+CNIVIDGLCK+GK+DKA+EI +EMW HGS
Sbjct: 436  LHSLWKEGKISEAEDLLQKMNEKGYGVDTVTCNIVIDGLCKSGKLDKAMEIGNEMWTHGS 495

Query: 912  AALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYP 733
            AALG+LGNSFIGLVD+ N+ K+C+PDL+TYS II+ LC  GRLDEAKKKF EMM KNL P
Sbjct: 496  AALGNLGNSFIGLVDDANSSKQCIPDLVTYSIIISALCKAGRLDEAKKKFKEMMGKNLQP 555

Query: 732  DSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLM 553
            DS+++DIF++  CK GK+SSAF+VLKDMEK+GC +SLQTYNSLILGLG+KNQI+E+ GL+
Sbjct: 556  DSVIFDIFIHIFCKEGKISSAFRVLKDMEKKGCNKSLQTYNSLILGLGSKNQIFEIYGLV 615

Query: 552  DEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRI 373
            DEMRERGI PNV TYN II CLCE G++++  S+++EMLQ+GI+PNI SF++LI +FC+ 
Sbjct: 616  DEMRERGITPNVCTYNNIIRCLCENGKMQDTTSILDEMLQKGINPNISSFRMLIEAFCKA 675

Query: 372  GEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYK 193
             +F  AQE+F++AL ICGHKE LY  +FNE+L GG+  EAK +FEA L R F LG FLYK
Sbjct: 676  CDFGVAQELFEIALSICGHKEALYKLMFNELLVGGQLSEAKLVFEAALYRSFHLGGFLYK 735

Query: 192  DLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNM 13
            DL+++LCK+++ EEA  IL KM+ KGY FDP +FMPV+D L K GNKHEA+EL+E M+ M
Sbjct: 736  DLIEKLCKDKKLEEASRILHKMINKGYKFDPATFMPVVDELGKRGNKHEADELAEKMMEM 795

Query: 12   ASGG 1
            AS G
Sbjct: 796  ASDG 799


>gb|EMJ01512.1| hypothetical protein PRUPE_ppa001249mg [Prunus persica]
          Length = 872

 Score = 1040 bits (2690), Expect = 0.0
 Identities = 498/787 (63%), Positives = 636/787 (80%), Gaps = 6/787 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR-----SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLL-SRPCETH 2182
            P LAW LFKR     +S S   + +  + ++TRILI S M  EI++L  LLL S+P ET 
Sbjct: 18   PKLAWHLFKRILSSPTSSSSSDLCLRSLPIVTRILIDSKMHHEIDSLRQLLLVSQPSETL 77

Query: 2181 RVSSYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWL 2002
            R    ++V+ LA     D A+S F+D+R+ FP  PP + LYN L+ESS++  + + + WL
Sbjct: 78   RPCLVSLVRFLAKSSLSDMAVSCFKDLRSRFPDEPPSVYLYNLLVESSLREKHVDFVLWL 137

Query: 2001 YKDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCR 1822
            YKDMI +G+ P+TYTFNLLIC+LC+  RL+DAR++FDKMR+KGC PNE++ GILVRGYCR
Sbjct: 138  YKDMIVSGMKPETYTFNLLICSLCESDRLDDAREVFDKMREKGCQPNEYSVGILVRGYCR 197

Query: 1821 XXXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVT 1642
                      LD M+   + PN V+YNTLIS FC++ KT+ AE+LVE+M+EDG++P+ VT
Sbjct: 198  AGLAVRGLEVLDQMRSCNLLPNRVVYNTLISSFCKQSKTDDAEKLVERMREDGMLPDAVT 257

Query: 1641 FNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLI 1462
            FNSRISALC AG ILEASRIF DM +DQE GLP+PN VTYNLML+GFC+E MLEE + L 
Sbjct: 258  FNSRISALCSAGKILEASRIFRDMHIDQEMGLPQPNVVTYNLMLQGFCREDMLEEAENLF 317

Query: 1461 ETMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKN 1282
            ++M++ G F N++SYNIWL GLVKNGKL +A+ VLK+MVD+GI+PN Y++NIVI+GLCKN
Sbjct: 318  KSMEKAGNFINLESYNIWLLGLVKNGKLLEARLVLKEMVDKGIEPNIYSYNIVINGLCKN 377

Query: 1281 GMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTC 1102
            GML DARM+M LM    I+PDTVTYSTLLHG+C++GK+ EA+ +LH MMM+ C PNT+TC
Sbjct: 378  GMLRDARMVMTLMVRNNISPDTVTYSTLLHGFCNKGKVFEASNILHEMMMNNCFPNTHTC 437

Query: 1101 NVLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWN 922
            N+LLHSLWKEG+ SEAE+LLQ+MNERGY LDTV+CNIVIDGLC  GK+DKA+EIVS MW 
Sbjct: 438  NILLHSLWKEGRTSEAEELLQKMNERGYGLDTVTCNIVIDGLCNDGKLDKAIEIVSGMWT 497

Query: 921  HGSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKN 742
            HGSAALG+LGNSFIGLVD+ NN KKC+PDLITYSTII+GLC  GRLDEAKKKF+EMM KN
Sbjct: 498  HGSAALGNLGNSFIGLVDDSNNGKKCIPDLITYSTIISGLCKAGRLDEAKKKFMEMMGKN 557

Query: 741  LYPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMC 562
            L+PDS++YD+F+ S CK+G++SSAF+VLKDMEK+GC +S+QTYNSL+LGLG+K QI+E+ 
Sbjct: 558  LHPDSVIYDMFINSFCKQGRISSAFRVLKDMEKKGCNKSIQTYNSLVLGLGSKKQIFEIY 617

Query: 561  GLMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISF 382
            GLMDEMRERG+ P+V TYN +++CLCEG R+++A SL++EMLQ+GISPNI +F++LI +F
Sbjct: 618  GLMDEMRERGVTPDVCTYNYMMNCLCEGERVKDATSLLDEMLQKGISPNISTFRILIKAF 677

Query: 381  CRIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSF 202
            C+  +F    EVFD+AL +CGHKEVLY+ +FNE+LAGGE L+AK LFE  LDR F LG+F
Sbjct: 678  CKACDFGVTHEVFDIALSVCGHKEVLYSLMFNELLAGGEILKAKALFEVALDRYFYLGNF 737

Query: 201  LYKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESM 22
            LYKDL+DRLCK+E+ E+A  IL  M  KGYGFDP SF+PVID L+K GNK EA+EL+E+M
Sbjct: 738  LYKDLIDRLCKDEKLEDASSILHTMKNKGYGFDPASFLPVIDGLSKRGNKQEADELAEAM 797

Query: 21   LNMASGG 1
            ++M S G
Sbjct: 798  MDMESEG 804


>ref|XP_002527150.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223533489|gb|EEF35232.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 874

 Score = 1028 bits (2659), Expect = 0.0
 Identities = 494/783 (63%), Positives = 625/783 (79%), Gaps = 4/783 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIPVISINY---VTLITRILIGSGMFTEIEALHNLLLSRPC-ETHRV 2176
            P LAW LFKR  +S+P+ S +    + +I+RILI S MF E++ LH LLL+ P  E+   
Sbjct: 18   PKLAWHLFKRI-LSLPISSNHRSQSIPIISRILIRSKMFNELDDLHQLLLNSPSLESSDS 76

Query: 2175 SSYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYK 1996
            S   +V +LA  G+ +KA+S F+ +R++FP   P I LYN L++S ++ N  EL+SWLYK
Sbjct: 77   SLENLVTVLAKSGFFNKAISHFKSLRSNFPEKQPSIYLYNVLLKSCIRENRVELVSWLYK 136

Query: 1995 DMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXX 1816
            DM+ A + P+ YTFNLLI  LCD G LEDAR+LFDKM  +GC PNEFTFGILVRGYCR  
Sbjct: 137  DMVLARVSPEAYTFNLLIGLLCDSGHLEDARELFDKMPARGCEPNEFTFGILVRGYCRAG 196

Query: 1815 XXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFN 1636
                    L  M+ +G+ PN V+YNTLIS FC+EGKT  AE+LV++M+EDG+VP+V TFN
Sbjct: 197  LASKGLELLGQMRTMGILPNNVLYNTLISSFCKEGKTHDAEKLVDKMREDGLVPHVETFN 256

Query: 1635 SRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIET 1456
            SRISALC +G ILEASRIF DMQ+D+E GLP PN +TY LML GFCKEGMLEE + L++T
Sbjct: 257  SRISALCGSGKILEASRIFRDMQIDEELGLPHPNVITYKLMLMGFCKEGMLEEAKTLVDT 316

Query: 1455 MKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGM 1276
            MKRN  F N++SYNIWL GL++NGKL +A  VLK+M+  GI+P+ Y++NIV+DGLCKNGM
Sbjct: 317  MKRNANFINLESYNIWLLGLIRNGKLLEAWIVLKEMLGIGIEPDIYSYNIVMDGLCKNGM 376

Query: 1275 LGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNV 1096
            L DARMLMGLM   GI PDTVTYSTLLHGYCS+GK+ EAN +LH M+ + C+PNTYTCNV
Sbjct: 377  LSDARMLMGLMIRNGILPDTVTYSTLLHGYCSKGKVFEANNLLHEMISNNCSPNTYTCNV 436

Query: 1095 LLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHG 916
            LLHSLWKEG+ISEAE LLQ+MNE+GY +DTV+CNI+I+ LC  G++DKA+EIV+ MW HG
Sbjct: 437  LLHSLWKEGRISEAENLLQKMNEKGYGVDTVTCNIIINALCNNGQLDKAIEIVNGMWTHG 496

Query: 915  SAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLY 736
            SAALG+LGNSFIGLVD+  + KKC PDL+TYSTII+GLC  GRLD+AKKKFIEMM K L 
Sbjct: 497  SAALGNLGNSFIGLVDDTISGKKCTPDLVTYSTIISGLCKAGRLDDAKKKFIEMMSKGLQ 556

Query: 735  PDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGL 556
            PDS +YD F++S C+ GK+SSAFQVLKDMEKRGC ++LQTYNSLILGLG+KNQI+E+ GL
Sbjct: 557  PDSAIYDTFIHSFCREGKISSAFQVLKDMEKRGCNKTLQTYNSLILGLGSKNQIFELYGL 616

Query: 555  MDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCR 376
            +DEMRE+G+ P+V TYN +++CLCEGGR+ +A S+++EMLQ+GISPNI SF++LI +FC+
Sbjct: 617  IDEMREKGVSPDVCTYNHMLNCLCEGGRINDAPSVLDEMLQKGISPNISSFRILIKAFCK 676

Query: 375  IGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLY 196
              +F+ + EVF++AL +CGHKE LY  +FNE+L GG+  EAKELFE  LDR FD+G+FLY
Sbjct: 677  ACDFKASHEVFEIALNVCGHKEALYTLMFNELLVGGKVAEAKELFETALDRSFDIGNFLY 736

Query: 195  KDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLN 16
            KDL+DRLCK+E+ E A D+L +++ KGY FDP SFMPVID   K GNKH A+EL+E M+ 
Sbjct: 737  KDLIDRLCKDEKLEAASDVLHRLIDKGYQFDPASFMPVIDGFGKMGNKHVADELAERMME 796

Query: 15   MAS 7
            MAS
Sbjct: 797  MAS 799



 Score =  208 bits (529), Expect = 1e-50
 Identities = 175/666 (26%), Positives = 290/666 (43%), Gaps = 81/666 (12%)
 Frame = -3

Query: 2271 LITRILIGSGMFTEIEALHNLLLSRPCETHRVSSYAVVKILANCGYVDKALSQFQDVRAH 2092
            L+  +L  SG   +   L + + +R CE +  +   +V+     G   K L     +R  
Sbjct: 152  LLIGLLCDSGHLEDARELFDKMPARGCEPNEFTFGILVRGYCRAGLASKGLELLGQMRTM 211

Query: 2091 FPSHPPPISLYNFLIESSMKGNNHELISWLYKDMIFAGLPPQTYTFNLLICALCDLGRLE 1912
                 P   LYN LI S  K         L   M   GL P   TFN  I ALC  G++ 
Sbjct: 212  --GILPNNVLYNTLISSFCKEGKTHDAEKLVDKMREDGLVPHVETFNSRISALCGSGKIL 269

Query: 1911 DARKLFDKMRDKGCM----PNEFTFGILVRGYCRXXXXXXXXXXLDVMKEIGVFPNLVIY 1744
            +A ++F  M+    +    PN  T+ +++ G+C+          +D MK    F NL  Y
Sbjct: 270  EASRIFRDMQIDEELGLPHPNVITYKLMLMGFCKEGMLEEAKTLVDTMKRNANFINLESY 329

Query: 1743 NTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNSRISALCEAGSILEASRIFSDMQM 1564
            N  +    R GK   A  ++++M   GI P++ ++N  +  LC+ G + +A  +   M  
Sbjct: 330  NIWLLGLIRNGKLLEAWIVLKEMLGIGIEPDIYSYNIVMDGLCKNGMLSDARMLMGLMIR 389

Query: 1563 DQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETMKRNGVFSNVDSYNIWLHGLVKNG 1384
            +       P+TVTY+ +L G+C +G + E   L+  M  N    N  + N+ LH L K G
Sbjct: 390  NGIL----PDTVTYSTLLHGYCSKGKVFEANNLLHEMISNNCSPNTYTCNVLLHSLWKEG 445

Query: 1383 KLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGMLGDARMLMGLMTTAG--------- 1231
            ++ +A+++L+ M ++G   +  T NI+I+ LC NG L  A  ++  M T G         
Sbjct: 446  RISEAENLLQKMNEKGYGVDTVTCNIIINALCNNGQLDKAIEIVNGMWTHGSAALGNLGN 505

Query: 1230 --------------ITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVL 1093
                           TPD VTYST++ G C  G++ +A K    MM  G  P++   +  
Sbjct: 506  SFIGLVDDTISGKKCTPDLVTYSTIISGLCKAGRLDDAKKKFIEMMSKGLQPDSAIYDTF 565

Query: 1092 LHSLWKEGKISEA-------EK----------------------------LLQRMNERGY 1018
            +HS  +EGKIS A       EK                            L+  M E+G 
Sbjct: 566  IHSFCREGKISSAFQVLKDMEKRGCNKTLQTYNSLILGLGSKNQIFELYGLIDEMREKGV 625

Query: 1017 DLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSA----ALGSLGNSFIGLVDEENNKK 850
              D  + N +++ LC+ G+++ A  ++ EM   G +    +   L  +F    D + + +
Sbjct: 626  SPDVCTYNHMLNCLCEGGRINDAPSVLDEMLQKGISPNISSFRILIKAFCKACDFKASHE 685

Query: 849  K-------CMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYPDSIVYDIFLYSLCK 691
                    C      Y+ + N L + G++ EAK+ F   + ++    + +Y   +  LCK
Sbjct: 686  VFEIALNVCGHKEALYTLMFNELLVGGKVAEAKELFETALDRSFDIGNFLYKDLIDRLCK 745

Query: 690  RGKVSSAFQVLKDMEKRGCKRSLQTYNSLILG---LGNKNQIYEMCGLMDEM-----RER 535
              K+ +A  VL  +  +G +    ++  +I G   +GNK+   E+   M EM     +E 
Sbjct: 746  DEKLEAASDVLHRLIDKGYQFDPASFMPVIDGFGKMGNKHVADELAERMMEMASESNKEN 805

Query: 534  GIPPNV 517
               PNV
Sbjct: 806  KAYPNV 811


>ref|XP_006350361.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g17140-like [Solanum tuberosum]
          Length = 849

 Score = 1026 bits (2652), Expect = 0.0
 Identities = 504/778 (64%), Positives = 608/778 (78%), Gaps = 1/778 (0%)
 Frame = -3

Query: 2337 LAWQLFKRSSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLS-RPCETHRVSSYAV 2161
            +AWQL +R+ +S P    + +  +TRIL+   M  +I  LH+ LLS  P      S Y +
Sbjct: 5    VAWQLLRRT-LSTPNPPFHSILPLTRILLRYQMLPQIHTLHSFLLSLSPHHISLPSQYTI 63

Query: 2160 VKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKDMIFA 1981
            VK+LA+ GY+  A+S F+  R+H P  PP +SLYNFLI  S K N    I WLY+DMI A
Sbjct: 64   VKLLASHGYIHDAISLFRSTRSHHP--PPKLSLYNFLIHKSFKFNFPNFIFWLYQDMISA 121

Query: 1980 GLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXXXXXX 1801
             + P TYTFNLLI  LC   RL DAR+LFD M  KGC PNEFTFGIL+RGYC+       
Sbjct: 122  SVSPVTYTFNLLIHGLCHSDRLGDARQLFDVMPHKGCHPNEFTFGILIRGYCKFGLSLQG 181

Query: 1800 XXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNSRISA 1621
               LD MK + V PN++IYNTLI+ FCR+G  + AERLVE+M+EDG++P+VVTFNSRISA
Sbjct: 182  LKLLDTMKMMNVRPNVIIYNTLIASFCRKGDVDEAERLVERMREDGLLPDVVTFNSRISA 241

Query: 1620 LCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETMKRNG 1441
            LC +G ILEASRIF DMQ+D+ F LP+PN VT+NLML+GFC++GMLEE + L E+MK++ 
Sbjct: 242  LCNSGKILEASRIFRDMQIDEVFELPRPNVVTFNLMLQGFCQKGMLEEARTLTESMKKDD 301

Query: 1440 VFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGMLGDAR 1261
            +F NV SYNIWL GLV+NGKL +AQ+VLK++   G+ P  Y++NI+IDGLCKNGMLGDA+
Sbjct: 302  IFFNVQSYNIWLCGLVRNGKLLEAQTVLKELPQNGVDPTIYSYNILIDGLCKNGMLGDAK 361

Query: 1260 MLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVLLHSL 1081
            MLM LM   GI PDTVTYSTLLHGYC++ K+ EA  +L  MM  GC PN YTCN LLHS+
Sbjct: 362  MLMSLMINDGIFPDTVTYSTLLHGYCTKSKVTEAKNILREMMKRGCIPNKYTCNTLLHSM 421

Query: 1080 WKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSAALG 901
            WKEGK+SEA++LLQ+MNERGY LDTVSCNIVI GLC+TG+VDKAVEIVSEMW+HGS ALG
Sbjct: 422  WKEGKVSEAQQLLQKMNERGYGLDTVSCNIVIHGLCQTGEVDKAVEIVSEMWSHGSVALG 481

Query: 900  SLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYPDSIV 721
             LGNSF+ LV+E++N +KC+PDLITYSTIIN L   G+LDEAKKKF+EMM+K LYPDSI+
Sbjct: 482  DLGNSFMSLVNEDDNGRKCLPDLITYSTIINSLFREGKLDEAKKKFVEMMRKKLYPDSII 541

Query: 720  YDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLMDEMR 541
            Y+  L+ LCKRGKVSSAFQVLKDME + CK+SL+TYNSLILGLGNKNQI+EMCGLMDEMR
Sbjct: 542  YNTILHHLCKRGKVSSAFQVLKDMETKDCKKSLRTYNSLILGLGNKNQIFEMCGLMDEMR 601

Query: 540  ERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRIGEFR 361
            E+GI PNV+TYN +I CLC+ GR EEA  L+NEMLQ+GI PN+ +F+LLI S+CR GEFR
Sbjct: 602  EKGISPNVYTYNIMIGCLCKSGRTEEAIPLLNEMLQKGIIPNMNTFELLIKSYCRTGEFR 661

Query: 360  PAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYKDLVD 181
            PAQEVFD+A  ICGH E LYA +FNE LAGGE +EAK+  E  +D+ FDLGSFLYKDL+D
Sbjct: 662  PAQEVFDIASSICGHSEALYALMFNEFLAGGEIMEAKQFLETAIDKHFDLGSFLYKDLID 721

Query: 180  RLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNMAS 7
            +LCK E  E AHDIL KMM  GYGFDP SFMPVID L K G KH A+ELSE ML M S
Sbjct: 722  KLCKVENLEGAHDILIKMMHIGYGFDPASFMPVIDGLNKLGQKHVADELSERMLEMVS 779


>ref|XP_004290060.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17140-like
            [Fragaria vesca subsp. vesca]
          Length = 871

 Score = 1016 bits (2627), Expect = 0.0
 Identities = 492/788 (62%), Positives = 628/788 (79%), Gaps = 7/788 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR-----SSVSIPVISINY-VTLITRILIGSGMFTEIEALHNLLL-SRPCET 2185
            P LAW LFKR     +S S    S +  + +I RILI S M  EI+ LH LLL S+P +T
Sbjct: 17   PKLAWHLFKRILSSPTSTSTSTSSSHLSLPIIARILISSKMHPEIDTLHRLLLHSQPPQT 76

Query: 2184 HRVSSYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISW 2005
             R S  ++V+I A      KALS F+ +R+ FP+ PPP+ LYN L+ES+++ N+ + + W
Sbjct: 77   LRPSLLSLVRIFAKSNLPYKALSHFKSLRSRFPNDPPPVYLYNLLLESTLRNNDVDYVLW 136

Query: 2004 LYKDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYC 1825
            LYKDMI +G+ PQTYTFNLLICALCD  RLEDAR++FDKMRDKGC+PNE++  ILVRGYC
Sbjct: 137  LYKDMIASGVRPQTYTFNLLICALCDCSRLEDARQVFDKMRDKGCVPNEYSVAILVRGYC 196

Query: 1824 RXXXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVV 1645
            R          LD M+  GV  N V+YNTL+S FC+EG+T+ AE+LVE+M+E+G+ P+V+
Sbjct: 197  RAGFGSDALDVLDEMRGCGVGVNRVVYNTLVSSFCKEGRTDEAEKLVERMREEGMFPDVI 256

Query: 1644 TFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKL 1465
            TFNSRISALC AG ILEASRIF DM MDQ  GLP+PN VTYNLML+GFCKEGMLEE + L
Sbjct: 257  TFNSRISALCSAGKILEASRIFRDMHMDQALGLPQPNVVTYNLMLQGFCKEGMLEEAESL 316

Query: 1464 IETMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCK 1285
             ++M++ G   N++SYNIWL GLV+N KL +A+ VLK+MV +GI+ N Y++NI+I+GLCK
Sbjct: 317  FKSMEKAGGLINLESYNIWLLGLVRNKKLLEARLVLKEMVHKGIELNIYSYNILINGLCK 376

Query: 1284 NGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYT 1105
            NGML DARM+M LM    I+PDTVTYSTLLHGYC++GK+ EA+ VL  M+M  C PNT+T
Sbjct: 377  NGMLRDARMVMDLMARNNISPDTVTYSTLLHGYCNKGKVFEASNVLQEMIMKNCFPNTHT 436

Query: 1104 CNVLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMW 925
            CN+LLHSLWKEG+ISEAE+LLQ+MNE+GY LDTV+CNIVIDGLC  GK+DKA+EIVS MW
Sbjct: 437  CNILLHSLWKEGRISEAEELLQKMNEKGYGLDTVTCNIVIDGLCNDGKLDKAIEIVSGMW 496

Query: 924  NHGSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQK 745
             HG AALG+LGNSF+GLVD+ NN  KC+PDLI+YSTII+GLC  GRLDEAKKKF+EMM +
Sbjct: 497  THGGAALGNLGNSFVGLVDDCNNGNKCLPDLISYSTIISGLCKAGRLDEAKKKFMEMMGR 556

Query: 744  NLYPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEM 565
            NL+PDS++YD F+ + CK GK+SSAF+VLKDMEK+GC +S+QTYNSLILG+G+K QI+E+
Sbjct: 557  NLHPDSVIYDTFIRTFCKEGKISSAFRVLKDMEKKGCNKSIQTYNSLILGIGSKKQIFEI 616

Query: 564  CGLMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIIS 385
             GLMDEM+ERG+PP+V TYN +++CLCE  R+++A SL++EMLQ+GISPNI +F++LI +
Sbjct: 617  YGLMDEMKERGVPPDVCTYNNMMTCLCEVERVKDATSLLDEMLQKGISPNISTFRILIKA 676

Query: 384  FCRIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGS 205
            FC+  +F  AQEVFD+AL +CGHKE LY+ +FNE+LAGGE  +A ELF+  LD+ F LG+
Sbjct: 677  FCKGFDFAVAQEVFDIALSVCGHKEALYSMMFNELLAGGEVAKATELFKEALDKYFYLGN 736

Query: 204  FLYKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSES 25
            FLYKDL+DRLC +++ ++A  IL  MM KGYGFD  SF+PVID L K GNKHEA+EL+E 
Sbjct: 737  FLYKDLMDRLCMDQKLDDACSILHNMMNKGYGFDSASFLPVIDGLGKKGNKHEADELAER 796

Query: 24   MLNMASGG 1
            M+ MAS G
Sbjct: 797  MMEMASEG 804


>ref|XP_006384843.1| hypothetical protein POPTR_0004s21560g [Populus trichocarpa]
            gi|550341611|gb|ERP62640.1| hypothetical protein
            POPTR_0004s21560g [Populus trichocarpa]
          Length = 874

 Score = 1002 bits (2590), Expect = 0.0
 Identities = 482/784 (61%), Positives = 619/784 (78%), Gaps = 3/784 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIPVISI--NYVTLITRILIGSGMFTEIEALHNLLL-SRPCETHRVS 2173
            P ++W LFKR  +S+PV       + +ITRILI + M  E++ L  LL+ S+P ET   S
Sbjct: 19   PKISWYLFKRI-LSLPVTQQCPQSIPIITRILIRAKMLNELDDLPQLLIASQPQETLHTS 77

Query: 2172 SYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKD 1993
              + + +LA  G   KA+SQF+ +R  FP +PP I LYN L+ S  K    + +SWL KD
Sbjct: 78   LVSFITVLAKSGLFGKAISQFKSLRFRFPENPPSIYLYNVLLRSCTKEGRVDCVSWLCKD 137

Query: 1992 MIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXX 1813
            M+ +G+ P+TYTFN+LI  LCD G L+DAR+LFDKM +KGC PNE++FGILVRGYCR   
Sbjct: 138  MVASGVSPETYTFNVLIGLLCDSGCLDDARELFDKMPEKGCEPNEYSFGILVRGYCRAGF 197

Query: 1812 XXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNS 1633
                   L  M+ +G  PN V+YNTLIS FC+EGKT+ AE+LV++M++DG+ P+VVTFN+
Sbjct: 198  TSKGLELLGEMRRLGFSPNKVVYNTLISSFCKEGKTDDAEKLVDEMRKDGLSPDVVTFNA 257

Query: 1632 RISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETM 1453
            RISALC +G +LEASRIF DMQ+D+  GLP+PN +TYNLML GFCKEGMLEE + L E M
Sbjct: 258  RISALCSSGKVLEASRIFRDMQIDEVLGLPQPNIITYNLMLGGFCKEGMLEEARALFEKM 317

Query: 1452 KRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGML 1273
            K +    N +SYNIWL GLV+ GKL +AQ VLK+MVD G++PN Y++NIV+DGLCKNG+L
Sbjct: 318  KVSENLMNRESYNIWLLGLVRIGKLLEAQLVLKEMVDMGMEPNVYSYNIVMDGLCKNGVL 377

Query: 1272 GDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVL 1093
             DARMLM LMT++G+ PDTVTY+TLLHGYC  GK+ EAN VL  MM  GC+PN YTCN+L
Sbjct: 378  FDARMLMRLMTSSGVLPDTVTYTTLLHGYCHTGKVSEANNVLREMMRDGCSPNNYTCNIL 437

Query: 1092 LHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGS 913
            L+SLWKEG+ISEAE+LLQ+MNE+GY +DTV+CNIVIDGLC  GK+DKA+EIV+ MW HGS
Sbjct: 438  LYSLWKEGRISEAEELLQKMNEKGYVIDTVTCNIVIDGLCNNGKLDKAIEIVNGMWTHGS 497

Query: 912  AALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYP 733
            AALG+LGNS+IGLVD+ +++KKCMPDLI+YSTII+GLC  GR+ EAKKKFIEMM KNL P
Sbjct: 498  AALGNLGNSYIGLVDDSDSRKKCMPDLISYSTIISGLCKAGRVGEAKKKFIEMMGKNLQP 557

Query: 732  DSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLM 553
            DS +YD+F++S CK GK+SSAF+VLKDMEK+GC ++LQTYNSLI+GLG+KNQI+E+ GL+
Sbjct: 558  DSAIYDVFIHSFCKEGKISSAFRVLKDMEKKGCNKTLQTYNSLIMGLGSKNQIFEIYGLI 617

Query: 552  DEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRI 373
            DEMRERG+ P+V  YN ++S LCEGGR+++A S+++EMLQ+GISPNI SF +LI +FC+ 
Sbjct: 618  DEMRERGVSPDVSIYNNVLSSLCEGGRVKDAPSVLDEMLQKGISPNISSFSILIKAFCKA 677

Query: 372  GEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYK 193
             +F    E+F++AL +CGHKE LY+  FNE+L GGE ++AKELFE  LDR FD+G+FLYK
Sbjct: 678  CDFSAVDEIFEIALNVCGHKEALYSLTFNELLVGGEVVKAKELFETALDRSFDVGNFLYK 737

Query: 192  DLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNM 13
            DL+D LCK+E+ ++A  IL K++ KGY FDP SFMPVID L K GNKHEA+EL+E M+ M
Sbjct: 738  DLIDHLCKDEKLDDASGILHKLIDKGYWFDPASFMPVIDGLGKRGNKHEADELAEKMMEM 797

Query: 12   ASGG 1
            AS G
Sbjct: 798  ASEG 801


>ref|XP_006422433.1| hypothetical protein CICLE_v10027787mg [Citrus clementina]
            gi|557524367|gb|ESR35673.1| hypothetical protein
            CICLE_v10027787mg [Citrus clementina]
          Length = 889

 Score = 1001 bits (2588), Expect = 0.0
 Identities = 483/784 (61%), Positives = 616/784 (78%), Gaps = 3/784 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIPVIS--INYVTLITRILIGSGMFTEIEALHNLLLS-RPCETHRVS 2173
            P LAWQ+F+R  ++ P  +  ++ V  I RILI S M  EI  LH+LLLS +P      S
Sbjct: 22   PKLAWQIFRRVILNSPTTNPPLDSVPTIARILIRSKMLEEIHTLHSLLLSLKPRGNSHSS 81

Query: 2172 SYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKD 1993
              ++VKILA  G +D+A SQF+ +R HF   PP I LYN L E  ++  + + + WLYKD
Sbjct: 82   LTSLVKILAKSGLLDEAFSQFKSIRVHFHGDPPCIYLYNVLFECCVRKQHMDYLMWLYKD 141

Query: 1992 MIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXX 1813
            M+ A + P+TYTFNLLI ALCD GRLEDARKLFDKM DKGC PNEF+F ILVRGYCR   
Sbjct: 142  MVVAKVSPETYTFNLLIRALCDSGRLEDARKLFDKMSDKGCRPNEFSFAILVRGYCRAGL 201

Query: 1812 XXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNS 1633
                   +++M+ +G  PN V+YNTLIS FCR+GKTE AE++VE+M+EDG+ P+VVTFNS
Sbjct: 202  ADEGLELMELMRSLGFSPNRVVYNTLISSFCRDGKTEDAEKMVERMREDGMFPDVVTFNS 261

Query: 1632 RISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETM 1453
            RISALC  G ILEASRIF DMQ+D++ GLP+PN +TYNLMLEGFCKEGMLEE + L++ +
Sbjct: 262  RISALCRTGKILEASRIFRDMQLDEDLGLPRPNIITYNLMLEGFCKEGMLEEAKTLVDAV 321

Query: 1452 KRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGML 1273
             +NG   N++SYNIWL GLV+NGKL +AQ+VL++MV++G++P+ +++NI+IDGLCKN ML
Sbjct: 322  TKNGGSLNLESYNIWLMGLVRNGKLVEAQAVLQEMVEKGVEPSIHSYNILIDGLCKNRML 381

Query: 1272 GDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVL 1093
             DARM+M LM   GI+PDT+TYSTLLHGYCS GK+ EAN VL+ M+ +GC+P  +TCN+L
Sbjct: 382  SDARMVMDLMVDRGISPDTITYSTLLHGYCSNGKVFEANNVLYEMIRNGCSPTIFTCNIL 441

Query: 1092 LHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGS 913
            LHSL+KEG+ISEAE LLQ+MNERGYDLDTV+CNI+I GLC  GK+DKA+EIV+EMW  GS
Sbjct: 442  LHSLYKEGRISEAEDLLQKMNERGYDLDTVTCNIIIHGLCNCGKLDKAIEIVNEMWTKGS 501

Query: 912  AALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYP 733
            AALG+LGN F+   D+ N+KKKC+PDLITYST+I+ LC  G+LDEAKKKF EMM KN+ P
Sbjct: 502  AALGNLGNFFVVPADDTNSKKKCLPDLITYSTVISALCKAGKLDEAKKKFTEMMGKNVRP 561

Query: 732  DSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLM 553
            DS VYD F++S CK GK+SSAF+VLKDMEK GC ++LQTYNSLILGLG+K QI+E+ GL+
Sbjct: 562  DSFVYDNFIHSYCKEGKLSSAFRVLKDMEKNGCSKTLQTYNSLILGLGSKAQIFEIHGLL 621

Query: 552  DEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRI 373
            +EMRERG+ PNV TYN I+ CLCEG ++E+A S ++EM Q+GI  N+ SF++L+  FC+ 
Sbjct: 622  NEMRERGVSPNVCTYNNILKCLCEGSKIEDAISCLDEMQQRGIL-NVSSFRMLVRVFCKA 680

Query: 372  GEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYK 193
            GEFR AQE F  AL +CGHKE +Y+ +FNE+L  GE  EAKE+FEA L++ +DLG+FLYK
Sbjct: 681  GEFRVAQEAFKTALSMCGHKEGIYSLMFNELLLAGEVSEAKEIFEAALEKSYDLGNFLYK 740

Query: 192  DLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNM 13
             L+D LCK+E+ EEA  I+ KM+ KGYGFDP SFMPVIDAL + G KHEA+EL+E M+ M
Sbjct: 741  VLIDGLCKDEKLEEATGIIYKMIDKGYGFDPASFMPVIDALGERGKKHEADELAEKMMEM 800

Query: 12   ASGG 1
             S G
Sbjct: 801  TSDG 804


>ref|XP_004231526.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17140-like
            [Solanum lycopersicum]
          Length = 837

 Score =  995 bits (2573), Expect = 0.0
 Identities = 485/779 (62%), Positives = 602/779 (77%)
 Frame = -3

Query: 2337 LAWQLFKRSSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLSRPCETHRVSSYAVV 2158
            +AWQL +R+ +S P   ++ +  +TRIL+   M  +I +LH+ LLS     H    Y +V
Sbjct: 9    VAWQLLRRT-LSTPNPPLHSILPLTRILLRYHMLPQIHSLHSFLLS--LSPHLTCQYTIV 65

Query: 2157 KILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKDMIFAG 1978
            K+LA+ G++  A+  F+  R+H P  PP +SLYNFLI  S K N    ISWLY+DMI A 
Sbjct: 66   KLLASHGHIHDAIFLFRSTRSHHP--PPRLSLYNFLIYKSFKFNYSNFISWLYQDMISAS 123

Query: 1977 LPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXXXXXXX 1798
            + P TYTFNLLI  LC+  RL DAR LFD M  KGC PN FTFGIL+R YC+        
Sbjct: 124  VSPVTYTFNLLIHGLCNSDRLRDARHLFDLMPHKGCHPNHFTFGILIRAYCKFGLSLQGL 183

Query: 1797 XXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNSRISAL 1618
              LD MK + V PN++IYNTL++ FCR+G  + AERLV++M++DG++P+VVTFNSRISAL
Sbjct: 184  KLLDTMKMMNVCPNIIIYNTLVASFCRKGDVDEAERLVQRMRDDGLLPDVVTFNSRISAL 243

Query: 1617 CEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETMKRNGV 1438
            C +G ILEASRIF DMQ+D+ FGLP+PN VT+NLML+GFC++GMLEE + L E+MK++ +
Sbjct: 244  CNSGKILEASRIFRDMQIDEVFGLPRPNIVTFNLMLQGFCQKGMLEEARTLTESMKKDDI 303

Query: 1437 FSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGMLGDARM 1258
            F NV SYNIWL GLV+NGKL +AQ+VLK+M   G+ P  Y++NI+I GLCK+GMLGDA+M
Sbjct: 304  FFNVQSYNIWLCGLVRNGKLLEAQTVLKEMPQNGVDPTIYSYNILIHGLCKHGMLGDAKM 363

Query: 1257 LMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVLLHSLW 1078
            LM LM   GI PDTVTYSTLLHGYC++ ++ EA  +L  MM  GC PN YTCN LLHS+W
Sbjct: 364  LMSLMINDGIFPDTVTYSTLLHGYCTKSEVTEAKNILREMMKRGCIPNKYTCNTLLHSMW 423

Query: 1077 KEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSAALGS 898
            KEGK+SEA++LLQ+MNERGY LDTVSCNIVI GLC+ G+VDKAVEIVSEMW+HGS ALG 
Sbjct: 424  KEGKVSEAQQLLQKMNERGYGLDTVSCNIVIHGLCQIGEVDKAVEIVSEMWSHGSIALGD 483

Query: 897  LGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYPDSIVY 718
             GNS + LV+E+++ +KC+PDLITYS IIN L   G+LDEAKKKF+EMM+K LYPDS++Y
Sbjct: 484  FGNSLMSLVNEDDHGRKCLPDLITYSIIINSLFREGKLDEAKKKFVEMMRKKLYPDSVIY 543

Query: 717  DIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLMDEMRE 538
            +  L+ LCKRGK+SSAFQVLKDME + CK+SL+TYNSLILGLG+KNQI+EMCGLMDEMRE
Sbjct: 544  NTILHHLCKRGKISSAFQVLKDMETKDCKKSLRTYNSLILGLGDKNQIFEMCGLMDEMRE 603

Query: 537  RGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRIGEFRP 358
            +GI P+V+TYN +I CLC+ GR E+A  L+NEMLQ+GI PN  +F+LLI S+CR GEFRP
Sbjct: 604  KGISPSVYTYNIMIGCLCKSGRTEKAIPLLNEMLQKGIIPNTNTFELLIKSYCRTGEFRP 663

Query: 357  AQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYKDLVDR 178
            AQEVFD+A  ICGH E LYA +FNE LAG E +EAK+  E  +D+ FDLGSFLYKDL+D+
Sbjct: 664  AQEVFDIASTICGHTEALYALMFNEFLAGDEIVEAKQFLETAIDKHFDLGSFLYKDLIDK 723

Query: 177  LCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNMASGG 1
            LCK E  E AHDIL KMM  GYGFDP SFMPVID L K G KH A+EL+E ML M S G
Sbjct: 724  LCKVENLEGAHDILIKMMHIGYGFDPASFMPVIDGLIKLGQKHVADELTERMLEMVSEG 782


>ref|XP_006486600.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17140-like
            [Citrus sinensis] gi|568866524|ref|XP_006486605.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g17140-like [Citrus sinensis]
          Length = 889

 Score =  995 bits (2572), Expect = 0.0
 Identities = 480/784 (61%), Positives = 613/784 (78%), Gaps = 3/784 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIPVIS--INYVTLITRILIGSGMFTEIEALHNLLLS-RPCETHRVS 2173
            P LAWQ+F+R  ++ P  +  ++ V  I RILI S M  EI  LH+LLLS +P      S
Sbjct: 22   PKLAWQIFRRVILNSPTTNPPLDSVPTIARILIRSKMLEEIHTLHSLLLSLKPRGNSHSS 81

Query: 2172 SYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKD 1993
              ++VKILA  G +D+A SQF+ +R HF   PP I LYN L E  ++  + + + WLYKD
Sbjct: 82   LTSLVKILAKSGLLDEAFSQFKSIRVHFHGDPPCIYLYNVLFECCVRKQHMDYLMWLYKD 141

Query: 1992 MIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXX 1813
            M+ A + P+TYTFNLLI ALCD GRLEDARKLFDKM DKGC PNEF+F ILVRGYCR   
Sbjct: 142  MVVAKVSPETYTFNLLIRALCDSGRLEDARKLFDKMSDKGCRPNEFSFAILVRGYCRAGL 201

Query: 1812 XXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNS 1633
                   +++M+ +G  PN V+YNTLIS FCR+GKT+ AE++VE+M+EDG+ P+VVTFNS
Sbjct: 202  ADEGLELMELMRSLGFSPNRVVYNTLISSFCRDGKTDDAEKMVERMREDGMFPDVVTFNS 261

Query: 1632 RISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETM 1453
            RISALC  G ILEASRIF DMQ+D++ GLP+PN +TYNLMLEGFCKEGMLEE + L++ +
Sbjct: 262  RISALCRTGKILEASRIFRDMQLDEDLGLPRPNIITYNLMLEGFCKEGMLEEAKTLVDAV 321

Query: 1452 KRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGML 1273
             +NG   N++SYNIWL GLV+NGKL +AQ+VL++MV++G++P+ +++NI+IDGLCKN ML
Sbjct: 322  TKNGGSLNLESYNIWLMGLVRNGKLVEAQAVLQEMVEKGVEPSIHSYNILIDGLCKNRML 381

Query: 1272 GDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVL 1093
             DARM+M LM   GI+PDT+TYSTLLHGYCS GK+ EAN VL+ M+ +GC+P  +TCN+L
Sbjct: 382  SDARMVMDLMVDRGISPDTITYSTLLHGYCSNGKVFEANNVLYEMIRNGCSPTIFTCNIL 441

Query: 1092 LHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGS 913
            LHSL+KEG+ISEAE LLQ+MNERGY LDTV+CNI+I GLC  GK+DKA+EIV+EMW  GS
Sbjct: 442  LHSLYKEGRISEAEDLLQKMNERGYGLDTVTCNIIIHGLCNCGKLDKAIEIVNEMWTKGS 501

Query: 912  AALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYP 733
            AALG+LGN F+   D+ N+KKKC PDLITYST+I+ LC  G+LDEAKKKF EMM KN+ P
Sbjct: 502  AALGNLGNFFVVPADDTNSKKKCFPDLITYSTVISALCKAGKLDEAKKKFTEMMGKNVRP 561

Query: 732  DSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLM 553
            DS VYD F++S CK GK+SSAF+VLKDMEK GC ++LQTYNSLILG G+K QI+E+ GL+
Sbjct: 562  DSFVYDNFIHSYCKEGKLSSAFRVLKDMEKNGCSKTLQTYNSLILGFGSKAQIFEIHGLL 621

Query: 552  DEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRI 373
            +EMRERG+ PNV TYN I+ CLCEG ++E+A S ++EM Q+GI  N+ SF++L+  FC+ 
Sbjct: 622  NEMRERGVSPNVCTYNNILKCLCEGSKIEDAISCLDEMQQRGIL-NVSSFRMLVRVFCKA 680

Query: 372  GEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYK 193
            GEFR AQE F  AL +CGHKE +Y+ +FNE+L  GE  EAKE+FEA L++ +DLG+FLYK
Sbjct: 681  GEFRVAQEAFKTALSMCGHKEGIYSLMFNELLLAGEVSEAKEIFEAALEKSYDLGNFLYK 740

Query: 192  DLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNM 13
             L+D LCK+E+ EEA  I+ KM+ KGYGFDP SFMPVIDAL + G KHEA+EL+E M+ M
Sbjct: 741  VLIDGLCKDEKLEEATGIIYKMIDKGYGFDPASFMPVIDALGERGKKHEADELAEKMMEM 800

Query: 12   ASGG 1
             S G
Sbjct: 801  TSDG 804


>gb|EXC31542.1| hypothetical protein L484_006574 [Morus notabilis]
          Length = 864

 Score =  976 bits (2524), Expect = 0.0
 Identities = 475/782 (60%), Positives = 608/782 (77%), Gaps = 1/782 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLS-RPCETHRVSSY 2167
            P+LAW LFKRS   + + S   V ++ RILI + M  +I+ALH LLLS  P ET      
Sbjct: 18   PSLAWLLFKRSPSHLRLRS---VPVVARILIAAKMRRQIDALHQLLLSSEPPETAHACLL 74

Query: 2166 AVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKDMI 1987
            ++V++LAN G+   A+S F+ +R+ FP  P    LYN L +SS+   + +  SWLYKDMI
Sbjct: 75   SLVRMLANSGFSGMAVSHFKALRSRFPDKPYSAFLYNSLFKSSLAEKSVDSFSWLYKDMI 134

Query: 1986 FAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXXXX 1807
             +G+ P+TYTFNLLI ALC+ G LE+AR++FDKM +KGC PNE++ GILVRGYCR     
Sbjct: 135  VSGVVPETYTFNLLISALCESGHLENAREMFDKMSEKGCRPNEYSVGILVRGYCRAGLVD 194

Query: 1806 XXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNSRI 1627
                  +   ++ + PN ++YNTLIS FCR G+T+ AE+LVE+M+++ + P+VVTFNSRI
Sbjct: 195  EALEFFNKTSDV-LPPNRIVYNTLISSFCRAGRTDEAEKLVERMRDNNMTPDVVTFNSRI 253

Query: 1626 SALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETMKR 1447
            SALC++G +LEA RIF DMQ+DQE GLP+PN +TYNLMLEGFCKEGM EE + L ETMKR
Sbjct: 254  SALCKSGQVLEACRIFRDMQIDQELGLPRPNIITYNLMLEGFCKEGMFEEARSLFETMKR 313

Query: 1446 NGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGMLGD 1267
            N  F N++S+NIWL  L+ +GKL +A+ +LK+MVD+G  P+ +T+NI++DGLCKNGM  D
Sbjct: 314  NSEFVNLESFNIWLRALIMSGKLLEARLLLKEMVDKGTGPSIFTYNIMMDGLCKNGMFSD 373

Query: 1266 ARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVLLH 1087
            ARM+MGLM ++GI+PDTVTY+TLLHG+C++G++ EA KVL  MMM+ C PNT TCN+LLH
Sbjct: 374  ARMVMGLMISSGISPDTVTYTTLLHGHCNKGRVFEAKKVLQEMMMNNCHPNTRTCNILLH 433

Query: 1086 SLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSAA 907
            SLWKEGK SEAE+LLQ+M ERGY +D V+CNIVIDGLC  GK+DKA+EIVSEMW HGSAA
Sbjct: 434  SLWKEGKTSEAEELLQKMYERGYGIDIVTCNIVIDGLCNIGKMDKAIEIVSEMWTHGSAA 493

Query: 906  LGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYPDS 727
            LG LGNSFIGLVD+ NN + C PDLITYSTII+ LC  G+LDEAKKKF EMM K L PDS
Sbjct: 494  LGHLGNSFIGLVDDNNNGRNCRPDLITYSTIISALCKVGKLDEAKKKFAEMMGKRLCPDS 553

Query: 726  IVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLMDE 547
            ++YD F+ S CK+GK+S AF+VLKDMEK+GC +SLQTYNSLILG+G+KNQI+E+ GL+DE
Sbjct: 554  VIYDTFIRSYCKQGKISLAFRVLKDMEKKGCNKSLQTYNSLILGVGSKNQIFEIYGLLDE 613

Query: 546  MRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRIGE 367
            MRERG+  +V TYN +ISCLCE GR++EA SL++EM+Q+ ISPNI SF  LI +FC+  E
Sbjct: 614  MRERGVSADVCTYNNVISCLCEEGRIKEATSLLDEMVQKDISPNIGSFGKLIKAFCKACE 673

Query: 366  FRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYKDL 187
            F   QEVF +AL ICGH+E LY+ +FNE+ AGGE  +AKE+F A LDR   +G+FLYKDL
Sbjct: 674  FEDMQEVFAIALNICGHREALYSLMFNELFAGGEFSKAKEIFIAALDRNLYVGNFLYKDL 733

Query: 186  VDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNMAS 7
            +D+LCK+E+ +EA  I+  M+ KGYGFDP SFMPVID L K GNKHEA  L+E M+ M+S
Sbjct: 734  IDKLCKDEKLDEASSIIYYMLGKGYGFDPASFMPVIDGLGKKGNKHEAELLAERMMEMSS 793

Query: 6    GG 1
             G
Sbjct: 794  EG 795


>ref|XP_002328265.1| predicted protein [Populus trichocarpa]
          Length = 742

 Score =  965 bits (2494), Expect = 0.0
 Identities = 454/709 (64%), Positives = 577/709 (81%)
 Frame = -3

Query: 2127 KALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKDMIFAGLPPQTYTFNL 1948
            KA+SQF+ +R  FP +PP I LYN L+ S  K    + +SWL KDM+ +G+ P+TYTFN+
Sbjct: 2    KAISQFKSLRFRFPENPPSIYLYNVLLRSCTKEGRVDCVSWLCKDMVASGVSPETYTFNV 61

Query: 1947 LICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXXXXXXXXXLDVMKEIG 1768
            LI  LCD G L+DAR+LFDKM +KGC PNE++FGILVRGYCR          L  M+ +G
Sbjct: 62   LIGLLCDSGCLDDARELFDKMPEKGCEPNEYSFGILVRGYCRAGFTSKGLELLGEMRRLG 121

Query: 1767 VFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNSRISALCEAGSILEAS 1588
              PN V+YNTLIS FC+EGKT+ AE+LV++M++DG+ P+VVTFN+RISALC +G +LEAS
Sbjct: 122  FSPNKVVYNTLISSFCKEGKTDDAEKLVDEMRKDGLSPDVVTFNARISALCSSGKVLEAS 181

Query: 1587 RIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETMKRNGVFSNVDSYNIW 1408
            RIF DMQ+D+  GLP+PN +TYNLML GFCKEGMLEE + L E MK +    N +SYNIW
Sbjct: 182  RIFRDMQIDEVLGLPQPNIITYNLMLGGFCKEGMLEEARALFEKMKVSENLMNRESYNIW 241

Query: 1407 LHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGMLGDARMLMGLMTTAGI 1228
            L GLV+ GKL +AQ VLK+MVD G++PN Y++NIV+DGLCKNG+L DARMLM LMT++G+
Sbjct: 242  LLGLVRIGKLLEAQLVLKEMVDMGMEPNVYSYNIVMDGLCKNGVLFDARMLMRLMTSSGV 301

Query: 1227 TPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVLLHSLWKEGKISEAEK 1048
             PDTVTY+TLLHGYC  GK+ EAN VL  MM  GC+PN YTCN+LL+SLWKEG+ISEAE+
Sbjct: 302  LPDTVTYTTLLHGYCHTGKVSEANNVLREMMRDGCSPNNYTCNILLYSLWKEGRISEAEE 361

Query: 1047 LLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSAALGSLGNSFIGLVD 868
            LLQ+MNE+GY +DTV+CNIVIDGLC  GK+DKA+EIV+ MW HGSAALG+LGNS+IGLVD
Sbjct: 362  LLQKMNEKGYVIDTVTCNIVIDGLCNNGKLDKAIEIVNGMWTHGSAALGNLGNSYIGLVD 421

Query: 867  EENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYPDSIVYDIFLYSLCKR 688
            + +++KKCMPDLI+YSTII+GLC  GR+ EAKKKFIEMM KNL PDS +YD+F++S CK 
Sbjct: 422  DSDSRKKCMPDLISYSTIISGLCKAGRVGEAKKKFIEMMGKNLQPDSAIYDVFIHSFCKE 481

Query: 687  GKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLMDEMRERGIPPNVFTY 508
            GK+SSAF+VLKDMEK+GC ++LQTYNSLI+GLG+KNQI+E+ GL+DEMRERG+ P+V  Y
Sbjct: 482  GKISSAFRVLKDMEKKGCNKTLQTYNSLIMGLGSKNQIFEIYGLIDEMRERGVSPDVSIY 541

Query: 507  NTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRIGEFRPAQEVFDVALG 328
            N ++S LCEGGR+++A S+++EMLQ+GISPNI SF +LI +FC+  +F    E+F++AL 
Sbjct: 542  NNVLSSLCEGGRVKDAPSVLDEMLQKGISPNISSFSILIKAFCKACDFSAVDEIFEIALN 601

Query: 327  ICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYKDLVDRLCKEERYEEA 148
            +CGHKE LY+  FNE+L GGE ++AKELFE  LDR FD+G+FLYKDL+D LCK+E+ ++A
Sbjct: 602  VCGHKEALYSLTFNELLVGGEVVKAKELFETALDRSFDVGNFLYKDLIDHLCKDEKLDDA 661

Query: 147  HDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESMLNMASGG 1
              IL K++ KGY FDP SFMPVID L K GNKHEA+EL+E M+ MAS G
Sbjct: 662  SGILHKLIDKGYWFDPASFMPVIDGLGKRGNKHEADELAEKMMEMASEG 710


>ref|XP_004154721.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17140-like
            [Cucumis sativus]
          Length = 875

 Score =  956 bits (2472), Expect = 0.0
 Identities = 468/786 (59%), Positives = 600/786 (76%), Gaps = 7/786 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR-------SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLSRPCET 2185
            PNLAW LFKR       +S S    S+  V  I RILI + M  +I+ LH LLLS+  + 
Sbjct: 20   PNLAWLLFKRILSSPIPASSSFFKPSLQSVPAIARILITAKMHPQIDHLHQLLLSQHRDF 79

Query: 2184 HRVSSYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISW 2005
               S +++V+ LA+ G ++ A+SQF+ +R  FP  PPPIS YN L   S+K +  + + W
Sbjct: 80   AHPSGFSLVRTLADLGLLENAISQFRSLRDRFPHDPPPISFYNLLFRCSLKESRVDCVIW 139

Query: 2004 LYKDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYC 1825
            LYKDM  A + PQTYTFNLLI ALC++G LE+AR++FDKM +KGC PNEF+ GILVRGYC
Sbjct: 140  LYKDMAVAKVKPQTYTFNLLISALCEMGYLENAREVFDKMSEKGCKPNEFSLGILVRGYC 199

Query: 1824 RXXXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVV 1645
            R          LD M+  G  PN V YNT+IS  C EG+T  AE+LVE+M+E G+ P++V
Sbjct: 200  RAGLHSHGIDLLDEMRSSGALPNRVAYNTVISSLCGEGQTVEAEKLVEKMREVGLSPDIV 259

Query: 1644 TFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKL 1465
            TFN RI+ALC++G ILEASRIF DMQ+D+E GLPKPNTVTYNLMLEGFC EGM EE + +
Sbjct: 260  TFNCRIAALCKSGQILEASRIFRDMQIDEEMGLPKPNTVTYNLMLEGFCSEGMFEEARAI 319

Query: 1464 IETMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCK 1285
             ++MK +   S + SYNIW+ GLV++GKL +A  +L +M ++ IKPN Y++NI++ GLCK
Sbjct: 320  FDSMKNSETLS-LRSYNIWMLGLVRSGKLLEAHLILNEMAEKNIKPNLYSYNILVHGLCK 378

Query: 1284 NGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYT 1105
             GM  DAR ++GLM  +G+ PDTVTYSTLLHGYC RGKI+EAN VL  M+  GC PN YT
Sbjct: 379  YGMFSDARSILGLMRESGVAPDTVTYSTLLHGYCRRGKILEANYVLREMIQVGCFPNMYT 438

Query: 1104 CNVLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMW 925
            CN+LLHSLWKEG+ SEAE LLQ MNERGY LD V+CN +I+GLCK G +DKA+EIVS MW
Sbjct: 439  CNILLHSLWKEGRASEAEDLLQMMNERGYGLDNVTCNTMINGLCKAGNLDKAIEIVSGMW 498

Query: 924  NHGSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQK 745
              GSA+LG+LGNSFI L D  NN KKC+PD ITY+TII GLC  GR+DEAKKK +EM+ K
Sbjct: 499  TRGSASLGNLGNSFIDLFDIRNNGKKCLPDSITYATIIGGLCKVGRVDEAKKKLLEMIGK 558

Query: 744  NLYPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEM 565
             L PDS+++D F+Y+ CK+GK+SSAF+VLK+MEK+GC +SL+TYNSLI GLG++NQI+E+
Sbjct: 559  KLSPDSLIFDTFIYNYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLIQGLGSENQIFEI 618

Query: 564  CGLMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIIS 385
             GLMDEM+ERGI PNV+TYN IISCL EGG+L++A  L++EMLQ+GISPNIY+F++LI +
Sbjct: 619  YGLMDEMKERGIFPNVYTYNNIISCLSEGGKLKDATCLLDEMLQKGISPNIYTFRILIGA 678

Query: 384  FCRIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGS 205
            F +  +F  AQE+F++AL +CGHKE LY+ +FNE+LAGGETL+AKELFEA LDR   L +
Sbjct: 679  FFKACDFGAAQELFEIALSLCGHKESLYSFMFNELLAGGETLKAKELFEAALDRSLALKN 738

Query: 204  FLYKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSES 25
            FLY+DL+++LCK+ + ++A  IL KMM K Y FDP SFMPVID L K G+KH A+E +E 
Sbjct: 739  FLYRDLIEKLCKDGKLDDASFILHKMMDKQYSFDPASFMPVIDELGKRGSKHAADEFAER 798

Query: 24   MLNMAS 7
            M+ MAS
Sbjct: 799  MMEMAS 804


>ref|XP_004144886.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17140-like
            [Cucumis sativus] gi|449472527|ref|XP_004153621.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g17140-like [Cucumis sativus]
          Length = 875

 Score =  956 bits (2472), Expect = 0.0
 Identities = 468/786 (59%), Positives = 600/786 (76%), Gaps = 7/786 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR-------SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLSRPCET 2185
            PNLAW LFKR       +S S    S+  V  I RILI + M  +I+ LH LLLS+  + 
Sbjct: 20   PNLAWLLFKRILSSPIPASSSFFKPSLQSVPAIARILITAKMHPQIDHLHQLLLSQHRDF 79

Query: 2184 HRVSSYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISW 2005
               S +++V+ LA+ G ++ A+SQF+ +R  FP  PPPIS YN L   S+K +  + + W
Sbjct: 80   AHPSGFSLVRTLADLGLLENAISQFRSLRDRFPHDPPPISFYNLLFRCSLKESRVDCVIW 139

Query: 2004 LYKDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYC 1825
            LYKDM  A + PQTYTFNLLI ALC++G LE+AR++FDKM +KGC PNEF+ GILVRGYC
Sbjct: 140  LYKDMAVARVKPQTYTFNLLISALCEMGYLENAREVFDKMSEKGCKPNEFSLGILVRGYC 199

Query: 1824 RXXXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVV 1645
            R          LD M+  G  PN V YNT+IS  C EG+T  AE+LVE+M+E G+ P++V
Sbjct: 200  RAGLHSHGIDLLDEMRSSGALPNRVAYNTVISSLCGEGQTVEAEKLVEKMREVGLSPDIV 259

Query: 1644 TFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKL 1465
            TFN RI+ALC++G ILEASRIF DMQ+D+E GLPKPNTVTYNLMLEGFC EGM EE + +
Sbjct: 260  TFNCRIAALCKSGQILEASRIFRDMQIDEEMGLPKPNTVTYNLMLEGFCSEGMFEEARAI 319

Query: 1464 IETMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCK 1285
             ++MK +   S + SYNIW+ GLV++GKL +A  +L +M ++ IKPN Y++NI++ GLCK
Sbjct: 320  FDSMKNSETLS-LRSYNIWMLGLVRSGKLLEAHLILNEMAEKNIKPNLYSYNILVHGLCK 378

Query: 1284 NGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYT 1105
             GM  DAR ++GLM  +G+ PDTVTYSTLLHGYC RGKI+EAN VL  M+  GC PN YT
Sbjct: 379  YGMFSDARSILGLMRESGVAPDTVTYSTLLHGYCRRGKILEANYVLREMIQVGCFPNMYT 438

Query: 1104 CNVLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMW 925
            CN+LLHSLWKEG+ SEAE LLQ MNERGY LD V+CN +I+GLCK G +DKA+EIVS MW
Sbjct: 439  CNILLHSLWKEGRASEAEDLLQMMNERGYGLDNVTCNTMINGLCKAGNLDKAIEIVSGMW 498

Query: 924  NHGSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQK 745
              GSA+LG+LGNSFI L D  NN KKC+PD ITY+TII GLC  GR+DEAKKK +EM+ K
Sbjct: 499  TRGSASLGNLGNSFIDLFDIRNNGKKCLPDSITYATIIGGLCKVGRVDEAKKKLLEMIGK 558

Query: 744  NLYPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEM 565
             L PDS+++D F+Y+ CK+GK+SSAF+VLK+MEK+GC +SL+TYNSLI GLG++NQI+E+
Sbjct: 559  KLSPDSLIFDTFIYNYCKQGKLSSAFRVLKEMEKKGCNKSLRTYNSLIQGLGSENQIFEI 618

Query: 564  CGLMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIIS 385
             GLMDEM+ERGI PNV+TYN IISCL EGG+L++A  L++EMLQ+GISPNIY+F++LI +
Sbjct: 619  YGLMDEMKERGIFPNVYTYNNIISCLSEGGKLKDATCLLDEMLQKGISPNIYTFRILIGA 678

Query: 384  FCRIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGS 205
            F +  +F  AQE+F++AL +CGHKE LY+ +FNE+LAGGETL+AKELFEA LDR   L +
Sbjct: 679  FFKACDFGAAQELFEIALSLCGHKESLYSFMFNELLAGGETLKAKELFEAALDRSLALKN 738

Query: 204  FLYKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSES 25
            FLY+DL+++LCK+ + ++A  IL KMM K Y FDP SFMPVID L K G+KH A+E +E 
Sbjct: 739  FLYRDLIEKLCKDGKLDDASFILHKMMDKQYSFDPASFMPVIDELGKRGSKHAADEFAER 798

Query: 24   MLNMAS 7
            M+ MAS
Sbjct: 799  MMEMAS 804


>ref|NP_179305.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122223754|sp|Q0WPZ6.1|PP158_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17140 gi|110737729|dbj|BAF00803.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|330251496|gb|AEC06590.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 874

 Score =  940 bits (2430), Expect = 0.0
 Identities = 450/786 (57%), Positives = 599/786 (76%), Gaps = 5/786 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR----SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLSRPCETHRV 2176
            P LAW++FKR     S     IS++    I RIL+ + M  EI+ LHNL+LS   +  ++
Sbjct: 16   PRLAWRIFKRIFSSPSEESHGISLDATPTIARILVRAKMHEEIQELHNLILSSSIQKTKL 75

Query: 2175 SSY-AVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLY 1999
            SS  +VV I A   ++DKA  QFQ VR+ FP + P + LYN L+ES +K    E +SWLY
Sbjct: 76   SSLLSVVSIFAKSNHIDKAFPQFQLVRSRFPENKPSVYLYNLLLESCIKERRVEFVSWLY 135

Query: 1998 KDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRX 1819
            KDM+  G+ PQTYTFNLLI ALCD   ++ AR+LFD+M +KGC PNEFTFGILVRGYC+ 
Sbjct: 136  KDMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNEFTFGILVRGYCKA 195

Query: 1818 XXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTF 1639
                     L+ M+  GV PN VIYNT++S FCREG+ + +E++VE+M+E+G+VP++VTF
Sbjct: 196  GLTDKGLELLNAMESFGVLPNKVIYNTIVSSFCREGRNDDSEKMVEKMREEGLVPDIVTF 255

Query: 1638 NSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIE 1459
            NSRISALC+ G +L+ASRIFSDM++D+  GLP+PN++TYNLML+GFCK G+LE+ + L E
Sbjct: 256  NSRISALCKEGKVLDASRIFSDMELDEYLGLPRPNSITYNLMLKGFCKVGLLEDAKTLFE 315

Query: 1458 TMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNG 1279
            +++ N   +++ SYNIWL GLV++GK  +A++VLK M D+GI P+ Y++NI++DGLCK G
Sbjct: 316  SIRENDDLASLQSYNIWLQGLVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLG 375

Query: 1278 MLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCN 1099
            ML DA+ ++GLM   G+ PD VTY  LLHGYCS GK+  A  +L  MM + C PN YTCN
Sbjct: 376  MLSDAKTIVGLMKRNGVCPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCN 435

Query: 1098 VLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNH 919
            +LLHSLWK G+ISEAE+LL++MNE+GY LDTV+CNI++DGLC +G++DKA+EIV  M  H
Sbjct: 436  ILLHSLWKMGRISEAEELLRKMNEKGYGLDTVTCNIIVDGLCGSGELDKAIEIVKGMRVH 495

Query: 918  GSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNL 739
            GSAALG+LGNS+IGLVD+   +  C+PDLITYST++NGLC  GR  EAK  F EMM + L
Sbjct: 496  GSAALGNLGNSYIGLVDDSLIENNCLPDLITYSTLLNGLCKAGRFAEAKNLFAEMMGEKL 555

Query: 738  YPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCG 559
             PDS+ Y+IF++  CK+GK+SSAF+VLKDMEK+GC +SL+TYNSLILGLG KNQI+E+ G
Sbjct: 556  QPDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLILGLGIKNQIFEIHG 615

Query: 558  LMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFC 379
            LMDEM+E+GI PN+ TYNT I  LCEG ++E+A +L++EM+Q+ I+PN++SFK LI +FC
Sbjct: 616  LMDEMKEKGISPNICTYNTAIQYLCEGEKVEDATNLLDEMMQKNIAPNVFSFKYLIEAFC 675

Query: 378  RIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFL 199
            ++ +F  AQEVF+ A+ ICG KE LY+ +FNE+LA G+ L+A EL EAVLDR F+LG+FL
Sbjct: 676  KVPDFDMAQEVFETAVSICGQKEGLYSLMFNELLAAGQLLKATELLEAVLDRGFELGTFL 735

Query: 198  YKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESML 19
            YKDLV+ LCK++  E A  IL KM+ +GYGFDP + MPVID L K GNK EAN  ++ M+
Sbjct: 736  YKDLVESLCKKDELEVASGILHKMIDRGYGFDPAALMPVIDGLGKMGNKKEANSFADKMM 795

Query: 18   NMASGG 1
             MAS G
Sbjct: 796  EMASVG 801


>ref|XP_006296960.1| hypothetical protein CARUB_v10012952mg, partial [Capsella rubella]
            gi|482565669|gb|EOA29858.1| hypothetical protein
            CARUB_v10012952mg, partial [Capsella rubella]
          Length = 881

 Score =  940 bits (2429), Expect = 0.0
 Identities = 448/786 (56%), Positives = 597/786 (75%), Gaps = 5/786 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKRSSVSIP----VISINYVTLITRILIGSGMFTEIEALHNLLLSRPCETHRV 2176
            P LAW +FKR S S      VIS+     I RIL+ + M  EIE +HNL+LS   E  R+
Sbjct: 23   PRLAWSIFKRISSSPSEESHVISLAAAPTIARILVRAKMHDEIEEIHNLILSSSIEKTRL 82

Query: 2175 SSY-AVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLY 1999
            SS  +VV I A   Y+DKA  QFQ VR+ FP   P I LYN L+ES ++    E +SWLY
Sbjct: 83   SSLLSVVSIFAKSNYIDKAFPQFQFVRSRFPEKKPGIYLYNVLLESCIRERRVEFVSWLY 142

Query: 1998 KDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRX 1819
            KDM+  G+ PQTYTFNLLI ALCD   ++ AR+LFD+M +KGC PNEFTFGIL+RGYC+ 
Sbjct: 143  KDMVLCGIAPQTYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNEFTFGILIRGYCKA 202

Query: 1818 XXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTF 1639
                     L+ M+  G+ PN V+YNT++S FCREG+ + +E+LVE+M+E+G+VP++VTF
Sbjct: 203  GMSDKGLELLNSMESFGILPNKVVYNTIVSSFCREGRNDESEKLVEKMREEGLVPDIVTF 262

Query: 1638 NSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIE 1459
            NSRISALC+ G +L+ASRIF DM++D+  GLP+PN++TYNLML+GFCK G LE+ + L +
Sbjct: 263  NSRISALCKEGKVLDASRIFRDMELDEYLGLPRPNSITYNLMLKGFCKVGFLEDAKTLFD 322

Query: 1458 TMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNG 1279
            +++ N   +++ SYNIWL GLV++GK  +A++VLK M+D+GI P+ +++NI++DGLCK G
Sbjct: 323  SIRENDELASLQSYNIWLQGLVRHGKFIEAETVLKQMIDKGIGPSIFSYNILMDGLCKLG 382

Query: 1278 MLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCN 1099
            ML DA+ + GLM   G++PD VTY  LLHGYCS GK+  A ++L  MM + C PN YTCN
Sbjct: 383  MLSDAKTIFGLMKQNGVSPDAVTYGCLLHGYCSVGKVDAAKRLLQEMMRNNCLPNAYTCN 442

Query: 1098 VLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNH 919
            +LLHSLWK G+ISEAE+LL++MNE+GY LDTV+CNI+IDGLC++G++DKA+EIV  M  H
Sbjct: 443  ILLHSLWKMGRISEAEELLRQMNEKGYGLDTVTCNIIIDGLCESGELDKAIEIVKGMRVH 502

Query: 918  GSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNL 739
            GSAALG+LGNS+IGLVD+   +  C+PDLITYST++NGLC  GR  EAK  F EMM + L
Sbjct: 503  GSAALGNLGNSYIGLVDDSLIENNCLPDLITYSTLLNGLCKAGRFGEAKNLFAEMMGEKL 562

Query: 738  YPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCG 559
             PDS+ Y+IF++  CK GK+SSAF+VLK+MEK+GC +SL+TYN+LILGLG KNQI+E+ G
Sbjct: 563  QPDSVAYNIFIHHFCKHGKLSSAFRVLKEMEKKGCHKSLETYNALILGLGIKNQIFEIHG 622

Query: 558  LMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFC 379
            LMDEM+E+GI PN+ TYNT I  LCEGG +E+A +L++EM+Q+ ++PN++SFK LI +FC
Sbjct: 623  LMDEMKEKGILPNICTYNTAIKYLCEGGEVEDATNLLDEMMQKNVAPNVFSFKYLIEAFC 682

Query: 378  RIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFL 199
            ++ +F  AQEVF+ A  ICG KE LY+ IFNE++A  + L+A E+ EAVLDR F+LG+FL
Sbjct: 683  KVPDFDMAQEVFETAASICGQKEALYSLIFNELVAARQLLKATEVLEAVLDRGFELGTFL 742

Query: 198  YKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESML 19
            YKDL++ LCK++  E A +IL KM+ KGYGFDP + MPVID L K GNK EAN  +E M+
Sbjct: 743  YKDLIESLCKKDELEVASEILHKMIDKGYGFDPAALMPVIDGLGKMGNKKEANNFAEKMM 802

Query: 18   NMASGG 1
             MAS G
Sbjct: 803  EMASVG 808


>ref|XP_002884041.1| binding protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329881|gb|EFH60300.1| binding protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 874

 Score =  936 bits (2418), Expect = 0.0
 Identities = 451/786 (57%), Positives = 597/786 (75%), Gaps = 5/786 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR----SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLSRPCETHRV 2176
            P LAW++FKR     S     IS+     +  IL+ + M  EIE LHNL+LS   +  ++
Sbjct: 16   PRLAWRIFKRIFSSPSEESHGISLAATPTMACILVRAKMHEEIEELHNLILSSSIQKTKL 75

Query: 2175 SSY-AVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLY 1999
            SS  +VV I A   ++DKA  QFQ VR+ FP + P I LYN L+ES ++    E +SWLY
Sbjct: 76   SSLLSVVSIFAKSNHIDKAFPQFQFVRSRFPENKPGIYLYNVLLESCIRERRVEFVSWLY 135

Query: 1998 KDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRX 1819
            KDM+  G+ P+TYTFNLLI ALCD   ++ AR+LFD+M +KGC PNEFTFGILVRGYC+ 
Sbjct: 136  KDMVLCGISPETYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNEFTFGILVRGYCKA 195

Query: 1818 XXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTF 1639
                     L+ M+  GV PN V+YNT++S FCREG+ + +E+LVE+M+E+G+VP++VTF
Sbjct: 196  GLTDKGLELLNSMESFGVLPNKVVYNTIVSSFCREGRNDDSEKLVEKMREEGLVPDIVTF 255

Query: 1638 NSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIE 1459
            NSRISALC+ G +L+ASRIFSDM++D+  GLP+PN++TYNLML+GFCK G+LE+ + L E
Sbjct: 256  NSRISALCKEGKVLDASRIFSDMELDEYLGLPRPNSITYNLMLKGFCKVGLLEDAKTLFE 315

Query: 1458 TMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNG 1279
            +++ N    ++ SYNIWL GLV++GK  +A++VLK M+D+GI P+ Y++NI++DGLCK G
Sbjct: 316  SIRENDDLVSLQSYNIWLQGLVRHGKFIEAETVLKQMIDKGIGPSIYSYNILMDGLCKLG 375

Query: 1278 MLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCN 1099
            ML DA+ ++GLM   G++PD VTY  LLHGYCS GK+  A  +L  MM + C PN YTCN
Sbjct: 376  MLSDAKTIVGLMKRNGVSPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCN 435

Query: 1098 VLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNH 919
            +LLHSLW  G+ISEAE+LL++MNE+GY LDTV+CNI++DGLC +G++DKA+EIV  M  H
Sbjct: 436  ILLHSLWNMGRISEAEELLRKMNEKGYGLDTVTCNIIVDGLCGSGELDKAIEIVKGMRVH 495

Query: 918  GSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNL 739
            GSAALG+LGNS+IGLVD+   +  C+PDLITYST++NGLC  GR  EAK  F EMM + L
Sbjct: 496  GSAALGNLGNSYIGLVDDSLIENNCLPDLITYSTLLNGLCKAGRFAEAKTLFAEMMGEKL 555

Query: 738  YPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCG 559
             PDS+ Y+IF++  CK+GK+SSAF+VLKDMEK+GC +SL+TYNSLILGLG KNQI+E+ G
Sbjct: 556  QPDSLAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLILGLGIKNQIFEIHG 615

Query: 558  LMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFC 379
            LMDEMRE+GI PN+ TYNT I  LCEGG++E+A +L++EM+Q+ I+PN++SFK LI +FC
Sbjct: 616  LMDEMREKGISPNICTYNTAIQYLCEGGKVEDATNLLDEMMQKNIAPNVFSFKYLIGAFC 675

Query: 378  RIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFL 199
            ++ +F  AQEVF+ A+ ICG KE LY+ +FNE+LA G+ L+A EL EAVLDR F+LG+FL
Sbjct: 676  KVPDFDMAQEVFETAVSICGQKEGLYSLMFNELLAAGQLLKATELLEAVLDRGFELGTFL 735

Query: 198  YKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESML 19
            YKDLV  LCK++  E A  IL KM+ KGYGFDP + MPVID L K GNK EAN  +E M+
Sbjct: 736  YKDLVVSLCKKDELEVASGILHKMIDKGYGFDPAALMPVIDGLGKMGNKKEANNFAEKMM 795

Query: 18   NMASGG 1
             MAS G
Sbjct: 796  EMASVG 801


>ref|XP_006409347.1| hypothetical protein EUTSA_v10022542mg [Eutrema salsugineum]
            gi|557110509|gb|ESQ50800.1| hypothetical protein
            EUTSA_v10022542mg [Eutrema salsugineum]
          Length = 877

 Score =  924 bits (2388), Expect = 0.0
 Identities = 446/786 (56%), Positives = 588/786 (74%), Gaps = 5/786 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR----SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLSRPCETHRV 2176
            P LAW +FKR     S     +S++    I RIL+ + M  EI  LH+L+LS   +    
Sbjct: 19   PRLAWSIFKRIFSSPSEESHGMSLSAAPTIARILVRAKMREEINELHSLMLSSSVQNAEF 78

Query: 2175 SSY-AVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLY 1999
            S+  +VV I A   ++DKA SQFQ +R+ FP + P I LYN L+E  +KG   E +SWLY
Sbjct: 79   STLLSVVSIFAKSDHIDKAFSQFQFLRSRFPENHPGIYLYNVLLEGCIKGRRVEFVSWLY 138

Query: 1998 KDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRX 1819
            KDM    + P+TYTFNLLI ALCD   ++ AR+LFD+M +KGC PN+FTFGILVRGYCR 
Sbjct: 139  KDMFLCRIAPETYTFNLLIRALCDSSCVDAARELFDEMPEKGCKPNDFTFGILVRGYCRV 198

Query: 1818 XXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTF 1639
                     L+ M+  GV PN V+YNT+IS FC+EG+ + +E+LVE+M+E+G++P++VTF
Sbjct: 199  GLPDKGLELLNSMRSSGVLPNKVVYNTIISSFCKEGRNDDSEKLVEKMREEGLMPDIVTF 258

Query: 1638 NSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIE 1459
            NSRISALC+ G + +ASRIFSDM++D+  GLP+PN +TYNLML+GFCK GMLEE + L E
Sbjct: 259  NSRISALCKEGKVRDASRIFSDMELDEYMGLPRPNRITYNLMLKGFCKVGMLEEAKTLFE 318

Query: 1458 TMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNG 1279
            ++  N   S + SYNIWL GLV++GK  +A++VLK M+D+G+ P+ Y++NI++DGLCK G
Sbjct: 319  SISENDDLSGLQSYNIWLQGLVRHGKFIEAETVLKQMIDKGVWPSIYSYNILLDGLCKLG 378

Query: 1278 MLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCN 1099
            ML DA  ++GLM   GI+PD VTY  LLHGYCS GK+  A  +L  MM + C PN YTCN
Sbjct: 379  MLSDANTIVGLMKRNGISPDAVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCN 438

Query: 1098 VLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNH 919
            +LLHSLWK G++SEAE LL++MNE+GY +DTV+CNI++DGLC +G +DKA+EIV  M  H
Sbjct: 439  ILLHSLWKMGRMSEAEDLLRKMNEKGYGIDTVTCNIIVDGLCGSGDLDKAIEIVKGMRVH 498

Query: 918  GSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNL 739
            GSAALG+LGNS+IGLVD+   +  C+PDLITYST++NGLC  GR  EAKK F EMM + L
Sbjct: 499  GSAALGNLGNSYIGLVDDSLIENNCLPDLITYSTLLNGLCKAGRFAEAKKLFAEMMGEKL 558

Query: 738  YPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCG 559
             PDS+ Y+IF++  CK+GK+SSAF+VLKDMEK+GC +SL+TYNSLILGLG +NQI+E+ G
Sbjct: 559  QPDSVAYNIFIHHFCKQGKISSAFRVLKDMEKKGCHKSLETYNSLILGLGIQNQIFEIHG 618

Query: 558  LMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFC 379
            LMDEM+E+GI PN+ TYNT I  LCEGG++E+A +L++EM+Q+ ISPN++SF  LI +FC
Sbjct: 619  LMDEMKEKGISPNICTYNTAIKYLCEGGKVEDATNLLDEMMQKNISPNVFSFTYLIGAFC 678

Query: 378  RIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFL 199
            ++ +F  AQE F+ A+ ICG KE LY+ +FNE+LA G+ L+A EL E VLDR F+LG+FL
Sbjct: 679  KVPDFDMAQEAFETAVSICGQKEGLYSLMFNELLAAGQLLKATELLETVLDRGFELGTFL 738

Query: 198  YKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESML 19
            YKDLV+ LCK++  E A  IL KM+ KGYGFDP + MPVID L K GNK EAN  +E M+
Sbjct: 739  YKDLVESLCKKDELEVASGILHKMIDKGYGFDPAALMPVIDGLGKMGNKTEANNFAEKMM 798

Query: 18   NMASGG 1
             MAS G
Sbjct: 799  EMASVG 804


>gb|ESW31227.1| hypothetical protein PHAVU_002G220400g [Phaseolus vulgaris]
          Length = 886

 Score =  906 bits (2342), Expect = 0.0
 Identities = 444/794 (55%), Positives = 589/794 (74%), Gaps = 16/794 (2%)
 Frame = -3

Query: 2343 PNLAWQLFKR------SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLL-SRPCET 2185
            P LAWQL KR      S+ S+   + + VT+ TRIL G+ M  E+  LH LLL S+P   
Sbjct: 18   PKLAWQLVKRVLSSPSSASSVTNQTQHLVTVTTRILAGANMHLELHGLHKLLLASQPYHI 77

Query: 2184 HRVSSYAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISW 2005
               S  ++V++LA  G++D+ALS F+ +RA FPS PP +  YN L+ S+++ N   L++W
Sbjct: 78   AHPSIVSMVRVLAQWGHIDEALSHFKSLRAQFPSSPPSLPFYNLLLRSTIQHNRPNLVTW 137

Query: 2004 LYKDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYC 1825
            LY DMI AG  PQTYTFNLLI +LCD    + A +LFDKM  KGC PNEFT GILVRG C
Sbjct: 138  LYTDMIAAGFTPQTYTFNLLIRSLCDSRAFDHALQLFDKMSQKGCHPNEFTLGILVRGLC 197

Query: 1824 RXXXXXXXXXXLDVMKEIGV---------FPNLVIYNTLISCFCREGKTEVAERLVEQMK 1672
            R          ++                  N V+YNTL+S FCR+   + AE+LVE+M 
Sbjct: 198  RAGRVRQALELVNNSYSNNNDNNSNNSCRIANRVVYNTLVSAFCRDELNDEAEKLVERMS 257

Query: 1671 EDGIVPNVVTFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKE 1492
            E G++P+VVTFNSRISALC+AG + EASRIF DMQMD+  GLP+PN VTYNLML+GFCK 
Sbjct: 258  ELGVLPDVVTFNSRISALCKAGKVHEASRIFRDMQMDEALGLPRPNVVTYNLMLKGFCKH 317

Query: 1491 GMLEEVQKLIETMKRNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTF 1312
            GM+E+ + L++TMK+ G F ++ SYNIWL GL++NG+L +A+ VL +M  +GI+PN YT+
Sbjct: 318  GMIEDARGLVDTMKKAGNFVSLKSYNIWLLGLLRNGELLEARLVLDEMTAKGIEPNAYTY 377

Query: 1311 NIVIDGLCKNGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMM 1132
            NI++DGLC+N ML DAR LM +M + G+ PDTVTYSTLLHGYC +GK+ EA  VLH M+ 
Sbjct: 378  NIMVDGLCRNHMLSDARGLMHVMKSNGVFPDTVTYSTLLHGYCIKGKVSEAKHVLHEMIR 437

Query: 1131 SGCTPNTYTCNVLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDK 952
            +GC PNTYTCN LLHSLWKEG+  EAE++LQ+MNE+ Y  DTV+CNIV++GL + G++DK
Sbjct: 438  NGCQPNTYTCNTLLHSLWKEGRTLEAEEMLQKMNEKCYQPDTVTCNIVVNGLSRNGELDK 497

Query: 951  AVEIVSEMWNHGSAALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAK 772
            A+EIVSEMW +G  +L   GN+F  L++  +N   C+PD ITY+T+INGLC  GRL+EAK
Sbjct: 498  AMEIVSEMWTNGPTSLDK-GNTFATLINSISNVSNCLPDGITYTTLINGLCKAGRLEEAK 556

Query: 771  KKFIEMMQKNLYPDSIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGL 592
            KKFIEM+ KNL+PDS+ YD F++S  K+GK+SSAF+VLKDME+ GC ++LQTYN+LILGL
Sbjct: 557  KKFIEMLAKNLHPDSVTYDTFIWSFSKQGKISSAFRVLKDMERNGCSKTLQTYNALILGL 616

Query: 591  GNKNQIYEMCGLMDEMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNI 412
            G+K QI+E+ GLMDEM+E+GI P++FTYN IISCLCEGG+  +A +L++EML +GISPNI
Sbjct: 617  GSKKQIFEIYGLMDEMKEKGINPDIFTYNNIISCLCEGGKANDAFTLLHEMLDKGISPNI 676

Query: 411  YSFKLLIISFCRIGEFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAV 232
             SFK+LI + C+  +F+ A E+F+VAL ICGHKE LY+++FNE+L GG+  EAKELFEA 
Sbjct: 677  SSFKILIKALCKSSDFKVACELFEVALSICGHKEALYSSLFNELLGGGQLSEAKELFEAS 736

Query: 231  LDRCFDLGSFLYKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNK 52
            LDR   L +F+YKD ++RLC++ER  +A  +L K++ KGYG D  SFMPVID L K G K
Sbjct: 737  LDRYLTLKNFMYKDFIERLCQDERLADASSLLHKLIDKGYGVDHASFMPVIDGLNKRGQK 796

Query: 51   HEANELSESMLNMA 10
             +A+EL++ M+ +A
Sbjct: 797  QKADELAKRMMELA 810


>ref|XP_004504955.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17140-like
            [Cicer arietinum]
          Length = 867

 Score =  905 bits (2339), Expect = 0.0
 Identities = 436/776 (56%), Positives = 587/776 (75%), Gaps = 2/776 (0%)
 Frame = -3

Query: 2343 PNLAWQLFKR--SSVSIPVISINYVTLITRILIGSGMFTEIEALHNLLLSRPCETHRVSS 2170
            P LAW LFKR  SS S    + + +  ITRIL+ + M  +I+ L  L+ +        S 
Sbjct: 18   PKLAWHLFKRILSSPSSSTSTHHLLPTITRILLTANMHRQIDNLTQLITNNHPNIAHSSL 77

Query: 2169 YAVVKILANCGYVDKALSQFQDVRAHFPSHPPPISLYNFLIESSMKGNNHELISWLYKDM 1990
             +++++LA   ++D A S F+ +R+ FPS P P+ LY+ L+ SS+  N    ++ LY DM
Sbjct: 78   ISILRVLAQSPHIDFAFSHFKSLRSQFPSTPLPLQLYHILLRSSLHHNRPHFVTSLYTDM 137

Query: 1989 IFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEFTFGILVRGYCRXXXX 1810
            I AG+ PQTYTFNLL+ +LC+   L+ A +LFD+M +KGC PNEFT GILVRG+CR    
Sbjct: 138  IQAGVHPQTYTFNLLLQSLCESNALDHALQLFDRMSEKGCHPNEFTVGILVRGFCRAGKT 197

Query: 1809 XXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNSR 1630
                  +D   +     N V+YNTL+S FC++   + AE+LVE+M++ G+ P+VVTFNSR
Sbjct: 198  QQALEFID--NKFCKNVNRVVYNTLVSSFCKQDMNDEAEKLVERMRDQGLFPDVVTFNSR 255

Query: 1629 ISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETMK 1450
            ISALC AG + EASRIF DMQMD E GLPKPN VT+NLM++GFC++GM++E   L+ETMK
Sbjct: 256  ISALCRAGKVFEASRIFRDMQMDGELGLPKPNVVTFNLMVKGFCQQGMMKEASSLVETMK 315

Query: 1449 RNGVFSNVDSYNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGMLG 1270
            + G F  ++SYN WL G ++NGKL +A+  L +MVD G++PN Y++NIV+DGLC+N M+ 
Sbjct: 316  KAGNFVTLESYNTWLLGFLRNGKLLEARLFLDEMVDNGVEPNIYSYNIVMDGLCRNHMML 375

Query: 1269 DARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVLL 1090
            DAR LM LM + G+ PDTVTY+TLLHGYCS+GK+ EA  VL+ M+  GC PNTYTCN LL
Sbjct: 376  DARRLMDLMVSNGVCPDTVTYTTLLHGYCSKGKVFEAKAVLNEMIRKGCHPNTYTCNTLL 435

Query: 1089 HSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSA 910
            HSLWKEG+ SEAE++L +MNE+ Y LDTV+CNIV++GLCK G+++KA+E+VSEMW  G+ 
Sbjct: 436  HSLWKEGRKSEAEEMLHKMNEKCYQLDTVTCNIVVNGLCKNGELEKAIEVVSEMWTDGTN 495

Query: 909  ALGSLGNSFIGLVDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYPD 730
            +L    NSF GLV   +N    MPD+ITY+T+INGLC  GRL+EAKKKFIEMM KNL+PD
Sbjct: 496  SLDK-ENSFAGLVSLIHNVSTNMPDVITYTTLINGLCKVGRLEEAKKKFIEMMAKNLHPD 554

Query: 729  SIVYDIFLYSLCKRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLMD 550
            S+ YD F+ S CK+GK+SSA +VLKDME+ GC +++QTYNSLI+GLG+K QI+EM GLMD
Sbjct: 555  SVTYDTFVSSFCKQGKISSALRVLKDMERNGCGKTIQTYNSLIMGLGSKGQIFEMYGLMD 614

Query: 549  EMRERGIPPNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFKLLIISFCRIG 370
            EMRERGI P++ TYN +ISCLC+GG+ ++A SL++EML +GISPN+ SFK+LI++FC+ G
Sbjct: 615  EMRERGIRPDICTYNNMISCLCKGGKAKDATSLLHEMLDKGISPNVTSFKILILAFCKSG 674

Query: 369  EFRPAQEVFDVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDRCFDLGSFLYKD 190
            +F+ A E+FDVAL +CGHKE LY+ +FNE+LAGG+  +AKELFEA LD    + +F+YKD
Sbjct: 675  DFKVACELFDVALSVCGHKEALYSLMFNELLAGGKLSDAKELFEASLDSSLLVKNFMYKD 734

Query: 189  LVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEANELSESM 22
            L+DRLCK++R ++AH +LQK++ KGYGFDP SF+PVID L+K GNK +A+EL+  M
Sbjct: 735  LIDRLCKDKRLDDAHSLLQKLIDKGYGFDPSSFIPVIDGLSKRGNKQQADELARIM 790



 Score =  218 bits (554), Expect = 1e-53
 Identities = 159/610 (26%), Positives = 283/610 (46%), Gaps = 29/610 (4%)
 Frame = -3

Query: 1755 LVIYNTLISCFCREGKTEVAERLVEQMKEDGIVPNVVTFNSRISALCEAGSILEASRIFS 1576
            L +Y+ L+       +      L   M + G+ P   TFN  + +LCE+ ++  A ++F 
Sbjct: 111  LQLYHILLRSSLHHNRPHFVTSLYTDMIQAGVHPQTYTFNLLLQSLCESNALDHALQLFD 170

Query: 1575 DMQMDQEFGLPKPNTVTYNLMLEGFCKEGMLEEVQKLIETMKRNGVFSNVDS--YNIWLH 1402
             M    E G   PN  T  +++ GFC+ G  ++  + I+    N    NV+   YN  + 
Sbjct: 171  RMS---EKGC-HPNEFTVGILVRGFCRAGKTQQALEFID----NKFCKNVNRVVYNTLVS 222

Query: 1401 GLVKNGKLFDAQSVLKDMVDEGIKPNNYTFNIVIDGLCKNGMLGDARMLMGLMTTAGIT- 1225
               K     +A+ +++ M D+G+ P+  TFN  I  LC+ G + +A  +   M   G   
Sbjct: 223  SFCKQDMNDEAEKLVERMRDQGLFPDVVTFNSRISALCRAGKVFEASRIFRDMQMDGELG 282

Query: 1224 ---PDTVTYSTLLHGYCSRGKIIEANKVLHAMMMSGCTPNTYTCNVLLHSLWKEGKISEA 1054
               P+ VT++ ++ G+C +G + EA+ ++  M  +G      + N  L    + GK+ EA
Sbjct: 283  LPKPNVVTFNLMVKGFCQQGMMKEASSLVETMKKAGNFVTLESYNTWLLGFLRNGKLLEA 342

Query: 1053 EKLLQRMNERGYDLDTVSCNIVIDGLCKTGKVDKAVEIVSEMWNHGSAALGSLGNSFIGL 874
               L  M + G + +  S NIV+DGLC+   +  A  ++  M ++G              
Sbjct: 343  RLFLDEMVDNGVEPNIYSYNIVMDGLCRNHMMLDARRLMDLMVSNGVC------------ 390

Query: 873  VDEENNKKKCMPDLITYSTIINGLCIHGRLDEAKKKFIEMMQKNLYPDSIVYDIFLYSLC 694
                       PD +TY+T+++G C  G++ EAK    EM++K  +P++   +  L+SL 
Sbjct: 391  -----------PDTVTYTTLLHGYCSKGKVFEAKAVLNEMIRKGCHPNTYTCNTLLHSLW 439

Query: 693  KRGKVSSAFQVLKDMEKRGCKRSLQTYNSLILGLGNKNQIYEMCGLMDEMRERGIP---- 526
            K G+ S A ++L  M ++  +    T N ++ GL    ++ +   ++ EM   G      
Sbjct: 440  KEGRKSEAEEMLHKMNEKCYQLDTVTCNIVVNGLCKNGELEKAIEVVSEMWTDGTNSLDK 499

Query: 525  ------------------PNVFTYNTIISCLCEGGRLEEAASLINEMLQQGISPNIYSFK 400
                              P+V TY T+I+ LC+ GRLEEA     EM+ + + P+  ++ 
Sbjct: 500  ENSFAGLVSLIHNVSTNMPDVITYTTLINGLCKVGRLEEAKKKFIEMMAKNLHPDSVTYD 559

Query: 399  LLIISFCRIGEFRPAQEVF-DVALGICGHKEVLYATIFNEMLAGGETLEAKELFEAVLDR 223
              + SFC+ G+   A  V  D+    CG     Y ++   + + G+  E   L + + +R
Sbjct: 560  TFVSSFCKQGKISSALRVLKDMERNGCGKTIQTYNSLIMGLGSKGQIFEMYGLMDEMRER 619

Query: 222  CFDLGSFLYKDLVDRLCKEERYEEAHDILQKMMQKGYGFDPVSFMPVIDALTKSGNKHEA 43
                    Y +++  LCK  + ++A  +L +M+ KG   +  SF  +I A  KSG+   A
Sbjct: 620  GIRPDICTYNNMISCLCKGGKAKDATSLLHEMLDKGISPNVTSFKILILAFCKSGDFKVA 679

Query: 42   NELSESMLNM 13
             EL +  L++
Sbjct: 680  CELFDVALSV 689



 Score =  185 bits (469), Expect = 9e-44
 Identities = 153/580 (26%), Positives = 239/580 (41%), Gaps = 107/580 (18%)
 Frame = -3

Query: 2064 LYNFLIESSMKGNNHELISWLYKDMIFAGLPPQTYTFNLLICALCDLGRLEDARKLFDKM 1885
            +YN L+ S  K + ++    L + M   GL P   TFN  I ALC  G++ +A ++F  M
Sbjct: 216  VYNTLVSSFCKQDMNDEAEKLVERMRDQGLFPDVVTFNSRISALCRAGKVFEASRIFRDM 275

Query: 1884 RDKGCM----PNEFTFGILVRGYCRXXXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCR 1717
            +  G +    PN  TF ++V+G+C+          ++ MK+ G F  L  YNT +  F R
Sbjct: 276  QMDGELGLPKPNVVTFNLMVKGFCQQGMMKEASSLVETMKKAGNFVTLESYNTWLLGFLR 335

Query: 1716 EGKTEVAERLVEQMKEDGIVPNVVTFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKP 1537
             GK   A   +++M ++G+ PN+ ++N  +  LC    +L+A R+   M  +       P
Sbjct: 336  NGKLLEARLFLDEMVDNGVEPNIYSYNIVMDGLCRNHMMLDARRLMDLMVSNGVC----P 391

Query: 1536 NTVTYNLMLEGFCKEGMLEEVQKLIETMKRNGVFSNVDSYNIWLH--------------- 1402
            +TVTY  +L G+C +G + E + ++  M R G   N  + N  LH               
Sbjct: 392  DTVTYTTLLHGYCSKGKVFEAKAVLNEMIRKGCHPNTYTCNTLLHSLWKEGRKSEAEEML 451

Query: 1401 --------------------GLVKNGKLFDAQSVLKDMVDEGIK---------------- 1330
                                GL KNG+L  A  V+ +M  +G                  
Sbjct: 452  HKMNEKCYQLDTVTCNIVVNGLCKNGELEKAIEVVSEMWTDGTNSLDKENSFAGLVSLIH 511

Query: 1329 ------PNNYTFNIVIDGLCKNGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKI 1168
                  P+  T+  +I+GLCK G L +A+     M    + PD+VTY T +  +C +GKI
Sbjct: 512  NVSTNMPDVITYTTLINGLCKVGRLEEAKKKFIEMMAKNLHPDSVTYDTFVSSFCKQGKI 571

Query: 1167 IEANKVLHAMMMSGCTPNTYTCNVLLHSLWKEGKISEAEKLLQRMNERGYDLDTVSCNIV 988
              A +VL  M  +GC     T N L+  L  +G+I E   L+  M ERG   D  + N +
Sbjct: 572  SSALRVLKDMERNGCGKTIQTYNSLIMGLGSKGQIFEMYGLMDEMRERGIRPDICTYNNM 631

Query: 987  IDGLCKTGKVDKAVEIVSEMWN-------------------------------------- 922
            I  LCK GK   A  ++ EM +                                      
Sbjct: 632  ISCLCKGGKAKDATSLLHEMLDKGISPNVTSFKILILAFCKSGDFKVACELFDVALSVCG 691

Query: 921  HGSAALGSLGNSFIG---LVD-----EENNKKKCMPDLITYSTIINGLCIHGRLDEAKKK 766
            H  A    + N  +    L D     E +     +     Y  +I+ LC   RLD+A   
Sbjct: 692  HKEALYSLMFNELLAGGKLSDAKELFEASLDSSLLVKNFMYKDLIDRLCKDKRLDDAHSL 751

Query: 765  FIEMMQKNLYPDSIVYDIFLYSLCKRGKVSSAFQVLKDME 646
              +++ K    D   +   +  L KRG    A ++ + ME
Sbjct: 752  LQKLIDKGYGFDPSSFIPVIDGLSKRGNKQQADELARIME 791



 Score =  122 bits (305), Expect = 1e-24
 Identities = 82/303 (27%), Positives = 148/303 (48%), Gaps = 11/303 (3%)
 Frame = -3

Query: 2004 LYKDMIFAGLP----------PQTYTFNLLICALCDLGRLEDARKLFDKMRDKGCMPNEF 1855
            L K+  FAGL           P   T+  LI  LC +GRLE+A+K F +M  K   P+  
Sbjct: 497  LDKENSFAGLVSLIHNVSTNMPDVITYTTLINGLCKVGRLEEAKKKFIEMMAKNLHPDSV 556

Query: 1854 TFGILVRGYCRXXXXXXXXXXLDVMKEIGVFPNLVIYNTLISCFCREGKTEVAERLVEQM 1675
            T+   V  +C+          L  M+  G    +  YN+LI     +G+      L+++M
Sbjct: 557  TYDTFVSSFCKQGKISSALRVLKDMERNGCGKTIQTYNSLIMGLGSKGQIFEMYGLMDEM 616

Query: 1674 KEDGIVPNVVTFNSRISALCEAGSILEASRIFSDMQMDQEFGLPKPNTVTYNLMLEGFCK 1495
            +E GI P++ T+N+ IS LC+ G   +A+ +  +M +D+      PN  ++ +++  FCK
Sbjct: 617  RERGIRPDICTYNNMISCLCKGGKAKDATSLLHEM-LDKGIS---PNVTSFKILILAFCK 672

Query: 1494 EGMLEEVQKLIETMKRNGVFSNVDS-YNIWLHGLVKNGKLFDAQSVLKDMVDEGIKPNNY 1318
             G  +   +L +      V  + ++ Y++  + L+  GKL DA+ + +  +D  +   N+
Sbjct: 673  SGDFKVACELFDVAL--SVCGHKEALYSLMFNELLAGGKLSDAKELFEASLDSSLLVKNF 730

Query: 1317 TFNIVIDGLCKNGMLGDARMLMGLMTTAGITPDTVTYSTLLHGYCSRGKIIEANKVLHAM 1138
             +  +ID LCK+  L DA  L+  +   G   D  ++  ++ G   RG   +A+++   M
Sbjct: 731  MYKDLIDRLCKDKRLDDAHSLLQKLIDKGYGFDPSSFIPVIDGLSKRGNKQQADELARIM 790

Query: 1137 MMS 1129
             ++
Sbjct: 791  ELA 793


Top