BLASTX nr result

ID: Akebia23_contig00027673 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00027673
         (2174 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279193.1| PREDICTED: putative pentatricopeptide repeat...   776   0.0  
gb|EXB55995.1| hypothetical protein L484_018781 [Morus notabilis]     768   0.0  
ref|XP_002515231.1| pentatricopeptide repeat-containing protein,...   759   0.0  
ref|XP_007051214.1| Pentatricopeptide repeat superfamily protein...   744   0.0  
ref|XP_006492377.1| PREDICTED: putative pentatricopeptide repeat...   733   0.0  
ref|XP_006444562.1| hypothetical protein CICLE_v10019795mg [Citr...   731   0.0  
ref|XP_003518677.1| PREDICTED: putative pentatricopeptide repeat...   730   0.0  
ref|XP_004495883.1| PREDICTED: putative pentatricopeptide repeat...   728   0.0  
ref|XP_004150840.1| PREDICTED: putative pentatricopeptide repeat...   727   0.0  
ref|XP_004163923.1| PREDICTED: putative pentatricopeptide repeat...   726   0.0  
ref|XP_007132128.1| hypothetical protein PHAVU_011G069000g [Phas...   724   0.0  
ref|XP_004295891.1| PREDICTED: putative pentatricopeptide repeat...   721   0.0  
ref|XP_003591356.1| Pentatricopeptide repeat-containing protein ...   716   0.0  
ref|XP_002310039.2| hypothetical protein POPTR_0007s06780g [Popu...   714   0.0  
ref|XP_006340302.1| PREDICTED: putative pentatricopeptide repeat...   707   0.0  
ref|XP_004251431.1| PREDICTED: putative pentatricopeptide repeat...   706   0.0  
ref|XP_006418341.1| hypothetical protein EUTSA_v10007463mg [Eutr...   685   0.0  
ref|XP_002889402.1| hypothetical protein ARALYDRAFT_470202 [Arab...   684   0.0  
gb|AAG00894.1|AC064879_12 Hypothetical protein [Arabidopsis thal...   677   0.0  
ref|NP_171744.1| pentatricopeptide repeat-containing protein [Ar...   677   0.0  

>ref|XP_002279193.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Vitis vinifera]
          Length = 526

 Score =  776 bits (2004), Expect = 0.0
 Identities = 378/507 (74%), Positives = 440/507 (86%), Gaps = 4/507 (0%)
 Frame = +3

Query: 6    MFLRRFSFRSPTRRYANPILNHSLNSITQNPNSLLFRFFCSVN----SESNQDVETIFQI 173
            M L+  S +S +R YA  I    LNS  QNP+S+  R + S      +  N DVET+F+I
Sbjct: 8    MILKPLSLQSSSR-YAPSISTIGLNSNPQNPSSVFSRLYWSETIAETTVDNGDVETVFRI 66

Query: 174  INSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSS 353
            ++S+ ST+NLK+SLK+S +FLSNDLIDKVLKR+RFSH NP QAL FFNYT++R+GFYH+ 
Sbjct: 67   VSSASSTRNLKQSLKSSGVFLSNDLIDKVLKRVRFSHGNPFQALAFFNYTNKRKGFYHTP 126

Query: 354  FSYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFG 533
            FS DT+LYILGR+R+FDQIWE+L++MRRKD+SLI+PR+VQ+VL RIAKVCSV+QTVESF 
Sbjct: 127  FSLDTMLYILGRSRRFDQIWELLVDMRRKDQSLISPRSVQVVLGRIAKVCSVKQTVESFR 186

Query: 534  KFRKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEE 713
            KF+KLVPEFDT CFNALLRTLCQEKSM DARNVYHSLKH F+P+L+TFNILLSGWKS+EE
Sbjct: 187  KFKKLVPEFDTACFNALLRTLCQEKSMRDARNVYHSLKHDFRPDLRTFNILLSGWKSAEE 246

Query: 714  AEGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIG 893
            AEGFF+EM +MGV+PD+VSYNCL+DVYCK RE+++AYKVIDKMRDE+ISPDVI+YTSIIG
Sbjct: 247  AEGFFDEMREMGVEPDVVSYNCLIDVYCKGREIERAYKVIDKMRDEQISPDVISYTSIIG 306

Query: 894  GLGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDP 1073
            GLGL+GQPDKARDVLKEMKEYG YPDVAAYNA IRNFCIA RLGD   LMDEMVGKGL P
Sbjct: 307  GLGLVGQPDKARDVLKEMKEYGCYPDVAAYNAAIRNFCIANRLGDADGLMDEMVGKGLSP 366

Query: 1074 NATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLW 1253
            NATTYNLFFRCFYWSNDL  S  LY+RM  +GCLPNTQSCMFL RLFR++EK+EMAL LW
Sbjct: 367  NATTYNLFFRCFYWSNDLGRSCGLYQRMKKTGCLPNTQSCMFLTRLFRRQEKVEMALELW 426

Query: 1254 NDMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELA 1433
            NDMVEKGFGSY+LVSDVLFD+LCD+GKL E EKC LQM+EKG KPSNVSF+RIKVLMELA
Sbjct: 427  NDMVEKGFGSYILVSDVLFDMLCDMGKLVEVEKCCLQMIEKGHKPSNVSFRRIKVLMELA 486

Query: 1434 KREEALQCLSEKMAIFGPVVQVHERED 1514
             + EALQ L+EKMA+FGP  QV ER +
Sbjct: 487  NKHEALQNLTEKMAMFGPSTQVQERAE 513


>gb|EXB55995.1| hypothetical protein L484_018781 [Morus notabilis]
          Length = 486

 Score =  768 bits (1982), Expect = 0.0
 Identities = 372/469 (79%), Positives = 424/469 (90%), Gaps = 2/469 (0%)
 Frame = +3

Query: 90   QNPNSLLFR-FFCS-VNSESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVL 263
            QNPN  + R FFCS  N  +N  V+T+F+II+SS S++N+K+SLK+S +FLSNDLIDKVL
Sbjct: 15   QNPNLFISRLFFCSETNPATNDPVDTVFRIISSSTSSKNMKQSLKSSGVFLSNDLIDKVL 74

Query: 264  KRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKD 443
            KR+RFSHANPLQ L+FFNYT  R+GFYH+ FS DTILYILGR+R F++IW+VL++ + KD
Sbjct: 75   KRVRFSHANPLQTLDFFNYTGNRKGFYHTPFSLDTILYILGRSRMFEKIWDVLVDTKFKD 134

Query: 444  RSLITPRTVQIVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKSMSDA 623
            R+LITPRTV +VLARIAKVCSVRQTVESF KF+KLVPEFDT CFNALLRTLCQEKSM+DA
Sbjct: 135  RNLITPRTVMVVLARIAKVCSVRQTVESFRKFKKLVPEFDTNCFNALLRTLCQEKSMADA 194

Query: 624  RNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKS 803
            RNVYHSLKH F+PNLQTFNILLSGWKS EEAEGFFEEM +MGVKPD+VSYNCLVDVYCK 
Sbjct: 195  RNVYHSLKHSFRPNLQTFNILLSGWKSCEEAEGFFEEMREMGVKPDVVSYNCLVDVYCKG 254

Query: 804  REMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAY 983
            RE++KA+KV+ KMRDE+I PDVITYTSIIGGLGL+GQPDKARDVLKEMKE G YPDVAAY
Sbjct: 255  REIEKAFKVVAKMRDEDIQPDVITYTSIIGGLGLVGQPDKARDVLKEMKEDGCYPDVAAY 314

Query: 984  NAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMD 1163
            NA IRNFCIAKRLG  Y+LMDEMV KGL+ NATTYNLFFR FYWSNDL SSW+LY RMM+
Sbjct: 315  NAAIRNFCIAKRLGVAYSLMDEMVSKGLNANATTYNLFFRVFYWSNDLTSSWNLYGRMME 374

Query: 1164 SGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDE 1343
            +GCLPNTQSCMFLIRLFR++EK+EMAL LWNDMVEKGFGSYVLVSDVLFDLLCD GKL E
Sbjct: 375  TGCLPNTQSCMFLIRLFRRQEKVEMALQLWNDMVEKGFGSYVLVSDVLFDLLCDAGKLME 434

Query: 1344 AEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPV 1490
            AE+CFLQMVEKGQKPSNVS++RIKVLMELA ++++L  LSEKMA+FGPV
Sbjct: 435  AERCFLQMVEKGQKPSNVSYRRIKVLMELANKQDSLHILSEKMALFGPV 483


>ref|XP_002515231.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223545711|gb|EEF47215.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 505

 Score =  759 bits (1961), Expect = 0.0
 Identities = 369/478 (77%), Positives = 419/478 (87%), Gaps = 1/478 (0%)
 Frame = +3

Query: 84   ITQNPNSLLFRFFCSVNSES-NQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKV 260
            I++ P SLL   F S N  S N DVE +++II SS S QNLK+SL ++ IFLSNDLIDKV
Sbjct: 15   ISKTPISLLANLFFSTNPTSLNDDVEVVYRIITSSSSVQNLKQSLTSTGIFLSNDLIDKV 74

Query: 261  LKRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRK 440
            LKR+RFSH NPLQALEFF YT  R+GFYH+ +S DTILYILGR+RKFD IW+VLI+M+RK
Sbjct: 75   LKRVRFSHGNPLQALEFFYYTDNRKGFYHTPYSLDTILYILGRSRKFDHIWDVLIKMKRK 134

Query: 441  DRSLITPRTVQIVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKSMSD 620
            DR LI+PRT+QIVLAR+AK+CSVRQTVESF KF+K V   DTTCFNALLRTLCQEKSM+D
Sbjct: 135  DRFLISPRTMQIVLARVAKLCSVRQTVESFRKFKKFVSVLDTTCFNALLRTLCQEKSMTD 194

Query: 621  ARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCK 800
            ARNVYH LK +FKPNLQTFNILLSGWK SEEAE FFEEM ++G+KPD+VSYN L+DVYCK
Sbjct: 195  ARNVYHRLKKEFKPNLQTFNILLSGWKQSEEAELFFEEMRELGIKPDVVSYNSLIDVYCK 254

Query: 801  SREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAA 980
             REM+KAYKV++KMR+E+ISPDVITYTSIIGGLGL+GQPDKARD+L EMKEYG YPDVAA
Sbjct: 255  DREMEKAYKVVEKMREEDISPDVITYTSIIGGLGLVGQPDKARDILNEMKEYGCYPDVAA 314

Query: 981  YNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMM 1160
            YNAVIRN+CIAKRLGD  NLMDEM  KGL PNATTYNLFFR FYWSNDL +SWSLY+RMM
Sbjct: 315  YNAVIRNYCIAKRLGDASNLMDEMASKGLSPNATTYNLFFRVFYWSNDLRNSWSLYRRMM 374

Query: 1161 DSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLD 1340
            +SGCLPNTQSCMFLIRLFRK EK+EMAL LWNDMVEKGFGSY+LVSDVLFDLLCD+GKL 
Sbjct: 375  ESGCLPNTQSCMFLIRLFRKHEKVEMALTLWNDMVEKGFGSYILVSDVLFDLLCDMGKLV 434

Query: 1341 EAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHERED 1514
            EAEKCFLQM+EKG KPSNVSF+RIKVLMEL  + +AL  L +KMAIFG  +Q+ ERE+
Sbjct: 435  EAEKCFLQMIEKGHKPSNVSFRRIKVLMELVNKHDALLNLQKKMAIFGSSIQLPEREE 492



 Score = 82.8 bits (203), Expect = 6e-13
 Identities = 105/458 (22%), Positives = 179/458 (39%), Gaps = 29/458 (6%)
 Frame = +3

Query: 768  SYNCLVDVYCKSREMDKAYKVIDKMRDEE---ISPDVITYTSIIGGLGLIGQPDKARDVL 938
            S + ++ +  +SR+ D  + V+ KM+ ++   ISP   T   ++  +  +    +  +  
Sbjct: 107  SLDTILYILGRSRKFDHIWDVLIKMKRKDRFLISPR--TMQIVLARVAKLCSVRQTVESF 164

Query: 939  KEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWS 1118
            ++ K++ S  D   +NA++R  C  K + D  N+   +  K   PN  T+N+    +  S
Sbjct: 165  RKFKKFVSVLDTTCFNALLRTLCQEKSMTDARNVYHRLK-KEFKPNLQTFNILLSGWKQS 223

Query: 1119 NDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVS 1298
             + E     ++ M + G  P+  S   LI ++ K  ++E A  +   M E+     V+  
Sbjct: 224  EEAEL---FFEEMRELGIKPDVVSYNSLIDVYCKDREMEKAYKVVEKMREEDISPDVITY 280

Query: 1299 DVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAI 1478
              +   L  +G+ D+A     +M E G  P   ++  +     +AKR      L ++MA 
Sbjct: 281  TSIIGGLGLVGQPDKARDILNEMKEYGCYPDVAAYNAVIRNYCIAKRLGDASNLMDEMAS 340

Query: 1479 FGPVVQVHEREDTGAYSDSRLC*IKYGIYF*FGKTIHEWIKD*TFGEIAHPKLLYSRSVA 1658
             G                       Y ++F                     ++ Y  +  
Sbjct: 341  KGLSPNA----------------TTYNLFF---------------------RVFYWSNDL 363

Query: 1659 RQHLSRYNYFTRMMGSRCLPNTQGGGDGA*AVE**GQCFWVIH----------------- 1787
            R   S Y    RMM S CLPNTQ              C ++I                  
Sbjct: 364  RNSWSLYR---RMMESGCLPNTQS-------------CMFLIRLFRKHEKVEMALTLWND 407

Query: 1788 -----FG----IGFVI*YLLCDMGKFVMQSGVFY*ILRIRL**YGDEIWMVDKECMSSDV 1940
                 FG    +  V+  LLCDMGK V     F              + M++K    S+V
Sbjct: 408  MVEKGFGSYILVSDVLFDLLCDMGKLVEAEKCF--------------LQMIEKGHKPSNV 453

Query: 1941 SSRGIKFLVERVNGQATILNMLVKKVVFGSAVKCMERK 2054
            S R IK L+E VN    +LN+  K  +FGS+++  ER+
Sbjct: 454  SFRRIKVLMELVNKHDALLNLQKKMAIFGSSIQLPERE 491


>ref|XP_007051214.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508703475|gb|EOX95371.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 511

 Score =  744 bits (1921), Expect = 0.0
 Identities = 365/488 (74%), Positives = 423/488 (86%), Gaps = 2/488 (0%)
 Frame = +3

Query: 57   PILNHSLNSITQNPNSLL-FRFFCS-VNSESNQDVETIFQIINSSESTQNLKESLKTSQI 230
            P L+H   S   +  S L F+FFCS  N  S+ DV+ +++II SS S++NL +SLK++ I
Sbjct: 5    PSLSHQSPSRYVSSFSFLPFKFFCSDSNPPSSDDVDIVYRIIASSTSSKNLTQSLKSTGI 64

Query: 231  FLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQI 410
            FLSN LIDKVLKR+RFSH NPL A E F YT +R+GFYH++FS DT+LYILGR+RKF QI
Sbjct: 65   FLSNGLIDKVLKRVRFSHGNPLLAFELFKYTGKRKGFYHTAFSLDTMLYILGRSRKFYQI 124

Query: 411  WEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLR 590
            WEVLI+++RKD+SLITPRT+Q+VLARIAKVCSVR+TV+SF +F+K V EFDT CFNALLR
Sbjct: 125  WEVLIDIKRKDQSLITPRTMQVVLARIAKVCSVRETVDSFRRFKKFVSEFDTACFNALLR 184

Query: 591  TLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVS 770
            TLCQEK M DARNVYHSLKH F+PNLQTFNILLSGWKSSEEAEGFF EM  +GVKPD+VS
Sbjct: 185  TLCQEKCMKDARNVYHSLKHDFRPNLQTFNILLSGWKSSEEAEGFFNEMRGLGVKPDVVS 244

Query: 771  YNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMK 950
            YNCL+DVYCK+R++DKAY+V+++M DEEI PDVITYTSIIGGLGL+GQPDKA+DVLKEMK
Sbjct: 245  YNCLIDVYCKNRDIDKAYRVVERMTDEEIWPDVITYTSIIGGLGLVGQPDKAKDVLKEMK 304

Query: 951  EYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLE 1130
            E+G YPDVAAYNA IRNFCIAKRLGD YNLMDEMVGKGL PNATTYNLFFR FYWSNDL 
Sbjct: 305  EHGCYPDVAAYNAAIRNFCIAKRLGDAYNLMDEMVGKGLSPNATTYNLFFRVFYWSNDLR 364

Query: 1131 SSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLF 1310
            SS SLY+RMMDSGCLPNTQSCMFLIRLFR+ EK+ MAL LWNDMVEKGFGSYVLVSDVLF
Sbjct: 365  SSCSLYQRMMDSGCLPNTQSCMFLIRLFRRHEKVGMALQLWNDMVEKGFGSYVLVSDVLF 424

Query: 1311 DLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPV 1490
            DLLCD+GKL EAEKCF +M+EK  KPSNVSF+RIKVLMELA + EA++ L EKMA+FG  
Sbjct: 425  DLLCDMGKLVEAEKCFSEMIEKRHKPSNVSFRRIKVLMELANKHEAVKNLKEKMAVFGSS 484

Query: 1491 VQVHERED 1514
            +Q+   E+
Sbjct: 485  IQLPGGEE 492


>ref|XP_006492377.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Citrus sinensis]
          Length = 506

 Score =  733 bits (1892), Expect = 0.0
 Identities = 351/481 (72%), Positives = 424/481 (88%), Gaps = 1/481 (0%)
 Frame = +3

Query: 75   LNSITQNPNSLLFR-FFCSVNSESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLI 251
            L+ +++N  SL+ R +F S  S  N DV+TIF+II SS S ++LK+SLK+S++ +SNDL+
Sbjct: 14   LSFLSKNATSLISRLYFSSDISPKNDDVDTIFRIITSSTSCKHLKQSLKSSEVNISNDLV 73

Query: 252  DKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEM 431
            DK+LKR+RFSH NPLQAL+F+ Y   RRGF+H++FS DT+LY+LGR R+FD IW+VL + 
Sbjct: 74   DKILKRVRFSHGNPLQALDFYRYIDNRRGFFHTAFSLDTMLYMLGRGRRFDIIWDVLADT 133

Query: 432  RRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKS 611
            +RKD+SLI+PRT+Q+VLAR+AKVCSVRQTVESF KF+KLVP+FD TCFNALLRTLCQEKS
Sbjct: 134  KRKDQSLISPRTIQVVLARVAKVCSVRQTVESFKKFKKLVPDFDITCFNALLRTLCQEKS 193

Query: 612  MSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDV 791
            M+DARNVYHSLK+ F+PNLQTFNILLSGWKS +EAEGF EEM +MGVKPDIVSYNCL+DV
Sbjct: 194  MTDARNVYHSLKYDFRPNLQTFNILLSGWKSVDEAEGFLEEMREMGVKPDIVSYNCLIDV 253

Query: 792  YCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPD 971
            YCK R+++KAYK+++KMRDE+ISPDVI+YTSIIGGLGL+GQPDKARDVLKEMKEYG YPD
Sbjct: 254  YCKDRQVEKAYKIVEKMRDEDISPDVISYTSIIGGLGLVGQPDKARDVLKEMKEYGCYPD 313

Query: 972  VAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYK 1151
             AAYNA IRN+CIAKRL D   LMDEMV KGL PNATTYNLFFR FYWSNDL SSW+LY 
Sbjct: 314  AAAYNAAIRNYCIAKRLRDASGLMDEMVEKGLSPNATTYNLFFRVFYWSNDLRSSWNLYC 373

Query: 1152 RMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLG 1331
            RMM +GCLPNTQSCMFL++L +++EK+E+AL LWNDMVEKGFGSY+LVSDVLFDLLCD+G
Sbjct: 374  RMMGTGCLPNTQSCMFLVKLCKRQEKVEIALQLWNDMVEKGFGSYILVSDVLFDLLCDMG 433

Query: 1332 KLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHERE 1511
            KL EAEK FL+M+EKG KPS VSF+RIKVLMELA ++EALQ LS KMA+FGP + + +RE
Sbjct: 434  KLVEAEKSFLEMIEKGHKPSQVSFRRIKVLMELANKQEALQNLSNKMALFGPSM-IPKRE 492

Query: 1512 D 1514
            +
Sbjct: 493  E 493



 Score = 59.7 bits (143), Expect = 5e-06
 Identities = 93/442 (21%), Positives = 163/442 (36%), Gaps = 29/442 (6%)
 Frame = +3

Query: 798  KSREMDKAYKVI--DKMRDEE-ISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYP 968
            + R  D  + V+   K +D+  ISP  I    ++  +  +    +  +  K+ K+     
Sbjct: 119  RGRRFDIIWDVLADTKRKDQSLISPRTIQV--VLARVAKVCSVRQTVESFKKFKKLVPDF 176

Query: 969  DVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLY 1148
            D+  +NA++R  C  K + D  N+   +      PN  T+N+    +     ++ +    
Sbjct: 177  DITCFNALLRTLCQEKSMTDARNVYHSLK-YDFRPNLQTFNILLSGW---KSVDEAEGFL 232

Query: 1149 KRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDL 1328
            + M + G  P+  S   LI ++ K  ++E A  +   M ++     V+    +   L  +
Sbjct: 233  EEMREMGVKPDIVSYNCLIDVYCKDRQVEKAYKIVEKMRDEDISPDVISYTSIIGGLGLV 292

Query: 1329 GKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHER 1508
            G+ D+A     +M E G  P   ++        +AKR      L ++M            
Sbjct: 293  GQPDKARDVLKEMKEYGCYPDAAAYNAAIRNYCIAKRLRDASGLMDEMV----------- 341

Query: 1509 EDTGAYSDSRLC*IKYGIYF*FGKTIHEWIKD*TFGEIAHPKLLYSRSVARQHLSRYNYF 1688
             + G   ++      Y ++F     +  W  D                      S +N +
Sbjct: 342  -EKGLSPNATT----YNLFF----RVFYWSNDLR--------------------SSWNLY 372

Query: 1689 TRMMGSRCLPNTQGGGDGA*AVE**GQCFWVIH----------------------FG--- 1793
             RMMG+ CLPNTQ              C +++                       FG   
Sbjct: 373  CRMMGTGCLPNTQS-------------CMFLVKLCKRQEKVEIALQLWNDMVEKGFGSYI 419

Query: 1794 -IGFVI*YLLCDMGKFVMQSGVFY*ILRIRL**YGDEIWMVDKECMSSDVSSRGIKFLVE 1970
             +  V+  LLCDMGK V     F              + M++K    S VS R IK L+E
Sbjct: 420  LVSDVLFDLLCDMGKLVEAEKSF--------------LEMIEKGHKPSQVSFRRIKVLME 465

Query: 1971 RVNGQATILNMLVKKVVFGSAV 2036
              N Q  + N+  K  +FG ++
Sbjct: 466  LANKQEALQNLSNKMALFGPSM 487


>ref|XP_006444562.1| hypothetical protein CICLE_v10019795mg [Citrus clementina]
            gi|557546824|gb|ESR57802.1| hypothetical protein
            CICLE_v10019795mg [Citrus clementina]
          Length = 506

 Score =  731 bits (1888), Expect = 0.0
 Identities = 350/481 (72%), Positives = 423/481 (87%), Gaps = 1/481 (0%)
 Frame = +3

Query: 75   LNSITQNPNSLLFR-FFCSVNSESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLI 251
            L+ +++N  SL+ R +F S  S  N DV+TIF+II SS S ++LK+SLK+S++ +SNDL+
Sbjct: 14   LSFLSKNATSLISRLYFSSDISPKNDDVDTIFRIITSSTSCKHLKQSLKSSEVNISNDLV 73

Query: 252  DKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEM 431
            DK+LKR+RFSH NPLQAL+F+ Y   RRGF+H++FS DT+LY+LGR R+FD IW+VL + 
Sbjct: 74   DKILKRVRFSHGNPLQALDFYRYIDNRRGFFHTAFSLDTMLYMLGRGRRFDIIWDVLADT 133

Query: 432  RRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKS 611
            +RKD+SLI+PRT+Q+VLAR+AKVCSVRQTVESF KF+KLVP+FD TCFNALLRTLCQEKS
Sbjct: 134  KRKDQSLISPRTIQVVLARVAKVCSVRQTVESFKKFKKLVPDFDITCFNALLRTLCQEKS 193

Query: 612  MSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDV 791
            M+DARNVYHSLK+ F+PNLQTFNILLSGWKS +EAEGF EEM +MGVKPDIVSYNCL+DV
Sbjct: 194  MTDARNVYHSLKYDFRPNLQTFNILLSGWKSVDEAEGFLEEMREMGVKPDIVSYNCLIDV 253

Query: 792  YCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPD 971
            YCK R+++KAYK+++KMRDE+ISPDVI+YTSIIGGLGL+GQPDKARDVLKEMKEYG YPD
Sbjct: 254  YCKDRQVEKAYKIVEKMRDEDISPDVISYTSIIGGLGLVGQPDKARDVLKEMKEYGCYPD 313

Query: 972  VAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYK 1151
             AAYNA IRN+CIAKRL D   LMDEMV KGL PNATTYNLFFR FYWSNDL SSW+LY 
Sbjct: 314  AAAYNAAIRNYCIAKRLRDASGLMDEMVEKGLSPNATTYNLFFRVFYWSNDLRSSWNLYC 373

Query: 1152 RMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLG 1331
            RMM +GCLPNTQSCMFL++L +++EK+E+AL LWNDMVEKGFGSY+LVSDVLFDLLCD+G
Sbjct: 374  RMMGTGCLPNTQSCMFLVKLCKRQEKVEIALQLWNDMVEKGFGSYILVSDVLFDLLCDMG 433

Query: 1332 KLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHERE 1511
            KL EAEK FL+M+EKG KPS VSF+RIK LMELA ++EALQ LS KMA+FGP + + +RE
Sbjct: 434  KLVEAEKSFLEMIEKGHKPSQVSFRRIKALMELANKQEALQNLSNKMALFGPSM-IPKRE 492

Query: 1512 D 1514
            +
Sbjct: 493  E 493



 Score = 59.3 bits (142), Expect = 7e-06
 Identities = 93/442 (21%), Positives = 163/442 (36%), Gaps = 29/442 (6%)
 Frame = +3

Query: 798  KSREMDKAYKVI--DKMRDEE-ISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYP 968
            + R  D  + V+   K +D+  ISP  I    ++  +  +    +  +  K+ K+     
Sbjct: 119  RGRRFDIIWDVLADTKRKDQSLISPRTIQV--VLARVAKVCSVRQTVESFKKFKKLVPDF 176

Query: 969  DVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLY 1148
            D+  +NA++R  C  K + D  N+   +      PN  T+N+    +     ++ +    
Sbjct: 177  DITCFNALLRTLCQEKSMTDARNVYHSLK-YDFRPNLQTFNILLSGW---KSVDEAEGFL 232

Query: 1149 KRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDL 1328
            + M + G  P+  S   LI ++ K  ++E A  +   M ++     V+    +   L  +
Sbjct: 233  EEMREMGVKPDIVSYNCLIDVYCKDRQVEKAYKIVEKMRDEDISPDVISYTSIIGGLGLV 292

Query: 1329 GKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHER 1508
            G+ D+A     +M E G  P   ++        +AKR      L ++M            
Sbjct: 293  GQPDKARDVLKEMKEYGCYPDAAAYNAAIRNYCIAKRLRDASGLMDEMV----------- 341

Query: 1509 EDTGAYSDSRLC*IKYGIYF*FGKTIHEWIKD*TFGEIAHPKLLYSRSVARQHLSRYNYF 1688
             + G   ++      Y ++F     +  W  D                      S +N +
Sbjct: 342  -EKGLSPNATT----YNLFF----RVFYWSNDLR--------------------SSWNLY 372

Query: 1689 TRMMGSRCLPNTQGGGDGA*AVE**GQCFWVIH----------------------FG--- 1793
             RMMG+ CLPNTQ              C +++                       FG   
Sbjct: 373  CRMMGTGCLPNTQS-------------CMFLVKLCKRQEKVEIALQLWNDMVEKGFGSYI 419

Query: 1794 -IGFVI*YLLCDMGKFVMQSGVFY*ILRIRL**YGDEIWMVDKECMSSDVSSRGIKFLVE 1970
             +  V+  LLCDMGK V     F              + M++K    S VS R IK L+E
Sbjct: 420  LVSDVLFDLLCDMGKLVEAEKSF--------------LEMIEKGHKPSQVSFRRIKALME 465

Query: 1971 RVNGQATILNMLVKKVVFGSAV 2036
              N Q  + N+  K  +FG ++
Sbjct: 466  LANKQEALQNLSNKMALFGPSM 487


>ref|XP_003518677.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Glycine max]
          Length = 500

 Score =  730 bits (1884), Expect = 0.0
 Identities = 357/502 (71%), Positives = 421/502 (83%), Gaps = 2/502 (0%)
 Frame = +3

Query: 6    MFLRRFSFRSPTRRYANP--ILNHSLNSITQNPNSLLFRFFCSVNSESNQDVETIFQIIN 179
            M LRRFSF+SP+  Y  P  ++ H L                 ++S  N DV+ +F I++
Sbjct: 1    MLLRRFSFQSPSN-YIPPSTLIRHRL-----------------LSSNQNDDVQKVFGILS 42

Query: 180  SSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFS 359
            S+ + + LK+SLK+S +FLSND+ID+VLKR+RFSH NP Q LEFF YT RR+GFYHSSFS
Sbjct: 43   STSTPEQLKQSLKSSGVFLSNDVIDQVLKRVRFSHGNPSQTLEFFRYTGRRKGFYHSSFS 102

Query: 360  YDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKF 539
             DT+LYILGR+R F Q+WE+LIE RRKD++ IT RTV +VL RIAKVCSVRQTVESF KF
Sbjct: 103  LDTMLYILGRSRMFGQVWELLIEARRKDQTAITARTVMVVLGRIAKVCSVRQTVESFRKF 162

Query: 540  RKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAE 719
            RKLV EFDT CFNALLRTLCQEKSM+DARNVYHSLKH+F+PNLQTFNILLSGWK+ E+A+
Sbjct: 163  RKLVQEFDTNCFNALLRTLCQEKSMADARNVYHSLKHRFRPNLQTFNILLSGWKTPEDAD 222

Query: 720  GFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGL 899
             FF+EM +MGV PD+V+YN L+DVYCK RE++KAYK++D+MRD++ SPDVITYT IIGGL
Sbjct: 223  LFFKEMKEMGVTPDVVTYNSLMDVYCKGREIEKAYKMLDEMRDQDFSPDVITYTCIIGGL 282

Query: 900  GLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNA 1079
            GLIGQPDKAR+VLKEMKEYG YPD AAYNA IRNFCIAKRLGD + L++EMV KGL PNA
Sbjct: 283  GLIGQPDKARNVLKEMKEYGCYPDAAAYNAAIRNFCIAKRLGDAHGLVEEMVTKGLSPNA 342

Query: 1080 TTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWND 1259
            TTYNLFFR FYWSNDL+SSW++Y+RMM  GCLPNTQSCMFLIRLFR+ EK+EMAL  W D
Sbjct: 343  TTYNLFFRVFYWSNDLQSSWNMYQRMMVEGCLPNTQSCMFLIRLFRRHEKVEMALQFWGD 402

Query: 1260 MVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKR 1439
            MVEKGFGSY LVSDVLFDLLCD+GKL+EAEKCFL+MVEKGQKPS+VSF+RIKVLMELA R
Sbjct: 403  MVEKGFGSYTLVSDVLFDLLCDMGKLEEAEKCFLEMVEKGQKPSHVSFRRIKVLMELANR 462

Query: 1440 EEALQCLSEKMAIFGPVVQVHE 1505
             EALQ L +KMA+FG  +QV +
Sbjct: 463  HEALQSLMQKMAMFGRPLQVDQ 484


>ref|XP_004495883.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Cicer arietinum]
          Length = 502

 Score =  728 bits (1880), Expect = 0.0
 Identities = 354/500 (70%), Positives = 416/500 (83%)
 Frame = +3

Query: 6    MFLRRFSFRSPTRRYANPILNHSLNSITQNPNSLLFRFFCSVNSESNQDVETIFQIINSS 185
            MFLR ++  SP R                 P++L+ R   S N   N DV  +F I+++S
Sbjct: 1    MFLRHYTSESPFRSIL--------------PSNLILRRLFSFNP--NDDVNKVFNILSNS 44

Query: 186  ESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYD 365
             S ++L+++LK+S IFLSN+LID+VLKR+RF HANP Q LEFFNYT RR+GFYH++FS D
Sbjct: 45   SSPEHLQQTLKSSGIFLSNELIDQVLKRVRFGHANPSQTLEFFNYTGRRKGFYHTAFSLD 104

Query: 366  TILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKFRK 545
            T+LYILGR+R F+ +W++L E RRKDR++ITPRTV +VLAR+AKVCSV+QTVESF KF+K
Sbjct: 105  TMLYILGRSRMFNHVWDLLTEARRKDRTVITPRTVMVVLARVAKVCSVKQTVESFRKFKK 164

Query: 546  LVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGF 725
            +VP+F T CFN+LLRTLCQEKSM+DARNVYHSLKH F PNLQTFNILLSGWK+ E+AE F
Sbjct: 165  IVPDFGTDCFNSLLRTLCQEKSMTDARNVYHSLKHSFHPNLQTFNILLSGWKTPEDAESF 224

Query: 726  FEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGL 905
            F+EM +MGV+PD+V+YN LVDVYCK RE+DKAYKV D+MR+ ++SPDVITYT IIGGLGL
Sbjct: 225  FKEMKEMGVEPDVVTYNSLVDVYCKGREIDKAYKVFDEMRERDLSPDVITYTCIIGGLGL 284

Query: 906  IGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATT 1085
            IGQPDKARDVLKEMKE+G YPDV AYNA IRNFCIAKRLGD Y+L+DEM  KGL PNATT
Sbjct: 285  IGQPDKARDVLKEMKEFGIYPDVPAYNAAIRNFCIAKRLGDAYDLVDEMTNKGLSPNATT 344

Query: 1086 YNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMV 1265
            YNLFFR +YWSNDL SSWSLYKRMM  GCLPNTQSCMFLIRL +K EK EMAL LW DMV
Sbjct: 345  YNLFFRIYYWSNDLPSSWSLYKRMMVEGCLPNTQSCMFLIRLLKKHEKAEMALQLWGDMV 404

Query: 1266 EKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREE 1445
            EKGFGSY LVSDVLFDLLCD+GKL EAEKCFL+MVEKGQKPSNVSF+RIKVLMELA R E
Sbjct: 405  EKGFGSYTLVSDVLFDLLCDMGKLLEAEKCFLEMVEKGQKPSNVSFRRIKVLMELANRHE 464

Query: 1446 ALQCLSEKMAIFGPVVQVHE 1505
            A+Q L++KM +FG  +QV E
Sbjct: 465  AIQNLTQKMGVFGQTLQVRE 484


>ref|XP_004150840.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Cucumis sativus]
          Length = 495

 Score =  727 bits (1876), Expect = 0.0
 Identities = 356/492 (72%), Positives = 423/492 (85%), Gaps = 3/492 (0%)
 Frame = +3

Query: 6    MFLRRFSFRSPTRRYANPILNHSLNSITQNPNSLLFR--FFCSVNSES-NQDVETIFQII 176
            M LRR +F     RY +PI           P S+ F   F  S +++S +Q++ET+F+II
Sbjct: 1    MILRRPNFFQSPLRYFSPI-----------PLSIFFSHPFSSSTDNQSLHQNIETVFRII 49

Query: 177  NSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSF 356
             +S S+ ++K SL++S++FLSN+LID VLKR+RFSH NPLQALEFFNYT++RRGFYH+SF
Sbjct: 50   TTSSSSTDMKHSLESSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTAKRRGFYHTSF 109

Query: 357  SYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGK 536
            S DT+LYILGR+RKFD+IW+VL++++ KD SLI+ RTV +VL RIAKVCSVRQTVESF K
Sbjct: 110  SVDTMLYILGRSRKFDKIWDVLLDVKFKDPSLISLRTVMVVLGRIAKVCSVRQTVESFRK 169

Query: 537  FRKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEA 716
            F+K VPEFD TCFNALLRTLCQEKSM DARNVYH LK  F+PNLQTFNILLSGWKSSEEA
Sbjct: 170  FKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHGLKSMFRPNLQTFNILLSGWKSSEEA 229

Query: 717  EGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGG 896
            EGFF+EM++MGVKPD+VSYNCLVDVYCK+REMDKA+KV+ KMRDE+I  DVITYTSIIGG
Sbjct: 230  EGFFDEMIEMGVKPDVVSYNCLVDVYCKNREMDKAFKVVGKMRDEDIPADVITYTSIIGG 289

Query: 897  LGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPN 1076
            LGL+GQPDKAR++LKEMKEYG YPDVAAYNA IRNFCIAKRL + ++L+DEMV KGL PN
Sbjct: 290  LGLVGQPDKARNILKEMKEYGCYPDVAAYNATIRNFCIAKRLHEAFDLLDEMVNKGLSPN 349

Query: 1077 ATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWN 1256
            ATTYNLFFR F+WSNDL+S+W+LY+RMMD+GCLPNTQSC+FL+RLF+K EK EMAL LWN
Sbjct: 350  ATTYNLFFRIFFWSNDLQSAWNLYRRMMDTGCLPNTQSCLFLVRLFKKYEKEEMALELWN 409

Query: 1257 DMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAK 1436
            DM++KGFGSY+LVS+ LFDLLCDLGKL EAE CFLQMV+KG KPS  SFKRIKVLMELA 
Sbjct: 410  DMIQKGFGSYILVSEELFDLLCDLGKLIEAESCFLQMVDKGHKPSYTSFKRIKVLMELAN 469

Query: 1437 REEALQCLSEKM 1472
            + EALQ LS+KM
Sbjct: 470  KHEALQNLSKKM 481


>ref|XP_004163923.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Cucumis sativus]
          Length = 495

 Score =  726 bits (1873), Expect = 0.0
 Identities = 355/492 (72%), Positives = 423/492 (85%), Gaps = 3/492 (0%)
 Frame = +3

Query: 6    MFLRRFSFRSPTRRYANPILNHSLNSITQNPNSLLFR--FFCSVNSES-NQDVETIFQII 176
            M LRR +F     RY +PI           P S+ +   F  S +++S +Q++ET+F+II
Sbjct: 1    MILRRPNFFQSPLRYFSPI-----------PLSIFYSHPFSSSTDNQSLHQNIETVFRII 49

Query: 177  NSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSF 356
             +S S+ ++K SL++S++FLSN+LID VLKR+RFSH NPLQALEFFNYT++RRGFYH+SF
Sbjct: 50   TTSSSSTDMKHSLESSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTAKRRGFYHTSF 109

Query: 357  SYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGK 536
            S DT+LYILGR+RKFD+IW+VL++++ KD SLI+ RTV +VL RIAKVCSVRQTVESF K
Sbjct: 110  SVDTMLYILGRSRKFDKIWDVLLDVKFKDPSLISLRTVMVVLGRIAKVCSVRQTVESFRK 169

Query: 537  FRKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEA 716
            F+K VPEFD TCFNALLRTLCQEKSM DARNVYH LK  F+PNLQTFNILLSGWKSSEEA
Sbjct: 170  FKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHGLKSMFRPNLQTFNILLSGWKSSEEA 229

Query: 717  EGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGG 896
            EGFF+EM++MGVKPD+VSYNCLVDVYCK+REMDKA+KV+ KMRDE+I  DVITYTSIIGG
Sbjct: 230  EGFFDEMIEMGVKPDVVSYNCLVDVYCKNREMDKAFKVVGKMRDEDIPADVITYTSIIGG 289

Query: 897  LGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPN 1076
            LGL+GQPDKAR++LKEMKEYG YPDVAAYNA IRNFCIAKRL + ++L+DEMV KGL PN
Sbjct: 290  LGLVGQPDKARNILKEMKEYGCYPDVAAYNATIRNFCIAKRLHEAFDLLDEMVNKGLSPN 349

Query: 1077 ATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWN 1256
            ATTYNLFFR F+WSNDL+S+W+LY+RMMD+GCLPNTQSC+FL+RLF+K EK EMAL LWN
Sbjct: 350  ATTYNLFFRIFFWSNDLQSAWNLYRRMMDTGCLPNTQSCLFLVRLFKKYEKEEMALELWN 409

Query: 1257 DMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAK 1436
            DM++KGFGSY+LVS+ LFDLLCDLGKL EAE CFLQMV+KG KPS  SFKRIKVLMELA 
Sbjct: 410  DMIQKGFGSYILVSEELFDLLCDLGKLIEAESCFLQMVDKGHKPSYTSFKRIKVLMELAN 469

Query: 1437 REEALQCLSEKM 1472
            + EALQ LS+KM
Sbjct: 470  KHEALQNLSKKM 481


>ref|XP_007132128.1| hypothetical protein PHAVU_011G069000g [Phaseolus vulgaris]
            gi|561005128|gb|ESW04122.1| hypothetical protein
            PHAVU_011G069000g [Phaseolus vulgaris]
          Length = 500

 Score =  724 bits (1869), Expect = 0.0
 Identities = 353/500 (70%), Positives = 415/500 (83%)
 Frame = +3

Query: 6    MFLRRFSFRSPTRRYANPILNHSLNSITQNPNSLLFRFFCSVNSESNQDVETIFQIINSS 185
            M +RRFS +SP        LN +L+S             C  +S  N DV  +F I++S+
Sbjct: 1    MLVRRFSGKSP--------LNCALSSTVIR--------HCFSSSNENDDVRKVFGILSST 44

Query: 186  ESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYD 365
             + + LK+SLK S +FLSN+LID+VLKR+RFSH NP Q LEFF YT RR+GFYH++FS D
Sbjct: 45   STPEQLKQSLKASGVFLSNELIDQVLKRVRFSHGNPSQTLEFFRYTGRRKGFYHTAFSLD 104

Query: 366  TILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKFRK 545
            T+LYILGR+R F  +W++LIE RRKD++ IT RTV +VL R+AKVCSVRQTV+SF KFRK
Sbjct: 105  TMLYILGRSRMFGHVWDLLIECRRKDQTAITARTVMVVLGRVAKVCSVRQTVDSFRKFRK 164

Query: 546  LVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGF 725
            LV EFDT CFNALLRTLCQEKSM+DARNVYHSLKH+F+PNLQTFNILLSGWK+ E+A+GF
Sbjct: 165  LVAEFDTNCFNALLRTLCQEKSMTDARNVYHSLKHRFRPNLQTFNILLSGWKTPEDADGF 224

Query: 726  FEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGL 905
            F+EM +MGV PD+V+YN LVDVYCK RE++KAYKV+D+MRD ++SPDVITYT IIGGLGL
Sbjct: 225  FKEMKEMGVTPDVVTYNSLVDVYCKGREIEKAYKVLDEMRDRDLSPDVITYTCIIGGLGL 284

Query: 906  IGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATT 1085
            IGQPDKAR VLKEMKEYG YPD AAYNA IRNFCIAKRLGD + L+ EMV  GL PNATT
Sbjct: 285  IGQPDKARGVLKEMKEYGCYPDAAAYNAAIRNFCIAKRLGDAHGLVKEMVSMGLCPNATT 344

Query: 1086 YNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMV 1265
            YNLFFR FYWSNDL SSW +YKRMM  GCLPNTQSCMFLIRLFRK EK+EMAL LW +MV
Sbjct: 345  YNLFFRVFYWSNDLHSSWIMYKRMMVEGCLPNTQSCMFLIRLFRKHEKVEMALQLWENMV 404

Query: 1266 EKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREE 1445
            EKGFGSY LVSDVLFDLLCD+GKL+EAEKCFL+M+EKGQKPSNVSF+RIKVLMELA R E
Sbjct: 405  EKGFGSYTLVSDVLFDLLCDMGKLEEAEKCFLEMIEKGQKPSNVSFRRIKVLMELANRHE 464

Query: 1446 ALQCLSEKMAIFGPVVQVHE 1505
            AL+ L++KM+IFG  +Q+H+
Sbjct: 465  ALESLTQKMSIFGRPLQLHQ 484


>ref|XP_004295891.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Fragaria vesca subsp. vesca]
          Length = 504

 Score =  721 bits (1861), Expect = 0.0
 Identities = 344/463 (74%), Positives = 403/463 (87%)
 Frame = +3

Query: 126  SVNSESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQAL 305
            S  S +  DV+T++ I++SS  ++NLK+SLK+  +FL+NDL D+VLKR RFSH NPLQAL
Sbjct: 30   SSQSSTPNDVDTVYCIVSSSAHSKNLKQSLKSCGVFLTNDLTDEVLKRARFSHGNPLQAL 89

Query: 306  EFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLA 485
            EFFNYT  RRGFYH+SFS DT+LY+LGR+R F ++WEVL+E + KDRSLITPRTV +VLA
Sbjct: 90   EFFNYTGNRRGFYHTSFSLDTMLYMLGRSRMFKKMWEVLVETKHKDRSLITPRTVMVVLA 149

Query: 486  RIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPN 665
            RIAKVCSVR+TVE F KF+KLVPEFDT CFN+LLRTLCQEKSM+DARNVYH LKH F+PN
Sbjct: 150  RIAKVCSVRETVECFKKFKKLVPEFDTACFNSLLRTLCQEKSMTDARNVYHKLKHSFEPN 209

Query: 666  LQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMR 845
            LQTFNILLSGWKSSEEAEGFFEEM ++G+KPD+VSYNCL+DVY K+REM+K +KV++KMR
Sbjct: 210  LQTFNILLSGWKSSEEAEGFFEEMRELGLKPDVVSYNCLIDVYSKNREMEKVFKVMEKMR 269

Query: 846  DEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLG 1025
            DEEI PD ITYT +IGG GL+GQPDKARDVLKEMKE G YPDVAAYNA IRNFCIAKRLG
Sbjct: 270  DEEIWPDKITYTCVIGGFGLVGQPDKARDVLKEMKELGCYPDVAAYNAAIRNFCIAKRLG 329

Query: 1026 DGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLI 1205
            D   LM+EM+  GL PNATTYNLFFR F+WS+DL++SWSLY RMM  GCLPNTQSCMFLI
Sbjct: 330  DANGLMEEMMSNGLSPNATTYNLFFRVFFWSSDLQNSWSLYGRMMYMGCLPNTQSCMFLI 389

Query: 1206 RLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQK 1385
            RLFRK EK+++AL LWNDM+E+GFGSY+LVSDVLFDLLCD+GKL EAE CFLQMVEKG K
Sbjct: 390  RLFRKLEKVDLALQLWNDMIERGFGSYILVSDVLFDLLCDMGKLTEAETCFLQMVEKGHK 449

Query: 1386 PSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHERED 1514
            PSNVSF+RIKVLMELA + EAL+ L+EKMA+FG  + + E  D
Sbjct: 450  PSNVSFRRIKVLMELANKHEALKNLTEKMALFGSSIHLPESMD 492


>ref|XP_003591356.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355480404|gb|AES61607.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 518

 Score =  716 bits (1847), Expect = 0.0
 Identities = 339/477 (71%), Positives = 411/477 (86%)
 Frame = +3

Query: 87   TQNPNSLLFRFFCSVNSESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVLK 266
            +++P + + R   + N   N DV  ++ I+ ++ S + LK+SLK++QIFLSN+LID+VLK
Sbjct: 8    SKSPFTSILRRLSTFNP--NDDVHKVYTILTTTSSPETLKQSLKSTQIFLSNELIDQVLK 65

Query: 267  RIRFSHANPLQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKDR 446
            R+RF HANP Q LEFF YT RR+GFYH+++S DT+LYILGR+R FD +WE+LIE RRKD+
Sbjct: 66   RVRFGHANPNQTLEFFRYTGRRKGFYHTAYSLDTMLYILGRSRMFDHVWELLIEARRKDQ 125

Query: 447  SLITPRTVQIVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKSMSDAR 626
            ++ITPRTV +VL R+AKVCSVRQTVE+F KF+K+VP++   CFNALLRTLCQEKSM+DAR
Sbjct: 126  NVITPRTVMVVLGRVAKVCSVRQTVETFRKFKKIVPDYGVNCFNALLRTLCQEKSMTDAR 185

Query: 627  NVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKSR 806
            NVYHSLKH F+PNLQTFNILLSGWK+ E+AE F  EM +MGV+PD+V+YN LVDVYCK R
Sbjct: 186  NVYHSLKHNFRPNLQTFNILLSGWKNVEDAELFVNEMKEMGVEPDVVTYNSLVDVYCKGR 245

Query: 807  EMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAYN 986
            E++KAYKV D+MR++++SPDVITYTS+IGGLGL+GQPDKARDVLKEMKEYG YPDV AYN
Sbjct: 246  EIEKAYKVFDEMREKDLSPDVITYTSVIGGLGLVGQPDKARDVLKEMKEYGVYPDVPAYN 305

Query: 987  AVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMDS 1166
            A IRN+CIAKRLG  + L+DEMV KGL PNATTYNLFFR FYWSNDL+SSW+LYKRMM  
Sbjct: 306  AAIRNYCIAKRLGIAFELVDEMVNKGLSPNATTYNLFFRVFYWSNDLQSSWNLYKRMMGE 365

Query: 1167 GCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDEA 1346
            GCLP TQSCMFLIRLF++ EK+EMAL LW +MVEKGFGSY LVSDVLFD+LCD+GKL EA
Sbjct: 366  GCLPYTQSCMFLIRLFKRHEKMEMALQLWGEMVEKGFGSYTLVSDVLFDMLCDMGKLMEA 425

Query: 1347 EKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHEREDT 1517
            EKCFL+M+EKGQ+PSNVSFKRIKVLMELA + EA+Q L++KMAIFG  +QVHER  T
Sbjct: 426  EKCFLEMIEKGQRPSNVSFKRIKVLMELANKHEAIQNLTQKMAIFGRPLQVHERVAT 482


>ref|XP_002310039.2| hypothetical protein POPTR_0007s06780g [Populus trichocarpa]
            gi|550334290|gb|EEE90489.2| hypothetical protein
            POPTR_0007s06780g [Populus trichocarpa]
          Length = 509

 Score =  714 bits (1842), Expect = 0.0
 Identities = 354/511 (69%), Positives = 423/511 (82%), Gaps = 4/511 (0%)
 Frame = +3

Query: 12   LRRFSFRSPTRRYAN----PILNHSLNSITQNPNSLLFRFFCSVNSESNQDVETIFQIIN 179
            L+  SF     RY +    P L  SL++  QN N+             +  V+ I+ II+
Sbjct: 8    LKSLSFELQKSRYVSSSKIPFL--SLHTNPQNHNN------------KDIQVDAIYNIIS 53

Query: 180  SSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFS 359
            +S S+QNLK+SLK++ +FLSNDLIDKVLKR+RFSH NPLQAL+FFN+T+ RRGFYHSS+S
Sbjct: 54   NSTSSQNLKQSLKSTGVFLSNDLIDKVLKRVRFSHGNPLQALDFFNFTADRRGFYHSSYS 113

Query: 360  YDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKF 539
             DT+LYILGR+RKFD IW+VLI+++RKDR+LITPRT+Q+VL R+AKVCSVR TVESF KF
Sbjct: 114  LDTMLYILGRSRKFDHIWDVLIDIKRKDRNLITPRTLQVVLGRVAKVCSVRMTVESFWKF 173

Query: 540  RKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAE 719
            ++LVP FDT+CFNALLRTLCQEKSMSDARNVYH LK  F+PNLQTFNILLSGWKSSEEAE
Sbjct: 174  KRLVPVFDTSCFNALLRTLCQEKSMSDARNVYHHLKKGFRPNLQTFNILLSGWKSSEEAE 233

Query: 720  GFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGL 899
             F+EEM ++GVKPDIV+YN L+DV+CK RE++KAY V+ +MR+E+I PDVITYTSIIGGL
Sbjct: 234  LFYEEMKELGVKPDIVTYNSLIDVFCKGRELEKAYGVVARMREEDILPDVITYTSIIGGL 293

Query: 900  GLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNA 1079
            GL+GQPDKARD+LKEMKE+G YPDVAAYNAVIRN+CIAKRL   Y+LM EM  KG+ PNA
Sbjct: 294  GLVGQPDKARDMLKEMKEHGCYPDVAAYNAVIRNYCIAKRLDAAYSLMAEMESKGMSPNA 353

Query: 1080 TTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWND 1259
            T+YNLFFR F WSNDL +SW  Y RMMD+GCLPNTQSCMFLI+LF++ EK+EMAL LWND
Sbjct: 354  TSYNLFFRVFSWSNDLRNSWDFYGRMMDAGCLPNTQSCMFLIKLFKRHEKVEMALQLWND 413

Query: 1260 MVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKR 1439
            MVEKGFGSY+LVSDVL  +LCD+GKL EAEKCFLQMVEKG KPSNVSF+RIKVLMELA +
Sbjct: 414  MVEKGFGSYILVSDVLLGMLCDMGKLVEAEKCFLQMVEKGHKPSNVSFRRIKVLMELANK 473

Query: 1440 EEALQCLSEKMAIFGPVVQVHEREDTGAYSD 1532
             +A++ LSEKMAIFG  ++  E  D    SD
Sbjct: 474  HDAIRNLSEKMAIFGSSIRAPEGMDEKECSD 504



 Score = 75.9 bits (185), Expect = 7e-11
 Identities = 102/456 (22%), Positives = 179/456 (39%), Gaps = 29/456 (6%)
 Frame = +3

Query: 768  SYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLK-- 941
            S + ++ +  +SR+ D  + V+  ++ ++   ++IT  ++   LG + +    R  ++  
Sbjct: 113  SLDTMLYILGRSRKFDHIWDVLIDIKRKD--RNLITPRTLQVVLGRVAKVCSVRMTVESF 170

Query: 942  -EMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWS 1118
             + K      D + +NA++R  C  K + D  N+   +  KG  PN  T+N+    +  S
Sbjct: 171  WKFKRLVPVFDTSCFNALLRTLCQEKSMSDARNVYHHLK-KGFRPNLQTFNILLSGWKSS 229

Query: 1119 NDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVS 1298
             + E     Y+ M + G  P+  +   LI +F K  +LE A  +   M E+     V+  
Sbjct: 230  EEAEL---FYEEMKELGVKPDIVTYNSLIDVFCKGRELEKAYGVVARMREEDILPDVITY 286

Query: 1299 DVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAI 1478
              +   L  +G+ D+A     +M E G  P   ++  +     +AKR +A   L  +M  
Sbjct: 287  TSIIGGLGLVGQPDKARDMLKEMKEHGCYPDVAAYNAVIRNYCIAKRLDAAYSLMAEM-- 344

Query: 1479 FGPVVQVHEREDTGAYSDSRLC*IKYGIYF*FGKTIHEWIKD*TFGEIAHPKLLYSRSVA 1658
                      E  G   ++      Y ++F     +  W  D                  
Sbjct: 345  ----------ESKGMSPNAT----SYNLFF----RVFSWSNDLR---------------- 370

Query: 1659 RQHLSRYNYFTRMMGSRCLPNTQGGGDGA*AVE**GQCFWVIH----------------- 1787
                + ++++ RMM + CLPNTQ              C ++I                  
Sbjct: 371  ----NSWDFYGRMMDAGCLPNTQS-------------CMFLIKLFKRHEKVEMALQLWND 413

Query: 1788 -----FG----IGFVI*YLLCDMGKFVMQSGVFY*ILRIRL**YGDEIWMVDKECMSSDV 1940
                 FG    +  V+  +LCDMGK V     F              + MV+K    S+V
Sbjct: 414  MVEKGFGSYILVSDVLLGMLCDMGKLVEAEKCF--------------LQMVEKGHKPSNV 459

Query: 1941 SSRGIKFLVERVNGQATILNMLVKKVVFGSAVKCME 2048
            S R IK L+E  N    I N+  K  +FGS+++  E
Sbjct: 460  SFRRIKVLMELANKHDAIRNLSEKMAIFGSSIRAPE 495


>ref|XP_006340302.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Solanum tuberosum]
          Length = 505

 Score =  707 bits (1825), Expect = 0.0
 Identities = 338/474 (71%), Positives = 413/474 (87%), Gaps = 4/474 (0%)
 Frame = +3

Query: 126  SVNSESN----QDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANP 293
            S++S+S+    ++VET+++II  ++S + LK++LK+SQI LSNDLIDKVLKR+RFSH+NP
Sbjct: 27   SLHSQSSAPIDKEVETLYRIITVTQSPEGLKQALKSSQIKLSNDLIDKVLKRVRFSHSNP 86

Query: 294  LQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQ 473
            LQALEFF Y  +R+GFYH+ FS DTILY++GRNRKFD+IWEVL+EM+RKD+SLITPRTVQ
Sbjct: 87   LQALEFFKYADKRKGFYHTGFSLDTILYVIGRNRKFDKIWEVLVEMKRKDQSLITPRTVQ 146

Query: 474  IVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHK 653
            +VL R+AKVCSVR+TVESF  F++L+ EF   CFNALLR LCQEKSMSDARNVYH LK+K
Sbjct: 147  VVLGRVAKVCSVRETVESFWGFKRLLNEFGVDCFNALLRALCQEKSMSDARNVYHRLKYK 206

Query: 654  FKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVI 833
            F+PN QTFNILLSGWKSSE+AE FF+EM  +GV+PD+VS+NCLVDVYCK REM+KA+ V+
Sbjct: 207  FRPNNQTFNILLSGWKSSEDAEVFFKEMRDLGVEPDVVSFNCLVDVYCKGREMEKAFTVV 266

Query: 834  DKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIA 1013
            ++MR+++I+PDVITYTS+IGGLGL GQPDKAR +LKEM+EYG YPDVAAYNA IRNFCIA
Sbjct: 267  EEMREKDITPDVITYTSLIGGLGLAGQPDKARHILKEMREYGCYPDVAAYNAAIRNFCIA 326

Query: 1014 KRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSC 1193
            KR+GD Y+LMDEMV  GL PNATTYN+F R F+W NDL+SSW+LY+RM ++GCLP+TQSC
Sbjct: 327  KRIGDAYSLMDEMVRNGLSPNATTYNVFLRSFFWINDLKSSWTLYQRMKETGCLPSTQSC 386

Query: 1194 MFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVE 1373
            MFLIRL R+ EK+EMAL LW+DM+E+GFGSY+LVSDVLFDLLCDLGKL EAE+CFLQMV 
Sbjct: 387  MFLIRLSRRHEKVEMALELWDDMMERGFGSYILVSDVLFDLLCDLGKLAEAERCFLQMVN 446

Query: 1374 KGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHEREDTGAYSDS 1535
            KGQKPSNVSF+RIKVLMELA ++E L+ LSEKMA FG   Q+ +  D   Y  S
Sbjct: 447  KGQKPSNVSFRRIKVLMELANKQETLKLLSEKMAAFGSSTQLIQ-HDKAEYEQS 499


>ref|XP_004251431.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g02420-like [Solanum lycopersicum]
          Length = 500

 Score =  706 bits (1821), Expect = 0.0
 Identities = 333/466 (71%), Positives = 412/466 (88%), Gaps = 4/466 (0%)
 Frame = +3

Query: 126  SVNSESN----QDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANP 293
            S++S+S+    ++VET+++II  +++ + LK++LK+SQI LSNDLIDKVLKR+RFSH+NP
Sbjct: 27   SLHSQSSAPIDKEVETLYRIITITQTPEGLKQALKSSQIKLSNDLIDKVLKRVRFSHSNP 86

Query: 294  LQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQ 473
            LQALEFF Y  +R+GFYH+ FS DTILY+LGRNRKFD+IWEVL+EM+RKD+SLITPRTVQ
Sbjct: 87   LQALEFFKYADKRKGFYHTGFSLDTILYVLGRNRKFDKIWEVLVEMKRKDQSLITPRTVQ 146

Query: 474  IVLARIAKVCSVRQTVESFGKFRKLVPEFDTTCFNALLRTLCQEKSMSDARNVYHSLKHK 653
            +VL R+AKVCSVR+TVESF  F++L+ EF   CFNALLR LCQEKSMSDARNVYH LK+K
Sbjct: 147  VVLGRVAKVCSVRETVESFWGFKRLLNEFGVDCFNALLRALCQEKSMSDARNVYHRLKYK 206

Query: 654  FKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVI 833
            F+PN QTFNILLSGWKSSE+AE FF+EM  +GV+PD+VS+NCLVDVYCK REM+KA++V+
Sbjct: 207  FRPNNQTFNILLSGWKSSEDAEVFFKEMRDLGVEPDVVSFNCLVDVYCKGREMEKAFRVV 266

Query: 834  DKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIA 1013
            ++MR+++I+PDVITYTS+IGGLGL+GQPDKAR +LKEM+EYG YPD AAYNA +RNFCIA
Sbjct: 267  EEMREKDITPDVITYTSLIGGLGLVGQPDKARHILKEMREYGCYPDAAAYNAAVRNFCIA 326

Query: 1014 KRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSC 1193
            KR+GD Y+LMDEMV  GL PNATTYN+F R F+W NDL+SSW+LY+RM ++GCLP+TQSC
Sbjct: 327  KRIGDAYSLMDEMVRNGLSPNATTYNVFLRSFFWINDLKSSWTLYQRMKETGCLPSTQSC 386

Query: 1194 MFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVE 1373
            MFLIRL R+ EK+EMAL LW+DM+E+GFGSY+LVSDVLFDLLCDLGKL EAE+CFLQMV 
Sbjct: 387  MFLIRLSRRHEKVEMALELWDDMMERGFGSYILVSDVLFDLLCDLGKLAEAERCFLQMVN 446

Query: 1374 KGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAIFGPVVQVHERE 1511
            KGQKPSNVSF+RIKVLMELA ++EAL+ LSEKMA F    Q+ + +
Sbjct: 447  KGQKPSNVSFRRIKVLMELANKQEALKLLSEKMAAFRSSTQLIQHD 492


>ref|XP_006418341.1| hypothetical protein EUTSA_v10007463mg [Eutrema salsugineum]
            gi|557096112|gb|ESQ36694.1| hypothetical protein
            EUTSA_v10007463mg [Eutrema salsugineum]
          Length = 494

 Score =  685 bits (1768), Expect = 0.0
 Identities = 337/482 (69%), Positives = 402/482 (83%), Gaps = 2/482 (0%)
 Frame = +3

Query: 45   RYANPILNHS-LNSITQNPNSLLFRFFCSVNSESNQDVETIFQIINSSESTQNLKESLKT 221
            R++N  L+ S L+S+  + N  L        +E   D ET+F++IN S     LKESL +
Sbjct: 15   RFSNLRLSASFLHSVAISSNEKL-------PAEEADDAETVFRMINGSNLQGELKESLSS 67

Query: 222  SQIFLSNDLIDKVLKRIRFSHANPLQALEFFNYTSRRRGFYHSSFSYDTILYILGRNRKF 401
            S I LS DLI++VLKR+RFSH NPLQALEF+ Y   RRGFYHS+FS DT+LYILGRNRKF
Sbjct: 68   SGIHLSKDLIERVLKRVRFSHGNPLQALEFYRYAGARRGFYHSAFSLDTMLYILGRNRKF 127

Query: 402  DQIWEVLIEMRRKDRSLITPRTVQIVLARIAKVCSVRQTVESFGKFRKLVPEF-DTTCFN 578
            DQIWE+LIE +RKDRSLI+PRT+Q+VL R+AK+CSVRQTVESF KF++LVP+F DT CFN
Sbjct: 128  DQIWEILIEAKRKDRSLISPRTMQVVLGRVAKLCSVRQTVESFWKFKRLVPDFFDTACFN 187

Query: 579  ALLRTLCQEKSMSDARNVYHSLKHKFKPNLQTFNILLSGWKSSEEAEGFFEEMMQMGVKP 758
            ALLRTLCQEKSM+DARN YH+LKH+F+P+LQTFNILLSGW+SSEEAE FFEEM + G+KP
Sbjct: 188  ALLRTLCQEKSMTDARNAYHTLKHQFQPDLQTFNILLSGWRSSEEAEAFFEEMREKGLKP 247

Query: 759  DIVSYNCLVDVYCKSREMDKAYKVIDKMRDEEISPDVITYTSIIGGLGLIGQPDKARDVL 938
            D+V+YN L+DVYCK REM+KAY++IDKMR+E+ +PDVITYT++IGGLGLIGQPDKARDVL
Sbjct: 248  DVVTYNSLIDVYCKDREMEKAYRLIDKMREEDETPDVITYTTVIGGLGLIGQPDKARDVL 307

Query: 939  KEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGYNLMDEMVGKGLDPNATTYNLFFRCFYWS 1118
            KEMKEYG YPDV AYNA IRN+CIA+RLGD   L+DEMV KGL PNATTYNLFFR    +
Sbjct: 308  KEMKEYGCYPDVPAYNAAIRNYCIARRLGDADMLVDEMVKKGLTPNATTYNLFFRVLSLA 367

Query: 1119 NDLESSWSLYKRMMDSGCLPNTQSCMFLIRLFRKREKLEMALVLWNDMVEKGFGSYVLVS 1298
            NDL  SW LY+RM+ +GCLPNTQSCMFLI++F++ EK++MA+ LW DMV KGFGSY LVS
Sbjct: 368  NDLGRSWELYERMLANGCLPNTQSCMFLIKMFKRHEKVDMAMRLWEDMVVKGFGSYSLVS 427

Query: 1299 DVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSNVSFKRIKVLMELAKREEALQCLSEKMAI 1478
            DVL DLLCDL K+DEAEKC LQMVEKG +PSNVSFKRIK+LMELA + E L  L +KMAI
Sbjct: 428  DVLLDLLCDLAKVDEAEKCLLQMVEKGHRPSNVSFKRIKLLMELANKHEELDNLKQKMAI 487

Query: 1479 FG 1484
            FG
Sbjct: 488  FG 489


>ref|XP_002889402.1| hypothetical protein ARALYDRAFT_470202 [Arabidopsis lyrata subsp.
            lyrata] gi|297335244|gb|EFH65661.1| hypothetical protein
            ARALYDRAFT_470202 [Arabidopsis lyrata subsp. lyrata]
          Length = 490

 Score =  684 bits (1764), Expect = 0.0
 Identities = 329/450 (73%), Positives = 388/450 (86%), Gaps = 1/450 (0%)
 Frame = +3

Query: 138  ESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFN 317
            E + D ET+F++IN S     LKESL +S I LS DLID+VLKR+RFSH NP+Q LEF+ 
Sbjct: 36   EEDVDAETVFRMINGSNLQGELKESLSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYR 95

Query: 318  YTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAK 497
            Y   RRGFYHSSFS DT+LYILGRNRKFDQIWE+LIE +RKDRSLI+PRT+Q+VL R+AK
Sbjct: 96   YAGARRGFYHSSFSLDTMLYILGRNRKFDQIWEILIETKRKDRSLISPRTMQVVLGRVAK 155

Query: 498  VCSVRQTVESFGKFRKLVPEF-DTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQT 674
            +CSVRQTVESF KF++LVP+F DT CFNALLRTLCQEKSM+DARNVYHSLKH+F+P+LQT
Sbjct: 156  LCSVRQTVESFWKFKRLVPDFFDTACFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQT 215

Query: 675  FNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEE 854
            FNILLSGWKSSEEAE FFEEM   G+KPD+V+YN L+DVYCK RE++KAYK+IDKMR+E+
Sbjct: 216  FNILLSGWKSSEEAEAFFEEMKGKGLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREED 275

Query: 855  ISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGY 1034
             +PDVITYT+IIGGLGLIGQPDKAR+VLKEMKEYG YPDVAAYNA IRN+CIA+RLGD  
Sbjct: 276  ETPDVITYTTIIGGLGLIGQPDKAREVLKEMKEYGCYPDVAAYNAAIRNYCIARRLGDAD 335

Query: 1035 NLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLF 1214
             L+DEMV KGL PNATTYNLFFR    +NDL  SW LY+RM+ +GCLPNTQSCMFLI++F
Sbjct: 336  KLVDEMVKKGLSPNATTYNLFFRVLSLANDLGRSWELYERMLGNGCLPNTQSCMFLIKMF 395

Query: 1215 RKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSN 1394
            ++ EK++MA+ LW DMV KGFGSY LVSDVL DLLCDL K++EAEKC L+MVEKG +PSN
Sbjct: 396  KRHEKVDMAMRLWEDMVVKGFGSYSLVSDVLLDLLCDLAKVEEAEKCLLEMVEKGHRPSN 455

Query: 1395 VSFKRIKVLMELAKREEALQCLSEKMAIFG 1484
            VSFKRIK+LMELA + + +  L +KMAIFG
Sbjct: 456  VSFKRIKLLMELANKHDEVNNLIQKMAIFG 485


>gb|AAG00894.1|AC064879_12 Hypothetical protein [Arabidopsis thaliana]
          Length = 490

 Score =  677 bits (1747), Expect = 0.0
 Identities = 328/449 (73%), Positives = 384/449 (85%), Gaps = 1/449 (0%)
 Frame = +3

Query: 138  ESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFN 317
            E   D ET+F++IN S     LKESL +S I LS DLID+VLKR+RFSH NP+Q LEF+ 
Sbjct: 36   EEGDDAETVFRMINGSNLQVELKESLSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYR 95

Query: 318  YTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAK 497
            Y S  RGFYHSSFS DT+LYILGRNRKFDQIWE+LIE +RKDRSLI+PRT+Q+VL R+AK
Sbjct: 96   YASAIRGFYHSSFSLDTMLYILGRNRKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAK 155

Query: 498  VCSVRQTVESFGKFRKLVPEF-DTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQT 674
            +CSVRQTVESF KF++LVP+F DT CFNALLRTLCQEKSM+DARNVYHSLKH+F+P+LQT
Sbjct: 156  LCSVRQTVESFWKFKRLVPDFFDTACFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQT 215

Query: 675  FNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEE 854
            FNILLSGWKSSEEAE FFEEM   G+KPD+V+YN L+DVYCK RE++KAYK+IDKMR+EE
Sbjct: 216  FNILLSGWKSSEEAEAFFEEMKGKGLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEE 275

Query: 855  ISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGY 1034
             +PDVITYT++IGGLGLIGQPDKAR+VLKEMKEYG YPDVAAYNA IRNFCIA+RLGD  
Sbjct: 276  ETPDVITYTTVIGGLGLIGQPDKAREVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDAD 335

Query: 1035 NLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLF 1214
             L+DEMV KGL PNATTYNLFFR    +NDL  SW LY RM+ + CLPNTQSCMFLI++F
Sbjct: 336  KLVDEMVKKGLSPNATTYNLFFRVLSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMF 395

Query: 1215 RKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSN 1394
            ++ EK++MA+ LW DMV KGFGSY LVSDVL DLLCDL K++EAEKC L+MVEKG +PSN
Sbjct: 396  KRHEKVDMAMRLWEDMVVKGFGSYSLVSDVLLDLLCDLAKVEEAEKCLLEMVEKGHRPSN 455

Query: 1395 VSFKRIKVLMELAKREEALQCLSEKMAIF 1481
            VSFKRIK+LMELA + + +  L +KMAIF
Sbjct: 456  VSFKRIKLLMELANKHDEVNNLIQKMAIF 484


>ref|NP_171744.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806504|sp|Q9FZ19.2|PPR5_ARATH RecName:
            Full=Putative pentatricopeptide repeat-containing protein
            At1g02420 gi|332189307|gb|AEE27428.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 491

 Score =  677 bits (1747), Expect = 0.0
 Identities = 328/449 (73%), Positives = 384/449 (85%), Gaps = 1/449 (0%)
 Frame = +3

Query: 138  ESNQDVETIFQIINSSESTQNLKESLKTSQIFLSNDLIDKVLKRIRFSHANPLQALEFFN 317
            E   D ET+F++IN S     LKESL +S I LS DLID+VLKR+RFSH NP+Q LEF+ 
Sbjct: 37   EEGDDAETVFRMINGSNLQVELKESLSSSGIHLSKDLIDRVLKRVRFSHGNPIQTLEFYR 96

Query: 318  YTSRRRGFYHSSFSYDTILYILGRNRKFDQIWEVLIEMRRKDRSLITPRTVQIVLARIAK 497
            Y S  RGFYHSSFS DT+LYILGRNRKFDQIWE+LIE +RKDRSLI+PRT+Q+VL R+AK
Sbjct: 97   YASAIRGFYHSSFSLDTMLYILGRNRKFDQIWELLIETKRKDRSLISPRTMQVVLGRVAK 156

Query: 498  VCSVRQTVESFGKFRKLVPEF-DTTCFNALLRTLCQEKSMSDARNVYHSLKHKFKPNLQT 674
            +CSVRQTVESF KF++LVP+F DT CFNALLRTLCQEKSM+DARNVYHSLKH+F+P+LQT
Sbjct: 157  LCSVRQTVESFWKFKRLVPDFFDTACFNALLRTLCQEKSMTDARNVYHSLKHQFQPDLQT 216

Query: 675  FNILLSGWKSSEEAEGFFEEMMQMGVKPDIVSYNCLVDVYCKSREMDKAYKVIDKMRDEE 854
            FNILLSGWKSSEEAE FFEEM   G+KPD+V+YN L+DVYCK RE++KAYK+IDKMR+EE
Sbjct: 217  FNILLSGWKSSEEAEAFFEEMKGKGLKPDVVTYNSLIDVYCKDREIEKAYKLIDKMREEE 276

Query: 855  ISPDVITYTSIIGGLGLIGQPDKARDVLKEMKEYGSYPDVAAYNAVIRNFCIAKRLGDGY 1034
             +PDVITYT++IGGLGLIGQPDKAR+VLKEMKEYG YPDVAAYNA IRNFCIA+RLGD  
Sbjct: 277  ETPDVITYTTVIGGLGLIGQPDKAREVLKEMKEYGCYPDVAAYNAAIRNFCIARRLGDAD 336

Query: 1035 NLMDEMVGKGLDPNATTYNLFFRCFYWSNDLESSWSLYKRMMDSGCLPNTQSCMFLIRLF 1214
             L+DEMV KGL PNATTYNLFFR    +NDL  SW LY RM+ + CLPNTQSCMFLI++F
Sbjct: 337  KLVDEMVKKGLSPNATTYNLFFRVLSLANDLGRSWELYVRMLGNECLPNTQSCMFLIKMF 396

Query: 1215 RKREKLEMALVLWNDMVEKGFGSYVLVSDVLFDLLCDLGKLDEAEKCFLQMVEKGQKPSN 1394
            ++ EK++MA+ LW DMV KGFGSY LVSDVL DLLCDL K++EAEKC L+MVEKG +PSN
Sbjct: 397  KRHEKVDMAMRLWEDMVVKGFGSYSLVSDVLLDLLCDLAKVEEAEKCLLEMVEKGHRPSN 456

Query: 1395 VSFKRIKVLMELAKREEALQCLSEKMAIF 1481
            VSFKRIK+LMELA + + +  L +KMAIF
Sbjct: 457  VSFKRIKLLMELANKHDEVNNLIQKMAIF 485


Top