BLASTX nr result

ID: Achyranthes23_contig00019892 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00019892
         (922 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi...   270   6e-70
gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus pe...   267   4e-69
ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citr...   265   2e-68
gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily p...   259   8e-67
ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containi...   259   1e-66
gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]     256   6e-66
ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containi...   252   2e-64
ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi...   252   2e-64
ref|XP_002526948.1| pentatricopeptide repeat-containing protein,...   248   2e-63
ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar...   247   5e-63
ref|XP_002873660.1| pentatricopeptide repeat-containing protein ...   246   1e-62
ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutr...   245   1e-62
ref|XP_002324000.1| pentatricopeptide repeat-containing family p...   245   2e-62
ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Caps...   243   1e-61
ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containi...   243   1e-61
gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus...   241   4e-61
gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlise...   233   1e-58
ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [A...   216   1e-53
gb|EAZ20176.1| hypothetical protein OsJ_35776 [Oryza sativa Japo...   178   2e-42
ref|NP_001066581.1| Os12g0283900 [Oryza sativa Japonica Group] g...   178   2e-42

>ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610
           [Vitis vinifera]
          Length = 763

 Score =  270 bits (690), Expect = 6e-70
 Identities = 145/292 (49%), Positives = 191/292 (65%), Gaps = 2/292 (0%)
 Frame = +3

Query: 48  RRSLVGMCFPTGWALGEQGIDDSVNGENPKLIDGCEKETQLRD-DSVEAPLSKSLSSGD- 221
           +R   G  F   WAL +Q I +    E+   I      T+  D D ++   ++     D 
Sbjct: 96  KRGSFGASFALAWALEQQAIGNEFVKEDSNSIHSLAGNTETVDIDCLKVDGARDGDENDN 155

Query: 222 VGKXXXXXXXXXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGF 401
             +                     LA  L  A   +DVEEV+KD+ ELPLQV+S++IRGF
Sbjct: 156 EEEKEAEKNGEVIEEKSRNVDVRALAHGLEFATTADDVEEVLKDKVELPLQVYSTMIRGF 215

Query: 402 GRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEA 581
           G DK+L+AAMA+V+WL++K +E+ G  GPNLF+YNSLLGAVKQS +   V+ V++ M   
Sbjct: 216 GTDKRLDAAMALVEWLKRK-KETNGSKGPNLFVYNSLLGAVKQSEKFALVEKVMNDMARE 274

Query: 582 GVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGA 761
           G+  NVVTYNT+M IYL++GR+VEALN+ EEI + GL PSP SYSTALL YR MEDG GA
Sbjct: 275 GILPNVVTYNTLMSIYLEQGRSVEALNILEEIQKNGLCPSPVSYSTALLVYRRMEDGHGA 334

Query: 762 LNFFLDFKEKYMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
           L FF++ +E Y+ GE+ KD  +DW+NEF+KL+NFT RICYQVMRRWLVK+GN
Sbjct: 335 LKFFIELRENYLKGEIGKDADEDWENEFVKLKNFTIRICYQVMRRWLVKEGN 386


>gb|EMJ21432.1| hypothetical protein PRUPE_ppa001979mg [Prunus persica]
          Length = 734

 Score =  267 bits (683), Expect = 4e-69
 Identities = 150/318 (47%), Positives = 198/318 (62%), Gaps = 13/318 (4%)
 Frame = +3

Query: 3    FIFGCP--------KNGNFVSGNRRSLVGMCFPTGWALGEQGIDDSVNGENPKLIDGCEK 158
            F FGC         K        +RS  G  F   WAL EQ I + +      +I+    
Sbjct: 74   FDFGCGCFSGYSKLKPARICQSKKRSF-GASFVVAWALEEQAIGNDI------VIEESTS 126

Query: 159  ETQLRDDS----VEAPLSKSLSSG-DVGKXXXXXXXXXXXXXXXXXXXXGLAQRLSNAKL 323
            E +L  +     V+  +      G D  +                     LA  L  AK 
Sbjct: 127  EHRLSGEGESKGVDHLIVDEAEGGEDKNEVDVRNGGANWEQKNEKIDVRALALSLQFAKT 186

Query: 324  ENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIY 503
             +DVE V+KD+G+LPLQVFSS+IRGFGRD+ +++A AVV+WL++KSEE+ G   PNLFIY
Sbjct: 187  ADDVEVVLKDKGDLPLQVFSSMIRGFGRDRLMDSAFAVVEWLKRKSEETNGSITPNLFIY 246

Query: 504  NSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITR 683
            NSLLGAVKQS +  ++D VLSAM E GV  NVVTYNT M IY+++G + +AL++ E+I +
Sbjct: 247  NSLLGAVKQSKQFGEMDKVLSAMTEEGVELNVVTYNTKMAIYIEQGLSTKALDVLEDIEK 306

Query: 684  TGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDWDNEFLKLENF 863
             GL PS  SYSTALLAY+ MEDG GAL FF++F+EKY  G++ K+  +DW++EF++LENF
Sbjct: 307  KGLIPSSVSYSTALLAYQRMEDGNGALQFFIEFREKYHKGDISKESVEDWEHEFIQLENF 366

Query: 864  TSRICYQVMRRWLVKDGN 917
            T R+CYQVMRRWLVKD N
Sbjct: 367  TKRVCYQVMRRWLVKDDN 384


>ref|XP_006447217.1| hypothetical protein CICLE_v10014357mg [Citrus clementina]
           gi|568831365|ref|XP_006469938.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At3g46610-like [Citrus sinensis]
           gi|557549828|gb|ESR60457.1| hypothetical protein
           CICLE_v10014357mg [Citrus clementina]
          Length = 768

 Score =  265 bits (676), Expect = 2e-68
 Identities = 141/297 (47%), Positives = 193/297 (64%), Gaps = 7/297 (2%)
 Frame = +3

Query: 48  RRSLVGMCFPTGWALGEQGIDDSVNGENPKLIDGCEKETQLRDDSVE-APLSKSLSSGDV 224
           ++S  G      W++ +Q I + +  E P   DG   ET+   D V+   + +   +GD 
Sbjct: 96  KKSYFGASVMFAWSMEQQEIGNGLLVEEPNSADGLLVETE--SDIVDYRSVHRVEDTGDN 153

Query: 225 GKXXXXXXXXXXXXXXXXXXXXG------LAQRLSNAKLENDVEEVMKDEGELPLQVFSS 386
           G                     G      LAQ L + K  +DVEEV+KD GELP QV SS
Sbjct: 154 GNQVESEEVEIIGERGVGKQKSGRVDVKALAQSLWHTKTADDVEEVLKDMGELPPQVHSS 213

Query: 387 IIRGFGRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLS 566
           +IRGFG++K+ + AMA+V+WL++K  E+ G  GPNLF+YNSLLGAVKQS + +++D +++
Sbjct: 214 MIRGFGKEKRTDCAMALVEWLKRKKRETGGFIGPNLFVYNSLLGAVKQSQKFEEMDRIMN 273

Query: 567 AMDEAGVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYRNME 746
            M E GV  NVVTYNT+M IY+++G   +ALN+ EEI + GL+PS  SYS ALLAYR ME
Sbjct: 274 DMAEEGVNPNVVTYNTLMAIYIEQGEGTKALNVLEEIKKKGLTPSAVSYSQALLAYRRME 333

Query: 747 DGMGALNFFLDFKEKYMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
           DG GAL FF++ +EKY+ GE+ K D ++W+NEF+KL++F  RICYQVMRRWLVKD N
Sbjct: 334 DGNGALKFFVELREKYLKGEIGKGDDENWENEFVKLKDFIIRICYQVMRRWLVKDEN 390


>gb|EOY02618.1| Pentatricopeptide repeat (PPR-like) superfamily protein, putative
           [Theobroma cacao]
          Length = 741

 Score =  259 bits (663), Expect = 8e-67
 Identities = 142/292 (48%), Positives = 189/292 (64%)
 Frame = +3

Query: 42  GNRRSLVGMCFPTGWALGEQGIDDSVNGENPKLIDGCEKETQLRDDSVEAPLSKSLSSGD 221
           G+ R LV +     WAL +Q I + +  E     DG +   + +++ ++A      S G+
Sbjct: 94  GSSRGLVALA----WALEQQEIGNELEREESHSRDG-DNGNEDKNEEMDAS-----SEGE 143

Query: 222 VGKXXXXXXXXXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGF 401
           V                       LA  L  AK  +D+E+V+KD  ELPLQV SS+I+GF
Sbjct: 144 V-----------ELEESARLDVRALASSLQFAKTADDIEKVLKDMDELPLQVHSSMIKGF 192

Query: 402 GRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEA 581
           GRD  ++AAMA+V+WL++K  +S G  GPNLFIYNSLLGAVK S +  +++ +L  M+E 
Sbjct: 193 GRDNYMDAAMALVEWLKRKKNDSGGSVGPNLFIYNSLLGAVKHSKQFREMEKILKDMEEE 252

Query: 582 GVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGA 761
           GV  N+VTYN +M IYL++G A +ALN+ EEI   G SPSP SYSTALLAYR MEDG GA
Sbjct: 253 GVIPNIVTYNVLMAIYLEQGEATKALNVLEEIQEKGFSPSPVSYSTALLAYRRMEDGNGA 312

Query: 762 LNFFLDFKEKYMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
           L FF++ +EKY+ G+L KD  ++W+ EF+KLENFT RIC QVMRRWLVKD N
Sbjct: 313 LKFFIELREKYVKGDLGKDADENWEYEFVKLENFTVRICQQVMRRWLVKDEN 364


>ref|XP_004308618.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46610-like [Fragaria vesca subsp. vesca]
          Length = 657

 Score =  259 bits (661), Expect = 1e-66
 Identities = 136/280 (48%), Positives = 185/280 (66%)
 Frame = +3

Query: 72  FPTGWALGEQGIDDSVNGENPKLIDGCEKETQLRDDSVEAPLSKSLSSGDVGKXXXXXXX 251
           F + WAL EQ I D V+ EN    +G   E   R+  +E        SG  G        
Sbjct: 38  FVSAWALEEQDIGDEVSVENSTSGNGLLAECGSREVGMEGSDEVDGRSGGEG-------- 89

Query: 252 XXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAM 431
                         LA RL  AK  +DVEEV+K+ G+LPLQVFSS+IRGFGRDK +++A 
Sbjct: 90  GNWEEKSEVVDVRALASRLQFAKTADDVEEVLKEMGDLPLQVFSSMIRGFGRDKLMDSAF 149

Query: 432 AVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYN 611
           AVV+WL+++ EE+ G+  PNLFI+NSLLGAVKQ  +  ++D VL+ M + GV  N+VTYN
Sbjct: 150 AVVEWLKRRGEETNGMVAPNLFIFNSLLGAVKQCKQFGEMDKVLADMTQEGVEPNIVTYN 209

Query: 612 TIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEK 791
           T M IY+++G + +AL++ EEI + G+  SP +YSTAL AY+ M+DG+GAL FF++F+EK
Sbjct: 210 TKMAIYVEQGLSTKALDVLEEIQKKGMIASPVTYSTALQAYQRMQDGIGALEFFVEFREK 269

Query: 792 YMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRRWLVKD 911
           Y NG++     +DW++EFLKLE+FT R+CYQVMR WLV D
Sbjct: 270 YRNGDICNVSEEDWESEFLKLESFTKRVCYQVMRWWLVMD 309


>gb|EXC31403.1| hypothetical protein L484_017686 [Morus notabilis]
          Length = 737

 Score =  256 bits (655), Expect = 6e-66
 Identities = 124/208 (59%), Positives = 162/208 (77%)
 Frame = +3

Query: 294 LAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESE 473
           LA  L  AK  +DV+EV+KD+GELP QVFS++IRG GR+K L+ A A+++WL++K EE+ 
Sbjct: 186 LASSLRFAKTADDVDEVLKDKGELPPQVFSTMIRGLGREKLLDPAFALLEWLKRKKEENN 245

Query: 474 GVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVE 653
           G+   NLFIYNSLLGAVKQS +  +++ VL+ M + GV  NVVTYNT+M I+L+ G   +
Sbjct: 246 GLISLNLFIYNSLLGAVKQSEQFGEMEKVLNYMAQEGVVPNVVTYNTMMAIHLENGEGTK 305

Query: 654 ALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDW 833
           AL++ EEI + GL+PSP SYSTALLAYR MEDG GAL FF++ +EKY  GE+ KDD +DW
Sbjct: 306 ALSVLEEIRKKGLTPSPVSYSTALLAYRRMEDGHGALKFFVEIREKYQKGEMGKDDDEDW 365

Query: 834 DNEFLKLENFTSRICYQVMRRWLVKDGN 917
           +NEF+KLENFT R+CYQVMR WLV + N
Sbjct: 366 ENEFVKLENFTIRVCYQVMRHWLVNEDN 393


>ref|XP_006338641.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46610-like [Solanum tuberosum]
          Length = 740

 Score =  252 bits (643), Expect = 2e-64
 Identities = 122/208 (58%), Positives = 163/208 (78%)
 Frame = +3

Query: 294 LAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESE 473
           LAQ L   K  ++V+EV+KD+ ELPLQV+SS+IRGFG+DKKL +AMA+V+WLR++S+++ 
Sbjct: 157 LAQSLHFVKTADEVDEVLKDKIELPLQVYSSMIRGFGKDKKLNSAMALVEWLRRRSKDNI 216

Query: 474 GVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVE 653
           G    N+FIYNSLLGA+K++G+ D VD V+  M   GV  NVVTYNT+M IY+++GR +E
Sbjct: 217 GSISLNVFIYNSLLGAIKEAGKYDFVDKVMDDMVSEGVQPNVVTYNTLMRIYIEQGRELE 276

Query: 654 ALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDW 833
           ALNLF  + + GLSPSP SYSTAL AYR +EDG GA+ FF++ +EKY NGE+   + ++W
Sbjct: 277 ALNLFRLMPKKGLSPSPASYSTALFAYRRLEDGFGAITFFVETREKYQNGEIGNIEEENW 336

Query: 834 DNEFLKLENFTSRICYQVMRRWLVKDGN 917
           ++EF KLENF  RICYQVMR+WLVK  N
Sbjct: 337 EDEFAKLENFIVRICYQVMRQWLVKGEN 364


>ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like
            [Glycine max]
          Length = 808

 Score =  252 bits (643), Expect = 2e-64
 Identities = 149/359 (41%), Positives = 205/359 (57%), Gaps = 54/359 (15%)
 Frame = +3

Query: 3    FIFGC--PKNGNFVSGNRRSLVGMCFPTGWALGEQGI------------DDSVNGE---- 128
            F+ GC  PK G  +  ++  +  +  P GWAL E G+            D SVN E    
Sbjct: 73   FLLGCSRPKLGIILKPHKSHVGDLAPPLGWALEEDGVGSELVDEQIDSNDASVNRESEGV 132

Query: 129  ---NPKLIDGCEKETQLR---DDSVEA----------PLSKSLSSGDV------------ 224
               N   +   + E Q+R   DDS E+            + +L +GD+            
Sbjct: 133  KSLNLDQVQDSDFEGQIRGYDDDSKESGGNELVEEQTDSNDALVNGDLEGVKSLNLDQVK 192

Query: 225  -----GKXXXXXXXXXXXXXXXXXXXX--GLAQRLSNAKLENDVEEVMKDEGELPLQVFS 383
                 GK                       LA  L   K   DV  ++KD+G+LPLQVFS
Sbjct: 193  DSDCEGKMCGDDNSKEGGEEESDGKVDVRALALSLQTVKTVEDVGGILKDKGDLPLQVFS 252

Query: 384  SIIRGFGRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVL 563
            +II GFG++K++++A+ + +W++K+  E+ G  GPNLFIYN LLG VKQSG+  +++++L
Sbjct: 253  TIISGFGKEKRMDSALILFNWMKKRKIETNGSFGPNLFIYNGLLGVVKQSGQFAEMEVIL 312

Query: 564  SAMDEAGVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYRNM 743
            + M E G+ +NVVTYNT+M IY+++G   +ALN+ EEI R GL+PSP SYS ALLAYR M
Sbjct: 313  NEMAEDGIAYNVVTYNTLMAIYIEKGECDKALNMLEEIRRNGLTPSPVSYSQALLAYRRM 372

Query: 744  EDGMGALNFFLDFKEKYMNGELRKDDH-DDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
            EDG GALNFF++F+EKY  GE+ KDD  +DW+ E LKLE FT R+CYQVMR WLV   N
Sbjct: 373  EDGYGALNFFVEFREKYRQGEIGKDDDGEDWEKECLKLEKFTIRVCYQVMRCWLVSRDN 431


>ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223533700|gb|EEF35435.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 671

 Score =  248 bits (633), Expect = 2e-63
 Identities = 129/285 (45%), Positives = 186/285 (65%), Gaps = 6/285 (2%)
 Frame = +3

Query: 84  WALGEQGIDDSVNGENPKLIDGCEKETQLRDDS------VEAPLSKSLSSGDVGKXXXXX 245
           WAL +Q I    +G  P L DG   +++  D +      +E     + +  D  +     
Sbjct: 12  WALQKQDISSEFHGVEPSLDDGLLGKSEKEDVNPHNLGRLEDSDDDNNNQEDNIELDLRS 71

Query: 246 XXXXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEA 425
                           LA+ L +A+  +DVEEV+KD+GELPLQV+SS+I+ FG D K+E+
Sbjct: 72  KEGVGEEKCRSIDVRSLARSLHSAQTADDVEEVLKDKGELPLQVYSSMIKAFGWDNKMES 131

Query: 426 AMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVT 605
           A+A+V+WL+++ E    + GPNLFIYNSLL AVK+S   ++ + +L+ M + G+  NVVT
Sbjct: 132 ALALVEWLKRRKEIGSSI-GPNLFIYNSLLSAVKKSKLFEEAEKILNDMTQEGIAPNVVT 190

Query: 606 YNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFK 785
           YNT+MGIY+++G+A +ALN+ E++   G  P+  SYSTALLAYR MEDG GAL FF+D K
Sbjct: 191 YNTLMGIYVEKGQATKALNILEQMHEKGFIPTAASYSTALLAYRGMEDGHGALAFFVDIK 250

Query: 786 EKYMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRRWLVKDGNF 920
           +KY+ G++ K+  ++W+NEF+KLE F  RICYQVMRRWLV+  NF
Sbjct: 251 DKYLKGKIGKNSDENWENEFVKLETFIIRICYQVMRRWLVRHDNF 295


>ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At3g46610 gi|6523064|emb|CAB62331.1| hypothetical
           protein [Arabidopsis thaliana]
           gi|332644660|gb|AEE78181.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 665

 Score =  247 bits (630), Expect = 5e-63
 Identities = 142/308 (46%), Positives = 187/308 (60%), Gaps = 12/308 (3%)
 Frame = +3

Query: 33  FVSGNR---------RSLVGMCFPTGWALGEQGID---DSVNGENPKLIDGCEKETQLRD 176
           FVS NR         RSL+G  F  GWA  ++ ++   + V+ E+    +G EK   LR 
Sbjct: 58  FVSSNRKVLFLCEPKRSLLGSSFGVGWATEQRELELGEEEVSTEDLSSANGGEKNN-LRV 116

Query: 177 DSVEAPLSKSLSSGDVGKXXXXXXXXXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDE 356
           D  E                                   LA  L  AK  +DV+ V+KD+
Sbjct: 117 DVRE-----------------------------------LAFSLRAAKTADDVDAVLKDK 141

Query: 357 GELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSG 536
           GELPLQVF ++I+GFG+DK+L+ A+AVVDWL++K  ES GV GPNLFIYNSLLGA++  G
Sbjct: 142 GELPLQVFCAMIKGFGKDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAMRGFG 201

Query: 537 ELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYS 716
           E +K+   L  M+E G+  N+VTYNT+M IY++EG  ++AL + +     G  P+P +YS
Sbjct: 202 EAEKI---LKDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYS 258

Query: 717 TALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRR 896
           TALL YR MEDGMGAL FF++ +EKY   E+  D   DW+ EF+KLENF  RICYQVMRR
Sbjct: 259 TALLVYRRMEDGMGALEFFVELREKYAKREIGNDVGYDWEFEFVKLENFIGRICYQVMRR 318

Query: 897 WLVKDGNF 920
           WLVKD N+
Sbjct: 319 WLVKDDNW 326


>ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319497|gb|EFH49919.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 674

 Score =  246 bits (627), Expect = 1e-62
 Identities = 140/301 (46%), Positives = 187/301 (62%), Gaps = 5/301 (1%)
 Frame = +3

Query: 33  FVSGNRRSLVGMCFPTGWALGEQGIDDSVNGEN---PKLIDGCEKETQLRDDSVEAPLSK 203
           F+   +R+L G     GWA  ++ + + V+ E+   P+ ++G EK T  R D  E     
Sbjct: 74  FLCEPKRNLSGSSVGVGWATEQRELGEEVSTEDSSYPQTVNGGEK-TNSRVDVRE----- 127

Query: 204 SLSSGDVGKXXXXXXXXXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDEGELPLQVFS 383
                                         LA  L  AK  +DV+ V+K+ GELPLQV+ 
Sbjct: 128 ------------------------------LAYSLRAAKTADDVDIVIKEMGELPLQVYC 157

Query: 384 SIIRGFGRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQS--GELDKVDI 557
           ++IRGFG+DK+L+ A+AVVDWLR+K  ES GV GPNLFIYNSLLGA+KQS  GE +K+  
Sbjct: 158 AMIRGFGKDKRLKPAIAVVDWLRRKKSESGGVIGPNLFIYNSLLGAMKQSSVGEAEKI-- 215

Query: 558 VLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYR 737
            LS M+E G+  N+VTYNT+M IY+++G   +AL + + +   G  P+P +YSTALL YR
Sbjct: 216 -LSDMEEEGIVPNIVTYNTLMVIYMEKGEFHKALGILDLVKEKGFEPNPITYSTALLVYR 274

Query: 738 NMEDGMGALNFFLDFKEKYMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
            MEDGMGAL FF++ +EKY   E+  D   DW+ EF+KLENF  RICYQVMRRWLVKD N
Sbjct: 275 RMEDGMGALEFFVELREKYSKREIGNDADYDWEFEFVKLENFIGRICYQVMRRWLVKDEN 334

Query: 918 F 920
           +
Sbjct: 335 W 335


>ref|XP_006399946.1| hypothetical protein EUTSA_v10015672mg [Eutrema salsugineum]
           gi|557101036|gb|ESQ41399.1| hypothetical protein
           EUTSA_v10015672mg [Eutrema salsugineum]
          Length = 688

 Score =  245 bits (626), Expect = 1e-62
 Identities = 130/295 (44%), Positives = 183/295 (62%)
 Frame = +3

Query: 33  FVSGNRRSLVGMCFPTGWALGEQGIDDSVNGENPKLIDGCEKETQLRDDSVEAPLSKSLS 212
           F+   ++SL G     GWA  ++ + + V+ E+   +   + +            S++++
Sbjct: 74  FLCEPKKSLSGSSVGVGWATEQRELGEEVSREDSSSVTASDSDHSK---------SQAVT 124

Query: 213 SGDVGKXXXXXXXXXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDEGELPLQVFSSII 392
            G+                        LA  L  AK  +DV+ V+K++GELPLQV+ ++I
Sbjct: 125 GGEKTNARVDVRE--------------LAYSLRAAKTADDVDVVLKEKGELPLQVYCAMI 170

Query: 393 RGFGRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAM 572
           RGFG+DK+L+ AMAVVDWL++K  ES G+ GPNLFIYNSLLGA+K+S    + + +LS M
Sbjct: 171 RGFGKDKRLKPAMAVVDWLKRKKIESGGLIGPNLFIYNSLLGAMKESRGFGETEKILSDM 230

Query: 573 DEAGVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYSTALLAYRNMEDG 752
           +E G+  N+VTYNT+M IY++EG   +AL + + +   G  PSP +YSTALL YR +EDG
Sbjct: 231 EEEGIVPNIVTYNTLMVIYMEEGEFHKALGILDLVKEKGFEPSPVTYSTALLVYRRLEDG 290

Query: 753 MGALNFFLDFKEKYMNGELRKDDHDDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
           MGAL FF + +EKY   E+  D   DW+ EF+KLENF  RICYQVMRRWLVKD N
Sbjct: 291 MGALEFFAELREKYSKREIGNDADYDWEFEFVKLENFIGRICYQVMRRWLVKDEN 345


>ref|XP_002324000.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222867002|gb|EEF04133.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 709

 Score =  245 bits (625), Expect = 2e-62
 Identities = 120/205 (58%), Positives = 161/205 (78%)
 Frame = +3

Query: 294 LAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESE 473
           LAQ L  AK  +D+EEV+KD+GELP+QV+ S+I+GFG DKK+E A+A+VDWL+ K +E++
Sbjct: 128 LAQSLYFAKTVDDIEEVLKDKGELPVQVYLSMIKGFGWDKKMEPAIALVDWLKIK-KETD 186

Query: 474 GVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVE 653
           G   PNLFIYNSLL AVKQS + ++ + +L  M + GV  NVVTYN +M IY+ +G+A +
Sbjct: 187 GTIVPNLFIYNSLLSAVKQSEQYEETEKILERMTQEGVAPNVVTYNILMVIYVKQGQAKK 246

Query: 654 ALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDW 833
           AL++ EE+ R G +PS  SYS+ALLAYR MEDG GAL FF++ K+KYM GE+ KD  +DW
Sbjct: 247 ALDVLEEMRRNGFTPSAASYSSALLAYRKMEDGDGALKFFVEIKDKYMKGEIGKDADEDW 306

Query: 834 DNEFLKLENFTSRICYQVMRRWLVK 908
           + E++KLENFT R+CYQVMRRWLV+
Sbjct: 307 EREYVKLENFTIRVCYQVMRRWLVR 331


>ref|XP_006292935.1| hypothetical protein CARUB_v10019206mg [Capsella rubella]
           gi|482561642|gb|EOA25833.1| hypothetical protein
           CARUB_v10019206mg [Capsella rubella]
          Length = 673

 Score =  243 bits (619), Expect = 1e-61
 Identities = 119/209 (56%), Positives = 153/209 (73%)
 Frame = +3

Query: 294 LAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESE 473
           LA  L  AK  +DV+ V+K++GELPLQVF ++I GFG+DK+LE A+AVVDWL++K  ES 
Sbjct: 126 LAFSLRAAKTADDVDAVLKEKGELPLQVFCAMISGFGKDKRLEPAVAVVDWLKRKKSESG 185

Query: 474 GVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVE 653
            V GPNLFIYNSLLGA+KQ     + + VLS M+E G+  N+VTYNT+M IY++EG  ++
Sbjct: 186 SVIGPNLFIYNSLLGAMKQLSAFGEAEKVLSDMEEEGIVPNIVTYNTLMVIYMEEGEFLK 245

Query: 654 ALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDW 833
           AL + + +   G  P+P +YSTALL YR MEDGMGAL FF++ +EKY   E+  D   DW
Sbjct: 246 ALGILDLVKEKGFEPNPITYSTALLVYRRMEDGMGALEFFVELREKYSKREIGNDPDYDW 305

Query: 834 DNEFLKLENFTSRICYQVMRRWLVKDGNF 920
             EF KLENF  RICYQVMRRWLVK+ N+
Sbjct: 306 KFEFFKLENFIGRICYQVMRRWLVKNENW 334


>ref|XP_004231824.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46610-like [Solanum lycopersicum]
          Length = 742

 Score =  243 bits (619), Expect = 1e-61
 Identities = 118/209 (56%), Positives = 161/209 (77%), Gaps = 1/209 (0%)
 Frame = +3

Query: 294 LAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKK-SEES 470
           LAQ L   K  ++V+EV+KD+ ELPLQV+SS+IRGFG+DKKL +AMA+V+WLR++  +++
Sbjct: 158 LAQSLHFVKTADEVDEVLKDKVELPLQVYSSMIRGFGKDKKLNSAMALVEWLRRRRGKDN 217

Query: 471 EGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAV 650
            G    N+FIYNSLLGA+K++G+ D VD V+  M   GV  NVVTYNT+M  Y+++GR +
Sbjct: 218 IGSISLNVFIYNSLLGAIKEAGKYDFVDKVMDDMVSEGVQPNVVTYNTLMRTYIEQGREL 277

Query: 651 EALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDD 830
           EAL LF E+ + GL+PSP SYSTAL AYR +EDG GA+ FF++ +E+Y NGE+   + ++
Sbjct: 278 EALKLFREMPKKGLTPSPASYSTALFAYRRLEDGFGAITFFVETRERYQNGEIGNIEEEN 337

Query: 831 WDNEFLKLENFTSRICYQVMRRWLVKDGN 917
           W++EF KLENF  RICYQVMR+WLVK  N
Sbjct: 338 WEDEFAKLENFIVRICYQVMRQWLVKGEN 366


>gb|ESW12830.1| hypothetical protein PHAVU_008G145600g [Phaseolus vulgaris]
          Length = 752

 Score =  241 bits (614), Expect = 4e-61
 Identities = 130/308 (42%), Positives = 186/308 (60%), Gaps = 8/308 (2%)
 Frame = +3

Query: 18  PKNGNFVSGNRRSLVGMCFPTGWALGEQGIDDSVNGENPKLIDGCEKET-------QLRD 176
           PK G  +  N+  +  +  P GWAL ++G+   +  EN  +    E E        Q++D
Sbjct: 80  PKFGIILKQNKSHIGDLAPPLGWALEDEGVVSELVEEN--IDSNGESEVIKSLNLGQVQD 137

Query: 177 DSVEAPLSKSLSSGDVGKXXXXXXXXXXXXXXXXXXXXGLAQRLSNAKLENDVEEVMKDE 356
              E  +    +S + GK                     LA RL  A   +DV E++ D+
Sbjct: 138 SDCEPKMGVGENSKEGGKEESFGKVDVR----------ALALRLQTALTVDDVREILVDK 187

Query: 357 GELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESEGVDGPNLFIYNSLLGAVKQSG 536
            +LPLQVFS+II  FG++K++++A+ + +W++K+  E+ G  GPNLFIYN LLG VKQSG
Sbjct: 188 RDLPLQVFSTIINSFGKEKRMDSALILFEWMKKRKIETNGSFGPNLFIYNGLLGVVKQSG 247

Query: 537 ELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVEALNLFEEITRTGLSPSPPSYS 716
           +  +++ +L+ M + G+ +NVVTYNT+M IY+++G    ALN+ EEI   G +PSP SYS
Sbjct: 248 QFAQMETILNEMAKDGISYNVVTYNTLMAIYIEKGEFDRALNVLEEIHGNGFTPSPVSYS 307

Query: 717 TALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDH-DDWDNEFLKLENFTSRICYQVMR 893
            ALLAYR MED  GALNFF++ +E Y  GE+ +DD  +DW+ E +KLE FT RICYQVMR
Sbjct: 308 QALLAYRRMEDCNGALNFFVELRENYHRGEIGEDDDGEDWEEELMKLEKFTIRICYQVMR 367

Query: 894 RWLVKDGN 917
            WLV   N
Sbjct: 368 CWLVSSDN 375


>gb|EPS70491.1| hypothetical protein M569_04265, partial [Genlisea aurea]
          Length = 557

 Score =  233 bits (593), Expect = 1e-58
 Identities = 111/209 (53%), Positives = 156/209 (74%)
 Frame = +3

Query: 294 LAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESE 473
           LA +L  A   +DVE+++K +  LPLQV+S++IRG G++K++++AMA+ +WL++KS+ES 
Sbjct: 11  LALKLQLATTADDVEQLLKGKENLPLQVYSTVIRGLGKEKRIQSAMALFEWLQRKSKESG 70

Query: 474 GVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVE 653
                NLF+YNSLLGA+KQ+   D V+ V++ M   GV  NVVT+N +MGI++++G  + 
Sbjct: 71  SKLKLNLFVYNSLLGAMKQAEAFDLVEEVMTKMGAEGVHPNVVTFNALMGIHIEQGNELR 130

Query: 654 ALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDW 833
           AL LF E+   G+SPSP SYST L AYR ME+G GA++FF++ + KY NG++  DD +DW
Sbjct: 131 ALELFREMLMMGISPSPASYSTVLNAYRRMENGSGAVSFFIETRNKYRNGDMANDDDEDW 190

Query: 834 DNEFLKLENFTSRICYQVMRRWLVKDGNF 920
           + E  KLENFT RICYQVMRRWLVK GNF
Sbjct: 191 ELEISKLENFTLRICYQVMRRWLVKRGNF 219


>ref|XP_006852234.1| hypothetical protein AMTR_s00049p00149530 [Amborella trichopoda]
           gi|548855838|gb|ERN13701.1| hypothetical protein
           AMTR_s00049p00149530 [Amborella trichopoda]
          Length = 754

 Score =  216 bits (550), Expect = 1e-53
 Identities = 108/207 (52%), Positives = 146/207 (70%)
 Frame = +3

Query: 294 LAQRLSNAKLENDVEEVMKDEGELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKSEESE 473
           LA  L  A+  +DVEEV+ D  +LP  V+SS+IRGFG  ++L+ A+A+V+WL++  + + 
Sbjct: 165 LAMSLQFAERADDVEEVLGDM-DLPPSVYSSMIRGFGMAERLKPAIALVEWLKRGKKSTN 223

Query: 474 GVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEGRAVE 653
           G    NL+IYNSLLGA K S   +KV  ++  M++ G+  N+VT NT+M +YL++G+  E
Sbjct: 224 GGAILNLYIYNSLLGAAKASHSYEKVGKIIEDMEKQGILPNIVTLNTLMSVYLEQGKTQE 283

Query: 654 ALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDDHDDW 833
           A ++F EI R GLSPSP +YST L  YR MED  GAL FF++ +EKY  GE+  D  +DW
Sbjct: 284 ARDIFSEIPRNGLSPSPVTYSTVLQIYRKMEDAKGALEFFVESREKYKKGEIENDSCEDW 343

Query: 834 DNEFLKLENFTSRICYQVMRRWLVKDG 914
           +NEF KLENFT RICYQVMR WLVK G
Sbjct: 344 ENEFAKLENFTIRICYQVMRGWLVKGG 370


>gb|EAZ20176.1| hypothetical protein OsJ_35776 [Oryza sativa Japonica Group]
          Length = 642

 Score =  178 bits (452), Expect = 2e-42
 Identities = 93/212 (43%), Positives = 135/212 (63%), Gaps = 8/212 (3%)
 Frame = +3

Query: 306 LSNAKLENDVEEVMK---DEG-----ELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKS 461
           L +A+  ++VE ++K   D+G      LPLQV++S+IRG G++++L+AA AVV+ L++ S
Sbjct: 50  LRDARTADEVETLVKGFLDDGGGGEEHLPLQVYTSVIRGLGKERRLDAAFAVVEHLKRGS 109

Query: 462 EESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEG 641
               G  G N F+YN LLGAVK SGE  ++  VL+ M+  GVP N+VT+NT+M IY+++G
Sbjct: 110 GSGGGGGGVNQFVYNCLLGAVKNSGEFGRIHDVLADMEAQGVPPNIVTFNTLMSIYVEQG 169

Query: 642 RAVEALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDD 821
           +  E   +F+ I  +GL P+  +YST + AY+   D   AL F    +E Y  GEL   +
Sbjct: 170 KIDEVFRVFDTIEGSGLVPTAATYSTVMSAYKKAGDAFAALKFITKLREMYNKGELAV-N 228

Query: 822 HDDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
           H+DWD EF+K E  T R+CY  MRR LV   N
Sbjct: 229 HEDWDREFVKFEKLTVRVCYMAMRRSLVGGEN 260


>ref|NP_001066581.1| Os12g0283900 [Oryza sativa Japonica Group]
           gi|113649088|dbj|BAF29600.1| Os12g0283900 [Oryza sativa
           Japonica Group]
          Length = 675

 Score =  178 bits (452), Expect = 2e-42
 Identities = 93/212 (43%), Positives = 135/212 (63%), Gaps = 8/212 (3%)
 Frame = +3

Query: 306 LSNAKLENDVEEVMK---DEG-----ELPLQVFSSIIRGFGRDKKLEAAMAVVDWLRKKS 461
           L +A+  ++VE ++K   D+G      LPLQV++S+IRG G++++L+AA AVV+ L++ S
Sbjct: 83  LRDARTADEVETLVKGFLDDGGGGEEHLPLQVYTSVIRGLGKERRLDAAFAVVEHLKRGS 142

Query: 462 EESEGVDGPNLFIYNSLLGAVKQSGELDKVDIVLSAMDEAGVPWNVVTYNTIMGIYLDEG 641
               G  G N F+YN LLGAVK SGE  ++  VL+ M+  GVP N+VT+NT+M IY+++G
Sbjct: 143 GSGGGGGGVNQFVYNCLLGAVKNSGEFGRIHDVLADMEAQGVPPNIVTFNTLMSIYVEQG 202

Query: 642 RAVEALNLFEEITRTGLSPSPPSYSTALLAYRNMEDGMGALNFFLDFKEKYMNGELRKDD 821
           +  E   +F+ I  +GL P+  +YST + AY+   D   AL F    +E Y  GEL   +
Sbjct: 203 KIDEVFRVFDTIEGSGLVPTAATYSTVMSAYKKAGDAFAALKFITKLREMYNKGELAV-N 261

Query: 822 HDDWDNEFLKLENFTSRICYQVMRRWLVKDGN 917
           H+DWD EF+K E  T R+CY  MRR LV   N
Sbjct: 262 HEDWDREFVKFEKLTVRVCYMAMRRSLVGGEN 293


Top