BLASTX nr result

ID: Akebia24_contig00008810 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00008810
         (1544 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007037187.1| Pentatricopeptide repeat-containing protein,...   298   4e-78
ref|XP_007210374.1| hypothetical protein PRUPE_ppa001256mg [Prun...   297   8e-78
ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containi...   297   1e-77
emb|CBI15289.3| unnamed protein product [Vitis vinifera]              277   8e-72
ref|XP_006476670.1| PREDICTED: pentatricopeptide repeat-containi...   276   2e-71
ref|XP_004299605.1| PREDICTED: pentatricopeptide repeat-containi...   275   3e-71
ref|XP_002511505.1| pentatricopeptide repeat-containing protein,...   273   1e-70
ref|XP_006439668.1| hypothetical protein CICLE_v10018829mg [Citr...   271   4e-70
gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis]     264   9e-68
ref|XP_002321537.1| pentatricopeptide repeat-containing family p...   261   8e-67
ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containi...   260   1e-66
ref|XP_006578589.1| PREDICTED: pentatricopeptide repeat-containi...   259   2e-66
ref|XP_003523047.2| PREDICTED: pentatricopeptide repeat-containi...   259   3e-66
ref|XP_006578590.1| PREDICTED: pentatricopeptide repeat-containi...   259   3e-66
ref|XP_007157658.1| hypothetical protein PHAVU_002G087700g [Phas...   251   6e-64
ref|XP_004513211.1| PREDICTED: pentatricopeptide repeat-containi...   248   5e-63
ref|XP_003525037.2| PREDICTED: pentatricopeptide repeat-containi...   247   1e-62
ref|XP_004138146.1| PREDICTED: pentatricopeptide repeat-containi...   246   3e-62
ref|XP_004154991.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   244   6e-62
gb|EYU35898.1| hypothetical protein MIMGU_mgv1a019674mg [Mimulus...   233   2e-58

>ref|XP_007037187.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao] gi|508774432|gb|EOY21688.1| Pentatricopeptide
            repeat-containing protein, putative [Theobroma cacao]
          Length = 859

 Score =  298 bits (763), Expect = 4e-78
 Identities = 176/397 (44%), Positives = 227/397 (57%), Gaps = 25/397 (6%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKA-S 944
            MLR KQI  LS SARSF  SGSR                  +SR +   NEV +      
Sbjct: 1    MLRAKQIGNLSSSARSFLFSGSRCSASDGNSCTCPEDESC-VSRKRSIRNEVLSKSSGRG 59

Query: 943  ALASKSSTIVVAQHSGDSVGACVSI---VSGDGRTSQEYSVHSSTLVGN--------FVK 797
             LA  +++  V  H  +     VS    +   G  + + ++ ++ L G         FVK
Sbjct: 60   TLALGTASKAVGSHEAERAPQLVSSPIPLHRSGNVNYDVNIDAAQLDGQASAPISDQFVK 119

Query: 796  ASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVKSPHPKPHKVEKFSQVH 617
            A IA   FLSD++N+K P SDG     +P+NC+V+SSR +  +KSP  KP K E F++V+
Sbjct: 120  AGIAAVSFLSDMMNYKLPLSDGGVMLSSPKNCVVESSRQLPNIKSPAVKPIKKENFAKVY 179

Query: 616  GKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKATTNSAHS-----DTQGRELP 452
             KPS EI  G K    YH TK +  K   V+G    S  A+  S+ +     +T  +  P
Sbjct: 180  PKPSSEIAAGPKSTVSYHGTKDRGNKPNFVRGYKQVSNAASVGSSETHRTSANTCDKGKP 239

Query: 451  IKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRSMRMPTETAPFAK----- 287
            + QR K +S+RF +   SN   S+ K        E F+K  R M+MPT   P  +     
Sbjct: 240  MPQRVKAHSHRFMSNFNSNVLPSDAKFSDS--GTEGFKKSFRDMKMPTGVVPMTRPLAGT 297

Query: 286  ---LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDHTVALGFFYWLKRRA 116
                E V H+L+QL WGP AEQ L+NLN  +DAYQANQ+LKQ+QDHTVALGFFYWLK+RA
Sbjct: 298  RHVTESVSHILQQLNWGPAAEQALENLNFSMDAYQANQVLKQIQDHTVALGFFYWLKQRA 357

Query: 115  GFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            GFKHDGH+YTTMVGILGRA+QFGAINRLLD+MV DGC
Sbjct: 358  GFKHDGHTYTTMVGILGRARQFGAINRLLDQMVKDGC 394


>ref|XP_007210374.1| hypothetical protein PRUPE_ppa001256mg [Prunus persica]
            gi|462406109|gb|EMJ11573.1| hypothetical protein
            PRUPE_ppa001256mg [Prunus persica]
          Length = 870

 Score =  297 bits (761), Expect = 8e-78
 Identities = 185/410 (45%), Positives = 234/410 (57%), Gaps = 38/410 (9%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR K IS LS SARSFF++G R                  +S+ +Q  N  P     S 
Sbjct: 1    MLRAKHISNLSSSARSFFLNGPRCSATEGSSCTCSEDETC-VSQRQQTRNGGPLAQTPST 59

Query: 940  LASKSS----TI-------VVAQHSGDSVGACVSIVS------GDGRT------SQEYSV 830
            + SK S    TI       V + H  +SV    +I          GR+      S   +V
Sbjct: 60   MVSKPSAGAGTIITGDAVKVASSHKAESVEHTTNIKQVTTAPRSFGRSATVTYSSSTDAV 119

Query: 829  HSSTLV-GNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVKSPHP 653
            HSS LV   F +A +A   FLSD++N K P SDG+G    PQNCMVD +R +S +K  H 
Sbjct: 120  HSSPLVVDQFARAGVAAVNFLSDIVNGKLPLSDGLGLLNLPQNCMVDPTRPLSSIKPSHV 179

Query: 652  KPHKVEKFSQVHGKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKATTNS---- 485
            K  K E F  VH KPS E    SK  +  H +KGK EK   VKGL+H       NS    
Sbjct: 180  KQIKREHFISVHPKPSTETAAASKHTSNNHGSKGKGEKPSFVKGLNHVPYTRKENSVVAH 239

Query: 484  -AHSDT-QGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRSMRMP 311
             A SDT   R +P  ++SK +SN F     SN QTS+ + +G  R  + F +P R M+MP
Sbjct: 240  TASSDTFDKRSMP--RKSKGHSNNFIPNYSSNVQTSDAESMG--RVTKGFNRPTRDMKMP 295

Query: 310  TETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDHT 155
            T   P  +        ++ V H+L+Q++WGP AE  L NLNC +DAYQANQILKQLQDH+
Sbjct: 296  TGITPINRQFVHTGNVVQNVSHILQQMRWGPAAEAALLNLNCSMDAYQANQILKQLQDHS 355

Query: 154  VALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            VAL FFYWLKR+AGFKHDGH+YTTMVGILGR++QFGAIN+LL++MV +GC
Sbjct: 356  VALSFFYWLKRQAGFKHDGHTYTTMVGILGRSRQFGAINKLLNQMVKEGC 405


>ref|XP_002266698.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750
            [Vitis vinifera]
          Length = 875

 Score =  297 bits (760), Expect = 1e-77
 Identities = 180/415 (43%), Positives = 231/415 (55%), Gaps = 43/415 (10%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLRTKQI  LS SARS  ISG+R                  +S  +   NEV    K + 
Sbjct: 1    MLRTKQIGPLSNSARSILISGTRSSTPDGNSCPCSEDETC-VSTKQHARNEVLIMQKQTT 59

Query: 940  LASKSSTIVVAQHSGDSV-----------------------------GACVSIVSGDGRT 848
            LASK++  V     GD+V                               CVS VS +   
Sbjct: 60   LASKTAARVGPLFLGDAVKVVGSQKVESVEHATSLAQVVAAPRSVVGSDCVSYVSDNVGV 119

Query: 847  SQEYSVHSSTLVGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVV 668
            + + +VH+  +   F++A I    FLSDL+N+K P SDG G    PQNCMVD ++ +S +
Sbjct: 120  NND-AVHAPPISDQFIRAGIVAVNFLSDLVNYKIPMSDGSGMLKLPQNCMVDPTKPLSKI 178

Query: 667  KSPHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDH-----TSA 503
            KS + KP +  KFS+V  + S  I   S   + YH T+GK +K GSVKG  H     T  
Sbjct: 179  KSTNIKPIRKGKFSKVRAESSANIAAASNSTSSYHSTRGKGDKSGSVKGCSHVGDTWTRN 238

Query: 502  KATTNSAHSDTQG-RELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIR 326
               T S  SDT   R +P K ++  N +  ++   SN + SE + VGGI     F KP+R
Sbjct: 239  TVDTRSLSSDTHNKRSMPQKSKAYSNYSTSNSNFNSNVRNSEPRFVGGIAG--GFSKPLR 296

Query: 325  SMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQ 170
              +M    AP ++        +E V  +LRQL WGP AE+ L+NLNC +DAYQANQ+LKQ
Sbjct: 297  DTKM-IGIAPVSRQFGSSGHVVENVSRILRQLSWGPAAEEALRNLNCLMDAYQANQVLKQ 355

Query: 169  LQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            +QDH VALGFFYWLKR+ GFKHDGH+YTTMVGILGRA+QFGAIN+LL EMV DGC
Sbjct: 356  IQDHPVALGFFYWLKRQTGFKHDGHTYTTMVGILGRARQFGAINKLLAEMVRDGC 410


>emb|CBI15289.3| unnamed protein product [Vitis vinifera]
          Length = 793

 Score =  277 bits (709), Expect = 8e-72
 Identities = 172/407 (42%), Positives = 219/407 (53%), Gaps = 35/407 (8%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLRTKQI  LS SARS  ISG+R                  +S  +   NEV    K + 
Sbjct: 28   MLRTKQIGPLSNSARSILISGTRSSTPDGNSCPCSEDETC-VSTKQHARNEVLIMQKQTT 86

Query: 940  LASKSSTIVVAQHSGDSV-----------------------------GACVSIVSGDGRT 848
            LASK++  V     GD+V                               CVS VS +   
Sbjct: 87   LASKTAARVGPLFLGDAVKVVGSQKVESVEHATSLAQVVAAPRSVVGSDCVSYVSDNVGV 146

Query: 847  SQEYSVHSSTLVGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVV 668
            + + +VH+  +   F++A I    FLSDL+N+K P SDG G    PQNCMVD ++ +S +
Sbjct: 147  NND-AVHAPPISDQFIRAGIVAVNFLSDLVNYKIPMSDGSGMLKLPQNCMVDPTKPLSKI 205

Query: 667  KSPHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDH-----TSA 503
            KS + KP +  KFS+V  + S  I   S   + YH T+GK +K GSVKG  H     T  
Sbjct: 206  KSTNIKPIRKGKFSKVRAESSANIAAASNSTSSYHSTRGKGDKSGSVKGCSHVGDTWTRN 265

Query: 502  KATTNSAHSDTQGRELPIKQRSKINSNRFHAKSKSNGQ-TSETKVVGGIRSVENFQKPIR 326
               T S  SDT  +   + Q+SK  SN   + S  N     +TK++G       F     
Sbjct: 266  TVDTRSLSSDTHNKR-SMPQKSKAYSNYSTSNSNFNSNPLRDTKMIGIAPVSRQF----- 319

Query: 325  SMRMPTETAPFAKLERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDHTVAL 146
                    +    +E V  +LRQL WGP AE+ L+NLNC +DAYQANQ+LKQ+QDH VAL
Sbjct: 320  -------GSSGHVVENVSRILRQLSWGPAAEEALRNLNCLMDAYQANQVLKQIQDHPVAL 372

Query: 145  GFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            GFFYWLKR+ GFKHDGH+YTTMVGILGRA+QFGAIN+LL EMV DGC
Sbjct: 373  GFFYWLKRQTGFKHDGHTYTTMVGILGRARQFGAINKLLAEMVRDGC 419


>ref|XP_006476670.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like
            [Citrus sinensis]
          Length = 856

 Score =  276 bits (706), Expect = 2e-71
 Identities = 167/395 (42%), Positives = 222/395 (56%), Gaps = 23/395 (5%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR K I+ LS +ARSFF++GSR                  +SR +  +  V  P +A  
Sbjct: 1    MLRAKHITNLSSTARSFFLNGSRCSASDGSSCTCSEDETC-VSRRQHNAQMVYTPARAGV 59

Query: 940  LAS----------KSSTIVVAQHSGDSVGACVSIVSGDGRTSQEYSVHSSTLVGNFVKAS 791
            + S          KS  + V   S       VS  S      ++  + SS +   FVKA 
Sbjct: 60   VVSGEAAKAAGLQKSERVSVPSPSSLGRSDHVSYASSVDAVPKDV-LTSSPISDQFVKAG 118

Query: 790  IATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVKSPHPKPHKVEKFSQVHGK 611
            +A   FLSDL+N+K P  DG G + +P N MVD +R +S +K  + K  + E  S+V+  
Sbjct: 119  VAAVCFLSDLVNYKLPALDGSGTANSPTNFMVDPTRPLSNIKPANVKTIRRENVSKVYPN 178

Query: 610  PSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKAT-----TNSAHSDTQGRELPIK 446
             S E   GS P T YH+ K K +     +     S  +      T++  SD   R   ++
Sbjct: 179  SSAESTVGSNPSTGYHNAKDKGDNSNIARRFKRVSNASNGTSLETHNVSSDNSDRRRIVQ 238

Query: 445  QRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRSMRMPTETAPFAK------- 287
             RSK +SNR ++  KSN Q S+ KVV  +   E F KP R M++P   APF++       
Sbjct: 239  PRSKAHSNRLNSNFKSNLQPSDAKVVECVS--ERFSKPSREMKIPAGLAPFSRHFASTGN 296

Query: 286  -LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDHTVALGFFYWLKRRAGF 110
             +E V  +LRQ KWGP+AE+ L N N  +DAYQANQ+LKQLQDHTVALGFF WL+R+AGF
Sbjct: 297  VVESVSRILRQWKWGPLAEEALGNTNYSMDAYQANQVLKQLQDHTVALGFFNWLRRQAGF 356

Query: 109  KHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            KHD H+YTTMVGILGRA+QFGAIN+LLD+MV DGC
Sbjct: 357  KHDEHTYTTMVGILGRARQFGAINKLLDQMVRDGC 391


>ref|XP_004299605.1| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like
            [Fragaria vesca subsp. vesca]
          Length = 879

 Score =  275 bits (704), Expect = 3e-71
 Identities = 171/413 (41%), Positives = 228/413 (55%), Gaps = 41/413 (9%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNK-------------- 983
            +LR K +S LS SARSFFISGSR                    R +              
Sbjct: 5    LLRAKHLSNLSSSARSFFISGSRCSATEGNSCTCSEDENCGPQRQQAINRGLLAQNPSSS 64

Query: 982  -------------QESNEVPNPHKASALASKSSTIVVAQHSGDSVGACVSIVSGDGRTSQ 842
                         Q++ +V +  K+ ++  ++S   VA      V A     + +    Q
Sbjct: 65   VSKPAAGAGILISQDAVKVADSGKSKSVDQRTSIKQVATAPTPFVRADTVSYATNVDAIQ 124

Query: 841  EYSVHSSTLVGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVKS 662
            +    S      FVKA +A   FLSD++++K P SDG+G    P+N MV  +  VS VKS
Sbjct: 125  KDISSSPPTTEQFVKAGVAAVNFLSDIVSYKIPLSDGMGMLTLPKNTMVRPTVGVSSVKS 184

Query: 661  PHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKATTNSA 482
             + K    E F  VH KPS E    S+  + +  +KG  +K  SVKGL+H     T NSA
Sbjct: 185  SNVKQINRENFISVHPKPSTETAAASERTSNHQGSKGNYDKSNSVKGLNHVPYTRTENSA 244

Query: 481  --HS----DTQGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRSM 320
              HS    +T+ R   + ++SK   N F    KSN Q S+ +        + F KP+R M
Sbjct: 245  VAHSVQTLETRDRRA-LPRKSKAQPNHFVPDFKSNVQISDAETTRC--GSKGFSKPVREM 301

Query: 319  RMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQ 164
            +MPT  APF +        ++ V H+L+QLKWGP AE +L+NLNC +DAYQANQILKQLQ
Sbjct: 302  KMPTAIAPFNRQFVHNGNVVQNVSHILQQLKWGPSAEASLRNLNCSMDAYQANQILKQLQ 361

Query: 163  DHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            DHTVALGFF WLKR+AGF+HDGH+YTTMVGILGRA+QFGAIN+LL++MV++GC
Sbjct: 362  DHTVALGFFNWLKRQAGFRHDGHTYTTMVGILGRARQFGAINKLLNQMVNEGC 414


>ref|XP_002511505.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223550620|gb|EEF52107.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 876

 Score =  273 bits (699), Expect = 1e-70
 Identities = 175/414 (42%), Positives = 224/414 (54%), Gaps = 42/414 (10%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPH---- 953
            MLR KQ+S LS +ARSFF+SGSR                    R +  +N V        
Sbjct: 1    MLRAKQLSNLSSNARSFFLSGSRCSTSDGSSCTCSEDESCLPRRQQTRNNAVLAQRGPAL 60

Query: 952  --KASALASKSSTI-----VVAQHSGDSV-----------------GACVSIVSGDGRTS 845
              KASA  S++S +     ++  H  +SV                   CVS  SG     
Sbjct: 61   VPKASARVSQTSLLGDAGKLLVPHKVESVECPTLPQVVSAPISIRKSDCVSYASGIDAVE 120

Query: 844  QEYSVHSSTLVGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVK 665
             +    S  +   F KA IA   FLSDL+N+K P +DG G + +P+NCMVD +R  S V+
Sbjct: 121  NDIPYSSPPISDQFFKAGIAAVSFLSDLVNYKLPITDGSGIN-SPKNCMVDPTRPQSTVR 179

Query: 664  SPHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKATTNS 485
            S + KP + E  S+V+ K S E    S   + Y  T+ K+EK   +KG    S     NS
Sbjct: 180  SSNVKPIRRENCSKVYPKASPE-AAVSSSTSNYDSTRDKSEKSSFIKGSKRVSNTPAGNS 238

Query: 484  AH-----SDTQGRELPIKQRSKINSNRFHAKSKSNGQTSETK-VVGGIRSVENFQKPIRS 323
                   SDT  R + I Q+SK  SNR  A   +N QT +T     G    E+++KP R 
Sbjct: 239  VKTCSIASDTCDRRI-IPQKSKGQSNRSTANFNANVQTVQTSDTKYGEYVAEDYRKPPRE 297

Query: 322  MRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQL 167
             +MP    P  +        +E V H+LRQ++WGP AE+ L NLN  +D YQANQ+LKQL
Sbjct: 298  TKMPVVRVPSTRRFASNGHIVENVAHILRQIRWGPAAEEALANLNYSMDPYQANQVLKQL 357

Query: 166  QDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            QDHTVAL FFYWLKR+ GF HDGH+YTTMVGILGRAKQFGAIN+LLD+MV DGC
Sbjct: 358  QDHTVALNFFYWLKRQPGFNHDGHTYTTMVGILGRAKQFGAINKLLDQMVKDGC 411


>ref|XP_006439668.1| hypothetical protein CICLE_v10018829mg [Citrus clementina]
            gi|557541930|gb|ESR52908.1| hypothetical protein
            CICLE_v10018829mg [Citrus clementina]
          Length = 856

 Score =  271 bits (694), Expect = 4e-70
 Identities = 165/395 (41%), Positives = 221/395 (55%), Gaps = 23/395 (5%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR K I+ LS +ARSFF++GSR                  +SR +  +  V  P +A  
Sbjct: 1    MLRAKHITNLSSTARSFFLNGSRCSASDGSSCTCSEDETC-VSRRQHNAQMVYTPARAGV 59

Query: 940  LAS----------KSSTIVVAQHSGDSVGACVSIVSGDGRTSQEYSVHSSTLVGNFVKAS 791
            + S          KS  + V   S       VS  S      ++  + SS +   FVKA 
Sbjct: 60   VVSGEAVKAAGLQKSERVSVPSPSSLGRSDHVSYASTVDAVPKDV-LTSSPISDQFVKAG 118

Query: 790  IATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVKSPHPKPHKVEKFSQVHGK 611
            +A   FLSDL+N+K P  DG G + +P N MVD +R +S +K  + K  + E  S+V+  
Sbjct: 119  VAAVSFLSDLVNYKLPALDGSGTANSPTNFMVDPTRPLSNIKPANVKTIRRENVSKVYPN 178

Query: 610  PSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKAT-----TNSAHSDTQGRELPIK 446
             S E   GS P T  H+ K K +     +     S  +      T++  SD   R   ++
Sbjct: 179  SSAESTVGSNPSTGCHNAKDKGDNSNIARRFKRVSNASNGTSLETHNVSSDNSDRRRIVQ 238

Query: 445  QRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRSMRMPTETAPFAK------- 287
             RSK +SNR ++  KSN Q S+ KVV  +   E F KP R M++P   APF++       
Sbjct: 239  PRSKAHSNRLNSNFKSNLQPSDAKVVECVS--ERFSKPSREMKIPAGLAPFSRHFASTGN 296

Query: 286  -LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDHTVALGFFYWLKRRAGF 110
             +E V  +L+Q KWGP+AE+ L N N  +DAYQANQ+LKQLQDHTVALGFF WL+R+AGF
Sbjct: 297  VVESVSRILQQWKWGPLAEEALGNTNYSMDAYQANQVLKQLQDHTVALGFFNWLRRQAGF 356

Query: 109  KHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            KHD H+YTTMVGILGRA+QFGAIN+LLD+MV DGC
Sbjct: 357  KHDEHTYTTMVGILGRARQFGAINKLLDQMVRDGC 391


>gb|EXC34220.1| hypothetical protein L484_010090 [Morus notabilis]
          Length = 872

 Score =  264 bits (674), Expect = 9e-68
 Identities = 175/420 (41%), Positives = 228/420 (54%), Gaps = 48/420 (11%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR KQI  LS SARSFF+SGSR                T +SR +   +        S 
Sbjct: 1    MLRAKQIGNLSNSARSFFLSGSRCNAADGSSSCTCSEDETCVSRRQNLRHGGILAQNPST 60

Query: 940  LASKSSTIVVAQHSGDSVGA----------------------------CVSIVSGDGRTS 845
            LAS++S  V    SGD+V A                            CVS  S    T 
Sbjct: 61   LASRTSARVGTLISGDAVKAVSTEKASMHNPTSLKQVIISPKSLGRSECVSYAS----TV 116

Query: 844  QEYSVHSSTLVGN-FVKASIATAGFLSDLLNFKFPTSDGIGA--SFTPQNCMVDSSRTVS 674
            ++   HSS +  + FVKA +A   FLSD++N+KFP SDGIG   +  PQNCMVD +R  +
Sbjct: 117  EKNVEHSSPVFSDQFVKAGVAAVNFLSDVMNYKFPLSDGIGIFNNNLPQNCMVDPARLST 176

Query: 673  VVKSPHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQYH---DTKGKAEKYGSVKGLDHT-- 509
             ++S H    K + FS VH +PSVE         QY+    TK K  K  SVKG+++   
Sbjct: 177  SIRSSHVNHVKRKNFSGVHPRPSVEAA------VQYNSTSSTKSKDSKSSSVKGVNNVPN 230

Query: 508  ----SAKATTNSAHSDTQGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENF 341
                ++ AT +        R +P + ++ +NS +    S SN  T    V  G +    F
Sbjct: 231  TRNGNSWATRSVPAEARDRRAIPNRTKACLNSFKADFSSDSNQSTDGGNVGFGNK---GF 287

Query: 340  QKPIRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQAN 185
             +P R M  PT  AP  +        +ERV H+L  L+WG  AE+ L+NLN  +DA+QAN
Sbjct: 288  NRPPREMNFPTGYAPIKRPYANTANVVERVSHMLHGLRWGRAAEEALENLNYAMDAFQAN 347

Query: 184  QILKQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            Q+LKQLQDH VALGFFYWLKR+AGFKHDGH+YTTMVGILGR+++FGAIN+LL EMV +GC
Sbjct: 348  QVLKQLQDHNVALGFFYWLKRQAGFKHDGHTYTTMVGILGRSREFGAINKLLHEMVKEGC 407


>ref|XP_002321537.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222868533|gb|EEF05664.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 834

 Score =  261 bits (666), Expect = 8e-67
 Identities = 163/397 (41%), Positives = 208/397 (52%), Gaps = 25/397 (6%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR KQ+  LS SARSFF+SGSR                T +S  +Q  N +    K S 
Sbjct: 1    MLRAKQLGNLSSSARSFFLSGSRCSATDGSSSCTCSEDETCVSTRQQPRNSILLAQKPSN 60

Query: 940  LASKSSTIVVAQHSGDS------------VGACVSIVSGDGRTSQEYSVHSSTLVGNFVK 797
              SK+S  V A  SGD             +  CVS   G     ++    S  +   FV+
Sbjct: 61   FGSKTSARVEASVSGDGSSFLLPQKSSCGMSGCVSYAIGIDIAEKDVGHSSPPISDQFVR 120

Query: 796  ASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVKSPHPKPHKVEKFSQVH 617
              IA   FLSDL+N+K PTSDG   + T  NCM+D +R +S +KS + KP + E F++ +
Sbjct: 121  VGIAAVSFLSDLVNYKLPTSDGTVINSTI-NCMIDPTRQLSNIKSSNVKPIRRENFTKAY 179

Query: 616  GKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKAT-----TNSAHSDTQGRELP 452
               S EI  GS     Y+  K +  K   V+G    S+ A      ++S  SD   +   
Sbjct: 180  PNSSAEIPVGSNAAVNYNSMKDRGNKSSFVRGFKQVSSIAADSSLDSHSLPSDAFDKRRT 239

Query: 451  IKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRSMRMPTETAPFAK----- 287
            I QR K   NR                           +P R  +MP   A  A+     
Sbjct: 240  IPQRLKAQPNR---------------------------RPSRDTKMPAVVARSARQFVST 272

Query: 286  ---LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDHTVALGFFYWLKRRA 116
               +E V  +LRQL+WGP AE+ L NLNC +DAYQANQ+LKQLQDHTVALGFF+WLK+  
Sbjct: 273  GHVVENVSQILRQLRWGPSAEEALVNLNCHMDAYQANQVLKQLQDHTVALGFFHWLKQLP 332

Query: 115  GFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            GFKHDG++YTTMVGILGRAKQF AIN+LLD+MV DGC
Sbjct: 333  GFKHDGYTYTTMVGILGRAKQFVAINKLLDQMVRDGC 369


>ref|XP_003527053.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            [Glycine max]
          Length = 882

 Score =  260 bits (664), Expect = 1e-66
 Identities = 171/418 (40%), Positives = 226/418 (54%), Gaps = 46/418 (11%)
 Frame = -2

Query: 1120 MLRTKQIS-TLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKAS 944
            MLR K IS TLS +ARS  +SGSR                    R ++++NE     K  
Sbjct: 4    MLRAKAISSTLSSNARSILLSGSRCNAADGNSCNCPEDETCVSKRQQRKNNEDLLAPKPP 63

Query: 943  ALASKSSTIVV------------AQHSGDSVG--ACVSIV--------SGDGRTS----- 845
            +L SK+++ VV            A H+  SVG   CV  V          +  TS     
Sbjct: 64   SLVSKATSQVVGTLVSGNLANGPASHNVGSVGQSGCVQKVRPTSYAPSKSESVTSACVVD 123

Query: 844  --QEYSVHSSTL-VGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVS 674
              QE+  HSS+L    F +A IA   F+SD++N+K P SDG+G     +NCMVD +R + 
Sbjct: 124  GVQEHVAHSSSLNADQFYRAGIAAVNFISDVVNYKLPLSDGMGILNYSKNCMVDPARALP 183

Query: 673  VVKSPHPKPHKVEKFSQVHGKPSVEIGGG-SKPKTQYHDTKGKAEKYGSVKGLDHTSAKA 497
             ++S + +  + E F+ VH KP V    G SK     H  KGKA K    KG  + +A  
Sbjct: 184  KIRSSNVQQIRTENFTSVHPKPPVPAHPGPSKHTNNNHGAKGKANKSNLAKGFKYVAASG 243

Query: 496  TTNSAHSDT------QGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQK 335
            T  S  +          R LP  QR++ NSN F     SN Q+S  ++    +  E+F K
Sbjct: 244  TEKSGAAPNIPVNNHDRRALP--QRTRTNSNHFVTNFGSNMQSSNPQMARPFK--ESFNK 299

Query: 334  PIRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQI 179
              R + MP   AP  +        +E V  +L+QL+WGP  E+ L NLN  +DAYQANQI
Sbjct: 300  HTRDLNMPAGIAPTRRHFTNSGHVVEGVKDILKQLRWGPATEKALYNLNFSIDAYQANQI 359

Query: 178  LKQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            LKQLQDH+VAL FFYWLKR+ GF HDGH+YTTMVGILGRA++FGAIN+LL++MV DGC
Sbjct: 360  LKQLQDHSVALSFFYWLKRQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGC 417


>ref|XP_006578589.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            isoform X2 [Glycine max]
          Length = 898

 Score =  259 bits (662), Expect = 2e-66
 Identities = 168/421 (39%), Positives = 227/421 (53%), Gaps = 46/421 (10%)
 Frame = -2

Query: 1129 EDKMLRTKQIS-TLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPH 953
            +  MLR KQIS TLS +ARS  + GSR                    R ++++NE     
Sbjct: 17   QKNMLRAKQISSTLSSNARSILLGGSRCNAADGNSCTCPEDETCVSKRQQRKNNEDLLAL 76

Query: 952  KASALASKSSTIVV------------AQHSGDSVGACVSIVS----------GDGRTS-- 845
            K  +L SK+++ VV            A H   SVG    +             D  TS  
Sbjct: 77   KPPSLVSKATSQVVGTLVSGNLANGPASHKAGSVGQSGRVQQVQPTSYAPSKSDSATSAC 136

Query: 844  -----QEYSVHSSTL-VGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSR 683
                 Q++  HSS+L    F +A IA   F+SD++N+K P SDG+G     +N MVD +R
Sbjct: 137  VVDGVQDHVAHSSSLNADQFYRAGIAAVNFISDVVNYKLPLSDGMGILNYSKNYMVDPAR 196

Query: 682  TVSVVKSPHPKPHKVEKFSQVHGKPSVEIGGG-SKPKTQYHDTKGKAEKYGSVKGLDHTS 506
             +  ++S + +  K E F+ VH KP V    G SK    +H  KGKA+K    KG  H +
Sbjct: 197  ALPKIRSSNVQQIKKENFTAVHPKPPVPTHPGPSKHTNNHHGAKGKADKSNLAKGFKHVA 256

Query: 505  AKATTNSAHSDT------QGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVEN 344
            +  T  S  +          R LP  QR++ +SN F A   SN Q+S  ++ G  +  E+
Sbjct: 257  SSGTEKSGAAPNIPVNNHDRRALP--QRTRTHSNHFVANFGSNMQSSNPQMAGPFK--ES 312

Query: 343  FQKPIRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQA 188
            F K  R + MP    P  +        +E V  +L+QL+WGP  E+TL NLN  +DAYQA
Sbjct: 313  FNKHTRDLNMPAGIVPTKRHFTNSGHVVEVVKDILKQLRWGPATEKTLYNLNFSIDAYQA 372

Query: 187  NQILKQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDG 8
            NQILKQLQDH+VA+GFF WLKR+ GF HDGH+YTTMVGILGRA++FGAIN+LL++MV DG
Sbjct: 373  NQILKQLQDHSVAVGFFCWLKRQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDG 432

Query: 7    C 5
            C
Sbjct: 433  C 433


>ref|XP_003523047.2| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            isoform X1 [Glycine max]
          Length = 882

 Score =  259 bits (661), Expect = 3e-66
 Identities = 168/418 (40%), Positives = 226/418 (54%), Gaps = 46/418 (11%)
 Frame = -2

Query: 1120 MLRTKQIS-TLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKAS 944
            MLR KQIS TLS +ARS  + GSR                    R ++++NE     K  
Sbjct: 4    MLRAKQISSTLSSNARSILLGGSRCNAADGNSCTCPEDETCVSKRQQRKNNEDLLALKPP 63

Query: 943  ALASKSSTIVV------------AQHSGDSVGACVSIVS----------GDGRTS----- 845
            +L SK+++ VV            A H   SVG    +             D  TS     
Sbjct: 64   SLVSKATSQVVGTLVSGNLANGPASHKAGSVGQSGRVQQVQPTSYAPSKSDSATSACVVD 123

Query: 844  --QEYSVHSSTL-VGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVS 674
              Q++  HSS+L    F +A IA   F+SD++N+K P SDG+G     +N MVD +R + 
Sbjct: 124  GVQDHVAHSSSLNADQFYRAGIAAVNFISDVVNYKLPLSDGMGILNYSKNYMVDPARALP 183

Query: 673  VVKSPHPKPHKVEKFSQVHGKPSVEIGGG-SKPKTQYHDTKGKAEKYGSVKGLDHTSAKA 497
             ++S + +  K E F+ VH KP V    G SK    +H  KGKA+K    KG  H ++  
Sbjct: 184  KIRSSNVQQIKKENFTAVHPKPPVPTHPGPSKHTNNHHGAKGKADKSNLAKGFKHVASSG 243

Query: 496  TTNSAHSDT------QGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQK 335
            T  S  +          R LP  QR++ +SN F A   SN Q+S  ++ G  +  E+F K
Sbjct: 244  TEKSGAAPNIPVNNHDRRALP--QRTRTHSNHFVANFGSNMQSSNPQMAGPFK--ESFNK 299

Query: 334  PIRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQI 179
              R + MP    P  +        +E V  +L+QL+WGP  E+TL NLN  +DAYQANQI
Sbjct: 300  HTRDLNMPAGIVPTKRHFTNSGHVVEVVKDILKQLRWGPATEKTLYNLNFSIDAYQANQI 359

Query: 178  LKQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            LKQLQDH+VA+GFF WLKR+ GF HDGH+YTTMVGILGRA++FGAIN+LL++MV DGC
Sbjct: 360  LKQLQDHSVAVGFFCWLKRQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGC 417


>ref|XP_006578590.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            isoform X3 [Glycine max]
          Length = 879

 Score =  259 bits (661), Expect = 3e-66
 Identities = 168/418 (40%), Positives = 226/418 (54%), Gaps = 46/418 (11%)
 Frame = -2

Query: 1120 MLRTKQIS-TLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKAS 944
            MLR KQIS TLS +ARS  + GSR                    R ++++NE     K  
Sbjct: 1    MLRAKQISSTLSSNARSILLGGSRCNAADGNSCTCPEDETCVSKRQQRKNNEDLLALKPP 60

Query: 943  ALASKSSTIVV------------AQHSGDSVGACVSIVS----------GDGRTS----- 845
            +L SK+++ VV            A H   SVG    +             D  TS     
Sbjct: 61   SLVSKATSQVVGTLVSGNLANGPASHKAGSVGQSGRVQQVQPTSYAPSKSDSATSACVVD 120

Query: 844  --QEYSVHSSTL-VGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVS 674
              Q++  HSS+L    F +A IA   F+SD++N+K P SDG+G     +N MVD +R + 
Sbjct: 121  GVQDHVAHSSSLNADQFYRAGIAAVNFISDVVNYKLPLSDGMGILNYSKNYMVDPARALP 180

Query: 673  VVKSPHPKPHKVEKFSQVHGKPSVEIGGG-SKPKTQYHDTKGKAEKYGSVKGLDHTSAKA 497
             ++S + +  K E F+ VH KP V    G SK    +H  KGKA+K    KG  H ++  
Sbjct: 181  KIRSSNVQQIKKENFTAVHPKPPVPTHPGPSKHTNNHHGAKGKADKSNLAKGFKHVASSG 240

Query: 496  TTNSAHSDT------QGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQK 335
            T  S  +          R LP  QR++ +SN F A   SN Q+S  ++ G  +  E+F K
Sbjct: 241  TEKSGAAPNIPVNNHDRRALP--QRTRTHSNHFVANFGSNMQSSNPQMAGPFK--ESFNK 296

Query: 334  PIRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQI 179
              R + MP    P  +        +E V  +L+QL+WGP  E+TL NLN  +DAYQANQI
Sbjct: 297  HTRDLNMPAGIVPTKRHFTNSGHVVEVVKDILKQLRWGPATEKTLYNLNFSIDAYQANQI 356

Query: 178  LKQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            LKQLQDH+VA+GFF WLKR+ GF HDGH+YTTMVGILGRA++FGAIN+LL++MV DGC
Sbjct: 357  LKQLQDHSVAVGFFCWLKRQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGC 414


>ref|XP_007157658.1| hypothetical protein PHAVU_002G087700g [Phaseolus vulgaris]
            gi|561031073|gb|ESW29652.1| hypothetical protein
            PHAVU_002G087700g [Phaseolus vulgaris]
          Length = 881

 Score =  251 bits (641), Expect = 6e-64
 Identities = 166/418 (39%), Positives = 220/418 (52%), Gaps = 46/418 (11%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESN-EVPNPHKAS 944
            MLR KQISTLS +ARSF + GSR                  +SR +Q  N E     K S
Sbjct: 4    MLRAKQISTLSSNARSFLLGGSRCNGADGNSCTCPEDETC-ISRGQQRKNSEDLVVQKPS 62

Query: 943  ALASKSSTIVV------------AQHSGDSVG--ACVSIVSGDGRTS------------- 845
            +L SK+++ VV            A H    VG   CV  +                    
Sbjct: 63   SLVSKTTSQVVGTLVSGSLANGPASHKAGDVGQSGCVQQIRSTSFAPSRPDSVTYACVVD 122

Query: 844  --QEYSVHSSTL-VGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVS 674
              Q++  HSS++    F +A IA   F+SD++N+KFP SDG+G     +N MVD  R + 
Sbjct: 123  GVQDHVAHSSSVNADQFYRAGIAAVNFISDVVNYKFPLSDGMGILNYSKNYMVDPGRALP 182

Query: 673  VVKSPHPKPHKVEKFSQVHGKPSVEIGGG-SKPKTQYHDTKGKAEKYGSVKGLDHTSAKA 497
             ++S + K  + E F+ VH KP V    G SKP   +H  KGK +K    KG    ++  
Sbjct: 183  SIRSSNVKQIRKESFTAVHPKPPVSTHPGPSKPTNNHHGAKGKGDKSNLAKGFKPVASPG 242

Query: 496  TTNSAHSDT------QGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQK 335
               S  +          R LP  QR++   NRF     SN  +S  ++ G  +  E+F K
Sbjct: 243  IEKSGEAPNIPVNSHDRRALP--QRTRTRPNRFVTNFGSNMPSSNPQMAGSFK--ESFCK 298

Query: 334  PIRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQI 179
              R++ M    AP  +        ++ V  +LRQLKWGP  E+ L NLN  +DAYQANQI
Sbjct: 299  YTRNVNMAAGIAPSNRHFTNSGHVVDMVKDMLRQLKWGPATEKALCNLNFSIDAYQANQI 358

Query: 178  LKQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            LKQLQDH+VAL FFYWLK + GF HDGH+YTTMVGILGRA++FGAIN+LL++MV DGC
Sbjct: 359  LKQLQDHSVALSFFYWLKLQPGFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGC 416


>ref|XP_004513211.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            [Cicer arietinum]
          Length = 879

 Score =  248 bits (633), Expect = 5e-63
 Identities = 161/415 (38%), Positives = 228/415 (54%), Gaps = 43/415 (10%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR K IS+LS SARSFF+SGSR                  +SR ++  NE     K S+
Sbjct: 4    MLRAKPISSLSSSARSFFLSGSRCNAPDANSCTCNEDETC-VSRRQETKNENLLMQKPSS 62

Query: 940  LASKSSTIVVAQHSGDSV-----------GACVSIVSGDGRTSQEYSV------------ 830
            ++  +S +     SG+S            G+   + S    +S+  SV            
Sbjct: 63   VSKTTSLVERTLVSGNSASSHKVKGIDQSGSVQQVRSNSSPSSKSDSVTYACVADDIPNH 122

Query: 829  --HSSTL-VGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVVKSP 659
              HSS L    F +A IA   F+SD ++ K P SDG+G     +NCMV+ + T++ ++S 
Sbjct: 123  VTHSSPLDTDQFYRAGIAAVNFISDFVHCKLPVSDGMGILSYSKNCMVEPASTITSIRSS 182

Query: 658  HPKPHKVEKFSQVHGKPSVEIGGGSK--PKTQYHDTKGKAEKYGSVKGLDHTSAKATTNS 485
            + K  + E F  VH KP V    GS     + Y+ +KGK +K    KG  H ++ AT  S
Sbjct: 183  NVKQIRKEDFISVHPKPPVSNHPGSSNHASSSYNGSKGKGDKSKFGKGFKHIASSATEKS 242

Query: 484  ------AHSDTQGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRS 323
                  A ++  GR  P+ QR++ ++N+F   S SN QTS + ++G  +  E+F +  R 
Sbjct: 243  EVAPNIAFNNHDGRR-PLPQRTRTHTNQFVTNSGSNVQTSNSHMLGSFK--ESFHRHPRY 299

Query: 322  MRMPTETAP----FAK-----LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQ 170
            ++    ++     F K     +E V  +L+QLKWGP  E+ L NL   +DAYQ NQILKQ
Sbjct: 300  LKTSAGSSSTKTHFTKTGHRVVEVVIDILQQLKWGPATEEALYNLKSSIDAYQGNQILKQ 359

Query: 169  LQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            L+DH+VAL FFYWLKR+  F HDGH+YTTMVGILGRA++FGAIN+LL++MV DGC
Sbjct: 360  LEDHSVALSFFYWLKRQPNFWHDGHTYTTMVGILGRAREFGAINKLLEQMVKDGC 414


>ref|XP_003525037.2| PREDICTED: pentatricopeptide repeat-containing protein At1g74750-like
            [Glycine max]
          Length = 876

 Score =  247 bits (630), Expect = 1e-62
 Identities = 163/411 (39%), Positives = 223/411 (54%), Gaps = 39/411 (9%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR KQI+ LS SAR+F I GSR                  +SR +   N V    K S+
Sbjct: 4    MLRAKQITALSSSARTFLIGGSRCNAADGSSCSCTEDEIC-VSRRQHIKNHVLPAQKPSS 62

Query: 940  LASKSSTIV----VAQHS----------GDSVGACVSIVSGDGRTS-------------- 845
            LASK+++ V    V+++S          G    +C+  +      +              
Sbjct: 63   LASKATSEVDETLVSENSVNGPACCKAKGVDQSSCIQQIRSASSPACKSDSVTYACDIDG 122

Query: 844  -QEYSVHSSTLVGN-FVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSV 671
             QE+  H S L  + F +ASIA   FLSDL N+KFP S+G G     +NCMVD++RT   
Sbjct: 123  VQEHVEHLSPLNSDQFYRASIAAINFLSDLANYKFPLSNGKGILSYSKNCMVDTARTPPN 182

Query: 670  VKSPHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKATT 491
            ++S + K  K E F+ VH +PSV     SK    +H  K K +K    KG  H  +    
Sbjct: 183  IRSSNVKQIKRENFTSVHPRPSVSTNSRSKRAGHHHSGKCKGDKSNLGKGFKHIPSSGME 242

Query: 490  NSAHSDT------QGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSV-ENFQKP 332
             S  S        + R  P  QR+   SN       S  + S T++V  + ++ E+F K 
Sbjct: 243  KSVVSPNIPLNNHEHRAFP--QRTTTKSNHIVTNFGSYMRASNTQMVEVVPTIKESFNKH 300

Query: 331  IRSMRMPTETAPFAK--LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDH 158
             R ++M   TAP  +  +E V  +LRQL+WGP AE+ L NLN  +DAYQANQILKQLQD 
Sbjct: 301  PRDLKMSARTAPMNRRIVEVVSDILRQLRWGPTAEKALYNLNFSMDAYQANQILKQLQDP 360

Query: 157  TVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            +VALGFF WL+R+ GF+HDGH+YTTMVGILGRA++F +I++LL++MV DGC
Sbjct: 361  SVALGFFDWLRRQPGFRHDGHTYTTMVGILGRARRFDSISKLLEQMVKDGC 411


>ref|XP_004138146.1| PREDICTED: pentatricopeptide repeat-containing protein At1g18900-like
            [Cucumis sativus]
          Length = 874

 Score =  246 bits (627), Expect = 3e-62
 Identities = 162/417 (38%), Positives = 217/417 (52%), Gaps = 45/417 (10%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR KQI +LS SARSFF+SGSR                  +S  +   NE     K S 
Sbjct: 1    MLRAKQIGSLSNSARSFFLSGSRCNADGASCTCPEDETC--VSERQNARNETLPSQKPST 58

Query: 940  LASKSS-----------TIVVAQHSGDSVGACVSI--VSGDGRTSQ-------------- 842
            L + SS             V+  H  D+V   VSI  V+  G   Q              
Sbjct: 59   LVANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNT 118

Query: 841  --EYSVHSSTLVGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVV 668
              +    S  +    VKA I      SD +NFK P+SD  G   + +NCMVD +R+++ V
Sbjct: 119  VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSV 178

Query: 667  KSPHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQY-HDTKGKAEKYGSVKGLDHTSAKATT 491
            K    K  + E  S+VH +PSVEI   SKP++   H +  K  +   VKG     ++A T
Sbjct: 179  KPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEART 238

Query: 490  -------NSAHSDTQGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKP 332
                   N +      R LP  QR++++SN F +   S  QT+ +       S +NF+K 
Sbjct: 239  QKLVVFQNISSDKCDKRNLP--QRTRVHSNSFTSHFHSIAQTTGSDFTN---SSKNFKKF 293

Query: 331  IRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQIL 176
              +++ PT  AP           +E V  +L+QLKWGP AE+ +  LNC +DAYQANQIL
Sbjct: 294  PDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQIL 353

Query: 175  KQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            K++ DH VALGFFYWLKR   F+HDGH+YTTM+G+LGRAKQF AIN+LLD+M+ DGC
Sbjct: 354  KRVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGC 410


>ref|XP_004154991.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At1g18900-like [Cucumis sativus]
          Length = 874

 Score =  244 bits (624), Expect = 6e-62
 Identities = 162/417 (38%), Positives = 216/417 (51%), Gaps = 45/417 (10%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR KQI +LS SARSFF SGSR                  +S  +   NE     K S 
Sbjct: 1    MLRAKQIGSLSNSARSFFXSGSRCNADGASCTCPEDETC--VSERQNARNETLPSQKPST 58

Query: 940  LASKSS-----------TIVVAQHSGDSVGACVSI--VSGDGRTSQ-------------- 842
            L + SS             V+  H  D+V   VSI  V+  G   Q              
Sbjct: 59   LVANSSPRVGPLIAEEAAKVIVSHKTDNVDLSVSIRQVANTGPNHQRGAECVRYASGLNT 118

Query: 841  --EYSVHSSTLVGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQNCMVDSSRTVSVV 668
              +    S  +    VKA I      SD +NFK P+SD  G   + +NCMVD +R+++ V
Sbjct: 119  VLDGECTSPRIADQVVKAGIMAVNLFSDFVNFKIPSSDYGGTFSSSKNCMVDPARSITSV 178

Query: 667  KSPHPKPHKVEKFSQVHGKPSVEIGGGSKPKTQY-HDTKGKAEKYGSVKGLDHTSAKATT 491
            K    K  + E  S+VH +PSVEI   SKP++   H +  K  +   VKG     ++A T
Sbjct: 179  KPSKIKHLRRENISRVHSRPSVEIPVDSKPQSSSNHGSNCKPAQSSYVKGSRQEVSEART 238

Query: 490  -------NSAHSDTQGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKP 332
                   N +      R LP  QR++++SN F +   S  QT+ +       S +NF+K 
Sbjct: 239  QKLVVFQNISSDKCDKRNLP--QRTRVHSNSFTSHFHSIAQTTGSDFTN---SSKNFKKF 293

Query: 331  IRSMRMPTETAPFAK--------LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQIL 176
              +++ PT  AP           +E V  +L+QLKWGP AE+ +  LNC +DAYQANQIL
Sbjct: 294  PDNLKSPTGMAPITSSFLNAPNVVESVSCILQQLKWGPAAEEAIGKLNCSIDAYQANQIL 353

Query: 175  KQLQDHTVALGFFYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            K++ DH VALGFFYWLKR   F+HDGH+YTTM+G+LGRAKQF AIN+LLD+M+ DGC
Sbjct: 354  KRVDDHAVALGFFYWLKRLPRFRHDGHTYTTMIGLLGRAKQFAAINKLLDQMIKDGC 410


>gb|EYU35898.1| hypothetical protein MIMGU_mgv1a019674mg [Mimulus guttatus]
          Length = 846

 Score =  233 bits (594), Expect = 2e-58
 Identities = 159/405 (39%), Positives = 217/405 (53%), Gaps = 33/405 (8%)
 Frame = -2

Query: 1120 MLRTKQISTLSKSARSFFISGSRXXXXXXXXXXXXXXXXTRLSRNKQESNEVPNPHKASA 941
            MLR K + TLS+SARSFF+SGSR                   SR    +N V +   +S 
Sbjct: 1    MLRAKNLGTLSQSARSFFLSGSRCSAADGSSCTCCDDETCA-SRRPLGAN-VKHLQSSSK 58

Query: 940  LASKSSTIVVAQHSGDSVGACVS-----------------------IVSGDGRTSQEYSV 830
            L SK+S  V +  S DS+    S                       +   D    ++  V
Sbjct: 59   LLSKASVGVASLTSVDSIKEANSKKPENAPQPQVVPTPRVLNRANFVSYEDDFDEKDNVV 118

Query: 829  HSST-LVGNFVKASIATAGFLSDLLNFKFPTSDGIGASFTPQ-NCMVDSSRTVSVVKSPH 656
            +SS  +  +FVKA +A  G LSDL+N++ P +DG     T   N MVD ++T+S V+  +
Sbjct: 119  YSSPPIADHFVKAGMAAVGLLSDLVNYRIPMTDGSAMHNTSAPNSMVDRTKTISNVRPAN 178

Query: 655  PKPHKVEKFSQVHGKPSVEIGGGSKPKTQYHDTKGKAEKYGSVKGLDHTSAKATTNSAHS 476
             K  + +K    + KPS      S+P + Y DTK +A+  G  KG    +     +    
Sbjct: 179  VKTSRKDK---AYVKPS------SEPVSSY-DTKSRADSSGFGKGFVAANESDNASEFFV 228

Query: 475  DTQGRELPIKQRSKINSNRFHAKSKSNGQTSETKVVGGIRSVENFQKPIRSMRM----PT 308
            +T+ R+ P+ Q+S+  SN+F  + K  G  SE            F K IR  ++    P 
Sbjct: 229  ETRERKRPVAQKSRAYSNKF-VEGKLGGNKSEV-----------FAKDIRHTKVVITRPA 276

Query: 307  ETAPFAK----LERVHHVLRQLKWGPMAEQTLKNLNCPLDAYQANQILKQLQDHTVALGF 140
            +  PF+     +E V  +L  L+WGP  E+ L  LNC LDAYQANQILKQLQD+TVALGF
Sbjct: 277  QARPFSTSSPIVESVSRILHHLQWGPPTEEALCKLNCTLDAYQANQILKQLQDYTVALGF 336

Query: 139  FYWLKRRAGFKHDGHSYTTMVGILGRAKQFGAINRLLDEMVSDGC 5
            FYWLKR+ GFKHDGH+YTTMVGILGRA+QF AI++LLD+M+ DGC
Sbjct: 337  FYWLKRKPGFKHDGHTYTTMVGILGRARQFAAIDKLLDQMIKDGC 381


Top