BLASTX nr result

ID: Sinomenium22_contig00034788 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00034788
         (583 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citr...   119   5e-25
ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containi...   112   7e-23
emb|CBI29222.3| unnamed protein product [Vitis vinifera]              112   7e-23
ref|XP_002303480.2| pentatricopeptide repeat-containing family p...   103   3e-20
ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prun...   103   3e-20
ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containi...    96   9e-18
ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containi...    95   1e-17
ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containi...    92   1e-16
ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily pr...    90   4e-16
ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containi...    84   4e-14
gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus...    82   8e-14
ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutr...    80   4e-13
sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-c...    80   4e-13
emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|72687...    80   4e-13
ref|NP_567587.1| pentatricopeptide repeat-containing protein [Ar...    80   4e-13
ref|XP_003628993.1| Pentatricopeptide repeat-containing protein ...    79   1e-12
ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containi...    78   2e-12
ref|XP_007156329.1| hypothetical protein PHAVU_003G277400g [Phas...    77   3e-12
ref|XP_006282558.1| hypothetical protein CARUB_v10004123mg [Caps...    77   3e-12
gb|AHB18408.1| pentatricopeptide repeat-containing protein [Goss...    76   6e-12

>ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citrus clementina]
           gi|568835123|ref|XP_006471629.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X1 [Citrus sinensis]
           gi|568835125|ref|XP_006471630.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X2 [Citrus sinensis]
           gi|557534991|gb|ESR46109.1| hypothetical protein
           CICLE_v10000274mg [Citrus clementina]
          Length = 833

 Score =  119 bits (299), Expect = 5e-25
 Identities = 69/165 (41%), Positives = 93/165 (56%), Gaps = 8/165 (4%)
 Frame = +2

Query: 113 LSRPKRLLILFSFNRSFSSVQHDQNQQQKEHEQRFDH--------DEIVVKKVLSILSNQ 268
           LS PK   +  + +R  + V     QQQ+ H +            ++ ++K V S+LS Q
Sbjct: 6   LSIPKPCSLSIAVSRPLTHVTSTAQQQQELHNRNQQQQPPPPQSSNQSLLKWVSSVLSKQ 65

Query: 269 SLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHLIR 448
           SLD SKC+  L +LS + FD +FF + S++NPKTAL FF FAS+S  FRFTV SYC LIR
Sbjct: 66  SLDPSKCKLFLPNLSPQEFDTLFFSIRSNVNPKTALKFFYFASQSCNFRFTVRSYCLLIR 125

Query: 449 LLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
           LL+  N             DGK+P L+ S    RH+EIA  + DL
Sbjct: 126 LLLFSNLLSPARLLLIRLIDGKMPVLYASNPSIRHIEIASQMVDL 170


>ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic [Vitis vinifera]
          Length = 1022

 Score =  112 bits (280), Expect = 7e-23
 Identities = 59/120 (49%), Positives = 78/120 (65%)
 Frame = +2

Query: 224 DEIVVKKVLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASES 403
           D  ++K V SILSN SLD ++C++++  LS  +FD VFF V  ++NPKTALNFF FAS+S
Sbjct: 113 DHALLKSVTSILSNPSLDSTQCKQLIPHLSPHQFDSVFFSVRRNVNPKTALNFFYFASDS 172

Query: 404 LGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
            GFRFT+ SYC L+R L+                D KLP LF   ++ RH+EIA A+ADL
Sbjct: 173 CGFRFTLRSYCVLMRSLIVSGFVSPARLLLIRLIDRKLPVLFGDPKN-RHIEIASAMADL 231


>emb|CBI29222.3| unnamed protein product [Vitis vinifera]
          Length = 826

 Score =  112 bits (280), Expect = 7e-23
 Identities = 59/120 (49%), Positives = 78/120 (65%)
 Frame = +2

Query: 224 DEIVVKKVLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASES 403
           D  ++K V SILSN SLD ++C++++  LS  +FD VFF V  ++NPKTALNFF FAS+S
Sbjct: 46  DHALLKSVTSILSNPSLDSTQCKQLIPHLSPHQFDSVFFSVRRNVNPKTALNFFYFASDS 105

Query: 404 LGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
            GFRFT+ SYC L+R L+                D KLP LF   ++ RH+EIA A+ADL
Sbjct: 106 CGFRFTLRSYCVLMRSLIVSGFVSPARLLLIRLIDRKLPVLFGDPKN-RHIEIASAMADL 164


>ref|XP_002303480.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550342907|gb|EEE78459.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 842

 Score =  103 bits (258), Expect = 3e-20
 Identities = 54/119 (45%), Positives = 74/119 (62%)
 Frame = +2

Query: 224 DEIVVKKVLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASES 403
           ++ ++K+V  ILSN SLD +KC+E++  LS + FD  F  + S++NPKTALNFF F SE+
Sbjct: 60  NQSLLKRVSLILSNPSLDCAKCKELVPHLSPQEFDSCFLALKSNVNPKTALNFFHFVSET 119

Query: 404 LGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVAD 580
             FRFT  SYC LI LLV  +             DGK+PA +    + RH EIA+ +AD
Sbjct: 120 CKFRFTARSYCVLIHLLVGNDLLSPARLLLIRLIDGKVPAFYARNFESRHFEIAQIMAD 178


>ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prunus persica]
           gi|462413303|gb|EMJ18352.1| hypothetical protein
           PRUPE_ppa001463mg [Prunus persica]
          Length = 821

 Score =  103 bits (257), Expect = 3e-20
 Identities = 55/113 (48%), Positives = 74/113 (65%)
 Frame = +2

Query: 245 VLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTV 424
           V SILS  SLD SKC+ ++  LS+  FDRVF  + S++NPKTAL+FF FASES  F+FTV
Sbjct: 57  VSSILSKPSLDSSKCKALIPLLSSHEFDRVFCSISSNVNPKTALHFFYFASESFKFQFTV 116

Query: 425 GSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
            S+C L+RLL+  N             DG +P L+ +  ++RH+EIA A+ DL
Sbjct: 117 RSFCVLVRLLILSNLVSPARLLLIRLIDGNVPVLY-ANHNQRHMEIAIAMLDL 168


>ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Cucumis sativus]
          Length = 822

 Score = 95.5 bits (236), Expect = 9e-18
 Identities = 56/149 (37%), Positives = 78/149 (52%)
 Frame = +2

Query: 137 ILFSFNRSFSSVQHDQNQQQKEHEQRFDHDEIVVKKVLSILSNQSLDRSKCREVLCDLST 316
           +LF F+R    V   Q  ++   +  +   + +   V S+LS+ SLD SKC  +L  LS 
Sbjct: 14  VLFPFSRRLVCVSSTQPHKEHHQDPPWQSQDQLHLWVSSVLSHSSLDSSKCSALLPHLSP 73

Query: 317 RRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHLIRLLVCKNXXXXXXXXXX 496
            +FD++FF +    NP T LNFF FAS S  FRFT+ SYC LI LL+             
Sbjct: 74  SQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTIHSYCTLILLLIRSKFIPPARLLLI 133

Query: 497 XXXDGKLPALFESERDRRHLEIARAVADL 583
              DG LP L   + ++ H+EIA A+  L
Sbjct: 134 RLIDGNLPVL-NLDSEKFHIEIANALFGL 161


>ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Solanum tuberosum]
          Length = 928

 Score = 95.1 bits (235), Expect = 1e-17
 Identities = 58/167 (34%), Positives = 92/167 (55%), Gaps = 7/167 (4%)
 Frame = +2

Query: 104 IALLSRPKRLLILFSFNRSFSSV-------QHDQNQQQKEHEQRFDHDEIVVKKVLSILS 262
           IA+ S  KR L    +  S   +       Q ++     E +Q+   D  + K V+S+LS
Sbjct: 101 IAIFSHIKRPLTCVIYTASSDQISEPLQKAQSNKPNPSSEKKQKNGLDLNLRKWVVSVLS 160

Query: 263 NQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHL 442
           N  +D  K +++L  L+ ++FD +F  ++SS+ P   L FF  AS + GF F+V SYC L
Sbjct: 161 NPPVDSLKIKDLLTLLTPQQFDAIFLEIYSSLKPLNVLKFFHVASGTCGFSFSVRSYCTL 220

Query: 443 IRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
           +RLLV  N             DGKLPALF++ + ++H+E+A ++A+L
Sbjct: 221 LRLLVASNHDVPARLLLIRLIDGKLPALFDTSQ-QKHVEVAVSLAEL 266


>ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Solanum lycopersicum]
          Length = 839

 Score = 92.0 bits (227), Expect = 1e-16
 Identities = 59/167 (35%), Positives = 89/167 (53%), Gaps = 7/167 (4%)
 Frame = +2

Query: 104 IALLSRPKRLLILFSFNRSFSSVQH-----DQNQQQ--KEHEQRFDHDEIVVKKVLSILS 262
           IA+ S  KR L    +  S   +       D N+     E +Q    D  + K V+S+LS
Sbjct: 12  IAIFSHIKRPLTCVIYTASSDQISEPLQKGDSNKPNPSSEKKQIKGLDLNLRKWVVSVLS 71

Query: 263 NQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHL 442
           +  +D  K +++L  L+ ++FD +F  +HSS+ P   L FF  AS +  F FTV SYC L
Sbjct: 72  DPPVDSLKIKDLLTLLNPQQFDAIFLEIHSSLKPLNVLKFFHVASGTCSFSFTVRSYCTL 131

Query: 443 IRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
           +RLL+  N             DGKLPALF+S   ++H+E+A ++A+L
Sbjct: 132 VRLLIASNHDAPARLLLIRLIDGKLPALFDS-LQQKHVEVAVSLAEL 177


>ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|590680604|ref|XP_007040907.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590680608|ref|XP_007040908.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590680612|ref|XP_007040909.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590680616|ref|XP_007040910.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590680620|ref|XP_007040911.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778151|gb|EOY25407.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778152|gb|EOY25408.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778153|gb|EOY25409.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778154|gb|EOY25410.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778155|gb|EOY25411.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778156|gb|EOY25412.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
          Length = 845

 Score = 90.1 bits (222), Expect = 4e-16
 Identities = 50/122 (40%), Positives = 71/122 (58%)
 Frame = +2

Query: 218 DHDEIVVKKVLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFAS 397
           ++++ ++ ++  ILS  SLD SKC+++L  LS   FDR F  + S +NPKT L+FF  AS
Sbjct: 62  NNNQGLLGRLSCILSKSSLDSSKCKQLLPLLSPLDFDRFFSAISSHLNPKTTLHFFYLAS 121

Query: 398 ESLGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVA 577
           +S  FRFT+ SYC LI LL+  N             DGKLP    +     H++I  A+A
Sbjct: 122 QSFNFRFTLRSYCILILLLLLANHSSPARLLFIRLIDGKLPLSSPNNTTIDHIQITTALA 181

Query: 578 DL 583
           DL
Sbjct: 182 DL 183


>ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X1 [Cicer arietinum]
           gi|502153968|ref|XP_004509526.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X2 [Cicer arietinum]
           gi|502153970|ref|XP_004509527.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X3 [Cicer arietinum]
           gi|502153972|ref|XP_004509528.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X4 [Cicer arietinum]
           gi|502153974|ref|XP_004509529.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X5 [Cicer arietinum]
           gi|502153976|ref|XP_004509530.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X6 [Cicer arietinum]
           gi|502153978|ref|XP_004509531.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X7 [Cicer arietinum]
           gi|502153980|ref|XP_004509532.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X8 [Cicer arietinum]
           gi|502153982|ref|XP_004509533.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X9 [Cicer arietinum]
           gi|502153984|ref|XP_004509534.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X10 [Cicer arietinum]
          Length = 835

 Score = 83.6 bits (205), Expect = 4e-14
 Identities = 50/113 (44%), Positives = 65/113 (57%), Gaps = 2/113 (1%)
 Frame = +2

Query: 251 SILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGS 430
           SILS++ LD SKC+ +L  L+  +FD +FF  HS++N KT L+FF FAS    F FTV S
Sbjct: 62  SILSHKILDSSKCKSILPHLTPHQFDTLFFTHHSTVNLKTTLDFFRFASNQFKFCFTVRS 121

Query: 431 YCHLIRLLVCKNXXXXXXXXXXXXXDGKL--PALFESERDRRHLEIARAVADL 583
           YC LIRLL+C N             DG +  P L    RD R  E+A +  +L
Sbjct: 122 YCLLIRLLLCSNHLPRARFFMKRLIDGNVSTPLL---NRDDRLSEMASSFLEL 171


>gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus guttatus]
          Length = 847

 Score = 82.4 bits (202), Expect = 8e-14
 Identities = 42/114 (36%), Positives = 61/114 (53%)
 Frame = +2

Query: 239 KKVLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRF 418
           K + S+LS  + + ++C+E++  +S R+FD +F+ +H++I P TAL  F FA +   F F
Sbjct: 74  KSLASVLSGSNFNSNQCKELISQISPRQFDSIFWEIHNNIEPSTALKLFYFAGDYCSFSF 133

Query: 419 TVGSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVAD 580
           T+ SYC L  LLV KN             D KLP          H EIA  +AD
Sbjct: 134 TLRSYCILFHLLVSKNLDSAARLLLIRLIDRKLPVSLRDNVVNLHNEIAIVLAD 187


>ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutrema salsugineum]
           gi|557115148|gb|ESQ55431.1| hypothetical protein
           EUTSA_v10024401mg [Eutrema salsugineum]
          Length = 837

 Score = 80.1 bits (196), Expect = 4e-13
 Identities = 53/155 (34%), Positives = 83/155 (53%), Gaps = 2/155 (1%)
 Frame = +2

Query: 125 KRLLILFSFNRSFSSVQHDQNQQQKEHEQRFDHDEIVVKKVLSILSNQSLDRSKCREVLC 304
           +RL  +    +SF +  H   QQ ++ E+    D  + +++ + LS +SLD  +C++++ 
Sbjct: 39  RRLKSIAYPRKSFHTTLH--LQQLEKSEEASSSDRHLRERLSAALSRRSLDYEQCKQLIA 96

Query: 305 DLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHLIRLLVCKNXXXXXX 484
            LS   FDR+F    S +NPKTAL+FF  AS+S  F F++ SYC LI LL+  +      
Sbjct: 97  TLSPHEFDRLFPDFRSKVNPKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDASLLSPAR 156

Query: 485 XXXXXXXDGKLPALFESERDRR--HLEIARAVADL 583
                  +G +P L  S  D R   + IA A+A L
Sbjct: 157 LVLIRLINGNVPVL-PSANDSRDGRVAIADAMASL 190


>sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-containing protein
           At4g19440, chloroplastic; Flags: Precursor
          Length = 838

 Score = 80.1 bits (196), Expect = 4e-13
 Identities = 51/147 (34%), Positives = 77/147 (52%)
 Frame = +2

Query: 143 FSFNRSFSSVQHDQNQQQKEHEQRFDHDEIVVKKVLSILSNQSLDRSKCREVLCDLSTRR 322
           F  +R      H  ++ ++    R  H+     ++ S+LS +SLD  +C++++  LS   
Sbjct: 51  FHTSRYLQQCVHRPDKSEETSSDRHLHE-----RLSSVLSKRSLDYEQCKQLITVLSPLE 105

Query: 323 FDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXX 502
           FDR+F    S +NPKTAL+FF  AS+S  F F++ SYC LI LL+  N            
Sbjct: 106 FDRLFPEFRSKVNPKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRL 165

Query: 503 XDGKLPALFESERDRRHLEIARAVADL 583
            +G +P L    RD R + IA A+A L
Sbjct: 166 INGNVPVLPCGLRDSR-VAIADAMASL 191


>emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|7268739|emb|CAB78946.1|
           putative protein [Arabidopsis thaliana]
          Length = 814

 Score = 80.1 bits (196), Expect = 4e-13
 Identities = 51/147 (34%), Positives = 77/147 (52%)
 Frame = +2

Query: 143 FSFNRSFSSVQHDQNQQQKEHEQRFDHDEIVVKKVLSILSNQSLDRSKCREVLCDLSTRR 322
           F  +R      H  ++ ++    R  H+     ++ S+LS +SLD  +C++++  LS   
Sbjct: 27  FHTSRYLQQCVHRPDKSEETSSDRHLHE-----RLSSVLSKRSLDYEQCKQLITVLSPLE 81

Query: 323 FDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXX 502
           FDR+F    S +NPKTAL+FF  AS+S  F F++ SYC LI LL+  N            
Sbjct: 82  FDRLFPEFRSKVNPKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRL 141

Query: 503 XDGKLPALFESERDRRHLEIARAVADL 583
            +G +P L    RD R + IA A+A L
Sbjct: 142 INGNVPVLPCGLRDSR-VAIADAMASL 167


>ref|NP_567587.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|334186696|ref|NP_001190771.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|15810161|gb|AAL07224.1| unknown protein [Arabidopsis
           thaliana] gi|332658782|gb|AEE84182.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332658783|gb|AEE84183.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 825

 Score = 80.1 bits (196), Expect = 4e-13
 Identities = 51/147 (34%), Positives = 77/147 (52%)
 Frame = +2

Query: 143 FSFNRSFSSVQHDQNQQQKEHEQRFDHDEIVVKKVLSILSNQSLDRSKCREVLCDLSTRR 322
           F  +R      H  ++ ++    R  H+     ++ S+LS +SLD  +C++++  LS   
Sbjct: 38  FHTSRYLQQCVHRPDKSEETSSDRHLHE-----RLSSVLSKRSLDYEQCKQLITVLSPLE 92

Query: 323 FDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXX 502
           FDR+F    S +NPKTAL+FF  AS+S  F F++ SYC LI LL+  N            
Sbjct: 93  FDRLFPEFRSKVNPKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRL 152

Query: 503 XDGKLPALFESERDRRHLEIARAVADL 583
            +G +P L    RD R + IA A+A L
Sbjct: 153 INGNVPVLPCGLRDSR-VAIADAMASL 178


>ref|XP_003628993.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355523015|gb|AET03469.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 819

 Score = 78.6 bits (192), Expect = 1e-12
 Identities = 45/111 (40%), Positives = 65/111 (58%)
 Frame = +2

Query: 251 SILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGS 430
           SIL+++ LD SKC+ ++ +L+   F+  FF  H+++N KT L+FFSFAS++  FRFTV S
Sbjct: 54  SILAHKVLDSSKCKTLIPNLTPHEFEHSFFTHHTTVNLKTTLDFFSFASKNFKFRFTVRS 113

Query: 431 YCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
           YC LIRLL+  N             +G      + + D R  EIA A  +L
Sbjct: 114 YCILIRLLLASNHIPRAKFTLKRLIEGNANTPLK-KTDARLSEIASAFLEL 163


>ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Glycine max]
          Length = 840

 Score = 77.8 bits (190), Expect = 2e-12
 Identities = 37/95 (38%), Positives = 55/95 (57%)
 Frame = +2

Query: 236 VKKVLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFR 415
           +  + SIL++++LD SKC+ +L  L+   FDR+F  +H ++NPKT   FF FA+    FR
Sbjct: 60  LSSIPSILTSKTLDSSKCKSILPHLTPHHFDRLFLSLHRTVNPKTTHEFFRFATRHCNFR 119

Query: 416 FTVGSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLP 520
           FTV SYC L+R L+  +             DG +P
Sbjct: 120 FTVRSYCLLLRSLLADSFVPRARFLLARLIDGHVP 154


>ref|XP_007156329.1| hypothetical protein PHAVU_003G277400g [Phaseolus vulgaris]
           gi|561029683|gb|ESW28323.1| hypothetical protein
           PHAVU_003G277400g [Phaseolus vulgaris]
          Length = 837

 Score = 77.4 bits (189), Expect = 3e-12
 Identities = 42/111 (37%), Positives = 61/111 (54%)
 Frame = +2

Query: 251 SILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGS 430
           S+L+   LD SKC+ +L  LS   FDR+FF +H ++NP T L+FF  A+    F FT  S
Sbjct: 67  SLLTTGVLDSSKCKSILPHLSPLEFDRLFFPIHHTVNPITTLDFFRLATNRFKFPFTFRS 126

Query: 431 YCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDRRHLEIARAVADL 583
           YC L+R L+  +             DG +P  F  +R+ R  EIA ++ +L
Sbjct: 127 YCLLLRSLLASSLLPRARSLVTRLIDGHVPTSFH-DRENRLREIASSMLEL 176


>ref|XP_006282558.1| hypothetical protein CARUB_v10004123mg [Capsella rubella]
           gi|482551263|gb|EOA15456.1| hypothetical protein
           CARUB_v10004123mg [Capsella rubella]
          Length = 838

 Score = 77.0 bits (188), Expect = 3e-12
 Identities = 43/117 (36%), Positives = 63/117 (53%)
 Frame = +2

Query: 176 HDQNQQQKEHEQRFDHDEIVVKKVLSILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSS 355
           H  ++ ++    R  HD     ++ S+LS +SLD   C++++  LS   FDR+F    S 
Sbjct: 59  HPPDKSEEASSDRHLHD-----RLSSVLSKRSLDYELCKQLITVLSPLEFDRLFPEFRSK 113

Query: 356 INPKTALNFFSFASESLGFRFTVGSYCHLIRLLVCKNXXXXXXXXXXXXXDGKLPAL 526
           +NPKTALNFF  AS+S  F F++ SYC LI LL+  N             +G +P L
Sbjct: 114 VNPKTALNFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSPARVTLIRLINGNVPVL 170


>gb|AHB18408.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 846

 Score = 76.3 bits (186), Expect = 6e-12
 Identities = 50/113 (44%), Positives = 60/113 (53%), Gaps = 2/113 (1%)
 Frame = +2

Query: 251 SILSNQSLDRSKCREVLCDLSTRRFDRVFFGVHSSINPKTALNFFSFASESLGFRFTVGS 430
           SILS  SLD SK +++L  LS   FDR F  +    +PKT LNFF  AS    FRFT+ S
Sbjct: 85  SILSKPSLDSSKSKQLLPLLSPSDFDRFFIALSPRADPKTTLNFFHLASRCFNFRFTLRS 144

Query: 431 YCHLIRLLVCKNXXXXXXXXXXXXXDGKLPALFESERDR--RHLEIARAVADL 583
           Y  LI LL+  N             DGKLP LF         H++IA A+ADL
Sbjct: 145 YYILILLLLLSNNSSAARLLLIRLIDGKLP-LFSPNNPPTVNHIQIAIALADL 196


Top