BLASTX nr result

ID: Rehmannia31_contig00004307 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00004307
         (670 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PIN17871.1| hypothetical protein CDL12_09472 [Handroanthus im...   319   e-103
ref|XP_011082312.1| pentatricopeptide repeat-containing protein ...   311   e-100
ref|XP_012854228.1| PREDICTED: pentatricopeptide repeat-containi...   301   7e-96
gb|EYU23303.1| hypothetical protein MIMGU_mgv1a003335mg [Erythra...   273   4e-85
ref|XP_022736384.1| pentatricopeptide repeat-containing protein ...   268   2e-83
ref|XP_022875458.1| pentatricopeptide repeat-containing protein ...   267   8e-83
gb|OMO93484.1| hypothetical protein COLO4_16917 [Corchorus olito...   265   5e-82
emb|CDL67990.1| putative pentatricopeptide repeat-containing pro...   251   7e-82
ref|XP_017973404.1| PREDICTED: pentatricopeptide repeat-containi...   263   3e-81
gb|AIF73144.1| tetratricopeptide repeat-like superfamily protein...   260   4e-80
ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containi...   259   6e-80
ref|XP_021276562.1| pentatricopeptide repeat-containing protein ...   259   8e-80
gb|EOY25237.1| Tetratricopeptide repeat-like superfamily protein...   259   5e-79
ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containi...   256   2e-78
ref|XP_023913662.1| pentatricopeptide repeat-containing protein ...   255   2e-78
ref|XP_015084956.1| PREDICTED: pentatricopeptide repeat-containi...   255   2e-78
ref|XP_021593507.1| pentatricopeptide repeat-containing protein ...   255   3e-78
ref|XP_012475633.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   255   6e-78
gb|PPD83735.1| hypothetical protein GOBAR_DD19314 [Gossypium bar...   253   1e-77
ref|XP_016717819.1| PREDICTED: pentatricopeptide repeat-containi...   253   1e-77

>gb|PIN17871.1| hypothetical protein CDL12_09472 [Handroanthus impetiginosus]
          Length = 606

 Score =  319 bits (818), Expect = e-103
 Identities = 154/194 (79%), Positives = 170/194 (87%), Gaps = 2/194 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEK--KSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           MVGASVLNQPQQFLI  E   KSP++DFN KE ECISL+K  KNM+EFKQVH QILKLG 
Sbjct: 1   MVGASVLNQPQQFLIPPEHNGKSPEVDFNQKEQECISLVKRSKNMEEFKQVHGQILKLGL 60

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
           FWSSFCASNLV+TCALSEWGSMDYACSIFEQIDDPSSFEFN MIRGYIKD NS+EA+FTY
Sbjct: 61  FWSSFCASNLVATCALSEWGSMDYACSIFEQIDDPSSFEFNAMIRGYIKDMNSEEALFTY 120

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKC 625
           ++MLE+GV+PD+FTYPALLK CA LSA+ EG QIHGQI+K+GFVED FVQNSLIN YGKC
Sbjct: 121 LEMLEMGVKPDNFTYPALLKGCAFLSAIEEGIQIHGQILKLGFVEDAFVQNSLINKYGKC 180

Query: 626 GLLRHSCAVFEQMD 667
           G +  SC VFE MD
Sbjct: 181 GRISDSCTVFEHMD 194



 Score = 66.2 bits (160), Expect = 6e-09
 Identities = 46/161 (28%), Positives = 82/161 (50%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +LLK C     ++E  Q+H QILKLGF   +F  ++L++     + G +  +C++FE +D
Sbjct: 137 ALLKGCAFLSAIEEGIQIHGQILKLGFVEDAFVQNSLINK--YGKCGRISDSCTVFEHMD 194

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             +   ++ +I  +       E +  +  M   G    ++ T  ++L AC  L  ++ G+
Sbjct: 195 QKTIASWSALIAAHANMGFWDECLKLFSQMNREGCWRAEESTLVSVLLACTHLGVLDSGR 254

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
            IHG +++     +V V+ SLI+MY  CG L     +F  M
Sbjct: 255 AIHGYLLRNLTGLNVAVETSLIDMYIHCGNLDKGICLFRGM 295


>ref|XP_011082312.1| pentatricopeptide repeat-containing protein At1g31920 [Sesamum
           indicum]
          Length = 606

 Score =  311 bits (797), Expect = e-100
 Identities = 148/194 (76%), Positives = 172/194 (88%), Gaps = 2/194 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQE--KKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           MVGASVLNQPQQFLI+ +   K P+ID N KE ECISL+K C N++EFKQ H QILKLGF
Sbjct: 1   MVGASVLNQPQQFLIAHDHHSKIPEIDINRKEQECISLIKRCTNIEEFKQAHAQILKLGF 60

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
           F SSFCASNL++TCALSEWGSMDYACSIF+QI DP SFEFN MIRG++K+F+S+ A+ TY
Sbjct: 61  FCSSFCASNLIATCALSEWGSMDYACSIFQQIYDPGSFEFNAMIRGHVKEFDSEAALCTY 120

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKC 625
           +DMLE+GVEPD+FTYP+LLKACA LSAV EGKQIHGQ++K+GFVEDVFVQNSLINMYGKC
Sbjct: 121 LDMLEVGVEPDNFTYPSLLKACASLSAVEEGKQIHGQVLKLGFVEDVFVQNSLINMYGKC 180

Query: 626 GLLRHSCAVFEQMD 667
           G ++HS AVFEQMD
Sbjct: 181 GQIKHSRAVFEQMD 194



 Score = 74.3 bits (181), Expect = 1e-11
 Identities = 49/160 (30%), Positives = 89/160 (55%), Gaps = 4/160 (2%)
 Frame = +2

Query: 194 SLLKTCKNM---KEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           SLLK C ++   +E KQ+H Q+LKLGF    F  ++L++     + G + ++ ++FEQ+D
Sbjct: 137 SLLKACASLSAVEEGKQIHGQVLKLGFVEDVFVQNSLINM--YGKCGQIKHSRAVFEQMD 194

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             +   ++ +I  Y       E +  + +M   G    ++ T  +++ ACA L A++ G+
Sbjct: 195 RKTVASWSAVIAAYANLGMWDECLSLFGEMNFEGCWRAEESTLVSVVSACAHLGALDLGR 254

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQ 661
             HG +++     +V VQ SLI+MY KCG L    ++F++
Sbjct: 255 STHGYLLRNLSGLNVAVQTSLIDMYIKCGSLDKGMSLFQR 294


>ref|XP_012854228.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
           [Erythranthe guttata]
          Length = 607

 Score =  301 bits (770), Expect = 7e-96
 Identities = 142/194 (73%), Positives = 167/194 (86%), Gaps = 2/194 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEK--KSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           MV AS+ NQP QF I ++   K+P+IDF +KE ECISL+KTC++M EFK+VH +ILKLG 
Sbjct: 1   MVVASIPNQPHQFSIPKDNHGKNPEIDFGVKEQECISLVKTCRSMDEFKKVHGKILKLGL 60

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
           FWSSFCASNL++TCALSEWGSMDYACSIF Q+DDP SFEFNTMIRGY+KD NS+EA FTY
Sbjct: 61  FWSSFCASNLLATCALSEWGSMDYACSIFRQMDDPDSFEFNTMIRGYVKDMNSEEAFFTY 120

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKC 625
           ++MLE GVEPD+FTYP LLKAC++LSA  EG QIHGQI KMGFVEDV VQNSLIN+YGKC
Sbjct: 121 LEMLEFGVEPDNFTYPPLLKACSILSAFAEGAQIHGQIYKMGFVEDVMVQNSLINVYGKC 180

Query: 626 GLLRHSCAVFEQMD 667
           G ++ SCAVF +MD
Sbjct: 181 GRVKRSCAVFRRMD 194


>gb|EYU23303.1| hypothetical protein MIMGU_mgv1a003335mg [Erythranthe guttata]
          Length = 592

 Score =  273 bits (697), Expect = 4e-85
 Identities = 130/176 (73%), Positives = 152/176 (86%), Gaps = 2/176 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEK--KSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           MV AS+ NQP QF I ++   K+P+IDF +KE ECISL+KTC++M EFK+VH +ILKLG 
Sbjct: 1   MVVASIPNQPHQFSIPKDNHGKNPEIDFGVKEQECISLVKTCRSMDEFKKVHGKILKLGL 60

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
           FWSSFCASNL++TCALSEWGSMDYACSIF Q+DDP SFEFNTMIRGY+KD NS+EA FTY
Sbjct: 61  FWSSFCASNLLATCALSEWGSMDYACSIFRQMDDPDSFEFNTMIRGYVKDMNSEEAFFTY 120

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINM 613
           ++MLE GVEPD+FTYP LLKAC++LSA  EG QIHGQI KMGFVEDV VQNSLIN+
Sbjct: 121 LEMLEFGVEPDNFTYPPLLKACSILSAFAEGAQIHGQIYKMGFVEDVMVQNSLINV 176


>ref|XP_022736384.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
           [Durio zibethinus]
          Length = 605

 Score =  268 bits (686), Expect = 2e-83
 Identities = 125/194 (64%), Positives = 154/194 (79%), Gaps = 2/194 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLI--SQEKKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           M G SVL +P +F    +   +SP++   LKE EC+SLLK CKN++EFKQ H QI+K GF
Sbjct: 1   MTGTSVL-KPTKFFSPPADPPQSPELSLRLKEQECLSLLKKCKNLEEFKQAHAQIVKWGF 59

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
           FW+SFCASNLV  CALS+WGSMDYACSIF+QID+P +F+FNTMIR ++KD N +EA+F Y
Sbjct: 60  FWNSFCASNLVVACALSDWGSMDYACSIFQQIDEPGTFDFNTMIRAHVKDMNFEEALFFY 119

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKC 625
            +MLE G+EPD+FTYP+L KACA L A  EG QIHG   K+GF  D++VQNSLINMYGKC
Sbjct: 120 YEMLERGIEPDNFTYPSLFKACAWLQAQEEGMQIHGHAFKLGFENDLYVQNSLINMYGKC 179

Query: 626 GLLRHSCAVFEQMD 667
           G ++HSCAVFEQMD
Sbjct: 180 GEIKHSCAVFEQMD 193



 Score = 76.3 bits (186), Expect = 2e-12
 Identities = 49/161 (30%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           SL K C   +  +E  Q+H    KLGF    +  ++L++     + G + ++C++FEQ+D
Sbjct: 136 SLFKACAWLQAQEEGMQIHGHAFKLGFENDLYVQNSLINM--YGKCGEIKHSCAVFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
           + S   ++ +I  +       E +  + +M   G   P++ T   +L ACA L A++ GK
Sbjct: 194 EKSVASWSAIIAAHASVGMWYECLMIFGNMSSEGCWRPEESTLVTVLSACAYLGALDLGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
              G +++     +V VQ SLI+MY KCG L    +VF +M
Sbjct: 254 CTQGSLLRNISELNVIVQTSLIDMYVKCGCLEKGLSVFRKM 294


>ref|XP_022875458.1| pentatricopeptide repeat-containing protein At1g31920 [Olea
           europaea var. sylvestris]
 ref|XP_022875459.1| pentatricopeptide repeat-containing protein At1g31920 [Olea
           europaea var. sylvestris]
 ref|XP_022875460.1| pentatricopeptide repeat-containing protein At1g31920 [Olea
           europaea var. sylvestris]
          Length = 603

 Score =  267 bits (682), Expect = 8e-83
 Identities = 130/193 (67%), Positives = 158/193 (81%), Gaps = 1/193 (0%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEKKSPDIDFNLK-EHECISLLKTCKNMKEFKQVHCQILKLGFF 268
           MVG+SVL+QP+  LI   +   +I+ + K E +CI LLK C +++E KQVH QILKLG F
Sbjct: 1   MVGSSVLHQPK-LLIPTHRNGLEIESDSKREQDCIFLLKKCTSVEELKQVHAQILKLGLF 59

Query: 269 WSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYI 448
             SFCASNLVSTCALSEWGSMDYACSIF+  DDP SF+FNTMIRG+IK  N ++A+FTY+
Sbjct: 60  CKSFCASNLVSTCALSEWGSMDYACSIFKHRDDPDSFDFNTMIRGHIKHMNLEQALFTYL 119

Query: 449 DMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCG 628
           +ML  G+EPD+FTYPALLKAC  LSAV++G QIHGQI K+GFV+DVFVQNSLINMYGKCG
Sbjct: 120 EMLHSGIEPDNFTYPALLKACTRLSAVDQGMQIHGQIFKLGFVDDVFVQNSLINMYGKCG 179

Query: 629 LLRHSCAVFEQMD 667
            ++ SC VFEQM+
Sbjct: 180 DIKRSCVVFEQME 192



 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 45/162 (27%), Positives = 82/162 (50%), Gaps = 5/162 (3%)
 Frame = +2

Query: 194 SLLKTCKNMK---EFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +LLK C  +    +  Q+H QI KLGF    F  ++L++     + G +  +C +FEQ++
Sbjct: 135 ALLKACTRLSAVDQGMQIHGQIFKLGFVDDVFVQNSLINM--YGKCGDIKRSCVVFEQME 192

Query: 365 DPSSF--EFNTMIRGYIKDFNSKEAIFTYIDMLEIGVEPDDFTYPALLKACALLSAVNEG 538
           D       ++++I  Y       E +  + +M EIG+  ++     +L AC  L A++  
Sbjct: 193 DSRKTIASWSSVISAYASTGMWSECLRLFGEMNEIGLRAEESILVNVLCACTHLGALDLA 252

Query: 539 KQIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
              HG +++     +V V+ +LI++Y KCG       +FE+M
Sbjct: 253 MCTHGYLLRNLTGLNVIVETTLIDVYMKCGFPDRGLLLFEKM 294


>gb|OMO93484.1| hypothetical protein COLO4_16917 [Corchorus olitorius]
          Length = 605

 Score =  265 bits (677), Expect = 5e-82
 Identities = 124/193 (64%), Positives = 149/193 (77%), Gaps = 1/193 (0%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEK-KSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFF 268
           M G SVL Q + F    +   SPD+   LKE +C+SLLK CKN++EFKQ H QI+K GFF
Sbjct: 1   MTGTSVLQQIKFFSPPADPPSSPDLSLRLKEQDCLSLLKRCKNIEEFKQAHAQIVKWGFF 60

Query: 269 WSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYI 448
           W+SFCASNLV TCALS+WGSMDYACSIF+QID+P +FEFNTMIR ++K  N +EA++ Y 
Sbjct: 61  WNSFCASNLVVTCALSDWGSMDYACSIFQQIDEPGTFEFNTMIRAHVKSMNFEEALYFYF 120

Query: 449 DMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCG 628
           +M+E G+EPD+FTYP L KACA L A  EG QIHG   K GF  D++VQNSLINMYGKCG
Sbjct: 121 EMVERGIEPDNFTYPTLFKACAWLRAQEEGMQIHGHAFKFGFGSDLYVQNSLINMYGKCG 180

Query: 629 LLRHSCAVFEQMD 667
            + HSCAVFEQMD
Sbjct: 181 NIEHSCAVFEQMD 193



 Score = 78.6 bits (192), Expect = 4e-13
 Identities = 47/161 (29%), Positives = 85/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +L K C   +  +E  Q+H    K GF    +  ++L++     + G+++++C++FEQ+D
Sbjct: 136 TLFKACAWLRAQEEGMQIHGHAFKFGFGSDLYVQNSLINM--YGKCGNIEHSCAVFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
           + S   ++ +I  +       E + T+  M   G   P++ T   +L AC  L A++ GK
Sbjct: 194 EKSVASWSAIIAAHASLGRWSECLMTFGKMSSEGHWRPEESTLVTVLSACTHLGALDLGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SLI+MY KCG L    ++F +M
Sbjct: 254 STHGSLLRNISELNVIVQTSLIDMYVKCGCLEKGLSLFRKM 294



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 41/196 (20%), Positives = 92/196 (46%), Gaps = 4/196 (2%)
 Frame = +2

Query: 89  IMVGASVLNQPQQFLISQEKKSPDIDFNLKEHECISLLKTCKNMKEF---KQVHCQILKL 259
           I+   + L +  + L++  K S +  +  +E   +++L  C ++      K  H  +L+ 
Sbjct: 203 IIAAHASLGRWSECLMTFGKMSSEGHWRPEESTLVTVLSACTHLGALDLGKSTHGSLLRN 262

Query: 260 GFFWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIF 439
               +    ++L+      + G ++   S+F ++   +   ++ +I G     N +EA+ 
Sbjct: 263 ISELNVIVQTSLIDMYV--KCGCLEKGLSLFRKMAKRNQMSYSVIISGLAMHGNGEEALR 320

Query: 440 TYIDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQI-IKMGFVEDVFVQNSLINMY 616
            + +MLE G++PDD  Y  +L AC+    V+EG Q   ++  + G    V     ++++ 
Sbjct: 321 IFSEMLEEGLDPDDVVYVGVLSACSHAGLVDEGFQCFDRMKSEHGIKPTVLHYGCMVDLM 380

Query: 617 GKCGLLRHSCAVFEQM 664
           GK G+++ +    + M
Sbjct: 381 GKAGMIKEALEFIKSM 396


>emb|CDL67990.1| putative pentatricopeptide repeat-containing protein At1g31920,
           partial [Olea europaea]
          Length = 199

 Score =  251 bits (642), Expect = 7e-82
 Identities = 117/156 (75%), Positives = 136/156 (87%)
 Frame = +2

Query: 200 LKTCKNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSF 379
           LK CKNM+EFKQ+H QI+K GF WSSFC+SNL++TCALSEWGSMDYACSIF QI+DP SF
Sbjct: 3   LKKCKNMQEFKQIHGQIIKFGFLWSSFCSSNLLATCALSEWGSMDYACSIFHQIEDPGSF 62

Query: 380 EFNTMIRGYIKDFNSKEAIFTYIDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQI 559
           EFNTMIRG+ KD N  EA+F YI+MLE  VE D+FT+PA+LKAC+ LSA+ EG QIHGQI
Sbjct: 63  EFNTMIRGHNKDVNFAEALFIYIEMLEREVEQDNFTFPAILKACSGLSALLEGMQIHGQI 122

Query: 560 IKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQMD 667
            K+G+VED+FVQNSLINMYGKCG + HSCAVFEQMD
Sbjct: 123 FKLGYVEDLFVQNSLINMYGKCGKISHSCAVFEQMD 158


>ref|XP_017973404.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
           isoform X1 [Theobroma cacao]
          Length = 605

 Score =  263 bits (672), Expect = 3e-81
 Identities = 124/193 (64%), Positives = 150/193 (77%), Gaps = 1/193 (0%)
 Frame = +2

Query: 92  MVGASVLNQPQQF-LISQEKKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFF 268
           M G SVL Q + F L +   +SP++   LKE EC S+LK CKNM+EF+Q H QI+K GFF
Sbjct: 1   MPGTSVLQQTKFFSLPADPPQSPELSLRLKEQECFSILKRCKNMEEFRQAHAQIVKWGFF 60

Query: 269 WSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYI 448
           W+SFCASNLV+ CALS+ GSMDYACSIF+QID+P +FEFNTMIR ++KD   +EA+  Y 
Sbjct: 61  WNSFCASNLVAACALSDGGSMDYACSIFQQIDEPGTFEFNTMIRAHVKDMTFEEALVFYY 120

Query: 449 DMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCG 628
           +MLE GVEPD+FTYPAL KACA L A  EGKQIHG   K+G   D++VQNSLINMYGKCG
Sbjct: 121 EMLEKGVEPDNFTYPALFKACACLQAQEEGKQIHGHAFKLGLESDLYVQNSLINMYGKCG 180

Query: 629 LLRHSCAVFEQMD 667
            + HSCA+FEQMD
Sbjct: 181 EIEHSCAIFEQMD 193



 Score = 75.5 bits (184), Expect = 4e-12
 Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +L K C   +  +E KQ+H    KLG     +  ++L++     + G ++++C+IFEQ+D
Sbjct: 136 ALFKACACLQAQEEGKQIHGHAFKLGLESDLYVQNSLINM--YGKCGEIEHSCAIFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             S   ++ +I  +       E +  + +M   G   P++ T   +L AC  L A++ GK
Sbjct: 194 QKSVASWSAIIAAHASFGKWYECLMMFGNMSSEGCWRPEESTLVTVLSACTHLGALDLGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SL++MY KCG L    ++F +M
Sbjct: 254 CTHGSLLRNISELNVIVQTSLMDMYVKCGCLEKGLSLFRKM 294


>gb|AIF73144.1| tetratricopeptide repeat-like superfamily protein [Camellia
           sinensis var. sinensis]
          Length = 605

 Score =  260 bits (664), Expect = 4e-80
 Identities = 122/182 (67%), Positives = 149/182 (81%), Gaps = 2/182 (1%)
 Frame = +2

Query: 128 FLISQEKK--SPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFFWSSFCASNLVS 301
           FLI QE +  SP+ +F L+E EC+SL+K CKN++EFKQ H QILK G FWSSFCA+NLV+
Sbjct: 12  FLIPQEDRPQSPESNFRLREQECVSLIKQCKNLEEFKQAHAQILKFGMFWSSFCANNLVA 71

Query: 302 TCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGVEPDD 481
           TCALS+WGSMDYA SIF+QI++P SF FN MIRG++KD N +EA+  Y +MLE+GVEPD+
Sbjct: 72  TCALSDWGSMDYASSIFQQINEPGSFAFNHMIRGHVKDMNLEEALLMYDEMLELGVEPDN 131

Query: 482 FTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQ 661
           FTYP LLKACA L A+ EG QIHG   K+GF +DVFVQNSLINMYGKCG +  SCAVFE+
Sbjct: 132 FTYPTLLKACANLPALEEGMQIHGHSFKLGFEDDVFVQNSLINMYGKCGEIGLSCAVFEK 191

Query: 662 MD 667
           M+
Sbjct: 192 ME 193


>ref|XP_004245945.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
           [Solanum lycopersicum]
          Length = 605

 Score =  259 bits (663), Expect = 6e-80
 Identities = 128/194 (65%), Positives = 154/194 (79%), Gaps = 2/194 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQE--KKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           MV  SVL Q   FLI +E   K+ +++F+LKE E IS++K C NM+E KQVH QILKLGF
Sbjct: 1   MVRTSVLYQTP-FLIPKEYHAKAQELNFSLKEQEWISMIKKCNNMRELKQVHGQILKLGF 59

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
             SSFCA NL+STCALSEWGSMDYAC IF++IDDP SFE+NT+IRGY+KD N +EA+  Y
Sbjct: 60  ICSSFCAGNLLSTCALSEWGSMDYACLIFDEIDDPGSFEYNTVIRGYVKDMNLEEALLWY 119

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKC 625
           + M+E  VEPD+F+YP LLK CA + A+ EGKQIHGQI+K G  +DVFVQNSLINMYGKC
Sbjct: 120 VHMIEDEVEPDNFSYPTLLKVCARIRALKEGKQIHGQILKFGHEDDVFVQNSLINMYGKC 179

Query: 626 GLLRHSCAVFEQMD 667
           G +R SC VFEQMD
Sbjct: 180 GGVRQSCIVFEQMD 193



 Score = 73.6 bits (179), Expect = 2e-11
 Identities = 48/161 (29%), Positives = 85/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +LLK C   + +KE KQ+H QILK G     F  ++L++     + G +  +C +FEQ+D
Sbjct: 136 TLLKVCARIRALKEGKQIHGQILKFGHEDDVFVQNSLINM--YGKCGGVRQSCIVFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             +   ++ +I          E +  + +M   G    ++ T  +++ AC  L+A++ GK
Sbjct: 194 QRTIASWSALIAANANLGLWSECLRVFAEMNSEGCWRAEESTLVSVISACTHLNALDFGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V V+ SLI+MY KCG L     +F++M
Sbjct: 254 ATHGYLLRNMTGLNVIVETSLIDMYVKCGCLEKGLFLFQRM 294


>ref|XP_021276562.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
           [Herrania umbratica]
          Length = 605

 Score =  259 bits (662), Expect = 8e-80
 Identities = 121/193 (62%), Positives = 150/193 (77%), Gaps = 1/193 (0%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEK-KSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFF 268
           M G SVL Q + F +  +  +SP++   LKE EC+S+LK CKNM+EF+Q H QI+K GFF
Sbjct: 1   MPGTSVLQQTKFFSLPVDPPQSPELSLRLKEQECLSILKRCKNMEEFRQAHAQIVKWGFF 60

Query: 269 WSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYI 448
           W+SFCASNL++ CALS+ GSMDYACSIF+QID+P +FEFNTMIR ++KD   +EA+  Y 
Sbjct: 61  WNSFCASNLLAACALSDGGSMDYACSIFQQIDEPGTFEFNTMIRAHVKDMTFEEALVFYY 120

Query: 449 DMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCG 628
           +MLE GVEPD+FTYPAL KACA L A  EGKQIHG   K+G   D++VQNSLINMY KCG
Sbjct: 121 EMLEKGVEPDNFTYPALFKACAWLQAQEEGKQIHGHAFKLGLESDLYVQNSLINMYSKCG 180

Query: 629 LLRHSCAVFEQMD 667
            + HSCA+FEQMD
Sbjct: 181 EIEHSCAIFEQMD 193



 Score = 80.5 bits (197), Expect = 8e-14
 Identities = 49/161 (30%), Positives = 87/161 (54%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +L K C   +  +E KQ+H    KLG     +  ++L++    S+ G ++++C+IFEQ+D
Sbjct: 136 ALFKACAWLQAQEEGKQIHGHAFKLGLESDLYVQNSLINM--YSKCGEIEHSCAIFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
           + S   ++ +I  +       E +  + +M   G   P++ T   +L AC  L A++ GK
Sbjct: 194 EKSVASWSAIIAAHASFGKWDECLMMFGNMSSEGCWRPEESTLVTVLSACTYLGALDLGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SLI+MY KCG L+   ++F +M
Sbjct: 254 CAHGSLLRNISELNVIVQTSLIDMYVKCGCLQKGLSLFRKM 294


>gb|EOY25237.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao]
          Length = 703

 Score =  259 bits (663), Expect = 5e-79
 Identities = 123/197 (62%), Positives = 151/197 (76%), Gaps = 1/197 (0%)
 Frame = +2

Query: 80  INEIMVGASVLNQPQQF-LISQEKKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILK 256
           ++  M G SVL Q + F L +   +S ++   LKE EC S+LK CKNM+EF+Q H QI+K
Sbjct: 95  VDNRMPGTSVLQQTKFFSLPADPPQSLELSLRLKEQECFSILKRCKNMEEFRQAHAQIVK 154

Query: 257 LGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAI 436
            GFFW+SFCASNLV+ CALS+ GSMDYACSIF+QID+P +FEFNTMIR ++KD   +EA+
Sbjct: 155 WGFFWNSFCASNLVAACALSDGGSMDYACSIFQQIDEPGTFEFNTMIRAHVKDMTFEEAL 214

Query: 437 FTYIDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMY 616
             Y +MLE GVEPD+FTYPAL KACA L A  EGKQIHG   K+G   D++VQNSLINMY
Sbjct: 215 VFYYEMLEKGVEPDNFTYPALFKACACLQAQEEGKQIHGHAFKLGLESDLYVQNSLINMY 274

Query: 617 GKCGLLRHSCAVFEQMD 667
           GKCG + HSCA+FEQMD
Sbjct: 275 GKCGEIEHSCAIFEQMD 291



 Score = 75.5 bits (184), Expect = 4e-12
 Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +L K C   +  +E KQ+H    KLG     +  ++L++     + G ++++C+IFEQ+D
Sbjct: 234 ALFKACACLQAQEEGKQIHGHAFKLGLESDLYVQNSLINM--YGKCGEIEHSCAIFEQMD 291

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             S   ++ +I  +       E +  + +M   G   P++ T   +L AC  L A++ GK
Sbjct: 292 QKSVASWSAIIAAHASFGKWYECLMMFGNMSSEGCWRPEESTLVTVLSACTHLGALDLGK 351

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SL++MY KCG L    ++F +M
Sbjct: 352 CTHGSLLRNISELNVIVQTSLMDMYVKCGCLEKGLSLFRKM 392


>ref|XP_006352928.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
           [Solanum tuberosum]
          Length = 605

 Score =  256 bits (653), Expect = 2e-78
 Identities = 126/194 (64%), Positives = 153/194 (78%), Gaps = 2/194 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQE--KKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           MV  SVL Q   FLI +E   K+ + +F+LKE E IS++K C +M+E KQVH QILKLGF
Sbjct: 1   MVRTSVLYQTP-FLIPKEYHAKAQEFNFSLKEQEWISMIKKCNSMRELKQVHGQILKLGF 59

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
             SSFC+ NL+STCALSEWGSMDYAC IF++IDDP SFE+NT+IRGY+KD N +EA+  Y
Sbjct: 60  ICSSFCSGNLLSTCALSEWGSMDYACLIFDEIDDPRSFEYNTVIRGYVKDMNLEEALLWY 119

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKC 625
           + M+E  VEPD+F+YP LLK CA + A+ EGKQIHGQI+K G  +DVFVQNSLINMYGKC
Sbjct: 120 VHMIEDEVEPDNFSYPTLLKVCARIRALKEGKQIHGQILKFGHEDDVFVQNSLINMYGKC 179

Query: 626 GLLRHSCAVFEQMD 667
           G +R SC VFEQMD
Sbjct: 180 GEVRQSCIVFEQMD 193



 Score = 72.0 bits (175), Expect = 6e-11
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +LLK C   + +KE KQ+H QILK G     F  ++L++     + G +  +C +FEQ+D
Sbjct: 136 TLLKVCARIRALKEGKQIHGQILKFGHEDDVFVQNSLINM--YGKCGEVRQSCIVFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             +   ++ +I          E +  + +M   G    ++ T  +++ AC  L A++ GK
Sbjct: 194 QRTIASWSALIAANANLGLWSECLKVFGEMNSEGCWRAEESTLVSVISACTHLDALDFGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V V+ SLI+MY KCG L     +F++M
Sbjct: 254 ATHGYLLRNMTGLNVIVETSLIDMYVKCGCLEKGLFLFQRM 294


>ref|XP_023913662.1| pentatricopeptide repeat-containing protein At1g31920 [Quercus
           suber]
          Length = 604

 Score =  255 bits (652), Expect = 2e-78
 Identities = 119/192 (61%), Positives = 149/192 (77%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEKKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFFW 271
           M+   VLN+    L +++ K  + D  LKE EC+SLLK CK+M+E KQVH QILK G FW
Sbjct: 1   MIRTPVLNKTHLLLSTKDPKLTEFDLRLKEQECLSLLKRCKSMEELKQVHVQILKFGLFW 60

Query: 272 SSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYID 451
            SFCASNLV++CALS+WGSMDYACSIF Q+ +P +F FNTMIRG++KD + +EA+  Y +
Sbjct: 61  DSFCASNLVASCALSDWGSMDYACSIFRQLKEPGTFVFNTMIRGHVKDASFEEALLVYYE 120

Query: 452 MLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCGL 631
           M+E G+E D+FTYPALLKACA   A+ EG QIHG I K+G  +DVFVQNSLI+MYGK G 
Sbjct: 121 MIERGIEADNFTYPALLKACARSLALEEGMQIHGHIFKLGLEDDVFVQNSLISMYGKFGE 180

Query: 632 LRHSCAVFEQMD 667
           ++HSCAVFEQMD
Sbjct: 181 VKHSCAVFEQMD 192



 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 47/161 (29%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTCKN---MKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +LLK C     ++E  Q+H  I KLG     F  ++L+S     ++G + ++C++FEQ+D
Sbjct: 135 ALLKACARSLALEEGMQIHGHIFKLGLEDDVFVQNSLISM--YGKFGEVKHSCAVFEQMD 192

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             +   ++ +I  +       E +  + DM    +   ++ T  ++L AC  L A++ G+
Sbjct: 193 QKTVASWSAIIGAHASLGLWCECLMLFGDMSSEELWRAEESTLVSVLSACTHLGALDLGR 252

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SLI+MY KCG L     +F+ M
Sbjct: 253 CTHGSLLRNISGLNVIVQTSLIDMYVKCGCLEKGWRLFQNM 293


>ref|XP_015084956.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31920
           [Solanum pennellii]
          Length = 605

 Score =  255 bits (652), Expect = 2e-78
 Identities = 126/194 (64%), Positives = 153/194 (78%), Gaps = 2/194 (1%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQE--KKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGF 265
           MV  SVL Q   FLI +E   K+ + +F+LKE E IS++K C +M+E KQVH QILKLGF
Sbjct: 1   MVRTSVLYQTP-FLIPKEYHAKAQEFNFSLKEQEWISMIKKCNSMRELKQVHGQILKLGF 59

Query: 266 FWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTY 445
             SSFCA NL+STCALSEWGSMDYAC IF++IDDP SFE+NT+IRGY+KD N +EA+  Y
Sbjct: 60  ICSSFCAGNLLSTCALSEWGSMDYACLIFDEIDDPGSFEYNTVIRGYVKDMNLEEALLWY 119

Query: 446 IDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKC 625
           + M+E  VEPD+F+YP LLK CA + A+ EGKQIHGQI+K G  ++VFVQNSLINMYGKC
Sbjct: 120 VRMIEDEVEPDNFSYPTLLKVCARIRALKEGKQIHGQILKFGHEDEVFVQNSLINMYGKC 179

Query: 626 GLLRHSCAVFEQMD 667
           G +R SC VFEQMD
Sbjct: 180 GEVRQSCIVFEQMD 193



 Score = 73.9 bits (180), Expect = 1e-11
 Identities = 48/161 (29%), Positives = 86/161 (53%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +LLK C   + +KE KQ+H QILK G     F  ++L++     + G +  +C +FEQ+D
Sbjct: 136 TLLKVCARIRALKEGKQIHGQILKFGHEDEVFVQNSLINM--YGKCGEVRQSCIVFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
             +   ++ +I          E +  + +M   G    ++ T  +++ AC  L+A++ GK
Sbjct: 194 QRTIASWSALIAANANLGLWSECLRVFAEMNSEGCWRAEESTLVSVISACTHLNALDFGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V V++SLI+MY KCG L     +F++M
Sbjct: 254 ATHGYLLRNMTGLNVIVESSLIDMYVKCGCLEKGLFLFQRM 294


>ref|XP_021593507.1| pentatricopeptide repeat-containing protein At1g31920 isoform X1
           [Manihot esculenta]
          Length = 601

 Score =  255 bits (651), Expect = 3e-78
 Identities = 119/191 (62%), Positives = 155/191 (81%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEKKSPDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFFW 271
           M+G SVL+Q   FL+  E  SP ++  LKE EC++LL+ C N++EF+Q H QIL+ GFF 
Sbjct: 1   MIGTSVLHQTH-FLLPSE--SPQVNLRLKEQECLTLLRRCMNIEEFRQAHAQILRWGFFC 57

Query: 272 SSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYID 451
           SSFCASNL++TCAL +WGSMDYA SIF+QI++P +FEFNTMI+GY KDFN ++++F Y +
Sbjct: 58  SSFCASNLLATCALPDWGSMDYASSIFQQIEEPGTFEFNTMIKGYAKDFNMEKSLFVYCE 117

Query: 452 MLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCGL 631
           MLE GV+ D+FT+P+LLKAC  L+A+ EG QIHG IIK+GF  D++VQNSLINMYGKCG 
Sbjct: 118 MLEKGVQSDNFTFPSLLKACTWLNAIKEGMQIHGNIIKLGFESDLYVQNSLINMYGKCGE 177

Query: 632 LRHSCAVFEQM 664
           ++ SCAVFEQM
Sbjct: 178 IKLSCAVFEQM 188



 Score = 65.5 bits (158), Expect = 1e-08
 Identities = 40/160 (25%), Positives = 84/160 (52%), Gaps = 4/160 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           SLLK C     +KE  Q+H  I+KLGF    +  ++L++     + G +  +C++FEQ+ 
Sbjct: 132 SLLKACTWLNAIKEGMQIHGNIIKLGFESDLYVQNSLINM--YGKCGEIKLSCAVFEQMG 189

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLE-IGVEPDDFTYPALLKACALLSAVNEGK 541
              +  ++ ++  +       E++  + +M       P++  Y ++L AC+ L A++ G+
Sbjct: 190 QKDAASWSAIMAAHTSSGMWSESLQLFEEMGHGRSCRPEESLYVSMLSACSHLGALDFGR 249

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQ 661
            +HG +++     ++ V+ SL +MY  CG +  +  +F +
Sbjct: 250 FLHGVLLRNFSELNLTVKTSLTDMYINCGCVEKALCLFRR 289


>ref|XP_012475633.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At1g31920 [Gossypium raimondii]
          Length = 626

 Score =  255 bits (651), Expect = 6e-78
 Identities = 118/197 (59%), Positives = 150/197 (76%), Gaps = 1/197 (0%)
 Frame = +2

Query: 80  INEIMVGASVLNQPQQFLISQEKKS-PDIDFNLKEHECISLLKTCKNMKEFKQVHCQILK 256
           +++ M G SVL Q   F    +     +++  LKE +C+SLLK CKN+++FKQ H QI+K
Sbjct: 18  VDKTMAGTSVLQQTNFFSPPADPPQFSELNLRLKEQQCLSLLKRCKNLEDFKQAHAQIIK 77

Query: 257 LGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAI 436
            GFFW+SF ASNLV+ CALS+WGS+DYACSIF+Q  +P +FEFNTMIR ++KD N ++A+
Sbjct: 78  WGFFWNSFSASNLVAACALSDWGSLDYACSIFQQFHEPGTFEFNTMIRAHVKDMNFQDAL 137

Query: 437 FTYIDMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMY 616
             Y +MLE GVEPD+FTYPAL KACA L A  EG QIHG + K GF  D++VQNSLINMY
Sbjct: 138 VFYYEMLERGVEPDNFTYPALFKACAWLKAREEGMQIHGHVFKFGFESDLYVQNSLINMY 197

Query: 617 GKCGLLRHSCAVFEQMD 667
           GKCG ++HSCAVFEQMD
Sbjct: 198 GKCGEIQHSCAVFEQMD 214



 Score = 76.3 bits (186), Expect = 2e-12
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +L K C   K  +E  Q+H  + K GF    +  ++L++     + G + ++C++FEQ+D
Sbjct: 157 ALFKACAWLKAREEGMQIHGHVFKFGFESDLYVQNSLINM--YGKCGEIQHSCAVFEQMD 214

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
           + S   ++ +I          E +  + +M   G   P++ T   LL AC  L A++ GK
Sbjct: 215 EKSVASWSAIIAANASLGMWYECLMVFGNMSSEGCWRPEESTLVTLLSACTHLGALDLGK 274

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SLI+MY KCG L    ++F++M
Sbjct: 275 CTHGALLRNISELNVIVQTSLIDMYVKCGYLEKGLSLFKKM 315


>gb|PPD83735.1| hypothetical protein GOBAR_DD19314 [Gossypium barbadense]
          Length = 605

 Score =  253 bits (647), Expect = 1e-77
 Identities = 118/193 (61%), Positives = 147/193 (76%), Gaps = 1/193 (0%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEKKS-PDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFF 268
           M G SVL Q   F    +     +++  LKE +C+SLLK CKN+++FKQ H QI+K GFF
Sbjct: 1   MAGTSVLQQTNFFSPPADPPQFSELNLRLKEQQCLSLLKRCKNLEDFKQAHAQIVKWGFF 60

Query: 269 WSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYI 448
           W+SF ASNLV+ CALS+WGS+DYACSIF+Q  +P +FEFNTMIR ++KD N ++A+  Y 
Sbjct: 61  WNSFSASNLVAACALSDWGSLDYACSIFQQFHEPGTFEFNTMIRAHVKDMNFQDALVFYY 120

Query: 449 DMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCG 628
           +MLE GVEPD+FTYPAL KACA L A  EG QIHG + K GF  D++VQNSLINMYGKCG
Sbjct: 121 EMLERGVEPDNFTYPALFKACAWLKAKEEGMQIHGHVFKFGFESDLYVQNSLINMYGKCG 180

Query: 629 LLRHSCAVFEQMD 667
            ++HSCAVFEQMD
Sbjct: 181 EIQHSCAVFEQMD 193



 Score = 76.3 bits (186), Expect = 2e-12
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +L K C   K  +E  Q+H  + K GF    +  ++L++     + G + ++C++FEQ+D
Sbjct: 136 ALFKACAWLKAKEEGMQIHGHVFKFGFESDLYVQNSLINM--YGKCGEIQHSCAVFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
           + S   ++ +I          E +  + +M   G   P++ T   LL AC  L A++ GK
Sbjct: 194 EKSVASWSAIIAANASLGMWYECLMVFGNMSSEGCWRPEESTLVTLLSACTHLGALDLGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SLI+MY KCG L    ++F++M
Sbjct: 254 CTHGALLRNISELNVIVQTSLIDMYVKCGYLEKGLSLFKKM 294


>ref|XP_016717819.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Gossypium hirsutum]
          Length = 605

 Score =  253 bits (647), Expect = 1e-77
 Identities = 118/193 (61%), Positives = 147/193 (76%), Gaps = 1/193 (0%)
 Frame = +2

Query: 92  MVGASVLNQPQQFLISQEKKS-PDIDFNLKEHECISLLKTCKNMKEFKQVHCQILKLGFF 268
           M G SVL Q   F    +     +++  LKE +C+SLLK CKN+++FKQ H QI+K GFF
Sbjct: 1   MAGTSVLQQTNFFSPPTDPPQFSELNLRLKEQQCLSLLKRCKNLEDFKQAHAQIVKWGFF 60

Query: 269 WSSFCASNLVSTCALSEWGSMDYACSIFEQIDDPSSFEFNTMIRGYIKDFNSKEAIFTYI 448
           W+SF ASNLV+ CALS+WGS+DYACSIF+Q  +P +FEFNTMIR ++KD N ++A+  Y 
Sbjct: 61  WNSFSASNLVAACALSDWGSLDYACSIFQQFHEPGTFEFNTMIRAHVKDMNFQDALVFYY 120

Query: 449 DMLEIGVEPDDFTYPALLKACALLSAVNEGKQIHGQIIKMGFVEDVFVQNSLINMYGKCG 628
           +MLE GVEPD+FTYPAL KACA L A  EG QIHG + K GF  D++VQNSLINMYGKCG
Sbjct: 121 EMLERGVEPDNFTYPALFKACAWLKAKEEGMQIHGHVFKFGFESDLYVQNSLINMYGKCG 180

Query: 629 LLRHSCAVFEQMD 667
            ++HSCAVFEQMD
Sbjct: 181 EIQHSCAVFEQMD 193



 Score = 76.3 bits (186), Expect = 2e-12
 Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 194 SLLKTC---KNMKEFKQVHCQILKLGFFWSSFCASNLVSTCALSEWGSMDYACSIFEQID 364
           +L K C   K  +E  Q+H  + K GF    +  ++L++     + G + ++C++FEQ+D
Sbjct: 136 ALFKACAWLKAKEEGMQIHGHVFKFGFESDLYVQNSLINM--YGKCGEIQHSCAVFEQMD 193

Query: 365 DPSSFEFNTMIRGYIKDFNSKEAIFTYIDMLEIGV-EPDDFTYPALLKACALLSAVNEGK 541
           + S   ++ +I          E +  + +M   G   P++ T   LL AC  L A++ GK
Sbjct: 194 EKSVASWSAIIAANASLGMWYECLMVFGNMSSEGCWRPEESTLVTLLSACTHLGALDLGK 253

Query: 542 QIHGQIIKMGFVEDVFVQNSLINMYGKCGLLRHSCAVFEQM 664
             HG +++     +V VQ SLI+MY KCG L    ++F++M
Sbjct: 254 CTHGALLRNISELNVIVQTSLIDMYVKCGYLEKGLSLFKKM 294


Top