BLASTX nr result

ID: Cocculus23_contig00019933 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00019933
         (892 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citr...   229   8e-58
ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containi...   228   2e-57
emb|CBI29222.3| unnamed protein product [Vitis vinifera]              228   2e-57
ref|XP_002303480.2| pentatricopeptide repeat-containing family p...   225   2e-56
ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prun...   224   5e-56
ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containi...   202   2e-49
ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containi...   201   2e-49
ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containi...   195   2e-47
ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containi...   184   2e-44
ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containi...   184   4e-44
ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutr...   183   7e-44
ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily pr...   182   1e-43
ref|XP_006282558.1| hypothetical protein CARUB_v10004123mg [Caps...   177   6e-42
sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-c...   174   4e-41
emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|72687...   174   4e-41
ref|NP_567587.1| pentatricopeptide repeat-containing protein [Ar...   174   4e-41
ref|XP_002867936.1| hypothetical protein ARALYDRAFT_492917 [Arab...   174   5e-41
gb|AHB18408.1| pentatricopeptide repeat-containing protein [Goss...   172   2e-40
gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus...   171   3e-40
ref|XP_003628993.1| Pentatricopeptide repeat-containing protein ...   171   5e-40

>ref|XP_006432869.1| hypothetical protein CICLE_v10000274mg [Citrus clementina]
           gi|568835123|ref|XP_006471629.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X1 [Citrus sinensis]
           gi|568835125|ref|XP_006471630.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X2 [Citrus sinensis]
           gi|557534991|gb|ESR46109.1| hypothetical protein
           CICLE_v10000274mg [Citrus clementina]
          Length = 833

 Score =  229 bits (585), Expect = 8e-58
 Identities = 124/211 (58%), Positives = 152/211 (72%), Gaps = 2/211 (0%)
 Frame = +3

Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443
           K V ++LS  SLD SKC+  L +LSPQ FD +F +  S+VNP+TAL FFYFA     FRF
Sbjct: 56  KWVSSVLSKQSLDPSKCKLFLPNLSPQEFDTLFFSIRSNVNPKTALKFFYFASQSCNFRF 115

Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623
           TV SYC LIRLLL +NL++PARLLLIRLIDGK+  L+ +  +  RHI+IA  + DL+   
Sbjct: 116 TVRSYCLLIRLLLFSNLLSPARLLLIRLIDGKMPVLYASNPS-IRHIEIASQMVDLNVTS 174

Query: 624 GDLVGV--IDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797
              +GV   DL+VHVYCTQFK LG G   DVF  F+ + +FPSLKTCNFLL+SLVKANE+
Sbjct: 175 EPALGVQIADLLVHVYCTQFKNLGFGYAIDVFSIFSNKGIFPSLKTCNFLLNSLVKANEV 234

Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           ++  EVF  +CR GVSPDV  FST IN FCK
Sbjct: 235 QKGIEVFETMCR-GVSPDVFLFSTAINAFCK 264


>ref|XP_002270963.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic [Vitis vinifera]
          Length = 1022

 Score =  228 bits (581), Expect = 2e-57
 Identities = 122/216 (56%), Positives = 154/216 (71%), Gaps = 2/216 (0%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  + K V +ILS+PSLD ++C+ L+  LSP +FD +F +   +VNP+TALNFFYFA D 
Sbjct: 113 DHALLKSVTSILSNPSLDSTQCKQLIPHLSPHQFDSVFFSVRRNVNPKTALNFFYFASDS 172

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608
            GFRFT+ SYC L+R L+ +  V+PARLLLIRLID KL  LF + KN  RHI+IA  +AD
Sbjct: 173 CGFRFTLRSYCVLMRSLIVSGFVSPARLLLIRLIDRKLPVLFGDPKN--RHIEIASAMAD 230

Query: 609 LS--GREGDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782
           L+  G  G  V  +DL++HVYCTQF+ +G      VFRF A + +FP++KTC FLLSSLV
Sbjct: 231 LNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAIGVFRFLANKGVFPTVKTCTFLLSSLV 290

Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           KANELE+S  VF  + R GVSPDV  FST IN FCK
Sbjct: 291 KANELEKSYWVFETM-RQGVSPDVYLFSTAINAFCK 325


>emb|CBI29222.3| unnamed protein product [Vitis vinifera]
          Length = 826

 Score =  228 bits (581), Expect = 2e-57
 Identities = 122/216 (56%), Positives = 154/216 (71%), Gaps = 2/216 (0%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  + K V +ILS+PSLD ++C+ L+  LSP +FD +F +   +VNP+TALNFFYFA D 
Sbjct: 46  DHALLKSVTSILSNPSLDSTQCKQLIPHLSPHQFDSVFFSVRRNVNPKTALNFFYFASDS 105

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608
            GFRFT+ SYC L+R L+ +  V+PARLLLIRLID KL  LF + KN  RHI+IA  +AD
Sbjct: 106 CGFRFTLRSYCVLMRSLIVSGFVSPARLLLIRLIDRKLPVLFGDPKN--RHIEIASAMAD 163

Query: 609 LS--GREGDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782
           L+  G  G  V  +DL++HVYCTQF+ +G      VFRF A + +FP++KTC FLLSSLV
Sbjct: 164 LNEVGESGVAVAAVDLLIHVYCTQFRNVGFRNAIGVFRFLANKGVFPTVKTCTFLLSSLV 223

Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           KANELE+S  VF  + R GVSPDV  FST IN FCK
Sbjct: 224 KANELEKSYWVFETM-RQGVSPDVYLFSTAINAFCK 258


>ref|XP_002303480.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550342907|gb|EEE78459.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 842

 Score =  225 bits (574), Expect = 2e-56
 Identities = 122/211 (57%), Positives = 150/211 (71%), Gaps = 2/211 (0%)
 Frame = +3

Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443
           K+V  ILS+PSLD +KC+ L+  LSPQ FD  F    S+VNP+TALNFF+F  +   FRF
Sbjct: 65  KRVSLILSNPSLDCAKCKELVPHLSPQEFDSCFLALKSNVNPKTALNFFHFVSETCKFRF 124

Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623
           T  SYC LI LL+GN+L++PARLLLIRLIDGK+ A +     E RH +IA+ +AD +   
Sbjct: 125 TARSYCVLIHLLVGNDLLSPARLLLIRLIDGKVPAFYARNF-ESRHFEIAQIMADFNLVF 183

Query: 624 GDLVGV--IDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797
             ++GV   DL+VHVY TQFK LG G   DVF   A + LFPSLKTC FLLSSLVKANEL
Sbjct: 184 EPVIGVKIADLLVHVYSTQFKHLGFGFAADVFSLLAKKGLFPSLKTCTFLLSSLVKANEL 243

Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           ++S EV+  IC GG+ PDV  FST+IN FCK
Sbjct: 244 KKSYEVYDFICLGGIIPDVHLFSTMINAFCK 274


>ref|XP_007217153.1| hypothetical protein PRUPE_ppa001463mg [Prunus persica]
           gi|462413303|gb|EMJ18352.1| hypothetical protein
           PRUPE_ppa001463mg [Prunus persica]
          Length = 821

 Score =  224 bits (570), Expect = 5e-56
 Identities = 118/209 (56%), Positives = 153/209 (73%), Gaps = 2/209 (0%)
 Frame = +3

Query: 270 VIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTV 449
           V +ILS PSLD SKC+ L+  LS   FDR+F +  S+VNP+TAL+FFYFA +   F+FTV
Sbjct: 57  VSSILSKPSLDSSKCKALIPLLSSHEFDRVFCSISSNVNPKTALHFFYFASESFKFQFTV 116

Query: 450 GSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLS--GRE 623
            S+C L+RLL+ +NLV+PARLLLIRLIDG +  L+ N  +  RH++IA  + DL+    +
Sbjct: 117 RSFCVLVRLLILSNLVSPARLLLIRLIDGNVPVLYAN--HNQRHMEIAIAMLDLNTVSTQ 174

Query: 624 GDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELER 803
           G  V  +DL++HVYCTQFK +G G   D F  F+ + +FPSLKTCNFLLSSLVKANEL +
Sbjct: 175 GLGVQALDLLIHVYCTQFKNMGFGYAIDAFVIFSKKGVFPSLKTCNFLLSSLVKANELHK 234

Query: 804 SCEVFSLICRGGVSPDVISFSTLINVFCK 890
           S +VF ++CR GVSPDV  F+T IN FCK
Sbjct: 235 SYDVFEVMCR-GVSPDVYLFTTAINAFCK 262


>ref|XP_006351033.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Solanum tuberosum]
          Length = 928

 Score =  202 bits (513), Expect = 2e-49
 Identities = 110/211 (52%), Positives = 144/211 (68%), Gaps = 2/211 (0%)
 Frame = +3

Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443
           K V+++LS+P +D  K ++LL  L+PQ+FD IF    SS+ P   L FF+ A    GF F
Sbjct: 153 KWVVSVLSNPPVDSLKIKDLLTLLTPQQFDAIFLEIYSSLKPLNVLKFFHVASGTCGFSF 212

Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623
           +V SYC L+RLL+ +N   PARLLLIRLIDGKL ALF+   ++ +H+++A  +A+LSG  
Sbjct: 213 SVRSYCTLLRLLVASNHDVPARLLLIRLIDGKLPALFDT--SQQKHVEVAVSLAELSGVS 270

Query: 624 --GDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797
             G  V   DL++H+ CTQFK +G     DVFR  A R ++PSLKTCNFLLSSLVK NEL
Sbjct: 271 DFGVAVRTFDLLLHLCCTQFKNVGFDAALDVFRSLASRGVYPSLKTCNFLLSSLVKENEL 330

Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
            +S EVF ++ + GV PDV  FST IN FCK
Sbjct: 331 WKSYEVFGIL-KDGVEPDVYLFSTAINAFCK 360


>ref|XP_004249905.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Solanum lycopersicum]
          Length = 839

 Score =  201 bits (512), Expect = 2e-49
 Identities = 111/211 (52%), Positives = 143/211 (67%), Gaps = 2/211 (0%)
 Frame = +3

Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443
           K V+++LS P +D  K ++LL  L+PQ+FD IF    SS+ P   L FF+ A     F F
Sbjct: 64  KWVVSVLSDPPVDSLKIKDLLTLLNPQQFDAIFLEIHSSLKPLNVLKFFHVASGTCSFSF 123

Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGRE 623
           TV SYC L+RLL+ +N  APARLLLIRLIDGKL ALF++   + +H+++A  +A+LSG  
Sbjct: 124 TVRSYCTLVRLLIASNHDAPARLLLIRLIDGKLPALFDS--LQQKHVEVAVSLAELSGVS 181

Query: 624 --GDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANEL 797
             G  V   DL++H+ CTQFK +G     DVFR  A R ++PSLKTCNFLLSSLVK NEL
Sbjct: 182 DFGVAVRTFDLLLHLCCTQFKSVGFDAALDVFRSLASRGVYPSLKTCNFLLSSLVKENEL 241

Query: 798 ERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
            +S EVF ++ + GV PDV  FST IN FCK
Sbjct: 242 WKSYEVFEIL-KDGVKPDVYLFSTAINAFCK 271


>ref|XP_004149000.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Cucumis sativus]
          Length = 822

 Score =  195 bits (495), Expect = 2e-47
 Identities = 108/209 (51%), Positives = 133/209 (63%), Gaps = 2/209 (0%)
 Frame = +3

Query: 270 VIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTV 449
           V ++LS  SLD SKC  LL  LSP +FD++F +     NP T LNFFYFA +   FRFT+
Sbjct: 50  VSSVLSHSSLDSSKCSALLPHLSPSQFDQLFFSIGLKANPMTCLNFFYFASNSFKFRFTI 109

Query: 450 GSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREG- 626
            SYC LI LL+ +  + PARLLLIRLIDG L  L  N  +E  HI+IA  +  L+   G 
Sbjct: 110 HSYCTLILLLIRSKFIPPARLLLIRLIDGNLPVL--NLDSEKFHIEIANALFGLTSVVGR 167

Query: 627 -DLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELER 803
            +     DL++HVY TQF+ LG     DVF   A +  FPSLKTCNFLLSSLVKANE E+
Sbjct: 168 FEWTQAFDLLIHVYSTQFRNLGFSCAVDVFYLLARKGTFPSLKTCNFLLSSLVKANEFEK 227

Query: 804 SCEVFSLICRGGVSPDVISFSTLINVFCK 890
            CEVF ++   G  PDV SF+ +IN  CK
Sbjct: 228 CCEVFRVMSE-GACPDVFSFTNVINALCK 255


>ref|XP_003548529.2| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like [Glycine max]
          Length = 840

 Score =  184 bits (467), Expect(2) = 2e-44
 Identities = 102/208 (49%), Positives = 136/208 (65%), Gaps = 3/208 (1%)
 Frame = +3

Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455
           +IL+S +LD SKC+++L  L+P  FDR+F +   +VNP+T   FF FA     FRFTV S
Sbjct: 65  SILTSKTLDSSKCKSILPHLTPHHFDRLFLSLHRTVNPKTTHEFFRFATRHCNFRFTVRS 124

Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNE--DRHIDIARGIADLS-GREG 626
           YC L+R LL ++ V  AR LL RLIDG +            DR  +IA  + +L+ G + 
Sbjct: 125 YCLLLRSLLADSFVPRARFLLARLIDGHVPTWSSKTTTSFHDRLREIASSMLELNQGSDE 184

Query: 627 DLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERS 806
             +G +DL++H+ C+QFK LG    FD+F  F+ R +FP LKTCN LLSSLVKANEL +S
Sbjct: 185 QRLGELDLLLHILCSQFKCLGSRCAFDIFVMFSKRGVFPCLKTCNLLLSSLVKANELHKS 244

Query: 807 CEVFSLICRGGVSPDVISFSTLINVFCK 890
            EVF L C+ GV+PDV +F+T IN FCK
Sbjct: 245 YEVFDLACQ-GVAPDVFTFTTAINAFCK 271



 Score = 22.7 bits (47), Expect(2) = 2e-44
 Identities = 12/21 (57%), Positives = 13/21 (61%)
 Frame = +1

Query: 121 VPPESLYSPNPIASLSSQPSI 183
           +PP S   P P  SLSS PSI
Sbjct: 46  LPPPSPPPPPPHPSLSSIPSI 66


>ref|XP_004509525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X1 [Cicer arietinum]
           gi|502153968|ref|XP_004509526.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X2 [Cicer arietinum]
           gi|502153970|ref|XP_004509527.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X3 [Cicer arietinum]
           gi|502153972|ref|XP_004509528.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X4 [Cicer arietinum]
           gi|502153974|ref|XP_004509529.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X5 [Cicer arietinum]
           gi|502153976|ref|XP_004509530.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X6 [Cicer arietinum]
           gi|502153978|ref|XP_004509531.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X7 [Cicer arietinum]
           gi|502153980|ref|XP_004509532.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X8 [Cicer arietinum]
           gi|502153982|ref|XP_004509533.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X9 [Cicer arietinum]
           gi|502153984|ref|XP_004509534.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19440,
           chloroplastic-like isoform X10 [Cicer arietinum]
          Length = 835

 Score =  184 bits (467), Expect = 4e-44
 Identities = 101/205 (49%), Positives = 133/205 (64%)
 Frame = +3

Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455
           +ILS   LD SKC+++L  L+P +FD +F T  S+VN +T L+FF FA +   F FTV S
Sbjct: 62  SILSHKILDSSKCKSILPHLTPHQFDTLFFTHHSTVNLKTTLDFFRFASNQFKFCFTVRS 121

Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREGDLV 635
           YC LIRLLL +N +  AR  + RLIDG +     N   +DR  ++A    +LS       
Sbjct: 122 YCLLIRLLLCSNHLPRARFFMKRLIDGNVSTPLLNR--DDRLSEMASSFLELSRLTERSH 179

Query: 636 GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERSCEV 815
           G +DL++H+ C+QF+ LG    FD+F  F    +FPSLKTCNFLLSSLVK+NEL +S  V
Sbjct: 180 GELDLLLHILCSQFQHLGFHWAFDIFTLFTSNGVFPSLKTCNFLLSSLVKSNELHKSYRV 239

Query: 816 FSLICRGGVSPDVISFSTLINVFCK 890
           F ++CRGGVS DV +FST IN F K
Sbjct: 240 FDVVCRGGVSLDVYTFSTAINAFSK 264


>ref|XP_006413978.1| hypothetical protein EUTSA_v10024401mg [Eutrema salsugineum]
           gi|557115148|gb|ESQ55431.1| hypothetical protein
           EUTSA_v10024401mg [Eutrema salsugineum]
          Length = 837

 Score =  183 bits (465), Expect = 7e-44
 Identities = 100/216 (46%), Positives = 136/216 (62%), Gaps = 2/216 (0%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  + +++ A LS  SLD  +C+ L+  LSP  FDR+F    S VNP+TAL+FF  A D 
Sbjct: 70  DRHLRERLSAALSRRSLDYEQCKQLIATLSPHEFDRLFPDFRSKVNPKTALDFFRLASDS 129

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608
             F F++ SYC LI LLL  +L++PARL+LIRLI+G +  L     + D  + IA  +A 
Sbjct: 130 FSFSFSLRSYCLLIGLLLDASLLSPARLVLIRLINGNVPVLPSANDSRDGRVAIADAMAS 189

Query: 609 LS--GREGDLVGVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782
           LS        + + DL++ VYCTQFK  G  L  D+F   A + LFPS  TCN LL+SLV
Sbjct: 190 LSLCFDPEIRMRISDLLIEVYCTQFKRAGCYLALDIFPLLANKGLFPSRTTCNILLTSLV 249

Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           +ANE ++ CE F  +C+ GVSPDV  F+T+IN +CK
Sbjct: 250 RANEFQKCCEAFEAVCK-GVSPDVYLFTTVINAYCK 284


>ref|XP_007040906.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|590680604|ref|XP_007040907.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590680608|ref|XP_007040908.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590680612|ref|XP_007040909.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590680616|ref|XP_007040910.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590680620|ref|XP_007040911.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778151|gb|EOY25407.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778152|gb|EOY25408.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778153|gb|EOY25409.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778154|gb|EOY25410.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778155|gb|EOY25411.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778156|gb|EOY25412.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
          Length = 845

 Score =  182 bits (463), Expect = 1e-43
 Identities = 107/210 (50%), Positives = 132/210 (62%), Gaps = 2/210 (0%)
 Frame = +3

Query: 267 KVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFT 446
           ++  ILS  SLD SKC+ LL  LSP  FDR FS   S +NP+T L+FFY A     FRFT
Sbjct: 70  RLSCILSKSSLDSSKCKQLLPLLSPLDFDRFFSAISSHLNPKTTLHFFYLASQSFNFRFT 129

Query: 447 VGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREG 626
           + SYC LI LLL  N  +PARLL IRLIDGKL     N    D HI I   +ADL+    
Sbjct: 130 LRSYCILILLLLLANHSSPARLLFIRLIDGKLPLSSPNNTTID-HIQITTALADLNTLSK 188

Query: 627 DLVGV--IDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELE 800
            +  V  +D+++H+YCTQFK  G     DVF   A + +FPS KTCNF LSSLVKANEL+
Sbjct: 189 GVPRVMGVDMLLHLYCTQFKNAGFTSAIDVFFTLADKGMFPSSKTCNFFLSSLVKANELQ 248

Query: 801 RSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           ++ +VF  + R  VS DV   +T+IN FCK
Sbjct: 249 KTYQVFETLSR-FVSLDVYLCTTMINAFCK 277


>ref|XP_006282558.1| hypothetical protein CARUB_v10004123mg [Capsella rubella]
           gi|482551263|gb|EOA15456.1| hypothetical protein
           CARUB_v10004123mg [Capsella rubella]
          Length = 838

 Score =  177 bits (448), Expect = 6e-42
 Identities = 101/217 (46%), Positives = 135/217 (62%), Gaps = 3/217 (1%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  +  ++ ++LS  SLD   C+ L+  LSP  FDR+F    S VNP+TALNFF  A D 
Sbjct: 70  DRHLHDRLSSVLSKRSLDYELCKQLITVLSPLEFDRLFPEFRSKVNPKTALNFFRLASDS 129

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKN-EDRHIDIARGIA 605
             F F++ SYC LI LLL  NL++PAR+ LIRLI+G +  L   +    D  + IA  +A
Sbjct: 130 FSFSFSLRSYCLLIGLLLDANLLSPARVTLIRLINGNVPVLPCGDGGLRDSRVAIADAMA 189

Query: 606 DLSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSL 779
            LS    + +   + DL++ VYCTQFK  G  L  DVF F A + +FPS  TCN LL+SL
Sbjct: 190 RLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPFLANKGMFPSKTTCNILLTSL 249

Query: 780 VKANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           V+ANE ++ CE F ++C+ GV PDV  F+T IN FCK
Sbjct: 250 VRANEFQKCCEAFEVVCK-GVFPDVYLFTTAINAFCK 285


>sp|Q940A6.2|PP325_ARATH RecName: Full=Pentatricopeptide repeat-containing protein
           At4g19440, chloroplastic; Flags: Precursor
          Length = 838

 Score =  174 bits (441), Expect = 4e-41
 Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 2/216 (0%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  + +++ ++LS  SLD  +C+ L+  LSP  FDR+F    S VNP+TAL+FF  A D 
Sbjct: 73  DRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDS 132

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608
             F F++ SYC LI LLL  NL++ AR++LIRLI+G +  L    +  D  + IA  +A 
Sbjct: 133 FSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR--DSRVAIADAMAS 190

Query: 609 LSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782
           LS    + +   + DL++ VYCTQFK  G  L  DVF   A + +FPS  TCN LL+SLV
Sbjct: 191 LSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLV 250

Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           +ANE ++ CE F ++C+ GVSPDV  F+T IN FCK
Sbjct: 251 RANEFQKCCEAFDVVCK-GVSPDVYLFTTAINAFCK 285


>emb|CAA18631.1| putative protein [Arabidopsis thaliana] gi|7268739|emb|CAB78946.1|
           putative protein [Arabidopsis thaliana]
          Length = 814

 Score =  174 bits (441), Expect = 4e-41
 Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 2/216 (0%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  + +++ ++LS  SLD  +C+ L+  LSP  FDR+F    S VNP+TAL+FF  A D 
Sbjct: 49  DRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDS 108

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608
             F F++ SYC LI LLL  NL++ AR++LIRLI+G +  L    +  D  + IA  +A 
Sbjct: 109 FSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR--DSRVAIADAMAS 166

Query: 609 LSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782
           LS    + +   + DL++ VYCTQFK  G  L  DVF   A + +FPS  TCN LL+SLV
Sbjct: 167 LSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLV 226

Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           +ANE ++ CE F ++C+ GVSPDV  F+T IN FCK
Sbjct: 227 RANEFQKCCEAFDVVCK-GVSPDVYLFTTAINAFCK 261


>ref|NP_567587.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|334186696|ref|NP_001190771.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|15810161|gb|AAL07224.1| unknown protein [Arabidopsis
           thaliana] gi|332658782|gb|AEE84182.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332658783|gb|AEE84183.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 825

 Score =  174 bits (441), Expect = 4e-41
 Identities = 99/216 (45%), Positives = 137/216 (63%), Gaps = 2/216 (0%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  + +++ ++LS  SLD  +C+ L+  LSP  FDR+F    S VNP+TAL+FF  A D 
Sbjct: 60  DRHLHERLSSVLSKRSLDYEQCKQLITVLSPLEFDRLFPEFRSKVNPKTALDFFRLASDS 119

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIAD 608
             F F++ SYC LI LLL  NL++ AR++LIRLI+G +  L    +  D  + IA  +A 
Sbjct: 120 FSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGLR--DSRVAIADAMAS 177

Query: 609 LSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLV 782
           LS    + +   + DL++ VYCTQFK  G  L  DVF   A + +FPS  TCN LL+SLV
Sbjct: 178 LSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSLV 237

Query: 783 KANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           +ANE ++ CE F ++C+ GVSPDV  F+T IN FCK
Sbjct: 238 RANEFQKCCEAFDVVCK-GVSPDVYLFTTAINAFCK 272


>ref|XP_002867936.1| hypothetical protein ARALYDRAFT_492917 [Arabidopsis lyrata subsp.
           lyrata] gi|297313772|gb|EFH44195.1| hypothetical protein
           ARALYDRAFT_492917 [Arabidopsis lyrata subsp. lyrata]
          Length = 817

 Score =  174 bits (440), Expect = 5e-41
 Identities = 99/217 (45%), Positives = 135/217 (62%), Gaps = 3/217 (1%)
 Frame = +3

Query: 249 DEIVSKKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDP 428
           D  + +++ ++LS  SLD  +C+ L+  LSP  FDR+F      VNP+TAL+FF  A D 
Sbjct: 49  DRHLHERLSSVLSKRSLDYEQCKQLITVLSPHEFDRLFPEFRFKVNPKTALDFFRLASDS 108

Query: 429 LGFRFTVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLRAL-FENEKNEDRHIDIARGIA 605
             F F++ SYC LI LLL  NL +PAR++LIRLI+G +  L   N    D  + IA  +A
Sbjct: 109 FSFSFSLRSYCLLIGLLLDANLSSPARVVLIRLINGNVPVLPCGNGGLRDSRVAIADAMA 168

Query: 606 DLSGREGDLV--GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSL 779
            LS    + +   + DL++ VYCTQFK  G  L  DVF   A + +FPS  TCN LL+SL
Sbjct: 169 SLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYLALDVFPVLANKGMFPSKTTCNILLTSL 228

Query: 780 VKANELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           V+A E ++ CE F ++C+ GVSPDV  F+T IN FCK
Sbjct: 229 VRATEFQKCCEAFHVVCK-GVSPDVYLFTTAINAFCK 264


>gb|AHB18408.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 846

 Score =  172 bits (436), Expect = 2e-40
 Identities = 102/205 (49%), Positives = 128/205 (62%)
 Frame = +3

Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455
           +ILS PSLD SK + LL  LSP  FDR F       +P+T LNFF+ A     FRFT+ S
Sbjct: 85  SILSKPSLDSSKSKQLLPLLSPSDFDRFFIALSPRADPKTTLNFFHLASRCFNFRFTLRS 144

Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREGDLV 635
           Y  LI LLL +N  + ARLLLIRLIDGKL     N      HI IA  +ADL+     + 
Sbjct: 145 YYILILLLLLSNNSSAARLLLIRLIDGKLPLFSPNNPPTVNHIQIAIALADLNTSFKGVA 204

Query: 636 GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERSCEV 815
           GV DL++H+YCTQFK +G     DVF   A + +FPS KTCNF L+SL+KANE+ ++ +V
Sbjct: 205 GV-DLLLHLYCTQFKNVGFTYAIDVFFTLAYKGIFPSTKTCNFFLNSLLKANEVRKTYQV 263

Query: 816 FSLICRGGVSPDVISFSTLINVFCK 890
           F  + R  VS DV   +T+IN FCK
Sbjct: 264 FETLSR-SVSLDVYLCTTMINGFCK 287


>gb|EYU29134.1| hypothetical protein MIMGU_mgv1a001281mg [Mimulus guttatus]
          Length = 847

 Score =  171 bits (434), Expect = 3e-40
 Identities = 102/213 (47%), Positives = 136/213 (63%), Gaps = 4/213 (1%)
 Frame = +3

Query: 264 KKVIAILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRF 443
           K + ++LS  + + ++C+ L+  +SP++FD IF    +++ P TAL  FYFA D   F F
Sbjct: 74  KSLASVLSGSNFNSNQCKELISQISPRQFDSIFWEIHNNIEPSTALKLFYFAGDYCSFSF 133

Query: 444 TVGSYCNLIRLLLGNNLVAPARLLLIRLIDGKLR-ALFENEKNEDRHIDIARGIAD-LSG 617
           T+ SYC L  LL+  NL + ARLLLIRLID KL  +L +N  N   H +IA  +AD  SG
Sbjct: 134 TLRSYCILFHLLVSKNLDSAARLLLIRLIDRKLPVSLRDNVVN--LHNEIAIVLADTFSG 191

Query: 618 REGDLVG--VIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKAN 791
            E    G    D++VHVY T+FK LGL    DVFR  AGR L PS KTCNFL+S+LVKA+
Sbjct: 192 SEKFRSGNRGFDMLVHVYATEFKSLGLDAAMDVFRLLAGRRLVPSFKTCNFLMSTLVKAD 251

Query: 792 ELERSCEVFSLICRGGVSPDVISFSTLINVFCK 890
           E E+S E+F ++ R  + PDV  +ST IN  CK
Sbjct: 252 EHEKSYEIFLIVSRESL-PDVYLYSTAINALCK 283


>ref|XP_003628993.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355523015|gb|AET03469.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 819

 Score =  171 bits (432), Expect = 5e-40
 Identities = 92/205 (44%), Positives = 129/205 (62%)
 Frame = +3

Query: 276 AILSSPSLDRSKCRNLLFDLSPQRFDRIFSTGCSSVNPRTALNFFYFALDPLGFRFTVGS 455
           +IL+   LD SKC+ L+ +L+P  F+  F T  ++VN +T L+FF FA     FRFTV S
Sbjct: 54  SILAHKVLDSSKCKTLIPNLTPHEFEHSFFTHHTTVNLKTTLDFFSFASKNFKFRFTVRS 113

Query: 456 YCNLIRLLLGNNLVAPARLLLIRLIDGKLRALFENEKNEDRHIDIARGIADLSGREGDLV 635
           YC LIRLLL +N +  A+  L RLI+G      +  K + R  +IA    +L  R     
Sbjct: 114 YCILIRLLLASNHIPRAKFTLKRLIEGNANTPLK--KTDARLSEIASAFLELGERSH--- 168

Query: 636 GVIDLVVHVYCTQFKGLGLGLGFDVFRFFAGRNLFPSLKTCNFLLSSLVKANELERSCEV 815
           G +DL++++ C+QF+ LG    FD F  F  + +FPSLK+CNFL+SSLVK+NEL +S  V
Sbjct: 169 GELDLLIYILCSQFQHLGFHWAFDTFMLFTSKGVFPSLKSCNFLMSSLVKSNELHKSFRV 228

Query: 816 FSLICRGGVSPDVISFSTLINVFCK 890
           F  +CRGGV  DV +++T IN +CK
Sbjct: 229 FDAMCRGGVLIDVYTYATAINAYCK 253


Top