BLASTX nr result

ID: Paeonia23_contig00021346 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia23_contig00021346
         (752 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containi...   283   5e-74
ref|XP_007199804.1| hypothetical protein PRUPE_ppa004794mg [Prun...   256   5e-66
ref|XP_004161634.1| PREDICTED: pentatricopeptide repeat-containi...   251   2e-64
ref|XP_004145397.1| PREDICTED: pentatricopeptide repeat-containi...   251   2e-64
ref|XP_007041384.1| Tetratricopeptide repeat (TPR)-like superfam...   240   3e-61
ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containi...   238   2e-60
ref|XP_004289840.1| PREDICTED: pentatricopeptide repeat-containi...   233   4e-59
ref|XP_006299009.1| hypothetical protein CARUB_v10015136mg [Caps...   221   3e-55
ref|XP_006423153.1| hypothetical protein CICLE_v10028281mg [Citr...   220   4e-55
ref|XP_006409479.1| hypothetical protein EUTSA_v10022658mg [Eutr...   220   4e-55
gb|EXB63632.1| hypothetical protein L484_026974 [Morus notabilis]     220   5e-55
ref|NP_179197.1| pentatricopeptide repeat-containing protein [Ar...   220   5e-55
ref|XP_007131879.1| hypothetical protein PHAVU_011G049000g [Phas...   219   6e-55
ref|XP_002306075.1| pentatricopeptide repeat-containing family p...   211   2e-52
ref|XP_006349623.1| PREDICTED: pentatricopeptide repeat-containi...   198   2e-48
ref|XP_002519113.1| pentatricopeptide repeat-containing protein,...   197   4e-48
ref|XP_004248897.1| PREDICTED: pentatricopeptide repeat-containi...   191   2e-46
gb|EYU42638.1| hypothetical protein MIMGU_mgv1a023952mg [Mimulus...   186   8e-45
gb|EPS61672.1| hypothetical protein M569_13122 [Genlisea aurea]       181   2e-43
ref|XP_006856168.1| hypothetical protein AMTR_s00059p00176060 [A...   165   1e-38

>ref|XP_002281132.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Vitis vinifera]
          Length = 492

 Score =  283 bits (724), Expect = 5e-74
 Identities = 146/213 (68%), Positives = 169/213 (79%), Gaps = 1/213 (0%)
 Frame = +1

Query: 106 SSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQVSQITLDLKNNPRL 285
           SS P+ QN  T+TL+   VSIL+H RSK+RWS+L SL+P GF P++ SQI L +KNNP L
Sbjct: 24  SSLPSDQN-PTKTLISTAVSILRHQRSKSRWSHLQSLFPKGFTPTEASQIVLQIKNNPHL 82

Query: 286 ALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRVFDAPDRLSAPSPK 465
           ALSFFLW   KSLCNH LLSYSTIIHILARARLKSQA   IRTAIRVFD  D  S+  PK
Sbjct: 83  ALSFFLWCHHKSLCNHTLLSYSTIIHILARARLKSQALGLIRTAIRVFDDSDECSSQPPK 142

Query: 466 IFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRGISPKVSTCNLLIWN 645
           IFE+LVKTY  C SAPFVFDLLIKACL SKRI+ SI+IV+MLRSRGISP +STCN LIW 
Sbjct: 143 IFESLVKTYNSCGSAPFVFDLLIKACLNSKRIEQSISIVKMLRSRGISPTISTCNALIWQ 202

Query: 646 VSRCQGVDAGYAIYKEVFG-FDGELKDKPKLFV 741
           VSR +G DAGY IY+EVFG +D E+ +K ++ V
Sbjct: 203 VSRGRGCDAGYEIYREVFGSWDDEINEKVRVRV 235


>ref|XP_007199804.1| hypothetical protein PRUPE_ppa004794mg [Prunus persica]
           gi|462395204|gb|EMJ01003.1| hypothetical protein
           PRUPE_ppa004794mg [Prunus persica]
          Length = 491

 Score =  256 bits (655), Expect = 5e-66
 Identities = 137/209 (65%), Positives = 161/209 (77%), Gaps = 4/209 (1%)
 Frame = +1

Query: 103 FSSSPAAQNVSTET--LVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQVSQITLDLKNN 276
           FSSSP +    ++T  L+ DVVSI+ + RSK RWSYL SLYP GFD +  SQI L +KNN
Sbjct: 24  FSSSPPSDQTPSQTNPLISDVVSIITNLRSKTRWSYLRSLYPHGFDSNDFSQIALHIKNN 83

Query: 277 PRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRVFDAPDRLSAP 456
           PRLAL FFLW+Q KSLCNHNL S+STIIHILAR RL+SQA D IRTAIRV ++    S  
Sbjct: 84  PRLALRFFLWTQHKSLCNHNLQSHSTIIHILARGRLRSQAYDLIRTAIRVSESESIGSHE 143

Query: 457 SP--KIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRGISPKVSTCN 630
           S   K+FE+LVKTYR CDSAPFVFDLLIKACLESK+ID +I IVRML SRGISP +STCN
Sbjct: 144 SKPLKVFESLVKTYRQCDSAPFVFDLLIKACLESKKIDPAIQIVRMLLSRGISPGLSTCN 203

Query: 631 LLIWNVSRCQGVDAGYAIYKEVFGFDGEL 717
            LI  +S+ +G  AGY IY+E+FG D E+
Sbjct: 204 ALIRLLSQRRGAYAGYEIYREIFGLDCEV 232


>ref|XP_004161634.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Cucumis sativus]
          Length = 499

 Score =  251 bits (640), Expect = 2e-64
 Identities = 128/216 (59%), Positives = 161/216 (74%), Gaps = 9/216 (4%)
 Frame = +1

Query: 106 SSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQVSQITLDLKNNPRL 285
           SS P   + ST+  +  VVS+L H RSK+RW +L+SL P GFDP + S I L +KNNP L
Sbjct: 29  SSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHL 88

Query: 286 ALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRV--------FDAPD 441
           AL FFLW+Q+KSLCNHNL+SYST+IHILAR RL++ A+D I+TAIR         +   +
Sbjct: 89  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTE 148

Query: 442 RLSAPSP-KIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRGISPKV 618
           R S   P K+FETLVKTY+ C SAPFVFDLLIKA L+SK++D SI IVRMLRSRGISP+V
Sbjct: 149 RFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQV 208

Query: 619 STCNLLIWNVSRCQGVDAGYAIYKEVFGFDGELKDK 726
           ST N LI  VS+CQG +  YAI++EVFG D E++++
Sbjct: 209 STLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEE 244


>ref|XP_004145397.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Cucumis sativus]
           gi|449472579|ref|XP_004153637.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At2g15980-like [Cucumis sativus]
          Length = 499

 Score =  251 bits (640), Expect = 2e-64
 Identities = 128/216 (59%), Positives = 161/216 (74%), Gaps = 9/216 (4%)
 Frame = +1

Query: 106 SSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQVSQITLDLKNNPRL 285
           SS P   + ST+  +  VVS+L H RSK+RW +L+SL P GFDP + S I L +KNNP L
Sbjct: 29  SSPPTDPSPSTKPSISTVVSVLTHQRSKSRWRFLNSLCPNGFDPGEFSDILLQIKNNPHL 88

Query: 286 ALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRV--------FDAPD 441
           AL FFLW+Q+KSLCNHNL+SYST+IHILAR RL++ A+D I+TAIR         +   +
Sbjct: 89  ALRFFLWTQNKSLCNHNLISYSTLIHILARGRLRTHAKDVIQTAIRAAQLEDSDNYSKTE 148

Query: 442 RLSAPSP-KIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRGISPKV 618
           R S   P K+FETLVKTY+ C SAPFVFDLLIKA L+SK++D SI IVRMLRSRGISP+V
Sbjct: 149 RFSPSRPLKLFETLVKTYKRCGSAPFVFDLLIKALLDSKKLDSSIEIVRMLRSRGISPQV 208

Query: 619 STCNLLIWNVSRCQGVDAGYAIYKEVFGFDGELKDK 726
           ST N LI  VS+CQG +  YAI++EVFG D E++++
Sbjct: 209 STLNSLILLVSKCQGANVAYAIFREVFGLDCEIEEE 244


>ref|XP_007041384.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           isoform 1 [Theobroma cacao]
           gi|590682590|ref|XP_007041385.1| Tetratricopeptide
           repeat (TPR)-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|590682593|ref|XP_007041386.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508705319|gb|EOX97215.1| Tetratricopeptide repeat
           (TPR)-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508705320|gb|EOX97216.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508705321|gb|EOX97217.1| Tetratricopeptide repeat
           (TPR)-like superfamily protein, putative isoform 1
           [Theobroma cacao]
          Length = 490

 Score =  240 bits (613), Expect = 3e-61
 Identities = 129/227 (56%), Positives = 162/227 (71%), Gaps = 5/227 (2%)
 Frame = +1

Query: 67  TSSMAISVIKRMFSSS---PAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDP 237
           T  + +S+    FSSS   P+  +      +  V SIL HHRSK+RWS + +L+P GF P
Sbjct: 11  TPRLLLSLASFSFSSSYSTPSPPSSDQPDPIATVTSILTHHRSKSRWSTILTLFPSGFTP 70

Query: 238 SQVSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTA 417
           SQ SQITL LKNNP LAL FFL+++ KSLCNHNL SYSTIIHIL+RARLK++A++ IR A
Sbjct: 71  SQFSQITLQLKNNPHLALRFFLFTEQKSLCNHNLSSYSTIIHILSRARLKTRARELIRVA 130

Query: 418 IRVFDAPDRLSAPS-PKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLR 594
           IR    P   + P+  K+FE LVKTY  C SAPFVFDL +K+CL+ K++DGSI IVRML 
Sbjct: 131 IR---TPGMENEPTYLKLFELLVKTYNECGSAPFVFDLFVKSCLQMKKLDGSIEIVRMLM 187

Query: 595 SRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGF-DGELKDKPK 732
           SRGISP++STCN LI  VS+C+G   GY +YKEVFG  +GE +   K
Sbjct: 188 SRGISPQLSTCNALIGEVSKCRGAKRGYEVYKEVFGVGNGERESNVK 234


>ref|XP_003537906.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Glycine max]
          Length = 487

 Score =  238 bits (606), Expect = 2e-60
 Identities = 130/227 (57%), Positives = 158/227 (69%), Gaps = 14/227 (6%)
 Frame = +1

Query: 76  MAISVIKRMFSS----------SPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPG 225
           MAI ++K+   +          S +  N ++++LV D VSIL HHRSK+RWS L S  P 
Sbjct: 1   MAIQILKQFSQTLRPKPWTLFFSFSCSNDASQSLVTDAVSILTHHRSKSRWSNLRSACPN 60

Query: 226 GFDPSQVSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDF 405
           G  P++ S+ITL +KN P+LAL FFLW++ KSLCNHNL SYS+IIH+LARARL S A D 
Sbjct: 61  GITPAEFSEITLHIKNKPQLALRFFLWTKSKSLCNHNLASYSSIIHLLARARLSSHAYDL 120

Query: 406 IRTAIRVFDAPD----RLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSI 573
           IRTAIR     D    R ++    +FETLVKTYR   SAPFVFDLLIKACL+SK++D SI
Sbjct: 121 IRTAIRASHQNDEENCRFNSRPLNLFETLVKTYRDSGSAPFVFDLLIKACLDSKKLDPSI 180

Query: 574 AIVRMLRSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGFDGE 714
            IVRML SRGISPKVST N LI  V + +GVD GYAIY+E F  D E
Sbjct: 181 EIVRMLLSRGISPKVSTLNSLISRVCKSRGVDEGYAIYREFFRLDEE 227


>ref|XP_004289840.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Fragaria vesca subsp. vesca]
          Length = 493

 Score =  233 bits (595), Expect = 4e-59
 Identities = 128/210 (60%), Positives = 152/210 (72%), Gaps = 3/210 (1%)
 Frame = +1

Query: 103 FSSSPAAQNVSTET-LVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQVSQITLDLKNNP 279
           FSSSP+ ++ S    L+P VVSIL   RSK+RW YL +LYP GF P+  SQI+L +KNNP
Sbjct: 29  FSSSPSDESPSEPNPLIPSVVSILTQLRSKSRWGYLRTLYPSGFTPNDFSQISLQIKNNP 88

Query: 280 RLALSFFLWSQDK--SLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRVFDAPDRLSA 453
            L L FF W+Q+K  SLC HNLLSYSTIIHILAR+RLKSQA   I  AI V++  +    
Sbjct: 89  HLVLRFFQWTQNKNNSLCAHNLLSYSTIIHILARSRLKSQAYSLIGDAIWVWEPLE---- 144

Query: 454 PSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRGISPKVSTCNL 633
               +FETLVKTYR C SAPFVF+ LIKACLESK+ID +I IVRM+ SRGISP +STCN 
Sbjct: 145 ----VFETLVKTYRQCGSAPFVFNYLIKACLESKKIDPAIQIVRMILSRGISPGLSTCNS 200

Query: 634 LIWNVSRCQGVDAGYAIYKEVFGFDGELKD 723
           LI  V + QG  AGY IY+EVFG DG + D
Sbjct: 201 LIRCVMQRQGAYAGYEIYREVFGLDGRVLD 230


>ref|XP_006299009.1| hypothetical protein CARUB_v10015136mg [Capsella rubella]
           gi|482567718|gb|EOA31907.1| hypothetical protein
           CARUB_v10015136mg [Capsella rubella]
          Length = 492

 Score =  221 bits (562), Expect = 3e-55
 Identities = 118/228 (51%), Positives = 159/228 (69%)
 Frame = +1

Query: 64  PTSSMAISVIKRMFSSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQ 243
           P S ++IS+   +  SSP +     + L+ D VSIL HHRSK+RWS L SL+P GF P Q
Sbjct: 15  PDSILSISLFTTV--SSPPS-----DPLISDAVSILTHHRSKSRWSTLRSLHPYGFTPFQ 67

Query: 244 VSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIR 423
            S+ITL L+NNP L+L FFL+++  SLC+H++ S ST+IHILAR+RLKS A + IR A+R
Sbjct: 68  FSEITLRLRNNPHLSLRFFLFTRRFSLCSHDVGSCSTLIHILARSRLKSHASEVIRLALR 127

Query: 424 VFDAPDRLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRG 603
           + D  +       K+F +LVK+Y LC SAPFVFDLL+K+CL+SK IDG++ ++R LRSRG
Sbjct: 128 LADDNEEGENRVLKVFRSLVKSYNLCGSAPFVFDLLVKSCLDSKEIDGAVMVMRKLRSRG 187

Query: 604 ISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGFDGELKDKPKLFVTK 747
           IS ++STCN L+  VSR +G   GY +Y+EVFG D    D  K   +K
Sbjct: 188 ISLQISTCNALVSEVSRRRGAFNGYKMYREVFGLDDVKVDDGKKMASK 235


>ref|XP_006423153.1| hypothetical protein CICLE_v10028281mg [Citrus clementina]
           gi|567861000|ref|XP_006423154.1| hypothetical protein
           CICLE_v10028281mg [Citrus clementina]
           gi|567861002|ref|XP_006423155.1| hypothetical protein
           CICLE_v10028281mg [Citrus clementina]
           gi|568851351|ref|XP_006479357.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At2g15980-like isoform X1 [Citrus sinensis]
           gi|568851353|ref|XP_006479358.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At2g15980-like isoform X2 [Citrus sinensis]
           gi|557525087|gb|ESR36393.1| hypothetical protein
           CICLE_v10028281mg [Citrus clementina]
           gi|557525088|gb|ESR36394.1| hypothetical protein
           CICLE_v10028281mg [Citrus clementina]
           gi|557525089|gb|ESR36395.1| hypothetical protein
           CICLE_v10028281mg [Citrus clementina]
          Length = 494

 Score =  220 bits (561), Expect = 4e-55
 Identities = 119/212 (56%), Positives = 150/212 (70%), Gaps = 3/212 (1%)
 Frame = +1

Query: 88  VIKRMFSSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQVSQITLDL 267
           ++ +  +SS    +  +  L+  VVS+L HHRSK+RW++L SL   G  P+Q SQI L L
Sbjct: 22  LLSQFSTSSSTPPSDQSHNLIATVVSLLTHHRSKSRWNHLLSLCRSGLTPTQFSQIALGL 81

Query: 268 KNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRVFDAPDRL 447
           KNNP LAL FF ++Q KSLC H+L SY+TIIHIL+RARL   A+D IR A+R   +P+  
Sbjct: 82  KNNPHLALHFFSFTQHKSLCKHSLSSYATIIHILSRARLIGPARDVIRVALR---SPE-- 136

Query: 448 SAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESK---RIDGSIAIVRMLRSRGISPKV 618
           + P  K+FE LVKTYR C SAPFVFDLLIK CLE K   +I+  + IVRML SRG+S KV
Sbjct: 137 NDPKLKLFEVLVKTYRECGSAPFVFDLLIKCCLEVKNIEKIETCVDIVRMLMSRGLSVKV 196

Query: 619 STCNLLIWNVSRCQGVDAGYAIYKEVFGFDGE 714
           STCN LIW VSR +GV +GY IY+EVFG D +
Sbjct: 197 STCNALIWEVSRGKGVISGYEIYREVFGLDSD 228


>ref|XP_006409479.1| hypothetical protein EUTSA_v10022658mg [Eutrema salsugineum]
           gi|557110641|gb|ESQ50932.1| hypothetical protein
           EUTSA_v10022658mg [Eutrema salsugineum]
          Length = 495

 Score =  220 bits (561), Expect = 4e-55
 Identities = 119/229 (51%), Positives = 161/229 (70%), Gaps = 4/229 (1%)
 Frame = +1

Query: 61  DPTSSMAISVIKRMFSSSPAAQNVS-TETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDP 237
           D T S+A + +  + S  P   N + ++ L+ D VSIL HHRSK+RWS L SL P GF P
Sbjct: 19  DSTHSIASASLTTVSSPPPDHSNPTPSDPLISDAVSILTHHRSKSRWSTLRSLNPSGFTP 78

Query: 238 SQVSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTA 417
           SQ S+ITL L+NNP L+L FFL+++  SLC H++ S ST+IHILAR+RLK+ A+D IR A
Sbjct: 79  SQFSEITLRLRNNPHLSLRFFLFTRRHSLCPHDIGSCSTLIHILARSRLKTDARDVIRLA 138

Query: 418 IRVF---DAPDRLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRM 588
           +R+    +  DR+S    ++F +L+K+Y  C SAPFVFDLLIK+CL+SK IDG++ ++R 
Sbjct: 139 LRLAGGDEEEDRVS----RVFRSLIKSYNRCGSAPFVFDLLIKSCLDSKEIDGAVMVMRK 194

Query: 589 LRSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGFDGELKDKPKL 735
           LRSRGI  ++STCN LI  VSR +    GY +Y+EVFG DG  K K K+
Sbjct: 195 LRSRGIDLQISTCNALISEVSRRRDASKGYKLYREVFGLDG-AKAKAKI 242


>gb|EXB63632.1| hypothetical protein L484_026974 [Morus notabilis]
          Length = 476

 Score =  220 bits (560), Expect = 5e-55
 Identities = 125/239 (52%), Positives = 159/239 (66%), Gaps = 16/239 (6%)
 Frame = +1

Query: 76  MAISVIKRMFSSSPAAQ-------NVST-------ETLVPDVVSILKHHRSKNRWSYLHS 213
           MAI  +K +   +P  +       N ST       ++++  VV++L H RSK+RW++L S
Sbjct: 1   MAIRALKTILFPTPMQKPSFLHLSNFSTYSTPDNPQSMISTVVAVLTHQRSKSRWAHLRS 60

Query: 214 LYPGGFDPSQVSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQ 393
           L P GF PS+ SQI L LKNNP LAL FFLW+   SLC+HNL SYST+IHILAR RLK Q
Sbjct: 61  LRPNGFAPSEFSQIALHLKNNPHLALRFFLWTHRNSLCDHNLSSYSTLIHILARGRLKRQ 120

Query: 394 AQDFIRTAIRVFDAPD-RLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGS 570
           A   +R AIRV    +  L +   K+FETLVKTYR C SAPFVFDLLI+ACL+ K+ID S
Sbjct: 121 ALIVLRDAIRVSRLENGELESKPLKVFETLVKTYRQCGSAPFVFDLLIEACLDLKKIDSS 180

Query: 571 IAIVRMLRSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGFD-GELKDKPKLFVT 744
           I IVRML SR ISP+ STC  LI  VS+  G + GY +YKE+FG + G ++   ++F T
Sbjct: 181 IEIVRMLISRRISPRFSTCCSLIQQVSQRHGPNEGYKMYKEIFGSNCGAVEPSVEIFNT 239


>ref|NP_179197.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75267579|sp|Q9XIM8.1|PP155_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g15980 gi|5306237|gb|AAD41970.1| hypothetical protein
           [Arabidopsis thaliana] gi|330251359|gb|AEC06453.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 498

 Score =  220 bits (560), Expect = 5e-55
 Identities = 117/228 (51%), Positives = 160/228 (70%)
 Frame = +1

Query: 64  PTSSMAISVIKRMFSSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQ 243
           P + ++IS++  +  SSP +    ++ L+ D VSIL HHRSK+RWS L SL P GF PSQ
Sbjct: 18  PDAILSISLLTTV--SSPPSP--PSDPLISDAVSILTHHRSKSRWSTLRSLQPSGFTPSQ 73

Query: 244 VSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIR 423
            S+ITL L+NNP L+L FFL+++  SLC+H+  S ST+IHIL+R+RLKS A + IR A+R
Sbjct: 74  FSEITLCLRNNPHLSLRFFLFTRRYSLCSHDTHSCSTLIHILSRSRLKSHASEIIRLALR 133

Query: 424 VFDAPDRLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRG 603
           +  A D       K+F +L+K+Y  C SAPFVFDLLIK+CL+SK IDG++ ++R LRSRG
Sbjct: 134 L-AATDEDEDRVLKVFRSLIKSYNRCGSAPFVFDLLIKSCLDSKEIDGAVMVMRKLRSRG 192

Query: 604 ISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGFDGELKDKPKLFVTK 747
           I+ ++STCN LI  VSR +G   GY +Y+EVFG D    D+ K  + K
Sbjct: 193 INAQISTCNALITEVSRRRGASNGYKMYREVFGLDDVSVDEAKKMIGK 240


>ref|XP_007131879.1| hypothetical protein PHAVU_011G049000g [Phaseolus vulgaris]
           gi|561004879|gb|ESW03873.1| hypothetical protein
           PHAVU_011G049000g [Phaseolus vulgaris]
          Length = 439

 Score =  219 bits (559), Expect = 6e-55
 Identities = 124/216 (57%), Positives = 149/216 (68%), Gaps = 16/216 (7%)
 Frame = +1

Query: 76  MAISVIKR------------MFSSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLY 219
           MAI V+K+            +FSSS    N  +++ V +VV+IL +HRSK+RWS L S  
Sbjct: 1   MAIQVLKQFCKTPRPKAWALLFSSS--CSNHDSQSFVTNVVTILINHRSKSRWSNLRSAC 58

Query: 220 PGGFDPSQVSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQ 399
           P G DP + SQITL LKN P+LAL FFLW++ KSLC+HNL SYS IIH+LAR RL S A 
Sbjct: 59  PNGIDPLEFSQITLHLKNKPQLALRFFLWTKSKSLCHHNLASYSAIIHLLARGRLSSDAS 118

Query: 400 DFIRTAIRVFDAPD----RLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDG 567
             IRTAIR  D  D    R ++P   +FETLVKTYR   SAPFVFDLLIKACL+S+++D 
Sbjct: 119 HVIRTAIRDSDQTDDQNCRFASPPLNLFETLVKTYRDFGSAPFVFDLLIKACLDSRKVDP 178

Query: 568 SIAIVRMLRSRGISPKVSTCNLLIWNVSRCQGVDAG 675
           S+ IVRML SRGISPKVST N LI  V R +GVD G
Sbjct: 179 SVEIVRMLLSRGISPKVSTLNSLITGVCRSRGVDEG 214


>ref|XP_002306075.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222849039|gb|EEE86586.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 498

 Score =  211 bits (538), Expect = 2e-52
 Identities = 126/244 (51%), Positives = 158/244 (64%), Gaps = 6/244 (2%)
 Frame = +1

Query: 13  HNPRLSPPIAFPAAWNDPTSSMAISVIKRMFSSSPAAQNVSTETLVPDVVSILKHHRSKN 192
           H+P L PP         PT S+  S      ++SP   + S  T    ++S+L HHRSK+
Sbjct: 10  HHPPLPPP--------PPTYSLPFST-----TTSPPPNHHSPLTTA--IISLLTHHRSKS 54

Query: 193 RWSYLHSLYPGGFD----PSQVSQITLDLKNNPRLALSFFLWS-QDKSLCNHNLLSYSTI 357
           RWS+L SL          P   S ITL LK+NP LALSFF ++  + SLC+HNL SY+TI
Sbjct: 55  RWSHLRSLLTTTTSTPLAPGHFSLITLKLKSNPHLALSFFHFTLHNSSLCSHNLRSYATI 114

Query: 358 IHILARARLKSQAQDFIRTAIRVFDAPDRLSAPSPKIFETLVKTYRLCDSAPFVFDLLIK 537
           IHIL+RARLK+ AQ+ IR  +R       L     + FE LVK+YR CDSAPFVFDLLIK
Sbjct: 115 IHILSRARLKAHAQEIIRAGLRSQILYHLLK--EVRFFEVLVKSYRECDSAPFVFDLLIK 172

Query: 538 ACLESKRIDGSIAIVRMLRSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGFDG-E 714
           +CLE K+IDGSI IV+MLRS+GISP +STCN LI  VSRC+G   GY ++KEVFG +  E
Sbjct: 173 SCLELKKIDGSIEIVKMLRSKGISPSISTCNALISEVSRCKGSFVGYGVFKEVFGLESCE 232

Query: 715 LKDK 726
           L +K
Sbjct: 233 LGEK 236


>ref|XP_006349623.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like isoform X1 [Solanum tuberosum]
           gi|565365876|ref|XP_006349624.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At2g15980-like isoform X2 [Solanum tuberosum]
          Length = 493

 Score =  198 bits (503), Expect = 2e-48
 Identities = 112/217 (51%), Positives = 147/217 (67%), Gaps = 5/217 (2%)
 Frame = +1

Query: 64  PTSSMAISVIKRMFSSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYP--GGFDP 237
           P +   +S      SS P +++   ETLV    +ILKHHRSK+RWS + SL P   GF P
Sbjct: 16  PLTKHPLSTFSASTSSPPQSED---ETLVSAATTILKHHRSKSRWSEILSLAPPTSGFTP 72

Query: 238 SQVSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTA 417
           SQVS+I L L+N P LAL FF ++  +S+C H++ SY+TIIHIL+R+RLKSQA + I+ A
Sbjct: 73  SQVSKIILQLRNTPHLALRFFNFTVHRSICCHSVSSYATIIHILSRSRLKSQALELIKCA 132

Query: 418 IRVFD---APDRLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRM 588
           IR F     PD  S+  P+IFE LVKTYR CDSAPFVFDLLIKA L+SK+ID S+ +VR 
Sbjct: 133 IRKFPDIHKPD--SSNPPRIFEILVKTYRSCDSAPFVFDLLIKAYLDSKKIDVSVQLVRT 190

Query: 589 LRSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVF 699
           L S+ I P +  CN LI  +++ +G  A Y +Y E+F
Sbjct: 191 LASKNIFPHIVVCNSLIELIAKSRGPFAAYDMYVEIF 227


>ref|XP_002519113.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223541776|gb|EEF43324.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 486

 Score =  197 bits (500), Expect = 4e-48
 Identities = 113/222 (50%), Positives = 147/222 (66%), Gaps = 14/222 (6%)
 Frame = +1

Query: 76  MAISVIKRMFSSSPAA---QNVST-------ETLVPDVVSILKHHRSKNRWSYLHSLYPG 225
           MA  ++KR+   SP     Q++ST       + L+  + S+L HHRSK+RW++L SL   
Sbjct: 1   MASPILKRILLFSPKTINPQSLSTASPPSSDQQLITTITSLLIHHRSKSRWTHLRSLILT 60

Query: 226 G---FDPSQVSQITLDLKNNPRLALSFFLWS-QDKSLCNHNLLSYSTIIHILARARLKSQ 393
                 P+  SQI L LK+NPRLAL FF ++ ++ S C+H+L S STI HIL+RARLK Q
Sbjct: 61  SNKTLTPTHFSQIILLLKSNPRLALRFFHFTLRNPSFCSHDLRSISTITHILSRARLKPQ 120

Query: 394 AQDFIRTAIRVFDAPDRLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSI 573
           AQ  I  A       D  +  + K FE LVKTYR CDSAPFVFDLLIK+CLE K+ID  +
Sbjct: 121 AQSIIHLAFTSPVLVDDSNGQALKFFEILVKTYRECDSAPFVFDLLIKSCLELKKIDDGL 180

Query: 574 AIVRMLRSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVF 699
            IVR+LRSRGISP +STCN L+  VS+C+G  AGY +++EVF
Sbjct: 181 KIVRLLRSRGISPLISTCNFLVSWVSKCKGCYAGYGVFREVF 222


>ref|XP_004248897.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g15980-like [Solanum lycopersicum]
          Length = 495

 Score =  191 bits (485), Expect = 2e-46
 Identities = 111/224 (49%), Positives = 150/224 (66%), Gaps = 11/224 (4%)
 Frame = +1

Query: 76  MAISVIKRMFSSSPAAQNVST------ETLVPDVVSILKHHRSKNRWSYLHSLYP--GGF 231
           ++I++ K   S+  A+ + S+      E LV    +ILKHHRSK+RWS + SL P   GF
Sbjct: 13  LSIALTKHSLSTHSASASASSPPLSEDERLVSAATTILKHHRSKSRWSEILSLAPPTSGF 72

Query: 232 DPSQVSQITLDLKNNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIR 411
            PSQVS+I L L+N P LAL FF ++  +S+C H+L SY+TIIHIL+R+RLK  A + I+
Sbjct: 73  TPSQVSKIILQLRNTPHLALRFFNFTVHRSICCHSLSSYATIIHILSRSRLKPHALELIK 132

Query: 412 TAIRVFD---APDRLSAPSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIV 582
            AIR F     PD LS P P+ FE LVKTYR CDSAPFVFDLL+KA L+SK+ID S+ +V
Sbjct: 133 CAIRKFPDTHQPD-LSNP-PRFFEILVKTYRSCDSAPFVFDLLMKAYLDSKKIDVSVQLV 190

Query: 583 RMLRSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFGFDGE 714
           R+L S+ I P +  CN LI  +++ +G  A Y +Y E+F  + E
Sbjct: 191 RILASKNIFPHIVVCNSLIELIAKSRGPFAAYDMYVEIFRCEKE 234


>gb|EYU42638.1| hypothetical protein MIMGU_mgv1a023952mg [Mimulus guttatus]
          Length = 415

 Score =  186 bits (472), Expect = 8e-45
 Identities = 101/208 (48%), Positives = 136/208 (65%), Gaps = 4/208 (1%)
 Frame = +1

Query: 103 FSSSPAAQNVSTETLVPDVVSILKHHRSKNRWSYLHSLYPGGFD----PSQVSQITLDLK 270
           FSSS      S   +    VSILKHHRSK+RWS+L SL   G D      Q SQI L L+
Sbjct: 3   FSSSSTTPPTSGVDVASAAVSILKHHRSKSRWSHLRSLIAAGEDNRLTADQFSQIALQLR 62

Query: 271 NNPRLALSFFLWSQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRVFDAPDRLS 450
           N+P+L + FF ++   SL +H L SY+T+IHIL+R+RLKSQA   I++++  F    + +
Sbjct: 63  NSPQLVIRFFHFTLHHSLSSHTLFSYATVIHILSRSRLKSQALTLIKSSMCAFSESQQQT 122

Query: 451 APSPKIFETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRGISPKVSTCN 630
           + +  I + L+KTYR CDSAPFVFDLL+KACLESK+ D +I I   L+S+ +  K STCN
Sbjct: 123 S-TISILDALIKTYRACDSAPFVFDLLVKACLESKKTDLAIEIYAALKSKNVHLKTSTCN 181

Query: 631 LLIWNVSRCQGVDAGYAIYKEVFGFDGE 714
            LI  VS+ +G  AGY +Y+E+F  D E
Sbjct: 182 SLIEIVSKNRGCFAGYDLYREIFNLDAE 209


>gb|EPS61672.1| hypothetical protein M569_13122 [Genlisea aurea]
          Length = 464

 Score =  181 bits (459), Expect = 2e-43
 Identities = 98/196 (50%), Positives = 133/196 (67%), Gaps = 2/196 (1%)
 Frame = +1

Query: 133 STETLVPDVVSILKHHRSKNRWSYLHSLY--PGGFDPSQVSQITLDLKNNPRLALSFFLW 306
           S E+LV   VS+L+HHRSK+RWS L SL   P    PS  SQ+ L ++NNPRL L+FF +
Sbjct: 14  SGESLVSAAVSVLQHHRSKSRWSNLRSLLSGPANLTPSHFSQVALRIRNNPRLVLAFFHF 73

Query: 307 SQDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRVFDAPDRLSAPSPKIFETLVK 486
           +   SL +H+L SY+TIIHILAR+R KSQA   I +A+R     D  +     I + L+K
Sbjct: 74  TLRYSLSSHSLSSYATIIHILARSRRKSQALGVIISAMR--SHKDNTNQTPIAILQALIK 131

Query: 487 TYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRMLRSRGISPKVSTCNLLIWNVSRCQGV 666
           +YR+CDSAPFVFDLL+KAC++SK++D ++ I  +LRS+ +  K STCN LI   S+ QG 
Sbjct: 132 SYRVCDSAPFVFDLLVKACVDSKKLDSALQIHTLLRSKNVFLKTSTCNSLIELASKNQGS 191

Query: 667 DAGYAIYKEVFGFDGE 714
            AGY +Y E+F   G+
Sbjct: 192 VAGYNLYSEMFSSAGK 207


>ref|XP_006856168.1| hypothetical protein AMTR_s00059p00176060 [Amborella trichopoda]
           gi|548860027|gb|ERN17635.1| hypothetical protein
           AMTR_s00059p00176060 [Amborella trichopoda]
          Length = 511

 Score =  165 bits (418), Expect = 1e-38
 Identities = 96/217 (44%), Positives = 132/217 (60%), Gaps = 17/217 (7%)
 Frame = +1

Query: 103 FSSSPAAQNVSTET-------LVPDVVSILKHHRSKNRWSYLHSLYPGGFDPSQVSQITL 261
           FSSS     +S E        L+    +ILK HRSK+RW+YL +  P GF+P QVSQI +
Sbjct: 37  FSSSTQKTLLSAEEGGKEENELIFSATTILKEHRSKSRWNYLKASCPQGFNPQQVSQIII 96

Query: 262 DLKNNPRLALSFFLWS--QDKSLCNHNLLSYSTIIHILARARLKSQAQDFIRTAIRVFDA 435
           +L+N P LAL+FF WS  Q ++   HNLLSY TIIHILAR+RLK+  +  I  A+ V + 
Sbjct: 97  NLRNKPHLALAFFYWSAKQKQNSYKHNLLSYCTIIHILARSRLKNHVRSLILKAM-VEEQ 155

Query: 436 PDRLSAPSPKI--------FETLVKTYRLCDSAPFVFDLLIKACLESKRIDGSIAIVRML 591
              LS   P +          TL++TYR CDS P VFDLLI+  L +K++D +  IVR+L
Sbjct: 156 SLSLSPEGPSLSISELGNLLGTLIRTYRSCDSCPLVFDLLIEGHLRAKKVDCAAEIVRLL 215

Query: 592 RSRGISPKVSTCNLLIWNVSRCQGVDAGYAIYKEVFG 702
             RG+ P +   N L+  VS+ +G + G + +KE+FG
Sbjct: 216 VPRGLHPSIGILNTLLRLVSQSKGSNEGLSFFKEIFG 252


Top