BLASTX nr result

ID: Catharanthus23_contig00034356 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00034356
         (648 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004233195.1| PREDICTED: pentatricopeptide repeat-containi...   217   3e-54
ref|XP_006353048.1| PREDICTED: pentatricopeptide repeat-containi...   216   4e-54
emb|CBI31086.3| unnamed protein product [Vitis vinifera]              186   4e-45
ref|XP_002268853.1| PREDICTED: pentatricopeptide repeat-containi...   186   4e-45
emb|CAN66615.1| hypothetical protein VITISV_022030 [Vitis vinifera]   186   5e-45
gb|EXB75175.1| hypothetical protein L484_025954 [Morus notabilis]     166   5e-39
ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containi...   159   5e-37
gb|ESW10516.1| hypothetical protein PHAVU_009G216300g [Phaseolus...   156   6e-36
ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containi...   153   4e-35
ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containi...   153   4e-35
ref|XP_003630936.1| Pentatricopeptide repeat-containing protein ...   150   4e-34
ref|XP_006468372.1| PREDICTED: pentatricopeptide repeat-containi...   142   6e-32
ref|XP_006448816.1| hypothetical protein CICLE_v10014257mg [Citr...   139   9e-31
ref|XP_002317690.2| pentatricopeptide repeat-containing family p...   134   2e-29
gb|EPS70037.1| hypothetical protein M569_04719, partial [Genlise...   134   2e-29
gb|EOY25609.1| Tetratricopeptide repeat (TPR)-like superfamily p...   126   5e-27
gb|EMJ00152.1| hypothetical protein PRUPE_ppa018505mg [Prunus pe...   124   3e-26
ref|XP_004158080.1| PREDICTED: pentatricopeptide repeat-containi...   119   6e-25
ref|XP_004135750.1| PREDICTED: pentatricopeptide repeat-containi...   119   6e-25
ref|NP_193861.1| pentatricopeptide repeat-containing protein [Ar...   109   8e-22

>ref|XP_004233195.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Solanum lycopersicum]
          Length = 853

 Score =  217 bits (552), Expect = 3e-54
 Identities = 109/184 (59%), Positives = 134/184 (72%), Gaps = 8/184 (4%)
 Frame = +2

Query: 119 KNLSVFKFRSIQTSTARPFVANFPY--------TEEALASKIAPLLVSCSTPGLNGSSLH 274
           KN+     RSI  + A     N P+        TEE LASK+AP+L SC++   N   L 
Sbjct: 6   KNICSIYRRSISVAAAFSSKPNSPFIQDSVIHCTEEVLASKLAPILQSCNSSAEN---LG 62

Query: 275 SIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWII 454
           S++R+G+QVHAQ+TVNGI+NLGILGTRILGMY+LCN+  DA  LF+QL L YASPWNW+I
Sbjct: 63  SVIRKGEQVHAQVTVNGIDNLGILGTRILGMYVLCNRFIDAKKLFFQLRLCYASPWNWMI 122

Query: 455 RGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFE 634
           RG+ I G FD A+L +FKML FGT PDKYTFPYVIKACAG+ A+S GK +H L++ LGFE
Sbjct: 123 RGYTIMGRFDLAILLFFKMLVFGTYPDKYTFPYVIKACAGVNAVSFGKWLHRLVQSLGFE 182

Query: 635 TDVF 646
            DVF
Sbjct: 183 DDVF 186


>ref|XP_006353048.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Solanum tuberosum]
          Length = 852

 Score =  216 bits (551), Expect = 4e-54
 Identities = 108/184 (58%), Positives = 134/184 (72%), Gaps = 8/184 (4%)
 Frame = +2

Query: 119 KNLSVFKFRSIQTSTARPFVANFPY--------TEEALASKIAPLLVSCSTPGLNGSSLH 274
           KN+     RSI  + A     N P+        TE+ LASK+AP+L SC+    N   L 
Sbjct: 6   KNICSIFRRSISVAAAFSSKPNSPFFQDSAFHNTEQVLASKLAPILQSCTNSTEN---LG 62

Query: 275 SIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWII 454
           S++R+G+QVHAQ+TVNGI+NLGILGTRILGMY+LCN+  DA  LF+QL L YASPWNW+I
Sbjct: 63  SVLRKGEQVHAQVTVNGIDNLGILGTRILGMYVLCNRFIDAKKLFFQLQLCYASPWNWMI 122

Query: 455 RGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFE 634
           RG+ I G FD A+L +FKML FGTCPDKYTFP VIKACAG+ A++LGK +H L++ LGFE
Sbjct: 123 RGYTIMGRFDLAILLFFKMLVFGTCPDKYTFPCVIKACAGINAVNLGKWLHGLVQSLGFE 182

Query: 635 TDVF 646
            DVF
Sbjct: 183 DDVF 186


>emb|CBI31086.3| unnamed protein product [Vitis vinifera]
          Length = 766

 Score =  186 bits (473), Expect = 4e-45
 Identities = 90/176 (51%), Positives = 123/176 (69%), Gaps = 3/176 (1%)
 Frame = +2

Query: 128 SVFKFRSIQTSTA---RPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQ 298
           + FK +S  T++    +P   +  + +++LA ++  +L +C+ P        S + +G+Q
Sbjct: 17  TTFKLKSFHTNSVNIGKPLQFSI-HNDDSLAPQLVSILQTCTDP--------SGLSQGRQ 67

Query: 299 VHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGS 478
            HAQ+ VNGI   GILGT++LGMY+LC    DA N+FYQL LW + PWNW+IRGF + G 
Sbjct: 68  AHAQMLVNGIGYNGILGTKLLGMYVLCGAFLDAKNIFYQLRLWCSEPWNWMIRGFTMMGQ 127

Query: 479 FDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           FDFALLFYFKMLG GT PDKYTFPYVIKAC GL +++LG++VH  I+ +GFE DVF
Sbjct: 128 FDFALLFYFKMLGCGTLPDKYTFPYVIKACGGLNSVALGRVVHDKIQFMGFELDVF 183


>ref|XP_002268853.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Vitis vinifera]
          Length = 853

 Score =  186 bits (473), Expect = 4e-45
 Identities = 90/176 (51%), Positives = 123/176 (69%), Gaps = 3/176 (1%)
 Frame = +2

Query: 128 SVFKFRSIQTSTA---RPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQ 298
           + FK +S  T++    +P   +  + +++LA ++  +L +C+ P        S + +G+Q
Sbjct: 17  TTFKLKSFHTNSVNIGKPLQFSI-HNDDSLAPQLVSILQTCTDP--------SGLSQGRQ 67

Query: 299 VHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGS 478
            HAQ+ VNGI   GILGT++LGMY+LC    DA N+FYQL LW + PWNW+IRGF + G 
Sbjct: 68  AHAQMLVNGIGYNGILGTKLLGMYVLCGAFLDAKNIFYQLRLWCSEPWNWMIRGFTMMGQ 127

Query: 479 FDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           FDFALLFYFKMLG GT PDKYTFPYVIKAC GL +++LG++VH  I+ +GFE DVF
Sbjct: 128 FDFALLFYFKMLGCGTLPDKYTFPYVIKACGGLNSVALGRVVHDKIQFMGFELDVF 183


>emb|CAN66615.1| hypothetical protein VITISV_022030 [Vitis vinifera]
          Length = 818

 Score =  186 bits (472), Expect = 5e-45
 Identities = 90/176 (51%), Positives = 122/176 (69%), Gaps = 3/176 (1%)
 Frame = +2

Query: 128 SVFKFRSIQTST---ARPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQ 298
           + FK +S  T++    +P   +  + +++LA ++  +L +C+ P        S +  G+Q
Sbjct: 17  TTFKLKSFHTNSINIGKPLQFSI-HNDDSLAPQLVSILQTCTDP--------SGLSHGRQ 67

Query: 299 VHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGS 478
            HAQ+ VNGI   GILGT++LGMY+LC    DA N+FYQL LW + PWNW+IRGF + G 
Sbjct: 68  AHAQMLVNGIGYNGILGTKLLGMYVLCGAFLDAKNIFYQLRLWCSEPWNWMIRGFTMMGQ 127

Query: 479 FDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           FDFALLFYFKMLG GT PDKYTFPYVIKAC GL +++LG++VH  I+ +GFE DVF
Sbjct: 128 FDFALLFYFKMLGCGTLPDKYTFPYVIKACGGLNSVALGRVVHDKIQFMGFELDVF 183



 Score = 56.6 bits (135), Expect = 6e-06
 Identities = 34/119 (28%), Positives = 50/119 (42%)
 Frame = +2

Query: 290 GQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFII 469
           G Q+H  +  +G+     +   +L MY  C    DA  LF  +       WN +I G++ 
Sbjct: 267 GSQLHGLVVSSGLEMDSPVANTLLAMYAKCGHLFDARRLFDMMPKTDLVTWNGMISGYVQ 326

Query: 470 KGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
            G  D A   + +M+  G  PD  TF   +   +    L  GK +H  I   G   DVF
Sbjct: 327 NGFMDEASCLFHEMISAGMKPDSITFSSFLPLLSEGATLRQGKEIHCYIIRNGVSLDVF 385


>gb|EXB75175.1| hypothetical protein L484_025954 [Morus notabilis]
          Length = 850

 Score =  166 bits (420), Expect = 5e-39
 Identities = 78/160 (48%), Positives = 110/160 (68%)
 Frame = +2

Query: 167 RPFVANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGIL 346
           +PF+ NFP TEEAL +    +L +C          H+++++G+Q+HAQ+  NGI+   ++
Sbjct: 36  KPFI-NFPRTEEALTNHFLSILQACCD--------HALLQQGRQIHAQVIANGISRKNLI 86

Query: 347 GTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGT 526
           GT+IL +Y+LC     A N+FY+L+L +ASPWNW+IR F + G FD A++ YFKML +GT
Sbjct: 87  GTKILAVYVLCGSFLYAKNVFYRLDLRFASPWNWMIRWFTMMGLFDVAIMLYFKMLCYGT 146

Query: 527 CPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
            PDKYTFP VIKAC GL  + L K VHS ++ +G E DVF
Sbjct: 147 SPDKYTFPPVIKACGGLNNVRLAKRVHSTVKLIGLEVDVF 186



 Score = 56.6 bits (135), Expect = 6e-06
 Identities = 36/123 (29%), Positives = 55/123 (44%)
 Frame = +2

Query: 278 IMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIR 457
           ++R G Q+H  +  +G+     +   +L MY  C   SDA+ LF  +       WN +I 
Sbjct: 266 LVRFGTQLHGLVVNSGLELDSPVANTLLAMYSKCQHLSDAHKLFDLMPKTDLVTWNGMIS 325

Query: 458 GFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFET 637
           G++  G    A   + +M+  G  PD  TF   I +     +L  GK +H  I   G   
Sbjct: 326 GYVQNGFMIEASNCFHEMISAGVKPDSITFASFIPSVTESASLHKGKEIHGYIIRHGVPL 385

Query: 638 DVF 646
           DVF
Sbjct: 386 DVF 388


>ref|XP_004503357.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Cicer arietinum]
          Length = 875

 Score =  159 bits (403), Expect = 5e-37
 Identities = 75/152 (49%), Positives = 107/152 (70%)
 Frame = +2

Query: 191 YTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMY 370
           + E++LA+++  +   CS          S+++R +Q+HA + V+G+++   LG+RILGMY
Sbjct: 65  FFEQSLAAQLECMFRDCSNFDA------SMVQRVRQIHAHVVVSGMSDSLTLGSRILGMY 118

Query: 371 ILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFP 550
           ILC + +DA NLF++L L Y+ PWNW+IRGF + G FDFAL+F+F+MLG    PDKYTFP
Sbjct: 119 ILCGRFNDAGNLFFRLQLCYSLPWNWLIRGFSMLGWFDFALMFFFRMLGCNVAPDKYTFP 178

Query: 551 YVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           YVIKAC GL  + L K+VH L R +GF  D+F
Sbjct: 179 YVIKACGGLNNVPLCKMVHDLARSMGFHMDLF 210


>gb|ESW10516.1| hypothetical protein PHAVU_009G216300g [Phaseolus vulgaris]
          Length = 848

 Score =  156 bits (394), Expect = 6e-36
 Identities = 82/192 (42%), Positives = 119/192 (61%), Gaps = 6/192 (3%)
 Frame = +2

Query: 89  MYSRSSIAACKNL--SVFKFRSIQTSTARPFVANF----PYTEEALASKIAPLLVSCSTP 250
           MY+ S++ +   L  S  KF    T+     ++      P T+++L   +  L  +CS  
Sbjct: 1   MYNTSNLCSIFRLAFSRSKFMHTATNICNNVISKSHLLPPETQDSLTPHLESLFRACSDA 60

Query: 251 GLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWY 430
                   S++++ +QVH Q+ V G++++  L +RILG+Y+LC +  DA NLF++L L Y
Sbjct: 61  --------SLLQQVRQVHTQVVVGGMSDVCSLSSRILGLYVLCGRIKDAENLFFRLELCY 112

Query: 431 ASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHS 610
           A PWNW+IRG  + G FDFALLFYFKMLG    PDKYTFPYVIKAC GL  + L  +VH+
Sbjct: 113 ALPWNWMIRGLYMLGWFDFALLFYFKMLGNKVSPDKYTFPYVIKACGGLNNVPLCMVVHN 172

Query: 611 LIRDLGFETDVF 646
           ++R +GF  D+F
Sbjct: 173 MVRLMGFHVDLF 184


>ref|XP_006587119.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like isoform X2 [Glycine max]
           gi|571476945|ref|XP_003535029.2| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g21300-like isoform X1 [Glycine max]
          Length = 848

 Score =  153 bits (387), Expect = 4e-35
 Identities = 81/191 (42%), Positives = 117/191 (61%), Gaps = 5/191 (2%)
 Frame = +2

Query: 89  MYSRSS-IAACKNLSVFKFRSIQTSTA----RPFVANFPYTEEALASKIAPLLVSCSTPG 253
           MY+R++ + +   LS  + + + T+T        V   P T ++L +++  L  +CS   
Sbjct: 1   MYNRTTNLCSIFRLSFSRSKLMHTATTSICNNNNVMAKPETLDSLTTQLESLFRACSDA- 59

Query: 254 LNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYA 433
                  S++++ +QVH Q+ V G+ ++    +R+LG+Y+LC +  DA NLF++L L YA
Sbjct: 60  -------SMVQQARQVHTQVIVGGMGDVCAPSSRVLGLYVLCGRFRDAGNLFFELELRYA 112

Query: 434 SPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSL 613
            PWNW+IRG  + G FDFALLFYFKMLG    PDKYTFPYVIKAC GL  + L  +VH  
Sbjct: 113 LPWNWMIRGLYMLGWFDFALLFYFKMLGSNVSPDKYTFPYVIKACGGLNNVPLCMVVHDT 172

Query: 614 IRDLGFETDVF 646
            R LGF  D+F
Sbjct: 173 ARSLGFHVDLF 183


>ref|XP_003547574.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Glycine max]
          Length = 846

 Score =  153 bits (387), Expect = 4e-35
 Identities = 75/157 (47%), Positives = 103/157 (65%)
 Frame = +2

Query: 176 VANFPYTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTR 355
           V + P T++ L +++  L  +CS          S++++ +QVH QI V G++++  L +R
Sbjct: 33  VMSKPETQDYLTTQLESLFRACSDA--------SVVQQARQVHTQIIVGGMSDVCALSSR 84

Query: 356 ILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPD 535
           +LG+Y+LC + SD  NLF+ L L  A PWNW+IRG  + G FDFALLFYFKMLG    PD
Sbjct: 85  VLGLYVLCGRISDGGNLFFGLELCNALPWNWMIRGLYMLGWFDFALLFYFKMLGSNVSPD 144

Query: 536 KYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           KYTFPYVIKAC GL  + L  +VH+  R LGF  D+F
Sbjct: 145 KYTFPYVIKACGGLNNVPLCMVVHNTARSLGFHVDLF 181


>ref|XP_003630936.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355524958|gb|AET05412.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 959

 Score =  150 bits (378), Expect = 4e-34
 Identities = 71/150 (47%), Positives = 99/150 (66%)
 Frame = +2

Query: 197 EEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYIL 376
           EE+LA+++  +  +        S    + ++ +Q+HA++ V G+N    LG+R+LGMY+L
Sbjct: 69  EESLAAQLESMFRA-----FPNSDASLVKQQVRQIHAKVLVCGMNGSLTLGSRMLGMYVL 123

Query: 377 CNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYV 556
           C    D  NLF +L L Y+ PWNW+IRGF + G FDFAL+F+F+MLG    PDKYTFPYV
Sbjct: 124 CRSFKDVGNLFCRLQLCYSLPWNWLIRGFSMLGCFDFALMFFFRMLGSNVAPDKYTFPYV 183

Query: 557 IKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           IKAC GL  + L K+VH L R +GF  D+F
Sbjct: 184 IKACGGLNNVPLCKMVHELARSMGFHMDLF 213


>ref|XP_006468372.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Citrus sinensis]
          Length = 847

 Score =  142 bits (359), Expect = 6e-32
 Identities = 83/191 (43%), Positives = 114/191 (59%), Gaps = 5/191 (2%)
 Frame = +2

Query: 89  MYSR---SSIAACKNLSVFKFRSIQTSTAR--PFVANFPYTEEALASKIAPLLVSCSTPG 253
           MY R   SS       S FK +SI ++       + +   T+ ALAS +  +L +C+   
Sbjct: 1   MYQRLITSSHKCLSTFSAFKCKSIHSNCEHFTNQLVSSHKTDTALASHLGSILEACAD-- 58

Query: 254 LNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYA 433
                 HS++++G+QVH+Q  +NGI++   LG +ILGMY+LC    DA N+F +L+L  +
Sbjct: 59  ------HSVLQQGRQVHSQFILNGISDNAALGAKILGMYVLCGGFIDAGNMFPRLDLATS 112

Query: 434 SPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSL 613
            PWN +IR F   G F FALLFYFKML  G  PD +TFP V+KAC+ L  L  GKLVH +
Sbjct: 113 LPWNRMIRVFAKMGLFRFALLFYFKMLSCGIRPDNHTFPSVMKACSALGNLRFGKLVHDM 172

Query: 614 IRDLGFETDVF 646
           I  +G E DVF
Sbjct: 173 IWLMGCEIDVF 183


>ref|XP_006448816.1| hypothetical protein CICLE_v10014257mg [Citrus clementina]
           gi|557551427|gb|ESR62056.1| hypothetical protein
           CICLE_v10014257mg [Citrus clementina]
          Length = 848

 Score =  139 bits (349), Expect = 9e-31
 Identities = 81/191 (42%), Positives = 116/191 (60%), Gaps = 5/191 (2%)
 Frame = +2

Query: 89  MYSRSSIAACKNLSVF---KFRSIQTSTAR--PFVANFPYTEEALASKIAPLLVSCSTPG 253
           MY R   ++ K LS+F   K +SI ++       + +   T+ ALAS +  +L +C+   
Sbjct: 1   MYQRLITSSHKCLSIFSAFKCKSIHSNCEHFTNQLVSSHKTDTALASHLGSILEACAD-- 58

Query: 254 LNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYA 433
                 HS++++G+QVH+Q  +NGI++   LG +ILGMY+LC    DA N+F +L+L  +
Sbjct: 59  ------HSVLQQGRQVHSQFILNGISDNAALGAKILGMYVLCGGFIDAGNMFPRLDLATS 112

Query: 434 SPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSL 613
            PWN +IR F   G F FALLFYFKML  G  PD +TFP V+KAC+ L  +  GKLVH +
Sbjct: 113 LPWNRMIRVFAKMGLFRFALLFYFKMLSCGIRPDNHTFPSVMKACSALGNVRFGKLVHDM 172

Query: 614 IRDLGFETDVF 646
           I  +G   DVF
Sbjct: 173 IWLMGCGIDVF 183


>ref|XP_002317690.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550328506|gb|EEE98302.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 757

 Score =  134 bits (338), Expect = 2e-29
 Identities = 59/97 (60%), Positives = 72/97 (74%)
 Frame = +2

Query: 356 ILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPD 535
           +LGMY+LCN   DA  LFYQL  +YA PWNW+IRG +  G FDFALLFYFKMLG G  PD
Sbjct: 1   MLGMYVLCNSFVDAKKLFYQLEFYYAMPWNWMIRGLVKLGCFDFALLFYFKMLGCGVFPD 60

Query: 536 KYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           KYTFP VIK C GL  + LGK++  +I ++GF+ D+F
Sbjct: 61  KYTFPPVIKCCTGLNNVRLGKVIQDMILEMGFDLDMF 97



 Score = 58.2 bits (139), Expect = 2e-06
 Identities = 53/193 (27%), Positives = 86/193 (44%), Gaps = 10/193 (5%)
 Frame = +2

Query: 98  RSSIAACK--NLSVFKFRSIQTSTARPFVANFPYTEEA------LASKIAP--LLVSCST 247
           R ++ ACK  NLS      I T+    +V N    +        L  K+ P  L  S   
Sbjct: 312 RDAVMACKMFNLSTKFDIVIYTAMISGYVLNGMNKDALEIFRWLLQKKMIPNALTFSSIL 371

Query: 248 PGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLW 427
           P   G +    ++ G+++H  I  N +     +G+ I+ MY  C +   A+ +F ++++ 
Sbjct: 372 PACAGLAA---IKLGRELHGYIIKNELEEKCPVGSAIMNMYAKCGRLDLAHLIFGRISIK 428

Query: 428 YASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVH 607
            A  WN II  F   G  + A+  + +M   G   D  T    + ACA + AL  GK +H
Sbjct: 429 DAICWNSIITSFSQDGKPEEAIYLFRQMGMEGVKYDCVTVSAALSACANIPALHYGKEIH 488

Query: 608 SLIRDLGFETDVF 646
             +    FE+D+F
Sbjct: 489 GFMIKGAFESDLF 501


>gb|EPS70037.1| hypothetical protein M569_04719, partial [Genlisea aurea]
          Length = 740

 Score =  134 bits (337), Expect = 2e-29
 Identities = 56/97 (57%), Positives = 78/97 (80%)
 Frame = +2

Query: 356 ILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPD 535
           +LG+Y++ N ++DA  LF+++ L YA+PWNW+IRGF + G  D+A+LFYFKML FGT PD
Sbjct: 1   VLGIYLMSNNYNDAKKLFFRMQLCYAAPWNWMIRGFTVMGHCDYAVLFYFKMLAFGTSPD 60

Query: 536 KYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           KYTFPYVIKAC G++A+ L K +HS+I+ L +E DV+
Sbjct: 61  KYTFPYVIKACGGMKAVGLLKHIHSMIKKLCYELDVY 97


>gb|EOY25609.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           isoform 1 [Theobroma cacao] gi|508778355|gb|EOY25611.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778356|gb|EOY25612.1| Tetratricopeptide repeat
           (TPR)-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508778357|gb|EOY25613.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508778358|gb|EOY25614.1| Tetratricopeptide repeat
           (TPR)-like superfamily protein, putative isoform 1
           [Theobroma cacao]
          Length = 833

 Score =  126 bits (317), Expect = 5e-27
 Identities = 72/186 (38%), Positives = 102/186 (54%)
 Frame = +2

Query: 89  MYSRSSIAACKNLSVFKFRSIQTSTARPFVANFPYTEEALASKIAPLLVSCSTPGLNGSS 268
           MY R+  + CK ++        TS   P  + FP T+      +A  L S S P      
Sbjct: 1   MYQRNLRSICKLIAPLTNVHTTTSQHIPRPSEFPTTQ------LASFLQSTSFP------ 48

Query: 269 LHSIMRRGQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNW 448
             S +++G+QVHA++ +N I     L   +L MY+ C   +DA N+FY+++L     WN 
Sbjct: 49  --SNLQQGKQVHARLILNEITTTDPL---LLAMYLRCGSFNDAKNMFYRIDLGCVKRWNL 103

Query: 449 IIRGFIIKGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLG 628
           +IRG +  G F   LLFYFKMLG G  PD +TFP V+KAC+GL  +  G L+H  I  +G
Sbjct: 104 MIRGLVKMGWFHLGLLFYFKMLGCGVSPDNFTFPSVVKACSGLNNVRFGTLIHEAIMSMG 163

Query: 629 FETDVF 646
           FE +VF
Sbjct: 164 FEVNVF 169



 Score = 61.2 bits (147), Expect = 2e-07
 Identities = 36/119 (30%), Positives = 52/119 (43%)
 Frame = +2

Query: 290 GQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFII 469
           G Q+H  +   G+    ++   +L MY  C   SDA+ LF  +       WN +I G++ 
Sbjct: 253 GTQLHGLVVCCGLEFDSVVANALLSMYSKCGWLSDAHKLFGMMPQADLVSWNGMISGYVQ 312

Query: 470 KGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
            G    A   + +M+  G  PD  TF   + A  GL     GK +H  I   G   DVF
Sbjct: 313 NGFMQDASCLFNEMISSGLKPDAITFSSFLPAVTGLGCFRKGKEIHGYILRHGVSLDVF 371


>gb|EMJ00152.1| hypothetical protein PRUPE_ppa018505mg [Prunus persica]
          Length = 758

 Score =  124 bits (310), Expect = 3e-26
 Identities = 55/94 (58%), Positives = 68/94 (72%)
 Frame = +2

Query: 365 MYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYT 544
           MY LC    DA N+FY+L+L Y  PWNW+IRGF + G F+FALLFYFKMLG G  PDKYT
Sbjct: 1   MYFLCGSIVDAKNIFYKLDLQYTLPWNWMIRGFTMMGYFEFALLFYFKMLGSGISPDKYT 60

Query: 545 FPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           FP VIKAC G+  + LGK ++  I+ +GF  D+F
Sbjct: 61  FPSVIKACGGVNNVRLGKAIYDTIQFMGFGVDIF 94


>ref|XP_004158080.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Cucumis sativus]
          Length = 762

 Score =  119 bits (299), Expect = 6e-25
 Identities = 52/94 (55%), Positives = 66/94 (70%)
 Frame = +2

Query: 365 MYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYT 544
           MY+      DA NLFY L L   S WNW+IRGF + G F++ALLFY KMLG G  PDKYT
Sbjct: 1   MYVRTGSLKDAKNLFYTLQLGCTSAWNWMIRGFTMMGQFNYALLFYLKMLGAGVSPDKYT 60

Query: 545 FPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           FPYV+KAC GL+++ +GK+VH  +  +G + DVF
Sbjct: 61  FPYVVKACCGLKSVKMGKIVHETVNLMGLKEDVF 94


>ref|XP_004135750.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21300-like [Cucumis sativus]
          Length = 762

 Score =  119 bits (299), Expect = 6e-25
 Identities = 52/94 (55%), Positives = 66/94 (70%)
 Frame = +2

Query: 365 MYILCNKHSDANNLFYQLNLWYASPWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYT 544
           MY+      DA NLFY L L   S WNW+IRGF + G F++ALLFY KMLG G  PDKYT
Sbjct: 1   MYVRTGSLKDAKNLFYTLQLGCTSAWNWMIRGFTMMGQFNYALLFYLKMLGAGVSPDKYT 60

Query: 545 FPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           FPYV+KAC GL+++ +GK+VH  +  +G + DVF
Sbjct: 61  FPYVVKACCGLKSVKMGKIVHETVNLMGLKEDVF 94


>ref|NP_193861.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75207660|sp|Q9STE1.1|PP333_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g21300 gi|3402749|emb|CAA20195.1| putative protein
           [Arabidopsis thaliana] gi|7268926|emb|CAB79129.1|
           putative protein [Arabidopsis thaliana]
           gi|332659037|gb|AEE84437.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 857

 Score =  109 bits (272), Expect = 8e-22
 Identities = 60/154 (38%), Positives = 85/154 (55%), Gaps = 2/154 (1%)
 Frame = +2

Query: 191 YTEEALASKIAPLLVSCSTPGLNGSSLHSIMRRGQQVHAQITVNGINNLGILGTRILGMY 370
           + EE +  +++ LL +CS P L        +R+G+QVHA + VN I+       RILGMY
Sbjct: 29  FLEETIPRRLSLLLQACSNPNL--------LRQGKQVHAFLIVNSISGDSYTDERILGMY 80

Query: 371 ILCNKHSDANNLFYQLNLWYAS--PWNWIIRGFIIKGSFDFALLFYFKMLGFGTCPDKYT 544
            +C   SD   +FY+L+L  +S  PWN II  F+  G  + AL FYFKML FG  PD  T
Sbjct: 81  AMCGSFSDCGKMFYRLDLRRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPDVST 140

Query: 545 FPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
           FP ++KAC  L+       +   +  LG + + F
Sbjct: 141 FPCLVKACVALKNFKGIDFLSDTVSSLGMDCNEF 174



 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 33/119 (27%), Positives = 59/119 (49%)
 Frame = +2

Query: 290 GQQVHAQITVNGINNLGILGTRILGMYILCNKHSDANNLFYQLNLWYASPWNWIIRGFII 469
           G Q+H  + V+G++  G +   +L MY  C +  DA+ LF  ++      WN +I G++ 
Sbjct: 258 GVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQ 317

Query: 470 KGSFDFALLFYFKMLGFGTCPDKYTFPYVIKACAGLRALSLGKLVHSLIRDLGFETDVF 646
            G  + +L F+++M+  G  PD  TF  ++ + +    L   K +H  I       D+F
Sbjct: 318 SGLMEESLTFFYEMISSGVLPDAITFSSLLPSVSKFENLEYCKQIHCYIMRHSISLDIF 376


Top