BLASTX nr result

ID: Catharanthus23_contig00037406 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00037406
         (322 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004229547.1| PREDICTED: pentatricopeptide repeat-containi...   110   2e-22
ref|XP_006359126.1| PREDICTED: pentatricopeptide repeat-containi...   107   2e-21
ref|XP_002264130.2| PREDICTED: pentatricopeptide repeat-containi...   107   2e-21
emb|CAN60118.1| hypothetical protein VITISV_016374 [Vitis vinifera]   107   2e-21
ref|XP_004294060.1| PREDICTED: pentatricopeptide repeat-containi...   104   1e-20
gb|EOY14017.1| Tetratricopeptide repeat (TPR)-like superfamily p...   103   2e-20
gb|EOY14016.1| Tetratricopeptide repeat-like superfamily protein...   103   2e-20
gb|EMJ14858.1| hypothetical protein PRUPE_ppa001106mg [Prunus pe...   102   4e-20
gb|EXB53343.1| hypothetical protein L484_016225 [Morus notabilis]     102   5e-20
ref|XP_002310674.2| hypothetical protein POPTR_0007s08080g [Popu...   100   2e-19
ref|XP_006422241.1| hypothetical protein CICLE_v10004260mg [Citr...   100   3e-19
ref|XP_002513375.1| pentatricopeptide repeat-containing protein,...    99   5e-19
ref|XP_004149965.1| PREDICTED: pentatricopeptide repeat-containi...    98   1e-18
ref|XP_003540076.2| PREDICTED: pentatricopeptide repeat-containi...    94   1e-17
ref|XP_004515321.1| PREDICTED: pentatricopeptide repeat-containi...    92   6e-17
ref|XP_003599152.1| Pentatricopeptide repeat-containing protein ...    92   7e-17
ref|XP_004983787.1| PREDICTED: pentatricopeptide repeat-containi...    88   1e-15
gb|AAD34705.1|AC006341_33 >F3O9.28 [Arabidopsis thaliana]              87   2e-15
ref|NP_001185013.1| PPR repeat domain-containing protein [Arabid...    87   2e-15
ref|NP_173097.2| PPR repeat domain-containing protein [Arabidops...    87   2e-15

>ref|XP_004229547.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Solanum lycopersicum]
          Length = 1038

 Score =  110 bits (276), Expect = 2e-22
 Identities = 55/90 (61%), Positives = 65/90 (72%)
 Frame = +1

Query: 52  GYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQYVT 231
           G  EEALKLLL+MQR K++FDQF              LEEG+Q+H LA KLGFDS  +V 
Sbjct: 645 GLWEEALKLLLQMQREKLEFDQFSLSAALSAAANLASLEEGQQIHCLATKLGFDSNSFVG 704

Query: 232 NCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           N  MDMYGKCGE++DVLK+LPEPN+R RLS
Sbjct: 705 NATMDMYGKCGEMNDVLKILPEPNLRPRLS 734


>ref|XP_006359126.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Solanum tuberosum]
          Length = 1038

 Score =  107 bits (267), Expect = 2e-21
 Identities = 53/90 (58%), Positives = 64/90 (71%)
 Frame = +1

Query: 52  GYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQYVT 231
           G  EEALKLLL+MQR K++FDQF              LEEG+Q+H LA KLGFDS  +V 
Sbjct: 645 GLWEEALKLLLQMQREKLEFDQFSLSAALSAAANLASLEEGQQIHCLATKLGFDSNSFVG 704

Query: 232 NCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           N  MDMYGKCGE+++VLK+ PEPN+R RLS
Sbjct: 705 NATMDMYGKCGEMNNVLKIFPEPNLRPRLS 734


>ref|XP_002264130.2| PREDICTED: pentatricopeptide repeat-containing protein At4g13650-like
            [Vitis vinifera]
          Length = 1724

 Score =  107 bits (266), Expect = 2e-21
 Identities = 54/92 (58%), Positives = 63/92 (68%)
 Frame = +1

Query: 46   HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
            HHG GEEALK+  EM+   V+ DQF              LEEG+QLH L IKLGF+S  +
Sbjct: 1329 HHGCGEEALKIFGEMRNVGVNLDQFSFSGGLAATANLAVLEEGQQLHGLVIKLGFESDLH 1388

Query: 226  VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
            VTN AMDMYGKCGE+ DVLKMLP+P  R+RLS
Sbjct: 1389 VTNAAMDMYGKCGEMHDVLKMLPQPINRSRLS 1420


>emb|CAN60118.1| hypothetical protein VITISV_016374 [Vitis vinifera]
          Length = 1166

 Score =  107 bits (266), Expect = 2e-21
 Identities = 54/92 (58%), Positives = 63/92 (68%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           HHG GEEALK+  EM+   V+ DQF              LEEG+QLH L IKLGF+S  +
Sbjct: 679 HHGCGEEALKIFGEMRNVGVNLDQFSFSGGLAATANLAVLEEGQQLHGLVIKLGFESDLH 738

Query: 226 VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           VTN AMDMYGKCGE+ DVLKMLP+P  R+RLS
Sbjct: 739 VTNAAMDMYGKCGEMHDVLKMLPQPINRSRLS 770


>ref|XP_004294060.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Fragaria vesca subsp. vesca]
          Length = 936

 Score =  104 bits (259), Expect = 1e-20
 Identities = 53/92 (57%), Positives = 63/92 (68%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           +HG  EEALKL+L M R  VDFDQF              L EG+QLH L +KLGFD+  Y
Sbjct: 542 NHGL-EEALKLVLMMGRAGVDFDQFSLSVALSVSADLAMLVEGQQLHGLVVKLGFDTDHY 600

Query: 226 VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           +TN AMDMYGKCGE+ DVLK+LP P +R+RLS
Sbjct: 601 ITNAAMDMYGKCGEMEDVLKILPSPTIRSRLS 632


>gb|EOY14017.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 2
           [Theobroma cacao]
          Length = 787

 Score =  103 bits (258), Expect = 2e-20
 Identities = 50/91 (54%), Positives = 63/91 (69%)
 Frame = +1

Query: 49  HGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQYV 228
           HG GEE LK +++M+   +D DQF              LEEG+QLH +A+KLGFDS  +V
Sbjct: 393 HGLGEEVLKHIVKMRTAGIDLDQFSFSEGLAATAKLAVLEEGQQLHCVAVKLGFDSDPFV 452

Query: 229 TNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           TN AMDMYGKCGE+ DVL+MLP+P  R+RLS
Sbjct: 453 TNAAMDMYGKCGEMDDVLRMLPQPVSRSRLS 483


>gb|EOY14016.1| Tetratricopeptide repeat-like superfamily protein isoform 1
            [Theobroma cacao]
          Length = 1196

 Score =  103 bits (258), Expect = 2e-20
 Identities = 50/91 (54%), Positives = 63/91 (69%)
 Frame = +1

Query: 49   HGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQYV 228
            HG GEE LK +++M+   +D DQF              LEEG+QLH +A+KLGFDS  +V
Sbjct: 802  HGLGEEVLKHIVKMRTAGIDLDQFSFSEGLAATAKLAVLEEGQQLHCVAVKLGFDSDPFV 861

Query: 229  TNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
            TN AMDMYGKCGE+ DVL+MLP+P  R+RLS
Sbjct: 862  TNAAMDMYGKCGEMDDVLRMLPQPVSRSRLS 892


>gb|EMJ14858.1| hypothetical protein PRUPE_ppa001106mg [Prunus persica]
          Length = 908

 Score =  102 bits (255), Expect = 4e-20
 Identities = 52/92 (56%), Positives = 63/92 (68%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           +HG  E+ALKL++ M++  VD DQF              LEEG+QLH L +KLGFDS  Y
Sbjct: 514 NHGL-EKALKLVVMMKKAGVDLDQFSFSVALSVSADLAMLEEGQQLHGLVVKLGFDSDHY 572

Query: 226 VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           VTN AMDMYGKCGE+ DVLK+LP P  R+RLS
Sbjct: 573 VTNAAMDMYGKCGEMEDVLKLLPSPTNRSRLS 604


>gb|EXB53343.1| hypothetical protein L484_016225 [Morus notabilis]
          Length = 920

 Score =  102 bits (254), Expect = 5e-20
 Identities = 51/92 (55%), Positives = 62/92 (67%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           HHG GEEALKL+++M+   +  DQF              LEEG+QLH L IKLGF+   Y
Sbjct: 525 HHGCGEEALKLIMKMRNAGLLLDQFSLSVALSVSADLAILEEGQQLHGLVIKLGFELDHY 584

Query: 226 VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           VTN AMDMYGKCGE+ DVL++L  P +RTRLS
Sbjct: 585 VTNTAMDMYGKCGEMDDVLRILSPPFIRTRLS 616


>ref|XP_002310674.2| hypothetical protein POPTR_0007s08080g [Populus trichocarpa]
           gi|550334392|gb|EEE91124.2| hypothetical protein
           POPTR_0007s08080g [Populus trichocarpa]
          Length = 999

 Score =  100 bits (250), Expect = 2e-19
 Identities = 49/92 (53%), Positives = 62/92 (67%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           HHG+ EEALK LLEM+R  V+ D+F              LEEG+QLH LA+KLG DS  +
Sbjct: 604 HHGHMEEALKFLLEMRRAGVNVDEFSFSECLAAAAKLAILEEGQQLHGLAVKLGCDSNPF 663

Query: 226 VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           V +  MDMYGKCGE+ DVL+++P P  R+RLS
Sbjct: 664 VASATMDMYGKCGEIDDVLRIIPRPINRSRLS 695


>ref|XP_006422241.1| hypothetical protein CICLE_v10004260mg [Citrus clementina]
           gi|557524114|gb|ESR35481.1| hypothetical protein
           CICLE_v10004260mg [Citrus clementina]
          Length = 936

 Score =  100 bits (248), Expect = 3e-19
 Identities = 51/91 (56%), Positives = 60/91 (65%)
 Frame = +1

Query: 49  HGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQYV 228
           HG GEE LKLL++M+ T V FD+F              LEEG QLH LA KLGFD   +V
Sbjct: 543 HGQGEEVLKLLVKMRHTGVYFDRFSLSEGLAAAAKLAVLEEGHQLHGLATKLGFDLDPFV 602

Query: 229 TNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           TN AMDMYGKCGE+ DVL++ P+P  R RLS
Sbjct: 603 TNAAMDMYGKCGEIGDVLRIAPQPVDRPRLS 633


>ref|XP_002513375.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223547283|gb|EEF48778.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 922

 Score = 99.4 bits (246), Expect = 5e-19
 Identities = 49/92 (53%), Positives = 63/92 (68%)
 Frame = +1

Query: 46   HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
            +HG  EE+LKLL++M+   VD DQF              LEEG+QL +LA+KLGFDS  +
Sbjct: 797  YHGQMEESLKLLVKMRHAGVDLDQFSFSGCLSATATLAMLEEGQQLQSLAVKLGFDSDPF 856

Query: 226  VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
            VTN  MDMY KCGEL DVL+++P+P  R+RLS
Sbjct: 857  VTNALMDMYAKCGELDDVLRIIPQPLERSRLS 888


>ref|XP_004149965.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Cucumis sativus]
           gi|449497665|ref|XP_004160467.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g13650-like [Cucumis sativus]
          Length = 938

 Score = 98.2 bits (243), Expect = 1e-18
 Identities = 46/91 (50%), Positives = 61/91 (67%)
 Frame = +1

Query: 49  HGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQYV 228
           +G+GEEALKL++ M+   ++FDQF              LEEG+QLH   IKLGF+   ++
Sbjct: 544 YGFGEEALKLVVRMRSAGIEFDQFNFSTALSVAADLAMLEEGQQLHGSTIKLGFELDHFI 603

Query: 229 TNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
            N AMDMYGKCGEL D L++LP+P  R+RLS
Sbjct: 604 INAAMDMYGKCGELDDALRILPQPTDRSRLS 634


>ref|XP_003540076.2| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Glycine max]
          Length = 1044

 Score = 94.4 bits (233), Expect = 1e-17
 Identities = 44/92 (47%), Positives = 60/92 (65%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           H+G GEEALKL+++M+   +  DQF              L+EG+QLH+L IK GF+S  Y
Sbjct: 649 HYGPGEEALKLIIKMRNDGIHLDQFSFSVAHAIIGNLTLLDEGQQLHSLIIKHGFESNDY 708

Query: 226 VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           V N  MDMYGKCGE+ DV ++LP+P  R++ S
Sbjct: 709 VLNATMDMYGKCGEIDDVFRILPQPRSRSQRS 740


>ref|XP_004515321.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like isoform X1 [Cicer arietinum]
          Length = 934

 Score = 92.4 bits (228), Expect = 6e-17
 Identities = 45/92 (48%), Positives = 57/92 (61%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           H+G GEEALK +  M+   VD DQF              L+EG+QLH+  IKLGF S +Y
Sbjct: 539 HYGPGEEALKFIARMRNDGVDLDQFSFSVALATIGNLTVLDEGQQLHSWIIKLGFKSNEY 598

Query: 226 VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
           V N  MDMYGKCGE+ DV ++LP P  R++ S
Sbjct: 599 VLNATMDMYGKCGEIDDVFRILPLPKSRSQRS 630


>ref|XP_003599152.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355488200|gb|AES69403.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 1125

 Score = 92.0 bits (227), Expect = 7e-17
 Identities = 44/92 (47%), Positives = 58/92 (63%)
 Frame = +1

Query: 46   HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
            H+G GEEALK +  M+   VD DQF              L+EG+QLH+  IKLGF+  +Y
Sbjct: 730  HYGPGEEALKFIARMRNDGVDLDQFSFSVALATIGNLTVLDEGQQLHSWIIKLGFELDEY 789

Query: 226  VTNCAMDMYGKCGELSDVLKMLPEPNMRTRLS 321
            V N  MDMYGKCGE+ DV ++LP P +R++ S
Sbjct: 790  VLNATMDMYGKCGEIDDVFRILPIPKIRSKRS 821


>ref|XP_004983787.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Setaria italica]
          Length = 893

 Score = 88.2 bits (217), Expect = 1e-15
 Identities = 42/87 (48%), Positives = 54/87 (62%)
 Frame = +1

Query: 49  HGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQYV 228
           HG GEEALKL ++++      D+F              LEEG QLH L++K G DS  +V
Sbjct: 499 HGRGEEALKLFMDLRHAGNKLDRFCVAECLSSSACLASLEEGMQLHGLSVKCGLDSDSHV 558

Query: 229 TNCAMDMYGKCGELSDVLKMLPEPNMR 309
            N AMDMYGKCG++ D+LKMLP+P  R
Sbjct: 559 VNAAMDMYGKCGKMDDMLKMLPDPASR 585


>gb|AAD34705.1|AC006341_33 >F3O9.28 [Arabidopsis thaliana]
          Length = 1027

 Score = 87.4 bits (215), Expect = 2e-15
 Identities = 41/83 (49%), Positives = 53/83 (63%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           HHG+GEE LKL+ +M+   V  DQF              LEEG+QLH LA+KLGF+   +
Sbjct: 632 HHGHGEEVLKLVSKMRSFGVSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFEHDSF 691

Query: 226 VTNCAMDMYGKCGELSDVLKMLP 294
           + N A DMY KCGE+ +V+KMLP
Sbjct: 692 IFNAAADMYSKCGEIGEVVKMLP 714


>ref|NP_001185013.1| PPR repeat domain-containing protein [Arabidopsis thaliana]
           gi|332191339|gb|AEE29460.1| PPR repeat domain-containing
           protein [Arabidopsis thaliana]
          Length = 928

 Score = 87.4 bits (215), Expect = 2e-15
 Identities = 41/83 (49%), Positives = 53/83 (63%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           HHG+GEE LKL+ +M+   V  DQF              LEEG+QLH LA+KLGF+   +
Sbjct: 525 HHGHGEEVLKLVSKMRSFGVSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFEHDSF 584

Query: 226 VTNCAMDMYGKCGELSDVLKMLP 294
           + N A DMY KCGE+ +V+KMLP
Sbjct: 585 IFNAAADMYSKCGEIGEVVKMLP 607


>ref|NP_173097.2| PPR repeat domain-containing protein [Arabidopsis thaliana]
           gi|332191338|gb|AEE29459.1| PPR repeat domain-containing
           protein [Arabidopsis thaliana]
          Length = 937

 Score = 87.4 bits (215), Expect = 2e-15
 Identities = 41/83 (49%), Positives = 53/83 (63%)
 Frame = +1

Query: 46  HHGYGEEALKLLLEMQRTKVDFDQFXXXXXXXXXXXXXCLEEGKQLHNLAIKLGFDSYQY 225
           HHG+GEE LKL+ +M+   V  DQF              LEEG+QLH LA+KLGF+   +
Sbjct: 542 HHGHGEEVLKLVSKMRSFGVSLDQFSFSEGLSAAAKLAVLEEGQQLHGLAVKLGFEHDSF 601

Query: 226 VTNCAMDMYGKCGELSDVLKMLP 294
           + N A DMY KCGE+ +V+KMLP
Sbjct: 602 IFNAAADMYSKCGEIGEVVKMLP 624


Top