BLASTX nr result

ID: Rheum21_contig00012094 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00012094
         (504 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI37461.3| unnamed protein product [Vitis vinifera]              111   1e-22
ref|XP_002263778.1| PREDICTED: putative pentatricopeptide repeat...   111   1e-22
ref|XP_002322117.2| pentatricopeptide repeat-containing family p...    87   3e-15
gb|EOY12919.1| Tetratricopeptide repeat-like superfamily protein...    84   2e-14
gb|EMJ28842.1| hypothetical protein PRUPE_ppa018028mg [Prunus pe...    82   6e-14
ref|XP_006836820.1| hypothetical protein AMTR_s00099p00041040 [A...    81   1e-13
ref|XP_006418059.1| hypothetical protein EUTSA_v10009444mg [Eutr...    66   4e-09

>emb|CBI37461.3| unnamed protein product [Vitis vinifera]
          Length = 822

 Score =  111 bits (277), Expect = 1e-22
 Identities = 52/131 (39%), Positives = 79/131 (60%)
 Frame = -1

Query: 417 ISNLFQTENLHALDP*CLRGFQPIHIDQILFSLQSNPISAIRFVQWSKNVSRIVPSAQSF 238
           I+  F   N    +   L   QP H++ ++F L+SNP SA+RF +W++N   +    QSF
Sbjct: 33  IAKAFHHNNFSFFNSGSLPNLQPAHLEPVVFQLRSNPTSALRFFEWAENFLGLCHPVQSF 92

Query: 237 CSLAHILLSHHMFDHAHKVFDEMIHRF*NMDLFFALS*GFSAYGTNVSTVYSCLVAGFCR 58
           C +AH+LL H MFD A +VFD M+ +F N+++       F  YG+N STVYS L+  +CR
Sbjct: 93  CGIAHVLLRHRMFDPATRVFDRMVGQFGNLEVLGEFHGSFRNYGSNPSTVYSFLLHCYCR 152

Query: 57  CGMIDHDLELY 25
            GM+D  ++ +
Sbjct: 153 NGMVDRAVDTF 163


>ref|XP_002263778.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g31840-like [Vitis vinifera]
          Length = 1131

 Score =  111 bits (277), Expect = 1e-22
 Identities = 52/131 (39%), Positives = 79/131 (60%)
 Frame = -1

Query: 417 ISNLFQTENLHALDP*CLRGFQPIHIDQILFSLQSNPISAIRFVQWSKNVSRIVPSAQSF 238
           I+  F   N    +   L   QP H++ ++F L+SNP SA+RF +W++N   +    QSF
Sbjct: 33  IAKAFHHNNFSFFNSGSLPNLQPAHLEPVVFQLRSNPTSALRFFEWAENFLGLCHPVQSF 92

Query: 237 CSLAHILLSHHMFDHAHKVFDEMIHRF*NMDLFFALS*GFSAYGTNVSTVYSCLVAGFCR 58
           C +AH+LL H MFD A +VFD M+ +F N+++       F  YG+N STVYS L+  +CR
Sbjct: 93  CGIAHVLLRHRMFDPATRVFDRMVGQFGNLEVLGEFHGSFRNYGSNPSTVYSFLLHCYCR 152

Query: 57  CGMIDHDLELY 25
            GM+D  ++ +
Sbjct: 153 NGMVDRAVDTF 163


>ref|XP_002322117.2| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550321948|gb|EEF06244.2|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 854

 Score = 86.7 bits (213), Expect = 3e-15
 Identities = 50/133 (37%), Positives = 72/133 (54%), Gaps = 2/133 (1%)
 Frame = -1

Query: 396 ENLHALDP*CLRGFQPIHIDQILFSLQSNPISAIRFVQWSKNVSRIVPSAQSFCSLAHIL 217
           +N   L+P  L   Q  H+  ++ SLQ  P SAIRF +W+++      SA SFC+L H+L
Sbjct: 65  QNPKPLNPILLSKLQLYHVPDVIISLQPKPFSAIRFFEWAESFFISPLSAPSFCALLHVL 124

Query: 216 LSHHMFDHAHKVFDEMIHRF*N-MDLFFALS*GF-SAYGTNVSTVYSCLVAGFCRCGMID 43
           L + +F  A  VFD+ I +F N  D   A   GF     TN S VY  L+  +CR GM D
Sbjct: 125 LQNQLFSRAACVFDKFIMQFGNDYDTLDAFRDGFCDLDSTNHSVVYGFLIESYCRKGMFD 184

Query: 42  HDLELYFIMCGGG 4
             ++++  +C  G
Sbjct: 185 KSVDIFMHVCVKG 197


>gb|EOY12919.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|508721023|gb|EOY12920.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
          Length = 808

 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 45/127 (35%), Positives = 74/127 (58%), Gaps = 3/127 (2%)
 Frame = -1

Query: 381 LDP*CLRGFQPIHIDQILFSLQSNPISAIRFVQWSKNVSRIVPSAQSFCSLAHILLSHHM 202
           L+P  L   QP H+  IL +LQS P SA+ F +W++   ++  +  S+C+L  +LL H +
Sbjct: 36  LNPTLLSKLQPSHVKPILLTLQSKPSSALNFFRWTQRFLKLPHAVPSYCALISLLLRHRV 95

Query: 201 FDHAHKVFDEMIHRF-*NMDLFFALS*GFSAYGTNVSTVYSCLVAGFCRCGMIDHDLELY 25
           F  A +VFDEM+  F  N+D+F A + G   + +N + V+  L+  +C+ GM+D    ++
Sbjct: 96  FGAAAEVFDEMMVLFGTNIDVFEAFNEGIKDFDSNPNVVFGFLLESYCKKGMVDMSFCVF 155

Query: 24  FIM--CG 10
             M  CG
Sbjct: 156 VKMSRCG 162


>gb|EMJ28842.1| hypothetical protein PRUPE_ppa018028mg [Prunus persica]
          Length = 802

 Score = 82.4 bits (202), Expect = 6e-14
 Identities = 46/114 (40%), Positives = 62/114 (54%), Gaps = 1/114 (0%)
 Frame = -1

Query: 351 PIHIDQILFSLQSNPISAIRFVQWSKNVSRIVPSAQSFCSLAHILLSHHMFDHAHKVFDE 172
           P H+  I  SLQSN ISA   + WS N   +  S QSFC+L H+LL H     A  +F+ 
Sbjct: 15  PNHVHHIPLSLQSNSISAYHLLDWSCNSPGLHHSPQSFCALTHLLLRHRKLAPASHLFNT 74

Query: 171 MIHRF*NMDLFFALS*GFSA-YGTNVSTVYSCLVAGFCRCGMIDHDLELYFIMC 13
           M+ +F     FFA     S  Y ++ S +YS L+  FCR GM+D  +E +  MC
Sbjct: 75  MVRQFGTHFHFFAAFSEISPNYASDSSDLYSFLIENFCRNGMLDSSIETFIHMC 128


>ref|XP_006836820.1| hypothetical protein AMTR_s00099p00041040 [Amborella trichopoda]
           gi|548839384|gb|ERM99673.1| hypothetical protein
           AMTR_s00099p00041040 [Amborella trichopoda]
          Length = 942

 Score = 81.3 bits (199), Expect = 1e-13
 Identities = 46/119 (38%), Positives = 69/119 (57%), Gaps = 2/119 (1%)
 Frame = -1

Query: 375 P*CLRGFQPIHIDQILFSLQSNPISAIRFVQWSK-NVSRIVPSAQSFCSLAHILLSHHMF 199
           P  L   QP H++ +L  L S P SA++F +W+  N+     +  +FCS+ HIL+ + MF
Sbjct: 50  PIFLHKLQPHHVNLLLHKLGSKPSSALQFFKWAPLNLRGFSHTPATFCSICHILIRNQMF 109

Query: 198 DHAHKVFDEMI-HRF*NMDLFFALS*GFSAYGTNVSTVYSCLVAGFCRCGMIDHDLELY 25
           + +  +FD+MI +   N DL   L  GF  YG+N  TVYS L  G+CR GM +  +E +
Sbjct: 110 EASRDLFDDMIAYSGANYDLVRELQSGFPIYGSNRVTVYSFLNIGYCRAGMNELAVEAF 168


>ref|XP_006418059.1| hypothetical protein EUTSA_v10009444mg [Eutrema salsugineum]
           gi|557095830|gb|ESQ36412.1| hypothetical protein
           EUTSA_v10009444mg [Eutrema salsugineum]
          Length = 827

 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 42/102 (41%), Positives = 58/102 (56%)
 Frame = -1

Query: 333 ILFSLQSNPISAIRFVQWSKNVSRIVPSAQSFCSLAHILLSHHMFDHAHKVFDEMIHRF* 154
           IL SLQS+P SA+ + +W++ +S + PS   F +L H+L+ H  FD A KVFDEMI    
Sbjct: 61  ILLSLQSDPYSAVNYFRWAE-MSGLAPS---FFTLVHVLVRHGKFDVADKVFDEMIANRG 116

Query: 153 NMDLFFALS*GFSAYGTNVSTVYSCLVAGFCRCGMIDHDLEL 28
           N+ +    S  F     N S VY  L+   CR GM D  +E+
Sbjct: 117 NISVMLDKSMDFP---LNHSVVYGFLMECCCRYGMFDEAMEI 155


Top