BLASTX nr result

ID: Forsythia22_contig00013647 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00013647
         (440 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012830324.1| PREDICTED: uncharacterized protein At4g13200...    87   4e-15
gb|EYU46342.1| hypothetical protein MIMGU_mgv1a014561mg [Erythra...    84   4e-14
ref|XP_011083122.1| PREDICTED: uncharacterized protein At4g13200...    80   4e-13
ref|XP_011083123.1| PREDICTED: uncharacterized protein LOC105165...    77   3e-12
ref|XP_012828986.1| PREDICTED: uncharacterized protein At4g13200...    76   8e-12
gb|EYU17982.1| hypothetical protein MIMGU_mgv1a013846mg [Erythra...    76   8e-12
ref|XP_007049802.1| Uncharacterized protein isoform 2, partial [...    65   4e-11
ref|XP_012484856.1| PREDICTED: uncharacterized protein At4g13200...    74   5e-11
emb|CDP03458.1| unnamed protein product [Coffea canephora]             74   5e-11
ref|XP_011080178.1| PREDICTED: uncharacterized protein At4g13200...    71   2e-10
ref|XP_006443627.1| hypothetical protein CICLE_v10022209mg [Citr...    71   2e-10
gb|KDO65901.1| hypothetical protein CISIN_1g028159mg [Citrus sin...    71   3e-10
ref|XP_010069658.1| PREDICTED: uncharacterized protein At4g13200...    71   3e-10
ref|XP_009628665.1| PREDICTED: uncharacterized protein At4g13200...    70   4e-10
ref|XP_006364265.1| PREDICTED: uncharacterized protein LOC102593...    70   4e-10
gb|KDO65902.1| hypothetical protein CISIN_1g028159mg [Citrus sin...    70   7e-10
ref|XP_009788645.1| PREDICTED: uncharacterized protein At4g13200...    69   9e-10
ref|XP_007049801.1| Uncharacterized protein isoform 1 [Theobroma...    69   9e-10
ref|XP_004247093.1| PREDICTED: uncharacterized protein LOC101259...    69   9e-10
ref|XP_002521111.1| conserved hypothetical protein [Ricinus comm...    69   1e-09

>ref|XP_012830324.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like
           [Erythranthe guttatus] gi|604348188|gb|EYU46343.1|
           hypothetical protein MIMGU_mgv1a014561mg [Erythranthe
           guttata]
          Length = 186

 Score = 87.0 bits (214), Expect = 4e-15
 Identities = 58/121 (47%), Positives = 77/121 (63%), Gaps = 3/121 (2%)
 Frame = +3

Query: 3   SLCSPP-RIPSKIH-QLCWAGFPREIPASCTNLKLNSTGLQKFPXXXXXXXXXXXX-ENE 173
           SLCS P RIPS I+ Q C   +P       ++  LNS GLQKFP             ENE
Sbjct: 12  SLCSRPYRIPSTIYRQHCSVAYPL-----FSSRNLNS-GLQKFPRTVNCRRNGSADSENE 65

Query: 174 SRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQHEK 353
           SR +LDAFFLGKA+AEAL ER+ESSVGEFLS IG LQA +  + + ++++   K+++ ++
Sbjct: 66  SRAILDAFFLGKAVAEALNERIESSVGEFLSTIGRLQAEQQKQVQEFQDEVLEKARRAKE 125

Query: 354 Q 356
           Q
Sbjct: 126 Q 126


>gb|EYU46342.1| hypothetical protein MIMGU_mgv1a014561mg [Erythranthe guttata]
          Length = 180

 Score = 84.0 bits (206), Expect = 4e-14
 Identities = 60/129 (46%), Positives = 77/129 (59%), Gaps = 3/129 (2%)
 Frame = +3

Query: 3   SLCSPP-RIPSKIH-QLCWAGFPREIPASCTNLKLNSTGLQKFPXXXXXXXXXXXX-ENE 173
           SLCS P RIPS I+ Q C   +P       ++  LNS GLQKFP             ENE
Sbjct: 12  SLCSRPYRIPSTIYRQHCSVAYPL-----FSSRNLNS-GLQKFPRTVNCRRNGSADSENE 65

Query: 174 SRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQHEK 353
           SR +LDAFFLGKA+AEAL ER+ESSVGEFLS IG LQA +  + + + + ++ K +   +
Sbjct: 66  SRAILDAFFLGKAVAEALNERIESSVGEFLSTIGRLQAEQQKQVQEF-QARRAKEQAARE 124

Query: 354 QWKHKMLFP 380
             + K L P
Sbjct: 125 AMEAKGLIP 133


>ref|XP_011083122.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like
           isoform X1 [Sesamum indicum]
          Length = 180

 Score = 80.5 bits (197), Expect = 4e-13
 Identities = 53/119 (44%), Positives = 76/119 (63%), Gaps = 1/119 (0%)
 Frame = +3

Query: 3   SLCSPPRIPSKIHQLCWAGFPREIPASCTNLKLNSTGLQKFPXXXXXXXXXXXX-ENESR 179
           S+ S PRIPS+ +++    FP       +NLK  S+GL+K P             ENESR
Sbjct: 13  SVFSIPRIPSRDNRM----FP-------SNLKPVSSGLRKSPSISLRCNSTGDSGENESR 61

Query: 180 TVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQHEKQ 356
           +VLDAFFLGKA+ EAL ER+ES+VGE LSVIG+LQA +  +   ++E+   K+++ ++Q
Sbjct: 62  SVLDAFFLGKALGEALTERIESTVGEILSVIGSLQAEQQKQILEFQEEVLEKARRAKEQ 120


>ref|XP_011083123.1| PREDICTED: uncharacterized protein LOC105165718 isoform X2 [Sesamum
           indicum]
          Length = 148

 Score = 77.4 bits (189), Expect = 3e-12
 Identities = 54/114 (47%), Positives = 71/114 (62%), Gaps = 1/114 (0%)
 Frame = +3

Query: 3   SLCSPPRIPSKIHQLCWAGFPREIPASCTNLKLNSTGLQKFPXXXXXXXXXXXX-ENESR 179
           S+ S PRIPS+ +++    FP       +NLK  S+GL+K P             ENESR
Sbjct: 13  SVFSIPRIPSRDNRM----FP-------SNLKPVSSGLRKSPSISLRCNSTGDSGENESR 61

Query: 180 TVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSK 341
           +VLDAFFLGKA+ EAL ER+ES+VGE LSVIG+LQA    +K+  + Q  P+ K
Sbjct: 62  SVLDAFFLGKALGEALTERIESTVGEILSVIGSLQA--EQQKQILEFQLHPELK 113


>ref|XP_012828986.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like
           [Erythranthe guttatus]
          Length = 172

 Score = 76.3 bits (186), Expect = 8e-12
 Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)
 Frame = +3

Query: 3   SLCSPPRIPSKIHQL--CWAGFP---REIPASCTNLKLNSTGLQKFPXXXXXXXXXXXX- 164
           S+ S PRIPS I Q   C    P   R+ P   + +   S GL   P             
Sbjct: 10  SVSSIPRIPSIIGQRRRCSVVIPSATRQNPLFPSTINPVSIGLLNLPRINFLCKSTADSG 69

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           ENES+TVLDAFFLGKA+ E + ER+ES+VGEFLSVIG LQA +      ++E+   K+++
Sbjct: 70  ENESKTVLDAFFLGKALGEVINERIESTVGEFLSVIGRLQAEQQKHVSEFQEEVLEKARR 129

Query: 345 HEKQ 356
            ++Q
Sbjct: 130 AKEQ 133


>gb|EYU17982.1| hypothetical protein MIMGU_mgv1a013846mg [Erythranthe guttata]
          Length = 209

 Score = 76.3 bits (186), Expect = 8e-12
 Identities = 52/124 (41%), Positives = 70/124 (56%), Gaps = 6/124 (4%)
 Frame = +3

Query: 3   SLCSPPRIPSKIHQL--CWAGFP---REIPASCTNLKLNSTGLQKFPXXXXXXXXXXXX- 164
           S+ S PRIPS I Q   C    P   R+ P   + +   S GL   P             
Sbjct: 47  SVSSIPRIPSIIGQRRRCSVVIPSATRQNPLFPSTINPVSIGLLNLPRINFLCKSTADSG 106

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           ENES+TVLDAFFLGKA+ E + ER+ES+VGEFLSVIG LQA +      ++E+   K+++
Sbjct: 107 ENESKTVLDAFFLGKALGEVINERIESTVGEFLSVIGRLQAEQQKHVSEFQEEVLEKARR 166

Query: 345 HEKQ 356
            ++Q
Sbjct: 167 AKEQ 170


>ref|XP_007049802.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
           gi|508702063|gb|EOX93959.1| Uncharacterized protein
           isoform 2, partial [Theobroma cacao]
          Length = 149

 Score = 65.5 bits (158), Expect(2) = 4e-11
 Identities = 30/46 (65%), Positives = 38/46 (82%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSR 302
           +NESR VLDAFFLGKA+AEAL ER+ES++GEFL  +G LQA +  +
Sbjct: 76  DNESRNVLDAFFLGKALAEALNERIESTIGEFLGAVGRLQAEQQKQ 121



 Score = 28.5 bits (62), Expect(2) = 4e-11
 Identities = 12/18 (66%), Positives = 15/18 (83%)
 Frame = +1

Query: 292 RIPGRGVGKSKKSQRASS 345
           ++ GRGVGK +KSQR SS
Sbjct: 132 KVVGRGVGKGQKSQRESS 149


>ref|XP_012484856.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic
           [Gossypium raimondii] gi|763742642|gb|KJB10141.1|
           hypothetical protein B456_001G186100 [Gossypium
           raimondii]
          Length = 214

 Score = 73.6 bits (179), Expect = 5e-11
 Identities = 35/64 (54%), Positives = 52/64 (81%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           +NESR VLDAFFLGKA+AEAL ER+ES++GEFLSV+G LQA +  + + ++E+   ++K+
Sbjct: 80  DNESRNVLDAFFLGKAVAEALNERIESTIGEFLSVVGRLQAEQQKQVQDFQEEVLERAKR 139

Query: 345 HEKQ 356
            ++Q
Sbjct: 140 AKEQ 143


>emb|CDP03458.1| unnamed protein product [Coffea canephora]
          Length = 268

 Score = 73.6 bits (179), Expect = 5e-11
 Identities = 46/129 (35%), Positives = 75/129 (58%), Gaps = 14/129 (10%)
 Frame = +3

Query: 12  SPPRIPSKIHQLC-----WAGFPREIPASCT-NLKLNSTGLQK--------FPXXXXXXX 149
           SPPRIPSK  Q C     W    + + +    +LKL+ +G +K                 
Sbjct: 73  SPPRIPSKPLQRCGTYQSWNYSSKSVHSLFFWDLKLSKSGSEKPSRISFLRCNSSTDPGG 132

Query: 150 XXXXXENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKK 329
                +N+S+T+LDAFFLGKA+AE++ ER+ES+VGEFLS +G LQA +  + + ++E+  
Sbjct: 133 PPGPGDNDSKTILDAFFLGKALAESVNERIESAVGEFLSAVGRLQAEQQKQVQDFQEEVL 192

Query: 330 PKSKQHEKQ 356
            ++K+ +++
Sbjct: 193 ERAKKAKER 201


>ref|XP_011080178.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like
           [Sesamum indicum]
          Length = 191

 Score = 71.2 bits (173), Expect = 2e-10
 Identities = 41/95 (43%), Positives = 61/95 (64%), Gaps = 1/95 (1%)
 Frame = +3

Query: 75  PASCTNLKLNSTGLQKFPXXXXXXXXXXXX-ENESRTVLDAFFLGKAMAEALGERLESSV 251
           P   +NL+L   GL +FP             +NES+ +LDAFFLGKA+AEA+ ER+ES+V
Sbjct: 43  PLFSSNLRL---GLHQFPRINLSCSCSGDSGDNESKAILDAFFLGKAVAEAVNERIESAV 99

Query: 252 GEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQHEKQ 356
           GEFLS IG LQA +  + + ++E    K+++ ++Q
Sbjct: 100 GEFLSTIGRLQAEQQKQVQEFQEDVLEKARRAKEQ 134


>ref|XP_006443627.1| hypothetical protein CICLE_v10022209mg [Citrus clementina]
           gi|568851255|ref|XP_006479309.1| PREDICTED:
           uncharacterized protein At4g13200, chloroplastic-like
           [Citrus sinensis] gi|557545889|gb|ESR56867.1|
           hypothetical protein CICLE_v10022209mg [Citrus
           clementina]
          Length = 214

 Score = 71.2 bits (173), Expect = 2e-10
 Identities = 48/111 (43%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
 Frame = +3

Query: 63  PREIPASCTNLKLNSTGLQKFPXXXXXXXXXXXXENESRTVLDAFFLGKAMAEALGERLE 242
           PR IP  C     NST     P            + ESRTVLDAFFLGKA+AEAL ER+E
Sbjct: 61  PRRIPLQC-----NSTTKPGPPSGSG--------DGESRTVLDAFFLGKAVAEALNERIE 107

Query: 243 SSVGEFLSVIGTLQAXKNSRKRCWKEQ-----KKPKSKQHEKQWKHKMLFP 380
           S+VGEFLS +G LQA +  + + ++E      KK K K   +  + + L P
Sbjct: 108 SAVGEFLSTVGRLQAEQQKQVQEFQEDVLERAKKAKEKAAREAMEARGLVP 158


>gb|KDO65901.1| hypothetical protein CISIN_1g028159mg [Citrus sinensis]
          Length = 212

 Score = 70.9 bits (172), Expect = 3e-10
 Identities = 48/111 (43%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
 Frame = +3

Query: 63  PREIPASCTNLKLNSTGLQKFPXXXXXXXXXXXXENESRTVLDAFFLGKAMAEALGERLE 242
           PR IP  C     NST     P            + ESRTVLDAFFLGKA+AEAL ER+E
Sbjct: 61  PRRIPLQC-----NSTTKPGPPSGSG--------DGESRTVLDAFFLGKAVAEALNERIE 107

Query: 243 SSVGEFLSVIGTLQAXKNSRKRCWKEQ-----KKPKSKQHEKQWKHKMLFP 380
           S+VGEFLS +G LQA +  + + ++E      KK K K   +  + + L P
Sbjct: 108 SAVGEFLSTVGRLQAEQQKQVQEFQEDVLERAKKAKEKAAREAMEVRGLVP 158


>ref|XP_010069658.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic
           [Eucalyptus grandis] gi|702434471|ref|XP_010069659.1|
           PREDICTED: uncharacterized protein At4g13200,
           chloroplastic [Eucalyptus grandis]
           gi|629092079|gb|KCW58074.1| hypothetical protein
           EUGRSUZ_H00802 [Eucalyptus grandis]
           gi|629092080|gb|KCW58075.1| hypothetical protein
           EUGRSUZ_H00802 [Eucalyptus grandis]
          Length = 205

 Score = 70.9 bits (172), Expect = 3e-10
 Identities = 40/77 (51%), Positives = 52/77 (67%), Gaps = 5/77 (6%)
 Frame = +3

Query: 168 NESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQ-----KKP 332
           NESRTVLDAFFLGKA+AEAL ER+ES VGEFLS +G LQA +  + + ++E      KK 
Sbjct: 83  NESRTVLDAFFLGKALAEALNERVESVVGEFLSTVGRLQAEQQKQVQEFQEDVFERAKKA 142

Query: 333 KSKQHEKQWKHKMLFPS 383
           K K   +  + + L P+
Sbjct: 143 KEKAAREAMEAQGLIPT 159


>ref|XP_009628665.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic
           [Nicotiana tomentosiformis]
          Length = 200

 Score = 70.5 bits (171), Expect = 4e-10
 Identities = 33/64 (51%), Positives = 51/64 (79%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           ENES+ +LDAFFLGKA+AEA+ ER+ES+VGEFLS +G LQA +  + + ++E+   ++KQ
Sbjct: 76  ENESKNILDAFFLGKALAEAVTERIESTVGEFLSTVGRLQAEQQKQVQDFQEEVLERAKQ 135

Query: 345 HEKQ 356
            +++
Sbjct: 136 AKEK 139


>ref|XP_006364265.1| PREDICTED: uncharacterized protein LOC102593653 [Solanum tuberosum]
          Length = 180

 Score = 70.5 bits (171), Expect = 4e-10
 Identities = 33/64 (51%), Positives = 51/64 (79%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           ENES+ +LDAFFLGKA+AEA+ ER+ES+VGEFLS +G LQA +  + + ++E+   ++KQ
Sbjct: 62  ENESKNILDAFFLGKALAEAVTERIESTVGEFLSTVGRLQAEQQKQVQDFQEEVLERAKQ 121

Query: 345 HEKQ 356
            +++
Sbjct: 122 AKEK 125


>gb|KDO65902.1| hypothetical protein CISIN_1g028159mg [Citrus sinensis]
          Length = 158

 Score = 69.7 bits (169), Expect = 7e-10
 Identities = 39/77 (50%), Positives = 52/77 (67%), Gaps = 5/77 (6%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQ-----KK 329
           + ESRTVLDAFFLGKA+AEAL ER+ES+VGEFLS +G LQA +  + + ++E      KK
Sbjct: 28  DGESRTVLDAFFLGKAVAEALNERIESAVGEFLSTVGRLQAEQQKQVQEFQEDVLERAKK 87

Query: 330 PKSKQHEKQWKHKMLFP 380
            K K   +  + + L P
Sbjct: 88  AKEKAAREAMEVRGLVP 104


>ref|XP_009788645.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic
           [Nicotiana sylvestris]
          Length = 193

 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 33/64 (51%), Positives = 50/64 (78%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           ENES+ +LDAFFLGKA+AEA+ ER+ES VGEFLS +G LQA +  + + ++E+   ++KQ
Sbjct: 69  ENESKNILDAFFLGKALAEAVTERIESIVGEFLSTVGRLQAEQQKQVQDFQEEVLERAKQ 128

Query: 345 HEKQ 356
            +++
Sbjct: 129 AKEK 132


>ref|XP_007049801.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508702062|gb|EOX93958.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 202

 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 32/64 (50%), Positives = 50/64 (78%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           +NESR VLDAFFLGKA+AEAL ER+ES++GEFL  +G LQA +  + + ++E+   ++K+
Sbjct: 76  DNESRNVLDAFFLGKALAEALNERIESTIGEFLGAVGRLQAEQQKQVQDFQEEVLERAKR 135

Query: 345 HEKQ 356
            +++
Sbjct: 136 AKEK 139


>ref|XP_004247093.1| PREDICTED: uncharacterized protein LOC101259284 [Solanum
           lycopersicum]
          Length = 223

 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 33/64 (51%), Positives = 51/64 (79%)
 Frame = +3

Query: 165 ENESRTVLDAFFLGKAMAEALGERLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           ENES+ VLDAFFLGKA+AEA+ ER+ES+VGEFLS +G LQ+ +  + + ++E+   ++KQ
Sbjct: 104 ENESKNVLDAFFLGKALAEAVTERIESTVGEFLSTVGRLQSEQQKQVQDFQEEILERAKQ 163

Query: 345 HEKQ 356
            +++
Sbjct: 164 AKEK 167


>ref|XP_002521111.1| conserved hypothetical protein [Ricinus communis]
           gi|223539680|gb|EEF41262.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 215

 Score = 68.9 bits (167), Expect = 1e-09
 Identities = 44/97 (45%), Positives = 60/97 (61%), Gaps = 1/97 (1%)
 Frame = +3

Query: 57  GFPREIPASCT-NLKLNSTGLQKFPXXXXXXXXXXXXENESRTVLDAFFLGKAMAEALGE 233
           GF  E   S T NL+ NST     P            +NESR+VLDAFFLGKA+AEA+ E
Sbjct: 48  GFRNETTQSHTINLRCNSTTGPGGPGSG---------DNESRSVLDAFFLGKALAEAVNE 98

Query: 234 RLESSVGEFLSVIGTLQAXKNSRKRCWKEQKKPKSKQ 344
           R+ES+VGEFLS IG LQA +  + + ++E    ++++
Sbjct: 99  RVESAVGEFLSTIGRLQAEQQRQIQDFQEDVLERARK 135


Top