BLASTX nr result

ID: Perilla23_contig00030647 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00030647
         (322 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21545.1| hypothetical protein MIMGU_mgv1a016307mg [Erythra...   130   3e-28
emb|CDO98060.1| unnamed protein product [Coffea canephora]            122   8e-26
emb|CDP07765.1| unnamed protein product [Coffea canephora]            121   2e-25
ref|XP_010249774.1| PREDICTED: uncharacterized protein LOC104592...   102   1e-19
ref|XP_007024131.1| Uncharacterized protein TCM_028719 [Theobrom...    97   6e-18
gb|KDO51221.1| hypothetical protein CISIN_1g044132mg [Citrus sin...    96   8e-18
ref|XP_006427016.1| hypothetical protein CICLE_v10026796mg [Citr...    96   8e-18
ref|XP_012852634.1| PREDICTED: uncharacterized protein LOC105972...    95   2e-17
ref|XP_002516081.1| conserved hypothetical protein [Ricinus comm...    91   3e-16
ref|XP_006385473.1| hypothetical protein POPTR_0003s05390g [Popu...    90   6e-16
gb|EYU24692.1| hypothetical protein MIMGU_mgv1a018514mg, partial...    87   4e-15
gb|KJB56739.1| hypothetical protein B456_009G133500 [Gossypium r...    86   1e-14
ref|XP_007216358.1| hypothetical protein PRUPE_ppa020686mg [Prun...    86   1e-14
ref|XP_011032796.1| PREDICTED: uncharacterized protein LOC105131...    84   5e-14
ref|XP_002299643.1| hypothetical protein POPTR_0001s18080g [Popu...    83   9e-14
gb|KDP37209.1| hypothetical protein JCGZ_06265 [Jatropha curcas]       82   1e-13
ref|XP_009419297.1| PREDICTED: uncharacterized protein LOC103999...    82   2e-13
ref|XP_010098049.1| hypothetical protein L484_026180 [Morus nota...    82   2e-13
gb|KJB06232.1| hypothetical protein B456_001G029100 [Gossypium r...    79   2e-12
ref|XP_007013213.1| Uncharacterized protein TCM_037906 [Theobrom...    78   2e-12

>gb|EYU21545.1| hypothetical protein MIMGU_mgv1a016307mg [Erythranthe guttata]
          Length = 126

 Score =  130 bits (328), Expect = 3e-28
 Identities = 70/112 (62%), Positives = 81/112 (72%), Gaps = 12/112 (10%)
 Frame = -1

Query: 310 QWASHENACWPRSYSSRKYDRFVSCN---VPVKRRNS-PPIWRQIWNKLKKLEKKRIFHC 143
           +W+S EN CWPRSYSS +YDRFVS N    P+ R +S  PIW Q+W KLK+ EKKRIF C
Sbjct: 8   KWSSKENVCWPRSYSSLRYDRFVSYNNNMPPLTRSSSKAPIWMQLWRKLKR-EKKRIFQC 66

Query: 142 SN------PMSFTYDPHSYSQNFD--QGSSWTDHDDLSRSFSARFAVPSTIF 11
           +N       M FTYD +SYSQNFD  QGS W D D++SRSFSARFAVPS IF
Sbjct: 67  NNNNSNNSSMRFTYDAYSYSQNFDDNQGSVWADPDNVSRSFSARFAVPSRIF 118


>emb|CDO98060.1| unnamed protein product [Coffea canephora]
          Length = 117

 Score =  122 bits (307), Expect = 8e-26
 Identities = 58/92 (63%), Positives = 67/92 (72%)
 Frame = -1

Query: 286 CWPRSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPMSFTYDPHS 107
           CW RS+SSR Y RFVS  +     +  PIWRQ+W K+KK EKKR F+ S    F YDPH+
Sbjct: 19  CWSRSFSSRNYGRFVSHIIRPSSGSKAPIWRQLWTKMKK-EKKRSFYRSTSTRFAYDPHT 77

Query: 106 YSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           YSQNFDQG +W D DD+SRSFSARFAVPS IF
Sbjct: 78  YSQNFDQGLTWADPDDISRSFSARFAVPSRIF 109


>emb|CDP07765.1| unnamed protein product [Coffea canephora]
          Length = 116

 Score =  121 bits (304), Expect = 2e-25
 Identities = 58/92 (63%), Positives = 66/92 (71%)
 Frame = -1

Query: 286 CWPRSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPMSFTYDPHS 107
           CW RS SSR YDRFVS  +     +  PIWRQ+W K+K  EKKR F+ S    F YDPH+
Sbjct: 18  CWCRSLSSRNYDRFVSHIIRPSSGSKAPIWRQLWTKMKN-EKKRTFYRSTSTRFAYDPHT 76

Query: 106 YSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           YSQNFDQG +W D DD+SRSFSARFAVPS IF
Sbjct: 77  YSQNFDQGLTWADPDDISRSFSARFAVPSRIF 108


>ref|XP_010249774.1| PREDICTED: uncharacterized protein LOC104592237 [Nelumbo nucifera]
          Length = 123

 Score =  102 bits (254), Expect = 1e-19
 Identities = 50/90 (55%), Positives = 65/90 (72%), Gaps = 1/90 (1%)
 Frame = -1

Query: 277 RSYSSRKYDRFVSCNVPVK-RRNSPPIWRQIWNKLKKLEKKRIFHCSNPMSFTYDPHSYS 101
           R Y    YD F+S N+P    R++ P WR +W ++ K EKK+IF+ S+P+   YD +SYS
Sbjct: 28  RRYIDSDYDDFLSFNLPSSSHRSTTPRWRVLWRRIMK-EKKKIFYSSSPLQVPYDAYSYS 86

Query: 100 QNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           QNFDQGS+WT+ D+LSRSFSARFA PS IF
Sbjct: 87  QNFDQGSAWTEPDNLSRSFSARFADPSRIF 116


>ref|XP_007024131.1| Uncharacterized protein TCM_028719 [Theobroma cacao]
           gi|508779497|gb|EOY26753.1| Uncharacterized protein
           TCM_028719 [Theobroma cacao]
          Length = 138

 Score = 96.7 bits (239), Expect = 6e-18
 Identities = 52/107 (48%), Positives = 69/107 (64%), Gaps = 4/107 (3%)
 Frame = -1

Query: 319 EALQWASH--ENACWPRSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFH 146
           E  +W S   ++  W + Y+  +YDR  S NV V R   P  WR +W +L + EKK+IF 
Sbjct: 26  ELSKWCSSGTQDLRWSQGYAKSQYDRMHSINVLVTRSKLPR-WRMLWRRLMR-EKKKIFD 83

Query: 145 CSNP--MSFTYDPHSYSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           CS+   +  +YDP++Y+QNFDQG    D DDLSRSFSARFAVPS +F
Sbjct: 84  CSSSTRVHVSYDPYTYAQNFDQGLMSADPDDLSRSFSARFAVPSRVF 130


>gb|KDO51221.1| hypothetical protein CISIN_1g044132mg [Citrus sinensis]
          Length = 155

 Score = 96.3 bits (238), Expect = 8e-18
 Identities = 51/101 (50%), Positives = 66/101 (65%), Gaps = 3/101 (2%)
 Frame = -1

Query: 304 ASHENACWPRSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPMS- 128
           + ++N C  +SY  R YDR +S N  +    +P  WR +W K+K+ EKKRIF CS+  + 
Sbjct: 50  SGNQNMCCCQSYVKRGYDRVLSFNGLITGSKTPR-WRLLWRKIKR-EKKRIFDCSSGSTN 107

Query: 127 --FTYDPHSYSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
               YDP +YSQNFDQG  W D D+ SRSFSARFAVPS +F
Sbjct: 108 VHVPYDPCTYSQNFDQGYVWADPDNASRSFSARFAVPSRVF 148


>ref|XP_006427016.1| hypothetical protein CICLE_v10026796mg [Citrus clementina]
           gi|557529006|gb|ESR40256.1| hypothetical protein
           CICLE_v10026796mg [Citrus clementina]
          Length = 114

 Score = 96.3 bits (238), Expect = 8e-18
 Identities = 50/101 (49%), Positives = 67/101 (66%), Gaps = 3/101 (2%)
 Frame = -1

Query: 304 ASHENACWPRSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPMS- 128
           + ++N C  +SY  R YDR +S N  +    +P  WR +W K+K+ EK+RIF CS+  + 
Sbjct: 9   SGNQNMCCCQSYVRRGYDRVLSFNGLITGSKTPR-WRLLWRKIKR-EKERIFDCSSGSTN 66

Query: 127 --FTYDPHSYSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
               YDP++YSQNFDQG  W D D+ SRSFSARFAVPS +F
Sbjct: 67  VHVPYDPYTYSQNFDQGYVWADPDNASRSFSARFAVPSRVF 107


>ref|XP_012852634.1| PREDICTED: uncharacterized protein LOC105972244 [Erythranthe
           guttatus]
          Length = 110

 Score = 95.1 bits (235), Expect = 2e-17
 Identities = 50/102 (49%), Positives = 63/102 (61%), Gaps = 3/102 (2%)
 Frame = -1

Query: 307 WASHENACWPRSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIF---HCSN 137
           W + E  C    YS RK D F   +     R+  PIW  +W ++KK +K+      +CS+
Sbjct: 6   WKNGELCC----YSGRKNDSFGPYS-RASTRSKAPIWIVLWRRIKKGKKRSSIMSQYCSS 60

Query: 136 PMSFTYDPHSYSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
              FTYDP++YSQNFDQG  W D DDL+RSFSARFAVPS IF
Sbjct: 61  STRFTYDPYTYSQNFDQGLMWADSDDLNRSFSARFAVPSRIF 102


>ref|XP_002516081.1| conserved hypothetical protein [Ricinus communis]
           gi|223544986|gb|EEF46501.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 123

 Score = 90.9 bits (224), Expect = 3e-16
 Identities = 55/119 (46%), Positives = 75/119 (63%), Gaps = 16/119 (13%)
 Frame = -1

Query: 319 EALQWASHEN-----ACWPRSYSSRKYDRF-VSCNVPVKRRN------SPPIWRQIWNKL 176
           E   W +  N     AC+ +S   ++YDR  VS N+ V   +      + P WR +W K+
Sbjct: 2   EVRSWCNSGNSNVIDACYVKS---QRYDRIDVSFNMLVGHESKTANTAATPRWRLLWRKI 58

Query: 175 KKLEKKRIFHCSNP---MSFTYDPHSYSQNFDQGSS-WTDHDDLSRSFSARFAVPSTIF 11
            K EKK++F CS+    M F+YDP++Y+QNFDQGSS W+D D +SRSFSARFAVP+ IF
Sbjct: 59  MK-EKKKLFDCSSSSDRMHFSYDPYTYAQNFDQGSSMWSDPDSMSRSFSARFAVPARIF 116


>ref|XP_006385473.1| hypothetical protein POPTR_0003s05390g [Populus trichocarpa]
           gi|550342461|gb|ERP63270.1| hypothetical protein
           POPTR_0003s05390g [Populus trichocarpa]
          Length = 107

 Score = 90.1 bits (222), Expect = 6e-16
 Identities = 46/90 (51%), Positives = 59/90 (65%), Gaps = 2/90 (2%)
 Frame = -1

Query: 274 SYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPM--SFTYDPHSYS 101
           SY+ R YDR  S    V   +  P WR +W K+ K EK++IF+CS+    + TYDP++YS
Sbjct: 13  SYAKRGYDRIGSFKTLVSHESKSPRWRLLWKKIVK-EKRKIFYCSSSAQANITYDPYTYS 71

Query: 100 QNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           QNFD G   ++ DD SRSFSARF VPS IF
Sbjct: 72  QNFDHGLIMSNPDDSSRSFSARFPVPSRIF 101


>gb|EYU24692.1| hypothetical protein MIMGU_mgv1a018514mg, partial [Erythranthe
           guttata]
          Length = 79

 Score = 87.4 bits (215), Expect = 4e-15
 Identities = 40/68 (58%), Positives = 50/68 (73%), Gaps = 3/68 (4%)
 Frame = -1

Query: 205 PIWRQIWNKLKKLEKKRIF---HCSNPMSFTYDPHSYSQNFDQGSSWTDHDDLSRSFSAR 35
           PIW  +W ++KK +K+      +CS+   FTYDP++YSQNFDQG  W D DDL+RSFSAR
Sbjct: 4   PIWIVLWRRIKKGKKRSSIMSQYCSSSTRFTYDPYTYSQNFDQGLMWADSDDLNRSFSAR 63

Query: 34  FAVPSTIF 11
           FAVPS IF
Sbjct: 64  FAVPSRIF 71


>gb|KJB56739.1| hypothetical protein B456_009G133500 [Gossypium raimondii]
          Length = 112

 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 45/92 (48%), Positives = 60/92 (65%), Gaps = 5/92 (5%)
 Frame = -1

Query: 271 YSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHC-----SNPMSFTYDPHS 107
           + S +YDR +S +V +   +  P WR +W KL + EKK++F C     S   + +YDPH+
Sbjct: 16  WRSNRYDRMLSISV-LTTTSKVPRWRLLWRKLMR-EKKKVFACTSRTTSGVHNVSYDPHT 73

Query: 106 YSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           Y+QNFDQG    D DD SRSFSARFAVPS +F
Sbjct: 74  YAQNFDQGLISADPDDFSRSFSARFAVPSRVF 105


>ref|XP_007216358.1| hypothetical protein PRUPE_ppa020686mg [Prunus persica]
           gi|462412508|gb|EMJ17557.1| hypothetical protein
           PRUPE_ppa020686mg [Prunus persica]
          Length = 95

 Score = 85.5 bits (210), Expect = 1e-14
 Identities = 39/70 (55%), Positives = 49/70 (70%), Gaps = 3/70 (4%)
 Frame = -1

Query: 208 PPIWRQIWNKLKKLEKKRIFHCSNP---MSFTYDPHSYSQNFDQGSSWTDHDDLSRSFSA 38
           P  WR +W K+KK +K+ +F CS     +   YDP++YS+NFDQG  W D D LSRSFSA
Sbjct: 19  PVRWRMLWRKIKKEKKRLLFDCSTSAQRVHVPYDPYTYSKNFDQGLMWADPDFLSRSFSA 78

Query: 37  RFAVPSTIFH 8
           RFAVPS +FH
Sbjct: 79  RFAVPSRVFH 88


>ref|XP_011032796.1| PREDICTED: uncharacterized protein LOC105131492 [Populus
           euphratica]
          Length = 108

 Score = 83.6 bits (205), Expect = 5e-14
 Identities = 44/91 (48%), Positives = 56/91 (61%), Gaps = 2/91 (2%)
 Frame = -1

Query: 277 RSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPMS--FTYDPHSY 104
           +SY+  +YDR  S    V   +  P WR +W KL K  K+++F  S+      TYDP++Y
Sbjct: 12  KSYAESRYDRIGSFKALVSAESKTPRWRLLWRKLVKT-KRKVFDSSSSAQVYLTYDPYTY 70

Query: 103 SQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           SQNFD G   +  DD SRSFSARFAVPS IF
Sbjct: 71  SQNFDHGLVMSHPDDSSRSFSARFAVPSRIF 101


>ref|XP_002299643.1| hypothetical protein POPTR_0001s18080g [Populus trichocarpa]
           gi|222846901|gb|EEE84448.1| hypothetical protein
           POPTR_0001s18080g [Populus trichocarpa]
          Length = 117

 Score = 82.8 bits (203), Expect = 9e-14
 Identities = 43/91 (47%), Positives = 56/91 (61%), Gaps = 2/91 (2%)
 Frame = -1

Query: 277 RSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPMS--FTYDPHSY 104
           +SY+  +YDR  S    V   +  P WR +W K+ K  K+++F  S+      TYDP++Y
Sbjct: 12  KSYAESRYDRIGSFKALVSAESKTPRWRLLWRKMVKT-KRKVFDSSSSAQVYLTYDPYTY 70

Query: 103 SQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
           SQNFD G   +  DD SRSFSARFAVPS IF
Sbjct: 71  SQNFDDGLVMSHPDDSSRSFSARFAVPSRIF 101


>gb|KDP37209.1| hypothetical protein JCGZ_06265 [Jatropha curcas]
          Length = 116

 Score = 82.4 bits (202), Expect = 1e-13
 Identities = 53/115 (46%), Positives = 62/115 (53%), Gaps = 20/115 (17%)
 Frame = -1

Query: 295 ENACWPRSYSSRKYDRFVSC------------NVPVKRRNSPPIWRQIWNKLKKLEKKRI 152
           E   W  S   R YDR VS             N  +KR    P WR +W K+ K EKK+ 
Sbjct: 2   ELGTWCNSGGGR-YDRIVSTASFNMLESSSRSNGCLKR----PRWRLLWRKIMK-EKKKF 55

Query: 151 FHCSN--------PMSFTYDPHSYSQNFDQGSSWTDHDDLSRSFSARFAVPSTIF 11
             CS+         M F+YDP  Y+QNFDQGS W D D++SRSFSARFAVPS IF
Sbjct: 56  LDCSSLSSTSTISRMHFSYDPCGYAQNFDQGSMWCDPDNMSRSFSARFAVPSRIF 110


>ref|XP_009419297.1| PREDICTED: uncharacterized protein LOC103999300 [Musa acuminata
           subsp. malaccensis]
          Length = 125

 Score = 82.0 bits (201), Expect = 2e-13
 Identities = 37/68 (54%), Positives = 52/68 (76%)
 Frame = -1

Query: 217 RNSPPIWRQIWNKLKKLEKKRIFHCSNPMSFTYDPHSYSQNFDQGSSWTDHDDLSRSFSA 38
           R+S    R +W ++ K EK+RIF+ ++P    YDPH+Y+QNFD+GS+W + +DLSRSFSA
Sbjct: 54  RSSATRLRGLWRRIMK-EKRRIFNPASPAPMAYDPHTYAQNFDEGSAWEEPEDLSRSFSA 112

Query: 37  RFAVPSTI 14
           RFAVPS +
Sbjct: 113 RFAVPSRV 120


>ref|XP_010098049.1| hypothetical protein L484_026180 [Morus notabilis]
           gi|587885629|gb|EXB74486.1| hypothetical protein
           L484_026180 [Morus notabilis]
          Length = 104

 Score = 81.6 bits (200), Expect = 2e-13
 Identities = 43/83 (51%), Positives = 58/83 (69%), Gaps = 1/83 (1%)
 Frame = -1

Query: 256 YDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSN-PMSFTYDPHSYSQNFDQGS 80
           Y++ V C   +K  +  P+W+ +W K+KK EK+R F  S+ P    YDP++YSQNFDQGS
Sbjct: 18  YEKIVLCTA-LKAGSKMPVWKVLWTKIKK-EKRRFFDSSSDPAHVPYDPYTYSQNFDQGS 75

Query: 79  SWTDHDDLSRSFSARFAVPSTIF 11
           +  D ++LSRSFSARFAVP  IF
Sbjct: 76  A-ADPENLSRSFSARFAVPLRIF 97


>gb|KJB06232.1| hypothetical protein B456_001G029100 [Gossypium raimondii]
          Length = 101

 Score = 78.6 bits (192), Expect = 2e-12
 Identities = 42/99 (42%), Positives = 55/99 (55%)
 Frame = -1

Query: 310 QWASHENACWPRSYSSRKYDRFVSCNVPVKRRNSPPIWRQIWNKLKKLEKKRIFHCSNPM 131
           Q +  E     RSY+ R           +K   S P W   W KL+K E+K++F      
Sbjct: 9   QSSGRETITLGRSYTQRG----------IKDDRSKPKWTNFWRKLRK-ERKKLFSSGGTF 57

Query: 130 SFTYDPHSYSQNFDQGSSWTDHDDLSRSFSARFAVPSTI 14
             +Y+P +YSQNFDQG+ W + D+LSRSFSARFA PS I
Sbjct: 58  QASYEPDAYSQNFDQGTGWAEPDNLSRSFSARFADPSRI 96


>ref|XP_007013213.1| Uncharacterized protein TCM_037906 [Theobroma cacao]
           gi|508783576|gb|EOY30832.1| Uncharacterized protein
           TCM_037906 [Theobroma cacao]
          Length = 167

 Score = 78.2 bits (191), Expect = 2e-12
 Identities = 36/63 (57%), Positives = 47/63 (74%)
 Frame = -1

Query: 199 WRQIWNKLKKLEKKRIFHCSNPMSFTYDPHSYSQNFDQGSSWTDHDDLSRSFSARFAVPS 20
           W+ +W K KK EK++I  C +P    YDP++YSQNFDQG +W D D+LSRSFS RFA PS
Sbjct: 102 WKVLWMKFKK-EKRKI--CESPAQVPYDPYTYSQNFDQGFAWDDPDNLSRSFSMRFADPS 158

Query: 19  TIF 11
           ++F
Sbjct: 159 SVF 161


Top