BLASTX nr result

ID: Catharanthus23_contig00019738 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00019738
         (597 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266366.1| PREDICTED: uncharacterized protein LOC100255...    87   4e-15
ref|XP_004156275.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...    80   3e-13
ref|XP_004143275.1| PREDICTED: uncharacterized protein LOC101222...    80   3e-13
ref|XP_006354372.1| PREDICTED: uncharacterized protein LOC102578...    80   4e-13
gb|EXB75877.1| hypothetical protein L484_022554 [Morus notabilis]      79   7e-13
ref|XP_004246621.1| PREDICTED: uncharacterized protein LOC101260...    79   9e-13
ref|XP_006428444.1| hypothetical protein CICLE_v10012507mg [Citr...    78   2e-12
ref|XP_006428443.1| hypothetical protein CICLE_v10012507mg [Citr...    78   2e-12
emb|CBI30143.3| unnamed protein product [Vitis vinifera]               77   3e-12
ref|XP_002519479.1| conserved hypothetical protein [Ricinus comm...    77   5e-12
gb|EOY07933.1| GATA zinc finger domain-containing protein 10 [Th...    76   6e-12
ref|XP_003542523.1| PREDICTED: uncharacterized protein LOC100793...    70   3e-10
ref|XP_006294832.1| hypothetical protein CARUB_v10023884mg, part...    69   7e-10
ref|XP_006410861.1| hypothetical protein EUTSA_v10017130mg [Eutr...    69   1e-09
ref|XP_002323503.2| hypothetical protein POPTR_0016s10710g [Popu...    68   2e-09
ref|XP_006294833.1| hypothetical protein CARUB_v10023884mg, part...    68   2e-09
ref|XP_002881477.1| hypothetical protein ARALYDRAFT_482670 [Arab...    68   2e-09
gb|EMJ05917.1| hypothetical protein PRUPE_ppa010283mg [Prunus pe...    67   3e-09
ref|NP_565853.1| uncharacterized protein [Arabidopsis thaliana] ...    67   3e-09
ref|NP_973615.2| uncharacterized protein [Arabidopsis thaliana] ...    67   3e-09

>ref|XP_002266366.1| PREDICTED: uncharacterized protein LOC100255653 [Vitis vinifera]
          Length = 260

 Score = 86.7 bits (213), Expect = 4e-15
 Identities = 52/113 (46%), Positives = 61/113 (53%), Gaps = 2/113 (1%)
 Frame = -3

Query: 334 NSSRFPSKYVKYSLFGTSSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEV--SGE 161
           NSS F  K   +SL  TS+   R  F P  +  +       A  ++++  DE E    G 
Sbjct: 26  NSSSF-FKIPTHSLVATSTTLTRPLFTPSFVLKD-DHCEVIATFDQDKVEDEEEDMREGR 83

Query: 160 EIRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
           E  LYSFT            A  V SLFGPFVELVKSWNLP+WLVHWGHPGNM
Sbjct: 84  ETLLYSFTPLPLLLVAALPGAGAVRSLFGPFVELVKSWNLPDWLVHWGHPGNM 136


>ref|XP_004156275.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101230564 [Cucumis sativus]
          Length = 255

 Score = 80.5 bits (197), Expect = 3e-13
 Identities = 45/95 (47%), Positives = 53/95 (55%)
 Frame = -3

Query: 286 TSSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXX 107
           +SS S   F +P  L  +       A  +E    +E +    E+RLYS +          
Sbjct: 37  SSSSSSFSFASPLPLRTSVRFNPSFARNDEFGDFEETKEETSEMRLYSLSPFPLLFIAAL 96

Query: 106 XXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
             A TV SLFGPFVELVKSWNLPEWLVHWGHPGNM
Sbjct: 97  PGAGTVRSLFGPFVELVKSWNLPEWLVHWGHPGNM 131


>ref|XP_004143275.1| PREDICTED: uncharacterized protein LOC101222568 [Cucumis sativus]
          Length = 255

 Score = 80.5 bits (197), Expect = 3e-13
 Identities = 45/95 (47%), Positives = 53/95 (55%)
 Frame = -3

Query: 286 TSSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXX 107
           +SS S   F +P  L  +       A  +E    +E +    E+RLYS +          
Sbjct: 37  SSSSSSFSFASPLPLRTSVRFNPSFARNDEFGDFEETKEETSEMRLYSLSPFPLLFIAAL 96

Query: 106 XXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
             A TV SLFGPFVELVKSWNLPEWLVHWGHPGNM
Sbjct: 97  PGAGTVRSLFGPFVELVKSWNLPEWLVHWGHPGNM 131


>ref|XP_006354372.1| PREDICTED: uncharacterized protein LOC102578558 [Solanum tuberosum]
          Length = 260

 Score = 80.1 bits (196), Expect = 4e-13
 Identities = 46/98 (46%), Positives = 55/98 (56%), Gaps = 8/98 (8%)
 Frame = -3

Query: 271 QRGFFAPPLLAANYSKAS------DAAVVEENRATDEMEVSG--EEIRLYSFTXXXXXXX 116
           +R F APPL    Y K +      D  +VE  +  +E+E      E  LYS+        
Sbjct: 42  RRIFLAPPLA---YKKLNCEDINVDLKIVELEKEENEVEEMSFTNETFLYSYNPLPLMFV 98

Query: 115 XXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
                A T+ SLFGPFVELVKSWNLP+WLVHWGHPGNM
Sbjct: 99  AALPGAGTIRSLFGPFVELVKSWNLPDWLVHWGHPGNM 136


>gb|EXB75877.1| hypothetical protein L484_022554 [Morus notabilis]
          Length = 261

 Score = 79.3 bits (194), Expect = 7e-13
 Identities = 44/94 (46%), Positives = 55/94 (58%)
 Frame = -3

Query: 283 SSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXXX 104
           +S  +R   +   +   +S+A D  +V+E    D+ E S  E  LYSF+           
Sbjct: 48  TSRRRRPLLSSVFVKDTFSQAID--MVDEEEEEDKEESS--ETLLYSFSPLPLMLVAALP 103

Query: 103 XAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
            A  V SLFGPFVELVKSWNLP+WLVHWGHPGNM
Sbjct: 104 GAGAVRSLFGPFVELVKSWNLPDWLVHWGHPGNM 137


>ref|XP_004246621.1| PREDICTED: uncharacterized protein LOC101260668 [Solanum
           lycopersicum]
          Length = 252

 Score = 79.0 bits (193), Expect = 9e-13
 Identities = 36/61 (59%), Positives = 41/61 (67%)
 Frame = -3

Query: 184 DEMEVSGEEIRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGN 5
           +EM  +  E  LYSF             A T++SLFGPFVELVKSWNLP+WLVHWGHPGN
Sbjct: 68  EEMSFTNNETFLYSFNPLPLMFLAALPGAGTISSLFGPFVELVKSWNLPDWLVHWGHPGN 127

Query: 4   M 2
           M
Sbjct: 128 M 128


>ref|XP_006428444.1| hypothetical protein CICLE_v10012507mg [Citrus clementina]
           gi|557530501|gb|ESR41684.1| hypothetical protein
           CICLE_v10012507mg [Citrus clementina]
          Length = 261

 Score = 77.8 bits (190), Expect = 2e-12
 Identities = 40/74 (54%), Positives = 43/74 (58%)
 Frame = -3

Query: 223 ASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWN 44
           A  A  VEE     EME   E + LYS                TV +LFGPFVELVKSWN
Sbjct: 67  ADGAQQVEEEEEKKEMEKPRETL-LYSIAPLPLLFDAALPGGGTVRALFGPFVELVKSWN 125

Query: 43  LPEWLVHWGHPGNM 2
           LP+WLVHWGHPGNM
Sbjct: 126 LPDWLVHWGHPGNM 139


>ref|XP_006428443.1| hypothetical protein CICLE_v10012507mg [Citrus clementina]
           gi|557530500|gb|ESR41683.1| hypothetical protein
           CICLE_v10012507mg [Citrus clementina]
          Length = 209

 Score = 77.8 bits (190), Expect = 2e-12
 Identities = 40/74 (54%), Positives = 43/74 (58%)
 Frame = -3

Query: 223 ASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWN 44
           A  A  VEE     EME   E + LYS                TV +LFGPFVELVKSWN
Sbjct: 67  ADGAQQVEEEEEKKEMEKPRETL-LYSIAPLPLLFDAALPGGGTVRALFGPFVELVKSWN 125

Query: 43  LPEWLVHWGHPGNM 2
           LP+WLVHWGHPGNM
Sbjct: 126 LPDWLVHWGHPGNM 139


>emb|CBI30143.3| unnamed protein product [Vitis vinifera]
          Length = 182

 Score = 77.0 bits (188), Expect = 3e-12
 Identities = 36/55 (65%), Positives = 37/55 (67%)
 Frame = -3

Query: 166 GEEIRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
           G E  LYSFT            A  V SLFGPFVELVKSWNLP+WLVHWGHPGNM
Sbjct: 4   GRETLLYSFTPLPLLLVAALPGAGAVRSLFGPFVELVKSWNLPDWLVHWGHPGNM 58


>ref|XP_002519479.1| conserved hypothetical protein [Ricinus communis]
           gi|223541342|gb|EEF42893.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 263

 Score = 76.6 bits (187), Expect = 5e-12
 Identities = 43/93 (46%), Positives = 51/93 (54%)
 Frame = -3

Query: 280 SHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXXXX 101
           + S R  F  P +     +A   +  +E+   + +E    E  LYSFT            
Sbjct: 50  TRSSRCLFPSPAIKNKVPEADYRS--DEDEEVEHVEEL-TETHLYSFTPLPLLLVAALPG 106

Query: 100 AETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
           A TV SL GPFVELVKSWNLPEWLVHWGHPGNM
Sbjct: 107 AGTVRSLIGPFVELVKSWNLPEWLVHWGHPGNM 139


>gb|EOY07933.1| GATA zinc finger domain-containing protein 10 [Theobroma cacao]
          Length = 256

 Score = 76.3 bits (186), Expect = 6e-12
 Identities = 39/68 (57%), Positives = 45/68 (66%), Gaps = 2/68 (2%)
 Frame = -3

Query: 199 ENRATDEMEVSGE--EIRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLV 26
           ++ A  ++E  GE  E  LYSF+            A TV SLFGPFVELVKSWNLP+WLV
Sbjct: 65  DDGADAKVEELGEPTETLLYSFSPLPLLVVAALPGAGTVRSLFGPFVELVKSWNLPDWLV 124

Query: 25  HWGHPGNM 2
           HWGHPGNM
Sbjct: 125 HWGHPGNM 132


>ref|XP_003542523.1| PREDICTED: uncharacterized protein LOC100793792 [Glycine max]
          Length = 243

 Score = 70.5 bits (171), Expect = 3e-10
 Identities = 28/32 (87%), Positives = 30/32 (93%)
 Frame = -3

Query: 97  ETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
           E V S+FGPFVELVKSWNLP+WLVHWGHPGNM
Sbjct: 88  EAVTSVFGPFVELVKSWNLPDWLVHWGHPGNM 119


>ref|XP_006294832.1| hypothetical protein CARUB_v10023884mg, partial [Capsella rubella]
           gi|482563540|gb|EOA27730.1| hypothetical protein
           CARUB_v10023884mg, partial [Capsella rubella]
          Length = 203

 Score = 69.3 bits (168), Expect = 7e-10
 Identities = 49/144 (34%), Positives = 69/144 (47%)
 Frame = -3

Query: 433 MANASSSIAFITHYXXXXXXXXXXXXXXXXXLINSSRFPSKYVKYSLFGTSSHSQRGFFA 254
           +++ SS +A IT +                 L+NS+R  S++   S F   S+S R    
Sbjct: 18  LSSLSSPMAAITGFSALSIPISSPPSLPASRLLNSTRCFSRFSNLSPFPAFSNSPRRKIR 77

Query: 253 PPLLAANYSKASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXXXXAETVNSLFG 74
              L    S   D     E+RA +E++    E  + S +            A TV+S+ G
Sbjct: 78  ---LITACSSTGDRGEEVESRAENEIK----ETLMLSVSPLPLLLVATLPGAATVSSVIG 130

Query: 73  PFVELVKSWNLPEWLVHWGHPGNM 2
           PF E+VKS NLP+WLVHWGHPGNM
Sbjct: 131 PFAEIVKSLNLPDWLVHWGHPGNM 154


>ref|XP_006410861.1| hypothetical protein EUTSA_v10017130mg [Eutrema salsugineum]
           gi|557112030|gb|ESQ52314.1| hypothetical protein
           EUTSA_v10017130mg [Eutrema salsugineum]
          Length = 255

 Score = 68.6 bits (166), Expect = 1e-09
 Identities = 47/112 (41%), Positives = 55/112 (49%)
 Frame = -3

Query: 337 INSSRFPSKYVKYSLFGTSSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEE 158
           +NS    S++   S F   S S R     PL  A  S       VEE R  DE+     E
Sbjct: 26  LNSPPCLSRFPNVSPFPALSTSLRRRI--PLTPACSSIGDGDESVEEARGDDEIR----E 79

Query: 157 IRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
             + S +            A TV S+ GPFVE+VKS NLPEWLVHWGHPGNM
Sbjct: 80  TLMLSVSPLPLLLVASLPGAATVRSVIGPFVEIVKSLNLPEWLVHWGHPGNM 131


>ref|XP_002323503.2| hypothetical protein POPTR_0016s10710g [Populus trichocarpa]
           gi|550321227|gb|EEF05264.2| hypothetical protein
           POPTR_0016s10710g [Populus trichocarpa]
          Length = 334

 Score = 68.2 bits (165), Expect = 2e-09
 Identities = 32/50 (64%), Positives = 34/50 (68%)
 Frame = -3

Query: 151 LYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
           LYSF             A TV SLFGPFVE+VKS NLP+WLVHWGHPGNM
Sbjct: 161 LYSFIPLPLLFTAALPGAATVRSLFGPFVEIVKSLNLPDWLVHWGHPGNM 210


>ref|XP_006294833.1| hypothetical protein CARUB_v10023884mg, partial [Capsella rubella]
           gi|482563541|gb|EOA27731.1| hypothetical protein
           CARUB_v10023884mg, partial [Capsella rubella]
          Length = 259

 Score = 68.2 bits (165), Expect = 2e-09
 Identities = 49/140 (35%), Positives = 66/140 (47%)
 Frame = -3

Query: 421 SSSIAFITHYXXXXXXXXXXXXXXXXXLINSSRFPSKYVKYSLFGTSSHSQRGFFAPPLL 242
           SS +A IT +                 L+NS+R  S++   S F   S+S R       L
Sbjct: 3   SSPMAAITGFSALSIPISSPPSLPASRLLNSTRCFSRFSNLSPFPAFSNSPRRKIR---L 59

Query: 241 AANYSKASDAAVVEENRATDEMEVSGEEIRLYSFTXXXXXXXXXXXXAETVNSLFGPFVE 62
               S   D     E+RA +E++    E  + S +            A TV+S+ GPF E
Sbjct: 60  ITACSSTGDRGEEVESRAENEIK----ETLMLSVSPLPLLLVATLPGAATVSSVIGPFAE 115

Query: 61  LVKSWNLPEWLVHWGHPGNM 2
           +VKS NLP+WLVHWGHPGNM
Sbjct: 116 IVKSLNLPDWLVHWGHPGNM 135


>ref|XP_002881477.1| hypothetical protein ARALYDRAFT_482670 [Arabidopsis lyrata subsp.
           lyrata] gi|297327316|gb|EFH57736.1| hypothetical protein
           ARALYDRAFT_482670 [Arabidopsis lyrata subsp. lyrata]
          Length = 258

 Score = 68.2 bits (165), Expect = 2e-09
 Identities = 44/112 (39%), Positives = 58/112 (51%)
 Frame = -3

Query: 337 INSSRFPSKYVKYSLFGTSSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEE 158
           +NS++  S++   S F   S S+R    P   A +  +  D +V  E R  DE E+    
Sbjct: 26  LNSTQCLSRFSNVSPFPALSTSRRRKI-PLTPACSSIRNGDESV--EARGDDENEIKETL 82

Query: 157 IRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
           +   S               ETV S+FGP VE+VKS NLP+WLVHWGHPGNM
Sbjct: 83  MLSVSPLPLLLVASLPGGNNETVTSVFGPVVEIVKSLNLPDWLVHWGHPGNM 134


>gb|EMJ05917.1| hypothetical protein PRUPE_ppa010283mg [Prunus persica]
          Length = 256

 Score = 67.4 bits (163), Expect = 3e-09
 Identities = 43/96 (44%), Positives = 51/96 (53%), Gaps = 2/96 (2%)
 Frame = -3

Query: 283 SSHSQRGFFAPPLLAANYSKASDAAVVEENRATDE--MEVSGEEIRLYSFTXXXXXXXXX 110
           SS ++R     PL+  N ++ S   VV+E     E  ME    E  LYS +         
Sbjct: 38  SSANKRRALPCPLVIKN-TRISSVDVVDEADQQKELGMEKKPNETLLYSLSPLPLLLVAA 96

Query: 109 XXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
              A  V SLF PFVELVKS+ LP WLVHWGHPGNM
Sbjct: 97  LPGAGAVTSLFEPFVELVKSFGLPGWLVHWGHPGNM 132


>ref|NP_565853.1| uncharacterized protein [Arabidopsis thaliana]
           gi|20197943|gb|AAM15322.1| Expressed protein
           [Arabidopsis thaliana] gi|21555809|gb|AAM63938.1|
           unknown [Arabidopsis thaliana]
           gi|26983860|gb|AAN86182.1| unknown protein [Arabidopsis
           thaliana] gi|330254219|gb|AEC09313.1| uncharacterized
           protein AT2G36885 [Arabidopsis thaliana]
          Length = 256

 Score = 67.4 bits (163), Expect = 3e-09
 Identities = 46/112 (41%), Positives = 58/112 (51%)
 Frame = -3

Query: 337 INSSRFPSKYVKYSLFGTSSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEE 158
           +NS++  S++   S F   S  +R     PL  A  S   D     E R  DE E+   E
Sbjct: 26  LNSTQCLSRFSNVSSFPALSTFRRRKI--PLTPA-CSSIVDGDEEIEARGDDENEI--RE 80

Query: 157 IRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
             + S +            AETV S+FGP VE+VKS NLP+WLVHWGHPGNM
Sbjct: 81  TLMLSVSPLPLLLVASLPGAETVRSVFGPVVEIVKSLNLPDWLVHWGHPGNM 132


>ref|NP_973615.2| uncharacterized protein [Arabidopsis thaliana]
           gi|330254220|gb|AEC09314.1| uncharacterized protein
           AT2G36885 [Arabidopsis thaliana]
          Length = 255

 Score = 67.4 bits (163), Expect = 3e-09
 Identities = 46/112 (41%), Positives = 58/112 (51%)
 Frame = -3

Query: 337 INSSRFPSKYVKYSLFGTSSHSQRGFFAPPLLAANYSKASDAAVVEENRATDEMEVSGEE 158
           +NS++  S++   S F   S  +R     PL  A  S   D     E R  DE E+   E
Sbjct: 26  LNSTQCLSRFSNVSSFPALSTFRRRKI--PLTPA-CSSIVDGDEEIEARGDDENEI--RE 80

Query: 157 IRLYSFTXXXXXXXXXXXXAETVNSLFGPFVELVKSWNLPEWLVHWGHPGNM 2
             + S +            AETV S+FGP VE+VKS NLP+WLVHWGHPGNM
Sbjct: 81  TLMLSVSPLPLLLVASLPGAETVRSVFGPVVEIVKSLNLPDWLVHWGHPGNM 132


Top