BLASTX nr result

ID: Catharanthus22_contig00038103 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00038103
         (626 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006345200.1| PREDICTED: uncharacterized protein LOC102586...    87   3e-15
ref|XP_004488365.1| PREDICTED: uncharacterized protein LOC101512...    79   1e-12
ref|XP_006440878.1| hypothetical protein CICLE_v10024471mg [Citr...    77   3e-12
ref|XP_006494270.1| PREDICTED: oligopeptide transporter 5-like [...    74   3e-11
gb|EOY22249.1| Uncharacterized protein TCM_014473 [Theobroma cacao]    73   6e-11
ref|XP_002512002.1| conserved hypothetical protein [Ricinus comm...    72   1e-10
ref|XP_006600793.1| PREDICTED: uncharacterized protein LOC102668...    69   8e-10
gb|ESW27168.1| hypothetical protein PHAVU_003G179700g [Phaseolus...    69   1e-09
ref|XP_002887523.1| hypothetical protein ARALYDRAFT_476547 [Arab...    67   5e-09
ref|XP_006301400.1| hypothetical protein CARUB_v10021814mg [Caps...    65   2e-08
gb|EMJ11821.1| hypothetical protein PRUPE_ppa018620mg, partial [...    65   2e-08
gb|AAM63003.1| unknown [Arabidopsis thaliana]                          63   6e-08
ref|XP_006390472.1| hypothetical protein EUTSA_v10019432mg [Eutr...    63   8e-08
ref|NP_565078.1| uncharacterized protein [Arabidopsis thaliana] ...    62   2e-07
gb|ABK25855.1| unknown [Picea sitchensis] gi|148909738|gb|ABR179...    59   1e-06
gb|EXC28018.1| hypothetical protein L484_022251 [Morus notabilis]      58   3e-06
gb|AFG61998.1| hypothetical protein 0_10044_01, partial [Pinus t...    58   3e-06

>ref|XP_006345200.1| PREDICTED: uncharacterized protein LOC102586915 [Solanum tuberosum]
          Length = 107

 Score = 87.4 bits (215), Expect = 3e-15
 Identities = 54/105 (51%), Positives = 65/105 (61%), Gaps = 1/105 (0%)
 Frame = -1

Query: 479 VPVPGAAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGR-RLTPLESIKYRGC 303
           VP P  AV N+ WHSSGSIGPFFAVISVLT+L+I+SC++GR CR + R TPL SIK R C
Sbjct: 19  VPYP-EAVPNSAWHSSGSIGPFFAVISVLTILSILSCLVGRYCRNQERATPLHSIKQRDC 77

Query: 302 SSCGFGPFGWFRRKSCPGQVLGNNKXXXXVHDGQKENNIDVKAQD 168
           S      FG  R K C G            ++G+  NN + K QD
Sbjct: 78  S------FGGLRGKLCWG----------CTNNGESTNN-EGKGQD 105


>ref|XP_004488365.1| PREDICTED: uncharacterized protein LOC101512774 [Cicer arietinum]
          Length = 127

 Score = 79.0 bits (193), Expect = 1e-12
 Identities = 38/71 (53%), Positives = 49/71 (69%)
 Frame = -1

Query: 473 VPGAAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRRLTPLESIKYRGCSSC 294
           V  ++ S++ W SSGSI PFFAVI +LT+LA++SC L R+C  R LTPLESIK RGC   
Sbjct: 23  VSTSSSSSSAWQSSGSIAPFFAVIIILTILAVLSCYLSRMCNRRELTPLESIKGRGC--- 79

Query: 293 GFGPFGWFRRK 261
                GW +R+
Sbjct: 80  ----LGWLKRR 86


>ref|XP_006440878.1| hypothetical protein CICLE_v10024471mg [Citrus clementina]
           gi|557543140|gb|ESR54118.1| hypothetical protein
           CICLE_v10024471mg [Citrus clementina]
          Length = 129

 Score = 77.4 bits (189), Expect = 3e-12
 Identities = 47/99 (47%), Positives = 59/99 (59%), Gaps = 8/99 (8%)
 Frame = -1

Query: 449 NRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRR-----LTPLESIKYRGCSSCGFG 285
           +R  SSGS+GPFFAV+S+L VLAIISC+LGRIC  RR      +PL+SIKYRGC      
Sbjct: 27  SRSASSGSVGPFFAVMSILLVLAIISCVLGRICSRRRGRIIVESPLDSIKYRGC------ 80

Query: 284 PFGWFRRKSCP---GQVLGNNKXXXXVHDGQKENNIDVK 177
             GW +R+  P   G+V    K    V D   ++N   K
Sbjct: 81  -LGWLKRRCRPCMDGEVEAGAKVMACVDDRNNDDNNKAK 118


>ref|XP_006494270.1| PREDICTED: oligopeptide transporter 5-like [Citrus sinensis]
          Length = 834

 Score = 73.9 bits (180), Expect = 3e-11
 Identities = 39/68 (57%), Positives = 48/68 (70%), Gaps = 5/68 (7%)
 Frame = -1

Query: 449 NRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRR-----LTPLESIKYRGCSSCGFG 285
           +R  SSGS+GPFFAV+S+L VLAIISC+LGRIC  RR      +PL+SIKYRGC      
Sbjct: 732 SRSASSGSVGPFFAVMSILLVLAIISCVLGRICSRRRGRIIVESPLDSIKYRGC------ 785

Query: 284 PFGWFRRK 261
             GW +R+
Sbjct: 786 -LGWLKRR 792


>gb|EOY22249.1| Uncharacterized protein TCM_014473 [Theobroma cacao]
          Length = 120

 Score = 73.2 bits (178), Expect = 6e-11
 Identities = 38/73 (52%), Positives = 51/73 (69%), Gaps = 4/73 (5%)
 Frame = -1

Query: 467 GAAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRR----LTPLESIKYRGCS 300
           GAA +++   S+GSIGPFFAVIS+LT LAI+SC++GRIC  RR    +TPL++IK+  C 
Sbjct: 24  GAAAASSSSSSAGSIGPFFAVISILTFLAIVSCVVGRICVRRRTAAPVTPLDTIKHGSC- 82

Query: 299 SCGFGPFGWFRRK 261
                  GW +RK
Sbjct: 83  ------LGWLKRK 89


>ref|XP_002512002.1| conserved hypothetical protein [Ricinus communis]
           gi|223549182|gb|EEF50671.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 125

 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 45/83 (54%), Positives = 52/83 (62%), Gaps = 11/83 (13%)
 Frame = -1

Query: 473 VPGAAVS----NNRWHSS-GSIGPFFAVISVLTVLAIISCILGRICRGRRLT------PL 327
           +P AAVS    N  WHSS GSIGPFF VISVLTVLAI+SCILGR+C  R         P+
Sbjct: 22  IPQAAVSSTGSNANWHSSSGSIGPFFGVISVLTVLAILSCILGRVCSRRAEAAVGGGGPV 81

Query: 326 ESIKYRGCSSCGFGPFGWFRRKS 258
            +IK+R         FGW +RKS
Sbjct: 82  GAIKHRDY-------FGWMKRKS 97


>ref|XP_006600793.1| PREDICTED: uncharacterized protein LOC102668422 [Glycine max]
          Length = 128

 Score = 69.3 bits (168), Expect = 8e-10
 Identities = 44/97 (45%), Positives = 54/97 (55%)
 Frame = -1

Query: 554 MATTAISSPFPXXXXXXXXXXXQEVVPVPGAAVSNNRWHSSGSIGPFFAVISVLTVLAII 375
           MAT A+SS               E V VP    S + W SSGS+GPFFAVI+VL +L+++
Sbjct: 1   MATPALSS----IVEPEQQHAEPEAVAVP----STSAWKSSGSVGPFFAVITVLIILSVL 52

Query: 374 SCILGRICRGRRLTPLESIKYRGCSSCGFGPFGWFRR 264
           SC LGR    R  TPLESI+ RG        FGW +R
Sbjct: 53  SCYLGRKWNRRPKTPLESIRGRGF-------FGWLKR 82


>gb|ESW27168.1| hypothetical protein PHAVU_003G179700g [Phaseolus vulgaris]
          Length = 126

 Score = 68.6 bits (166), Expect = 1e-09
 Identities = 34/67 (50%), Positives = 44/67 (65%)
 Frame = -1

Query: 464 AAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRRLTPLESIKYRGCSSCGFG 285
           +   ++ W SSGS+GPFFAV+SVL +LA++SC LGR    R  TPLESI+ RG       
Sbjct: 24  STTGSSTWKSSGSVGPFFAVMSVLVILAVVSCYLGRKWSRRPKTPLESIRGRGL------ 77

Query: 284 PFGWFRR 264
            FGW +R
Sbjct: 78  -FGWLKR 83


>ref|XP_002887523.1| hypothetical protein ARALYDRAFT_476547 [Arabidopsis lyrata subsp.
           lyrata] gi|297333364|gb|EFH63782.1| hypothetical protein
           ARALYDRAFT_476547 [Arabidopsis lyrata subsp. lyrata]
          Length = 144

 Score = 66.6 bits (161), Expect = 5e-09
 Identities = 42/76 (55%), Positives = 47/76 (61%), Gaps = 8/76 (10%)
 Frame = -1

Query: 464 AAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRR--------LTPLESIKYR 309
           AA +    +SSGSIGPFFAVISVL VLA++SC LGRIC  RR        + PLE IK  
Sbjct: 39  AAPNAPNHYSSGSIGPFFAVISVLVVLAVLSCFLGRICARRRQRTVLVAEVNPLEMIK-- 96

Query: 308 GCSSCGFGPFGWFRRK 261
              S GF   GW RRK
Sbjct: 97  ---SGGF--LGWLRRK 107


>ref|XP_006301400.1| hypothetical protein CARUB_v10021814mg [Capsella rubella]
           gi|482570110|gb|EOA34298.1| hypothetical protein
           CARUB_v10021814mg [Capsella rubella]
          Length = 142

 Score = 65.1 bits (157), Expect = 2e-08
 Identities = 39/76 (51%), Positives = 45/76 (59%), Gaps = 8/76 (10%)
 Frame = -1

Query: 464 AAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRR--------LTPLESIKYR 309
           AA +    +SSGSIGPFFAVISVL VLA++SC LGRIC  RR        + PLE IK  
Sbjct: 37  AAPNAPNHYSSGSIGPFFAVISVLVVLAVLSCFLGRICARRRQSTVSVAEVNPLEMIKSG 96

Query: 308 GCSSCGFGPFGWFRRK 261
           G         GW RR+
Sbjct: 97  GI-------LGWLRRR 105


>gb|EMJ11821.1| hypothetical protein PRUPE_ppa018620mg, partial [Prunus persica]
          Length = 105

 Score = 65.1 bits (157), Expect = 2e-08
 Identities = 39/73 (53%), Positives = 50/73 (68%), Gaps = 2/73 (2%)
 Frame = -1

Query: 473 VPGAAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGR-ICRGRR-LTPLESIKYRGCS 300
           VP ++ S++    SGSIGPFFAVISVLTVLA +SC+LGR + R +  L PLESI +   +
Sbjct: 30  VPLSSSSSSSTSGSGSIGPFFAVISVLTVLAFLSCVLGRKLSRDQTVLRPLESINHGHRT 89

Query: 299 SCGFGPFGWFRRK 261
           SC     GW +RK
Sbjct: 90  SCA----GWLKRK 98


>gb|AAM63003.1| unknown [Arabidopsis thaliana]
          Length = 144

 Score = 63.2 bits (152), Expect = 6e-08
 Identities = 47/112 (41%), Positives = 58/112 (51%), Gaps = 14/112 (12%)
 Frame = -1

Query: 554 MATTAISSP-FPXXXXXXXXXXXQEVVPVPG-----AAVSNNRWHSSGSIGPFFAVISVL 393
           MA+++ISS  FP            E+ P+       AA +    +SS SIGPFFAVISVL
Sbjct: 3   MASSSISSSLFPIQQQPQQQLGGNEITPMATNANLIAAPNAPNHYSSSSIGPFFAVISVL 62

Query: 392 TVLAIISCILGRIC-RGRRLT-------PLESIKYRGCSSCGFGPFGWFRRK 261
            +LA++SC LGR C R R+ T       PLE IK  G         GW RRK
Sbjct: 63  IILAVLSCFLGRFCARSRQRTGLVAEVKPLEMIKSGGL-------LGWLRRK 107


>ref|XP_006390472.1| hypothetical protein EUTSA_v10019432mg [Eutrema salsugineum]
           gi|557086906|gb|ESQ27758.1| hypothetical protein
           EUTSA_v10019432mg [Eutrema salsugineum]
          Length = 139

 Score = 62.8 bits (151), Expect = 8e-08
 Identities = 39/76 (51%), Positives = 44/76 (57%), Gaps = 8/76 (10%)
 Frame = -1

Query: 464 AAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRR--------LTPLESIKYR 309
           AA +    +SSGSIGPFFAVISVL VLA++SC LGRIC  RR          PLE IK  
Sbjct: 39  AAPNGPNHYSSGSIGPFFAVISVLVVLAVLSCFLGRICARRRQSTVLVAEANPLEMIKSG 98

Query: 308 GCSSCGFGPFGWFRRK 261
           G         G+ RRK
Sbjct: 99  GL-------LGYLRRK 107


>ref|NP_565078.1| uncharacterized protein [Arabidopsis thaliana]
           gi|62319486|dbj|BAD94874.1| hypothetical protein
           [Arabidopsis thaliana] gi|89001015|gb|ABD59097.1|
           At1g74055 [Arabidopsis thaliana]
           gi|332197422|gb|AEE35543.1| uncharacterized protein
           AT1G74055 [Arabidopsis thaliana]
          Length = 144

 Score = 61.6 bits (148), Expect = 2e-07
 Identities = 44/110 (40%), Positives = 56/110 (50%), Gaps = 13/110 (11%)
 Frame = -1

Query: 551 ATTAISSPFPXXXXXXXXXXXQEVVPVPG-----AAVSNNRWHSSGSIGPFFAVISVLTV 387
           ++++ SS FP            E+ P+       AA +    +SS SIGPFFAVISVL +
Sbjct: 5   SSSSSSSLFPIQQQPQQQLGGNEITPMATNANLIAAPNAPNHYSSSSIGPFFAVISVLII 64

Query: 386 LAIISCILGRIC-RGRRLT-------PLESIKYRGCSSCGFGPFGWFRRK 261
           LA++SC LGR C R R+ T       PLE IK  G         GW RRK
Sbjct: 65  LAVLSCFLGRFCARSRQRTGLVAEVNPLEMIKSGGL-------LGWLRRK 107


>gb|ABK25855.1| unknown [Picea sitchensis] gi|148909738|gb|ABR17960.1| unknown
           [Picea sitchensis]
          Length = 119

 Score = 58.5 bits (140), Expect = 1e-06
 Identities = 30/64 (46%), Positives = 41/64 (64%), Gaps = 7/64 (10%)
 Frame = -1

Query: 464 AAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRRLTPLESIKY-------RG 306
           A  +N   HSSGS+GP  AV+SV+T+L +I+C+LGRIC GR  +   + KY       R 
Sbjct: 24  AVANNGVSHSSGSVGPVLAVLSVITILGVIACVLGRICAGRLFS--ANSKYDCVGWMERR 81

Query: 305 CSSC 294
           C+SC
Sbjct: 82  CASC 85


>gb|EXC28018.1| hypothetical protein L484_022251 [Morus notabilis]
          Length = 129

 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 37/73 (50%), Positives = 44/73 (60%), Gaps = 2/73 (2%)
 Frame = -1

Query: 473 VPGAAVSNNRWH-SSGSIGPFFAVISVLTVLAIISCILGRICRGRRL-TPLESIKYRGCS 300
           VPG+A   +    SSGSIGPFFAVISVLTVLA++SC  G     R+   PLE I+  G  
Sbjct: 25  VPGSASGGDSASASSGSIGPFFAVISVLTVLAVLSCYFGHKYNRRKAGAPLERIEIGG-- 82

Query: 299 SCGFGPFGWFRRK 261
               G  GW +RK
Sbjct: 83  ----GFLGWVKRK 91


>gb|AFG61998.1| hypothetical protein 0_10044_01, partial [Pinus taeda]
           gi|383159149|gb|AFG61999.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159151|gb|AFG62000.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159153|gb|AFG62001.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159155|gb|AFG62002.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159157|gb|AFG62003.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159159|gb|AFG62004.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159161|gb|AFG62005.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159163|gb|AFG62006.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159165|gb|AFG62007.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
           gi|383159167|gb|AFG62008.1| hypothetical protein
           0_10044_01, partial [Pinus taeda]
          Length = 107

 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 29/64 (45%), Positives = 41/64 (64%), Gaps = 7/64 (10%)
 Frame = -1

Query: 464 AAVSNNRWHSSGSIGPFFAVISVLTVLAIISCILGRICRGRRLTPLESIKY-------RG 306
           A  +N   HS+GS+GP  AV+SV+T+L +I+C+LGRIC GR  +   + KY       R 
Sbjct: 17  AVANNGGSHSNGSVGPVLAVLSVITILGVIACVLGRICAGRLFS--ANSKYDCVGWMERR 74

Query: 305 CSSC 294
           C+SC
Sbjct: 75  CASC 78


Top