BLASTX nr result

ID: Salvia21_contig00001924 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Salvia21_contig00001924
         (1615 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275132.2| PREDICTED: uncharacterized protein LOC100244...   299   1e-78
ref|NP_001239834.1| uncharacterized protein LOC100810535 [Glycin...   260   8e-67
ref|XP_003534557.1| PREDICTED: uncharacterized protein LOC100808...   256   9e-66
emb|CBI40392.3| unnamed protein product [Vitis vinifera]              253   1e-64
ref|XP_002511877.1| hypothetical protein RCOM_1615820 [Ricinus c...   253   1e-64

>ref|XP_002275132.2| PREDICTED: uncharacterized protein LOC100244782 [Vitis vinifera]
          Length = 319

 Score =  299 bits (766), Expect = 1e-78
 Identities = 159/316 (50%), Positives = 200/316 (63%), Gaps = 32/316 (10%)
 Frame = +2

Query: 266  MADNEEAKETGPRGNEWEVVSLTASAYAAAPGPKPVDSSQDSVTKLDKNHEGETSNAMFM 445
            MADNEE +ET  RGNEWEVVSLTASAYAAAPGPK ++ S D  +   K +E ETS+AMF+
Sbjct: 1    MADNEEGEETTSRGNEWEVVSLTASAYAAAPGPKGIEMSDDGKSNTFKGNEAETSHAMFL 60

Query: 446  SSHFVFPPSQHENLPIEPEFNESDSEKGREDDVSQLGKDEGVKSDAKDEDNATIEGLMT- 622
            S HFVFPPSQHENLP+EP+  E  +E G ED + +   ++G  SD K+EDN +++ L   
Sbjct: 61   SGHFVFPPSQHENLPLEPDDTEIHNEHGSEDVIPESNVEKGGHSDGKNEDNWSVKDLTKG 120

Query: 623  EEFPGIQVLNE------------------KGXXXXXXXXXXXXXXXXXXXXXXXXXGTD- 745
            +EFPGIQ+ +E                  KG                         G   
Sbjct: 121  DEFPGIQLFDEGGKSLSDRGKGFEEGTALKGLNLVDKEPDIYSAAKFSSLHSEPTIGGST 180

Query: 746  ---------DSIEPLDGYVDPNLLNFQKPI---EGDKYDDADLPCEAWWKRRAFSVYAHA 889
                     D +EP +   D +    Q P    + D+YD ++LPCEAWWKRRA S Y HA
Sbjct: 181  TYDGNTVIPDLVEPPELGADLHADISQSPKSTKDDDRYDGSNLPCEAWWKRRAASFYGHA 240

Query: 890  KEANTVWSIFLAAAVMGLVIIGHQWQHERWQVLRLRWQFGINDERMSRLLAPIFRLKDVV 1069
            KEAN  WSIF+AAAVMGLVI+G +WQHERWQVL+L+WQFG+NDE+M R+L PI RLKDV+
Sbjct: 241  KEANAFWSIFIAAAVMGLVILGQRWQHERWQVLQLKWQFGVNDEKMGRMLGPIIRLKDVI 300

Query: 1070 VGGHRRGSLVRGSTSS 1117
            VGG+RRGS +RGS++S
Sbjct: 301  VGGNRRGSFIRGSSTS 316


>ref|NP_001239834.1| uncharacterized protein LOC100810535 [Glycine max]
            gi|255639957|gb|ACU20271.1| unknown [Glycine max]
          Length = 315

 Score =  260 bits (664), Expect = 8e-67
 Identities = 146/317 (46%), Positives = 187/317 (58%), Gaps = 32/317 (10%)
 Frame = +2

Query: 266  MADNEEAKETGPRGNEWEVVSLTASAYAAAPGPKPVDSSQDSVTKLDKNHEGETSNAMFM 445
            MA+NE+ ++   RGNEWEVVSLTAS YAAAPGP  V+   D    +    EGETSNA+FM
Sbjct: 1    MANNEDGRDKTTRGNEWEVVSLTASTYAAAPGPDEVEMKDDGNEDVYGQDEGETSNALFM 60

Query: 446  SSHFVFPPSQHENLPIEPEFNESDSEKGREDDVSQLGKDEGVKSDAKDEDNATIEGL-MT 622
            S HFVFPPSQHENLP+EP++ E   + G +D  S+   +E      KDE+N T+ GL ++
Sbjct: 61   SRHFVFPPSQHENLPVEPDYGEIHDDSGDKDVASEETPEEVTIPSGKDEENLTLPGLEVS 120

Query: 623  EEFPGIQV-------------------------LNEKGXXXXXXXXXXXXXXXXXXXXXX 727
            EEF G++                          L EKG                      
Sbjct: 121  EEFEGMRYFDEKINRLSVRGKQFEESTTLPAFGLTEKGESMYDPAKYTSFDSETAIGGIT 180

Query: 728  XXXG------TDDSIEPLDGYVDPNLLNFQKPIEGDKYDDADLPCEAWWKRRAFSVYAHA 889
                      T +S E     V P+L       + ++Y+ +DLPC AWWKRRA S+YAHA
Sbjct: 181  AYGESIVDPETTESAEQ-GSNVSPDLSLSNYSSKDNEYNSSDLPCGAWWKRRAASLYAHA 239

Query: 890  KEANTVWSIFLAAAVMGLVIIGHQWQHERWQVLRLRWQFGINDERMSRLLAPIFRLKDVV 1069
            KEAN  WS+F+AAAVMGLV++G +WQ ER   L+L+WQ  INDE  SR+LAPI+RLKDV+
Sbjct: 240  KEANAFWSVFIAAAVMGLVMLGQRWQQER--ALQLKWQISINDEARSRVLAPIYRLKDVI 297

Query: 1070 VGGHRRGSLVRGSTSSE 1120
            VGG+RRGSL+RGS+S E
Sbjct: 298  VGGNRRGSLIRGSSSGE 314


>ref|XP_003534557.1| PREDICTED: uncharacterized protein LOC100808551 isoform 1 [Glycine
            max] gi|356531991|ref|XP_003534558.1| PREDICTED:
            uncharacterized protein LOC100808551 isoform 2 [Glycine
            max]
          Length = 315

 Score =  256 bits (655), Expect = 9e-66
 Identities = 146/317 (46%), Positives = 188/317 (59%), Gaps = 32/317 (10%)
 Frame = +2

Query: 266  MADNEEAKETGPRGNEWEVVSLTASAYAAAPGPKPVDSSQDSVTKLDKNHEGETSNAMFM 445
            MADNE+  +   RGNEWEVVSLTAS YAAAPGP  V+   D    +    EGETS+A+FM
Sbjct: 1    MADNEDGGDKTSRGNEWEVVSLTASTYAAAPGPDEVEMKDDGKEDVYGQDEGETSHALFM 60

Query: 446  SSHFVFPPSQHENLPIEPEFNESDSEKGREDDVSQLGKDEGVKSDAKDEDNATIEGL-MT 622
            S HFVFPPSQHENLP+EP++ E   + G +D  S+   +E      KDE+N T+ GL + 
Sbjct: 61   SRHFVFPPSQHENLPVEPDYGEIHDDFGDKDVASEETPEEVTIPSGKDEENLTLPGLEVA 120

Query: 623  EEFPGIQVLN-------------EKGXXXXXXXXXXXXXXXXXXXXXXXXXGTDDSIEPL 763
            EEF G++  +             E+G                         G + +I  +
Sbjct: 121  EEFEGMRYFDEKINRLSVRGKQFEEGTTLPAFGLTEKGESMYDPAKYTSFEG-ETAIGGV 179

Query: 764  DGY----VDPNL-------------LNFQKPIEGD-KYDDADLPCEAWWKRRAFSVYAHA 889
              Y    VDP               L+  K +  D +Y+ +DLPC AWWKRRA S+YAHA
Sbjct: 180  TAYGESIVDPEATEMEYQGSNVSPDLSLSKNLSKDNEYNTSDLPCGAWWKRRAASLYAHA 239

Query: 890  KEANTVWSIFLAAAVMGLVIIGHQWQHERWQVLRLRWQFGINDERMSRLLAPIFRLKDVV 1069
            KEAN  WS+F+AA VMGLV++G +WQHER   L+L+WQ  INDE  SR+LAPI+RLKDV+
Sbjct: 240  KEANAFWSVFIAATVMGLVMLGQRWQHER--ALQLKWQISINDEARSRVLAPIYRLKDVI 297

Query: 1070 VGGHRRGSLVRGSTSSE 1120
            VGG+RRGSL+R S+S E
Sbjct: 298  VGGNRRGSLIRRSSSGE 314


>emb|CBI40392.3| unnamed protein product [Vitis vinifera]
          Length = 311

 Score =  253 bits (646), Expect = 1e-64
 Identities = 140/289 (48%), Positives = 171/289 (59%), Gaps = 2/289 (0%)
 Frame = +2

Query: 257  LL*MADNEEAKETGPRGNEWEVVSLTASAYAAAPGPKPVDSSQDSVTKLDKNHEGETSNA 436
            LL MADNEE +ET  RGNEWEVVSLTASAYAAAPGPK ++ S D  +   K +E ETS+A
Sbjct: 63   LLKMADNEEGEETTSRGNEWEVVSLTASAYAAAPGPKGIEMSDDGKSNTFKGNEAETSHA 122

Query: 437  MFMSSHFVFPPSQHENLPIEPEFNESDSEKGREDDVSQLGKDEGVKSDAKDEDNATIEGL 616
            MF+S HFVFPPSQHENLP+EP+  E  +E G E              D   E N   EG 
Sbjct: 123  MFLSGHFVFPPSQHENLPLEPDDTEIHNEHGSE--------------DVIPESNVEKEGT 168

Query: 617  MTEEFPGIQVLNEKGXXXXXXXXXXXXXXXXXXXXXXXXXGT--DDSIEPLDGYVDPNLL 790
              +   G+ +++++                           T   D +EP          
Sbjct: 169  ALK---GLNLVDKEPDIYSAAKFSSLHSEPTIGGSTTYDGNTVIPDLVEP---------- 215

Query: 791  NFQKPIEGDKYDDADLPCEAWWKRRAFSVYAHAKEANTVWSIFLAAAVMGLVIIGHQWQH 970
                            P  AWWKRRA S Y HAKEAN  WSIF+AAAVMGLVI+G +WQH
Sbjct: 216  ----------------PELAWWKRRAASFYGHAKEANAFWSIFIAAAVMGLVILGQRWQH 259

Query: 971  ERWQVLRLRWQFGINDERMSRLLAPIFRLKDVVVGGHRRGSLVRGSTSS 1117
            ERWQVL+L+WQFG+NDE+M R+L PI RLKDV+VGG+RRGS +RGS++S
Sbjct: 260  ERWQVLQLKWQFGVNDEKMGRMLGPIIRLKDVIVGGNRRGSFIRGSSTS 308


>ref|XP_002511877.1| hypothetical protein RCOM_1615820 [Ricinus communis]
            gi|223549057|gb|EEF50546.1| hypothetical protein
            RCOM_1615820 [Ricinus communis]
          Length = 312

 Score =  253 bits (646), Expect = 1e-64
 Identities = 145/311 (46%), Positives = 185/311 (59%), Gaps = 26/311 (8%)
 Frame = +2

Query: 266  MADNEEA-KETGPRGNEWEVVSLTASAYAAAPGPKPVDSSQDSVTKLDKNHEGETSNA-M 439
            MADNEE  +E   RGNEWEVVSLTAS Y AAPGPK V+   +         E E+S A +
Sbjct: 2    MADNEEGVEENTSRGNEWEVVSLTASTYDAAPGPKEVELKDEENKDKVYGDEAESSRASL 61

Query: 440  FMSSHFVFPPSQHENLPIEPEFNESDSEKGREDDVSQLGKDEGVKSDAKDEDNATIEGL- 616
            F S HFVFPPSQHENLP+EP+ +E  +E+  ++ VS+LG +EG K   KDE+N   +GL 
Sbjct: 62   FFSRHFVFPPSQHENLPLEPDNSEILNEEVGKNVVSELGVEEGDKFGRKDEENQPFKGLH 121

Query: 617  MTEEFPGIQVLNEKGXXXXXXXXXXXXXXXXXXXXXXXXXGTD-----------DSIEPL 763
            ++EE PG+Q  + K                           TD           D  +  
Sbjct: 122  VSEEIPGLQFSDGKAISGSEFEESTTLQELGLIEKEQSIYNTDAFNPFHSETEHDGSDTY 181

Query: 764  DGYVDPNLLNFQKPIEGD------------KYDDADLPCEAWWKRRAFSVYAHAKEANTV 907
               +  ++ N Q     D            KYD ++LPCEAWWKRRA S+Y+HAKE N +
Sbjct: 182  GESLGISIANEQSEQGSDFSTDISHSPKAVKYDGSNLPCEAWWKRRAASLYSHAKETNAL 241

Query: 908  WSIFLAAAVMGLVIIGHQWQHERWQVLRLRWQFGINDERMSRLLAPIFRLKDVVVGGHRR 1087
            WSIF+AAAVMGLVIIG +WQ ERW+ L+L+WQ  IN E+  R+L PI RLKDV+VGGHRR
Sbjct: 242  WSIFVAAAVMGLVIIGQRWQQERWRALQLKWQANIN-EKTGRILGPISRLKDVIVGGHRR 300

Query: 1088 GSLVRGSTSSE 1120
            G+ +RG +SSE
Sbjct: 301  GTFIRGGSSSE 311


Top