BLASTX nr result

ID: Dioscorea21_contig00006442 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00006442
         (1107 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002275132.2| PREDICTED: uncharacterized protein LOC100244...   263   4e-68
ref|XP_002511877.1| hypothetical protein RCOM_1615820 [Ricinus c...   254   2e-65
ref|XP_002320804.1| predicted protein [Populus trichocarpa] gi|2...   238   2e-60
gb|AFK46799.1| unknown [Lotus japonicus]                              236   8e-60
ref|NP_001239834.1| uncharacterized protein LOC100810535 [Glycin...   226   6e-57

>ref|XP_002275132.2| PREDICTED: uncharacterized protein LOC100244782 [Vitis vinifera]
          Length = 319

 Score =  263 bits (673), Expect = 4e-68
 Identities = 147/315 (46%), Positives = 198/315 (62%), Gaps = 15/315 (4%)
 Frame = +3

Query: 150  MADDEKGGEGNASRGADWEVVSLTASTYAAAPGPEGFQSPDESRDLDFDKNEHQRADPLF 329
            MAD+E+G E   SRG +WEVVSLTAS YAAAPGP+G +  D+ +   F  NE + +  +F
Sbjct: 1    MADNEEGEE-TTSRGNEWEVVSLTASAYAAAPGPKGIEMSDDGKSNTFKGNEAETSHAMF 59

Query: 330  MSGHFVFPPSEHENLPIENYGDEIQDEPSGEDI---ASVLEEGENHGKGADE-NLKFKTD 497
            +SGHFVFPPS+HENLP+E    EI +E   ED+   ++V + G + GK  D  ++K  T 
Sbjct: 60   LSGHFVFPPSQHENLPLEPDDTEIHNEHGSEDVIPESNVEKGGHSDGKNEDNWSVKDLTK 119

Query: 498  -ESLQGIQMFDSG-EQVSVHHMEFGDSKSLRGLSFVGKEDIMYSSSAFGALHSETEISMS 671
             +   GIQ+FD G + +S     F +  +L+GL+ V KE  +YS++ F +LHSE  I  S
Sbjct: 120  GDEFPGIQLFDEGGKSLSDRGKGFEEGTALKGLNLVDKEPDIYSAAKFSSLHSEPTIGGS 179

Query: 672  DPCDESA---DTVESNDPSLD--ADALKLDKQNKNN----GSGLPCEAWWKRHAASWYNH 824
               D +    D VE  +   D  AD  +  K  K++    GS LPCEAWWKR AAS+Y H
Sbjct: 180  TTYDGNTVIPDLVEPPELGADLHADISQSPKSTKDDDRYDGSNLPCEAWWKRRAASFYGH 239

Query: 825  AKEGNTFWSVFVAAALMGLVILGKRWHRENLQLQQLKWQFNINAERMKWMTAPINRFKDI 1004
            AKE N FWS+F+AAA+MGLVILG+RW  E  Q+ QLKWQF +N E+M  M  PI R KD+
Sbjct: 240  AKEANAFWSIFIAAAVMGLVILGQRWQHERWQVLQLKWQFGVNDEKMGRMLGPIIRLKDV 299

Query: 1005 LVGSHHRSLLVRAEA 1049
            +VG + R   +R  +
Sbjct: 300  IVGGNRRGSFIRGSS 314


>ref|XP_002511877.1| hypothetical protein RCOM_1615820 [Ricinus communis]
            gi|223549057|gb|EEF50546.1| hypothetical protein
            RCOM_1615820 [Ricinus communis]
          Length = 312

 Score =  254 bits (650), Expect = 2e-65
 Identities = 148/309 (47%), Positives = 193/309 (62%), Gaps = 12/309 (3%)
 Frame = +3

Query: 150  MADDEKGGEGNASRGADWEVVSLTASTYAAAPGPEGFQSPDE-SRDLDF-DKNEHQRADP 323
            MAD+E+G E N SRG +WEVVSLTASTY AAPGP+  +  DE ++D  + D+ E  RA  
Sbjct: 2    MADNEEGVEENTSRGNEWEVVSLTASTYDAAPGPKEVELKDEENKDKVYGDEAESSRAS- 60

Query: 324  LFMSGHFVFPPSEHENLPIENYGDEIQDEPSGEDIASVL--EEGENHGKGADENLKFK-- 491
            LF S HFVFPPS+HENLP+E    EI +E  G+++ S L  EEG+  G+  +EN  FK  
Sbjct: 61   LFFSRHFVFPPSQHENLPLEPDNSEILNEEVGKNVVSELGVEEGDKFGRKDEENQPFKGL 120

Query: 492  -TDESLQGIQMFDSGEQVSVHHMEFGDSKSLRGLSFVGKEDIMYSSSAFGALHSETEISM 668
               E + G+Q F  G+ +S    EF +S +L+ L  + KE  +Y++ AF   HSETE   
Sbjct: 121  HVSEEIPGLQ-FSDGKAIS--GSEFEESTTLQELGLIEKEQSIYNTDAFNPFHSETEHDG 177

Query: 669  SDPCDESADTVESNDPSLDADALKLD-----KQNKNNGSGLPCEAWWKRHAASWYNHAKE 833
            SD   ES     +N+ S        D     K  K +GS LPCEAWWKR AAS Y+HAKE
Sbjct: 178  SDTYGESLGISIANEQSEQGSDFSTDISHSPKAVKYDGSNLPCEAWWKRRAASLYSHAKE 237

Query: 834  GNTFWSVFVAAALMGLVILGKRWHRENLQLQQLKWQFNINAERMKWMTAPINRFKDILVG 1013
             N  WS+FVAAA+MGLVI+G+RW +E  +  QLKWQ NIN E+   +  PI+R KD++VG
Sbjct: 238  TNALWSIFVAAAVMGLVIIGQRWQQERWRALQLKWQANIN-EKTGRILGPISRLKDVIVG 296

Query: 1014 SHHRSLLVR 1040
             H R   +R
Sbjct: 297  GHRRGTFIR 305


>ref|XP_002320804.1| predicted protein [Populus trichocarpa] gi|222861577|gb|EEE99119.1|
            predicted protein [Populus trichocarpa]
          Length = 306

 Score =  238 bits (607), Expect = 2e-60
 Identities = 137/302 (45%), Positives = 180/302 (59%), Gaps = 9/302 (2%)
 Frame = +3

Query: 171  GEGNASRGADWEVVSLTASTYAAAPGPEGFQSPDESRDLDFDKNEHQRADPLFMSGHFVF 350
            GE N  RG DWEVVSLTASTYAAAPGP+ F   D+     ++++E + +  LFMS HFVF
Sbjct: 7    GEENPPRGNDWEVVSLTASTYAAAPGPKEFDQKDDDNSKVYEEDEAESSHALFMSRHFVF 66

Query: 351  PPSEHENLPIENYGDEIQDEPSGEDIASVL---EEGENHGKGADENLKFK---TDESLQG 512
            PPS+HENLP+E+   EI D   G+++A  L   E G + GK  +E   FK     E   G
Sbjct: 67   PPSQHENLPLEHVNSEILDSHVGKNVALELGPEEGGRSSGKN-EEIWPFKGLEESEEYPG 125

Query: 513  IQMFDSGEQVSVHHMEFGDSKSLRGLSFVGKEDIMYSSSAFGALHSETEISMSDPCDESA 692
            IQ+FD   +      EF +S +L+   F  KE  +YS++A  + H+ETE+S S    E+ 
Sbjct: 126  IQLFDEKGKKG---QEFEESTTLQ--DFSDKEQSIYSTAALTSFHNETELSGSTTYGENL 180

Query: 693  DTVESNDPS---LDADALKLDKQNKNNGSGLPCEAWWKRHAASWYNHAKEGNTFWSVFVA 863
               E N+ S   LD  A+          + LP  AWWKR AAS Y HAKE NTFWS+FV 
Sbjct: 181  GIPEVNESSERGLDFPAVVPFSPKAAKDADLPSNAWWKRRAASLYAHAKEANTFWSIFVT 240

Query: 864  AALMGLVILGKRWHRENLQLQQLKWQFNINAERMKWMTAPINRFKDILVGSHHRSLLVRA 1043
            AA+MG+VILG+RW +E  Q  QLKWQ +IN ER   +  PI R KD++VG + R   +R 
Sbjct: 241  AAVMGIVILGQRWQQERWQALQLKWQASINNERSGSVLRPITRLKDVIVGGNRRGSFIRG 300

Query: 1044 EA 1049
             +
Sbjct: 301  SS 302


>gb|AFK46799.1| unknown [Lotus japonicus]
          Length = 309

 Score =  236 bits (602), Expect = 8e-60
 Identities = 133/308 (43%), Positives = 180/308 (58%), Gaps = 8/308 (2%)
 Frame = +3

Query: 150  MADDEKGGEGNASRGADWEVVSLTASTYAAAPGPEGFQSPDESRDLDFDKNEHQRADPLF 329
            MAD+E+GG G A RG DWEVVSLTASTYAAAPGP   +  D  ++  + ++E + +  LF
Sbjct: 1    MADNEEGG-GKAPRGNDWEVVSLTASTYAAAPGPTEVELKDGDKEDVYAQDEAETSRALF 59

Query: 330  MSGHFVFPPSEHENLPIENYGDEIQDEPSGEDIAS--VLEEGENHGKGADENLK---FKT 494
            MSGHFVFPPS+HENLP++    EI DE   EDI+S    E+        +ENL       
Sbjct: 60   MSGHFVFPPSQHENLPVQPDCSEIHDESRDEDISSEETREKATRRSGKDEENLTLAGLNV 119

Query: 495  DESLQGIQMFDSG-EQVSVHHMEFGDSKSLRGLSFVGKE--DIMYSSSAFGALHSETEIS 665
             E  +GIQ FD    ++SVH  +F +  +L G   VGK   + +Y  + +   HSET I 
Sbjct: 120  SEDFEGIQYFDEKINRLSVHGKQFEEGTTLPGYVLVGKRKGESIYDPAKYTNFHSETAIG 179

Query: 666  MSDPCDESADTVESNDPSLDADALKLDKQNKNNGSGLPCEAWWKRHAASWYNHAKEGNTF 845
                 D+S    E+ +             +K + S LPC AWWKR AAS Y HAKE NT 
Sbjct: 180  DVTSYDQSIVESETTESEEQLPVNPFRDDDKYDTSDLPCGAWWKRTAASLYTHAKEANTV 239

Query: 846  WSVFVAAALMGLVILGKRWHRENLQLQQLKWQFNINAERMKWMTAPINRFKDILVGSHHR 1025
            WS+F+AA +MG+V+LG RW ++  +  QLKWQ ++N E    +  PI+R KD++VG H R
Sbjct: 240  WSIFIAATVMGIVMLGHRWQQQ--RALQLKWQVSVNDEVRSKVLGPISRLKDVIVGGHRR 297

Query: 1026 SLLVRAEA 1049
              L+R  +
Sbjct: 298  GSLIRGSS 305


>ref|NP_001239834.1| uncharacterized protein LOC100810535 [Glycine max]
            gi|255639957|gb|ACU20271.1| unknown [Glycine max]
          Length = 315

 Score =  226 bits (577), Expect = 6e-57
 Identities = 134/315 (42%), Positives = 185/315 (58%), Gaps = 15/315 (4%)
 Frame = +3

Query: 150  MADDEKGGEGNASRGADWEVVSLTASTYAAAPGPEGFQSPDESRDLDFDKNEHQRADPLF 329
            MA++E G +   +RG +WEVVSLTASTYAAAPGP+  +  D+  +  + ++E + ++ LF
Sbjct: 1    MANNEDGRD-KTTRGNEWEVVSLTASTYAAAPGPDEVEMKDDGNEDVYGQDEGETSNALF 59

Query: 330  MSGHFVFPPSEHENLPIENYGDEIQDEPSGEDIAS--VLEEGENHGKGADENLK---FKT 494
            MS HFVFPPS+HENLP+E    EI D+   +D+AS    EE        +ENL     + 
Sbjct: 60   MSRHFVFPPSQHENLPVEPDYGEIHDDSGDKDVASEETPEEVTIPSGKDEENLTLPGLEV 119

Query: 495  DESLQGIQMFDSG-EQVSVHHMEFGDSKSLRGLSFVGKEDIMYSSSAFGALHSETEI--- 662
             E  +G++ FD    ++SV   +F +S +L       K + MY  + + +  SET I   
Sbjct: 120  SEEFEGMRYFDEKINRLSVRGKQFEESTTLPAFGLTEKGESMYDPAKYTSFDSETAIGGI 179

Query: 663  -----SMSDP-CDESADTVESNDPSLDADALKLDKQNKNNGSGLPCEAWWKRHAASWYNH 824
                 S+ DP   ESA+   +  P L        K N+ N S LPC AWWKR AAS Y H
Sbjct: 180  TAYGESIVDPETTESAEQGSNVSPDLSLSNYS-SKDNEYNSSDLPCGAWWKRRAASLYAH 238

Query: 825  AKEGNTFWSVFVAAALMGLVILGKRWHRENLQLQQLKWQFNINAERMKWMTAPINRFKDI 1004
            AKE N FWSVF+AAA+MGLV+LG+RW +E  +  QLKWQ +IN E    + API R KD+
Sbjct: 239  AKEANAFWSVFIAAAVMGLVMLGQRWQQE--RALQLKWQISINDEARSRVLAPIYRLKDV 296

Query: 1005 LVGSHHRSLLVRAEA 1049
            +VG + R  L+R  +
Sbjct: 297  IVGGNRRGSLIRGSS 311


Top