BLASTX nr result

ID: Dioscorea21_contig00005575 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00005575
         (1145 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN80099.1| hypothetical protein VITISV_002699 [Vitis vinifera]   161   3e-37
ref|XP_002330533.1| predicted protein [Populus trichocarpa] gi|2...   142   2e-31
ref|XP_002300593.1| predicted protein [Populus trichocarpa] gi|2...   135   2e-29
ref|XP_002525604.1| conserved hypothetical protein [Ricinus comm...   132   2e-28
ref|XP_003523853.1| PREDICTED: uncharacterized protein At4g26450...   109   1e-21

>emb|CAN80099.1| hypothetical protein VITISV_002699 [Vitis vinifera]
          Length = 718

 Score =  161 bits (407), Expect = 3e-37
 Identities = 139/414 (33%), Positives = 191/414 (46%), Gaps = 68/414 (16%)
 Frame = +1

Query: 28   RKTDILLEAGRLAAEYLVFKGLLPPNSLPARWQDRS-------FQEPKDREXXXXXXXSA 186
            RK DI +EAGRLA EYL+  GLLPP++LP +WQ+ S       FQ+ ++ +       SA
Sbjct: 64   RKGDIFMEAGRLATEYLISTGLLPPSALPVKWQNGSLKKQVGDFQDGENLQLPAEGRTSA 123

Query: 187  FSRLGSLQPDH-HGRRRFDDEYNXXXXXXXXXXXXYN---RSYSSDWGRENGRNGQWTER 354
             +RLG+   D   GRRRF DEYN                 RSY SDW   NGR+G W + 
Sbjct: 124  LARLGNAVSDSGSGRRRFSDEYNPTGSRNHTRGRRRMGSFRSYGSDW---NGRSGSWDK- 179

Query: 355  SGRHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTSRDERNSRSE---SGSEIENHEVV 525
              R     EGD+D   GY  E+  G  DVGS V ++   E    S+     SE E     
Sbjct: 180  -ARASPDTEGDEDSTCGYPEEQLVG-KDVGSGVQKSRPSELPPISDDVGDDSEAEPENTR 237

Query: 526  DDTGSKVSSSSTRKEQLAEVDVEMSKNKELDD----DVKADNLENGDVN--IDAEKKIQQ 687
            DD  SK  SS   +E   E D E+  NK  DD    DV    +++G  N   D  +K   
Sbjct: 238  DDMSSKAGSSRVGREIPLETDGEL--NKRPDDSRVLDVAPGEMKDGTSNDSDDETEKQTA 295

Query: 688  EEDVNLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSL------------TQRNDKH 831
             ED+ +  D+A ++D A K  ++LL+LC+FAKVPT+ RSSL            T++ +  
Sbjct: 296  SEDLTIQ-DSAVDDDIAGKSGTDLLRLCNFAKVPTKTRSSLMYKALKIDLTPTTEKGNTC 354

Query: 832  DDKPMEGSSSNSHADQ-DENLEGEARNATS-----------VLTDQPMEEAVDV------ 957
            D  P+ GSS +S  +  +E+L G   + T             LT + +E+A ++      
Sbjct: 355  DIGPLRGSSYSSEDNPVEESLGGALSDQTQKSQCLNLDVSRALTVESLEDAEELDSKHVV 414

Query: 958  ------------------HNEANDDPPGFETFSSAIVEEEDASFEQHIQKNDGI 1065
                                EA+  PPGF   SS I E  +    Q     +GI
Sbjct: 415  EQGKCVRSQSFPERAFMYEQEASQGPPGFGRCSSMIKERGEKRAGQQSDTMEGI 468


>ref|XP_002330533.1| predicted protein [Populus trichocarpa] gi|222872091|gb|EEF09222.1|
            predicted protein [Populus trichocarpa]
          Length = 725

 Score =  142 bits (358), Expect = 2e-31
 Identities = 117/356 (32%), Positives = 179/356 (50%), Gaps = 15/356 (4%)
 Frame = +1

Query: 28   RKTDILLEAGRLAAEYLVFKGLLPPNSLPARWQDRSFQEP--------KDREXXXXXXXS 183
            +K  IL+EAGRLAAEYLV KGLLP ++L  +WQ+ SF+          +  +       S
Sbjct: 66   QKGYILMEAGRLAAEYLVSKGLLPQSALSGKWQNGSFKRQAGDYQDFRQQEDLMQEGRTS 125

Query: 184  AFSRLGSLQPDHH-GRRRFDDEYNXXXXXXXXXXXXYNRSYSSDWGRENGRNGQWTERSG 360
            A SRLGS   D   GRRR+ D++N            + R YSS+WGRE GR+G  ++R+ 
Sbjct: 126  AHSRLGSGASDAGLGRRRYPDDFNLRNHVKGRRRGEHYRGYSSEWGREYGRSGSLSDRNR 185

Query: 361  RHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTSRDERNSRSESGSEIEN----HEVVD 528
               D  E  +D   G+  E++   +DVG  + ++ +      SE  ++IE+    +   +
Sbjct: 186  MSPDTEE--NDTVSGHCEEQQVS-NDVGDGMEKSGQSGVAPESEETADIESGLSKYNYPN 242

Query: 529  DTGSKVSSSSTRKEQLAEVDVEMSKNKELDDDVKADNLENGDVNIDAE-KKIQQEEDVNL 705
            +TGSK SSSS  KE   E D E SK      +V   N +  D N D E +K    ED+ +
Sbjct: 243  ETGSKASSSSVLKE---ETDGEPSKGSGDPANVNLGNKDMKDGNYDYEIEKQIVPEDLPI 299

Query: 706  HADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDKHDDKPMEGSSSNSHADQDE 885
                  ++D + K  S+LL L  FA VPT+ RS+L+ R+ + D  P     +N   D  +
Sbjct: 300  Q-----QSDLSGKDESDLLTLSKFANVPTKMRSALSCRSSRVDQVP-----NNEEEDTSD 349

Query: 886  NLEGEARNATSVLTD-QPMEEAVDVHNEANDDPPGFETFSSAIVEEEDASFEQHIQ 1050
            N  G  + +  V+ D      A DV+   + + P  E    A+V+  + + E+ ++
Sbjct: 350  N--GLNKGSEDVVQDGVDNVSATDVNATHDSNCPNSEIIKVAVVQPAEDADEEGLE 403


>ref|XP_002300593.1| predicted protein [Populus trichocarpa] gi|222847851|gb|EEE85398.1|
            predicted protein [Populus trichocarpa]
          Length = 732

 Score =  135 bits (340), Expect = 2e-29
 Identities = 109/327 (33%), Positives = 158/327 (48%), Gaps = 22/327 (6%)
 Frame = +1

Query: 28   RKTDILLEAGRLAAEYLVFKGLLPPNSLPARWQDRSFQEP--------KDREXXXXXXXS 183
            +K D+L+EAGRLAAEYLV KGLLP ++L  +WQ+  F+          +  +       S
Sbjct: 69   QKGDVLMEAGRLAAEYLVSKGLLPQSALSGKWQNGGFKMQAGDYQDFRQQEDLMHEGRTS 128

Query: 184  AFSRLGSLQPDHH-GRRRFDDEYNXXXXXXXXXXXXYNRSYSSDWGRENGRNGQWTERSG 360
            A SRLGS   D    RRR+ D++N            + R YS++WGRE GR+G  ++R+ 
Sbjct: 129  AHSRLGSGASDTGLSRRRYSDDFNSRNHVKGRRRGEHYRGYSAEWGREYGRSGPLSDRN- 187

Query: 361  RHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTS------RDERNSRSESGSEIENHEV 522
            R     EG  D    +  E++   +DVG  + ++         E  +  ESG    NH  
Sbjct: 188  RVSPDMEGQSDTVSEHYEEQQVS-NDVGDGMEKSGLSGVAPESEETADIESGLSKYNHP- 245

Query: 523  VDDTGSKVSSSSTRKEQLAEVDVEMSKNKELDDDVKADNLENGDVNIDAEKKIQ-QEEDV 699
             D+TGSK SSSS  KE   E   E SK      ++   N E  D N D E + Q   ED+
Sbjct: 246  -DETGSKASSSSVPKE---ETGGEPSKGSGDPANLNLGNGEVKDSNYDYETEKQIVPEDL 301

Query: 700  NLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDKHDDKPMEGSSSNSHADQ 879
             +   +A E D + +  S+LL L  FA VPT+ RS+L+ R+ + D  P       S    
Sbjct: 302  PIQ-QSAVEGDISGRNGSDLLTLSKFANVPTKTRSALSCRSSRVDQVPNNEDDGTSGIGL 360

Query: 880  DENLEGEAR------NATSVLTDQPME 942
            ++  E   +      +A  VL + P +
Sbjct: 361  NKGSEDSVQDGMYNVSAADVLANAPRD 387


>ref|XP_002525604.1| conserved hypothetical protein [Ricinus communis]
            gi|223535040|gb|EEF36722.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 724

 Score =  132 bits (331), Expect = 2e-28
 Identities = 111/357 (31%), Positives = 175/357 (49%), Gaps = 18/357 (5%)
 Frame = +1

Query: 28   RKTDILLEAGRLAAEYLVFKGLLPPNSLPA--RWQDRSFQEP-----KDREXXXXXXXSA 186
            RK DIL+EAGRLAAEYLV KGLLP ++L A  +WQ+ + ++         E       SA
Sbjct: 69   RKGDILMEAGRLAAEYLVSKGLLPQSALSASGKWQNGNSKKQVGDDYLQEELNQDGRTSA 128

Query: 187  FSRLGSLQPDHH-GRRRFDDEYN-XXXXXXXXXXXXYNRSYSSDW-GRENGRNGQWTERS 357
             SRLGS   D   G+RR+ D++N             YNRSY+S+W GRE GR+G W +R+
Sbjct: 129  HSRLGSGASDSGIGKRRYSDDFNLRNHVKGRRRGEYYNRSYNSEWGGREYGRSGSWLDRN 188

Query: 358  GRHWDGPEGDDDFAPGYQRERRTGFDDVGSSVSRTSRDERNSRSESGSEIEN---HEVVD 528
             R     EG+ D   G+  E++ G +DV   + ++        +E  +++E+   +   D
Sbjct: 189  -RVSPDMEGEGDTISGHYDEQQAG-EDVSEGLQKSGLSGSVPDNEEAADMESFAEYTNSD 246

Query: 529  DTGSKVSSSSTRKEQL----AEVDVEMSKNKELDDDVKADNLENGDVNIDAEKKIQQEED 696
            + GSK SS  T K++      +V  +++      +++K +N ++     + EK+I  E+ 
Sbjct: 247  EMGSKASSLHTGKDEAVGEPGQVSDDLTNLNSGSEEMKDNNFKH-----ETEKQIAPEDL 301

Query: 697  VNLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDKHDDKPMEGSSSNSHAD 876
                   + E D   K  S+LL  C+FAKVPT+ RS+LT +  K D  P        +A+
Sbjct: 302  ATQQC--SVEGDILDKHESDLLTFCNFAKVPTKIRSALTYKVPKVDQVP--------NAE 351

Query: 877  QDENLEGEARNATSVLTDQPMEEA-VDVHNEANDDPPGFETFSSAIVEEEDASFEQH 1044
            +    +G  + +   + D  ++ A  D     ND        S ++   ED     H
Sbjct: 352  ERNVSDGAQKGSEITVQDGTLDFATADQLPNTNDLKSADPEISISVQSAEDVGESGH 408


>ref|XP_003523853.1| PREDICTED: uncharacterized protein At4g26450-like [Glycine max]
          Length = 699

 Score =  109 bits (273), Expect = 1e-21
 Identities = 91/287 (31%), Positives = 142/287 (49%), Gaps = 22/287 (7%)
 Frame = +1

Query: 34  TDILLEAGRLAAEYLVFKGLLPPNSL-PARWQDRSFQEPKDREXXXXXXXSAFSRLGSLQ 210
           +DI +EAGRLAAEYLV +G LPPN+L P +WQ+        R+       SA +RLGS  
Sbjct: 36  SDIFVEAGRLAAEYLVSQGQLPPNALPPPKWQNHKTPAEGGRQ-------SALARLGSAD 88

Query: 211 PDHHGRRRFDDEYNXXXXXXXXXXXXYNRSYSSDWGRENGRNGQWTER-SGRHWDGPEG- 384
               GRR+    ++            ++RS   DWGRE  RNG W++R  G   D  +G 
Sbjct: 89  ----GRRKLGG-FDEFGQKGGRRRGSFSRSNGMDWGREYRRNGSWSDRFRGGAVDVRDGE 143

Query: 385 DDDFAPG--------------YQRERRTGFDDVGSSVSRTSRDERNSRSESGSEIE---- 510
           DDD+  G              YQ+++     D  S  S ++ +E   RSE G ++     
Sbjct: 144 DDDYESGGFSVRHQDEEDQHQYQQQQHQNSVDDASMKSNSNLNEFAPRSEDGGDLNDKDG 203

Query: 511 NHEVVDDTGSKVSSSSTRKEQLAEVDVEMSKNKELDD-DVKADNLENGDVNIDAEKKIQQ 687
           + E V+     V  S   K+ +++VD+E+    +L+   V    +++G    D  ++++ 
Sbjct: 204 DKERVNAELMGVKQSGIGKD-VSDVDMEVGVGNDLESVSVGVKEVKDGSGGDDDSERLRN 262

Query: 688 EEDVNLHADNATENDSAVKLSSNLLKLCSFAKVPTRPRSSLTQRNDK 828
             D      +  EN S+  + ++L+ LC   KVPTR RSS+T++N K
Sbjct: 263 VSD----QWSDQENSSSGGVVADLVSLCKSVKVPTRTRSSVTRKNLK 305