BLASTX nr result

ID: Dioscorea21_contig00020934 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00020934
         (1455 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002330887.1| predicted protein [Populus trichocarpa] gi|2...   370   e-100
ref|XP_002269272.1| PREDICTED: box C/D snoRNA protein 1 [Vitis v...   352   1e-94
emb|CBI32194.3| unnamed protein product [Vitis vinifera]              350   5e-94
ref|XP_002512150.1| conserved hypothetical protein [Ricinus comm...   346   8e-93
ref|NP_001184910.1| HIT-type Zinc finger family protein [Arabido...   338   3e-90

>ref|XP_002330887.1| predicted protein [Populus trichocarpa] gi|222872709|gb|EEF09840.1|
            predicted protein [Populus trichocarpa]
          Length = 424

 Score =  370 bits (951), Expect = e-100
 Identities = 205/423 (48%), Positives = 279/423 (65%), Gaps = 3/423 (0%)
 Frame = +2

Query: 5    LCEECGLNPWKYRCPGCSIRTCALPCVKAHKQRTSCTGKRSRAEPIPISQFNDDLLLSDY 184
            LCEEC   P KY+CPGCS+R+C+L CVKAHKQR SC+GKRS+   +P+SQF+D+LLLSDY
Sbjct: 24   LCEECKEKPSKYKCPGCSVRSCSLDCVKAHKQRASCSGKRSQTHFVPLSQFDDNLLLSDY 83

Query: 185  KFLEEGKRVADSARRTISGLVGNIGF-QLPTRLKILRNAARRRRTQVLFLSQKMAKSERN 361
              LEE KRVADSARRT + L  +  + + P   + L+ AA RRRT++LFL   M+K E+N
Sbjct: 84   NLLEEIKRVADSARRTRTKLHPHPPYSRFPPHRQDLKRAAARRRTKLLFLPSGMSKREKN 143

Query: 362  RSRYDIRKNTIFWTIEWRFNGTGVSLIDHGADEYTNLHSLLEKHLKPSPWNHPLKPYCDV 541
            +++YD RK +I WTIEWRF+ T V L DHG  E T L S++EKHLKP PWNHPL+ +CD 
Sbjct: 144  QTQYDPRKKSISWTIEWRFHSTDVVLHDHGVHEDTELFSVIEKHLKPGPWNHPLRQFCDQ 203

Query: 542  PPEDLKVFIQKTPKGSKSPFRMLSIKAPFGQQMANIVLVEHPIIHVYLPSHNYDFDIDND 721
            P + LK FI+K PKG KS F  ++ KA   QQ+AN+V++EHP+IHV+LPSH YDFD+  D
Sbjct: 204  PLDSLKFFIRKYPKGPKSTFCEINTKASLRQQLANVVIMEHPVIHVFLPSHKYDFDVVKD 263

Query: 722  IELLSHSKTDDPPGSSDGIPNSKSLYFREEQIE-ESELSSF-TKVTDLMDHSRPRQSDKF 895
            + L++H + D    +S+  P+ K + FREEQIE E+ + SF  +V DLM +     S++ 
Sbjct: 264  VRLVNH-RLDAKNSASNDCPSPKGIVFREEQIEDEANICSFDPQVYDLMKNEILSPSNQI 322

Query: 896  HLKKNAAVNEKTSDQFYALMSKPDSIDLKSASGEGCKASVIKDRNTDSNSGANHTRLSND 1075
                NA+V  K  +  +      DS+ ++ A+  G  +S  K   T  N           
Sbjct: 323  -CHHNASV--KALENIF-----DDSLAVREAAANGVHSS-SKSEGTFEN----------- 362

Query: 1076 VKFEFEQELKDAYSDLVGEINPDDFLCFDGVYSDEYELEEQRANLLILDGRFLGDDQLEE 1255
            ++F+F+Q L DAYSDL+ +INPDDFL  +GV+S E   EE R +     G F  +++LEE
Sbjct: 363  MEFDFDQGLMDAYSDLIAQINPDDFLDLEGVFSKE---EEDRNDHFGSMGTFSVEEELEE 419

Query: 1256 GEI 1264
            GEI
Sbjct: 420  GEI 422


>ref|XP_002269272.1| PREDICTED: box C/D snoRNA protein 1 [Vitis vinifera]
            gi|147778999|emb|CAN60314.1| hypothetical protein
            VITISV_036305 [Vitis vinifera]
          Length = 385

 Score =  352 bits (904), Expect = 1e-94
 Identities = 193/422 (45%), Positives = 256/422 (60%)
 Frame = +2

Query: 5    LCEECGLNPWKYRCPGCSIRTCALPCVKAHKQRTSCTGKRSRAEPIPISQFNDDLLLSDY 184
            LC+EC LNP KY CPGCS+R+C+LPCVKAHKQ+T CTGKR + + +P+SQF+D+LLLSDY
Sbjct: 14   LCQECKLNPSKYTCPGCSVRSCSLPCVKAHKQQTGCTGKRQQTQFVPLSQFDDNLLLSDY 73

Query: 185  KFLEEGKRVADSARRTISGLVGNIGFQLPTRLKILRNAARRRRTQVLFLSQKMAKSERNR 364
              LEE K VA+SA+R    L G    + P  L+ LRNAA  RRT++LFL   M+K E+N+
Sbjct: 74   NLLEEVKSVAESAQRRRVKLCGYSQLKFPYHLRGLRNAAGSRRTKLLFLPSGMSKREKNK 133

Query: 365  SRYDIRKNTIFWTIEWRFNGTGVSLIDHGADEYTNLHSLLEKHLKPSPWNHPLKPYCDVP 544
            S+Y+ R   I WTIEWRF+ T V L+DHG +E + L S++EKHLKP PWNH LKP+C   
Sbjct: 134  SQYNQRSKCITWTIEWRFHSTDVVLLDHGINENSTLSSVIEKHLKPGPWNHKLKPFCAEQ 193

Query: 545  PEDLKVFIQKTPKGSKSPFRMLSIKAPFGQQMANIVLVEHPIIHVYLPSHNYDFDIDNDI 724
             + LK FI+K PKG +SPF  L I+AP  QQ AN+ ++E+P IHV+LPSH+YDFD+  D 
Sbjct: 194  LDCLKFFIRKYPKGPRSPFHELDIRAPIRQQFANLAILEYPQIHVFLPSHSYDFDVIKDA 253

Query: 725  ELLSHSKTDDPPGSSDGIPNSKSLYFREEQIEESELSSFTKVTDLMDHSRPRQSDKFHLK 904
                H + +     +   P+ K + FREE+IEE   S  +KV DLM H +  Q++K    
Sbjct: 254  NP-RHRQAELKESLTIDQPSPKGISFREEEIEEHGGSLDSKVLDLMQHVKSTQANK---- 308

Query: 905  KNAAVNEKTSDQFYALMSKPDSIDLKSASGEGCKASVIKDRNTDSNSGANHTRLSNDVKF 1084
                               P S   K   GE                         ++ F
Sbjct: 309  -------------------PQSSPQKKEPGE-------------------------NIDF 324

Query: 1085 EFEQELKDAYSDLVGEINPDDFLCFDGVYSDEYELEEQRANLLILDGRFLGDDQLEEGEI 1264
            +FEQ L D YSD++ +INPDDFL ++G +S E + EE R +     G F   ++LEEGEI
Sbjct: 325  DFEQGLLDVYSDIIAQINPDDFLDWEGDFSKEVKSEE-RIDCSDFGGVF-PVEELEEGEI 382

Query: 1265 PG 1270
            PG
Sbjct: 383  PG 384


>emb|CBI32194.3| unnamed protein product [Vitis vinifera]
          Length = 383

 Score =  350 bits (898), Expect = 5e-94
 Identities = 192/421 (45%), Positives = 255/421 (60%)
 Frame = +2

Query: 5    LCEECGLNPWKYRCPGCSIRTCALPCVKAHKQRTSCTGKRSRAEPIPISQFNDDLLLSDY 184
            LC+EC LNP KY CPGCS+R+C+LPCVKAHKQ+T CTGKR + + +P+SQF+D+LLLSDY
Sbjct: 14   LCQECKLNPSKYTCPGCSVRSCSLPCVKAHKQQTGCTGKRQQTQFVPLSQFDDNLLLSDY 73

Query: 185  KFLEEGKRVADSARRTISGLVGNIGFQLPTRLKILRNAARRRRTQVLFLSQKMAKSERNR 364
              LEE K VA+SA+R    L G    + P  L+ LRNAA  RRT++LFL   M+K E+N+
Sbjct: 74   NLLEEVKSVAESAQRRRVKLCGYSQLKFPYHLRGLRNAAGSRRTKLLFLPSGMSKREKNK 133

Query: 365  SRYDIRKNTIFWTIEWRFNGTGVSLIDHGADEYTNLHSLLEKHLKPSPWNHPLKPYCDVP 544
            S+Y+ R   I WTIEWRF+ T V L+DHG +E + L S++EKHLKP PWNH LKP+C   
Sbjct: 134  SQYNQRSKCITWTIEWRFHSTDVVLLDHGINENSTLSSVIEKHLKPGPWNHKLKPFCAEQ 193

Query: 545  PEDLKVFIQKTPKGSKSPFRMLSIKAPFGQQMANIVLVEHPIIHVYLPSHNYDFDIDNDI 724
             + LK FI+K PKG +SPF  L I+AP  QQ AN+ ++E+P IHV+LPSH+YDFD+  D 
Sbjct: 194  LDCLKFFIRKYPKGPRSPFHELDIRAPIRQQFANLAILEYPQIHVFLPSHSYDFDVIKDA 253

Query: 725  ELLSHSKTDDPPGSSDGIPNSKSLYFREEQIEESELSSFTKVTDLMDHSRPRQSDKFHLK 904
                H + +     +   P+ K + FREE+IEE   S  +KV DLM H +  Q++K    
Sbjct: 254  NP-RHRQAELKESLTIDQPSPKGISFREEEIEEHGGSLDSKVLDLMQHVKSTQANK---- 308

Query: 905  KNAAVNEKTSDQFYALMSKPDSIDLKSASGEGCKASVIKDRNTDSNSGANHTRLSNDVKF 1084
                               P S   K   GE                         ++ F
Sbjct: 309  -------------------PQSSPQKKEPGE-------------------------NIDF 324

Query: 1085 EFEQELKDAYSDLVGEINPDDFLCFDGVYSDEYELEEQRANLLILDGRFLGDDQLEEGEI 1264
            +FEQ L D YSD++ +INPDDFL ++G +S E + EE R +     G F   ++LEEGEI
Sbjct: 325  DFEQGLLDVYSDIIAQINPDDFLDWEGDFSKEVKSEE-RIDCSDFGGVF-PVEELEEGEI 382

Query: 1265 P 1267
            P
Sbjct: 383  P 383


>ref|XP_002512150.1| conserved hypothetical protein [Ricinus communis]
            gi|223548694|gb|EEF50184.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 411

 Score =  346 bits (888), Expect = 8e-93
 Identities = 191/420 (45%), Positives = 256/420 (60%)
 Frame = +2

Query: 5    LCEECGLNPWKYRCPGCSIRTCALPCVKAHKQRTSCTGKRSRAEPIPISQFNDDLLLSDY 184
            +CEEC  NP KY+CPGCS+R+C+LPCVKAHK RT C+GKR++   +P+SQFND L+LSDY
Sbjct: 16   ICEECKENPSKYKCPGCSLRSCSLPCVKAHKHRTGCSGKRNQTHFVPLSQFNDSLILSDY 75

Query: 185  KFLEEGKRVADSARRTISGLVGNIGFQLPTRLKILRNAARRRRTQVLFLSQKMAKSERNR 364
              LEE KRVA+SA+R  + L     F+LP  L+ LR AA  RRT+++FL   M+K E+N+
Sbjct: 76   NLLEETKRVAESAQRMRTKLCAYPQFRLPAYLQSLRRAAASRRTKLIFLPSGMSKREKNQ 135

Query: 365  SRYDIRKNTIFWTIEWRFNGTGVSLIDHGADEYTNLHSLLEKHLKPSPWNHPLKPYCDVP 544
            SRY+ RK  I WT+EWRF+ T V L+DHG  E  NL S++E+HL P PWNH L+ +C+  
Sbjct: 136  SRYNQRKKFIAWTVEWRFHTTDVVLLDHGVHEDRNLFSVIEQHLNPGPWNHQLRQFCEEQ 195

Query: 545  PEDLKVFIQKTPKGSKSPFRMLSIKAPFGQQMANIVLVEHPIIHVYLPSHNYDFDIDNDI 724
             + LK FI+K PKG KSPF  L IKAP  QQ+ANIV++E+PIIHV+LPSH  DF++   +
Sbjct: 196  LDSLKFFIRKYPKGPKSPFCELDIKAPLRQQLANIVILENPIIHVFLPSHGCDFEVVKCM 255

Query: 725  ELLSHSKTDDPPGSSDGIPNSKSLYFREEQIEESELSSFTKVTDLMDHSRPRQSDKFHLK 904
            + ++H +  +   S  G+       FREE+IEES  SS  ++ D                
Sbjct: 256  QSVTHRQETNNSASPKGVS------FREEEIEESNGSSDPQIYD--------------FT 295

Query: 905  KNAAVNEKTSDQFYALMSKPDSIDLKSASGEGCKASVIKDRNTDSNSGANHTRLSNDVKF 1084
            KN  ++       + +  K     L  +S     AS+  D N+  NS      L  D+ F
Sbjct: 296  KNVMLSPLHEIPCHNMSEK----SLDGSSDGSFLASMAADSNSQINSMGIEPLLFGDLDF 351

Query: 1085 EFEQELKDAYSDLVGEINPDDFLCFDGVYSDEYELEEQRANLLILDGRFLGDDQLEEGEI 1264
             F+Q L DAYS L+G+ NPDDF   +G    E E E  R +L       L  D+LEEGEI
Sbjct: 352  GFDQALIDAYSGLIGQGNPDDFFDLEGELPKEEESE--RKHLSNSREVLLVQDELEEGEI 409


>ref|NP_001184910.1| HIT-type Zinc finger family protein [Arabidopsis thaliana]
            gi|332189643|gb|AEE27764.1| HIT-type Zinc finger family
            protein [Arabidopsis thaliana]
          Length = 644

 Score =  338 bits (866), Expect = 3e-90
 Identities = 189/425 (44%), Positives = 257/425 (60%), Gaps = 4/425 (0%)
 Frame = +2

Query: 2    SLCEECGLNPWKYRCPGCSIRTCALPCVKAHKQRTSCTGKRSRAEPIPISQFNDDLLLSD 181
            S+CEEC  NPWKY+CPGCSIR+CALPCVKAHKQRT CTGKR   + +P+S+F+D+LLLSD
Sbjct: 256  SVCEECKQNPWKYKCPGCSIRSCALPCVKAHKQRTGCTGKRKFTDVVPLSKFDDNLLLSD 315

Query: 182  YKFLEEGKRVADSARRTISGLVGN-IGFQLPTRLKILRNAARRRRTQVLFLSQKMAKSER 358
            Y  LEE KRVA+SA R  S L  N   ++LP  LK L++AA  RRT++ +L   M K E 
Sbjct: 316  YNMLEETKRVAESALRRRSQLCKNHYSYKLPYLLKSLQSAAYSRRTKLWYLPSGMLKREN 375

Query: 359  NRSRYDIRKNTIFWTIEWRFNGTGVSLIDHGADEYTNLHSLLEKHLKPSPWNHPLKPYCD 538
            N+SRYD R   I WTIEWRF+ T V L+DHG  E  NL S+++ HLKP PW H LKP+CD
Sbjct: 376  NQSRYDNRSKCISWTIEWRFHSTDVILVDHGVGEDRNLCSVIKNHLKPGPWIHKLKPFCD 435

Query: 539  VPPEDLKVFIQKTPKGSKSPFRMLSIKAPFGQQMANIVLVEHPIIHVYLPSHNYDFDIDN 718
            V  + LK+FI++ PKG+K+PF+ L IKAP  +Q+A +V++E+P+IHVYLPS +Y+F +  
Sbjct: 436  VDLDSLKLFIRQYPKGAKAPFKELDIKAPLRKQLAKVVILEYPVIHVYLPSQSYEFKVIK 495

Query: 719  DIELLSHSKTDDPPGS-SDGIPNSKSLYFREEQIEESELSSF-TKVTDLMDHSRPRQSDK 892
            D      + T +P  S  DG   +  + FREE+IEE ++ SF  +V  LM         +
Sbjct: 496  DF-----NTTPNPNDSLYDGHGCTNGITFREEEIEEDDIDSFEPEVLGLM--------KQ 542

Query: 893  FHLKKNAAVNEKTSDQFYALMSKPDSIDLKSASGEGCKASVIKDRNTDSNSGANHTRLSN 1072
             +      V+EK+                        KA  +   N++          + 
Sbjct: 543  MNYNPCLRVSEKS------------------------KAEGVGTNNSNPQVDTTEQEDAG 578

Query: 1073 DVKFEFEQELKDAYSDLVGEINPDDFLCFDGVYSDEYELEEQRANLLILDGRFLGDD-QL 1249
            +++ EFEQ L D YSDL  E+NP D+  F+  ++   +  +   NL  LD  F+ D   L
Sbjct: 579  NMELEFEQGLIDTYSDLFAEMNPGDYFNFECEFAKGLD-SDDNCNLQNLDTDFIADGLDL 637

Query: 1250 EEGEI 1264
            EEGEI
Sbjct: 638  EEGEI 642


Top