BLASTX nr result

ID: Dioscorea21_contig00010870 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00010870
         (2538 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002312573.1| predicted protein [Populus trichocarpa] gi|2...   212   5e-52
ref|XP_002517012.1| hypothetical protein RCOM_0908960 [Ricinus c...   182   5e-43
ref|XP_002312571.1| predicted protein [Populus trichocarpa] gi|2...   163   2e-37
ref|XP_002865410.1| zinc knuckle (CCHC-type) family protein [Ara...   130   2e-27
ref|XP_004170660.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   128   7e-27

>ref|XP_002312573.1| predicted protein [Populus trichocarpa] gi|222852393|gb|EEE89940.1|
            predicted protein [Populus trichocarpa]
          Length = 970

 Score =  212 bits (539), Expect = 5e-52
 Identities = 224/883 (25%), Positives = 365/883 (41%), Gaps = 59/883 (6%)
 Frame = -1

Query: 2520 GISSCPVSNYLKKNSGAGANANSRTDMVLITTNPLSELVWSPQKGLSLKYANSSLSEKKA 2341
            G S+  +   LK +SGAGANA S  DM  + TN LSELVWSP+KGLSLK A+ + S +K 
Sbjct: 19   GYSNQCIQRRLKNDSGAGANAASSVDMTFVATNALSELVWSPKKGLSLKCADGTFSNQKP 78

Query: 2340 SLLWNAESFNIMILPPQCPNVGESSKAMDTIDRNLNPVQLEINSESKNSNREAPPSPPQS 2161
            SLL  A        P    +   + KA+        P + ++ SE   + R+ P     S
Sbjct: 79   SLLRGAG-------PSDMVSGSNADKAIGKKVFMTPPEESDVRSEV--AGRDNPTKFVTS 129

Query: 2160 VAGMQPISLTLIHEQHSRSYGHMGQFGS------TSVNLDNPEKDKNEEILHSKSISRGD 1999
              G+ P+    +H+    +Y  +           T+V L  P   K E+  ++K+    +
Sbjct: 130  DTGLFPLLSESMHKVKIGNYEFLAATDDHKEEMKTAVGL--PFLQKMEDARNNKA----E 183

Query: 1998 EVWKNVKSAVDVMPEAFNLDNKKGPGDLKLNSAQVECEPISNFIQHFRGSIG----TRKD 1831
            +++  +   VD +   +         + KL+ AQ    P S       G +G    T + 
Sbjct: 184  DIYDPINLQVDEISRTWETKFPSLSDETKLDVAQNG--PTSKEPNVRIGGVGDASHTLQT 241

Query: 1830 NLLGLEGKAEDYSSEKHDF-VTKLPSSPRTVANEGLNSENVRSNISTRVIECA------- 1675
             ++           E +D  + K P       +     +   +N+ T    C        
Sbjct: 242  EIVSASQVCSVEECESYDTNMQKAPLGREHFESPSCMEKERENNMGTGPYICPLEKLEST 301

Query: 1674 --DNFQSLSKQGMFGREAVDLQDRNEVHLTAPAQASDEHVAELRKASLLGKSAPTEGLLN 1501
              ++F++   + +       +  +N   + + +Q  DE + +    ++  K +PT     
Sbjct: 302  AENDFKTPHSENVCAVATEIVGSQNAKEVRSSSQQDDEILPKDNDCAI--KQSPTY---- 355

Query: 1500 KSESLRSHSEGNGHINSHKNGRDNVTKDXXXXXXXXXXXXXXXXXSTRKRELAFEPESSS 1321
             S + R   +G     S  N  + +                    ST KR+  F+P S  
Sbjct: 356  -SRTRRYQMKGKAKALSDGNLNERMLDMDDDSHESVESCNSVGLFSTGKRQRNFDPHSYV 414

Query: 1320 ENKRLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRSYQEKPLNQPLPIAH------D 1159
             +K +KT+ Q+   S SF K + SFMNWIS M  GF +S +++  +  L +A+      D
Sbjct: 415  GSKSIKTKIQESPGSSSFVKHDGSFMNWISNMMKGFLKSNEDEAPSLALTLANHKHGHED 474

Query: 1158 TNKG----------SCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQRVADVLSEQDDREQ 1009
             +K            C  +GF S+F SL+ P+   Q+    + ++Q         D++  
Sbjct: 475  RDKNLISCNRNQDQGCKTMGFHSLFQSLYCPKTKAQETVALNANTQTEGSKELGLDNKIC 534

Query: 1008 ASTGAGLVGSDGLDSNLQNAIGTSSKATNS-SLKGIVCHEHVKLPTGALHSNENLKQATC 832
             S    +      D+  +  +  + K   S S  G       KL +  + S + +  +  
Sbjct: 535  DSNATPIPCRMVTDNVYKRFLQPNEKLNESTSGNGTASPALTKLLSTNIASGQEISGSNS 594

Query: 831  VDEALPLNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTTLEGKAIGTIPSVLN 652
             ++    N+  +++ K      G   N            Q +     EGKA  T      
Sbjct: 595  AEKK---NSCNMATDKEKN---GTSSNSSRGKRKMNDAEQPS-----EGKATNT------ 637

Query: 651  GSSNLVSKKRGAFRESLWISRLLPKVSVSIPEPANCSHGVELSNEKHT-----KITEKSC 487
                  S  R     SLWI+RL PK S  +     C      + +  T     K   ++ 
Sbjct: 638  ------SGYRSDPLTSLWITRLSPKTSGPLSNRDLCHRRTGEALDGFTDFIRLKAQWQNH 691

Query: 486  PSLFGQKSFARGTIKAQGHSDSDG------SNGTNA-----------DGSSKSKLNCKLP 358
            PS +  K+      + + H   D       +N T             D  S  K+N  LP
Sbjct: 692  PSSYQDKNIVGA--REEEHFTEDPVCMHNCANSTEVSFSINKVNGHHDEKSMCKMNSTLP 749

Query: 357  SQKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKVGHSLKECPQLTESE 178
              +   SE MASVFARRLDA+ HI P+   +D +    TCFFCG   H +++CP++ +SE
Sbjct: 750  FSRFRNSEAMASVFARRLDALMHIMPSYGTDDSSHGNLTCFFCGIKCHHVRDCPEIIDSE 809

Query: 177  LQDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSKIKN 49
            L DILR+ NS++  + F  +CIRCF  NHWA++CP  SS+ ++
Sbjct: 810  LADILRNANSFNGANEFPCVCIRCFQSNHWAVACPSASSRTRH 852


>ref|XP_002517012.1| hypothetical protein RCOM_0908960 [Ricinus communis]
            gi|223543647|gb|EEF45175.1| hypothetical protein
            RCOM_0908960 [Ricinus communis]
          Length = 1067

 Score =  182 bits (461), Expect = 5e-43
 Identities = 218/882 (24%), Positives = 353/882 (40%), Gaps = 58/882 (6%)
 Frame = -1

Query: 2520 GISSCPVSNYLKKNSGAGANANSRTDMVLITTNPLSELVWSPQKGLSLKYANSSLSEKKA 2341
            G S+  +   L  + GAGANA S  D+  + T+PLSELVWSP KGLSL+ A+ S  +KK 
Sbjct: 19   GYSNQCIQRNLSNDPGAGANAASTADITFVATDPLSELVWSPHKGLSLRCADGSFIDKKP 78

Query: 2340 SLLWNAESFNIMILPPQCPNVGESSKAMDTIDRNLNPVQLEINSESKNSNREAPPSPPQS 2161
            SLL               P VG +  A  +            +S+   SN          
Sbjct: 79   SLL---------------PGVGPTYMASGS------------SSDKPISNTGKLFDNEIC 111

Query: 2160 VAGMQPISLTL-IHEQHSRSY--GHMGQFGSTSVNLDNPEKDKNEEILHSKSISRGDEVW 1990
            +A +    L   I   +S ++   ++G    +   LD                + GD+V 
Sbjct: 112  IASLPACKLASEISGDNSTTFLTSNVGIMPLSGTGLDKT--------------ATGDQVV 157

Query: 1989 KNVKSAVDVMPEAFNLDNKKGPGDLKLNSAQVECEPISNFIQHFRGSIGTRKDNLLGLE- 1813
            + +K+AV+   +  +L N K   + KL+ AQ        F +    +     D+ LG+E 
Sbjct: 158  E-MKNAVNYFLQKEDLRNDKAEDETKLDVAQ----NYRTFEEPIVRATDVNDDHELGMEI 212

Query: 1812 GKAEDYSSEK--HDFVTKLPSSPRTVANEGLNSENVRSN-------ISTRVIECADNFQS 1660
                D+ + K   D+  K+ ++  +   E     +VR         I    I   D  +S
Sbjct: 213  VLVSDFHTVKGREDYGIKIQNAACS-GKENEEPPSVREKERKNKMVIGRPGIFSLDKLES 271

Query: 1659 LSKQGM---FGREAVDLQDRNEVHLTAP-AQASDEHVAELRKASLLGKSAPTEGLLNKSE 1492
             ++  +   FG  +  ++++N    +A   + + +H     + +L    +PT   L   +
Sbjct: 272  TAENDLETPFGENSCSMRNKNLASESADRVENNTQHELIPIEYALGYNQSPTSSRLQNIQ 331

Query: 1491 SLRSHSEGNGHINSHKNGRDNVTKDXXXXXXXXXXXXXXXXXSTRKRELAFEPESSSENK 1312
                  +G     S  + ++ +  +                 ST K+   F+ +    +K
Sbjct: 332  R-----QGQSKALSDGDAKERMLNEEDGSHESVESCNSTELFSTGKQRWNFDQQLIVGSK 386

Query: 1311 RLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRS----------------YQEKPLNQ 1180
            R+K Q QD   S S  KQ+SSF+NWIS M  GF +S                Y  +  +Q
Sbjct: 387  RVKRQIQDSPGSSSLGKQDSSFVNWISNMMKGFLKSSEGEAPFLSSALSNPNYGHENPSQ 446

Query: 1179 PLPIAHDTNKGSCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQRVADVLSEQDDREQAST 1000
             +   +     +C   GF S+F SL+  +   Q+    +++ Q       +QD++     
Sbjct: 447  DVFTCNRKEDPACDTRGFQSVFQSLYCRKTKGQETVTLNVNHQTEGSKECDQDNKI-CDL 505

Query: 999  GAGLVGSDGLDSNLQNAIGTSSKATNSSLKGIVCHEHVKLPTGALHSNENLKQATCVDEA 820
             A  +    +  N+      S++  N    G     H  +    +HS +       + E+
Sbjct: 506  NAAPIACRMVTGNVYKRFLPSNEKHNEPTSGY----HAGM---TVHSRDISMSFPVIPES 558

Query: 819  LPL------NTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTTLEGKAIGTIPSV 658
                     N+  ++ GK  +   G   NF                T+  GK    +PS 
Sbjct: 559  NGSVSTENKNSCNLAIGKEKD---GTDSNFSHGKHK----------TSSAGKIDPELPSE 605

Query: 657  LNGSSNLVSKKRGAFRESLWISRLLPKVS----------VSIPEPANCSHGV-------- 532
               +     K  G    SLWI+R  PK S           S  E  NCS           
Sbjct: 606  DKTAHGFGYK--GDPLGSLWIARFSPKTSGAPFNHYPSNKSTGEAFNCSADSMGLIPQVQ 663

Query: 531  -ELSNEKHTKITEKSCPSLFGQKSFARGTIKAQGHSDSDGSNGTNADGSSKSKLNCKLPS 355
              L +    +I E    +          +   +   D     G N D  S +KLN  L S
Sbjct: 664  NPLGSSSEHEIVEVRNKNFQEPLPIQNYSTANRAPFDFYNVKG-NIDNDSGNKLNPILSS 722

Query: 354  QKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKVGHSLKECPQLTESEL 175
             ++  SE MASV  RRLDA K+ITP+   ++      TCFFCG  GH L+EC ++T++EL
Sbjct: 723  ARVKTSEAMASVSPRRLDAPKYITPSDDADNSDRASMTCFFCGIKGHDLRECSEVTDTEL 782

Query: 174  QDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSKIKN 49
            +D+LR++N Y        +CIRCF  NHWA++CP    ++++
Sbjct: 783  EDLLRNINIYGGIKELPCVCIRCFQLNHWAVACPSTCPRVRS 824


>ref|XP_002312571.1| predicted protein [Populus trichocarpa] gi|222852391|gb|EEE89938.1|
            predicted protein [Populus trichocarpa]
          Length = 779

 Score =  163 bits (412), Expect = 2e-37
 Identities = 135/477 (28%), Positives = 211/477 (44%), Gaps = 39/477 (8%)
 Frame = -1

Query: 1362 TRKRELAFEPESSSENKRLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRSYQEKPLN 1183
            T KR+  F+P S   +K +KT+ Q+   S SF K + SFMNWIS M  GF +S +++  +
Sbjct: 210  TGKRQRNFDPHSYVGSKSIKTKIQESPGSSSFVKHDGSFMNWISNMMKGFLKSNEDEAPS 269

Query: 1182 QPLPIAH------DTNKG----------SCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQ 1051
              L +A+      D +K            C  +GF S+F SL+ P+   Q+    + ++Q
Sbjct: 270  LALTLANHKHGHEDRDKNLISCNRNQDQGCKTMGFHSLFQSLYCPKTKAQETVALNANTQ 329

Query: 1050 RVADVLSEQDDREQASTGAGLVGSDGLDSNLQNAIGTSSKATNS-SLKGIVCHEHVKLPT 874
                     D++   S    +      D+  +  +  + K   S S  G       KL +
Sbjct: 330  TEGSKELGLDNKICDSNATPITCPMVTDNVYKRFLQPNEKLNESTSGNGAASPALTKLLS 389

Query: 873  GALHSNENLKQATCVDEALPLNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTT 694
              + S++ +  +   ++    N+  +++ K      G   N            Q +    
Sbjct: 390  TNIASSQEISGSNSAEKK---NSCNMATDKEKN---GTSSNSSPGKRKMNDAEQPS---- 439

Query: 693  LEGKAIGTIPSVLNGSSNLVSKKRGAFRESLWISRLLPKVSVSIPEPANCSHGVELSNEK 514
             EGKA  T            S  R     SLWI+RL PK S  +     C      + + 
Sbjct: 440  -EGKATNT------------SGYRSDPLTSLWITRLSPKTSGPLSNRDLCHRRTGEALDG 486

Query: 513  HT-----KITEKSCPSLFGQKSFARGTIKAQGHSDSDG------SNGTNA---------- 397
             T     K   ++ PS +  K+      + + H   D       +N T            
Sbjct: 487  FTDFIRLKAQWQNHPSSYQDKNIVGA--REEEHFTEDPVCMHNCANSTEVSFSINKVNGH 544

Query: 396  -DGSSKSKLNCKLPSQKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKV 220
             D  S  K+N  LP  +   SE MASVFARRLDA+ HI P+   +D +    TCFFCG  
Sbjct: 545  HDEKSMCKMNSTLPFSRFRNSEAMASVFARRLDALMHIMPSYGTDDSSHGNLTCFFCGIK 604

Query: 219  GHSLKECPQLTESELQDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSKIKN 49
             H +++CP++ +SEL DILR+ NS++  + F  +CIRCF  NHWA++CP  SS+ ++
Sbjct: 605  CHHVRDCPEIIDSELADILRNANSFNGANEFPCVCIRCFQSNHWAVACPSASSRTRH 661


>ref|XP_002865410.1| zinc knuckle (CCHC-type) family protein [Arabidopsis lyrata subsp.
            lyrata] gi|297311245|gb|EFH41669.1| zinc knuckle
            (CCHC-type) family protein [Arabidopsis lyrata subsp.
            lyrata]
          Length = 759

 Score =  130 bits (327), Expect = 2e-27
 Identities = 121/433 (27%), Positives = 184/433 (42%), Gaps = 31/433 (7%)
 Frame = -1

Query: 1278 SGSFH--KQESSFMNWISTMTNGFSRSYQEKPL-----------------------NQPL 1174
            SGS+   KQ+SSFMNWIS MT G  +  +E                           Q  
Sbjct: 222  SGSYRRPKQDSSFMNWISNMTKGIWKGNEEDDSPFAALTTTSDANGHGQVNAIVDQQQLS 281

Query: 1173 PIAHDTNKGSCTNIGFGSIFHSLHSPRLLIQDRAQKDLDSQRVADVLSEQDDREQASTGA 994
            P     N G C N GF S+F S++ P+   QD  + D  +   A  L E     +     
Sbjct: 282  PCCVKENSG-CRNTGFQSLFQSIYCPKKRSQDAVEMDFPNDANATSLQELPWIPEQ---C 337

Query: 993  GLVGSDGLDSNLQNAIGTSSKATNSSLKGIVCHEHVKLPTGALHSNENLKQATCVDEALP 814
            G+   D L S+  N IG  ++   SS K         L      S+EN ++    D+   
Sbjct: 338  GIAKGDDLSSS-DNDIGPVAEPNISSGKVGFNQRSETL------SSENKRE----DKEPN 386

Query: 813  LNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKALTTTLEGKAIGTIPSVLNGSSNLV 634
            ++ + +S  K +E+                          + G+A G +   LN      
Sbjct: 387  ISLMSLSKSKPNEEP------------------------KICGEAGGKVSPCLNN----- 417

Query: 633  SKKRGAFRESLWISRLLPKVSVSIPEPANCSHGVELSNEKHTKITEKSCPSLFGQKSFAR 454
               R +  +SLWISR   K      + +  +  V  S     K  +        QK    
Sbjct: 418  ---RNSGLQSLWISRFSSKSPFPQKKTSETAKEVNASASDTAKTHDS-------QKMLVN 467

Query: 453  GTIKAQGHSDSDGSNGTNADGSSKSKLNCKLP---SQKLIKSEPMASVFARRLDAIKHIT 283
              +     S  DG +          KLN  LP   S ++  SE MAS+FARRL+A+KHI 
Sbjct: 468  NNVVIPSISSVDGLD----------KLNTVLPIVSSMRIESSEAMASLFARRLEAMKHII 517

Query: 282  PAKTMNDKTSMLGT---CFFCGKVGHSLKECPQLTESELQDILRDLNSYDNTDGFLSICI 112
            PA ++ +          CF+CGK GH L++C ++T++EL+D++++++S +  +   S+CI
Sbjct: 518  PAGSLAENAEEEQPNLICFYCGKKGHCLQDCLEVTDTELRDLVQNISSRNGREEASSLCI 577

Query: 111  RCFGFNHWAISCP 73
            RCF  +HWA +CP
Sbjct: 578  RCFQLSHWAATCP 590


>ref|XP_004170660.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224596
            [Cucumis sativus]
          Length = 1004

 Score =  128 bits (322), Expect = 7e-27
 Identities = 127/471 (26%), Positives = 190/471 (40%), Gaps = 36/471 (7%)
 Frame = -1

Query: 1362 TRKRELAFEPESSSENKRLKTQAQDKFCSGSFHKQESSFMNWISTMTNGFSRSYQEKPLN 1183
            T KR  +FE      NKR K Q  +     S   Q+SSFM WIS M  GFS S Q++   
Sbjct: 376  TSKRRWSFEQRLIVGNKRAKKQDGNASGPTSNLGQDSSFMIWISNMMKGFSESIQDEAPT 435

Query: 1182 QPL---------------PIAHDTNKGSCTNIGFGSIFHSLHSPRLLIQDRA-----QKD 1063
              L               PI    N    + IGF SIF SL++P +  ++ A     Q  
Sbjct: 436  LDLTLAKCDVEQGGPNEEPIYKKINAPGFSGIGFQSIFRSLYNPTMRGEEGAPSATCQAK 495

Query: 1062 LDSQRVADVLSEQDDREQASTGAGLVGSDGLDSNLQNAIGTSSKATNSSLKGIVCHEHVK 883
             +++ +  + +  D         G     G    L N   T   + N     I      +
Sbjct: 496  QEAKGIEIIKNSCDLNATPIACFGESDHFGKQLLLNNENATDLISGNGPTLLIQLKNSPE 555

Query: 882  LPTGALHSNENLKQATCVDEALPLNTIYISSGKSHEKAVGNIGNFXXXXXXXXXXSQKAL 703
            +  G+  S++   Q       L      +S+  + E     +G              K  
Sbjct: 556  ISCGSHQSHKTRSQGNQNSSNL------VSAAGTGEVMHSALG--------------KCK 595

Query: 702  TTTLEGKAIGTIPSVLNGSSNLVSKKRGAFRESLWISRLLPKVS--VSIPEPANCSHGVE 529
            +   E      +   +N ++  VS       +SLWISR   K S   S PE +N +   +
Sbjct: 596  SNGTENVDCDQLCGKINHTTGNVSDPL----KSLWISRFAAKASGFTSNPETSNLNTKDD 651

Query: 528  LSNEKHTKITEKSCP--------------SLFGQKSFARGTIKAQGHSDSDGSNGTNADG 391
                 H+      CP              ++  ++     T  + GH +       +++ 
Sbjct: 652  SQCSMHSP-RHMPCPQNHIDHHSMDDLDTAVSKEQHNIANTETSPGHKEFKD----HSEQ 706

Query: 390  SSKSKLNCKLPSQKLIKSEPMASVFARRLDAIKHITPAKTMNDKTSMLGTCFFCGKVGHS 211
             S SK    L S K+   E MASVFARRL A+KHI P+    +  +   TCFFCG  GH+
Sbjct: 707  KSISKFKSALRSPKIRSPEAMASVFARRLGALKHIIPSDLTINVGNETVTCFFCGTKGHN 766

Query: 210  LKECPQLTESELQDILRDLNSYDNTDGFLSICIRCFGFNHWAISCPFESSK 58
            L  C ++TE E++D+ R++   + T      CIRCF  NHWAI+CP   ++
Sbjct: 767  LHNCSEITEREIEDLSRNIRFCNETVDPPCSCIRCFQLNHWAIACPLAPAR 817



 Score = 71.6 bits (174), Expect = 1e-09
 Identities = 41/102 (40%), Positives = 58/102 (56%)
 Frame = -1

Query: 2502 VSNYLKKNSGAGANANSRTDMVLITTNPLSELVWSPQKGLSLKYANSSLSEKKASLLWNA 2323
            +   L   SG GANA S  D+  +TT+ LSELVWSP KGLSL+ A+SS + +K S+LW+A
Sbjct: 25   IQGRLTNRSGVGANAGSMVDVKYVTTDSLSELVWSPHKGLSLRCADSSFNNRKTSILWDA 84

Query: 2322 ESFNIMILPPQCPNVGESSKAMDTIDRNLNPVQLEINSESKN 2197
             +       PQ   + E S + + +D N   +  +  S  KN
Sbjct: 85   AANKANFALPQSV-IAEKSTSNNLLD-NRTIILSQAESHLKN 124


Top