BLASTX nr result

ID: Dioscorea21_contig00011032 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00011032
         (2031 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002305691.1| predicted protein [Populus trichocarpa] gi|2...   343   1e-91
ref|XP_002331921.1| predicted protein [Populus trichocarpa] gi|2...   330   7e-88
ref|XP_003549171.1| PREDICTED: uncharacterized protein LOC100817...   321   5e-85
ref|XP_003620160.1| hypothetical protein MTR_6g077930 [Medicago ...   318   4e-84
emb|CAN81192.1| hypothetical protein VITISV_022847 [Vitis vinifera]   313   8e-83

>ref|XP_002305691.1| predicted protein [Populus trichocarpa] gi|222848655|gb|EEE86202.1|
            predicted protein [Populus trichocarpa]
          Length = 1132

 Score =  343 bits (880), Expect = 1e-91
 Identities = 210/520 (40%), Positives = 273/520 (52%), Gaps = 30/520 (5%)
 Frame = +3

Query: 306  EYIKKNSSIGPLSQKWVPLGSKHP-VVFKDNHGNSLV-------------------SSFT 425
            E+ K+N S G + QKW+P+G +   +      GNSL                    +SF 
Sbjct: 636  EHCKQNHSSGSVMQKWIPIGVRESELATSARFGNSLPDPSDRPAREDFTLRNVQENASFD 695

Query: 426  DSTLAANGLQGLNQIVEDTDCTLTEADDCTSNFRCLARVEGEFAGVSAVTDQHV-----T 590
               L ++ L G  Q   +  C+  E D          ++      +  +  +HV     T
Sbjct: 696  SQDLVSSSLLGTCQGSGNASCSPKEDDHSQ-------KLNNSTGWMFELNKKHVEADSST 748

Query: 591  HEVMDKCVSEFHTDLNMIVKAVNDSYKLQIVSENVQLVYGSPIAEFESFLHFASPVLTQT 770
             E  D+  S F      I++AV D+ ++Q+  E +Q+  GSP+AEFE FLHF+SPV++Q 
Sbjct: 749  SEYSDQQFSAFEDKSIKIIQAVKDACRVQMECEAIQMSTGSPVAEFERFLHFSSPVISQL 808

Query: 771  DYIRSCKVCPQGQLPCEVLCPPQTPCISLGSLWQWYEKPGSYGLEVKVGAYNNLKR---- 938
              +  C+ C   +L     C  + P I LG LW+WYE+ G+YGLEV+   + N K     
Sbjct: 809  PGLSCCQTCLCDRLVGARPCRHEIPYIPLGCLWKWYEEHGNYGLEVRAEDFENSKSLGLD 868

Query: 939  CQSHYGFRAYFVPYLSAVQLFGRRSSSLCRTNESLKMMETSGMHESTSPS-LGSLPIFSK 1115
            C S   FR YFVP+LSA+QLF   +S          +  T    ES+  S  G LPIFS 
Sbjct: 869  CVS---FRGYFVPFLSAIQLFKNHTSQPINKAPDHGIFGTHEASESSEDSKAGRLPIFSV 925

Query: 1116 LCPRPRKVADTFMSESASSCKEEFCNQTAKSTHFSDGELIFEYFESGQPQQRRPLNEKIK 1295
            L P+PR  A     + A S                D EL+FEYFE  QPQQR+P  EKI+
Sbjct: 926  LIPKPRTTAAAQSVDVACS---------------DDAELLFEYFEPEQPQQRQPFYEKIQ 970

Query: 1296 ELVEGNKSSNCQIFGDPAXXXXXXXXXXXPASWYSVAWYPIYRIPDGNLRAAFLTYHSLG 1475
            ELV GN SS C+++GDP            P SWYSVAWYPIYRIPDGN R AFLTYHSLG
Sbjct: 971  ELVRGNASSRCKMYGDPTNLASLNLHDLHPRSWYSVAWYPIYRIPDGNFRTAFLTYHSLG 1030

Query: 1476 HFTRRRTSLNTSCGATSLVSPVVGLQSYNTKGECWXXXXXXXXXXXXTEGVDISHPSGIL 1655
            H   R    ++      +VSPVVGLQSYN +GECW            T  +D   PS I+
Sbjct: 1031 HLVHRSAKFDSPSKNECVVSPVVGLQSYNAQGECWFQPRHSVNQTTGTPSLD---PSVIM 1087

Query: 1656 KERLRTLEQTASVMARASVQKDGRRSANRQPDYEFFVSRK 1775
            KERLRTL +TAS+MARA V K  + S NR PDYEFF+SR+
Sbjct: 1088 KERLRTLAETASLMARAVVNKGNQTSVNRHPDYEFFLSRR 1127


>ref|XP_002331921.1| predicted protein [Populus trichocarpa] gi|222874593|gb|EEF11724.1|
            predicted protein [Populus trichocarpa]
          Length = 1150

 Score =  330 bits (847), Expect = 7e-88
 Identities = 205/519 (39%), Positives = 272/519 (52%), Gaps = 30/519 (5%)
 Frame = +3

Query: 306  EYIKKNSSIGPLSQKWVPLGSKHP-VVFKDNHGNSLV-------------------SSFT 425
            EY K+N S   + QKW+P+G K P +      GNS                     ++F 
Sbjct: 652  EYCKQNHSSVTVMQKWIPIGVKDPELTTSARFGNSSPDPSDGPAGEDLTLRNVQDKANFD 711

Query: 426  DSTLAANGLQGLNQIVEDTDCTLTEADDCTSNFRCLARVEGEFAGVSAVTDQHV-----T 590
               L ++ + G  Q   +  C   E D        + +++     +  +  +HV     T
Sbjct: 712  SQDLVSSLMLGTCQDSGNAVCFPQEDDR-------IQKLKNSTLWMDELNKKHVAADALT 764

Query: 591  HEVMDKCVSEFHTDLNMIVKAVNDSYKLQIVSENVQLVYGSPIAEFESFLHFASPVLTQT 770
             E   +  S F  +   I++AV D+ ++Q+ SE +Q+  G PIAEFE FLH +SPV+   
Sbjct: 765  SESSYQQFSAFEDESIKIIQAVKDTCRVQMESEAIQMAAGGPIAEFERFLHLSSPVINFP 824

Query: 771  DYIRSCKVCPQGQLPCEVLCPPQTPCISLGSLWQWYEKPGSYGLEVKVGAYNNLKRCQ-S 947
              +  C+ C   +L    LC  + P I LG +W+WYE+ G+YGLEV+     N       
Sbjct: 825  S-LSCCQTCLDDRLVGASLCRHEIPNIPLGCIWKWYEEHGNYGLEVRAEECENSNSGSFD 883

Query: 948  HYGFRAYFVPYLSAVQLFGRRSSSLCRTNESLKMMETSGMHESTSPS----LGSLPIFSK 1115
            H+ F  YFVP+LSAVQLF   SS       S    E S  ++++  S    +G LPIFS 
Sbjct: 884  HFSFHGYFVPFLSAVQLFKNHSSQPINNKNSAPDHEISDTYKASESSENSNVGRLPIFSL 943

Query: 1116 LCPRPRKVADTFMSESASSCKEEFCNQTAKSTHFSDGELIFEYFESGQPQQRRPLNEKIK 1295
            L P+PR  A                 Q+   T     EL+FEYFES QPQQRRPL EKI+
Sbjct: 944  LIPQPRTTAVA---------------QSVNLTCSDGAELLFEYFESEQPQQRRPLYEKIQ 988

Query: 1296 ELVEGNKSSNCQIFGDPAXXXXXXXXXXXPASWYSVAWYPIYRIPDGNLRAAFLTYHSLG 1475
            EL  G+ SS  +++GDP            P SWYSVAWYPIYRIPDG+ RAAFLTYHSLG
Sbjct: 989  ELARGDASSRYKMYGDPTNLASLNLHDLHPRSWYSVAWYPIYRIPDGHFRAAFLTYHSLG 1048

Query: 1476 HFTRRRTSLNTSCGATSLVSPVVGLQSYNTKGECWXXXXXXXXXXXXTEGVDISHPSGIL 1655
            H   +   ++ +     +VSPVVGLQSYN +GECW              G  IS+PS IL
Sbjct: 1049 HLVHKSAEVDYASKDACIVSPVVGLQSYNAQGECW---FQLRHSVNQAAGTPISNPSVIL 1105

Query: 1656 KERLRTLEQTASVMARASVQKDGRRSANRQPDYEFFVSR 1772
            KERLRTL +TAS++ARA V K  + S NR PDYEFF+SR
Sbjct: 1106 KERLRTLGETASLIARAVVNKGNQTSINRHPDYEFFLSR 1144


>ref|XP_003549171.1| PREDICTED: uncharacterized protein LOC100817088 [Glycine max]
          Length = 1181

 Score =  321 bits (822), Expect = 5e-85
 Identities = 216/559 (38%), Positives = 283/559 (50%), Gaps = 33/559 (5%)
 Frame = +3

Query: 198  DSSFAKMTGDVPDVSSVISNSKSEKC--AKSEMGNCHEE-----YIKKNSSIGPLSQKWV 356
            DSSF  M G+  + S++      + C     E+G   +E     Y  +N S G    KW+
Sbjct: 630  DSSFI-MPGEYINQSNMSEELSPDSCNLEGDEVGQNEKEVSSADYNAQNHSSGTTLWKWI 688

Query: 357  PLGSKHPVVFKDNHGNSLVSSFTDSTLAANGLQGLNQIVEDTDCTLTEADDCTSNFRCLA 536
            P+G K   + K +  NS     +D++   N        VE    +    D   ++  C  
Sbjct: 689  PVGKKDRGLEK-SESNSAPPENSDASSRNNS--NSESSVEPEVASSENPDSLNASRACNG 745

Query: 537  RV--------EGEFAGVSA-----VTDQHVTHEVMDKCVSEFHTDLNM------IVKAVN 659
            ++        EGE   + +     +T+    HE  +    E      +      I +AVN
Sbjct: 746  QIYDKVSCLDEGENHKMGSQVARTLTEHRDKHEAANHMFYECENQDMLENYSYRIAQAVN 805

Query: 660  DSYKLQIVSENVQLVYGSPIAEFESFLHFASPVLTQTDYIRSCKVCPQGQLPCEVLCPPQ 839
            D+ K Q+  E V +  G P+AEFE  LHF SPV+ ++    SC  C         LC  +
Sbjct: 806  DACKAQLACEAVHMATGGPVAEFERLLHFCSPVICKSLSSHSCSACSHNHGGGASLCRHE 865

Query: 840  TPCISLGSLWQWYEKPGSYGLEVKVGAYNNLKRCQ--SHYGFRAYFVPYLSAVQLFGRRS 1013
             P +SLG LWQWYEK GSYGLE++   + N KR    + + FRAYFVP LSAVQLF    
Sbjct: 866  IPDLSLGCLWQWYEKHGSYGLEIRAQGHENPKRQGGVADFPFRAYFVPSLSAVQLFKNHE 925

Query: 1014 SSLCRTNESLKMMETSGMHESTSPSLGSLP-----IFSKLCPRPRKVADTFMSESASSCK 1178
            +      + L   E S   E    S  S       IFS L P+PR    +  +   ++  
Sbjct: 926  NLCVNNGDRLPNSEVSEACEMVDISANSSTASQHSIFSVLFPQPRNQDKSSQTPKETASI 985

Query: 1179 EEFCNQTAKSTHFSDGELIFEYFESGQPQQRRPLNEKIKELVEGNKSSNCQIFGDPAXXX 1358
                  +  ST   D EL+FEYFE  QPQQR+PL EKI+ELV G+       +GDP    
Sbjct: 986  NNASIPSINSTCSGDLELLFEYFEFEQPQQRQPLYEKIQELVRGHIPIESSTYGDPTKLD 1045

Query: 1359 XXXXXXXXPASWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFTRRRTSLNTSCGATSLVSP 1538
                    P SW+SVAWYPIYRIPDGN RA+FLTYHSLGH  RRRTS + S   + +VSP
Sbjct: 1046 SINLRDLHPRSWFSVAWYPIYRIPDGNFRASFLTYHSLGHLVRRRTS-DLSTVGSCIVSP 1104

Query: 1539 VVGLQSYNTKGECWXXXXXXXXXXXXTEGVDISHPSGILKERLRTLEQTASVMARASVQK 1718
             VGLQSYN +GECW             E V++  PS +LKERLRTLE+TAS+MARA V K
Sbjct: 1105 TVGLQSYNAQGECW---FQLKHSAPAAEMVNL-EPSLLLKERLRTLEETASLMARAVVNK 1160

Query: 1719 DGRRSANRQPDYEFFVSRK 1775
                  NR PDYEFF+SR+
Sbjct: 1161 GNLTCTNRHPDYEFFLSRR 1179


>ref|XP_003620160.1| hypothetical protein MTR_6g077930 [Medicago truncatula]
            gi|355495175|gb|AES76378.1| hypothetical protein
            MTR_6g077930 [Medicago truncatula]
          Length = 1107

 Score =  318 bits (814), Expect = 4e-84
 Identities = 212/577 (36%), Positives = 295/577 (51%), Gaps = 39/577 (6%)
 Frame = +3

Query: 162  NASHVPQSEVHHDSSFAKMTGDVP-----------DVSSVISNSKSE-----KCAKSEMG 293
            N +    SE+ H   F     D+            D+ S +S S  +     K   +++G
Sbjct: 560  NGAEQETSEIAHSEKFHADESDILKSSQETENGSIDIQSQVSCSDEQSQVSCKLLDNQVG 619

Query: 294  NCHEE-----YIKKNSSIGPLSQ-KWVPLGSKHPVVFKDNHGNSLVSSFTDSTLAANGLQ 455
               +E     Y  +N S G  +  KW+P+G K   + K +  NS  S ++D   +   + 
Sbjct: 620  QTVKEVSSADYNGQNHSSGSTALWKWIPVGKKDAGMAK-SESNSSSSQYSDEPTSK--II 676

Query: 456  GLNQIVEDTDCTLTEADDCTSNFRC--LARVEGE-------FAGVSAV------TDQHVT 590
             +   +E    +L++  D + + R   + R+EGE        AG           D H+ 
Sbjct: 677  DMENGLEPKSDSLSQNQDSSPDTRTTSIGRIEGENHKLGEEIAGSLTERMDKHQVDNHII 736

Query: 591  HEVMDKCVSEFHTDLNMIVKAVNDSYKLQIVSENVQLVYGSPIAEFESFLHFASPVLTQT 770
            +E   +C+ E   D   I +AVND+ ++Q+  + V  V G+P+AEFE  LHF SPV+ ++
Sbjct: 737  YECESQCLLE--NDSYRIAQAVNDACRVQLACDVVHKVTGAPVAEFEKLLHFCSPVICRS 794

Query: 771  DYIRSCKVCPQGQLPCEVLCPPQTPCISLGSLWQWYEKPGSYGLEVKVGAYNNLKRCQS- 947
                 C  C +  L    LC  + P +SLG LW+WYEK GSYGLE++   Y + K     
Sbjct: 795  PDSLGCFTCAKNHLIGVPLCRHEIPEVSLGCLWEWYEKHGSYGLEIRAWDYEDPKTLGGV 854

Query: 948  -HYGFRAYFVPYLSAVQLFGRRSSSLCRTNESLKMMETSGMHESTSPSLGSLPIFSKLCP 1124
             H+ FRAYFVP LSAVQLF  R S     N S+  +                   S+ C 
Sbjct: 855  GHFPFRAYFVPSLSAVQLFKNRESRC--VNNSVSFLNCK---------------VSEACE 897

Query: 1125 RPRKVADTFMSESASSCKEEFCNQTAKSTHFSDGELIFEYFESGQPQQRRPLNEKIKELV 1304
                  D+F+   +++      N +  ST   D EL+FEYFE  QPQQRRPL E+I+ELV
Sbjct: 898  MIDNSEDSFIGRFSNAS-----NPSTDSTCSGDSELLFEYFECEQPQQRRPLYERIQELV 952

Query: 1305 EGNKSSNCQIFGDPAXXXXXXXXXXXPASWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFT 1484
             G+     + +GD             P SWYSVAWYPIYRIPDGN RA+FLTYHSLGH  
Sbjct: 953  RGDVQIQSKTYGDATKLESINLRDLHPRSWYSVAWYPIYRIPDGNFRASFLTYHSLGHLV 1012

Query: 1485 RRRTSLNTSCGATSLVSPVVGLQSYNTKGECWXXXXXXXXXXXXTEGVDISHPSGILKER 1664
             R ++ ++    + +VSP VGLQSYN +GECW            TE + I +PS  L+ER
Sbjct: 1013 CRSSNSDSPTLDSCVVSPAVGLQSYNAQGECW---FQLNQSTRRTEMLGI-NPSVFLQER 1068

Query: 1665 LRTLEQTASVMARASVQKDGRRSANRQPDYEFFVSRK 1775
            LRTLE+TAS+MARA V K  +   NR PDYEFF+SR+
Sbjct: 1069 LRTLEETASLMARADVNKGNQTCTNRHPDYEFFLSRR 1105


>emb|CAN81192.1| hypothetical protein VITISV_022847 [Vitis vinifera]
          Length = 1239

 Score =  313 bits (803), Expect = 8e-83
 Identities = 207/553 (37%), Positives = 285/553 (51%), Gaps = 18/553 (3%)
 Frame = +3

Query: 171  HVPQSEVHHDSSFAKMTGDVPDVSSVISNSKSEKCAKSEMGNCHEEYIKKNS---SIG-- 335
            H  QS VH      +   +V     +  NSK E  + S M    +   KKNS   S+G  
Sbjct: 711  HEGQSAVHLHPLIGEEVAEVDKEVYLSENSKQEHSSASVMKKW-KPVAKKNSGFASLGRS 769

Query: 336  ----------PLSQKWVPLGSKHPVVFKDNHGNSLVSSFTDSTLAANGLQGLNQIVEDTD 485
                      P ++ W P  S       ++H    +SS     +  +   G      + +
Sbjct: 770  DISLLAHADEPAAEGWTPKNSVEEKASSNSH--KPISSNDSEIMCVDHSFG------NAN 821

Query: 486  CTLTEADDCTSNFRCLARVEGEFAGVSAVTDQHVTHEVMDKCVSEFHTDLNMIVKAVNDS 665
            C+  E      N     ++  +   V+  T     H   +K +  F  D + I  A++D+
Sbjct: 822  CSSPEDKSPIQNTCTPKQLXNKHPAVNCFT-----HSCKEKHIYAFGADSSKISGALHDA 876

Query: 666  YKLQIVSENVQLVYGSPIAEFESFLHFASPVLTQTDYIRSCKVCPQGQLPCEVLCPPQTP 845
            Y++Q +SE+VQL  G PIA+FE  LH ASP++ +++ ++ C+ C + ++    LC  + P
Sbjct: 877  YRVQQLSESVQLATGCPIADFERLLHAASPIICRSNSVKICQTCVRDEVG-RPLCRHEAP 935

Query: 846  CISLGSLWQWYEKPGSYGLEVKVGAYNNLKRCQS-HYGFRAYFVPYLSAVQLFGRRSSSL 1022
             I+L SLW+WYEK GSYGLEV++      KR    H  FRAYFVP LSAVQLF +  S  
Sbjct: 936  NITLRSLWKWYEKHGSYGLEVRLEDCEYSKRLGFYHSAFRAYFVPSLSAVQLFKKPRSHH 995

Query: 1023 CRTNESLKMMETSGMHESTSPSLGSLPIFSKLCPRPRKVADTF--MSESASSCKEEFCNQ 1196
                  +           +S ++G LPIFS L PRP     +F  +     S +    +Q
Sbjct: 996  MDNGPVVSRACEMSKTSQSSFNIGQLPIFSILFPRPCTEETSFSPLENQMHSSQVSSMSQ 1055

Query: 1197 TAKSTHFSDGELIFEYFESGQPQQRRPLNEKIKELVEGNKSSNCQIFGDPAXXXXXXXXX 1376
            +  +T   D EL+FEYFES QPQ R+PL EKIKELV G+  S  +++GDP          
Sbjct: 1056 SVDTTITDDSELLFEYFESDQPQLRKPLFEKIKELVSGDGPSWNKVYGDPTKLDSMNLDE 1115

Query: 1377 XXPASWYSVAWYPIYRIPDGNLRAAFLTYHSLGHFTRRRTSLNTSCGATSLVSPVVGLQS 1556
               +SWYSVAWYPIYRIPDG  RAAFLTYHS GH   R ++ ++      +VSPVVGLQS
Sbjct: 1116 LHHSSWYSVAWYPIYRIPDGEFRAAFLTYHSFGHLVHRSSTFDSHRKDACIVSPVVGLQS 1175

Query: 1557 YNTKGECWXXXXXXXXXXXXTEGVDISHPSGILKERLRTLEQTASVMARASVQKDGRRSA 1736
            YN +                TE      PS IL++RL+TLE TAS+MARA V K   +S 
Sbjct: 1176 YNAQ-----------PILSQTEETXNLKPSEILRKRLKTLEXTASLMARAEVSKGNLKSV 1224

Query: 1737 NRQPDYEFFVSRK 1775
            NR PDYEFF+SR+
Sbjct: 1225 NRHPDYEFFLSRQ 1237