BLASTX nr result

ID: Dioscorea21_contig00002312 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00002312
         (3147 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003633429.1| PREDICTED: uncharacterized protein LOC100852...   150   2e-33
ref|XP_003555628.1| PREDICTED: uncharacterized protein LOC100803...   145   7e-32
ref|XP_003535782.1| PREDICTED: uncharacterized protein LOC100817...   135   5e-29
ref|XP_002525074.1| hypothetical protein RCOM_0745050 [Ricinus c...   134   2e-28
ref|XP_002311130.1| predicted protein [Populus trichocarpa] gi|2...   131   1e-27

>ref|XP_003633429.1| PREDICTED: uncharacterized protein LOC100852618 [Vitis vinifera]
          Length = 324

 Score =  150 bits (380), Expect = 2e-33
 Identities = 118/321 (36%), Positives = 154/321 (47%), Gaps = 18/321 (5%)
 Frame = +2

Query: 1664 ISPGRCIARDISDSLPLNADDKRLRGLPDVMGDLPLRAQHRLQFEREENAYXXXXXXXXX 1843
            ISP RCI  D SD + L   +K +RGL D + + P+  + +  FE  E  +         
Sbjct: 9    ISPNRCIGEDASDLVGLRHSEKFIRGLRDDIVE-PVFTRQQPPFEGVEGHFVQGNRNFSS 67

Query: 1844 XXXXXXXXXXXXX--NECSPHQWSPPRR-SPNIYEGHPELGHIRSPPILTHDRMRSPRQR 2014
                              SP  WS PRR SP+ + GHPEL H RSP +   DRMRSP  R
Sbjct: 68   IQRRGPPRIHSKSPMRSGSPGPWSSPRRRSPDGFNGHPELTHRRSPAVYRMDRMRSP-DR 126

Query: 2015 PCFPEDVIVRRPGSPQSFMTRMSDDGREIHLRR----------NDCMRPDRVLERNMQRF 2164
            PCFPE+++ RR GSP  F+ R S+D R++   R          N      R+L RN +RF
Sbjct: 127  PCFPEEIVARRHGSP-PFLPRPSNDLRDMDSARDHGPPRSVIPNRRSPSGRILLRNSRRF 185

Query: 2165 DMIDPRXXXXXXXXXXXXXN--QFCELPGGGDERVDVRRICEERRSFVRPIRQRFNASED 2338
            D+I+PR             +  +F EL  GGD   + RR   ERR  VR  R  +N +  
Sbjct: 186  DIIEPRERTDSDEFFGPPMHSGRFHEL--GGDGSGEERRRIGERRGPVRSFRPPYNGAGA 243

Query: 2339 EGSLHNHAGDGPPTFRFRPEAMEGFSERGG--SRDFNGHIQSRLGNVHDRLHGIDEREQY 2512
            EG   N   DGP  +RF PEA   F ERG    R+F+  +++R GN   R     E    
Sbjct: 244  EGFRFN-IEDGPRPYRFCPEADSEFLERGNLREREFDRRVKNRPGNAPRRSIEDQEGNYR 302

Query: 2513 HGRQGWSEADFAGV-RPKRRR 2572
            HG Q W +  F  + R KRRR
Sbjct: 303  HGEQVWHDQGFDDISRLKRRR 323


>ref|XP_003555628.1| PREDICTED: uncharacterized protein LOC100803295 [Glycine max]
          Length = 1378

 Score =  145 bits (366), Expect = 7e-32
 Identities = 194/664 (29%), Positives = 282/664 (42%), Gaps = 55/664 (8%)
 Frame = +2

Query: 746  LKMPLDEGVSEIKVENPEES-KIVDDSLCADERDSCISTADNQQEVSAVGCLXXXXXXXX 922
            ++  ++  V ++ +E  E S K++D S+C  E     S  D +  ++A G          
Sbjct: 757  IQSEINNEVVDMDIEMHERSGKVIDKSVCVQE-----SLDDEKSNIAAHGANVLQMKALD 811

Query: 923  XXXXXXXXXXXXXXXEKGSETKSNPKRDALPRTDKGGNDKLSSTPENSKGLDM---IPEK 1093
                              +E+ SN   +     D    D++  T +  K  D+     E 
Sbjct: 812  LLDGKNVCEALV------AESPSNQATNGSHGVDFQCADEVVKTADIVKQTDLDFETMEV 865

Query: 1094 HLPIKDASLNVNLGKQESGKEKQSRIIKLTSATDRSSYGKVKSNDNKSPLMQSEEEKMTD 1273
                 DA+ +VN G          RII L+ AT  SS GK +    +S   ++  + ++D
Sbjct: 866  SANADDAAKDVNNGGNPG------RIIVLSRATSSSSPGKTRPISGRSLSSRAGRDVLSD 919

Query: 1274 RPFMRIKPYFRETRDGFCKDRYQKFTSDRNQDQYSGKHRSDLIRMTERGNTHPDG----- 1438
                         RD    D   KF+ +R+QD      R + +R   R N+  D      
Sbjct: 920  S---LDGDKLHRGRDEVFIDGPHKFSRERHQDISPRNSRFNFVRGRGRLNSRLDSVRSEW 976

Query: 1439 ------SHVFH----RFK----------KSTGMRYPRHNNSSEIAYIPPGCRLVRKHGED 1558
                  S  F+    +F+            T M Y  +N + + +Y+  G RL RK   D
Sbjct: 977  ESDREFSGEFYNGPSQFRGPRPKYAPAFADTDMEY--NNVAPDGSYVGNG-RLGRKPLND 1033

Query: 1559 ESSNLTRLPSRRRSPGPVDREGHPVMGVQVAGRS--FISPGRCIARDISDSLPLNADDKR 1732
             S     +  RRRSPG  D       G+Q+  R+   ISP RCI  D SD + +  +DK 
Sbjct: 1034 GSY----IAPRRRSPGGRD-------GIQIGHRNPRNISPNRCIG-DGSDLVGVRHNDKF 1081

Query: 1733 LRGLPDVMGDLPLRAQHRLQ-----FEREENAYXXXXXXXXXXXXXXXXXXXXXXNE--- 1888
            +RGLP+   D         +     F R    +                      +    
Sbjct: 1082 MRGLPEDNMDAMFTRSQTFEGMDGRFTRGSRNFSSMQRRGPPRIRSKSPIRSRSRSPGPW 1141

Query: 1889 CSPHQWSPPRRSPNIYEGHPELGHIRSPPILTHDRMRSPRQRPCFPEDVIVRRPGSPQSF 2068
             SP + SP RRSP+ + GHPEL H RS P    DRMRSP  RP FP + +VRR GSP SF
Sbjct: 1142 SSPRRRSPRRRSPDGFGGHPELSHRRS-PFYRVDRMRSP-DRPVFPAERVVRRHGSP-SF 1198

Query: 2069 MTRMSDDGREI-HLRRNDCMRPDRVLERNMQRFDMIDPR---XXXXXXXXXXXXXNQFCE 2236
            M+R S+D R+I   R +   R  R+L RN +RFD++DPR                 +  E
Sbjct: 1199 MSRPSNDMRDIDSARDHGHPRSGRILIRN-RRFDVVDPRDRAENDDEYFGGPMHSGRLLE 1257

Query: 2237 LPGGGDERVDVRRICEERRSFVRPIRQRFNASEDEGSLHNHAGDGPPTFRFRPEAMEGFS 2416
            L G G+   + RR   ERR  VR  R  +N +  E + H +A DGP  +RF  +  + F 
Sbjct: 1258 LSGEGNG--EDRRRFGERRGPVRSFRPPYNNNVGE-NFHLNAEDGPRHYRFCSDDSD-FH 1313

Query: 2417 ERGGS----RDFNGHIQSRLGNVHD-RLHGIDEREQYH------GRQGWSEADFAGV-RP 2560
            ERGG+    RDF+  I+ R  NV   R   +DE+E+        G Q WS+  F  + R 
Sbjct: 1314 ERGGNNIRERDFDRRIKGRPANVPPRRTRNMDEQEENFRHGGGGGGQVWSDDSFDDISRV 1373

Query: 2561 KRRR 2572
            KR+R
Sbjct: 1374 KRKR 1377


>ref|XP_003535782.1| PREDICTED: uncharacterized protein LOC100817471 [Glycine max]
          Length = 1396

 Score =  135 bits (341), Expect = 5e-29
 Identities = 202/781 (25%), Positives = 313/781 (40%), Gaps = 53/781 (6%)
 Frame = +2

Query: 326  TSSLPVEHVGSDDKGSITTNDLSSVIGTTGSNDEKLATHETSV-IENNEKKNSSYQFDKS 502
            TSS+  E   + D+ +     ++     +  N E  A+ E  + +  +  ++ SY  D  
Sbjct: 615  TSSVSTEEENAADRDACRLKLMNEPPPASRGNGEGCASDEEKITLSTDMLEDDSYDSDSE 674

Query: 503  S--HHAITETDEHEHKTHKSGDVDG-IKKPA-PKXXXXXXXXXXXXXXXPNLRAEGGDRV 670
            S  +HA+T   + E         DG +++P  P                 N   +  ++ 
Sbjct: 675  SDENHAVTIAVDTECYVEDDDYEDGEVREPLDPSTAEDVCEVREVEHPDSNFVNKQMEKG 734

Query: 671  LPGEDSSEKREANQLDGSTAVLDVPLKMPLDEGVSEIKVENPEES-KIVDDSLCADER-- 841
            +   D     +  + +  TA+     +  ++  V ++ +E  E S K+VD ++C  E   
Sbjct: 735  MVSGDCPTSYQVVEKNNMTAI-----QSEINNEVVDMDIEMHERSGKVVDKNVCVQESLD 789

Query: 842  -DSCISTADNQQEVSAVGCLXXXXXXXXXXXXXXXXXXXXXXXEKGSETKSNPKRDALPR 1018
             + C       + V+ +                            GS        D + +
Sbjct: 790  DEKCNIATHGNKPVNVLQMKALDLLEGKNVCEALVTESPSNQATNGSHGVDVQCADEVVK 849

Query: 1019 TDKGGNDKLSSTPENSKGLDMIPEKHLPIKDASLNVNLGKQESGKEKQSRIIKLTSATDR 1198
            T     D +  T  + + +++        KD +   NLG          RII L+ AT  
Sbjct: 850  T----TDIVKQTDLDFETMEVSANADDAAKDVNNGGNLG----------RIIDLSRATSS 895

Query: 1199 SSYGKVKSNDNKSPLMQSEEEKMTDRPFMRIKPYFRETRDGFCKDRYQKFTSDRNQDQYS 1378
            SS GK +    +S   ++  + ++D             RD    D   KF+ +R+QD   
Sbjct: 896  SSPGKTRPMSGRSLSSRAGRDVLSDT---LDGDKLHRGRDEVYIDGPHKFSRERHQDISP 952

Query: 1379 GKHRSDLIRMTERGNTHPDG-----------SHVFH----RFK----------KSTGMRY 1483
             K R + +R   R N   D            S  F+    +F+            T M Y
Sbjct: 953  RKTRMNFVRGRGRLNNRLDSVRNDWESDREFSGEFYNGPSQFRGPRPKYASAFADTDMEY 1012

Query: 1484 PRHNNSSEIAYIPPGCRLVRKHGEDESSNLTRLPSRRRSPGPVDREGHPVMGVQVAGRS- 1660
              +N + + +Y+  G RL RK   D S     +  RRRS G  D       G+Q+  R+ 
Sbjct: 1013 --NNVAPDGSYVGNG-RLGRKPLNDGSY----IAPRRRSSGGRD-------GIQIGHRNP 1058

Query: 1661 -FISPGRCIARDISDSLPLNADDKRLRGLPDVMGDLPLRAQHRLQ-----FEREENAYXX 1822
              ISP RCI  D SD + +  ++K +R LP+   D         +     F R    +  
Sbjct: 1059 RNISPNRCIG-DGSDLVGVRHNEKFMRSLPEDNMDAMFTRPQTFEGMDGRFTRGSRNFSS 1117

Query: 1823 XXXXXXXXXXXXXXXXXXXXNE---CSPHQWSPPRRSPNIYEGHPELGHIRSPPILTHDR 1993
                                +     SP + SP RRSP+ + GHPEL H RS P    DR
Sbjct: 1118 MQRRGPPQIRSKSPIRSRSRSPGPWSSPRRRSPRRRSPDGFGGHPELTHRRS-PFYRVDR 1176

Query: 1994 MRSPRQRPCFPEDVIVRRPGSPQSFMTRMSDDGREI-HLRRNDCMRPDRVLERNMQRFDM 2170
            MRSP  RP FP + +VRR GSP SFM+R S+D R++   R +   R  R+L RN +RFD+
Sbjct: 1177 MRSP-DRPVFPAERVVRRHGSP-SFMSRPSNDMRDMDSARDHGHPRSGRILIRN-RRFDV 1233

Query: 2171 IDPR---XXXXXXXXXXXXXNQFCELPGGGDERVDVRRICEERRSFVRPIRQRFNASEDE 2341
            +DPR                 +  EL G G+   + RR   ERR  VR  R  +N +   
Sbjct: 1234 VDPRDRVDNDDEYFGGPMHSGRLLELSGEGNG--EDRRRFGERRGPVRSFRPPYNNNNVG 1291

Query: 2342 GSLHNHAGDGPPTFRFRPEAMEGFSERGGS----RDFNGHIQSRLGNVHD-RLHGIDERE 2506
             S H +A DGP  +RF  +  + F ERGG+    RDF   I+ R  NV   R   +DE+E
Sbjct: 1292 ESFHLNAEDGPRHYRFCSDDSD-FHERGGNNLRERDFERRIKGRPANVPPRRTRNMDEQE 1350

Query: 2507 Q 2509
            +
Sbjct: 1351 E 1351


>ref|XP_002525074.1| hypothetical protein RCOM_0745050 [Ricinus communis]
            gi|223535655|gb|EEF37321.1| hypothetical protein
            RCOM_0745050 [Ricinus communis]
          Length = 1517

 Score =  134 bits (337), Expect = 2e-28
 Identities = 156/525 (29%), Positives = 229/525 (43%), Gaps = 30/525 (5%)
 Frame = +2

Query: 1088 EKHLPIKDASLNV-NLGKQESGKEKQSRIIKLTSATDRSSYGKVKSNDNKSPLMQSEEEK 1264
            E  LP  +  +N  N  K  +    QSRII L+ A++ SS+GK +S  +K   ++S  E+
Sbjct: 961  ESALPKMETLINGDNAPKDANSGGNQSRIINLSIASNMSSFGKTRSISSKPLSLRSGRER 1020

Query: 1265 MTDRPFM--RIKPYFRETRDGFCKDRYQKFTSDRNQDQYSGK-----------HRSDLIR 1405
            + D P    R+ P     RD    D  QKFT +R Q+  + +            R D +R
Sbjct: 1021 L-DVPLEGDRLHP---RGRDEAYNDGSQKFTRERYQESRNSRWNFIHGRGRLASRIDSLR 1076

Query: 1406 MTERGNTHPDGSHVFHRFKKSTGMRYPRHNNSSEIAYIPPGCRLVRKHGEDESSNLTRLP 1585
                        H +      +   +  +N  S+  +   G R  RK  +D++       
Sbjct: 1077 NDRDSERDCIPRHKYATAVAGSDTEFVNYNMGSDGVFAG-GVRGGRKLVDDDTPIFRHFS 1135

Query: 1586 SRRRSPGPVDREGHPVMGVQVAGRSFISPGRCIARDISDSLPLNADDKRLRGLPDVMGDL 1765
            SRRRSPG   R+G    G+Q+  R      R I  D S+ + L   +K +RG PD  G+ 
Sbjct: 1136 SRRRSPGR--RDGPASRGLQMVRRV----PRSIDEDNSEVVGLRHTEKIMRGFPDD-GEE 1188

Query: 1766 PLRAQHRLQFEREENAYXXXXXXXXXXXXXXXXXXXXXX-NECSPHQWSPPRRSPNIYEG 1942
               +  +  +E  +  +                          SP  WS  RRSP+ + G
Sbjct: 1189 HSYSHTQPPYEGLDGPFVQGTRSFSVQRRGLPQMHSKSPIRSRSPGPWSSRRRSPDGFVG 1248

Query: 1943 HPELGHIRSPPILTHDRMRSPRQRPCFPEDVIVRRPGSPQSFMTRMSDDGREIHLRRNDC 2122
             PEL H RS P+   +RMRSP   P FP D + RR  SP S+++R  +D RE+   R D 
Sbjct: 1249 PPELPHRRS-PLYRMERMRSP-DNPGFPADRVGRRHSSP-SYLSR-PNDLREMDPSR-DH 1303

Query: 2123 MRPDRVLE-----------RNMQRFDMIDPRXXXXXXXXXXXXXN--QFCELPGGGDERV 2263
              P  ++            R  +RF + DPR             +  +F EL G G+E  
Sbjct: 1304 GHPRSIISNRSPTGRGGLLRGSRRFGIGDPRERPENEEFFAGPVHSGRFHELGGDGNEE- 1362

Query: 2264 DVRRICEERRSFVRPIRQRFNASEDEGSLHNHAGDGPPTFRFRPEAMEGFSERGG--SRD 2437
              RR   ERR+ VR  R  FN ++ E + + +  DGP +FRF PE    F ER     R+
Sbjct: 1363 --RRRFGERRAPVRSFRPPFNGTDGE-NFNFNTEDGPRSFRFYPEVDPDFHERPNLRERE 1419

Query: 2438 FNGHIQSRLGNVHDRLHGIDEREQYHGRQGWSEADFAGVRPKRRR 2572
            F+  I++R GN   R   I+E+E  + R G   A+   V P   R
Sbjct: 1420 FDRRIKNRPGNAPRRPRSIEEQEGNY-RHGGQMAELVVVIPAEYR 1463


>ref|XP_002311130.1| predicted protein [Populus trichocarpa] gi|222850950|gb|EEE88497.1|
            predicted protein [Populus trichocarpa]
          Length = 1370

 Score =  131 bits (330), Expect = 1e-27
 Identities = 161/545 (29%), Positives = 235/545 (43%), Gaps = 45/545 (8%)
 Frame = +2

Query: 1058 ENSKGLDMIPEKHLPIKDASLNV-NLGKQESGKEKQSRIIKLTSATDRSSYGKVKSNDNK 1234
            EN K  + + +  LP  +ASLN  ++ K  S    +SRII L  A++ SS GK +S   +
Sbjct: 796  ENIK-TNYMEKNELPELEASLNGGDMAKDVSSS--RSRIINLPRASNSSSPGKTRSISGR 852

Query: 1235 SPLMQSEEEKMTDRPFMRIKPYFRETRDGFCKDRYQKFTSDRNQDQYSGKHRSDLIRM-- 1408
                 S +E++ D P    K +  + RD    D  ++F+ DR+Q+ +    R + +R   
Sbjct: 853  P--FSSYQERLPDGPLEGGKLH-PQGRDEIYIDGPRRFSRDRHQEHFPRNSRMNFVRGRG 909

Query: 1409 -------TERGNTHPDGSHVFHRFKKSTGMRYPRHNNSSEIA----------YIPPG--- 1528
                   T RG+   + ++    +  S+     RH  +S  A            P G   
Sbjct: 910  RISSRIDTLRGDRDSERNYASEFYNGSSDFAVRRHKYASAAAEADSESINYNIAPDGSFV 969

Query: 1529 --CRLVRKHGEDESSNLTRLPSRRRSPGPVDREGHPVMGVQVAGRSFISPGRCIARDISD 1702
               R  RK  +DE+     +PSRRRSP    R+     G+Q+  R      R I  + S+
Sbjct: 970  GTARGGRKLLDDETPVFRNVPSRRRSPE--GRDVPAARGIQMVHRV----PRNIGEEGSE 1023

Query: 1703 SLPLNADDKRLRGLPDVMGDLPLRAQ---------HRLQFEREENAYXXXXXXXXXXXXX 1855
             +     +  +RG PD   +   R           H +Q  R  ++              
Sbjct: 1024 VIGARHTEN-MRGFPDDGTEQAFRRPQPSYEGLDGHFVQGTRNYSSVHRRALPQFRSKSP 1082

Query: 1856 XXXXXXXXXNECSPHQWSPPRR-SPNIYEGHPELGHIRSPPILTHDRMRSPRQRPCFPED 2032
                        SP  WS  RR SP+ + G  EL + RSP I +  R+RSP   P FP +
Sbjct: 1083 IRSR--------SPGPWSSARRRSPDGFGGTSELSNRRSP-IYSMGRIRSP-DHPGFPRE 1132

Query: 2033 VIVRRPGSPQSFM----TRMSDDGREIHLRRNDCMRPDRVLERNMQRFDMIDPRXXXXXX 2200
            ++VRR GSP        TR +D G    +  N   +  RV  RN +RF + DPR      
Sbjct: 1133 MVVRRHGSPPFLSRPPDTRETDPGHSRSIISNRG-QTGRVFLRNSRRFGITDPRERADSD 1191

Query: 2201 XXXXXXXN--QFCELPGGGDERVDVRRICEERRSFVRPIRQRFNASEDEGSLHNHAGDGP 2374
                   +  +F +L  GGD  V+ RR   ERR  VR  +  FN +  E + H +  DGP
Sbjct: 1192 EFFGGPIHSGRFHDL--GGDGNVEDRRRFSERRGPVRSFKPPFNGAGSE-NFHLNPEDGP 1248

Query: 2375 PTFRFRPEAMEGFSERGG--SRDFNGHIQSRLGNVHDRLHGIDERE--QYHGRQGWSEAD 2542
              FRF PE    F ER     R+F+G I++R GN   R  GI+E+E    HGRQ   +  
Sbjct: 1249 RPFRFFPEDNPEFHERTNLREREFDGRIRNRPGNAPRRPRGIEEQEGNYRHGRQATYDCC 1308

Query: 2543 FAGVR 2557
              G R
Sbjct: 1309 CVGWR 1313


Top