BLASTX nr result

ID: Dioscorea21_contig00012020 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00012020
         (1515 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002279655.1| PREDICTED: uncharacterized protein LOC100255...   682   0.0  
ref|XP_003530919.1| PREDICTED: uncharacterized protein LOC100809...   668   0.0  
ref|XP_002327325.1| predicted protein [Populus trichocarpa] gi|2...   666   0.0  
ref|XP_002881945.1| hypothetical protein ARALYDRAFT_903808 [Arab...   660   0.0  
ref|NP_566005.1| transcription termination factor domain-contain...   657   0.0  

>ref|XP_002279655.1| PREDICTED: uncharacterized protein LOC100255927 [Vitis vinifera]
          Length = 481

 Score =  682 bits (1759), Expect = 0.0
 Identities = 341/474 (71%), Positives = 395/474 (83%)
 Frame = +2

Query: 92   MLHLLRRNSRFRIPICISQTLTPIPSHLNASRFSSQSPAKPPEYEMPSVTWGVVQGRKER 271
            M+ LLRR         ++ TLT  P  L        S +K PEYEMPSVTWGVV GRKER
Sbjct: 1    MISLLRRTK------LLTLTLTSNPRTLRPFLRFLSSSSKFPEYEMPSVTWGVVLGRKER 54

Query: 272  LVSRVIISDYLKSIGIVPDEIEPLELPSTVDVMRERVEFLHRLGLTVDDLNAYPLVLACS 451
            LVSRVIISDYLK++GI+PDE+E +ELPSTV+VMRERVEFL +LG+T+D LN YPL+L CS
Sbjct: 55   LVSRVIISDYLKTLGIIPDELEQVELPSTVEVMRERVEFLQKLGVTIDHLNEYPLMLGCS 114

Query: 452  VRKNIIPVLGYLEKLGIPRSKLGEFVRNYPQXXXXXXXXXXXXXXKFLRGLDVERHDIPY 631
            VRKN+IPVLGYLEK+GIPRSKLGEFV NYPQ              KFLRGLDV++ DI Y
Sbjct: 115  VRKNMIPVLGYLEKIGIPRSKLGEFVVNYPQVLHASVVVELAPVVKFLRGLDVDKQDIGY 174

Query: 632  VLQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQFPYLLGMRVGTKIKPLVDFL 811
            VL KYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQ+PY LGMRVGT IKP+VD+L
Sbjct: 175  VLMKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYPYFLGMRVGTVIKPIVDYL 234

Query: 812  LSLGIPKKILAKVLEKRTYILGYDLQETVKPNVEALLSFGVRSERLPSVIAQYPQILGLP 991
            +SLG+PKK+LA++ EKR Y+LGYDL+E +KPNV+ L+SFG+R E L SVIAQ+PQILGLP
Sbjct: 235  VSLGLPKKVLARMFEKRAYVLGYDLEECIKPNVDCLVSFGIRREALASVIAQFPQILGLP 294

Query: 992  LKAKLSSQQYFFNLKLRIDPDGFARALERMPQIVSLNQSIIMKPIEFLWGRGFSTEDVAK 1171
            LKAKLSSQQYFFNLKL+IDPDGFAR +ERMPQIVSLNQ++IMKP+EFL GRG    DVAK
Sbjct: 295  LKAKLSSQQYFFNLKLKIDPDGFARVIERMPQIVSLNQNVIMKPVEFLLGRGIPAVDVAK 354

Query: 1172 MFVKCPQLAAVRVELMKNSLYFCKSEMKRPMEELVEFPEYFTYSLESRIKPRYLVLASKG 1351
            M VKCPQL A+RVELMKN  YF KSEM R ++ELVEFPEYFTYSLESRIKPRY  L SKG
Sbjct: 355  MVVKCPQLVALRVELMKNGYYFFKSEMGRQVKELVEFPEYFTYSLESRIKPRYQRLQSKG 414

Query: 1352 IKCSLGWFLNCSDQRFEERIKAEYIDADTPGPSFAMGGKLEMPGSELVSEDEDE 1513
            ++ SL WFLNCSDQRFEER++A+YI+ +T GPSF MGGKL++PG+E+VS++EDE
Sbjct: 415  VRSSLDWFLNCSDQRFEERLQADYIEMETIGPSFCMGGKLQLPGNEVVSDEEDE 468


>ref|XP_003530919.1| PREDICTED: uncharacterized protein LOC100809590 [Glycine max]
          Length = 499

 Score =  668 bits (1723), Expect = 0.0
 Identities = 329/454 (72%), Positives = 383/454 (84%), Gaps = 1/454 (0%)
 Frame = +2

Query: 155  TPIPSHLNASRFSSQSPA-KPPEYEMPSVTWGVVQGRKERLVSRVIISDYLKSIGIVPDE 331
            T IP  L    + +QS A K PEYEMPSVTWGV+QGRKE+LVSRVII DYLK +GI+PDE
Sbjct: 33   TKIPKTLFRVYYGTQSSASKLPEYEMPSVTWGVIQGRKEKLVSRVIIFDYLKGLGIIPDE 92

Query: 332  IEPLELPSTVDVMRERVEFLHRLGLTVDDLNAYPLVLACSVRKNIIPVLGYLEKLGIPRS 511
            +  LELPSTVDVMRERVEFL +LGLTVDD+N YPL+L CSVRKN+IPVLGYLEK+GI R 
Sbjct: 93   LHDLELPSTVDVMRERVEFLQKLGLTVDDINNYPLMLGCSVRKNMIPVLGYLEKIGIARP 152

Query: 512  KLGEFVRNYPQXXXXXXXXXXXXXXKFLRGLDVERHDIPYVLQKYPELLGFKLEGTMSTS 691
            KLG FV+NYPQ              KFLRGLDVE+ DI YVLQKYPELLGFKLEGTMSTS
Sbjct: 153  KLGGFVKNYPQVLHASVIVELAPVVKFLRGLDVEKDDIGYVLQKYPELLGFKLEGTMSTS 212

Query: 692  VAYLVSIGVSPRDIGPMVTQFPYLLGMRVGTKIKPLVDFLLSLGIPKKILAKVLEKRTYI 871
            VAYLVSIGV+PRDIGPMVTQ+PYLLGMRVGT IKP++D+L+ LG+PKK+LA++LEKR Y+
Sbjct: 213  VAYLVSIGVNPRDIGPMVTQYPYLLGMRVGTVIKPMIDYLVDLGLPKKVLARMLEKRAYV 272

Query: 872  LGYDLQETVKPNVEALLSFGVRSERLPSVIAQYPQILGLPLKAKLSSQQYFFNLKLRIDP 1051
            LGYDL+ETVKPNVE L+SFGV  + L S+IAQYPQILGLPLKAKLS+QQYFF+LKL++DP
Sbjct: 273  LGYDLEETVKPNVECLISFGVGRDCLASIIAQYPQILGLPLKAKLSTQQYFFSLKLKVDP 332

Query: 1052 DGFARALERMPQIVSLNQSIIMKPIEFLWGRGFSTEDVAKMFVKCPQLAAVRVELMKNSL 1231
            +GFAR +E MPQ+VSL+Q +IMKP+EFL GR    +DVA M VKCPQL A+RVELMKNS 
Sbjct: 333  EGFARVVENMPQVVSLHQHVIMKPVEFLLGRTIPAQDVASMVVKCPQLVALRVELMKNSY 392

Query: 1232 YFCKSEMKRPMEELVEFPEYFTYSLESRIKPRYLVLASKGIKCSLGWFLNCSDQRFEERI 1411
            YF KSEM RP++ELVEFPEYFTYSLESRIKPRY  L SKGI+CSL W LNCSDQRFEER+
Sbjct: 393  YFFKSEMGRPLQELVEFPEYFTYSLESRIKPRYQRLKSKGIRCSLNWMLNCSDQRFEERL 452

Query: 1412 KAEYIDADTPGPSFAMGGKLEMPGSELVSEDEDE 1513
            +  YI+ ++ GP F MGGKLE+PG+ LVS++E+E
Sbjct: 453  QGHYIETESVGPRFCMGGKLELPGNGLVSDEEEE 486


>ref|XP_002327325.1| predicted protein [Populus trichocarpa] gi|222835695|gb|EEE74130.1|
            predicted protein [Populus trichocarpa]
          Length = 514

 Score =  666 bits (1719), Expect = 0.0
 Identities = 326/442 (73%), Positives = 387/442 (87%)
 Frame = +2

Query: 188  FSSQSPAKPPEYEMPSVTWGVVQGRKERLVSRVIISDYLKSIGIVPDEIEPLELPSTVDV 367
            FS+Q+ +K  EYEMPSVTWGVVQG+KE+LV+RVII DYLK +GI+PDE+E LELPSTV+V
Sbjct: 58   FSTQA-SKFHEYEMPSVTWGVVQGKKEKLVNRVIICDYLKGLGIIPDELESLELPSTVEV 116

Query: 368  MRERVEFLHRLGLTVDDLNAYPLVLACSVRKNIIPVLGYLEKLGIPRSKLGEFVRNYPQX 547
            M+ERVEFL R+GLT+DD+N YPL+L CSVRKNIIPVLGYLEK+GI RSKLGEFV++YPQ 
Sbjct: 117  MKERVEFLQRMGLTIDDINEYPLMLGCSVRKNIIPVLGYLEKIGISRSKLGEFVKSYPQV 176

Query: 548  XXXXXXXXXXXXXKFLRGLDVERHDIPYVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPR 727
                         KFLRGLDV++ DI YVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPR
Sbjct: 177  LHASVVVELQPVIKFLRGLDVDKLDIGYVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPR 236

Query: 728  DIGPMVTQFPYLLGMRVGTKIKPLVDFLLSLGIPKKILAKVLEKRTYILGYDLQETVKPN 907
            DIGPMVTQ+PYLLGMRVGT IKPLVD+L+SLG+PKKI+A++LEKR Y+LGYDLQETVKPN
Sbjct: 237  DIGPMVTQYPYLLGMRVGTMIKPLVDYLVSLGLPKKIVARMLEKRPYVLGYDLQETVKPN 296

Query: 908  VEALLSFGVRSERLPSVIAQYPQILGLPLKAKLSSQQYFFNLKLRIDPDGFARALERMPQ 1087
            V+ L+SFG+R E L S++AQYP ILGLPLKAKLSSQQYFFNLKL+IDP+ FAR +E+MPQ
Sbjct: 297  VDCLISFGIRREVLASIVAQYPPILGLPLKAKLSSQQYFFNLKLKIDPERFARVIEKMPQ 356

Query: 1088 IVSLNQSIIMKPIEFLWGRGFSTEDVAKMFVKCPQLAAVRVELMKNSLYFCKSEMKRPME 1267
            IVSLNQ++IMKP++FL  R   +EDVA M +KCPQL A+RV LMKNS YF KSEM RP++
Sbjct: 357  IVSLNQNVIMKPVQFLLERAIPSEDVATMVIKCPQLLALRVPLMKNSYYFFKSEMGRPLK 416

Query: 1268 ELVEFPEYFTYSLESRIKPRYLVLASKGIKCSLGWFLNCSDQRFEERIKAEYIDADTPGP 1447
            ELVEFPEYFTYSLESRIKPRY +L SKGI+ SL WFLNCSD+RFEER++ +YI++++ GP
Sbjct: 417  ELVEFPEYFTYSLESRIKPRYEMLKSKGIRSSLNWFLNCSDKRFEERLEGDYIESESLGP 476

Query: 1448 SFAMGGKLEMPGSELVSEDEDE 1513
            SF MGGKLE+PG E++S++EDE
Sbjct: 477  SFCMGGKLELPGCEILSDEEDE 498


>ref|XP_002881945.1| hypothetical protein ARALYDRAFT_903808 [Arabidopsis lyrata subsp.
            lyrata] gi|297327784|gb|EFH58204.1| hypothetical protein
            ARALYDRAFT_903808 [Arabidopsis lyrata subsp. lyrata]
          Length = 508

 Score =  660 bits (1703), Expect = 0.0
 Identities = 327/487 (67%), Positives = 395/487 (81%), Gaps = 11/487 (2%)
 Frame = +2

Query: 86   SQMLHLLRRNSRFRIPICISQTLTPIPSHLNASRFSSQ-----------SPAKPPEYEMP 232
            ++ L L+RR S    PI  ++T T I    N  +F +              +K PEYEMP
Sbjct: 9    NKFLALIRRQSLI-FPITSTETKTLINPDPNIPQFQNPCSIFRIAHYATQSSKFPEYEMP 67

Query: 233  SVTWGVVQGRKERLVSRVIISDYLKSIGIVPDEIEPLELPSTVDVMRERVEFLHRLGLTV 412
            +VTWGV+QG+KE+LV+RV I DYLK +GI+ DE+E +ELPST++VM ERVEFL +LGLT+
Sbjct: 68   TVTWGVIQGKKEKLVNRVKICDYLKGLGIITDELESIELPSTIEVMCERVEFLQKLGLTI 127

Query: 413  DDLNAYPLVLACSVRKNIIPVLGYLEKLGIPRSKLGEFVRNYPQXXXXXXXXXXXXXXKF 592
            DD+N YPL+L CSVRKN+IPVL YLEK+GI RSKLGEFV+NYPQ              KF
Sbjct: 128  DDINEYPLMLGCSVRKNLIPVLAYLEKIGISRSKLGEFVKNYPQVLHASVVVELAPVVKF 187

Query: 593  LRGLDVERHDIPYVLQKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQFPYLLGM 772
            LRGLDVE+ D+ YVL KYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQ+PYLLGM
Sbjct: 188  LRGLDVEKQDLGYVLMKYPELLGFKLEGTMSTSVAYLVSIGVSPRDIGPMVTQYPYLLGM 247

Query: 773  RVGTKIKPLVDFLLSLGIPKKILAKVLEKRTYILGYDLQETVKPNVEALLSFGVRSERLP 952
            RVGT IKPLVD+L+S+G+PKKI+A++LEKR YI+GY+L+ETVKPNV+ L+SFGV+ E LP
Sbjct: 248  RVGTMIKPLVDYLISIGLPKKIVARMLEKRAYIVGYNLEETVKPNVDCLISFGVKKELLP 307

Query: 953  SVIAQYPQILGLPLKAKLSSQQYFFNLKLRIDPDGFARALERMPQIVSLNQSIIMKPIEF 1132
             +IAQYPQILGLP+KAK+S+QQYFF+LKL+IDP+GFAR +E+MPQIVSL Q++IMKPIEF
Sbjct: 308  LLIAQYPQILGLPVKAKMSTQQYFFSLKLKIDPEGFARVVEKMPQIVSLKQNVIMKPIEF 367

Query: 1133 LWGRGFSTEDVAKMFVKCPQLAAVRVELMKNSLYFCKSEMKRPMEELVEFPEYFTYSLES 1312
            L GR F  ED+AKM V+CPQ+   RVELMKNS YF K+EM RPM+ELVE+PEYFTYSLES
Sbjct: 368  LLGRAFQVEDIAKMVVRCPQILCSRVELMKNSYYFYKTEMGRPMKELVEYPEYFTYSLES 427

Query: 1313 RIKPRYLVLASKGIKCSLGWFLNCSDQRFEERIKAEYIDADTPGPSFAMGGKLEMPGSEL 1492
            RIKPRY  L SKGI+ SL WFLNCSDQRFEER++  +ID DT GP F MGGKLEMPG E+
Sbjct: 428  RIKPRYQKLQSKGIRSSLNWFLNCSDQRFEERLQGNFIDPDTEGPMFDMGGKLEMPGGEI 487

Query: 1493 VSEDEDE 1513
            VS++E++
Sbjct: 488  VSDEEED 494


>ref|NP_566005.1| transcription termination factor domain-containing protein
            [Arabidopsis thaliana] gi|3212859|gb|AAC23410.1|
            expressed protein [Arabidopsis thaliana]
            gi|14532592|gb|AAK64024.1| unknown protein [Arabidopsis
            thaliana] gi|19310761|gb|AAL85111.1| unknown protein
            [Arabidopsis thaliana] gi|330255268|gb|AEC10362.1|
            transcription termination factor domain-containing
            protein [Arabidopsis thaliana]
          Length = 507

 Score =  657 bits (1695), Expect = 0.0
 Identities = 316/444 (71%), Positives = 384/444 (86%)
 Frame = +2

Query: 182  SRFSSQSPAKPPEYEMPSVTWGVVQGRKERLVSRVIISDYLKSIGIVPDEIEPLELPSTV 361
            + +++QS +K PEYEMP+VTWGV+QG+KE+LV+RV I DYLK +GI+ DE+E +ELPST+
Sbjct: 51   AHYATQS-SKFPEYEMPTVTWGVIQGKKEKLVNRVKICDYLKGLGIITDELESIELPSTI 109

Query: 362  DVMRERVEFLHRLGLTVDDLNAYPLVLACSVRKNIIPVLGYLEKLGIPRSKLGEFVRNYP 541
            +VM ERVEFL +LGLT+DD+N YPL+L CSVRKN+IPVL YLEK+GI RSKLGEFV+NYP
Sbjct: 110  EVMCERVEFLQKLGLTIDDINEYPLMLGCSVRKNLIPVLAYLEKIGISRSKLGEFVKNYP 169

Query: 542  QXXXXXXXXXXXXXXKFLRGLDVERHDIPYVLQKYPELLGFKLEGTMSTSVAYLVSIGVS 721
            Q              KFLRGLDVE+ D+ YVL KYPELLGFKLEGTMSTSVAYLVSIGVS
Sbjct: 170  QVLHASVVVELAPVVKFLRGLDVEKQDLGYVLMKYPELLGFKLEGTMSTSVAYLVSIGVS 229

Query: 722  PRDIGPMVTQFPYLLGMRVGTKIKPLVDFLLSLGIPKKILAKVLEKRTYILGYDLQETVK 901
            PRDIGPMVTQ+PYLLGMRVGT IKPLVD+L+S+G+PKKI+A++LEKR+YI+GY+L+ETVK
Sbjct: 230  PRDIGPMVTQYPYLLGMRVGTMIKPLVDYLISIGLPKKIVARMLEKRSYIVGYNLEETVK 289

Query: 902  PNVEALLSFGVRSERLPSVIAQYPQILGLPLKAKLSSQQYFFNLKLRIDPDGFARALERM 1081
            PNV+ L+SFGV+ E LP +IAQYPQILGLP+KAK+S+QQYFF+LKL+IDP+GFAR +E+M
Sbjct: 290  PNVDCLISFGVKKELLPLLIAQYPQILGLPVKAKMSTQQYFFSLKLKIDPEGFARVVEKM 349

Query: 1082 PQIVSLNQSIIMKPIEFLWGRGFSTEDVAKMFVKCPQLAAVRVELMKNSLYFCKSEMKRP 1261
            PQIVSL Q++IMKPIEFL GR F  ED+AKM V+CPQ+   RVELMKNS YF K+EM RP
Sbjct: 350  PQIVSLKQNVIMKPIEFLLGRAFQVEDIAKMVVRCPQILCSRVELMKNSYYFYKTEMGRP 409

Query: 1262 MEELVEFPEYFTYSLESRIKPRYLVLASKGIKCSLGWFLNCSDQRFEERIKAEYIDADTP 1441
            M+ELVE+PEYFTYSLESRIKPRY  L SKGI+ SL WFLNCSDQRFEER++  +ID DT 
Sbjct: 410  MKELVEYPEYFTYSLESRIKPRYQKLQSKGIRSSLNWFLNCSDQRFEERLQGNFIDPDTE 469

Query: 1442 GPSFAMGGKLEMPGSELVSEDEDE 1513
            GP+F MGGKLEMPG E+V+++E++
Sbjct: 470  GPTFDMGGKLEMPGGEIVTDEEED 493


Top