BLASTX nr result

ID: Dioscorea21_contig00005613 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00005613
         (1727 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABA93957.1| NLI interacting factor-like phosphatase family pr...   261   3e-67
gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japo...   244   4e-62
ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [S...   232   2e-58
ref|XP_002297869.1| predicted protein [Populus trichocarpa] gi|2...   221   4e-55
ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal doma...   219   2e-54

>gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
            [Oryza sativa Japonica Group]
          Length = 1272

 Score =  261 bits (668), Expect = 3e-67
 Identities = 191/549 (34%), Positives = 273/549 (49%), Gaps = 31/549 (5%)
 Frame = -1

Query: 1577 FDKSVSLILEELDAITVEEVEKSFEASYLRLLKCFESLKQM--SSDEHTPVVDALVQQAF 1404
            FD+ V  ILEEL+ +++EE EKSFE +  RL  CFE+LK +   S    P++DALVQQAF
Sbjct: 202  FDQRVGSILEELEMVSIEEAEKSFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAF 261

Query: 1403 ISIQTLHSVYSSGNLKNMDQAEKFLVRLLMRIKNQYHVLLTPEQMKEIDVLVQTLVFE-- 1230
            + I T+ +V +S ++   +Q +  L++LL  IKN+Y  +LTP+Q  E+D  V+ LVFE  
Sbjct: 262  VGIDTITTVANSYDMPKREQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVFEDG 321

Query: 1229 --NSNKEREMKMNVDRSGSAKLAEKPVFDQ-RKPSFPNLELPVTPRNRV---LVDLHAKY 1068
              N+N       N        L+E+  F+     SF  +E+P   +NR+   L+DLHA Y
Sbjct: 322  KDNANGPNATSTNAAAPSGQVLSERLPFESGAGNSFSKVEIPA--KNRMVSPLLDLHADY 379

Query: 1067 DEGSLPSPTRDNPQPLPMLQPIGVGVSIGATTPSQPIASKNVNADDTSVHPYLTDALKAV 888
            DE SLPSPTRD+  P  + +PIG G       P +P   + V     S +    DALKAV
Sbjct: 380  DENSLPSPTRDSKPPFDVPKPIGYGAL--PMAPDRPSVLERVEPAKNSSYQSFNDALKAV 437

Query: 887  SSYQQKYSKTTFLQSNRLPSPTPSEDGKNNDD---DTRGEVXXXXXXXXXXXXXXXXXXX 717
              YQQK+ + +   S+ LPSPTPS DG  + D   D  GEV                   
Sbjct: 438  CYYQQKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNKIALPIVNQMPS 497

Query: 716  XXXXXXXXXSMMNLTVQPVAPTKTVVQMSCASTSVMKSAAKSRDPRLKFVNAEVGDSPDQ 537
                        +    P    K +      S  ++K+ AKSRDPRLKF+N + G   D 
Sbjct: 498  RPSTVSSNSD--SFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFLNRDTGGVADA 555

Query: 536  SKHTASLESTVLKNGPV--GALVNTRKHKVLDEPLLDDHHMKRRRNGVSDPRDVLMT--- 372
            ++     E    K+  +  G  +N+RK+K +DEP++D++ +KR R  + + RD+  T   
Sbjct: 556  NRRVNFAEPNPSKDRTMGGGVSINSRKNKAVDEPMVDENALKRSRGVIGNLRDMQPTGRG 615

Query: 371  --XXXXXXXXXXXXXSAQPNSRIYGGENVDVQRKPGNGEIVSDRRPDACMITKASNN--- 207
                             QPN     G N       GN  I +D    A  +   +NN   
Sbjct: 616  GWAKDGGNISSYSSDGFQPNQNTRLGNNT-----TGNHNIRTDSTL-ASNLNNTTNNSGT 669

Query: 206  ------AIELNSI--LSTAAAVSLPNLLKGIAVNPTMLVELLKMEQQRLAGGAQQKPADA 51
                  A + NS    S+A AVSLP +LK IAVNPTML++ ++MEQQ+++    Q+   A
Sbjct: 670  SPGIVQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQMEQQKMSASEPQQKVTA 729

Query: 50   IQNSAINRT 24
                  N T
Sbjct: 730  SVGMTSNVT 738


>gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
          Length = 1267

 Score =  244 bits (624), Expect = 4e-62
 Identities = 191/582 (32%), Positives = 273/582 (46%), Gaps = 64/582 (10%)
 Frame = -1

Query: 1577 FDKSVSLILEELDAITVEEVEK---------------------------------SFEAS 1497
            FD+ V  ILEEL+ +++EE EK                                 SFE +
Sbjct: 164  FDQRVGSILEELEMVSIEEAEKYGLMILLYGKVHVLDVFWCMIQLLRDPILMFCRSFEGA 223

Query: 1496 YLRLLKCFESLKQM--SSDEHTPVVDALVQQAFISIQTLHSVYSSGNLKNMDQAEKFLVR 1323
              RL  CFE+LK +   S    P++DALVQQAF+ I T+ +V +S ++   +Q +  L++
Sbjct: 224  CTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGIDTITTVANSYDMPKREQTKNMLLK 283

Query: 1322 LLMRIKNQYHVLLTPEQMKEIDVLVQTLVFE----NSNKEREMKMNVDRSGSAKLAEKPV 1155
            LL  IKN+Y  +LTP+Q  E+D  V+ LVFE    N+N       N        L+E+  
Sbjct: 284  LLFHIKNRYSDMLTPDQRDELDSRVRQLVFEDGKDNANGPNATSTNAAAPSGQVLSERLP 343

Query: 1154 FDQ-RKPSFPNLELPVTPRNRV---LVDLHAKYDEGSLPSPTRDNPQPLPMLQPIGVGVS 987
            F+     SF  +E+P   +NR+   L+DLHA YDE SLPSPTRD+  P  + +PIG G  
Sbjct: 344  FESGAGNSFSKVEIPA--KNRMVSPLLDLHADYDENSLPSPTRDSKPPFDVPKPIGYGAL 401

Query: 986  IGATTPSQPIASKNVNADDTSVHPYLTDALKAVSSYQQKYSKTTFLQSNRLPSPTPSEDG 807
                 P +P   + V     S +    DALKAV  YQQK+ + +   S+ LPSPTPS DG
Sbjct: 402  --PMAPDRPSVLERVEPAKNSSYQSFNDALKAVCYYQQKHGQKSNFASDDLPSPTPSGDG 459

Query: 806  KNNDD---DTRGEVXXXXXXXXXXXXXXXXXXXXXXXXXXXXSMMNLTVQPVAPTKTVVQ 636
              + D   D  GEV                               +    P    K +  
Sbjct: 460  DKSGDKGGDVFGEVSSFSASNKIALPIVNQMPSRPSTVSSNSD--SFAGGPPGYAKQIEN 517

Query: 635  MSCASTSVMKSAAKSRDPRLKFVNAEVGDSPDQSKHTASLESTVLKNGPV--GALVNTRK 462
                S  ++K+ AKSRDPRLKF+N + G   D ++     E    K+  +  G  +N+RK
Sbjct: 518  SVSGSNHLLKATAKSRDPRLKFLNRDTGGVADANRRVNFAEPNPSKDRTMGGGVSINSRK 577

Query: 461  HKVLDEPLLDDHHMKRRRNGVSDPRDVLMT-----XXXXXXXXXXXXXSAQPNSRIYGGE 297
            +K +DEP++D++ +KR R  + + RD+  T                    QPN     G 
Sbjct: 578  NKAVDEPMVDENALKRSRGVIGNLRDMQPTGRGGWAKDGGNISSYSSDGFQPNQNTRLGN 637

Query: 296  NVDVQRKPGNGEIVSDRRPDACMITKASNN---------AIELNSI--LSTAAAVSLPNL 150
            N       GN  I +D    A  +   +NN         A + NS    S+A AVSLP +
Sbjct: 638  NT-----TGNHNIRTDSTL-ASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSAPAVSLPAM 691

Query: 149  LKGIAVNPTMLVELLKMEQQRLAGGAQQKPADAIQNSAINRT 24
            LK IAVNPTML++ ++MEQQ+++    Q+   A      N T
Sbjct: 692  LKDIAVNPTMLMQWIQMEQQKMSASEPQQKVTASVGMTSNVT 733


>ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
            gi|241935397|gb|EES08542.1| hypothetical protein
            SORBIDRAFT_05g019010 [Sorghum bicolor]
          Length = 1197

 Score =  232 bits (592), Expect = 2e-58
 Identities = 180/524 (34%), Positives = 258/524 (49%), Gaps = 21/524 (4%)
 Frame = -1

Query: 1574 DKSVSLILEELDAITVEEVEKSFEASYLRLLKCFESLK----QMSSDEHTPVVDALVQQA 1407
            D+ V  ILEEL+ +++EE EKSFE +  RL  CFE+LK    ++ +     +++ L+QQA
Sbjct: 179  DQRVGSILEELEMVSIEEAEKSFEGACGRLHTCFENLKPLFQELENGSPMAILEPLMQQA 238

Query: 1406 FISIQTLHSVYSSGNLKNMDQAEKFLVRLLMRIKNQYHVLLTPEQMKEIDVLVQTLVF-- 1233
            FI I TL +V  S NL   +Q +  L++ L  IKN+Y  +LTPEQ  E+D  V+ LVF  
Sbjct: 239  FIGIDTLTTVAISYNLPRSEQNKTTLLKSLFHIKNRYSDMLTPEQRDELDSRVRKLVFGE 298

Query: 1232 ----------ENSNKEREMKMNVDRSGSAKLAEKPVFDQRKPSFPNLELPVTPRNRVLVD 1083
                        +N    +  +   S S  L  +        S P LE+P   R   L+D
Sbjct: 299  KDNVSDPSTSSGTNAINVLAPSGQVSSSGGLPFESGAANPFSSLPRLEVP-AKRISPLLD 357

Query: 1082 LHAKYDEGSLPSPTRDNPQPLPMLQPIGVGVSIGATTPSQPIASKNVNADDTSVHPYLTD 903
            LHA YDE SLPSPTRDN  P P+ +PIG G       P +    + V     S++P L D
Sbjct: 358  LHADYDENSLPSPTRDNAPPFPVPKPIGFGAF--PMVPEKLSFPERVEPAKNSLYPSLND 415

Query: 902  ALKAVSSYQQKYSKTTFLQSNRLPSPTPS-EDGKNND--DDTRGEVXXXXXXXXXXXXXX 732
             LKAVSSYQQKY + +   S+ LPSPTPS ++GK+ D   D   EV              
Sbjct: 416  PLKAVSSYQQKYGQKSVFPSDDLPSPTPSGDEGKSADKGGDIFSEVSSFPVPKSIALPST 475

Query: 731  XXXXXXXXXXXXXXSMMNLTVQPVAPTKTVVQMSCASTSVMKSAAKSRDPRLKFVNAEVG 552
                            ++    P    K + Q        +K+A+KSRDPRL+F+N +  
Sbjct: 476  SQMPASQPSTVSSSG-ISYASGPPGFAKQIEQPVAGPNHAIKAASKSRDPRLRFLNRDSA 534

Query: 551  DSPDQSKHTASLESTVLKNGPV-GALVNTRKHKVLDEPLLDDHHMKRRRNGVSDPRDVLM 375
             + D ++     E   LK+G + GA V  RKHK +D+P +D++ +KR R G ++PRD+  
Sbjct: 535  GATDVNRRANFSE---LKDGNLGGASVGNRKHKAIDDPQVDENVLKRFRGGTANPRDLQP 591

Query: 374  TXXXXXXXXXXXXXSAQPNSRIYGGENVDVQRKPGNGEIVSDRRPDACMITKASNNAIEL 195
            T                PN  +    N+   R P N   ++ +       T         
Sbjct: 592  T--------------GNPNQLM----NI---RAPTNSSGINMKTLQPPQTTAPH------ 624

Query: 194  NSILSTAAAVSLPN-LLKGIAVNPTMLVELLKMEQQRLAGGAQQ 66
               +S A AV +P+ LLK IAVNPT+L+ L++ME Q+ +    Q
Sbjct: 625  ---VSAAPAVPVPSMLLKDIAVNPTLLMHLIQMEHQKKSASETQ 665


>ref|XP_002297869.1| predicted protein [Populus trichocarpa] gi|222845127|gb|EEE82674.1|
            predicted protein [Populus trichocarpa]
          Length = 1117

 Score =  221 bits (564), Expect = 4e-55
 Identities = 178/551 (32%), Positives = 274/551 (49%), Gaps = 27/551 (4%)
 Frame = -1

Query: 1574 DKSVSLILEELDAITVEEVEKSFEASYLRLLKCFESLKQM--SSDEHTPVVDALVQQAFI 1401
            +  V  I ++L++++V E EKSFEA  L+L K  ESLK++   +D   P  D LVQ  F+
Sbjct: 72   ENRVKSIRKDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLFM 131

Query: 1400 SIQTLHSVYSSGNLKNMDQAEKFLVRLLMRIKNQYHVLLTPEQMKEIDVLVQTLVFENSN 1221
            +I+ ++SV+ S N K  +Q +    R    + + Y    +P Q KE+       + EN N
Sbjct: 132  AIRVVNSVFCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEV-------LNENHN 184

Query: 1220 KEREMKMNVDRSG-SAKLAEKPVFDQRKPSFPNLELPVTP-----RNR----VLVDLHAK 1071
                     D +  S KL     F Q KP+  ++E P  P     ++R     L+DL   
Sbjct: 185  DSLAKTAGYDLTTMSEKLPAAETFVQNKPN-KSIEAPKPPGVPSFKSRGVLLPLLDLKKY 243

Query: 1070 YDEGSLPSPTRDNPQPLPMLQPIGVGVSIGATTPSQPIASKNVNADDTSVHPYLTDALKA 891
            +DE SLPSPT++   P P+ + + +G   G  +   P+      A++  +HPY TDALKA
Sbjct: 244  HDEDSLPSPTQETT-PFPVQRLLAIGD--GMVSSGLPVPKVTPVAEEPRMHPYETDALKA 300

Query: 890  VSSYQQKYSKTTFLQSNRLPSPTPSEDGKNNDDDTRGEVXXXXXXXXXXXXXXXXXXXXX 711
            VSSYQQK+++ +F  +N LPSPTPSE+  N D DT GEV                     
Sbjct: 301  VSSYQQKFNRNSFF-TNELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRTVNPPVSDQKN 359

Query: 710  XXXXXXXSMM------NLTVQPVAPTKTVVQMSCASTSVMKSAAKSRDPRLKFVNAEVGD 549
                            +  ++ V PT+    +S   +S +K++AKSRDPRL++VN +   
Sbjct: 360  APPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSGPSSTIKASAKSRDPRLRYVNID-AC 418

Query: 548  SPDQSKHTASLESTVLKNGPVGALVNTRKHKVLDEPLLDDHHMKRRRNGVSD---PRDVL 378
            + D ++    + + + +  P GA+V ++KHK+ +E +LDD  +KR+RN   +    RD+ 
Sbjct: 419  ALDHNQRALPMVNNLPRVEPAGAIVGSKKHKI-EEDVLDDPSLKRQRNSFDNYGAVRDIE 477

Query: 377  MTXXXXXXXXXXXXXSAQPNSRIYGGENVDVQRKPGNGEIVSDRRPDACMITKASNNAIE 198
                             Q  ++    EN +V    G+G   S            SN    
Sbjct: 478  SMTGTGGWLEDTDMAEPQTVNKNQWAENSNVN---GSGNAQSP-------FMGISNITGS 527

Query: 197  LNSILSTAAAVSLPNLLKGIAVNPTMLVELLKM-EQQRLAGGAQQ---KPADAIQNSAIN 30
              + +++ A  SLP+LLK IAVNPTML+ +LKM +QQRLA   QQ    PA +  +  I+
Sbjct: 528  EQAQVTSTATTSLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSDPAKSTSHPPIS 587

Query: 29   RTT--AVPSLN 3
             T   A+P++N
Sbjct: 588  NTVLGAIPTVN 598


>ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Brachypodium distachyon]
          Length = 1259

 Score =  219 bits (558), Expect = 2e-54
 Identities = 176/556 (31%), Positives = 271/556 (48%), Gaps = 36/556 (6%)
 Frame = -1

Query: 1577 FDKSVSLILEELDAITVEEVEKSFEASYLRLLKCFESLKQM--SSDEHTPVVDALVQQAF 1404
            FD+ V  ILEEL+ +++EE EKSFE +  RL  CFE+LK +   S    P++DALVQQ F
Sbjct: 188  FDQRVGSILEELEMVSIEEAEKSFEGACERLRTCFENLKPLFLESGSPMPMLDALVQQGF 247

Query: 1403 ISIQTLHSVYSSGNLKNMDQAEKFLVRLLMRIKNQYHVLLTPEQMKEIDVLVQTLVF--- 1233
            + I T+ +V +S  +    Q ++ L++LL  ++N+Y  +LTP+Q  E+D  V+ L F   
Sbjct: 248  VGIDTITTVANSYAMPKRVQNKEMLLKLLFHLRNRYSDMLTPDQRVELDSRVRQLAFVDG 307

Query: 1232 ------ENSNKEREMKMNVDRSGSAKLAEKPVFDQRKPSFPNLELP---VTPRNRV---L 1089
                   N++        V  +G       P        F    LP      +NR+   L
Sbjct: 308  EENTDGPNASCSTNSTNVVVPTGQVPSERLPFESGATNPFSGSSLPWLETQTKNRMVSPL 367

Query: 1088 VDLHAKYDEGSLPSPTRDNPQPLPMLQPIGVGVSIGATTPSQPIASKNVNADDT--SVHP 915
            +DLHA +DE SLPSPTRDN     + +PIG G       P  P  S    A+ +  +++P
Sbjct: 368  LDLHADHDENSLPSPTRDNAPQFSVPKPIGFG-----AFPMGPDRSLTERAEPSKKNLYP 422

Query: 914  YLTDALKAVSSYQQKYSKTTFLQSNRLPSPTPSEDGKNNDD---DTRGEVXXXXXXXXXX 744
             + D+L  VSSY+QKYS+ +   ++ LPSPTPS DG  ++D   D  GE+          
Sbjct: 423  SVNDSLD-VSSYKQKYSQKSNFANDDLPSPTPSGDGDKSEDKDGDMFGEISSFSSSNKTA 481

Query: 743  XXXXXXXXXXXXXXXXXXSMMNLTVQPVAPTKTVVQMSCASTSVMKSAAKSRDPRLKFVN 564
                              +       P    K + Q        +K +AKSRDPRL+++N
Sbjct: 482  LPSVSQIPASRPSTVSSSN--GSFSGPPGYAKKIEQSVSGPNLALKPSAKSRDPRLRYLN 539

Query: 563  AEVGDSPDQSKHTASLESTVLKNGPVGALVNTRKHKVLDEPLLDDHHMKRRRNGVSDPRD 384
             + GD+   ++     E      G +G      KHK + +PL+D++ +KR R  + +PRD
Sbjct: 540  RDPGDA---NRCMNFAEPNASLGGTLG------KHKAVGQPLMDENMVKRARGSIGNPRD 590

Query: 383  VLMTXXXXXXXXXXXXXSAQPNSRIYGGENVDVQRK-PGNGEIVSDRR--PDACMITKAS 213
            + +                 P+ R+   +N  +  K  GN  + +D +   +   IT +S
Sbjct: 591  LQVPPGRDGSNISF-----YPSDRVQSNQNTRLDTKTTGNPNLRADSQLLSNVSSITNSS 645

Query: 212  ------NNAIELNSILSTAAA--VSLPNLLKGIAVNPTMLVELLKMEQQRLAGGAQQKPA 57
                   NA + +S+  T+AA  VSLP +LK IAVNPT+L+  ++MEQQ+ +    Q+  
Sbjct: 646  VTSTKTLNAGQPDSVPQTSAAPSVSLPAVLKDIAVNPTVLMHWIQMEQQKRSASEPQQTV 705

Query: 56   D---AIQNSAINRTTA 18
            +    I +  IN  TA
Sbjct: 706  NTLGGISSGMINNDTA 721


Top