BLASTX nr result

ID: Dioscorea21_contig00004738 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00004738
         (1879 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EAZ32303.1| hypothetical protein OsJ_16511 [Oryza sativa Japo...   514   e-143
gb|EAY95924.1| hypothetical protein OsI_17791 [Oryza sativa Indi...   514   e-143
ref|XP_002320619.1| predicted protein [Populus trichocarpa] gi|2...   505   e-140
ref|XP_003530850.1| PREDICTED: uncharacterized protein LOC100816...   499   e-138
ref|XP_002531084.1| GTP cyclohydrolase I, putative [Ricinus comm...   498   e-138

>gb|EAZ32303.1| hypothetical protein OsJ_16511 [Oryza sativa Japonica Group]
          Length = 483

 Score =  514 bits (1323), Expect = e-143
 Identities = 275/491 (56%), Positives = 339/491 (69%), Gaps = 16/491 (3%)
 Frame = -3

Query: 1766 MGALDDAHLDEELHCAV----------GLVPGSGSEMLSTREIEDAVKVLLHGLGEDSGR 1617
            MGAL++AHL   +               LV G G E  +   +E AV+ LL GLGED+ R
Sbjct: 1    MGALEEAHLAAAISACECECYEEEEEDDLVEGDG-EAAAADAMEPAVRALLLGLGEDARR 59

Query: 1616 EGLKKTPLRVAKAFLDGTRGYKLKVKDIVQGALFPEAGLENATGCAXXXXXXXXVRDIDM 1437
            EGL++TP RVAKAF DGTRGYK KVKDIVQGALFPE G++  TG A        VRDID+
Sbjct: 60   EGLRRTPKRVAKAFRDGTRGYKQKVKDIVQGALFPEVGVDKRTGSAGGTGGQVVVRDIDL 119

Query: 1436 FSYCESCLLPFSIRCHVGYIPSGQRVVGLSKLSRVADVFARRLQEPQRLADEISSALQSS 1257
            FSYCESCLLPFSI+ HVGY+PSG RVVGLSKLSRVADVFA+RLQ PQRLA E+  AL +S
Sbjct: 120  FSYCESCLLPFSIQFHVGYVPSGGRVVGLSKLSRVADVFAKRLQNPQRLASEVCGALHAS 179

Query: 1256 INPAGVAVALQCSHIQLPETLNCKANFKISSKLDMQGWVNASVFSSSGVFKNEDHPFWDD 1077
            I PAGVAVALQC HI LPE L CK          +QGW++ S  S SGVF+ E   FW+D
Sbjct: 180  IQPAGVAVALQCWHIPLPENLKCKT---------LQGWISTSHSSRSGVFEGESSSFWND 230

Query: 1076 FLALLKFKGIHMEETDPYHSLAQSWCPLRSLDILPCNG---RNLTNVKFSPKFGVTQTSM 906
            F ALLK +GI ME     HS + +WCPLRS D+  CNG   +  TN   SPK     ++M
Sbjct: 231  FSALLKLRGIDMERDS--HSASIAWCPLRSHDVPVCNGHCKKATTNGAISPKSVPAPSNM 288

Query: 905  IAAVTSIIEALGEDPSRKELMGTPSRFIHWLTNFKKSSFEMKL---SRNSLHMKTTNGVA 735
            ++AV+S++ +LGEDP RKEL+GTP R++ WL  F+  + ++KL   + N+L +  +    
Sbjct: 289  VSAVSSMLLSLGEDPFRKELVGTPQRYVQWLMKFRACNLDVKLNGFTLNNLSVYQSPAGD 348

Query: 734  GAEQNEMHTELSLPFCSQCEHHLLPFHGVVHVGYFGSNKGEVIQRCILQSLVHFYSVKLQ 555
             A+   +H+EL LPFC+QCEHHLLPF+GVVH+GY     GEVI R   Q+LVHFY  KLQ
Sbjct: 349  AADHRAIHSELHLPFCAQCEHHLLPFYGVVHIGYLDGGDGEVIDRSHFQALVHFYGCKLQ 408

Query: 554  VQERLTRQIAETVYSVLGTGVMVVVEASHTCMISRGIEKVGCTTATMALLGRFSTEPKAK 375
            VQER+TRQIAE VYSV   G +VVVEA+H CMISRGIEK+  +TAT+A+LG+F T+P AK
Sbjct: 409  VQERMTRQIAEAVYSVSHCGAIVVVEANHICMISRGIEKIRSSTATIAVLGQFLTDPSAK 468

Query: 374  TMFLQAISNHT 342
              FLQ + + T
Sbjct: 469  ARFLQNVVDTT 479


>gb|EAY95924.1| hypothetical protein OsI_17791 [Oryza sativa Indica Group]
          Length = 483

 Score =  514 bits (1323), Expect = e-143
 Identities = 275/491 (56%), Positives = 339/491 (69%), Gaps = 16/491 (3%)
 Frame = -3

Query: 1766 MGALDDAHLDEELHCAV----------GLVPGSGSEMLSTREIEDAVKVLLHGLGEDSGR 1617
            MGAL++AHL   +               LV G G E  +   +E AV+ LL GLGED+ R
Sbjct: 1    MGALEEAHLAAAISACECECYEEEEEDDLVEGDG-EAAAADAMEPAVRALLLGLGEDARR 59

Query: 1616 EGLKKTPLRVAKAFLDGTRGYKLKVKDIVQGALFPEAGLENATGCAXXXXXXXXVRDIDM 1437
            EGL++TP RVAKAF DGTRGYK KVKDIVQGALFPE G++  TG A        VRDID+
Sbjct: 60   EGLRRTPKRVAKAFRDGTRGYKQKVKDIVQGALFPEVGVDKRTGSAGGTGGQVVVRDIDL 119

Query: 1436 FSYCESCLLPFSIRCHVGYIPSGQRVVGLSKLSRVADVFARRLQEPQRLADEISSALQSS 1257
            FSYCESCLLPFSI+ HVGY+PSG RVVGLSKLSRVADVFA+RLQ PQRLA E+  AL +S
Sbjct: 120  FSYCESCLLPFSIQFHVGYVPSGGRVVGLSKLSRVADVFAKRLQNPQRLASEVCGALHAS 179

Query: 1256 INPAGVAVALQCSHIQLPETLNCKANFKISSKLDMQGWVNASVFSSSGVFKNEDHPFWDD 1077
            I PAGVAVALQC HI LPE L CK          +QGW++ S  S SGVF+ E   FW+D
Sbjct: 180  IEPAGVAVALQCWHIPLPENLKCKT---------LQGWISTSHSSRSGVFEGESSSFWND 230

Query: 1076 FLALLKFKGIHMEETDPYHSLAQSWCPLRSLDILPCNG---RNLTNVKFSPKFGVTQTSM 906
            F ALLK +GI ME     HS + +WCPLRS D+  CNG   +  TN   SPK     ++M
Sbjct: 231  FSALLKLRGIDMERDS--HSASIAWCPLRSHDVPVCNGHCKKATTNGAISPKSVPAPSNM 288

Query: 905  IAAVTSIIEALGEDPSRKELMGTPSRFIHWLTNFKKSSFEMKL---SRNSLHMKTTNGVA 735
            ++AV+S++ +LGEDP RKEL+GTP R++ WL  F+  + ++KL   + N+L +  +    
Sbjct: 289  VSAVSSMLLSLGEDPFRKELVGTPQRYVQWLMKFRACNLDVKLNGFTLNNLSVYQSPAGD 348

Query: 734  GAEQNEMHTELSLPFCSQCEHHLLPFHGVVHVGYFGSNKGEVIQRCILQSLVHFYSVKLQ 555
             A+   +H+EL LPFC+QCEHHLLPF+GVVH+GY     GEVI R   Q+LVHFY  KLQ
Sbjct: 349  AADHRAIHSELHLPFCAQCEHHLLPFYGVVHIGYLDGGDGEVIDRSHFQALVHFYGCKLQ 408

Query: 554  VQERLTRQIAETVYSVLGTGVMVVVEASHTCMISRGIEKVGCTTATMALLGRFSTEPKAK 375
            VQER+TRQIAE VYSV   G +VVVEA+H CMISRGIEK+  +TAT+A+LG+F T+P AK
Sbjct: 409  VQERMTRQIAEAVYSVSHCGAIVVVEANHICMISRGIEKIRSSTATIAVLGQFLTDPSAK 468

Query: 374  TMFLQAISNHT 342
              FLQ + + T
Sbjct: 469  ARFLQNVVDTT 479


>ref|XP_002320619.1| predicted protein [Populus trichocarpa] gi|222861392|gb|EEE98934.1|
            predicted protein [Populus trichocarpa]
          Length = 465

 Score =  505 bits (1301), Expect = e-140
 Identities = 275/483 (56%), Positives = 337/483 (69%), Gaps = 5/483 (1%)
 Frame = -3

Query: 1766 MGALDDAHLDEELHCAVGL-VPGSG-SEMLSTREIEDAVKVLLHGLGEDSGREGLKKTPL 1593
            M ALD+ H + EL   V L   G G  +   T  IEDAVKVLL GLGED  REGLKKTPL
Sbjct: 1    MSALDEGHFNAELENGVKLNCLGLGIQDQPETVAIEDAVKVLLQGLGEDINREGLKKTPL 60

Query: 1592 RVAKAFLDGTRGYKLKVKDIVQGALFPEAGLENATGCAXXXXXXXXVRDIDMFSYCESCL 1413
            RVAKA  +GT+GYK +VK+IVQGALFPE GL++  G A        VRD+D+FSYCESCL
Sbjct: 61   RVAKALREGTKGYKQRVKEIVQGALFPEVGLDDEVGQAGGAGGLVIVRDLDLFSYCESCL 120

Query: 1412 LPFSIRCHVGYIPSGQRVVGLSKLSRVADVFARRLQEPQRLADEISSALQSSINPAGVAV 1233
            LPF ++C +GY+PSGQRVVGLSKLSRVADVFA+RLQ+PQRLADEI SAL   + PAGVAV
Sbjct: 121  LPFQVKCQIGYVPSGQRVVGLSKLSRVADVFAKRLQDPQRLADEICSALHHGVMPAGVAV 180

Query: 1232 ALQCSHIQLPETLNCKANFKISSKLDMQGWVNASVFSSSGVFKNEDHPFWDDFLALLKFK 1053
             LQC HIQ P   N ++ F  S+    QGWV A V S SGVF+NE    W DFL+LLKF+
Sbjct: 181  VLQCLHIQFP---NIESLFLDSNH---QGWVKAVVHSGSGVFENELADVWGDFLSLLKFR 234

Query: 1052 GIHMEETDPYHSLAQSWCPLRSLDILPCNGRNLTNVKFSPKFGVTQTSMIAAVTSIIEAL 873
            GI++++T    S+ Q WCP           R  ++ K     G     M+ AVTSI+ +L
Sbjct: 235  GINLDKTQMKDSVQQCWCP----------SRYSSSAKV---IGPPNRGMVTAVTSILSSL 281

Query: 872  GEDPSRKELMGTPSRFIHWLTNFKKSSFEMKLSR---NSLHMKTTNGVAGAEQNEMHTEL 702
            GEDP RKEL+GTPSRF+ WL NF+  + EMKL+      +     NG     + +++TEL
Sbjct: 282  GEDPLRKELVGTPSRFVKWLMNFQSPNLEMKLNGVACGRMDPLKQNGEVSHNKQQIYTEL 341

Query: 701  SLPFCSQCEHHLLPFHGVVHVGYFGSNKGEVIQRCILQSLVHFYSVKLQVQERLTRQIAE 522
             L F SQCEHHLLPF+GVVH+GY+ + +   + + +LQS+VHFY  KLQVQERLTRQIAE
Sbjct: 342  CLSFWSQCEHHLLPFYGVVHIGYYCAEETTPLSKSLLQSIVHFYGFKLQVQERLTRQIAE 401

Query: 521  TVYSVLGTGVMVVVEASHTCMISRGIEKVGCTTATMALLGRFSTEPKAKTMFLQAISNHT 342
            TV S+LG  VMVVVEA+HTCMISRGIEK G +TAT+A+LGRFST+P A+ MFL+ I N  
Sbjct: 402  TVSSLLGGDVMVVVEANHTCMISRGIEKFGSSTATIAVLGRFSTDPAARAMFLKNIPNPA 461

Query: 341  ATG 333
            + G
Sbjct: 462  SGG 464


>ref|XP_003530850.1| PREDICTED: uncharacterized protein LOC100816351 [Glycine max]
          Length = 457

 Score =  499 bits (1285), Expect = e-138
 Identities = 269/481 (55%), Positives = 330/481 (68%), Gaps = 3/481 (0%)
 Frame = -3

Query: 1766 MGALDDAHLDEELHCAVGLVPGSGSEMLSTREIEDAVKVLLHGLGEDSGREGLKKTPLRV 1587
            MG L D     E+    G+  G G       E+EDAVKVLL GLGED  REGL+KTPLRV
Sbjct: 1    MGCLGDGRFAVEIRN--GVSNGCG-------EVEDAVKVLLEGLGEDVNREGLRKTPLRV 51

Query: 1586 AKAFLDGTRGYKLKVKDIVQGALFPEAGLENATGCAXXXXXXXXVRDIDMFSYCESCLLP 1407
            AKA  +GTRGY+ KVKDIVQGALFPEAGL+N  G A        VRD+D+FSYCESCLLP
Sbjct: 52   AKALREGTRGYRQKVKDIVQGALFPEAGLDNRVGHAGGAGGLVIVRDLDLFSYCESCLLP 111

Query: 1406 FSIRCHVGYIPSGQRVVGLSKLSRVADVFARRLQEPQRLADEISSALQSSINPAGVAVAL 1227
            F ++CHVGY+PSG+RVVGLSKLSRVADVFA+RLQEPQRLADE+ SAL   I PAGVA+ L
Sbjct: 112  FPVKCHVGYVPSGERVVGLSKLSRVADVFAKRLQEPQRLADEVCSALHRGIKPAGVAIIL 171

Query: 1226 QCSHIQLPETLNCKANFKISSKLDMQGWVNASVFSSSGVFKNEDHPFWDDFLALLKFKGI 1047
            QC+HI  P+         +    + QGWV   V S SGVF+N++   WDDF  LLKF+GI
Sbjct: 172  QCTHIHFPDI------EPVFLDSNHQGWVKILVSSGSGVFENKNADVWDDFFGLLKFRGI 225

Query: 1046 HMEETDPYHSLAQSWCPLRSLDILPCNGRNLTNVKFSPKFGVTQTSMIAAVTSIIEALGE 867
            +M++     S    WCP +S            + K S K G     M+ AV SIIE+LGE
Sbjct: 226  NMDKIHLRGSSDPCWCPSQS----------SLSAKVSSKIGPVNPVMVTAVASIIESLGE 275

Query: 866  DPSRKELMGTPSRFIHWLTNFKKSSFEMKLSR---NSLHMKTTNGVAGAEQNEMHTELSL 696
            DP RKEL+GTPSRF+ WL NF+ S+F+MKL+    + +     N      Q ++ +EL++
Sbjct: 276  DPLRKELIGTPSRFVKWLMNFQNSNFDMKLNGFLCDGIDSLNANEEVNVNQ-KITSELNI 334

Query: 695  PFCSQCEHHLLPFHGVVHVGYFGSNKGEVIQRCILQSLVHFYSVKLQVQERLTRQIAETV 516
            PF SQCEHHLLPFHGVVH+GY  S+    + + +LQS+VHFY  KLQVQERLTRQIAET+
Sbjct: 335  PFWSQCEHHLLPFHGVVHIGYLMSDGFNPMGKLLLQSIVHFYGFKLQVQERLTRQIAETI 394

Query: 515  YSVLGTGVMVVVEASHTCMISRGIEKVGCTTATMALLGRFSTEPKAKTMFLQAISNHTAT 336
              +LG  V+VVVEASHTCMISRGIEK G +TAT+A+LG FST P A+  FL++I   T++
Sbjct: 395  APLLGGDVIVVVEASHTCMISRGIEKFGSSTATIAVLGHFSTNPTARASFLESIPRPTSS 454

Query: 335  G 333
            G
Sbjct: 455  G 455


>ref|XP_002531084.1| GTP cyclohydrolase I, putative [Ricinus communis]
            gi|223529330|gb|EEF31298.1| GTP cyclohydrolase I,
            putative [Ricinus communis]
          Length = 469

 Score =  498 bits (1283), Expect = e-138
 Identities = 270/483 (55%), Positives = 330/483 (68%), Gaps = 5/483 (1%)
 Frame = -3

Query: 1766 MGALDDAHLDEELHCAVGL--VPGSGSEMLSTREIEDAVKVLLHGLGEDSGREGLKKTPL 1593
            MGALD+ H + EL   V L  +     E   T  IE+AV VLL GLGED  REGLKKTPL
Sbjct: 1    MGALDEGHFNLELENGVKLDCLELGFQEQTETLAIENAVSVLLQGLGEDINREGLKKTPL 60

Query: 1592 RVAKAFLDGTRGYKLKVKDIVQGALFPEAGLENATGCAXXXXXXXXVRDIDMFSYCESCL 1413
            RVAKA L G RGYK    DIV  ALFPE+GL+NA G A        VRD+D+FSYCESCL
Sbjct: 61   RVAKALLYGNRGYKQNANDIVHSALFPESGLDNAVGHAGGAGGLVIVRDLDLFSYCESCL 120

Query: 1412 LPFSIRCHVGYIPSGQRVVGLSKLSRVADVFARRLQEPQRLADEISSALQSSINPAGVAV 1233
            LPF ++CH+GY+PSGQRVVGLSKLSRVADVFA+RLQ PQRLA+EI SAL   I PAGVAV
Sbjct: 121  LPFQVKCHIGYVPSGQRVVGLSKLSRVADVFAKRLQGPQRLANEICSALHHGIKPAGVAV 180

Query: 1232 ALQCSHIQLPETLNCKANFKISSKLDMQGWVNASVFSSSGVFKNEDHPFWDDFLALLKFK 1053
             LQC HI  P       +  + S  + QG+V A V S SGVF+ E    W DFL+LLKF+
Sbjct: 181  ILQCLHIHFPSF----GSLLLDS--NHQGFVKALVHSGSGVFETETADTWCDFLSLLKFR 234

Query: 1052 GIHMEETDPYHSLAQSWCPLRSLDILPCNGRNLTNVKFSPKFGVTQTSMIAAVTSIIEAL 873
            GI++++     S+ Q WCP +S           ++ K   K G+    M++AVTSI+ ++
Sbjct: 235  GINVDKDHLKGSMEQCWCPSQS----------SSSSKILTKIGLPNPEMVSAVTSILTSI 284

Query: 872  GEDPSRKELMGTPSRFIHWLTNFKKSSFEMKLSR---NSLHMKTTNGVAGAEQNEMHTEL 702
            GEDP RKEL+GTPSRF+ WL NF  ++ EMKL+    N +     NG     + ++ +EL
Sbjct: 285  GEDPLRKELVGTPSRFVKWLMNFHNTNLEMKLNGFGCNRMDPLKANGGVSHNKEQLQSEL 344

Query: 701  SLPFCSQCEHHLLPFHGVVHVGYFGSNKGEVIQRCILQSLVHFYSVKLQVQERLTRQIAE 522
            +L F SQCEHHLLPF+GVVH+GYF +     I + +LQS+VHFY  KLQVQERLTRQIAE
Sbjct: 345  NLSFWSQCEHHLLPFYGVVHIGYFQAEGFNPIGKSLLQSIVHFYGFKLQVQERLTRQIAE 404

Query: 521  TVYSVLGTGVMVVVEASHTCMISRGIEKVGCTTATMALLGRFSTEPKAKTMFLQAISNHT 342
            T  S+LG  VMVVVEA+HTCMISRGIEK G  TAT+A+LGRFST+P ++ MFLQ+I N  
Sbjct: 405  TASSILGGNVMVVVEANHTCMISRGIEKFGSNTATIAVLGRFSTDPSSRAMFLQSIPNSA 464

Query: 341  ATG 333
            A G
Sbjct: 465  ACG 467


Top