BLASTX nr result

ID: Dioscorea21_contig00027992 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00027992
         (1347 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAK98741.1|AC090485_20 Hypothetical protein similar to putati...    78   6e-12
emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabid...    76   2e-11
gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-...    76   2e-11
emb|CAN64680.1| hypothetical protein VITISV_016601 [Vitis vinifera]    74   7e-11
ref|XP_002461200.1| hypothetical protein SORBIDRAFT_02g042745 [S...    74   1e-10

>gb|AAK98741.1|AC090485_20 Hypothetical protein similar to putative retroelements [Oryza sativa
            Japonica Group] gi|27497213|gb|AAO17357.1| Putative
            retroelement [Oryza sativa Japonica Group]
            gi|108706160|gb|ABF93955.1| retrotransposon protein,
            putative, unclassified [Oryza sativa Japonica Group]
            gi|125584905|gb|EAZ25569.1| hypothetical protein
            OsJ_09395 [Oryza sativa Japonica Group]
          Length = 387

 Score = 77.8 bits (190), Expect = 6e-12
 Identities = 43/143 (30%), Positives = 66/143 (46%)
 Frame = -2

Query: 1076 TWNGWNYLWKLNTSPRAKFFAWLVLHGRLQTLEFLRSIRIISNSTCALCGLHKENIEHLF 897
            TW  W+ +W+    PR KFFAWL+   RL T   L    I+   TC LC  + E+  H+F
Sbjct: 222  TWPPWSAIWRSAAPPRVKFFAWLMSKNRLPTRVNLHKKTILPTPTCELCNANLEDTYHIF 281

Query: 896  NSCSKTLCLWRIIEGITYSSFFTDLNVLNGHWLDNPCSPNGFWKASVIVNVIWNLWKARC 717
              C      W +I+ I   S  +DL+ L    L  P     F  ++  +   W LW  R 
Sbjct: 282  LRCPMAAAFWNMIQIIPEISSLSDLHNLE---LSGPLPT--FLSSTFFLLCCWRLWNHRN 336

Query: 716  ALIFKHQPLDLLWIAHASLNQVK 648
             ++F++ P  +  + ++ L   K
Sbjct: 337  EVVFQNLPPSISRLINSCLQDAK 359


>emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
            gi|7267666|emb|CAB78094.1| RNA-directed DNA
            polymerase-like protein [Arabidopsis thaliana]
          Length = 1274

 Score = 76.3 bits (186), Expect = 2e-11
 Identities = 92/376 (24%), Positives = 139/376 (36%), Gaps = 14/376 (3%)
 Frame = -2

Query: 1295 NEELSLHDFIVEG--NWNFSSIHNFLSVHFSPPKPVLSSIDCNSV---NSWVWSQGYMRQ 1131
            N++LS+HD I     +WN  +I      H    +  +  I  N++   +S VW      +
Sbjct: 891  NKDLSVHDLICHDVKSWNVEAIRK----HLPQYEDQIRKITINALPLQDSLVWLPVKSGE 946

Query: 1130 NIVKEVYS--HFNSDHTKDFTWNGWNYLWKLNTSPRAKFFAWLVLHGRLQTLEFLRSIRI 957
               K  Y+    NS       +N    +WK++TSP+ K F W  + G L   E L    I
Sbjct: 947  YTTKTGYALAKLNSFPASQLDFNWQKNIWKIHTSPKVKHFLWKAMKGALPVGEALSRRNI 1006

Query: 956  ISNSTCALCGLHKENIEHLFNSCSKTLCLWRIIEGITYSSFFTDLNV---LNGHWLDNPC 786
             +  TC  CG   E+  HL   C     +W +   +   S  T  +V   L         
Sbjct: 1007 EAEVTCKRCG-QTESSLHLMLLCPYAKKVWELAPVLFNPSEATHSSVALLLVDAKRMVAL 1065

Query: 785  SPNGFWKASVIVNVIWNLWKARCALIFKHQPLD---LLWIAHASLNQVKEFSLKSDHLRD 615
             P G   A +   ++W+LWKAR  LIF +       L+  A        E  L   H   
Sbjct: 1066 PPTGLGSAPLYPWLLWHLWKARNRLIFDNHSCSEEGLVLKAILDARAWMEAQLLIHHPSP 1125

Query: 614  VFFLNACTMSSPNYCLFTDASWLMDSLSGGCGFFISKSY-LSIAVAGCSNCISNSXXXXX 438
            +    + T +      F DA+W       G G+F+   Y + I     S+    S     
Sbjct: 1126 ISDYPSPTPNLKVTSCFVDAAWTTSGYC-GMGWFLQDPYKVKIKENQSSSSFVGSALMAE 1184

Query: 437  XXXXXXXLKTAVASGFHINTIFSDCAGLHLAITKPNTVTAWXXXXXXXXXXXXXXIDLVQ 258
                   L  A+++G     +FSDC  L   +    ++                   L  
Sbjct: 1185 TLAVHLALVDALSTGVRQLNVFSDCKELISLLNSGKSIVELRGLLHDIRELSVSFTHLC- 1243

Query: 257  IIVIPRTWNLLADKLA 210
               IPR  N++AD LA
Sbjct: 1244 FFFIPRLSNVVADSLA 1259


>gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana]
          Length = 1142

 Score = 75.9 bits (185), Expect = 2e-11
 Identities = 95/410 (23%), Positives = 159/410 (38%), Gaps = 33/410 (8%)
 Frame = -2

Query: 1340 IPIQY-KPTFVNMNILNEELSLHDFIVEGNWNFSSIHNFLSVHFSPPK-PVLSSIDCNSV 1167
            IP Q+ +P     +I++  L +   +++   NF +I + L   F P   P++S++   + 
Sbjct: 724  IPAQFPRPAKYGGSIVDPSLKVKS-LIDSRSNFWNI-DLLKELFDPEDVPLISALPIGNP 781

Query: 1166 N---SWVWSQGYMRQNIVKEVYSHFNSDHTKDFTWNGWN------YLWKLNTSPRAKFFA 1014
            N   +  W         VK  Y     D  +  T  G +      Y+WK+   P+ + F 
Sbjct: 782  NMEDTLGWHFTKAGNYTVKSGYHTARLDLNEGTTLIGPDLTTLKAYIWKVQCPPKLRHFL 841

Query: 1013 WLVLHGRLQTLEFLRSIRIISNSTCALCGLHKENIEHLFNSCSKTLCLWRIIEGITYSSF 834
            W +L G +   E LR   I+ +  C  CG  +E+I H    C     +W + +  T    
Sbjct: 842  WQILSGCVPVSENLRKRGILCDKGCVSCGASEESINHTLFQCHPARQIWALSQIPTAPGI 901

Query: 833  FTDLNVLNGHWLDNPCSPNGFWKASVIVN------VIWNLWKARCALIFKH---QPLDLL 681
            F   ++     LD+      FW+    V+      +IW +WKAR   +F++    P+++L
Sbjct: 902  FPSNSIFTN--LDHL-----FWRIPSGVDSAPYPWIIWYIWKARNEKVFENVDKDPMEIL 954

Query: 680  WI----------AHASLNQVKEFSLKSDHLRDVFFLNACTMSSPNYCLFTDASWLMDSLS 531
             +          A   L+  +  SL  D    V  ++  T  S   C F D SW      
Sbjct: 955  LLAVKEAQSWQEAQVELHSERHGSLSIDSRIRVRDVSQDTTFSGFRC-FIDGSWKASDQF 1013

Query: 530  GGCGFFISKSYLSIAVAGCSNC-ISNSXXXXXXXXXXXXLKTAVASGFHINTIFSDCAGL 354
             G G+F   S       G +N   S S            +K  + +       F+DC+ L
Sbjct: 1014 SGTGWFCLSSLGESPTMGAANVRRSLSPLHTEMEALLWAMKCMIGADNQNVAFFTDCSDL 1073

Query: 353  HLAITKPNTVTAWXXXXXXXXXXXXXXIDLV--QIIVIPRTWNLLADKLA 210
               ++ P   T W               +     + +I R+ N+ ADKLA
Sbjct: 1074 VKMVSSP---TEWPAFSVYLEELQSDREEFTNFSLSLISRSANVKADKLA 1120


>emb|CAN64680.1| hypothetical protein VITISV_016601 [Vitis vinifera]
          Length = 971

 Score = 74.3 bits (181), Expect = 7e-11
 Identities = 50/209 (23%), Positives = 91/209 (43%), Gaps = 11/209 (5%)
 Frame = -2

Query: 1259 GNWNFSSIHNFLSVHFSPPKPVLSSIDCNSVNSWV-----WSQGYMRQNIVKEVYSHFNS 1095
            G WN   +  F        K  +  I    V   V     W++    + +VK +Y    +
Sbjct: 739  GGWNPCFLRAFNDWEIEEAKRFMERIQSKRVYEDVEDTVSWTETKSGKFLVKSLYIALEA 798

Query: 1094 DHTKDFTWNGWNYLWKLNTSPRAKFFAWLVLHGRLQTLEFLRSIRIISNSTCALCGLHKE 915
              +  F  +   ++W +N  P+  FFAW    G+  TL+ ++    +  + C +C   +E
Sbjct: 799  GGSSLFPSS---FIWNVNVQPKMSFFAWEATWGKALTLDLVQKRGWVLANRCFMCLEKEE 855

Query: 914  NIEHLFNSCSKTLCLWRIIEGITYSSFFTDLNV------LNGHWLDNPCSPNGFWKASVI 753
            NI HL   CS+T  LW ++  +   S+    +V       NG++L    +    W+ + +
Sbjct: 856  NINHLLLHCSRTRALWELLFALFGVSWVLPFSVRETLLSWNGYFLGK--NRKKVWRVAPL 913

Query: 752  VNVIWNLWKARCALIFKHQPLDLLWIAHA 666
             ++ W +WK R  L FK + L +  + H+
Sbjct: 914  -HIFWTVWKERNRLAFKDESLSIQRLKHS 941


>ref|XP_002461200.1| hypothetical protein SORBIDRAFT_02g042745 [Sorghum bicolor]
            gi|241924577|gb|EER97721.1| hypothetical protein
            SORBIDRAFT_02g042745 [Sorghum bicolor]
          Length = 230

 Score = 73.6 bits (179), Expect = 1e-10
 Identities = 44/146 (30%), Positives = 72/146 (49%), Gaps = 5/146 (3%)
 Frame = -2

Query: 1067 GWNYLWKLNTSPRAKFFAWLVLHGRLQTLEFLRSIRIISNSTCALCGLHKENIEHLFNSC 888
            G  + WK     + KFF WL LHGRL T E  R   + + + CALC  H E ++HL  +C
Sbjct: 66   GVMFAWKAMVPSKMKFFFWLALHGRLWTAERRRRHGLRAEAACALCSQHDETVDHLLIAC 125

Query: 887  SKTLCLWR--IIEGITYSSFFTDLNVLNGHWLDNPCS-PNGFWKA--SVIVNVIWNLWKA 723
              +  +W   ++         TD + L   W+ +    P+ F  A  S+++ V WN+WK 
Sbjct: 126  VFSRDVWSRLLMRAGLLRLTPTDGSALQDWWISSRQQIPHSFCCAFDSMVIIVSWNVWKE 185

Query: 722  RCALIFKHQPLDLLWIAHASLNQVKE 645
            R    F  +   ++ +  A L++++E
Sbjct: 186  RNDRTFNGRDRTIVQVCGAILDEIRE 211


Top