BLASTX nr result

ID: Dioscorea21_contig00024757 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00024757
         (826 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabi...   187   2e-45
gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-...   185   1e-44
gb|AAB84340.1| putative non-LTR retroelement reverse transcripta...   184   2e-44
ref|XP_002449295.1| hypothetical protein SORBIDRAFT_05g007323 [S...   182   6e-44
ref|XP_003528143.1| PREDICTED: uncharacterized protein LOC100778...   182   8e-44

>pir||T00833 RNA-directed DNA polymerase homolog T13L16.7 - Arabidopsis thaliana
            (fragment)
          Length = 1365

 Score =  187 bits (476), Expect = 2e-45
 Identities = 107/264 (40%), Positives = 158/264 (59%), Gaps = 4/264 (1%)
 Frame = +1

Query: 1    PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180
            PK  +P  ++D RPISLC+V YK+I+K+L  RLK  +  +V + QS F+P R   DNI+ 
Sbjct: 492  PKITSPQRMSDLRPISLCSVLYKIISKILTQRLKKHLPAIVSTTQSAFVPQRLISDNILV 551

Query: 181  AQEVAHSLET-DSSNPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNS 357
            A E+ HSL T D  +   M  K D+ KA+D +EWP +   +  + F + WI+WI +C+ S
Sbjct: 552  AHEMIHSLRTNDRISKEHMAFKTDMSKAYDRVEWPFLETMMTALGFNNKWISWIMNCVTS 611

Query: 358  ASFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGF---NR 528
             S+S++ING+       +RG+RQGDPLSP LF+L ++ L  ILNKA     I+G    ++
Sbjct: 612  VSYSVLINGQPYGHIIPTRGIRQGDPLSPALFVLCTEALIHILNKAEQAGKITGIQFQDK 671

Query: 529  SLSNNFIHLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKR 708
             +S N  HL+FADD LL+ KA+++     + CL+ Y  L+GQ  NL KSA       + +
Sbjct: 672  KVSVN--HLLFADDTLLMCKATKQECEELMQCLSQYGQLSGQMINLNKSAITFGKNVDIQ 729

Query: 709  LTTSISNILGINPGVFPFLYLGVP 780
            +   I +  GI+       YLG+P
Sbjct: 730  IKDWIKSRSGISLEGGTGKYLGLP 753


>gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana]
          Length = 1142

 Score =  185 bits (469), Expect = 1e-44
 Identities = 107/264 (40%), Positives = 150/264 (56%), Gaps = 2/264 (0%)
 Frame = +1

Query: 1    PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180
            PK   P  + + RPISLCNV YK+I+K+L  RLK V+  L+   QS F+ GR   DNI+ 
Sbjct: 277  PKTERPTRMTELRPISLCNVGYKVISKILCQRLKTVLPNLISETQSAFVDGRLISDNILI 336

Query: 181  AQEVAHSLETDSS-NPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNS 357
            AQE+ H L T+SS     M  K D+ KA+D +EW  I A LR+M F + WI+WI  C+ +
Sbjct: 337  AQEMFHGLRTNSSCKDKFMAIKTDMSKAYDQVEWNFIEALLRKMGFCEKWISWIMWCITT 396

Query: 358  ASFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFNRSLS 537
              + ++ING+        RG+RQGDPLSP LFIL ++ L A + KA   +LI+G   +  
Sbjct: 397  VQYKVLINGQPKGLIIPERGLRQGDPLSPYLFILCTEVLIANIRKAERQNLITGIKVATP 456

Query: 538  NNFI-HLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKRLT 714
            +  + HL+FADD L   KA++      L  L  Y +++GQ+ N  KS+          + 
Sbjct: 457  SPAVSHLLFADDSLFFCKANKEQCGIILEILKQYESVSGQQINFSKSSIQFGHKVEDSIK 516

Query: 715  TSISNILGINPGVFPFLYLGVPIS 786
              I  ILGI+       YLG+P S
Sbjct: 517  ADIKLILGIHNLGGMGSYLGLPES 540


>gb|AAB84340.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1094

 Score =  184 bits (468), Expect = 2e-44
 Identities = 105/262 (40%), Positives = 149/262 (56%), Gaps = 2/262 (0%)
 Frame = +1

Query: 1    PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180
            PK   P  + D RPISLC+V YK+I+K+L+ RLK  +  +V   QS F+  R   DNII 
Sbjct: 222  PKITKPARMADIRPISLCSVMYKIISKILSARLKKYLPVIVSPTQSAFVAERLVSDNIIL 281

Query: 181  AQEVAHSLETDSS-NPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNS 357
            A E+ H+L T+   +   M+ K D+ KA+D +EWP +   L  + F  +WI W+ +C++S
Sbjct: 282  AHEIVHNLRTNEKISKDFMVFKTDMSKAYDRVEWPFLKGILLALGFNSTWINWMMACVSS 341

Query: 358  ASFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFN-RSL 534
             S+S++ING+        RG+RQGDPLSP LF+L ++ L  ILN+A     ISG      
Sbjct: 342  VSYSVLINGQPFGHITPHRGLRQGDPLSPFLFVLCTEALIHILNQAEKIGKISGIQFNGT 401

Query: 535  SNNFIHLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKRLT 714
              +  HL+FADD LLI KAS+      + CL+ Y +++GQ  N  KSA    +  N+   
Sbjct: 402  GPSVNHLLFADDTLLICKASQLECAEIMHCLSQYGHISGQMINSEKSAITFGAKVNEETK 461

Query: 715  TSISNILGINPGVFPFLYLGVP 780
              I N  GI        YLG+P
Sbjct: 462  QWIMNRSGIQTEGGTGKYLGLP 483


>ref|XP_002449295.1| hypothetical protein SORBIDRAFT_05g007323 [Sorghum bicolor]
           gi|241935138|gb|EES08283.1| hypothetical protein
           SORBIDRAFT_05g007323 [Sorghum bicolor]
          Length = 531

 Score =  182 bits (463), Expect = 6e-44
 Identities = 105/269 (39%), Positives = 153/269 (56%)
 Frame = +1

Query: 1   PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180
           PKK NP +VNDFRPI+L N+  KL+TKLLA+RL++VI KLV + Q GFI  R   D +  
Sbjct: 177 PKKENPETVNDFRPIALMNISLKLLTKLLADRLQVVILKLVHTNQYGFIRSRAIQDCLAW 236

Query: 181 AQEVAHSLETDSSNPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNSA 360
           + E  H  +   S    ++ K+D EKAFD +E   ++  +  +  PD WI W+++ L+SA
Sbjct: 237 SYEYIHQCQ--QSRRETIILKLDFEKAFDMVEHSTMIQVMSHLGMPDRWIQWVSTILSSA 294

Query: 361 SFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFNRSLSN 540
           S ++++NG    +F   RGVRQGDPLSPLLF+L ++ L  I+N+A+   LI         
Sbjct: 295 STAVLLNGTAGKFFKCKRGVRQGDPLSPLLFVLAAELLQIIINRAMIMGLIHKPLPQDGE 354

Query: 541 NFIHLMFADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFVPSWCNKRLTTS 720
           ++  + +ADD LL  +A  R        LN +   TG K N  KS  + P   +      
Sbjct: 355 DYPIVQYADDTLLFMQADARQLVFLKAILNSFSESTGLKINYSKSHMY-PINVSANKMNI 413

Query: 721 ISNILGINPGVFPFLYLGVPISPKKLKIN 807
           ++   G + G  PF YLG+P+   K KI+
Sbjct: 414 LAGTFGCDIGSMPFTYLGLPMGTTKPKID 442


>ref|XP_003528143.1| PREDICTED: uncharacterized protein LOC100778359 [Glycine max]
          Length = 2621

 Score =  182 bits (462), Expect = 8e-44
 Identities = 100/266 (37%), Positives = 150/266 (56%), Gaps = 5/266 (1%)
 Frame = +1

Query: 1    PKKPNPVSVNDFRPISLCNVCYKLITKLLANRLKMVIHKLVGSEQSGFIPGRGAFDNIIA 180
            PKK +P  +ND+RPISL    YK++ K+LA R+K V+  ++   QS FI GR    +++ 
Sbjct: 1164 PKKVDPQVLNDYRPISLIGCMYKIVAKILAKRIKTVLPTIINEAQSAFIEGRHLLQSVLI 1223

Query: 181  AQEVAHSLETDSSNPPMMMCKIDIEKAFDSIEWPAILATLRRMLFPDSWITWINSCLNSA 360
            A EV    E   S+ P ++ K+D EKA+DS+ W  +L  L+R  F   WI+W+  CL SA
Sbjct: 1224 ANEVID--EAKRSHKPCLIFKVDYEKAYDSVSWNFLLYMLKRTGFCPKWISWMEGCLKSA 1281

Query: 361  SFSLIINGKTSPWFNSSRGVRQGDPLSPLLFILVSQNLTAILNKALSTDLISGFNRSLSN 540
            S S+++NG  +  F   RG+RQGDPL+P LF +V++ L  ++  AL+ +L  GFN + S 
Sbjct: 1282 SISVLVNGSPTKEFKPQRGLRQGDPLAPFLFNIVAEALNGLMRTALAANLYKGFNIASSE 1341

Query: 541  NFIHLM-FADDLLLITKASRRNARNCLLCLNVYHNLTGQKPNLLKSAFFV----PSWCNK 705
              I L+ +ADD +   +AS +N +     L  +  ++G K N  KS+F        W   
Sbjct: 1342 ISISLLQYADDTIFFGEASMKNVKVLKAILRTFEVVSGLKINFAKSSFGAFGRDDQWRQM 1401

Query: 706  RLTTSISNILGINPGVFPFLYLGVPI 783
              T      L  +    PF+YLG+PI
Sbjct: 1402 AAT-----YLNCSQLALPFVYLGIPI 1422


Top