BLASTX nr result

ID: Dioscorea21_contig00003614 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00003614
         (1197 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CCA65996.1| hypothetical protein [Beta vulgaris subsp. vulga...   113   8e-23
emb|CAB10338.1| hypothetical protein [Arabidopsis thaliana] gi|7...   113   1e-22
gb|AAD37019.2| putative non-LTR retrolelement reverse transcript...   110   9e-22
ref|XP_002299084.1| predicted protein [Populus trichocarpa] gi|2...   107   6e-21
emb|CCA66008.1| hypothetical protein [Beta vulgaris subsp. vulga...   107   8e-21

>emb|CCA65996.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 744

 Score =  113 bits (283), Expect = 8e-23
 Identities = 60/193 (31%), Positives = 99/193 (51%)
 Frame = +3

Query: 303 VRLDRETIARARMRFQTALYGKFFGKAPPFDQTKEILSEKWNDIGTFQISDLPNGYLLIR 482
           V   +ET+ + R  ++  L GK  G +   +   E ++  W      ++ DL +   L +
Sbjct: 108 VSFTKETLWKLREPWRNTLMGKVLGMSISRNFLVERVNRIWKTNDKVEVIDLGHDVFLFK 167

Query: 483 CSTHESMQRLMFEGPWAVNGLVLQLFPWRPYFEPAFTKLSSAAIWIQLHNLPVELWEGEA 662
            +    M++ +F GPW +    L L  W+P F P+ +      +WI+   LP+E ++ EA
Sbjct: 168 FNNGNDMEKALFGGPWFILNHYLMLTRWKPDFRPSQSVFDKIMVWIRFPELPLEYYDKEA 227

Query: 663 LETIASIFGRLLKIDDVTLVLSRSKYARICVEVDLSKPLKQGFWVGDEDNRVFVVVLYER 842
           L  IA   G+ +K+D  T  ++R +YAR+C+E+DL+K L    WV     R +  V YE 
Sbjct: 228 LFAIAGKVGKPIKVDYATDHMARGRYARVCIELDLAKALVSKVWVA----RAWQNVEYEN 283

Query: 843 LPTFCYHCGLVGH 881
           L   C+ CG +GH
Sbjct: 284 LSLVCFLCGKIGH 296


>emb|CAB10338.1| hypothetical protein [Arabidopsis thaliana]
           gi|7268308|emb|CAB78602.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 655

 Score =  113 bits (282), Expect = 1e-22
 Identities = 60/193 (31%), Positives = 94/193 (48%)
 Frame = +3

Query: 303 VRLDRETIARARMRFQTALYGKFFGKAPPFDQTKEILSEKWNDIGTFQISDLPNGYLLIR 482
           + +++E +      ++  +  K  G+    +     L + W   G   + DLP  + ++R
Sbjct: 78  ITIEKEVLDAMNGLYKQCMLVKILGRHTTIEVLSRKLRDLWRPTGGMSVLDLPRQFFMVR 137

Query: 483 CSTHESMQRLMFEGPWAVNGLVLQLFPWRPYFEPAFTKLSSAAIWIQLHNLPVELWEGEA 662
               E     +  GPW V G +L +  W P F P    + +  +W+++ NLPV  +  E 
Sbjct: 138 FEVEEDYMMALTGGPWRVLGSILMVQAWSPEFNPLRDVIETTPVWVRVANLPVTFYHNEI 197

Query: 663 LETIASIFGRLLKIDDVTLVLSRSKYARICVEVDLSKPLKQGFWVGDEDNRVFVVVLYER 842
           L  IA+  G+ +K+D  TL   R ++AR+CVEV+L  PLK    V  E  R F  V YE 
Sbjct: 198 LLGIAAGLGKPIKVDLTTLRKERGRFARVCVEVNLKNPLKGTLVVNGE--RYF--VSYEG 253

Query: 843 LPTFCYHCGLVGH 881
           L T C  CG+ GH
Sbjct: 254 LQTICSLCGIYGH 266


>gb|AAD37019.2| putative non-LTR retrolelement reverse transcriptase [Arabidopsis
           thaliana]
          Length = 855

 Score =  110 bits (274), Expect = 9e-22
 Identities = 61/199 (30%), Positives = 97/199 (48%)
 Frame = +3

Query: 285 AGTDDFVRLDRETIARARMRFQTALYGKFFGKAPPFDQTKEILSEKWNDIGTFQISDLPN 464
           A  +  + + RE +      ++  +  K  G+          L E W   G   + DLP 
Sbjct: 32  ADGESVITIGREVLDVMNSMWKQCMVVKVLGRNISIANLNRRLREMWKPQGAMFVVDLPC 91

Query: 465 GYLLIRCSTHESMQRLMFEGPWAVNGLVLQLFPWRPYFEPAFTKLSSAAIWIQLHNLPVE 644
            + +IR    +     +  GPW   G  L +  W P F+P   ++++  IW++L N+P+ 
Sbjct: 92  QFFMIRFEREDEYLSALTGGPWRAFGSYLLVQAWSPEFDPLRDEITTTPIWVRLMNIPLS 151

Query: 645 LWEGEALETIASIFGRLLKIDDVTLVLSRSKYARICVEVDLSKPLKQGFWVGDEDNRVFV 824
           L+    L  IA   G+ +K+D  TL + R+++AR+C+EVDL+KPLK    +  E  R F 
Sbjct: 152 LYHTSILMGIAGSLGKPVKVDMTTLHVERARFARMCIEVDLAKPLKGTLLLNGE--RYF- 208

Query: 825 VVLYERLPTFCYHCGLVGH 881
            V YE L   C  CG+ GH
Sbjct: 209 -VSYEGLANICSRCGMYGH 226


>ref|XP_002299084.1| predicted protein [Populus trichocarpa] gi|222846342|gb|EEE83889.1|
            predicted protein [Populus trichocarpa]
          Length = 949

 Score =  107 bits (267), Expect = 6e-21
 Identities = 69/239 (28%), Positives = 106/239 (44%), Gaps = 2/239 (0%)
 Frame = +3

Query: 171  RSAPSTSVAKTWAEVAARAQQQAFSSPLEDGPRLEKLKAGTDDFVRLDRETIARARMRFQ 350
            R+A   S + +W E   R    +    L+  PR +++ A     +++  E +     ++ 
Sbjct: 344  RAAAHPSSSPSWVE-RVRVTDTSTRFSLDPIPR-QQIGAS----LKIPEEILTETTEKWT 397

Query: 351  TALYGKFFGKAPPFDQTKEILSEKWNDIGTFQISDLPNGYLLIRCSTHESMQRLMFEGPW 530
              + G F G   PF     I S  W   G   +    NG+++ R  T   M  ++ +GPW
Sbjct: 398  RCMIGFFPGFKMPFHVVNTIASRVWASYGLENVMTTANGFMVFRFKTEAEMHVVLEKGPW 457

Query: 531  AVNGLVLQLFPWRPYFEPAFTKLSSAAIWIQLHNLPVELWEGEALETIASIFGRLLKIDD 710
               G  + L  W P+F     K+S   +WI+LH LP  LW    L   AS+ G+ L  D+
Sbjct: 458  MFGGKAIILQQWHPHFVFDKNKISKLPVWIRLHGLPFPLWSKSGLSLAASMAGKPLSCDE 517

Query: 711  VTLVLSRSKYARICVEVDLSKPLKQGFWVGDE--DNRVFVVVLYERLPTFCYHCGLVGH 881
             T   +   YA +CVE+D S P    F +  +  D  V + V YE  P  C  C + GH
Sbjct: 518  QTYNCTHLDYAIVCVEIDASLPFIHQFDMESKLSDELVLIRVEYEWRPPRCEKCCVFGH 576



 Score = 77.8 bits (190), Expect = 5e-12
 Identities = 42/119 (35%), Positives = 58/119 (48%)
 Frame = +3

Query: 525 PWAVNGLVLQLFPWRPYFEPAFTKLSSAAIWIQLHNLPVELWEGEALETIASIFGRLLKI 704
           PW   G  + L  W   F      ++   +WI++++LP  LW  E L  +AS+ G+ L  
Sbjct: 7   PWMFGGKAIILQKWHSGFVFDMNMITKIPVWIRIYDLPFPLWTKEGLSEVASMVGQPLSC 66

Query: 705 DDVTLVLSRSKYARICVEVDLSKPLKQGFWVGDEDNRVFVVVLYERLPTFCYHCGLVGH 881
           D++TL   R  YAR+CVEVD S P    F +        V V YE  P  C  C + GH
Sbjct: 67  DELTLGCKRLDYARLCVEVDASLPFVHKFKLEFSTTIREVHVNYEWKPKRCERCPVFGH 125


>emb|CCA66008.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 678

 Score =  107 bits (266), Expect = 8e-21
 Identities = 76/281 (27%), Positives = 123/281 (43%), Gaps = 31/281 (11%)
 Frame = +3

Query: 132 ERFSSPLAMASGGRSAP---STSVAKTWAEVAARAQQQA---------------FSSPLE 257
           +R      MA G RS+P     SVA ++ E       ++                S+PL 
Sbjct: 34  KRMEEDSFMAEGDRSSPPMREDSVAASFVEETLGTPNKSPKKVSYSQAARGLGLQSNPLY 93

Query: 258 DGPRLEKLKAGTDDF-------------VRLDRETIARARMRFQTALYGKFFGKAPPFDQ 398
               +E  +   DD              + L +E   R R  ++ +L  K F +   +D 
Sbjct: 94  VNDSIEDEEISDDDIPLECDSEDESCPTIYLTKEEKRRIRHPWRNSLIIKLFDRRLSYDI 153

Query: 399 TKEILSEKWNDIGTFQISDLPNGYLLIRCSTHESMQRLMFEGPWAVNGLVLQLFPWRPYF 578
               L  KWN  G   ++D+ + Y ++R +  E    ++ +GPW +    L +  W P F
Sbjct: 154 LVRRLKYKWNLKGDIALTDVGHAYYVVRFNNMEDYDFVLTQGPWLIGDSYLTIRKWVPNF 213

Query: 579 EPAFTKLSSAAIWIQLHNLPVELWEGEALETIASIFGRLLKIDDVTLVLSRSKYARICVE 758
                 +     W+++ +L VE ++ + L +I S  G+++KID  T  + R +Y R C+E
Sbjct: 214 VSDQEPIKKLTAWVRIPHLSVEYFDKQFLHSIGSKIGKVIKIDRNTESMDRGQYVRFCIE 273

Query: 759 VDLSKPLKQGFWVGDEDNRVFVVVLYERLPTFCYHCGLVGH 881
           VDLSKPL   F +    N    +V YE L   C+ CG +GH
Sbjct: 274 VDLSKPLLSKFRL----NGKVWIVQYEGLRLICFKCGHLGH 310


Top