BLASTX nr result

ID: Lithospermum22_contig00021144 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00021144
         (1239 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002300602.1| predicted protein [Populus trichocarpa] gi|2...   125   2e-26
ref|XP_002452516.1| hypothetical protein SORBIDRAFT_04g027285 [S...   117   6e-24
ref|XP_002446678.1| hypothetical protein SORBIDRAFT_06g020403 [S...   116   1e-23
gb|AAD21778.1| putative non-LTR retroelement reverse transcripta...   112   2e-22
emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabid...   111   4e-22

>ref|XP_002300602.1| predicted protein [Populus trichocarpa] gi|222842328|gb|EEE79875.1|
            predicted protein [Populus trichocarpa]
          Length = 366

 Score =  125 bits (314), Expect = 2e-26
 Identities = 94/346 (27%), Positives = 150/346 (43%), Gaps = 7/346 (2%)
 Frame = +2

Query: 134  FIIDLVGISGGNTIM*KEEKMVTIERTSSWYVETEISEGGRRDWKGCFNYASCVDDVRKQ 313
            F +D +G  GG  ++ +    V++   S+ +++  + E     W+    Y     + + +
Sbjct: 42   FAVDRLGRGGGLAVLWRSSASVSLLGYSNNHIDLLVVEANNFIWRFTGYYGLPDRNRKME 101

Query: 314  KLEELRALAPSDDQGWCSMGDFNDLLSSEEKLGGIERTESSMQVFREFVKHCQLLDIGYV 493
                LRAL+      W  MGD+ND+L +EEK G +    SS Q FRE V+ C+L D+  +
Sbjct: 102  SWNLLRALSRQSALPWVCMGDYNDMLCAEEKRGRVLHPNSSFQGFREAVEDCKLTDMPLI 161

Query: 494  GHPYTW*NRLEGM------*LDHTLATASWCTEFPRVNCTHLAMLGVDHYPLLLDTEAQL 655
            G+P+TW  R  G        +D  + + SW   F R     L     DH PLLL T    
Sbjct: 162  GYPFTW-ERGRGTAEWIQERIDRAMCSDSWFDYFDRAELHILTCSSSDHSPLLLRTRVVS 220

Query: 656  EKTKKWFVFLQTVGWKRGV*KYY*ESLGLSC*RVSLVYC*RED*IC*DGAIEWCKNNNFN 835
                      Q +GW RG        +GL            E  IC     +W ++   N
Sbjct: 221  SHD-------QLMGWFRG--------MGLL----------DEFCICTGRVSQWGRSLGRN 255

Query: 836  ARVQINDLQARIRT-TYESSNNDRXXXXXXXXXXXXP*REEECYWKVQAKERHLKEGDKN 1012
             +V+I D   ++    +    +                 +EE +W+ +AK   L EG  N
Sbjct: 256  FKVEIRDCHNKLDVLRHLDDEHSAIEFKNCTDRLARLLAQEEDFWRQRAKIYWLTEGGLN 315

Query: 1013 TSFFHASAMIRRRQNLLLGIEDDTDVWQEGASKVEGIVLDYFTDMF 1150
            T +FH+ A  RRR+N++  + D      E    ++G+  DYFT++F
Sbjct: 316  TKYFHSVATARRRRNVISALVDGAGTVVEDTEGLQGVAKDYFTNLF 361


>ref|XP_002452516.1| hypothetical protein SORBIDRAFT_04g027285 [Sorghum bicolor]
           gi|241932347|gb|EES05492.1| hypothetical protein
           SORBIDRAFT_04g027285 [Sorghum bicolor]
          Length = 689

 Score =  117 bits (293), Expect = 6e-24
 Identities = 64/180 (35%), Positives = 101/180 (56%), Gaps = 5/180 (2%)
 Frame = +2

Query: 113 TLDFAFSFIIDLVGISGGNTIM*KEEKMVTIERTSSWYVETEISEGGRRDWKGCFNYASC 292
           TL F  SF ++  G SGG  +    + +++I++ S+++++T ISE G+  W+  F Y   
Sbjct: 383 TLSFDNSFAVNSSGRSGGLGLFWNNDVLLSIQKYSNYHIDTIISEHGKEPWRMSFIYGEP 442

Query: 293 VDDVRKQKLEELRALAPSDDQGWCSMGDFNDLLSSEEKLGGIERTESSMQVFREFVKHCQ 472
              +R +  + ++ +    D  W  MGDFN++L  EE+LG  ER E  M+ FR+ V  CQ
Sbjct: 443 NRSLRFRTWDIMKQMRSDTDLPWVCMGDFNEILRREEQLGPNEREEYLMEGFRDAVDVCQ 502

Query: 473 LLDIGYVGHPYTW*NRLEG-----M*LDHTLATASWCTEFPRVNCTHLAMLGVDHYPLLL 637
           L DIGY+G  +T+  ++ G     + LD  LA+ +WC  FP     HL  +  DH P+LL
Sbjct: 503 LRDIGYIGLGWTFEKKVAGGHYVRVRLDRALASVNWCARFPLAAGQHLTTVKSDHCPILL 562


>ref|XP_002446678.1| hypothetical protein SORBIDRAFT_06g020403 [Sorghum bicolor]
           gi|241937861|gb|EES11006.1| hypothetical protein
           SORBIDRAFT_06g020403 [Sorghum bicolor]
          Length = 633

 Score =  116 bits (291), Expect = 1e-23
 Identities = 64/180 (35%), Positives = 100/180 (55%), Gaps = 5/180 (2%)
 Frame = +2

Query: 113 TLDFAFSFIIDLVGISGGNTIM*KEEKMVTIERTSSWYVETEISEGGRRDWKGCFNYASC 292
           TL F  SF ++  G SGG  +    + +++I++ S+++++T ISE G+  W+  F Y   
Sbjct: 383 TLSFDNSFAVNSSGRSGGLGLFWNNDVLLSIQKYSNYHIDTIISEHGKEPWRMSFIYGEP 442

Query: 293 VDDVRKQKLEELRALAPSDDQGWCSMGDFNDLLSSEEKLGGIERTESSMQVFREFVKHCQ 472
              +R +  + ++ +    D  W  MGDFN++L  EE+LG  ER E  M+ FR  V  CQ
Sbjct: 443 NRSLRFRTWDIMKQMRSDTDLPWVCMGDFNEILRREEQLGPNEREEYLMEGFRGAVDVCQ 502

Query: 473 LLDIGYVGHPYTW*NRLEG-----M*LDHTLATASWCTEFPRVNCTHLAMLGVDHYPLLL 637
           L DIGY+G  +T+  ++ G     + LD  LA+ +WC  FP     HL  +  DH P+LL
Sbjct: 503 LRDIGYIGLGWTFEKKVAGGHYVRVRLDRALASVNWCARFPLAAVQHLTTVKSDHCPILL 562


>gb|AAD21778.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1715

 Score =  112 bits (280), Expect = 2e-22
 Identities = 87/380 (22%), Positives = 152/380 (40%), Gaps = 20/380 (5%)
 Frame = +2

Query: 137  IIDLVGISGGNTIM*KEEKMVTIERTSSWYVETEISEGGRRDWKGCFNYASCVDDVRKQK 316
            II   G+SGG  +  K+   + +       V+  +       +  C  Y   +   R   
Sbjct: 420  IISPRGLSGGLVVYWKKHLSIQVISHDVRLVDLYVEYKNFNFYLSCI-YGHPIPSERHHL 478

Query: 317  LEELRALAPSDDQGWCSMGDFNDLLSSEEKLGGIERTESSMQVFREFVKHCQLLDIGYVG 496
             E+L+ ++      W   GDFN++L+  EK GG  R+  S+Q F   +  C + D+   G
Sbjct: 479  WEKLQRVSAHRSGPWMMCGDFNEILNLNEKKGGRRRSIGSLQNFTNMINCCNMKDLKSKG 538

Query: 497  HPYTW*NRLEG----M*LDHTLATASWCTEFPRVNCTHLAMLGVDHYPLLLDT------- 643
            +PY+W  + +       LD     + W   FP      L + G DH P+++D        
Sbjct: 539  NPYSWVGKRQNETIESCLDRVFINSDWQASFPAFETEFLPIAGSDHAPVIIDIAEEVCTK 598

Query: 644  EAQLEKTKKWFVFLQTV-----GWKRGV*K----YY*ESLGLSC*RVSLVYC*RED*IC* 796
              Q    ++ F F   V     GW RG       YY             ++C R++    
Sbjct: 599  RGQFRYDRRHFQFEDFVDSVQRGWNRGRSDSHGGYY-----------EKLHCCRQE---- 643

Query: 797  DGAIEWCKNNNFNARVQINDLQARIRTTYESSNNDRXXXXXXXXXXXXP*REEECYWKVQ 976
                +W +    N   +I  L+ R+                         R+EE YW ++
Sbjct: 644  --LAKWKRRTKTNTAEKIETLKYRVDAAERDHTLPHQTILRLRQDLNQAYRDEELYWHLK 701

Query: 977  AKERHLKEGDKNTSFFHASAMIRRRQNLLLGIEDDTDVWQEGASKVEGIVLDYFTDMFQT 1156
            ++ R +  GD+NT FF+AS  +R+ +N +  I D   +       +  +  +YF D+F T
Sbjct: 702  SRNRWMLLGDRNTMFFYASTKLRKSRNRIKAITDAQGIENFRDDTIGKVAENYFADLFTT 761

Query: 1157 NEASKPKLAMHTVDERVTDR 1216
             + S  +  +  +  +VT++
Sbjct: 762  TQTSDWEEIISGIAPKVTEQ 781


>emb|CAB39638.1| RNA-directed DNA polymerase-like protein [Arabidopsis thaliana]
            gi|7267666|emb|CAB78094.1| RNA-directed DNA
            polymerase-like protein [Arabidopsis thaliana]
          Length = 1274

 Score =  111 bits (277), Expect = 4e-22
 Identities = 101/377 (26%), Positives = 148/377 (39%), Gaps = 13/377 (3%)
 Frame = +2

Query: 116  LDFAFSFIIDLVGISGGNTIM*KEEKMVTIERTSSWYVETEISEGGRRDWKGCFNYASCV 295
            + +A  F I   G+SGG  +  KE             VE EI E            A   
Sbjct: 17   MGYAHRFTIPPEGLSGGLALYWKEN------------VEVEILEA-----------APNF 53

Query: 296  DDVRKQKLEELRALAPSDDQGWCSMGDFNDLLSSEEKLGGIERTESSMQVFREFVKHCQL 475
             D R    +++ +L       W   GDFND+L + EK GG  R E     FR FV    L
Sbjct: 54   IDNRSVFWDKISSLGAQRSSAWLLTGDFNDILDNSEKQGGPLRWEGFFLAFRSFVSQNGL 113

Query: 476  LDIGYVGHPYTW----*NRLEGM*LDHTLATASWCTEFPRVNCTHLAMLGVDHYPLLLDT 643
             DI + G+  +W     +      LD  L   SW   FP   C +L   G DH PL+   
Sbjct: 114  WDINHTGNSLSWRGTRYSHFIKSRLDRALGNCSWSELFPMSKCEYLRFEGSDHRPLVTYF 173

Query: 644  EAQLEKTKKWFVFLQTVGWKRGV*KYY*ESLGLSC*RVSLVYC*RED*IC*DGAIEWCKN 823
             A   K  K F F + +  K  +     E   L+  + S++Y       C    I+W K 
Sbjct: 174  GAPPLKRSKPFRFDRRLREKEEIRALVKEVWELAR-QDSVLYKISR---CRQSIIKWTKE 229

Query: 824  NNFNARVQINDLQARIRTTYESSNNDRXXXXXXXXXXXXP*REEECYWKVQAKERHLKEG 1003
             N N+   I   Q  + +   +   D               R+EE +WK  ++ + L  G
Sbjct: 230  QNSNSAKAIKKAQQALESALSADIPDPSLIGSITQELEAAYRQEELFWKQWSRVQWLNSG 289

Query: 1004 DKNTSFFHASAMIRRRQNLLLGIEDDTDVWQEGASKVEGIVLDYFTDMFQTN-------- 1159
            D+N  +FHA+   RR  N L  IED +        ++   +  YF ++F T+        
Sbjct: 290  DRNKGYFHATTRTRRMLNNLSVIEDGSGQEFHEEEQIASTISSYFQNIFTTSNNSDLQVV 349

Query: 1160 -EASKPKLAMHTVDERV 1207
             EA  P ++ H  +E +
Sbjct: 350  QEALSPIISSHCNEELI 366


Top