BLASTX nr result

ID: Angelica22_contig00020037 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00020037
         (960 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       104   2e-36
ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|2...    96   1e-33
gb|AAC67331.1| putative non-LTR retroelement reverse transcripta...    99   2e-33
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   111   2e-33
ref|XP_002331746.1| predicted protein [Populus trichocarpa] gi|2...    89   1e-32

>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  104 bits (259), Expect(2) = 2e-36
 Identities = 54/164 (32%), Positives = 89/164 (54%), Gaps = 3/164 (1%)
 Frame = -1

Query: 957 IMFIYGSITVVERRALWEDL--SAFSSTILDSDWTIYGDFNTCLSIDEKQGGNVLWT-LG 787
           +  +Y +  V  R+ LW ++     S  I D  W + GDFN  L+  E      L   + 
Sbjct: 106 VSVVYAANEVASRKELWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDIN 165

Query: 786 MMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINPSWHSVFPNASTVFMA 607
           M +F+D  L   LSDLR   +  TWW+ +   P  KK+DR+L+N SW+++FP++  +F +
Sbjct: 166 MRDFRDCLLAAELSDLRYKGNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGS 225

Query: 606 RGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 475
              SDH      L   +    +PF+FF++L+++LDF++ V + W
Sbjct: 226 LDFSDHVSCGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNW 269



 Score = 75.1 bits (183), Expect(2) = 2e-36
 Identities = 42/143 (29%), Positives = 71/143 (49%), Gaps = 1/143 (0%)
 Frame = -3

Query: 460 GDPWYLLTRKLKKVKEALKGLNS-RVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 284
           G   + +++KLK +K+ +K  +      L     E+ + L+  Q+   + P+P   + E 
Sbjct: 276 GSSMFRVSKKLKALKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFEL 335

Query: 283 RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 104
                +      EE F +QKSR++W   GDGN KYF      R +SN I++L D NG   
Sbjct: 336 EAERKWHILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLV 395

Query: 103 STHEDMARVATSYYQSILGNSAE 35
            + E +  +  SY+ S+LG+  +
Sbjct: 396 DSQEGILDLCASYFGSLLGDEVD 418


>ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|222873371|gb|EEF10502.1|
           predicted protein [Populus trichocarpa]
          Length = 819

 Score = 95.9 bits (237), Expect(2) = 1e-33
 Identities = 53/159 (33%), Positives = 81/159 (50%), Gaps = 1/159 (0%)
 Frame = -1

Query: 948 IYGSITVVERRALWEDLSAFSSTILDSDWTIYGDFNTCLSIDEKQGGNVLWTLGMMEFKD 769
           IYG      R ALW D+ + S     + W + GDFN   +  ++ GG+  W  G M+  D
Sbjct: 485 IYGDNNASLREALWSDIVSRSDGWESTLWILIGDFNAIRNQSDRLGGSTTWA-GTMDRLD 543

Query: 768 VCLKLG-LSDLRSTWHFLTWWDNNLENPKFKKLDRVLINPSWHSVFPNASTVFMARGLSD 592
            C++   + DLR +    TW +   EN   +KLDRVL+N  W+  FP +   F+  G+SD
Sbjct: 544 TCIREAKVDDLRYSGMHYTWSNQCPENLIMRKLDRVLVNEKWNLKFPLSEARFLPSGMSD 603

Query: 591 HCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 475
           H P    +   +Q + KPF+FF   +   +F+  V + W
Sbjct: 604 HSPMVVKVIGNDQNKKKPFRFFDMWMDHDEFMPLVKKVW 642



 Score = 74.3 bits (181), Expect(2) = 1e-33
 Identities = 45/140 (32%), Positives = 68/140 (48%), Gaps = 2/140 (1%)
 Frame = -3

Query: 460  GDPWYLLTRKLKKVKEALKGLN-SRVGNLHLLVSESREALMHYQNLLPSS-PSPDQFNEE 287
            G P Y L  KL+K+K+ LK  N +   N+   V +++  +   Q  L ++  +P     E
Sbjct: 648  GCPMYQLCCKLRKLKQELKLFNMAHFSNISDRVRDAKNKMDKAQQALHTAHENPILCMRE 707

Query: 286  KRLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDT 107
            + ++  Y      EE F KQK+R+ WL  GD N  YF  S   R N NK+ SL   +G+ 
Sbjct: 708  RDVVHKYASTVRAEESFFKQKARIQWLSLGDQNTSYFHKSVNGRQNRNKLLSLTREDGEV 767

Query: 106  QSTHEDMARVATSYYQSILG 47
                E +     SY+  +LG
Sbjct: 768  VERQEAVKSEVISYFHRVLG 787


>gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1449

 Score = 99.4 bits (246), Expect(2) = 2e-33
 Identities = 62/169 (36%), Positives = 86/169 (50%), Gaps = 7/169 (4%)
 Frame = -1

Query: 960  FIMFIYGSITVVERRALWEDLSAF--SSTILDSDWTIYGDFNTCLSIDE--KQGGNVLWT 793
            F  F+Y S    ER+ LW DL     S  I D  W I+GDFN  L +DE  +   +   T
Sbjct: 518  FYSFVYASNFAEERKILWNDLRDHMDSPIIRDKPWIIFGDFNEILDMDEHSRMEDHPAVT 577

Query: 792  LGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINPSWHSVFPNASTVF 613
             GM +F+ +      SDL S     TW +    +P +KKLDRV++N +W  V+P +  VF
Sbjct: 578  SGMRDFQSLVNYCSFSDLASHGPLFTWCNKRDNDPIWKKLDRVMVNEAWKMVYPQSYNVF 637

Query: 612  MARGLSDHCPTATSLGMLN--QIR-NKPFQFFSHLIQDLDFISKVMEAW 475
             A G SDH     +L M +  Q+R NKPF+F + +    +F   V   W
Sbjct: 638  EAGGCSDHLRCRINLNMNSGAQVRGNKPFKFVNAVADMEEFKPLVENFW 686



 Score = 70.1 bits (170), Expect(2) = 2e-33
 Identities = 47/141 (33%), Positives = 75/141 (53%), Gaps = 8/141 (5%)
 Frame = -3

Query: 448  YLLTRKLKKVKEALKGL-NSRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEKRLIE 272
            +  T+KLK +K  L+GL   ++GNL   V  +REA   Y +L  +  S  Q N  +R +E
Sbjct: 700  FRFTKKLKALKPKLRGLAKEKMGNL---VKRTREA---YLSLCQAQQSNSQ-NPSQRAME 752

Query: 271  VYKDAF-------NMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANG 113
            +  +A+       ++EE +LKQ S+++WLK GD NNK F  +   R   N I  +   +G
Sbjct: 753  IESEAYVRWDRIASIEEKYLKQVSKLHWLKVGDKNNKTFHRAATARAAQNSIREIQKEDG 812

Query: 112  DTQSTHEDMARVATSYYQSIL 50
             T +T +D+      ++Q  L
Sbjct: 813  STATTKDDIKNETERFFQEFL 833


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  111 bits (277), Expect(2) = 2e-33
 Identities = 61/164 (37%), Positives = 86/164 (52%), Gaps = 3/164 (1%)
 Frame = -1

Query: 957 IMFIYGSITVVERRALWEDLS--AFSSTILDSDWTIYGDFNTCLS-IDEKQGGNVLWTLG 787
           + F+Y       RR LW +L   A + T  D  W I GDFN  L  +D   GG+ + T G
Sbjct: 105 VTFVYAVNCRYGRRRLWSELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRI-TRG 163

Query: 786 MMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINPSWHSVFPNASTVFMA 607
           M EF++  L   +SDL    +  TWW+N   NP  KK+DR+L+N SW    P +   F A
Sbjct: 164 MEEFRECLLTSNISDLPFRGNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCA 223

Query: 606 RGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 475
              SDHCP+  ++   +  RNKPF+  + L+   +FI K+   W
Sbjct: 224 MEFSDHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTW 267



 Score = 58.2 bits (139), Expect(2) = 2e-33
 Identities = 40/144 (27%), Positives = 66/144 (45%), Gaps = 1/144 (0%)
 Frame = -3

Query: 460 GDPWYLLTRKLKKVKEALKGLN-SRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 284
           G   + L++K K +K  ++  N      L   V ++ + L   QN L ++PS      EK
Sbjct: 274 GSAMFTLSKKSKFLKGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAPSSYLAGLEK 333

Query: 283 RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 104
                + +    EE FL QKSRV WLK GD N  +F      R   N+I  L+D  G   
Sbjct: 334 EAHRSWAELALAEERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRI 393

Query: 103 STHEDMARVATSYYQSILGNSAEV 32
              +++      +++ + G+S+ +
Sbjct: 394 ENTDELQTHCVDFFKELFGSSSHL 417


>ref|XP_002331746.1| predicted protein [Populus trichocarpa] gi|222874272|gb|EEF11403.1|
           predicted protein [Populus trichocarpa]
          Length = 503

 Score = 89.0 bits (219), Expect(2) = 1e-32
 Identities = 50/142 (35%), Positives = 72/142 (50%), Gaps = 1/142 (0%)
 Frame = -1

Query: 948 IYGSITVVERRALWEDLSAFSSTILDSDWTIYGDFNTCLSIDEKQGGNVLWTLGMMEFKD 769
           IYG      R ALW D+ + S     + W + GDFN   +   + GG+  W  G M+  D
Sbjct: 92  IYGDNNASLREALWSDIVSRSDGWESTPWILMGDFNAIRNQSHRLGGSTTWA-GTMDRLD 150

Query: 768 VCLKLG-LSDLRSTWHFLTWWDNNLENPKFKKLDRVLINPSWHSVFPNASTVFMARGLSD 592
            C++   + DLR +    TW +   EN   +KLDRVL+N  W+  FP +   F+  G+SD
Sbjct: 151 TCIREAKVDDLRYSGMHYTWSNQCPENLIMRKLDRVLVNEKWNLNFPLSEVRFLPSGISD 210

Query: 591 HCPTATSLGMLNQIRNKPFQFF 526
           H P    +   +Q   KPF+FF
Sbjct: 211 HSPMVVKVIGNDQNIKKPFRFF 232



 Score = 78.2 bits (191), Expect(2) = 1e-32
 Identities = 46/142 (32%), Positives = 70/142 (49%), Gaps = 2/142 (1%)
 Frame = -3

Query: 466 NGGDPWYLLTRKLKKVKEALKGLN-SRVGNLHLLVSESREALMHYQNLLPSS-PSPDQFN 293
           +GG P Y L   LKK+K+ LK  N +   N+   V +++  +   Q  L ++  +P    
Sbjct: 240 SGGCPMYQLCCNLKKLKQELKLFNMAHFSNISDRVKDAKNEMDKAQQALHTAHENPILCM 299

Query: 292 EEKRLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANG 113
            E+ ++  Y      EE F KQK+R+ WL  GD N  YF  S   R N NK+ SL   +G
Sbjct: 300 RERDVVHKYASTVRAEESFFKQKARIQWLSLGDQNTSYFHKSVNGRHNRNKLLSLTREDG 359

Query: 112 DTQSTHEDMARVATSYYQSILG 47
           +    HE +     +Y+  +LG
Sbjct: 360 EVVEGHEAVKSEVIAYFHRVLG 381


Top