BLASTX nr result

ID: Angelica23_contig00034483 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00034483
         (1154 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   150   5e-47
dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]           137   2e-43
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   147   3e-42
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           147   4e-42
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               130   2e-40

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
           predicted protein [Populus trichocarpa]
          Length = 517

 Score =  150 bits (379), Expect(2) = 5e-47
 Identities = 88/277 (31%), Positives = 137/277 (49%), Gaps = 4/277 (1%)
 Frame = +3

Query: 18  IMVMSGFARGSLPITYLGLPLITTKLHDRDCALLLSKFCSQIESWTAKFLNFGGRLQLIK 197
           I+ + GF  G LP+ YLG+PL++++L    C  L+ +  S++  WT + L++ GR+QLI 
Sbjct: 43  IIHILGFREGELPMKYLGVPLLSSRLKAIYCKGLVDRITSKVRHWTCRTLSYAGRVQLIN 102

Query: 198 SVLSSMLGY*SMFVFLPHSMLKKLNALMFKFLW-GDFYKQNGKCQHKVKWEDCCKPKNEG 374
           SVL S+  Y +    LP  ++K +  +M  FLW G   +  G    KV W+  C PK EG
Sbjct: 103 SVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGA---KVAWDQVCLPKKEG 159

Query: 375 GLGLRNIYEWNFAAILHQL*WISQNESSSI*VAWFNKELLKNKGL*TSKLPYKCPWAVRK 554
           GLG+++I EWN  A+L  +  +  +   SI   W    LL+ +   T K P  C WA  K
Sbjct: 160 GLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQNCSWAWGK 219

Query: 555 ILNSRVFASNYIQYHIGANSRFLFWHDPWVRGKSLINIFPNHVISTTESVHMAPASNFLQ 734
           IL  R  A   ++Y IG       W D W     L + +    I  +     A  +  +Q
Sbjct: 220 ILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGERFIYDSGMAKNAKVNVLIQ 279

Query: 735 GSTWRLPSS---NHDNIIELRQLVASVQIHNRDTITW 836
            S W+ P++       IIE     ++ ++  +D + W
Sbjct: 280 NSEWKTPTTQAIGWHPIIEAIPSNSNPKMGQKDELVW 316



 Score = 65.1 bits (157), Expect(2) = 5e-47
 Identities = 32/90 (35%), Positives = 48/90 (53%), Gaps = 2/90 (2%)
 Frame = +2

Query: 833  LEWLAS--H*CSFIRIWHSIRSSSTSVPWFDFVWKSYSIPKCSFILWLSI*NRLFTRDRM 1006
            L WL S  H  S    W  +R     V W D VW   ++P+ SF+LW+++  +L T+D++
Sbjct: 314  LVWLDSPNHRFSVKVAWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKL 373

Query: 1007 LSFGMSTPPGCLLCNCNLE*VHHIFLNCPF 1096
              FG+  P  C LC  N E  +H+F  C +
Sbjct: 374  HRFGIHGPNRCSLCLRNNEDHNHLFFECSY 403


>dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]
          Length = 478

 Score =  137 bits (346), Expect(2) = 2e-43
 Identities = 92/282 (32%), Positives = 142/282 (50%), Gaps = 6/282 (2%)
 Frame = +3

Query: 15  DIMVMSGFARGSLPITYLGLPLITTKLHDRDCALLLSKFCSQIESWTAKFLNFGGRLQLI 194
           DI+    FA G+LP+ YLGLPL+T K+   D   L+ K   +I  WTA+ L+F GRLQLI
Sbjct: 11  DILHSFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLI 70

Query: 195 KSVLSSMLGY*SMFVFLPHSMLKKLNALMFKFLWGDFYKQNGKCQHKVKWEDCCKPKNEG 374
            SV+ S+  +      LP + +K+++++   FLW        K   KV W D C PK+EG
Sbjct: 71  SSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKA--KVAWSDVCTPKDEG 128

Query: 375 GLGLRNIYEWNFAAILHQL*WISQNESSSI*VAWFNKELLKNKGL*T-SKLPYKCPWAVR 551
           GLG+R++ E N  ++L +L W     S+S+ V W    LL+     + S       W  +
Sbjct: 129 GLGIRSLKEANKVSLL-KLIW-RMLSSTSLWVQWLRLYLLRKGSFWSISGNTTLGSWMWK 186

Query: 552 KILNSRVFASNYIQYHIGANSRFLFWHDPWVRGKSLINIFPNH-VISTTESVHMAPASNF 728
           KIL  R  AS ++++ I   S   FW D W +   LI++  +   I    ++H + A   
Sbjct: 187 KILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAEAV 246

Query: 729 LQGSTWRLPSSNHDNIIELRQLVASVQ----IHNRDTITWNG 842
           +     R     HD ++ +  ++A V+        DT+ W G
Sbjct: 247 VNHRPRR---HRHDTLLRIEDVIAEVRHQGLTSGEDTVRWKG 285



 Score = 65.9 bits (159), Expect(2) = 2e-43
 Identities = 29/74 (39%), Positives = 42/74 (56%)
 Frame = +2

Query: 875  WHSIRSSSTSVPWFDFVWKSYSIPKCSFILWLSI*NRLFTRDRMLSFGMSTPPGCLLCNC 1054
            W + R     V W+  VW S++ PK S + W++I NRL T DRMLS+       C+LC+ 
Sbjct: 300  WAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHH 359

Query: 1055 NLE*VHHIFLNCPF 1096
             +E   H+F  CP+
Sbjct: 360  LVETRDHLFFTCPY 373


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  147 bits (371), Expect(2) = 3e-42
 Identities = 84/243 (34%), Positives = 118/243 (48%)
 Frame = +3

Query: 33   GFARGSLPITYLGLPLITTKLHDRDCALLLSKFCSQIESWTAKFLNFGGRLQLIKSVLSS 212
            GF  G+ PI YLGLPL+  KL   D   LL K  +++ SW +K L+F GR QLI SV+  
Sbjct: 614  GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673

Query: 213  MLGY*SMFVFLPHSMLKKLNALMFKFLWGDFYKQNGKCQHKVKWEDCCKPKNEGGLGLRN 392
            ++ +      LP   +KK+ +L  KFLW      +G+   KV W DCC PK+EGGLG R+
Sbjct: 674  LINFWMSTFLLPKGCIKKIESLCSKFLWAG--SIDGRKSSKVSWVDCCLPKSEGGLGFRS 731

Query: 393  IYEWNFAAILHQL*WISQNESSSI*VAWFNKELLKNKGL*TSKLPYKCPWAVRKILNSRV 572
              EWN   +L +L W+  +  +S+   W     L +            PW  + +LN R 
Sbjct: 732  FGEWN-KTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRP 790

Query: 573  FASNYIQYHIGANSRFLFWHDPWVRGKSLINIFPNHVISTTESVHMAPASNFLQGSTWRL 752
             A  +I+  +G      FW D W     LI    +           A  ++ + GS WRL
Sbjct: 791  LAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWRL 850

Query: 753  PSS 761
            P S
Sbjct: 851  PLS 853



 Score = 52.0 bits (123), Expect(2) = 3e-42
 Identities = 26/79 (32%), Positives = 40/79 (50%)
 Frame = +2

Query: 860  SFIRIWHSIRSSSTSVPWFDFVWKSYSIPKCSFILWLSI*NRLFTRDRMLSFGMSTPPGC 1039
            S  + W  +R       W   VW   ++PK +F  W +  NRL TR R++S+G+ +   C
Sbjct: 893  SAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAEC 952

Query: 1040 LLCNCNLE*VHHIFLNCPF 1096
             LC+ + E   H+ L C F
Sbjct: 953  CLCSFDTETRDHLLLLCDF 971


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  147 bits (371), Expect(2) = 4e-42
 Identities = 84/243 (34%), Positives = 118/243 (48%)
 Frame = +3

Query: 33   GFARGSLPITYLGLPLITTKLHDRDCALLLSKFCSQIESWTAKFLNFGGRLQLIKSVLSS 212
            GF  G+ PI YLGLPL+  KL   D   LL K  +++ SW +K L+F GR QLI SV+  
Sbjct: 614  GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673

Query: 213  MLGY*SMFVFLPHSMLKKLNALMFKFLWGDFYKQNGKCQHKVKWEDCCKPKNEGGLGLRN 392
            ++ +      LP   +KK+ +L  KFLW      +G+   KV W DCC PK+EGGLG R+
Sbjct: 674  LINFWMSTFLLPKGCIKKIESLCSKFLWAG--SIDGRKSSKVSWVDCCLPKSEGGLGFRS 731

Query: 393  IYEWNFAAILHQL*WISQNESSSI*VAWFNKELLKNKGL*TSKLPYKCPWAVRKILNSRV 572
              EWN   +L +L W+  +  +S+   W     L +            PW  + +LN R 
Sbjct: 732  FGEWN-KTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRP 790

Query: 573  FASNYIQYHIGANSRFLFWHDPWVRGKSLINIFPNHVISTTESVHMAPASNFLQGSTWRL 752
             A  +I+  +G      FW D W     LI    +           A  ++ + GS WRL
Sbjct: 791  LAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWRL 850

Query: 753  PSS 761
            P S
Sbjct: 851  PLS 853



 Score = 51.6 bits (122), Expect(2) = 4e-42
 Identities = 26/79 (32%), Positives = 40/79 (50%)
 Frame = +2

Query: 860  SFIRIWHSIRSSSTSVPWFDFVWKSYSIPKCSFILWLSI*NRLFTRDRMLSFGMSTPPGC 1039
            S  + W  +R       W   VW   ++PK +F  W +  NRL TR R++S+G+ +   C
Sbjct: 893  SAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAEC 952

Query: 1040 LLCNCNLE*VHHIFLNCPF 1096
             LC+ + E   H+ L C F
Sbjct: 953  CLCSFDTETRDHLLLLCDF 971


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  130 bits (328), Expect(2) = 2e-40
 Identities = 85/276 (30%), Positives = 133/276 (48%), Gaps = 7/276 (2%)
 Frame = +3

Query: 36   FARGSLPITYLGLPLITTKLHDRDCALLLSKFCSQIESWTAKFLNFGGRLQLIKSVLSSM 215
            F  G LP+ YLGLPL+T +L   D + LL +   +I +WT +F +F GR  LIKSVL S+
Sbjct: 409  FDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLWSI 468

Query: 216  LGY*SMFVFLPHSMLKKLNALMFKFLWGDFYKQNGKCQHKVKWEDCCKPKNEGGLGLRNI 395
              +      LP   +++++ L   FLW      + K   K+ W+  CKPK EGGLGLRN+
Sbjct: 469  CNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKA--KISWDIVCKPKAEGGLGLRNL 526

Query: 396  YEWNFAAILHQL*WISQNESSSI*VAWFNKELLKNKGL*TSKLPYKC-PWAVRKILNSRV 572
             E N  + L +L W   + S+S+   W  + L++ K + + K       W  RKIL  R 
Sbjct: 527  KEANDVSCL-KLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRD 585

Query: 573  FASNYIQYHIGANSRFLFWHDPWVRGKSLINIFPNHVISTTESVHMAPASNFLQGSTWRL 752
             A ++ +  +G      FW+D W     LI+      +    ++ +           W  
Sbjct: 586  VAKSFSRVEVGNGESASFWYDHWSAHGRLID-----TVGDKGTIDLGIPREASVADAWTR 640

Query: 753  PSSNHDN---IIELRQLVASVQIHN---RDTITWNG 842
             S        + E+ +++A  +IH+    DT+ W G
Sbjct: 641  RSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRG 676



 Score = 62.8 bits (151), Expect(2) = 2e-40
 Identities = 30/83 (36%), Positives = 47/83 (56%), Gaps = 2/83 (2%)
 Frame = +2

Query: 875  WHSIRSSSTSVPWFDFVWKSYSIPKCSFILWLSI*NRLFTRDRMLSFGM--STPPGCLLC 1048
            WH I+++S++V W   VW  ++ PK +   WL+I NRL T DRML +    S    C+LC
Sbjct: 691  WHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLC 750

Query: 1049 NCNLE*VHHIFLNCPFFDLIRCA 1117
              N + + H+F +C +   +  A
Sbjct: 751  TNNSKTLEHLFFSCSYASTVWAA 773


Top