BLASTX nr result

ID: Angelica23_contig00009561 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00009561
         (1334 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAC67331.1| putative non-LTR retroelement reverse transcripta...   155   3e-61
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   141   6e-60
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       151   1e-59
gb|AAD12028.1| putative non-LTR retroelement reverse transcripta...   156   3e-55
gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,...   146   4e-55

>gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1449

 Score =  155 bits (392), Expect(2) = 3e-61
 Identities = 98/272 (36%), Positives = 147/272 (54%), Gaps = 13/272 (4%)
 Frame = -3

Query: 777  YLLTRKLKKVKEALKGL-NSRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEKRLIE 601
            +  T+KLK +K  L+GL   ++GNL   V  +REA   Y +L  +  S  Q N  +R +E
Sbjct: 700  FRFTKKLKALKPKLRGLAKEKMGNL---VKRTREA---YLSLCQAQQSNSQ-NPSQRAME 752

Query: 600  VYKDAF-------NMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANG 442
            +  +A+       ++EE +LKQ S+++WLK GD NNK F  +   R   N I  +   +G
Sbjct: 753  IESEAYVRWDRIASIEEKYLKQVSKLHWLKVGDKNNKTFHRAATARAAQNSIREIQKEDG 812

Query: 441  DTQSTHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQAS-----VLTNEFTSDD 277
             T +T +D+      ++Q  L       +     KL S+  +  S     +LT   ++ +
Sbjct: 813  STATTKDDIKNETERFFQEFLQLIPNDYEGITVEKLTSLLPYHCSPAEKDMLTASVSAKE 872

Query: 276  ILKTLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLV 97
            I   L SM  ++SPGPDG+  EFY   W IIG      +  FFE   LP+ VN   + L+
Sbjct: 873  IRGALFSMPNDKSPGPDGYTSEFYKRAWDIIGAEFVLAVKSFFEKGFLPKGVNTTILALI 932

Query: 96   PKQQNASEMKHFRPISCCNVLYKCIAKMLASR 1
            PK+  A EMK +RPISCCNV+YK I+K++A+R
Sbjct: 933  PKKLEAKEMKDYRPISCCNVIYKVISKIIANR 964



 Score =  107 bits (268), Expect(2) = 3e-61
 Identities = 66/183 (36%), Positives = 93/183 (50%), Gaps = 7/183 (3%)
 Frame = -1

Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDLSAF--SSTILDSDWTIYGDFNTCLS 1158
            ++C   L S  + FF  F+Y S    ER+ LW DL     S  I D  W I+GDFN  L 
Sbjct: 504  ITCSVKLESQEEEFFYSFVYASNFAEERKILWNDLRDHMDSPIIRDKPWIIFGDFNEILD 563

Query: 1157 IDE--KQGGNVLWTLGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLIN 984
            +DE  +   +   T GM +F+ +      SDL S     TW +    +P +KKLDRV++N
Sbjct: 564  MDEHSRMEDHPAVTSGMRDFQSLVNYCSFSDLASHGPLFTWCNKRDNDPIWKKLDRVMVN 623

Query: 983  PSWHSVFPNASTVFMARGLSDHCPTATSLGMLN--QIR-NKPFQFFSHLIQDLDFISKVM 813
             +W  V+P +  VF A G SDH     +L M +  Q+R NKPF+F + +    +F   V 
Sbjct: 624  EAWKMVYPQSYNVFEAGGCSDHLRCRINLNMNSGAQVRGNKPFKFVNAVADMEEFKPLVE 683

Query: 812  EAW 804
              W
Sbjct: 684  NFW 686


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  141 bits (355), Expect(2) = 6e-60
 Identities = 86/269 (31%), Positives = 135/269 (50%), Gaps = 6/269 (2%)
 Frame = -3

Query: 789  GDPWYLLTRKLKKVKEALKGLN-SRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 613
            G   + L++K K +K  ++  N      L   V ++ + L   QN L ++PS      EK
Sbjct: 274  GSAMFTLSKKSKFLKGTIRTFNREHYSGLEKRVVQAAQNLKTCQNNLLAAPSSYLAGLEK 333

Query: 612  RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 433
                 + +    EE FL QKSRV WLK GD N  +F      R   N+I  L+D  G   
Sbjct: 334  EAHRSWAELALAEERFLCQKSRVLWLKCGDSNTTFFHRMMTARRAINEIHYLLDQTGRRI 393

Query: 432  STHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQAS-----VLTNEFTSDDILK 268
               +++      +++ + G+S+ +       ++ S+T  +       +L  E +  DI  
Sbjct: 394  ENTDELQTHCVDFFKELFGSSSHLISAEGISQINSLTRFKCDENTRQLLEAEVSEADIKS 453

Query: 267  TLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQ 88
               ++  N+SPGPDG+  EF+  TW I+GP + + +  FF S  L    N+ A+ +VPK+
Sbjct: 454  EFFALPSNKSPGPDGYTSEFFKKTWSIVGPSLIAAVQEFFRSGRLLGQWNSTAVTMVPKK 513

Query: 87   QNASEMKHFRPISCCNVLYKCIAKMLASR 1
             NA  +  FRPISCCN +YK I+K+LA R
Sbjct: 514  PNADRITEFRPISCCNAIYKVISKLLARR 542



 Score =  117 bits (294), Expect(2) = 6e-60
 Identities = 65/179 (36%), Positives = 92/179 (51%), Gaps = 3/179 (1%)
 Frame = -1

Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDLS--AFSSTILDSDWTIYGDFNTCLS 1158
            +SC   L   +  F + F+Y       RR LW +L   A + T  D  W I GDFN  L 
Sbjct: 90   ISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWSELELLAANQTTSDKPWIILGDFNQSLD 149

Query: 1157 -IDEKQGGNVLWTLGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINP 981
             +D   GG+ + T GM EF++  L   +SDL    +  TWW+N   NP  KK+DR+L+N 
Sbjct: 150  PVDASTGGSRI-TRGMEEFRECLLTSNISDLPFRGNHYTWWNNQENNPIAKKIDRILVND 208

Query: 980  SWHSVFPNASTVFMARGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 804
            SW    P +   F A   SDHCP+  ++   +  RNKPF+  + L+   +FI K+   W
Sbjct: 209  SWLIASPLSYGSFCAMEFSDHCPSCVNISNQSGGRNKPFKLSNFLMHHPEFIEKIRVTW 267


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  151 bits (382), Expect(2) = 1e-59
 Identities = 91/269 (33%), Positives = 140/269 (52%), Gaps = 6/269 (2%)
 Frame = -3

Query: 789  GDPWYLLTRKLKKVKEALKGLNS-RVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 613
            G   + +++KLK +K+ +K  +      L     E+ + L+  Q+   + P+P   + E 
Sbjct: 276  GSSMFRVSKKLKALKKPIKDFSRLNYSELEKRTKEAHDFLIGCQDRTLADPTPINASFEL 335

Query: 612  RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 433
                 +      EE F +QKSR++W   GDGN KYF      R +SN I++L D NG   
Sbjct: 336  EAERKWHILTAAEESFFRQKSRISWFAEGDGNTKYFHRMADARNSSNSISALYDGNGKLV 395

Query: 432  STHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQASV-----LTNEFTSDDILK 268
             + E +  +  SY+ S+LG+  +     E   +  + S++ S      L + F+++DI  
Sbjct: 396  DSQEGILDLCASYFGSLLGDEVDPY-LMEQNDMNLLLSYRCSPAQVCELESTFSNEDIRA 454

Query: 267  TLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQ 88
             L S+ +N+S GPDGF  EF++ +W I+G  VT  I  FF S  L +  NA  I L+PK 
Sbjct: 455  ALFSLPRNKSCGPDGFTAEFFIDSWSIVGAEVTDAIKEFFSSGCLLKQWNATTIVLIPKI 514

Query: 87   QNASEMKHFRPISCCNVLYKCIAKMLASR 1
             N +    FRPISC N LYK IA++L  R
Sbjct: 515  VNPTCTSDFRPISCLNTLYKVIARLLTDR 543



 Score =  106 bits (265), Expect(2) = 1e-59
 Identities = 56/179 (31%), Positives = 93/179 (51%), Gaps = 3/179 (1%)
 Frame = -1

Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDL--SAFSSTILDSDWTIYGDFNTCLS 1158
            ++C   L        +  +Y +  V  R+ LW ++     S  I D  W + GDFN  L+
Sbjct: 91   ITCEVLLPGSPSWIIVSVVYAANEVASRKELWIEIVNMVVSGIIGDRPWLVLGDFNQVLN 150

Query: 1157 IDEKQGGNVLWT-LGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINP 981
              E      L   + M +F+D  L   LSDLR   +  TWW+ +   P  KK+DR+L+N 
Sbjct: 151  PQEHSNPVSLNVDINMRDFRDCLLAAELSDLRYKGNTFTWWNKSHTTPVAKKIDRILVND 210

Query: 980  SWHSVFPNASTVFMARGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 804
            SW+++FP++  +F +   SDH      L   +    +PF+FF++L+++LDF++ V + W
Sbjct: 211  SWNALFPSSLGIFGSLDFSDHVSCGVVLEETSIKAKRPFKFFNYLLKNLDFLNLVRDNW 269


>gb|AAD12028.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1447

 Score =  156 bits (395), Expect(2) = 3e-55
 Identities = 91/267 (34%), Positives = 146/267 (54%), Gaps = 8/267 (2%)
 Frame = -3

Query: 777  YLLTRKLKKVKEALKGL-NSRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEKRLIE 601
            +  ++KLK +K  L+ L   R+GNL     E+ + L   Q    ++P+P+   EE    +
Sbjct: 693  FRFSKKLKSLKPLLRNLAKERLGNLVKKTREAYDTLCKKQESTLNNPTPNAMKEEVEAHD 752

Query: 600  VYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQSTHE 421
             ++    +EE FLK+KS+++WL  GD NNK F  +   R   N I+ +   +G   +  +
Sbjct: 753  RWEHVAGLEEKFLKKKSKLHWLDGGDKNNKAFHRAVVTREAQNSISEIQCQDGSVTAKGD 812

Query: 420  DMARVATSYYQSILG------NSAEVCDFPEDLKLP-SITSHQASVLTNEFTSDDILKTL 262
            ++   A  +++  L           + D  + L    S T H+  +LT   T+++I K L
Sbjct: 813  EIKAYAERFFREFLQLIPNEYEGVTMADLQDLLPFRCSETEHE--LLTRVVTAEEIKKVL 870

Query: 261  KSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQQN 82
             SM  ++SPGPDGF  EF+ +TW I+G      I  FF    LP+ +N   + L+PK++ 
Sbjct: 871  FSMPNDKSPGPDGFTSEFFKATWEILGNEFILAIQSFFAKGFLPKGINTTILALIPKKKE 930

Query: 81   ASEMKHFRPISCCNVLYKCIAKMLASR 1
            A EMK +RPISCCNV+YK I+K++A+R
Sbjct: 931  AKEMKDYRPISCCNVIYKVISKIIANR 957



 Score = 86.7 bits (213), Expect(2) = 3e-55
 Identities = 58/183 (31%), Positives = 89/183 (48%), Gaps = 7/183 (3%)
 Frame = -1

Query: 1331 VSCHATLLSYNKSFFIMFIYGSITVVERRALWEDLSAF--SSTILDSDWTIYGDFNTCLS 1158
            ++C   L + +  FF  F+Y S    +R+ LW +L     S  I    W I+GDFN  L 
Sbjct: 497  ITCSVKLENRDDEFFCSFVYASNFRDDRKVLWNELQDHYDSPIIKKKPWIIFGDFNETLE 556

Query: 1157 IDE--KQGGNVLWTLGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLIN 984
            ++E  K   N + ++GM +F+ +     L+D+       TW +    +   KKLDRV++N
Sbjct: 557  LEEHSKVEDNPVVSMGMRDFRSMVNYCSLTDMAHHGPLYTWSNKREHDLIAKKLDRVMVN 616

Query: 983  PSWHSVFPNASTVFMARGLSDHCPTATSL--GMLNQIRNK-PFQFFSHLIQDLDFISKVM 813
              W   FP + +VF A G  DH     +L  G  + +R K PF+F + L +  DF   V 
Sbjct: 617  DVWTQSFPQSYSVFEAGGCLDHLRGRINLNDGPGSIVRGKRPFKFVNVLTEMEDFKPTVD 676

Query: 812  EAW 804
              W
Sbjct: 677  SYW 679


>gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13)
           [Arabidopsis thaliana]
          Length = 1164

 Score =  146 bits (369), Expect(2) = 4e-55
 Identities = 91/269 (33%), Positives = 137/269 (50%), Gaps = 6/269 (2%)
 Frame = -3

Query: 789 GDPWYLLTRKLKKVKEALKGLN-SRVGNLHLLVSESREALMHYQNLLPSSPSPDQFNEEK 613
           G   Y ++ KLK +K+ ++  +     ++     E+ +AL+  Q++L +SP P     E 
Sbjct: 173 GSAMYRVSVKLKALKKVIRDFSRDNYSDIEKRTKEAHDALLLAQSVLLASPCPSNAAIEA 232

Query: 612 RLIEVYKDAFNMEECFLKQKSRVNWLKHGDGNNKYFFNSCRRRWNSNKITSLVDANGDTQ 433
                ++     E  F  Q+SRVNWL+ GD N+ YF      R + N I  L D  GD  
Sbjct: 233 ETQRKWRILAEAEASFFYQRSRVNWLREGDMNSSYFHKMASARQSLNHIHFLSDPVGDRI 292

Query: 432 STHEDMARVATSYYQSILGNSAEVCDFPEDLKLPSITSHQASV-----LTNEFTSDDILK 268
              +++      Y+QS LG+   +  F E   + ++ S++ S      L   F+S+ I  
Sbjct: 293 EGQQNLENHCVEYFQSNLGSEQGLPLF-EQADISNLLSYRCSPAQQVSLDTPFSSEQIKN 351

Query: 267 TLKSMGKNRSPGPDGFPVEFYLSTWHIIGPYVTSGILYFFESLSLPRVVNAAAICLVPKQ 88
              S+ +N++ GPDGF  EF+ + W IIG  VT  I  FF S  L +  NA  + L+PK 
Sbjct: 352 AFFSLPRNKASGPDGFSPEFFCACWPIIGGEVTEAIHEFFTSGKLLKQWNATNLVLIPKI 411

Query: 87  QNASEMKHFRPISCCNVLYKCIAKMLASR 1
            NAS M  FRPISC N +YK I+K+L  R
Sbjct: 412 TNASSMSDFRPISCLNTVYKVISKLLTDR 440



 Score = 96.3 bits (238), Expect(2) = 4e-55
 Identities = 55/166 (33%), Positives = 84/166 (50%), Gaps = 5/166 (3%)
 Frame = -1

Query: 1286 IMFIYGSITVVERRALWEDLSAFSST--ILDSDWTIYGDFNTCLSIDE---KQGGNVLWT 1122
            + F+Y S   V R+ LW ++  FS+   ++D  WT+ GDFN  L   E     G NV   
Sbjct: 3    LSFVYASTDEVTRQILWNEIVDFSNDPCVIDKPWTVLGDFNQILHPSEHSTSDGFNVDRP 62

Query: 1121 LGMMEFKDVCLKLGLSDLRSTWHFLTWWDNNLENPKFKKLDRVLINPSWHSVFPNASTVF 942
              +  F++  L   L+DL    +  TWW+     P  KKLDR+L+N  W + FP++  +F
Sbjct: 63   TRI--FRETILLASLTDLSFRGNTFTWWNKRSRAPVAKKLDRILVNDKWTTTFPSSLGLF 120

Query: 941  MARGLSDHCPTATSLGMLNQIRNKPFQFFSHLIQDLDFISKVMEAW 804
                 SDH     SL   +    KPF+F + L++D +F+S +   W
Sbjct: 121  GEPDFSDHSSCELSLMSASPRSKKPFRFNNFLLKDENFLSLICLKW 166


Top