BLASTX nr result

ID: Angelica23_contig00024256 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00024256
         (1228 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   147   5e-52
gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00...   126   8e-49
emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694...   141   2e-47
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       130   4e-45
gb|AAC67331.1| putative non-LTR retroelement reverse transcripta...   109   3e-43

>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  147 bits (371), Expect(2) = 5e-52
 Identities = 77/223 (34%), Positives = 123/223 (55%), Gaps = 1/223 (0%)
 Frame = +1

Query: 4   NYDSHPNGRIWLGWDPNLWEVQILASNAQHISCSISRIGNNDSFLISFVYALNTSIERRI 183
           NY+    GRIW+ WDP + EV +L+ + Q ISC++     +  F+++FVYA+N    RR 
Sbjct: 61  NYEFAALGRIWVVWDPAV-EVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRR 119

Query: 184 LWGNLLDFKRQHVDVNLVPWTVLGDFNVCLNMDEMDGGSVSFSRGMIEFKDFLDDAEVFD 363
           LW  L +    +   +  PW +LGDFN  L+  +   G    +RGM EF++ L  + + D
Sbjct: 120 LWSEL-ELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISD 178

Query: 364 LYFSGSFLTWWDSNKTNPTHRKLDRVLVNESWISSFASSRAQFLPRGLSDHCPTLVYIGL 543
           L F G+  TWW++ + NP  +K+DR+LVN+SW+ +   S   F     SDHCP+ V I  
Sbjct: 179 LPFRGNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEFSDHCPSCVNISN 238

Query: 544 VVEKIFKPFQVFQHIIQSPDFLSSVQAAWN-VDISGDPWFVLT 669
                 KPF++   ++  P+F+  ++  W+ +   G   F L+
Sbjct: 239 QSGGRNKPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLS 281



 Score = 85.1 bits (209), Expect(2) = 5e-52
 Identities = 58/170 (34%), Positives = 84/170 (49%), Gaps = 6/170 (3%)
 Frame = +2

Query: 737  KVKEAQSNLIAYQESLPCVPSLEQFEEEERLCLSLSHCLKLEETFLKQKSRVNWLKVGDN 916
            +V +A  NL   Q +L   PS      E+    S +     EE FL QKSRV WLK GD+
Sbjct: 305  RVVQAAQNLKTCQNNLLAAPSSYLAGLEKEAHRSWAELALAEERFLCQKSRVLWLKCGDS 364

Query: 917  NNSCFFKSCKSRWNSNKLLALEDDQGNTFTTHRDISNLTVEYFKSILGTSVSVPSIEDF- 1093
            N + F +   +R   N++  L D  G       ++    V++FK + G+S  + S E   
Sbjct: 365  NTTFFHRMMTARRAINEIHYLLDQTGRRIENTDELQTHCVDFFKELFGSSSHLISAEGIS 424

Query: 1094 QLPGISEDQC-----QLLLAPFTREEIAPVFKKMVKNKSPGPDGFT*EFF 1228
            Q+  ++  +C     QLL A  +  +I   F  +  NKSPGPDG+T EFF
Sbjct: 425  QINSLTRFKCDENTRQLLEAEVSEADIKSEFFALPSNKSPGPDGYTSEFF 474


>gb|AAD21699.1| Contains reverse transcriptase domain (rvt) PF|00078 [Arabidopsis
           thaliana]
          Length = 1253

 Score =  126 bits (317), Expect(2) = 8e-49
 Identities = 70/184 (38%), Positives = 106/184 (57%), Gaps = 2/184 (1%)
 Frame = +1

Query: 124 NDSFLISFVYALNTSIERRILWGNLLDFKRQHVDVNLVPWTVLGDFN-VCLNMDEMDGGS 300
           +DS ++S VYA N +I R+ LW  LL      +  N  PW +LGDFN V    +     S
Sbjct: 50  DDSVVVSIVYAANEAITRKELWEELL-LLSVSLSGNGKPWIMLGDFNQVLCPAEHSQATS 108

Query: 301 VSFSRGMIEFKDFLDDAEVFDLYFSGSFLTWWDSNKTNPTHRKLDRVLVNESWISSFASS 480
           ++ +R M  F+D L +AE+ DL F G+  TWW+ + T P  +KLDR+LVNESW S F S+
Sbjct: 109 LNVNRRMKVFRDCLFEAELCDLVFKGNTFTWWNKSATRPVAKKLDRILVNESWCSRFPSA 168

Query: 481 RAQFLPRGLSDHCPTLVYIGLVVEKIFKPFQVFQHIIQSPDFLSSVQAAW-NVDISGDPW 657
            A F     SDH    V I  ++ +  +PF+ +  ++Q+PDF+S V   W ++++ G   
Sbjct: 169 YAVFGEPDFSDHASCGVIINPLMHREKRPFRFYNFLLQNPDFISLVGELWYSINVVGSSM 228

Query: 658 FVLT 669
           F ++
Sbjct: 229 FKMS 232



 Score = 95.1 bits (235), Expect(2) = 8e-49
 Identities = 66/177 (37%), Positives = 91/177 (51%), Gaps = 9/177 (5%)
 Frame = +2

Query: 725  NMHLKVKEAQSNLIAYQE----SLPCVPSLEQFEEEERLCLSLSHCLKLEETFLKQKSRV 892
            N+  +VKEA  NL+ Y++    S P +P+     E +R  L L   +K EE+F  Q+SRV
Sbjct: 252  NLEKRVKEAH-NLVLYRQNKTLSDPTIPNAALEMEAQRKWLIL---VKAEESFFCQRSRV 307

Query: 893  NWLKVGDNNNSCFFKSCKSRWNSNKLLALEDDQGNTFTTHRDISNLTVEYFKSILGTSVS 1072
             W+  GD+N S F +   SR   N +  + DD G    T   I    +EYF ++LG  V 
Sbjct: 308  TWMGEGDSNTSYFHRMADSRKAVNTIHIIIDDNGVKIDTQLGIKEHCIEYFSNLLGGEVG 367

Query: 1073 VPSI--EDFQLP---GISEDQCQLLLAPFTREEIAPVFKKMVKNKSPGPDGFT*EFF 1228
             P +  EDF L      S DQ + L   F+R++I   F     NK+ GPDGF  EFF
Sbjct: 368  PPMLIQEDFDLLLPFRCSHDQKKELAMSFSRQDIKSAFFSFPSNKTSGPDGFPVEFF 424


>emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1|
           putative protein [Arabidopsis thaliana]
          Length = 1141

 Score =  141 bits (356), Expect(2) = 2e-47
 Identities = 82/221 (37%), Positives = 123/221 (55%), Gaps = 2/221 (0%)
 Frame = +1

Query: 4   NYDSHPNGRIWLGWDPNLWEVQILASNAQHISCSISRIGNNDSFLISFVYALNTSIERRI 183
           NY     G+IW+ WDP++ EV I+A + Q I+C +    +    +IS VYA N   +R+ 
Sbjct: 54  NYGFSDLGKIWVLWDPSV-EVVIVAKSLQMITCEVLFPNSRTWIVISVVYAANEDDKRKE 112

Query: 184 LWGNLLDFKRQHVDVNLVPWTVLGDFNVCLNMDEMDGG-SVSFSRGMIEFKDFLDDAEVF 360
           LW  +       V  N  PW +LGDFN  L+  E     S++  R + +F++ L DAE+ 
Sbjct: 113 LWREITALVASPVTFNR-PWILLGDFNQVLHPHEHSRHVSLNVDRRIRDFRECLLDAELS 171

Query: 361 DLYFSGSFLTWWDSNKTNPTHRKLDRVLVNESWISSFASSRAQFLPRGLSDHCPTLVYIG 540
           DL + GS  TWW+ +KT P  +K+DR+LVNESW + F SS   F P   SDH    V + 
Sbjct: 172 DLVYKGSSFTWWNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPDFSDHASCGVVLE 231

Query: 541 LVVEKIFKPFQVFQHIIQSPDFLSSVQAAW-NVDISGDPWF 660
           L   K  +PF+ F  ++++P+FL+ V   W + ++ G   F
Sbjct: 232 LDPIKAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMF 272



 Score = 75.1 bits (183), Expect(2) = 2e-47
 Identities = 49/168 (29%), Positives = 76/168 (45%), Gaps = 5/168 (2%)
 Frame = +2

Query: 725  NMHLKVKEAQSNLIAYQESLPCVPSLEQFEEEERLCLSLSHCLKLEETFLKQKSRVNWLK 904
            N+  + +EA   L+++Q      PSLE    E             EE+F +Q+SRV W  
Sbjct: 295  NLEKRTEEAHETLLSFQNLTLDNPSLENAAHELEAQRKWQILATAEESFFRQRSRVTWFA 354

Query: 905  VGDNNNSCFFKSCKSRWNSNKLLALEDDQGNTFTTHRDISNLTVEYFKSILGTSVSVPSI 1084
             GD N   F +   SR + N +  L DD G    + + I++    YF+++L       S+
Sbjct: 355  EGDGNTRYFHRMADSRKSVNTITTLVDDSGTQIDSQQGIADHCALYFENLLSDDNDPYSL 414

Query: 1085 EDFQLPGISEDQCQL-----LLAPFTREEIAPVFKKMVKNKSPGPDGF 1213
            E   +  +   +C       L A F+ E+I   F  +  NK+ GPDGF
Sbjct: 415  EQDDMNLLLTYRCPYSQVADLEAMFSDEDIKAAFFGLPSNKACGPDGF 462


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  130 bits (327), Expect(2) = 4e-45
 Identities = 75/221 (33%), Positives = 121/221 (54%), Gaps = 2/221 (0%)
 Frame = +1

Query: 4   NYDSHPNGRIWLGWDPNLWEVQILASNAQHISCSISRIGNNDSFLISFVYALNTSIERRI 183
           NY     G+IW+ WDP++ +V ++A + Q I+C +   G+    ++S VYA N    R+ 
Sbjct: 62  NYAFSDLGKIWVMWDPSV-QVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKE 120

Query: 184 LWGNLLDFKRQHVDVNLVPWTVLGDFNVCLNMDEMDGG-SVSFSRGMIEFKDFLDDAEVF 360
           LW  +++     + +   PW VLGDFN  LN  E     S++    M +F+D L  AE+ 
Sbjct: 121 LWIEIVNMVVSGI-IGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELS 179

Query: 361 DLYFSGSFLTWWDSNKTNPTHRKLDRVLVNESWISSFASSRAQFLPRGLSDHCPTLVYIG 540
           DL + G+  TWW+ + T P  +K+DR+LVN+SW + F SS   F     SDH    V + 
Sbjct: 180 DLRYKGNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGSLDFSDHVSCGVVLE 239

Query: 541 LVVEKIFKPFQVFQHIIQSPDFLSSVQAAW-NVDISGDPWF 660
               K  +PF+ F +++++ DFL+ V+  W  +++ G   F
Sbjct: 240 ETSIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMF 280



 Score = 79.0 bits (193), Expect(2) = 4e-45
 Identities = 58/172 (33%), Positives = 82/172 (47%), Gaps = 8/172 (4%)
 Frame = +2

Query: 737  KVKEAQSNLIAYQESLPCVPSL--EQFE-EEERLCLSLSHCLKLEETFLKQKSRVNWLKV 907
            + KEA   LI  Q+     P+     FE E ER    L+     EE+F +QKSR++W   
Sbjct: 307  RTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAA---EESFFRQKSRISWFAE 363

Query: 908  GDNNNSCFFKSCKSRWNSNKLLALEDDQGNTFTTHRDISNLTVEYFKSILGTSVSVPSIE 1087
            GD N   F +   +R +SN + AL D  G    +   I +L   YF S+LG  V    +E
Sbjct: 364  GDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGDEVDPYLME 423

Query: 1088 DFQLPGISEDQCQ-----LLLAPFTREEIAPVFKKMVKNKSPGPDGFT*EFF 1228
               +  +   +C       L + F+ E+I      + +NKS GPDGFT EFF
Sbjct: 424  QNDMNLLLSYRCSPAQVCELESTFSNEDIRAALFSLPRNKSCGPDGFTAEFF 475


>gb|AAC67331.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1449

 Score =  109 bits (273), Expect(2) = 3e-43
 Identities = 68/217 (31%), Positives = 105/217 (48%), Gaps = 8/217 (3%)
 Frame = +1

Query: 4    NYDSHPNGRIWLGWDPNLWEVQILASNAQHISCSISRIGNNDSFLISFVYALNTSIERRI 183
            NY+ +  GR+W+ W  N+       S+ Q I+CS+      + F  SFVYA N + ER+I
Sbjct: 475  NYEFNRRGRLWVVWRENVRFTPFYKSD-QLITCSVKLESQEEEFFYSFVYASNFAEERKI 533

Query: 184  LWGNLLDFKRQHVDVNLV---PWTVLGDFNVCLNMDEMDG--GSVSFSRGMIEFKDFLDD 348
            LW +L    R H+D  ++   PW + GDFN  L+MDE        + + GM +F+  ++ 
Sbjct: 534  LWNDL----RDHMDSPIIRDKPWIIFGDFNEILDMDEHSRMEDHPAVTSGMRDFQSLVNY 589

Query: 349  AEVFDLYFSGSFLTWWDSNKTNPTHRKLDRVLVNESWISSFASSRAQFLPRGLSDHCPTL 528
                DL   G   TW +    +P  +KLDRV+VNE+W   +  S   F   G SDH    
Sbjct: 590  CSFSDLASHGPLFTWCNKRDNDPIWKKLDRVMVNEAWKMVYPQSYNVFEAGGCSDHLRCR 649

Query: 529  VYIGL---VVEKIFKPFQVFQHIIQSPDFLSSVQAAW 630
            + + +      +  KPF+    +    +F   V+  W
Sbjct: 650  INLNMNSGAQVRGNKPFKFVNAVADMEEFKPLVENFW 686



 Score = 93.6 bits (231), Expect(2) = 3e-43
 Identities = 56/176 (31%), Positives = 87/176 (49%), Gaps = 6/176 (3%)
 Frame = +2

Query: 719  MGNMHLKVKEAQSNLIAYQESLPCVPSLEQFEEEERLCLSLSHCLKLEETFLKQKSRVNW 898
            MGN+  + +EA  +L   Q+S    PS    E E    +       +EE +LKQ S+++W
Sbjct: 721  MGNLVKRTREAYLSLCQAQQSNSQNPSQRAMEIESEAYVRWDRIASIEEKYLKQVSKLHW 780

Query: 899  LKVGDNNNSCFFKSCKSRWNSNKLLALEDDQGNTFTTHRDISNLTVEYFKSILG------ 1060
            LKVGD NN  F ++  +R   N +  ++ + G+T TT  DI N T  +F+  L       
Sbjct: 781  LKVGDKNNKTFHRAATARAAQNSIREIQKEDGSTATTKDDIKNETERFFQEFLQLIPNDY 840

Query: 1061 TSVSVPSIEDFQLPGISEDQCQLLLAPFTREEIAPVFKKMVKNKSPGPDGFT*EFF 1228
              ++V  +        S  +  +L A  + +EI      M  +KSPGPDG+T EF+
Sbjct: 841  EGITVEKLTSLLPYHCSPAEKDMLTASVSAKEIRGALFSMPNDKSPGPDGYTSEFY 896


Top