BLASTX nr result

ID: Angelica23_contig00009925 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00009925
         (1120 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694...   119   7e-42
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       127   4e-41
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   134   7e-41
ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|2...   120   1e-39
dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like ...   125   1e-37

>emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1|
           putative protein [Arabidopsis thaliana]
          Length = 1141

 Score =  119 bits (298), Expect(2) = 7e-42
 Identities = 75/228 (32%), Positives = 114/228 (50%), Gaps = 7/228 (3%)
 Frame = +2

Query: 5   CSYNIRGLNNKKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKF-NWHFNYNSH 181
           C +NI    N     + +  VN      ++E HVKQ       KFI+A    W F+ N  
Sbjct: 3   CGFNIPSHRNG---FKKWFKVNRPIFGGVIEKHVKQPKD---KKFINALLPGWFFDENYG 56

Query: 182 YN--GRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRLLWR 355
           ++  G+IWV +DP +    +++ + Q ITC V    S+    +S VYA N   +R+ LWR
Sbjct: 57  FSDLGKIWVLWDPSV-EVVIVAKSLQMITCEVLFPNSRTWIVISVVYAANEDDKRKELWR 115

Query: 356 DLESVRS--LIGNTAWCLSGDFNDCLGPSESSNHANWNAG--MLEFKDASFQLGVTDLKS 523
           ++ ++ +  +  N  W L GDFN  L P E S H + N    + +F++      ++DL  
Sbjct: 116 EITALVASPVTFNRPWILLGDFNQVLHPHEHSRHVSLNVDRRIRDFRECLLDAELSDLVY 175

Query: 524 SGQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDH 667
            G  FTWW+     P+ KK+DR LVN+ W + FP +     P   SDH
Sbjct: 176 KGSSFTWWNKSKTRPVAKKIDRILVNESWSNLFPSSFGLFGPPDFSDH 223



 Score = 79.0 bits (193), Expect(2) = 7e-42
 Identities = 45/141 (31%), Positives = 70/141 (49%), Gaps = 2/141 (1%)
 Frame = +1

Query: 703  KIPKPFQFFKHLIKAPGFMEAVSDAW-NTNIPGDPWLVLTSKIRRVKQAMRTLNA-NTGN 876
            K  +PF+FF  L+K P F+  V D W +TN+ G     ++ K++ +K+ ++  +  N  N
Sbjct: 236  KAKRPFKFFNFLLKNPEFLNLVWDVWYSTNVVGSSMFRVSKKLKALKKPIKDFSRLNYSN 295

Query: 877  LHLKVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKS 1056
            L  +   A   LL+FQ+   D PS+     E   +       + EE F +Q+SRV+W   
Sbjct: 296  LEKRTEEAHETLLSFQNLTLDNPSLENAAHELEAQRKWQILATAEESFFRQRSRVTWFAE 355

Query: 1057 GDGNNSCFFNYCKGRWNSNKI 1119
            GDGN   F      R + N I
Sbjct: 356  GDGNTRYFHRMADSRKSVNTI 376


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  127 bits (319), Expect(2) = 4e-41
 Identities = 84/231 (36%), Positives = 117/231 (50%), Gaps = 9/231 (3%)
 Frame = +2

Query: 2   FCSYNIRGLNN--KKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKF-NWHF-- 166
           FC +NIRG NN   ++  + ++  N      ++ETHVKQ       KFI+A    W F  
Sbjct: 6   FC-WNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKD---RKFINALLPGWSFVE 61

Query: 167 NYNSHYNGRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRL 346
           NY     G+IWV +DP + +  V++ + Q ITC V    S     VS VYA N    R+ 
Sbjct: 62  NYAFSDLGKIWVMWDPSV-QVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKE 120

Query: 347 LWRDLES--VRSLIGNTAWCLSGDFNDCLGPSESSNHANWNA--GMLEFKDASFQLGVTD 514
           LW ++ +  V  +IG+  W + GDFN  L P E SN  + N    M +F+D      ++D
Sbjct: 121 LWIEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSD 180

Query: 515 LKSSGQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDH 667
           L+  G  FTWW+     P+ KK+DR LVND W   FP +         SDH
Sbjct: 181 LRYKGNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGSLDFSDH 231



 Score = 68.2 bits (165), Expect(2) = 4e-41
 Identities = 42/141 (29%), Positives = 66/141 (46%), Gaps = 2/141 (1%)
 Frame = +1

Query: 703  KIPKPFQFFKHLIKAPGFMEAVSDAWNT-NIPGDPWLVLTSKIRRVKQAMRTLNA-NTGN 876
            K  +PF+FF +L+K   F+  V D W T N+ G     ++ K++ +K+ ++  +  N   
Sbjct: 244  KAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNYSE 303

Query: 877  LHLKVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKS 1056
            L  +   A   L+  QD     P+    + E   +       + EE F +QKSR+SW   
Sbjct: 304  LEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAAEESFFRQKSRISWFAE 363

Query: 1057 GDGNNSCFFNYCKGRWNSNKI 1119
            GDGN   F      R +SN I
Sbjct: 364  GDGNTKYFHRMADARNSSNSI 384


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  134 bits (337), Expect(2) = 7e-41
 Identities = 81/236 (34%), Positives = 125/236 (52%), Gaps = 7/236 (2%)
 Frame = +2

Query: 8   SYNIRGLNN--KKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKFNWHF--NYN 175
           S+N+RG NN  ++   R +  ++     +ILET VK+  +      +S+   W    NY 
Sbjct: 6   SWNVRGFNNSVRRRNFRKWFKLSKALFGSILETRVKEHRARR--SLLSSFPGWKSVCNYE 63

Query: 176 SHYNGRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRLLWR 355
               GRIWV +DP +   TV+S + Q I+C+V+      +F V+FVYA N    RR LW 
Sbjct: 64  FAALGRIWVVWDPAV-EVTVLSKSDQTISCTVKLPHISTEFVVTFVYAVNCRYGRRRLWS 122

Query: 356 DLE--SVRSLIGNTAWCLSGDFNDCLGPSESSNHANW-NAGMLEFKDASFQLGVTDLKSS 526
           +LE  +      +  W + GDFN  L P ++S   +    GM EF++      ++DL   
Sbjct: 123 ELELLAANQTTSDKPWIILGDFNQSLDPVDASTGGSRITRGMEEFRECLLTSNISDLPFR 182

Query: 527 GQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDHCPLSTSLGH 694
           G  +TWW++   +P+ KK+DR LVND WL + PL++ S      SDHCP   ++ +
Sbjct: 183 GNHYTWWNNQENNPIAKKIDRILVNDSWLIASPLSYGSFCAMEFSDHCPSCVNISN 238



 Score = 60.5 bits (145), Expect(2) = 7e-41
 Identities = 45/138 (32%), Positives = 60/138 (43%), Gaps = 2/138 (1%)
 Frame = +1

Query: 712  KPFQFFKHLIKAPGFMEAVSDAWNT-NIPGDPWLVLTSKIRRVKQAMRTLNA-NTGNLHL 885
            KPF+    L+  P F+E +   W+     G     L+ K + +K  +RT N  +   L  
Sbjct: 245  KPFKLSNFLMHHPEFIEKIRVTWDRLAYQGSAMFTLSKKSKFLKGTIRTFNREHYSGLEK 304

Query: 886  KVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKSGDG 1065
            +V  A   L T Q+NL   PS      E     +       EE FL QKSRV WLK GD 
Sbjct: 305  RVVQAAQNLKTCQNNLLAAPSSYLAGLEKEAHRSWAELALAEERFLCQKSRVLWLKCGDS 364

Query: 1066 NNSCFFNYCKGRWNSNKI 1119
            N + F      R   N+I
Sbjct: 365  NTTFFHRMMTARRAINEI 382


>ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|222873371|gb|EEF10502.1|
            predicted protein [Populus trichocarpa]
          Length = 819

 Score =  120 bits (302), Expect(2) = 1e-39
 Identities = 71/222 (31%), Positives = 113/222 (50%), Gaps = 3/222 (1%)
 Frame = +2

Query: 20   RGLNN--KKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKFNWHFNYNSHYNGR 193
            RGLN+  K + +R  +    I+L  ++ET VK +   ++S+ +   +++ +NY+    GR
Sbjct: 386  RGLNDPIKHSELRRLIHQERIALFGLVETRVKDKNKDNVSQLLLRSWSFLYNYDFSCRGR 445

Query: 194  IWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERRLLWRDLESVR 373
            IWV ++    +  V   + Q I  SV  +A+   F  S +Y  N    R  LW D+ S  
Sbjct: 446  IWVCWNADTVKVDVFGMSDQAIHVSVTILATNISFNTSIIYGDNNASLREALWSDIVSRS 505

Query: 374  SLIGNTAWCLSGDFNDCLGPSESSNHANWNAGMLEFKDASF-QLGVTDLKSSGQKFTWWD 550
                +T W L GDFN     S+    +   AG ++  D    +  V DL+ SG  +TW +
Sbjct: 506  DGWESTLWILIGDFNAIRNQSDRLGGSTTWAGTMDRLDTCIREAKVDDLRYSGMHYTWSN 565

Query: 551  SCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDHCPL 676
             C  + + +KLDR LVN+ W   FPL+    +P G+SDH P+
Sbjct: 566  QCPENLIMRKLDRVLVNEKWNLKFPLSEARFLPSGMSDHSPM 607



 Score = 70.1 bits (170), Expect(2) = 1e-39
 Identities = 46/145 (31%), Positives = 67/145 (46%), Gaps = 9/145 (6%)
 Frame = +1

Query: 712  KPFQFFKHLIKAPGFMEAVSDAWNTNIPGDPWLVLTSKIRRVKQAMRTLN-ANTGNLHLK 888
            KPF+FF   +    FM  V   W+ N  G P   L  K+R++KQ ++  N A+  N+  +
Sbjct: 620  KPFRFFDMWMDHDEFMPLVKKVWDQNSRGCPMYQLCCKLRKLKQELKLFNMAHFSNISDR 679

Query: 889  VSMARSELLTFQDNLPDCPSVAQLTEENRLKC--------NLTAALSEEEIFLKQKSRVS 1044
            V  A++++   Q  L           EN + C           + +  EE F KQK+R+ 
Sbjct: 680  VRDAKNKMDKAQQAL-------HTAHENPILCMRERDVVHKYASTVRAEESFFKQKARIQ 732

Query: 1045 WLKSGDGNNSCFFNYCKGRWNSNKI 1119
            WL  GD N S F     GR N NK+
Sbjct: 733  WLSLGDQNTSYFHKSVNGRQNRNKL 757


>dbj|BAB01845.1| non-LTR retroelement reverse transcriptase-like protein
           [Arabidopsis thaliana]
          Length = 893

 Score =  125 bits (315), Expect(2) = 1e-37
 Identities = 86/243 (35%), Positives = 121/243 (49%), Gaps = 8/243 (3%)
 Frame = +2

Query: 2   FCSYNIRGLN---NKKAYVRDFLTVNNISLLAILETHVKQEASVSISKFISAKF-NWHF- 166
           FC +N+RG N   +++ + + FL +N      ++ETHVKQ       KFIS     W F 
Sbjct: 6   FC-WNVRGFNISSHRRGFKKWFL-LNKPLFGGLIETHVKQPKE---KKFISNLLPGWSFV 60

Query: 167 -NYNSHYNGRIWVGFDPLIWRCTVISNTAQQITCSVQKIASKEKFYVSFVYAFNTPQERR 343
            NY     G+IWV +DP + +  VI  + Q ITC +    S   F VS VYA N    R+
Sbjct: 61  ENYEFSVLGKIWVLWDPSV-KVVVIGRSLQMITCELLLPDSPSWFVVSIVYASNEEGTRK 119

Query: 344 LLWRDLE--SVRSLIGNTAWCLSGDFNDCLGPSESSNHANWNAGMLEFKDASFQLGVTDL 517
            LW +L   ++  ++   +W + GDFN  L P ES+ +AN    +  F+       + DL
Sbjct: 120 ELWNELVQLALSPVVVGRSWIVLGDFNQILNP-ESAINANIGRKIRAFRSCLLDSDLYDL 178

Query: 518 KSSGQKFTWWDSCIRDPLFKKLDRCLVNDYWLHSFPLAHVSIMPRGLSDHCPLSTSLGHA 697
              G  +TWW+ C   PL KK+DR LVND+W   FP A+ +      SDH      L  A
Sbjct: 179 VYKGSSYTWWNKCSSRPLAKKIDRILVNDHWNTLFPSAYANFGEPDFSDHSSCEVVLDPA 238

Query: 698 AEK 706
             K
Sbjct: 239 VLK 241



 Score = 58.2 bits (139), Expect(2) = 1e-37
 Identities = 39/141 (27%), Positives = 64/141 (45%), Gaps = 2/141 (1%)
 Frame = +1

Query: 703  KIPKPFQFFKHLIKAPGFMEAVSDAW-NTNIPGDPWLVLTSKIRRVKQAMRTLNA-NTGN 876
            K  +PF+FF + +  P F++ + + W + N+ G     ++ K++ +K  +   +  N  +
Sbjct: 241  KAKRPFRFFNYFLHNPDFLQLIRENWYSCNVSGSAMYRVSKKLKHLKLPICCFSRENYSD 300

Query: 877  LHLKVSMARSELLTFQDNLPDCPSVAQLTEENRLKCNLTAALSEEEIFLKQKSRVSWLKS 1056
            +  +VS A + +L  Q      PSV   T E             EE F  QKS +SWL  
Sbjct: 301  IEKRVSEAHAIVLHRQRITLTNPSVVHATLELEATRKWQILAKAEESFFCQKSSISWLYE 360

Query: 1057 GDGNNSCFFNYCKGRWNSNKI 1119
            GD N + F      R + N I
Sbjct: 361  GDNNTAYFHKMADMRKSINTI 381


Top