BLASTX nr result

ID: Angelica22_contig00001741 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00001741
         (1655 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAD22283.1| putative retroelement pol polyprotein [Arabidopsi...   375   e-101
gb|AAF79618.1|AC027665_19 F5M15.26 [Arabidopsis thaliana]             374   e-101
gb|AAD20101.1| putative retroelement pol polyprotein [Arabidopsi...   356   9e-96
gb|AAR13317.1| gag-pol polyprotein [Phaseolus vulgaris]               356   1e-95
dbj|BAK61840.1| gag-pol polyprotein [Citrus unshiu]                   355   2e-95

>gb|AAD22283.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1787

 Score =  375 bits (963), Expect = e-101
 Identities = 184/332 (55%), Positives = 235/332 (70%), Gaps = 3/332 (0%)
 Frame = -1

Query: 989  EDAVIEDPRNFDFDLDPRM---PQPMEKTGSAEDTASIFVDVEDPSKFLKIGSQLNPDLR 819
            E AV++ P  ++  L P     P P + T        + +D  DPS+ + I   L  +L+
Sbjct: 687  EFAVVDKPIIYNVILVPTQDERPNPQKGT-----VVQVNIDESDPSRCVGIRIDLPSELQ 741

Query: 818  DMLTNFLLKNLDVFAWSHADMVGIDPEVMCHHLNIDPSKRGMRQKRRPMSGERAVALKEE 639
            + L NFL +N   FAWS  DM GID  V CH LN+DP+ + ++QKRR +  +R   + EE
Sbjct: 742  NELVNFLRQNAATFAWSVEDMPGIDSAVTCHELNVDPTYKPLKQKRRKLGPDRTKDVNEE 801

Query: 638  VDRLLDVGLIKESYYPDWLANPVLVKKPNGKWRTCVDFTDLNKACPKDSFPLPRIDQLVD 459
            V +LLD G I E  YPDWL NPV+VKK NGKWR C+DFTDLNKACPKDSFPLP ID+LV+
Sbjct: 802  VKKLLDAGSIVEVRYPDWLRNPVVVKKKNGKWRVCIDFTDLNKACPKDSFPLPHIDRLVE 861

Query: 458  STAGHALLSFMDAYSGYNQIPMYGPDQEHTSFITDRGLYCYVGMPFGLINAGATYQRLVN 279
            +TAG+ LLSFMDA+SGYNQI M+  D+E T FITD+G YCY  MPFGL NAGATY RLVN
Sbjct: 862  ATAGNELLSFMDAFSGYNQILMHQNDREKTVFITDQGTYCYKVMPFGLKNAGATYPRLVN 921

Query: 278  MMFKDLIGKTMEVYVDDMLVKSMKAEDHVKHLEEMFNILRRYKMKLNPQKCVFGVESGKF 99
             MF D +  +MEVY+DDMLVKS++AE+H+ HL + F +L RY MKLNP KC FGV SG+F
Sbjct: 922  QMFTDQLDHSMEVYIDDMLVKSLRAEEHITHLRQCFQVLNRYNMKLNPSKCTFGVTSGEF 981

Query: 98   LGFIVNHRGIEANPAKIKALLDMKSPSSVKQV 3
            LG++V  RGIEANP +I A++D+ SP + ++V
Sbjct: 982  LGYLVTRRGIEANPKQISAIIDLPSPRNTREV 1013


>gb|AAF79618.1|AC027665_19 F5M15.26 [Arabidopsis thaliana]
          Length = 1838

 Score =  374 bits (961), Expect = e-101
 Identities = 175/298 (58%), Positives = 228/298 (76%)
 Frame = -1

Query: 896  TASIFVDVEDPSKFLKIGSQLNPDLRDMLTNFLLKNLDVFAWSHADMVGIDPEVMCHHLN 717
            T  + +D  DP++ + +G++++P +R  L   L +N   FAWS  DM GIDP +  H LN
Sbjct: 758  TEMVNIDESDPTRCVGVGAEISPSIRLELIALLKRNSKTFAWSIEDMKGIDPAITAHELN 817

Query: 716  IDPSKRGMRQKRRPMSGERAVALKEEVDRLLDVGLIKESYYPDWLANPVLVKKPNGKWRT 537
            +DP+ + ++QKRR +  ERA A+ EEV++LL  G I E  YP+WLANPV+VKK NGKWR 
Sbjct: 818  VDPTFKPVKQKRRKLGPERARAVNEEVEKLLKAGQIIEVKYPEWLANPVVVKKKNGKWRV 877

Query: 536  CVDFTDLNKACPKDSFPLPRIDQLVDSTAGHALLSFMDAYSGYNQIPMYGPDQEHTSFIT 357
            CVD+TDLNKACPKDS+PLP ID+LV++T+G+ LLSFMDA+SGYNQI M+  DQE TSF+T
Sbjct: 878  CVDYTDLNKACPKDSYPLPHIDRLVEATSGNGLLSFMDAFSGYNQILMHKDDQEKTSFVT 937

Query: 356  DRGLYCYVGMPFGLINAGATYQRLVNMMFKDLIGKTMEVYVDDMLVKSMKAEDHVKHLEE 177
            DRG YCY  M FGL NAGATYQR VN M  D IG+T+EVY+DDMLVKS+K EDHV+HL +
Sbjct: 938  DRGTYCYKVMSFGLKNAGATYQRFVNKMLADQIGRTVEVYIDDMLVKSLKPEDHVEHLSK 997

Query: 176  MFNILRRYKMKLNPQKCVFGVESGKFLGFIVNHRGIEANPAKIKALLDMKSPSSVKQV 3
             F++L  Y MKLNP KC FGV SG+FLG++V  RGIEANP +I+A+L++ SP + ++V
Sbjct: 998  CFDVLNTYGMKLNPTKCTFGVTSGEFLGYVVTKRGIEANPKQIRAILELPSPRNAREV 1055



 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 29/94 (30%), Positives = 59/94 (62%)
 Frame = -1

Query: 1655 LLDKDMEPDNGWIYGFSGEAVKVMGSIRLPVTLGEGALSVTQMMNFMILNQESAHNALVG 1476
            + D+ ++P +  + GF G+ V  +G+I+LP+ +G     +   + F+++ + + +N ++G
Sbjct: 653  ITDRQIKPVSKPLAGFDGDFVMTIGTIKLPIFVG----GLIAWVKFVVIGKPAVYNVILG 708

Query: 1475 RPLLKEMKAVTSIYHLAMKFPTPNGIGTVRGSQQ 1374
             P + +M+A+ S YH  +KFPT NGI T+R  ++
Sbjct: 709  TPWIHQMQAIPSTYHQCVKFPTHNGIFTLRAPKE 742


>gb|AAD20101.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 764

 Score =  356 bits (914), Expect = 9e-96
 Identities = 166/253 (65%), Positives = 203/253 (80%)
 Frame = -1

Query: 761  DMVGIDPEVMCHHLNIDPSKRGMRQKRRPMSGERAVALKEEVDRLLDVGLIKESYYPDWL 582
            DMVGIDPEV CH LN+DP+ + ++QKRR +  ER+ A+ +EVD+LLD G I E  YP+WL
Sbjct: 476  DMVGIDPEVACHELNVDPTFKLVKQKRRKLGPERSKAVNDEVDKLLDAGSIVEVKYPEWL 535

Query: 581  ANPVLVKKPNGKWRTCVDFTDLNKACPKDSFPLPRIDQLVDSTAGHALLSFMDAYSGYNQ 402
            ANPV+VKK N KWR C+DFTDLNKACPKDSFPLP ID++V++T G+ LLSFMDA+SGYNQ
Sbjct: 536  ANPVVVKKKNDKWRVCIDFTDLNKACPKDSFPLPHIDRMVEATTGNELLSFMDAFSGYNQ 595

Query: 401  IPMYGPDQEHTSFITDRGLYCYVGMPFGLINAGATYQRLVNMMFKDLIGKTMEVYVDDML 222
            IPM+  DQE TSFI DRG YCY  MPFGL N GA YQRLVN MF   +GKTMEVY+DDML
Sbjct: 596  IPMHKDDQEKTSFIIDRGTYCYKVMPFGLKNVGARYQRLVNQMFAPQLGKTMEVYIDDML 655

Query: 221  VKSMKAEDHVKHLEEMFNILRRYKMKLNPQKCVFGVESGKFLGFIVNHRGIEANPAKIKA 42
            VKS ++ DH+ HL+  F  L +Y MKLNP KC+FGV SG+FLG+IV  RGIEANP +I+A
Sbjct: 656  VKSTRSADHIDHLKACFETLNKYNMKLNPAKCLFGVTSGEFLGYIVTKRGIEANPKQIRA 715

Query: 41   LLDMKSPSSVKQV 3
            +LD++SP + K+V
Sbjct: 716  ILDLQSPRNKKEV 728



 Score = 63.5 bits (153), Expect = 2e-07
 Identities = 31/96 (32%), Positives = 56/96 (58%)
 Frame = -1

Query: 1649 DKDMEPDNGWIYGFSGEAVKVMGSIRLPVTLGEGALSVTQMMNFMILNQESAHNALVGRP 1470
            D+ ++P    + GF G  +   G+I+LP+ LG  A        F+++++ + +N ++G P
Sbjct: 385  DRHIKPSVRPLTGFDGNTMMTNGTIKLPIYLGGAAT----WHKFVVVDKPTIYNIILGTP 440

Query: 1469 LLKEMKAVTSIYHLAMKFPTPNGIGTVRGSQQDSRD 1362
             + +M+A+ S YH  +K PT  GI T+RG+Q  + D
Sbjct: 441  WIHDMQAIPSSYHQCIKIPTSIGIETIRGNQNLAHD 476


>gb|AAR13317.1| gag-pol polyprotein [Phaseolus vulgaris]
          Length = 1859

 Score =  356 bits (913), Expect = 1e-95
 Identities = 171/316 (54%), Positives = 228/316 (72%)
 Frame = -1

Query: 950  DLDPRMPQPMEKTGSAEDTASIFVDVEDPSKFLKIGSQLNPDLRDMLTNFLLKNLDVFAW 771
            DLDPR+  P  + G  ED   IF+  +D   ++  G+ L PD R+ +   L KN D+FAW
Sbjct: 821  DLDPRLDDPRMEAG--EDLQPIFLRDKDRKTYM--GTSLKPDDRETIGKTLTKNADLFAW 876

Query: 770  SHADMVGIDPEVMCHHLNIDPSKRGMRQKRRPMSGERAVALKEEVDRLLDVGLIKESYYP 591
            + ADM G+  +V+ H L++    R + QK+R +  ER  A +EE D+L+  G I++++Y 
Sbjct: 877  TAADMPGVKSDVITHRLSVYTEARPIAQKKRKLGEERRKAAREETDKLIQAGFIQKAHYT 936

Query: 590  DWLANPVLVKKPNGKWRTCVDFTDLNKACPKDSFPLPRIDQLVDSTAGHALLSFMDAYSG 411
             WLAN V+VKK NGKWR CVD+TDLNKACPKDS+PLP ID+LVD  AGH +LSF+DAYSG
Sbjct: 937  TWLANVVMVKKTNGKWRMCVDYTDLNKACPKDSYPLPTIDRLVDGAAGHQILSFLDAYSG 996

Query: 410  YNQIPMYGPDQEHTSFITDRGLYCYVGMPFGLINAGATYQRLVNMMFKDLIGKTMEVYVD 231
            YNQI MY  D+E T+F TD   + Y  MPFGL NAGATYQRL++ +F D+IG+ +EVYVD
Sbjct: 997  YNQIQMYHRDREKTAFRTDSDNFFYEVMPFGLKNAGATYQRLMDHVFHDMIGRNVEVYVD 1056

Query: 230  DMLVKSMKAEDHVKHLEEMFNILRRYKMKLNPQKCVFGVESGKFLGFIVNHRGIEANPAK 51
            D++VKS   E HV  L+E+F  LR+Y+M+LNP+KC FGVE GKFLGF++ HRGIEANP K
Sbjct: 1057 DIVVKSDSCEQHVSDLKEVFQALRQYRMRLNPEKCAFGVEGGKFLGFMLTHRGIEANPEK 1116

Query: 50   IKALLDMKSPSSVKQV 3
             KA+ +M+SP  +K++
Sbjct: 1117 CKAITEMRSPKGLKEI 1132



 Score = 80.1 bits (196), Expect = 2e-12
 Identities = 42/104 (40%), Positives = 64/104 (61%), Gaps = 1/104 (0%)
 Frame = -1

Query: 1649 DKDMEPDNGWIYGFSGEAVKVMGSIRLPVTLGEGALSVTQMMNFMILNQESAHNALVGRP 1470
            + +++P N  I GFS E V   G I L  T G+  LS T  + ++++N  +++N L+GRP
Sbjct: 677  EAEIQPYNEQIVGFSRERVDTKGFIDLYTTFGDDYLSKTINIRYLLVNANTSYNILLGRP 736

Query: 1469 LLKEMKAVTSIYHLAMKFPTPNG-IGTVRGSQQDSRDCYHKEVK 1341
             +  +KA+ S  HLAMKFP+ NG I TV   Q+ +R+CY   +K
Sbjct: 737  SINRLKAIVSTPHLAMKFPSVNGDIATVHIDQKTARECYVASLK 780


>dbj|BAK61840.1| gag-pol polyprotein [Citrus unshiu]
          Length = 1542

 Score =  355 bits (911), Expect = 2e-95
 Identities = 179/315 (56%), Positives = 226/315 (71%)
 Frame = -1

Query: 947  LDPRMPQPMEKTGSAEDTASIFVDVEDPSKFLKIGSQLNPDLRDMLTNFLLKNLDVFAWS 768
            LD R      +    E    +FV+  +PS+ +KIGS L   ++  L  +L    D+FAWS
Sbjct: 699  LDNRGDSKKGRQEPVEKLDEVFVNKSNPSRMVKIGSGLGETIKGELVKYLQSYADIFAWS 758

Query: 767  HADMVGIDPEVMCHHLNIDPSKRGMRQKRRPMSGERAVALKEEVDRLLDVGLIKESYYPD 588
            H DM  ID  +  H L I      +RQKRR  + ER  A+  EV++LL  G I+++ Y +
Sbjct: 759  HEDMPRIDHGIAYHKLAIRKGAMPVRQKRRCFNQERYEAINAEVEQLLKAGFIRKTKYSE 818

Query: 587  WLANPVLVKKPNGKWRTCVDFTDLNKACPKDSFPLPRIDQLVDSTAGHALLSFMDAYSGY 408
            W++N +LVKK +   R CVDFTDLNKACPKDSFPL +IDQLVDSTAGH+LLSFMD +SGY
Sbjct: 819  WISNVILVKKAS---RMCVDFTDLNKACPKDSFPLQKIDQLVDSTAGHSLLSFMDTFSGY 875

Query: 407  NQIPMYGPDQEHTSFITDRGLYCYVGMPFGLINAGATYQRLVNMMFKDLIGKTMEVYVDD 228
            NQIPM   D+E T+FIT+ GL+CY  MPFGL N GATYQRLVN +FK LIG TMEVYVDD
Sbjct: 876  NQIPMDEQDEESTTFITNIGLFCYRVMPFGLKNTGATYQRLVNKIFKPLIGHTMEVYVDD 935

Query: 227  MLVKSMKAEDHVKHLEEMFNILRRYKMKLNPQKCVFGVESGKFLGFIVNHRGIEANPAKI 48
            M+ KS K +DHVKHLEE F +LR+Y+MKLNP+KC FGV SGKFLGF+V+HR I+ NPAKI
Sbjct: 936  MITKSKKPKDHVKHLEETFELLRKYEMKLNPEKCAFGVSSGKFLGFLVSHRRIKVNPAKI 995

Query: 47   KALLDMKSPSSVKQV 3
            +A+ ++KS  +VK+V
Sbjct: 996  RAVTEIKSLRTVKEV 1010



 Score = 63.9 bits (154), Expect = 1e-07
 Identities = 31/103 (30%), Positives = 55/103 (53%)
 Frame = -1

Query: 1649 DKDMEPDNGWIYGFSGEAVKVMGSIRLPVTLGEGALSVTQMMNFMILNQESAHNALVGRP 1470
            D  +E  N  + GF G  +  MG   LP+T+G        M+ F+++ + S +  ++GRP
Sbjct: 589  DLKLERTNTSLKGFGGGRLTPMGINELPITVGSKPFERIVMLYFVVVEERSPYQMILGRP 648

Query: 1469 LLKEMKAVTSIYHLAMKFPTPNGIGTVRGSQQDSRDCYHKEVK 1341
             ++  + V S ++LA+K+     +G V+G Q+ +R CY    K
Sbjct: 649  FIRISQCVISTHYLALKYRVNGVVGVVKGVQKMTRSCYATAAK 691


Top