BLASTX nr result

ID: Angelica22_contig00026465 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00026465
         (2100 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hy...   211   8e-52
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis...   190   1e-45
emb|CAN60374.1| hypothetical protein VITISV_001215 [Vitis vinifera]   173   1e-40
gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula]      172   2e-40
gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arab...   168   6e-39

>dbj|BAD99219.1| polypeptide with a gag-like domain [Petunia x hybrida]
          Length = 463

 Score =  211 bits (536), Expect = 8e-52
 Identities = 128/405 (31%), Positives = 214/405 (52%), Gaps = 11/405 (2%)
 Frame = -1

Query: 1572 GVGSTTNAWNRNVTLNSQQDPTGVYYIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSIST 1393
            G GS++ + + N  L    D    YY+  SD     L++  F+G+ + NWKR +L+S+S 
Sbjct: 14   GTGSSSGSRDENSPLKVL-DINHPYYLASSDAPGMNLINTSFDGSSYGNWKRGVLISLSA 72

Query: 1392 KNKIGFFNGTEVKPGSGTSDFKGWERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDL 1213
            KNK+GF  G   KP      F+ W RC+D+V +WLLN+L   I++SVL+ +T +E+W +L
Sbjct: 73   KNKLGFITGAYKKPDKEDLLFEQWRRCSDMVLAWLLNSLSKEIAESVLYSQTAQELWQEL 132

Query: 1212 EDRFGYASMTQVYSIEQRLSELNQGTKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTC 1033
            E R+G    T+++ +++ L+ ++QGT  V  +F    R+   + V+   + C     C C
Sbjct: 133  EQRYGQIDGTKMFQLQRELNNVSQGTNDVAAYFNKLKRIWDQMKVLNTFMVC----SCEC 188

Query: 1032 NVTQKVH--QMQQDHKLIQFMMKLSDKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQEL 859
            N   K H  +MQ+D +LIQF+M L++ YS +RGNILM +PLP+ + A+ + S EE  + +
Sbjct: 189  NCEAKGHNAKMQEDQQLIQFLMGLNEVYSGIRGNILMMKPLPSTAQAYSIISHEETQRGI 248

Query: 858  SQIGS-QTESLAFIADKKGFDSQRNYRSGHSNAQRQNFNSSSYKAGPGTNTVGNKKCANF 682
            +   +  T+S AF A  + +++QRN     +  Q  N   ++Y+              N 
Sbjct: 249  AAGNNVSTDSAAFNASTQNWNNQRNNYDNRARNQNYNNRRNNYEGRRSNQN-------NS 301

Query: 681  FCTHCKISGHSVDRCFKIHG------YPA-NFKSSKDRKVADVFICNNADTVEXXXXXXX 523
            +CT+C+ SGH  + C K++G       PA N  +  +       +  N +T E       
Sbjct: 302  YCTYCRTSGHVREECRKLNGRDNRNRRPAGNPNTQANAAYEGNTMQRNGNTPENTSAGSV 361

Query: 522  XXXXXXVAQYQQLMELL-NKKSLAPSGSTSHTNVDHATMLVGPFA 391
                    Q +QL+++L +  S++ + S S  N + A   VG +A
Sbjct: 362  NAQGFTKEQCEQLIQMLQSAHSISSTASCSEVN-NSAANFVGKYA 405


>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
            gi|7268152|emb|CAB78488.1| retrovirus-related like
            polyprotein [Arabidopsis thaliana]
          Length = 1489

 Score =  190 bits (482), Expect = 1e-45
 Identities = 97/301 (32%), Positives = 165/301 (54%), Gaps = 1/301 (0%)
 Frame = -1

Query: 1500 YYIHPSDTNTTQLVSIKFN-GTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGSGTSDFKG 1324
            YY+H +D     LVS +    + F +W+RS+L++++ +NK+GF NGT  KP     DF  
Sbjct: 34   YYLHSADHAGLILVSDRLTTASDFHSWRRSILMALNVRNKLGFINGTITKPPEDHRDFGA 93

Query: 1323 WERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYSIEQRLSELN 1144
            W RCND+V +WL+N++D  I +S+L+  T + IW++L  RF      +++ IEQ+LS++ 
Sbjct: 94   WSRCNDIVSTWLMNSVDKKIGQSLLYIATVQGIWNNLLSRFKQDDAPRIFDIEQKLSKIE 153

Query: 1143 QGTKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTCNVTQKVHQMQQDHKLIQFMMKLS 964
            QG+  +  ++   L L        V +P CTC +C C+   K   +QQ  ++ +F+ +L+
Sbjct: 154  QGSMDISTYYTALLTLWEEH-RNYVELPVCTCGRCECDAAVKWEHLQQRSRVTKFLKELN 212

Query: 963  DKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADKKGFDSQRNY 784
            + +   R +ILM +P+P +  AF M +Q+ER + +  + ++ +S+AF             
Sbjct: 213  EGFDQTRRHILMLKPIPTIKEAFNMVTQDERQRNVKPL-TRVDSVAF------------- 258

Query: 783  RSGHSNAQRQNFNSSSYKAGPGTNTVGNKKCANFFCTHCKISGHSVDRCFKIHGYPANFK 604
                 N    N + ++Y A   T     K      CTHC   GH++ +C+K+HGYP   K
Sbjct: 259  ----QNTSMINEDENAYVAAYNTVRPNQKP----ICTHCGKVGHTIQKCYKVHGYPPGMK 310

Query: 603  S 601
            +
Sbjct: 311  T 311


>emb|CAN60374.1| hypothetical protein VITISV_001215 [Vitis vinifera]
          Length = 1535

 Score =  173 bits (439), Expect = 1e-40
 Identities = 108/364 (29%), Positives = 179/364 (49%), Gaps = 10/364 (2%)
 Frame = -1

Query: 1524 SQQDPTGVYYIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGS 1345
            S+ D +  Y+ H SD     L+S   NG  +S W+R+M L+++ KNK+GF NG    P  
Sbjct: 34   SKSDLSNPYFTHHSDHPGLVLISKPLNGDNYSAWRRAMALALNAKNKLGFVNGIIKAPSE 93

Query: 1344 GT--SDFKGWERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYS 1171
             T   D+  W RCND+V SW++N L+  IS SV+++ T  E+W+DL +RF  ++  +++ 
Sbjct: 94   ETHPDDYATWSRCNDMVHSWIVNTLNPEISDSVIYYATAHEVWEDLCERFSQSNAPRIFE 153

Query: 1170 IEQRLSELNQGTKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTCNVTQKVHQMQQDHK 991
            I++ ++   Q   S+  ++     L   L               + N      Q Q   +
Sbjct: 154  IQREIAYHRQEQLSISVYYTKLKSLWDEL--------------ASYNDASSGAQ-QDQQR 198

Query: 990  LIQFMMKLSDKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADK 811
            L+QF+M L++ YS +RG IL+  PLP++  A+    QEE+ + LS   +  ES +  A  
Sbjct: 199  LMQFLMGLNESYSAIRGQILLMNPLPSIRQAYSSVCQEEKQRLLSATHTTAESNSSAAMA 258

Query: 810  KGFDSQRNYRSGHSNAQRQN--FNSS--SYKAGPGTNTVGNKKCANFFCTHCKISGHSVD 643
               +  +N  +G++ + R +  +NSS  S +        G+ K     CT+C   GH V+
Sbjct: 259  VRSNQMKNNSAGNARSDRSDRFYNSSQDSRRFDQDKRRSGSSK-GRPQCTYCGEMGHFVE 317

Query: 642  RCFKIHGYPANFKSSKD----RKVADVFICNNADTVEXXXXXXXXXXXXXVAQYQQLMEL 475
            +C+++HGYP     ++      +  + F+ N                    AQ QQL+ L
Sbjct: 318  KCYQLHGYPPGHPKARTGSNFNRHKNTFVANQVSDGANKDGGKSVLTGITEAQLQQLLSL 377

Query: 474  LNKK 463
            LN K
Sbjct: 378  LNDK 381



 Score = 82.4 bits (202), Expect = 4e-13
 Identities = 44/93 (47%), Positives = 57/93 (61%), Gaps = 2/93 (2%)
 Frame = -2

Query: 275 VWHLRLGHVPFQKLKLIPPLSGLKNPSQELI--CQICPLAKQTRVSFPISSIKTKCPFEL 102
           +WH RLGHV   +L  I       N S +    C IC LAKQ R+ F  S I ++ PF+L
Sbjct: 547 LWHSRLGHVSHSRLSFIA--KNFLNFSIQFNNDCPICLLAKQHRLPFGTSEISSEKPFDL 604

Query: 101 IHIDVRGPYKVKAHNGCNQFLTIVDDYSRFTWV 3
           IH D+ G YK  + +G + FLTIVDDY+RFTW+
Sbjct: 605 IHCDIWGRYKHSSLSGAHYFLTIVDDYTRFTWI 637


>gb|ABD32757.1| Integrase, catalytic region [Medicago truncatula]
          Length = 1157

 Score =  172 bits (437), Expect = 2e-40
 Identities = 99/321 (30%), Positives = 172/321 (53%), Gaps = 6/321 (1%)
 Frame = -1

Query: 1500 YYIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGSGTSDFKGW 1321
            YYIHPSD  ++ +++ K NG+ +  W RSM  ++  KNK+ F +G+   P     + + W
Sbjct: 15   YYIHPSDGPSSLIITPKLNGSNYLAWHRSMQRALGAKNKLVFLDGSISVPDIDDLNRQAW 74

Query: 1320 ERCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYSIEQRLSELNQ 1141
            ERCN L+ SW++N++ +SI+++++F  T    WDDL++ F      +V S+   ++ L Q
Sbjct: 75   ERCNHLIHSWIVNSVTESIAQTIVFHDTALSAWDDLKECFSKVDRVRVLSLRSTINNLKQ 134

Query: 1140 GTKSVLDFFLLRLRLCGML*VMLVH--IPCCTC-HKCTCNVTQKVHQMQQDHKLIQFMMK 970
            GTKSVLD+F   + LC +   +  H  IP CTC H C C   +     + + +++QF+  
Sbjct: 135  GTKSVLDYF---IELCTLWDELNSHRPIPNCTCIHPCRCESIRLAKYYRTEDQILQFLTG 191

Query: 969  LSDKYSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADKKGFDSQR 790
            L+D +S ++  IL+  PLP ++  + +  QEE           ++++ F       DS  
Sbjct: 192  LNDTFSVVKTQILLMDPLPPINKVYSLVVQEE-----------SQNIVFSTPSISDDSSI 240

Query: 789  NYRSGHSNAQRQNFNSSSYKAGPGTNTVGNKKCANFFCTHCKISGHSVDRCFKIHGYPAN 610
            +  +  S+A++       Y  G GT++    K  + FCT C    H+++ C+  +G+P N
Sbjct: 241  SVNA--SDARK------FYPRGKGTSSTSANKGKDRFCTFCNRYNHTIEFCYLKYGHP-N 291

Query: 609  F---KSSKDRKVADVFICNNA 556
            F    SS +  V++    NN+
Sbjct: 292  FNKGNSSANNAVSEASDTNNS 312


>gb|AAG10817.1|AC011808_5 Putative retroelement polyprotein [Arabidopsis thaliana]
          Length = 1413

 Score =  168 bits (425), Expect = 6e-39
 Identities = 86/301 (28%), Positives = 154/301 (51%)
 Frame = -1

Query: 1497 YIHPSDTNTTQLVSIKFNGTGFSNWKRSMLLSISTKNKIGFFNGTEVKPGSGTSDFKGWE 1318
            ++H +D     +VS++ +G  ++ W  +M +++  KNKI F +G+  +P  G    + W 
Sbjct: 55   FLHNADHPGISIVSVQLDGANYNQWSSAMKIALDAKNKIAFIDGSCPRPEEGNHLLRIWS 114

Query: 1317 RCNDLVCSWLLNNLDDSISKSVLFFKTTKEIWDDLEDRFGYASMTQVYSIEQRLSELNQG 1138
            RCN +V SW+LN+++  I  S+L F    +IW+DL +RF   ++ + + + Q++ +L QG
Sbjct: 115  RCNSMVKSWILNSVNREIYGSILSFDDAAQIWNDLHNRFHMTNLPRTFQLVQQIQDLRQG 174

Query: 1137 TKSVLDFFLLRLRLCGML*VMLVHIPCCTCHKCTCNVTQKVHQMQQDHKLIQFMMKLSDK 958
            + ++  ++     L   L      +PC  C K TC             ++I+F+  L++K
Sbjct: 175  SMNLSTYYTTLKTLRDNLDGAEASVPCHCCKKSTCESQIFAKSNVNRGRIIKFLAGLNEK 234

Query: 957  YSTLRGNILMQQPLPNMSSAFRMFSQEERHQELSQIGSQTESLAFIADKKGFDSQRNYRS 778
            YS +RG I+M++PLP+++  + +  Q++  ++ S   +   S AF   K           
Sbjct: 235  YSIIRGQIIMKKPLPDLAEVYNILDQDDSQRQFS---NNVASAAFQVTK----------- 280

Query: 777  GHSNAQRQNFNSSSYKAGPGTNTVGNKKCANFFCTHCKISGHSVDRCFKIHGYPANFKSS 598
               + Q     SSS    PG      KK     C+H   +GH+ +RC+K+HGYP  +K  
Sbjct: 281  --DDVQPGALASSSNMPQPGMLGAVQKKDKKSICSHYGYTGHTSERCYKLHGYPVGWKKG 338

Query: 597  K 595
            K
Sbjct: 339  K 339



 Score = 63.5 bits (153), Expect = 2e-07
 Identities = 29/62 (46%), Positives = 39/62 (62%)
 Frame = -2

Query: 188 LICQICPLAKQTRVSFPISSIKTKCPFELIHIDVRGPYKVKAHNGCNQFLTIVDDYSRFT 9
           L C IC  AKQ ++++P        PF+L+HIDV GP+      G + FLTIVDD++R T
Sbjct: 564 LHCDICQRAKQKKLTYPSRHNICLAPFDLLHIDVWGPFSEPTQEGYHYFLTIVDDHTRVT 623

Query: 8   WV 3
           WV
Sbjct: 624 WV 625


Top