BLASTX nr result

ID: Angelica23_contig00007524 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00007524
         (1764 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   155   3e-35
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           141   6e-31
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   141   6e-31
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   129   2e-27
gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana]       124   4e-27

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
            predicted protein [Populus trichocarpa]
          Length = 517

 Score =  155 bits (392), Expect = 3e-35
 Identities = 101/365 (27%), Positives = 156/365 (42%), Gaps = 4/365 (1%)
 Frame = -1

Query: 1461 GLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFG 1282
            G  EG    K LG+     +L    C  LV +IT ++  WTC  L  A R+QL+ S +F 
Sbjct: 48   GFREGELPMKYLGVPLLSSRLKAIYCKGLVDRITSKVRHWTCRTLSYAGRVQLINSVLFS 107

Query: 1281 I*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD 1102
            I  YW +   LP +VI  V+ +M  FLW GS       K++W+  CLPK  GGLG + + 
Sbjct: 108  IQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQVCLPKKEGGLGIKSIK 167

Query: 1101 *WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPARTPWCFKNFLSARSIA 922
             WN+ ++   +W        S+W   + +  LR R     + P    W +   L  RS+A
Sbjct: 168  EWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQNCSWAWGKILKLRSLA 227

Query: 921  LPKIYV*CGVKFKVPRVARPVARLHTAEVKNMADGSQFISIVESTNLEEI*NCHLNDQWA 742
             PK+    G        +      H       + G +FI         ++     N +W 
Sbjct: 228  WPKMKYIIGDGM---TTSLWFDNWHPHSPLADSYGERFIYDSGMAKNAKVNVLIQNSEWK 284

Query: 741  VTSSN----HDIAFAVRHSISTTHISGHDEILWDGLQLKHVSVSAVWHSFRQSAPQPPWL 574
              ++     H I  A+  S S   +   DE++W        SV   W   R+      W 
Sbjct: 285  TPTTQAIGWHPIIEAI-PSNSNPKMGQKDELVWLDSPNHRFSVKVAWEQLRRHRQMVEWH 343

Query: 573  GAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNPPCLLCNAATESANHLFMQCP 394
              +W    +P+ +  LW A+   + T+D + +F +   N  C LC    E  NHLF +C 
Sbjct: 344  DIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPN-RCSLCLRNNEDHNHLFFECS 402

Query: 393  YAQSV 379
            Y +++
Sbjct: 403  YTKAI 407


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  141 bits (355), Expect = 6e-31
 Identities = 121/478 (25%), Positives = 191/478 (39%), Gaps = 46/478 (9%)
 Frame = -1

Query: 1461 GLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFG 1282
            G   G    + LG+     KL +AD  PL+ K++ R+  W    L  A R QL+ S IFG
Sbjct: 614  GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673

Query: 1281 I*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD 1102
            +  +W +   LP   I K++SL   FLW GS+  R   K+SW  CCLPKS GGLG R   
Sbjct: 674  LINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFG 733

Query: 1101 *WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPARTPWCFKNFLSARSIA 922
             WN+  +   +W  L  + +S+W        L +            PW +K  L+ R +A
Sbjct: 734  EWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLA 792

Query: 921  LPKIYV*CG------------------VKFKVPRVARPVARLHTAEVKNMADGSQFISIV 796
               I    G                  +K+     +RP+    +A+V +  DGS      
Sbjct: 793  EKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGS------ 846

Query: 795  ESTNLEEI*NCHLNDQWAVTSSNHDIAFAVRHSIST----THISGHDEILW--DGLQLKH 634
                            W +  S    A ++   +++    + +   D   W  D +  + 
Sbjct: 847  ---------------GWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQG 891

Query: 633  VSVSAVWHSFRQSAPQPPWLGAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNP 454
             S +  W   R   P   W  ++W    +PK     W A LN + T+  ++ + + V + 
Sbjct: 892  FSAAKTWEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGL-VSSA 950

Query: 453  PCLLCNAATESANHLFMQCPYAQSV-----LQASPMTYHFLVSWNPYLQW--QFHVGQVT 295
             C LC+  TE+ +HL + C ++  V     L+  P     L +W   L W  Q      +
Sbjct: 951  ECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCP-RQRLLCTWAELLSWTRQSTAAAPS 1009

Query: 294  TMRKQISYLY---------LAVVLQRMIHCNQVTRLIRLHL------MRHYASWQSEL 166
             +RK ++ L          L +     + C+ V RL+   L       RH   W+  L
Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILSRRHKRRWRELL 1067


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  141 bits (355), Expect = 6e-31
 Identities = 121/478 (25%), Positives = 191/478 (39%), Gaps = 46/478 (9%)
 Frame = -1

Query: 1461 GLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFG 1282
            G   G    + LG+     KL +AD  PL+ K++ R+  W    L  A R QL+ S IFG
Sbjct: 614  GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673

Query: 1281 I*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD 1102
            +  +W +   LP   I K++SL   FLW GS+  R   K+SW  CCLPKS GGLG R   
Sbjct: 674  LINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFG 733

Query: 1101 *WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPARTPWCFKNFLSARSIA 922
             WN+  +   +W  L  + +S+W        L +            PW +K  L+ R +A
Sbjct: 734  EWNKTLLLRLIW-VLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLA 792

Query: 921  LPKIYV*CG------------------VKFKVPRVARPVARLHTAEVKNMADGSQFISIV 796
               I    G                  +K+     +RP+    +A+V +  DGS      
Sbjct: 793  EKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGS------ 846

Query: 795  ESTNLEEI*NCHLNDQWAVTSSNHDIAFAVRHSIST----THISGHDEILW--DGLQLKH 634
                            W +  S    A ++   +++    + +   D   W  D +  + 
Sbjct: 847  ---------------GWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQG 891

Query: 633  VSVSAVWHSFRQSAPQPPWLGAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNP 454
             S +  W   R   P   W  ++W    +PK     W A LN + T+  ++ + + V + 
Sbjct: 892  FSAAKTWEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGL-VSSA 950

Query: 453  PCLLCNAATESANHLFMQCPYAQSV-----LQASPMTYHFLVSWNPYLQW--QFHVGQVT 295
             C LC+  TE+ +HL + C ++  V     L+  P     L +W   L W  Q      +
Sbjct: 951  ECCLCSFDTETRDHLLLLCDFSSQVWRMVFLRLCP-RQRLLCTWAELLSWTRQSTAAAPS 1009

Query: 294  TMRKQISYLY---------LAVVLQRMIHCNQVTRLIRLHL------MRHYASWQSEL 166
             +RK ++ L          L +     + C+ V RL+   L       RH   W+  L
Sbjct: 1010 LLRKVVAQLVVYNLWRQRNLVLHSSLRVSCSVVFRLVDRELRNVILSRRHKRRWRELL 1067


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  129 bits (325), Expect = 2e-27
 Identities = 104/381 (27%), Positives = 164/381 (43%), Gaps = 8/381 (2%)
 Frame = -1

Query: 1488 KTVSSV*FNGLSEGFSTYKILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARI 1309
            +T+SS  F   + G    + LG+     +++ AD +PL+  +  +++ WT   L  A R+
Sbjct: 1052 QTLSSFPF---ANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRL 1108

Query: 1308 QLLKSTIFGI*WYWTAYLFLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSV 1129
             LL S I  I  +W +   LP   I +++ L   FLW G +      KI+W++ C PK  
Sbjct: 1109 ALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKE 1168

Query: 1128 GGLGPRDLD*WNQASVCYQLW*TLQPQFSSVWLN*VCTIFLRNRIILD-HEIPARTPWCF 952
            GGLG + L   N+ S    +W  L  Q  S+W+  + T  +R       +E  +   W +
Sbjct: 1169 GGLGIKSLAEANKVSCLKLIWRLLSTQ-PSLWVTWIWTFIIRKGTFWSANERSSLGSWMW 1227

Query: 951  KNFLSARSIA--LPKIYV*CG--VKFKVPRVARPVARLHTAEVKNMADGSQFISIVESTN 784
            K  L  R +A  + K+ V  G    F     +     L     + + D    + I   TN
Sbjct: 1228 KKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVID----LGIPLETN 1283

Query: 783  LEEI*NCHLNDQWAVTSSNHDIAFAVRHSISTTHISGHDEILWDGLQ---LKHVSVSAVW 613
            LE +   H + Q      N  I   ++        +G D  LW  L+    K       W
Sbjct: 1284 LETVLRTHQHRQHRAAIYNR-INAEIQRLQQQEREAGPDISLWRSLKNDFNKRFITKVTW 1342

Query: 612  HSFRQSAPQPPWLGAIWHPWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNPPCLLCNA 433
            ++ R   PQ  W   +W P+  PK +  LW  + N + T D +  ++       C LCN 
Sbjct: 1343 NNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSG-QLVTCTLCNN 1401

Query: 432  ATESANHLFMQCPYAQSVLQA 370
            A E+ +HLF  C Y   V +A
Sbjct: 1402 AEETRDHLFFSCQYTSYVWEA 1422


>gb|ABV21212.1| Ty1_Copia-element protein [Arabidopsis thaliana]
          Length = 438

 Score =  124 bits (310), Expect(2) = 4e-27
 Identities = 101/403 (25%), Positives = 170/403 (42%), Gaps = 16/403 (3%)
 Frame = -1

Query: 1434 KILGIAFNFWKLSVADCTPLVLKITERMNCWTCIFLHLAARIQLLKSTIFGI*WYWTAYL 1255
            + LG+     KL +++  PLV+KI  ++N W    L  A R+QLL S I GI  +W +  
Sbjct: 34   RYLGLPLMSRKLKISEFEPLVVKIKAKLNFWAVKSLSFAGRLQLLSSVISGIVVFWMSTF 93

Query: 1254 FLP*KVITKVQSLMLMFLWGGSLTTRAMPKISWNTCCLPKSVGGLGPRDLD*WNQASVCY 1075
             LP   I +++S+   FLW G        K+SW+T CLPK+ GGLG R    WN A    
Sbjct: 94   RLPKGCIREIESMCARFLWSGGTDEHHKAKVSWSTVCLPKAEGGLGVRKFTEWNTALNLK 153

Query: 1074 QLW*TLQPQFSSVWLN*VCTIFLRNRIILDHEIPART--PWCFKNFLSARSIALPKIYV* 901
             +W  L     S+W+       L   +     I   T   W ++  L  R +A   ++  
Sbjct: 154  LIW-LLFSNSGSLWVAWHLFHNLSTSVSNFWLIKEGTTDSWNWRCLLRLRPLASKFLFCS 212

Query: 900  CGVKFKVPRVARPVARLHTAEVKNMADGSQFISIVESTNLEEI*NCHLNDQWAVTSSNHD 721
             G        A              +DG +   I   + + ++ N    ++W + S    
Sbjct: 213  IGNGLTASFWADSWTPFGPLLTFIGSDGPRNQRIPLCSKVADVVN---GNRWLLPSPRSS 269

Query: 720  IAFAVRHSISTTHIS----GHDEILWDGLQLKHVSVSA--VWHSFRQSAPQPPWLGAIWH 559
             A  +   ++T  I       D  LW       +  S+   W++ R    + PW+ ++W 
Sbjct: 270  NALNLHAFLTTLSIPLQPLVEDSYLWKVENCSDIGFSSAHTWNALRHKEVEKPWVSSVWF 329

Query: 558  PWHIPKRTITLWHALLNNVLTKD*MLQFSMRVDNPPCLLCNAATESANHLFMQCPYAQSV 379
                PK    +W    + + TK  M+ +   V +P C LC    E+ +HL + C ++ SV
Sbjct: 330  KGVTPKNAFNMWITHQDRLRTKLRMIAWGFLV-SPVCALCQVGFETRDHLMLSCDFSVSV 388

Query: 378  LQ------ASPMTYHFLVSWNPYLQWQFHVGQV--TTMRKQIS 274
                     +P+T     +W+  + W  +  +   +T+RK ++
Sbjct: 389  WALVRQRIGTPLT--IFQNWSELILWTQNRSKAAPSTLRKLVA 429



 Score = 25.8 bits (55), Expect(2) = 4e-27
 Identities = 10/13 (76%), Positives = 13/13 (100%)
 Frame = -2

Query: 1445 SLPIKYLGLPLIS 1407
            +LPI+YLGLPL+S
Sbjct: 30   NLPIRYLGLPLMS 42


Top