BLASTX nr result

ID: Angelica23_contig00032125 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00032125
         (1025 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   163   2e-41
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   165   4e-41
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           165   5e-41
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       161   3e-37
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   160   5e-37

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
           predicted protein [Populus trichocarpa]
          Length = 517

 Score =  163 bits (412), Expect(2) = 2e-41
 Identities = 79/216 (36%), Positives = 118/216 (54%)
 Frame = +3

Query: 15  FFANVPEDARQFTLHSLGYQLGSLPIKYLGLPLVSTKLKLSDCYPLLLRFCKLMDHWANK 194
           F + V    R+  +H LG++ G LP+KYLG+PL+S++LK   C  L+ R    + HW  +
Sbjct: 31  FLSGVLNAEREQIIHILGFREGELPMKYLGVPLLSSRLKAIYCKGLVDRITSKVRHWTCR 90

Query: 195 ALIQAGRLQLLKVVLFGVQSYWAAHLFLQKGFLKRLQSLCTKFLWGGSSSSTKLVKVSWS 374
            L  AGR+QL+  VLF +Q YWA+   L    +K ++ +   FLW GS   T   KV+W 
Sbjct: 91  TLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWD 150

Query: 375 ECCKLKSEGGLGIRDLVEWNRASIFYQLLRITQPASQSIWISWIQKTLLKSCSIWTLKKP 554
           + C  K EGGLGI+ + EWN+ ++   +  +   +  SIW +WI+  LL+  + WT+K P
Sbjct: 151 QVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTP 210

Query: 555 SFATWGLKKILNARPEAIQFINYSVGRRSMFLFWHD 662
              +W   KIL  R  A   + Y +G       W D
Sbjct: 211 QNCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFD 246



 Score = 33.5 bits (75), Expect(2) = 2e-41
 Identities = 21/80 (26%), Positives = 32/80 (40%), Gaps = 3/80 (3%)
 Frame = +1

Query: 778  NFAWRLPLSNHAD---VIDLWDRLSNVQIRHLDEIKWQDIPASKAKISDIWHSLRTVNTA 948
            N  W+ P +       +I+     SN ++   DE+ W D P  +  +   W  LR     
Sbjct: 280  NSEWKTPTTQAIGWHPIIEAIPSNSNPKMGQKDELVWLDSPNHRFSVKVAWEQLRRHRQM 339

Query: 949  PPWLPAV*FPLAIPKCSVTL 1008
              W   V F  A+P+ S  L
Sbjct: 340  VEWHDIVWFKNAVPRHSFLL 359


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  165 bits (417), Expect(2) = 4e-41
 Identities = 82/208 (39%), Positives = 118/208 (56%)
 Frame = +3

Query: 39   ARQFTLHSLGYQLGSLPIKYLGLPLVSTKLKLSDCYPLLLRFCKLMDHWANKALIQAGRL 218
            + + T  + G+  G+ PI+YLGLPL+  KL+++D  PLL +    +  W +KAL  AGR 
Sbjct: 605  SERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRT 664

Query: 219  QLLKVVLFGVQSYWAAHLFLQKGFLKRLQSLCTKFLWGGSSSSTKLVKVSWSECCKLKSE 398
            QL+  V+FG+ ++W +   L KG +K+++SLC+KFLW GS    K  KVSW +CC  KSE
Sbjct: 665  QLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSE 724

Query: 399  GGLGIRDLVEWNRASIFYQLLRITQPASQSIWISWIQKTLLKSCSIWTLKKPSFATWGLK 578
            GGLG R   EWN+ ++  +L+ +      S+W  W +   L   S W +       W  K
Sbjct: 725  GGLGFRSFGEWNK-TLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWK 783

Query: 579  KILNARPEAIQFINYSVGRRSMFLFWHD 662
             +LN RP A +FI   VG      FW D
Sbjct: 784  MLLNLRPLAEKFIKAKVGNGGTVSFWFD 811



 Score = 30.4 bits (67), Expect(2) = 4e-41
 Identities = 21/83 (25%), Positives = 31/83 (37%), Gaps = 6/83 (7%)
 Frame = +1

Query: 763  SDYQDNFAWRLPLSNHADVIDLWDRLSNV----QIRHLDEIKW--QDIPASKAKISDIWH 924
            +D  D   WRLPLS       +   L+++     +   D   W   D+       +  W 
Sbjct: 840  ADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWE 899

Query: 925  SLRTVNTAPPWLPAV*FPLAIPK 993
             LR       W  +V F  A+PK
Sbjct: 900  VLRPRRPVKRWAKSVWFKGAVPK 922


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  165 bits (417), Expect(2) = 5e-41
 Identities = 82/208 (39%), Positives = 118/208 (56%)
 Frame = +3

Query: 39   ARQFTLHSLGYQLGSLPIKYLGLPLVSTKLKLSDCYPLLLRFCKLMDHWANKALIQAGRL 218
            + + T  + G+  G+ PI+YLGLPL+  KL+++D  PLL +    +  W +KAL  AGR 
Sbjct: 605  SERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRT 664

Query: 219  QLLKVVLFGVQSYWAAHLFLQKGFLKRLQSLCTKFLWGGSSSSTKLVKVSWSECCKLKSE 398
            QL+  V+FG+ ++W +   L KG +K+++SLC+KFLW GS    K  KVSW +CC  KSE
Sbjct: 665  QLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSE 724

Query: 399  GGLGIRDLVEWNRASIFYQLLRITQPASQSIWISWIQKTLLKSCSIWTLKKPSFATWGLK 578
            GGLG R   EWN+ ++  +L+ +      S+W  W +   L   S W +       W  K
Sbjct: 725  GGLGFRSFGEWNK-TLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWK 783

Query: 579  KILNARPEAIQFINYSVGRRSMFLFWHD 662
             +LN RP A +FI   VG      FW D
Sbjct: 784  MLLNLRPLAEKFIKAKVGNGGTVSFWFD 811



 Score = 30.0 bits (66), Expect(2) = 5e-41
 Identities = 21/83 (25%), Positives = 31/83 (37%), Gaps = 6/83 (7%)
 Frame = +1

Query: 763  SDYQDNFAWRLPLSNHADVIDLWDRLSNV----QIRHLDEIKW--QDIPASKAKISDIWH 924
            +D  D   WRLPLS       +   L+++     +   D   W   D+       +  W 
Sbjct: 840  ADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKTWE 899

Query: 925  SLRTVNTAPPWLPAV*FPLAIPK 993
             LR       W  +V F  A+PK
Sbjct: 900  VLRPRRPVKRWARSVWFKGAVPK 922


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  161 bits (407), Expect = 3e-37
 Identities = 79/199 (39%), Positives = 121/199 (60%)
 Frame = +3

Query: 66   GYQLGSLPIKYLGLPLVSTKLKLSDCYPLLLRFCKLMDHWANKALIQAGRLQLLKVVLFG 245
            G+ +G+LPI+YLGLPL++ KL++++  PLL +       W NK L  AGR+QL+  V+FG
Sbjct: 754  GFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFG 813

Query: 246  VQSYWAAHLFLQKGFLKRLQSLCTKFLWGGSSSSTKLVKVSWSECCKLKSEGGLGIRDLV 425
              ++W +   L KG +KR++SLC++FLW G+    K +KVSW+  C  KSEGGLG+R L+
Sbjct: 814  SINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLL 873

Query: 426  EWNRASIFYQLLRITQPASQSIWISWIQKTLLKSCSIWTLKKPSFATWGLKKILNARPEA 605
            EWN+ ++  +L+     A  S+W  W     L   S W ++     +W  K++L+ RP A
Sbjct: 874  EWNK-TLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLA 932

Query: 606  IQFINYSVGRRSMFLFWHD 662
             QF+   VG      +W+D
Sbjct: 933  HQFLVCKVGNGLKADYWYD 951


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  160 bits (405), Expect = 5e-37
 Identities = 81/221 (36%), Positives = 127/221 (57%), Gaps = 1/221 (0%)
 Frame = +3

Query: 3    KTLWFFANVPEDARQFTLHSLGYQLGSLPIKYLGLPLVSTKLKLSDCYPLLLRFCKLMDH 182
            K+  F A +  +A+   L    ++LG+LP+KYLGLPL++ ++  SD  PL+ +    +  
Sbjct: 887  KSTIFMAGISPNAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITS 946

Query: 183  WANKALIQAGRLQLLKVVLFGVQSYWAAHLFLQKGFLKRLQSLCTKFLWGGSSSSTKLVK 362
            W N+ L  AGRLQL+K VL  + ++W +   L K  L+ ++ + + FLW G   +TK  K
Sbjct: 947  WTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAK 1006

Query: 363  VSWSECCKLKSEGGLGIRDLVEWNRASIFYQLLRITQPASQSIWISWIQKTLLKSCSIWT 542
            ++WSE CKLK EGGLG++ L E N  S+   + RI   A  S+W+ W+ K L++  + W+
Sbjct: 1007 IAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILS-ARDSLWVKWVNKHLIRKETFWS 1065

Query: 543  LKK-PSFATWGLKKILNARPEAIQFINYSVGRRSMFLFWHD 662
            +K+     +W  +KIL  R +A  F    V   +   FWHD
Sbjct: 1066 VKENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHD 1106


Top