BLASTX nr result

ID: Forsythia22_contig00037132 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00037132
         (512 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    45   6e-10
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    45   6e-07
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...    43   1e-06
ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom...    42   2e-06
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    40   8e-06

>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 45.1 bits (105), Expect(2) = 6e-10
 Identities = 17/36 (47%), Positives = 24/36 (66%)
 Frame = +2

Query: 8    PDSVRVCFQAWHFSGNCYTPGHISTIILLLIQWFVW 115
            P ++     +W++SG+   PGHI T+ILL I WFVW
Sbjct: 1070 PQNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVW 1105



 Score = 45.1 bits (105), Expect(2) = 6e-10
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 4/61 (6%)
 Frame = +3

Query: 198  WKAIWGVAKGFGFHWQTTKKRLPIIVAWIKPLLAFTKLNSDGYPVE----CSDGGILGDH 365
            WK    +A  +GF++   ++  P I+ WIKPL+   KLN DG   +     + GG+L DH
Sbjct: 1145 WKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDH 1204

Query: 366  T 368
            T
Sbjct: 1205 T 1205


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 44.7 bits (104), Expect(2) = 6e-07
 Identities = 36/127 (28%), Positives = 58/127 (45%), Gaps = 7/127 (5%)
 Frame = +3

Query: 147  LLKNLVHLRA-ELLYFKSWKAIWGVAKGFGFHWQTTKKRLPIIVAWIKPLLAFTKLNSDG 323
            ++K L  L A  LL    WK    +A  +GF +     + P I++WIKP +   KLN DG
Sbjct: 1500 IMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDG 1559

Query: 324  ---YPVECSDGGILGDHTIIESHLLTIINMVMAPINWMDYKPSTIVQGHF---NISIHNT 485
                    + GG+L DHT     L    +  + P+  +  +   +++G       +I N 
Sbjct: 1560 SSKSSQNAAGGGVLRDHT---GKLAFAFSENLGPLPSLQAELHALLRGLLLCKERNITNL 1616

Query: 486  WEELDTI 506
            W E+D +
Sbjct: 1617 WIEMDAL 1623



 Score = 35.0 bits (79), Expect(2) = 6e-07
 Identities = 15/36 (41%), Positives = 19/36 (52%)
 Frame = +2

Query: 8    PDSVRVCFQAWHFSGNCYTPGHISTIILLLIQWFVW 115
            P  +     AW FSG+    GHI  +I L I WF+W
Sbjct: 1443 PKHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLW 1478


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 42.7 bits (99), Expect(2) = 1e-06
 Identities = 36/127 (28%), Positives = 57/127 (44%), Gaps = 7/127 (5%)
 Frame = +3

Query: 147  LLKNLVHLRA-ELLYFKSWKAIWGVAKGFGFHWQTTKKRLPIIVAWIKPLLAFTKLNSDG 323
            ++K L  L A  LL    WK    +A  +GF +       P I+ WIKP +   KLN DG
Sbjct: 1743 IMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDG 1802

Query: 324  YP---VECSDGGILGDHTIIESHLLTIINMVMAPINWMDYKPSTIVQGHF---NISIHNT 485
                 +  + GG+L DHT     L    +  + P+  +  +   +++G       +I N 
Sbjct: 1803 SSKSNLNAAGGGVLRDHT---GKLAFAFSENLGPLPSLQAELHALLRGLLLCKERNITNL 1859

Query: 486  WEELDTI 506
            W E+D +
Sbjct: 1860 WIEMDAL 1866



 Score = 35.8 bits (81), Expect(2) = 1e-06
 Identities = 15/36 (41%), Positives = 20/36 (55%)
 Frame = +2

Query: 8    PDSVRVCFQAWHFSGNCYTPGHISTIILLLIQWFVW 115
            P+ +     AW FSG+    GHI  +I L I WF+W
Sbjct: 1686 PNHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLW 1721


>ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
            gi|508778195|gb|EOY25451.1| Uncharacterized protein
            TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 42.4 bits (98), Expect(2) = 2e-06
 Identities = 36/130 (27%), Positives = 56/130 (43%), Gaps = 8/130 (6%)
 Frame = +3

Query: 144  WLLKNLVH--LRAELLYFKSWKAIWGVAKGFGFHWQTTKKRLPIIVAWIKPLLAFTKLNS 317
            W +  L+   L   LL+   WK    +A  +G  +Q+  +  P I+ W KP     KLN 
Sbjct: 669  WRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYKLNV 728

Query: 318  DGYPVE---CSDGGILGDHTIIESHLLTIINMVMAPINWMDYKPSTIVQGHF---NISIH 479
            DG        + GGIL DHT     L+   +  +   N +  +   +++G        I 
Sbjct: 729  DGSSRNGHLAASGGILRDHT---GKLIFGFSENIGLCNSLQAELRALLRGLLLCKERHIE 785

Query: 480  NTWEELDTIA 509
            N W E+D +A
Sbjct: 786  NLWIEMDALA 795



 Score = 35.8 bits (81), Expect(2) = 2e-06
 Identities = 14/36 (38%), Positives = 20/36 (55%)
 Frame = +2

Query: 8   PDSVRVCFQAWHFSGNCYTPGHISTIILLLIQWFVW 115
           P  V     AW FSG+    GHI +++ + I WF+W
Sbjct: 614 PQHVSQILWAWFFSGDYVKKGHIRSLLPIFICWFLW 649


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 40.0 bits (92), Expect(2) = 8e-06
 Identities = 15/36 (41%), Positives = 22/36 (61%)
 Frame = +2

Query: 8    PDSVRVCFQAWHFSGNCYTPGHISTIILLLIQWFVW 115
            P ++     AW +SG+   PGHI T++ L I WF+W
Sbjct: 1986 PCTINQIIGAWFYSGDYCKPGHIRTLVPLFILWFLW 2021



 Score = 35.8 bits (81), Expect(2) = 8e-06
 Identities = 21/59 (35%), Positives = 29/59 (49%), Gaps = 3/59 (5%)
 Frame = +3

Query: 198  WKAIWGVAKGFGFHWQTTKKRLPIIVAWIKPLLAFTKLNSDGYPVE---CSDGGILGDH 365
            WK    +A+ +G  +Q      P + +W KP L   KLN DG   +    + GGIL DH
Sbjct: 2061 WKGDKQIAQEWGIIFQAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRDH 2119


Top