BLASTX nr result

ID: Lithospermum23_contig00039581 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00039581
         (564 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

EOY34747.1 Uncharacterized protein TCM_042327 [Theobroma cacao]        99   7e-21
EOY06960.1 Uncharacterized protein TCM_021522 [Theobroma cacao]        99   1e-20
EOX96783.1 Uncharacterized protein TCM_005954 [Theobroma cacao]        98   2e-20
EOY02239.1 Uncharacterized protein TCM_016763 [Theobroma cacao]        98   3e-20
EOY14356.1 Uncharacterized protein TCM_033752 [Theobroma cacao]        98   3e-20
EOY06959.1 Uncharacterized protein TCM_021521 [Theobroma cacao]        97   5e-20
EOY02234.1 Uncharacterized protein TCM_011921 [Theobroma cacao]        97   6e-20
EOY02236.1 Uncharacterized protein TCM_011923 [Theobroma cacao]        97   7e-20
EOY02238.1 Uncharacterized protein TCM_016762 [Theobroma cacao]        96   9e-20
EOY25451.1 Uncharacterized protein TCM_016759 [Theobroma cacao]        96   2e-19
XP_017972650.1 PREDICTED: uncharacterized protein LOC18606969 [T...    95   2e-19
EOY17513.1 Uncharacterized protein TCM_036737 [Theobroma cacao]        95   2e-19
EOY17514.1 Uncharacterized protein TCM_042330 [Theobroma cacao]        91   7e-18
EOY34748.1 Uncharacterized protein TCM_042328 [Theobroma cacao]        91   9e-18
EOY19200.1 Retrotransposon, unclassified-like protein [Theobroma...    90   1e-17
EOY25447.1 Uncharacterized protein TCM_016753 [Theobroma cacao]        90   2e-17
EOY06956.1 Uncharacterized protein TCM_021518 [Theobroma cacao]        90   2e-17
XP_019177745.1 PREDICTED: uncharacterized protein LOC109172951 [...    89   2e-17
KZV46870.1 hypothetical protein F511_08631 [Dorcoceras hygrometr...    87   8e-17
KZV43060.1 hypothetical protein F511_04452 [Dorcoceras hygrometr...    86   8e-17

>EOY34747.1 Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score = 99.4 bits (246), Expect = 7e-21
 Identities = 64/193 (33%), Positives = 92/193 (47%), Gaps = 6/193 (3%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A  F    +  Q+V   + AW  + +   KGHIR +IP+ I W LW  RN +KH  
Sbjct: 735  WNFFADFFQINISNPQHVSQIIWAWYYSGDFVRKGHIRTLIPLFICWFLWLERNDAKHRH 794

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAKP 209
                  +V+ ++M     +    L+    WKGD  +   +   + +    +  ++ W KP
Sbjct: 795  LGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKP 854

Query: 208  SIGVLKLNVDGSYGLSHSAA-GGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN---WC 41
              G  KLNVDGS   + SAA GG+LRD  G L+  FS    PS+ L+AE+ A L     C
Sbjct: 855  VTGEYKLNVDGSSRHNQSAATGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLC 914

Query: 40   VANNFTLLQVETD 2
               N   L +E D
Sbjct: 915  KDRNIEKLWIEMD 927


>EOY06960.1 Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 98.6 bits (244), Expect = 1e-20
 Identities = 65/193 (33%), Positives = 92/193 (47%), Gaps = 6/193 (3%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLA--VNTKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A  F    +  +++   + AW  +      GHIR +IP+ I W LW  RN +KH  
Sbjct: 1429 WNFFAKSFQIYVSKPKHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRH 1488

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAKP 209
                 ++V+ R+M     +    L+    WKGD  +   +  +      ++  +++W KP
Sbjct: 1489 MGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKP 1548

Query: 208  SIGVLKLNVDGSYGLSHSAA-GGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN---WC 41
             IG  KLNVDGS   S +AA GGVLRD  G L  AFS    P   L+AE+ A L     C
Sbjct: 1549 FIGEYKLNVDGSSKSSQNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHALLRGLLLC 1608

Query: 40   VANNFTLLQVETD 2
               N T L +E D
Sbjct: 1609 KERNITNLWIEMD 1621



 Score = 95.5 bits (236), Expect = 2e-19
 Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 7/194 (3%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLA--VNTKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A +F         +   + AW  +   +  GHIR ++P+ I+W LW  RN +KH  
Sbjct: 3223 WSYFAKVFQIHIINPCTINHIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRN 3282

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAKP 209
                 ++++ +++     + +   +    W+GD  + + +   +    P    ++ W KP
Sbjct: 3283 LGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKP 3342

Query: 208  SIGVLKLNVDGS--YGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDA---ALNW 44
            SIG  KLNVDGS  Y L  +A GG+LRD  G +I  FS        L+AE+ A    L  
Sbjct: 3343 SIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHRGLLL 3402

Query: 43   CVANNFTLLQVETD 2
            C+ +N T L +E D
Sbjct: 3403 CIDHNVTRLWIEMD 3416


>EOX96783.1 Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score = 98.2 bits (243), Expect = 2e-20
 Identities = 69/196 (35%), Positives = 94/196 (47%), Gaps = 9/196 (4%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A +F       ++V   + AW ++ +   KGH R ++P+ I W LW  RN +KH  
Sbjct: 853  WNFFAKLFQIYILNPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRH 912

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALS--VVTWA 215
            T     +V+ R M     +    L+    WKGD  +          PP +  S  ++ W 
Sbjct: 913  TGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIAA--MLGFSFPPQQHASPQIIYWK 970

Query: 214  KPSIGVLKLNVDGS--YGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN-- 47
            KPSIG  KLNVDGS   GL H+A GGVLRD  G LI  FS    P + L+AE+ A L   
Sbjct: 971  KPSIGEYKLNVDGSSRNGL-HAATGGVLRDHTGKLIFGFSENIGPCNSLQAELRALLRGL 1029

Query: 46   -WCVANNFTLLQVETD 2
              C   +   L +E D
Sbjct: 1030 LLCKERHIEKLWIEMD 1045


>EOY02239.1 Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 97.8 bits (242), Expect = 3e-20
 Identities = 70/197 (35%), Positives = 95/197 (48%), Gaps = 10/197 (5%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A +F       ++V   + AW ++ +   KGH R ++P+ I W LW  RN +KH  
Sbjct: 1849 WNFFAQLFQIYIWNPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDAKHRH 1908

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGD---VTMVEHFQAQILVPPPRALSVVTW 218
            T     +V+ R M     +    L+    WKGD    TM+          PP+   ++ W
Sbjct: 1909 TGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQ---IIYW 1965

Query: 217  AKPSIGVLKLNVDGS--YGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN- 47
             KPSIG  KLNVDGS   GL H+A GGVLRD  G LI  FS    P + L+AE+ A L  
Sbjct: 1966 KKPSIGEYKLNVDGSSRNGL-HAATGGVLRDHTGKLIFGFSENIGPCNSLQAELRALLRG 2024

Query: 46   --WCVANNFTLLQVETD 2
               C   +   L +E D
Sbjct: 2025 LLLCKERHIEKLWIEMD 2041


>EOY14356.1 Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 97.8 bits (242), Expect = 3e-20
 Identities = 61/163 (37%), Positives = 86/163 (52%), Gaps = 8/163 (4%)
 Frame = -3

Query: 466  GHIRQIIPMVIIWALWEARNQSKHDGTTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDV 287
            GHIR ++P+ I+W LW  RN +KH       ++V+ RV+     +     +    WKGD 
Sbjct: 2006 GHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDK 2065

Query: 286  TMVEH----FQAQILVPPPRALSVVTWAKPSIGVLKLNVDGSYGLSHSAA-GGVLRDSNG 122
             + +     FQA+ L PP     V +W KPS+G  KLNVDGS   SH+AA GG+LRD  G
Sbjct: 2066 QIAQEWGIIFQAESLAPP----KVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAG 2121

Query: 121  DLIMAFSLTTHPSSPLEAEVDA---ALNWCVANNFTLLQVETD 2
            +++  FS      + L+AE+ A    L  C   N   L +E D
Sbjct: 2122 EMVFGFSENLGTQNSLQAELLALYRGLILCRDYNIRRLWIEMD 2164


>EOY06959.1 Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 97.1 bits (240), Expect = 5e-20
 Identities = 66/195 (33%), Positives = 91/195 (46%), Gaps = 8/195 (4%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLA--VNTKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A  F    +   ++   + AW  +      GHIR +IP+ I W LW  RN +KH  
Sbjct: 1672 WFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRH 1731

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALS--VVTWA 215
                 ++V+ R+M     +    L+    WKGD  +   +  +   PP    S  ++ W 
Sbjct: 1732 MGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKF--PPKYCTSPQIIYWI 1789

Query: 214  KPSIGVLKLNVDGSYGLS-HSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN--- 47
            KP IG  KLNVDGS   + ++A GGVLRD  G L  AFS    P   L+AE+ A L    
Sbjct: 1790 KPFIGEYKLNVDGSSKSNLNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHALLRGLL 1849

Query: 46   WCVANNFTLLQVETD 2
             C   N T L +E D
Sbjct: 1850 LCKERNITNLWIEMD 1864


>EOY02234.1 Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score = 96.7 bits (239), Expect = 6e-20
 Identities = 67/198 (33%), Positives = 96/198 (48%), Gaps = 11/198 (5%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A+ F       Q+V   + AW  + +   +GHIR ++P+ I W LW  RN +KH  
Sbjct: 648  WAFFANFFQIYIFNPQHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKHRY 707

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVE----HFQAQILVPPPRALSVVT 221
            +     +V+ R+M     +    L+    WKGD  +      + Q ++  PP     +V 
Sbjct: 708  SGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPP----QIVY 763

Query: 220  WAKPSIGVLKLNVDGS--YGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN 47
            W KPS G  KLNVDGS  +G  H+A+GGVLRD  G LI  FS      + L+AE+ A L 
Sbjct: 764  WRKPSTGEYKLNVDGSSRHG-QHAASGGVLRDHTGKLIFGFSENIGNCNSLQAELRALLR 822

Query: 46   ---WCVANNFTLLQVETD 2
                C   +   L +E D
Sbjct: 823  GLLLCKERHIEQLWIEMD 840


>EOY02236.1 Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 96.7 bits (239), Expect = 7e-20
 Identities = 70/196 (35%), Positives = 94/196 (47%), Gaps = 9/196 (4%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A+ F    +  QNV   +  W L+ +   KGHIR +IP+ I W LW  RN +KH  
Sbjct: 1675 WNFFANSFQIYISKPQNVSQILWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDAKHRH 1734

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPR---ALSVVTW 218
                  +V+ ++M     +    L+    WKGD      +    L  PP+   A  ++ W
Sbjct: 1735 LGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWG---LFSPPKTRAAPQILHW 1791

Query: 217  AKPSIGVLKLNVDGSYGLSHSAA-GGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN-- 47
             KP  G  KLNVDGS   + +AA GGVLRD  G L+  FS    PS+ L+AE+ A L   
Sbjct: 1792 VKPVPGEHKLNVDGSSRQNQTAAIGGVLRDHTGTLVFDFSENIGPSNSLQAELRALLRGL 1851

Query: 46   -WCVANNFTLLQVETD 2
              C   N   L VE D
Sbjct: 1852 LLCKERNIEKLWVEMD 1867


>EOY02238.1 Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 96.3 bits (238), Expect = 9e-20
 Identities = 65/198 (32%), Positives = 96/198 (48%), Gaps = 11/198 (5%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A  F       ++V   + AW  + +   +GHIR ++P+ I W LW  RN +K+  
Sbjct: 1936 WAFFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRH 1995

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVE----HFQAQILVPPPRALSVVT 221
            +     +++ R+M     +    L+    WKGD  +      +FQ ++  PP     +V 
Sbjct: 1996 SGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPP----QIVY 2051

Query: 220  WAKPSIGVLKLNVDGS--YGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN 47
            W KPS G  KLNVDGS  +G  H+A+GGVLRD  G LI  FS      + L+AE+ A L 
Sbjct: 2052 WRKPSTGEYKLNVDGSSRHG-QHAASGGVLRDHTGKLIFGFSENIGTCNSLQAELRALLR 2110

Query: 46   ---WCVANNFTLLQVETD 2
                C   +   L +E D
Sbjct: 2111 GLLLCKERHIEKLWIEMD 2128


>EOY25451.1 Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 95.5 bits (236), Expect = 2e-19
 Identities = 65/197 (32%), Positives = 91/197 (46%), Gaps = 10/197 (5%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W      F       Q+V   + AW  + +   KGHIR ++P+ I W LW  RN +KH  
Sbjct: 600  WAFFGKFFQIYVLNPQHVSQILWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDAKHRH 659

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMV----EHFQAQILVPPPRALSVVT 221
            T     +V+ R+M     +    L+    WKGD  +       FQ++   PP     ++ 
Sbjct: 660  TRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPP----QIIY 715

Query: 220  WAKPSIGVLKLNVDGSYGLSH-SAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN- 47
            W KP  G  KLNVDGS    H +A+GG+LRD  G LI  FS      + L+AE+ A L  
Sbjct: 716  WRKPFTGEYKLNVDGSSRNGHLAASGGILRDHTGKLIFGFSENIGLCNSLQAELRALLRG 775

Query: 46   --WCVANNFTLLQVETD 2
               C   +   L +E D
Sbjct: 776  LLLCKERHIENLWIEMD 792


>XP_017972650.1 PREDICTED: uncharacterized protein LOC18606969 [Theobroma cacao]
          Length = 431

 Score = 94.7 bits (234), Expect = 2e-19
 Identities = 66/196 (33%), Positives = 90/196 (45%), Gaps = 9/196 (4%)
 Frame = -3

Query: 562 WILVASMFGFTHAPYQNVQGAVIAWTLA--VNTKGHIRQIIPMVIIWALWEARNQSKHDG 389
           W   A  F    +   ++   + AW  +      GHIR +IP+ I W LW  RN +KH  
Sbjct: 152 WFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDAKHRH 211

Query: 388 TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPR---ALSVVTW 218
                 +V+ R+M     +    L+    WKGD  +   +  +    PP+   +  ++ W
Sbjct: 212 MGMYPDRVIWRIMKLLNQLYAGSLLKRWQWKGDTDIATMWGFKF---PPKYYTSPQIIYW 268

Query: 217 AKPSIGVLKLNVDGSYGLSHSAA-GGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN-- 47
            KPSIG  KLNV GS   + +AA GGVLRD  G L   FS    P S L AE+ A L   
Sbjct: 269 IKPSIGEYKLNVYGSSESNQNAAGGGVLRDHTGRLAFVFSENLGPRSSLHAELHALLRGL 328

Query: 46  -WCVANNFTLLQVETD 2
             C   N T L +E D
Sbjct: 329 LLCKERNITNLWIEMD 344


>EOY17513.1 Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 95.1 bits (235), Expect = 2e-19
 Identities = 57/194 (29%), Positives = 93/194 (47%), Gaps = 7/194 (3%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLA--VNTKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A +F         +   + AW  +   +  GHIR ++P+  +W LW  RN +KH  
Sbjct: 1935 WSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRN 1994

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAKP 209
                 ++V+ +++     + +   +    W+GD  + + +   +    P    ++ W KP
Sbjct: 1995 LGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKP 2054

Query: 208  SIGVLKLNVDGS--YGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDA---ALNW 44
            SIG LKLNVDGS  +    +A GG+LRD  G +I  FS    P   L+AE+ A    L  
Sbjct: 2055 SIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHRGLLL 2114

Query: 43   CVANNFTLLQVETD 2
            C+ +N + L +E D
Sbjct: 2115 CIEHNISRLWIEMD 2128


>EOY17514.1 Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 90.9 bits (224), Expect = 7e-18
 Identities = 60/163 (36%), Positives = 83/163 (50%), Gaps = 8/163 (4%)
 Frame = -3

Query: 466  GHIRQIIPMVIIWALWEARNQSKHDGTTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDV 287
            GHIR ++P+  +W LW  RN +KH       ++++ R++     +     +    WKGD 
Sbjct: 2004 GHIRTLVPIFTLWFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDK 2063

Query: 286  TMVEH----FQAQILVPPPRALSVVTWAKPSIGVLKLNVDGSYGLSHSAA-GGVLRDSNG 122
             + +     FQA+ L PP     V  W KPSIG  KLNVDGS  LS +AA GGVLRD  G
Sbjct: 2064 QIAQEWGITFQAESLPPP----KVFPWHKPSIGEFKLNVDGSAKLSQNAAGGGVLRDHAG 2119

Query: 121  DLIMAFSLTTHPSSPLEAEVDA---ALNWCVANNFTLLQVETD 2
             ++  FS      + L+AE+ A    L  C   N   L +E D
Sbjct: 2120 VMVFGFSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMD 2162


>EOY34748.1 Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 90.5 bits (223), Expect = 9e-18
 Identities = 59/163 (36%), Positives = 84/163 (51%), Gaps = 8/163 (4%)
 Frame = -3

Query: 466  GHIRQIIPMVIIWALWEARNQSKHDGTTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDV 287
            GHIR ++P+ I+W LW  RN +KH       ++V+ RV+     +     +    WKGD 
Sbjct: 665  GHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDK 724

Query: 286  TMVEHF----QAQILVPPPRALSVVTWAKPSIGVLKLNVDGSYGLSHSAA-GGVLRDSNG 122
             + + +    QA+ L PP     V +W KP+ G  KLNVDGS   SH+AA GG+LRD  G
Sbjct: 725  QIAQEWGIILQAESLAPP----KVFSWHKPTTGEFKLNVDGSAKHSHNAAGGGILRDHAG 780

Query: 121  DLIMAFSLTTHPSSPLEAEVDA---ALNWCVANNFTLLQVETD 2
             ++  FS      + L+AE+ A    L  C   N   L +E D
Sbjct: 781  VMVFGFSENLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMD 823


>EOY19200.1 Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 90.1 bits (222), Expect = 1e-17
 Identities = 61/185 (32%), Positives = 93/185 (50%), Gaps = 7/185 (3%)
 Frame = -3

Query: 535  FTHAPYQNVQGAVIAWTLAVN-TK-GHIRQIIPMVIIWALWEARNQSKHDGTTYTFHKVM 362
            + H P QN+   + +W  + + TK GHIR +I + I W +W  RN +KH        +++
Sbjct: 1066 YVHNP-QNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRII 1124

Query: 361  SRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAKPSIGVLKLNV 182
             R+M     + +  L+    WKGD+ +  H+             ++ W KP IG LKLNV
Sbjct: 1125 WRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNV 1184

Query: 181  DGSY--GLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDA---ALNWCVANNFTLL 17
            DGS      ++A GGVLRD  G+LI  FS      + L+AE+ A    L  C+  N + +
Sbjct: 1185 DGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLCLCMEYNVSRV 1244

Query: 16   QVETD 2
             +E D
Sbjct: 1245 WIEVD 1249


>EOY25447.1 Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score = 89.7 bits (221), Expect = 2e-17
 Identities = 59/173 (34%), Positives = 83/173 (47%), Gaps = 8/173 (4%)
 Frame = -3

Query: 496  IAWTLAVNTKGHIRQIIPMVIIWALWEARNQSKHDGTTYTFHKVMSRVMVTTTYICKSCL 317
            + W  +V  +G IR ++P+ I W LW  RN +KH  +     +V+ R+M     +    L
Sbjct: 876  VLWGNSVAKQGRIRTLLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSL 935

Query: 316  ISYKHWKGDVTMVE----HFQAQILVPPPRALSVVTWAKPSIGVLKLNVDG-SYGLSHSA 152
            +    WKGD  +      +FQ +   PP     +V W KP  G  KLNVDG S    H+A
Sbjct: 936  LQQWQWKGDTDIAAMWRYNFQLKQRAPP----QIVYWRKPFTGEYKLNVDGSSRNGQHAA 991

Query: 151  AGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDA---ALNWCVANNFTLLQVETD 2
            +GGVLRD    LI  FS      + L+AE+ A    L  C   +   L +E D
Sbjct: 992  SGGVLRDHTSKLIFCFSENIGTYNSLQAELRALHRGLLLCKERHIEKLWIEMD 1044


>EOY06956.1 Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 89.7 bits (221), Expect = 2e-17
 Identities = 65/197 (32%), Positives = 92/197 (46%), Gaps = 10/197 (5%)
 Frame = -3

Query: 562  WILVASMFGFTHAPYQNVQGAVIAWTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDG 389
            W   A+ F    +  QNV   + AW  + +   KGHIR +IP+ I W LW  RN +K   
Sbjct: 1255 WNFFANFFQIYVSNPQNVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRH 1314

Query: 388  TTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVE----HFQAQILVPPPRALSVVT 221
                  +V+ ++M     +    ++    WKGD+ +      +F  +I   P     +  
Sbjct: 1315 LGMYSDRVVWKIMKLLRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATP----QIFH 1370

Query: 220  WAKPSIGVLKLNVDGSYGLSHSAA-GGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALN- 47
            W K   G  KLNVDGS   + SAA GG+LRD  G L+  FS    PS+ L+AE+ A L  
Sbjct: 1371 WVKLVSGEHKLNVDGSSRQNQSAAIGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRG 1430

Query: 46   --WCVANNFTLLQVETD 2
               C   N   L +E D
Sbjct: 1431 LLLCKERNIEKLWIEMD 1447


>XP_019177745.1 PREDICTED: uncharacterized protein LOC109172951 [Ipomoea nil]
          Length = 418

 Score = 89.0 bits (219), Expect = 2e-17
 Identities = 59/193 (30%), Positives = 86/193 (44%), Gaps = 6/193 (3%)
 Frame = -3

Query: 562 WILVASMFGFTHAPYQNVQGAVIAWTLAVNTKGHIRQII---PMVIIWALWEARNQSKHD 392
           WI     FG + +   +V+    +W L  ++   +R I+   P +I+W +W A N+  HD
Sbjct: 132 WIHFVGFFGLSLSSSASVRATCHSWWLLPSSTSAVRCIVCLLPCLILWFIWIAYNECLHD 191

Query: 391 GTTYTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAK 212
           G+T++   ++ R+   +  I  +  I          +         VPP      V W  
Sbjct: 192 GSTFSPSGLIKRISRESRLIFLATSIRGNGSSDSFLLAARLIVGFDVPPRMQSIWVKWIV 251

Query: 211 PSIGVLKLNVDGSYGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDA---ALNWC 41
           P  G LKLN D S+ L+ +A G  LRDS G L++        SS LEAE  A   AL WC
Sbjct: 252 PPSGRLKLNTDASFSLAGAAGGACLRDSRGGLVVGLCFNLSASSALEAEACALRLALQWC 311

Query: 40  VANNFTLLQVETD 2
            A       VE D
Sbjct: 312 EAMVLLPALVEVD 324


>KZV46870.1 hypothetical protein F511_08631 [Dorcoceras hygrometricum]
          Length = 459

 Score = 87.4 bits (215), Expect = 8e-17
 Identities = 57/191 (29%), Positives = 87/191 (45%), Gaps = 4/191 (2%)
 Frame = -3

Query: 562 WILVASMFGFTHAPYQNVQGAVIAWTLAVNTKGHIRQIIPMVIIWALWEARNQSKHDGTT 383
           W    S+FG       +     I W      +GH+R+ IP +I+W LW  RN SKH    
Sbjct: 179 WTHFGSLFGVAQPKQASNWKININW----RARGHLRECIPFLILWFLWIGRNDSKHRLIC 234

Query: 382 YTFHKVMSRVMVTTTYICKSCLISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAKPSI 203
                ++ R+         S L+  +HW+G   + ++F  Q+       +S + W KP  
Sbjct: 235 LRPAVIIRRIRYYIFTAASSGLLKAEHWQGVHALAQNFLVQVRGIRRTTVSTIYWIKPPT 294

Query: 202 GVLKLNVDGS-YGLSHSAAGGVLRDSNGDLIMAFSLTTHPSSPLEAEVDAALNW---CVA 35
              KLN DGS      ++ GG++RDS G +++AF       S L+AE+ A L     C+ 
Sbjct: 295 TWFKLNTDGSRSNQGMTSTGGLVRDSQGQVLVAFHGFLDAGSILKAELTAILQGLLICLH 354

Query: 34  NNFTLLQVETD 2
                + VETD
Sbjct: 355 QQLFPIWVETD 365


>KZV43060.1 hypothetical protein F511_04452 [Dorcoceras hygrometricum]
          Length = 325

 Score = 86.3 bits (212), Expect = 8e-17
 Identities = 53/169 (31%), Positives = 83/169 (49%), Gaps = 6/169 (3%)
 Frame = -3

Query: 490 WTLAVN--TKGHIRQIIPMVIIWALWEARNQSKHDGTTYTFHKVMSRVMVTTTYICKSCL 317
           W + +N   +GH+R+ IP +I+W LW  RN SKH         ++ R+         S L
Sbjct: 71  WKININWRARGHLRECIPFLILWFLWIGRNDSKHRLICLRPAVIIRRIRYYIFTAASSGL 130

Query: 316 ISYKHWKGDVTMVEHFQAQILVPPPRALSVVTWAKPSIGVLKLNVDGS-YGLSHSAAGGV 140
           +  +HW+G   + ++F  Q+       +S + W KP     KLN DGS      ++ GG+
Sbjct: 131 LKAEHWQGVHALAQNFLVQVRGIRRTTVSTIYWIKPPDTWFKLNTDGSRSNQGMTSTGGL 190

Query: 139 LRDSNGDLIMAFSLTTHPSSPLEAEVDAALNW---CVANNFTLLQVETD 2
           +RDS G +++AF       S L+AE+ A L     C+      + VETD
Sbjct: 191 VRDSQGQVLVAFHGFLDAGSILKAELTAILQGLLICLHQQLFPIWVETD 239


Top