BLASTX nr result

ID: Forsythia21_contig00039037 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00039037
         (764 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    67   4e-17
ref|XP_007022459.1| RNase H family protein [Theobroma cacao] gi|...    60   2e-14
ref|XP_011085143.1| PREDICTED: uncharacterized protein LOC105167...    45   3e-09
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    67   9e-09
ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobrom...    67   2e-08
ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobrom...    66   2e-08
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...    66   3e-08
ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom...    65   4e-08
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    64   8e-08
ref|XP_007010351.1| Polynucleotidyl transferase, putative [Theob...    64   1e-07
ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom...    63   2e-07
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    62   3e-07
ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...    62   4e-07
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    62   5e-07
ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...    61   8e-07
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...    61   8e-07
ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobrom...    60   1e-06
ref|XP_008377870.1| PREDICTED: putative ribonuclease H protein A...    60   1e-06
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...    60   1e-06
ref|XP_009774155.1| PREDICTED: uncharacterized protein LOC104224...    60   2e-06

>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 64.7 bits (156), Expect(2) = 4e-17
 Identities = 29/86 (33%), Positives = 48/86 (55%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW F    F ++ ++P  +   +  W FSG++   GHI   IPLFI WF+ +ERNDA
Sbjct: 1425 AKQVWNFFAKSFQIYVSKPKHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDA 1484

Query: 181  RHNDIKVGLFGGFNTYSKLNMLSWIH 258
            +H    +G++     +  + +L+ +H
Sbjct: 1485 KHR--HMGMYPNRVIWRIMKLLNQLH 1508



 Score = 50.8 bits (120), Expect(2) = 4e-17
 Identities = 45/144 (31%), Positives = 62/144 (43%), Gaps = 36/144 (25%)
 Frame = +3

Query: 318  KKPIKVSWQKPRKGWVKLNVNGSSKNNPGIGDSGGVI*DQYGHFICAF------------ 461
            + P  +SW KP  G  KLNV+GSSK++      GGV+ D  G    AF            
Sbjct: 1538 QSPQIISWIKPFIGEYKLNVDGSSKSSQNAA-GGGVLRDHTGKLAFAFSENLGPLPSLQA 1596

Query: 462  -------------QLFYTNIWAELDSLVLVNCVN-----------LGICNPLLMRSNT*N 569
                         +   TN+W E+D+LV V  V            L     L +RS +  
Sbjct: 1597 ELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYR 1656

Query: 570  LESA*N*HIYREGNKVAGWLANEG 641
            +      HIYREGN+ A +L+N+G
Sbjct: 1657 IS-----HIYREGNQAADFLSNKG 1675



 Score = 67.4 bits (163), Expect = 9e-09
 Identities = 24/65 (36%), Positives = 42/65 (64%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A+ VW +   +F +H   P ++   +  W +SG++   GHI + +PLFI+WF+ +ERNDA
Sbjct: 3219 ANQVWSYFAKVFQIHIINPCTINHIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDA 3278

Query: 181  RHNDI 195
            +H ++
Sbjct: 3279 KHRNL 3283


>ref|XP_007022459.1| RNase H family protein [Theobroma cacao]
           gi|508722087|gb|EOY13984.1| RNase H family protein
           [Theobroma cacao]
          Length = 429

 Score = 59.7 bits (143), Expect(2) = 2e-14
 Identities = 24/65 (36%), Positives = 36/65 (55%)
 Frame = +1

Query: 1   AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
           A  VW +    F ++     S+   +  W FS ++   GHI   IPLFI WF+ +ERNDA
Sbjct: 176 ASQVWNYFAKFFQIYIIHRKSIYQIIWAWLFSSDYTKKGHIHILIPLFIFWFLWVERNDA 235

Query: 181 RHNDI 195
           +H ++
Sbjct: 236 KHRNL 240



 Score = 46.6 bits (109), Expect(2) = 2e-14
 Identities = 37/143 (25%), Positives = 60/143 (41%), Gaps = 31/143 (21%)
 Frame = +3

Query: 306 QPMIKKPIKVSWQKPRKGWVKLNVNGSSKNNPGIGDSGGVI*DQYGHFICAF-------- 461
           +P + KP   SWQKP  G  KLNV+G SK +      G ++ D  G  I +F        
Sbjct: 247 KPSLPKPKVFSWQKPLTGEFKLNVDGGSKYDCQSAAGGRLLRDHTGTLIFSFVENFGPYN 306

Query: 462 -----------------QLFYTNIWAELDSLVLVNCVNLGICNPLLMRSNT*NLESA*N- 587
                            +     +W E+D+ V++  ++ G      +R    ++    + 
Sbjct: 307 SLQAELMALYRGLLLCIEHNVRRLWIEMDAKVVIQMIHRGHKGSAQIRYLLASIRKCLSV 366

Query: 588 -----*HIYREGNKVAGWLANEG 641
                 HI+REGN+ A  L+N+G
Sbjct: 367 ISFRISHIHREGNQAADLLSNQG 389


>ref|XP_011085143.1| PREDICTED: uncharacterized protein LOC105167219 [Sesamum indicum]
          Length = 1203

 Score = 45.4 bits (106), Expect(2) = 3e-09
 Identities = 20/43 (46%), Positives = 27/43 (62%)
 Frame = +3

Query: 297  SSSQPMIKKPIKVSWQKPRKGWVKLNVNGSSKNNPGIGDSGGV 425
            S  +P IK    V W KP  GW+K+N +G+SK NPG   +GG+
Sbjct: 1155 SQYKPKIKI---VKWTKPELGWIKINTDGASKGNPGRAGAGGI 1194



 Score = 43.5 bits (101), Expect(2) = 3e-09
 Identities = 30/108 (27%), Positives = 48/108 (44%), Gaps = 7/108 (6%)
 Frame = +1

Query: 10   VWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDARHN 189
            VWE     F ++     ++   L  WR S   +   HI   +P+ I+WF  +ERND +H 
Sbjct: 1051 VWEHFARKFNMNLPNTDNIVLLLNYWRISA--LGQNHIRMIVPMLILWFGWLERNDVKHR 1108

Query: 190  D-------IKVGLFGGFNTYSKLNMLSWIHRRGEVGVARNMRIPLHNQ 312
            +       IK  +     T  K      I+ +G+  VA+ M + L +Q
Sbjct: 1109 NKNFNSDRIKWKVHQHIVTTFKSKTTKRINWKGDRFVAKFMGLELGSQ 1156


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 67.4 bits (163), Expect = 9e-09
 Identities = 29/65 (44%), Positives = 41/65 (63%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW F  + F ++ ++P +V   L TW  SG++V  GHI   IPLFI WF+ +ERNDA
Sbjct: 1671 AKQVWNFFANSFQIYISKPQNVSQILWTWYLSGDYVRKGHIRILIPLFICWFLWLERNDA 1730

Query: 181  RHNDI 195
            +H  +
Sbjct: 1731 KHRHL 1735


>ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
           gi|508787491|gb|EOY34747.1| Uncharacterized protein
           TCM_042327 [Theobroma cacao]
          Length = 1014

 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 28/65 (43%), Positives = 39/65 (60%)
 Frame = +1

Query: 1   AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
           A  VW F    F ++ + P  V   +  W +SG+FV  GHI + IPLFI WF+ +ERNDA
Sbjct: 731 AKQVWNFFADFFQINISNPQHVSQIIWAWYYSGDFVRKGHIRTLIPLFICWFLWLERNDA 790

Query: 181 RHNDI 195
           +H  +
Sbjct: 791 KHRHL 795


>ref|XP_007028292.1| Uncharacterized protein TCM_023960 [Theobroma cacao]
           gi|508716897|gb|EOY08794.1| Uncharacterized protein
           TCM_023960 [Theobroma cacao]
          Length = 303

 Score = 66.2 bits (160), Expect = 2e-08
 Identities = 27/67 (40%), Positives = 41/67 (61%)
 Frame = +1

Query: 1   AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
           A  VW F    F ++   P +V   L  W +SG++V  GHI   +PL I+WF+ +ERNDA
Sbjct: 67  AQQVWNFFAKFFQIYVHNPQNVLHILHPWYYSGDYVKPGHIRILLPLLIMWFLWVERNDA 126

Query: 181 RHNDIKV 201
           +H ++K+
Sbjct: 127 KHKELKM 133


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 65.9 bits (159), Expect = 3e-08
 Identities = 28/65 (43%), Positives = 40/65 (61%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW F  + F ++ + P +V   L  W FSG++V  GHI + IPLFI WF+ +ERNDA
Sbjct: 1251 AKQVWNFFANFFQIYVSNPQNVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDA 1310

Query: 181  RHNDI 195
            +   +
Sbjct: 1311 KQRHL 1315


>ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
           gi|508778195|gb|EOY25451.1| Uncharacterized protein
           TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 65.1 bits (157), Expect = 4e-08
 Identities = 28/67 (41%), Positives = 39/67 (58%)
 Frame = +1

Query: 1   AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
           A  VW F    F ++   P  V   L  W FSG++V  GHI S +P+FI WF+ +ERNDA
Sbjct: 596 AKQVWAFFGKFFQIYVLNPQHVSQILWAWFFSGDYVKKGHIRSLLPIFICWFLWLERNDA 655

Query: 181 RHNDIKV 201
           +H   ++
Sbjct: 656 KHRHTRL 662


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 64.3 bits (155), Expect = 8e-08
 Identities = 26/65 (40%), Positives = 39/65 (60%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW +    F ++   P ++   L +W +SG+F   GHI + I LFI WF+ +ERNDA
Sbjct: 1052 AQQVWNYFSKFFQIYVHNPQNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDA 1111

Query: 181  RHNDI 195
            +H D+
Sbjct: 1112 KHRDL 1116


>ref|XP_007010351.1| Polynucleotidyl transferase, putative [Theobroma cacao]
           gi|508727264|gb|EOY19161.1| Polynucleotidyl transferase,
           putative [Theobroma cacao]
          Length = 419

 Score = 63.5 bits (153), Expect = 1e-07
 Identities = 25/49 (51%), Positives = 39/49 (79%)
 Frame = +3

Query: 315 IKKPIKVSWQKPRKGWVKLNVNGSSKNNPGIGDSGGVI*DQYGHFICAF 461
           +K+ + ++W+KP+ G+VKLNV+GS+K  PG+  SGGVI D+YG++I  F
Sbjct: 245 LKQEVLIAWEKPKNGYVKLNVDGSAKGQPGLAASGGVIRDEYGNWIAGF 293


>ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
           gi|508710337|gb|EOY02234.1| Uncharacterized protein
           TCM_011921 [Theobroma cacao]
          Length = 926

 Score = 63.2 bits (152), Expect = 2e-07
 Identities = 26/62 (41%), Positives = 38/62 (61%)
 Frame = +1

Query: 1   AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
           A  VW F  + F ++   P  V   L  W +SG++V  GHI + +P+FI WF+ +ERNDA
Sbjct: 644 AKQVWAFFANFFQIYIFNPQHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDA 703

Query: 181 RH 186
           +H
Sbjct: 704 KH 705


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 62.4 bits (150), Expect = 3e-07
 Identities = 23/65 (35%), Positives = 40/65 (61%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW +   +F +    P ++   +  W +SG++   GHI + +PLFI+WF+ +ERNDA
Sbjct: 1968 AMQVWNYFAKLFQILIINPCTINQIIGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDA 2027

Query: 181  RHNDI 195
            +H ++
Sbjct: 2028 KHRNL 2032


>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 62.0 bits (149), Expect = 4e-07
 Identities = 26/62 (41%), Positives = 37/62 (59%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW F    F ++ ++P  +   +  W FSG++   GHI   IPLFI WF+ +ERNDA
Sbjct: 1668 ATQVWFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRILIPLFICWFLWLERNDA 1727

Query: 181  RH 186
            +H
Sbjct: 1728 KH 1729


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 61.6 bits (148), Expect = 5e-07
 Identities = 22/65 (33%), Positives = 40/65 (61%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A+ VW +   +F +    P ++   +  W +SG++   GHI + +PLF +WF+ +ERNDA
Sbjct: 1931 ANQVWSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDA 1990

Query: 181  RHNDI 195
            +H ++
Sbjct: 1991 KHRNL 1995


>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
           gi|508787492|gb|EOY34748.1| Uncharacterized protein
           TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 23/65 (35%), Positives = 39/65 (60%)
 Frame = +1

Query: 1   AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
           A  VW +   +F +    P ++   +  W  SG++   GHI + +PLFI+WF+ +ERNDA
Sbjct: 627 AMQVWNYFAKLFQICIINPCTINQIIGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDA 686

Query: 181 RHNDI 195
           +H ++
Sbjct: 687 KHRNL 691


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 60.8 bits (146), Expect = 8e-07
 Identities = 25/62 (40%), Positives = 37/62 (59%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW F    F ++   P  V   L  W +SG++V  GHI + +P+FI WF+ +ERNDA
Sbjct: 1932 AKQVWAFFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDA 1991

Query: 181  RH 186
            ++
Sbjct: 1992 KY 1993


>ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
            gi|508704887|gb|EOX96783.1| Uncharacterized protein
            TCM_005954 [Theobroma cacao]
          Length = 1134

 Score = 60.5 bits (145), Expect = 1e-06
 Identities = 25/62 (40%), Positives = 35/62 (56%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW F   +F ++   P  V   +  W  SG++V  GH    +PLFI WF+ +ERNDA
Sbjct: 849  AKQVWNFFAKLFQIYILNPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDA 908

Query: 181  RH 186
            +H
Sbjct: 909  KH 910


>ref|XP_008377870.1| PREDICTED: putative ribonuclease H protein At1g65750 [Malus
           domestica]
          Length = 591

 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 44/142 (30%), Positives = 59/142 (41%), Gaps = 37/142 (26%)
 Frame = +3

Query: 327 IKVSWQKPRKGWVKLNVNGSSKNNPGIGDSGGVI*DQYGHFICAFQL-------FYT--- 476
           + V W  P   WVK+N +G +K NPG    GGV  D  G+F+  F L       FY    
Sbjct: 419 VPVLWHPPPSSWVKVNTDGLAKGNPGPAACGGVFRDSAGYFLGGFSLSLGHRTSFYAELH 478

Query: 477 ---------------NIWAELDSLVLVNCVNLGICNP------------LLMRSNT*NLE 575
                          N+W E DS  +++C   G  +P            LL+++      
Sbjct: 479 AVILAVELAHARGWQNLWLESDSSSVISCFASGSFSPPWSLQTRWNNCTLLLQNMVFRCS 538

Query: 576 SA*N*HIYREGNKVAGWLANEG 641
                HI+REGN VA  LAN G
Sbjct: 539 -----HIFREGNAVADKLANLG 555


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 60.1 bits (144), Expect = 1e-06
 Identities = 25/62 (40%), Positives = 35/62 (56%)
 Frame = +1

Query: 1    AHHVWEFCVSMFGVHYTQPTSVQARLQTWRFSGEFVSYGHICSCIPLFIIWFIQIERNDA 180
            A  VW F   +F ++   P  V   +  W  SG++V  GH    +PLFI WF+ +ERNDA
Sbjct: 1845 AKQVWNFFAQLFQIYIWNPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWLERNDA 1904

Query: 181  RH 186
            +H
Sbjct: 1905 KH 1906


>ref|XP_009774155.1| PREDICTED: uncharacterized protein LOC104224244 [Nicotiana
           sylvestris]
          Length = 195

 Score = 59.7 bits (143), Expect = 2e-06
 Identities = 35/88 (39%), Positives = 42/88 (47%), Gaps = 25/88 (28%)
 Frame = +3

Query: 333 VSWQKPRKGWVKLNVNGSSKNNPGIGDSGGVI*DQYGHFICAFQLFY------------- 473
           V WQKP  GWVKLNV+G SK NP     GG+I D +G  I AF  FY             
Sbjct: 93  VYWQKPHVGWVKLNVDGCSKGNPSPAGGGGLIRDHHGILIEAFAEFYRDCSCNIAEAKAM 152

Query: 474 ------------TNIWAELDSLVLVNCV 521
                       TN+  E DSL+L+N +
Sbjct: 153 MRGIKMCISKGFTNVIVESDSLILLNLI 180


Top