BLASTX nr result
ID: Sinomenium21_contig00042750
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00042750 (338 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 80 4e-13 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 79 9e-13 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 78 1e-12 ref|XP_004248946.1| PREDICTED: putative ribonuclease H protein A... 77 2e-12 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 77 3e-12 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 77 3e-12 ref|XP_004253499.1| PREDICTED: putative ribonuclease H protein A... 76 4e-12 emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694... 76 4e-12 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 76 6e-12 ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom... 76 6e-12 ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596... 75 9e-12 ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom... 74 2e-11 ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein A... 74 3e-11 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 73 4e-11 ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein A... 73 5e-11 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 72 6e-11 ref|XP_004239566.1| PREDICTED: putative ribonuclease H protein A... 72 6e-11 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 72 6e-11 ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobrom... 72 1e-10 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 71 1e-10 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 79.7 bits (195), Expect = 4e-13 Identities = 42/103 (40%), Positives = 62/103 (60%), Gaps = 1/103 (0%) Frame = +1 Query: 28 HHPGGVPM-ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKT 204 H+ GVP+ +SHL +AD++LIF NG K +++ ++ L+ YE+ SGQRI+ +KS Sbjct: 1550 HYSSGVPLSVSHLAFADDVLIFTNGSKSALQRILVFLQEYEEISGQRINAQKSCFVTHTN 1609 Query: 205 INLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 I S ++ + + TGF PITYLGA L G +F LV Sbjct: 1610 IPNSRRQIIAQATGFNHQLLPITYLGAPLYKGHKKVILFNDLV 1652 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 78.6 bits (192), Expect = 9e-13 Identities = 40/103 (38%), Positives = 62/103 (60%), Gaps = 1/103 (0%) Frame = +1 Query: 28 HHPGGVPM-ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKT 204 H+ G + +SHL +AD+++IF NG K +++ ++ L+ YEK SGQRI+ +KS + Sbjct: 1513 HYSSGCSLSVSHLAFADDVIIFANGSKSALQKIMAFLQEYEKLSGQRINPQKSCVVTHTN 1572 Query: 205 INLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + S ++ +L+ TGF PITYLGA L G +F LV Sbjct: 1573 MASSRRQIILQATGFSHRPLPITYLGAPLYKGHKKVMLFNDLV 1615 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 78.2 bits (191), Expect = 1e-12 Identities = 41/103 (39%), Positives = 62/103 (60%), Gaps = 1/103 (0%) Frame = +1 Query: 28 HHPGGVPM-ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKT 204 H+ G M ISHL +AD+++IF NG K +++ ++ L+ YE+ SGQRI+ +KS + Sbjct: 2801 HYSSGCSMPISHLAFADDVIIFANGSKSALQRILAFLQEYEELSGQRINPQKSCVVTHTN 2860 Query: 205 INLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + S ++ +L+ TGF PITYLGA L G +F LV Sbjct: 2861 MASSRRQIILQATGFSHRPLPITYLGAPLFKGHKKVILFNDLV 2903 Score = 63.9 bits (154), Expect = 2e-08 Identities = 31/90 (34%), Positives = 54/90 (60%) Frame = +1 Query: 64 LYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKRDLLRLT 243 L D+I+IF NG + S++ +++ L+ YE+ SGQ+++ +KS + LS ++ + T Sbjct: 1020 LQLDDIVIFTNGCRSSLQKILNFLQEYEQVSGQQVNHQKSCFITTNGCALSRRQIISHTT 1079 Query: 244 GFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 GF P+TYLGA L G+ +F+ L+ Sbjct: 1080 GFHHKTLPVTYLGAPLHKGQKKVILFDSLI 1109 >ref|XP_004248946.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 622 Score = 77.4 bits (189), Expect = 2e-12 Identities = 40/96 (41%), Positives = 60/96 (62%) Frame = +1 Query: 46 PMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKR 225 P ++HL +AD+I++F +G K++K L+ TL+ YEK SGQ I+ +KS + S + Sbjct: 6 PQVNHLSFADDIILFTSGKSKTLKILMHTLKEYEKISGQLINGDKSHFMLHSSAFNSTRD 65 Query: 226 DLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + RLTGFK+ + PITYLG L GR + F L+ Sbjct: 66 RIKRLTGFKQNHGPITYLGCPLFVGRPRNVYFSDLI 101 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 76.6 bits (187), Expect = 3e-12 Identities = 42/112 (37%), Positives = 69/112 (61%) Frame = +1 Query: 1 FENGRISKFHHPGGVPMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEK 180 +++G I +H G ISHL++AD+++IF +GG S+ + +TL+ + WSG +++K+K Sbjct: 536 YDSGYIH-YHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDK 594 Query: 181 SALFPSKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLVE 336 S LF + ++LS +R GF G FPI YLG L+ +L + PL+E Sbjct: 595 SQLFQA-GLDLS-ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLE 644 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 76.6 bits (187), Expect = 3e-12 Identities = 42/112 (37%), Positives = 69/112 (61%) Frame = +1 Query: 1 FENGRISKFHHPGGVPMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEK 180 +++G I +H G ISHL++AD+++IF +GG S+ + +TL+ + WSG +++K+K Sbjct: 536 YDSGYIH-YHPKAGDLSISHLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDK 594 Query: 181 SALFPSKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLVE 336 S LF + ++LS +R GF G FPI YLG L+ +L + PL+E Sbjct: 595 SQLFQA-GLDLS-ERITSAAYGFPAGTFPIRYLGLPLMCRKLRIADYGPLLE 644 >ref|XP_004253499.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 314 Score = 76.3 bits (186), Expect = 4e-12 Identities = 39/96 (40%), Positives = 60/96 (62%) Frame = +1 Query: 46 PMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKR 225 P ++HL +AD+I++F +G K++K L+ TL+ YE+ SGQ I+++KS + S + Sbjct: 6 PQVNHLSFADDIILFTSGRSKTLKLLMSTLKAYEETSGQLINEDKSHFMLHPSAFNSTRD 65 Query: 226 DLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + RLTGFK+ PITYLG L GR + F L+ Sbjct: 66 RIKRLTGFKQKQGPITYLGCPLFVGRPRNVYFSNLI 101 >emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1| putative protein [Arabidopsis thaliana] Length = 1141 Score = 76.3 bits (186), Expect = 4e-12 Identities = 40/112 (35%), Positives = 66/112 (58%) Frame = +1 Query: 1 FENGRISKFHHPGGVPMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEK 180 F++G I ++H ISHL++AD+++IF +GG S+ + +TLE + WSG +++ +K Sbjct: 654 FDSGYI-RYHPKASDLSISHLMFADDVMIFFDGGSSSLHGICETLEDFASWSGLKVNNDK 712 Query: 181 SALFPSKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLVE 336 S F + +R+ L GF +G PI YLG L+ +L +EPL+E Sbjct: 713 SHFFCAGLEQA--ERNSLAAYGFPQGCLPIRYLGLPLMCRKLRIAEYEPLLE 762 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 75.9 bits (185), Expect = 6e-12 Identities = 39/106 (36%), Positives = 62/106 (58%) Frame = +1 Query: 16 ISKFHHPGGVPMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFP 195 IS +H G ISHL +AD+I+IF NG K ++ +++ L+ YE+ SGQR++ +KS Sbjct: 631 ISLHYHSGCSLNISHLAFADDIMIFTNGSKSVLEKILEFLQEYEQISGQRVNHQKSCFVT 690 Query: 196 SKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + + S ++ + + GF PITYLGA L G +F+ L+ Sbjct: 691 ANNMPSSRRQIISQTIGFLHKTLPITYLGAPLFKGPKKVMLFDSLI 736 >ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao] gi|508710342|gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao] Length = 2127 Score = 75.9 bits (185), Expect = 6e-12 Identities = 35/94 (37%), Positives = 60/94 (63%) Frame = +1 Query: 52 ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKRDL 231 ISHL +AD+I+IF NGG+ +++ ++ L+ YE+ SGQ+++ +KS + +LS ++ + Sbjct: 1436 ISHLSFADDIVIFTNGGRSALQKILSFLQEYEQVSGQKVNHQKSCFITANGCSLSRRQII 1495 Query: 232 LRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 TGF+ P+TYLGA L G +F+ L+ Sbjct: 1496 SHTTGFQHKTLPVTYLGAPLHKGPKKVLLFDSLI 1529 >ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum] Length = 1135 Score = 75.1 bits (183), Expect = 9e-12 Identities = 39/112 (34%), Positives = 65/112 (58%) Frame = +1 Query: 1 FENGRISKFHHPGGVPMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEK 180 FE+ + P P++SHL YAD+ ++F +G S++ +++ L YEK SGQ I+ +K Sbjct: 529 FEDPDYIGYGMPKWSPVVSHLSYADDTILFCSGQTTSMRKMINILRGYEKVSGQMINLDK 588 Query: 181 SALFPSKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLVE 336 S ++ K + + R+TG ++G+FP TYLG + GR FE L++ Sbjct: 589 SMIYLHKQVPNRVCNLVKRITGIRQGSFPFTYLGCPIFYGRKNKGHFENLLK 640 >ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao] gi|508778198|gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao] Length = 2367 Score = 73.9 bits (180), Expect = 2e-11 Identities = 39/103 (37%), Positives = 61/103 (59%), Gaps = 1/103 (0%) Frame = +1 Query: 28 HHPGGVPM-ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKT 204 H+ GV + +SHL +AD++LIF NG K +++ ++ L+ YE+ S QRI+ +KS Sbjct: 1720 HYSTGVSIPVSHLAFADDVLIFTNGSKSALQRILAFLQEYEEISRQRINAQKSCFVTHTN 1779 Query: 205 INLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 ++ S ++ + + TGF PITYLGA L G +F LV Sbjct: 1780 VSSSRRQIIAQTTGFNHQLLPITYLGAPLYKGHKKVILFNDLV 1822 >ref|XP_004253372.1| PREDICTED: putative ribonuclease H protein At1g65750-like, partial [Solanum lycopersicum] Length = 451 Score = 73.6 bits (179), Expect = 3e-11 Identities = 39/96 (40%), Positives = 58/96 (60%) Frame = +1 Query: 46 PMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKR 225 P ++HL +AD+I++F +G K+++ L+DTL+ YE SGQ I+ +KS S + Sbjct: 6 PQVNHLSFADDIILFTSGRGKTLELLMDTLKKYENISGQLINGDKSHFLIHPNAFNSTRD 65 Query: 226 DLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + RLTGFK+ PITYLG L GR + F L+ Sbjct: 66 RIKRLTGFKQKQGPITYLGCPLFVGRPRNTYFSNLI 101 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 73.2 bits (178), Expect = 4e-11 Identities = 37/108 (34%), Positives = 65/108 (60%), Gaps = 1/108 (0%) Frame = +1 Query: 13 RISKFHHPGGVPM-ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSAL 189 R + H+ G M +SHL +AD+I+IF NG +++ ++ L+ YE+ SGQ+++ +KS Sbjct: 1509 RYNSLHYLSGCSMSVSHLAFADDIVIFTNGCHSALQKILVFLQEYEQVSGQQVNHQKSCF 1568 Query: 190 FPSKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + LS ++ + ++TGF+ P+TYLGA L G +F+ L+ Sbjct: 1569 ITANGCPLSRRQIIAQVTGFQHKTLPVTYLGAPLHKGPKKVFLFDSLI 1616 >ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 775 Score = 72.8 bits (177), Expect = 5e-11 Identities = 37/96 (38%), Positives = 60/96 (62%) Frame = +1 Query: 46 PMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKR 225 P ++HL +AD+I++F +G +++K L++TL+ YE+ SGQ I+ +KS + S + Sbjct: 65 PQVNHLSFADDIILFTSGRSRTLKLLMNTLKVYEETSGQLINGDKSHFMLHPSAFNSTRD 124 Query: 226 DLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + +LTGFK+ PITYLG L GR + F L+ Sbjct: 125 RIKKLTGFKQKQGPITYLGCPLFVGRPRNVYFSYLI 160 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 72.4 bits (176), Expect = 6e-11 Identities = 38/103 (36%), Positives = 61/103 (59%), Gaps = 1/103 (0%) Frame = +1 Query: 28 HHPGGVPM-ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKT 204 H+ GV + +SHL +AD++LIF NG K +++ ++ L+ Y++ SGQRI+ +KS Sbjct: 1548 HYSSGVSISVSHLAFADDVLIFTNGSKSALQRILAFLQEYQEISGQRINVQKSCFVTHTN 1607 Query: 205 INLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 ++ S ++ + + TGF ITYLGA L G +F LV Sbjct: 1608 VSSSRRQIIAQTTGFSHQLLLITYLGAPLYKGHKKVILFNDLV 1650 >ref|XP_004239566.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum lycopersicum] Length = 606 Score = 72.4 bits (176), Expect = 6e-11 Identities = 40/96 (41%), Positives = 57/96 (59%) Frame = +1 Query: 46 PMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKR 225 P ++HL +AD+I++F +G K++K L+ TL YEK SGQ I+ +KS + S K Sbjct: 98 PQVNHLSFADDIILFTSGRSKTLKLLMYTLREYEKVSGQLINGDKSNFMLHPSAFSSTKD 157 Query: 226 DLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLV 333 + RLTGFK+ ITYLG L GR + F L+ Sbjct: 158 RIRRLTGFKQKQGLITYLGCPLFVGRPRNVYFSDLI 193 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 72.4 bits (176), Expect = 6e-11 Identities = 39/112 (34%), Positives = 66/112 (58%) Frame = +1 Query: 1 FENGRISKFHHPGGVPMISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEK 180 +E+G I +H ISHL++AD+++IF +GG S+ + +TL+ + WSG +++K+K Sbjct: 676 YESGLIH-YHPKASNLSISHLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDK 734 Query: 181 SALFPSKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLVE 336 S L+ + L + + GF G PI YLG L++ +L +EPL+E Sbjct: 735 SHLYLAGLNQL--ESNANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLE 784 >ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobroma cacao] gi|508704887|gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] Length = 1134 Score = 71.6 bits (174), Expect = 1e-10 Identities = 32/83 (38%), Positives = 55/83 (66%) Frame = +1 Query: 52 ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSALFPSKTINLSWKRDL 231 ISH+ +AD+I+IF NGG+ ++++++ L+ YE+ SGQ+++ +KS + LS ++ + Sbjct: 630 ISHISFADDIVIFTNGGRSALQNILSFLQEYEQVSGQKVNHQKSCFITANGCPLSRRQII 689 Query: 232 LRLTGFKEGNFPITYLGAHLVSG 300 TGF+ P+TYLGA L G Sbjct: 690 SHTTGFQHKTLPVTYLGAPLHKG 712 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 71.2 bits (173), Expect = 1e-10 Identities = 41/110 (37%), Positives = 67/110 (60%), Gaps = 1/110 (0%) Frame = +1 Query: 10 GRISKFHHPGGVPM-ISHLLYADEILIFVNGGKKSIKSLVDTLETYEKWSGQRISKEKSA 186 GRI +HP M ++HL +AD+++I +G +SI+ +++ + + KWSG +IS EKS Sbjct: 238 GRIG--YHPHCKRMGLTHLSFADDLMILTDGQCRSIEGIIEVFDLFSKWSGLKISMEKST 295 Query: 187 LFPSKTINLSWKRDLLRLTGFKEGNFPITYLGAHLVSGRLTSCVFEPLVE 336 +F S ++ + + L F+ G PI YLG LV+ RL+S + PL+E Sbjct: 296 IF-SAGLSSTSRAQLHTHFPFEVGELPIRYLGLPLVTKRLSSVDYAPLIE 344