BLASTX nr result
ID: Jatropha_contig00035991
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00035991 (550 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002521111.1| conserved hypothetical protein [Ricinus comm... 140 1e-31 ref|XP_002301776.1| predicted protein [Populus trichocarpa] gi|1... 135 8e-30 gb|EOX93958.1| Uncharacterized protein isoform 1 [Theobroma cacao] 120 3e-25 ref|XP_002267372.1| PREDICTED: uncharacterized protein At4g13200... 119 6e-25 gb|ESR56867.1| hypothetical protein CICLE_v10022209mg [Citrus cl... 114 1e-23 ref|XP_004506658.1| PREDICTED: uncharacterized protein At4g13200... 114 1e-23 gb|EOX93959.1| Uncharacterized protein isoform 2, partial [Theob... 111 1e-22 ref|XP_003520920.1| PREDICTED: uncharacterized protein LOC100800... 107 1e-21 gb|ESW06032.1| hypothetical protein PHAVU_010G014200g [Phaseolus... 105 5e-21 gb|AGV54203.1| hypothetical protein [Phaseolus vulgaris] 105 5e-21 ref|XP_003552081.1| PREDICTED: uncharacterized protein At4g13200... 104 2e-20 gb|EMJ01684.1| hypothetical protein PRUPE_ppa011364mg [Prunus pe... 103 3e-20 ref|XP_006364265.1| PREDICTED: uncharacterized protein LOC102593... 98 1e-18 ref|XP_004247093.1| PREDICTED: uncharacterized protein LOC101259... 97 2e-18 ref|XP_004290221.1| PREDICTED: uncharacterized protein LOC101298... 96 4e-18 gb|ESQ56406.1| hypothetical protein EUTSA_v10026263mg [Eutrema s... 96 5e-18 ref|XP_002863161.1| hypothetical protein ARALYDRAFT_497076 [Arab... 93 5e-17 ref|NP_193056.1| uncharacterized protein [Arabidopsis thaliana] ... 92 1e-16 gb|AAM62995.1| unknown [Arabidopsis thaliana] 91 1e-16 gb|ESQ56405.1| hypothetical protein EUTSA_v10026263mg [Eutrema s... 89 9e-16 >ref|XP_002521111.1| conserved hypothetical protein [Ricinus communis] gi|223539680|gb|EEF41262.1| conserved hypothetical protein [Ricinus communis] Length = 215 Score = 140 bits (354), Expect = 1e-31 Identities = 75/131 (57%), Positives = 88/131 (67%), Gaps = 3/131 (2%) Frame = +2 Query: 11 SAVKVKKQSGLRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAV 190 S +K+KK F SH +RCNS TG GGPGSGD+E++++LDAFFLGKALAEAV Sbjct: 37 SDLKLKKNLAFGFRNETTQSHTINLRCNSTTGPGGPGSGDNESRSVLDAFFLGKALAEAV 96 Query: 191 NERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--- 361 NER+ES VGEFLS IGRLQAEQQRQIQDFQ DVL QGL+ K Sbjct: 97 NERVESAVGEFLSTIGRLQAEQQRQIQDFQEDVLERARKAKENAAWEAMEAQGLVSKPST 156 Query: 362 VETASATYGVD 394 V+ AS TYG++ Sbjct: 157 VDAASTTYGIN 167 >ref|XP_002301776.1| predicted protein [Populus trichocarpa] gi|118484006|gb|ABK93890.1| unknown [Populus trichocarpa] gi|222843502|gb|EEE81049.1| hypothetical protein POPTR_0002s24210g [Populus trichocarpa] Length = 209 Score = 135 bits (339), Expect = 8e-30 Identities = 74/134 (55%), Positives = 91/134 (67%), Gaps = 5/134 (3%) Frame = +2 Query: 5 SLSAVKVKKQSGLRFDTPAKDSHVRTVRCNSATGRGGPGS--GDDENKNILDAFFLGKAL 178 S S +K+K GLRF+T + H VRC+S++G GGPGS GD +++++LDAFFLGKA+ Sbjct: 45 SSSHIKLKTHLGLRFETALRGCHKINVRCSSSSGPGGPGSASGDSDSRSVLDAFFLGKAV 104 Query: 179 AEAVNERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIP 358 AEA+NER+ES VGEFLS IGRLQAEQQ+QIQDFQ DVL QG+IP Sbjct: 105 AEALNERVESAVGEFLSTIGRLQAEQQKQIQDFQEDVLGRAKKAKEQAAREAMEGQGIIP 164 Query: 359 K---VETASATYGV 391 K VET S GV Sbjct: 165 KPTTVETTSVNQGV 178 >gb|EOX93958.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 202 Score = 120 bits (300), Expect = 3e-25 Identities = 59/101 (58%), Positives = 73/101 (72%) Frame = +2 Query: 80 TVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQ 259 ++ C S+TG G PGSGD+E++N+LDAFFLGKALAEA+NERIEST+GEFL +GRLQAEQQ Sbjct: 60 SIICRSSTGPGAPGSGDNESRNVLDAFFLGKALAEALNERIESTIGEFLGAVGRLQAEQQ 119 Query: 260 RQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETASAT 382 +Q+QDFQ +VL QGLIPK +AT Sbjct: 120 KQVQDFQEEVLERAKRAKEKAAREAMEAQGLIPKSTAVNAT 160 >ref|XP_002267372.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic [Vitis vinifera] gi|297744310|emb|CBI37280.3| unnamed protein product [Vitis vinifera] Length = 195 Score = 119 bits (297), Expect = 6e-25 Identities = 61/100 (61%), Positives = 74/100 (74%) Frame = +2 Query: 77 RTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQ 256 R V+CNS+T PGSGD ++++ILDAFFLGKALAEA+NERIESTVGEFLS++GRLQAEQ Sbjct: 53 RVVQCNSSTNPPPPGSGDSDSRSILDAFFLGKALAEALNERIESTVGEFLSVVGRLQAEQ 112 Query: 257 QRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETAS 376 Q+Q+QDFQ +VL QGLIPK TA+ Sbjct: 113 QKQVQDFQDEVLERAKRAKEKAAREALEAQGLIPKSTTAA 152 >gb|ESR56867.1| hypothetical protein CICLE_v10022209mg [Citrus clementina] Length = 214 Score = 114 bits (286), Expect = 1e-23 Identities = 63/119 (52%), Positives = 78/119 (65%), Gaps = 1/119 (0%) Frame = +2 Query: 29 KQSGLRFDTPAKDSHVRTVRCNSATGRGGP-GSGDDENKNILDAFFLGKALAEAVNERIE 205 K +GL F AK ++CNS T G P GSGD E++ +LDAFFLGKA+AEA+NERIE Sbjct: 48 KLNGLGFFGGAKSPRRIPLQCNSTTKPGPPSGSGDGESRTVLDAFFLGKAVAEALNERIE 107 Query: 206 STVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETASAT 382 S VGEFLS +GRLQAEQQ+Q+Q+FQ DVL +GL+PK T +AT Sbjct: 108 SAVGEFLSTVGRLQAEQQKQVQEFQEDVLERAKKAKEKAAREAMEARGLVPKSRTVNAT 166 >ref|XP_004506658.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like [Cicer arietinum] Length = 197 Score = 114 bits (286), Expect = 1e-23 Identities = 65/117 (55%), Positives = 74/117 (63%), Gaps = 9/117 (7%) Frame = +2 Query: 71 HVRTVRCNSATGRGGP--GSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRL 244 H RCNS GGP G GD +KN+LDAFFLGKALAEA+NERIESTVGEFLS +GRL Sbjct: 54 HSTGFRCNSTFFPGGPPSGDGDSSSKNVLDAFFLGKALAEALNERIESTVGEFLSTVGRL 113 Query: 245 QAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK-------VETASATYGVD 394 QAEQQRQ+QDFQ DVL QGL+ K VE+A++ Y D Sbjct: 114 QAEQQRQVQDFQEDVLERAKKAKEKAAREAVEAQGLVYKSAADTEVVESATSNYSTD 170 >gb|EOX93959.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 149 Score = 111 bits (277), Expect = 1e-22 Identities = 50/70 (71%), Positives = 64/70 (91%) Frame = +2 Query: 80 TVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQ 259 ++ C S+TG G PGSGD+E++N+LDAFFLGKALAEA+NERIEST+GEFL +GRLQAEQQ Sbjct: 60 SIICRSSTGPGAPGSGDNESRNVLDAFFLGKALAEALNERIESTIGEFLGAVGRLQAEQQ 119 Query: 260 RQIQDFQVDV 289 +Q+QDFQV++ Sbjct: 120 KQVQDFQVNL 129 >ref|XP_003520920.1| PREDICTED: uncharacterized protein LOC100800588 [Glycine max] Length = 196 Score = 107 bits (268), Expect = 1e-21 Identities = 64/125 (51%), Positives = 77/125 (61%), Gaps = 5/125 (4%) Frame = +2 Query: 11 SAVKVKKQSGLRFDTPAKDSHVRT-VRCNSATGRGGPGSGDDE--NKNILDAFFLGKALA 181 S+ V S + P K T +RCN + GGPGSGD + N++ILDAFFLGKA+A Sbjct: 26 SSFAVPSSSSSHIELPRKRGSQNTGLRCNCSIWPGGPGSGDSDSSNRSILDAFFLGKAVA 85 Query: 182 EAVNERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK 361 EA+NERIESTVGE LS +GRLQAEQQ+Q+QDFQ +VL QG I K Sbjct: 86 EALNERIESTVGEILSTVGRLQAEQQKQVQDFQEEVLERAKKSKENAAREAMEAQGFISK 145 Query: 362 --VET 370 VET Sbjct: 146 SAVET 150 >gb|ESW06032.1| hypothetical protein PHAVU_010G014200g [Phaseolus vulgaris] Length = 197 Score = 105 bits (263), Expect = 5e-21 Identities = 60/107 (56%), Positives = 72/107 (67%), Gaps = 4/107 (3%) Frame = +2 Query: 62 KDSHVRTVRCNSATGRGGPGSGDDE--NKNILDAFFLGKALAEAVNERIESTVGEFLSLI 235 + S +RCN + GGPGSGD + N++ILDAFFLGKA+AEA+NERIESTVGE LS + Sbjct: 44 RGSQNTALRCNCSILPGGPGSGDSDSSNRSILDAFFLGKAVAEALNERIESTVGEILSTV 103 Query: 236 GRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--VET 370 GRLQAEQQ+Q+QDFQ DVL +GLI K VET Sbjct: 104 GRLQAEQQKQVQDFQEDVLERAKRAKEKAAREAMEARGLISKSAVET 150 >gb|AGV54203.1| hypothetical protein [Phaseolus vulgaris] Length = 197 Score = 105 bits (263), Expect = 5e-21 Identities = 60/107 (56%), Positives = 72/107 (67%), Gaps = 4/107 (3%) Frame = +2 Query: 62 KDSHVRTVRCNSATGRGGPGSGDDE--NKNILDAFFLGKALAEAVNERIESTVGEFLSLI 235 + S +RCN + GGPGSGD + N++ILDAFFLGKA+AEA+NERIESTVGE LS + Sbjct: 44 RGSQNTALRCNCSILPGGPGSGDSDSSNRSILDAFFLGKAVAEALNERIESTVGEILSTV 103 Query: 236 GRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--VET 370 GRLQAEQQ+Q+QDFQ DVL +GLI K VET Sbjct: 104 GRLQAEQQKQVQDFQEDVLERAKRAKEKAAREAMEARGLISKSAVET 150 >ref|XP_003552081.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like [Glycine max] Length = 193 Score = 104 bits (259), Expect = 2e-20 Identities = 61/121 (50%), Positives = 74/121 (61%), Gaps = 5/121 (4%) Frame = +2 Query: 23 VKKQSGLRFDTPAKDSHVRT-VRCNSATGRGGPGSGDDE--NKNILDAFFLGKALAEAVN 193 V + P K T +RC + GGPGSGD + N+++LDAFFLGKA+AEA+N Sbjct: 26 VPSSKSTHIELPRKRGSQNTGLRCKCSIWPGGPGSGDSDSSNRSVLDAFFLGKAVAEALN 85 Query: 194 ERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--VE 367 ERIESTVGE LS +GRLQAEQQ+Q+QDFQ +VL QGLI K VE Sbjct: 86 ERIESTVGEILSTVGRLQAEQQKQVQDFQEEVLERAKKSKEKSARQAMEAQGLISKSGVE 145 Query: 368 T 370 T Sbjct: 146 T 146 >gb|EMJ01684.1| hypothetical protein PRUPE_ppa011364mg [Prunus persica] Length = 214 Score = 103 bits (256), Expect = 3e-20 Identities = 51/77 (66%), Positives = 66/77 (85%) Frame = +2 Query: 62 KDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGR 241 K S V+C+S++G PGSGD +++++LDAFFLGKALAEA+NERIES+VGEFLS IGR Sbjct: 54 KGSQRSRVQCSSSSG---PGSGDGDSRSVLDAFFLGKALAEAINERIESSVGEFLSTIGR 110 Query: 242 LQAEQQRQIQDFQVDVL 292 LQAEQQ+Q+++FQ DVL Sbjct: 111 LQAEQQKQVEEFQEDVL 127 >ref|XP_006364265.1| PREDICTED: uncharacterized protein LOC102593653 [Solanum tuberosum] Length = 180 Score = 97.8 bits (242), Expect = 1e-18 Identities = 54/92 (58%), Positives = 63/92 (68%) Frame = +2 Query: 80 TVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQ 259 T CN ++ P G++E+KNILDAFFLGKALAEAV ERIESTVGEFLS +GRLQAEQQ Sbjct: 47 TFTCNFSSNPPPP-PGENESKNILDAFFLGKALAEAVTERIESTVGEFLSTVGRLQAEQQ 105 Query: 260 RQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI 355 +Q+QDFQ +VL QGLI Sbjct: 106 KQVQDFQEEVLERAKQAKEKAARETMETQGLI 137 >ref|XP_004247093.1| PREDICTED: uncharacterized protein LOC101259284 [Solanum lycopersicum] Length = 223 Score = 97.1 bits (240), Expect = 2e-18 Identities = 53/101 (52%), Positives = 69/101 (68%), Gaps = 3/101 (2%) Frame = +2 Query: 89 CNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQRQI 268 CN ++ P G++E+KN+LDAFFLGKALAEAV ERIESTVGEFLS +GRLQ+EQQ+Q+ Sbjct: 94 CNFSSN---PPPGENESKNVLDAFFLGKALAEAVTERIESTVGEFLSTVGRLQSEQQKQV 150 Query: 269 QDFQVDVLXXXXXXXXXXXXXXXXXQGLIP---KVETASAT 382 QDFQ ++L QGLI + +T++AT Sbjct: 151 QDFQEEILERAKQAKEKAARETMETQGLISNSYEADTSTAT 191 >ref|XP_004290221.1| PREDICTED: uncharacterized protein LOC101298383 [Fragaria vesca subsp. vesca] Length = 217 Score = 96.3 bits (238), Expect = 4e-18 Identities = 52/100 (52%), Positives = 65/100 (65%) Frame = +2 Query: 83 VRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQR 262 V CN+ G G+ ++K++LDAFFLGKALAEA+NERIES+VGEFLS IGRLQAEQQ+ Sbjct: 68 VHCNT-------GPGESDSKSVLDAFFLGKALAEALNERIESSVGEFLSTIGRLQAEQQK 120 Query: 263 QIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETASAT 382 Q+ +FQ DVL QG++ K T S T Sbjct: 121 QVVEFQADVLERAKKAKEKAAREAAEAQGIVSKPTTESIT 160 >gb|ESQ56406.1| hypothetical protein EUTSA_v10026263mg [Eutrema salsugineum] Length = 205 Score = 95.9 bits (237), Expect = 5e-18 Identities = 55/113 (48%), Positives = 71/113 (62%), Gaps = 2/113 (1%) Frame = +2 Query: 38 GLRFDTPAKDSHVRTVRCNSAT-GRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTV 214 GLR A+ + ++RCN + G G SG++EN+++LDAFFLGKALAE +NERIESTV Sbjct: 39 GLRLSGEAQRT---SLRCNCCSKGNRGTSSGENENRSVLDAFFLGKALAEVINERIESTV 95 Query: 215 GEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI-PKVET 370 GE L IGR QAEQQ+Q+Q+ Q +V QGL+ PK ET Sbjct: 96 GEVLGTIGRFQAEQQKQVQEIQEEVFERAKKAKERAARETMEEQGLVAPKPET 148 >ref|XP_002863161.1| hypothetical protein ARALYDRAFT_497076 [Arabidopsis lyrata subsp. lyrata] gi|297308995|gb|EFH39420.1| hypothetical protein ARALYDRAFT_497076 [Arabidopsis lyrata subsp. lyrata] Length = 181 Score = 92.8 bits (229), Expect = 5e-17 Identities = 50/121 (41%), Positives = 68/121 (56%) Frame = +2 Query: 11 SAVKVKKQSGLRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAV 190 S + +S F A ++ R+ + G G SG++EN+++LDAFFLGKALAE + Sbjct: 25 SFIPRNSRSNFEFRRLAVEARRRSTSLRCSNGTHGSDSGENENRSVLDAFFLGKALAEVI 84 Query: 191 NERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVET 370 NERIESTVGE L IG+ QAEQQ+Q+Q+ Q +VL QGL+ Sbjct: 85 NERIESTVGEVLGTIGKFQAEQQKQVQEIQEEVLERAKKAKERAARETKEEQGLVASKSA 144 Query: 371 A 373 A Sbjct: 145 A 145 >ref|NP_193056.1| uncharacterized protein [Arabidopsis thaliana] gi|147742899|sp|Q8LDV3.2|Y4320_ARATH RecName: Full=Uncharacterized protein At4g13200, chloroplastic; Flags: Precursor gi|4753654|emb|CAB41930.1| putative protein [Arabidopsis thaliana] gi|7268022|emb|CAB78362.1| putative protein [Arabidopsis thaliana] gi|17380654|gb|AAL36157.1| unknown protein [Arabidopsis thaliana] gi|21436277|gb|AAM51277.1| unknown protein [Arabidopsis thaliana] gi|332657844|gb|AEE83244.1| uncharacterized protein AT4G13200 [Arabidopsis thaliana] Length = 185 Score = 91.7 bits (226), Expect = 1e-16 Identities = 51/105 (48%), Positives = 64/105 (60%) Frame = +2 Query: 41 LRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGE 220 LR D +S R+ S G SG++ENK++LDAFFLGKALAE +NERIESTVGE Sbjct: 40 LRLDV---ESRRRSTSLRSNCSTKGTDSGENENKSVLDAFFLGKALAEVINERIESTVGE 96 Query: 221 FLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI 355 LS IG+ QAEQQ+Q+Q+ Q +VL QGL+ Sbjct: 97 VLSTIGKFQAEQQKQVQEIQEEVLERAKKAKERAARETMEEQGLV 141 >gb|AAM62995.1| unknown [Arabidopsis thaliana] Length = 185 Score = 91.3 bits (225), Expect = 1e-16 Identities = 51/105 (48%), Positives = 64/105 (60%) Frame = +2 Query: 41 LRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGE 220 LR D +S R+ S G SG++ENK++LDAFFLGKALAE +NERIESTVGE Sbjct: 40 LRLDV---ESRRRSTYLRSNCSTKGTDSGENENKSVLDAFFLGKALAEVINERIESTVGE 96 Query: 221 FLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI 355 LS IG+ QAEQQ+Q+Q+ Q +VL QGL+ Sbjct: 97 VLSTIGKFQAEQQKQVQEIQEEVLERAKKAKERAARETMEEQGLV 141 >gb|ESQ56405.1| hypothetical protein EUTSA_v10026263mg [Eutrema salsugineum] Length = 201 Score = 88.6 bits (218), Expect = 9e-16 Identities = 51/112 (45%), Positives = 66/112 (58%), Gaps = 1/112 (0%) Frame = +2 Query: 38 GLRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVG 217 GLR A+ + +R C+ G++EN+++LDAFFLGKALAE +NERIESTVG Sbjct: 39 GLRLSGEAQRTSLRCNCCSKGN------RGENENRSVLDAFFLGKALAEVINERIESTVG 92 Query: 218 EFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI-PKVET 370 E L IGR QAEQQ+Q+Q+ Q +V QGL+ PK ET Sbjct: 93 EVLGTIGRFQAEQQKQVQEIQEEVFERAKKAKERAARETMEEQGLVAPKPET 144