BLASTX nr result
ID: Jatropha_contig00035971
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00035971 (550 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002521111.1| conserved hypothetical protein [Ricinus comm... 140 1e-31 ref|XP_002301776.1| predicted protein [Populus trichocarpa] gi|1... 135 8e-30 gb|EOX93958.1| Uncharacterized protein isoform 1 [Theobroma cacao] 120 3e-25 ref|XP_002267372.1| PREDICTED: uncharacterized protein At4g13200... 119 6e-25 gb|ESR56867.1| hypothetical protein CICLE_v10022209mg [Citrus cl... 114 1e-23 ref|XP_004506658.1| PREDICTED: uncharacterized protein At4g13200... 114 1e-23 gb|EOX93959.1| Uncharacterized protein isoform 2, partial [Theob... 111 1e-22 ref|XP_003520920.1| PREDICTED: uncharacterized protein LOC100800... 107 1e-21 gb|ESW06032.1| hypothetical protein PHAVU_010G014200g [Phaseolus... 105 5e-21 gb|AGV54203.1| hypothetical protein [Phaseolus vulgaris] 105 5e-21 ref|XP_003552081.1| PREDICTED: uncharacterized protein At4g13200... 104 1e-20 gb|EMJ01684.1| hypothetical protein PRUPE_ppa011364mg [Prunus pe... 103 3e-20 ref|XP_006364265.1| PREDICTED: uncharacterized protein LOC102593... 98 1e-18 ref|XP_004247093.1| PREDICTED: uncharacterized protein LOC101259... 97 2e-18 ref|XP_004290221.1| PREDICTED: uncharacterized protein LOC101298... 96 4e-18 gb|ESQ56406.1| hypothetical protein EUTSA_v10026263mg [Eutrema s... 96 5e-18 ref|XP_002863161.1| hypothetical protein ARALYDRAFT_497076 [Arab... 93 4e-17 ref|NP_193056.1| uncharacterized protein [Arabidopsis thaliana] ... 92 1e-16 gb|AAM62995.1| unknown [Arabidopsis thaliana] 91 1e-16 gb|ESQ56405.1| hypothetical protein EUTSA_v10026263mg [Eutrema s... 89 8e-16 >ref|XP_002521111.1| conserved hypothetical protein [Ricinus communis] gi|223539680|gb|EEF41262.1| conserved hypothetical protein [Ricinus communis] Length = 215 Score = 140 bits (354), Expect = 1e-31 Identities = 75/131 (57%), Positives = 88/131 (67%), Gaps = 3/131 (2%) Frame = +3 Query: 9 SAVKVKKQSGLRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAV 188 S +K+KK F SH +RCNS TG GGPGSGD+E++++LDAFFLGKALAEAV Sbjct: 37 SDLKLKKNLAFGFRNETTQSHTINLRCNSTTGPGGPGSGDNESRSVLDAFFLGKALAEAV 96 Query: 189 NERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--- 359 NER+ES VGEFLS IGRLQAEQQRQIQDFQ DVL QGL+ K Sbjct: 97 NERVESAVGEFLSTIGRLQAEQQRQIQDFQEDVLERARKAKENAAWEAMEAQGLVSKPST 156 Query: 360 VETASATYGVD 392 V+ AS TYG++ Sbjct: 157 VDAASTTYGIN 167 >ref|XP_002301776.1| predicted protein [Populus trichocarpa] gi|118484006|gb|ABK93890.1| unknown [Populus trichocarpa] gi|222843502|gb|EEE81049.1| hypothetical protein POPTR_0002s24210g [Populus trichocarpa] Length = 209 Score = 135 bits (339), Expect = 8e-30 Identities = 74/134 (55%), Positives = 91/134 (67%), Gaps = 5/134 (3%) Frame = +3 Query: 3 SLSAVKVKKQSGLRFDTPAKDSHVRTVRCNSATGRGGPGS--GDDENKNILDAFFLGKAL 176 S S +K+K GLRF+T + H VRC+S++G GGPGS GD +++++LDAFFLGKA+ Sbjct: 45 SSSHIKLKTHLGLRFETALRGCHKINVRCSSSSGPGGPGSASGDSDSRSVLDAFFLGKAV 104 Query: 177 AEAVNERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIP 356 AEA+NER+ES VGEFLS IGRLQAEQQ+QIQDFQ DVL QG+IP Sbjct: 105 AEALNERVESAVGEFLSTIGRLQAEQQKQIQDFQEDVLGRAKKAKEQAAREAMEGQGIIP 164 Query: 357 K---VETASATYGV 389 K VET S GV Sbjct: 165 KPTTVETTSVNQGV 178 >gb|EOX93958.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 202 Score = 120 bits (300), Expect = 3e-25 Identities = 59/101 (58%), Positives = 73/101 (72%) Frame = +3 Query: 78 TVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQ 257 ++ C S+TG G PGSGD+E++N+LDAFFLGKALAEA+NERIEST+GEFL +GRLQAEQQ Sbjct: 60 SIICRSSTGPGAPGSGDNESRNVLDAFFLGKALAEALNERIESTIGEFLGAVGRLQAEQQ 119 Query: 258 RQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETASAT 380 +Q+QDFQ +VL QGLIPK +AT Sbjct: 120 KQVQDFQEEVLERAKRAKEKAAREAMEAQGLIPKSTAVNAT 160 >ref|XP_002267372.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic [Vitis vinifera] gi|297744310|emb|CBI37280.3| unnamed protein product [Vitis vinifera] Length = 195 Score = 119 bits (297), Expect = 6e-25 Identities = 61/100 (61%), Positives = 74/100 (74%) Frame = +3 Query: 75 RTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQ 254 R V+CNS+T PGSGD ++++ILDAFFLGKALAEA+NERIESTVGEFLS++GRLQAEQ Sbjct: 53 RVVQCNSSTNPPPPGSGDSDSRSILDAFFLGKALAEALNERIESTVGEFLSVVGRLQAEQ 112 Query: 255 QRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETAS 374 Q+Q+QDFQ +VL QGLIPK TA+ Sbjct: 113 QKQVQDFQDEVLERAKRAKEKAAREALEAQGLIPKSTTAA 152 >gb|ESR56867.1| hypothetical protein CICLE_v10022209mg [Citrus clementina] Length = 214 Score = 114 bits (286), Expect = 1e-23 Identities = 63/119 (52%), Positives = 78/119 (65%), Gaps = 1/119 (0%) Frame = +3 Query: 27 KQSGLRFDTPAKDSHVRTVRCNSATGRGGP-GSGDDENKNILDAFFLGKALAEAVNERIE 203 K +GL F AK ++CNS T G P GSGD E++ +LDAFFLGKA+AEA+NERIE Sbjct: 48 KLNGLGFFGGAKSPRRIPLQCNSTTKPGPPSGSGDGESRTVLDAFFLGKAVAEALNERIE 107 Query: 204 STVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETASAT 380 S VGEFLS +GRLQAEQQ+Q+Q+FQ DVL +GL+PK T +AT Sbjct: 108 SAVGEFLSTVGRLQAEQQKQVQEFQEDVLERAKKAKEKAAREAMEARGLVPKSRTVNAT 166 >ref|XP_004506658.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like [Cicer arietinum] Length = 197 Score = 114 bits (286), Expect = 1e-23 Identities = 65/117 (55%), Positives = 74/117 (63%), Gaps = 9/117 (7%) Frame = +3 Query: 69 HVRTVRCNSATGRGGP--GSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRL 242 H RCNS GGP G GD +KN+LDAFFLGKALAEA+NERIESTVGEFLS +GRL Sbjct: 54 HSTGFRCNSTFFPGGPPSGDGDSSSKNVLDAFFLGKALAEALNERIESTVGEFLSTVGRL 113 Query: 243 QAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK-------VETASATYGVD 392 QAEQQRQ+QDFQ DVL QGL+ K VE+A++ Y D Sbjct: 114 QAEQQRQVQDFQEDVLERAKKAKEKAAREAVEAQGLVYKSAADTEVVESATSNYSTD 170 >gb|EOX93959.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 149 Score = 111 bits (277), Expect = 1e-22 Identities = 50/70 (71%), Positives = 64/70 (91%) Frame = +3 Query: 78 TVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQ 257 ++ C S+TG G PGSGD+E++N+LDAFFLGKALAEA+NERIEST+GEFL +GRLQAEQQ Sbjct: 60 SIICRSSTGPGAPGSGDNESRNVLDAFFLGKALAEALNERIESTIGEFLGAVGRLQAEQQ 119 Query: 258 RQIQDFQVDV 287 +Q+QDFQV++ Sbjct: 120 KQVQDFQVNL 129 >ref|XP_003520920.1| PREDICTED: uncharacterized protein LOC100800588 [Glycine max] Length = 196 Score = 107 bits (268), Expect = 1e-21 Identities = 64/125 (51%), Positives = 77/125 (61%), Gaps = 5/125 (4%) Frame = +3 Query: 9 SAVKVKKQSGLRFDTPAKDSHVRT-VRCNSATGRGGPGSGDDE--NKNILDAFFLGKALA 179 S+ V S + P K T +RCN + GGPGSGD + N++ILDAFFLGKA+A Sbjct: 26 SSFAVPSSSSSHIELPRKRGSQNTGLRCNCSIWPGGPGSGDSDSSNRSILDAFFLGKAVA 85 Query: 180 EAVNERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK 359 EA+NERIESTVGE LS +GRLQAEQQ+Q+QDFQ +VL QG I K Sbjct: 86 EALNERIESTVGEILSTVGRLQAEQQKQVQDFQEEVLERAKKSKENAAREAMEAQGFISK 145 Query: 360 --VET 368 VET Sbjct: 146 SAVET 150 >gb|ESW06032.1| hypothetical protein PHAVU_010G014200g [Phaseolus vulgaris] Length = 197 Score = 105 bits (263), Expect = 5e-21 Identities = 60/107 (56%), Positives = 72/107 (67%), Gaps = 4/107 (3%) Frame = +3 Query: 60 KDSHVRTVRCNSATGRGGPGSGDDE--NKNILDAFFLGKALAEAVNERIESTVGEFLSLI 233 + S +RCN + GGPGSGD + N++ILDAFFLGKA+AEA+NERIESTVGE LS + Sbjct: 44 RGSQNTALRCNCSILPGGPGSGDSDSSNRSILDAFFLGKAVAEALNERIESTVGEILSTV 103 Query: 234 GRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--VET 368 GRLQAEQQ+Q+QDFQ DVL +GLI K VET Sbjct: 104 GRLQAEQQKQVQDFQEDVLERAKRAKEKAAREAMEARGLISKSAVET 150 >gb|AGV54203.1| hypothetical protein [Phaseolus vulgaris] Length = 197 Score = 105 bits (263), Expect = 5e-21 Identities = 60/107 (56%), Positives = 72/107 (67%), Gaps = 4/107 (3%) Frame = +3 Query: 60 KDSHVRTVRCNSATGRGGPGSGDDE--NKNILDAFFLGKALAEAVNERIESTVGEFLSLI 233 + S +RCN + GGPGSGD + N++ILDAFFLGKA+AEA+NERIESTVGE LS + Sbjct: 44 RGSQNTALRCNCSILPGGPGSGDSDSSNRSILDAFFLGKAVAEALNERIESTVGEILSTV 103 Query: 234 GRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--VET 368 GRLQAEQQ+Q+QDFQ DVL +GLI K VET Sbjct: 104 GRLQAEQQKQVQDFQEDVLERAKRAKEKAAREAMEARGLISKSAVET 150 >ref|XP_003552081.1| PREDICTED: uncharacterized protein At4g13200, chloroplastic-like [Glycine max] Length = 193 Score = 104 bits (259), Expect = 1e-20 Identities = 61/121 (50%), Positives = 74/121 (61%), Gaps = 5/121 (4%) Frame = +3 Query: 21 VKKQSGLRFDTPAKDSHVRT-VRCNSATGRGGPGSGDDE--NKNILDAFFLGKALAEAVN 191 V + P K T +RC + GGPGSGD + N+++LDAFFLGKA+AEA+N Sbjct: 26 VPSSKSTHIELPRKRGSQNTGLRCKCSIWPGGPGSGDSDSSNRSVLDAFFLGKAVAEALN 85 Query: 192 ERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPK--VE 365 ERIESTVGE LS +GRLQAEQQ+Q+QDFQ +VL QGLI K VE Sbjct: 86 ERIESTVGEILSTVGRLQAEQQKQVQDFQEEVLERAKKSKEKSARQAMEAQGLISKSGVE 145 Query: 366 T 368 T Sbjct: 146 T 146 >gb|EMJ01684.1| hypothetical protein PRUPE_ppa011364mg [Prunus persica] Length = 214 Score = 103 bits (256), Expect = 3e-20 Identities = 51/77 (66%), Positives = 66/77 (85%) Frame = +3 Query: 60 KDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGR 239 K S V+C+S++G PGSGD +++++LDAFFLGKALAEA+NERIES+VGEFLS IGR Sbjct: 54 KGSQRSRVQCSSSSG---PGSGDGDSRSVLDAFFLGKALAEAINERIESSVGEFLSTIGR 110 Query: 240 LQAEQQRQIQDFQVDVL 290 LQAEQQ+Q+++FQ DVL Sbjct: 111 LQAEQQKQVEEFQEDVL 127 >ref|XP_006364265.1| PREDICTED: uncharacterized protein LOC102593653 [Solanum tuberosum] Length = 180 Score = 97.8 bits (242), Expect = 1e-18 Identities = 54/92 (58%), Positives = 63/92 (68%) Frame = +3 Query: 78 TVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQ 257 T CN ++ P G++E+KNILDAFFLGKALAEAV ERIESTVGEFLS +GRLQAEQQ Sbjct: 47 TFTCNFSSNPPPP-PGENESKNILDAFFLGKALAEAVTERIESTVGEFLSTVGRLQAEQQ 105 Query: 258 RQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI 353 +Q+QDFQ +VL QGLI Sbjct: 106 KQVQDFQEEVLERAKQAKEKAARETMETQGLI 137 >ref|XP_004247093.1| PREDICTED: uncharacterized protein LOC101259284 [Solanum lycopersicum] Length = 223 Score = 97.1 bits (240), Expect = 2e-18 Identities = 53/101 (52%), Positives = 69/101 (68%), Gaps = 3/101 (2%) Frame = +3 Query: 87 CNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQRQI 266 CN ++ P G++E+KN+LDAFFLGKALAEAV ERIESTVGEFLS +GRLQ+EQQ+Q+ Sbjct: 94 CNFSSN---PPPGENESKNVLDAFFLGKALAEAVTERIESTVGEFLSTVGRLQSEQQKQV 150 Query: 267 QDFQVDVLXXXXXXXXXXXXXXXXXQGLIP---KVETASAT 380 QDFQ ++L QGLI + +T++AT Sbjct: 151 QDFQEEILERAKQAKEKAARETMETQGLISNSYEADTSTAT 191 >ref|XP_004290221.1| PREDICTED: uncharacterized protein LOC101298383 [Fragaria vesca subsp. vesca] Length = 217 Score = 96.3 bits (238), Expect = 4e-18 Identities = 52/100 (52%), Positives = 65/100 (65%) Frame = +3 Query: 81 VRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGEFLSLIGRLQAEQQR 260 V CN+ G G+ ++K++LDAFFLGKALAEA+NERIES+VGEFLS IGRLQAEQQ+ Sbjct: 68 VHCNT-------GPGESDSKSVLDAFFLGKALAEALNERIESSVGEFLSTIGRLQAEQQK 120 Query: 261 QIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVETASAT 380 Q+ +FQ DVL QG++ K T S T Sbjct: 121 QVVEFQADVLERAKKAKEKAAREAAEAQGIVSKPTTESIT 160 >gb|ESQ56406.1| hypothetical protein EUTSA_v10026263mg [Eutrema salsugineum] Length = 205 Score = 95.9 bits (237), Expect = 5e-18 Identities = 55/113 (48%), Positives = 71/113 (62%), Gaps = 2/113 (1%) Frame = +3 Query: 36 GLRFDTPAKDSHVRTVRCNSAT-GRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTV 212 GLR A+ + ++RCN + G G SG++EN+++LDAFFLGKALAE +NERIESTV Sbjct: 39 GLRLSGEAQRT---SLRCNCCSKGNRGTSSGENENRSVLDAFFLGKALAEVINERIESTV 95 Query: 213 GEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI-PKVET 368 GE L IGR QAEQQ+Q+Q+ Q +V QGL+ PK ET Sbjct: 96 GEVLGTIGRFQAEQQKQVQEIQEEVFERAKKAKERAARETMEEQGLVAPKPET 148 >ref|XP_002863161.1| hypothetical protein ARALYDRAFT_497076 [Arabidopsis lyrata subsp. lyrata] gi|297308995|gb|EFH39420.1| hypothetical protein ARALYDRAFT_497076 [Arabidopsis lyrata subsp. lyrata] Length = 181 Score = 92.8 bits (229), Expect = 4e-17 Identities = 50/121 (41%), Positives = 68/121 (56%) Frame = +3 Query: 9 SAVKVKKQSGLRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAV 188 S + +S F A ++ R+ + G G SG++EN+++LDAFFLGKALAE + Sbjct: 25 SFIPRNSRSNFEFRRLAVEARRRSTSLRCSNGTHGSDSGENENRSVLDAFFLGKALAEVI 84 Query: 189 NERIESTVGEFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLIPKVET 368 NERIESTVGE L IG+ QAEQQ+Q+Q+ Q +VL QGL+ Sbjct: 85 NERIESTVGEVLGTIGKFQAEQQKQVQEIQEEVLERAKKAKERAARETKEEQGLVASKSA 144 Query: 369 A 371 A Sbjct: 145 A 145 >ref|NP_193056.1| uncharacterized protein [Arabidopsis thaliana] gi|147742899|sp|Q8LDV3.2|Y4320_ARATH RecName: Full=Uncharacterized protein At4g13200, chloroplastic; Flags: Precursor gi|4753654|emb|CAB41930.1| putative protein [Arabidopsis thaliana] gi|7268022|emb|CAB78362.1| putative protein [Arabidopsis thaliana] gi|17380654|gb|AAL36157.1| unknown protein [Arabidopsis thaliana] gi|21436277|gb|AAM51277.1| unknown protein [Arabidopsis thaliana] gi|332657844|gb|AEE83244.1| uncharacterized protein AT4G13200 [Arabidopsis thaliana] Length = 185 Score = 91.7 bits (226), Expect = 1e-16 Identities = 51/105 (48%), Positives = 64/105 (60%) Frame = +3 Query: 39 LRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGE 218 LR D +S R+ S G SG++ENK++LDAFFLGKALAE +NERIESTVGE Sbjct: 40 LRLDV---ESRRRSTSLRSNCSTKGTDSGENENKSVLDAFFLGKALAEVINERIESTVGE 96 Query: 219 FLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI 353 LS IG+ QAEQQ+Q+Q+ Q +VL QGL+ Sbjct: 97 VLSTIGKFQAEQQKQVQEIQEEVLERAKKAKERAARETMEEQGLV 141 >gb|AAM62995.1| unknown [Arabidopsis thaliana] Length = 185 Score = 91.3 bits (225), Expect = 1e-16 Identities = 51/105 (48%), Positives = 64/105 (60%) Frame = +3 Query: 39 LRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVGE 218 LR D +S R+ S G SG++ENK++LDAFFLGKALAE +NERIESTVGE Sbjct: 40 LRLDV---ESRRRSTYLRSNCSTKGTDSGENENKSVLDAFFLGKALAEVINERIESTVGE 96 Query: 219 FLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI 353 LS IG+ QAEQQ+Q+Q+ Q +VL QGL+ Sbjct: 97 VLSTIGKFQAEQQKQVQEIQEEVLERAKKAKERAARETMEEQGLV 141 >gb|ESQ56405.1| hypothetical protein EUTSA_v10026263mg [Eutrema salsugineum] Length = 201 Score = 88.6 bits (218), Expect = 8e-16 Identities = 51/112 (45%), Positives = 66/112 (58%), Gaps = 1/112 (0%) Frame = +3 Query: 36 GLRFDTPAKDSHVRTVRCNSATGRGGPGSGDDENKNILDAFFLGKALAEAVNERIESTVG 215 GLR A+ + +R C+ G++EN+++LDAFFLGKALAE +NERIESTVG Sbjct: 39 GLRLSGEAQRTSLRCNCCSKGN------RGENENRSVLDAFFLGKALAEVINERIESTVG 92 Query: 216 EFLSLIGRLQAEQQRQIQDFQVDVLXXXXXXXXXXXXXXXXXQGLI-PKVET 368 E L IGR QAEQQ+Q+Q+ Q +V QGL+ PK ET Sbjct: 93 EVLGTIGRFQAEQQKQVQEIQEEVFERAKKAKERAARETMEEQGLVAPKPET 144