BLASTX nr result
ID: Perilla23_contig00018653
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00018653 (400 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012831128.1| PREDICTED: uncharacterized protein LOC105952... 101 2e-19 ref|XP_011071777.1| PREDICTED: uncharacterized protein LOC105157... 94 4e-17 ref|XP_013589999.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMP... 94 5e-17 ref|XP_002300893.2| hypothetical protein POPTR_0002s06280g [Popu... 92 1e-16 ref|XP_013748933.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMP... 91 4e-16 ref|XP_009144649.1| PREDICTED: uncharacterized protein LOC103868... 90 6e-16 ref|XP_013676590.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMP... 89 1e-15 emb|CDP04629.1| unnamed protein product [Coffea canephora] 89 2e-15 ref|XP_011026313.1| PREDICTED: uncharacterized protein LOC105126... 88 3e-15 ref|XP_011026312.1| PREDICTED: uncharacterized protein LOC105126... 88 3e-15 gb|KJB39230.1| hypothetical protein B456_007G003100, partial [Go... 87 4e-15 ref|XP_012488393.1| PREDICTED: uncharacterized protein LOC105801... 87 4e-15 ref|XP_010450438.1| PREDICTED: uncharacterized protein LOC104732... 86 8e-15 ref|XP_006395918.1| hypothetical protein EUTSA_v10005006mg [Eutr... 86 8e-15 gb|KHG12254.1| putative ycf19 [Gossypium arboreum] 86 1e-14 dbj|BAA96886.1| unnamed protein product [Arabidopsis thaliana] 86 1e-14 ref|XP_012081559.1| PREDICTED: uncharacterized protein LOC105641... 86 1e-14 ref|XP_006282723.1| hypothetical protein CARUB_v10005757mg, part... 86 1e-14 ref|NP_198461.2| cofactor assembly, complex C (B6F) [Arabidopsis... 86 1e-14 ref|XP_011044826.1| PREDICTED: uncharacterized protein LOC105139... 85 2e-14 >ref|XP_012831128.1| PREDICTED: uncharacterized protein LOC105952158 [Erythranthe guttatus] gi|604343661|gb|EYU42515.1| hypothetical protein MIMGU_mgv1a014382mg [Erythranthe guttata] Length = 191 Score = 101 bits (251), Expect = 2e-19 Identities = 69/136 (50%), Positives = 84/136 (61%), Gaps = 3/136 (2%) Frame = -2 Query: 399 AISW-PSNTIPPQNHNLISTKSC-HLSSHKSILKSPVRRKKPTPVCRAHCSSI-ISDSLT 229 +ISW PS P+N S HLS+HKS R + +CRAH SS I L Sbjct: 12 SISWLPSK---PRNQTRDHQNSSPHLSTHKS------RNPRNNHICRAHYSSSEIPIPL- 61 Query: 228 NHYFSSSSTAISGDPFSGTSKIMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFI 49 SS+++I+ + T+ I S E+LVL DLDPATAK+AI LLGPFLS FGFLFI Sbjct: 62 -----SSASSINAPLDAITATIPQSKLLENLVLADLDPATAKVAIGLLGPFLSGFGFLFI 116 Query: 48 LRIVMSWYPKLPVEKF 1 RIVMSWYPKLP+E+F Sbjct: 117 ARIVMSWYPKLPLEEF 132 >ref|XP_011071777.1| PREDICTED: uncharacterized protein LOC105157163 [Sesamum indicum] Length = 217 Score = 94.0 bits (232), Expect = 4e-17 Identities = 62/136 (45%), Positives = 72/136 (52%), Gaps = 14/136 (10%) Frame = -2 Query: 366 QNHNLISTKSCHLSSHKSI------LKSPVRRK--------KPTPVCRAHCSSIISDSLT 229 +N S+ S HLS K LKSP R K + +CRAHCSS + S+ Sbjct: 44 KNTTFTSSISWHLSIKKCAAERGLNLKSPGRGKLFPGCNNTRRQHMCRAHCSSQVPTSI- 102 Query: 228 NHYFSSSSTAISGDPFSGTSKIMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFI 49 SKI S LVL+DL+P +AKLAI +GP LS FGFLFI Sbjct: 103 -------------------SKISDSGCVGRLVLVDLEPGSAKLAIGFVGPLLSGFGFLFI 143 Query: 48 LRIVMSWYPKLPVEKF 1 LRIVMSWYPKLPVEKF Sbjct: 144 LRIVMSWYPKLPVEKF 159 >ref|XP_013589999.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic [Brassica oleracea var. oleracea] Length = 177 Score = 93.6 bits (231), Expect = 5e-17 Identities = 53/98 (54%), Positives = 65/98 (66%) Frame = -2 Query: 294 RRKKPTPVCRAHCSSIISDSLTNHYFSSSSTAISGDPFSGTSKIMHSVTAESLVLIDLDP 115 RR P R+ C ++S SLT +++T S P +SK + S ++ L DLDP Sbjct: 21 RRSCPNIRTRSACLPVVSASLTQIEVDTTTTTTSLYPSIVSSKPI-SEALHNISLADLDP 79 Query: 114 ATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 TAKLAI +LGP LSAFGFLFILRIVMSWYPKLPV+KF Sbjct: 80 GTAKLAIGILGPALSAFGFLFILRIVMSWYPKLPVDKF 117 >ref|XP_002300893.2| hypothetical protein POPTR_0002s06280g [Populus trichocarpa] gi|550344395|gb|EEE80166.2| hypothetical protein POPTR_0002s06280g [Populus trichocarpa] Length = 284 Score = 92.4 bits (228), Expect = 1e-16 Identities = 54/101 (53%), Positives = 64/101 (63%), Gaps = 12/101 (11%) Frame = -2 Query: 267 RAHCSSIISDSLTNHYFSSSSTAISGDPFSGTSKIMH------------SVTAESLVLID 124 R CS+ ++ SL FS S ISG+PFS + + S + L+L D Sbjct: 126 RTCCSAGVTASLDVD-FSPSHATISGEPFSVLEAVRNIKVDIPTTSEATSNLIQRLMLAD 184 Query: 123 LDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 LDPATAKLAI +LGPFLSAF FLF+LRIVMSWYPKLPV KF Sbjct: 185 LDPATAKLAIGILGPFLSAFSFLFVLRIVMSWYPKLPVGKF 225 >ref|XP_013748933.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic-like [Brassica napus] gi|923773352|ref|XP_013680242.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic-like [Brassica napus] Length = 176 Score = 90.5 bits (223), Expect = 4e-16 Identities = 52/104 (50%), Positives = 67/104 (64%), Gaps = 6/104 (5%) Frame = -2 Query: 294 RRKKPTPVCRAHCSSIISDSLTNHYFSSSSTA------ISGDPFSGTSKIMHSVTAESLV 133 RR P+ R+ C ++S SLT +++T +S P S + +H+++ Sbjct: 21 RRSCPSIRTRSACLPVVSASLTQIEVDTTTTTSLCSSIVSSKPIS---EALHNIS----- 72 Query: 132 LIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 L DLDP TAKLAI +LGP LSAFGFLFILRIVMSWYPKLPV+KF Sbjct: 73 LADLDPGTAKLAIGILGPALSAFGFLFILRIVMSWYPKLPVDKF 116 >ref|XP_009144649.1| PREDICTED: uncharacterized protein LOC103868299 [Brassica rapa] Length = 180 Score = 90.1 bits (222), Expect = 6e-16 Identities = 51/99 (51%), Positives = 62/99 (62%), Gaps = 1/99 (1%) Frame = -2 Query: 294 RRKKPTPVCRAHCSSIISDSLTNHYFSSSSTAISGDPFSGT-SKIMHSVTAESLVLIDLD 118 RR P R+ C ++S SLT +++T + S S S ++ L DLD Sbjct: 22 RRSCPNIRTRSACLPVVSASLTQIEVDTTTTTTTTSLCSSVVSSKQISEALHNISLADLD 81 Query: 117 PATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 P TAKLAI +LGP LSAFGFLFILRIVMSWYPKLPV+KF Sbjct: 82 PGTAKLAIGILGPTLSAFGFLFILRIVMSWYPKLPVDKF 120 >ref|XP_013676590.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic-like [Brassica napus] gi|923876715|ref|XP_013711339.1| PREDICTED: protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic-like [Brassica napus] Length = 180 Score = 89.0 bits (219), Expect = 1e-15 Identities = 53/101 (52%), Positives = 66/101 (65%), Gaps = 3/101 (2%) Frame = -2 Query: 294 RRKKPTPVCRAHCSSIISDSLTN---HYFSSSSTAISGDPFSGTSKIMHSVTAESLVLID 124 RR P R+ C ++S SLT ++++T S P +SK + S ++ L D Sbjct: 21 RRSCPNIRTRSACLPVVSASLTQIEVDTTTTTTTTTSLYPSIVSSKPI-SEALHNISLAD 79 Query: 123 LDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 LDP TAKLAI +LGP LSAFGFLFILRIVMSWYPKLPV+KF Sbjct: 80 LDPGTAKLAIGILGPALSAFGFLFILRIVMSWYPKLPVDKF 120 >emb|CDP04629.1| unnamed protein product [Coffea canephora] Length = 247 Score = 88.6 bits (218), Expect = 2e-15 Identities = 47/78 (60%), Positives = 58/78 (74%), Gaps = 13/78 (16%) Frame = -2 Query: 195 SGDPFSGTSKIMHSVTAES-------------LVLIDLDPATAKLAIALLGPFLSAFGFL 55 S +PF G K++ +V++++ LV+IDLDPATAK AIA+LGPFLSAF FL Sbjct: 112 SDNPFCG-QKVVDTVSSKNAEMQNATEALTKRLVIIDLDPATAKAAIAILGPFLSAFSFL 170 Query: 54 FILRIVMSWYPKLPVEKF 1 FILRI+MSWYPKLPVEKF Sbjct: 171 FILRIIMSWYPKLPVEKF 188 >ref|XP_011026313.1| PREDICTED: uncharacterized protein LOC105126945 isoform X2 [Populus euphratica] Length = 181 Score = 87.8 bits (216), Expect = 3e-15 Identities = 55/102 (53%), Positives = 65/102 (63%), Gaps = 13/102 (12%) Frame = -2 Query: 267 RAHCSSIISDSLTNHYFSSSSTAISGDPFSGTS-------KIMHSVTAES------LVLI 127 R CS+ ++ SL FS S IS +PFS K+ T+E+ L+L Sbjct: 22 RTCCSAGVTASLDVD-FSPSHATISVEPFSVLEAVSVRNIKVDIPKTSETSNLIQRLMLA 80 Query: 126 DLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 DLDPATAKLAI +LGPFLSAF FLF+LRIVMSWYPKLPV KF Sbjct: 81 DLDPATAKLAIGILGPFLSAFSFLFVLRIVMSWYPKLPVGKF 122 >ref|XP_011026312.1| PREDICTED: uncharacterized protein LOC105126945 isoform X1 [Populus euphratica] Length = 216 Score = 87.8 bits (216), Expect = 3e-15 Identities = 55/102 (53%), Positives = 65/102 (63%), Gaps = 13/102 (12%) Frame = -2 Query: 267 RAHCSSIISDSLTNHYFSSSSTAISGDPFSGTS-------KIMHSVTAES------LVLI 127 R CS+ ++ SL FS S IS +PFS K+ T+E+ L+L Sbjct: 57 RTCCSAGVTASLDVD-FSPSHATISVEPFSVLEAVSVRNIKVDIPKTSETSNLIQRLMLA 115 Query: 126 DLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 DLDPATAKLAI +LGPFLSAF FLF+LRIVMSWYPKLPV KF Sbjct: 116 DLDPATAKLAIGILGPFLSAFSFLFVLRIVMSWYPKLPVGKF 157 >gb|KJB39230.1| hypothetical protein B456_007G003100, partial [Gossypium raimondii] Length = 224 Score = 87.4 bits (215), Expect = 4e-15 Identities = 42/55 (76%), Positives = 48/55 (87%) Frame = -2 Query: 165 IMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 ++ S + +L+L DLDPATAKLAI +LGPFLSAF FLFILRIVMSWYPKLPVEKF Sbjct: 111 VVTSDLSRTLLLADLDPATAKLAIGILGPFLSAFAFLFILRIVMSWYPKLPVEKF 165 >ref|XP_012488393.1| PREDICTED: uncharacterized protein LOC105801704 [Gossypium raimondii] gi|763772106|gb|KJB39229.1| hypothetical protein B456_007G003100 [Gossypium raimondii] Length = 202 Score = 87.4 bits (215), Expect = 4e-15 Identities = 42/55 (76%), Positives = 48/55 (87%) Frame = -2 Query: 165 IMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 ++ S + +L+L DLDPATAKLAI +LGPFLSAF FLFILRIVMSWYPKLPVEKF Sbjct: 89 VVTSDLSRTLLLADLDPATAKLAIGILGPFLSAFAFLFILRIVMSWYPKLPVEKF 143 >ref|XP_010450438.1| PREDICTED: uncharacterized protein LOC104732577 isoform X1 [Camelina sativa] Length = 175 Score = 86.3 bits (212), Expect = 8e-15 Identities = 49/112 (43%), Positives = 72/112 (64%), Gaps = 2/112 (1%) Frame = -2 Query: 330 LSSHKSILKSPVRRKKPTPVCRAHCSSIISDSLTNHYFSSSSTAISGD--PFSGTSKIMH 157 +S ++L + RR P R+ I+S SL ++ + ++T + + + S ++H Sbjct: 9 VSFSPALLHAKSRRSVPNFRNRSPSLPIVSASLRSNVEAETTTNLYPNIRETNSVSDLLH 68 Query: 156 SVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 ++T L DLDP TAK+A+ +LGP LSAFGFLFI+RIVMSWYPKLPV+KF Sbjct: 69 NIT-----LADLDPGTAKVAVGILGPALSAFGFLFIVRIVMSWYPKLPVDKF 115 >ref|XP_006395918.1| hypothetical protein EUTSA_v10005006mg [Eutrema salsugineum] gi|557092557|gb|ESQ33204.1| hypothetical protein EUTSA_v10005006mg [Eutrema salsugineum] Length = 176 Score = 86.3 bits (212), Expect = 8e-15 Identities = 52/115 (45%), Positives = 70/115 (60%), Gaps = 7/115 (6%) Frame = -2 Query: 324 SHKSILKSPVRRKKPTPVCRAHCSSIISDSLTNHYFSSSSTA-------ISGDPFSGTSK 166 S ++L + RR P R+ ++S S+T +++T +S P S+ Sbjct: 11 SSPALLPAKPRRSYPNNRNRSASLPMVSASVTQIEPDTTTTTTRIYSSIVSSKP---VSE 67 Query: 165 IMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 ++H+++ L DLDP TAKLAI +LGP SAFGFLFILRIVMSWYPKLPVEKF Sbjct: 68 VLHNIS-----LADLDPGTAKLAIGILGPAFSAFGFLFILRIVMSWYPKLPVEKF 117 >gb|KHG12254.1| putative ycf19 [Gossypium arboreum] Length = 202 Score = 85.9 bits (211), Expect = 1e-14 Identities = 41/55 (74%), Positives = 47/55 (85%) Frame = -2 Query: 165 IMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 ++ S + + +L DLDPATAKLAI +LGPFLSAF FLFILRIVMSWYPKLPVEKF Sbjct: 89 VVTSDLSRTFLLADLDPATAKLAIGILGPFLSAFAFLFILRIVMSWYPKLPVEKF 143 >dbj|BAA96886.1| unnamed protein product [Arabidopsis thaliana] Length = 213 Score = 85.5 bits (210), Expect = 1e-14 Identities = 52/117 (44%), Positives = 71/117 (60%) Frame = -2 Query: 351 ISTKSCHLSSHKSILKSPVRRKKPTPVCRAHCSSIISDSLTNHYFSSSSTAISGDPFSGT 172 ++T S I + RR P R+ I+S +L++ ++++T + T Sbjct: 4 VTTSFVSFSPALMIFQKKSRRSSPNFRNRSTSLPIVSATLSHIEEAATTTNL----IRQT 59 Query: 171 SKIMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 + I S+ ++ L DLDP TAKLAI +LGP LSAFGFLFILRIVMSWYPKLPV+KF Sbjct: 60 NSISESL--RNISLADLDPGTAKLAIGILGPALSAFGFLFILRIVMSWYPKLPVDKF 114 >ref|XP_012081559.1| PREDICTED: uncharacterized protein LOC105641583 [Jatropha curcas] gi|643718744|gb|KDP29870.1| hypothetical protein JCGZ_18445 [Jatropha curcas] Length = 203 Score = 85.5 bits (210), Expect = 1e-14 Identities = 47/79 (59%), Positives = 55/79 (69%) Frame = -2 Query: 237 SLTNHYFSSSSTAISGDPFSGTSKIMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGF 58 +L FSS ++ D + + S +SL L DLDPATAKLAI++LGPFLSAF F Sbjct: 66 TLLTSRFSSLEDVMNIDHNTLEASEATSTFIQSLTLGDLDPATAKLAISILGPFLSAFAF 125 Query: 57 LFILRIVMSWYPKLPVEKF 1 LFILRIVMSWYPKLPV KF Sbjct: 126 LFILRIVMSWYPKLPVGKF 144 >ref|XP_006282723.1| hypothetical protein CARUB_v10005757mg, partial [Capsella rubella] gi|482551428|gb|EOA15621.1| hypothetical protein CARUB_v10005757mg, partial [Capsella rubella] Length = 203 Score = 85.5 bits (210), Expect = 1e-14 Identities = 53/118 (44%), Positives = 73/118 (61%) Frame = -2 Query: 354 LISTKSCHLSSHKSILKSPVRRKKPTPVCRAHCSSIISDSLTNHYFSSSSTAISGDPFSG 175 + + S LS ++L + RR R+ ++S SLTN+ + ++T + S Sbjct: 33 MTTVPSGFLSFSPALLHAKPRRPFLNFRNRSKSLLLVSASLTNNIEADTTTVQETNSISD 92 Query: 174 TSKIMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 + + +VT +L DLDP TAK+AI +LGP LSAFGFLFILRIVMSWYPKLPV+KF Sbjct: 93 S---LRNVT----LLADLDPGTAKVAIGILGPALSAFGFLFILRIVMSWYPKLPVDKF 143 >ref|NP_198461.2| cofactor assembly, complex C (B6F) [Arabidopsis thaliana] gi|75158720|sp|Q8RWM7.1|CCB3_ARATH RecName: Full=Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic; AltName: Full=YGGT family protein YLMG3; AltName: Full=YlmG homolog protein 3; Short=AtYLMG3; Flags: Precursor gi|20260188|gb|AAM12992.1| putative protein [Arabidopsis thaliana] gi|21387037|gb|AAM47922.1| putative protein [Arabidopsis thaliana] gi|62320216|dbj|BAD94460.1| hypothetical protein [Arabidopsis thaliana] gi|332006663|gb|AED94046.1| cofactor assembly, complex C (B6F) [Arabidopsis thaliana] Length = 174 Score = 85.5 bits (210), Expect = 1e-14 Identities = 52/117 (44%), Positives = 71/117 (60%) Frame = -2 Query: 351 ISTKSCHLSSHKSILKSPVRRKKPTPVCRAHCSSIISDSLTNHYFSSSSTAISGDPFSGT 172 ++T S I + RR P R+ I+S +L++ ++++T + T Sbjct: 4 VTTSFVSFSPALMIFQKKSRRSSPNFRNRSTSLPIVSATLSHIEEAATTTNL----IRQT 59 Query: 171 SKIMHSVTAESLVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWYPKLPVEKF 1 + I S+ ++ L DLDP TAKLAI +LGP LSAFGFLFILRIVMSWYPKLPV+KF Sbjct: 60 NSISESL--RNISLADLDPGTAKLAIGILGPALSAFGFLFILRIVMSWYPKLPVDKF 114 >ref|XP_011044826.1| PREDICTED: uncharacterized protein LOC105139890 [Populus euphratica] Length = 209 Score = 85.1 bits (209), Expect = 2e-14 Identities = 57/128 (44%), Positives = 72/128 (56%), Gaps = 12/128 (9%) Frame = -2 Query: 348 STKSCHLSSHKSILKSPVRRKKPTPVCRAHCSSIISDSLTNHYFSSSSTAISGDPFS--- 178 +T+ H S + +L S R C + ++ SL N + S ISG+PFS Sbjct: 35 ATRGGHSSQGRQVLSS-----------RTCCFTRVAASL-NVDLAMSHGDISGEPFSIPE 82 Query: 177 --GTSKIMHSVTAES-------LVLIDLDPATAKLAIALLGPFLSAFGFLFILRIVMSWY 25 G + T+E+ L+L DLDPA AK A+ +LGPFLSAF FLFILRIVMSWY Sbjct: 83 AVGNINVGIPKTSEATSNLIQRLMLADLDPAAAKSAVGILGPFLSAFSFLFILRIVMSWY 142 Query: 24 PKLPVEKF 1 PKLPV KF Sbjct: 143 PKLPVGKF 150