BLASTX nr result
ID: Atropa21_contig00014166
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00014166 (1146 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006355512.1| PREDICTED: mucin-19-like [Solanum tuberosum] 371 e-100 ref|XP_004246157.1| PREDICTED: uncharacterized protein LOC101252... 358 3e-96 ref|XP_002267137.2| PREDICTED: uncharacterized protein LOC100266... 196 1e-47 emb|CAN66568.1| hypothetical protein VITISV_039539 [Vitis vinifera] 196 1e-47 gb|EOY24314.1| G2484-1 protein, putative isoform 6 [Theobroma ca... 189 2e-45 gb|EOY24313.1| G2484-1 protein, putative isoform 5 [Theobroma ca... 189 2e-45 gb|EOY24312.1| G2484-1 protein, putative isoform 4 [Theobroma ca... 189 2e-45 gb|EOY24309.1| G2484-1 protein, putative isoform 1 [Theobroma ca... 189 2e-45 ref|XP_002530649.1| conserved hypothetical protein [Ricinus comm... 183 1e-43 ref|XP_006477174.1| PREDICTED: uncharacterized protein LOC102627... 182 2e-43 ref|XP_006440297.1| hypothetical protein CICLE_v10018443mg [Citr... 182 2e-43 emb|CBI37358.3| unnamed protein product [Vitis vinifera] 177 7e-42 ref|XP_006385540.1| agenet domain-containing family protein [Pop... 174 8e-41 ref|XP_006385539.1| hypothetical protein POPTR_0003s07530g [Popu... 174 8e-41 ref|XP_006385538.1| hypothetical protein POPTR_0003s07530g [Popu... 174 8e-41 ref|XP_006385537.1| hypothetical protein POPTR_0003s07530g [Popu... 174 8e-41 gb|EMJ10269.1| hypothetical protein PRUPE_ppa000035mg [Prunus pe... 172 2e-40 ref|XP_003525570.1| PREDICTED: uncharacterized threonine-rich GP... 164 8e-38 ref|XP_006590567.1| PREDICTED: mucin-17-like [Glycine max] 163 1e-37 ref|XP_006369017.1| hypothetical protein POPTR_0001s15740g [Popu... 162 3e-37 >ref|XP_006355512.1| PREDICTED: mucin-19-like [Solanum tuberosum] Length = 2181 Score = 371 bits (952), Expect = e-100 Identities = 195/246 (79%), Positives = 202/246 (82%), Gaps = 3/246 (1%) Frame = +1 Query: 1 GSNKNDNKPNTLRTMRSGLQKEGSKVFGVPKPGKKRKFMEVSKHYVSDRATKSNAEP--G 174 GSNK+D+KPNTLRTMRSGL KEGSKVFGVPKPGKKRKFMEVSKHYVSDRATKSNA P G Sbjct: 1925 GSNKDDSKPNTLRTMRSGLHKEGSKVFGVPKPGKKRKFMEVSKHYVSDRATKSNAAPAHG 1984 Query: 175 SAKFTKYLMPQTTGVGGWKNNPRTDLKEKQ-AIEARRXXXXXXXXXXXARNLKNNSITSK 351 SAKFTKYLMPQ TG GGWK N RTDLKEKQ IEARR AR LK+NSITS Sbjct: 1985 SAKFTKYLMPQATGTGGWKTNSRTDLKEKQQTIEARRKLPKPSKPPSSARTLKDNSITST 2044 Query: 352 GDARGADHMVGDAIEYDKHEAQQPNVVNFVSNAEEGAEGPVKFRSEALPTNIPKKASTSS 531 GDA GADH VGDAIE KHEAQQPNV NFVSNAEEGAEGP+KFRSEALPTNIPKKASTSS Sbjct: 2045 GDASGADHTVGDAIEDAKHEAQQPNVGNFVSNAEEGAEGPLKFRSEALPTNIPKKASTSS 2104 Query: 532 NRGEGMRXXXXXXXXXXXXVEEKDNMTPEVNEPRRSNRRIQPTSRLLEGLQSSLIISKFP 711 NRGEGM+ +E KD M PEVNEPRRSNR+IQPTSRLLEGLQSSLIISK P Sbjct: 2105 NRGEGMKKRIPISNLKSSKIEVKDKMMPEVNEPRRSNRKIQPTSRLLEGLQSSLIISKLP 2164 Query: 712 SVSHDK 729 SVSHDK Sbjct: 2165 SVSHDK 2170 >ref|XP_004246157.1| PREDICTED: uncharacterized protein LOC101252108 [Solanum lycopersicum] Length = 2155 Score = 358 bits (918), Expect = 3e-96 Identities = 189/244 (77%), Positives = 198/244 (81%), Gaps = 1/244 (0%) Frame = +1 Query: 1 GSNKNDNKPNTLRTMRSGLQKEGSKVFGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 GSNK+D+KPNTLRTMRSGL KEGSKVFGVPKPGKKRKFMEVSKHYVSDR KSNA GSA Sbjct: 1902 GSNKDDSKPNTLRTMRSGLHKEGSKVFGVPKPGKKRKFMEVSKHYVSDRTAKSNAAHGSA 1961 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEK-QAIEARRXXXXXXXXXXXARNLKNNSITSKGD 357 KFTK+LMPQ TG GGWK N RTDLKEK Q IE RR AR LK+NSITS D Sbjct: 1962 KFTKFLMPQATGTGGWKTNSRTDLKEKQQTIETRRKLPKSSKPSSSARTLKDNSITSTRD 2021 Query: 358 ARGADHMVGDAIEYDKHEAQQPNVVNFVSNAEEGAEGPVKFRSEALPTNIPKKASTSSNR 537 A GA+HMVGDAIEYDK+EAQQPNV NFVSNAEEG E VKFRSEALPTNIPKKASTSSNR Sbjct: 2022 ASGAEHMVGDAIEYDKNEAQQPNVGNFVSNAEEGVE-VVKFRSEALPTNIPKKASTSSNR 2080 Query: 538 GEGMRXXXXXXXXXXXXVEEKDNMTPEVNEPRRSNRRIQPTSRLLEGLQSSLIISKFPSV 717 GEGM+ VE KD M PEV+EPRRSNR+IQPTSRLLEGLQSSLIISKFPSV Sbjct: 2081 GEGMKKRIPISNLKSSKVEVKDKMIPEVSEPRRSNRKIQPTSRLLEGLQSSLIISKFPSV 2140 Query: 718 SHDK 729 SHDK Sbjct: 2141 SHDK 2144 >ref|XP_002267137.2| PREDICTED: uncharacterized protein LOC100266068 [Vitis vinifera] Length = 2292 Score = 196 bits (498), Expect = 1e-47 Identities = 117/264 (44%), Positives = 160/264 (60%), Gaps = 22/264 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 + +++NKP+ R +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR+ K + S Sbjct: 2013 NTRDENKPDAPRMIRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSNKISEANDSV 2072 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEAR-------RXXXXXXXXXXXARNLKNNS 339 KF KYL+PQ +G GWKN + D KEK+A+E++ + NL + Sbjct: 2073 KFAKYLIPQGSGPRGWKNTSKIDSKEKRAVESKPKVIRSGKPQNVSSRTVPRKDNLLASG 2132 Query: 340 ITSKGDARGADHM--VGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNI 507 ++ D D++ + D++ +D++ + + NV+ F SN E AEGP+ F S LP++ Sbjct: 2133 TSASNDTNVTDNLPNIKDSVSHDENASGKQNVIEFESFSNTEGQAEGPILFSSLPLPSDA 2192 Query: 508 P--KKASTSSNRGEGM-RXXXXXXXXXXXXVEEK-------DNMTPEVNEPRRSNRRIQP 657 P KK S+ + + + + +EE+ PE EPRRSNRRIQP Sbjct: 2193 PSSKKMPVSNVKSQRVSKGKLAPSGGKLAKIEEEKVYNGNPGKSVPEAVEPRRSNRRIQP 2252 Query: 658 TSRLLEGLQSSLIISKFPSVSHDK 729 TSRLLEGLQSSLIISK PSVSHDK Sbjct: 2253 TSRLLEGLQSSLIISKIPSVSHDK 2276 >emb|CAN66568.1| hypothetical protein VITISV_039539 [Vitis vinifera] Length = 2321 Score = 196 bits (498), Expect = 1e-47 Identities = 117/264 (44%), Positives = 160/264 (60%), Gaps = 22/264 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 + +++NKP+ R +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR+ K + S Sbjct: 1999 NTRDENKPDAPRMIRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSNKISEANDSV 2058 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEAR-------RXXXXXXXXXXXARNLKNNS 339 KF KYL+PQ +G GWKN + D KEK+A+E++ + NL + Sbjct: 2059 KFAKYLIPQGSGPRGWKNTSKIDSKEKRAVESKPKVIRSGKPQNVSSRTVPRKDNLLASG 2118 Query: 340 ITSKGDARGADHM--VGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNI 507 ++ D D++ + D++ +D++ + + NV+ F SN E AEGP+ F S LP++ Sbjct: 2119 TSASNDTNVTDNLPNIKDSVSHDENASGKQNVIEFESFSNTEGQAEGPILFSSLPLPSDA 2178 Query: 508 P--KKASTSSNRGEGM-RXXXXXXXXXXXXVEEK-------DNMTPEVNEPRRSNRRIQP 657 P KK S+ + + + + +EE+ PE EPRRSNRRIQP Sbjct: 2179 PSSKKMPVSNVKSQRVSKGKLAPSGGKLAKIEEEKVYNGNPGKSVPEAVEPRRSNRRIQP 2238 Query: 658 TSRLLEGLQSSLIISKFPSVSHDK 729 TSRLLEGLQSSLIISK PSVSHDK Sbjct: 2239 TSRLLEGLQSSLIISKIPSVSHDK 2262 >gb|EOY24314.1| G2484-1 protein, putative isoform 6 [Theobroma cacao] Length = 2138 Score = 189 bits (480), Expect = 2e-45 Identities = 122/262 (46%), Positives = 167/262 (63%), Gaps = 20/262 (7%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S ++++KP++LR +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+D+++K++ SA Sbjct: 1864 STRDESKPDSLRMIRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADQSSKTHETSDSA 1923 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEK-------QAIEARRXXXXXXXXXXXARNLKNNS 339 K TKYLMPQ +G G KN + +LKEK + +++ + NL N Sbjct: 1924 KITKYLMPQRSGPRGTKN--KIELKEKRMAVSKPKVLKSGKPPSVSSRTIPQKDNLSNTM 1981 Query: 340 ITSKGDARGAD-HMVGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNIP 510 ++ DA +D D++ + ++ + + NV+ F S+++ AEGPV F S AL ++ P Sbjct: 1982 VSEPDDAVASDVSKFKDSVSHAENISGKHNVMEFRSFSSSDGAAEGPVLFSSVALSSDAP 2041 Query: 511 -KKASTSSNRGEGM-RXXXXXXXXXXXXVEEK----DNMT---PEVNEPRRSNRRIQPTS 663 KK STS+ + E + + +EE+ DN T EV EPRRSNRRIQPTS Sbjct: 2042 SKKTSTSNAKFERINKGKLAAAAGKLGKIEEEKVFNDNSTKTISEVVEPRRSNRRIQPTS 2101 Query: 664 RLLEGLQSSLIISKFPSVSHDK 729 RLLEGLQSSLIISK PSVSHDK Sbjct: 2102 RLLEGLQSSLIISKIPSVSHDK 2123 >gb|EOY24313.1| G2484-1 protein, putative isoform 5 [Theobroma cacao] Length = 2151 Score = 189 bits (480), Expect = 2e-45 Identities = 122/262 (46%), Positives = 167/262 (63%), Gaps = 20/262 (7%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S ++++KP++LR +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+D+++K++ SA Sbjct: 1877 STRDESKPDSLRMIRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADQSSKTHETSDSA 1936 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEK-------QAIEARRXXXXXXXXXXXARNLKNNS 339 K TKYLMPQ +G G KN + +LKEK + +++ + NL N Sbjct: 1937 KITKYLMPQRSGPRGTKN--KIELKEKRMAVSKPKVLKSGKPPSVSSRTIPQKDNLSNTM 1994 Query: 340 ITSKGDARGAD-HMVGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNIP 510 ++ DA +D D++ + ++ + + NV+ F S+++ AEGPV F S AL ++ P Sbjct: 1995 VSEPDDAVASDVSKFKDSVSHAENISGKHNVMEFRSFSSSDGAAEGPVLFSSVALSSDAP 2054 Query: 511 -KKASTSSNRGEGM-RXXXXXXXXXXXXVEEK----DNMT---PEVNEPRRSNRRIQPTS 663 KK STS+ + E + + +EE+ DN T EV EPRRSNRRIQPTS Sbjct: 2055 SKKTSTSNAKFERINKGKLAAAAGKLGKIEEEKVFNDNSTKTISEVVEPRRSNRRIQPTS 2114 Query: 664 RLLEGLQSSLIISKFPSVSHDK 729 RLLEGLQSSLIISK PSVSHDK Sbjct: 2115 RLLEGLQSSLIISKIPSVSHDK 2136 >gb|EOY24312.1| G2484-1 protein, putative isoform 4 [Theobroma cacao] Length = 2110 Score = 189 bits (480), Expect = 2e-45 Identities = 122/262 (46%), Positives = 167/262 (63%), Gaps = 20/262 (7%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S ++++KP++LR +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+D+++K++ SA Sbjct: 1836 STRDESKPDSLRMIRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADQSSKTHETSDSA 1895 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEK-------QAIEARRXXXXXXXXXXXARNLKNNS 339 K TKYLMPQ +G G KN + +LKEK + +++ + NL N Sbjct: 1896 KITKYLMPQRSGPRGTKN--KIELKEKRMAVSKPKVLKSGKPPSVSSRTIPQKDNLSNTM 1953 Query: 340 ITSKGDARGAD-HMVGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNIP 510 ++ DA +D D++ + ++ + + NV+ F S+++ AEGPV F S AL ++ P Sbjct: 1954 VSEPDDAVASDVSKFKDSVSHAENISGKHNVMEFRSFSSSDGAAEGPVLFSSVALSSDAP 2013 Query: 511 -KKASTSSNRGEGM-RXXXXXXXXXXXXVEEK----DNMT---PEVNEPRRSNRRIQPTS 663 KK STS+ + E + + +EE+ DN T EV EPRRSNRRIQPTS Sbjct: 2014 SKKTSTSNAKFERINKGKLAAAAGKLGKIEEEKVFNDNSTKTISEVVEPRRSNRRIQPTS 2073 Query: 664 RLLEGLQSSLIISKFPSVSHDK 729 RLLEGLQSSLIISK PSVSHDK Sbjct: 2074 RLLEGLQSSLIISKIPSVSHDK 2095 >gb|EOY24309.1| G2484-1 protein, putative isoform 1 [Theobroma cacao] gi|508777054|gb|EOY24310.1| G2484-1 protein, putative isoform 1 [Theobroma cacao] gi|508777055|gb|EOY24311.1| G2484-1 protein, putative isoform 1 [Theobroma cacao] Length = 2123 Score = 189 bits (480), Expect = 2e-45 Identities = 122/262 (46%), Positives = 167/262 (63%), Gaps = 20/262 (7%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S ++++KP++LR +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+D+++K++ SA Sbjct: 1849 STRDESKPDSLRMIRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADQSSKTHETSDSA 1908 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEK-------QAIEARRXXXXXXXXXXXARNLKNNS 339 K TKYLMPQ +G G KN + +LKEK + +++ + NL N Sbjct: 1909 KITKYLMPQRSGPRGTKN--KIELKEKRMAVSKPKVLKSGKPPSVSSRTIPQKDNLSNTM 1966 Query: 340 ITSKGDARGAD-HMVGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNIP 510 ++ DA +D D++ + ++ + + NV+ F S+++ AEGPV F S AL ++ P Sbjct: 1967 VSEPDDAVASDVSKFKDSVSHAENISGKHNVMEFRSFSSSDGAAEGPVLFSSVALSSDAP 2026 Query: 511 -KKASTSSNRGEGM-RXXXXXXXXXXXXVEEK----DNMT---PEVNEPRRSNRRIQPTS 663 KK STS+ + E + + +EE+ DN T EV EPRRSNRRIQPTS Sbjct: 2027 SKKTSTSNAKFERINKGKLAAAAGKLGKIEEEKVFNDNSTKTISEVVEPRRSNRRIQPTS 2086 Query: 664 RLLEGLQSSLIISKFPSVSHDK 729 RLLEGLQSSLIISK PSVSHDK Sbjct: 2087 RLLEGLQSSLIISKIPSVSHDK 2108 >ref|XP_002530649.1| conserved hypothetical protein [Ricinus communis] gi|223529782|gb|EEF31718.1| conserved hypothetical protein [Ricinus communis] Length = 2104 Score = 183 bits (464), Expect = 1e-43 Identities = 111/263 (42%), Positives = 157/263 (59%), Gaps = 21/263 (7%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S+K+ N+ + LR R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR++++N S Sbjct: 1828 SSKDGNRTDALRMTRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSSQNNEANDSV 1887 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEK-------QAIEARRXXXXXXXXXXXARNLKNNS 339 KFTKYLMPQ G GWK+ +T+L EK + +++ + NL + S Sbjct: 1888 KFTKYLMPQGAGSRGWKSTSKTELNEKRPAISKPKVLKSGKPQNISGRTIPQRENLTSTS 1947 Query: 340 ITSKGDARGADHM--VGDAIEYDKHEAQQPNVVNFVSNAEEGA-EGPVKFRSEALPTN-- 504 ++ + DH+ D++ + ++ ++ N++ F S + GA EGP+ F + ALP++ Sbjct: 1948 VSITDGSALTDHVAKTKDSVSHSENATEKQNLMGFQSFSTSGATEGPILFSALALPSDNF 2007 Query: 505 IPKKASTSSNRGEGMRXXXXXXXXXXXXVEEKD--------NMTPEVNEPRRSNRRIQPT 660 KK +++ E + E+D T + EPRRSNRRIQPT Sbjct: 2008 SSKKMPLPNSKPERVSKGKLAPAGGKFGKIEEDKALNGNSAKSTFDPVEPRRSNRRIQPT 2067 Query: 661 SRLLEGLQSSLIISKFPSVSHDK 729 SRLLEGLQSSL++SK PSVSHDK Sbjct: 2068 SRLLEGLQSSLMVSKIPSVSHDK 2090 >ref|XP_006477174.1| PREDICTED: uncharacterized protein LOC102627454 isoform X1 [Citrus sinensis] gi|568846679|ref|XP_006477175.1| PREDICTED: uncharacterized protein LOC102627454 isoform X2 [Citrus sinensis] gi|568846681|ref|XP_006477176.1| PREDICTED: uncharacterized protein LOC102627454 isoform X3 [Citrus sinensis] Length = 2155 Score = 182 bits (462), Expect = 2e-43 Identities = 117/264 (44%), Positives = 157/264 (59%), Gaps = 22/264 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S ++DNKP+ LR +R+GLQKEGS+V FGVPKPGKKRKFM+VSKHYV D + K S Sbjct: 1876 SGRDDNKPDALRMIRTGLQKEGSRVVFGVPKPGKKRKFMDVSKHYVVDESNKVTEANDSV 1935 Query: 181 KFTKYLMPQTTGV--GGWKNNPRTDLKEKQAIEARRXXXXXXXXXXXARNL--KNNS--- 339 KF KYLMPQ+ G GWKN RT+ KEK+ +R R + K+NS Sbjct: 1936 KFAKYLMPQSQGSVSRGWKNALRTEPKEKRPAVSRPKVLKSGKPPLSGRTITQKDNSASS 1995 Query: 340 -ITSKGDARGADHM--VGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTN 504 +++ D DH + D + + ++++ + + + F +S +EE AE P+ F S + Sbjct: 1996 AVSASEDGADIDHTAKIKDFVRHAENKSGKHDSMEFRSLSTSEETAETPIVFSSMPSSSG 2055 Query: 505 IP-KKASTSSNRGEGMRXXXXXXXXXXXXVEEKDNM--------TPEVNEPRRSNRRIQP 657 P K+ S S++R E + E+D + + EV+EPRRSNRRIQP Sbjct: 2056 APSKRGSVSNSRTERVTKGKLAPAGGKLNKIEEDKVFNGNSAKTSSEVSEPRRSNRRIQP 2115 Query: 658 TSRLLEGLQSSLIISKFPSVSHDK 729 TSRLLEGLQSSLIISK PSVSH+K Sbjct: 2116 TSRLLEGLQSSLIISKIPSVSHEK 2139 >ref|XP_006440297.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|567895620|ref|XP_006440298.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|567895622|ref|XP_006440299.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|557542559|gb|ESR53537.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|557542560|gb|ESR53538.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] gi|557542561|gb|ESR53539.1| hypothetical protein CICLE_v10018443mg [Citrus clementina] Length = 2155 Score = 182 bits (462), Expect = 2e-43 Identities = 117/264 (44%), Positives = 157/264 (59%), Gaps = 22/264 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S ++DNKP+ LR +R+GLQKEGS+V FGVPKPGKKRKFM+VSKHYV D + K S Sbjct: 1876 SGRDDNKPDALRMIRTGLQKEGSRVVFGVPKPGKKRKFMDVSKHYVVDESNKVTEANDSV 1935 Query: 181 KFTKYLMPQTTGV--GGWKNNPRTDLKEKQAIEARRXXXXXXXXXXXARNL--KNNS--- 339 KF KYLMPQ+ G GWKN RT+ KEK+ +R R + K+NS Sbjct: 1936 KFAKYLMPQSQGSVSRGWKNALRTEPKEKRPAVSRPKVLKSGKPPLSGRTITQKDNSASS 1995 Query: 340 -ITSKGDARGADHM--VGDAIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTN 504 +++ D DH + D + + ++++ + + + F +S +EE AE P+ F S + Sbjct: 1996 AVSASEDGADIDHTAKIKDFVRHAENKSGKHDSMEFRSLSTSEETAETPIVFSSMPSSSG 2055 Query: 505 IP-KKASTSSNRGEGMRXXXXXXXXXXXXVEEKDNM--------TPEVNEPRRSNRRIQP 657 P K+ S S++R E + E+D + + EV+EPRRSNRRIQP Sbjct: 2056 APSKRGSVSNSRTERVTKGKLAPAGGKLNKIEEDKVFNGNSAKTSSEVSEPRRSNRRIQP 2115 Query: 658 TSRLLEGLQSSLIISKFPSVSHDK 729 TSRLLEGLQSSLIISK PSVSH+K Sbjct: 2116 TSRLLEGLQSSLIISKIPSVSHEK 2139 >emb|CBI37358.3| unnamed protein product [Vitis vinifera] Length = 1979 Score = 177 bits (449), Expect = 7e-42 Identities = 113/254 (44%), Positives = 149/254 (58%), Gaps = 12/254 (4%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 + +++NKP+ R +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR+ K + S Sbjct: 1734 NTRDENKPDAPRMIRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSNKISEANDSV 1793 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEARRXXXXXXXXXXXARNLKNNSITSKGDA 360 KF KYL+PQ +G GWKN + D KEK+A+E++ R+ K +++S+ Sbjct: 1794 KFAKYLIPQGSGPRGWKNTSKIDSKEKRAVESK---------PKVIRSGKPQNVSSRTVP 1844 Query: 361 RGADHMVGDAIEYDKHEAQQPNVVNFVSNAEEG-AEGPVKFRSEALPTNIP--KKASTSS 531 R D+++ SN G AEGP+ F S LP++ P KK S+ Sbjct: 1845 R-KDNLLASGTS--------------ASNDTNGQAEGPILFSSLPLPSDAPSSKKMPVSN 1889 Query: 532 NRGEGM-RXXXXXXXXXXXXVEEK-------DNMTPEVNEPRRSNRRIQPTSRLLEGLQS 687 + + + + +EE+ PE EPRRSNRRIQPTSRLLEGLQS Sbjct: 1890 VKSQRVSKGKLAPSGGKLAKIEEEKVYNGNPGKSVPEAVEPRRSNRRIQPTSRLLEGLQS 1949 Query: 688 SLIISKFPSVSHDK 729 SLIISK PSVSHDK Sbjct: 1950 SLIISKIPSVSHDK 1963 >ref|XP_006385540.1| agenet domain-containing family protein [Populus trichocarpa] gi|566161399|ref|XP_002304281.2| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] gi|550342637|gb|ERP63337.1| agenet domain-containing family protein [Populus trichocarpa] gi|550342638|gb|EEE79260.2| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] Length = 2107 Score = 174 bits (440), Expect = 8e-41 Identities = 112/265 (42%), Positives = 160/265 (60%), Gaps = 23/265 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S K+ N+P+ LR R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR++K+N Sbjct: 1830 SMKDGNRPDALRMARTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSSKNNEVNDPD 1889 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEAR-RXXXXXXXXXXXARNL--KNNSIT-- 345 KF KYL+PQ +G GWKN +T+ EK+ ++ + R + K+NS+T Sbjct: 1890 KFAKYLLPQGSGSRGWKNTLKTESLEKRTAASKPKVLKLGKPQNVSGRTIAQKDNSLTTA 1949 Query: 346 -SKGDARGADHMVGD--AIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNI- 507 S D DH+ + + + ++ +++ + +F +S++ GAEG + F S +L ++ Sbjct: 1950 VSASDGAATDHVAKNKASTSHVENTSEKHALTDFQPLSSSVGGAEGQI-FSSSSLSSDTL 2008 Query: 508 -PKKASTSSNRGE---GMRXXXXXXXXXXXXVEE-------KDNMTPEVNEPRRSNRRIQ 654 KK STS++ + G + +EE T +V EPRRSNRRIQ Sbjct: 2009 SSKKMSTSTSNAKPPRGSKGKLAPADGKFGRIEEDKVLIGSSSKSTSDVAEPRRSNRRIQ 2068 Query: 655 PTSRLLEGLQSSLIISKFPSVSHDK 729 PTSRLLEGLQSSL+++K PSVSHD+ Sbjct: 2069 PTSRLLEGLQSSLMVTKIPSVSHDR 2093 >ref|XP_006385539.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] gi|550342636|gb|ERP63336.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] Length = 2105 Score = 174 bits (440), Expect = 8e-41 Identities = 112/265 (42%), Positives = 160/265 (60%), Gaps = 23/265 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S K+ N+P+ LR R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR++K+N Sbjct: 1809 SMKDGNRPDALRMARTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSSKNNEVNDPD 1868 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEAR-RXXXXXXXXXXXARNL--KNNSIT-- 345 KF KYL+PQ +G GWKN +T+ EK+ ++ + R + K+NS+T Sbjct: 1869 KFAKYLLPQGSGSRGWKNTLKTESLEKRTAASKPKVLKLGKPQNVSGRTIAQKDNSLTTA 1928 Query: 346 -SKGDARGADHMVGD--AIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNI- 507 S D DH+ + + + ++ +++ + +F +S++ GAEG + F S +L ++ Sbjct: 1929 VSASDGAATDHVAKNKASTSHVENTSEKHALTDFQPLSSSVGGAEGQI-FSSSSLSSDTL 1987 Query: 508 -PKKASTSSNRGE---GMRXXXXXXXXXXXXVEE-------KDNMTPEVNEPRRSNRRIQ 654 KK STS++ + G + +EE T +V EPRRSNRRIQ Sbjct: 1988 SSKKMSTSTSNAKPPRGSKGKLAPADGKFGRIEEDKVLIGSSSKSTSDVAEPRRSNRRIQ 2047 Query: 655 PTSRLLEGLQSSLIISKFPSVSHDK 729 PTSRLLEGLQSSL+++K PSVSHD+ Sbjct: 2048 PTSRLLEGLQSSLMVTKIPSVSHDR 2072 >ref|XP_006385538.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] gi|550342635|gb|ERP63335.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] Length = 2086 Score = 174 bits (440), Expect = 8e-41 Identities = 112/265 (42%), Positives = 160/265 (60%), Gaps = 23/265 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S K+ N+P+ LR R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR++K+N Sbjct: 1809 SMKDGNRPDALRMARTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSSKNNEVNDPD 1868 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEAR-RXXXXXXXXXXXARNL--KNNSIT-- 345 KF KYL+PQ +G GWKN +T+ EK+ ++ + R + K+NS+T Sbjct: 1869 KFAKYLLPQGSGSRGWKNTLKTESLEKRTAASKPKVLKLGKPQNVSGRTIAQKDNSLTTA 1928 Query: 346 -SKGDARGADHMVGD--AIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNI- 507 S D DH+ + + + ++ +++ + +F +S++ GAEG + F S +L ++ Sbjct: 1929 VSASDGAATDHVAKNKASTSHVENTSEKHALTDFQPLSSSVGGAEGQI-FSSSSLSSDTL 1987 Query: 508 -PKKASTSSNRGE---GMRXXXXXXXXXXXXVEE-------KDNMTPEVNEPRRSNRRIQ 654 KK STS++ + G + +EE T +V EPRRSNRRIQ Sbjct: 1988 SSKKMSTSTSNAKPPRGSKGKLAPADGKFGRIEEDKVLIGSSSKSTSDVAEPRRSNRRIQ 2047 Query: 655 PTSRLLEGLQSSLIISKFPSVSHDK 729 PTSRLLEGLQSSL+++K PSVSHD+ Sbjct: 2048 PTSRLLEGLQSSLMVTKIPSVSHDR 2072 >ref|XP_006385537.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] gi|550342634|gb|ERP63334.1| hypothetical protein POPTR_0003s07530g [Populus trichocarpa] Length = 1591 Score = 174 bits (440), Expect = 8e-41 Identities = 112/265 (42%), Positives = 160/265 (60%), Gaps = 23/265 (8%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S K+ N+P+ LR R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+DR++K+N Sbjct: 1314 SMKDGNRPDALRMARTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADRSSKNNEVNDPD 1373 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEAR-RXXXXXXXXXXXARNL--KNNSIT-- 345 KF KYL+PQ +G GWKN +T+ EK+ ++ + R + K+NS+T Sbjct: 1374 KFAKYLLPQGSGSRGWKNTLKTESLEKRTAASKPKVLKLGKPQNVSGRTIAQKDNSLTTA 1433 Query: 346 -SKGDARGADHMVGD--AIEYDKHEAQQPNVVNF--VSNAEEGAEGPVKFRSEALPTNI- 507 S D DH+ + + + ++ +++ + +F +S++ GAEG + F S +L ++ Sbjct: 1434 VSASDGAATDHVAKNKASTSHVENTSEKHALTDFQPLSSSVGGAEGQI-FSSSSLSSDTL 1492 Query: 508 -PKKASTSSNRGE---GMRXXXXXXXXXXXXVEE-------KDNMTPEVNEPRRSNRRIQ 654 KK STS++ + G + +EE T +V EPRRSNRRIQ Sbjct: 1493 SSKKMSTSTSNAKPPRGSKGKLAPADGKFGRIEEDKVLIGSSSKSTSDVAEPRRSNRRIQ 1552 Query: 655 PTSRLLEGLQSSLIISKFPSVSHDK 729 PTSRLLEGLQSSL+++K PSVSHD+ Sbjct: 1553 PTSRLLEGLQSSLMVTKIPSVSHDR 1577 >gb|EMJ10269.1| hypothetical protein PRUPE_ppa000035mg [Prunus persica] Length = 2263 Score = 172 bits (437), Expect = 2e-40 Identities = 108/257 (42%), Positives = 143/257 (55%), Gaps = 15/257 (5%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 + + +NKP+ RT+R+GLQKEG+KV +G+PKPGKKRKFMEVSKHYV++++TK N S Sbjct: 1995 NTRTENKPDPTRTIRTGLQKEGAKVVYGIPKPGKKRKFMEVSKHYVANQSTKINETNDSM 2054 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEARRXXXXXXXXXXXARNLKNNSITSKGDA 360 KF KYLMPQ +G G KN + D +EKQ E++ + + + S+ K + Sbjct: 2055 KFAKYLMPQGSGSRGLKNTSKIDTREKQVTESK----LKGLKSIKPQGVPSKSVPQKDNL 2110 Query: 361 RGADHMVGDAIEYDKHEAQQPNVVNFVSNAE-----EGAEGPVKFRSEALPTNIP--KKA 519 V D H + + V+ V + EGP+ F S A ++ P KK Sbjct: 2111 LTDARTVSDGSSEMDHTGKIKDSVSRVDSVSGKHTLSQPEGPIVFSSLAPSSDFPSSKKV 2170 Query: 520 STSSNRGEGMRXXXXXXXXXXXXVEE-------KDNMTPEVNEPRRSNRRIQPTSRLLEG 678 S S+ + + +EE T EV EPRRSNRRIQPTSRLLEG Sbjct: 2171 SASTAKSRSNKGNLAPAGAKLGKIEEGKVFSGNPAKSTSEVAEPRRSNRRIQPTSRLLEG 2230 Query: 679 LQSSLIISKFPSVSHDK 729 LQSSLII+K PS SHDK Sbjct: 2231 LQSSLIITKIPSGSHDK 2247 >ref|XP_003525570.1| PREDICTED: uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like isoform X1 [Glycine max] gi|571453935|ref|XP_006579634.1| PREDICTED: uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like isoform X2 [Glycine max] gi|571453937|ref|XP_006579635.1| PREDICTED: uncharacterized threonine-rich GPI-anchored glycoprotein PJ4664.02-like isoform X3 [Glycine max] Length = 2242 Score = 164 bits (414), Expect = 8e-38 Identities = 106/254 (41%), Positives = 137/254 (53%), Gaps = 13/254 (5%) Frame = +1 Query: 7 NKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSAK 183 +KN+NK + R +R+GLQKEGS+V FGVPKPGKKRKFMEVSKHYV+D +K N S K Sbjct: 1977 SKNENKSDAHRMVRTGLQKEGSRVIFGVPKPGKKRKFMEVSKHYVADGTSKINDGTDSVK 2036 Query: 184 FTKYLMPQTTGVGGWKNNPRTDLKEKQAIEARRXXXXXXXXXXXARNL--KNNSITSK-- 351 + +L+PQ TG GWKN+ + D KEK ++R R + K N +++ Sbjct: 2037 LSNFLIPQGTGSRGWKNSSKNDTKEKLGADSRPTFKSGKSQSVLGRVVPPKENPLSNSRT 2096 Query: 352 GDARGADHMVGDAIEYDKHEAQQPNVVN--FVSNAEEGAEGPVKFRSEALPTNIPKKAST 525 D + D+ + K+ +Q N V S + GP+ S T+ T Sbjct: 2097 NDLTSHAERIKDSSSHFKNVSQSENQVERALYSGSTGAGAGPILHSSLVSSTDSHPAKKT 2156 Query: 526 SSNRGEGMRXXXXXXXXXXXXVEEKD------NMTPEVNEPRRSNRRIQPTSRLLEGLQS 687 S++R + EEK T E EPRRS RRIQPTSRLLEGLQS Sbjct: 2157 STSRASKGKLAPAGGGRLGKIDEEKAFSGNPLKSTSENTEPRRSIRRIQPTSRLLEGLQS 2216 Query: 688 SLIISKFPSVSHDK 729 SLIISK PS SH+K Sbjct: 2217 SLIISKIPSASHEK 2230 >ref|XP_006590567.1| PREDICTED: mucin-17-like [Glycine max] Length = 2135 Score = 163 bits (412), Expect = 1e-37 Identities = 103/248 (41%), Positives = 131/248 (52%), Gaps = 6/248 (2%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S+KN NK + R +R+GLQKEGSKV FGVPKPGKKRKFMEVSKHYV+ +K S Sbjct: 1907 SSKNGNKFDAHRMVRTGLQKEGSKVIFGVPKPGKKRKFMEVSKHYVAHENSKIGDRNDSV 1966 Query: 181 KFTKYLMPQTTGVGGWKNNPRTDLKEKQAIEARRXXXXXXXXXXXARNLKNNSITSKGDA 360 K T +LMP ++G GWKN+ + D KEK +++ + KN + S+ Sbjct: 1967 KLTNFLMPPSSGPRGWKNSSKNDAKEKHGADSKPKTSHTERIKDSSNLFKNAASKSESKV 2026 Query: 361 RGADHMVGDAIEYDKHEAQQPNVVNFVSNAEEGAEGPVKFRSEALPTNIPKKASTSSNRG 540 A H +A +GA GP F S A + SS+R Sbjct: 2027 ERAPH-----------------------SASDGATGPFLFSSLATSVDAHPTKRASSSRA 2063 Query: 541 EGMRXXXXXXXXXXXXVEEKDNMTP-----EVNEPRRSNRRIQPTSRLLEGLQSSLIISK 705 + +E+ N P ++ EPRRSNRRIQPTSRLLEGLQSSLIISK Sbjct: 2064 SKGKLAPARVKSGKVEMEKALNDNPMKSASDMVEPRRSNRRIQPTSRLLEGLQSSLIISK 2123 Query: 706 FPSVSHDK 729 PSVSH++ Sbjct: 2124 IPSVSHNR 2131 >ref|XP_006369017.1| hypothetical protein POPTR_0001s15740g [Populus trichocarpa] gi|550347376|gb|ERP65586.1| hypothetical protein POPTR_0001s15740g [Populus trichocarpa] Length = 2057 Score = 162 bits (409), Expect = 3e-37 Identities = 104/246 (42%), Positives = 139/246 (56%), Gaps = 4/246 (1%) Frame = +1 Query: 4 SNKNDNKPNTLRTMRSGLQKEGSKV-FGVPKPGKKRKFMEVSKHYVSDRATKSNAEPGSA 180 S K+ NKP+ LR R+GLQKEGSKV FGVPKPGKKRKFMEVSKHYV+D+++K++ S Sbjct: 1809 STKDGNKPDVLRMARTGLQKEGSKVIFGVPKPGKKRKFMEVSKHYVADQSSKNDDANDSV 1868 Query: 181 KFTKYLMPQTTGVGGWKNNPRTD-LKEKQAIEARRXXXXXXXXXXXARNL--KNNSITSK 351 KF KYLMP+ +G GWKN RT+ + + A + R + K+NS+T+ Sbjct: 1869 KFAKYLMPRGSGSRGWKNTLRTESIANRTAASKPKVFKSGKPQNVSGRTITQKDNSLTTT 1928 Query: 352 GDARGADHMVGDAIEYDKHEAQQPNVVNFVSNAEEGAEGPVKFRSEALPTNIPKKASTSS 531 A D V D H A+ ++ V N E + + + + P++ S Sbjct: 1929 VSASN-DGAVTD------HVAKTKASISHVENTSE--KRTLSSKKTSTSNAKPQRVSKGK 1979 Query: 532 NRGEGMRXXXXXXXXXXXXVEEKDNMTPEVNEPRRSNRRIQPTSRLLEGLQSSLIISKFP 711 G + K N +V EPRRSNR++QPTSRLLEGLQSSL++SK P Sbjct: 1980 LAPAGGKLGRIEEDKVFNGDSSKSN--SDVTEPRRSNRKMQPTSRLLEGLQSSLMVSKVP 2037 Query: 712 SVSHDK 729 +VSHDK Sbjct: 2038 AVSHDK 2043