BLASTX nr result
ID: Catharanthus23_contig00012529
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00012529 (1088 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006353908.1| PREDICTED: uncharacterized protein LOC102588... 159 2e-36 ref|XP_004234436.1| PREDICTED: uncharacterized protein LOC101263... 156 2e-35 ref|XP_006363332.1| PREDICTED: uncharacterized protein LOC102601... 145 2e-32 gb|EOY04736.1| Uncharacterized protein isoform 1 [Theobroma caca... 140 7e-31 ref|XP_006421998.1| hypothetical protein CICLE_v10005709mg [Citr... 139 3e-30 gb|EOY23283.1| Uncharacterized protein isoform 3 [Theobroma cacao] 134 8e-29 gb|EOY23281.1| Uncharacterized protein isoform 1 [Theobroma cacao] 134 8e-29 ref|XP_006490659.1| PREDICTED: uncharacterized protein LOC102610... 133 1e-28 ref|XP_002513663.1| conserved hypothetical protein [Ricinus comm... 130 9e-28 gb|EXB66274.1| hypothetical protein L484_003030 [Morus notabilis] 129 2e-27 gb|EXC05979.1| hypothetical protein L484_014249 [Morus notabilis] 127 8e-27 gb|EXC05978.1| hypothetical protein L484_014248 [Morus notabilis] 124 9e-26 gb|EOY23284.1| Uncharacterized protein isoform 4 [Theobroma cacao] 121 6e-25 gb|EMJ04540.1| hypothetical protein PRUPE_ppa020271mg [Prunus pe... 120 9e-25 gb|EOY23282.1| Uncharacterized protein isoform 2, partial [Theob... 118 4e-24 ref|XP_006423148.1| hypothetical protein CICLE_v10030388mg [Citr... 110 1e-21 gb|EMJ04444.1| hypothetical protein PRUPE_ppa019447mg [Prunus pe... 104 7e-20 gb|EPS57956.1| hypothetical protein M569_16861 [Genlisea aurea] 103 1e-19 emb|CAN66027.1| hypothetical protein VITISV_028775 [Vitis vinifera] 101 5e-19 gb|ESW15453.1| hypothetical protein PHAVU_007G073600g [Phaseolus... 92 4e-16 >ref|XP_006353908.1| PREDICTED: uncharacterized protein LOC102588162 [Solanum tuberosum] Length = 260 Score = 159 bits (402), Expect = 2e-36 Identities = 103/216 (47%), Positives = 132/216 (61%), Gaps = 15/216 (6%) Frame = +2 Query: 272 LVQDQNLNIHSTGGALLGGKIDISKAAKKGV-LSGRKALNDISNSGKPSALQASKKHNSI 448 ++QDQN+NIH G +L G K D SKA KKG L GRKALNDISNS KPS+LQASKK NS Sbjct: 3 MLQDQNINIHFDGASLFG-KNDTSKALKKGGGLGGRKALNDISNSAKPSSLQASKK-NSA 60 Query: 449 NVIPVAKDIGGSKIAKAVGGKLNLTNATEKGHSRKALGDLTNSVKPSLHKHLSGKVEEKK 628 +VI + KD+ +K G K NL +KG RKAL DLTNS KPS K S K +KK Sbjct: 61 SVISIGKDLNATKNKFIAGTKDNLAKVPDKG-GRKALTDLTNSSKPSA-KQGSKKGLDKK 118 Query: 629 LNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGLGSPVVV---------- 778 L+ A IP +I EE F+H+H+KCI+ QRK + +FL+ +GL + + V Sbjct: 119 LSAAAAANIPTSIAEEQFLHDHKKCIKAQRKVFDMDFFLKEVGLENDIPVELLASPRVSK 178 Query: 779 ---PQQPLKLKPESPV-KHLEMEEIPVVFLSDQDKK 874 L + E+PV KH E+EE+P + + DQ K Sbjct: 179 LSMKSMSLTYQLETPVKKHFEVEEMPELLMCDQVPK 214 >ref|XP_004234436.1| PREDICTED: uncharacterized protein LOC101263287 [Solanum lycopersicum] Length = 261 Score = 156 bits (394), Expect = 2e-35 Identities = 101/216 (46%), Positives = 134/216 (62%), Gaps = 17/216 (7%) Frame = +2 Query: 272 LVQDQNLNIHSTGGALLGGKIDISKAAKKGV-LSGRKALNDISNSGKPSALQASKKHNSI 448 ++QDQN+NIH G +L G K + SKA KKG L GRKALNDISNS KPS+LQASKK NS Sbjct: 3 MLQDQNINIHFDGASLFG-KNETSKALKKGGGLGGRKALNDISNSAKPSSLQASKK-NST 60 Query: 449 NVIPVAKDIGGSKIAKAVGGKLNLTNATEKGHSRKALGDLTNSVKPSLHKHLSGKVEEKK 628 +VI + KD+ +K G K NL +KG RKAL DLTNS KPS K S K +KK Sbjct: 61 SVISIGKDLNATKNKFIAGTKDNLAKVPDKG-GRKALTDLTNSSKPSA-KQGSKKGFDKK 118 Query: 629 LNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGLGSPVVVPQQP------ 790 + A IP +I EE F+H+H++CI+ QRK ++ +FL+ +GL + +P QP Sbjct: 119 WSAAAAANIPTSIAEEQFLHDHKECIKAQRKVIDMDFFLKEVGLDND--IPVQPLASPHA 176 Query: 791 ---------LKLKPESPV-KHLEMEEIPVVFLSDQD 868 L + E+PV KH E++E+P + + DQD Sbjct: 177 SKLSMKSMSLTYQLETPVKKHFEVDEMPELLMCDQD 212 >ref|XP_006363332.1| PREDICTED: uncharacterized protein LOC102601350 [Solanum tuberosum] Length = 240 Score = 145 bits (367), Expect = 2e-32 Identities = 105/271 (38%), Positives = 141/271 (52%), Gaps = 22/271 (8%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGV--LSGRKALNDISNSGKPSALQ 424 MAT + L+QDQN+++H G +L+G K I KA KKG + GRKALNDISNS KPSALQ Sbjct: 1 MATPGAYLIQDQNISVHYDGASLVG-KNGIYKAQKKGGGGIGGRKALNDISNSAKPSALQ 59 Query: 425 ASKKHNSINVIPVAKDIGGSKIAKAVGGKLNLTNATEKGHSRKALGDLTNSVKPSLHKHL 604 ASKK+NSIN I + KD S+ + G K N + EK RKAL DLTNS K S Sbjct: 60 ASKKNNSINRISIGKDHDASRKKFSAGTKANYSKGLEKKGGRKALADLTNSSKSS----- 114 Query: 605 SGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGLGS---PVV 775 ++ ++ F+HNHQ C++ QRK M+ FL+ IGL PV Sbjct: 115 -------------------SVAKDQFLHNHQNCVKAQRKVMDMSCFLKEIGLDHDDVPVH 155 Query: 776 VPQQPLKLK-----------PESPVKH-LEMEEIPVVFLSD-----QDKKARFSEPDCXX 904 + P LK P+SP+KH E+EE+P + D + +A S P C Sbjct: 156 LGASPHALKPSMKSKSSTYQPDSPMKHYAEVEEMPELMFYDEVRRCEQNRACASCPPC-- 213 Query: 905 XXXXXXXXXXXAFEFWMDEENLFEFKLIESP 997 + WMD +++ +F LI +P Sbjct: 214 -----VASPKSRYVSWMD-DSVLDFALIGTP 238 >gb|EOY04736.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508712840|gb|EOY04737.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 254 Score = 140 bits (354), Expect = 7e-31 Identities = 95/259 (36%), Positives = 139/259 (53%), Gaps = 10/259 (3%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA+ + L+QDQN N+H G A + GK +I KA +KG + GRK L D+SNS P+ Q S Sbjct: 1 MASRSVGLIQDQNFNVHYNG-ASVAGKANICKAPRKGGIGGRKPLGDLSNSVNPAPNQTS 59 Query: 431 KKHNSINVIPVAKDIGGSKIAKAVGGKLNLTNATEKGHS--RKALGDLTNSVKPSLHKHL 604 KK NS N K+ G SK+ K +++ A+EK + RKAL D++NS KP L + Sbjct: 60 KKENSKNFSFAEKETGASKLTHDSSKKKSVSKASEKVQTGGRKALSDISNSGKPHL-QET 118 Query: 605 SGKVEEKKLNVTAEE-TIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL------G 763 S K + KLN+ AE+ P I EEGF+HNH++CI+ QR+ ++ FL+ +GL Sbjct: 119 SRKNQTAKLNILAEDPRQPKDIAEEGFLHNHEECIKAQRRALSTNQFLQILGLDGFSKQS 178 Query: 764 SPVVVPQQPLKLKPESPVKHLEMEEIPVVFLSD-QDKKARFSEPDCXXXXXXXXXXXXXA 940 + P K+K SP + E+ ++P + + D K + S Sbjct: 179 ASAKEPPMSNKMKHGSPPRCSELGQMPELLIEDLSPPKHKLSS---KFDSAPPSPEPLDN 235 Query: 941 FEFWMDEENLFEFKLIESP 997 + W D + + FKLIESP Sbjct: 236 YMHWNDPKYIPSFKLIESP 254 >ref|XP_006421998.1| hypothetical protein CICLE_v10005709mg [Citrus clementina] gi|557523871|gb|ESR35238.1| hypothetical protein CICLE_v10005709mg [Citrus clementina] Length = 250 Score = 139 bits (349), Expect = 3e-30 Identities = 106/268 (39%), Positives = 136/268 (50%), Gaps = 19/268 (7%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA+ L++DQNLN H G + GGK ISK KKG L GRK L D+SNS P+ Q+ Sbjct: 1 MASQLGGLIRDQNLNAHLNGASAGGGKSTISKVPKKGALGGRKPLGDLSNSVNPTPNQSL 60 Query: 431 KKHN----SINVIPVAKDIGGSKIAKAVGGKLNLTNATEK--GHSRKALGDLTNSVKPSL 592 KK N S NVI +K SKI K + + A EK RKAL D++NS K L Sbjct: 61 KKQNSNVFSDNVIGASK----SKIKIDGSKKKSFSRAPEKLQTSGRKALSDISNSGKSHL 116 Query: 593 HKHLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL--GS 766 H+ K KL+V EE + AI EEG++HNHQ+CI+ Q K M+ L T+GL G Sbjct: 117 HE-APKKNMNPKLSVLTEEDLS-AIAEEGYLHNHQECIKAQTKSMDIDELLRTVGLDKGF 174 Query: 767 P-VVVPQQPLKLKPESPVKHLEMEEIPVVFL----------SDQDKKARFSEPDCXXXXX 913 P P Q K+ P SP ++LE+EE+P L SD D R + P Sbjct: 175 PKQAEPPQLSKVMPASPPRYLELEELPEDQLDLSPWKYDQFSDLDSPPRCASPKSPNH-- 232 Query: 914 XXXXXXXXAFEFWMDEENLFEFKLIESP 997 + W D + F+LIESP Sbjct: 233 ---------YMLWKDHDEA-NFRLIESP 250 >gb|EOY23283.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 254 Score = 134 bits (336), Expect = 8e-29 Identities = 97/260 (37%), Positives = 130/260 (50%), Gaps = 11/260 (4%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA A RL+QDQNLN+H G ++ GG+ +SKA KKG +GRK L D+SNS P QA Sbjct: 1 MALRAGRLIQDQNLNVHYNGVSV-GGQKKVSKAPKKGGTAGRKPLGDLSNSVNPIQKQAP 59 Query: 431 KKHNSINVIPVAK-DIGGSKIAKAVGGKLNLTNATEK---GHSRKALGDLTNSVKPSLHK 598 KK N K I SKI K +++NA+E+ SRKAL D++NSVKP + Sbjct: 60 KKENGHGFSIADKGTITTSKIPVDANRKNSVSNASERVLQNDSRKALSDISNSVKPCMR- 118 Query: 599 HLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGLGSPV-- 772 EK LN I I+EE F+HNHQ+CI+ Q++ M+ FL+ +GL Sbjct: 119 ----VTAEKNLNAKRS----IVIEEECFLHNHQECIKAQKQAMHMDEFLQMVGLDKDFSR 170 Query: 773 -----VVPQQPLKLKPESPVKHLEMEEIPVVFLSDQDKKARFSEPDCXXXXXXXXXXXXX 937 P K KP+S +K LE EIP + + DQ Sbjct: 171 QSTLSKTPPISNKTKPKSSLKSLEPLEIPGLLIEDQSPLKHNLCSKLVSPSATRTPEPPN 230 Query: 938 AFEFWMDEENLFEFKLIESP 997 F W D + + F+LIE+P Sbjct: 231 HFVHWADHD-IVSFRLIETP 249 >gb|EOY23281.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 349 Score = 134 bits (336), Expect = 8e-29 Identities = 97/260 (37%), Positives = 130/260 (50%), Gaps = 11/260 (4%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA A RL+QDQNLN+H G ++ GG+ +SKA KKG +GRK L D+SNS P QA Sbjct: 96 MALRAGRLIQDQNLNVHYNGVSV-GGQKKVSKAPKKGGTAGRKPLGDLSNSVNPIQKQAP 154 Query: 431 KKHNSINVIPVAK-DIGGSKIAKAVGGKLNLTNATEK---GHSRKALGDLTNSVKPSLHK 598 KK N K I SKI K +++NA+E+ SRKAL D++NSVKP + Sbjct: 155 KKENGHGFSIADKGTITTSKIPVDANRKNSVSNASERVLQNDSRKALSDISNSVKPCMR- 213 Query: 599 HLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGLGSPV-- 772 EK LN I I+EE F+HNHQ+CI+ Q++ M+ FL+ +GL Sbjct: 214 ----VTAEKNLNAKRS----IVIEEECFLHNHQECIKAQKQAMHMDEFLQMVGLDKDFSR 265 Query: 773 -----VVPQQPLKLKPESPVKHLEMEEIPVVFLSDQDKKARFSEPDCXXXXXXXXXXXXX 937 P K KP+S +K LE EIP + + DQ Sbjct: 266 QSTLSKTPPISNKTKPKSSLKSLEPLEIPGLLIEDQSPLKHNLCSKLVSPSATRTPEPPN 325 Query: 938 AFEFWMDEENLFEFKLIESP 997 F W D + + F+LIE+P Sbjct: 326 HFVHWADHD-IVSFRLIETP 344 >ref|XP_006490659.1| PREDICTED: uncharacterized protein LOC102610843 [Citrus sinensis] Length = 249 Score = 133 bits (335), Expect = 1e-28 Identities = 106/268 (39%), Positives = 135/268 (50%), Gaps = 19/268 (7%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA+ L++DQNLN H GA GGK ISK KKG L GRK L D+SNS P+ Q+ Sbjct: 1 MASHLGGLIRDQNLNAH-LNGASAGGKSTISKVPKKGALGGRKPLGDLSNSVNPTPNQSL 59 Query: 431 KKHN----SINVIPVAKDIGGSKIAKAVGGKLNLTNATEK--GHSRKALGDLTNSVKPSL 592 KK N S NVI +K SKI K + + A EK RKAL D++NS K L Sbjct: 60 KKQNSNVFSDNVIGASK----SKIKIDGSKKKSFSRAPEKLQTSGRKALSDISNSGKSHL 115 Query: 593 HKHLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL--GS 766 H+ K L+V EE + AI EEG++HNHQ+CI+ Q K M+ L T+GL G Sbjct: 116 HE-APKKNFNPTLSVLTEEDLS-AIAEEGYLHNHQECIKAQTKSMDIDELLRTVGLDKGF 173 Query: 767 P-VVVPQQPLKLKPESPVKHLEMEEIPVVFL----------SDQDKKARFSEPDCXXXXX 913 P P Q K+ P SP ++LE+EE+P L SD D R + P Sbjct: 174 PKQAEPTQLSKVMPASPPRYLELEELPEDQLHLSPWKYDQFSDLDSPPRCASPKSPNH-- 231 Query: 914 XXXXXXXXAFEFWMDEENLFEFKLIESP 997 + W D + F+LIESP Sbjct: 232 ---------YMLWKDHDEA-NFRLIESP 249 >ref|XP_002513663.1| conserved hypothetical protein [Ricinus communis] gi|223547571|gb|EEF49066.1| conserved hypothetical protein [Ricinus communis] Length = 250 Score = 130 bits (327), Expect = 9e-28 Identities = 89/213 (41%), Positives = 126/213 (59%), Gaps = 8/213 (3%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA+ A +VQDQNLNIH ++ G K ++SKA +KGVL GR L D+SNS KPS QAS Sbjct: 1 MASRAGGVVQDQNLNIHFNETSV-GWKTNVSKAPRKGVLGGRTPLGDLSNSLKPSLNQAS 59 Query: 431 KKHNSINVIPVAKDIGGSKIA-KAVGGKLNLTNATEKGHS--RKALGDLTNSVKPSLHKH 601 KK NS K+IG S+ A A + A+ K H+ RK L D++NS K + ++ Sbjct: 60 KKQNSSIFSFTEKEIGASQNALDATKNRSTCKKASGKAHTTGRKPLSDISNSGKQNRNEG 119 Query: 602 LSGKVEEKKLNVTAEETIPI-AIKEEGFMHNHQKCIETQRKGMNFGYFLETIGLGSPVVV 778 S + KL+V AEE I AI E F+HNH++CI+ Q + MN FL+ IGL + ++ Sbjct: 120 -SKRSYNAKLSVVAEEPIDANAIAGEQFLHNHEECIKVQSRVMNLDQFLQMIGLDNDIIK 178 Query: 779 PQQ---PLKLKPESPVK-HLEMEEIPVVFLSDQ 865 +K+K ESP + HLE+EE+ + ++ Sbjct: 179 QHANTVSIKVKAESPPRQHLELEEMTEELIEEE 211 >gb|EXB66274.1| hypothetical protein L484_003030 [Morus notabilis] Length = 290 Score = 129 bits (325), Expect = 2e-27 Identities = 92/264 (34%), Positives = 136/264 (51%), Gaps = 11/264 (4%) Frame = +2 Query: 239 KQSLMATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSA 418 ++S MA++ QDQN N+ +G A GGK+ +K+ KKG L GRK L +ISNS + Sbjct: 41 RKSTMASAIGVPFQDQNFNVQYSG-ASAGGKMHTNKSQKKGGLGGRKPLGEISNSTNIAP 99 Query: 419 LQASKKHNSINVIPVAKDIGGSK-IAKAVGGKLNLTNATEK--GHSRKALGDLTNSVKPS 589 QASKK NS K+ G K + + + ++ ++K SRKAL D++NS K Sbjct: 100 TQASKKQNS-------KNFGFIKEVTREESNRKSIAKTSDKMQTRSRKALSDISNSGKAH 152 Query: 590 LHKHLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL--- 760 LH+ + K V E P I EE F+H+HQ+CI+ + K M+ FL +IGL Sbjct: 153 LHEASKNNLSLKLSAVEEEHLFPSCIAEEQFLHDHQECIKAKTKPMDVEQFLVSIGLTNG 212 Query: 761 -----GSPVVVPQQPLKLKPESPVKHLEMEEIPVVFLSDQDKKARFSEPDCXXXXXXXXX 925 SP V P + K+ P++P+ LE EEI + D K + + P C Sbjct: 213 SSQQVESPRVPPVKLSKMMPQNPLSTLEPEEITEHLIEDDLWKMKMNSPTC-----RSPK 267 Query: 926 XXXXAFEFWMDEENLFEFKLIESP 997 + FW D +++ FKL++SP Sbjct: 268 SPIYSSAFWKDCDSI-NFKLMDSP 290 >gb|EXC05979.1| hypothetical protein L484_014249 [Morus notabilis] Length = 246 Score = 127 bits (319), Expect = 8e-27 Identities = 91/260 (35%), Positives = 133/260 (51%), Gaps = 11/260 (4%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA++ QDQN N+ +G A GGK+ +K+ KKG L GRK L +ISNS + QAS Sbjct: 1 MASAIGVPFQDQNFNVQYSG-ASAGGKMHTNKSQKKGGLGGRKPLGEISNSTNIAPTQAS 59 Query: 431 KKHNSINVIPVAKDIGGSK-IAKAVGGKLNLTNATEK--GHSRKALGDLTNSVKPSLHKH 601 KK NS K+ G K + + + ++ ++K SRKAL D++NS K LH+ Sbjct: 60 KKQNS-------KNFGFIKEVTREESNRKSIAKTSDKVQTRSRKALSDISNSGKAHLHEA 112 Query: 602 LSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL------- 760 + K V E P I EE F+H+HQ+CI+ + K M+ FL +IGL Sbjct: 113 SKNNLSLKLSAVEEEHLFPSCIAEEQFLHDHQECIKAKTKPMDVEQFLVSIGLTNGSSQQ 172 Query: 761 -GSPVVVPQQPLKLKPESPVKHLEMEEIPVVFLSDQDKKARFSEPDCXXXXXXXXXXXXX 937 SP V P + K+ P++P+ LE EEI + D K + + P C Sbjct: 173 VESPRVPPVKLSKMMPQNPLSTLEPEEITEHLIEDDLWKMKMNSPTC-----RSPKSPIY 227 Query: 938 AFEFWMDEENLFEFKLIESP 997 + FW D +++ FKL++SP Sbjct: 228 SSAFWKDCDSI-NFKLMDSP 246 >gb|EXC05978.1| hypothetical protein L484_014248 [Morus notabilis] Length = 246 Score = 124 bits (310), Expect = 9e-26 Identities = 90/260 (34%), Positives = 132/260 (50%), Gaps = 11/260 (4%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA++ QDQN N+ +G A GGK+ +K+ KK L GRK L +ISNS + QAS Sbjct: 1 MASAIGVPFQDQNFNVQYSG-ASAGGKMHANKSQKKVGLGGRKPLGEISNSTNIAPTQAS 59 Query: 431 KKHNSINVIPVAKDIGGSK-IAKAVGGKLNLTNATEK--GHSRKALGDLTNSVKPSLHKH 601 KK NS K+ G K + + + ++ ++K SRKAL D++NS K LH+ Sbjct: 60 KKQNS-------KNFGFIKEVTREESNRKSIAKTSDKMQTRSRKALSDISNSGKAHLHEA 112 Query: 602 LSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL------- 760 + K V E P I EE F+H+HQ+CI+ + K M+ FL +IGL Sbjct: 113 SKNNLSLKLSAVEEEHLFPSCIAEEQFLHDHQECIKAKTKPMDVEQFLVSIGLTNGSSQQ 172 Query: 761 -GSPVVVPQQPLKLKPESPVKHLEMEEIPVVFLSDQDKKARFSEPDCXXXXXXXXXXXXX 937 SP V P + K+ P++P+ LE EEI + D K + + P C Sbjct: 173 VESPRVPPVKLSKMMPQNPLSTLEPEEITEHLIEDDLWKMKMNSPTC-----RSPKSPIY 227 Query: 938 AFEFWMDEENLFEFKLIESP 997 + FW D +++ FKL++SP Sbjct: 228 SSAFWKDCDSI-NFKLMDSP 246 >gb|EOY23284.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 290 Score = 121 bits (303), Expect = 6e-25 Identities = 98/296 (33%), Positives = 134/296 (45%), Gaps = 47/296 (15%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA A RL+QDQNLN+H G ++ GG+ +SKA KKG +GRK L D+SNS P QA Sbjct: 1 MALRAGRLIQDQNLNVHYNGVSV-GGQKKVSKAPKKGGTAGRKPLGDLSNSVNPIQKQAP 59 Query: 431 KKHNSINVIPVAK-DIGGSKIAKAVGGKLNLTNATEK---GHSRKALGDLTNSVKPSLHK 598 KK N K I SKI K +++NA+E+ SRKAL D++NSVKP + Sbjct: 60 KKENGHGFSIADKGTITTSKIPVDANRKNSVSNASERVLQNDSRKALSDISNSVKPCMR- 118 Query: 599 HLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL------ 760 EK LN I I+EE F+HNHQ+CI+ Q++ M+ FL+ +GL Sbjct: 119 ----VTAEKNLNAKRS----IVIEEECFLHNHQECIKAQKQAMHMDEFLQMVGLDKGKEN 170 Query: 761 ----GSPVVVPQQPL---------------------------------KLKPESPVKHLE 829 G + + + L K KP+S +K LE Sbjct: 171 LNLSGLTIQMSKSSLWFSLPKTPNFECMFFCLDFSRQSTLSKTPPISNKTKPKSSLKSLE 230 Query: 830 MEEIPVVFLSDQDKKARFSEPDCXXXXXXXXXXXXXAFEFWMDEENLFEFKLIESP 997 EIP + + DQ F W D + + F+LIE+P Sbjct: 231 PLEIPGLLIEDQSPLKHNLCSKLVSPSATRTPEPPNHFVHWADHD-IVSFRLIETP 285 >gb|EMJ04540.1| hypothetical protein PRUPE_ppa020271mg [Prunus persica] Length = 233 Score = 120 bits (301), Expect = 9e-25 Identities = 83/209 (39%), Positives = 113/209 (54%), Gaps = 12/209 (5%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA++ L QDQNL +HS G A LG K D + +KG L RK L D+SNSGKP+ QAS Sbjct: 1 MASTIGHLFQDQNLIVHSHG-ASLGRKGDTFRKQRKGGLGARKPLGDLSNSGKPALTQAS 59 Query: 431 KKHNSINVIPVAKDIGGSKIAKAVGGKLNLTNATEKGHSRKALGDLTNSVKPSLHKHLSG 610 KK S ++ D K K+ + SRKAL D++NS P L Sbjct: 60 KKQLSKEMV---HDASNKKAFSKASDKV-------QTRSRKALSDISNSQAP-----LVQ 104 Query: 611 KVEEKKLNVTAEETI-PIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIG---------- 757 K KL+V AEE + P AI EE F+HNH++CI+ Q + M+ +FL T+G Sbjct: 105 KKHNMKLSVVAEEALCPGAIAEERFLHNHEECIKAQTQAMDMDHFLMTLGIHKVNSFFVQ 164 Query: 758 -LGSPVVVPQQPLKLKPESPVKHLEMEEI 841 L + +P Q L +PESP ++L +EE+ Sbjct: 165 ILARILHLPGQHLHRQPESPSRYLHLEEM 193 >gb|EOY23282.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 244 Score = 118 bits (296), Expect = 4e-24 Identities = 76/174 (43%), Positives = 101/174 (58%), Gaps = 4/174 (2%) Frame = +2 Query: 251 MATSASRLVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQAS 430 MA A RL+QDQNLN+H G ++ GG+ +SKA KKG +GRK L D+SNS P QA Sbjct: 59 MALRAGRLIQDQNLNVHYNGVSV-GGQKKVSKAPKKGGTAGRKPLGDLSNSVNPIQKQAP 117 Query: 431 KKHNSINVIPVAK-DIGGSKIAKAVGGKLNLTNATEK---GHSRKALGDLTNSVKPSLHK 598 KK N K I SKI K +++NA+E+ SRKAL D++NSVKP + Sbjct: 118 KKENGHGFSIADKGTITTSKIPVDANRKNSVSNASERVLQNDSRKALSDISNSVKPCMR- 176 Query: 599 HLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL 760 EK LN I I+EE F+HNHQ+CI+ Q++ M+ FL+ +GL Sbjct: 177 ----VTAEKNLNAKRS----IVIEEECFLHNHQECIKAQKQAMHMDEFLQMVGL 222 >ref|XP_006423148.1| hypothetical protein CICLE_v10030388mg [Citrus clementina] gi|557525082|gb|ESR36388.1| hypothetical protein CICLE_v10030388mg [Citrus clementina] Length = 258 Score = 110 bits (274), Expect = 1e-21 Identities = 94/261 (36%), Positives = 127/261 (48%), Gaps = 19/261 (7%) Frame = +2 Query: 272 LVQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQASKKHNSIN 451 ++ DQNLNI S G A GGK +SKA+KKG L GRK L D+SNS + Q+ KK NS N Sbjct: 9 IIHDQNLNIRSNGAAA-GGKSTVSKASKKGGLGGRKPLADLSNSVNLTLNQSLKKQNSNN 67 Query: 452 VIPVAKDIGGSKIAKAVGG--KLNLTNATEK--GHSRKALGDLTNSVKPSLHKHLSGKVE 619 + IG SK + G K + + A EK RKAL D++N KP LH+ K Sbjct: 68 F--ADRVIGASKSKIRIDGSEKKSFSKALEKLQTSGRKALSDISNWEKPHLHE-APKKNL 124 Query: 620 EKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLET-----------IGLGS 766 KLN+ EE + I EGF+H+HQ+CI+ Q K ++ L T + S Sbjct: 125 NAKLNIATEEDVS-DIAGEGFLHDHQECIKAQTKAVDIDEILRTSSFHFRCFFVMVNFRS 183 Query: 767 PVVVPQQP--LKLKPESPVKHLEMEEIPVVFLSDQD--KKARFSEPDCXXXXXXXXXXXX 934 +V L +P SP ++L ++E+P L D K RFS+ D Sbjct: 184 FGIVQINACFLLFQPVSPPRYLGLQELPEEQLEDPSPWKYDRFSDLDSPPPCRSLKSPNI 243 Query: 935 XAFEFWMDEENLFEFKLIESP 997 W D + +F L ESP Sbjct: 244 ----LWKDHD--ADFMLTESP 258 >gb|EMJ04444.1| hypothetical protein PRUPE_ppa019447mg [Prunus persica] Length = 292 Score = 104 bits (259), Expect = 7e-20 Identities = 69/148 (46%), Positives = 85/148 (57%), Gaps = 3/148 (2%) Frame = +2 Query: 311 GALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQASKKHNSINVIPVAKDIGGSKI 490 GA + GK D+ K +G L GRK L DISNSGKP QASKK +S NV V + KI Sbjct: 100 GASIVGKGDVLKTQMRG-LGGRKPLGDISNSGKPVLSQASKKQSSKNVPVVEEATSLPKI 158 Query: 491 AKAVGGKLNLTNATEK--GHSRKALGDLTNSVKPSLHKHLSGKVEEKKLNVTAEETI-PI 661 + A+EK HSR L ++NSVKP+L K+ S KL V AEE + P Sbjct: 159 IHDASTGKGVFKASEKVQTHSRNTLSHISNSVKPNLQKNHS-----MKLKVMAEEPLCPS 213 Query: 662 AIKEEGFMHNHQKCIETQRKGMNFGYFL 745 AI EEGF+HNHQ+CI+ Q K M+ L Sbjct: 214 AIAEEGFLHNHQECIKAQNKAMDLDQLL 241 >gb|EPS57956.1| hypothetical protein M569_16861 [Genlisea aurea] Length = 238 Score = 103 bits (257), Expect = 1e-19 Identities = 78/205 (38%), Positives = 111/205 (54%), Gaps = 7/205 (3%) Frame = +2 Query: 251 MATSASRL-VQDQNLNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQA 427 MATSA DQNLN+ S G+ K + KA KKG + R+ALNDISNS P + Sbjct: 1 MATSAHHTGTHDQNLNV-SHNGSTPARKTNFRKADKKGGCTSRRALNDISNSRNPLIREP 59 Query: 428 SKKHNSINV-IPVAKDI----GGSKIAKAVGGKLNLTNATEKG-HSRKALGDLTNSVKPS 589 KK N NV IP+ K+ G +K++ V T ++G SRK L D+TNS +P Sbjct: 60 VKKTNFTNVFIPIDKNNPSTPGTTKLSTRV------TEKKKRGVGSRKPLIDVTNSAEPC 113 Query: 590 LHKHLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGLGSP 769 L +H K +A+E +I +E F+H+HQKCIE + ++ YFL ++GL + Sbjct: 114 LQQH-------HKSTKSADECTSSSILDERFLHDHQKCIEILDQSVDKDYFLTSVGLSNA 166 Query: 770 VVVPQQPLKLKPESPVKHLEMEEIP 844 ++ L E K+L++EEIP Sbjct: 167 EDKHKEELSTTNELEEKNLKIEEIP 191 >emb|CAN66027.1| hypothetical protein VITISV_028775 [Vitis vinifera] Length = 459 Score = 101 bits (252), Expect = 5e-19 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 15/207 (7%) Frame = +2 Query: 290 LNIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGKPSALQASKKHNSINVIPVAK 469 +N T G + ++ K KK ++GRKAL D++NSGKPS ++ASK+H+S V + Sbjct: 76 MNFVKTFGCSQWKEANVYKVQKKQGIAGRKALGDLTNSGKPSPIKASKRHDSKIFTSVGE 135 Query: 470 DIGGSKIAKAVGGKLNLTNATEKGHS---RKALGDLTNSVKPSLHKHLSGKVEEKKLNVT 640 +I + + GK +++ A EK + RK L D++N+ + K +V Sbjct: 136 EIDAFRSKDTIRGKKSISKAQEKVQTSGRRKPLSDVSNT---------KNQKNVKLTSVM 186 Query: 641 AEETIPIAIKEEGFMHNHQKCIETQRKG-MNFGYFLETIGLGS-----------PVVVPQ 784 E +P +I EE F+H+H++CI+ Q M+ YFLETIGL P V + Sbjct: 187 KEYFLPNSIAEEQFLHDHEECIKAQNMNMMSKDYFLETIGLHKDFSMQLPSCHVPPVSSR 246 Query: 785 QPLKLKPESPVKHLEMEEIPVVFLSDQ 865 +P K++P SP K LE+EEI + + D+ Sbjct: 247 KP-KVQPGSPRK-LELEEIAELMIEDK 271 >gb|ESW15453.1| hypothetical protein PHAVU_007G073600g [Phaseolus vulgaris] Length = 257 Score = 92.0 bits (227), Expect = 4e-16 Identities = 86/271 (31%), Positives = 122/271 (45%), Gaps = 18/271 (6%) Frame = +2 Query: 251 MATSASRLVQDQNL-NIHSTGGALLGGKIDISKAAKKGVLSGRKALNDISNSGK------ 409 MA RL+Q+QNL N+H G + GK D+ +KG + GRK L D+SN+G Sbjct: 1 MAARNGRLLQNQNLINVHVHGAGSVSGKADLP-GQRKGRVGGRKPLGDLSNAGNLINQFD 59 Query: 410 -PSALQASKKHN--SINVIPVAKDIGGSKIAKAVGGKLNLTNATEKGHSRKALGDLTNSV 580 AL S S+N P + + K +G K + + T SRKAL D++NS Sbjct: 60 GKKALDGSLNIGKPSVNKAPKLQKSKNLETDKRIGNKASGKSLTG---SRKALSDISNSG 116 Query: 581 KPSLHKHLSGKVEEKKLNVTAEETIPIAIKEEGFMHNHQKCIETQRKGMNFGYFLETIGL 760 KP + ++ K K ++ E P AI EE +H+H+KCI++Q + + F +T+GL Sbjct: 117 KPQVPEN-KDKHTLKPCSLIEESLPPSAIAEEWILHDHKKCIKSQLENEDVHQFFKTVGL 175 Query: 761 GSPV-------VVPQQPLKLKPESPVKHLEMEEIPVVFLSDQDKKARFSEP-DCXXXXXX 916 KLK ES + E+EEIP Q A P +C Sbjct: 176 EDDADDHMAMSFELSAISKLKSES--AYFELEEIPEWLPERQSLSALCGSPTNCKTPGLS 233 Query: 917 XXXXXXXAFEFWMDEENLFEFKLIESPKSQK 1009 W D + FKLIE+PK K Sbjct: 234 TYR------TMWSD--STVNFKLIETPKLSK 256