BLASTX nr result
ID: Catharanthus22_contig00004751
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00004751 (1806 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao] 336 2e-89 ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu... 335 5e-89 ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245... 330 2e-87 ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620... 326 2e-86 ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620... 325 3e-86 ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm... 314 9e-83 ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244... 309 2e-81 gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theob... 307 1e-80 gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus pe... 301 5e-79 ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592... 301 6e-79 gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao] 300 1e-78 gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao] 299 3e-78 ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211... 276 2e-71 gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma caca... 268 8e-69 ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps... 268 8e-69 ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab... 260 1e-66 gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis] 258 5e-66 gb|EXC33904.1| hypothetical protein L484_012794 [Morus notabilis] 255 5e-65 ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ... 249 2e-63 gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theob... 248 6e-63 >gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 430 Score = 336 bits (862), Expect = 2e-89 Identities = 197/442 (44%), Positives = 278/442 (62%), Gaps = 12/442 (2%) Frame = -1 Query: 1662 VESSNSAQALDLNCIRSRIQELKDIR---SNFEEVPQLNSSEVDELVKSCAQQLESKVDQ 1492 +E S+S++ALDL+ IRSRI EL +I N +E L+ + ++L+K C+ ESKV Q Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNS-EKLLKDCSLHFESKVKQ 63 Query: 1491 IFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELG 1312 I DE+L H SR ++E+S+ LE L Sbjct: 64 IIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLE 123 Query: 1311 QLSCSLE---------VHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIK 1159 L +L+ V E DS +L++ K + Sbjct: 124 GLKYALDSIASQGMEGVEEDPCLDSSM---------------NDEDQSNLMHSNEEQKFE 168 Query: 1158 ILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIP 979 I+EL+ IEK LKSL+DLD F+RL+T+E+IE+ L+ LKV+ ++GNCIRLSL+T+IP Sbjct: 169 IMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIP 228 Query: 978 DIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQ 799 + +L + +ED+ E P E NHELL+E+ DGTME+K VE+FPNDVY+G+IIDA K FRQ Sbjct: 229 KLEGLLCQKTIEDISE-PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQ 287 Query: 798 LDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVG 619 L + L V +++SSLEWFV +VQDRI+LSTLRRF+VK NKSR+S EYL+RDE ++AHLVG Sbjct: 288 LSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVG 347 Query: 618 GIDASIKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSF 439 GIDA IK++QGWP+S +PL L+++KS + SR ISLS L K E+ NSLD H+R+++++F Sbjct: 348 GIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAF 407 Query: 438 VDGIEEILVHQM*DEIQPDHVS 373 VD +E++L+ QM ++Q D S Sbjct: 408 VDAVEKLLLEQMRLDLQSDDAS 429 >ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] gi|222847415|gb|EEE84962.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] Length = 429 Score = 335 bits (858), Expect = 5e-89 Identities = 190/434 (43%), Positives = 276/434 (63%), Gaps = 2/434 (0%) Frame = -1 Query: 1668 MEVESSNSAQALDLNCIRSRIQELKDIRS--NFEEVPQLNSSEVDELVKSCAQQLESKVD 1495 ME+ S + ++L+LN IRSRI EL++I N + ++NSS+ DEL+K AQQL SKV Sbjct: 1 MEISPSTTQESLNLNTIRSRINELEEIYRDCNADSFSEINSSDSDELMKDSAQQLVSKVS 60 Query: 1494 QIFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESEL 1315 Q D +L H +R +EDSS+LE++L Sbjct: 61 QTVTEYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDL 120 Query: 1314 GQLSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLI 1135 + CSL++ S + R K +L+N K +IL+LD+ I Sbjct: 121 EWMKCSLDL-----ISSQRDREKEKGDEQMEHFSSGENQSNLINTNEENKFEILKLDNQI 175 Query: 1134 EKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSL 955 E+ LKS++DLD + + IE+IE++LS LKV+E++G CIRLSL+T+IP +L L Sbjct: 176 EESTRILKSMQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPK-QDVLFL 234 Query: 954 QRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPVV 775 Q++E+ P E NHE L+E+ +G+ME+KKVE+FPND+YIG+I+DA K FRQ+ L ++ Sbjct: 235 QKIEET-NVPYEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALM 293 Query: 774 ESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKV 595 E+ SSLEWFVR+ QDRI+ STLRR + + + SR S+EYLDRDEI++AH+VGG+DA ++V Sbjct: 294 ETSSSLEWFVRKAQDRIIQSTLRRLVARSASTSRQSIEYLDRDEIIVAHMVGGVDAFMEV 353 Query: 594 AQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEIL 415 +QGWPI+++PL L++LK+ + ++EISL FL KV E NSLD H R++++SFVD +E+IL Sbjct: 354 SQGWPITNSPLKLVSLKNSNHHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKIL 413 Query: 414 VHQM*DEIQPDHVS 373 V QM E+ D S Sbjct: 414 VEQMHLELHSDGTS 427 >ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera] gi|298205214|emb|CBI17273.3| unnamed protein product [Vitis vinifera] Length = 425 Score = 330 bits (845), Expect = 2e-87 Identities = 189/415 (45%), Positives = 260/415 (62%) Frame = -1 Query: 1647 SAQALDLNCIRSRIQELKDIRSNFEEVPQLNSSEVDELVKSCAQQLESKVDQIFXXXXXX 1468 +A +DL+ IRSR+ EL I +N+ + N + L + + L+S+V+QI Sbjct: 6 AAGTMDLDTIRSRMSELNRIHTNYSHISDSNPLDSRSLFQEFSHHLQSRVNQILSQYSDV 65 Query: 1467 XXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELGQLSCSLEV 1288 D +L H +R YVEDS++LES+L L S++ Sbjct: 66 ESLEADDLDAYLGHLKKELNLVESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVDF 125 Query: 1287 HELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLIEKKKDTLKS 1108 V G R + + +IL+L++ +K K TLKS Sbjct: 126 ----VASQGLKRAEAGALVDYSSSVEDQLDSRTAHGDNN--FEILDLNYQTQKNKITLKS 179 Query: 1107 LEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSLQRMEDLIEK 928 L+DLDY F+R E IEKIE+ L+ LKV+++EGNCIRLSL TFIP++ +L +++E + + Sbjct: 180 LQDLDYTFKRFEAIEKIEDALTGLKVIDFEGNCIRLSLSTFIPNLEGLLCEEKIE-AVNE 238 Query: 927 PLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPVVESRSSLEWF 748 P E NHELL+E+ D +MELK VEIFPNDVY+GEIIDA K R+L + + ++E+RSSLEWF Sbjct: 239 PSELNHELLIEVMDQSMELKNVEIFPNDVYLGEIIDAAKSSRKLFSHMSILETRSSLEWF 298 Query: 747 VRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKVAQGWPISDA 568 VR+VQD+I+L LR+ +VK NKSR+SLEYLDRDEI++AH+VGG+DA IKV QGWP+S+ Sbjct: 299 VRKVQDKIILCALRQSIVKGANKSRHSLEYLDRDEIIVAHMVGGVDAYIKVCQGWPVSNN 358 Query: 567 PLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEILVHQM 403 L L +LKS S+ ISLSFL KV E+ NSLD +R++I+SFVD IEEILV QM Sbjct: 359 ALKLKSLKSSDQQSKGISLSFLCKVEEMANSLDVSIRKNISSFVDAIEEILVQQM 413 >ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus sinensis] Length = 444 Score = 326 bits (836), Expect = 2e-86 Identities = 182/437 (41%), Positives = 274/437 (62%), Gaps = 4/437 (0%) Frame = -1 Query: 1671 KMEVESS---NSAQALDLNCIRSRIQELKDI-RSNFEEVPQLNSSEVDELVKSCAQQLES 1504 ++EVE++ +S+ LDL+ +RS ++EL +I RS E+ P SS+ + L+K A ES Sbjct: 8 EVEVEATATPSSSSPLDLHSLRSEVKELMEIHRSGIEDEPNTVSSDSENLLKEYAHDFES 67 Query: 1503 KVDQIFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLE 1324 KV +I D +L+H +R VEDS +LE Sbjct: 68 KVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLE 127 Query: 1323 SELGQLSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELD 1144 S+L +L+C++++ + + DL+ + +ILEL+ Sbjct: 128 SDLEELNCAIDLIVSENAKEDRQAVCPARGEDQVCPTHTEDQSDLIKIHEDHRFEILELE 187 Query: 1143 HLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSI 964 IEK K L SL+DLD+ +R + +E+IE+ L+ LKV++++G C RLS++T+IP + Sbjct: 188 SQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTYIPTLEES 247 Query: 963 LSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPL 784 ++ED+IE P E NHELL+E+ DGTME+K VE+FPNDV+I +++DA K FRQ L Sbjct: 248 SFQHKIEDVIE-PSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSFRQSGTQL 306 Query: 783 PVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDAS 604 +E+ SSL+WF+R VQDRI+LSTLRRF+VK NKSR+ EY +RDE+++AHLVGG+DA Sbjct: 307 DSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFERDEMIVAHLVGGVDAF 366 Query: 603 IKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIE 424 IK +QGWP+S++PL +I+LK+ + S+ ISLSF +V E NSLD H+R++++SFVDG+E Sbjct: 367 IKPSQGWPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLSSFVDGVE 426 Query: 423 EILVHQM*DEIQPDHVS 373 +IL+ QM E+ D+ S Sbjct: 427 KILLEQMRVELHYDNAS 443 >ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus sinensis] Length = 447 Score = 325 bits (834), Expect = 3e-86 Identities = 183/444 (41%), Positives = 274/444 (61%), Gaps = 11/444 (2%) Frame = -1 Query: 1671 KMEVESS---NSAQALDLNCIRSRIQELKDI-RSNFEEVPQLNSSEVDELVKSCAQQLES 1504 ++EVE++ +S+ LDL+ +RS ++EL +I RS E+ P SS+ + L+K A ES Sbjct: 8 EVEVEATATPSSSSPLDLHSLRSEVKELMEIHRSGIEDEPNTVSSDSENLLKEYAHDFES 67 Query: 1503 KVDQIFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLE 1324 KV +I D +L+H +R VEDS +LE Sbjct: 68 KVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLE 127 Query: 1323 SELGQLSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXD-------LLNPCRTCK 1165 S+L +L+C++++ + G K L+ + Sbjct: 128 SDLEELNCAIDL----IVSEGSQNAKEDRQAVCPARGEDQVCPTHTEDQSDLIKIHEDHR 183 Query: 1164 IKILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTF 985 +ILEL+ IEK K L SL+DLD+ +R + +E+IE+ L+ LKV++++G C RLS++T+ Sbjct: 184 FEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTY 243 Query: 984 IPDIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEF 805 IP + ++ED+IE P E NHELL+E+ DGTME+K VE+FPNDV+I +++DA K F Sbjct: 244 IPTLEESSFQHKIEDVIE-PSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSF 302 Query: 804 RQLDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHL 625 RQ L +E+ SSL+WF+R VQDRI+LSTLRRF+VK NKSR+ EY +RDE+++AHL Sbjct: 303 RQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFERDEMIVAHL 362 Query: 624 VGGIDASIKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSIT 445 VGG+DA IK +QGWP+S++PL +I+LK+ + S+ ISLSF +V E NSLD H+R++++ Sbjct: 363 VGGVDAFIKPSQGWPLSNSPLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLS 422 Query: 444 SFVDGIEEILVHQM*DEIQPDHVS 373 SFVDG+E+IL+ QM E+ D+ S Sbjct: 423 SFVDGVEKILLEQMRVELHYDNAS 446 >ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis] gi|223542639|gb|EEF44176.1| conserved hypothetical protein [Ricinus communis] Length = 415 Score = 314 bits (804), Expect = 9e-83 Identities = 184/420 (43%), Positives = 255/420 (60%), Gaps = 2/420 (0%) Frame = -1 Query: 1635 LDLNCIRSRIQELKDIRSNFEEVPQLNSSEVDELVKSCAQQLESKVDQIFXXXXXXXXXX 1456 LDLN I I++L++I S ++ SS D++++ CA LESKV QI Sbjct: 5 LDLNSIICGIKDLEEIYSGCNGDTEMLSSHSDQVLEDCALHLESKVQQIMSECSDFNFLG 64 Query: 1455 XXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELGQLSCSLEVHELK 1276 D F++H +R ++ED ++LES++ L CSL+ K Sbjct: 65 IEDLDAFVEHLKEELSTTMSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISSK 124 Query: 1275 VFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLIEKKKDTLKSLEDL 1096 + K + R + +I +LD I K K LKSL+D Sbjct: 125 DVEKEK-------------EVACREDLYSTDAHRDYEFEISKLDDQIAKSKMILKSLQDF 171 Query: 1095 DYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSLQRMEDLIEKPLEQ 916 D F+R++ +E+IE LS LKV+E++G+CIRLSL+T++P + ++ + ED E P E Sbjct: 172 DSVFKRVDAVEQIEEALSGLKVIEFDGSCIRLSLRTYLPKLDDVMCQHKTEDTAE-PSEV 230 Query: 915 NHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQ--LDAPLPVVESRSSLEWFVR 742 NHELL+E+ GTMELK VEIFPND+YI +I+DA K FR+ L + L E+RSSL W VR Sbjct: 231 NHELLIEVVSGTMELKNVEIFPNDIYISDIVDAAKSFRKEFLYSALTESETRSSLGWLVR 290 Query: 741 RVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKVAQGWPISDAPL 562 +VQDRI+ TLRR +VK NKSRYS EYLDRDE V+AHLVGG+DA IK++QGWP+S +PL Sbjct: 291 KVQDRIIQFTLRRLVVKSSNKSRYSFEYLDRDETVVAHLVGGVDAFIKLSQGWPVSRSPL 350 Query: 561 VLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEILVHQM*DEIQPD 382 LI+LKS + S+EISLSFL +V E+ NSLD +R ++ SFV+ IE++LV QM E+ D Sbjct: 351 KLISLKSSNHHSKEISLSFLCRVEEVVNSLDIQMRLNLLSFVEVIEKLLVEQMRIELHSD 410 >ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244321 [Solanum lycopersicum] Length = 415 Score = 309 bits (792), Expect = 2e-81 Identities = 187/422 (44%), Positives = 253/422 (59%) Frame = -1 Query: 1668 MEVESSNSAQALDLNCIRSRIQELKDIRSNFEEVPQLNSSEVDELVKSCAQQLESKVDQI 1489 ME S N A +L R IQEL+DI+ + EE P+ E+ + ++ C Q ESKV+Q+ Sbjct: 1 MENRSYNDADSL-----RREIQELRDIQRSVEE-PEAFGLELKKSLEDCTLQFESKVEQL 54 Query: 1488 FXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELGQ 1309 DEF ++ SR YVE SKL +E+ Sbjct: 55 LCDASEVNFSSDQDLDEFWNYLKNELSTEEAKNAKIADEIEGLSREYVEGYSKLVNEVEG 114 Query: 1308 LSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLIEK 1129 LSC LE+ E + G+ N KI EL + +EK Sbjct: 115 LSCLLELIESLGIEQGRALTNFPCSTPGEDKGNLSSAPVEHN------FKIFELGNQLEK 168 Query: 1128 KKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSLQR 949 K L+SLE+L+ F R E IEKIE+ S LK+V++EGN IRLSL+TFIP++ ++L Q Sbjct: 169 SKLNLESLEELESTFNRFEAIEKIEDAFSGLKIVQFEGNRIRLSLRTFIPNLENLLHNQT 228 Query: 948 MEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPVVES 769 + + +P EQNHELL+EL DGTMELK VEIFPNDV I EI D K RQ+ P+ V+E+ Sbjct: 229 IG--VAEPPEQNHELLIELVDGTMELKHVEIFPNDVSISEITDTAKSLRQVYFPVGVLEN 286 Query: 768 RSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKVAQ 589 RSSLEW V+RVQDRI+LSTLRRFLVK N SR+S +Y++R+E ++AH+VGGIDA +K+ Q Sbjct: 287 RSSLEWLVKRVQDRIILSTLRRFLVKSANSSRHSFDYVEREETIVAHMVGGIDAFVKLPQ 346 Query: 588 GWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEILVH 409 GWP++ + L L++LKS S++ISL+ L KV E NSLD + R++I+ F D +EEIL+ Sbjct: 347 GWPLTCSGLTLMSLKSSSQYSQQISLTLLCKVAEAANSLDTNARQTISGFTDRVEEILMQ 406 Query: 408 QM 403 QM Sbjct: 407 QM 408 >gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 372 Score = 307 bits (786), Expect = 1e-80 Identities = 152/265 (57%), Positives = 210/265 (79%) Frame = -1 Query: 1167 KIKILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKT 988 K +I+EL+ IEK LKSL+DLD F+RL+T+E+IE+ L+ LKV+ ++GNCIRLSL+T Sbjct: 108 KFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQT 167 Query: 987 FIPDIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKE 808 +IP + +L + +ED+ E P E NHELL+E+ DGTME+K VE+FPNDVY+G+IIDA K Sbjct: 168 YIPKLEGLLCQKTIEDISE-PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKS 226 Query: 807 FRQLDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAH 628 FRQL + L V +++SSLEWFV +VQDRI+LSTLRRF+VK NKSR+S EYL+RDE ++AH Sbjct: 227 FRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAH 286 Query: 627 LVGGIDASIKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSI 448 LVGGIDA IK++QGWP+S +PL L+++KS + SR ISLS L K E+ NSLD H+R+++ Sbjct: 287 LVGGIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNL 346 Query: 447 TSFVDGIEEILVHQM*DEIQPDHVS 373 ++FVD +E++L+ QM ++Q D S Sbjct: 347 SAFVDAVEKLLLEQMRLDLQSDDAS 371 >gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] Length = 416 Score = 301 bits (772), Expect = 5e-79 Identities = 180/433 (41%), Positives = 259/433 (59%), Gaps = 4/433 (0%) Frame = -1 Query: 1668 MEVESSNSAQALDLNCIRSRIQELKDIRSNF--EEVPQLNSSEVDELVKSCAQQLESKVD 1495 ME + S++ LDLN I+ +++EL++I + ++ +L+ S+ D+L+++C L+S+V+ Sbjct: 1 MEEDPIPSSEPLDLNTIQRQVRELEEIIESCRQDDASELSPSDSDDLIRNCGLLLQSRVE 60 Query: 1494 QIFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESEL 1315 QI + ++ R + ED ++L ++L Sbjct: 61 QIVSECSDVGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDL 120 Query: 1314 GQLSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTC--KIKILELDH 1141 QL CSL+ E K + K LL+P K ++LEL++ Sbjct: 121 AQLKCSLDFVEEKDLEKAKLGADVDYHKCGKD---------LLDPMNVNADKFELLELEN 171 Query: 1140 LIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSIL 961 IEK LKSL+DL+ + L+ E+IE+ ++ LKV+ +EGNC+RLSL+T+IP + + Sbjct: 172 QIEKNNIILKSLQDLECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLF 231 Query: 960 SLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLP 781 S +++ D E P E NHELL+EL +GTM L+ VEIFPNDVYI +I+DA K R Sbjct: 232 SPKKVGDATE-PSEVNHELLIELLEGTMGLRNVEIFPNDVYINDILDAAKSLR------- 283 Query: 780 VVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASI 601 +SSL+WFV +VQDRIVL T+RR +VK ENKSR+SLEYLD+DE V+AH+VGG+DA I Sbjct: 284 ----KSSLQWFVTKVQDRIVLCTMRRLVVKNENKSRHSLEYLDKDETVVAHVVGGVDAFI 339 Query: 600 KVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEE 421 KV QGWP+ +PL LI LKS S+ ISLSFL V EL NSL +R++++SFVD IE+ Sbjct: 340 KVPQGWPLLSSPLKLIYLKSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEK 399 Query: 420 ILVHQM*DEIQPD 382 ILV QM EI D Sbjct: 400 ILVEQMCSEIHGD 412 >ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592291 [Solanum tuberosum] Length = 428 Score = 301 bits (771), Expect = 6e-79 Identities = 183/423 (43%), Positives = 246/423 (58%), Gaps = 13/423 (3%) Frame = -1 Query: 1632 DLNCIRSRIQELKDIRSNFEEVPQLNSSEVDELVKSCAQQLESKVDQIFXXXXXXXXXXX 1453 D++ R IQEL+DI+ + EE P+ E+ + ++ C Q E KV+QI Sbjct: 8 DVDSFRREIQELRDIQRSVEE-PEAFGLELKKSLEDCTLQFERKVEQILCDASEISFSSD 66 Query: 1452 XXXD-------------EFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELG 1312 EF + SR YVE SKL +E+ Sbjct: 67 QDLGRKKAVHIFFFPPYEFWKYLKNELSTEEANNAKIADEIEGLSREYVEGYSKLVNEIE 126 Query: 1311 QLSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLIE 1132 LSC LE+ E + G+ N K+ EL + +E Sbjct: 127 GLSCPLELIESLGLEQGRVLTNFPCSTPGEDKGNVSSAPVEQN------FKVFELGNQLE 180 Query: 1131 KKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSLQ 952 K K LKSLE+L+ F R E IEKIE+ S LK+VE+EGN IRLSL+TFIP++ ++L Q Sbjct: 181 KSKLNLKSLEELESTFNRFEAIEKIEDAFSGLKIVEFEGNRIRLSLRTFIPNLENLLHNQ 240 Query: 951 RMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPVVE 772 ++ + +P EQNHELL+EL DGTMELK VEIFPNDV I I D K RQ+ P+ V+E Sbjct: 241 TID--VAEPPEQNHELLIELMDGTMELKHVEIFPNDVSISYITDTAKSLRQVYFPVGVLE 298 Query: 771 SRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKVA 592 +RSSLEWFV+ VQDRIVLSTLRRFLVK N SR+S +Y+DR+E ++AH+VGGIDA IK+ Sbjct: 299 NRSSLEWFVKGVQDRIVLSTLRRFLVKSANSSRHSFDYVDREETIVAHMVGGIDAFIKLP 358 Query: 591 QGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEILV 412 QGWP++ + L L++LKS S++ISL+ L KV E+ N LD + R++I+ F D +EEIL+ Sbjct: 359 QGWPLTSSGLTLMSLKSSSQYSQQISLTLLCKVAEVANLLDTNERQTISGFTDRVEEILM 418 Query: 411 HQM 403 QM Sbjct: 419 QQM 421 >gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 432 Score = 300 bits (768), Expect = 1e-78 Identities = 181/404 (44%), Positives = 249/404 (61%), Gaps = 12/404 (2%) Frame = -1 Query: 1662 VESSNSAQALDLNCIRSRIQELKDIR---SNFEEVPQLNSSEVDELVKSCAQQLESKVDQ 1492 +E S+S++ALDL+ IRSRI EL +I N +E L+ + ++L+K C+ ESKV Q Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNS-EKLLKDCSLHFESKVKQ 63 Query: 1491 IFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELG 1312 I DE+L H SR ++E+S+ LE L Sbjct: 64 IIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLE 123 Query: 1311 QLSCSLE---------VHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIK 1159 L +L+ V E DS +L++ K + Sbjct: 124 GLKYALDSIASQGMEGVEEDPCLDSSM---------------NDEDQSNLMHSNEEQKFE 168 Query: 1158 ILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIP 979 I+EL+ IEK LKSL+DLD F+RL+T+E+IE+ L+ LKV+ ++GNCIRLSL+T+IP Sbjct: 169 IMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIP 228 Query: 978 DIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQ 799 + +L + +ED+ E P E NHELL+E+ DGTME+K VE+FPNDVY+G+IIDA K FRQ Sbjct: 229 KLEGLLCQKTIEDISE-PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQ 287 Query: 798 LDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVG 619 L + L V +++SSLEWFV +VQDRI+LSTLRRF+VK NKSR+S EYL+RDE ++AHLVG Sbjct: 288 LSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVG 347 Query: 618 GIDASIKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVE 487 GIDA IK++QGWP+S +PL L+++KS + SR ISLS L K E Sbjct: 348 GIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEE 391 >gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 392 Score = 299 bits (765), Expect = 3e-78 Identities = 180/401 (44%), Positives = 248/401 (61%), Gaps = 12/401 (2%) Frame = -1 Query: 1662 VESSNSAQALDLNCIRSRIQELKDIR---SNFEEVPQLNSSEVDELVKSCAQQLESKVDQ 1492 +E S+S++ALDL+ IRSRI EL +I N +E L+ + ++L+K C+ ESKV Q Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNS-EKLLKDCSLHFESKVKQ 63 Query: 1491 IFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELG 1312 I DE+L H SR ++E+S+ LE L Sbjct: 64 IIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLE 123 Query: 1311 QLSCSLE---------VHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIK 1159 L +L+ V E DS +L++ K + Sbjct: 124 GLKYALDSIASQGMEGVEEDPCLDSSM---------------NDEDQSNLMHSNEEQKFE 168 Query: 1158 ILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIP 979 I+EL+ IEK LKSL+DLD F+RL+T+E+IE+ L+ LKV+ ++GNCIRLSL+T+IP Sbjct: 169 IMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIP 228 Query: 978 DIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQ 799 + +L + +ED+ E P E NHELL+E+ DGTME+K VE+FPNDVY+G+IIDA K FRQ Sbjct: 229 KLEGLLCQKTIEDISE-PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQ 287 Query: 798 LDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVG 619 L + L V +++SSLEWFV +VQDRI+LSTLRRF+VK NKSR+S EYL+RDE ++AHLVG Sbjct: 288 LSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVG 347 Query: 618 GIDASIKVAQGWPISDAPLVLITLKSPINASREISLSFLRK 496 GIDA IK++QGWP+S +PL L+++KS + SR ISLS L K Sbjct: 348 GIDAFIKLSQGWPLSKSPLKLLSIKSSDHHSRGISLSLLCK 388 >ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211137 [Cucumis sativus] gi|449527675|ref|XP_004170835.1| PREDICTED: uncharacterized protein LOC101229419 [Cucumis sativus] Length = 414 Score = 276 bits (706), Expect = 2e-71 Identities = 174/439 (39%), Positives = 253/439 (57%), Gaps = 8/439 (1%) Frame = -1 Query: 1674 EKMEVESSNSAQALDLNCIRSRIQEL-KDIRSNFEEVPQLNSSEVDELVKSCAQQLESKV 1498 E ME S +LDL +RS ++EL + + N E SE +L++ CA LES++ Sbjct: 4 ESMEATPS-VPPSLDLQAVRSELEELQRSLEENEESTTDSLGSE--KLLRECALHLESRI 60 Query: 1497 DQIFXXXXXXXXXXXXXXDE-FLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLES 1321 Q+ + +++H R +EDS+KL+ Sbjct: 61 QQVLSEYSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEVLKRTNIEDSNKLKM 120 Query: 1320 ELGQLSCSLEVH-----ELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCK-IK 1159 +L L SL+ E F+ G+ ++N R C + Sbjct: 121 DLEVLKLSLDRFPSQDPEEATFNCSSMNGEDPMNV-------------IVN--RECNAFE 165 Query: 1158 ILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIP 979 +LEL+ IEK K LKSL+++D F+ L+ IE++E + +KV++ N IRLSL T IP Sbjct: 166 VLELESQIEKNKKILKSLQEVDEIFKSLDVIEQVEGTIGGMKVIDVADNSIRLSLHTHIP 225 Query: 978 DIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQ 799 ++ +LQR+E LIEK E +HEL++E+ DGTMELK EIFP DV++ +II+A+K Sbjct: 226 NVEDFSTLQRLEGLIEKS-ELDHELIIEVLDGTMELKNAEIFPADVHLHDIINASKSI-- 282 Query: 798 LDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVG 619 S SSLEWFVR+VQDRIVL TLRRF VK NKS +S EYLD+DE+++ ++G Sbjct: 283 ---------SNSSLEWFVRKVQDRIVLCTLRRFAVKSANKSCHSFEYLDQDEMIMCSMIG 333 Query: 618 GIDASIKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSF 439 GIDA IKV+QGWP++D+PL LI+LKS + ++ +SLS + KV ++ NSLDAH+RR+++SF Sbjct: 334 GIDACIKVSQGWPLADSPLKLISLKSSDHYTKGVSLSLICKVEKMANSLDAHIRRNLSSF 393 Query: 438 VDGIEEILVHQM*DEIQPD 382 D +E+IL QM E+Q D Sbjct: 394 ADAVEKILKEQMHLELQAD 412 >gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713298|gb|EOY05195.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 369 Score = 268 bits (684), Expect = 8e-69 Identities = 163/370 (44%), Positives = 225/370 (60%), Gaps = 12/370 (3%) Frame = -1 Query: 1662 VESSNSAQALDLNCIRSRIQELKDIR---SNFEEVPQLNSSEVDELVKSCAQQLESKVDQ 1492 +E S+S++ALDL+ IRSRI EL +I N +E L+ + ++L+K C+ ESKV Q Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNS-EKLLKDCSLHFESKVKQ 63 Query: 1491 IFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELG 1312 I DE+L H SR ++E+S+ LE L Sbjct: 64 IIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLE 123 Query: 1311 QLSCSLE---------VHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIK 1159 L +L+ V E DS +L++ K + Sbjct: 124 GLKYALDSIASQGMEGVEEDPCLDSSM---------------NDEDQSNLMHSNEEQKFE 168 Query: 1158 ILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIP 979 I+EL+ IEK LKSL+DLD F+RL+T+E+IE+ L+ LKV+ ++GNCIRLSL+T+IP Sbjct: 169 IMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIP 228 Query: 978 DIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQ 799 + +L + +ED+ E P E NHELL+E+ DGTME+K VE+FPNDVY+G+IIDA K FRQ Sbjct: 229 KLEGLLCQKTIEDISE-PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQ 287 Query: 798 LDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVG 619 L + L V +++SSLEWFV +VQDRI+LSTLRRF+VK NKSR+S EYL+RDE ++AHLVG Sbjct: 288 LSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVG 347 Query: 618 GIDASIKVAQ 589 GIDA IK++Q Sbjct: 348 GIDAFIKLSQ 357 >ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] gi|482566470|gb|EOA30659.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] Length = 420 Score = 268 bits (684), Expect = 8e-69 Identities = 160/433 (36%), Positives = 248/433 (57%), Gaps = 1/433 (0%) Frame = -1 Query: 1662 VESSNSAQALDLNCIRSRIQELKDIRSNFEEVP-QLNSSEVDELVKSCAQQLESKVDQIF 1486 +E +LDL IRSR++EL+ I N + P + +S+ + LV+ Q E+KV++I Sbjct: 1 MEEDTHDGSLDLQQIRSRVKELESIHRNCKYEPGESCTSDSENLVQDFVLQFETKVNEIV 60 Query: 1485 XXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELGQL 1306 D +L++ SR + EDSS+LE +L L Sbjct: 61 EDYSDVDILDVEDSDAYLEYLRKELHSVEAESAKVSEEIERLSRSHAEDSSRLERDLEGL 120 Query: 1305 SCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLIEKK 1126 SL+ + + K K K+ EL++ +E+K Sbjct: 121 LLSLDSMSSQDVNKSKESPPSCSSMEVCEVNDDD------------KFKMFELENQMEEK 168 Query: 1125 KDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSLQRM 946 + LKSLEDLD +R + E++E+ L+ LKV+E++GN IRL L+T+IP++ + + + Sbjct: 169 RMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPELDGLPAQHKF 228 Query: 945 EDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPVVESR 766 E KP E HELL+ L D T E+ K+E+FPNDVYIG+II+A FRQ+ V+++R Sbjct: 229 EHTT-KPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIEAADSFRQVRLHSAVLDTR 287 Query: 765 SSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKVAQG 586 SS++W V +VQDRI+ +TLR+++V R++ +Y D+DE ++AH+ GGIDA +KV+ G Sbjct: 288 SSVQWVVAKVQDRIITTTLRKYIVTSSKTMRHTFKYYDKDETIVAHIAGGIDAFLKVSDG 347 Query: 585 WPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEILVHQ 406 WP+ ++PL L +LK+ N S+ ISLS + KV EL NSLD R++++ F+D IE+ILVHQ Sbjct: 348 WPLLNSPLKLASLKNSDNQSKGISLSLICKVEELANSLDLQTRQNLSGFIDAIEKILVHQ 407 Query: 405 M*DEIQPDHVS*K 367 +E+Q + S K Sbjct: 408 TREELQSNDSSQK 420 >ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 260 bits (665), Expect = 1e-66 Identities = 157/416 (37%), Positives = 236/416 (56%), Gaps = 1/416 (0%) Frame = -1 Query: 1635 LDLNCIRSRIQELKDIRSNFEEVP-QLNSSEVDELVKSCAQQLESKVDQIFXXXXXXXXX 1459 LDL IRSR++EL+ I N + P + SS+ + LV+ Q E KV +I Sbjct: 10 LDLQEIRSRVKELEFIHRNCRDEPGESCSSDSETLVQDFVLQFEPKVKEIVEDYSDVDLL 69 Query: 1458 XXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELGQLSCSLEVHEL 1279 D +L++ S+ + +DSS+LE +L L SL+ Sbjct: 70 DVEDSDAYLEYLRKELQSVEAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSS 129 Query: 1278 KVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLIEKKKDTLKSLED 1099 + + K K K+ EL++ +E+K+ LKSLED Sbjct: 130 QDVEKSKENQPSSSSMEVCEVNDDD------------KFKMFELENQMEEKRSILKSLED 177 Query: 1098 LDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSLQRMEDLIEKPLE 919 LD +R + E++E+ L+ LKV+E++GN IRL L+T+IP + S+L Q+ E E P E Sbjct: 178 LDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFEHTTE-PSE 236 Query: 918 QNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPVVESRSSLEWFVRR 739 HELL+ L D T E+ K E+FPNDVYIG+II+A FRQ+ V+++RSS++W V + Sbjct: 237 LIHELLIYLKDKTTEITKFEMFPNDVYIGDIIEAADSFRQVSLHSAVLDTRSSVQWVVAK 296 Query: 738 VQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKVAQGWPISDAPLV 559 VQDRI+ STLR++LV R++ EY ++DE ++ H+ GGIDA +KV+ GWP+ + PL Sbjct: 297 VQDRIISSTLRKYLVTSSKTIRHTFEYYEKDETIVGHIAGGIDAFLKVSNGWPLLNTPLK 356 Query: 558 LITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEILVHQM*DEI 391 L +LK+ N S+ ISLS + KV +L NSLD R++++ F+D IE+ILV Q +E+ Sbjct: 357 LESLKNSDNQSKGISLSLICKVEDLANSLDLQTRQNLSGFMDAIEKILVQQTREEL 412 >gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis] Length = 550 Score = 258 bits (660), Expect = 5e-66 Identities = 166/434 (38%), Positives = 248/434 (57%), Gaps = 2/434 (0%) Frame = -1 Query: 1677 EEKMEVESSNSAQA-LDLNCIRSRIQELKDIRSNFEE-VPQLNSSEVDELVKSCAQQLES 1504 E ME+ +S LDL+ IRSR +EL+++ S+ E+ +L S++++LVK CA + +S Sbjct: 135 ENAMEIVPPSSEHLDLDLDTIRSRAKELEEMLSSLEDNDSELFHSDLEKLVKDCALKFQS 194 Query: 1503 KVDQIFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLE 1324 ++++I D L+H +R Y EDS++LE Sbjct: 195 RMEEIGSEWSDVSFLEDKDFDACLEHLGEELNLVEAENSRMSEEIEILTRTYAEDSNQLE 254 Query: 1323 SELGQLSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELD 1144 EL L ++++ L+ ++ K + + +LEL+ Sbjct: 255 IELEGLKSAMDLTALQDLENAKLGACDDYPRNTEDK-------------QHLVLHLLELE 301 Query: 1143 HLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSI 964 + I+KK LKSLEDLD + + IE+IE++L+ +KV+ E NCIR SL+T+IP++ SI Sbjct: 302 NEIKKKNIILKSLEDLDGICKWFDAIEQIEDILTSVKVIALEENCIRFSLQTYIPNLESI 361 Query: 963 LSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPL 784 LS Q +E + P E ELL+EL + T++ K EIFPNDVYI I +A K F Sbjct: 362 LSQQTIE-AVNVPFEVKLELLIELLEWTLDQKNAEIFPNDVYINNISNAAKCF------- 413 Query: 783 PVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDAS 604 S+ SL+WFV +VQDRIV T+R+ +VK NKS YSLEY D+DE+++AHL GG+DA Sbjct: 414 ----SKCSLQWFVTKVQDRIVSCTMRQLVVKSANKSGYSLEYFDKDEVMVAHLAGGVDAF 469 Query: 603 IKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIE 424 IKV+QGWP+S++PL L +LKS + ++ I FL KV E NSL H+ +++SFVD ++ Sbjct: 470 IKVSQGWPLSNSPLKLTSLKSSDHNTKGIPSIFLCKVEERVNSLAVHICHNLSSFVDAVD 529 Query: 423 EILVHQM*DEIQPD 382 +IL Q EI D Sbjct: 530 KILTEQKQLEIGYD 543 >gb|EXC33904.1| hypothetical protein L484_012794 [Morus notabilis] Length = 412 Score = 255 bits (651), Expect = 5e-65 Identities = 167/440 (37%), Positives = 251/440 (57%), Gaps = 8/440 (1%) Frame = -1 Query: 1677 EEKMEVESSNSAQALDLNCIRSRIQELKDIRSNFEEVP-QLNSSEVDELVKSCAQQLESK 1501 E ME+ +S + LDL+ IRSR +EL+++ S+ E+ +L S++++LVK CA + +S+ Sbjct: 2 ENAMEIVPPSS-EHLDLDTIRSRAKELEEMLSSLEDNDSELFHSDLEKLVKDCALKFQSR 60 Query: 1500 VDQIFXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLES 1321 +++I D L+H +R Y EDS++LE Sbjct: 61 MEEIGSEWSDVSFLEDKGFDACLEHLGEELNLVEAENSIMSEKIEVLTRTYAEDSNQLEI 120 Query: 1320 ELGQLSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPC----RTCKIK-- 1159 EL L +++ L+ + K L C R + K Sbjct: 121 ELEGLKNVMDLTALQDLGNAK-----------------------LGACDDYPRNTEDKQH 157 Query: 1158 -ILELDHLIEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFI 982 +LEL+ I++K LKSLEDLD + + IE+IE++L+ +KV+ E NCIR SL+T+I Sbjct: 158 SLLELEKEIKQKNIILKSLEDLDGICKWFDAIEQIEDILTGVKVIALEENCIRFSLQTYI 217 Query: 981 PDIGSILSLQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFR 802 P++ S L LQ+ + + P E HELL+EL + T++ K VEIFPNDVY+ I +A K+F Sbjct: 218 PNLESFL-LQQTIEAVNVPFEVKHELLIELLEWTLDQKNVEIFPNDVYLNNISNAAKDF- 275 Query: 801 QLDAPLPVVESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLV 622 S+ SL+WFV +VQDRIV T+R+ +VK N S YSLEY D+DE+++AHL Sbjct: 276 ----------SKCSLQWFVTKVQDRIVSCTMRQLVVKSANTSGYSLEYFDKDEVMVAHLA 325 Query: 621 GGIDASIKVAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITS 442 GG+DA IKV+QGWP+S++PL L +LKS + ++ I FL KV E NSL H+ ++++S Sbjct: 326 GGVDAFIKVSQGWPLSNSPLKLTSLKSSDHNTKGIPSIFLFKVKERVNSLAVHICQNLSS 385 Query: 441 FVDGIEEILVHQM*DEIQPD 382 FVD +++IL Q EI D Sbjct: 386 FVDAVDKILTEQKQLEIGYD 405 >ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1| uncharacterized protein AT3G23910 [Arabidopsis thaliana] Length = 421 Score = 249 bits (637), Expect = 2e-63 Identities = 158/437 (36%), Positives = 240/437 (54%), Gaps = 5/437 (1%) Frame = -1 Query: 1662 VESSNSAQALDLNCIRSRIQELKDIRSNFEEVP-QLNSSEVDELV-KSCAQQLESKVDQI 1489 +E +LDL IR R++EL N E P + SS+ + LV + Q E KV +I Sbjct: 1 MEEETHDGSLDLQEIRRRVKELDFFPRNCREEPVESCSSDYETLVVQDFVLQFEPKVKEI 60 Query: 1488 FXXXXXXXXXXXXXXDEFLDHXXXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELGQ 1309 D +L++ S+ + +DSS+L+ +L Sbjct: 61 VEEYGDVDLLDVEDSDAYLEYLRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEG 120 Query: 1308 LSCSLEVHELKVFDSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTC---KIKILELDHL 1138 L SL+ + + K + C K K+ EL++ Sbjct: 121 LLLSLDSMSSQDVEKSKENQPSSSS---------------MEVCEVIDDDKFKMFELENQ 165 Query: 1137 IEKKKDTLKSLEDLDYKFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILS 958 +E+K+ LKSLEDLD +R + E++E+ L+ LKV+E++GN IRL L+T+I + L Sbjct: 166 MEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLG 225 Query: 957 LQRMEDLIEKPLEQNHELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPV 778 Q D I +P E HELL+ L D T E+ K E+FPND+YIG+II+A FRQ+ V Sbjct: 226 -QHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAV 284 Query: 777 VESRSSLEWFVRRVQDRIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIK 598 +++RSS++W V +VQD+I+ +TLR+++V RY+ EY D+DE ++AH+ GGIDA +K Sbjct: 285 LDTRSSVQWVVAKVQDKIISTTLRKYIVMSSKTIRYTFEYYDKDETIVAHIAGGIDAFLK 344 Query: 597 VAQGWPISDAPLVLITLKSPINASREISLSFLRKVVELGNSLDAHVRRSITSFVDGIEEI 418 V+ GWP+ + PL L +LK+ N S+ ISLS + KV EL NSLD R++++ F+D IE+I Sbjct: 345 VSDGWPLLNTPLKLASLKNSDNQSKGISLSLICKVEELANSLDLETRQNLSGFMDAIEKI 404 Query: 417 LVHQM*DEIQPDHVS*K 367 LV Q +E+Q + S K Sbjct: 405 LVEQTREELQSNKSSQK 421 >gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theobroma cacao] Length = 343 Score = 248 bits (633), Expect = 6e-63 Identities = 150/347 (43%), Positives = 207/347 (59%), Gaps = 9/347 (2%) Frame = -1 Query: 1602 ELKDIRSNFEEVPQLNSSEVDELVKSCAQQLESKVDQIFXXXXXXXXXXXXXXDEFLDHX 1423 E+ I N +E L+ + ++L+K C+ ESKV QI DE+L H Sbjct: 2 EIHRIDKNKDEGEALSLNS-EKLLKDCSLHFESKVKQIIEEYSDVGFLGIEDLDEYLAHL 60 Query: 1422 XXXXXXXXXXXXXXXXXXXXXSRRYVEDSSKLESELGQLSCSLE---------VHELKVF 1270 SR ++E+S+ LE L L +L+ V E Sbjct: 61 KEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEGLKYALDSIASQGMEGVEEDPCL 120 Query: 1269 DSGKYRGKXXXXXXXXXXXXXXXXXDLLNPCRTCKIKILELDHLIEKKKDTLKSLEDLDY 1090 DS +L++ K +I+EL+ IEK LKSL+DLD Sbjct: 121 DSSM---------------NDEDQSNLMHSNEEQKFEIMELESQIEKNNIILKSLQDLDS 165 Query: 1089 KFRRLETIEKIENLLSCLKVVEYEGNCIRLSLKTFIPDIGSILSLQRMEDLIEKPLEQNH 910 F+RL+T+E+IE+ L+ LKV+ ++GNCIRLSL+T+IP + +L + +ED+ E P E NH Sbjct: 166 MFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE-PSEMNH 224 Query: 909 ELLLELGDGTMELKKVEIFPNDVYIGEIIDATKEFRQLDAPLPVVESRSSLEWFVRRVQD 730 ELL+E+ DGTME+K VE+FPNDVY+G+IIDA K FRQL + L V +++SSLEWFV +VQD Sbjct: 225 ELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWFVGKVQD 284 Query: 729 RIVLSTLRRFLVKCENKSRYSLEYLDRDEIVIAHLVGGIDASIKVAQ 589 RI+LSTLRRF+VK NKSR+S EYL+RDE ++AHLVGGIDA IK++Q Sbjct: 285 RIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQ 331