BLASTX nr result
ID: Akebia24_contig00028311
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00028311 (1483 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245... 419 e-114 ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma... 400 e-109 ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [... 388 e-105 ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm... 377 e-102 ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620... 373 e-100 ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620... 369 3e-99 ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma... 361 4e-97 ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma... 359 2e-96 ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu... 344 5e-92 ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prun... 342 3e-91 ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps... 335 4e-89 ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab... 334 5e-89 ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ... 325 2e-86 ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244... 325 2e-86 ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma... 321 6e-85 ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592... 319 2e-84 ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [... 318 3e-84 dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana] 315 4e-83 ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr... 308 5e-81 ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part... 305 3e-80 >ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera] gi|298205214|emb|CBI17273.3| unnamed protein product [Vitis vinifera] Length = 425 Score = 419 bits (1078), Expect = e-114 Identities = 221/390 (56%), Positives = 290/390 (74%), Gaps = 4/390 (1%) Frame = -1 Query: 1378 DFTELSSSS--ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLV 1205 +++ +S S+ +S +F++F L+S++ QI DAY+ H K+ELNLV Sbjct: 28 NYSHISDSNPLDSRSLFQEFSHHLQSRVNQILSQYSDVESLEADDLDAYLGHLKKELNLV 87 Query: 1204 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 1025 E+EN + SNEIE TRTY+ED+ +LES+LE L +S+ ++SQGL + E A V+ S E Sbjct: 88 ESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVDFVASQGLKRAEAGALVDYSSSVE 147 Query: 1024 NQ-GSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848 +Q S AH D NFE+L+L+++ +KNK+ L SL DLDYTFKR E++ KIED L GLKVI+ Sbjct: 148 DQLDSRTAHGDNNFEILDLNYQTQKNKITLKSLQDLDYTFKRFEAIEKIEDALTGLKVID 207 Query: 847 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668 FEGNCI+LSL T++P LEGLLC++K+E +P ++HEL IE+MD +MELKN EIFP+D+ Sbjct: 208 FEGNCIRLSLSTFIPNLEGLLCEEKIEAVNEPSELNHELLIEVMDQSMELKNVEIFPNDV 267 Query: 667 FIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491 ++GEIIDAAK+ R+ + +S+ E RSSLEW VRKVQ +IIL LRQ +VK AN SRH E Sbjct: 268 YLGEIIDAAKSSRKLFSHMSILETRSSLEWFVRKVQDKIILCALRQSIVKGANKSRHSLE 327 Query: 490 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311 Y DRD I AHMVGG+DA+IK+ Q WPV N+ALKL SL SS+ SKGISLSFLCKVEE+ Sbjct: 328 YLDRDEIIVAHMVGGVDAYIKVCQGWPVSNNALKLKSL-KSSDQQSKGISLSFLCKVEEM 386 Query: 310 VNSLDVKTRQNLSSFADAIEEILVQQMRSE 221 NSLDV R+N+SSF DAIEEILVQQM+S+ Sbjct: 387 ANSLDVSIRKNISSFVDAIEEILVQQMQSK 416 >ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508713296|gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 430 Score = 400 bits (1028), Expect = e-109 Identities = 213/428 (49%), Positives = 298/428 (69%), Gaps = 3/428 (0%) Frame = -1 Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298 E ME S SSE +D+ + + E + S S K+ KD L ESK+K Sbjct: 3 EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62 Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118 QI D Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NL Sbjct: 63 QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122 Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941 EGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + Sbjct: 123 EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182 Query: 940 LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761 L SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E Sbjct: 183 LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242 Query: 760 TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584 ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLE Sbjct: 243 SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302 Query: 583 WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVL 404 W V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ Sbjct: 303 WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLS 362 Query: 403 NSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRS 224 S LKL+S+ SS++HS+GISLS LCK EE+ NSLD+ RQNLS+F DA+E++L++QMR Sbjct: 363 KSPLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQMRL 421 Query: 223 ERQSGHIS 200 + QS S Sbjct: 422 DLQSDDAS 429 >ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] gi|508713299|gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 372 Score = 388 bits (996), Expect = e-105 Identities = 197/372 (52%), Positives = 276/372 (74%), Gaps = 2/372 (0%) Frame = -1 Query: 1309 SKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRL 1130 SK+KQI D Y+ H KEELN VEAE+ + SNEIE +R +IE++ L Sbjct: 1 SKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNIL 60 Query: 1129 ESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEK 953 E NLEGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEK Sbjct: 61 EGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEK 120 Query: 952 NKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQK 773 N + L SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ Sbjct: 121 NNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKT 180 Query: 772 MEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPR 596 +E ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + + Sbjct: 181 IEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQ 240 Query: 595 SSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQS 416 SSLEW V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q Sbjct: 241 SSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQG 300 Query: 415 WPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQ 236 WP+ S LKL+S+ SS++HS+GISLS LCK EE+ NSLD+ RQNLS+F DA+E++L++ Sbjct: 301 WPLSKSPLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLE 359 Query: 235 QMRSERQSGHIS 200 QMR + QS S Sbjct: 360 QMRLDLQSDDAS 371 >ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis] gi|223542639|gb|EEF44176.1| conserved hypothetical protein [Ricinus communis] Length = 415 Score = 377 bits (968), Expect = e-102 Identities = 206/394 (52%), Positives = 276/394 (70%), Gaps = 3/394 (0%) Frame = -1 Query: 1384 CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLV 1205 CN TE+ SS S ++ +D L LESK++QI DA+VEH KEEL+ Sbjct: 24 CNGDTEMLSS-HSDQVLEDCALHLESKVQQIMSECSDFNFLGIEDLDAFVEHLKEELSTT 82 Query: 1204 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 1025 +E + S EIE R ++ED TRLES++E L SL ISS+ ++K + A E ST+ Sbjct: 83 MSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISSKDVEKEKEVACREDLYSTD 142 Query: 1024 NQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEF 845 AH+DY FE+ +LD +I K+K+ L SL D D FKR+++V +IE+ L GLKVIEF Sbjct: 143 ------AHRDYEFEISKLDDQIAKSKMILKSLQDFDSVFKRVDAVEQIEEALSGLKVIEF 196 Query: 844 EGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIF 665 +G+CI+LSL+TY+P L+ ++CQ K E +P V+HEL IE++ GTMELKN EIFP+DI+ Sbjct: 197 DGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHELLIEVVSGTMELKNVEIFPNDIY 256 Query: 664 IGEIIDAAKALRQ---YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWF 494 I +I+DAAK+ R+ Y L+ SE RSSL W+VRKVQ +II TLR+L+VK +N SR+ F Sbjct: 257 ISDIVDAAKSFRKEFLYSALTESETRSSLGWLVRKVQDRIIQFTLRRLVVKSSNKSRYSF 316 Query: 493 EYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEE 314 EY DRD T+ AH+VGG+DAFIK+ Q WPV S LKL+SL SS+ +HSK ISLSFLC+VEE Sbjct: 317 EYLDRDETVVAHLVGGVDAFIKLSQGWPVSRSPLKLISLKSSN-HHSKEISLSFLCRVEE 375 Query: 313 LVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 212 +VNSLD++ R NL SF + IE++LV+QMR E S Sbjct: 376 VVNSLDIQMRLNLLSFVEVIEKLLVEQMRIELHS 409 >ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus sinensis] Length = 447 Score = 373 bits (957), Expect = e-100 Identities = 201/390 (51%), Positives = 270/390 (69%), Gaps = 11/390 (2%) Frame = -1 Query: 1357 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSN 1178 SS+S + K++ D ESK+K+I DAY+EH KEEL VEAE+ + SN Sbjct: 51 SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110 Query: 1177 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD------ASVEGFIS---TE 1025 EIE TRT +ED+ RLES+LE LN ++ I S+G + D A E + TE Sbjct: 111 EIETLTRTQVEDSDRLESDLEELNCAIDLIVSEGSQNAKEDRQAVCPARGEDQVCPTHTE 170 Query: 1024 NQGSSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848 +Q H+D+ FE+LEL+ +IEKNK+ L+SL DLD+ KR ++V +IED+L GLKVI+ Sbjct: 171 DQSDLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVID 230 Query: 847 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668 F+G C +LS++TY+PTLE Q K+E +P V+HEL IE++DGTME+KN E+FP+D+ Sbjct: 231 FDGKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDV 290 Query: 667 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491 I +++DAAK+ RQ L E SSL+W +R VQ +IILSTLR+ +VK AN SRH+FE Sbjct: 291 HISDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFE 350 Query: 490 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311 Y +RD I AH+VGG+DAFIK Q WP+ NS LK++SL +S++HSKGISLSF C+VEE Sbjct: 351 YFERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEA 409 Query: 310 VNSLDVKTRQNLSSFADAIEEILVQQMRSE 221 NSLDV RQNLSSF D +E+IL++QMR E Sbjct: 410 ANSLDVHIRQNLSSFVDGVEKILLEQMRVE 439 >ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus sinensis] Length = 444 Score = 369 bits (946), Expect = 3e-99 Identities = 199/387 (51%), Positives = 268/387 (69%), Gaps = 8/387 (2%) Frame = -1 Query: 1357 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSN 1178 SS+S + K++ D ESK+K+I DAY+EH KEEL VEAE+ + SN Sbjct: 51 SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110 Query: 1177 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEM---DASVEGFIS---TENQG 1016 EIE TRT +ED+ RLES+LE LN ++ I S+ + A E + TE+Q Sbjct: 111 EIETLTRTQVEDSDRLESDLEELNCAIDLIVSENAKEDRQAVCPARGEDQVCPTHTEDQS 170 Query: 1015 SSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEG 839 H+D+ FE+LEL+ +IEKNK+ L+SL DLD+ KR ++V +IED+L GLKVI+F+G Sbjct: 171 DLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDG 230 Query: 838 NCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIG 659 C +LS++TY+PTLE Q K+E +P V+HEL IE++DGTME+KN E+FP+D+ I Sbjct: 231 KCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHIS 290 Query: 658 EIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSD 482 +++DAAK+ RQ L E SSL+W +R VQ +IILSTLR+ +VK AN SRH+FEY + Sbjct: 291 DLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFE 350 Query: 481 RDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNS 302 RD I AH+VGG+DAFIK Q WP+ NS LK++SL +S++HSKGISLSF C+VEE NS Sbjct: 351 RDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEAANS 409 Query: 301 LDVKTRQNLSSFADAIEEILVQQMRSE 221 LDV RQNLSSF D +E+IL++QMR E Sbjct: 410 LDVHIRQNLSSFVDGVEKILLEQMRVE 436 >ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508713301|gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 432 Score = 361 bits (927), Expect = 4e-97 Identities = 193/392 (49%), Positives = 270/392 (68%), Gaps = 3/392 (0%) Frame = -1 Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298 E ME S SSE +D+ + + E + S S K+ KD L ESK+K Sbjct: 3 EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62 Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118 QI D Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NL Sbjct: 63 QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122 Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941 EGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + Sbjct: 123 EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182 Query: 940 LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761 L SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E Sbjct: 183 LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242 Query: 760 TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584 ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLE Sbjct: 243 SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302 Query: 583 WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVL 404 W V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ Sbjct: 303 WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLS 362 Query: 403 NSALKLVSLNSSSENHSKGISLSFLCKVEELV 308 S LKL+S+ SS++HS+GISLS LCK EE + Sbjct: 363 KSPLKLLSI-KSSDHHSRGISLSLLCKAEEAI 393 >ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508713300|gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 392 Score = 359 bits (921), Expect = 2e-96 Identities = 192/391 (49%), Positives = 269/391 (68%), Gaps = 3/391 (0%) Frame = -1 Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298 E ME S SSE +D+ + + E + S S K+ KD L ESK+K Sbjct: 3 EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62 Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118 QI D Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NL Sbjct: 63 QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122 Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941 EGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + Sbjct: 123 EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182 Query: 940 LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761 L SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E Sbjct: 183 LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242 Query: 760 TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584 ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLE Sbjct: 243 SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302 Query: 583 WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVL 404 W V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ Sbjct: 303 WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLS 362 Query: 403 NSALKLVSLNSSSENHSKGISLSFLCKVEEL 311 S LKL+S+ SS++HS+GISLS LCK E + Sbjct: 363 KSPLKLLSI-KSSDHHSRGISLSLLCKAERV 392 >ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] gi|222847415|gb|EEE84962.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] Length = 429 Score = 344 bits (883), Expect = 5e-92 Identities = 196/421 (46%), Positives = 277/421 (65%), Gaps = 5/421 (1%) Frame = -1 Query: 1459 SHSSERVDIETXXXXXXXXXXXXXSCN--DFTELSSSSESVKIFKDFFLDLESKIKQITX 1286 S + E +++ T CN F+E++SS +S ++ KD L SK+ Q Sbjct: 6 STTQESLNLNTIRSRINELEEIYRDCNADSFSEINSS-DSDELMKDSAQQLVSKVSQTVT 64 Query: 1285 XXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLN 1106 DAY+ H KEEL+ EAE+ + SNEIE+ RT +ED++ LE++LE + Sbjct: 65 EYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDLEWMK 124 Query: 1105 YSLQSISSQ-GLDKLEMDASVEGFISTENQGSSY-AHQDYNFELLELDHEIEKNKVALSS 932 SL ISSQ +K + D +E F S ENQ + +++ FE+L+LD++IE++ L S Sbjct: 125 CSLDLISSQRDREKEKGDEQMEHFSSGENQSNLINTNEENKFEILKLDNQIEESTRILKS 184 Query: 931 LHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDP 752 + DLD K +++ +IED L GLKVIEF+G CI+LSL+TY+P + +L QK+E P Sbjct: 185 MQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPK-QDVLFLQKIEETNVP 243 Query: 751 FAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVV 575 + ++HE IE+ +G+ME+K E+FP+DI+IG+I+DAAK+ RQ + L++ E SSLEW V Sbjct: 244 YEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALMETSSSLEWFV 303 Query: 574 RKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSA 395 RK Q +II STLR+L+ + A+ SR EY DRD I AHMVGG+DAF+++ Q WP+ NS Sbjct: 304 RKAQDRIIQSTLRRLVARSASTSRQSIEYLDRDEIIVAHMVGGVDAFMEVSQGWPITNSP 363 Query: 394 LKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQ 215 LKLVSL +S+ +H+K ISL FLCKVEE NSLDV TRQNLSSF D++E+ILV+QM E Sbjct: 364 LKLVSLKNSN-HHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKILVEQMHLELH 422 Query: 214 S 212 S Sbjct: 423 S 423 >ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] gi|462422632|gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] Length = 416 Score = 342 bits (877), Expect = 3e-91 Identities = 195/413 (47%), Positives = 271/413 (65%), Gaps = 2/413 (0%) Frame = -1 Query: 1453 SSERVDIETXXXXXXXXXXXXXSC--NDFTELSSSSESVKIFKDFFLDLESKIKQITXXX 1280 SSE +D+ T SC +D +ELS S +S + ++ L L+S+++QI Sbjct: 8 SSEPLDLNTIQRQVRELEEIIESCRQDDASELSPS-DSDDLIRNCGLLLQSRVEQIVSEC 66 Query: 1279 XXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYS 1100 +AYV ++ELN VEAE+ + SN IE RT+ ED RL ++L L S Sbjct: 67 SDVGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDLAQLKCS 126 Query: 1099 LQSISSQGLDKLEMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDL 920 L + + L+K ++ A V+ ++ FELLEL+++IEKN + L SL DL Sbjct: 127 LDFVEEKDLEKAKLGADVDYHKCGKDLLDPMNVNADKFELLELENQIEKNNIILKSLQDL 186 Query: 919 DYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVD 740 + T K +++ +IED + GLKVI FEGNC++LSL+TY+P LE L +K+ AT+P V+ Sbjct: 187 ECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLFSPKKVGDATEPSEVN 246 Query: 739 HELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCPLSVSEPRSSLEWVVRKVQH 560 HEL IEL++GTM L+N EIFP+D++I +I+DAAK+LR +SSL+W V KVQ Sbjct: 247 HELLIELLEGTMGLRNVEIFPNDVYINDILDAAKSLR----------KSSLQWFVTKVQD 296 Query: 559 QIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVS 380 +I+L T+R+L+VK+ N SRH EY D+D T+ AH+VGG+DAFIK+PQ WP+L+S LKL+ Sbjct: 297 RIVLCTMRRLVVKNENKSRHSLEYLDKDETVVAHVVGGVDAFIKVPQGWPLLSSPLKLIY 356 Query: 379 LNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 221 L SS+ HSKGISLSFLC V+EL NSL V+ RQ LSSF DAIE+ILV+QM SE Sbjct: 357 L-KSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEKILVEQMCSE 408 >ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] gi|482566470|gb|EOA30659.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] Length = 420 Score = 335 bits (858), Expect = 4e-89 Identities = 183/389 (47%), Positives = 258/389 (66%), Gaps = 1/389 (0%) Frame = -1 Query: 1363 SSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRT 1184 S +S+S + +DF L E+K+ +I DAY+E+ ++EL+ VEAE+ + Sbjct: 36 SCTSDSENLVQDFVLQFETKVNEIVEDYSDVDILDVEDSDAYLEYLRKELHSVEAESAKV 95 Query: 1183 SNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSSYA 1004 S EIE +R++ ED++RLE +LEGL SL S+SSQ ++K S E S + Sbjct: 96 SEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSSQDVNK-----SKESPPSCSSMEVCEV 150 Query: 1003 HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKL 824 + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L GLKV+EF+GN I+L Sbjct: 151 NDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRL 210 Query: 823 SLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDA 644 L+TY+P L+GL Q K E+ T P + HEL I L D T E+ E+FP+D++IG+II+A Sbjct: 211 QLRTYIPELDGLPAQHKFEHTTKPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIEA 270 Query: 643 AKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATI 467 A + RQ +V + RSS++WVV KVQ +II +TLR+ +V + RH F+Y D+D TI Sbjct: 271 ADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITTTLRKYIVTSSKTMRHTFKYYDKDETI 330 Query: 466 TAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKT 287 AH+ GGIDAF+K+ WP+LNS LKL SL +S+N SKGISLS +CKVEEL NSLD++T Sbjct: 331 VAHIAGGIDAFLKVSDGWPLLNSPLKLASL-KNSDNQSKGISLSLICKVEELANSLDLQT 389 Query: 286 RQNLSSFADAIEEILVQQMRSERQSGHIS 200 RQNLS F DAIE+ILV Q R E QS S Sbjct: 390 RQNLSGFIDAIEKILVHQTREELQSNDSS 418 >ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 334 bits (857), Expect = 5e-89 Identities = 187/395 (47%), Positives = 260/395 (65%), Gaps = 7/395 (1%) Frame = -1 Query: 1384 CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLV 1205 C D S SS+S + +DF L E K+K+I DAY+E+ ++EL V Sbjct: 29 CRDEPGESCSSDSETLVQDFVLQFEPKVKEIVEDYSDVDLLDVEDSDAYLEYLRKELQSV 88 Query: 1204 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 1025 EAE+ + S EIE ++++ +D++RLE +LEGL SL S+SSQ ++K S E Sbjct: 89 EAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSSQDVEK-----------SKE 137 Query: 1024 NQGSSYA------HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMG 863 NQ SS + + D F++ EL++++E+ + L SL DLD KR ++ ++ED L G Sbjct: 138 NQPSSSSMEVCEVNDDDKFKMFELENQMEEKRSILKSLEDLDSLRKRFDAAEQVEDALTG 197 Query: 862 LKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEI 683 LKV+EF+GN I+L L+TY+P L+ LL QQK E+ T+P + HEL I L D T E+ E+ Sbjct: 198 LKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIYLKDKTTEITKFEM 257 Query: 682 FPHDIFIGEIIDAAKALRQYCPLS-VSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNS 506 FP+D++IG+II+AA + RQ S V + RSS++WVV KVQ +II STLR+ LV + Sbjct: 258 FPNDVYIGDIIEAADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISSTLRKYLVTSSKTI 317 Query: 505 RHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLC 326 RH FEY ++D TI H+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKGISLS +C Sbjct: 318 RHTFEYYEKDETIVGHIAGGIDAFLKVSNGWPLLNTPLKLESL-KNSDNQSKGISLSLIC 376 Query: 325 KVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 221 KVE+L NSLD++TRQNLS F DAIE+ILVQQ R E Sbjct: 377 KVEDLANSLDLQTRQNLSGFMDAIEKILVQQTREE 411 >ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1| uncharacterized protein AT3G23910 [Arabidopsis thaliana] Length = 421 Score = 325 bits (834), Expect = 2e-86 Identities = 182/397 (45%), Positives = 258/397 (64%), Gaps = 7/397 (1%) Frame = -1 Query: 1369 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENL 1190 E SS + +DF L E K+K+I DAY+E+ + EL VEAE+ Sbjct: 35 ESCSSDYETLVVQDFVLQFEPKVKEIVEEYGDVDLLDVEDSDAYLEYLRNELQSVEAESA 94 Query: 1189 RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 1010 + S EIE ++++ +D++RL+ +LEGL SL S+SSQ ++K S ENQ SS Sbjct: 95 KVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143 Query: 1009 YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848 + + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L GLKV+E Sbjct: 144 SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203 Query: 847 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668 F+GN I+L L+TY+ L+G L Q K ++ T+P + HEL I L D T E+ E+FP+DI Sbjct: 204 FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263 Query: 667 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491 +IG+II+AA + RQ +V + RSS++WVV KVQ +II +TLR+ +V + R+ FE Sbjct: 264 YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKYIVMSSKTIRYTFE 323 Query: 490 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311 Y D+D TI AH+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKGISLS +CKVEEL Sbjct: 324 YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGISLSLICKVEEL 382 Query: 310 VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 200 NSLD++TRQNLS F DAIE+ILV+Q R E QS S Sbjct: 383 ANSLDLETRQNLSGFMDAIEKILVEQTREELQSNKSS 419 >ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244321 [Solanum lycopersicum] Length = 415 Score = 325 bits (834), Expect = 2e-86 Identities = 180/376 (47%), Positives = 254/376 (67%), Gaps = 2/376 (0%) Frame = -1 Query: 1351 ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEI 1172 E K +D L ESK++Q+ D + + K EL+ EA+N + ++EI Sbjct: 35 ELKKSLEDCTLQFESKVEQLLCDASEVNFSSDQDLDEFWNYLKNELSTEEAKNAKIADEI 94 Query: 1171 EVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQD 995 E +R Y+E ++L + +EGL+ L+ I S G+++ + E++G+ S A + Sbjct: 95 EGLSREYVEGYSKLVNEVEGLSCLLELIESLGIEQGRALTNFPCSTPGEDKGNLSSAPVE 154 Query: 994 YNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLK 815 +NF++ EL +++EK+K+ L SL +L+ TF R E++ KIED GLK+++FEGN I+LSL+ Sbjct: 155 HNFKIFELGNQLEKSKLNLESLEELESTFNRFEAIEKIEDAFSGLKIVQFEGNRIRLSLR 214 Query: 814 TYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKA 635 T++P LE LL Q + A P +HEL IEL+DGTMELK+ EIFP+D+ I EI D AK+ Sbjct: 215 TFIPNLENLLHNQTIGVAEPP-EQNHELLIELVDGTMELKHVEIFPNDVSISEITDTAKS 273 Query: 634 LRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAH 458 LRQ Y P+ V E RSSLEW+V++VQ +IILSTLR+ LVK AN+SRH F+Y +R+ TI AH Sbjct: 274 LRQVYFPVGVLENRSSLEWLVKRVQDRIILSTLRRFLVKSANSSRHSFDYVEREETIVAH 333 Query: 457 MVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQN 278 MVGGIDAF+K+PQ WP+ S L L+SL SSS+ +S+ ISL+ LCKV E NSLD RQ Sbjct: 334 MVGGIDAFVKLPQGWPLTCSGLTLMSLKSSSQ-YSQQISLTLLCKVAEAANSLDTNARQT 392 Query: 277 LSSFADAIEEILVQQM 230 +S F D +EEIL+QQM Sbjct: 393 ISGFTDRVEEILMQQM 408 >ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590656431|ref|XP_007034269.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713297|gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713298|gb|EOY05195.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 369 Score = 321 bits (822), Expect = 6e-85 Identities = 172/355 (48%), Positives = 242/355 (68%), Gaps = 3/355 (0%) Frame = -1 Query: 1474 ESMENSHSSERVDIETXXXXXXXXXXXXXSCNDFTELSSSS-ESVKIFKDFFLDLESKIK 1298 E ME S SSE +D+ + + E + S S K+ KD L ESK+K Sbjct: 3 EPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVK 62 Query: 1297 QITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNL 1118 QI D Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NL Sbjct: 63 QIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNL 122 Query: 1117 EGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVA 941 EGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + Sbjct: 123 EGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNII 182 Query: 940 LSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYA 761 L SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E Sbjct: 183 LKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDI 242 Query: 760 TDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLE 584 ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLE Sbjct: 243 SEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLE 302 Query: 583 WVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQ 419 W V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q Sbjct: 303 WFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQ 357 >ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592291 [Solanum tuberosum] Length = 428 Score = 319 bits (818), Expect = 2e-84 Identities = 173/337 (51%), Positives = 239/337 (70%), Gaps = 2/337 (0%) Frame = -1 Query: 1234 EHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD 1055 ++ K EL+ EA N + ++EIE +R Y+E ++L + +EGL+ L+ I S GL++ + Sbjct: 87 KYLKNELSTEEANNAKIADEIEGLSREYVEGYSKLVNEIEGLSCPLELIESLGLEQGRVL 146 Query: 1054 ASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIE 878 + E++G+ S A + NF++ EL +++EK+K+ L SL +L+ TF R E++ KIE Sbjct: 147 TNFPCSTPGEDKGNVSSAPVEQNFKVFELGNQLEKSKLNLKSLEELESTFNRFEAIEKIE 206 Query: 877 DTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMEL 698 D GLK++EFEGN I+LSL+T++P LE LL Q ++ A P +HEL IELMDGTMEL Sbjct: 207 DAFSGLKIVEFEGNRIRLSLRTFIPNLENLLHNQTIDVAEPP-EQNHELLIELMDGTMEL 265 Query: 697 KNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVK 521 K+ EIFP+D+ I I D AK+LRQ Y P+ V E RSSLEW V+ VQ +I+LSTLR+ LVK Sbjct: 266 KHVEIFPNDVSISYITDTAKSLRQVYFPVGVLENRSSLEWFVKGVQDRIVLSTLRRFLVK 325 Query: 520 DANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGIS 341 AN+SRH F+Y DR+ TI AHMVGGIDAFIK+PQ WP+ +S L L+SL SSS+ +S+ IS Sbjct: 326 SANSSRHSFDYVDREETIVAHMVGGIDAFIKLPQGWPLTSSGLTLMSLKSSSQ-YSQQIS 384 Query: 340 LSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQM 230 L+ LCKV E+ N LD RQ +S F D +EEIL+QQM Sbjct: 385 LTLLCKVAEVANLLDTNERQTISGFTDRVEEILMQQM 421 >ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [Theobroma cacao] gi|508713302|gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theobroma cacao] Length = 343 Score = 318 bits (816), Expect = 3e-84 Identities = 163/315 (51%), Positives = 228/315 (72%), Gaps = 2/315 (0%) Frame = -1 Query: 1357 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENLRTSN 1178 S S K+ KD L ESK+KQI D Y+ H KEELN VEAE+ + SN Sbjct: 17 SLNSEKLLKDCSLHFESKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISN 76 Query: 1177 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAH 1001 EIE +R +IE++ LE NLEGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++ Sbjct: 77 EIEDLSRNHIEESNILEGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSN 136 Query: 1000 QDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLS 821 ++ FE++EL+ +IEKN + L SL DLD FKR++++ +IED L GLKVI F+GNCI+LS Sbjct: 137 EEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLS 196 Query: 820 LKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAA 641 L+TY+P LEGLLCQ+ +E ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAA Sbjct: 197 LQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAA 256 Query: 640 KALRQYCP-LSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATIT 464 K+ RQ L+V + +SSLEW V KVQ +IILSTLR+ +VK N SRH FEY +RD TI Sbjct: 257 KSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIV 316 Query: 463 AHMVGGIDAFIKIPQ 419 AH+VGGIDAFIK+ Q Sbjct: 317 AHLVGGIDAFIKLSQ 331 >dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana] Length = 421 Score = 315 bits (806), Expect = 4e-83 Identities = 178/397 (44%), Positives = 254/397 (63%), Gaps = 7/397 (1%) Frame = -1 Query: 1369 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXDAYVEHAKEELNLVEAENL 1190 E SS + +DF L E K+K+I DAY+E+ + EL VEAE+ Sbjct: 35 ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDQTDAYLEYLRNELQSVEAESA 94 Query: 1189 RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 1010 + S EIE ++++ D++RL+ +LEGL SL S+SSQ ++K S ENQ SS Sbjct: 95 KVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143 Query: 1009 YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 848 + + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L GLKV+E Sbjct: 144 SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203 Query: 847 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 668 F+GN I+L L+TY+ L+G L Q K ++ T+P + HEL I L D T E+ E+FP+DI Sbjct: 204 FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263 Query: 667 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 491 +IG+II+AA + RQ +V + RSS++WVV KVQ +II +TLR+ V + R+ FE Sbjct: 264 YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSKTIRYTFE 323 Query: 490 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 311 Y D+D TI AH+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKG SLS + K+EEL Sbjct: 324 YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSLISKLEEL 382 Query: 310 VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 200 NSLD++TRQNLS F DA+E+ILVQQ R E +S S Sbjct: 383 ANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 419 >ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] Length = 428 Score = 308 bits (788), Expect = 5e-81 Identities = 177/404 (43%), Positives = 253/404 (62%), Gaps = 14/404 (3%) Frame = -1 Query: 1369 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXD-------AYVEHAKEELN 1211 E SS + +DF L E K+K+I AY+E+ + EL Sbjct: 35 ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDHTLVDGNLTDAYLEYLRNELQ 94 Query: 1210 LVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFIS 1031 VEAE+ + S EIE ++++ D++RL+ +LEGL SL S+SSQ ++K S Sbjct: 95 SVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------S 143 Query: 1030 TENQGSSYAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTL 869 ENQ SS + + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L Sbjct: 144 KENQPSSSSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDAL 203 Query: 868 MGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNA 689 GLKV+EF+GN I+L L+TY+ L+G L Q K ++ T+P + HEL I L D T E+ Sbjct: 204 TGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKF 263 Query: 688 EIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDAN 512 E+FP+DI+IG+II+AA + RQ +V + RSS++WVV KVQ +II +TLR+ V + Sbjct: 264 EMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSK 323 Query: 511 NSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSF 332 R+ FEY D+D TI AH+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKG SLS Sbjct: 324 TIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSL 382 Query: 331 LCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 200 + K+EEL NSLD++TRQNLS F DA+E+ILVQQ R E +S S Sbjct: 383 ISKLEELANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 426 >ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] gi|557096755|gb|ESQ37263.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] Length = 355 Score = 305 bits (781), Expect = 3e-80 Identities = 164/346 (47%), Positives = 235/346 (67%), Gaps = 2/346 (0%) Frame = -1 Query: 1243 AYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKL 1064 AY+E+ ++EL+ VEAE+ + S EIE + ++ ED++RL+ +LEGL SL +SSQ + K Sbjct: 6 AYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQEVQKS 65 Query: 1063 -EMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVG 887 E S + + D F++ EL+++IE+ + L SL +LD KR ++ Sbjct: 66 KENPPSTSSMERCDASTWIDVNDDEKFKMFELENQIEEKRRILKSLENLDSVCKRFDAAE 125 Query: 886 KIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGT 707 ++ED L GLKV+EF+GN I+L L+TY+P L+GLL Q K+ + T+P + HEL I+L D T Sbjct: 126 QVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHKLLHNTEPSELIHELLIDLKDKT 185 Query: 706 MELKNAEIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQL 530 E+ E+ P+D++IG+I DAA + RQ ++ + RSSL+W+V KVQ +II + LR+ Sbjct: 186 TEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTRSSLQWLVAKVQERIITTNLRKH 245 Query: 529 LVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSK 350 +VK + RH FEY D+D TI AH+ GGIDAF+K+ WP+L++ LKL SL +S+N S Sbjct: 246 IVKSSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSVGWPLLSTPLKLTSL-KNSDNQSN 304 Query: 349 GISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 212 GISLS +CKVEEL NSLD++TRQNLS F DAIE+ILVQQ R E S Sbjct: 305 GISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQQTREELHS 350