BLASTX nr result
ID: Akebia25_contig00003444
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00003444 (2005 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245... 419 e-114 ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma... 399 e-108 ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [... 388 e-105 ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm... 377 e-101 ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620... 373 e-100 ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620... 369 4e-99 ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma... 360 2e-96 ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma... 357 9e-96 ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu... 344 8e-92 ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prun... 342 4e-91 ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps... 335 6e-89 ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab... 334 8e-89 ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ... 325 4e-86 ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244... 325 4e-86 ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592... 319 3e-84 ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma... 319 3e-84 ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [... 318 4e-84 dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana] 315 6e-83 ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcr... 308 8e-81 ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part... 305 5e-80 >ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera] gi|298205214|emb|CBI17273.3| unnamed protein product [Vitis vinifera] Length = 425 Score = 419 bits (1078), Expect = e-114 Identities = 220/390 (56%), Positives = 289/390 (74%), Gaps = 4/390 (1%) Frame = +3 Query: 627 DFTELSSSS--ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLV 800 +++ +S S+ +S +F++F L+S++ QI AY+ H K+ELNLV Sbjct: 28 NYSHISDSNPLDSRSLFQEFSHHLQSRVNQILSQYSDVESLEADDLDAYLGHLKKELNLV 87 Query: 801 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 980 E+EN + SNEIE TRTY+ED+ +LES+LE L +S+ ++SQGL + E A V+ S E Sbjct: 88 ESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVDFVASQGLKRAEAGALVDYSSSVE 147 Query: 981 NQ-GSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157 +Q S AH D NFE+L+L+++ +KNK+ L SL DLDYTFKR E++ KIED L GLKVI+ Sbjct: 148 DQLDSRTAHGDNNFEILDLNYQTQKNKITLKSLQDLDYTFKRFEAIEKIEDALTGLKVID 207 Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337 FEGNCI+LSL T++P LEGLLC++K+E +P ++HEL IE+MD +MELKN EIFP+D+ Sbjct: 208 FEGNCIRLSLSTFIPNLEGLLCEEKIEAVNEPSELNHELLIEVMDQSMELKNVEIFPNDV 267 Query: 1338 FIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514 ++GEIIDAAK+ R+ + +S+ E RSSLEW VRKVQ +IIL LRQ +VK AN SRH E Sbjct: 268 YLGEIIDAAKSSRKLFSHMSILETRSSLEWFVRKVQDKIILCALRQSIVKGANKSRHSLE 327 Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694 Y DRD I AHMVGG+DA+IK+ Q WPV N+ALKL SL SS+ SKGISLSFLCKVEE+ Sbjct: 328 YLDRDEIIVAHMVGGVDAYIKVCQGWPVSNNALKLKSL-KSSDQQSKGISLSFLCKVEEM 386 Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784 NSLDV R+N+SSF DAIEEILVQQM+S+ Sbjct: 387 ANSLDVSIRKNISSFVDAIEEILVQQMQSK 416 >ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508713296|gb|EOY05193.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 430 Score = 399 bits (1024), Expect = e-108 Identities = 211/426 (49%), Positives = 296/426 (69%), Gaps = 3/426 (0%) Frame = +3 Query: 537 MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713 ME S SSE +D+ + + E + S S K+ KD L ESK+KQI Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64 Query: 714 TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893 Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NLEG Sbjct: 65 IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124 Query: 894 LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070 L Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + L Sbjct: 125 LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184 Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250 SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E ++ Sbjct: 185 SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244 Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427 P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLEW Sbjct: 245 PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304 Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNS 1607 V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ S Sbjct: 305 VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLSKS 364 Query: 1608 ALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSER 1787 LKL+S+ SS++HS+GISLS LCK EE+ NSLD+ RQNLS+F DA+E++L++QMR + Sbjct: 365 PLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQMRLDL 423 Query: 1788 QSGHIS 1805 QS S Sbjct: 424 QSDDAS 429 >ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] gi|508713299|gb|EOY05196.1| Uncharacterized protein isoform 4, partial [Theobroma cacao] Length = 372 Score = 388 bits (996), Expect = e-105 Identities = 196/372 (52%), Positives = 275/372 (73%), Gaps = 2/372 (0%) Frame = +3 Query: 696 SKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRL 875 SK+KQI Y+ H KEELN VEAE+ + SNEIE +R +IE++ L Sbjct: 1 SKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNIL 60 Query: 876 ESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEK 1052 E NLEGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEK Sbjct: 61 EGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEK 120 Query: 1053 NKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQK 1232 N + L SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ Sbjct: 121 NNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKT 180 Query: 1233 MEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPR 1409 +E ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + + Sbjct: 181 IEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQ 240 Query: 1410 SSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQS 1589 SSLEW V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q Sbjct: 241 SSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQG 300 Query: 1590 WPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQ 1769 WP+ S LKL+S+ SS++HS+GISLS LCK EE+ NSLD+ RQNLS+F DA+E++L++ Sbjct: 301 WPLSKSPLKLLSI-KSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLE 359 Query: 1770 QMRSERQSGHIS 1805 QMR + QS S Sbjct: 360 QMRLDLQSDDAS 371 >ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis] gi|223542639|gb|EEF44176.1| conserved hypothetical protein [Ricinus communis] Length = 415 Score = 377 bits (968), Expect = e-101 Identities = 205/394 (52%), Positives = 275/394 (69%), Gaps = 3/394 (0%) Frame = +3 Query: 621 CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLV 800 CN TE+ SS S ++ +D L LESK++QI A+VEH KEEL+ Sbjct: 24 CNGDTEMLSS-HSDQVLEDCALHLESKVQQIMSECSDFNFLGIEDLDAFVEHLKEELSTT 82 Query: 801 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 980 +E + S EIE R ++ED TRLES++E L SL ISS+ ++K + A E ST+ Sbjct: 83 MSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISSKDVEKEKEVACREDLYSTD 142 Query: 981 NQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEF 1160 AH+DY FE+ +LD +I K+K+ L SL D D FKR+++V +IE+ L GLKVIEF Sbjct: 143 ------AHRDYEFEISKLDDQIAKSKMILKSLQDFDSVFKRVDAVEQIEEALSGLKVIEF 196 Query: 1161 EGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIF 1340 +G+CI+LSL+TY+P L+ ++CQ K E +P V+HEL IE++ GTMELKN EIFP+DI+ Sbjct: 197 DGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHELLIEVVSGTMELKNVEIFPNDIY 256 Query: 1341 IGEIIDAAKALRQ---YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWF 1511 I +I+DAAK+ R+ Y L+ SE RSSL W+VRKVQ +II TLR+L+VK +N SR+ F Sbjct: 257 ISDIVDAAKSFRKEFLYSALTESETRSSLGWLVRKVQDRIIQFTLRRLVVKSSNKSRYSF 316 Query: 1512 EYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEE 1691 EY DRD T+ AH+VGG+DAFIK+ Q WPV S LKL+SL SS+ +HSK ISLSFLC+VEE Sbjct: 317 EYLDRDETVVAHLVGGVDAFIKLSQGWPVSRSPLKLISLKSSN-HHSKEISLSFLCRVEE 375 Query: 1692 LVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 1793 +VNSLD++ R NL SF + IE++LV+QMR E S Sbjct: 376 VVNSLDIQMRLNLLSFVEVIEKLLVEQMRIELHS 409 >ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus sinensis] Length = 447 Score = 373 bits (957), Expect = e-100 Identities = 200/390 (51%), Positives = 269/390 (68%), Gaps = 11/390 (2%) Frame = +3 Query: 648 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSN 827 SS+S + K++ D ESK+K+I AY+EH KEEL VEAE+ + SN Sbjct: 51 SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110 Query: 828 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD------ASVEGFIS---TE 980 EIE TRT +ED+ RLES+LE LN ++ I S+G + D A E + TE Sbjct: 111 EIETLTRTQVEDSDRLESDLEELNCAIDLIVSEGSQNAKEDRQAVCPARGEDQVCPTHTE 170 Query: 981 NQGSSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157 +Q H+D+ FE+LEL+ +IEKNK+ L+SL DLD+ KR ++V +IED+L GLKVI+ Sbjct: 171 DQSDLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVID 230 Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337 F+G C +LS++TY+PTLE Q K+E +P V+HEL IE++DGTME+KN E+FP+D+ Sbjct: 231 FDGKCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDV 290 Query: 1338 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514 I +++DAAK+ RQ L E SSL+W +R VQ +IILSTLR+ +VK AN SRH+FE Sbjct: 291 HISDLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFE 350 Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694 Y +RD I AH+VGG+DAFIK Q WP+ NS LK++SL +S++HSKGISLSF C+VEE Sbjct: 351 YFERDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEA 409 Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784 NSLDV RQNLSSF D +E+IL++QMR E Sbjct: 410 ANSLDVHIRQNLSSFVDGVEKILLEQMRVE 439 >ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus sinensis] Length = 444 Score = 369 bits (946), Expect = 4e-99 Identities = 198/387 (51%), Positives = 267/387 (68%), Gaps = 8/387 (2%) Frame = +3 Query: 648 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSN 827 SS+S + K++ D ESK+K+I AY+EH KEEL VEAE+ + SN Sbjct: 51 SSDSENLLKEYAHDFESKVKEIITEYADVSFLGIEDLDAYLEHLKEELKTVEAESSKISN 110 Query: 828 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEM---DASVEGFIS---TENQG 989 EIE TRT +ED+ RLES+LE LN ++ I S+ + A E + TE+Q Sbjct: 111 EIETLTRTQVEDSDRLESDLEELNCAIDLIVSENAKEDRQAVCPARGEDQVCPTHTEDQS 170 Query: 990 SSYA-HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEG 1166 H+D+ FE+LEL+ +IEKNK+ L+SL DLD+ KR ++V +IED+L GLKVI+F+G Sbjct: 171 DLIKIHEDHRFEILELESQIEKNKIILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDG 230 Query: 1167 NCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIG 1346 C +LS++TY+PTLE Q K+E +P V+HEL IE++DGTME+KN E+FP+D+ I Sbjct: 231 KCFRLSMQTYIPTLEESSFQHKIEDVIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHIS 290 Query: 1347 EIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSD 1523 +++DAAK+ RQ L E SSL+W +R VQ +IILSTLR+ +VK AN SRH+FEY + Sbjct: 291 DLVDAAKSFRQSGTQLDSLETSSSLQWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFE 350 Query: 1524 RDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNS 1703 RD I AH+VGG+DAFIK Q WP+ NS LK++SL +S++HSKGISLSF C+VEE NS Sbjct: 351 RDEMIVAHLVGGVDAFIKPSQGWPLSNSPLKVISL-KNSDHHSKGISLSFFCRVEEAANS 409 Query: 1704 LDVKTRQNLSSFADAIEEILVQQMRSE 1784 LDV RQNLSSF D +E+IL++QMR E Sbjct: 410 LDVHIRQNLSSFVDGVEKILLEQMRVE 436 >ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508713301|gb|EOY05198.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 432 Score = 360 bits (923), Expect = 2e-96 Identities = 191/390 (48%), Positives = 268/390 (68%), Gaps = 3/390 (0%) Frame = +3 Query: 537 MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713 ME S SSE +D+ + + E + S S K+ KD L ESK+KQI Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64 Query: 714 TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893 Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NLEG Sbjct: 65 IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124 Query: 894 LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070 L Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + L Sbjct: 125 LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184 Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250 SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E ++ Sbjct: 185 SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244 Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427 P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLEW Sbjct: 245 PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304 Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNS 1607 V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ S Sbjct: 305 VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLSKS 364 Query: 1608 ALKLVSLNSSSENHSKGISLSFLCKVEELV 1697 LKL+S+ SS++HS+GISLS LCK EE + Sbjct: 365 PLKLLSI-KSSDHHSRGISLSLLCKAEEAI 393 >ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508713300|gb|EOY05197.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 392 Score = 357 bits (917), Expect = 9e-96 Identities = 190/389 (48%), Positives = 267/389 (68%), Gaps = 3/389 (0%) Frame = +3 Query: 537 MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713 ME S SSE +D+ + + E + S S K+ KD L ESK+KQI Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64 Query: 714 TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893 Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NLEG Sbjct: 65 IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124 Query: 894 LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070 L Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + L Sbjct: 125 LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184 Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250 SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E ++ Sbjct: 185 SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244 Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427 P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLEW Sbjct: 245 PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304 Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNS 1607 V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q WP+ S Sbjct: 305 VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWPLSKS 364 Query: 1608 ALKLVSLNSSSENHSKGISLSFLCKVEEL 1694 LKL+S+ SS++HS+GISLS LCK E + Sbjct: 365 PLKLLSI-KSSDHHSRGISLSLLCKAERV 392 >ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] gi|222847415|gb|EEE84962.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa] Length = 429 Score = 344 bits (883), Expect = 8e-92 Identities = 195/421 (46%), Positives = 276/421 (65%), Gaps = 5/421 (1%) Frame = +3 Query: 546 SHSSERVDIETXXXXXXXXXXXXXXCN--DFTELSSSSESVKIFKDFFLDLESKIKQITX 719 S + E +++ T CN F+E++SS +S ++ KD L SK+ Q Sbjct: 6 STTQESLNLNTIRSRINELEEIYRDCNADSFSEINSS-DSDELMKDSAQQLVSKVSQTVT 64 Query: 720 XXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLN 899 AY+ H KEEL+ EAE+ + SNEIE+ RT +ED++ LE++LE + Sbjct: 65 EYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDLEWMK 124 Query: 900 YSLQSISSQ-GLDKLEMDASVEGFISTENQGSSY-AHQDYNFELLELDHEIEKNKVALSS 1073 SL ISSQ +K + D +E F S ENQ + +++ FE+L+LD++IE++ L S Sbjct: 125 CSLDLISSQRDREKEKGDEQMEHFSSGENQSNLINTNEENKFEILKLDNQIEESTRILKS 184 Query: 1074 LHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDP 1253 + DLD K +++ +IED L GLKVIEF+G CI+LSL+TY+P + +L QK+E P Sbjct: 185 MQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPK-QDVLFLQKIEETNVP 243 Query: 1254 FAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVV 1430 + ++HE IE+ +G+ME+K E+FP+DI+IG+I+DAAK+ RQ + L++ E SSLEW V Sbjct: 244 YEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALMETSSSLEWFV 303 Query: 1431 RKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSA 1610 RK Q +II STLR+L+ + A+ SR EY DRD I AHMVGG+DAF+++ Q WP+ NS Sbjct: 304 RKAQDRIIQSTLRRLVARSASTSRQSIEYLDRDEIIVAHMVGGVDAFMEVSQGWPITNSP 363 Query: 1611 LKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQ 1790 LKLVSL +S+ +H+K ISL FLCKVEE NSLDV TRQNLSSF D++E+ILV+QM E Sbjct: 364 LKLVSLKNSN-HHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKILVEQMHLELH 422 Query: 1791 S 1793 S Sbjct: 423 S 423 >ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] gi|462422632|gb|EMJ26895.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica] Length = 416 Score = 342 bits (877), Expect = 4e-91 Identities = 194/413 (46%), Positives = 269/413 (65%), Gaps = 2/413 (0%) Frame = +3 Query: 552 SSERVDIETXXXXXXXXXXXXXXC--NDFTELSSSSESVKIFKDFFLDLESKIKQITXXX 725 SSE +D+ T C +D +ELS S +S + ++ L L+S+++QI Sbjct: 8 SSEPLDLNTIQRQVRELEEIIESCRQDDASELSPS-DSDDLIRNCGLLLQSRVEQIVSEC 66 Query: 726 XXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYS 905 AYV ++ELN VEAE+ + SN IE RT+ ED RL ++L L S Sbjct: 67 SDVGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGTDLAQLKCS 126 Query: 906 LQSISSQGLDKLEMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDL 1085 L + + L+K ++ A V+ ++ FELLEL+++IEKN + L SL DL Sbjct: 127 LDFVEEKDLEKAKLGADVDYHKCGKDLLDPMNVNADKFELLELENQIEKNNIILKSLQDL 186 Query: 1086 DYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVD 1265 + T K +++ +IED + GLKVI FEGNC++LSL+TY+P LE L +K+ AT+P V+ Sbjct: 187 ECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLFSPKKVGDATEPSEVN 246 Query: 1266 HELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCPLSVSEPRSSLEWVVRKVQH 1445 HEL IEL++GTM L+N EIFP+D++I +I+DAAK+LR +SSL+W V KVQ Sbjct: 247 HELLIELLEGTMGLRNVEIFPNDVYINDILDAAKSLR----------KSSLQWFVTKVQD 296 Query: 1446 QIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVS 1625 +I+L T+R+L+VK+ N SRH EY D+D T+ AH+VGG+DAFIK+PQ WP+L+S LKL+ Sbjct: 297 RIVLCTMRRLVVKNENKSRHSLEYLDKDETVVAHVVGGVDAFIKVPQGWPLLSSPLKLIY 356 Query: 1626 LNSSSENHSKGISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784 L SS+ HSKGISLSFLC V+EL NSL V+ RQ LSSF DAIE+ILV+QM SE Sbjct: 357 L-KSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEKILVEQMCSE 408 >ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] gi|482566470|gb|EOA30659.1| hypothetical protein CARUB_v10013795mg [Capsella rubella] Length = 420 Score = 335 bits (858), Expect = 6e-89 Identities = 182/389 (46%), Positives = 257/389 (66%), Gaps = 1/389 (0%) Frame = +3 Query: 642 SSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRT 821 S +S+S + +DF L E+K+ +I AY+E+ ++EL+ VEAE+ + Sbjct: 36 SCTSDSENLVQDFVLQFETKVNEIVEDYSDVDILDVEDSDAYLEYLRKELHSVEAESAKV 95 Query: 822 SNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSSYA 1001 S EIE +R++ ED++RLE +LEGL SL S+SSQ ++K S E S + Sbjct: 96 SEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSSQDVNK-----SKESPPSCSSMEVCEV 150 Query: 1002 HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKL 1181 + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L GLKV+EF+GN I+L Sbjct: 151 NDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRL 210 Query: 1182 SLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDA 1361 L+TY+P L+GL Q K E+ T P + HEL I L D T E+ E+FP+D++IG+II+A Sbjct: 211 QLRTYIPELDGLPAQHKFEHTTKPSELIHELLIYLKDKTTEITKLEMFPNDVYIGDIIEA 270 Query: 1362 AKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATI 1538 A + RQ +V + RSS++WVV KVQ +II +TLR+ +V + RH F+Y D+D TI Sbjct: 271 ADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITTTLRKYIVTSSKTMRHTFKYYDKDETI 330 Query: 1539 TAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKT 1718 AH+ GGIDAF+K+ WP+LNS LKL SL +S+N SKGISLS +CKVEEL NSLD++T Sbjct: 331 VAHIAGGIDAFLKVSDGWPLLNSPLKLASL-KNSDNQSKGISLSLICKVEELANSLDLQT 389 Query: 1719 RQNLSSFADAIEEILVQQMRSERQSGHIS 1805 RQNLS F DAIE+ILV Q R E QS S Sbjct: 390 RQNLSGFIDAIEKILVHQTREELQSNDSS 418 >ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata] Length = 421 Score = 334 bits (857), Expect = 8e-89 Identities = 186/395 (47%), Positives = 259/395 (65%), Gaps = 7/395 (1%) Frame = +3 Query: 621 CNDFTELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLV 800 C D S SS+S + +DF L E K+K+I AY+E+ ++EL V Sbjct: 29 CRDEPGESCSSDSETLVQDFVLQFEPKVKEIVEDYSDVDLLDVEDSDAYLEYLRKELQSV 88 Query: 801 EAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTE 980 EAE+ + S EIE ++++ +D++RLE +LEGL SL S+SSQ ++K S E Sbjct: 89 EAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSSQDVEK-----------SKE 137 Query: 981 NQGSSYA------HQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMG 1142 NQ SS + + D F++ EL++++E+ + L SL DLD KR ++ ++ED L G Sbjct: 138 NQPSSSSMEVCEVNDDDKFKMFELENQMEEKRSILKSLEDLDSLRKRFDAAEQVEDALTG 197 Query: 1143 LKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEI 1322 LKV+EF+GN I+L L+TY+P L+ LL QQK E+ T+P + HEL I L D T E+ E+ Sbjct: 198 LKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIYLKDKTTEITKFEM 257 Query: 1323 FPHDIFIGEIIDAAKALRQYCPLS-VSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNS 1499 FP+D++IG+II+AA + RQ S V + RSS++WVV KVQ +II STLR+ LV + Sbjct: 258 FPNDVYIGDIIEAADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISSTLRKYLVTSSKTI 317 Query: 1500 RHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLC 1679 RH FEY ++D TI H+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKGISLS +C Sbjct: 318 RHTFEYYEKDETIVGHIAGGIDAFLKVSNGWPLLNTPLKLESL-KNSDNQSKGISLSLIC 376 Query: 1680 KVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSE 1784 KVE+L NSLD++TRQNLS F DAIE+ILVQQ R E Sbjct: 377 KVEDLANSLDLQTRQNLSGFMDAIEKILVQQTREE 411 >ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1| uncharacterized protein AT3G23910 [Arabidopsis thaliana] Length = 421 Score = 325 bits (834), Expect = 4e-86 Identities = 181/397 (45%), Positives = 257/397 (64%), Gaps = 7/397 (1%) Frame = +3 Query: 636 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENL 815 E SS + +DF L E K+K+I AY+E+ + EL VEAE+ Sbjct: 35 ESCSSDYETLVVQDFVLQFEPKVKEIVEEYGDVDLLDVEDSDAYLEYLRNELQSVEAESA 94 Query: 816 RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 995 + S EIE ++++ +D++RL+ +LEGL SL S+SSQ ++K S ENQ SS Sbjct: 95 KVSEEIERLSQSHAQDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143 Query: 996 YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157 + + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L GLKV+E Sbjct: 144 SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203 Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337 F+GN I+L L+TY+ L+G L Q K ++ T+P + HEL I L D T E+ E+FP+DI Sbjct: 204 FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263 Query: 1338 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514 +IG+II+AA + RQ +V + RSS++WVV KVQ +II +TLR+ +V + R+ FE Sbjct: 264 YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKYIVMSSKTIRYTFE 323 Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694 Y D+D TI AH+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKGISLS +CKVEEL Sbjct: 324 YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGISLSLICKVEEL 382 Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 1805 NSLD++TRQNLS F DAIE+ILV+Q R E QS S Sbjct: 383 ANSLDLETRQNLSGFMDAIEKILVEQTREELQSNKSS 419 >ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244321 [Solanum lycopersicum] Length = 415 Score = 325 bits (834), Expect = 4e-86 Identities = 179/376 (47%), Positives = 253/376 (67%), Gaps = 2/376 (0%) Frame = +3 Query: 654 ESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEI 833 E K +D L ESK++Q+ + + K EL+ EA+N + ++EI Sbjct: 35 ELKKSLEDCTLQFESKVEQLLCDASEVNFSSDQDLDEFWNYLKNELSTEEAKNAKIADEI 94 Query: 834 EVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQD 1010 E +R Y+E ++L + +EGL+ L+ I S G+++ + E++G+ S A + Sbjct: 95 EGLSREYVEGYSKLVNEVEGLSCLLELIESLGIEQGRALTNFPCSTPGEDKGNLSSAPVE 154 Query: 1011 YNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLK 1190 +NF++ EL +++EK+K+ L SL +L+ TF R E++ KIED GLK+++FEGN I+LSL+ Sbjct: 155 HNFKIFELGNQLEKSKLNLESLEELESTFNRFEAIEKIEDAFSGLKIVQFEGNRIRLSLR 214 Query: 1191 TYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKA 1370 T++P LE LL Q + A P +HEL IEL+DGTMELK+ EIFP+D+ I EI D AK+ Sbjct: 215 TFIPNLENLLHNQTIGVAEPP-EQNHELLIELVDGTMELKHVEIFPNDVSISEITDTAKS 273 Query: 1371 LRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAH 1547 LRQ Y P+ V E RSSLEW+V++VQ +IILSTLR+ LVK AN+SRH F+Y +R+ TI AH Sbjct: 274 LRQVYFPVGVLENRSSLEWLVKRVQDRIILSTLRRFLVKSANSSRHSFDYVEREETIVAH 333 Query: 1548 MVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEELVNSLDVKTRQN 1727 MVGGIDAF+K+PQ WP+ S L L+SL SSS+ +S+ ISL+ LCKV E NSLD RQ Sbjct: 334 MVGGIDAFVKLPQGWPLTCSGLTLMSLKSSSQ-YSQQISLTLLCKVAEAANSLDTNARQT 392 Query: 1728 LSSFADAIEEILVQQM 1775 +S F D +EEIL+QQM Sbjct: 393 ISGFTDRVEEILMQQM 408 >ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592291 [Solanum tuberosum] Length = 428 Score = 319 bits (818), Expect = 3e-84 Identities = 173/337 (51%), Positives = 239/337 (70%), Gaps = 2/337 (0%) Frame = +3 Query: 771 EHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMD 950 ++ K EL+ EA N + ++EIE +R Y+E ++L + +EGL+ L+ I S GL++ + Sbjct: 87 KYLKNELSTEEANNAKIADEIEGLSREYVEGYSKLVNEIEGLSCPLELIESLGLEQGRVL 146 Query: 951 ASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIE 1127 + E++G+ S A + NF++ EL +++EK+K+ L SL +L+ TF R E++ KIE Sbjct: 147 TNFPCSTPGEDKGNVSSAPVEQNFKVFELGNQLEKSKLNLKSLEELESTFNRFEAIEKIE 206 Query: 1128 DTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMEL 1307 D GLK++EFEGN I+LSL+T++P LE LL Q ++ A P +HEL IELMDGTMEL Sbjct: 207 DAFSGLKIVEFEGNRIRLSLRTFIPNLENLLHNQTIDVAEPP-EQNHELLIELMDGTMEL 265 Query: 1308 KNAEIFPHDIFIGEIIDAAKALRQ-YCPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVK 1484 K+ EIFP+D+ I I D AK+LRQ Y P+ V E RSSLEW V+ VQ +I+LSTLR+ LVK Sbjct: 266 KHVEIFPNDVSISYITDTAKSLRQVYFPVGVLENRSSLEWFVKGVQDRIVLSTLRRFLVK 325 Query: 1485 DANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGIS 1664 AN+SRH F+Y DR+ TI AHMVGGIDAFIK+PQ WP+ +S L L+SL SSS+ +S+ IS Sbjct: 326 SANSSRHSFDYVDREETIVAHMVGGIDAFIKLPQGWPLTSSGLTLMSLKSSSQ-YSQQIS 384 Query: 1665 LSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQM 1775 L+ LCKV E+ N LD RQ +S F D +EEIL+QQM Sbjct: 385 LTLLCKVAEVANLLDTNERQTISGFTDRVEEILMQQM 421 >ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590656431|ref|XP_007034269.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713297|gb|EOY05194.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508713298|gb|EOY05195.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 369 Score = 319 bits (818), Expect = 3e-84 Identities = 170/353 (48%), Positives = 240/353 (67%), Gaps = 3/353 (0%) Frame = +3 Query: 537 MENSHSSERVDIETXXXXXXXXXXXXXXCNDFTELSSSS-ESVKIFKDFFLDLESKIKQI 713 ME S SSE +D+ + + E + S S K+ KD L ESK+KQI Sbjct: 5 MEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESKVKQI 64 Query: 714 TXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEG 893 Y+ H KEELN VEAE+ + SNEIE +R +IE++ LE NLEG Sbjct: 65 IEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEGNLEG 124 Query: 894 LNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAHQDYNFELLELDHEIEKNKVALS 1070 L Y+L SI+SQG++ +E D ++ ++ E+Q + +++++ FE++EL+ +IEKN + L Sbjct: 125 LKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNNIILK 184 Query: 1071 SLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATD 1250 SL DLD FKR++++ +IED L GLKVI F+GNCI+LSL+TY+P LEGLLCQ+ +E ++ Sbjct: 185 SLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIEDISE 244 Query: 1251 PFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAAKALRQYCP-LSVSEPRSSLEWV 1427 P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAAK+ RQ L+V + +SSLEW Sbjct: 245 PSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSSLEWF 304 Query: 1428 VRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQ 1586 V KVQ +IILSTLR+ +VK N SRH FEY +RD TI AH+VGGIDAFIK+ Q Sbjct: 305 VGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQ 357 >ref|XP_007034273.1| Uncharacterized protein isoform 7, partial [Theobroma cacao] gi|508713302|gb|EOY05199.1| Uncharacterized protein isoform 7, partial [Theobroma cacao] Length = 343 Score = 318 bits (816), Expect = 4e-84 Identities = 162/315 (51%), Positives = 227/315 (72%), Gaps = 2/315 (0%) Frame = +3 Query: 648 SSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENLRTSN 827 S S K+ KD L ESK+KQI Y+ H KEELN VEAE+ + SN Sbjct: 17 SLNSEKLLKDCSLHFESKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISN 76 Query: 828 EIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGS-SYAH 1004 EIE +R +IE++ LE NLEGL Y+L SI+SQG++ +E D ++ ++ E+Q + +++ Sbjct: 77 EIEDLSRNHIEESNILEGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSN 136 Query: 1005 QDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIEFEGNCIKLS 1184 ++ FE++EL+ +IEKN + L SL DLD FKR++++ +IED L GLKVI F+GNCI+LS Sbjct: 137 EEQKFEIMELESQIEKNNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLS 196 Query: 1185 LKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDIFIGEIIDAA 1364 L+TY+P LEGLLCQ+ +E ++P ++HEL +E++DGTME+KN E+FP+D+++G+IIDAA Sbjct: 197 LQTYIPKLEGLLCQKTIEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAA 256 Query: 1365 KALRQYCP-LSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFEYSDRDATIT 1541 K+ RQ L+V + +SSLEW V KVQ +IILSTLR+ +VK N SRH FEY +RD TI Sbjct: 257 KSFRQLSSNLTVQQTQSSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIV 316 Query: 1542 AHMVGGIDAFIKIPQ 1586 AH+VGGIDAFIK+ Q Sbjct: 317 AHLVGGIDAFIKLSQ 331 >dbj|BAB02924.1| unnamed protein product [Arabidopsis thaliana] Length = 421 Score = 315 bits (806), Expect = 6e-83 Identities = 177/397 (44%), Positives = 253/397 (63%), Gaps = 7/397 (1%) Frame = +3 Query: 636 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXXAYVEHAKEELNLVEAENL 815 E SS + +DF L E K+K+I AY+E+ + EL VEAE+ Sbjct: 35 ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDQTDAYLEYLRNELQSVEAESA 94 Query: 816 RTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFISTENQGSS 995 + S EIE ++++ D++RL+ +LEGL SL S+SSQ ++K S ENQ SS Sbjct: 95 KVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------SKENQPSS 143 Query: 996 YAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTLMGLKVIE 1157 + + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L GLKV+E Sbjct: 144 SSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDALTGLKVLE 203 Query: 1158 FEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNAEIFPHDI 1337 F+GN I+L L+TY+ L+G L Q K ++ T+P + HEL I L D T E+ E+FP+DI Sbjct: 204 FDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKFEMFPNDI 263 Query: 1338 FIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDANNSRHWFE 1514 +IG+II+AA + RQ +V + RSS++WVV KVQ +II +TLR+ V + R+ FE Sbjct: 264 YIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSKTIRYTFE 323 Query: 1515 YSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSFLCKVEEL 1694 Y D+D TI AH+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKG SLS + K+EEL Sbjct: 324 YYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSLISKLEEL 382 Query: 1695 VNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 1805 NSLD++TRQNLS F DA+E+ILVQQ R E +S S Sbjct: 383 ANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 419 >ref|NP_001154643.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] gi|332643360|gb|AEE76881.1| RNA-directed DNA polymerase (reverse transcriptase)-related protein [Arabidopsis thaliana] Length = 428 Score = 308 bits (788), Expect = 8e-81 Identities = 177/404 (43%), Positives = 253/404 (62%), Gaps = 14/404 (3%) Frame = +3 Query: 636 ELSSSSESVKIFKDFFLDLESKIKQITXXXXXXXXXXXXXXX-------AYVEHAKEELN 794 E SS + +DF L E K+K+I AY+E+ + EL Sbjct: 35 ESCSSDYETLVVQDFVLQFEPKVKEIVEDYGDVDLLDVDHTLVDGNLTDAYLEYLRNELQ 94 Query: 795 LVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKLEMDASVEGFIS 974 VEAE+ + S EIE ++++ D++RL+ +LEGL SL S+SSQ ++K S Sbjct: 95 SVEAESAKVSEEIERLSQSHALDSSRLQRDLEGLLLSLDSMSSQDVEK-----------S 143 Query: 975 TENQGSSYAHQ------DYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVGKIEDTL 1136 ENQ SS + + D F++ EL++++E+ ++ L SL DLD KR ++ ++ED L Sbjct: 144 KENQPSSSSMEVCEVIDDDKFKMFELENQMEEKRMILKSLEDLDSLRKRFDAAEQVEDAL 203 Query: 1137 MGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGTMELKNA 1316 GLKV+EF+GN I+L L+TY+ L+G L Q K ++ T+P + HEL I L D T E+ Sbjct: 204 TGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEPSELIHELLIYLKDKTTEITKF 263 Query: 1317 EIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQLLVKDAN 1493 E+FP+DI+IG+II+AA + RQ +V + RSS++WVV KVQ +II +TLR+ V + Sbjct: 264 EMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDKIISTTLRKDFVMSSK 323 Query: 1494 NSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSKGISLSF 1673 R+ FEY D+D TI AH+ GGIDAF+K+ WP+LN+ LKL SL +S+N SKG SLS Sbjct: 324 TIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTPLKLASL-KNSDNQSKGFSLSL 382 Query: 1674 LCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQSGHIS 1805 + K+EEL NSLD++TRQNLS F DA+E+ILVQQ R E +S S Sbjct: 383 ISKLEELANSLDLETRQNLSGFMDAVEKILVQQTREELKSNESS 426 >ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] gi|557096755|gb|ESQ37263.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum] Length = 355 Score = 305 bits (781), Expect = 5e-80 Identities = 164/346 (47%), Positives = 235/346 (67%), Gaps = 2/346 (0%) Frame = +3 Query: 762 AYVEHAKEELNLVEAENLRTSNEIEVHTRTYIEDATRLESNLEGLNYSLQSISSQGLDKL 941 AY+E+ ++EL+ VEAE+ + S EIE + ++ ED++RL+ +LEGL SL +SSQ + K Sbjct: 6 AYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQEVQKS 65 Query: 942 -EMDASVEGFISTENQGSSYAHQDYNFELLELDHEIEKNKVALSSLHDLDYTFKRIESVG 1118 E S + + D F++ EL+++IE+ + L SL +LD KR ++ Sbjct: 66 KENPPSTSSMERCDASTWIDVNDDEKFKMFELENQIEEKRRILKSLENLDSVCKRFDAAE 125 Query: 1119 KIEDTLMGLKVIEFEGNCIKLSLKTYVPTLEGLLCQQKMEYATDPFAVDHELFIELMDGT 1298 ++ED L GLKV+EF+GN I+L L+TY+P L+GLL Q K+ + T+P + HEL I+L D T Sbjct: 126 QVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHKLLHNTEPSELIHELLIDLKDKT 185 Query: 1299 MELKNAEIFPHDIFIGEIIDAAKALRQY-CPLSVSEPRSSLEWVVRKVQHQIILSTLRQL 1475 E+ E+ P+D++IG+I DAA + RQ ++ + RSSL+W+V KVQ +II + LR+ Sbjct: 186 TEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTRSSLQWLVAKVQERIITTNLRKH 245 Query: 1476 LVKDANNSRHWFEYSDRDATITAHMVGGIDAFIKIPQSWPVLNSALKLVSLNSSSENHSK 1655 +VK + RH FEY D+D TI AH+ GGIDAF+K+ WP+L++ LKL SL +S+N S Sbjct: 246 IVKSSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSVGWPLLSTPLKLTSL-KNSDNQSN 304 Query: 1656 GISLSFLCKVEELVNSLDVKTRQNLSSFADAIEEILVQQMRSERQS 1793 GISLS +CKVEEL NSLD++TRQNLS F DAIE+ILVQQ R E S Sbjct: 305 GISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQQTREELHS 350