BLASTX nr result
ID: Forsythia23_contig00026417
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00026417 (1937 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011072663.1| PREDICTED: uncharacterized protein LOC105157... 611 e-172 ref|XP_011072662.1| PREDICTED: uncharacterized protein LOC105157... 611 e-172 ref|XP_011079454.1| PREDICTED: uncharacterized protein LOC105162... 568 e-159 ref|XP_011079449.1| PREDICTED: uncharacterized protein LOC105162... 568 e-159 ref|XP_011079438.1| PREDICTED: uncharacterized protein LOC105162... 568 e-159 ref|XP_012831833.1| PREDICTED: uncharacterized protein LOC105952... 556 e-155 emb|CDO97814.1| unnamed protein product [Coffea canephora] 508 e-141 ref|XP_011079460.1| PREDICTED: uncharacterized protein LOC105162... 493 e-136 ref|XP_007024720.1| Uncharacterized protein isoform 6 [Theobroma... 469 e-129 ref|XP_007024719.1| Uncharacterized protein isoform 5 [Theobroma... 469 e-129 ref|XP_007024718.1| Uncharacterized protein isoform 4 [Theobroma... 469 e-129 ref|XP_007024717.1| Uncharacterized protein isoform 3 [Theobroma... 469 e-129 ref|XP_007024715.1| Uncharacterized protein isoform 1 [Theobroma... 469 e-129 ref|XP_012068836.1| PREDICTED: uncharacterized protein LOC105631... 469 e-129 ref|XP_012068835.1| PREDICTED: uncharacterized protein LOC105631... 469 e-129 ref|XP_012068833.1| PREDICTED: uncharacterized protein LOC105631... 469 e-129 gb|KDP40661.1| hypothetical protein JCGZ_24660 [Jatropha curcas] 469 e-129 ref|XP_012856437.1| PREDICTED: uncharacterized protein LOC105975... 461 e-127 ref|XP_009763170.1| PREDICTED: uncharacterized protein LOC104215... 460 e-126 ref|XP_012454538.1| PREDICTED: uncharacterized protein LOC105776... 457 e-125 >ref|XP_011072663.1| PREDICTED: uncharacterized protein LOC105157864 isoform X2 [Sesamum indicum] Length = 1381 Score = 611 bits (1575), Expect = e-172 Identities = 348/613 (56%), Positives = 417/613 (68%), Gaps = 12/613 (1%) Frame = -1 Query: 1805 KMKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQS 1626 KMKS+ PLDYAVFQLSP+RSRCELFVSS G+ EKLASGL+KPFVA+LK AEEQV S AQS Sbjct: 8 KMKSDAPLDYAVFQLSPKRSRCELFVSSGGSTEKLASGLLKPFVANLKVAEEQVSSAAQS 67 Query: 1625 VKLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQ 1446 V+LE GR+K ++ WFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAR+IY++ GDQ Sbjct: 68 VRLEVGRRKNAEAWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGAGDQ 127 Query: 1445 LSGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFAE 1266 LSGGG GVT + D TKKELLRAID+RLV VQQDLT AGFN D VSELQ+FAE Sbjct: 128 LSGGGGPGVTAADDATKKELLRAIDLRLVAVQQDLTAAAARAAAAGFNVDAVSELQIFAE 187 Query: 1265 FFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTGPF 1086 FG HRLNEAC ++SLCERR DLI+ W+SG +D AVRSS GSDMSID+DP SPQP Sbjct: 188 RFGAHRLNEACGMFISLCERRPDLINQWKSGPEDRAVRSSCGSDMSIDDDPPSPQP---- 243 Query: 1085 PVQIQHQEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQNDAVVXXXXXXXXXXXS 906 +++ + CQQP P + +FP++ FSRESS ERDD NK ND V Sbjct: 244 ------RQEAATCQQPNPAAPTFPLRRAFSRESSVERDDGNKANDTVGEKDGKDETLTPD 297 Query: 905 DL--THTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVELRRLSSDVSSAPAAVE 732 T SQ RRLSVQDRI+LFENKQKEN SGGKPVV KS ELRRL SDVS+ AA E Sbjct: 298 QTGSTQASQPARRLSVQDRINLFENKQKEN--SGGKPVVVKSAELRRLPSDVSTTGAAAE 355 Query: 731 KAVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKALNLND--TMASSVK 558 KAVLRRWSGASDMS+DLS+EKKD +SPL TPSSA + S E K LNLND T +S VK Sbjct: 356 KAVLRRWSGASDMSIDLSAEKKDAQSPLSTPSSA----TVSHENKVLNLNDDTTKSSFVK 411 Query: 557 PESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGT-----TEFDGSKDQTHGKSQSRSFIV 393 PE ++IP +++ + E DG K+Q GK+QSRSFI+ Sbjct: 412 PEIKVIPSLSRGSDSRLKEGFNKSEQCSESSKSNFNLLPGESDGLKNQVLGKTQSRSFII 471 Query: 392 RTEDQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKD 213 + ++QENSEEK R+ D K E F Q +++Q+ V+D Sbjct: 472 KADNQENSEEKLRNLVDGKTESASLFGHQGKLKDSQIGEDLS-------GAQSQIAGVRD 524 Query: 212 QGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHS---TSQKMVADSGQFEGLAGSR 42 QG+ + +R G+KGGG V+I N ++ E D+ V + +++K V +S EG++GSR Sbjct: 525 QGSSLSHVRRIGSKGGGGVDILNQRQDSESWDESVVETSLKSTRKAVGESRVIEGVSGSR 584 Query: 41 IREAFAAHYKGTE 3 IREAFA YKG E Sbjct: 585 IREAFAVRYKGIE 597 >ref|XP_011072662.1| PREDICTED: uncharacterized protein LOC105157864 isoform X1 [Sesamum indicum] Length = 1409 Score = 611 bits (1575), Expect = e-172 Identities = 348/613 (56%), Positives = 417/613 (68%), Gaps = 12/613 (1%) Frame = -1 Query: 1805 KMKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQS 1626 KMKS+ PLDYAVFQLSP+RSRCELFVSS G+ EKLASGL+KPFVA+LK AEEQV S AQS Sbjct: 8 KMKSDAPLDYAVFQLSPKRSRCELFVSSGGSTEKLASGLLKPFVANLKVAEEQVSSAAQS 67 Query: 1625 VKLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQ 1446 V+LE GR+K ++ WFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAR+IY++ GDQ Sbjct: 68 VRLEVGRRKNAEAWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGAGDQ 127 Query: 1445 LSGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFAE 1266 LSGGG GVT + D TKKELLRAID+RLV VQQDLT AGFN D VSELQ+FAE Sbjct: 128 LSGGGGPGVTAADDATKKELLRAIDLRLVAVQQDLTAAAARAAAAGFNVDAVSELQIFAE 187 Query: 1265 FFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTGPF 1086 FG HRLNEAC ++SLCERR DLI+ W+SG +D AVRSS GSDMSID+DP SPQP Sbjct: 188 RFGAHRLNEACGMFISLCERRPDLINQWKSGPEDRAVRSSCGSDMSIDDDPPSPQP---- 243 Query: 1085 PVQIQHQEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQNDAVVXXXXXXXXXXXS 906 +++ + CQQP P + +FP++ FSRESS ERDD NK ND V Sbjct: 244 ------RQEAATCQQPNPAAPTFPLRRAFSRESSVERDDGNKANDTVGEKDGKDETLTPD 297 Query: 905 DL--THTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVELRRLSSDVSSAPAAVE 732 T SQ RRLSVQDRI+LFENKQKEN SGGKPVV KS ELRRL SDVS+ AA E Sbjct: 298 QTGSTQASQPARRLSVQDRINLFENKQKEN--SGGKPVVVKSAELRRLPSDVSTTGAAAE 355 Query: 731 KAVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKALNLND--TMASSVK 558 KAVLRRWSGASDMS+DLS+EKKD +SPL TPSSA + S E K LNLND T +S VK Sbjct: 356 KAVLRRWSGASDMSIDLSAEKKDAQSPLSTPSSA----TVSHENKVLNLNDDTTKSSFVK 411 Query: 557 PESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGT-----TEFDGSKDQTHGKSQSRSFIV 393 PE ++IP +++ + E DG K+Q GK+QSRSFI+ Sbjct: 412 PEIKVIPSLSRGSDSRLKEGFNKSEQCSESSKSNFNLLPGESDGLKNQVLGKTQSRSFII 471 Query: 392 RTEDQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKD 213 + ++QENSEEK R+ D K E F Q +++Q+ V+D Sbjct: 472 KADNQENSEEKLRNLVDGKTESASLFGHQGKLKDSQIGEDLS-------GAQSQIAGVRD 524 Query: 212 QGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHS---TSQKMVADSGQFEGLAGSR 42 QG+ + +R G+KGGG V+I N ++ E D+ V + +++K V +S EG++GSR Sbjct: 525 QGSSLSHVRRIGSKGGGGVDILNQRQDSESWDESVVETSLKSTRKAVGESRVIEGVSGSR 584 Query: 41 IREAFAAHYKGTE 3 IREAFA YKG E Sbjct: 585 IREAFAVRYKGIE 597 >ref|XP_011079454.1| PREDICTED: uncharacterized protein LOC105162955 isoform X3 [Sesamum indicum] Length = 1399 Score = 568 bits (1463), Expect = e-159 Identities = 339/611 (55%), Positives = 405/611 (66%), Gaps = 11/611 (1%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS TPLDYAVFQLSP+ SRCELFVS +G+ EKLASGL+KPFVAHL+ AEEQV S AQSV Sbjct: 1 MKSETPLDYAVFQLSPKCSRCELFVSGDGSTEKLASGLLKPFVAHLRIAEEQVASAAQSV 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE GR K + TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAR+IY++ GDQL Sbjct: 61 KLEVGRSKHAATWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGAGDQL 120 Query: 1442 SGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFAEF 1263 SGGG SGVT + D TKKELLRAIDVRL V+QDL+T AGFN DTVSELQMFA+ Sbjct: 121 SGGGGSGVTAADDATKKELLRAIDVRLAAVRQDLSTACTRAAAAGFNIDTVSELQMFADK 180 Query: 1262 FGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTGPFP 1083 FG RLNEAC K++S+ + R +LI+P +SGT+ A+RSS GSDMSIDEDP++P P Sbjct: 181 FGADRLNEACGKFISVSDSRPELINPCKSGTRGRALRSSCGSDMSIDEDPTTPPP----- 235 Query: 1082 VQIQHQEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQNDAVVXXXXXXXXXXXSD 903 HQ P+ QQP PP L+FP++ TFSRESS ERDD NK NDAV + Sbjct: 236 ----HQGPPTF-QQPNPPPLTFPLRPTFSRESSVERDDGNKPNDAVPEKDRKDETSTSDE 290 Query: 902 LT--HTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVELRRLSSDVSSAPAAVEK 729 +Q RRLSVQDRI+LFENKQKEN SGG PVV KSVELRRLSSD+SS+ AVEK Sbjct: 291 TVSIQAAQPARRLSVQDRINLFENKQKEN--SGGNPVVVKSVELRRLSSDLSSSAGAVEK 348 Query: 728 AVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKALNLND--TMASSV-K 558 AVLRRWSGASDMS+DLS+EKKD+ESPLCTP+S S++K NLN T +SSV K Sbjct: 349 AVLRRWSGASDMSIDLSAEKKDSESPLCTPAST----VVSQDKNVFNLNGEITESSSVAK 404 Query: 557 PESRIIPG---VAKDAXXXXXXXXXXXXXXXXXXMGTTEFDGSKDQTHGKSQSRSFIVRT 387 PE ++IP V+ +G+ E DG KDQ GK+QSRS + R Sbjct: 405 PEIKVIPSLSRVSDSRLKGVSFNNSELASESNSSLGSGENDGLKDQVCGKNQSRSSLSRA 464 Query: 386 EDQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQG 207 +D+E+ E K E ++GF D V+ KDQ Sbjct: 465 DDRESLGEDSTGV---KTEGILGFGD--------LGKLKDPRTGQEVSGPQAHIASKDQV 513 Query: 206 ALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTH---STSQKMVADSGQFEGLAGSRIR 36 + +Q+R F +KG Q EI N KE + ++ V QK + E +AGS+IR Sbjct: 514 SSSSQVRGFVSKGSEQFEIPNHKEDSRLGNEAVQQMKVKIVQKAAVEPRVLEEVAGSKIR 573 Query: 35 EAFAAHYKGTE 3 EAFA+H+KGT+ Sbjct: 574 EAFASHHKGTD 584 >ref|XP_011079449.1| PREDICTED: uncharacterized protein LOC105162955 isoform X2 [Sesamum indicum] Length = 1400 Score = 568 bits (1463), Expect = e-159 Identities = 339/611 (55%), Positives = 405/611 (66%), Gaps = 11/611 (1%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS TPLDYAVFQLSP+ SRCELFVS +G+ EKLASGL+KPFVAHL+ AEEQV S AQSV Sbjct: 1 MKSETPLDYAVFQLSPKCSRCELFVSGDGSTEKLASGLLKPFVAHLRIAEEQVASAAQSV 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE GR K + TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAR+IY++ GDQL Sbjct: 61 KLEVGRSKHAATWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGAGDQL 120 Query: 1442 SGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFAEF 1263 SGGG SGVT + D TKKELLRAIDVRL V+QDL+T AGFN DTVSELQMFA+ Sbjct: 121 SGGGGSGVTAADDATKKELLRAIDVRLAAVRQDLSTACTRAAAAGFNIDTVSELQMFADK 180 Query: 1262 FGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTGPFP 1083 FG RLNEAC K++S+ + R +LI+P +SGT+ A+RSS GSDMSIDEDP++P P Sbjct: 181 FGADRLNEACGKFISVSDSRPELINPCKSGTRGRALRSSCGSDMSIDEDPTTPPP----- 235 Query: 1082 VQIQHQEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQNDAVVXXXXXXXXXXXSD 903 HQ P+ QQP PP L+FP++ TFSRESS ERDD NK NDAV + Sbjct: 236 ----HQGPPTF-QQPNPPPLTFPLRPTFSRESSVERDDGNKPNDAVPEKDRKDETSTSDE 290 Query: 902 LT--HTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVELRRLSSDVSSAPAAVEK 729 +Q RRLSVQDRI+LFENKQKEN SGG PVV KSVELRRLSSD+SS+ AVEK Sbjct: 291 TVSIQAAQPARRLSVQDRINLFENKQKEN--SGGNPVVVKSVELRRLSSDLSSSAGAVEK 348 Query: 728 AVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKALNLND--TMASSV-K 558 AVLRRWSGASDMS+DLS+EKKD+ESPLCTP+S S++K NLN T +SSV K Sbjct: 349 AVLRRWSGASDMSIDLSAEKKDSESPLCTPAST----VVSQDKNVFNLNGEITESSSVAK 404 Query: 557 PESRIIPG---VAKDAXXXXXXXXXXXXXXXXXXMGTTEFDGSKDQTHGKSQSRSFIVRT 387 PE ++IP V+ +G+ E DG KDQ GK+QSRS + R Sbjct: 405 PEIKVIPSLSRVSDSRLKGVSFNNSELASESNSSLGSGENDGLKDQVCGKNQSRSSLSRA 464 Query: 386 EDQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQG 207 +D+E+ E K E ++GF D V+ KDQ Sbjct: 465 DDRESLGEDSTGV---KTEGILGFGD--------LGKLKDPRTGQEVSGPQAHIASKDQV 513 Query: 206 ALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTH---STSQKMVADSGQFEGLAGSRIR 36 + +Q+R F +KG Q EI N KE + ++ V QK + E +AGS+IR Sbjct: 514 SSSSQVRGFVSKGSEQFEIPNHKEDSRLGNEAVQQMKVKIVQKAAVEPRVLEEVAGSKIR 573 Query: 35 EAFAAHYKGTE 3 EAFA+H+KGT+ Sbjct: 574 EAFASHHKGTD 584 >ref|XP_011079438.1| PREDICTED: uncharacterized protein LOC105162955 isoform X1 [Sesamum indicum] gi|747042620|ref|XP_011079444.1| PREDICTED: uncharacterized protein LOC105162955 isoform X1 [Sesamum indicum] Length = 1401 Score = 568 bits (1463), Expect = e-159 Identities = 339/611 (55%), Positives = 405/611 (66%), Gaps = 11/611 (1%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS TPLDYAVFQLSP+ SRCELFVS +G+ EKLASGL+KPFVAHL+ AEEQV S AQSV Sbjct: 1 MKSETPLDYAVFQLSPKCSRCELFVSGDGSTEKLASGLLKPFVAHLRIAEEQVASAAQSV 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE GR K + TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAR+IY++ GDQL Sbjct: 61 KLEVGRSKHAATWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGAGDQL 120 Query: 1442 SGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFAEF 1263 SGGG SGVT + D TKKELLRAIDVRL V+QDL+T AGFN DTVSELQMFA+ Sbjct: 121 SGGGGSGVTAADDATKKELLRAIDVRLAAVRQDLSTACTRAAAAGFNIDTVSELQMFADK 180 Query: 1262 FGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTGPFP 1083 FG RLNEAC K++S+ + R +LI+P +SGT+ A+RSS GSDMSIDEDP++P P Sbjct: 181 FGADRLNEACGKFISVSDSRPELINPCKSGTRGRALRSSCGSDMSIDEDPTTPPP----- 235 Query: 1082 VQIQHQEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQNDAVVXXXXXXXXXXXSD 903 HQ P+ QQP PP L+FP++ TFSRESS ERDD NK NDAV + Sbjct: 236 ----HQGPPTF-QQPNPPPLTFPLRPTFSRESSVERDDGNKPNDAVPEKDRKDETSTSDE 290 Query: 902 LT--HTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVELRRLSSDVSSAPAAVEK 729 +Q RRLSVQDRI+LFENKQKEN SGG PVV KSVELRRLSSD+SS+ AVEK Sbjct: 291 TVSIQAAQPARRLSVQDRINLFENKQKEN--SGGNPVVVKSVELRRLSSDLSSSAGAVEK 348 Query: 728 AVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKALNLND--TMASSV-K 558 AVLRRWSGASDMS+DLS+EKKD+ESPLCTP+S S++K NLN T +SSV K Sbjct: 349 AVLRRWSGASDMSIDLSAEKKDSESPLCTPAST----VVSQDKNVFNLNGEITESSSVAK 404 Query: 557 PESRIIPG---VAKDAXXXXXXXXXXXXXXXXXXMGTTEFDGSKDQTHGKSQSRSFIVRT 387 PE ++IP V+ +G+ E DG KDQ GK+QSRS + R Sbjct: 405 PEIKVIPSLSRVSDSRLKGVSFNNSELASESNSSLGSGENDGLKDQVCGKNQSRSSLSRA 464 Query: 386 EDQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQG 207 +D+E+ E K E ++GF D V+ KDQ Sbjct: 465 DDRESLGEDSTGV---KTEGILGFGD--------LGKLKDPRTGQEVSGPQAHIASKDQV 513 Query: 206 ALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTH---STSQKMVADSGQFEGLAGSRIR 36 + +Q+R F +KG Q EI N KE + ++ V QK + E +AGS+IR Sbjct: 514 SSSSQVRGFVSKGSEQFEIPNHKEDSRLGNEAVQQMKVKIVQKAAVEPRVLEEVAGSKIR 573 Query: 35 EAFAAHYKGTE 3 EAFA+H+KGT+ Sbjct: 574 EAFASHHKGTD 584 >ref|XP_012831833.1| PREDICTED: uncharacterized protein LOC105952765 [Erythranthe guttatus] gi|848847834|ref|XP_012831908.1| PREDICTED: uncharacterized protein LOC105952765 [Erythranthe guttatus] gi|604347749|gb|EYU45904.1| hypothetical protein MIMGU_mgv1a000216mg [Erythranthe guttata] Length = 1420 Score = 556 bits (1434), Expect = e-155 Identities = 329/614 (53%), Positives = 400/614 (65%), Gaps = 14/614 (2%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS++ LDYA FQLSP+ SRCELFVSS G+ EKLASGL+KPFVAHL+ AEE+V S + SV Sbjct: 1 MKSDSTLDYAEFQLSPKHSRCELFVSSGGSTEKLASGLLKPFVAHLQIAEERVASASLSV 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE G+ K ++TWFTKGTLERFVRFVSTPEVLELV+T DAEMSQLEAAR+IY++ GDQL Sbjct: 61 KLEVGKNKNAETWFTKGTLERFVRFVSTPEVLELVSTLDAEMSQLEAARRIYSQGAGDQL 120 Query: 1442 SGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFAEF 1263 SGGG SG T + D TKKELLRAIDVRLV V+QDL+T AGFN DTVSELQMFA+ Sbjct: 121 SGGGGSGATAADDATKKELLRAIDVRLVAVRQDLSTACARAAAAGFNADTVSELQMFADR 180 Query: 1262 FGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTGPFP 1083 FG HRLNEAC K++SL ER +LI P +SG +D AVRSSYGSDMSID+DP+SP P Sbjct: 181 FGAHRLNEACSKFISLSERGPELIHPRKSGHEDRAVRSSYGSDMSIDDDPTSPPP----- 235 Query: 1082 VQIQHQEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQNDAVVXXXXXXXXXXXSD 903 + + QQP PP ++FP++ TFSRESS +R+D NK ND V Sbjct: 236 -----DPETATYQQPNPPPVTFPLRRTFSRESSVDREDGNKTNDTVPEKDRKDESSSPDQ 290 Query: 902 LT--HTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVELRRLSSDVSSAPAAVEK 729 SQ RRLSVQDRIS+FENKQK+ SGGKPVV K+VELRR+SSD+SS+ VEK Sbjct: 291 SVPISASQPARRLSVQDRISMFENKQKDT--SGGKPVVVKAVELRRMSSDLSSSSTVVEK 348 Query: 728 AVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKALNLNDTMA--SSV-K 558 VLRRWSGASDMS+DLS+EKKDTESP CTP+SA S++KK L LND A SSV K Sbjct: 349 GVLRRWSGASDMSIDLSAEKKDTESPSCTPTSA----VVSQDKKVLRLNDDNAEISSVSK 404 Query: 557 PESRIIPGVAKDA------XXXXXXXXXXXXXXXXXXMGTTEFDGSKDQTHGKSQSRSFI 396 PE ++IPG+ + + +G E DG +D GKS+S I Sbjct: 405 PEIKVIPGLVRGSDSRLKGISFNNSEQYFESTKSNSNLGLGESDGLEDAVRGKSRSSPSI 464 Query: 395 VRTEDQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVK 216 EDQE+ +E F++ K +GF +Q + S+ ++T Sbjct: 465 SGGEDQESPKENFKTLTGGKKSGSVGFGNQGRSTGEELIG---------LGSQKKITGGN 515 Query: 215 DQGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTH---STSQKMVADSGQFEGLAGS 45 D TQIR F KG Q+EI N KE E +++ V SQ+ + G EG GS Sbjct: 516 D----PTQIRPFLRKGDEQLEIPNQKEDSEPKNESVKKIPLKASQRSAVELGVLEGGPGS 571 Query: 44 RIREAFAAHYKGTE 3 RIR+AFA+ YKG E Sbjct: 572 RIRKAFASRYKGIE 585 >emb|CDO97814.1| unnamed protein product [Coffea canephora] Length = 1372 Score = 508 bits (1309), Expect = e-141 Identities = 319/621 (51%), Positives = 390/621 (62%), Gaps = 23/621 (3%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+TPLDY FQLSP+RSRCEL VSS GN EKLASGLVKPFVA+L+ AEEQV S+ Sbjct: 1 MKSDTPLDYVAFQLSPKRSRCELVVSSGGNTEKLASGLVKPFVANLRVAEEQVAMSVHSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE RQ+ ++ WFTKGTLERFVRFVSTPE+LEL NTFD EMSQLE+AR+IY++ G QL Sbjct: 61 KLEIERQQNAEVWFTKGTLERFVRFVSTPEILELANTFDTEMSQLESARRIYSQGTGQQL 120 Query: 1442 SGGG--KSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFA 1269 SG G SG +AD TKKELLRAIDVRL+ VQQDLTT AGFN DTV +LQMFA Sbjct: 121 SGSGGLGSGAAAAADATKKELLRAIDVRLLAVQQDLTTACARATAAGFNPDTVLDLQMFA 180 Query: 1268 EFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSP----- 1104 ++FG RLNEAC K++SLCERR DLI W++G D A+RSSYGSDMS+D++P+SP Sbjct: 181 DYFGALRLNEACGKFISLCERRPDLILTWKAGGDDPAIRSSYGSDMSVDDEPTSPDSLRF 240 Query: 1103 ---QPTGPFPVQIQHQEDPSICQQPKPPSLSFPVQCTF----SRESSTERDDSNKQNDAV 945 QP Q++ Q+ + P+L+ ++ +F S E+STE ++ +KQND + Sbjct: 241 GSRQPPRHEQQHSGQQQETDASQKYQHPNLATTLKPSFSLRKSGEASTEPEERSKQNDPL 300 Query: 944 VXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVELRRLS 765 SQ RRLSVQDRI+LFENKQKEN SGGKP V KS+E++RLS Sbjct: 301 ATEKEKK--------KEMSQPSRRLSVQDRINLFENKQKEN--SGGKPAVGKSIEIKRLS 350 Query: 764 SDVSS--APAAVEKAVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKAL 591 SDVSS + AAVEKAVLRRWSGASDMS+DLS EK+DTESPLCTPSS+ E + A+ Sbjct: 351 SDVSSSASAAAVEKAVLRRWSGASDMSIDLSGEKRDTESPLCTPSSSE---IVEERQSAV 407 Query: 590 NLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGTTEFDGSKDQTHGKSQ 411 + + + +S +S GV +G T + KDQT GK+Q Sbjct: 408 SSDKSGEASEGGKSNSTLGV----------------------IGVTAW---KDQTRGKTQ 442 Query: 410 SRSFIVRTEDQE-----NSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRV 246 SRSF+ R ED NSE KFRS P K E G D G+V Sbjct: 443 SRSFLNRAEDSRLDDLANSEPKFRSLPSGKAEE--GRSDNQPKFKGPEKRDDLVKTEGQV 500 Query: 245 ASETQVTDVKDQGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLV-THSTS-QKMVADS 72 SE QV KD+G Q Q F KG +E+S+ KE D L T+S + Q+ V Sbjct: 501 LSEAQVAGHKDKGTSQPQFGYFAGKG---IELSDQKEVGIRDDSLAQTYSRAPQRPVGKY 557 Query: 71 GQFEGLAGSRIREAFAAHYKG 9 EG +GSRIR+AFAA +KG Sbjct: 558 APQEGGSGSRIRDAFAAQHKG 578 >ref|XP_011079460.1| PREDICTED: uncharacterized protein LOC105162955 isoform X4 [Sesamum indicum] Length = 1363 Score = 493 bits (1270), Expect = e-136 Identities = 302/567 (53%), Positives = 363/567 (64%), Gaps = 11/567 (1%) Frame = -1 Query: 1670 HLKFAEEQVLSDAQSVKLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQ 1491 HL EEQV S AQSVKLE GR K + TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQ Sbjct: 7 HLSAHEEQVASAAQSVKLEVGRSKHAATWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQ 66 Query: 1490 LEAARKIYAEDRGDQLSGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXA 1311 LEAAR+IY++ GDQLSGGG SGVT + D TKKELLRAIDVRL V+QDL+T A Sbjct: 67 LEAARRIYSQGAGDQLSGGGGSGVTAADDATKKELLRAIDVRLAAVRQDLSTACTRAAAA 126 Query: 1310 GFNGDTVSELQMFAEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDM 1131 GFN DTVSELQMFA+ FG RLNEAC K++S+ + R +LI+P +SGT+ A+RSS GSDM Sbjct: 127 GFNIDTVSELQMFADKFGADRLNEACGKFISVSDSRPELINPCKSGTRGRALRSSCGSDM 186 Query: 1130 SIDEDPSSPQPTGPFPVQIQHQEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQND 951 SIDEDP++P P HQ P+ QQP PP L+FP++ TFSRESS ERDD NK ND Sbjct: 187 SIDEDPTTPPP---------HQGPPTF-QQPNPPPLTFPLRPTFSRESSVERDDGNKPND 236 Query: 950 AVVXXXXXXXXXXXSDLT--HTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVEL 777 AV + +Q RRLSVQDRI+LFENKQKEN SGG PVV KSVEL Sbjct: 237 AVPEKDRKDETSTSDETVSIQAAQPARRLSVQDRINLFENKQKEN--SGGNPVVVKSVEL 294 Query: 776 RRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKK 597 RRLSSD+SS+ AVEKAVLRRWSGASDMS+DLS+EKKD+ESPLCTP+S S++K Sbjct: 295 RRLSSDLSSSAGAVEKAVLRRWSGASDMSIDLSAEKKDSESPLCTPAST----VVSQDKN 350 Query: 596 ALNLND--TMASSV-KPESRIIPG---VAKDAXXXXXXXXXXXXXXXXXXMGTTEFDGSK 435 NLN T +SSV KPE ++IP V+ +G+ E DG K Sbjct: 351 VFNLNGEITESSSVAKPEIKVIPSLSRVSDSRLKGVSFNNSELASESNSSLGSGENDGLK 410 Query: 434 DQTHGKSQSRSFIVRTEDQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXX 255 DQ GK+QSRS + R +D+E+ E K E ++GF D Sbjct: 411 DQVCGKNQSRSSLSRADDRESLGEDSTGV---KTEGILGFGD--------LGKLKDPRTG 459 Query: 254 GRVASETQVTDVKDQGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTH---STSQKM 84 V+ KDQ + +Q+R F +KG Q EI N KE + ++ V QK Sbjct: 460 QEVSGPQAHIASKDQVSSSSQVRGFVSKGSEQFEIPNHKEDSRLGNEAVQQMKVKIVQKA 519 Query: 83 VADSGQFEGLAGSRIREAFAAHYKGTE 3 + E +AGS+IREAFA+H+KGT+ Sbjct: 520 AVEPRVLEEVAGSKIREAFASHHKGTD 546 >ref|XP_007024720.1| Uncharacterized protein isoform 6 [Theobroma cacao] gi|508780086|gb|EOY27342.1| Uncharacterized protein isoform 6 [Theobroma cacao] Length = 1415 Score = 469 bits (1207), Expect = e-129 Identities = 315/666 (47%), Positives = 385/666 (57%), Gaps = 66/666 (9%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+T LDYAVFQLSP+RSRCELFVSSNGN EKLASGLVKPFV HLK AEEQV QS+ Sbjct: 1 MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE ++K ++TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAA++IY++ GDQ Sbjct: 61 KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG +G+T +AD TKKELLRAIDVRL+TVQQDL T AGFN DTVSELQ F Sbjct: 121 SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---------- 1122 A+ FG HRL+EAC K++SLC+RR +LISPW+ G D VR+S+GSDMSID Sbjct: 181 ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240 Query: 1121 ---------EDPSSPQPTGPFPVQIQH---QEDPSICQQPKPPSLSFPVQCTFSRESSTE 978 ++ Q P Q QH Q P+I QQPKP T + S E Sbjct: 241 VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKP-------SITTQQRSQNE 293 Query: 977 RDDSNKQNDAVVXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPV 798 + K+++ V + SQ RRLSVQDRI+LFENKQKE+S SGGKP+ Sbjct: 294 NKEEEKKDEGVTESSP----------SQVSQPARRLSVQDRINLFENKQKESSSSGGKPI 343 Query: 797 -VRKSVELRRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD--TESPLCTPSSAS 627 V KSVELRRLSS+VSSAPA VEKAVLRRWSGASDMS+DL ++KKD T+SPLCTPSS+S Sbjct: 344 AVGKSVELRRLSSEVSSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGSTDSPLCTPSSSS 403 Query: 626 GFLSKS----------EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXX 477 KS E+K L+D + SSVK E + G +DA Sbjct: 404 ASQGKSNVFQGLSEDKEQKDEKGLSDKV-SSVKVEPK--SGSGRDA-DSGLKDHGEVQVQ 459 Query: 476 XXXXMGTTEFDGSKDQTHGKSQ--------SRSFIVRTE-----DQENSEEKF------- 357 +G E G K + + K Q +SF ++E DQ S+EK Sbjct: 460 VGNSLGKEEDVGLKGRMNLKDQLGSQYNQYHQSFTSKSEQLELGDQVVSQEKVKGSLTGE 519 Query: 356 --------RSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQGAL 201 R FPD V++G ++Q +V V D +G L Sbjct: 520 RGGSEVQSRVFPDK--AVIVGVKNQ-------------PTSQAQVGVADTVGDAMSEGEL 564 Query: 200 QTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTSQKMVADSGQFEGLAGSRIREAFAA 21 + ++ G ++Q M +L S+ + SGQFEG G + +E A Sbjct: 565 KNRVEAQG------------EDQSTMHLRLRAQGHSRTL---SGQFEGSIGLKTKE---A 606 Query: 20 HYKGTE 3 Y GTE Sbjct: 607 QYIGTE 612 >ref|XP_007024719.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508780085|gb|EOY27341.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1444 Score = 469 bits (1207), Expect = e-129 Identities = 315/666 (47%), Positives = 385/666 (57%), Gaps = 66/666 (9%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+T LDYAVFQLSP+RSRCELFVSSNGN EKLASGLVKPFV HLK AEEQV QS+ Sbjct: 1 MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE ++K ++TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAA++IY++ GDQ Sbjct: 61 KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG +G+T +AD TKKELLRAIDVRL+TVQQDL T AGFN DTVSELQ F Sbjct: 121 SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---------- 1122 A+ FG HRL+EAC K++SLC+RR +LISPW+ G D VR+S+GSDMSID Sbjct: 181 ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240 Query: 1121 ---------EDPSSPQPTGPFPVQIQH---QEDPSICQQPKPPSLSFPVQCTFSRESSTE 978 ++ Q P Q QH Q P+I QQPKP T + S E Sbjct: 241 VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKP-------SITTQQRSQNE 293 Query: 977 RDDSNKQNDAVVXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPV 798 + K+++ V + SQ RRLSVQDRI+LFENKQKE+S SGGKP+ Sbjct: 294 NKEEEKKDEGVTESSP----------SQVSQPARRLSVQDRINLFENKQKESSSSGGKPI 343 Query: 797 -VRKSVELRRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD--TESPLCTPSSAS 627 V KSVELRRLSS+VSSAPA VEKAVLRRWSGASDMS+DL ++KKD T+SPLCTPSS+S Sbjct: 344 AVGKSVELRRLSSEVSSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGSTDSPLCTPSSSS 403 Query: 626 GFLSKS----------EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXX 477 KS E+K L+D + SSVK E + G +DA Sbjct: 404 ASQGKSNVFQGLSEDKEQKDEKGLSDKV-SSVKVEPK--SGSGRDA-DSGLKDHGEVQVQ 459 Query: 476 XXXXMGTTEFDGSKDQTHGKSQ--------SRSFIVRTE-----DQENSEEKF------- 357 +G E G K + + K Q +SF ++E DQ S+EK Sbjct: 460 VGNSLGKEEDVGLKGRMNLKDQLGSQYNQYHQSFTSKSEQLELGDQVVSQEKVKGSLTGE 519 Query: 356 --------RSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQGAL 201 R FPD V++G ++Q +V V D +G L Sbjct: 520 RGGSEVQSRVFPDK--AVIVGVKNQ-------------PTSQAQVGVADTVGDAMSEGEL 564 Query: 200 QTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTSQKMVADSGQFEGLAGSRIREAFAA 21 + ++ G ++Q M +L S+ + SGQFEG G + +E A Sbjct: 565 KNRVEAQG------------EDQSTMHLRLRAQGHSRTL---SGQFEGSIGLKTKE---A 606 Query: 20 HYKGTE 3 Y GTE Sbjct: 607 QYIGTE 612 >ref|XP_007024718.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508780084|gb|EOY27340.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 1400 Score = 469 bits (1207), Expect = e-129 Identities = 315/666 (47%), Positives = 385/666 (57%), Gaps = 66/666 (9%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+T LDYAVFQLSP+RSRCELFVSSNGN EKLASGLVKPFV HLK AEEQV QS+ Sbjct: 1 MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE ++K ++TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAA++IY++ GDQ Sbjct: 61 KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG +G+T +AD TKKELLRAIDVRL+TVQQDL T AGFN DTVSELQ F Sbjct: 121 SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---------- 1122 A+ FG HRL+EAC K++SLC+RR +LISPW+ G D VR+S+GSDMSID Sbjct: 181 ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240 Query: 1121 ---------EDPSSPQPTGPFPVQIQH---QEDPSICQQPKPPSLSFPVQCTFSRESSTE 978 ++ Q P Q QH Q P+I QQPKP T + S E Sbjct: 241 VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKP-------SITTQQRSQNE 293 Query: 977 RDDSNKQNDAVVXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPV 798 + K+++ V + SQ RRLSVQDRI+LFENKQKE+S SGGKP+ Sbjct: 294 NKEEEKKDEGVTESSP----------SQVSQPARRLSVQDRINLFENKQKESSSSGGKPI 343 Query: 797 -VRKSVELRRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD--TESPLCTPSSAS 627 V KSVELRRLSS+VSSAPA VEKAVLRRWSGASDMS+DL ++KKD T+SPLCTPSS+S Sbjct: 344 AVGKSVELRRLSSEVSSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGSTDSPLCTPSSSS 403 Query: 626 GFLSKS----------EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXX 477 KS E+K L+D + SSVK E + G +DA Sbjct: 404 ASQGKSNVFQGLSEDKEQKDEKGLSDKV-SSVKVEPK--SGSGRDA-DSGLKDHGEVQVQ 459 Query: 476 XXXXMGTTEFDGSKDQTHGKSQ--------SRSFIVRTE-----DQENSEEKF------- 357 +G E G K + + K Q +SF ++E DQ S+EK Sbjct: 460 VGNSLGKEEDVGLKGRMNLKDQLGSQYNQYHQSFTSKSEQLELGDQVVSQEKVKGSLTGE 519 Query: 356 --------RSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQGAL 201 R FPD V++G ++Q +V V D +G L Sbjct: 520 RGGSEVQSRVFPDK--AVIVGVKNQ-------------PTSQAQVGVADTVGDAMSEGEL 564 Query: 200 QTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTSQKMVADSGQFEGLAGSRIREAFAA 21 + ++ G ++Q M +L S+ + SGQFEG G + +E A Sbjct: 565 KNRVEAQG------------EDQSTMHLRLRAQGHSRTL---SGQFEGSIGLKTKE---A 606 Query: 20 HYKGTE 3 Y GTE Sbjct: 607 QYIGTE 612 >ref|XP_007024717.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508780083|gb|EOY27339.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1431 Score = 469 bits (1207), Expect = e-129 Identities = 315/666 (47%), Positives = 385/666 (57%), Gaps = 66/666 (9%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+T LDYAVFQLSP+RSRCELFVSSNGN EKLASGLVKPFV HLK AEEQV QS+ Sbjct: 1 MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE ++K ++TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAA++IY++ GDQ Sbjct: 61 KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG +G+T +AD TKKELLRAIDVRL+TVQQDL T AGFN DTVSELQ F Sbjct: 121 SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---------- 1122 A+ FG HRL+EAC K++SLC+RR +LISPW+ G D VR+S+GSDMSID Sbjct: 181 ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240 Query: 1121 ---------EDPSSPQPTGPFPVQIQH---QEDPSICQQPKPPSLSFPVQCTFSRESSTE 978 ++ Q P Q QH Q P+I QQPKP T + S E Sbjct: 241 VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKP-------SITTQQRSQNE 293 Query: 977 RDDSNKQNDAVVXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPV 798 + K+++ V + SQ RRLSVQDRI+LFENKQKE+S SGGKP+ Sbjct: 294 NKEEEKKDEGVTESSP----------SQVSQPARRLSVQDRINLFENKQKESSSSGGKPI 343 Query: 797 -VRKSVELRRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD--TESPLCTPSSAS 627 V KSVELRRLSS+VSSAPA VEKAVLRRWSGASDMS+DL ++KKD T+SPLCTPSS+S Sbjct: 344 AVGKSVELRRLSSEVSSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGSTDSPLCTPSSSS 403 Query: 626 GFLSKS----------EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXX 477 KS E+K L+D + SSVK E + G +DA Sbjct: 404 ASQGKSNVFQGLSEDKEQKDEKGLSDKV-SSVKVEPK--SGSGRDA-DSGLKDHGEVQVQ 459 Query: 476 XXXXMGTTEFDGSKDQTHGKSQ--------SRSFIVRTE-----DQENSEEKF------- 357 +G E G K + + K Q +SF ++E DQ S+EK Sbjct: 460 VGNSLGKEEDVGLKGRMNLKDQLGSQYNQYHQSFTSKSEQLELGDQVVSQEKVKGSLTGE 519 Query: 356 --------RSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQGAL 201 R FPD V++G ++Q +V V D +G L Sbjct: 520 RGGSEVQSRVFPDK--AVIVGVKNQ-------------PTSQAQVGVADTVGDAMSEGEL 564 Query: 200 QTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTSQKMVADSGQFEGLAGSRIREAFAA 21 + ++ G ++Q M +L S+ + SGQFEG G + +E A Sbjct: 565 KNRVEAQG------------EDQSTMHLRLRAQGHSRTL---SGQFEGSIGLKTKE---A 606 Query: 20 HYKGTE 3 Y GTE Sbjct: 607 QYIGTE 612 >ref|XP_007024715.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590621133|ref|XP_007024716.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508780081|gb|EOY27337.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508780082|gb|EOY27338.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1428 Score = 469 bits (1207), Expect = e-129 Identities = 315/666 (47%), Positives = 385/666 (57%), Gaps = 66/666 (9%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+T LDYAVFQLSP+RSRCELFVSSNGN EKLASGLVKPFV HLK AEEQV QS+ Sbjct: 1 MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE ++K ++TWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAA++IY++ GDQ Sbjct: 61 KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG +G+T +AD TKKELLRAIDVRL+TVQQDL T AGFN DTVSELQ F Sbjct: 121 SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---------- 1122 A+ FG HRL+EAC K++SLC+RR +LISPW+ G D VR+S+GSDMSID Sbjct: 181 ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240 Query: 1121 ---------EDPSSPQPTGPFPVQIQH---QEDPSICQQPKPPSLSFPVQCTFSRESSTE 978 ++ Q P Q QH Q P+I QQPKP T + S E Sbjct: 241 VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKP-------SITTQQRSQNE 293 Query: 977 RDDSNKQNDAVVXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPV 798 + K+++ V + SQ RRLSVQDRI+LFENKQKE+S SGGKP+ Sbjct: 294 NKEEEKKDEGVTESSP----------SQVSQPARRLSVQDRINLFENKQKESSSSGGKPI 343 Query: 797 -VRKSVELRRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD--TESPLCTPSSAS 627 V KSVELRRLSS+VSSAPA VEKAVLRRWSGASDMS+DL ++KKD T+SPLCTPSS+S Sbjct: 344 AVGKSVELRRLSSEVSSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGSTDSPLCTPSSSS 403 Query: 626 GFLSKS----------EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXX 477 KS E+K L+D + SSVK E + G +DA Sbjct: 404 ASQGKSNVFQGLSEDKEQKDEKGLSDKV-SSVKVEPK--SGSGRDA-DSGLKDHGEVQVQ 459 Query: 476 XXXXMGTTEFDGSKDQTHGKSQ--------SRSFIVRTE-----DQENSEEKF------- 357 +G E G K + + K Q +SF ++E DQ S+EK Sbjct: 460 VGNSLGKEEDVGLKGRMNLKDQLGSQYNQYHQSFTSKSEQLELGDQVVSQEKVKGSLTGE 519 Query: 356 --------RSFPDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQGAL 201 R FPD V++G ++Q +V V D +G L Sbjct: 520 RGGSEVQSRVFPDK--AVIVGVKNQ-------------PTSQAQVGVADTVGDAMSEGEL 564 Query: 200 QTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTSQKMVADSGQFEGLAGSRIREAFAA 21 + ++ G ++Q M +L S+ + SGQFEG G + +E A Sbjct: 565 KNRVEAQG------------EDQSTMHLRLRAQGHSRTL---SGQFEGSIGLKTKE---A 606 Query: 20 HYKGTE 3 Y GTE Sbjct: 607 QYIGTE 612 >ref|XP_012068836.1| PREDICTED: uncharacterized protein LOC105631354 isoform X3 [Jatropha curcas] Length = 1409 Score = 469 bits (1206), Expect = e-129 Identities = 307/634 (48%), Positives = 377/634 (59%), Gaps = 34/634 (5%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+TPLDYAVFQLSP+ SRCELFVS +GN EKLASGLVKPFV HLK AEEQV S+ Sbjct: 1 MKSDTPLDYAVFQLSPKHSRCELFVSRSGNTEKLASGLVKPFVTHLKVAEEQVAQAVHSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE R K + TWFTKGTLERFVRFVSTPEVLE+VNTFDAEMSQLE ARKIY++ DQL Sbjct: 61 KLEVERHKNADTWFTKGTLERFVRFVSTPEVLEMVNTFDAEMSQLEGARKIYSQGTSDQL 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG ++G +AD TKKELLRAIDVRL V+QDLTT AGFN +TVSEL +F Sbjct: 121 SSALGGDETGTVAAADATKKELLRAIDVRLAAVRQDLTTACARASAAGFNPETVSELHLF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---EDPSSPQ 1101 ++ FG RLNEAC K++S+CERR DL++ W++ +D +R+S GSDMSID EDP+ P Sbjct: 181 SDCFGARRLNEACTKFISVCERRPDLVNTWKTRVEDQVLRASCGSDMSIDDPTEDPNGPH 240 Query: 1100 PTGPFPVQIQH-QEDPSICQQPKPPSLSFPVQ-------CTFSRESSTERDDSNKQNDAV 945 P Q+ Q++ Q+ K P+L+ +Q TF SS + NK+ Sbjct: 241 DVKPHQSSFQNKQQNQQAGQEQKQPNLTQTLQHLNQSKPSTFHSSSSVSTQNENKEGYKK 300 Query: 944 VXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKP-VVRKSVELRRL 768 + SQ RRLSVQDRI+LFENKQKEN SGGKP VV KSVELRRL Sbjct: 301 EESTTESLP------SQPSQPARRLSVQDRINLFENKQKEN--SGGKPAVVGKSVELRRL 352 Query: 767 SSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD---TESPLCTPSSASGFLSKS---- 609 SSDVSSAP EKAVLRRWSGASDMS+DL ++KKD +SP+CTPSS+S SKS Sbjct: 353 SSDVSSAP---EKAVLRRWSGASDMSIDLGNDKKDFNSADSPICTPSSSSVSQSKSDVFP 409 Query: 608 ----EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGTTEFDG 441 + K LNDT+ SSVK E++ + G G + Sbjct: 410 SSSADYKDHKGLNDTV-SSVKVEAKNVSGFKDQDELQTPPGGFIGKDEEVGLKGKVNW-- 466 Query: 440 SKDQTHGKSQSRSFIVRTE----DQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXX 273 KDQ + Q R+F R E DQ +EKF+SF + E + G + Q Sbjct: 467 -KDQVGSQPQLRAFAGRGEQVGVDQGVRDEKFKSFL-GRDEKITGIKFQGGFDGKLRD-- 522 Query: 272 XXXXXXGRVASETQVTDVKDQGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTS 93 + + V DQ LQT++ F K G+VE N E ++RDQ +HS Sbjct: 523 --------YSDREETAGVNDQSELQTEVGNFVGK-LGEVESGNRVEDVKVRDQPQSHSRF 573 Query: 92 Q----KMVADSGQFEGLAGSRIREAFAAHYKGTE 3 + + SGQFEG G +++E YK TE Sbjct: 574 RGSHIHTRSLSGQFEGGFGGKVKE---VGYKETE 604 >ref|XP_012068835.1| PREDICTED: uncharacterized protein LOC105631354 isoform X2 [Jatropha curcas] Length = 1409 Score = 469 bits (1206), Expect = e-129 Identities = 307/634 (48%), Positives = 377/634 (59%), Gaps = 34/634 (5%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+TPLDYAVFQLSP+ SRCELFVS +GN EKLASGLVKPFV HLK AEEQV S+ Sbjct: 1 MKSDTPLDYAVFQLSPKHSRCELFVSRSGNTEKLASGLVKPFVTHLKVAEEQVAQAVHSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE R K + TWFTKGTLERFVRFVSTPEVLE+VNTFDAEMSQLE ARKIY++ DQL Sbjct: 61 KLEVERHKNADTWFTKGTLERFVRFVSTPEVLEMVNTFDAEMSQLEGARKIYSQGTSDQL 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG ++G +AD TKKELLRAIDVRL V+QDLTT AGFN +TVSEL +F Sbjct: 121 SSALGGDETGTVAAADATKKELLRAIDVRLAAVRQDLTTACARASAAGFNPETVSELHLF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---EDPSSPQ 1101 ++ FG RLNEAC K++S+CERR DL++ W++ +D +R+S GSDMSID EDP+ P Sbjct: 181 SDCFGARRLNEACTKFISVCERRPDLVNTWKTRVEDQVLRASCGSDMSIDDPTEDPNGPH 240 Query: 1100 PTGPFPVQIQH-QEDPSICQQPKPPSLSFPVQ-------CTFSRESSTERDDSNKQNDAV 945 P Q+ Q++ Q+ K P+L+ +Q TF SS + NK+ Sbjct: 241 DVKPHQSSFQNKQQNQQAGQEQKQPNLTQTLQHLNQSKPSTFHSSSSVSTQNENKEGYKK 300 Query: 944 VXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKP-VVRKSVELRRL 768 + SQ RRLSVQDRI+LFENKQKEN SGGKP VV KSVELRRL Sbjct: 301 EESTTESLP------SQPSQPARRLSVQDRINLFENKQKEN--SGGKPAVVGKSVELRRL 352 Query: 767 SSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD---TESPLCTPSSASGFLSKS---- 609 SSDVSSAP EKAVLRRWSGASDMS+DL ++KKD +SP+CTPSS+S SKS Sbjct: 353 SSDVSSAP---EKAVLRRWSGASDMSIDLGNDKKDFNSADSPICTPSSSSVSQSKSDVFP 409 Query: 608 ----EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGTTEFDG 441 + K LNDT+ SSVK E++ + G G + Sbjct: 410 SSSADYKDHKGLNDTV-SSVKVEAKNVSGFKDQDELQTPPGGFIGKDEEVGLKGKVNW-- 466 Query: 440 SKDQTHGKSQSRSFIVRTE----DQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXX 273 KDQ + Q R+F R E DQ +EKF+SF + E + G + Q Sbjct: 467 -KDQVGSQPQLRAFAGRGEQVGVDQGVRDEKFKSFL-GRDEKITGIKFQGGFDGKLRD-- 522 Query: 272 XXXXXXGRVASETQVTDVKDQGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTS 93 + + V DQ LQT++ F K G+VE N E ++RDQ +HS Sbjct: 523 --------YSDREETAGVNDQSELQTEVGNFVGK-LGEVESGNRVEDVKVRDQPQSHSRF 573 Query: 92 Q----KMVADSGQFEGLAGSRIREAFAAHYKGTE 3 + + SGQFEG G +++E YK TE Sbjct: 574 RGSHIHTRSLSGQFEGGFGGKVKE---VGYKETE 604 >ref|XP_012068833.1| PREDICTED: uncharacterized protein LOC105631354 isoform X1 [Jatropha curcas] Length = 1416 Score = 469 bits (1206), Expect = e-129 Identities = 307/634 (48%), Positives = 377/634 (59%), Gaps = 34/634 (5%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+TPLDYAVFQLSP+ SRCELFVS +GN EKLASGLVKPFV HLK AEEQV S+ Sbjct: 1 MKSDTPLDYAVFQLSPKHSRCELFVSRSGNTEKLASGLVKPFVTHLKVAEEQVAQAVHSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE R K + TWFTKGTLERFVRFVSTPEVLE+VNTFDAEMSQLE ARKIY++ DQL Sbjct: 61 KLEVERHKNADTWFTKGTLERFVRFVSTPEVLEMVNTFDAEMSQLEGARKIYSQGTSDQL 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG ++G +AD TKKELLRAIDVRL V+QDLTT AGFN +TVSEL +F Sbjct: 121 SSALGGDETGTVAAADATKKELLRAIDVRLAAVRQDLTTACARASAAGFNPETVSELHLF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---EDPSSPQ 1101 ++ FG RLNEAC K++S+CERR DL++ W++ +D +R+S GSDMSID EDP+ P Sbjct: 181 SDCFGARRLNEACTKFISVCERRPDLVNTWKTRVEDQVLRASCGSDMSIDDPTEDPNGPH 240 Query: 1100 PTGPFPVQIQH-QEDPSICQQPKPPSLSFPVQ-------CTFSRESSTERDDSNKQNDAV 945 P Q+ Q++ Q+ K P+L+ +Q TF SS + NK+ Sbjct: 241 DVKPHQSSFQNKQQNQQAGQEQKQPNLTQTLQHLNQSKPSTFHSSSSVSTQNENKEGYKK 300 Query: 944 VXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKP-VVRKSVELRRL 768 + SQ RRLSVQDRI+LFENKQKEN SGGKP VV KSVELRRL Sbjct: 301 EESTTESLP------SQPSQPARRLSVQDRINLFENKQKEN--SGGKPAVVGKSVELRRL 352 Query: 767 SSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD---TESPLCTPSSASGFLSKS---- 609 SSDVSSAP EKAVLRRWSGASDMS+DL ++KKD +SP+CTPSS+S SKS Sbjct: 353 SSDVSSAP---EKAVLRRWSGASDMSIDLGNDKKDFNSADSPICTPSSSSVSQSKSDVFP 409 Query: 608 ----EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGTTEFDG 441 + K LNDT+ SSVK E++ + G G + Sbjct: 410 SSSADYKDHKGLNDTV-SSVKVEAKNVSGFKDQDELQTPPGGFIGKDEEVGLKGKVNW-- 466 Query: 440 SKDQTHGKSQSRSFIVRTE----DQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXX 273 KDQ + Q R+F R E DQ +EKF+SF + E + G + Q Sbjct: 467 -KDQVGSQPQLRAFAGRGEQVGVDQGVRDEKFKSFL-GRDEKITGIKFQGGFDGKLRD-- 522 Query: 272 XXXXXXGRVASETQVTDVKDQGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTS 93 + + V DQ LQT++ F K G+VE N E ++RDQ +HS Sbjct: 523 --------YSDREETAGVNDQSELQTEVGNFVGK-LGEVESGNRVEDVKVRDQPQSHSRF 573 Query: 92 Q----KMVADSGQFEGLAGSRIREAFAAHYKGTE 3 + + SGQFEG G +++E YK TE Sbjct: 574 RGSHIHTRSLSGQFEGGFGGKVKE---VGYKETE 604 >gb|KDP40661.1| hypothetical protein JCGZ_24660 [Jatropha curcas] Length = 1419 Score = 469 bits (1206), Expect = e-129 Identities = 307/634 (48%), Positives = 377/634 (59%), Gaps = 34/634 (5%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+TPLDYAVFQLSP+ SRCELFVS +GN EKLASGLVKPFV HLK AEEQV S+ Sbjct: 1 MKSDTPLDYAVFQLSPKHSRCELFVSRSGNTEKLASGLVKPFVTHLKVAEEQVAQAVHSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE R K + TWFTKGTLERFVRFVSTPEVLE+VNTFDAEMSQLE ARKIY++ DQL Sbjct: 61 KLEVERHKNADTWFTKGTLERFVRFVSTPEVLEMVNTFDAEMSQLEGARKIYSQGTSDQL 120 Query: 1442 S---GGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 S GG ++G +AD TKKELLRAIDVRL V+QDLTT AGFN +TVSEL +F Sbjct: 121 SSALGGDETGTVAAADATKKELLRAIDVRLAAVRQDLTTACARASAAGFNPETVSELHLF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSID---EDPSSPQ 1101 ++ FG RLNEAC K++S+CERR DL++ W++ +D +R+S GSDMSID EDP+ P Sbjct: 181 SDCFGARRLNEACTKFISVCERRPDLVNTWKTRVEDQVLRASCGSDMSIDDPTEDPNGPH 240 Query: 1100 PTGPFPVQIQH-QEDPSICQQPKPPSLSFPVQ-------CTFSRESSTERDDSNKQNDAV 945 P Q+ Q++ Q+ K P+L+ +Q TF SS + NK+ Sbjct: 241 DVKPHQSSFQNKQQNQQAGQEQKQPNLTQTLQHLNQSKPSTFHSSSSVSTQNENKEGYKK 300 Query: 944 VXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKP-VVRKSVELRRL 768 + SQ RRLSVQDRI+LFENKQKEN SGGKP VV KSVELRRL Sbjct: 301 EESTTESLP------SQPSQPARRLSVQDRINLFENKQKEN--SGGKPAVVGKSVELRRL 352 Query: 767 SSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD---TESPLCTPSSASGFLSKS---- 609 SSDVSSAP EKAVLRRWSGASDMS+DL ++KKD +SP+CTPSS+S SKS Sbjct: 353 SSDVSSAP---EKAVLRRWSGASDMSIDLGNDKKDFNSADSPICTPSSSSVSQSKSDVFP 409 Query: 608 ----EEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGTTEFDG 441 + K LNDT+ SSVK E++ + G G + Sbjct: 410 SSSADYKDHKGLNDTV-SSVKVEAKNVSGFKDQDELQTPPGGFIGKDEEVGLKGKVNW-- 466 Query: 440 SKDQTHGKSQSRSFIVRTE----DQENSEEKFRSFPDSKHEVLIGFRDQXXXXXXXXXXX 273 KDQ + Q R+F R E DQ +EKF+SF + E + G + Q Sbjct: 467 -KDQVGSQPQLRAFAGRGEQVGVDQGVRDEKFKSFL-GRDEKITGIKFQGGFDGKLRD-- 522 Query: 272 XXXXXXGRVASETQVTDVKDQGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTS 93 + + V DQ LQT++ F K G+VE N E ++RDQ +HS Sbjct: 523 --------YSDREETAGVNDQSELQTEVGNFVGK-LGEVESGNRVEDVKVRDQPQSHSRF 573 Query: 92 Q----KMVADSGQFEGLAGSRIREAFAAHYKGTE 3 + + SGQFEG G +++E YK TE Sbjct: 574 RGSHIHTRSLSGQFEGGFGGKVKE---VGYKETE 604 >ref|XP_012856437.1| PREDICTED: uncharacterized protein LOC105975755 [Erythranthe guttatus] Length = 1230 Score = 461 bits (1187), Expect = e-127 Identities = 299/610 (49%), Positives = 359/610 (58%), Gaps = 9/610 (1%) Frame = -1 Query: 1805 KMKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQS 1626 KMK + PLD+AVFQLSP+RSRCELFVS G+ EKLASGLVKPF+AHLK AEEQV S+AQS Sbjct: 3 KMKMDAPLDFAVFQLSPKRSRCELFVSRGGSTEKLASGLVKPFIAHLKVAEEQVASNAQS 62 Query: 1625 VKLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQ 1446 VKLE GR++ + WFTKGTLERFVRFVSTPE+LELVNTFDAEMSQLEAAR+IY++ GDQ Sbjct: 63 VKLEIGRRRNGEAWFTKGTLERFVRFVSTPEILELVNTFDAEMSQLEAARRIYSQGAGDQ 122 Query: 1445 LSGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFAE 1266 LSGG SG + D TKKELLRAID+RL VQQDL+ AGFN DTVSELQMFA+ Sbjct: 123 LSGGSGSGAKAADDATKKELLRAIDLRLAAVQQDLSATCARADAAGFNVDTVSELQMFAD 182 Query: 1265 FFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTGPF 1086 FG HRLNEAC K++SL ERR +LI+ W+ G +D A+RSS GSDMSID+D Sbjct: 183 RFGAHRLNEACGKFISLSERRPNLINQWKPGPEDRALRSSCGSDMSIDDD---------- 232 Query: 1085 PVQIQHQEDPSICQ-QPKPPSLSFPVQCTFSRESSTER--DDSNKQNDAVVXXXXXXXXX 915 + + D + CQ PP+ +FP + FSRESS E D NK NDA Sbjct: 233 --SLPTRHDSATCQPSDPPPATTFPSRRPFSRESSVEEKDDGDNKWNDAFGEKETKDDAP 290 Query: 914 XXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPVV--RKSVELRRLSSDVSSAPA 741 S H RRLSVQDRISLFENKQKEN SGGKPVV K VELRRLSSDVS+ + Sbjct: 291 -----VQASHHARRLSVQDRISLFENKQKEN--SGGKPVVPPAKPVELRRLSSDVSAMGS 343 Query: 740 AVEKAVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEKKALNLNDTM---A 570 A VLRRWSGASDMS+DL EKKD E P + S+E K LNLND + + Sbjct: 344 AAAAVVLRRWSGASDMSLDLGVEKKDAEIP-----------AVSQENKGLNLNDGIVKNS 392 Query: 569 SSVKPESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGTTEFDGSKDQTHGKSQSRSFIVR 390 S VK E ++IPG+ ++ M F GSK Q S+S S I Sbjct: 393 SVVKTEIKVIPGLIRNNSEHFTKSNSDLVSGGSSGMNDRMF-GSKTQ----SRSSSTISL 447 Query: 389 TEDQENSEEKFRSF-PDSKHEVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKD 213 E+ +NSEE+ F +S + L G + +S + + VK Sbjct: 448 AENLDNSEERSTVFRGESVSDFLYG--------------------HYQGSSVEKSSSVKQ 487 Query: 212 QGALQTQIRTFGTKGGGQVEISNCKEQYEMRDQLVTHSTSQKMVADSGQFEGLAGSRIRE 33 +G + T +++S G +GSRIR+ Sbjct: 488 RGGREDSESPVDTDEQTNLKLSRSN-------------------------TGESGSRIRD 522 Query: 32 AFAAHYKGTE 3 AFAAH K TE Sbjct: 523 AFAAHSKETE 532 >ref|XP_009763170.1| PREDICTED: uncharacterized protein LOC104215122 isoform X1 [Nicotiana sylvestris] gi|698532705|ref|XP_009763171.1| PREDICTED: uncharacterized protein LOC104215122 isoform X1 [Nicotiana sylvestris] Length = 1451 Score = 460 bits (1183), Expect = e-126 Identities = 313/653 (47%), Positives = 381/653 (58%), Gaps = 53/653 (8%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 M+S+T LD+AVFQLSP+RSRCELFVSS+GN EKLASGL+KPFV HLK AEEQV QS+ Sbjct: 1 MESSTLLDFAVFQLSPKRSRCELFVSSDGNTEKLASGLLKPFVTHLKVAEEQVALAVQSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQL 1443 KLE R+K S+TWFTKGTLERFVRFVSTPEVLELVN D EMSQLEAAR+IY++ G Q Sbjct: 61 KLEVKRRKNSETWFTKGTLERFVRFVSTPEVLELVNILDVEMSQLEAARRIYSQGEGYQF 120 Query: 1442 --SGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMFA 1269 +G G SGV +AD TKKELLRAIDVRL V+QDLTT AGFN DTVSELQMFA Sbjct: 121 NSTGSGGSGVMVAADATKKELLRAIDVRLTAVRQDLTTASSRAAAAGFNLDTVSELQMFA 180 Query: 1268 EFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDG-AVRSSYGSDMSIDEDP--SSPQP 1098 + FG HRL+EAC K++SL ERR DLI+PW+ +D AVR SYGSDMSID+DP S+ P Sbjct: 181 DQFGAHRLSEACNKFISLTERRPDLINPWKGVQRDNQAVRCSYGSDMSIDDDPAISNQPP 240 Query: 1097 T-------GPFPVQIQH------QEDPSICQQPKPPSLSFPVQCTFSRESSTERDDSNKQ 957 T G + ++ Q Q PSICQQP P SRESS E ++ +K+ Sbjct: 241 TLPHSTSRGAYSIKQQRHPQHLDQYMPSICQQPTP-------LLQHSRESSVESEEKSKE 293 Query: 956 NDAVV-XXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKPVVRKSVE 780 D +V + S+H RRLSVQDRISLFENKQKE+SGS GKPVV K E Sbjct: 294 RDVIVEKEKEDDTSSQKAKSAELSRHKRRLSVQDRISLFENKQKESSGSVGKPVVGKLAE 353 Query: 779 LRRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKDTESPLCTPSSASGFLSKSEEK 600 L+RLSSDV SAP VEKAVLRRWSGASDMS+DLS + KDTESP CTPSSAS S S+++ Sbjct: 354 LQRLSSDV-SAPPVVEKAVLRRWSGASDMSIDLSGD-KDTESPQCTPSSASVSQSNSKDQ 411 Query: 599 KALNLNDTMA---------SSVKPESRIIP------GVAKDAXXXXXXXXXXXXXXXXXX 465 K L D ++ S+ ESR+ VA Sbjct: 412 KTSVLTDGVSFGGSNSCNVPSMVSESRLNEQTDANLRVAYTNEKEEVAGAERLTGSCGNI 471 Query: 464 MGTTEF-----------DGSKDQTHGKSQSRSFIVRTEDQENSEE-----KFRSFPDSKH 333 ++EF DG KDQ GK++S S I R ED+ + +F + P+SK Sbjct: 472 DDSSEFTPNSNSRIFDSDGWKDQACGKTKSISLIRRDEDKSLKNQLKPGGQFLTSPESKS 531 Query: 332 EVLIGFRDQXXXXXXXXXXXXXXXXXGRVASETQVTDVKDQGALQTQIRTFGTKGGGQVE 153 + + +V QV +K+ GA Q Q + Sbjct: 532 DEIA----LTSNSEFTGSQGGNELGGSKVLLVHQVPGLKNNGAQQGPESV-------QAK 580 Query: 152 ISNCKEQYEMRDQLVTH---STSQKMVADSGQFEGLAGSRIREAFAAHYKGTE 3 I N +E + V+ SQ+ DS Q + + + E+FAA KG E Sbjct: 581 IRNHQEVLGSSNHSVSQLRDKASQRTTEDSVQLDSSSRLEVAESFAA--KGVE 631 >ref|XP_012454538.1| PREDICTED: uncharacterized protein LOC105776438 isoform X3 [Gossypium raimondii] Length = 1397 Score = 457 bits (1177), Expect = e-125 Identities = 304/647 (46%), Positives = 374/647 (57%), Gaps = 47/647 (7%) Frame = -1 Query: 1802 MKSNTPLDYAVFQLSPRRSRCELFVSSNGNMEKLASGLVKPFVAHLKFAEEQVLSDAQSV 1623 MKS+ P+DYAVFQLSP+RSRCELFVSSNGNMEKLASGLVKPFV HLK AEEQV QS+ Sbjct: 1 MKSDMPIDYAVFQLSPKRSRCELFVSSNGNMEKLASGLVKPFVTHLKVAEEQVALSLQSI 60 Query: 1622 KLEAGRQKRSKTWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYAEDRGDQ- 1446 KLE ++K +KTWFTKGT+ERFVRFVSTPEVLELVNTFDAEMSQLEAAR+IY++ GDQ Sbjct: 61 KLEVEKRKDAKTWFTKGTVERFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGAGDQP 120 Query: 1445 --LSGGGKSGVTTSADVTKKELLRAIDVRLVTVQQDLTTXXXXXXXAGFNGDTVSELQMF 1272 SGG +G+T +AD TKKELLRAIDVRL+ VQQDL T AGFN DTVSELQ F Sbjct: 121 SGASGGDAAGMTAAADATKKELLRAIDVRLIAVQQDLATAFARASSAGFNSDTVSELQQF 180 Query: 1271 AEFFGGHRLNEACCKYMSLCERRADLISPWRSGTQDGAVRSSYGSDMSIDEDPSSPQPTG 1092 A++FG HRLNEAC K+MSLC+RR +LI W+ D VR+S+GSDMSID DP Q Sbjct: 181 ADWFGAHRLNEACTKFMSLCQRRPELICRWKPSLDDQVVRASWGSDMSID-DPDEDQVGS 239 Query: 1091 PF------PVQIQHQEDPS---------ICQQPKPPSLSFPVQCTFSRESSTERDDSNKQ 957 P Q +HQE I Q S + + S +E+ + K+ Sbjct: 240 NVNSRPHQPSQNRHQEQQQSNTMQTQHHIGQSKSATSQQPKLSSATQQHSESEKREEEKK 299 Query: 956 NDAVVXXXXXXXXXXXSDLTHTSQHVRRLSVQDRISLFENKQKENSGSGGKP-VVRKSVE 780 + S + SQ RRLSVQDRI+LFENKQKE+S SGGKP V KSVE Sbjct: 300 EEG----------RFESSPSQISQPARRLSVQDRINLFENKQKESSSSGGKPTAVGKSVE 349 Query: 779 LRRLSSDVSSAPAAVEKAVLRRWSGASDMSVDLSSEKKD-TESPLCTPSSAS-------- 627 L+RL SDVS+A A EKAVLRRWSGASDMS+DL ++KKD T+SPLCTPSS+S Sbjct: 350 LKRLPSDVSAAAAVAEKAVLRRWSGASDMSIDLGNDKKDNTDSPLCTPSSSSVSQGKNYM 409 Query: 626 --GFLSKSEEKKALNLNDTMASSVKPESRIIPGVAKDAXXXXXXXXXXXXXXXXXXMGTT 453 G E K L+D + SSVK E + + G A D+ G Sbjct: 410 FQGLSEDKERKDEKGLSDKV-SSVKVEPKSVSGRAADS-------------------GLK 449 Query: 452 EFDGSKDQTH----GKSQS--RSFIVRTEDQENSEEK-FRSFPDSKHEVLIGFRDQXXXX 294 + DG + Q GK + + +DQ S+ + ++SF + +G DQ Sbjct: 450 DQDGVQAQIANNLLGKEEDLVSKGRMNLKDQSGSQNRYYQSFTSKSEQAELG--DQVVSQ 507 Query: 293 XXXXXXXXXXXXXGRVASE-----TQVTDVKDQGALQTQIRTFGTKGGGQVEISNCKEQY 129 V S+ T + VK+Q A + Q F G K++ Sbjct: 508 EKVKGSLTRERGVSDVQSQVVPDRTAIVGVKNQPAYRFQDGVFVDAVGDATPEGELKKRV 567 Query: 128 EM--RDQLVTH---STSQKMVADSGQFEGLAGSRIREAFAAHYKGTE 3 E+ +DQ V+ T SGQF+G G + +E A YKG+E Sbjct: 568 ELQGKDQSVSQLQFRTKGHSRTLSGQFQGGIGLKTKE---AQYKGSE 611