BLASTX nr result
ID: Mentha26_contig00012157
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00012157 (1693 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU23579.1| hypothetical protein MIMGU_mgv1a000155mg [Mimulus... 749 0.0 gb|EPS70544.1| hypothetical protein M569_04216 [Genlisea aurea] 668 0.0 ref|XP_004236487.1| PREDICTED: uncharacterized protein LOC101264... 650 0.0 ref|XP_006477617.1| PREDICTED: uncharacterized protein LOC102610... 648 0.0 ref|XP_006440689.1| hypothetical protein CICLE_v10018469mg [Citr... 646 0.0 ref|XP_006345163.1| PREDICTED: uncharacterized protein LOC102602... 646 0.0 ref|XP_002317968.2| hypothetical protein POPTR_0012s06850g [Popu... 641 0.0 ref|XP_007037486.1| Uncharacterized protein isoform 8 [Theobroma... 640 0.0 ref|XP_007037485.1| Uncharacterized protein isoform 7 [Theobroma... 640 0.0 ref|XP_007037484.1| Uncharacterized protein isoform 6, partial [... 640 0.0 ref|XP_007037483.1| Uncharacterized protein isoform 5 [Theobroma... 640 0.0 ref|XP_007037482.1| Uncharacterized protein isoform 4 [Theobroma... 640 0.0 ref|XP_007037481.1| Uncharacterized protein isoform 3 [Theobroma... 640 0.0 ref|XP_007037480.1| Uncharacterized protein isoform 2 [Theobroma... 640 0.0 ref|XP_007037479.1| Uncharacterized protein isoform 1 [Theobroma... 640 0.0 ref|XP_002514697.1| hypothetical protein RCOM_1470550 [Ricinus c... 639 e-180 emb|CBI15156.3| unnamed protein product [Vitis vinifera] 636 e-180 ref|XP_006579526.1| PREDICTED: GRIP and coiled-coil domain-conta... 628 e-177 gb|EXC11028.1| hypothetical protein L484_015248 [Morus notabilis] 626 e-176 ref|XP_003549556.2| PREDICTED: uncharacterized protein LOC100792... 622 e-175 >gb|EYU23579.1| hypothetical protein MIMGU_mgv1a000155mg [Mimulus guttatus] Length = 1553 Score = 749 bits (1934), Expect = 0.0 Identities = 405/567 (71%), Positives = 442/567 (77%), Gaps = 4/567 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 +++K +E +KN KS ++ GIS KSMDAWKEKR+WE++LA P Sbjct: 491 KRNKILAEASTDKNAKSVESSRRNIPFPEREREKKIGISSKSMDAWKEKRDWEDILATPH 550 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVSSRFSYSPG++RKSAERVRVLHDKLMSP HARATRIRTQLEH Sbjct: 551 RVSSRFSYSPGMNRKSAERVRVLHDKLMSPEKKKKSAVDVKKEAEEKHARATRIRTQLEH 610 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKLNRV+EWQ+VRSNKLRESMFARHQR ESRHEA++AQVVRRAGDE+SKV Sbjct: 611 ERVQKLQRTSEKLNRVNEWQSVRSNKLRESMFARHQRGESRHEAHLAQVVRRAGDESSKV 670 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKKHILRKK DSELRRAEKLQVIK KQKED+AREEAVLERKR+IEAE Sbjct: 671 NEVRFITSLNEENKKHILRKKHQDSELRRAEKLQVIKIKQKEDIAREEAVLERKRLIEAE 730 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQRRKEEAQV MEQ+RRKEI Sbjct: 731 KLQRLAETQRRKEEAQVRREEERKASSAAREAKAMEQIRRKEIRAKARQEEAELLAQKLA 790 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGK--SSPYSNGDDN-L 1073 SESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQG+ S+P SNGDDN L Sbjct: 791 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGRLNSNPCSNGDDNNL 850 Query: 1074 XXXXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGT 1253 E LQ SL QRLMSLKHEFPEPS LESSSLGYRTAVGT Sbjct: 851 GNDSSCTSGSGILTSEALQQSLKRRIKKIRQRLMSLKHEFPEPSGGLESSSLGYRTAVGT 910 Query: 1254 ARRKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASAL 1433 AR K+GRWLQDLQKLRQARKDGAANFGLITAE+IK+LEGRDAELQASR+AGLLDFIASAL Sbjct: 911 ARGKIGRWLQDLQKLRQARKDGAANFGLITAEMIKFLEGRDAELQASRQAGLLDFIASAL 970 Query: 1434 PASHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAAS 1613 PASHTSKP+ACQVTIYLLRLL++VL TPSNKCYFLVQNLLPP+IP+LA ALENYIKMAAS Sbjct: 971 PASHTSKPDACQVTIYLLRLLRVVLGTPSNKCYFLVQNLLPPIIPLLAAALENYIKMAAS 1030 Query: 1614 -SNLPGATNIVSSKIATGNLEYISEIV 1691 +N+PG TNI S K +TGN+E +SEIV Sbjct: 1031 ANNIPGPTNIASIKTSTGNMESVSEIV 1057 >gb|EPS70544.1| hypothetical protein M569_04216 [Genlisea aurea] Length = 1346 Score = 668 bits (1723), Expect = 0.0 Identities = 360/564 (63%), Positives = 414/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 +K+ +E+ +KN+KS D+ IS +S+DAWKEKRNWE++L P Sbjct: 554 KKNMMLAESIVQKNIKSDDSAKKNIQFAERERERKIAISTRSLDAWKEKRNWEDILNSPH 613 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 R+S+ FSYSPG+SR+S ERVR LHDKLM+P HARATRIRTQLE+ Sbjct: 614 RMSASFSYSPGMSRRSVERVRFLHDKLMTPEKKKKSALDLKREADEKHARATRIRTQLEN 673 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ER Q+LQRTSEKLNRVSEWQTVRSNKLRESMFARH+R ESRHEAY+A+VVRRAGDE+SKV Sbjct: 674 ERAQKLQRTSEKLNRVSEWQTVRSNKLRESMFARHRRGESRHEAYLAEVVRRAGDESSKV 733 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKKHIL+KKL DSELRRAEKLQ+IK KQKEDMAREEAVLER+R+IE E Sbjct: 734 NEVRFITSLNEENKKHILQKKLQDSELRRAEKLQIIKIKQKEDMAREEAVLERRRLIEVE 793 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQR ETQRRKEEAQV MEQMRRKEI Sbjct: 794 KLQRHAETQRRKEEAQVRREEERKASTAAREAKAMEQMRRKEIRARARQEEAELLAQKLA 853 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSP-YSNGDDNLXX 1079 SESEQRRKFYLEQIRE+ASMDFRDQS P RRF K+GQ+ GKS+P + N +D Sbjct: 854 ERLSESEQRRKFYLEQIREKASMDFRDQSLPFFRRFPVKDGQSPGKSAPLFCNKEDTHVN 913 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 + L+HSL QRLMSLK++FPEP ESSSLGYRTAVGTAR Sbjct: 914 DNYASSGSCTLTSDALKHSLKRRIKKVRQRLMSLKYDFPEPPFNAESSSLGYRTAVGTAR 973 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+ R LQDLQKLRQARK+GAANFGLITAE+IK+LEG+DAELQASR++GL+DFIAS LPA Sbjct: 974 AKISRSLQDLQKLRQARKEGAANFGLITAEMIKFLEGKDAELQASRQSGLIDFIASTLPA 1033 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SH+SKPEAC VT+YLLRLL+IVLAT +NKCYFLVQNLLPP+IPMLA ALENYI+MAA N Sbjct: 1034 SHSSKPEACLVTVYLLRLLRIVLATQTNKCYFLVQNLLPPIIPMLAAALENYIRMAALLN 1093 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 +N +SK N+E ISEI+ Sbjct: 1094 TTAPSNSSTSKTMAENVETISEIL 1117 >ref|XP_004236487.1| PREDICTED: uncharacterized protein LOC101264110 [Solanum lycopersicum] Length = 1631 Score = 650 bits (1677), Expect = 0.0 Identities = 355/553 (64%), Positives = 408/553 (73%), Gaps = 1/553 (0%) Frame = +3 Query: 36 EKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPPRVSSRFSYSPG 215 EKNLKS D L NG S +SMDAWKEKRNWE+VL+ P R+SSRFSYSPG Sbjct: 553 EKNLKSIDHLKRHYERDKEKR---NGSSWRSMDAWKEKRNWEDVLSTPQRISSRFSYSPG 609 Query: 216 LSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEHERVQRLQRTSE 395 LSR+SAER R LHDKLMSP HARA RIRT+LE+ERVQ+LQRTSE Sbjct: 610 LSRRSAERARTLHDKLMSPEKKKKSAIDLKKEAEEKHARAMRIRTELENERVQKLQRTSE 669 Query: 396 KLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKVNEVRFITSLNE 575 KLNRVSEWQTVRS KLRE M+ARHQRSESRHEA++A+VVRRAGDE+ KVNEVRFITSLNE Sbjct: 670 KLNRVSEWQTVRSLKLREVMYARHQRSESRHEAHLAEVVRRAGDESIKVNEVRFITSLNE 729 Query: 576 ENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAEKLQRLVETQRR 755 ENKK ILR+KLHDSELRRAEKLQV+KTKQKEDMAREEAVLERK++IEAEKLQRL ETQR+ Sbjct: 730 ENKKLILRQKLHDSELRRAEKLQVLKTKQKEDMAREEAVLERKKLIEAEKLQRLAETQRK 789 Query: 756 KEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXXXXXSESEQRRK 935 KEEAQV MEQMRRKE+ ESEQRRK Sbjct: 790 KEEAQVRREEERKASSAAREAKTMEQMRRKEVRAKAQQEEAELLAQKLAERLRESEQRRK 849 Query: 936 FYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDDNLXXXXXXXXXXXXXX 1115 YLEQIRERASMDFRDQSSPL RR KE QG+S+ +N +DN Sbjct: 850 IYLEQIRERASMDFRDQSSPLFRRSVAKE--VQGRSTSINNCEDNNENNGSTPEGSMLAP 907 Query: 1116 XE-TLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTARRKVGRWLQDLQ 1292 T QHSL QRLM+LK++ PE S + E++ YRTAV TAR K+ +WLQ+LQ Sbjct: 908 GHITTQHSLKRRIKKIRQRLMALKYDCPELSISTENAGFVYRTAVSTARAKIAKWLQELQ 967 Query: 1293 KLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPASHTSKPEACQV 1472 +LRQARK+GAA+FG+ITAEIIK+LEGRDAELQASR+AGL+DFIASALPASHTSKPE+CQV Sbjct: 968 RLRQARKEGAASFGIITAEIIKFLEGRDAELQASRQAGLVDFIASALPASHTSKPESCQV 1027 Query: 1473 TIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSNLPGATNIVSSK 1652 T+YLLRLLK+VL+ +NK YFL QNLLPP+IPMLA ALE YIK+AASSN + N+V+SK Sbjct: 1028 TVYLLRLLKVVLSAAANKSYFLAQNLLPPIIPMLAAALETYIKIAASSNGSASANLVTSK 1087 Query: 1653 IATGNLEYISEIV 1691 +T LE +SE++ Sbjct: 1088 ASTERLELMSEVL 1100 >ref|XP_006477617.1| PREDICTED: uncharacterized protein LOC102610780 [Citrus sinensis] Length = 1688 Score = 648 bits (1672), Expect = 0.0 Identities = 350/564 (62%), Positives = 413/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 +K K +E +KN KS D L N S KSMDAWKEKRNWE++L+ P Sbjct: 614 KKEKILAEIVTDKNFKSTDPLKRQIALTEKDKEKRNAASWKSMDAWKEKRNWEDILSSPF 673 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVSSR S+SPG+SRKSAER R+LHDKLM+P HARA RIR++LE+ Sbjct: 674 RVSSRISHSPGMSRKSAERARILHDKLMTPEKKKKTALDLKKEAAEKHARAMRIRSELEN 733 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKLNRV+EWQ VR+ KLRE M+ARHQRSE RHEA++AQVVRRAGDE+SKV Sbjct: 734 ERVQKLQRTSEKLNRVNEWQAVRTMKLREDMYARHQRSELRHEAFLAQVVRRAGDESSKV 793 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK ILR+KLHDSELRRAEKLQV++TKQKED+AREEAVLER+++IEAE Sbjct: 794 NEVRFITSLNEENKKLILRQKLHDSELRRAEKLQVLRTKQKEDIAREEAVLERRKLIEAE 853 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQ++KEEAQV +EQ+RRKE Sbjct: 854 KLQRLAETQKKKEEAQVRREEERKASSAAREARAIEQLRRKEERAKAQQEEAELLAQKLA 913 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KEG QG+S+P +N DD Sbjct: 914 EKLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSINKEG--QGRSTPINNNDDCQSSV 971 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 +LQHSL QRLM+LK+EFPEP E++ +GYRTAV TAR Sbjct: 972 VTGAGVSNLATGNVSLQHSLKRRIKRIRQRLMALKYEFPEPPVGSENAGIGYRTAVATAR 1031 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK GAA+ GLITAE+IK+LEG+D ELQASR+AGLLDFIASALPA Sbjct: 1032 AKIGRWLQELQKLRQARK-GAASIGLITAEMIKFLEGKDPELQASRQAGLLDFIASALPA 1090 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQV I+LL+LL++VL+ PSN+ YFL QNLLPP+IPML+ ALENYIK+ AS N Sbjct: 1091 SHTSKPEACQVMIHLLKLLRVVLSVPSNRSYFLAQNLLPPIIPMLSAALENYIKITASLN 1150 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 P +T+ SSK++ N E I+E++ Sbjct: 1151 APCSTSSSSSKVSVENFESITEVL 1174 >ref|XP_006440689.1| hypothetical protein CICLE_v10018469mg [Citrus clementina] gi|557542951|gb|ESR53929.1| hypothetical protein CICLE_v10018469mg [Citrus clementina] Length = 1688 Score = 646 bits (1667), Expect = 0.0 Identities = 349/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 +K K +E +KN K D L N S KSMDAWKEKRNWE++L+ P Sbjct: 614 KKEKILAEIVTDKNFKPTDPLKRQIALTERDKEKRNAASWKSMDAWKEKRNWEDILSSPF 673 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVSSR S+SPG+SRKSAER R+LHDKLM+P HARA RIR++LE+ Sbjct: 674 RVSSRISHSPGMSRKSAERARILHDKLMTPEKKKKTALDLKKEAAEKHARAMRIRSELEN 733 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKLNRV+EWQ VR+ KLRE M+ARHQRSE RHEA++AQVVRRAGDE+SKV Sbjct: 734 ERVQKLQRTSEKLNRVNEWQAVRTMKLREDMYARHQRSELRHEAFLAQVVRRAGDESSKV 793 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK ILR+KLHDSELRRAEKLQV++TKQKED+AREEAVLER+++IEAE Sbjct: 794 NEVRFITSLNEENKKLILRQKLHDSELRRAEKLQVLRTKQKEDIAREEAVLERRKLIEAE 853 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQ++KEEAQV +EQ+RRKE Sbjct: 854 KLQRLAETQKKKEEAQVRREEERKASSAAREARAIEQLRRKEERAKAQQEEAELLAQKLA 913 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KEG QG+S+P +N DD Sbjct: 914 EKLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSINKEG--QGRSTPINNNDDCQSSV 971 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 +LQHSL QRLM+LK+EFPEP E++ +GYRTAV TAR Sbjct: 972 VTGAGVSNLATGNVSLQHSLKRRIKRIRQRLMALKYEFPEPPVGSENAGIGYRTAVATAR 1031 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK GAA+ GLITAE+IK+LEG+D ELQASR+AGLLDFIASALPA Sbjct: 1032 AKIGRWLQELQKLRQARK-GAASIGLITAEMIKFLEGKDPELQASRQAGLLDFIASALPA 1090 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQV I+LL+LL++VL+ PSN+ YFL QNLLPP+IPML+ ALENYIK+ AS N Sbjct: 1091 SHTSKPEACQVMIHLLKLLRVVLSVPSNRSYFLAQNLLPPIIPMLSAALENYIKITASLN 1150 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 P +T+ SSK++ N E I+E++ Sbjct: 1151 APCSTSSSSSKVSVENFESITEVL 1174 >ref|XP_006345163.1| PREDICTED: uncharacterized protein LOC102602693 [Solanum tuberosum] Length = 1631 Score = 646 bits (1666), Expect = 0.0 Identities = 354/553 (64%), Positives = 406/553 (73%), Gaps = 1/553 (0%) Frame = +3 Query: 36 EKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPPRVSSRFSYSPG 215 EKNLK D L NG S +SMDAWKEKRNWE+VL+ P RVSSRFSYSPG Sbjct: 553 EKNLKPIDHLKRHYERDKEKR---NGSSWRSMDAWKEKRNWEDVLSTPHRVSSRFSYSPG 609 Query: 216 LSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEHERVQRLQRTSE 395 LSR+SAER R LHDKLMSP HARA RIRT+LE+ERVQ+LQRTSE Sbjct: 610 LSRRSAERARTLHDKLMSPEKKKKSAIDLKKEAEEKHARAMRIRTELENERVQKLQRTSE 669 Query: 396 KLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKVNEVRFITSLNE 575 KLNRVSEWQTVRS KLRE M+ARHQRSESRHEA++A+VVRRAGDE+ KVNEVRFITSLNE Sbjct: 670 KLNRVSEWQTVRSMKLREVMYARHQRSESRHEAHLAEVVRRAGDESIKVNEVRFITSLNE 729 Query: 576 ENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAEKLQRLVETQRR 755 ENKK ILR+KLHDSELRRAEKLQV+KTKQKEDMAREEAVLERK++IEAEKLQRL ETQR+ Sbjct: 730 ENKKLILRQKLHDSELRRAEKLQVLKTKQKEDMAREEAVLERKKLIEAEKLQRLAETQRK 789 Query: 756 KEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXXXXXSESEQRRK 935 KEEAQV MEQMRRKE+ ESEQRRK Sbjct: 790 KEEAQVRREEERKASSAAREAKTMEQMRRKEVRAKAQQEEAELLAQKLAERLRESEQRRK 849 Query: 936 FYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXXXXXXXXXXXXX 1112 YLEQIRERASMDFRDQSSPL RR KE QG+S+P SN +D N Sbjct: 850 IYLEQIRERASMDFRDQSSPLFRRSVAKE--VQGRSTPISNCEDYNENNGFAPEGSMLAP 907 Query: 1113 XXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTARRKVGRWLQDLQ 1292 T Q SL QRLM+LK++ PEPS + E++ YRTAV AR K+ +WLQ+LQ Sbjct: 908 GHITTQQSLKRRIKKIRQRLMALKYDCPEPSTSTENAGFVYRTAVAIARVKIAKWLQELQ 967 Query: 1293 KLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPASHTSKPEACQV 1472 +LRQARK+GAA+FGLITAEIIK+LEGRDAELQASR+AGL+DFIASALPASHTSKPE+CQV Sbjct: 968 RLRQARKEGAASFGLITAEIIKFLEGRDAELQASRQAGLVDFIASALPASHTSKPESCQV 1027 Query: 1473 TIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSNLPGATNIVSSK 1652 T++LLRLLK+VL+ +NK YFL QNLLPP+IPMLA ALE YIK+AASSN + N+V+ K Sbjct: 1028 TVFLLRLLKVVLSAAANKSYFLAQNLLPPIIPMLAAALETYIKIAASSNGSASANLVTCK 1087 Query: 1653 IATGNLEYISEIV 1691 +T LE ++E++ Sbjct: 1088 ASTERLELMAEVL 1100 >ref|XP_002317968.2| hypothetical protein POPTR_0012s06850g [Populus trichocarpa] gi|550326532|gb|EEE96188.2| hypothetical protein POPTR_0012s06850g [Populus trichocarpa] Length = 1427 Score = 641 bits (1654), Expect = 0.0 Identities = 348/565 (61%), Positives = 416/565 (73%), Gaps = 2/565 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSAD-TLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIP 179 +K KT SE EKNLKSA+ T N S KSMDAWKE+RNWE++L+ P Sbjct: 332 KKDKTFSETAIEKNLKSAENTTKKQIPLSEKDKERRNSSSRKSMDAWKERRNWEDILSSP 391 Query: 180 PRVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLE 359 VSSR S SPG+SRKSAER R+LH KLMSP HARA RIR++LE Sbjct: 392 FCVSSRLSNSPGISRKSAERARILHAKLMSPDKKKKTAFDLKREAEEKHARAMRIRSELE 451 Query: 360 HERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSK 539 +ERVQ+LQRTSEKLNRV+EWQ VR+ KLRE M+ARHQRSESRHEA++AQVVRRAGDE+SK Sbjct: 452 NERVQKLQRTSEKLNRVNEWQAVRTMKLREGMYARHQRSESRHEAFLAQVVRRAGDESSK 511 Query: 540 VNEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEA 719 VNEVRFITSLNEENKK +LR+KLHDSELRRAEKLQVIKTKQKEDMAREEAVLER+++IEA Sbjct: 512 VNEVRFITSLNEENKKLMLRQKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERRKLIEA 571 Query: 720 EKLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXX 899 EKLQRL ETQR+KEEAQV + Q+RR+E Sbjct: 572 EKLQRLAETQRKKEEAQVRREEERKASNAAREARAIIQLRRREERAKAQQEEAELLAQKL 631 Query: 900 XXXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLX 1076 SESEQRRKFYLEQIRERASMDFRDQSSPL+RR KEGQ G+++P ++ +D + Sbjct: 632 AERLSESEQRRKFYLEQIRERASMDFRDQSSPLMRRSMYKEGQ--GRTTPTNSSEDYQVN 689 Query: 1077 XXXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTA 1256 LQHS+ QRLM+L++EF EP A+ E++S+GYR AVGTA Sbjct: 690 NVTGAGSSTLAAGKALLQHSMKRRIKKIRQRLMALRYEFTEPLASSENTSIGYRMAVGTA 749 Query: 1257 RRKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALP 1436 R K GRWLQ+LQ+LRQARK GAA+ GLITAE+IK++EG+D ELQASR+AGLLDFIA+ALP Sbjct: 750 RAKFGRWLQELQRLRQARKKGAASIGLITAEMIKFVEGKDPELQASRQAGLLDFIAAALP 809 Query: 1437 ASHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASS 1616 ASHTS PE CQVTI+LL+LL++VL+ P+N+ YFL QNLLPP+IPML+ ALENYIK+AAS Sbjct: 810 ASHTSNPETCQVTIHLLKLLRVVLSAPANRSYFLSQNLLPPIIPMLSAALENYIKIAASL 869 Query: 1617 NLPGATNIVSSKIATGNLEYISEIV 1691 N+PG+TN+ SSK + N E ISE++ Sbjct: 870 NVPGSTNLQSSKTSVENFESISEVL 894 >ref|XP_007037486.1| Uncharacterized protein isoform 8 [Theobroma cacao] gi|508774731|gb|EOY21987.1| Uncharacterized protein isoform 8 [Theobroma cacao] Length = 1481 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_007037485.1| Uncharacterized protein isoform 7 [Theobroma cacao] gi|508774730|gb|EOY21986.1| Uncharacterized protein isoform 7 [Theobroma cacao] Length = 1529 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_007037484.1| Uncharacterized protein isoform 6, partial [Theobroma cacao] gi|508774729|gb|EOY21985.1| Uncharacterized protein isoform 6, partial [Theobroma cacao] Length = 1525 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_007037483.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508774728|gb|EOY21984.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 1571 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_007037482.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508774727|gb|EOY21983.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 1540 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_007037481.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508774726|gb|EOY21982.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1707 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_007037480.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508774725|gb|EOY21981.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1550 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_007037479.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508774724|gb|EOY21980.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1684 Score = 640 bits (1652), Expect = 0.0 Identities = 348/564 (61%), Positives = 412/564 (73%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 RK KT +EN EKN KS D + N S KSMDAWKEKRNWE++L+ P Sbjct: 616 RKDKTLTENIVEKNSKSVDHIKRQIPSEKDKDRR-NTTSWKSMDAWKEKRNWEDILSSPF 674 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVS R S+SP + +KSAERVR+LH+KLMSP HARA RIR++LE+ Sbjct: 675 RVSYRVSHSPNVGKKSAERVRILHEKLMSPEKKRKTALDLKKEAEEKHARALRIRSELEN 734 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKL RV+EWQ VR+ KLRE M AR QRSESRHEA++A+VVRRAGDE+SKV Sbjct: 735 ERVQKLQRTSEKLIRVNEWQAVRTMKLREGMHARQQRSESRHEAFLAEVVRRAGDESSKV 794 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KL DSELRRAEKLQV+KTKQKEDMAREEAVLER+++IEAE Sbjct: 795 NEVRFITSLNEENKKLMLRQKLQDSELRRAEKLQVMKTKQKEDMAREEAVLERRKLIEAE 854 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL ETQR+KEEAQ+ +EQ+RR+E Sbjct: 855 KLQRLAETQRKKEEAQIRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 914 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDD-NLXX 1079 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR KE +QG+S+P +N DD Sbjct: 915 ERLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSVNKE--SQGRSTPTNNSDDCQANG 972 Query: 1080 XXXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 LQHSL QRLM+LK EF EP AA E++ +GYRT VGTAR Sbjct: 973 SVILGNSALATGNGALQHSLKRRIKRIRQRLMALKFEFSEPPAAPENTGIGYRTTVGTAR 1032 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQKLRQARK+GA++ GLITAE++K+LEG++ ELQASR+AGLLDFIASALPA Sbjct: 1033 AKIGRWLQELQKLRQARKEGASSIGLITAEMVKFLEGKEPELQASRQAGLLDFIASALPA 1092 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVTI+LL+LL++VL+TP N+ YFL QNLLPP+IPML+ ALENYIK+AAS N Sbjct: 1093 SHTSKPEACQVTIHLLKLLRVVLSTPVNRSYFLAQNLLPPMIPMLSAALENYIKIAASLN 1152 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 LPG+TN +S K N E +SE++ Sbjct: 1153 LPGSTNSLSCKTLLENFESVSEVL 1176 >ref|XP_002514697.1| hypothetical protein RCOM_1470550 [Ricinus communis] gi|223546301|gb|EEF47803.1| hypothetical protein RCOM_1470550 [Ricinus communis] Length = 1809 Score = 639 bits (1647), Expect = e-180 Identities = 343/564 (60%), Positives = 411/564 (72%), Gaps = 1/564 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 ++ K E EKNLKS D S K MDAWKEKRNWE++L+ P Sbjct: 718 KRDKALVEGTVEKNLKSIDPPRKQIPLSEKDKEKRKETSWKYMDAWKEKRNWEDILSSPF 777 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVSSR S+SPG+SRKSAER R+LHDKLMSP HARA RIR++LE+ Sbjct: 778 RVSSRVSHSPGMSRKSAERARILHDKLMSPEKKKKTALDLKKEAEEKHARAMRIRSELEN 837 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTSEKLN+V+EWQ VR+ KLRE M+ARHQRSESRHEA++AQVVRRAGDE+SKV Sbjct: 838 ERVQKLQRTSEKLNKVNEWQAVRTMKLREGMYARHQRSESRHEAFLAQVVRRAGDESSKV 897 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK ILR+KL DSELRRAEKLQVIKTKQKEDMAREEAVLER+++IEAE Sbjct: 898 NEVRFITSLNEENKKLILRQKLQDSELRRAEKLQVIKTKQKEDMAREEAVLERRKLIEAE 957 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KL RL ETQR+KEEAQV +EQ+RR+E Sbjct: 958 KLHRLAETQRKKEEAQVRREEERKASSAAREARAIEQLRRREERAKAQQEEAELLAQKLA 1017 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDDNLXXX 1082 SES+QRRKFYLEQIRERASMDFRDQSSPL+RR KEGQ G+S+P ++G+ Sbjct: 1018 ERLSESKQRRKFYLEQIRERASMDFRDQSSPLMRRSMNKEGQ--GRSTPTNSGEVYQENS 1075 Query: 1083 XXXXXXXXXXXXE-TLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTAR 1259 TLQHSL QRLM+LK+EFPE + E++ +GYRTAV TAR Sbjct: 1076 VAGIGGSTLATGNATLQHSLKRRIKKIRQRLMALKYEFPEAPVSAENAGIGYRTAVATAR 1135 Query: 1260 RKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPA 1439 K+GRWLQ+LQ+LRQARK+GA + GLIT ++IK+LEG+D ELQASR+AGLLDFIASALPA Sbjct: 1136 AKLGRWLQELQRLRQARKEGATSIGLITTDMIKFLEGKDPELQASRQAGLLDFIASALPA 1195 Query: 1440 SHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSN 1619 SHTSKPEACQVT++LL+LL++VL+ P+N+ YFL QNLLPP+IPM++TALENYIK+AAS N Sbjct: 1196 SHTSKPEACQVTVHLLKLLRVVLSVPANRSYFLAQNLLPPIIPMVSTALENYIKIAASLN 1255 Query: 1620 LPGATNIVSSKIATGNLEYISEIV 1691 + G +N+ SSK + N E ISE++ Sbjct: 1256 VSGISNLPSSKTSVENFESISEVL 1279 >emb|CBI15156.3| unnamed protein product [Vitis vinifera] Length = 1617 Score = 636 bits (1641), Expect = e-180 Identities = 344/568 (60%), Positives = 412/568 (72%), Gaps = 5/568 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXX----NGISGKSMDAWKEKRNWEEVL 170 +K +E+ EKN K D L N S KSMDAWKEKRNWE++L Sbjct: 608 KKDTMLTESNIEKNPKPMDHLKRQIPIAEKDKDKEKEKRNAPSWKSMDAWKEKRNWEDIL 667 Query: 171 AIPPRVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRT 350 A P RVSSR S+SPG+SR+S ER R+LHDKLM+P HARA RIR+ Sbjct: 668 ASPFRVSSRVSHSPGMSRRSVERARILHDKLMTPEKRKKTALDLKKEAEEKHARAMRIRS 727 Query: 351 QLEHERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDE 530 +LE+ERVQ+LQRTSEKLNRV+EWQ VRS KLRE M+ARHQRSESRHEA++AQVVRRAGDE Sbjct: 728 ELENERVQKLQRTSEKLNRVNEWQAVRSMKLREGMYARHQRSESRHEAFLAQVVRRAGDE 787 Query: 531 TSKVNEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRI 710 +SKVNEVRFITSLNEENKK +LR+KLHDSE+RRAEKLQVIKTKQKEDMAREEAVLER+++ Sbjct: 788 SSKVNEVRFITSLNEENKKLMLRQKLHDSEVRRAEKLQVIKTKQKEDMAREEAVLERRKL 847 Query: 711 IEAEKLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXX 890 IEAEKLQRL ETQR+KEEA +EQ+RR+E+ Sbjct: 848 IEAEKLQRLAETQRKKEEALFRREEERKASSAAREAKAIEQLRRREVRAKAQQEEAELLA 907 Query: 891 XXXXXXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDDN 1070 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR K+ +QG+S+P +N +D Sbjct: 908 QKLAEKLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSLNKD--SQGRSTPTNNNEDY 965 Query: 1071 LXXXXXXXXXXXXXXXET-LQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAV 1247 LQ S+ Q+LM+LK+EF EP E++ +GYRTA+ Sbjct: 966 QATSISGLGSATIPTGNVGLQQSMRRRIKRIRQKLMALKYEFLEPPVGNENAGIGYRTAM 1025 Query: 1248 GTARRKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIAS 1427 GTAR K+GRWLQ+LQKLRQARK+GAA+ GLITAE+IK+LEG+D EL ASR+AGL+DFIAS Sbjct: 1026 GTARAKIGRWLQELQKLRQARKEGAASIGLITAEMIKFLEGKDPELNASRQAGLVDFIAS 1085 Query: 1428 ALPASHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMA 1607 ALPASHTSKPEACQVTIYLLRLL++VL+ P+ + YFL QNLLPP+IPML+ ALENYIK+A Sbjct: 1086 ALPASHTSKPEACQVTIYLLRLLRVVLSVPATRSYFLAQNLLPPIIPMLSAALENYIKIA 1145 Query: 1608 ASSNLPGATNIVSSKIATGNLEYISEIV 1691 AS N+PG+T++ SSK + N E ISE++ Sbjct: 1146 ASLNIPGSTSLSSSKASVENFESISEVL 1173 >ref|XP_006579526.1| PREDICTED: GRIP and coiled-coil domain-containing protein 2-like [Glycine max] Length = 1427 Score = 628 bits (1619), Expect = e-177 Identities = 339/563 (60%), Positives = 406/563 (72%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 +K K P+E EKN + D L + GKS++AWKEKRNWE++L+ P Sbjct: 607 KKDKAPTEVVNEKNARCTDNLRRQMPVPEKDKEKRSSAPGKSLNAWKEKRNWEDILSSPF 666 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 RVSSR YSP LSRKSAERVR LHDKLMSP HARA RIR++LE+ Sbjct: 667 RVSSRVPYSPSLSRKSAERVRTLHDKLMSPDKKKKTTSDLKREAEEKHARAMRIRSELEN 726 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTS+KLNRV+EW VR KLRE M+ARHQRSESRHEA++AQVV+RAGDE+SKV Sbjct: 727 ERVQKLQRTSQKLNRVNEWHAVRHMKLREGMYARHQRSESRHEAFLAQVVKRAGDESSKV 786 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KLH+SELRRAEKLQV+K+KQKED+AREEAVLER+++IEAE Sbjct: 787 NEVRFITSLNEENKKLMLRQKLHESELRRAEKLQVLKSKQKEDLAREEAVLERRKLIEAE 846 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL E QRRKEEAQV +EQ+RRKE Sbjct: 847 KLQRLAEIQRRKEEAQVRREEERKASSAAREARAIEQLRRKEERAKAQQEEAELLAQKLA 906 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDDNLXXX 1082 +ESEQRRK YLEQIRERA++ RDQSSPLLRR KEGQ G+S+P ++ DD+ Sbjct: 907 ERLNESEQRRKIYLEQIRERANL--RDQSSPLLRRSINKEGQ--GRSTPTNSSDDSQTNI 962 Query: 1083 XXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTARR 1262 TLQHS+ QRLM+LK+EF EP ES+SLGYR AVG AR Sbjct: 963 VSGIGSSLRIGNVTLQHSIKRRIKRIRQRLMALKYEFLEPLLGGESASLGYRVAVGAARA 1022 Query: 1263 KVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPAS 1442 KVGRWLQ+LQ+LRQARK+GA + GLI +E+IKYLEG+D ELQASR+AGLLDFIASALPAS Sbjct: 1023 KVGRWLQELQRLRQARKEGATSIGLIISEMIKYLEGKDPELQASRQAGLLDFIASALPAS 1082 Query: 1443 HTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSNL 1622 HTSKPEACQV ++LL+LL++VL+TP+N+ YFL QNLLPP+IPML+ ALENYIK+AAS ++ Sbjct: 1083 HTSKPEACQVMLHLLKLLRVVLSTPANRSYFLAQNLLPPIIPMLSAALENYIKIAASLSI 1142 Query: 1623 PGATNIVSSKIATGNLEYISEIV 1691 PG ++ SSK N E ISEI+ Sbjct: 1143 PGNVSLPSSKALVENFESISEIL 1165 >gb|EXC11028.1| hypothetical protein L484_015248 [Morus notabilis] Length = 1663 Score = 626 bits (1614), Expect = e-176 Identities = 345/566 (60%), Positives = 408/566 (72%), Gaps = 3/566 (0%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXX--NGISGKSMDAWKEKRNWEEVLAI 176 +K+KT + +EKN K D N KSMDAWKEKRNWE++LA Sbjct: 600 KKAKTLAGVVSEKNFKVTDHYKRQIPQSEQDKEKEKRNSAPWKSMDAWKEKRNWEDILAS 659 Query: 177 PPRVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQL 356 P RVSSR S+SPG+SRKSAER R+LHDKLMSP HARA RIR +L Sbjct: 660 PFRVSSRVSHSPGMSRKSAERARMLHDKLMSPEKKKKNAMDLKREAAEKHARAMRIRGEL 719 Query: 357 EHERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETS 536 E+ERVQ+LQRTSEKLNRVSEWQ VR+ KLRE M+AR QRSESRHEA++AQVV+RAGDE+S Sbjct: 720 ENERVQKLQRTSEKLNRVSEWQAVRNMKLREGMYARQQRSESRHEAFLAQVVKRAGDESS 779 Query: 537 KVNEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIE 716 KVNEVRFITSLNEENKK +LR+KLHDSELRRAEKLQV+K+KQKEDMAREEAVLER+++IE Sbjct: 780 KVNEVRFITSLNEENKKLMLRQKLHDSELRRAEKLQVMKSKQKEDMAREEAVLERRKLIE 839 Query: 717 AEKLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXX 896 AEKLQRL ETQRRKEEA E + +K Sbjct: 840 AEKLQRLAETQRRKEEA----------------LEEAELLAQK----------------- 866 Query: 897 XXXXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDDNLX 1076 SESEQRRKFYLEQIRERASMDFRDQSSPLLRR K+GQ G+S P + G+DN Sbjct: 867 LAEKLSESEQRRKFYLEQIRERASMDFRDQSSPLLRRSINKDGQ--GRSPPTNTGEDNQA 924 Query: 1077 XXXXXXXXXXXXXXET-LQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGT 1253 LQHS QRLM+LK+EFPEP E++ +GYRT +G+ Sbjct: 925 SSLLGLGGSTLVTSNVALQHSTKRRIKRIRQRLMALKYEFPEPPGGAENAGIGYRTTMGS 984 Query: 1254 ARRKVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASAL 1433 AR K+GRWLQ+LQ+LRQARK+GAA+ GLITAE++KYLEG+DAELQASR+AGL+DFIASAL Sbjct: 985 ARVKIGRWLQELQRLRQARKEGAASIGLITAEMVKYLEGKDAELQASRQAGLIDFIASAL 1044 Query: 1434 PASHTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAAS 1613 PASHTSKPEACQVTI+LL+LL++VL+ +N+ YFL QNLLPP+IPML+ ALENYIK+AAS Sbjct: 1045 PASHTSKPEACQVTIHLLKLLRVVLSVSANRSYFLAQNLLPPIIPMLSAALENYIKIAAS 1104 Query: 1614 SNLPGATNIVSSKIATGNLEYISEIV 1691 NLPG TN++SSK + + E ISEI+ Sbjct: 1105 LNLPGNTNLLSSKTSAEHFELISEIL 1130 >ref|XP_003549556.2| PREDICTED: uncharacterized protein LOC100792269 [Glycine max] Length = 1699 Score = 622 bits (1604), Expect = e-175 Identities = 335/563 (59%), Positives = 404/563 (71%) Frame = +3 Query: 3 RKSKTPSENPAEKNLKSADTLXXXXXXXXXXXXXXNGISGKSMDAWKEKRNWEEVLAIPP 182 +K K P+E EKN +S D L + GKS++AWKEKRNWE++L+ P Sbjct: 608 KKDKAPTEVVNEKNPRSTDNLRRQMPLPEKDKEKRSSAPGKSLNAWKEKRNWEDILSSPF 667 Query: 183 RVSSRFSYSPGLSRKSAERVRVLHDKLMSPXXXXXXXXXXXXXXXXXHARATRIRTQLEH 362 R+SSR YSP LSRKSAERVR LHDKLMSP HARA RIR++LE+ Sbjct: 668 RISSRLPYSPSLSRKSAERVRTLHDKLMSPDKKKKTTSDLKREAEEKHARAMRIRSELEN 727 Query: 363 ERVQRLQRTSEKLNRVSEWQTVRSNKLRESMFARHQRSESRHEAYIAQVVRRAGDETSKV 542 ERVQ+LQRTS+KLNRV+EW R KLRE M+ARHQRSESRHEA++AQV +RAGDE+SKV Sbjct: 728 ERVQKLQRTSQKLNRVNEWHADRHMKLREGMYARHQRSESRHEAFLAQVAKRAGDESSKV 787 Query: 543 NEVRFITSLNEENKKHILRKKLHDSELRRAEKLQVIKTKQKEDMAREEAVLERKRIIEAE 722 NEVRFITSLNEENKK +LR+KLH+SELRRAEKLQV+K+KQKED+AREEAVLER+++IEAE Sbjct: 788 NEVRFITSLNEENKKLMLRQKLHESELRRAEKLQVLKSKQKEDLAREEAVLERRKLIEAE 847 Query: 723 KLQRLVETQRRKEEAQVXXXXXXXXXXXXXXXXXMEQMRRKEIXXXXXXXXXXXXXXXXX 902 KLQRL E QRRKEEAQV +EQ+RRKE Sbjct: 848 KLQRLAEIQRRKEEAQVRREEERKASSAAREARAIEQLRRKEERAKAQQEEAELLAQKLA 907 Query: 903 XXXSESEQRRKFYLEQIRERASMDFRDQSSPLLRRFAGKEGQAQGKSSPYSNGDDNLXXX 1082 +ESEQRRK YLEQIRERA++ RDQSSPLLRR KEGQ G+S+P ++ DD+ Sbjct: 908 ERLNESEQRRKIYLEQIRERANL--RDQSSPLLRRSINKEGQ--GRSTPTNSSDDSQTNI 963 Query: 1083 XXXXXXXXXXXXETLQHSLXXXXXXXXQRLMSLKHEFPEPSAALESSSLGYRTAVGTARR 1262 TLQHS+ QRLM+LK+EF EP ES+SLGYR AVG AR Sbjct: 964 VSGIGSSLGIGNVTLQHSIKRRIKRIRQRLMALKYEFLEPPLGGESASLGYRVAVGAARA 1023 Query: 1263 KVGRWLQDLQKLRQARKDGAANFGLITAEIIKYLEGRDAELQASREAGLLDFIASALPAS 1442 KVGRWLQ+LQ+LRQARK+GA + GLI +E+IKYLEG+D ELQASR+AGLLDFIAS LPAS Sbjct: 1024 KVGRWLQELQRLRQARKEGATSIGLIISEMIKYLEGKDPELQASRQAGLLDFIASTLPAS 1083 Query: 1443 HTSKPEACQVTIYLLRLLKIVLATPSNKCYFLVQNLLPPLIPMLATALENYIKMAASSNL 1622 HTSKPEACQV ++LL+LL++VL+TP+N+ YFL QNLLPP+IPML+ ALENYIK+AAS ++ Sbjct: 1084 HTSKPEACQVMLHLLKLLRVVLSTPANRSYFLAQNLLPPIIPMLSAALENYIKIAASLSI 1143 Query: 1623 PGATNIVSSKIATGNLEYISEIV 1691 PG ++ SK + N E ISEI+ Sbjct: 1144 PGNISLPPSKASVENFESISEIL 1166