BLASTX nr result
ID: Glycyrrhiza30_contig00003518
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza30_contig00003518 (5463 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KHN18480.1 hypothetical protein glysoja_006893 [Glycine soja] 2058 0.0 KHN48614.1 hypothetical protein glysoja_015762 [Glycine soja] 2013 0.0 XP_006581932.1 PREDICTED: uncharacterized protein LOC100788512 [... 2013 0.0 KHN19325.1 hypothetical protein glysoja_012245 [Glycine soja] 2008 0.0 XP_006578618.1 PREDICTED: uncharacterized protein LOC100780436 [... 1990 0.0 KHN16310.1 hypothetical protein glysoja_030285 [Glycine soja] 1988 0.0 XP_007138108.1 hypothetical protein PHAVU_009G180900g [Phaseolus... 1967 0.0 XP_017421635.1 PREDICTED: uncharacterized protein LOC108331459 [... 1947 0.0 BAT79550.1 hypothetical protein VIGAN_02245700 [Vigna angularis ... 1946 0.0 XP_014625222.1 PREDICTED: uncharacterized protein LOC100792096 [... 1916 0.0 XP_014630930.1 PREDICTED: uncharacterized protein LOC100806155 [... 1898 0.0 XP_014501022.1 PREDICTED: uncharacterized protein LOC106761917 [... 1895 0.0 KRH03341.1 hypothetical protein GLYMA_17G092200 [Glycine max] 1729 0.0 XP_015882642.1 PREDICTED: uncharacterized protein LOC107418459 [... 1687 0.0 XP_017973244.1 PREDICTED: uncharacterized protein LOC18603853 [T... 1637 0.0 EOY20634.1 BAH domain,TFIIS helical bundle-like domain isoform 1... 1635 0.0 XP_018820881.1 PREDICTED: uncharacterized protein LOC108991179 i... 1630 0.0 XP_018831488.1 PREDICTED: uncharacterized protein LOC108999147 [... 1610 0.0 XP_018820884.1 PREDICTED: uncharacterized protein LOC108991180 [... 1600 0.0 EOY20638.1 BAH domain,TFIIS helical bundle-like domain isoform 5... 1600 0.0 >KHN18480.1 hypothetical protein glysoja_006893 [Glycine soja] Length = 1555 Score = 2058 bits (5332), Expect = 0.0 Identities = 1112/1546 (71%), Positives = 1215/1546 (78%), Gaps = 14/1546 (0%) Frame = -1 Query: 4884 PIEVRLGKGVPLKTTPNEIFFSFHTDKVSAASLLHPCKVAFLPKGVELPSGISSFVCRRV 4705 PIEVRLGK + L++ PNEIF+SFH ++ A SLLHPCKVAFLPKGVEL SGISSFVCRRV Sbjct: 34 PIEVRLGKDLLLESAPNEIFYSFHKAEIPATSLLHPCKVAFLPKGVELQSGISSFVCRRV 93 Query: 4704 YDIANKCLWWLSDQDYTNDCQEEIDKLLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKS 4525 YD+ N+C WLS+QDY +DCQEE+DKLL+RTCVEMQ PGGR SQLKS Sbjct: 94 YDVLNRCSRWLSNQDYIDDCQEEVDKLLHRTCVEMQ----PGGRSPKPVSSPTSTSQLKS 149 Query: 4524 GSDSIQNSASSFPSQVKGRKRERVDQGPESVKRECSAKNDDGDSGHVRQDSILKTEIAKV 4345 GSDS+QNS SSF S VKGRKRERVD+G ESVKR+ S K DDGDSGH +QDSILKTEIAKV Sbjct: 150 GSDSVQNSVSSFSSHVKGRKRERVDKGQESVKRDRSTKIDDGDSGHFKQDSILKTEIAKV 209 Query: 4344 TEKGGLVDTEGVEKLVQLMVPARNEKKIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPV 4165 TEKGGL+DTEGVEKLVQLMVP NEKKIDLASRSMLAAVIAATDKFDCLSRFVQL+GLP+ Sbjct: 210 TEKGGLIDTEGVEKLVQLMVPDGNEKKIDLASRSMLAAVIAATDKFDCLSRFVQLKGLPI 269 Query: 4164 FDEWLQEAHKGKIGDGTGSRDGDKSVDEFLLGLLRALDKLPVNLQALQACNIGKSVNHLR 3985 FDEWLQEAHKGKIGDG GSRDGDKSVD+FL LLRALDKLPVNLQALQACNIGKSVNHLR Sbjct: 270 FDEWLQEAHKGKIGDGLGSRDGDKSVDDFLFVLLRALDKLPVNLQALQACNIGKSVNHLR 329 Query: 3984 THKNLEIQKKARGLVDTWKKRVEAEMIINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGG 3805 THKNLEIQKKARGLVDTWKKRVEAEMIINDA+SGSVQ+ PWPARQRL EVAQGGN+HS G Sbjct: 330 THKNLEIQKKARGLVDTWKKRVEAEMIINDARSGSVQTVPWPARQRLSEVAQGGNKHSSG 389 Query: 3804 SSDVAVKSSVTQLSASKNASAKIVQGENNTRSVLTSASPGSAKSVLSPATATANSKDGQP 3625 S+DVA+KSSVTQLSASK ASAKIV GE NTRS TSASPGS KSV SPA ATAN KDGQP Sbjct: 390 SADVAMKSSVTQLSASKTASAKIVPGE-NTRS--TSASPGSTKSVPSPAPATANLKDGQP 446 Query: 3624 RVVAISGSSDLPVASARDEKXXXXXXXXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXX 3445 V SGS DLPVA+ARDEK SDH KAGGLSGKEDA Sbjct: 447 CVATASGSFDLPVANARDEK-SSSSSPSHNSQSYSSDHVKAGGLSGKEDARSSTAMSVNK 505 Query: 3444 XXXXXXXXXXSMNGFPGSTPSRGQRETGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLE 3265 SM GFPGSTPSR QRETGS +NSS HKNL SEK SQSGL EKA DGT+LE Sbjct: 506 VSGGSSRHRKSMKGFPGSTPSRRQRETGSRQNSS-HKNLISEKISQSGLREKANDGTTLE 564 Query: 3264 GHIPKLIVKIPNRGRSPAQSASAGSFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYR 3085 GH PKLIVKI NR RSPAQSA+AGS DDP+IMNSRASSPVLSEKHDQFDRCSKEK DFYR Sbjct: 565 GHTPKLIVKIQNRSRSPAQSATAGSSDDPAIMNSRASSPVLSEKHDQFDRCSKEKSDFYR 624 Query: 3084 ANIGADINTESWQSNDFKDILTGSDEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXS 2905 ANIGADI TESWQSNDFKD+LTGSDEGDGSP A TDE+ CRTG++CKK+LDV S Sbjct: 625 ANIGADIYTESWQSNDFKDVLTGSDEGDGSPEAITDEQ-CRTGEDCKKVLDVSKAASSSS 683 Query: 2904 GNESKTRNLQDASYSSINALIEGVKYSEADDVGMNLLASVAAGEISKSEMGMPAGSPERN 2725 GNES+ RNLQDASYSSINALIEGVKY+EADDVGMNLLA+VAAGEISKSE+GMP GSPE++ Sbjct: 684 GNESRARNLQDASYSSINALIEGVKYTEADDVGMNLLANVAAGEISKSELGMPVGSPEKD 743 Query: 2724 FTTIEQSRPGD-AVVKSSGEDLVQDKCYANDGEQKKQVSISSDLGTDDVNDSGFLASGGK 2548 TTIEQS GD AVVKSS E LVQDKCY N E KQ S DLG DD LAS GK Sbjct: 744 TTTIEQSYAGDAAVVKSSEEYLVQDKCYTN-VEHSKQDSRPGDLGADD----DILASEGK 798 Query: 2547 -SAEHNMDTNSCSMDLQVTETCLESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXX 2371 + EH N+ SMDL VTET LESKGKLIEK +GTSSA PES+ QE RD+DSS Sbjct: 799 VTGEH----NTSSMDLWVTETSLESKGKLIEKSSGTSSAGIPESTIQEVRDSDSSKLVKE 854 Query: 2370 XXXXXXXXXXGNLDVKVGDVAGESEAENPSVE--------VGVKCDNSATEGLNSGRETT 2215 GN+DVKV VA ESE E ++E V VKCDN A+EGL+ +ET Sbjct: 855 KKVVVRVDAVGNVDVKVNVVASESETE--AIEKFSCTCEVVDVKCDNRASEGLSGDKETA 912 Query: 2214 QKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQRN 2035 KSPAI M D V T EN Q SG VDKV E +NERE EKNDDM QDH +S KQ+N Sbjct: 913 GKSPAIRMSSDYVIATDENAPQSSGDIVDKVLEHVNERESEKNDDMVAQDHAKESIKQKN 972 Query: 2034 ESESDAVVVPENRGLCSGVTGLAAEYVEENSGTKEVYDQ--DARQMPHTVSPSFPSQEID 1861 ESE+DA++VP+NRGLCSG TGL AEYVEENSGTKEV DQ A Q+ HT PSFPS+E+D Sbjct: 973 ESENDAIMVPKNRGLCSGATGLDAEYVEENSGTKEVCDQVAGAGQIVHTDLPSFPSREMD 1032 Query: 1860 QCTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGE 1681 QC+G K SKL ++ E+AEEC KVEFDLNEG + DDGKC E Sbjct: 1033 QCSGHKDSKLTAMESEEAEECTSTTGDTSSASVAGVSEVDTKVEFDLNEGFNADDGKCSE 1092 Query: 1680 LKNTALPGCAXXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPEDLLKIKGELGWKGSAA 1501 + + A ITVAAAAK PFV PEDLLK K ELGWKGSAA Sbjct: 1093 MPGST--PAARLVSPVPFSASSMSFGILSITVAAAAKSPFVAPEDLLKSKKELGWKGSAA 1150 Query: 1500 TSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQ 1321 TSAFRPAEPRK+ME+PL ++TTPIPN A+ QSRAPLD DLNV DE L+D+SSQNC+ Q Sbjct: 1151 TSAFRPAEPRKVMEIPLDMSTTPIPNDEARKQSRAPLDFDLNVSDEVILDDVSSQNCARQ 1210 Query: 1320 TVCVSHAVDGHDPHKSTPSPVCCSGGLGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKS 1141 T C +H+ +GHDP+KS S V CSGGLGLDLNLVD ASDVGNCT S+SHK DVPL+QVKS Sbjct: 1211 TDCGTHSDNGHDPNKSMASHVSCSGGLGLDLNLVDGASDVGNCTLSSSHKMDVPLMQVKS 1270 Query: 1140 SADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSSLPSQPNVSGLSMSNA-EVA 964 +A PPN EMSFRRDFDLN+GP VDEVT+EPL+STQ AR+S+PSQP +SGL MSNA EV Sbjct: 1271 AASGPPNGEMSFRRDFDLNDGPVVDEVTSEPLMSTQPARNSVPSQPPISGLRMSNAEEVG 1330 Query: 963 NFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTV 784 NFSSWFPST NTYSAVTISSIM DRGD+SFSI APNGPQRMLGPATG NPFG DIY+G V Sbjct: 1331 NFSSWFPSTANTYSAVTISSIMSDRGDRSFSIVAPNGPQRMLGPATGSNPFGPDIYKGAV 1390 Query: 783 L-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGP 607 L SSPAVPY SAPFQYPVFPFNSSFPLPSASFSGGST YVD+TSGGRLCFPAVNSQL+G Sbjct: 1391 LSSSPAVPYQSAPFQYPVFPFNSSFPLPSASFSGGSTPYVDTTSGGRLCFPAVNSQLIGS 1450 Query: 606 VNTISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQ 427 V +S YPRPYVVSLPDG+NS+SAEN R+ RQGLDLNAGPG S+LEGRD+SSPLVPRQ Sbjct: 1451 VGNVSVHYPRPYVVSLPDGSNSSSAENCRRRTRQGLDLNAGPG-SNLEGRDDSSPLVPRQ 1509 Query: 426 LSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQYHSL 289 LSVASS A EEQAR+FHL S VLKRKEPDGG DGYKQ+SWQYH+L Sbjct: 1510 LSVASSQAQLEEQARIFHLSSDVLKRKEPDGGWDGYKQTSWQYHNL 1555 >KHN48614.1 hypothetical protein glysoja_015762 [Glycine soja] Length = 1504 Score = 2013 bits (5214), Expect = 0.0 Identities = 1085/1517 (71%), Positives = 1181/1517 (77%), Gaps = 10/1517 (0%) Frame = -1 Query: 4809 DKVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEID 4630 +++ A SLLHPCKV FLPKGVEL SGISSF+CRRVYD+ N+CL WLSDQDY +DCQEE+D Sbjct: 8 EEIPATSLLHPCKVVFLPKGVELQSGISSFICRRVYDVLNRCLCWLSDQDYIDDCQEEVD 67 Query: 4629 KLLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVD 4450 KL++RTCVEMQ PGGR SQLKSGSDS+QNS SSF S VKGRKRERVD Sbjct: 68 KLVHRTCVEMQ----PGGRSPKPVSSPTSTSQLKSGSDSVQNSVSSFSSHVKGRKRERVD 123 Query: 4449 QGPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNE 4270 QG ESVKR+ S KNDDGDSG+ + DSILKTEIAKVTEKGGL+D EGVEKLVQLMVP NE Sbjct: 124 QGQESVKRDRSTKNDDGDSGNFKADSILKTEIAKVTEKGGLIDPEGVEKLVQLMVPDGNE 183 Query: 4269 KKIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKS 4090 KKIDLASRSMLAAVIAAT+KFDCLSRFVQL+GLPVFDEWLQEAHKGK+G+G GSRDGDKS Sbjct: 184 KKIDLASRSMLAAVIAATEKFDCLSRFVQLKGLPVFDEWLQEAHKGKLGEGIGSRDGDKS 243 Query: 4089 VDEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAE 3910 VD+FL LLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAE Sbjct: 244 VDDFLFVLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAE 303 Query: 3909 MIINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQ 3730 MIINDA+S SVQ+ PWPARQRL EVAQGGN+HS GS+DVA+KSSVTQLSASK AS KI Sbjct: 304 MIINDARSCSVQTVPWPARQRLSEVAQGGNKHSSGSADVAMKSSVTQLSASKTASPKIAP 363 Query: 3729 GENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXX 3550 GEN TRS TSASPGS KSV SPA ATAN KDGQP A+SGSSDLPVA+ARDEK Sbjct: 364 GENTTRS--TSASPGSTKSVPSPAPATANLKDGQPHAAAVSGSSDLPVANARDEK-SSSS 420 Query: 3549 XXXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQR 3370 SDHAKAGG SGKEDA SMNGFPGSTPSR QR Sbjct: 421 SPSHNSQSYSSDHAKAGGFSGKEDARSSTAMSVNKVSGGSSRHRRSMNGFPGSTPSRRQR 480 Query: 3369 ETGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGS 3190 ETGSSRNSS HKNL SEK SQSGL EKA DGT LEGH PKLIVKIPNRGRSPAQSA+AGS Sbjct: 481 ETGSSRNSS-HKNLISEKISQSGLREKANDGTLLEGHTPKLIVKIPNRGRSPAQSATAGS 539 Query: 3189 FDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSD 3010 DDPSIMNSRASSP LSEKHDQFDRCSKEK DFYRANIGADINTESWQSNDFKD+LTGSD Sbjct: 540 SDDPSIMNSRASSPALSEKHDQFDRCSKEKSDFYRANIGADINTESWQSNDFKDVLTGSD 599 Query: 3009 EGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVK 2830 EGDGSPAA TDE+ CRTG++CKK+ DV G+ESK RNLQDASYSSINALIEGVK Sbjct: 600 EGDGSPAAITDEQ-CRTGEDCKKVSDVSKTTSSSPGSESKARNLQDASYSSINALIEGVK 658 Query: 2829 YSEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGD-AVVKSSGEDLVQD 2653 Y+EADDVGMNLLA VAAGEISKSE GMPAGSP++N TTIEQS G+ AVVKSS E LVQD Sbjct: 659 YTEADDVGMNLLARVAAGEISKSESGMPAGSPKKNTTTIEQSYAGNAAVVKSSEEYLVQD 718 Query: 2652 KCYANDGEQKKQVSISSDLGTDDVNDSGFLASGGKSAEHNMDTNSCSMDLQVTETCLESK 2473 KCY+N E KQ S S D GTDD + F G + EH N SMDLQVTET LESK Sbjct: 719 KCYSN-AEHMKQDSRSGDFGTDD-DIRAF--EGKATGEH----NPSSMDLQVTETSLESK 770 Query: 2472 GKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGESEA 2293 GKLI K +GT SA PES+ QE RD DSS N+D +V VA E E Sbjct: 771 GKLIVKSSGT-SAGIPESTFQEVRDIDSSKLVKEKKVVVRVDAVNNVD-EVNVVAREGET 828 Query: 2292 E------NPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSV 2131 E + EV VKCDN A+EGL+ +ET KSPA + D+VK T EN Q SGY V Sbjct: 829 EAIEKLSHTCEEVDVKCDNHASEGLSCDKETAGKSPATCVPSDSVKATDENALQSSGYIV 888 Query: 2130 DKVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVE 1951 DKVPE LNERE EKNDDMA QDH QS KQ+NESE+DA++VPENRGLCSG TGL AEYVE Sbjct: 889 DKVPEYLNERESEKNDDMAAQDHAKQSLKQKNESENDAIMVPENRGLCSGATGLDAEYVE 948 Query: 1950 ENSGTKEVYDQD--ARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXX 1777 ENSGTKEV DQD A Q+ HT PSFPS+E+DQ +G + SKLA ++ E+AEEC Sbjct: 949 ENSGTKEVCDQDAGAGQILHTDLPSFPSREMDQHSGQRDSKLAAMESEEAEECTSTTGDA 1008 Query: 1776 XXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPC 1597 KVEFDLNE L+ DDGKC E+ + Sbjct: 1009 SSASVAGVSEVDTKVEFDLNERLNADDGKCSEIPGSTPAARLVSPVPFSASSMSFGILSI 1068 Query: 1596 VITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNAA 1417 + AAAAKGPFVP EDLLK K ELGWKGSAATSAFRPAEPRK+ME+PL ++TTPIPN Sbjct: 1069 TVAAAAAAKGPFVPHEDLLKSKKELGWKGSAATSAFRPAEPRKVMEIPLDMSTTPIPNDE 1128 Query: 1416 AKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKSTPSPVCCSGGLG 1237 A+ QSR PLD DLNV DE L+D+SSQNC+ QT CV+ + DGHDP+KS S V CSGGLG Sbjct: 1129 ARKQSRVPLDFDLNVSDEIILDDLSSQNCARQTDCVTRSDDGHDPNKSMASHVRCSGGLG 1188 Query: 1236 LDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEVT 1057 LDLNLVD ASDVGNCT S+SHK DVPL Q KS+A PPN +MS RDFDLN+GP VDEVT Sbjct: 1189 LDLNLVDGASDVGNCTLSSSHKMDVPLTQFKSAASGPPNGKMSVLRDFDLNDGPIVDEVT 1248 Query: 1056 TEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKS 877 TE L+ST+ AR+S+PSQP +SGL MSNAEV N SSWFPSTGNTYSAVTISSIM DRGDK Sbjct: 1249 TEHLMSTRSARNSVPSQPPISGLRMSNAEVGNVSSWFPSTGNTYSAVTISSIMSDRGDKP 1308 Query: 876 FSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPS 700 FSI APN +R+LGPATG NPFG DIYRG VL SSPAVPY SAPFQYPVFPFNSSFPLPS Sbjct: 1309 FSIVAPNVSERVLGPATGSNPFGPDIYRGAVLSSSPAVPYQSAPFQYPVFPFNSSFPLPS 1368 Query: 699 ASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENSR 520 ASFSGGST YVD+TSGGRLCFP VNSQL+G V +S+ YPRPYVVS PDG+NS+ AENSR Sbjct: 1369 ASFSGGSTPYVDTTSGGRLCFPVVNSQLIGSVGNVSAHYPRPYVVSFPDGSNSSGAENSR 1428 Query: 519 KWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEP 340 K ARQGLDLNAGPG SDLEGRDESSPLVPRQLSVASS A EEQARMFHL S VLKRKEP Sbjct: 1429 KRARQGLDLNAGPG-SDLEGRDESSPLVPRQLSVASSQAQLEEQARMFHLSSDVLKRKEP 1487 Query: 339 DGGQDGYKQSSWQYHSL 289 DGG DGYKQSSWQYH+L Sbjct: 1488 DGGWDGYKQSSWQYHNL 1504 >XP_006581932.1 PREDICTED: uncharacterized protein LOC100788512 [Glycine max] KRH54431.1 hypothetical protein GLYMA_06G184600 [Glycine max] Length = 1613 Score = 2013 bits (5214), Expect = 0.0 Identities = 1086/1634 (66%), Positives = 1210/1634 (74%), Gaps = 12/1634 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHGCGGE + K RHMWK+P RG+ + SF KDGRKISVG+CAL Sbjct: 1 MHGCGGE-KGKGTRHMWKAPVRGDSSLNADVSSSSSSSSST-VKSFCKDGRKISVGECAL 58 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP D PPFIGII +T GKE KLK+GV+WLYR IEV+L KGVPL+ PNEIF++FH D Sbjct: 59 FKPSEDRPPFIGIIHCLTFGKEKKLKLGVSWLYRSIEVKLNKGVPLEAAPNEIFYTFHKD 118 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 + A SLLHPCKVAFL KG ELPSG SSFVCRRVYDIANKCLWWL+DQDY NDCQEE+D+ Sbjct: 119 ETDAESLLHPCKVAFLRKGAELPSGFSSFVCRRVYDIANKCLWWLNDQDYINDCQEEVDQ 178 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYRTCV M ATV PGGR SQLKS SDS+QN+ SSFPS +KGRKRER DQ Sbjct: 179 LLYRTCVRMHATVQPGGRSPKPMSSPTSTSQLKSVSDSVQNNTSSFPSHIKGRKRERADQ 238 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G E VKRE S K +DGDSGH R D+ILKTEIAK+TEKGGLVD EGVEKLVQLMVP RNEK Sbjct: 239 GSEPVKRERSIKTEDGDSGHFRHDNILKTEIAKITEKGGLVDNEGVEKLVQLMVPDRNEK 298 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDLASRS+LAAVIAAT+K DCLS+FVQLRGLPVFDEWLQE HKGKIGDG GSRDGDKSV Sbjct: 299 KIDLASRSLLAAVIAATEKLDCLSQFVQLRGLPVFDEWLQEVHKGKIGDGVGSRDGDKSV 358 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLRTHKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 359 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRTHKNTEIQRKARGLVDTWKKRVEAEM 418 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 I DAKSGS + WPA+ R +V GGNRHSG SSD+A+KSSVTQLSASK AS KIVQG Sbjct: 419 NIKDAKSGSGPTVHWPAKSRSSDVGHGGNRHSGASSDIAMKSSVTQLSASKTASVKIVQG 478 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN RS TS PG AKSVLSPA+ TAN KDGQP + A+SG SDLP+ +ARDEK Sbjct: 479 ENTIRSASTSTFPGPAKSVLSPASVTANLKDGQPCIAAVSGGSDLPMVNARDEKSSSSSQ 538 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG SGKEDA S+NGFPGSTPS GQRE Sbjct: 539 SHNNSQSCSSDHAKTGGHSGKEDARSSTAMSVNKISGGSSRHRKSINGFPGSTPSGGQRE 598 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNSSLHKNLTSEK SQ GLM+KALDGTSLEG KLIVKIP++GRSPAQSASAGSF Sbjct: 599 TGSSRNSSLHKNLTSEKISQPGLMDKALDGTSLEGVTCKLIVKIPSQGRSPAQSASAGSF 658 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL EKHDQFD CSKEK D YRANIG+DINTESWQSNDFKD+LTGSDE Sbjct: 659 DDPTIMNSRASSPVLPEKHDQFDHCSKEKSDLYRANIGSDINTESWQSNDFKDVLTGSDE 718 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 DGSPAA TDEE CR ++CKK +V SGNE+K NLQDASYSSINALIEGVKY Sbjct: 719 ADGSPAAVTDEERCRIVNDCKKTFEVPKAASSSSGNENKAGNLQDASYSSINALIEGVKY 778 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADDVGMNLLASVAAGEI KSE+ P GSPERN +EQS G+ +VKSS E+LV+D+C Sbjct: 779 SEADDVGMNLLASVAAGEILKSELLTPTGSPERNTAAVEQSCTGNDMVKSSEENLVRDEC 838 Query: 2646 YAN---DGEQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N DGE K Q S++ DLG +D +DS F ASG K+A E N N+CSMDL QV+E L Sbjct: 839 HSNNGLDGEHKNQGSVTDDLGANDESDSDFRASGEKAARELNKSVNACSMDLQQVSEIIL 898 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL EK T+ ESS QEARD D S +DVKV VA E Sbjct: 899 ESKGKLNEKSVSTALRGLSESSVQEARDGDRSKQLQEVGRGVNGGEI--VDVKVSSVA-E 955 Query: 2301 SEAE------NPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSG 2140 EAE + +V+V V+ DN EG + G T A+ + D +G ENV S Sbjct: 956 VEAEATEKLSHIAVKVDVQSDNCTAEGSSGGGRTA----AVLVPSDLARGKDENVLHSSA 1011 Query: 2139 YSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAE 1960 YSVDKVPEDL ERE EK DD+ ++ +QSKK+RNE ESD + +PENRGLCS VTG+AAE Sbjct: 1012 YSVDKVPEDLTERESEKADDVDAENLPSQSKKERNECESDTLTMPENRGLCSIVTGIAAE 1071 Query: 1959 YVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXX 1780 +VEEN TKEV+DQ AR+ SPS SQE+D+ KGSKL ++ E+AEEC Sbjct: 1072 HVEENLETKEVHDQPAREELPKDSPSVRSQEMDKHLDSKGSKLTAMEAEEAEECTSTTAD 1131 Query: 1779 XXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXP 1600 AKVEFDLNEGL+ DD KCGE N++ P P Sbjct: 1132 ASSVSAAAVSDADAKVEFDLNEGLNADDEKCGEF-NSSAPAGRLVSPVPFPASSMSCGIP 1190 Query: 1599 CVITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNA 1420 +T AAAAKG FVPPEDLL+ KGE+GWKGSAATSAFRPAE RK+MEMP G T+ IP+A Sbjct: 1191 APVTGAAAAKGRFVPPEDLLRSKGEIGWKGSAATSAFRPAELRKVMEMPFGALTSSIPDA 1250 Query: 1419 AAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGG 1243 A QSRAPLDIDLNV DER L+DISSQ C+ T VS DGHDP S SPV CSGG Sbjct: 1251 PAGKQSRAPLDIDLNVADERILDDISSQPCARHTDSVSLTTDGHDPVSSKMASPVRCSGG 1310 Query: 1242 LGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDE 1063 LGLDLN VDEASDVGNC SN HK DVP+++VKSS PPNRE++ RDFDLNNGP+VDE Sbjct: 1311 LGLDLNQVDEASDVGNCLSSN-HKIDVPIMKVKSSLGGPPNREVNVHRDFDLNNGPSVDE 1369 Query: 1062 VTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGD 883 VTTE + +Q ARSS+PSQP VSGL +S AE NF SW PS+GNTYSAVTISSIMPDRGD Sbjct: 1370 VTTESSLFSQHARSSVPSQPPVSGLRVSTAEPVNF-SWLPSSGNTYSAVTISSIMPDRGD 1428 Query: 882 KSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVLSSPAVPYPSAPFQYPVFPFNSSFPLP 703 + FSI APNGPQR+L PA GGNPFG D+Y+G VLSS PF+YPVFPFNSSFPLP Sbjct: 1429 QPFSIVAPNGPQRLLTPAAGGNPFGPDVYKGPVLSS--------PFEYPVFPFNSSFPLP 1480 Query: 702 SASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENS 523 SASFS GSTTYV TSG RLCFP VNSQLMGP +SS YPRPYVV L +G+NS SAE S Sbjct: 1481 SASFSAGSTTYVYPTSGNRLCFPVVNSQLMGPAGAVSSHYPRPYVVGLTEGSNSGSAETS 1540 Query: 522 RKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKE 343 RKWARQGLDLNAGPG SD+EGRD++SPL RQLSVASS A AEEQAR+ L V KRKE Sbjct: 1541 RKWARQGLDLNAGPGGSDMEGRDDNSPLPSRQLSVASSQALAEEQARI-QLAGSVCKRKE 1599 Query: 342 PDGGQDGYKQSSWQ 301 PDGG DGY QSSWQ Sbjct: 1600 PDGGWDGYNQSSWQ 1613 >KHN19325.1 hypothetical protein glysoja_012245 [Glycine soja] Length = 1613 Score = 2008 bits (5202), Expect = 0.0 Identities = 1085/1634 (66%), Positives = 1207/1634 (73%), Gaps = 12/1634 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHGCGGE + K RHMWK+P RG+ + SF KDGRKISVG+CAL Sbjct: 1 MHGCGGE-KGKGTRHMWKAPVRGDSSLNADVSSSSSSSSST-VKSFCKDGRKISVGECAL 58 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP D PPFIGII +T GKE KLK+GV+WLYR IEV+L KGVPL+ PNEIF++FH D Sbjct: 59 FKPSEDRPPFIGIIHCLTFGKEKKLKLGVSWLYRSIEVKLNKGVPLEAAPNEIFYTFHKD 118 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 + A SLLHPCKVAFL KG ELPSG SSFVCRRVYDIANKCLWWL+DQDY NDCQEE+D+ Sbjct: 119 ETDAESLLHPCKVAFLRKGAELPSGFSSFVCRRVYDIANKCLWWLNDQDYINDCQEEVDQ 178 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYRTCV M ATV PGGR SQLKS SDS+QN+ SSFPS +KGRKRER DQ Sbjct: 179 LLYRTCVRMHATVQPGGRSPKPMSSPTSTSQLKSVSDSVQNNTSSFPSHIKGRKRERADQ 238 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G E VKRE S K +DGDSGH R D+ILKTEIAK+TEKG LVD EGVEKLVQLMVP RNEK Sbjct: 239 GSEPVKRERSIKTEDGDSGHFRHDNILKTEIAKITEKGRLVDNEGVEKLVQLMVPDRNEK 298 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDLASRS+LAAVIAAT+K DCLS+FVQLRGLPVFDEWLQE HKGKIGDG GSRDGDKSV Sbjct: 299 KIDLASRSLLAAVIAATEKLDCLSQFVQLRGLPVFDEWLQEVHKGKIGDGVGSRDGDKSV 358 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLRTHKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 359 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRTHKNTEIQRKARGLVDTWKKRVEAEM 418 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 I DAKSGS + WPA+ R +V GGNRHSG SSD+A+KSSVTQLSASK AS KIVQG Sbjct: 419 NIKDAKSGSGPTVHWPAKSRSSDVGHGGNRHSGASSDIAMKSSVTQLSASKTASVKIVQG 478 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN RS TS PG AKSVLSPA+ TAN KDGQP + A+SG SDLP+ +ARDEK Sbjct: 479 ENTIRSASTSTFPGPAKSVLSPASVTANLKDGQPCIAAVSGGSDLPMVNARDEKSSSSSQ 538 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG SGKEDA S+NGF GSTPS GQRE Sbjct: 539 SHNNSQSCSSDHAKTGGHSGKEDARSSTAMSVNKISGGSSRHRKSINGFSGSTPSGGQRE 598 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNSSLHKNLTSEK SQ GLM+KALDGTSLEG KLIVKIP++GRSPAQSASAGSF Sbjct: 599 TGSSRNSSLHKNLTSEKISQPGLMDKALDGTSLEGVTCKLIVKIPSQGRSPAQSASAGSF 658 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL EKHDQFD CSKEK D YRANIG+DINTESWQSNDFKD+LTGSDE Sbjct: 659 DDPTIMNSRASSPVLPEKHDQFDHCSKEKSDLYRANIGSDINTESWQSNDFKDVLTGSDE 718 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 DGSPAA TDEE CR ++CKK +V SGNE+K NLQDASY+SINALIEGVKY Sbjct: 719 ADGSPAAVTDEERCRIVNDCKKTFEVPKAASSSSGNENKAGNLQDASYTSINALIEGVKY 778 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADDVGMNLLASVAAGEI KSE+ P GSPERN +EQS G+ +VKSS E+LV+D+C Sbjct: 779 SEADDVGMNLLASVAAGEILKSELLTPTGSPERNTAAVEQSCTGNGMVKSSEENLVRDEC 838 Query: 2646 YAN---DGEQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N DGE K Q S++ DLG +D +DS F ASG K+A E N N+CSMDL QV+E L Sbjct: 839 HSNNGLDGEHKNQGSVTDDLGANDESDSDFRASGEKAARELNKSVNACSMDLQQVSEIIL 898 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL EK T+ ESS QEARD D S +DVKV VA E Sbjct: 899 ESKGKLNEKSVSTALRGLSESSVQEARDGDRSKQLQEVGRGVNGGEI--VDVKVSSVA-E 955 Query: 2301 SEAE------NPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSG 2140 EAE + +VEV V+ DN EG + G T A+ + D +G ENV S Sbjct: 956 VEAEATEKLSHIAVEVDVQSDNCTAEGSSGGGRTA----AVLVPSDLARGKDENVLHSSA 1011 Query: 2139 YSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAE 1960 YSVDKVPEDL ERE EK DD+ ++ +QSKK+RNE ESD + +PENRGLCS VTG+AAE Sbjct: 1012 YSVDKVPEDLTERESEKADDVDAENLPSQSKKERNECESDTLTMPENRGLCSIVTGIAAE 1071 Query: 1959 YVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXX 1780 +VEEN TKEV+DQ AR+ SPS SQE+D+ KGSKL ++ E+AEEC Sbjct: 1072 HVEENLETKEVHDQPAREELPKDSPSVRSQEMDKHLDSKGSKLTAMEAEEAEECTSTTAD 1131 Query: 1779 XXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXP 1600 AKVEFDLNEGL+ DD KCGE N++ P P Sbjct: 1132 ASSVSAAAVSDADAKVEFDLNEGLNADDEKCGEF-NSSAPAGRLVSPVPFPASSMSCGIP 1190 Query: 1599 CVITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNA 1420 +TVAAAAKG FVPPEDLL+ KGE+GWKGSAATSAFRPAE RK+MEMP G T+ IP+A Sbjct: 1191 APVTVAAAAKGHFVPPEDLLRSKGEIGWKGSAATSAFRPAELRKVMEMPFGALTSSIPDA 1250 Query: 1419 AAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGG 1243 A QSRAPLDIDLNV DER L+DISSQ C+ T VS DGHDP S SPV C GG Sbjct: 1251 PAGKQSRAPLDIDLNVADERILDDISSQPCARHTDSVSLTTDGHDPVSSKMASPVRCYGG 1310 Query: 1242 LGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDE 1063 LGLDLN VDEASDVGNC SN HK DVP+ QVKSS PPNRE++ RDFDLNNGP+VDE Sbjct: 1311 LGLDLNQVDEASDVGNCLSSN-HKIDVPIKQVKSSLGGPPNREVNVHRDFDLNNGPSVDE 1369 Query: 1062 VTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGD 883 VTTE + +Q ARSS+PSQP VSGL +S AE NF SW PS+GNTYSAVTISSIMPDRGD Sbjct: 1370 VTTESSLFSQHARSSVPSQPPVSGLRVSTAEPVNF-SWLPSSGNTYSAVTISSIMPDRGD 1428 Query: 882 KSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVLSSPAVPYPSAPFQYPVFPFNSSFPLP 703 + FSI APNGPQR+L PA GGNPFG D+Y+G VLSS PF+YPVFPFNSSFPLP Sbjct: 1429 QPFSIVAPNGPQRLLTPAAGGNPFGPDVYKGPVLSS--------PFEYPVFPFNSSFPLP 1480 Query: 702 SASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENS 523 SASFS GSTTYV TSG RLCFP VNSQLMGP +SS YPRPYVV L +G+NS SAE S Sbjct: 1481 SASFSAGSTTYVYPTSGNRLCFPVVNSQLMGPAGAVSSHYPRPYVVGLTEGSNSGSAETS 1540 Query: 522 RKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKE 343 RKWARQGLDLNAGPG SD+EGRD++SPL RQLSVASS A AEEQAR+ L V KRKE Sbjct: 1541 RKWARQGLDLNAGPGGSDMEGRDDNSPLPSRQLSVASSQALAEEQARI-QLAGSVCKRKE 1599 Query: 342 PDGGQDGYKQSSWQ 301 PDGG DGY QSSWQ Sbjct: 1600 PDGGWDGYNQSSWQ 1613 >XP_006578618.1 PREDICTED: uncharacterized protein LOC100780436 [Glycine max] XP_006578619.1 PREDICTED: uncharacterized protein LOC100780436 [Glycine max] XP_014630262.1 PREDICTED: uncharacterized protein LOC100780436 [Glycine max] KRH63484.1 hypothetical protein GLYMA_04G180100 [Glycine max] KRH63485.1 hypothetical protein GLYMA_04G180100 [Glycine max] KRH63486.1 hypothetical protein GLYMA_04G180100 [Glycine max] KRH63487.1 hypothetical protein GLYMA_04G180100 [Glycine max] KRH63488.1 hypothetical protein GLYMA_04G180100 [Glycine max] Length = 1616 Score = 1990 bits (5156), Expect = 0.0 Identities = 1090/1635 (66%), Positives = 1212/1635 (74%), Gaps = 13/1635 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG GGE + KR RHMWKS TRG+ + SF KDGRKISVG+CAL Sbjct: 1 MHGRGGE-KGKRTRHMWKSSTRGDSSLNADVSSSSSSSSSS-VKSFCKDGRKISVGECAL 58 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP D PPFIGIIR +T GKE KLKVGV+WLYR IEV+L KG PL PNEIF++FH D Sbjct: 59 FKPSEDRPPFIGIIRCLTFGKEKKLKVGVSWLYRSIEVKLNKGGPLVAAPNEIFYTFHKD 118 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ A SLLHPCKVAFL KG ELPSGISSFVCRRVYDIANKCLWWL+DQDY NDCQEE+D+ Sbjct: 119 EIDAESLLHPCKVAFLRKGAELPSGISSFVCRRVYDIANKCLWWLNDQDYINDCQEEVDQ 178 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYRTCV M A V PGGR SQLKS SDS+QNS SSFPS +KGRKRER DQ Sbjct: 179 LLYRTCVGMHAAVQPGGRSPKPMSSPTSTSQLKSVSDSVQNSTSSFPSHIKGRKRERADQ 238 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 E KRE S K +DGDSGH R D+ILKTEIAK+TEKGGLVD EGVEKLVQLMVP +NE Sbjct: 239 SSEPAKRERSIKTEDGDSGHFRHDNILKTEIAKITEKGGLVDNEGVEKLVQLMVPEKNE- 297 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDLASR++LAAVIAATDK DCLS+FVQLRGLPVFDEWLQE HKGKIGDG GSRDGDKSV Sbjct: 298 KIDLASRALLAAVIAATDKLDCLSQFVQLRGLPVFDEWLQEVHKGKIGDGVGSRDGDKSV 357 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLRTHKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 358 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRTHKNTEIQRKARGLVDTWKKRVEAEM 417 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 I DAKSGS + W A+ R V QGGNRHSG SDVA+KSSVTQLSASK AS KIVQG Sbjct: 418 NIKDAKSGSGPTVHWSAKSRSSVVGQGGNRHSGALSDVAMKSSVTQLSASKTASVKIVQG 477 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN TRS TSA PG AKSV SPA+AT N KDGQPR+ A +G SDLP+ +ARDEK Sbjct: 478 ENTTRSASTSAFPGPAKSVPSPASATTNLKDGQPRIAAANGGSDLPMVNARDEKSSSSSQ 537 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG SGKEDA S+NGF GSTPS GQRE Sbjct: 538 SHNNSQSCSSDHAKTGGHSGKEDARSSTAMSVNKISGGSSRHRKSINGFSGSTPSVGQRE 597 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNS LHKNLTSEK SQ GLMEKAL+G S KLIVKIP++ RSPAQSASAGSF Sbjct: 598 TGSSRNSPLHKNLTSEKISQPGLMEKALEGVSC-----KLIVKIPSQVRSPAQSASAGSF 652 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL EKHDQFD+ SKEK D YRANI +DINTESWQSNDFKD+LTGSDE Sbjct: 653 DDPTIMNSRASSPVLPEKHDQFDQSSKEKSDLYRANIVSDINTESWQSNDFKDVLTGSDE 712 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 DGSPAA TDEEHC+ G++CKK L+V SGNE+K+ NLQDASY+SINALIEGVKY Sbjct: 713 ADGSPAAVTDEEHCQIGNDCKKTLEVPKAASSSSGNENKSGNLQDASYTSINALIEGVKY 772 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADDVGMNLLASVAAGEI KSE+ PAGSPERN T +EQS G+ VVKSS E+LV+D+C Sbjct: 773 SEADDVGMNLLASVAAGEILKSELLTPAGSPERNTTAVEQSCTGNGVVKSSEENLVRDEC 832 Query: 2646 YAN---DGEQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N DGE K Q S++ DLG +D +DS F AS K+A E N N+CSMDL QV+ET L Sbjct: 833 HSNNGLDGEHKNQGSVTGDLGVNDESDSDFRASEEKAARELNKCVNACSMDLQQVSETIL 892 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL +K T+ ESS QEARD D S +DVKV VA E Sbjct: 893 ESKGKLNKKSVSTALGGLSESSVQEARDGDRSKQLQEVGRGVNADEI--VDVKVSSVA-E 949 Query: 2301 SEAE------NPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSG 2140 +AE + +VEV V+ DN TE G +T AI +Q D+ +G ENV S Sbjct: 950 VKAEATEKLSHIAVEVDVQSDNCTTEVSTGGGQTA----AILVQSDSARGKDENVLHSSA 1005 Query: 2139 YSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAE 1960 YSVDKVPEDL ERE EK DD+ ++H +QSKKQRNE ESDA+ +PE+RGLCS VTG+AAE Sbjct: 1006 YSVDKVPEDLTEREFEKADDVDAENHSSQSKKQRNECESDALTMPEDRGLCSIVTGIAAE 1065 Query: 1959 YVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXX 1780 +VEEN TKEV+DQ AR+ SPS SQE+D+ KGSKL ++ E+AEEC Sbjct: 1066 HVEENLETKEVHDQPAREELPKDSPSVLSQEMDKHLDSKGSKLIAMEAEEAEECTSTTAD 1125 Query: 1779 XXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXP 1600 AKVEFDLNEGL+ DDGK GE +A GC P Sbjct: 1126 ASSMSSAAVSDADAKVEFDLNEGLNADDGKSGEFNCSAPAGC-LVSPVPFPASSMSCGIP 1184 Query: 1599 CVITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNA 1420 +TVAAAAKGPFVPPEDLL+ KGE+GWKGSAATSAFRPAEPRK+MEMPLG TT IP+A Sbjct: 1185 APVTVAAAAKGPFVPPEDLLRSKGEIGWKGSAATSAFRPAEPRKVMEMPLGALTTSIPDA 1244 Query: 1419 AAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGG 1243 A QSRAPLDIDLNV DER L+DISSQ + T S A D HDP S SP+ CSGG Sbjct: 1245 PAGKQSRAPLDIDLNVADERILDDISSQTYARHTDSASLATDDHDPVCSKMSSPLRCSGG 1304 Query: 1242 LGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDE 1063 LGLDLN VDEASDVGNC SN HK DVP++QVK S PPNRE++ RDFDLNNGP+VDE Sbjct: 1305 LGLDLNQVDEASDVGNCLSSN-HKIDVPIMQVKPSLGGPPNREVNVHRDFDLNNGPSVDE 1363 Query: 1062 VTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGD 883 VTTE + + ARSS+PSQP VSGL +S AE NF SW PS+GNTYSAVTISSIMPDRGD Sbjct: 1364 VTTESSLFSLHARSSVPSQPLVSGLRVSTAEPVNF-SWLPSSGNTYSAVTISSIMPDRGD 1422 Query: 882 KSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPL 706 FSI APNGPQR+L PA GGNPFG DIYRG VL SSPAV Y SAPF+YPVFPFNSSFPL Sbjct: 1423 HPFSIVAPNGPQRLLTPAAGGNPFGPDIYRGPVLSSSPAVSYASAPFEYPVFPFNSSFPL 1482 Query: 705 PSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAEN 526 PSASFS GSTTYV TSG +LCFPAVNSQLMGP +SS YPRP+VV L +G+NS SAE Sbjct: 1483 PSASFSSGSTTYVYPTSGNQLCFPAVNSQLMGPAGAVSSHYPRPFVVGLAEGSNSGSAET 1542 Query: 525 SRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRK 346 SRKWARQGLDLNAGPG SD++GRD +SPL RQLSVASS A AEEQ R+ L V KRK Sbjct: 1543 SRKWARQGLDLNAGPGGSDVDGRDGNSPLPSRQLSVASSQALAEEQVRV-QLAGSVRKRK 1601 Query: 345 EPDGGQDGYKQSSWQ 301 EPDGG DG+ QSSWQ Sbjct: 1602 EPDGGWDGHNQSSWQ 1616 >KHN16310.1 hypothetical protein glysoja_030285 [Glycine soja] Length = 1616 Score = 1988 bits (5150), Expect = 0.0 Identities = 1089/1635 (66%), Positives = 1212/1635 (74%), Gaps = 13/1635 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG GGE + KR RHMWKS TRG+ + SF KDGRKISVG+CAL Sbjct: 1 MHGRGGE-KGKRTRHMWKSSTRGDSSLNADVSSSSSSSSSS-VKSFCKDGRKISVGECAL 58 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP D PPFIGIIR +T GKE KLKVGV+WLYR IEV+L KG PL PNEIF++FH D Sbjct: 59 FKPSEDRPPFIGIIRCLTFGKEKKLKVGVSWLYRSIEVKLNKGGPLVAAPNEIFYTFHKD 118 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ A SLLHPCKVAFL KG ELPSGISSFVCRRVYDIANKCLWWL+DQDY NDCQEE+D+ Sbjct: 119 EIDAESLLHPCKVAFLRKGAELPSGISSFVCRRVYDIANKCLWWLNDQDYINDCQEEVDQ 178 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYRTCV M A V PGGR SQLKS SDS+QNS SSFPS +KGRKRER DQ Sbjct: 179 LLYRTCVGMHAAVQPGGRSPKPMSSPTSTSQLKSVSDSVQNSTSSFPSHIKGRKRERADQ 238 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 E KRE S K +DGDSGH R D+ILKTEIAK+TEKGGLVD EGVEKLVQLMVP +NE Sbjct: 239 SSEPAKRERSIKTEDGDSGHFRHDNILKTEIAKITEKGGLVDNEGVEKLVQLMVPEKNE- 297 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDLASR++LAAVIAATDK DCLS+FVQLRGLPVFDEWLQE HKGKIGDG GSRDGDKSV Sbjct: 298 KIDLASRALLAAVIAATDKLDCLSQFVQLRGLPVFDEWLQEVHKGKIGDGVGSRDGDKSV 357 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLRTHKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 358 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRTHKNTEIQRKARGLVDTWKKRVEAEM 417 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 I DAKSGS + W A+ R V QGGNRHSG SDVA+KSSVTQLSASK AS KIVQG Sbjct: 418 NIKDAKSGSGPTVHWSAKSRSSVVGQGGNRHSGALSDVAMKSSVTQLSASKTASVKIVQG 477 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN TRS TSA PG AKSV SPA+AT N KDGQPR+ A +G SDLP+ +ARDEK Sbjct: 478 ENTTRSASTSAFPGPAKSVPSPASATTNLKDGQPRIAAANGGSDLPMVNARDEKSSSSSQ 537 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG SGKEDA S+NGF GSTPS GQRE Sbjct: 538 SHNNSQSCSSDHAKTGGHSGKEDARSSTAMSVNKISGGSSRHRKSINGFSGSTPSVGQRE 597 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNS LHKNLTSEK SQ GLMEKAL+G S KLIVKIP++ RSPAQSASAGSF Sbjct: 598 TGSSRNSPLHKNLTSEKISQPGLMEKALEGVSC-----KLIVKIPSQVRSPAQSASAGSF 652 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL EKHDQFD+ SKEK D YRANI +DINTESWQSNDFKD+LTGSDE Sbjct: 653 DDPTIMNSRASSPVLPEKHDQFDQSSKEKSDLYRANIVSDINTESWQSNDFKDVLTGSDE 712 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 DGSPAA TDEE C+ G++CKK L+V SGNE+K+ NLQDASY+SINALIEGVKY Sbjct: 713 ADGSPAAVTDEERCQIGNDCKKTLEVPKAASSSSGNENKSGNLQDASYTSINALIEGVKY 772 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADDVGMNLLASVAAGEI KSE+ PAGSPERN T +EQS G+ VVKSS E+LV+D+C Sbjct: 773 SEADDVGMNLLASVAAGEILKSELLTPAGSPERNTTAVEQSCTGNGVVKSSEENLVRDEC 832 Query: 2646 YAN---DGEQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N DGE K Q S++ DLG +D +DS F AS K+A E N N+CSMDL QV+ET L Sbjct: 833 HSNNGLDGEHKNQGSVTGDLGVNDESDSDFRASEEKAARELNKCVNACSMDLQQVSETIL 892 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL +K T+ ESS QEARD D S +DVKV VA E Sbjct: 893 ESKGKLNKKSVSTALGGLSESSVQEARDGDRSKQLQEVGRGVNADEI--VDVKVSSVA-E 949 Query: 2301 SEAE------NPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSG 2140 +AE + +VEV V+ DN TE G +T AI +Q D+ +G ENV S Sbjct: 950 VKAEATEKLSHIAVEVDVQSDNCTTEVSTGGGQTA----AILVQSDSARGKDENVLHSSA 1005 Query: 2139 YSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAE 1960 YSVDKVPEDL ERE EK DD+ ++H +QSKKQRNE ESDA+ +PE+RGLCS VTG+AAE Sbjct: 1006 YSVDKVPEDLTEREFEKADDVDAENHSSQSKKQRNECESDALTMPEDRGLCSIVTGIAAE 1065 Query: 1959 YVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXX 1780 +VEEN TKEV+DQ AR+ SPS SQE+D+ KGSKL ++ E+AEEC Sbjct: 1066 HVEENLETKEVHDQPAREELPKDSPSVLSQEMDKHLDSKGSKLIAMEAEEAEECTSTTAD 1125 Query: 1779 XXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXP 1600 AKVEFDLNEGL+ DDGK GE +A GC P Sbjct: 1126 ASSMSSAAVSDADAKVEFDLNEGLNADDGKSGEFNCSAPAGC-LVSPVPFPASSMSCGIP 1184 Query: 1599 CVITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNA 1420 +TVAAAAKGPFVPPEDLL+ KGE+GWKGSAATSAFRPAEPRK+MEMPLG TT IP+A Sbjct: 1185 APVTVAAAAKGPFVPPEDLLRSKGEIGWKGSAATSAFRPAEPRKVMEMPLGALTTSIPDA 1244 Query: 1419 AAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGG 1243 A QSRAPLDIDLNV DER L+DISSQ + T S A D HDP S SP+ CSGG Sbjct: 1245 PAGKQSRAPLDIDLNVADERILDDISSQTYARHTDSASLATDDHDPVCSKMSSPLRCSGG 1304 Query: 1242 LGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDE 1063 LGLDLN VDEASDVGNC SN HK DVP++QVK S PPNRE++ RDFDLNNGP+VDE Sbjct: 1305 LGLDLNQVDEASDVGNCLSSN-HKIDVPIMQVKPSLGGPPNREVNVHRDFDLNNGPSVDE 1363 Query: 1062 VTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGD 883 VTTE + + ARSS+PSQP VSGL +S AE NF SW PS+GNTYSAVTISSIMPDRGD Sbjct: 1364 VTTESSLFSLHARSSVPSQPLVSGLRVSTAEPVNF-SWLPSSGNTYSAVTISSIMPDRGD 1422 Query: 882 KSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPL 706 + FSI APNGPQR+L PA GGNPFG DIYRG VL SSPAV Y SAPF+YPVFPFNSSFPL Sbjct: 1423 QPFSIVAPNGPQRLLTPAAGGNPFGPDIYRGPVLSSSPAVSYASAPFEYPVFPFNSSFPL 1482 Query: 705 PSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAEN 526 PSASFS GSTTYV TSG +LCFPAVNSQLMGP +SS YPRP+VV L +G+NS SAE Sbjct: 1483 PSASFSSGSTTYVYPTSGNQLCFPAVNSQLMGPAGAVSSHYPRPFVVGLAEGSNSGSAET 1542 Query: 525 SRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRK 346 SRKWARQGLDLNAGPG SD++GRD +SPL RQLSVASS A AEEQ R+ L V KRK Sbjct: 1543 SRKWARQGLDLNAGPGGSDVDGRDGNSPLPSRQLSVASSQALAEEQVRV-QLAGSVRKRK 1601 Query: 345 EPDGGQDGYKQSSWQ 301 EPDGG DG+ QSSWQ Sbjct: 1602 EPDGGWDGHNQSSWQ 1616 >XP_007138108.1 hypothetical protein PHAVU_009G180900g [Phaseolus vulgaris] ESW10102.1 hypothetical protein PHAVU_009G180900g [Phaseolus vulgaris] Length = 1617 Score = 1967 bits (5097), Expect = 0.0 Identities = 1064/1634 (65%), Positives = 1194/1634 (73%), Gaps = 12/1634 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG G E K RHMWK+PTRG+ + F+KD RKISVG+CAL Sbjct: 1 MHGRGCE---KGTRHMWKAPTRGDSSLNADVSSSSSSSSST-VKLFFKDRRKISVGECAL 56 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FK D PPFIGIIR +T+GK+ KLK GV+WLYR IEV+L KGVPL+ PNEIF++FH D Sbjct: 57 FKVSEDCPPFIGIIRYLTIGKDKKLKFGVSWLYRSIEVKLSKGVPLEAAPNEIFYTFHKD 116 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ A +LLHPCKVAFLPKG EL GISSF+CRRVYDIANKCLWWL+DQDY NDCQEE+DK Sbjct: 117 EIDAEALLHPCKVAFLPKGAELQPGISSFLCRRVYDIANKCLWWLNDQDYINDCQEEVDK 176 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYR CVEM ATV PGGR SQLKS SDS+QNS SSFPS +KGRKRER DQ Sbjct: 177 LLYRNCVEMHATVQPGGRSPKPMSSPTSTSQLKSASDSVQNSTSSFPSHIKGRKRERADQ 236 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G ESVKRE S K +DGDSG+ R D+I KTEIAK+TEKGGLVD EGVEKLVQLMVP RNEK Sbjct: 237 GSESVKRERSIKAEDGDSGNFRHDNIFKTEIAKITEKGGLVDGEGVEKLVQLMVPDRNEK 296 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KID+ASRS+LAAVIAATDK DCLS+FVQLRGLPVFDEWLQE HKGKIG DGDKS Sbjct: 297 KIDIASRSLLAAVIAATDKLDCLSQFVQLRGLPVFDEWLQEVHKGKIG------DGDKSA 350 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLRTHKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 351 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRTHKNTEIQRKARGLVDTWKKRVEAEM 410 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 INDAKSGS + WPA+ R +V QGGNRHSG SSDV +KSSVTQLSASK AS KIVQG Sbjct: 411 NINDAKSGSGPNVHWPAKSRPSDVGQGGNRHSGASSDVGMKSSVTQLSASKTASVKIVQG 470 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN TRS LTSA PG AKS SPA TAN KDGQPR+VA++G SDLP+A+ARDEK Sbjct: 471 ENITRSALTSAFPGPAKSAPSPAAVTANLKDGQPRIVAVNGGSDLPMANARDEKSCSSSQ 530 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG S KEDA S+NGF GSTPS GQRE Sbjct: 531 SHNNSQSCSSDHAKTGGHSVKEDARSSTAMSVNKISGGSSRHRKSINGFSGSTPSGGQRE 590 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNSSLHKNLTSEK S GLM+KA+DGTSL+G+IPKLIVKIPN+GRSPAQS SAGSF Sbjct: 591 TGSSRNSSLHKNLTSEKISPPGLMDKAVDGTSLDGNIPKLIVKIPNQGRSPAQSVSAGSF 650 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL +KHDQ D KEK D YR NIG+DINTESWQSNDFKD+LTGSDE Sbjct: 651 DDPTIMNSRASSPVLPDKHDQSDHSPKEKSDLYRVNIGSDINTESWQSNDFKDVLTGSDE 710 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 GDGSPAA TDEEHCRTG++CKK L+V SGNE K N+QDASYSSINALIEGVKY Sbjct: 711 GDGSPAAVTDEEHCRTGNDCKKALEVSKAASSSSGNEHKAGNMQDASYSSINALIEGVKY 770 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADD GMNLLASVAAGEI KSE+ PAGSPERN T +EQS + V KSS E+LV+D+C Sbjct: 771 SEADDGGMNLLASVAAGEILKSELLTPAGSPERNTTAVEQSCTDNGVNKSSEENLVRDEC 830 Query: 2646 YAN---DGEQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N DGE K S++ DLG +D +DS F ASGGK+A E N N+CSMDL QVTET L Sbjct: 831 HSNNGLDGEHKNLASVTGDLGANDESDSDFQASGGKAARELNKRVNACSMDLQQVTETTL 890 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL EK TS E+S QEA D D S + V +V E Sbjct: 891 ESKGKLNEKSGPTSLGGLAENSVQEAGDADRSKQLQEVVQGVNAGETHDKVSCVAEVEAE 950 Query: 2301 SEAE--NPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVD 2128 + + + +VEV + DN EG + + +K PAI +Q D G +N SGYSVD Sbjct: 951 AAKKLLHTAVEVDAQSDNCTAEGSSGCGQLVKKPPAILVQSDLASGKDDNALHSSGYSVD 1010 Query: 2127 KVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEE 1948 +VP+D +RE EK DD+ ++HV+QSK +RNESESDA+ +PEN+GLCS VTGL AE+VEE Sbjct: 1011 EVPKDFTDRESEKTDDVDAENHVSQSKNKRNESESDALTMPENKGLCSVVTGLVAEHVEE 1070 Query: 1947 NSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXX 1768 N KEV DQ AR+ P SPS SQEID+ K KL + E+AEEC Sbjct: 1071 NLEAKEVRDQPAREDPPEDSPSVRSQEIDKHLDSKRLKLTSTETEEAEECTSTTADASSM 1130 Query: 1767 XXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPCV-- 1594 AKV FDLNEGL+ DDG+C E + GCA + Sbjct: 1131 SAAAVSDVDAKVGFDLNEGLNADDGRC-EFNSIVTSGCAPAGQLISPVPFPASSMSGILA 1189 Query: 1593 -ITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNAA 1417 +TVA+AAKG FVPPEDLL+ KGE+GWKGSAATSAFRPAEPRK+MEMPLG + TPI +A Sbjct: 1190 PVTVASAAKGHFVPPEDLLRSKGEIGWKGSAATSAFRPAEPRKVMEMPLGTSATPIADAP 1249 Query: 1416 AKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGGL 1240 A QSRAPL+IDLNV DER L+DIS C+ T +S A D HDP S PSPV SGGL Sbjct: 1250 AGKQSRAPLNIDLNVADERILDDIS---CARHTNSISLATDCHDPVCSKIPSPVRSSGGL 1306 Query: 1239 GLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEV 1060 GLDLN D+ASD+ C SN HK DVP +Q KSS PPNRE + RDFDLNNGP+VDEV Sbjct: 1307 GLDLNQADDASDIDICLSSN-HKIDVPTMQGKSSLGGPPNREANVHRDFDLNNGPSVDEV 1365 Query: 1059 TTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDK 880 TTE +Q ARSS+PSQ VSGL ++ AE NF SW PS+GNTYSAVTISSIMPDRGD+ Sbjct: 1366 TTESSFFSQYARSSVPSQLPVSGLRVTTAEPGNF-SWLPSSGNTYSAVTISSIMPDRGDQ 1424 Query: 879 SFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLP 703 FS+ PNGPQR+L PA GGNPFG DIYR VL SSPAV YPSAPF+YPVFPFNSSFPLP Sbjct: 1425 PFSVVTPNGPQRLLTPAAGGNPFGPDIYRAPVLSSSPAVSYPSAPFEYPVFPFNSSFPLP 1484 Query: 702 SASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENS 523 SASFS GST YV TS RLCFPAVNSQLMGP T+SS YPRPYVV L +G+NS SAE S Sbjct: 1485 SASFSAGSTAYVYPTSANRLCFPAVNSQLMGPAGTVSSHYPRPYVVGLTEGSNSGSAETS 1544 Query: 522 RKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKE 343 RKW RQGLDLNAGPG SD+EGRDESSPL RQLSVASS A AEEQAR+ L V KRKE Sbjct: 1545 RKWTRQGLDLNAGPGCSDMEGRDESSPLPSRQLSVASSQALAEEQARI-QLAGSVRKRKE 1603 Query: 342 PDGGQDGYKQSSWQ 301 PDGG DGY QSSWQ Sbjct: 1604 PDGGWDGYNQSSWQ 1617 >XP_017421635.1 PREDICTED: uncharacterized protein LOC108331459 [Vigna angularis] KOM40325.1 hypothetical protein LR48_Vigan04g052300 [Vigna angularis] Length = 1615 Score = 1947 bits (5045), Expect = 0.0 Identities = 1054/1635 (64%), Positives = 1194/1635 (73%), Gaps = 13/1635 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG GGE K RHMWK+PT GN + SF+KD RKISVG+CAL Sbjct: 1 MHGRGGE---KGTRHMWKAPTLGNSSLNADVSSSSSSSS---VKSFFKDRRKISVGECAL 54 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP D PP+IGIIR +T+GK+ KLK GV+WLYR IEV+L K VPL+ PNEIF++FH D Sbjct: 55 FKPSEDRPPYIGIIRYLTIGKDKKLKFGVSWLYRSIEVKLSKSVPLEAAPNEIFYTFHKD 114 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ A +LLHPCKVAFLPKG EL GISSF+CRRVYDIANKCLWWL+DQDY NDCQEE+DK Sbjct: 115 EIDAEALLHPCKVAFLPKGAELQPGISSFLCRRVYDIANKCLWWLNDQDYINDCQEEVDK 174 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYR CVEM ATV PGGR SQLKS SDS+QNS SSFPS +KGRKRER DQ Sbjct: 175 LLYRNCVEMHATVQPGGRSPKPMSSPTSTSQLKSASDSVQNSTSSFPSHIKGRKRERADQ 234 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G ES KRE S K +DGDSG+ + D+I KTEIAK+TEKGGLVD EGVEKLVQLMVP RNEK Sbjct: 235 GSESAKRERSIKTEDGDSGNFKHDNIFKTEIAKITEKGGLVDGEGVEKLVQLMVPDRNEK 294 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDLASRS+LAAV+AATDK DCLS+FVQ RGLPVFDEWLQE HKGKIG D DKS Sbjct: 295 KIDLASRSLLAAVVAATDKLDCLSQFVQQRGLPVFDEWLQEVHKGKIG------DSDKSA 348 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLR+HKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 349 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRSHKNAEIQRKARGLVDTWKKRVEAEM 408 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 INDAKSGS + WPA+ R +V+QGGNRHSG SSDVA+KSSVTQLSASK S KIVQG Sbjct: 409 NINDAKSGSGTNVHWPAKSRPSDVSQGGNRHSGASSDVAMKSSVTQLSASKTTSVKIVQG 468 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN TRS TSA PG AKSV SPA ATAN KDGQPR+VA++G SD P+A+ARDEK Sbjct: 469 ENITRSASTSAFPGPAKSVPSPAAATANLKDGQPRIVAVNGGSDPPMANARDEKSCSSSQ 528 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG S KEDA S+NGF GS+PS GQRE Sbjct: 529 SHNNSQSCSSDHAKTGGHSVKEDARSSTAMSANKISGGSSRHRKSINGFSGSSPSGGQRE 588 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNSS+HKNLTSEK SQ GLM+KA+DGTSLEG+IPKLIVKIPN+GRSPAQS SAGSF Sbjct: 589 TGSSRNSSMHKNLTSEKISQPGLMDKAVDGTSLEGNIPKLIVKIPNQGRSPAQSVSAGSF 648 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL EKHDQ D KEK D YR NIG+DINTESWQSNDFK++LTGSDE Sbjct: 649 DDPTIMNSRASSPVLPEKHDQSDHSPKEKSDLYRVNIGSDINTESWQSNDFKEVLTGSDE 708 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 GDGSPAA TDEEHCRTG++CKK L+V SGNE+K NLQDASYSSINALIEGVKY Sbjct: 709 GDGSPAAVTDEEHCRTGNDCKKALEVSKAASSSSGNENKAGNLQDASYSSINALIEGVKY 768 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADD GMNLLASVAAGEI KSE+ P GSPERN +EQS D VKSS E+LV+D+C Sbjct: 769 SEADDGGMNLLASVAAGEILKSELLTPTGSPERNTAAVEQSCT-DNGVKSSEENLVRDEC 827 Query: 2646 YAN---DGEQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N D E K S++ DLG + +DS F ASGGK+A E N ++C+MDL QV ET L Sbjct: 828 HSNNALDVEHKNLASVTGDLGANGESDSDFQASGGKTARELNKRVDACNMDLQQVAETTL 887 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL EK T E+S QEA D D S + V DV E Sbjct: 888 ESKGKLNEKSGPTPLVGLAENSVQEAGDGDGSKQLQEVVQGVNAGEIHDKINCVADVEAE 947 Query: 2301 S--EAENPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVD 2128 + + + + EV V+ DN EG N +T + PA+ +Q D +G + SGYSVD Sbjct: 948 AAKKLSHTAAEVDVQSDNYTAEGSNGSGQTVKNPPAVLVQSDLARGKDDKALHSSGYSVD 1007 Query: 2127 KVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEE 1948 KV +D ERE +K DD+ ++HVNQ + +R+ESESD + +PENRG+CS VTGL AE+VEE Sbjct: 1008 KVLKDFPERESDKIDDVDAENHVNQCRSKRSESESDTLTMPENRGICSIVTGLVAEHVEE 1067 Query: 1947 NSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXX 1768 N T++V DQ +R+ SPS SQEID+ K KL + E+AEEC Sbjct: 1068 NLETRDVRDQPSREDLPEDSPSVLSQEIDKHLNSKRLKLTSSEAEEAEECTSTTADASSM 1127 Query: 1767 XXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPC--- 1597 AKVEFDLNEGL+ +DG+C E + A GCA C Sbjct: 1128 SAAAVSDADAKVEFDLNEGLNANDGRC-EFNSIANSGCAPSGRLISPVPFPASSMSCGIL 1186 Query: 1596 -VITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNA 1420 +TVA+AAKG FVPPEDLL+ KGE+GWKGSAATSAFRPAEPRK+MEMPLG + TPIP+A Sbjct: 1187 APVTVASAAKGHFVPPEDLLRSKGEIGWKGSAATSAFRPAEPRKVMEMPLGTSATPIPDA 1246 Query: 1419 AAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGG 1243 A QSRAPLDIDLNV DER L+DIS C+ T +S A D HDP S PSPV SGG Sbjct: 1247 PAGKQSRAPLDIDLNVADERILDDIS---CARHTNSISLATDSHDPVCSKIPSPVRSSGG 1303 Query: 1242 LGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDE 1063 LGLDLN VD+ASD+G C SN HK DVP++Q KSS PP RE++ RDFDLNNGP+VDE Sbjct: 1304 LGLDLNQVDDASDMGICLNSN-HKIDVPIMQGKSSLGGPPIREVNAHRDFDLNNGPSVDE 1362 Query: 1062 VTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGD 883 +TTE + +Q +RS++PSQ VSG M+ AE NF SW PS+GNTYSAVTISSIMPDRGD Sbjct: 1363 MTTESSLFSQHSRSNVPSQLPVSGHRMTTAEPGNF-SWLPSSGNTYSAVTISSIMPDRGD 1421 Query: 882 KSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPL 706 + FSI APNGPQR+L PA GGNPFG DIYR VL SSPAV YPSAPF+YPVFPFNSSFPL Sbjct: 1422 QPFSIVAPNGPQRLLTPAAGGNPFGPDIYRAPVLSSSPAVSYPSAPFEYPVFPFNSSFPL 1481 Query: 705 PSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAEN 526 PSASFS GSTTYV TS RLCFPAVNSQLMGP T+SS YPRPYVV LP+GNNS SAE Sbjct: 1482 PSASFSAGSTTYVYPTSANRLCFPAVNSQLMGPAGTVSSHYPRPYVVGLPEGNNSGSAET 1541 Query: 525 SRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRK 346 SRKW RQGLDLNAGPG SD+EGRDES PL RQLSVASS A AEEQAR+ L V KRK Sbjct: 1542 SRKWTRQGLDLNAGPGCSDMEGRDESLPLPSRQLSVASSQALAEEQARI-RLAGSVRKRK 1600 Query: 345 EPDGGQDGYKQSSWQ 301 EPDGG DGY QSSWQ Sbjct: 1601 EPDGGWDGYNQSSWQ 1615 >BAT79550.1 hypothetical protein VIGAN_02245700 [Vigna angularis var. angularis] Length = 1615 Score = 1946 bits (5041), Expect = 0.0 Identities = 1053/1635 (64%), Positives = 1194/1635 (73%), Gaps = 13/1635 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG GGE K RHMWK+PT GN + SF+KD RKISVG+CAL Sbjct: 1 MHGRGGE---KGTRHMWKAPTLGNSSLNADVSSSSSSSS---VKSFFKDRRKISVGECAL 54 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP D PP+IGIIR +T+GK+ KLK GV+WLYR IEV+L K VPL+ PNEIF++FH D Sbjct: 55 FKPSEDRPPYIGIIRYLTIGKDKKLKFGVSWLYRSIEVKLSKSVPLEAAPNEIFYTFHKD 114 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ A +LLHPCKVAFLPKG EL GISSF+CRRVYDIANKCLWWL+DQDY NDCQEE+DK Sbjct: 115 EIDAEALLHPCKVAFLPKGAELQPGISSFLCRRVYDIANKCLWWLNDQDYINDCQEEVDK 174 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYR CVEM ATV PGGR SQLKS SDS+QNS SSFPS +KGRKRER DQ Sbjct: 175 LLYRNCVEMHATVQPGGRSPKPMSSPTSTSQLKSASDSVQNSTSSFPSHIKGRKRERADQ 234 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G ES KRE S K +DGDSG+ + D+I KTEIAK+TEKGGLVD EGVEKLVQLMVP RNEK Sbjct: 235 GSESAKRERSIKTEDGDSGNFKHDNIFKTEIAKITEKGGLVDGEGVEKLVQLMVPDRNEK 294 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDLASRS+LAAV+AATDK DCLS+FVQ RGLPVFDEWLQE HKGKIG D DKS Sbjct: 295 KIDLASRSLLAAVVAATDKLDCLSQFVQQRGLPVFDEWLQEVHKGKIG------DSDKSA 348 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLR+HKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 349 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRSHKNAEIQRKARGLVDTWKKRVEAEM 408 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 INDAKSGS + WPA+ R +V+QGGNRHSG SSDVA+KSSVTQLSASK S KIVQG Sbjct: 409 NINDAKSGSGTNVHWPAKSRPSDVSQGGNRHSGASSDVAMKSSVTQLSASKTTSVKIVQG 468 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN TRS TSA PG AKSV SPA ATAN KDGQPR+VA++G SD P+A+ARDEK Sbjct: 469 ENITRSASTSAFPGPAKSVPSPAAATANLKDGQPRIVAVNGGSDPPMANARDEKSCSSSQ 528 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG S KEDA S+NGF GS+PS GQRE Sbjct: 529 SHNNSQSCSSDHAKTGGHSVKEDARSSTAMSANKISGGSSRHRKSINGFSGSSPSGGQRE 588 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNSS+HKNLTSEK SQ GLM+KA+DGTSLEG+IPKLIVKIPN+GRSPAQS SAGSF Sbjct: 589 TGSSRNSSMHKNLTSEKISQPGLMDKAVDGTSLEGNIPKLIVKIPNQGRSPAQSVSAGSF 648 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL EKHDQ D KEK D YR NIG+DINTESWQSNDFK++LTGSDE Sbjct: 649 DDPTIMNSRASSPVLPEKHDQSDHSPKEKSDLYRVNIGSDINTESWQSNDFKEVLTGSDE 708 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 GDGSPAA TDEEHCRTG++CKK L+V SGNE+K NLQDASYSSINALIEGVKY Sbjct: 709 GDGSPAAVTDEEHCRTGNDCKKALEVSKAASSSSGNENKAGNLQDASYSSINALIEGVKY 768 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADD GMNLLASVAAGEI KSE+ P GSPERN +EQS D VKSS E+LV+D+C Sbjct: 769 SEADDGGMNLLASVAAGEILKSELLTPTGSPERNTAAVEQSCT-DNGVKSSEENLVRDEC 827 Query: 2646 YAN---DGEQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N D E K S++ DLG + +DS F ASGGK+A E N ++C+MDL QV ET L Sbjct: 828 HSNNALDVEHKNLASVTGDLGANGESDSDFQASGGKTARELNKRVDACNMDLQQVAETTL 887 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL EK T E+S QEA D D S + V DV E Sbjct: 888 ESKGKLNEKSGPTPLVGLAENSVQEAGDGDGSKQLQEVVQGVNAGEIHDKINCVADVEAE 947 Query: 2301 S--EAENPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVD 2128 + + + + EV V+ DN EG N +T + PA+ +Q D +G + SGYSVD Sbjct: 948 AAKKLSHTAAEVDVQSDNYTAEGSNGSGQTVKNPPAVLVQSDLARGKDDKALHSSGYSVD 1007 Query: 2127 KVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEE 1948 KV +D ERE +K DD+ ++HVNQ + +R+ESESD + +PENRG+CS VTGL AE+VEE Sbjct: 1008 KVLKDFPERESDKIDDVDAENHVNQCRSKRSESESDTLTMPENRGICSIVTGLVAEHVEE 1067 Query: 1947 NSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXX 1768 N T++V DQ +R+ SPS SQEID+ K KL + E+AEEC Sbjct: 1068 NLETRDVRDQPSREDLPEDSPSVLSQEIDKHLNSKRLKLTSSEAEEAEECTSTTADASSM 1127 Query: 1767 XXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPC--- 1597 AKVEFDLNEGL+ +DG+C E + A GCA C Sbjct: 1128 SAAAVSDADAKVEFDLNEGLNANDGRC-EFNSIANSGCAPSGRLISPVPFPASSMSCGIL 1186 Query: 1596 -VITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNA 1420 +TVA+AAKG FVPPEDLL+ KGE+GWKGSAATSAFRPAEPRK+MEMPLG + TPIP+A Sbjct: 1187 APVTVASAAKGHFVPPEDLLRSKGEIGWKGSAATSAFRPAEPRKVMEMPLGTSATPIPDA 1246 Query: 1419 AAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGG 1243 A QSRAPLDIDLNV DER L+DIS C+ T +S A D HDP S PSPV SGG Sbjct: 1247 PAGKQSRAPLDIDLNVADERILDDIS---CARHTNSISLATDSHDPVCSKIPSPVRSSGG 1303 Query: 1242 LGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDE 1063 LGLDLN VD+ASD+G C SN HK DVP++Q KSS PP RE++ RDFDLNNGP+VDE Sbjct: 1304 LGLDLNQVDDASDMGICLNSN-HKIDVPIMQGKSSLGGPPIREVNAHRDFDLNNGPSVDE 1362 Query: 1062 VTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGD 883 +TTE + +Q +RS++PSQ VSG M+ AE NF SW PS+GNTYSAV+ISSIMPDRGD Sbjct: 1363 MTTESSLFSQHSRSNVPSQLPVSGHRMTTAEPGNF-SWLPSSGNTYSAVSISSIMPDRGD 1421 Query: 882 KSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPL 706 + FSI APNGPQR+L PA GGNPFG DIYR VL SSPAV YPSAPF+YPVFPFNSSFPL Sbjct: 1422 QPFSIVAPNGPQRLLTPAAGGNPFGPDIYRAPVLSSSPAVSYPSAPFEYPVFPFNSSFPL 1481 Query: 705 PSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAEN 526 PSASFS GSTTYV TS RLCFPAVNSQLMGP T+SS YPRPYVV LP+GNNS SAE Sbjct: 1482 PSASFSAGSTTYVYPTSANRLCFPAVNSQLMGPAGTVSSHYPRPYVVGLPEGNNSGSAET 1541 Query: 525 SRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRK 346 SRKW RQGLDLNAGPG SD+EGRDES PL RQLSVASS A AEEQAR+ L V KRK Sbjct: 1542 SRKWTRQGLDLNAGPGCSDMEGRDESLPLPSRQLSVASSQALAEEQARI-QLAGSVRKRK 1600 Query: 345 EPDGGQDGYKQSSWQ 301 EPDGG DGY QSSWQ Sbjct: 1601 EPDGGWDGYNQSSWQ 1615 >XP_014625222.1 PREDICTED: uncharacterized protein LOC100792096 [Glycine max] KRH03340.1 hypothetical protein GLYMA_17G092200 [Glycine max] Length = 1427 Score = 1916 bits (4963), Expect = 0.0 Identities = 1040/1447 (71%), Positives = 1132/1447 (78%), Gaps = 13/1447 (0%) Frame = -1 Query: 4590 VHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQGPESVKRECSAK 4411 + PGGR SQLKSGSDS+QNS SSF S VKGRKRERVD+G ESVKR+ S K Sbjct: 1 MQPGGRSPKPVSSPTSTSQLKSGSDSVQNSVSSFSSHVKGRKRERVDKGQESVKRDRSTK 60 Query: 4410 NDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEKKIDLASRSMLAA 4231 DDGDSGH +QDSILKTEIAKVTEKGGL+DTEGVEKLVQLMVP NEKKIDLASRSMLAA Sbjct: 61 IDDGDSGHFKQDSILKTEIAKVTEKGGLIDTEGVEKLVQLMVPDGNEKKIDLASRSMLAA 120 Query: 4230 VIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSVDEFLLGLLRALD 4051 VIAATDKFDCLSRFVQL+GLP+FDEWLQEAHKGKIGDG GSRDGDKSVD+FL LLRALD Sbjct: 121 VIAATDKFDCLSRFVQLKGLPIFDEWLQEAHKGKIGDGLGSRDGDKSVDDFLFVLLRALD 180 Query: 4050 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDAKSGSVQS 3871 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDA+SGSVQ+ Sbjct: 181 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDARSGSVQT 240 Query: 3870 GPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQGENNTRSVLTSAS 3691 PWPARQRL EVAQGGN+HS GS+DVA+KSSVTQLSASK ASAKIV GE NTRS TSAS Sbjct: 241 VPWPARQRLSEVAQGGNKHSSGSADVAMKSSVTQLSASKTASAKIVPGE-NTRS--TSAS 297 Query: 3690 PGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXXXXXXXXXXXSDH 3511 PGS KSV SPA ATAN KDGQP V SGS DLPVA+ARDEK SDH Sbjct: 298 PGSTKSVPSPAPATANLKDGQPCVATASGSFDLPVANARDEK-SSSSSPSHNSQSYSSDH 356 Query: 3510 AKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRETGSSRNSSLHKN 3331 KAGGLSGKEDA SM GFPGSTPSR QRETGS +NSS HKN Sbjct: 357 VKAGGLSGKEDARSSTAMSVNKVSGGSSRHRKSMKGFPGSTPSRRQRETGSRQNSS-HKN 415 Query: 3330 LTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSFDDPSIMNSRASS 3151 L SEK SQSGL EKA DGT+LEGH PKLIVKI NR RSPAQSA+AGS DDP+IMNSRASS Sbjct: 416 LISEKISQSGLREKANDGTTLEGHTPKLIVKIQNRSRSPAQSATAGSSDDPAIMNSRASS 475 Query: 3150 PVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDEGDGSPAATTDEE 2971 PVLSEKHDQFDRCSKEK DFYRANIGADI TESWQSNDFKD+LTGSDEGDGSP A TDE+ Sbjct: 476 PVLSEKHDQFDRCSKEKSDFYRANIGADIYTESWQSNDFKDVLTGSDEGDGSPEAITDEQ 535 Query: 2970 HCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKYSEADDVGMNLLA 2791 CRTG++CKK+LDV SGNES+ RNLQDASYSSINALIEGVKY+EADDVGMNLLA Sbjct: 536 -CRTGEDCKKVLDVSKAASSSSGNESRARNLQDASYSSINALIEGVKYTEADDVGMNLLA 594 Query: 2790 SVAAGEISKSEMGMPAGSPERNFTTIEQSRPGD-AVVKSSGEDLVQDKCYANDGEQKKQV 2614 +VAAGEISKSE+GMP GSPE++ TTIEQS GD AVVKSS E LVQDKCY N E KQ Sbjct: 595 NVAAGEISKSELGMPVGSPEKDTTTIEQSYAGDAAVVKSSEEYLVQDKCYTN-VEHSKQD 653 Query: 2613 SISSDLGTDDVNDSGFLASGGK-SAEHNMDTNSCSMDLQVTETCLESKGKLIEKPAGTSS 2437 S DLG DD LAS GK + EH N+ SMDL VTET LESKGKLIEK +GTSS Sbjct: 654 SRPGDLGADD----DILASEGKATGEH----NTSSMDLWVTETSLESKGKLIEKSSGTSS 705 Query: 2436 ASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGESEAENPSVE------ 2275 A PES+ QE RD+DSS GN+DVKV VA ESE E ++E Sbjct: 706 AGIPESTIQEVRDSDSSKLVKEKKVVVRVDAVGNVDVKVNVVASESETE--AIENFSCTC 763 Query: 2274 --VGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNER 2101 V VKCDN A+EGL+ +ET KSPAI M D V T EN Q SG VDKV E +NER Sbjct: 764 EVVDVKCDNRASEGLSGDKETAGKSPAIRMSSDYVIATDENAPQSSGDIVDKVLEHVNER 823 Query: 2100 EPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEENSGTKEVYD 1921 E EKNDDM QDH +S KQ+NESE+DA++VP+NRGLCSG TGL AEYVEENSGTKEV D Sbjct: 824 ESEKNDDMVAQDHAKESIKQKNESENDAIMVPKNRGLCSGATGLDAEYVEENSGTKEVCD 883 Query: 1920 Q--DARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXX 1747 Q A Q+ HT PSFPS+E+DQC+G K SKL ++ E+AEEC Sbjct: 884 QVAGAGQIVHTDLPSFPSREMDQCSGHKDSKLTAMESEEAEECTSTTGDTSSASVAGVSE 943 Query: 1746 XXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPCVITVAAAAKG 1567 KVEFDLNEG + DDGKC E+ + A ITVAAAAK Sbjct: 944 VDTKVEFDLNEGFNADDGKCSEMPGST--PAARLVSPVPFSASSMSFGILSITVAAAAKS 1001 Query: 1566 PFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLD 1387 PFV PEDLLK K ELGWKGSAATSAFRPAEPRK+ME+PL ++TTPIPN A+ QSRAPLD Sbjct: 1002 PFVAPEDLLKSKKELGWKGSAATSAFRPAEPRKVMEIPLDMSTTPIPNDEARKQSRAPLD 1061 Query: 1386 IDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKSTPSPVCCSGGLGLDLNLVDEAS 1207 DLNV DE L+D+SSQNC+ QT C +H+ +GHDP+KS S V CSGGLGLDLNLVD AS Sbjct: 1062 FDLNVSDEVILDDVSSQNCARQTDCGTHSDNGHDPNKSMASHVSCSGGLGLDLNLVDGAS 1121 Query: 1206 DVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLA 1027 DVGNCT S+SHK DVPL+QVKS+A PPN EMSFRRDFDLN+GP VDEVT+EPL+STQ A Sbjct: 1122 DVGNCTLSSSHKMDVPLMQVKSAASGPPNGEMSFRRDFDLNDGPVVDEVTSEPLMSTQPA 1181 Query: 1026 RSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQ 847 R+S+PSQP +SGL MSNAEV NFSSWFPST NTYSAVTISSIM DRGD+SFSI APNGPQ Sbjct: 1182 RNSVPSQPPISGLRMSNAEVGNFSSWFPSTANTYSAVTISSIMSDRGDRSFSIVAPNGPQ 1241 Query: 846 RMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTY 670 RMLGPATG NPFG DIY+G VL SSPAVPY SAPFQYPVFPFNSSFPLPSASFSGGST Y Sbjct: 1242 RMLGPATGSNPFGPDIYKGAVLSSSPAVPYQSAPFQYPVFPFNSSFPLPSASFSGGSTPY 1301 Query: 669 VDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLN 490 VD+TSGGRLCFPAVNSQL+G V +S YPRPYVVSLPDG+NS+SAEN R+ RQGLDLN Sbjct: 1302 VDTTSGGRLCFPAVNSQLIGSVGNVSVHYPRPYVVSLPDGSNSSSAENCRRRTRQGLDLN 1361 Query: 489 AGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQS 310 AGPG S+LEGRD+SSPLVPRQLSVASS A EEQAR+FHL S VLKRKEPDGG DGYKQ+ Sbjct: 1362 AGPG-SNLEGRDDSSPLVPRQLSVASSQAQLEEQARIFHLSSDVLKRKEPDGGWDGYKQT 1420 Query: 309 SWQYHSL 289 SWQYH+L Sbjct: 1421 SWQYHNL 1427 >XP_014630930.1 PREDICTED: uncharacterized protein LOC100806155 [Glycine max] KRH57034.1 hypothetical protein GLYMA_05G035100 [Glycine max] Length = 1428 Score = 1898 bits (4916), Expect = 0.0 Identities = 1031/1444 (71%), Positives = 1118/1444 (77%), Gaps = 10/1444 (0%) Frame = -1 Query: 4590 VHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQGPESVKRECSAK 4411 + PGGR SQLKSGSDS+QNS SSF S VKGRKRERVDQG ESVKR+ S K Sbjct: 1 MQPGGRSPKPVSSPTSTSQLKSGSDSVQNSVSSFSSHVKGRKRERVDQGQESVKRDRSTK 60 Query: 4410 NDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEKKIDLASRSMLAA 4231 NDDGDSG+ + DSILKTEIAKVTEKGGL+D EGVEKLVQLMVP NEKKIDLASRSMLAA Sbjct: 61 NDDGDSGNFKADSILKTEIAKVTEKGGLIDPEGVEKLVQLMVPDGNEKKIDLASRSMLAA 120 Query: 4230 VIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSVDEFLLGLLRALD 4051 VIAAT+KFDCLSRFVQL+GLPVFDEWLQEAHKGK+G+G GSRDGDKSVD+FL LLRALD Sbjct: 121 VIAATEKFDCLSRFVQLKGLPVFDEWLQEAHKGKLGEGIGSRDGDKSVDDFLFVLLRALD 180 Query: 4050 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDAKSGSVQS 3871 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDA+S SVQ+ Sbjct: 181 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDARSCSVQT 240 Query: 3870 GPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQGENNTRSVLTSAS 3691 PWPARQRL EVAQGGN+HS GS+DVA+KSSVTQLSASK AS KI GEN TRS TSAS Sbjct: 241 VPWPARQRLSEVAQGGNKHSSGSADVAMKSSVTQLSASKTASPKIAPGENTTRS--TSAS 298 Query: 3690 PGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXXXXXXXXXXXSDH 3511 PGS KSV SPA ATAN KDGQP A+SGSSDLPVA+ARDEK SDH Sbjct: 299 PGSTKSVPSPAPATANLKDGQPHAAAVSGSSDLPVANARDEK-SSSSSPSHNSQSYSSDH 357 Query: 3510 AKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRETGSSRNSSLHKN 3331 AKAGG SGKEDA SMNGFPGSTPSR QRETGSSRNSS HKN Sbjct: 358 AKAGGFSGKEDARSSTAMSVNKVSGGSSRHRRSMNGFPGSTPSRRQRETGSSRNSS-HKN 416 Query: 3330 LTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSFDDPSIMNSRASS 3151 L SEK SQSGL EKA DGT LEGH PKLIVKIPNRGRSPAQSA+AGS DDPSIMNSRASS Sbjct: 417 LISEKISQSGLREKANDGTLLEGHTPKLIVKIPNRGRSPAQSATAGSSDDPSIMNSRASS 476 Query: 3150 PVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDEGDGSPAATTDEE 2971 P LSEKHDQFDRCSKEK DFYRANIGADINTESWQSNDFKD+LTGSDEGDGSPAA TDE+ Sbjct: 477 PALSEKHDQFDRCSKEKSDFYRANIGADINTESWQSNDFKDVLTGSDEGDGSPAAITDEQ 536 Query: 2970 HCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKYSEADDVGMNLLA 2791 CRTG++CKK+ DV G+ESK RNLQDASYSSINALIEGVKY+EADDVGMNLLA Sbjct: 537 -CRTGEDCKKVSDVSKTTSSSPGSESKARNLQDASYSSINALIEGVKYTEADDVGMNLLA 595 Query: 2790 SVAAGEISKSEMGMPAGSPERNFTTIEQSRPGD-AVVKSSGEDLVQDKCYANDGEQKKQV 2614 VAAGEISKSE GMPAGSP++N TTIEQS G+ AVVKSS E LVQDKCY+N E KQ Sbjct: 596 RVAAGEISKSESGMPAGSPKKNTTTIEQSYAGNAAVVKSSEEYLVQDKCYSN-AEHMKQD 654 Query: 2613 SISSDLGTDDVNDSGFLASGGKSAEHNMDTNSCSMDLQVTETCLESKGKLIEKPAGTSSA 2434 S S D GTDD + F G + EH N SMDLQVTET LESKGKLI K +GT SA Sbjct: 655 SRSGDFGTDD-DIRAF--EGKATGEH----NPSSMDLQVTETSLESKGKLIVKSSGT-SA 706 Query: 2433 SFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGESEAE------NPSVEV 2272 PES+ QE RD DSS N+D +V VA E E E + EV Sbjct: 707 GIPESTFQEVRDIDSSKLVKEKKVVVRVDAVNNVD-EVNVVAREGETEAIEKLSHTCEEV 765 Query: 2271 GVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNEREPE 2092 VKCDN A+EGL+ +ET KSPA + D+VK T EN Q SGY VDKVPE LNERE E Sbjct: 766 DVKCDNHASEGLSCDKETAGKSPATCVPSDSVKATDENALQSSGYIVDKVPEYLNERESE 825 Query: 2091 KNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEENSGTKEVYDQD- 1915 KNDDMA QDH QS KQ+NESE+DA++VPENRGLCSG TGL AEYVEENSGTKEV DQD Sbjct: 826 KNDDMAAQDHAKQSLKQKNESENDAIMVPENRGLCSGATGLDAEYVEENSGTKEVCDQDA 885 Query: 1914 -ARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXA 1738 A Q+ HT PSFPS+E+DQ +G + SKLA ++ E+AEEC Sbjct: 886 GAGQILHTDLPSFPSREMDQHSGQRDSKLAAMESEEAEECTSTTGDASSASVAGVSEVDT 945 Query: 1737 KVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPCVITVAAAAKGPFV 1558 KVEFDLNE L+ DDGKC E+ + + AAAAKGPFV Sbjct: 946 KVEFDLNERLNADDGKCSEIPGSTPAARLVSPVPFSASSMSFGILSITVAAAAAAKGPFV 1005 Query: 1557 PPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDL 1378 P EDLLK K ELGWKGSAATSAFRPAEPRK+ME+PL ++TTPIPN A+ QSR PLD DL Sbjct: 1006 PHEDLLKSKKELGWKGSAATSAFRPAEPRKVMEIPLDMSTTPIPNDEARKQSRVPLDFDL 1065 Query: 1377 NVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKSTPSPVCCSGGLGLDLNLVDEASDVG 1198 NV DE L+D+SSQNC+ QT CV+ + DGHDP+KS S V CSGGLGLDLNLVD ASDVG Sbjct: 1066 NVSDEIILDDLSSQNCARQTDCVTRSDDGHDPNKSMASHVRCSGGLGLDLNLVDGASDVG 1125 Query: 1197 NCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSS 1018 NCT S+SHK DVPL Q KS+A PPN +MS RDFDLN+GP VDEVTTE L+ST+ AR+S Sbjct: 1126 NCTLSSSHKMDVPLTQFKSAASGPPNGKMSVLRDFDLNDGPIVDEVTTEHLMSTRSARNS 1185 Query: 1017 LPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRML 838 +PSQP +SGL MSNAEV N SSWFPSTGNTYSAVTISSIM DRGDK FSI APN +R+L Sbjct: 1186 VPSQPPISGLRMSNAEVGNVSSWFPSTGNTYSAVTISSIMSDRGDKPFSIVAPNVSERVL 1245 Query: 837 GPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDS 661 GPATG NPFG DIYRG VL SSPAVPY SAPFQYPVFPFNSSFPLPSASFSGGST YVD+ Sbjct: 1246 GPATGSNPFGPDIYRGAVLSSSPAVPYQSAPFQYPVFPFNSSFPLPSASFSGGSTPYVDT 1305 Query: 660 TSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGP 481 TSGGRLCFP VNSQL+G V +S+ YPRPYVVS PDG+NS+ AENSRK ARQGLDLNAGP Sbjct: 1306 TSGGRLCFPVVNSQLIGSVGNVSAHYPRPYVVSFPDGSNSSGAENSRKRARQGLDLNAGP 1365 Query: 480 GSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 G SDLEGRDESSPLVPRQLSVASS A EEQARMFHL S VLKRKEPDGG DGYKQSSWQ Sbjct: 1366 G-SDLEGRDESSPLVPRQLSVASSQAQLEEQARMFHLSSDVLKRKEPDGGWDGYKQSSWQ 1424 Query: 300 YHSL 289 YH+L Sbjct: 1425 YHNL 1428 >XP_014501022.1 PREDICTED: uncharacterized protein LOC106761917 [Vigna radiata var. radiata] Length = 1586 Score = 1895 bits (4910), Expect = 0.0 Identities = 1032/1635 (63%), Positives = 1168/1635 (71%), Gaps = 13/1635 (0%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG GGE K RHMWK+PT GN + SF+KD RKISVG+CAL Sbjct: 1 MHGRGGE---KGTRHMWKAPTLGNSSLNADVSSSSSSSS---VKSFFKDRRKISVGECAL 54 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP D PP+IGIIR +T+GK+ KLK GV+WLYR IEV+L K VPL+ PNEIF++FH D Sbjct: 55 FKPSEDRPPYIGIIRYLTIGKDKKLKFGVSWLYRSIEVKLSKSVPLEAAPNEIFYTFHKD 114 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ A +LLHPCKVAFLPKG EL GISSF+CRRVYDIANKCLWWL+DQDY NDCQEE+DK Sbjct: 115 EIDAEALLHPCKVAFLPKGAELQPGISSFLCRRVYDIANKCLWWLNDQDYINDCQEEVDK 174 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLYR CVEM ATV PGGR SQLKS SDS+QNS SSFPS +KGRKRER DQ Sbjct: 175 LLYRNCVEMHATVQPGGRSPKPMSSPTSTSQLKSASDSVQNSTSSFPSHIKGRKRERADQ 234 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G ES KRE S K +DGDSG + D+I KTEIAK+TEKGGLVD EGVEKLVQLMVP RNEK Sbjct: 235 GSESAKRERSIKTEDGDSGIFKHDNIFKTEIAKITEKGGLVDGEGVEKLVQLMVPDRNEK 294 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDLASRS+LAAV+AATDK DCLS+FVQ RGLPVFDEWLQE HKGKIG DGDKS Sbjct: 295 KIDLASRSLLAAVVAATDKLDCLSQFVQQRGLPVFDEWLQEVHKGKIG------DGDKSA 348 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ CNIGKSVNHLR+HKN EIQ+KARGLVDTWKKRVEAEM Sbjct: 349 EEFLLVLLRALDKLPVNLQALQTCNIGKSVNHLRSHKNAEIQRKARGLVDTWKKRVEAEM 408 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 INDAKSGS + WPA+ R +V QGGNRHSG SSDVA+KSSVTQLSASK S KIVQG Sbjct: 409 NINDAKSGSGTNVHWPAKSRPSDVGQGGNRHSGASSDVAMKSSVTQLSASKTTSVKIVQG 468 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 EN TRS TSA PG AKSV SPA TAN KDGQPR+VA++G SD P+A+ARDEK Sbjct: 469 ENITRSASTSAFPGPAKSVPSPAAVTANLKDGQPRIVAVNGGSDPPMANARDEKSCSSSQ 528 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRE 3367 SDHAK GG S KEDA S+NGF GS+PS GQRE Sbjct: 529 SHNNSQSCSSDHAKTGGHSVKEDARSSTAMSANKISGGSSRHRKSINGFSGSSPSGGQRE 588 Query: 3366 TGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSF 3187 TGSSRNSSLHKNLTSEK SQ GLM+KA+DGTSLEG+IPKLIVKIPN+GRSP QS SAGSF Sbjct: 589 TGSSRNSSLHKNLTSEKISQPGLMDKAVDGTSLEGNIPKLIVKIPNQGRSPTQSVSAGSF 648 Query: 3186 DDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDE 3007 DDP+IMNSRASSPVL EKHDQ D KEK D YR NIG+DINTESWQSNDFK++LTGSDE Sbjct: 649 DDPTIMNSRASSPVLPEKHDQSDHSPKEKSDLYRVNIGSDINTESWQSNDFKEVLTGSDE 708 Query: 3006 GDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKY 2827 GDGSPAA TDEEHCRTG++CKK L++ SGNE+K NLQDASYSSINALIEGVKY Sbjct: 709 GDGSPAAGTDEEHCRTGNDCKKALEISKAASSSSGNENKAGNLQDASYSSINALIEGVKY 768 Query: 2826 SEADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVKSSGEDLVQDKC 2647 SEADD GMNLLASVAAGEI KSE+ P GSPERN +EQS D VKSS E+LV+D+C Sbjct: 769 SEADDGGMNLLASVAAGEILKSELLTPTGSPERNTAAVEQSCT-DNGVKSSEENLVRDEC 827 Query: 2646 YANDG---EQKKQVSISSDLGTDDVNDSGFLASGGKSA-EHNMDTNSCSMDL-QVTETCL 2482 ++N+G E K S++ DLG + +DS F ASGGK+A E N ++CSMDL QV+ET L Sbjct: 828 HSNNGLDVEHKNLASVTGDLGANGESDSDFQASGGKTARELNKRVDACSMDLQQVSETTL 887 Query: 2481 ESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGE 2302 ESKGKL EK TS E+S QEA D D S + V +V E Sbjct: 888 ESKGKLNEKSGPTSLGGLAENSVQEAGDGDGSKQLQEVVQGVNAGEIHDKISCVAEVEAE 947 Query: 2301 S--EAENPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVD 2128 + + + EV V+ DN EG + +T +K P + +Q D +G + SGYSVD Sbjct: 948 AAKKLSHTPAEVDVQSDNYTAEGSSGSGQTVKKPPPVLVQSDLARGKDDKALHSSGYSVD 1007 Query: 2127 KVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEE 1948 KV +D ERE +K DD+ ++HVNQS+ +RNESESD + +PENRG CS VTGL AE+VEE Sbjct: 1008 KVLKDFPERESDKTDDVDAENHVNQSRSKRNESESDTLTMPENRGTCSIVTGLVAEHVEE 1067 Query: 1947 NSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXX 1768 N T++V DQ +R+ SPS SQEID+ K KL + E+AEEC Sbjct: 1068 NLETRDVRDQPSREDLPEDSPSVLSQEIDKHLNSKRLKLTSSEAEEAEECTSTTADASSM 1127 Query: 1767 XXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPC--- 1597 AKV FDLNEGL+ DDG+C E + A GCA C Sbjct: 1128 SAAAVSDADAKVGFDLNEGLNADDGRC-EFNSIANSGCAPCGRLISPVPFPASSMSCGIL 1186 Query: 1596 -VITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNA 1420 +TVA+AAKG FVPPEDLL+ KGE+GWKGSAATSAFRPAEPRK+MEM LG + PIP+A Sbjct: 1187 APVTVASAAKGHFVPPEDLLRSKGEIGWKGSAATSAFRPAEPRKVMEMSLGTSAAPIPDA 1246 Query: 1419 AAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKS-TPSPVCCSGG 1243 A QSRAPLDIDLNV DER L+DIS C+ T +S A D HDP S PSPV SGG Sbjct: 1247 PAGKQSRAPLDIDLNVADERILDDIS---CARHTNSISLATDSHDPVCSKIPSPVRSSGG 1303 Query: 1242 LGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDE 1063 LGLDLN VD+ASD+G C SN HK DVP++Q KSS PP RE++ RDFDLNNGP+VDE Sbjct: 1304 LGLDLNQVDDASDMGICLNSN-HKIDVPIMQGKSSLGGPPIREVNVHRDFDLNNGPSVDE 1362 Query: 1062 VTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGD 883 +TTE + + +RS++PSQ VSGL M+ E NF SW PS+GNTYSAVTISSIMPDRGD Sbjct: 1363 ITTESSLFSHHSRSNVPSQLPVSGLRMT-TEPGNF-SWLPSSGNTYSAVTISSIMPDRGD 1420 Query: 882 KSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPL 706 + FSI APNGPQR+L PA GGNPFG DIYR VL SSPAV YPSAPF+YPVFPFNSSFPL Sbjct: 1421 QPFSIVAPNGPQRLLTPAAGGNPFGPDIYRAPVLSSSPAVSYPSAPFEYPVFPFNSSFPL 1480 Query: 705 PSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAEN 526 PSASFS GSTTYV TS RLCFPAVNSQLMGP T+SS YPRPYVV L +GNNS SAE Sbjct: 1481 PSASFSAGSTTYVYPTSANRLCFPAVNSQLMGPAGTVSSHYPRPYVVGLTEGNNSGSAET 1540 Query: 525 SRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRK 346 SRKW RQGLDLNAGPG SD+EGRDES V KRK Sbjct: 1541 SRKWTRQGLDLNAGPGCSDMEGRDES-----------------------------VRKRK 1571 Query: 345 EPDGGQDGYKQSSWQ 301 EPDGG D Y QSSWQ Sbjct: 1572 EPDGGWDAYNQSSWQ 1586 >KRH03341.1 hypothetical protein GLYMA_17G092200 [Glycine max] Length = 1333 Score = 1729 bits (4477), Expect = 0.0 Identities = 945/1327 (71%), Positives = 1027/1327 (77%), Gaps = 13/1327 (0%) Frame = -1 Query: 4590 VHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQGPESVKRECSAK 4411 + PGGR SQLKSGSDS+QNS SSF S VKGRKRERVD+G ESVKR+ S K Sbjct: 1 MQPGGRSPKPVSSPTSTSQLKSGSDSVQNSVSSFSSHVKGRKRERVDKGQESVKRDRSTK 60 Query: 4410 NDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEKKIDLASRSMLAA 4231 DDGDSGH +QDSILKTEIAKVTEKGGL+DTEGVEKLVQLMVP NEKKIDLASRSMLAA Sbjct: 61 IDDGDSGHFKQDSILKTEIAKVTEKGGLIDTEGVEKLVQLMVPDGNEKKIDLASRSMLAA 120 Query: 4230 VIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSVDEFLLGLLRALD 4051 VIAATDKFDCLSRFVQL+GLP+FDEWLQEAHKGKIGDG GSRDGDKSVD+FL LLRALD Sbjct: 121 VIAATDKFDCLSRFVQLKGLPIFDEWLQEAHKGKIGDGLGSRDGDKSVDDFLFVLLRALD 180 Query: 4050 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDAKSGSVQS 3871 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDA+SGSVQ+ Sbjct: 181 KLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEMIINDARSGSVQT 240 Query: 3870 GPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQGENNTRSVLTSAS 3691 PWPARQRL EVAQGGN+HS GS+DVA+KSSVTQLSASK ASAKIV GE NTRS TSAS Sbjct: 241 VPWPARQRLSEVAQGGNKHSSGSADVAMKSSVTQLSASKTASAKIVPGE-NTRS--TSAS 297 Query: 3690 PGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXXXXXXXXXXXSDH 3511 PGS KSV SPA ATAN KDGQP V SGS DLPVA+ARDEK SDH Sbjct: 298 PGSTKSVPSPAPATANLKDGQPCVATASGSFDLPVANARDEK-SSSSSPSHNSQSYSSDH 356 Query: 3510 AKAGGLSGKEDAXXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQRETGSSRNSSLHKN 3331 KAGGLSGKEDA SM GFPGSTPSR QRETGS +NSS HKN Sbjct: 357 VKAGGLSGKEDARSSTAMSVNKVSGGSSRHRKSMKGFPGSTPSRRQRETGSRQNSS-HKN 415 Query: 3330 LTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAGSFDDPSIMNSRASS 3151 L SEK SQSGL EKA DGT+LEGH PKLIVKI NR RSPAQSA+AGS DDP+IMNSRASS Sbjct: 416 LISEKISQSGLREKANDGTTLEGHTPKLIVKIQNRSRSPAQSATAGSSDDPAIMNSRASS 475 Query: 3150 PVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGSDEGDGSPAATTDEE 2971 PVLSEKHDQFDRCSKEK DFYRANIGADI TESWQSNDFKD+LTGSDEGDGSP A TDE+ Sbjct: 476 PVLSEKHDQFDRCSKEKSDFYRANIGADIYTESWQSNDFKDVLTGSDEGDGSPEAITDEQ 535 Query: 2970 HCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEGVKYSEADDVGMNLLA 2791 CRTG++CKK+LDV SGNES+ RNLQDASYSSINALIEGVKY+EADDVGMNLLA Sbjct: 536 -CRTGEDCKKVLDVSKAASSSSGNESRARNLQDASYSSINALIEGVKYTEADDVGMNLLA 594 Query: 2790 SVAAGEISKSEMGMPAGSPERNFTTIEQSRPGD-AVVKSSGEDLVQDKCYANDGEQKKQV 2614 +VAAGEISKSE+GMP GSPE++ TTIEQS GD AVVKSS E LVQDKCY N E KQ Sbjct: 595 NVAAGEISKSELGMPVGSPEKDTTTIEQSYAGDAAVVKSSEEYLVQDKCYTN-VEHSKQD 653 Query: 2613 SISSDLGTDDVNDSGFLASGGK-SAEHNMDTNSCSMDLQVTETCLESKGKLIEKPAGTSS 2437 S DLG DD LAS GK + EH N+ SMDL VTET LESKGKLIEK +GTSS Sbjct: 654 SRPGDLGADD----DILASEGKATGEH----NTSSMDLWVTETSLESKGKLIEKSSGTSS 705 Query: 2436 ASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGNLDVKVGDVAGESEAENPSVE------ 2275 A PES+ QE RD+DSS GN+DVKV VA ESE E ++E Sbjct: 706 AGIPESTIQEVRDSDSSKLVKEKKVVVRVDAVGNVDVKVNVVASESETE--AIENFSCTC 763 Query: 2274 --VGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNER 2101 V VKCDN A+EGL+ +ET KSPAI M D V T EN Q SG VDKV E +NER Sbjct: 764 EVVDVKCDNRASEGLSGDKETAGKSPAIRMSSDYVIATDENAPQSSGDIVDKVLEHVNER 823 Query: 2100 EPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEENSGTKEVYD 1921 E EKNDDM QDH +S KQ+NESE+DA++VP+NRGLCSG TGL AEYVEENSGTKEV D Sbjct: 824 ESEKNDDMVAQDHAKESIKQKNESENDAIMVPKNRGLCSGATGLDAEYVEENSGTKEVCD 883 Query: 1920 Q--DARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXX 1747 Q A Q+ HT PSFPS+E+DQC+G K SKL ++ E+AEEC Sbjct: 884 QVAGAGQIVHTDLPSFPSREMDQCSGHKDSKLTAMESEEAEECTSTTGDTSSASVAGVSE 943 Query: 1746 XXAKVEFDLNEGLSVDDGKCGELKNTALPGCAXXXXXXXXXXXXXXXXPCVITVAAAAKG 1567 KVEFDLNEG + DDGKC E+ + A ITVAAAAK Sbjct: 944 VDTKVEFDLNEGFNADDGKCSEMPGST--PAARLVSPVPFSASSMSFGILSITVAAAAKS 1001 Query: 1566 PFVPPEDLLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLD 1387 PFV PEDLLK K ELGWKGSAATSAFRPAEPRK+ME+PL ++TTPIPN A+ QSRAPLD Sbjct: 1002 PFVAPEDLLKSKKELGWKGSAATSAFRPAEPRKVMEIPLDMSTTPIPNDEARKQSRAPLD 1061 Query: 1386 IDLNVPDERTLNDISSQNCSDQTVCVSHAVDGHDPHKSTPSPVCCSGGLGLDLNLVDEAS 1207 DLNV DE L+D+SSQNC+ QT C +H+ +GHDP+KS S V CSGGLGLDLNLVD AS Sbjct: 1062 FDLNVSDEVILDDVSSQNCARQTDCGTHSDNGHDPNKSMASHVSCSGGLGLDLNLVDGAS 1121 Query: 1206 DVGNCTKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLA 1027 DVGNCT S+SHK DVPL+QVKS+A PPN EMSFRRDFDLN+GP VDEVT+EPL+STQ A Sbjct: 1122 DVGNCTLSSSHKMDVPLMQVKSAASGPPNGEMSFRRDFDLNDGPVVDEVTSEPLMSTQPA 1181 Query: 1026 RSSLPSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQ 847 R+S+PSQP +SGL MSNAEV NFSSWFPST NTYSAVTISSIM DRGD+SFSI APNGPQ Sbjct: 1182 RNSVPSQPPISGLRMSNAEVGNFSSWFPSTANTYSAVTISSIMSDRGDRSFSIVAPNGPQ 1241 Query: 846 RMLGPATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTY 670 RMLGPATG NPFG DIY+G VL SSPAVPY SAPFQYPVFPFNSSFPLPSASFSGGST Y Sbjct: 1242 RMLGPATGSNPFGPDIYKGAVLSSSPAVPYQSAPFQYPVFPFNSSFPLPSASFSGGSTPY 1301 Query: 669 VDSTSGG 649 VD+TSGG Sbjct: 1302 VDTTSGG 1308 >XP_015882642.1 PREDICTED: uncharacterized protein LOC107418459 [Ziziphus jujuba] Length = 1647 Score = 1687 bits (4370), Expect = 0.0 Identities = 952/1670 (57%), Positives = 1123/1670 (67%), Gaps = 48/1670 (2%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG G E RKR RHMW PTR S NSF+KDGRKI++GDCAL Sbjct: 1 MHGREGGEERKRTRHMWTVPTR------VATPDGSSSSSSSSANSFFKDGRKINIGDCAL 54 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKPP DSPPFIGIIR +T GK+NKLK+GVNWLYRP EV+LGKG L PNEIF+SFH D Sbjct: 55 FKPPQDSPPFIGIIRWLTTGKDNKLKLGVNWLYRPAEVKLGKGTLLDAAPNEIFYSFHKD 114 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ AASLLHPCKVAFL KGVELP+GISSFVCRRVYDI NKCLWWL+DQDY N+ QEE+D+ Sbjct: 115 EIPAASLLHPCKVAFLSKGVELPTGISSFVCRRVYDITNKCLWWLTDQDYINERQEEVDQ 174 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLY+T VEM ATV GGR SQLKSGSDSIQNS SSF SQVKG+KRER DQ Sbjct: 175 LLYKTKVEMHATVQSGGRSPKPLNGPTSTSQLKSGSDSIQNSVSSFSSQVKGKKRERGDQ 234 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G E +KRE +K DDGDSG RQ+S K+EIAK+TEKGGLV++EGVEKLVQLM P RNEK Sbjct: 235 GSEPIKRERISKTDDGDSGRCRQESAFKSEIAKITEKGGLVESEGVEKLVQLMQPDRNEK 294 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDL RSMLA VIAATDKFDCLS+FVQLRGL VFDEWLQE HKGKIGD S++ DKSV Sbjct: 295 KIDLGGRSMLAGVIAATDKFDCLSQFVQLRGLNVFDEWLQEVHKGKIGDAGASKENDKSV 354 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNLQALQ+CNIGKSVNHLR+HKNLEIQKKAR LVDTWKKRVEAEM Sbjct: 355 EEFLLVLLRALDKLPVNLQALQSCNIGKSVNHLRSHKNLEIQKKARSLVDTWKKRVEAEM 414 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 IND KSGS Q+ PWP R RLPEV GNRHSGGSSDVA+KSSVTQ+SASK+AS K+VQG Sbjct: 415 NINDVKSGSNQAVPWPGRSRLPEV---GNRHSGGSSDVAMKSSVTQVSASKSASVKLVQG 471 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 E+ TRS SASPGS KSV SP +A N K+GQPR A G+SD+P+A+ARDEK Sbjct: 472 ESATRS--ASASPGSIKSVPSPVSAGTNLKEGQPRNTAAVGTSDVPLATARDEKSSSSSQ 529 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDA--XXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQ 3373 +DH + GG SGK+DA ++NGF GSTPS GQ Sbjct: 530 SHNNSQSCSNDHTRTGGTSGKDDARSSSVGSMNVNKLSGGSSRPRKTINGFQGSTPSGGQ 589 Query: 3372 RETGSSRNSSLHKNLTSEKTSQSGL-MEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASA 3196 RE GSSR SSLHKN EK S SGL EK L+G S E + K IVKIPNRGRSPAQS S Sbjct: 590 REIGSSRGSSLHKNPAPEKLSHSGLSSEKILEGPSAESNSHKFIVKIPNRGRSPAQSVSG 649 Query: 3195 GSFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTG 3016 GSF++PSIMNSRASSPVLSEKHDQFDR KEK A + +D+N ESWQSNDFKD+LTG Sbjct: 650 GSFEEPSIMNSRASSPVLSEKHDQFDRNLKEK----SAAVTSDVNAESWQSNDFKDVLTG 705 Query: 3015 SDEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEG 2836 SDEGDGSPAA T EEHCRTGD KK+ +V SGNE K+RNLQ+AS+SSI+ALIE Sbjct: 706 SDEGDGSPAAVTGEEHCRTGDETKKVAEVSKTGSSSSGNEHKSRNLQEASFSSIHALIES 765 Query: 2835 -VKYSEA-------DDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGDAVVK 2680 VKYSEA DDVGMNLLASVAAGE+SKS + P SP+RN T +E S G+ Sbjct: 766 CVKYSEANASVSVVDDVGMNLLASVAAGEMSKSNLVSPTDSPQRN-TPVELSCSGNDTKV 824 Query: 2679 SSGEDLVQDKCYANDGEQKKQVSISSDLGT----DDVNDSGFLASGGKSAEHNMDTNSCS 2512 S +D Q++ DG + + + G + + S FL+ G AE NS Sbjct: 825 KSSDDHGQEQTQCVDGADDEPGNNCKESGNFVAKNGDDKSSFLSEGKPVAESGGHLNSSG 884 Query: 2511 MDLQVTE-TCLESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXGN 2335 ++LQ E TC +S GK E S + P ++ + DT+ N Sbjct: 885 IELQQAEGTCEDSNGKSKEMILAPSMVASPSVTSNKTMDTEGFKPAKEKMADGGVDDDSN 944 Query: 2334 LD------------VKVGDVAGESEA-----ENPSVEVGVKCDNS----ATEGLNSGRET 2218 LD V V D+ E E P G + DN+ EG+NS T Sbjct: 945 LDNKHSMSGSLLDEVNVNDLGSNKEVKAETIEGPLPHPGSEIDNNIKYCENEGMNSDVHT 1004 Query: 2217 TQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQR 2038 +K P + + ++VKGT E V S D + + ++E +PEK D M + HVN ++KQR Sbjct: 1005 KEKPPILIVHSESVKGTGEEVLLSSDSCKDLILQKIHELKPEKADAMDPKSHVNLTEKQR 1064 Query: 2037 NESESDAVVVPENR--GLCSGVTGLAAEYVEENSGTKEVYDQDARQMPHTVSPSFPSQEI 1864 E S+A + P+N+ G SGV E+VEEN ++E +Q +PH VSP +E Sbjct: 1065 TELVSNAHMAPDNQVVGFSSGVAHHNGEHVEENLQSRET-EQCGGAIPHKVSPVVNVRET 1123 Query: 1863 DQCTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCG 1684 + S + ++ ++AEEC AK++FDLNEG + DDGK G Sbjct: 1124 EHVR----STVVDMETDEAEECTSTTADASSVSAAGGSDADAKIKFDLNEGFNADDGKDG 1179 Query: 1683 ELKNTALPG----CAXXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPEDLLKIKGELGW 1516 + N ++ G + P ITVAAAAKGPFVPPEDLL+ KGELGW Sbjct: 1180 DPNNLSVLGSSSAVSLISPLSLPVSSVSSGLPASITVAAAAKGPFVPPEDLLRSKGELGW 1239 Query: 1515 KGSAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPDERTLNDISSQ 1336 KGSAATSAFRPAEPRK++EMPLG T + A Q R LDIDLNVPDER L D++ + Sbjct: 1240 KGSAATSAFRPAEPRKVLEMPLGATNISHTCSTAGKQGRPLLDIDLNVPDERILEDLALR 1299 Query: 1335 NCSDQTVCVSHAVDGHD---PHKSTPSPVCCSGGLGLDLNLVDEASDVGNCTKSNSHKKD 1165 N ++ V+ + HD +PV SGGL LDLN VDEASD+GN + S++ K D Sbjct: 1300 NSIHESGSVADTLSNHDLGRDELKVAAPVRSSGGLDLDLNRVDEASDLGNYSISSTCKID 1359 Query: 1164 VPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSSLPSQPNVSGLS 985 L QVKSS+ + E+ RRDFDL NGP DEV EP + +Q RSS+PSQP VSG Sbjct: 1360 PTLSQVKSSSGGTLSSEVCVRRDFDL-NGPVADEVGAEPAVFSQRGRSSVPSQPPVSGHR 1418 Query: 984 MSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLGPATGGNPFGS 805 M+NAEV N+SSWFP G+TY AVTI SIMPDRG++ F + P GPQR+LGP G NPF Sbjct: 1419 MNNAEVGNYSSWFP-PGSTYPAVTIPSIMPDRGEQPFPVVPPGGPQRILGPPAGSNPFSP 1477 Query: 804 DIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDSTSGGRLCFPAV 628 D+YRG+VL SSPA+P+P FQYPVF F ++FPLP+A+F+GGSTTY+DSTSGGRLCFPAV Sbjct: 1478 DVYRGSVLSSSPAMPFPPPQFQYPVFNFGATFPLPAATFTGGSTTYMDSTSGGRLCFPAV 1537 Query: 627 NSQLMGPVNTISSQYPR-PYVVSLPDGNNSNSAENSRKWARQGLDLNAGPGSSDLEGRDE 451 SQL+GP +SS Y R PYVVSLPDGNN+++ E+SRKW RQGLDLNAGPG D+EGRDE Sbjct: 1538 PSQLLGPAGLVSSSYTRPPYVVSLPDGNNNSNGESSRKWGRQGLDLNAGPGGPDIEGRDE 1597 Query: 450 SSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 SSPL RQLSVASS A AEEQ RMF + G LKRKEP+GG DGYKQSSWQ Sbjct: 1598 SSPLASRQLSVASSQALAEEQFRMFQIPGGTLKRKEPEGGWDGYKQSSWQ 1647 >XP_017973244.1 PREDICTED: uncharacterized protein LOC18603853 [Theobroma cacao] Length = 1630 Score = 1637 bits (4238), Expect = 0.0 Identities = 940/1667 (56%), Positives = 1111/1667 (66%), Gaps = 45/1667 (2%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG GG E RKR RHMW PTR ++NSF KDGRKISVGDCAL Sbjct: 1 MHGRGGGEERKRARHMWTVPTRATAVLSGDGGASPSSSSSSTVNSFCKDGRKISVGDCAL 60 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKPP +SPPFIGIIR + GKENKL++GVNWLYRP EV+LGKG+ L+ PNEIF+SFH D Sbjct: 61 FKPPQESPPFIGIIRCLIAGKENKLRLGVNWLYRPAEVKLGKGILLEAAPNEIFYSFHKD 120 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ AASLLHPCKVAFLPK VELPSGI SFVCRRVYDI NKCLWWL+DQDY N+ QEE+D+ Sbjct: 121 EIPAASLLHPCKVAFLPKDVELPSGICSFVCRRVYDITNKCLWWLTDQDYINERQEEVDQ 180 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LL +T +EM ATV PGGR SQ+K GSDS+QNSASSFPSQ KG+KRER DQ Sbjct: 181 LLDKTRLEMHATVQPGGRSPKPMNGPTSTSQIKPGSDSVQNSASSFPSQGKGKKRERGDQ 240 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G E VKRE ++K DDGDSGH R + LK+EIAK+TEKGGL D+EGVEKLVQLMVP RNEK Sbjct: 241 GSEPVKRERTSKMDDGDSGHGRPEINLKSEIAKITEKGGLEDSEGVEKLVQLMVPERNEK 300 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDL SRSMLA VIAATDKFDCLSRFVQLRGLPVFDEWLQE HKGKIGDG+GS+D D+SV Sbjct: 301 KIDLVSRSMLAGVIAATDKFDCLSRFVQLRGLPVFDEWLQEVHKGKIGDGSGSKD-DRSV 359 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 D+FLL LLRALDKLPVNL ALQ CNIGKSVNHLR+HKNLEIQKKARGLVDTWKKRVEAEM Sbjct: 360 DDFLLTLLRALDKLPVNLTALQMCNIGKSVNHLRSHKNLEIQKKARGLVDTWKKRVEAEM 419 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 DAKSGS Q+ PW AR R+ EV+ G++HS GSS+VAVKSSVTQ SASK S K+ QG Sbjct: 420 ---DAKSGSNQAVPWSARPRISEVSHSGSKHS-GSSEVAVKSSVTQFSASKTGSVKLAQG 475 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 E T+S SASPGS K+ SP +A+ N KDGQ R G+SD P +ARDEK Sbjct: 476 ETPTKS--ASASPGSMKAATSPVSASTNLKDGQARNATAVGTSD-PQTTARDEKSSSSSQ 532 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDA--XXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQ 3373 SDHAK GG+SGKE+A S+NGFPGS S Q Sbjct: 533 SHNNSQSCSSDHAKTGGVSGKEEARSSAAGSGTVTKISGSSSRHRKSINGFPGS--SGVQ 590 Query: 3372 RETGSSRNSSLHKNLTSEKTSQSGL-MEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASA 3196 RETGSS+NSSLH+N SEK SQSGL EKA+D EG+ K IVKIPNRGRSPAQS S Sbjct: 591 RETGSSKNSSLHRNPASEKISQSGLTCEKAVDAPMAEGNSHKFIVKIPNRGRSPAQSVSG 650 Query: 3195 GSFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTG 3016 GS +D S+MNSRASSPVLSEKH+Q DR +KEK + YRAN+ D+NTESWQSNDFKD+LTG Sbjct: 651 GSLEDLSVMNSRASSPVLSEKHEQSDRNTKEKSETYRANVTTDVNTESWQSNDFKDVLTG 710 Query: 3015 SDEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEG 2836 SDEGDGSPAA DEEHCR G++ +K +V SGNE K+ LQ+AS+SSINALI+ Sbjct: 711 SDEGDGSPAAVPDEEHCRIGEDARKTTEVTKTASSSSGNELKSGKLQEASFSSINALIDS 770 Query: 2835 -VKYSEA-------DDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGD--AV 2686 VKYSEA DD GMNLLASVAAGEISKS++ P SP+RN +E S G+ + Sbjct: 771 CVKYSEANACMPVGDDAGMNLLASVAAGEISKSDVASPIDSPQRNNPVVEHSSTGNDTRL 830 Query: 2685 VKSSGEDLVQDK--CYAN-DGEQKKQVSISSD---LGTDDVNDSGFLASGGKSAEHNMDT 2524 S+G+D+V+D+ C D E KQ +++ + D S SGG+ EH + Sbjct: 831 KPSAGDDVVRDRHQCVEGADDEHLKQGTVAGNSWAKNADCKTGSSQEKSGGELNEHLI-- 888 Query: 2523 NSCSMDL-QVTETCLES---------------KGKLIEKPAGTSSASFPESSAQEARDTD 2392 S SM L Q + CLE+ G +EK + D D Sbjct: 889 -SSSMGLPQTADQCLENGKLKEIVTAALVNLPSGSTVEKTTAVGDSKEHLEKKAGGVDDD 947 Query: 2391 SSXXXXXXXXXXXXXXXGNLD--VKVGDVAGESEAENPSVEVGVKCDNSATEGLNSGRET 2218 SS +D VKV A + + PS+EV V+ + TEGL+ +T Sbjct: 948 SSLDTKQKGSTSLVNEDKVVDPGVKVEKEAVDGSSSVPSMEVDVEDKKNVTEGLDRSLQT 1007 Query: 2217 TQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQR 2038 + S A+ ++ KG A+ A G + D V E + E +PEK+ + + HV ++KQ+ Sbjct: 1008 HENSAAV--TGNSTKG-ADKEALPPGSAKDIVLEKVGEVKPEKDVETDARSHVAHTEKQK 1064 Query: 2037 NESESDAVVVPENRGLCSGVTGLAAEYVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQ 1858 E E+ VT E VEEN EV+ + R P S E +Q Sbjct: 1065 PEWET--------------VTARKGEQVEENLECGEVH--EPRGGPSPCRASSTVMETEQ 1108 Query: 1857 CTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGEL 1678 T +GSKL V + ++AEE AKVEFDLNEG + D+ K GE Sbjct: 1109 PTRSRGSKLTVAEADEAEE---RTSTTSDAPATGGADADAKVEFDLNEGFNADEAKFGEP 1165 Query: 1677 KNTALPGCA----XXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPEDLLKIKGELGWKG 1510 N PGC+ P ITVAAAAKGPFVPP+DLL+ KG LGWKG Sbjct: 1166 NNLTAPGCSAPVQLISPLPFPISSVSSSLPASITVAAAAKGPFVPPDDLLRTKGVLGWKG 1225 Query: 1509 SAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPDERTLNDISSQNC 1330 SAATSAFRPAEPRK ++MPLG + +P+A QSR PLDIDLNVPDER L D++S++ Sbjct: 1226 SAATSAFRPAEPRKSLDMPLGTSNASMPDATTSKQSRPPLDIDLNVPDERVLEDLASRSS 1285 Query: 1329 SDQTVCVSHAVDGHDPHKST--PSPVCCSGGLGLDLNLVDEASDVGNCTKSNSHKKDVPL 1156 + T + D +P+ SGGL LDLN VDE D+GN + S + DVP+ Sbjct: 1286 AQGTDSAPDLTNNRDLTCGLMGSAPIRSSGGLDLDLNRVDEPIDLGNHSTGTSRRLDVPM 1345 Query: 1155 LQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSS-LPSQPNVSGLSMS 979 +KSS+ N E S RRDFDLNNGPAVDEV+ EP + +Q RSS +PSQP VS L ++ Sbjct: 1346 QPLKSSSGGILNGEASVRRDFDLNNGPAVDEVSAEPSLFSQHNRSSNVPSQPPVSSLRIN 1405 Query: 978 NAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLGPATGGNPFGSDI 799 N E+ANFSSWFP TGNTYSAVTI SI+PDRG++ F I A GP R+LGP T PF D+ Sbjct: 1406 NTEMANFSSWFP-TGNTYSAVTIPSILPDRGEQPFPIVATGGPPRVLGPPTAATPFNPDV 1464 Query: 798 YRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDSTSGGRLCFPAVNS 622 YRG VL SSPAVP+PSAPFQYPVFPF ++FPLPS SFSGGSTTYVDS+ GRLCFP V S Sbjct: 1465 YRGPVLSSSPAVPFPSAPFQYPVFPFGTTFPLPSTSFSGGSTTYVDSSPSGRLCFPPV-S 1523 Query: 621 QLMGPVNTISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGPGSSDLEGRDESSP 442 QL+GP + S Y RPYVVSLPDG+N++ AE+ RKW RQGLDLNAGPG D+EGRDE+SP Sbjct: 1524 QLLGPAGAVPSHYARPYVVSLPDGSNNSGAESGRKWGRQGLDLNAGPGGPDIEGRDETSP 1583 Query: 441 LVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 L RQLSVASS A AEEQARM+ + G+LKRKEP+GG DGYKQSSWQ Sbjct: 1584 LASRQLSVASSQALAEEQARMYQVPGGILKRKEPEGGWDGYKQSSWQ 1630 >EOY20634.1 BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] EOY20635.1 BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] EOY20636.1 BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] EOY20639.1 BAH domain,TFIIS helical bundle-like domain isoform 1 [Theobroma cacao] Length = 1630 Score = 1635 bits (4235), Expect = 0.0 Identities = 939/1667 (56%), Positives = 1112/1667 (66%), Gaps = 45/1667 (2%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG GG E RKR RHMW PTR ++NSF KDGRKISVGDCAL Sbjct: 1 MHGRGGGEERKRARHMWTVPTRATAVLSGDGGASPSSSSSSTVNSFCKDGRKISVGDCAL 60 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKPP DSPPFIGIIR + GKENKL++GVNWLYRP EV+LGKG+ L+ PNEIF+SFH D Sbjct: 61 FKPPQDSPPFIGIIRCLIAGKENKLRLGVNWLYRPAEVKLGKGILLEAAPNEIFYSFHKD 120 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ AASLLHPCKVAFLPK VELPSGI SFVCRRVYDI NKCLWWL+DQDY N+ QEE+D+ Sbjct: 121 EIPAASLLHPCKVAFLPKDVELPSGICSFVCRRVYDITNKCLWWLTDQDYINERQEEVDQ 180 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LL +T +EM ATV PGGR SQ+K GSDS+QNSASSFPSQ KG+KRER DQ Sbjct: 181 LLDKTRLEMHATVQPGGRSPKPMNGPTSTSQIKPGSDSVQNSASSFPSQGKGKKRERGDQ 240 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G E VKRE ++K DDGDSGH R + LK+EIAK+TEKGGL D+EGVEKLVQLMVP RNEK Sbjct: 241 GSEPVKRERTSKMDDGDSGHGRPEINLKSEIAKITEKGGLEDSEGVEKLVQLMVPERNEK 300 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDL SRSMLA VIAATDKFDCLSRFVQLRGLPVFDEWLQE HKGKIGDG+GS+D D+SV Sbjct: 301 KIDLVSRSMLAGVIAATDKFDCLSRFVQLRGLPVFDEWLQEVHKGKIGDGSGSKD-DRSV 359 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 D+FLL LLRALDKLPVNL ALQ CNIGKSVNHLR+HKNLEIQKKARGLVDTWKKRVEAEM Sbjct: 360 DDFLLTLLRALDKLPVNLTALQMCNIGKSVNHLRSHKNLEIQKKARGLVDTWKKRVEAEM 419 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 DAKSGS Q+ PW AR R+ EV+ G++HS GSS+VAVKSSVTQ SASK S K+ QG Sbjct: 420 ---DAKSGSNQAVPWSARPRISEVSHSGSKHS-GSSEVAVKSSVTQFSASKTGSVKLAQG 475 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 E T+S SASPGS K+ SP +A+ N KDGQ R G+SD P +ARDEK Sbjct: 476 ETPTKS--ASASPGSMKAATSPVSASTNLKDGQARNATAVGTSD-PQTTARDEKSSSSSQ 532 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDA--XXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQ 3373 SDHAK GG+SGKE+A S+NGFPGS S Q Sbjct: 533 SHNNSQSCSSDHAKTGGVSGKEEARSSAAGSGTVTKISGSSSRHRKSINGFPGS--SGVQ 590 Query: 3372 RETGSSRNSSLHKNLTSEKTSQSGL-MEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASA 3196 RETGSS+NSSLH+N SEK SQSGL EKA+D EG+ K IVKIPNRGRSPAQS S Sbjct: 591 RETGSSKNSSLHRNPASEKISQSGLTCEKAVDAPMAEGNSHKFIVKIPNRGRSPAQSVSG 650 Query: 3195 GSFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTG 3016 GS +D S+MNSRASSPVLSEKH+Q DR +KEK + YRAN+ D+NTESWQSNDFKD+LTG Sbjct: 651 GSLEDLSVMNSRASSPVLSEKHEQSDRNTKEKSETYRANVTTDVNTESWQSNDFKDVLTG 710 Query: 3015 SDEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEG 2836 SDEGDGSPAA DEEHCR G++ +K +V SGNE K+ LQ+AS+SSINALI+ Sbjct: 711 SDEGDGSPAAVPDEEHCRIGEDARKTTEVTKTASSSSGNELKSGKLQEASFSSINALIDS 770 Query: 2835 -VKYSEA-------DDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPGD--AV 2686 VKYSEA DD GMNLLASVAAGEISKS++ P SP+RN +E S G+ + Sbjct: 771 CVKYSEANACMPVGDDAGMNLLASVAAGEISKSDVASPIDSPQRNTPVVEHSSTGNDTRL 830 Query: 2685 VKSSGEDLVQDKCYANDG---EQKKQVSISSD---LGTDDVNDSGFLASGGKSAEHNMDT 2524 S+G+D+V+D+ + +G E KQ +++ + D S SGG+ EH + Sbjct: 831 KPSAGDDVVRDRHQSVEGADDEHLKQGTVAGNSWAKNADCKTGSSQEKSGGELNEHLI-- 888 Query: 2523 NSCSMDL-QVTETCLES---------------KGKLIEKPAGTSSASFPESSAQEARDTD 2392 S SM L Q + CLE+ G +EK + D D Sbjct: 889 -SSSMGLPQTADQCLENGKLKEIVAAALVNLPSGSTVEKTTDVGDSKEHLEKKAGGVDDD 947 Query: 2391 SSXXXXXXXXXXXXXXXGNLD--VKVGDVAGESEAENPSVEVGVKCDNSATEGLNSGRET 2218 SS +D VKV A + + PS+EV V+ + TEGL+ +T Sbjct: 948 SSLDTKQKGSTSLVNEDKVVDPGVKVEKEAVDGSSSVPSMEVDVEDKKNVTEGLDRSLQT 1007 Query: 2217 TQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQR 2038 + S A+ ++ KG A+ A G + D V E + E + EK+ + + HV ++KQ+ Sbjct: 1008 HENSAAV--TGNSTKG-ADKEASPPGSAKDIVLEKVGEVKLEKDVETDARSHVAHTEKQK 1064 Query: 2037 NESESDAVVVPENRGLCSGVTGLAAEYVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQ 1858 E E+ VT E VEEN EV+ + R P S E +Q Sbjct: 1065 PEWET--------------VTARKGEQVEENLECSEVH--EPRGGPSPCRASSTVMETEQ 1108 Query: 1857 CTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGEL 1678 T +GSKL V + ++AEE AKVEFDLNEG + D+ K GE Sbjct: 1109 PTRSRGSKLTVAEADEAEE---RTSTTSDAPATGGADADAKVEFDLNEGFNADEAKFGEP 1165 Query: 1677 KNTALPGCA----XXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPEDLLKIKGELGWKG 1510 N PGC+ P ITVAAAAKGPFVPP+DLL+ KG LGWKG Sbjct: 1166 NNLTAPGCSPPVQLISPLPFPVSSVSSSLPASITVAAAAKGPFVPPDDLLRTKGVLGWKG 1225 Query: 1509 SAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPDERTLNDISSQNC 1330 SAATSAFRPAEPRK ++MPLG + +P+A QSR PLDIDLNVPDER L D++S++ Sbjct: 1226 SAATSAFRPAEPRKSLDMPLGTSNASMPDATTCKQSRPPLDIDLNVPDERVLEDLASRSS 1285 Query: 1329 SDQTVCVSHAVDGHDPHKST--PSPVCCSGGLGLDLNLVDEASDVGNCTKSNSHKKDVPL 1156 + T + D +P+ SGGL LDLN VDE D+GN + +S + DVP+ Sbjct: 1286 AQGTDSAPDLTNNRDLTCGLMGSAPIRSSGGLDLDLNRVDEPIDLGNHSTGSSRRLDVPM 1345 Query: 1155 LQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSS-LPSQPNVSGLSMS 979 +KSS+ N E S RRDFDLNNGPAVDEV+ EP + +Q RSS +PSQP VS L ++ Sbjct: 1346 QPLKSSSGGILNGEASVRRDFDLNNGPAVDEVSAEPSLFSQHNRSSNVPSQPPVSSLRIN 1405 Query: 978 NAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLGPATGGNPFGSDI 799 N E+ANFSSWFP TGNTYSAVTI SI+PDRG++ F I A GP R+LGP T PF D+ Sbjct: 1406 NTEMANFSSWFP-TGNTYSAVTIPSILPDRGEQPFPIVATGGPPRVLGPPTAATPFNPDV 1464 Query: 798 YRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDSTSGGRLCFPAVNS 622 YRG VL SSPAVP+PSAPFQYPVFPF ++FPLPS SFSGGSTTYVDS+ GRLCFP V S Sbjct: 1465 YRGPVLSSSPAVPFPSAPFQYPVFPFGTTFPLPSTSFSGGSTTYVDSSPSGRLCFPPV-S 1523 Query: 621 QLMGPVNTISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGPGSSDLEGRDESSP 442 QL+GP + S Y RPYVVSLPDG+N++ AE+ RKW RQGLDLNAGPG D+EGRDE+SP Sbjct: 1524 QLLGPAGAVPSHYARPYVVSLPDGSNNSGAESGRKWGRQGLDLNAGPGGPDIEGRDETSP 1583 Query: 441 LVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 L RQLSVASS A AEEQARM+ + G+LKRKEP+GG DGYKQSSWQ Sbjct: 1584 LASRQLSVASSQALAEEQARMYQVPGGILKRKEPEGGWDGYKQSSWQ 1630 >XP_018820881.1 PREDICTED: uncharacterized protein LOC108991179 isoform X1 [Juglans regia] Length = 1636 Score = 1630 bits (4220), Expect = 0.0 Identities = 932/1655 (56%), Positives = 1105/1655 (66%), Gaps = 33/1655 (1%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG EE KR RH+W PTR S NSF KDGRKISVG+ AL Sbjct: 1 MHGLAAEEG-KRSRHVWTIPTRS---IVAATADGSSSSTLSSANSFSKDGRKISVGETAL 56 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKP DSPPF+G+I +TV KENK+ +GVNWLYRP EV+LGKGV L+ PNEIF+SFH D Sbjct: 57 FKPAQDSPPFVGLIHWLTVSKENKITLGVNWLYRPAEVKLGKGVLLEAAPNEIFYSFHKD 116 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++SA SLLHPCKVAFL KGVELPSG+SSFVCRRVYDI NK +WWL+DQD+ N+ QEEID+ Sbjct: 117 EISAESLLHPCKVAFLRKGVELPSGVSSFVCRRVYDITNKRIWWLTDQDFMNEQQEEIDQ 176 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLY+T + M AT+ PGGR SQLK+GSDS+QNSASS PSQVKG+KRER DQ Sbjct: 177 LLYKTRIRMHATLQPGGRSPKPMSGPTSTSQLKAGSDSVQNSASSLPSQVKGKKRERADQ 236 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 G E VKRE + K DDGDSG R +S LK+EI K+TEKGGL D+EGVE+LVQLM+P RNEK Sbjct: 237 GSEPVKRERTTKTDDGDSGLSRVESNLKSEILKITEKGGLTDSEGVERLVQLMLPERNEK 296 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDL RSMLA VIAATDKFDCL +FVQL+G+PVFDEWLQE HKGKIG DG+K+V Sbjct: 297 KIDLNGRSMLAGVIAATDKFDCLGQFVQLKGVPVFDEWLQEVHKGKIG------DGEKNV 350 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRALDKLPVNL ALQ CNIGKSVNHLRTHKNLEIQKKAR LVDTWK+RVEAEM Sbjct: 351 EEFLLVLLRALDKLPVNLHALQMCNIGKSVNHLRTHKNLEIQKKARSLVDTWKRRVEAEM 410 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 IND+KSGS Q+ PW +R RLPEV+ GGNRHSG S++ +KSSVTQLSASK AS K++QG Sbjct: 411 NINDSKSGSNQAVPWSSRLRLPEVSHGGNRHSGTSTEAVIKSSVTQLSASKTASVKLIQG 470 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 E+ TRS SASPGS KSV SPA A N +DGQ + A G+SDLP+ + RDEK Sbjct: 471 ESVTRSPSASASPGSMKSVPSPAPANTNVRDGQIQNAA-GGTSDLPLTTVRDEK-SSSSS 528 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDA--XXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQ 3373 SDHAK GGLS KEDA S+NGFPGS S Q Sbjct: 529 QSHNSQSCSSDHAKTGGLSVKEDARSSSAGSMNGNKVSGGSSRHRKSVNGFPGSAASGVQ 588 Query: 3372 RETGSSRNSSLHKNLTSEKTSQSGLMEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASAG 3193 RE SS+ SSLHKN SEK QS EK LDG E + KLIVKI NRGRSPAQSAS G Sbjct: 589 REITSSK-SSLHKNSASEKLIQS---EKTLDGPVSEANSLKLIVKISNRGRSPAQSASGG 644 Query: 3192 SFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTGS 3013 S +DPSIMNSRASSPVLSEKHDQFDR KEK D YRAN+ +D+NTESWQSNDFKD+LTGS Sbjct: 645 SVEDPSIMNSRASSPVLSEKHDQFDRTLKEKSDTYRANVTSDVNTESWQSNDFKDVLTGS 704 Query: 3012 DEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEG- 2836 D+GDGSP A TDEE CRTG++ KKI SGN ++ NLQDAS SINALI+ Sbjct: 705 DDGDGSPVAITDEERCRTGEDSKKIAKA-KAASSSSGNGLESANLQDASLRSINALIDSC 763 Query: 2835 VKYSE-------ADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIE----QSRPGDA 2689 VKYSE DDVGMNLLASVAAGE+SKS++ P SP RN T+E S P Sbjct: 764 VKYSETNASMSIGDDVGMNLLASVAAGEMSKSDLVSPINSPLRNTPTVEHCCTDSDPKVK 823 Query: 2688 VVKSSGEDLVQDKCYANDG---EQKKQVSISSDLGTDDVNDSG--FLASGGKSAEHNMDT 2524 V S +D +++ ++DG + +KQ IS++L DS ++ E+ Sbjct: 824 VKSSLVDDFTREESQSHDGADEDHEKQGVISTNLRPKVGEDSSVDLVSEEKPMGEYPGYL 883 Query: 2523 NSCSMDLQVT-ETCLESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXX 2347 S DLQ T + C+ES GK E TS AS S+ ++ + + S Sbjct: 884 TSSGRDLQQTAKACVESDGKSAEIKVATSMASSTASTVEKTMNIEGSQPPQEKKMDGAVS 943 Query: 2346 XXGNLDVK-VGDVAGESEAENPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQFDTVKG 2170 DVK + E + V +K + A EGL +T QK A M D VKG Sbjct: 944 MGATPDVKEKASCSLLKEDDGKDEIVSLKVEMKAVEGLGDAVQTEQKPSASMMHSDDVKG 1003 Query: 2169 TAENVAQCSGYSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVVPENR-- 1996 + + V SG D + E++++ + E ++ V+ V+Q +RNE +S+A PENR Sbjct: 1004 SNQEVVLPSGGGKDVLSENVSKLKAENIEETDVRGFVSQIDDRRNEQDSNAFSSPENRIS 1063 Query: 1995 -GLCSGVTGLAAEYVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGSKLAVVD 1819 GL ++ E+VEEN +K+ R PHTVS + QE +Q + SKL + Sbjct: 1064 VGLAPILSDRDGEHVEENLESKDDLALRGRAAPHTVSSALAVQETEQPESSRRSKLIGTE 1123 Query: 1818 GEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPGCA---- 1651 EDAEEC AKVEFDLNEG SVDDGK GE N A PGC+ Sbjct: 1124 TEDAEECISTSAHAASIPVSGVSDMDAKVEFDLNEGFSVDDGKFGEHNNLAAPGCSAAVR 1183 Query: 1650 XXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAFRPAEPR 1471 P IT+ AAAKGPFVPP+DLLK KGELGWKGSAATSAFRPAEPR Sbjct: 1184 LVSPLPFSLPSVSSGLPASITITAAAKGPFVPPDDLLKSKGELGWKGSAATSAFRPAEPR 1243 Query: 1470 KIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCVSHAVDG 1291 K +EM LG P+P+A A QSR PLDIDLNVPDER L D+ S+NC+ + +S ++ Sbjct: 1244 KPLEMSLGTANIPLPDATAGKQSRLPLDIDLNVPDERFLEDLVSRNCTQEPGTLSGPMNS 1303 Query: 1290 HD---PHKSTPSPVCCSGGLGLDLNLVDEASDVGN-CTKSNSHKKDVPLLQVKSSADSPP 1123 + + + + SGGLGLDLNLVD+ASD+GN T SN+ + DVPLL KS++ S Sbjct: 1304 RELAREQQIGSNLLRGSGGLGLDLNLVDDASDMGNYATSSNTRRVDVPLLPRKSTSSSAL 1363 Query: 1122 NREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANFSSWFP 943 N MS RRDFDLNNGP VDEV EP +Q ARSSLPSQP +SGL M+N E+ NF+ WFP Sbjct: 1364 NGAMSGRRDFDLNNGPVVDEVCAEPSQFSQQARSSLPSQPPLSGLRMNNTEMGNFAPWFP 1423 Query: 942 STGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVLSS-PAV 766 S G+TYSA+ I SIMPDRG++ F I AP GPQRMLGP +PF D+YRG VLSS AV Sbjct: 1424 S-GSTYSAIAIPSIMPDRGEQPFPIVAPGGPQRMLGPTGSSSPFSPDVYRGPVLSSATAV 1482 Query: 765 PYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVNTISSQ 586 P+PS+PF YPVFPF +SFPLPSA+FSGGSTTYVDS SGG++CFP V +Q +GP +SSQ Sbjct: 1483 PFPSSPFPYPVFPFGTSFPLPSATFSGGSTTYVDSPSGGKVCFPTVRTQFLGPAGAVSSQ 1542 Query: 585 YPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLSVASSL 406 +PRP+VVS PDGN + S E+SRKW R GLDLNAGPG D++GRDE S L R LSV+ S Sbjct: 1543 FPRPFVVSFPDGNINGSGESSRKWGRHGLDLNAGPGGLDVDGRDEPS-LPLRHLSVSGSQ 1601 Query: 405 APAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 A A+EQARM + GV KRKEP+GG DGYKQSSWQ Sbjct: 1602 ALADEQARMIQMAGGVSKRKEPEGGWDGYKQSSWQ 1636 >XP_018831488.1 PREDICTED: uncharacterized protein LOC108999147 [Juglans regia] Length = 1644 Score = 1610 bits (4169), Expect = 0.0 Identities = 925/1660 (55%), Positives = 1099/1660 (66%), Gaps = 38/1660 (2%) Frame = -1 Query: 5166 MHGCGGEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCAL 4987 MHG G+ER KR RHMW PTR S NSF KDGRKISVGDCAL Sbjct: 1 MHGQVGDER-KRSRHMWTVPTR---VIVAATADGSSSSTLSSANSFTKDGRKISVGDCAL 56 Query: 4986 FKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHTD 4807 FKPP DSPPFIGIIR + VGKENK+K+GVNWLYRP EV+LGKGV L+ TPNE+F+SFH D Sbjct: 57 FKPPQDSPPFIGIIRWLIVGKENKIKLGVNWLYRPAEVKLGKGVLLEATPNELFYSFHKD 116 Query: 4806 KVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEIDK 4627 ++ A SLLHPCKVAFLPKGVELP GI SFVCRRVYDI NKCLWWL+DQDY N+ QEE+D+ Sbjct: 117 EIPAESLLHPCKVAFLPKGVELPPGILSFVCRRVYDITNKCLWWLTDQDYINERQEEVDQ 176 Query: 4626 LLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVDQ 4447 LLY+T ++M AT+ P GR SQLK+GSDS+QNSASS P+Q KG+KRER DQ Sbjct: 177 LLYKTQMQMHATLQPSGRSPKPMSGPTSTSQLKAGSDSVQNSASSVPAQAKGKKRERGDQ 236 Query: 4446 GPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNEK 4267 E VKRE + K DDGDSGH R ++ LK+EI K+TEKGGL D+EGVE+LVQLM RNEK Sbjct: 237 VSEPVKRERATKTDDGDSGHGRPENNLKSEILKITEKGGLTDSEGVERLVQLMQSERNEK 296 Query: 4266 KIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKSV 4087 KIDL RSMLA VIAATDKFDCLSRFVQLRGL VFDEWLQE HKGK GDG+ +DGDK+V Sbjct: 297 KIDLNCRSMLAGVIAATDKFDCLSRFVQLRGLLVFDEWLQEVHKGKTGDGSSPKDGDKTV 356 Query: 4086 DEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAEM 3907 +EFLL LLRAL+KLPVNL ALQ CNIGKSVNHLRT KNLEIQKKAR LVDTWKKRVEAEM Sbjct: 357 EEFLLVLLRALEKLPVNLHALQMCNIGKSVNHLRTQKNLEIQKKARSLVDTWKKRVEAEM 416 Query: 3906 IINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQG 3727 INDA+SGS Q+ WP+R RL EV+ GNRHSG S+D A+KSSVTQLSASK S K+ G Sbjct: 417 NINDARSGSNQAVTWPSRPRLSEVSHIGNRHSGASTDAAMKSSVTQLSASKTTSVKLAHG 476 Query: 3726 ENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXXX 3547 E+ RS SASPGS KSV SPA N +DGQ R A+ +SDL + +ARDEK Sbjct: 477 ESAIRS--PSASPGSLKSVPSPAPTNTNVRDGQAR-NAVGVNSDLHMTTARDEKSSSSSQ 533 Query: 3546 XXXXXXXXXSDHAKAGGLSGKEDA--XXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRGQ 3373 SDHAK GGLS KEDA S+NGFPGS S Q Sbjct: 534 SHNNSQSCSSDHAKTGGLSVKEDARSSTAGSMSGNKISGGSSRYRKSVNGFPGSAVSGVQ 593 Query: 3372 RETGSSRNSSLHKNLTSEKTSQSGL-MEKALDGTSLEGHIPKLIVKIPNRGRSPAQSASA 3196 +E SS+ +SLHKN SEK +QS L EK LDG + +G KLIVKI NR RSPAQSAS Sbjct: 594 KEITSSK-TSLHKNPASEKLTQSALSSEKTLDGPAADGSSHKLIVKITNRSRSPAQSASG 652 Query: 3195 GSFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILTG 3016 GS +DPSIMNSRASSPVLSEKHDQ DR KEK D YRANI +D+NTESWQSNDFKD+LTG Sbjct: 653 GSIEDPSIMNSRASSPVLSEKHDQSDRNLKEKSDAYRANITSDVNTESWQSNDFKDVLTG 712 Query: 3015 SDEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIEG 2836 SDEGDGSPAA TDEE CRTG++CKKI + E + NLQ+AS SINALIE Sbjct: 713 SDEGDGSPAAITDEERCRTGEDCKKIAET--KAAFSPSGEHELGNLQEASLRSINALIES 770 Query: 2835 -VKYSE-------ADDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPG-DAVV 2683 VKYSE DDVGMNLLASVAAGE+SKS++ P+ SP+RN E G D +V Sbjct: 771 CVKYSETNGSMSIGDDVGMNLLASVAAGEMSKSDLVSPSDSPQRNTIAAEHCCTGSDPMV 830 Query: 2682 KSS-------GEDLVQDKCYANDG---EQKKQVSISSDL--GTDDVNDSGFLASGGKSAE 2539 KSS +DL +++ +NDG E +KQ SS+L + N + + E Sbjct: 831 KSSEADDLARADDLAREESQSNDGADDEHEKQGVNSSNLEPKAGEGNSAYLVCEEKPVRE 890 Query: 2538 HNMDTNSCSMDLQVT-ETCLESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXX 2362 ++ NS +Q T E C+E K E TS S P SS ++ D + Sbjct: 891 YHGHLNSSGRHMQQTAEPCVEIDEKSTEITVATSMDSSPVSSTEKGVDIEGGQPLQEKKA 950 Query: 2361 XXXXXXXGNLDV--KVGDVAGESEAENPSVEVGVKCDNSATEGLNSGRETTQKSPAIHMQ 2188 LD+ K + E N V VG++ + A EGL+ + QK A M Sbjct: 951 VGGVVLDATLDILEKTSCTLLKEEKLNDEV-VGLEVEMEAVEGLDGDVQAEQKPSASTMH 1009 Query: 2187 FDTVKGTAENVAQCSGYSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQRNESESDAVVV 2008 + VKG + + SG D + ED++E + EK D+ + +Q+ K+RNE +S+A Sbjct: 1010 SEDVKGRNQELMLPSGGGKDLLSEDVSELKAEKIDETDARSFASQTDKERNEWDSNAPTS 1069 Query: 2007 PENR---GLCSGVTGLAAEYVEENSGTKEVYDQDARQMPHTVSPSFPSQEIDQCTGPKGS 1837 PEN GL + T + EN +K+ Q PH VS + QE +Q +G Sbjct: 1070 PENLISVGLAATHTDQNGVQLGENLESKDDLAQRGIPAPHIVSSALTVQETEQPENSRGP 1129 Query: 1836 KLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNEGLSVDDGKCGELKNTALPG 1657 K G +AE C AKVEFDLNEG +VDDGK E ++ PG Sbjct: 1130 KWT---GTEAEACTTTNGNTTTLSVAGASDMDAKVEFDLNEGFTVDDGKFAEQNDSPAPG 1186 Query: 1656 CA----XXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPEDLLKIKGELGWKGSAATSAF 1489 C+ P IT+ AAAKGPFVPPEDLL+ KGELGWKGSAATSAF Sbjct: 1187 CSTAIRLVSPLPFSVSSVSSGLPASITITAAAKGPFVPPEDLLRSKGELGWKGSAATSAF 1246 Query: 1488 RPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPDERTLNDISSQNCSDQTVCV 1309 RPAEPRK E+ G + +P+A A QSR PLDIDLNVPDER L D++S+NC+ + + Sbjct: 1247 RPAEPRKGPELSQGTASISLPDATACKQSRPPLDIDLNVPDERILEDLASRNCTQEPDSL 1306 Query: 1308 SHAVDGHD-PHKST--PSPVCCSGGLGLDLNLVDEASDVGNCTKSNSHKKDVPLLQVKSS 1138 S H+ HK + V CSGGL DLN VD+ASD+GN T SNS + DV L +KSS Sbjct: 1307 SRHPYSHEMAHKLLIGSNTVRCSGGLDFDLNRVDDASDIGNYTTSNSFRVDVTLFPMKSS 1366 Query: 1137 ADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSSLPSQPNVSGLSMSNAEVANF 958 + S N S RDFDLNNGP DEV EP +Q ARSSLPSQ ++SGL M+N+E+ NF Sbjct: 1367 SGSALNGAASGHRDFDLNNGPVSDEVCAEPSPFSQQARSSLPSQRSLSGLRMNNSEMGNF 1426 Query: 957 SSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLGPATGGNPFGSDIYRGTVL- 781 S WFPS G+TYSAV I SIMPDRG++ F I A GPQRMLGP +G +PF D+YRG VL Sbjct: 1427 SPWFPS-GSTYSAVAIPSIMPDRGEQPFPIVAAGGPQRMLGPTSGSSPFSPDVYRGPVLS 1485 Query: 780 SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDSTSGGRLCFPAVNSQLMGPVN 601 SSPAVP+PS+ FQYPVF F +SFPLPSA+F GGST Y+DS+SGG++CFPAV +Q +GP Sbjct: 1486 SSPAVPFPSSHFQYPVFSFGTSFPLPSATFPGGSTAYMDSSSGGKVCFPAVRTQFLGPAG 1545 Query: 600 TISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGPGSSDLEGRDESSPLVPRQLS 421 +SSQ+PRPYVVS PDG++++S E+SRKW RQG DLNAGPG D+EGRD+ + PRQLS Sbjct: 1546 AVSSQFPRPYVVSFPDGSSNSSGESSRKWGRQGFDLNAGPGGLDVEGRDDMN-FPPRQLS 1604 Query: 420 VASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 V+S+ A A+EQARM + GVLKRKEPDG DGYKQSSWQ Sbjct: 1605 VSSTQALADEQARMLQMTGGVLKRKEPDGSWDGYKQSSWQ 1644 >XP_018820884.1 PREDICTED: uncharacterized protein LOC108991180 [Juglans regia] Length = 1652 Score = 1600 bits (4144), Expect = 0.0 Identities = 939/1671 (56%), Positives = 1113/1671 (66%), Gaps = 49/1671 (2%) Frame = -1 Query: 5166 MHGCG-GEERRKRIRHMWKSPTRGNXXXXXXXXXXXXXXXXXSINSFYKDGRKISVGDCA 4990 MHG G E RKR HMW PTR S+NSF KDGR+ISVGDCA Sbjct: 1 MHGIGRAVEERKRSPHMWTVPTR------TIVAATADGSSSSSVNSFCKDGRRISVGDCA 54 Query: 4989 LFKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKTTPNEIFFSFHT 4810 LFKPP DSPPFIGIIR + GKE+KLK+GVNWLYRP EV+LGKGV L+ PNEIF+SFH Sbjct: 55 LFKPPQDSPPFIGIIRWLAAGKEDKLKLGVNWLYRPAEVKLGKGVLLEAAPNEIFYSFHK 114 Query: 4809 DKVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQDYTNDCQEEID 4630 D++ AASLLHPCKVAFLPKGVELPSGISSFVCR+VYDI NKCLWWL+DQDY ++ QEE+D Sbjct: 115 DEIPAASLLHPCKVAFLPKGVELPSGISSFVCRQVYDITNKCLWWLTDQDYIDERQEEVD 174 Query: 4629 KLLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPSQVKGRKRERVD 4450 +LLY+T +EM AT+ R SQLK+GSDS+QNS SS P+QVKG+KRER D Sbjct: 175 QLLYKTQIEMHATLQSDIRSPKPMSGPTSTSQLKAGSDSVQNSGSSVPTQVKGKKRERGD 234 Query: 4449 QGPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEKLVQLMVPARNE 4270 Q E VKRE +AK DDGDS R +S LK+E AK+TEKGGL+D+EGVE+LVQ+M P R E Sbjct: 235 QSSEPVKRERTAKADDGDSNRSRLESNLKSETAKITEKGGLIDSEGVERLVQVMQPDRTE 294 Query: 4269 KKIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIGDGTGSRDGDKS 4090 KIDLA RSML V+AATDKFDCLSRFVQL+GLPV DEWLQE HKG+IGDG+GS+DGDKS Sbjct: 295 -KIDLAGRSMLVGVLAATDKFDCLSRFVQLKGLPVLDEWLQEVHKGRIGDGSGSKDGDKS 353 Query: 4089 VDEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGLVDTWKKRVEAE 3910 ++EFLL LLRALDKLPVNLQALQ CNIGKSVNHLRTHKNLEIQKKAR LVDTWKKRVEAE Sbjct: 354 IEEFLLVLLRALDKLPVNLQALQMCNIGKSVNHLRTHKNLEIQKKARSLVDTWKKRVEAE 413 Query: 3909 MIINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLSASKNASAKIVQ 3730 M INDAKSGS Q+ PWPA+ RLPEV+ GGNR +G S+DVA+KSSV QLSASK A AK+V Sbjct: 414 MNINDAKSGSNQAVPWPAKARLPEVSHGGNRQTGASADVAMKSSVPQLSASKIAPAKVVL 473 Query: 3729 GENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVASARDEKXXXXX 3550 GE+ +R SAS GS KSV SPA A+ N +D QP+ A+ +SDLP+ + RDEK Sbjct: 474 GESTSR--CASASLGSMKSVPSPAQASTNFRDSQPQ-HAVGVASDLPLTTMRDEK-SSSS 529 Query: 3549 XXXXXXXXXXSDHAKAGGLSGKEDA--XXXXXXXXXXXXXXXXXXXXSMNGFPGSTPSRG 3376 SDH K GLS KEDA S+NG PGST S Sbjct: 530 SQSHNSQSCSSDHTKT-GLSVKEDARSPTAGSMSGNKISGGSSRHRKSINGLPGSTLSGV 588 Query: 3375 QRETGSSRNSSLHKNLTSEKTSQSGL-MEKALDGTSLEGHIPKLIVKIPNRGRSPAQSAS 3199 RE+GSSR S L+KN SE+ SQSG EKALDG +EG+ KL VKIPNR RSPAQSAS Sbjct: 589 LRESGSSR-SPLNKNPASERQSQSGFSCEKALDGPIMEGNSHKLTVKIPNRVRSPAQSAS 647 Query: 3198 AGSFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTESWQSNDFKDILT 3019 F+DPS++NSRASSPVLSEKHDQFDR KEK D YRAN +D+NTESWQSNDFKD+LT Sbjct: 648 GALFEDPSMVNSRASSPVLSEKHDQFDRSLKEKSDGYRANNTSDVNTESWQSNDFKDVLT 707 Query: 3018 GSDEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQDASYSSINALIE 2839 GSD GDGSPAA TDE+ +T D KK+ + SGNE K N +AS SINALIE Sbjct: 708 GSDGGDGSPAAITDEKCRKTDDESKKLAEA-KGASLSSGNELKVWNSHEASLRSINALIE 766 Query: 2838 G-VKYSEA------DDVGMNLLASVAAGEISKSEMGMPAGSPERNFTTIEQSRPG-DAVV 2683 VKYSE+ DDVGMNLLASVAAGE+SK+++ P SP+RN +E S G D V Sbjct: 767 SCVKYSESNVSMSGDDVGMNLLASVAAGEMSKTDLVSPTDSPQRNTPAVEPSCTGSDLKV 826 Query: 2682 KSSG-EDLVQDKCYANDG-EQKKQVSISSDLG-TDDVNDSGFLASGGKS-AEHNMDTNSC 2515 +SS DL G E ++QV ISS++G D N+ L SG K A + NS Sbjct: 827 ESSPVNDLTPSHTIDGAGVEDEEQVVISSNVGLKDGGNEPASLMSGEKPVAGDSGHFNSS 886 Query: 2514 SMDLQVT-ETCLESKGKLIEKPAGTSSASFPESSAQEARDTDSSXXXXXXXXXXXXXXXG 2338 SM+LQ+T + LES GK E + AS P S+ ++ D + Sbjct: 887 SMELQLTADRFLESDGKSTETTVAATVASSPASAMEKTMDIEGGKPLHNKKAISEVNANA 946 Query: 2337 NLDVK------------VGDV---------AGESEAENPSVEVGVKCDNSATEGLNSGRE 2221 +D K V DV A E + PS+E+ K +EGLNSG + Sbjct: 947 IVDAKEKESGSLLDKDMVSDVVASPEVQMEAIEGSSSYPSLEIDGKNKKLMSEGLNSGVK 1006 Query: 2220 TTQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNEREPEKNDDMAVQDHVNQSKKQ 2041 T +K A+ ++ + VKG E V SG D VPE E + EKN++ HV +++ + Sbjct: 1007 TEEKPLALIIRSEAVKGIDE-VLHSSGGGKDLVPEKGIELKTEKNEERDATIHV-KTETE 1064 Query: 2040 RNESESDAVVVPENR---GLCSGVTGLAAEYVEENSGTKEVYDQDARQMPHTVSPSFPSQ 1870 NE E +A PENR GL S T +Y+E+N KEV+ + R H +SP+FP Q Sbjct: 1065 SNELEGNAPSSPENRMLVGLGSADTSHDDKYLEKNLACKEVHKKRGRPASHKLSPAFPMQ 1124 Query: 1869 EIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNEGLSVDDGK 1690 E DQ +GSKL + +DAEE AKVEFDLNEG +VDDGK Sbjct: 1125 ETDQHERSRGSKLTGAEADDAEEFASTTADASCLSVAGVSDMEAKVEFDLNEGFTVDDGK 1184 Query: 1689 CGELKNTALPGCA----XXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPEDLLKIKGEL 1522 GE N GC+ P ITV AAAKGPFVPP DLLK KGEL Sbjct: 1185 LGETNNFTQVGCSAAICLVSPLPFPVSSVSTGIPASITVTAAAKGPFVPPVDLLKSKGEL 1244 Query: 1521 GWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPDERTLNDIS 1342 GWKGSAATSAFRPAEPRK EMP T + +A A R PLDIDLNVPDER L D++ Sbjct: 1245 GWKGSAATSAFRPAEPRKAPEMPQETVTISLLDATAGKNGRFPLDIDLNVPDERILEDLA 1304 Query: 1341 SQNCSDQTVCVSHAVDGHDPHKST---PSPVCCSGGLGLDLNLVDEASDVGNCTKSNSHK 1171 SQ+ +++ +S + H+ + +P CS L LDLN VD+ASD+GN S+ + Sbjct: 1305 SQDSANELGNLSSLTNNHEMAREELMGSAPARCSEALDLDLNRVDDASDMGNYPTSSGRR 1364 Query: 1170 KDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSSLPSQPNVSG 991 DVP + VKS + P N +S RDFDLNNGPAVDE+ EP QLAR+SLP+Q +VSG Sbjct: 1365 MDVPPVPVKSKSGGPFNDAVSACRDFDLNNGPAVDEMNAEPSPFVQLARNSLPAQLSVSG 1424 Query: 990 LSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLGPATGGNPF 811 L MSNAE+ NFS WF S G+ YSAV I SIMPDRG++ F + A G QR LGP NPF Sbjct: 1425 LRMSNAEMGNFSPWFHS-GSNYSAVAIPSIMPDRGEQPFPVIATGGLQRWLGPTGSSNPF 1483 Query: 810 GSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDSTSGGRLCFP 634 DIYRG L SSPAVP+PS+PFQYPVFPF +SFPLPSA+FSGGSTTY DS+SGG++CFP Sbjct: 1484 SPDIYRGPGLSSSPAVPFPSSPFQYPVFPFGTSFPLPSATFSGGSTTYADSSSGGKVCFP 1543 Query: 633 AVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGPGSSDLEGRD 454 AV+ Q +GP +SS YPRPY VS PDG+N++S E+SRKW R LDLNAGPGS D+EGRD Sbjct: 1544 AVHPQFLGPAGAVSSHYPRPY-VSFPDGSNNSSGESSRKWGRHALDLNAGPGSLDIEGRD 1602 Query: 453 ESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 E+S L PRQLSVASS A A+EQAR++ + GVLKRKEP+GG DGYKQSSWQ Sbjct: 1603 EAS-LPPRQLSVASSQAIADEQARIYPMAGGVLKRKEPEGGWDGYKQSSWQ 1652 >EOY20638.1 BAH domain,TFIIS helical bundle-like domain isoform 5 [Theobroma cacao] Length = 1583 Score = 1600 bits (4143), Expect = 0.0 Identities = 919/1619 (56%), Positives = 1090/1619 (67%), Gaps = 45/1619 (2%) Frame = -1 Query: 5022 DGRKISVGDCALFKPPIDSPPFIGIIRRVTVGKENKLKVGVNWLYRPIEVRLGKGVPLKT 4843 DGRKISVGDCALFKPP DSPPFIGIIR + GKENKL++GVNWLYRP EV+LGKG+ L+ Sbjct: 2 DGRKISVGDCALFKPPQDSPPFIGIIRCLIAGKENKLRLGVNWLYRPAEVKLGKGILLEA 61 Query: 4842 TPNEIFFSFHTDKVSAASLLHPCKVAFLPKGVELPSGISSFVCRRVYDIANKCLWWLSDQ 4663 PNEIF+SFH D++ AASLLHPCKVAFLPK VELPSGI SFVCRRVYDI NKCLWWL+DQ Sbjct: 62 APNEIFYSFHKDEIPAASLLHPCKVAFLPKDVELPSGICSFVCRRVYDITNKCLWWLTDQ 121 Query: 4662 DYTNDCQEEIDKLLYRTCVEMQATVHPGGRXXXXXXXXXXXSQLKSGSDSIQNSASSFPS 4483 DY N+ QEE+D+LL +T +EM ATV PGGR SQ+K GSDS+QNSASSFPS Sbjct: 122 DYINERQEEVDQLLDKTRLEMHATVQPGGRSPKPMNGPTSTSQIKPGSDSVQNSASSFPS 181 Query: 4482 QVKGRKRERVDQGPESVKRECSAKNDDGDSGHVRQDSILKTEIAKVTEKGGLVDTEGVEK 4303 Q KG+KRER DQG E VKRE ++K DDGDSGH R + LK+EIAK+TEKGGL D+EGVEK Sbjct: 182 QGKGKKRERGDQGSEPVKRERTSKMDDGDSGHGRPEINLKSEIAKITEKGGLEDSEGVEK 241 Query: 4302 LVQLMVPARNEKKIDLASRSMLAAVIAATDKFDCLSRFVQLRGLPVFDEWLQEAHKGKIG 4123 LVQLMVP RNEKKIDL SRSMLA VIAATDKFDCLSRFVQLRGLPVFDEWLQE HKGKIG Sbjct: 242 LVQLMVPERNEKKIDLVSRSMLAGVIAATDKFDCLSRFVQLRGLPVFDEWLQEVHKGKIG 301 Query: 4122 DGTGSRDGDKSVDEFLLGLLRALDKLPVNLQALQACNIGKSVNHLRTHKNLEIQKKARGL 3943 DG+GS+D D+SVD+FLL LLRALDKLPVNL ALQ CNIGKSVNHLR+HKNLEIQKKARGL Sbjct: 302 DGSGSKD-DRSVDDFLLTLLRALDKLPVNLTALQMCNIGKSVNHLRSHKNLEIQKKARGL 360 Query: 3942 VDTWKKRVEAEMIINDAKSGSVQSGPWPARQRLPEVAQGGNRHSGGSSDVAVKSSVTQLS 3763 VDTWKKRVEAEM DAKSGS Q+ PW AR R+ EV+ G++HS GSS+VAVKSSVTQ S Sbjct: 361 VDTWKKRVEAEM---DAKSGSNQAVPWSARPRISEVSHSGSKHS-GSSEVAVKSSVTQFS 416 Query: 3762 ASKNASAKIVQGENNTRSVLTSASPGSAKSVLSPATATANSKDGQPRVVAISGSSDLPVA 3583 ASK S K+ QGE T+S SASPGS K+ SP +A+ N KDGQ R G+SD P Sbjct: 417 ASKTGSVKLAQGETPTKS--ASASPGSMKAATSPVSASTNLKDGQARNATAVGTSD-PQT 473 Query: 3582 SARDEKXXXXXXXXXXXXXXXSDHAKAGGLSGKEDA--XXXXXXXXXXXXXXXXXXXXSM 3409 +ARDEK SDHAK GG+SGKE+A S+ Sbjct: 474 TARDEKSSSSSQSHNNSQSCSSDHAKTGGVSGKEEARSSAAGSGTVTKISGSSSRHRKSI 533 Query: 3408 NGFPGSTPSRGQRETGSSRNSSLHKNLTSEKTSQSGL-MEKALDGTSLEGHIPKLIVKIP 3232 NGFPGS S QRETGSS+NSSLH+N SEK SQSGL EKA+D EG+ K IVKIP Sbjct: 534 NGFPGS--SGVQRETGSSKNSSLHRNPASEKISQSGLTCEKAVDAPMAEGNSHKFIVKIP 591 Query: 3231 NRGRSPAQSASAGSFDDPSIMNSRASSPVLSEKHDQFDRCSKEKIDFYRANIGADINTES 3052 NRGRSPAQS S GS +D S+MNSRASSPVLSEKH+Q DR +KEK + YRAN+ D+NTES Sbjct: 592 NRGRSPAQSVSGGSLEDLSVMNSRASSPVLSEKHEQSDRNTKEKSETYRANVTTDVNTES 651 Query: 3051 WQSNDFKDILTGSDEGDGSPAATTDEEHCRTGDNCKKILDVXXXXXXXSGNESKTRNLQD 2872 WQSNDFKD+LTGSDEGDGSPAA DEEHCR G++ +K +V SGNE K+ LQ+ Sbjct: 652 WQSNDFKDVLTGSDEGDGSPAAVPDEEHCRIGEDARKTTEVTKTASSSSGNELKSGKLQE 711 Query: 2871 ASYSSINALIEG-VKYSEA-------DDVGMNLLASVAAGEISKSEMGMPAGSPERNFTT 2716 AS+SSINALI+ VKYSEA DD GMNLLASVAAGEISKS++ P SP+RN Sbjct: 712 ASFSSINALIDSCVKYSEANACMPVGDDAGMNLLASVAAGEISKSDVASPIDSPQRNTPV 771 Query: 2715 IEQSRPGD--AVVKSSGEDLVQDKCYANDG---EQKKQVSISSD---LGTDDVNDSGFLA 2560 +E S G+ + S+G+D+V+D+ + +G E KQ +++ + D S Sbjct: 772 VEHSSTGNDTRLKPSAGDDVVRDRHQSVEGADDEHLKQGTVAGNSWAKNADCKTGSSQEK 831 Query: 2559 SGGKSAEHNMDTNSCSMDL-QVTETCLES---------------KGKLIEKPAGTSSASF 2428 SGG+ EH + S SM L Q + CLE+ G +EK + Sbjct: 832 SGGELNEHLI---SSSMGLPQTADQCLENGKLKEIVAAALVNLPSGSTVEKTTDVGDSKE 888 Query: 2427 PESSAQEARDTDSSXXXXXXXXXXXXXXXGNLD--VKVGDVAGESEAENPSVEVGVKCDN 2254 D DSS +D VKV A + + PS+EV V+ Sbjct: 889 HLEKKAGGVDDDSSLDTKQKGSTSLVNEDKVVDPGVKVEKEAVDGSSSVPSMEVDVEDKK 948 Query: 2253 SATEGLNSGRETTQKSPAIHMQFDTVKGTAENVAQCSGYSVDKVPEDLNEREPEKNDDMA 2074 + TEGL+ +T + S A+ ++ KG A+ A G + D V E + E + EK+ + Sbjct: 949 NVTEGLDRSLQTHENSAAV--TGNSTKG-ADKEASPPGSAKDIVLEKVGEVKLEKDVETD 1005 Query: 2073 VQDHVNQSKKQRNESESDAVVVPENRGLCSGVTGLAAEYVEENSGTKEVYDQDARQMPHT 1894 + HV ++KQ+ E E+ VT E VEEN EV+ + R P Sbjct: 1006 ARSHVAHTEKQKPEWET--------------VTARKGEQVEENLECSEVH--EPRGGPSP 1049 Query: 1893 VSPSFPSQEIDQCTGPKGSKLAVVDGEDAEECXXXXXXXXXXXXXXXXXXXAKVEFDLNE 1714 S E +Q T +GSKL V + ++AEE AKVEFDLNE Sbjct: 1050 CRASSTVMETEQPTRSRGSKLTVAEADEAEE---RTSTTSDAPATGGADADAKVEFDLNE 1106 Query: 1713 GLSVDDGKCGELKNTALPGCA----XXXXXXXXXXXXXXXXPCVITVAAAAKGPFVPPED 1546 G + D+ K GE N PGC+ P ITVAAAAKGPFVPP+D Sbjct: 1107 GFNADEAKFGEPNNLTAPGCSPPVQLISPLPFPVSSVSSSLPASITVAAAAKGPFVPPDD 1166 Query: 1545 LLKIKGELGWKGSAATSAFRPAEPRKIMEMPLGITTTPIPNAAAKNQSRAPLDIDLNVPD 1366 LL+ KG LGWKGSAATSAFRPAEPRK ++MPLG + +P+A QSR PLDIDLNVPD Sbjct: 1167 LLRTKGVLGWKGSAATSAFRPAEPRKSLDMPLGTSNASMPDATTCKQSRPPLDIDLNVPD 1226 Query: 1365 ERTLNDISSQNCSDQTVCVSHAVDGHDPHKST--PSPVCCSGGLGLDLNLVDEASDVGNC 1192 ER L D++S++ + T + D +P+ SGGL LDLN VDE D+GN Sbjct: 1227 ERVLEDLASRSSAQGTDSAPDLTNNRDLTCGLMGSAPIRSSGGLDLDLNRVDEPIDLGNH 1286 Query: 1191 TKSNSHKKDVPLLQVKSSADSPPNREMSFRRDFDLNNGPAVDEVTTEPLISTQLARSS-L 1015 + +S + DVP+ +KSS+ N E S RRDFDLNNGPAVDEV+ EP + +Q RSS + Sbjct: 1287 STGSSRRLDVPMQPLKSSSGGILNGEASVRRDFDLNNGPAVDEVSAEPSLFSQHNRSSNV 1346 Query: 1014 PSQPNVSGLSMSNAEVANFSSWFPSTGNTYSAVTISSIMPDRGDKSFSIGAPNGPQRMLG 835 PSQP VS L ++N E+ANFSSWFP TGNTYSAVTI SI+PDRG++ F I A GP R+LG Sbjct: 1347 PSQPPVSSLRINNTEMANFSSWFP-TGNTYSAVTIPSILPDRGEQPFPIVATGGPPRVLG 1405 Query: 834 PATGGNPFGSDIYRGTVL-SSPAVPYPSAPFQYPVFPFNSSFPLPSASFSGGSTTYVDST 658 P T PF D+YRG VL SSPAVP+PSAPFQYPVFPF ++FPLPS SFSGGSTTYVDS+ Sbjct: 1406 PPTAATPFNPDVYRGPVLSSSPAVPFPSAPFQYPVFPFGTTFPLPSTSFSGGSTTYVDSS 1465 Query: 657 SGGRLCFPAVNSQLMGPVNTISSQYPRPYVVSLPDGNNSNSAENSRKWARQGLDLNAGPG 478 GRLCFP V SQL+GP + S Y RPYVVSLPDG+N++ AE+ RKW RQGLDLNAGPG Sbjct: 1466 PSGRLCFPPV-SQLLGPAGAVPSHYARPYVVSLPDGSNNSGAESGRKWGRQGLDLNAGPG 1524 Query: 477 SSDLEGRDESSPLVPRQLSVASSLAPAEEQARMFHLGSGVLKRKEPDGGQDGYKQSSWQ 301 D+EGRDE+SPL RQLSVASS A AEEQARM+ + G+LKRKEP+GG DGYKQSSWQ Sbjct: 1525 GPDIEGRDETSPLASRQLSVASSQALAEEQARMYQVPGGILKRKEPEGGWDGYKQSSWQ 1583