BLASTX nr result

ID: Astragalus23_contig00004660 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00004660
         (3259 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004501561.1| PREDICTED: THO complex subunit 5A [Cicer ari...  1283   0.0  
ref|XP_003603050.1| THO complex, protein [Medicago truncatula] >...  1246   0.0  
ref|XP_020218905.1| THO complex subunit 5B [Cajanus cajan]           1217   0.0  
gb|KYP65083.1| THO complex subunit 5 isogeny [Cajanus cajan]         1212   0.0  
gb|KHN22326.1| THO complex subunit 5 like [Glycine soja]             1200   0.0  
ref|XP_017436837.1| PREDICTED: THO complex subunit 5A [Vigna ang...  1196   0.0  
gb|KHN04496.1| THO complex subunit 5 like [Glycine soja]             1196   0.0  
ref|XP_007136690.1| hypothetical protein PHAVU_009G065600g [Phas...  1196   0.0  
ref|XP_003528289.1| PREDICTED: THO complex subunit 5A-like isofo...  1196   0.0  
dbj|BAT77850.1| hypothetical protein VIGAN_02045000 [Vigna angul...  1194   0.0  
ref|XP_003523934.1| PREDICTED: THO complex subunit 5A-like [Glyc...  1192   0.0  
ref|XP_014501896.1| THO complex subunit 5A isoform X1 [Vigna rad...  1192   0.0  
ref|XP_019414808.1| PREDICTED: THO complex subunit 5A [Lupinus a...  1186   0.0  
gb|OIV97645.1| hypothetical protein TanjilG_12402 [Lupinus angus...  1162   0.0  
gb|ABC47853.1| expressed protein-like protein [Glycine max]          1148   0.0  
ref|XP_015934253.1| THO complex subunit 5A [Arachis duranensis] ...  1133   0.0  
ref|XP_016163247.1| LOW QUALITY PROTEIN: THO complex subunit 5A ...  1125   0.0  
gb|PNY16156.1| THO complex subunit 5 B [Trifolium pratense]          1115   0.0  
dbj|GAU42192.1| hypothetical protein TSUD_305370 [Trifolium subt...  1108   0.0  
ref|XP_014632732.1| PREDICTED: THO complex subunit 5A-like isofo...  1058   0.0  

>ref|XP_004501561.1| PREDICTED: THO complex subunit 5A [Cicer arietinum]
          Length = 807

 Score = 1283 bits (3321), Expect = 0.0
 Identities = 642/799 (80%), Positives = 701/799 (87%), Gaps = 1/799 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EEDEQ F HSSDD K EESPYETLHNSK+SIENI+++ILSIKKD+KPKQLLRDL
Sbjct: 9    GSITAPEEDEQSFLHSSDDSKSEESPYETLHNSKSSIENIISDILSIKKDSKPKQLLRDL 68

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSIL+EEDRVK+ETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC
Sbjct: 69   VTQMFLHFITLRQANRSILIEEDRVKTETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 128

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDI+LVPE+EFFRDAP++IK SVLSKD+AHNLMLKRLNFELYQRKELCKHH K
Sbjct: 129  KDFKSKYPDIELVPEEEFFRDAPKDIKDSVLSKDSAHNLMLKRLNFELYQRKELCKHHAK 188

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLG++HTK+ KQH +AELLPPA
Sbjct: 189  LEQQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGIMHTKKLKQHHSAELLPPA 248

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAF EPIDLE+VGSLKDAQAF RNQAH  TGIST MESSK+EDD+PDD
Sbjct: 249  LYVIYSQLLAQKEAFVEPIDLEIVGSLKDAQAFARNQAHKDTGISTVMESSKVEDDIPDD 308

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ESPDQGG FQ HPLKI +HVYEDEAS+PKPAKL+TLRFEYMVK
Sbjct: 309  EEDGQRRRKRPRRVQVKESPDQGGIFQSHPLKITVHVYEDEASNPKPAKLITLRFEYMVK 368

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN   +NDILCNLFPNDTGLELPHQSAKL VQDAI FNTQRTSRPYKWAQ
Sbjct: 369  LNVVCVGIEGSNDGLDNDILCNLFPNDTGLELPHQSAKLFVQDAIEFNTQRTSRPYKWAQ 428

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPEVSPLL T N+E A++EDVISG SLYRQQNRVQTVL RIRSRR        
Sbjct: 429  HLAGIDFLPEVSPLLPTDNSEAAKNEDVISGFSLYRQQNRVQTVLQRIRSRRKAQLALLE 488

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP+LSC+SVPWA+HTPLCKLDGWSPIR LPV SEAS   +IDKEEHVQE M
Sbjct: 489  QLESLTKLEWPILSCKSVPWALHTPLCKLDGWSPIRALPVPSEASPPAIIDKEEHVQESM 548

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            DVD+ E SG TK EL+++TEDGELPTLLP  +KFDHS QASLIS SIIP LNKVRSQSFK
Sbjct: 549  DVDVMENSGATKEELDSMTEDGELPTLLPKKTKFDHSKQASLISKSIIPSLNKVRSQSFK 608

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K DD++DFLLDT+SDFDEP+Q E + ENIVSDYCAR S+SW DSGV+EF  VLSRK+NA 
Sbjct: 609  KADDSSDFLLDTDSDFDEPSQIESDRENIVSDYCARNSLSWKDSGVKEFFFVLSRKTNAD 668

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E+ V+LEAKIKISMEYPLRPPLFALSL       NH  N+GL+WYNELRA+EAEVNLH+L
Sbjct: 669  EKTVSLEAKIKISMEYPLRPPLFALSLRCTPFEGNHLENNGLEWYNELRAIEAEVNLHVL 728

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            K LP  EQN+VLAHQV+CLAMLFDYYLDDA SSSERTN +S+VDVGLC P+SGRFL RSF
Sbjct: 729  KTLPVIEQNYVLAHQVNCLAMLFDYYLDDAGSSSERTNSSSLVDVGLCPPISGRFLGRSF 788

Query: 2449 RGRDRRKMISWKDMNFISN 2505
            RGRD RKMISWKDM F S+
Sbjct: 789  RGRDHRKMISWKDMKFTSS 807


>ref|XP_003603050.1| THO complex, protein [Medicago truncatula]
 gb|AES73301.1| THO complex, protein [Medicago truncatula]
          Length = 807

 Score = 1246 bits (3225), Expect = 0.0
 Identities = 631/800 (78%), Positives = 696/800 (87%), Gaps = 2/800 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKL-EESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRD 288
            GSI M E++EQ F +SS+D K  EESPYE L NSK+SIE+I+++ILSIKK+AKPKQLLRD
Sbjct: 9    GSIAMEEQNEQAFPNSSEDSKSDEESPYELLQNSKSSIESIISDILSIKKEAKPKQLLRD 68

Query: 289  LVTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKA 468
            LVTQMFLHFITLRQ NRSIL+EEDRVK +TERAKAPVDFTTLQLHNLVYEKSHYLKAIKA
Sbjct: 69   LVTQMFLHFITLRQANRSILIEEDRVKMQTERAKAPVDFTTLQLHNLVYEKSHYLKAIKA 128

Query: 469  CKDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHE 648
            CKDFKSKYPDI+LVPE+EFFRDAP++IK  VLSKD+AHNLMLKRLNFELYQRKELCKHH 
Sbjct: 129  CKDFKSKYPDIELVPEEEFFRDAPKDIKDLVLSKDSAHNLMLKRLNFELYQRKELCKHHA 188

Query: 649  KLEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPP 828
            KLE QKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLG+ HTK+ KQH +AELLPP
Sbjct: 189  KLELQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGITHTKKLKQHHSAELLPP 248

Query: 829  ALYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPD 1008
            ALYV+YSQLLAQKEAF EPIDLE+VGSLKDAQAF R+QA+  TGIST M+SSKLEDDV D
Sbjct: 249  ALYVIYSQLLAQKEAFAEPIDLEIVGSLKDAQAFARSQANKDTGISTVMDSSKLEDDVHD 308

Query: 1009 DEEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMV 1188
            DEEDG             ESPDQGG F+ HPLKIII+VYEDE+SDPKPAKL+TLRFEY+V
Sbjct: 309  DEEDGQRRRKRPRRVEVKESPDQGGIFKSHPLKIIINVYEDESSDPKPAKLITLRFEYVV 368

Query: 1189 KLNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWA 1368
            KLN VCVG+EGSN  P+NDILCNLFPNDTGLELPHQSAKL VQ+A+ FNTQRTSRPYKWA
Sbjct: 369  KLNTVCVGVEGSNDGPDNDILCNLFPNDTGLELPHQSAKLFVQNAMAFNTQRTSRPYKWA 428

Query: 1369 QHLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXX 1548
            QHLAGIDFLPEVSPLL   N+E A+SEDV+SGL+LYRQQNRV TVL RIRSRR       
Sbjct: 429  QHLAGIDFLPEVSPLLPADNSEAAKSEDVVSGLTLYRQQNRVHTVLQRIRSRRKAQLALL 488

Query: 1549 XXXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEP 1728
               ESLTK+EWPLLSC+SVPWA+HTPLCKLDGWSPIR LPV SEASS  +IDKEEHVQE 
Sbjct: 489  EQLESLTKLEWPLLSCKSVPWALHTPLCKLDGWSPIRALPVPSEASSPAIIDKEEHVQES 548

Query: 1729 MDVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSF 1905
             DVD+ E SG TK EL+++TEDGELPTLLP  +KFDHS QASLIS SIIP LNKVRS SF
Sbjct: 549  TDVDVIENSGVTKGELDSMTEDGELPTLLPKRTKFDHSKQASLISKSIIPSLNKVRSLSF 608

Query: 1906 KKVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNA 2085
            KK DD++DFLLDT+SDFDEPAQ E E+E+ VSDYCA KS+SWM+SG +EF+LVLSRK+NA
Sbjct: 609  KKGDDSSDFLLDTDSDFDEPAQVESEHEHSVSDYCATKSLSWMESGAKEFVLVLSRKTNA 668

Query: 2086 GERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHI 2265
             ERNV LEAKIKISMEYPLRPPLFALS  G  SGE H  NDGL+WYNELRA+EAEVNLH+
Sbjct: 669  DERNVNLEAKIKISMEYPLRPPLFALSFCGRPSGEYHIENDGLEWYNELRAIEAEVNLHM 728

Query: 2266 LKILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRS 2445
            LK LP +E N+VLAHQVSCLAMLFDYYLDD  SSSERTNCT++VDVGLCKPVSG FL RS
Sbjct: 729  LKTLPVNEHNYVLAHQVSCLAMLFDYYLDDG-SSSERTNCTTLVDVGLCKPVSGGFLGRS 787

Query: 2446 FRGRDRRKMISWKDMNFISN 2505
            FRGRD RK ISWKD  F SN
Sbjct: 788  FRGRDHRKTISWKDTKFTSN 807


>ref|XP_020218905.1| THO complex subunit 5B [Cajanus cajan]
          Length = 800

 Score = 1217 bits (3148), Expect = 0.0
 Identities = 617/801 (77%), Positives = 687/801 (85%), Gaps = 1/801 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            G +   +EDEQ    SS+ RKLEESP+E L  SK S+ENIVA+ILSIKK+AKPKQ LRDL
Sbjct: 9    GPMGGVDEDEQ----SSEPRKLEESPFEILRTSKTSVENIVADILSIKKEAKPKQHLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHF+TLRQ NR+ILLEEDRVK+ETERAKAPVDFT+LQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFVTLRQANRNILLEEDRVKTETERAKAPVDFTSLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE+EFFRDAPQ+IK SVLS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  KDFKSKYPDIDLVPEEEFFRDAPQDIKDSVLSSDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETI+ RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LEQQKKILLETISNRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAF EPIDLE+VGSLKDAQAF R QAH  T +S+  +SSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFREPIDLEIVGSLKDAQAFARQQAHKDTDMSSTADSSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             +S DQGG FQ HPLKIIIH+YEDE S P  AKL+TLRFEYMVK
Sbjct: 305  EEDGQRRRKRPRRVQVKDSLDQGGIFQAHPLKIIIHLYEDETSGPNSAKLITLRFEYMVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN  PENDILCNLFPNDTGLELPHQSAKL+VQDA TF+TQRTSRPYKWAQ
Sbjct: 365  LNVVCVGIEGSNDGPENDILCNLFPNDTGLELPHQSAKLIVQDAKTFDTQRTSRPYKWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPEVSPL+LT N+   ++E+VISGLSLYRQQNRV TVL RIR+RR        
Sbjct: 425  HLAGIDFLPEVSPLILTDNDGAVKNENVISGLSLYRQQNRVLTVLQRIRARRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP LSC+SVPWA+HTPLC LD WSP+R  PV  E+SS  V+DKEEHVQEPM
Sbjct: 485  QLESLTKLEWPHLSCKSVPWALHTPLCNLDSWSPVRVPPVPRESSSPAVVDKEEHVQEPM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+NERSG TK ELE++TEDGELPTLLPN+SK DHS Q +LIS SIIPPLNK+RSQSFK
Sbjct: 545  DADINERSGATKAELESITEDGELPTLLPNMSKLDHSTQLNLISKSIIPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K+DD++DFLLD ESD DEPAQ E+E+ENI+S    RKSVSWMD G++EF LVL RK  A 
Sbjct: 605  KLDDSSDFLLDIESDLDEPAQIEQEHENILS----RKSVSWMDYGLREFRLVLCRKITAD 660

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E NV LEAKIKISMEYPLRPPLFALS+  +SSGENH G  GL+WYNELRAMEAEVNLHI+
Sbjct: 661  ESNVNLEAKIKISMEYPLRPPLFALSICCISSGENHDGT-GLEWYNELRAMEAEVNLHII 719

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            ++LP  +QN++LAHQVSCLAMLFDYYLD+A  SSERTNCTSVVDVGLCKPV+GRFL RSF
Sbjct: 720  RMLPVDQQNYILAHQVSCLAMLFDYYLDEASPSSERTNCTSVVDVGLCKPVNGRFLGRSF 779

Query: 2449 RGRDRRKMISWKDMNFISNYP 2511
            RGRDRRKMISWKDMNF S+ P
Sbjct: 780  RGRDRRKMISWKDMNFNSSRP 800


>gb|KYP65083.1| THO complex subunit 5 isogeny [Cajanus cajan]
          Length = 809

 Score = 1212 bits (3135), Expect = 0.0
 Identities = 618/810 (76%), Positives = 688/810 (84%), Gaps = 10/810 (1%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            G +   +EDEQ    SS+ RKLEESP+E L  SK S+ENIVA+ILSIKK+AKPKQ LRDL
Sbjct: 9    GPMGGVDEDEQ----SSEPRKLEESPFEILRTSKTSVENIVADILSIKKEAKPKQHLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHF+TLRQ NR+ILLEEDRVK+ETERAKAPVDFT+LQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFVTLRQANRNILLEEDRVKTETERAKAPVDFTSLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE+EFFRDAPQ+IK SVLS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  KDFKSKYPDIDLVPEEEFFRDAPQDIKDSVLSSDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETI+ RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LEQQKKILLETISNRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATG---------ISTAMESS 984
            LYV+YSQLLAQKEAF EPIDLE+VGSLKDAQAF R QAH  TG         +S+  +SS
Sbjct: 245  LYVIYSQLLAQKEAFREPIDLEIVGSLKDAQAFARQQAHKDTGFTMMFYYSDMSSTADSS 304

Query: 985  KLEDDVPDDEEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLV 1164
            KLEDD PD+EEDG             +S DQGG FQ HPLKIIIH+YEDE S P  AKL+
Sbjct: 305  KLEDDAPDEEEDGQRRRKRPRRVQVKDSLDQGGIFQAHPLKIIIHLYEDETSGPNSAKLI 364

Query: 1165 TLRFEYMVKLNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQR 1344
            TLRFEYMVKLN+VCVGIEGSN  PENDILCNLFPNDTGLELPHQSAKL+VQDA TF+TQR
Sbjct: 365  TLRFEYMVKLNVVCVGIEGSNDGPENDILCNLFPNDTGLELPHQSAKLIVQDAKTFDTQR 424

Query: 1345 TSRPYKWAQHLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSR 1524
            TSRPYKWAQHLAGIDFLPEVSPL+LT N+   ++E+VISGLSLYRQQNRV TVL RIR+R
Sbjct: 425  TSRPYKWAQHLAGIDFLPEVSPLILTDNDGAVKNENVISGLSLYRQQNRVLTVLQRIRAR 484

Query: 1525 RXXXXXXXXXXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVID 1704
            R          ESLTK+EWP LSC+SVPWA+HTPLC LD WSP+R  PV  E+SS  V+D
Sbjct: 485  RKAQLALLEQLESLTKLEWPHLSCKSVPWALHTPLCNLDSWSPVRVPPVPRESSSPAVVD 544

Query: 1705 KEEHVQEPMDVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPL 1881
            KEEHVQEPMD D+NERSG TK ELE++TEDGELPTLLPN+SK DHS Q +LIS SIIPPL
Sbjct: 545  KEEHVQEPMDADINERSGATKAELESITEDGELPTLLPNMSKLDHSTQLNLISKSIIPPL 604

Query: 1882 NKVRSQSFKKVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLL 2061
            NK+RSQSFKK+DD++DFLLD ESD DEPAQ E+E+ENI+S    RKSVSWMD G++EF L
Sbjct: 605  NKIRSQSFKKLDDSSDFLLDIESDLDEPAQIEQEHENILS----RKSVSWMDYGLREFRL 660

Query: 2062 VLSRKSNAGERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAM 2241
            VL RK  A E NV LEAKIKISMEYPLRPPLFALS+  +SSGENH G  GL+WYNELRAM
Sbjct: 661  VLCRKITADESNVNLEAKIKISMEYPLRPPLFALSICCISSGENHDGT-GLEWYNELRAM 719

Query: 2242 EAEVNLHILKILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPV 2421
            EAEVNLHI+++LP  +QN++LAHQVSCLAMLFDYYLD+A  SSERTNCTSVVDVGLCKPV
Sbjct: 720  EAEVNLHIIRMLPVDQQNYILAHQVSCLAMLFDYYLDEASPSSERTNCTSVVDVGLCKPV 779

Query: 2422 SGRFLDRSFRGRDRRKMISWKDMNFISNYP 2511
            +GRFL RSFRGRDRRKMISWKDMNF S+ P
Sbjct: 780  NGRFLGRSFRGRDRRKMISWKDMNFNSSRP 809


>gb|KHN22326.1| THO complex subunit 5 like [Glycine soja]
          Length = 802

 Score = 1200 bits (3105), Expect = 0.0
 Identities = 611/801 (76%), Positives = 681/801 (85%), Gaps = 1/801 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+EQ    S +  K EESPYE L NSKAS+E+IVA++LSIKK+ KPKQLLRDL
Sbjct: 9    GSIGGVEEEEQ----SPEPHKSEESPYEMLRNSKASVESIVADMLSIKKEGKPKQLLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE++FFRDAPQ+I+ SVLS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  KDFKSKYPDIDLVPEEDFFRDAPQDIQDSVLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTK+ KQH +AELLPP 
Sbjct: 185  LEQQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKKLKQHHSAELLPPG 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH  T IST MESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDTDISTTMESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG +Q+HPLKI+IHVYEDEAS PK AKL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQTKESLDQGGLYQVHPLKIVIHVYEDEASGPKSAKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN  PENDILCNLFPNDTGLELPHQSAKL VQDAITFNTQRTSRPYKWAQ
Sbjct: 365  LNVVCVGIEGSNDGPENDILCNLFPNDTGLELPHQSAKLFVQDAITFNTQRTSRPYKWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPEVS LLLT N+   ++E+VISGLSLYRQQNRV TVL RIR+RR        
Sbjct: 425  HLAGIDFLPEVSRLLLTDNSGAVKNENVISGLSLYRQQNRVPTVLQRIRARRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP L C+SVPWA+HTPLC L  WSP+R  PVL E+SS  VIDKEEH+QEPM
Sbjct: 485  QLESLTKLEWPRLPCKSVPWALHTPLCNLGSWSPVRLPPVLRESSSPAVIDKEEHIQEPM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+ ERSG TK E +++TEDGELPTLLP +SK D S Q +LIS SIIPPLNK+RSQSFK
Sbjct: 545  DADVIERSGATKAEPQSITEDGELPTLLPKVSKLDLSAQLNLISKSIIPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K+DD++DFLLD ESD DEPAQ E+E+E   S+Y ARKSVSWM+ G++EF LVL RK +A 
Sbjct: 605  KIDDSSDFLLDIESDIDEPAQIEQEHEK--SNYHARKSVSWMNYGLKEFRLVLCRKISAD 662

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E  + LEAKIKISMEYPLRPPLFALS+  +SSGENH    GL+WYNELRAMEA VNLHIL
Sbjct: 663  ESKLNLEAKIKISMEYPLRPPLFALSISCISSGENH-DETGLEWYNELRAMEAAVNLHIL 721

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            K+L  ++QN+VLAHQV+CLAMLFDYYLD+A  SSERTNCTSVVD+GLCKPV+GRFL RSF
Sbjct: 722  KMLLVNQQNYVLAHQVNCLAMLFDYYLDEASPSSERTNCTSVVDIGLCKPVTGRFLGRSF 781

Query: 2449 RGRDRRKMISWKDMNFISNYP 2511
            RGRDRRKMISWKDM   S+ P
Sbjct: 782  RGRDRRKMISWKDMKLNSSCP 802


>ref|XP_017436837.1| PREDICTED: THO complex subunit 5A [Vigna angularis]
 gb|KOM51504.1| hypothetical protein LR48_Vigan09g016300 [Vigna angularis]
          Length = 804

 Score = 1196 bits (3095), Expect = 0.0
 Identities = 609/801 (76%), Positives = 676/801 (84%), Gaps = 1/801 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+E    HSS+  K EESPYE L NSKA +ENIVA+ILSIKKD KPKQ LRDL
Sbjct: 9    GSIGGVEEEE----HSSEPHKSEESPYEILRNSKAFVENIVADILSIKKDRKPKQHLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
             TQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEK+HY+KAIKAC
Sbjct: 65   ATQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKNHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE+EFFRDAPQ+IK SVLS D AHNLML+RLNFEL+QRKELCKHHEK
Sbjct: 125  KDFKSKYPDIDLVPEEEFFRDAPQDIKDSVLSNDAAHNLMLRRLNFELFQRKELCKHHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LE QKK+LL+TIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LELQKKILLQTIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH     ST +ESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDNDNSTTVESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG FQ+HPLKIIIHVYED+ SD K +KL TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQAKESLDQGGIFQVHPLKIIIHVYEDQHSDSKSSKL-TLRFEYLVK 363

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN SPENDILCNLFPNDTGLELP QSAKL VQ+AI FNT RTSRPYKWAQ
Sbjct: 364  LNVVCVGIEGSNESPENDILCNLFPNDTGLELPQQSAKLFVQEAIAFNTLRTSRPYKWAQ 423

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGI+FLPEVSPLLLT +N  A++E+VISGLSLYRQQNR+ TVL RIR RR        
Sbjct: 424  HLAGIEFLPEVSPLLLTDDNGAAKNENVISGLSLYRQQNRISTVLQRIRDRRKAQLALLE 483

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESL K++WP LSC+S+PWA HTPLC LD WSP+R  PV  E+SS TVID+EEHVQEPM
Sbjct: 484  QLESLAKLDWPRLSCRSIPWAFHTPLCNLDSWSPVRLPPVPRESSSPTVIDEEEHVQEPM 543

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+NERS  TK E E++TEDGELPTLLPN+SK DHS Q +LIS SI+PPLNK+RSQSFK
Sbjct: 544  DADVNERSDVTKAEPESITEDGELPTLLPNMSKLDHSTQLNLISKSILPPLNKIRSQSFK 603

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K DD++DFLLDTESD DEPAQ E E+EN +S+Y  R SVSWMD G++EF LVL RK +A 
Sbjct: 604  KFDDSSDFLLDTESDLDEPAQTELEHENTLSNYHDRNSVSWMDHGLKEFCLVLCRKISAD 663

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E NV LEAKIKISMEYPLRPPLFALS+  +SSG+      GL+WYNELRAMEAEVNLHIL
Sbjct: 664  ENNVNLEAKIKISMEYPLRPPLFALSVRCISSGQKS-DKLGLEWYNELRAMEAEVNLHIL 722

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            ++LP ++QN+VLAHQVSCLAMLFDYYLD+A+ SSERTNCTSVVDVGLCKPV+GRFL R F
Sbjct: 723  RMLPVNQQNYVLAHQVSCLAMLFDYYLDEAFPSSERTNCTSVVDVGLCKPVTGRFLGRCF 782

Query: 2449 RGRDRRKMISWKDMNFISNYP 2511
            RGRDRRKMISWKDM    + P
Sbjct: 783  RGRDRRKMISWKDMKLNFSCP 803


>gb|KHN04496.1| THO complex subunit 5 like [Glycine soja]
          Length = 802

 Score = 1196 bits (3095), Expect = 0.0
 Identities = 608/799 (76%), Positives = 680/799 (85%), Gaps = 1/799 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+EQ    S +  K EESPY+ L NSKAS++NIVA++LSIKK+ KPKQLLRDL
Sbjct: 9    GSIGGVEEEEQ----SPEPHKSEESPYQMLRNSKASVQNIVADMLSIKKEGKPKQLLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE++FFRDAPQ+I+ S LS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  KDFKSKYPDIDLVPEEDFFRDAPQDIQDSFLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LEQQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH  T IST +ESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDTDISTTVESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG +Q+HPLKIIIHVYEDEAS PK AKL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQAKESLDQGGIYQVHPLKIIIHVYEDEASGPKSAKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN +PEND+LCNLFPNDTGLELPHQSAKL VQDAITFNTQRTSRPY+WAQ
Sbjct: 365  LNVVCVGIEGSNDAPENDLLCNLFPNDTGLELPHQSAKLFVQDAITFNTQRTSRPYRWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPE+SPLLLT N+  A++E+VISGLSLYRQQNRV TVL RIR+RR        
Sbjct: 425  HLAGIDFLPEMSPLLLTDNSGAAKNENVISGLSLYRQQNRVPTVLQRIRARRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP L C+SVPWA+HTPLC LD WSP++  PV  E+SS  VIDKEEH+QE M
Sbjct: 485  QLESLTKLEWPRLPCKSVPWALHTPLCNLDSWSPVKLPPVPRESSSPAVIDKEEHIQEAM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+ ERSG TK E E++TEDGELPTLLP +SK   S Q +LIS SI+PPLNK+RSQSFK
Sbjct: 545  DADVIERSGATKAEPESITEDGELPTLLPKVSKLGLSAQLNLISKSIVPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K+DD++DFLLDTESD DEPAQ E+E+E   S+Y ARKSVSWM+ G++EF LVL RK    
Sbjct: 605  KIDDSSDFLLDTESDLDEPAQIEQEHEK--SNYHARKSVSWMNYGLKEFHLVLCRKIGTD 662

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E N+ LEAKI+ISMEYPLRPPLF LS+  +SSGENH    GL+WYNELRAMEAEVNLH+L
Sbjct: 663  ESNLNLEAKIQISMEYPLRPPLFLLSISCISSGENH-DETGLEWYNELRAMEAEVNLHLL 721

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            K+L  ++QN VLAHQVSCLAMLFDYYLD+A  SSERTNCTSVVDVGLCKPVSGRFL RSF
Sbjct: 722  KMLTVNQQNFVLAHQVSCLAMLFDYYLDEASPSSERTNCTSVVDVGLCKPVSGRFLGRSF 781

Query: 2449 RGRDRRKMISWKDMNFISN 2505
            RGRDRRKMISWKDM   S+
Sbjct: 782  RGRDRRKMISWKDMKLNSS 800


>ref|XP_007136690.1| hypothetical protein PHAVU_009G065600g [Phaseolus vulgaris]
 gb|ESW08684.1| hypothetical protein PHAVU_009G065600g [Phaseolus vulgaris]
          Length = 805

 Score = 1196 bits (3094), Expect = 0.0
 Identities = 607/801 (75%), Positives = 672/801 (83%), Gaps = 1/801 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+E+    SS+  K EESPYE L NSKAS+ENI+A++L+IKKD KPKQ LRDL
Sbjct: 9    GSIGGVEEEER----SSEPHKSEESPYEMLRNSKASVENIIADVLAIKKDGKPKQHLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEK+HY+KAIKAC
Sbjct: 65   VTQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKNHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
             DFKSKYPDIDLVPE+EFFRDAPQ+IK SVLS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  TDFKSKYPDIDLVPEEEFFRDAPQDIKDSVLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LL+TIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +A LLPPA
Sbjct: 185  LEQQKKILLQTIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAGLLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQL AQKEAF EPIDLE++GSLKDAQAF R QAH  T  ST MESSKLEDD PD+
Sbjct: 245  LYVIYSQLFAQKEAFAEPIDLEIIGSLKDAQAFARQQAHKDTDNSTTMESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG FQ+HPLKII+HVYEDE SD K AKL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQAKESLDQGGIFQVHPLKIIVHVYEDEDSDSKSAKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEG N  PENDILCNLFPNDTGLELP QSAKL VQDA TFN+QRTSRPYKWAQ
Sbjct: 365  LNVVCVGIEGCNEGPENDILCNLFPNDTGLELPQQSAKLFVQDATTFNSQRTSRPYKWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPEVSPLLLT ++  A++E+VISGLSLYRQQNRV TVLHRIR RR        
Sbjct: 425  HLAGIDFLPEVSPLLLTEDSGAAKNENVISGLSLYRQQNRVMTVLHRIRDRRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              E L K++WP LSC+ VPWA HTPLC LD WSP+R  PV  E+SS  VID+EEHV EPM
Sbjct: 485  QLEFLAKLDWPCLSCKIVPWAFHTPLCNLDSWSPVRLPPVPRESSSPAVIDEEEHVHEPM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+NE S  TK E E++TEDGELPTLLPN+SK DHS Q +LIS SI+PPLNK+RSQSFK
Sbjct: 545  DADVNEHSDVTKAEPESITEDGELPTLLPNMSKLDHSTQLNLISKSIVPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K DD++DFLLDTESD DEPAQ E E+ENI+S+Y  R SVSWM  G++EF LVL RK +A 
Sbjct: 605  KYDDSSDFLLDTESDLDEPAQTELEHENILSNYHDRNSVSWMHHGLKEFCLVLCRKISAD 664

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E NV LEAKIKISMEYPLRPPLFALS+  +SSGE      GL+WYNELRAMEAEVNLHIL
Sbjct: 665  ESNVKLEAKIKISMEYPLRPPLFALSIRCISSGEKR-DKLGLEWYNELRAMEAEVNLHIL 723

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            K+LP ++QN+VLAHQVSCLAMLFDYYLD+A+ SSERTNCTSVVDVGLCKPV+GRFL R F
Sbjct: 724  KMLPINQQNYVLAHQVSCLAMLFDYYLDEAFPSSERTNCTSVVDVGLCKPVTGRFLGRCF 783

Query: 2449 RGRDRRKMISWKDMNFISNYP 2511
            RGRDRRKMISWKDM F S  P
Sbjct: 784  RGRDRRKMISWKDMKFNSRCP 804


>ref|XP_003528289.1| PREDICTED: THO complex subunit 5A-like isoform X1 [Glycine max]
 gb|KRH52004.1| hypothetical protein GLYMA_06G040600 [Glycine max]
 gb|KRH52005.1| hypothetical protein GLYMA_06G040600 [Glycine max]
          Length = 802

 Score = 1196 bits (3093), Expect = 0.0
 Identities = 609/801 (76%), Positives = 679/801 (84%), Gaps = 1/801 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+EQ    S +  K EESPYE L NSKAS+E+IVA++LSIKK+ KPKQLLRDL
Sbjct: 9    GSIGGVEEEEQ----SPEPHKSEESPYEMLRNSKASVESIVADMLSIKKEGKPKQLLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
             DFKSKYPDIDLVPE++FFRDAPQ+I+ SVLS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  NDFKSKYPDIDLVPEEDFFRDAPQDIQDSVLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTK+ KQH +AELLPP 
Sbjct: 185  LEQQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKKLKQHHSAELLPPG 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH  T IST MESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDTDISTTMESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG +Q+HPLKI+IHVYEDEAS PK AKL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQTKESLDQGGLYQVHPLKIVIHVYEDEASGPKSAKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN  PENDILCNLFPNDTGLELPHQSAKL VQDAITFNTQRTSRPYKWAQ
Sbjct: 365  LNVVCVGIEGSNDGPENDILCNLFPNDTGLELPHQSAKLFVQDAITFNTQRTSRPYKWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPEVS LLLT N+   ++E+VISGLSLYRQQNRV TVL RIR+RR        
Sbjct: 425  HLAGIDFLPEVSRLLLTDNSGAVKNENVISGLSLYRQQNRVPTVLQRIRARRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP L C+SVPWA+HTPLC L  WSP+R  PVL E+SS  VIDKEEH+QEPM
Sbjct: 485  QLESLTKLEWPRLPCKSVPWALHTPLCNLGSWSPVRLPPVLRESSSPAVIDKEEHIQEPM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+ ERSG TK E +++TEDGELPTLLP +SK D S Q +LIS SIIPPLNK+RSQSFK
Sbjct: 545  DADVIERSGATKAEPQSITEDGELPTLLPKVSKLDLSAQLNLISKSIIPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K+DD++DFLLD ESD DEPAQ E+E+E   S+Y ARKS SWM+ G++EF LVL RK +A 
Sbjct: 605  KIDDSSDFLLDIESDIDEPAQIEQEHEK--SNYHARKSGSWMNYGLKEFRLVLCRKISAD 662

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E  + LEAKIKISMEYPLRPPLFALS+  +SSGENH    GL+WYNELRAMEA VNLHIL
Sbjct: 663  ESKLNLEAKIKISMEYPLRPPLFALSISCISSGENH-DETGLEWYNELRAMEAAVNLHIL 721

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            K+L  ++QN+VLAHQV+CLAMLFDYYLD+A  SSERTNCTSVVD+GLCKPV+GRFL RSF
Sbjct: 722  KMLLVNQQNYVLAHQVNCLAMLFDYYLDEASPSSERTNCTSVVDIGLCKPVTGRFLGRSF 781

Query: 2449 RGRDRRKMISWKDMNFISNYP 2511
            RGRDRRKMISWKDM   S+ P
Sbjct: 782  RGRDRRKMISWKDMKLNSSCP 802


>dbj|BAT77850.1| hypothetical protein VIGAN_02045000 [Vigna angularis var. angularis]
          Length = 804

 Score = 1194 bits (3090), Expect = 0.0
 Identities = 608/801 (75%), Positives = 675/801 (84%), Gaps = 1/801 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+E    HSS+  K EESPYE L NSKA +ENIVA+ILSIKKD KPKQ LRDL
Sbjct: 9    GSIGGVEEEE----HSSEPHKSEESPYEILRNSKAFVENIVADILSIKKDRKPKQHLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
             TQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEK+HY+KAIKAC
Sbjct: 65   ATQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKNHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE+EFFRDAPQ+IK SVLS D AHNLML+RLNFEL+QRKELCKHHEK
Sbjct: 125  KDFKSKYPDIDLVPEEEFFRDAPQDIKDSVLSNDAAHNLMLRRLNFELFQRKELCKHHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LE QKK+LL+TIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LELQKKILLQTIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH     ST +ESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDNDNSTTVESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG FQ+HPLKIIIHVYED+ SD K +KL TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQAKESLDQGGIFQVHPLKIIIHVYEDQHSDSKSSKL-TLRFEYLVK 363

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN SPENDILCNLFPNDTGLELP QSAKL VQ+AI FNT RTSRPYKWAQ
Sbjct: 364  LNVVCVGIEGSNESPENDILCNLFPNDTGLELPQQSAKLFVQEAIAFNTLRTSRPYKWAQ 423

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGI+FLPEVSPLLLT +N  A++E+VISGLSLYRQQNR+ TVL RI  RR        
Sbjct: 424  HLAGIEFLPEVSPLLLTDDNGAAKNENVISGLSLYRQQNRISTVLQRIHDRRKAQLALLE 483

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESL K++WP LSC+S+PWA HTPLC LD WSP+R  PV  E+SS TVID+EEHVQEPM
Sbjct: 484  QLESLAKLDWPRLSCRSIPWAFHTPLCNLDSWSPVRLPPVPRESSSPTVIDEEEHVQEPM 543

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+NERS  TK E E++TEDGELPTLLPN+SK DHS Q +LIS SI+PPLNK+RSQSFK
Sbjct: 544  DADVNERSDVTKAEPESITEDGELPTLLPNMSKLDHSTQLNLISKSILPPLNKIRSQSFK 603

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K DD++DFLLDTESD DEPAQ E E+EN +S+Y  R SVSWMD G++EF LVL RK +A 
Sbjct: 604  KFDDSSDFLLDTESDLDEPAQTELEHENTLSNYHDRNSVSWMDHGLKEFCLVLCRKISAD 663

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E NV LEAKIKISMEYPLRPPLFALS+  +SSG+      GL+WYNELRAMEAEVNLHIL
Sbjct: 664  ENNVNLEAKIKISMEYPLRPPLFALSVRCISSGQKS-DKLGLEWYNELRAMEAEVNLHIL 722

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            ++LP ++QN+VLAHQVSCLAMLFDYYLD+A+ SSERTNCTSVVDVGLCKPV+GRFL R F
Sbjct: 723  RMLPVNQQNYVLAHQVSCLAMLFDYYLDEAFPSSERTNCTSVVDVGLCKPVTGRFLGRCF 782

Query: 2449 RGRDRRKMISWKDMNFISNYP 2511
            RGRDRRKMISWKDM    + P
Sbjct: 783  RGRDRRKMISWKDMKLNFSCP 803


>ref|XP_003523934.1| PREDICTED: THO complex subunit 5A-like [Glycine max]
 gb|KRH61301.1| hypothetical protein GLYMA_04G039400 [Glycine max]
          Length = 802

 Score = 1192 bits (3084), Expect = 0.0
 Identities = 605/799 (75%), Positives = 679/799 (84%), Gaps = 1/799 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+EQ    S +  K EESPY+ L NSKAS++NIVA++LSIKK+ KPKQLLRDL
Sbjct: 9    GSIGGVEEEEQ----SPEPHKSEESPYQMLRNSKASVQNIVADMLSIKKEGKPKQLLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE++FFRDAPQ+I+   LS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  KDFKSKYPDIDLVPEEDFFRDAPQDIQDCFLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LEQQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH  T IST +ESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDTDISTTVESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG +Q+HPLKIIIHVYEDEAS PK AKL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQAKESLDQGGIYQVHPLKIIIHVYEDEASGPKSAKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN +PEND+LCNLFPNDTGLELPHQSAKL VQDAITFNTQRTSRPY+WAQ
Sbjct: 365  LNVVCVGIEGSNDAPENDLLCNLFPNDTGLELPHQSAKLFVQDAITFNTQRTSRPYRWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPE+SPLLLT N+  A++E+VISGLSLYRQQNRV TVL RIR+RR        
Sbjct: 425  HLAGIDFLPEMSPLLLTDNSGAAKNENVISGLSLYRQQNRVPTVLQRIRARRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP L C+SVPWA+HTPLC LD WSP++  PV  E+SS  VIDKEEH+QE M
Sbjct: 485  QLESLTKLEWPRLPCKSVPWALHTPLCNLDSWSPVKLPPVPRESSSPAVIDKEEHIQEAM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+ ERSG TK E E++TEDGELPTLLP +SK   S Q +LIS SI+PPLNK+RSQSFK
Sbjct: 545  DADVIERSGATKAEPESITEDGELPTLLPKVSKLGLSAQLNLISKSIVPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K+DD++DFLLDTESD DEPAQ E+E+E   S+Y ARKSVSWM+ G++EF LV+ RK    
Sbjct: 605  KIDDSSDFLLDTESDLDEPAQIEQEHEK--SNYHARKSVSWMNYGLKEFHLVICRKIGTD 662

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E N+ LEAKI+ISMEYPLRPPLF LS+  +SSGENH    GL+WYNELRAMEAEVNLH+L
Sbjct: 663  ESNLNLEAKIQISMEYPLRPPLFLLSISCISSGENH-DETGLEWYNELRAMEAEVNLHLL 721

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            K+L  +++N VLAHQVSCLAMLFDYYLD+A  SSERTNCTSVVDVGLCKPVSGRFL RSF
Sbjct: 722  KMLTVNQKNFVLAHQVSCLAMLFDYYLDEASPSSERTNCTSVVDVGLCKPVSGRFLGRSF 781

Query: 2449 RGRDRRKMISWKDMNFISN 2505
            RGRDRRKMISWKDM   S+
Sbjct: 782  RGRDRRKMISWKDMKLNSS 800


>ref|XP_014501896.1| THO complex subunit 5A isoform X1 [Vigna radiata var. radiata]
          Length = 804

 Score = 1192 bits (3083), Expect = 0.0
 Identities = 606/801 (75%), Positives = 673/801 (84%), Gaps = 1/801 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+E    HSS+  K EESPYE L NSKAS+ENIVA+ILSIKKD KPKQ L+DL
Sbjct: 9    GSIGGVEEEE----HSSEPHKSEESPYEMLRNSKASVENIVADILSIKKDRKPKQHLQDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
             TQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEK+HY+KAIKAC
Sbjct: 65   ATQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKNHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE+EFFRDAPQ+IK SVLS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  KDFKSKYPDIDLVPEEEFFRDAPQDIKDSVLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LE QKK+LL+TIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LELQKKILLQTIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH     ST +ESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDNDNSTTVESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG FQ+HPLKIII+VYED+ SD K +KL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQAKESLDQGGIFQVHPLKIIIYVYEDQHSDSKSSKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN  PENDILCNLFPNDTGLELP QSAKL VQ+AI FNT RTSRPYKWAQ
Sbjct: 365  LNVVCVGIEGSNEGPENDILCNLFPNDTGLELPQQSAKLFVQEAIAFNTLRTSRPYKWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGI+FLPEVSPLLLT ++   ++E++ISGLSLYRQQNRV TVL RIR RR        
Sbjct: 425  HLAGIEFLPEVSPLLLTDDSGAVKNENIISGLSLYRQQNRVSTVLKRIRDRRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESL K++WP LSC SVPWA HTPLC LD WSP+R  PV  E+SS  VID+EEHVQEPM
Sbjct: 485  QLESLAKLDWPRLSCSSVPWAFHTPLCNLDSWSPVRLPPVPRESSSPAVIDEEEHVQEPM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+NERS  TK E E++TEDGELPTLLPN+SK DHS Q +LIS SIIPPLNK+RS SFK
Sbjct: 545  DADVNERSDVTKTEPESITEDGELPTLLPNMSKLDHSTQLNLISKSIIPPLNKIRSHSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K DD++DFLLDTESD DEPAQ E E+EN +S+Y  R SVSWMD G++EF LVL RK +A 
Sbjct: 605  KFDDSSDFLLDTESDIDEPAQTELEHENTLSNYHDRNSVSWMDHGLKEFCLVLCRKISAD 664

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E NV LEAKIKISMEYPLRPPLFALS+  +SSGE      GL+WYNELRAMEAEVNLHIL
Sbjct: 665  ENNVNLEAKIKISMEYPLRPPLFALSVRCISSGEKR-DKLGLEWYNELRAMEAEVNLHIL 723

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSF 2448
            ++LP ++QN+VLAHQVSCLAMLFDYYLD+A+ SSERTNCTSVVDVGLCKPV+GRFL R F
Sbjct: 724  RMLPVNQQNYVLAHQVSCLAMLFDYYLDEAFPSSERTNCTSVVDVGLCKPVTGRFLGRCF 783

Query: 2449 RGRDRRKMISWKDMNFISNYP 2511
            RGRDRRKMISWKDM   S+ P
Sbjct: 784  RGRDRRKMISWKDMKLNSSCP 804


>ref|XP_019414808.1| PREDICTED: THO complex subunit 5A [Lupinus angustifolius]
          Length = 817

 Score = 1186 bits (3069), Expect = 0.0
 Identities = 602/798 (75%), Positives = 670/798 (83%), Gaps = 6/798 (0%)
 Frame = +1

Query: 136  DEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDLVTQMFLHF 315
            DEQG  +S   R ++ESPYE L N+KAS+ENI++E+LSIK++ K K  + +LVTQMFLHF
Sbjct: 21   DEQGSTNSEQQRIIDESPYEILQNTKASVENIISELLSIKREGKSKSHVPELVTQMFLHF 80

Query: 316  ITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKACKDFKSKYP 495
            ITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKACKDFKSKYP
Sbjct: 81   ITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKACKDFKSKYP 140

Query: 496  DIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEKLEQQKKVL 675
            DIDLVPED FFRDAPQ+IK SV+S D+AHNLML RL FEL+QRK LCK HEKLEQQKK L
Sbjct: 141  DIDLVPEDRFFRDAPQDIKDSVVSNDSAHNLMLNRLTFELFQRKGLCKLHEKLEQQKKSL 200

Query: 676  LETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPALYVVYSQL 855
            LETIA RKKFLTSLPSHLKSLKKASLPVQNQLG+LHTK+ KQH +AELLPPALYV+YSQL
Sbjct: 201  LETIANRKKFLTSLPSHLKSLKKASLPVQNQLGVLHTKKLKQHHSAELLPPALYVIYSQL 260

Query: 856  LAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDDEEDGXXXX 1035
            LAQKEAF EPID+E++GSLKDAQ F RNQAH  TGI T ++SSKLEDD PD+EEDG    
Sbjct: 261  LAQKEAFREPIDVEIIGSLKDAQTFARNQAHKDTGIPTNVDSSKLEDDAPDEEEDGQRRR 320

Query: 1036 XXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVKLNIVCVGI 1215
                     +S D+ G FQ+HPLKI IHVY+DEASDPKPAKL+TL+FEY+VKL +VCVGI
Sbjct: 321  KRPKRVQGKDSADRAGVFQVHPLKITIHVYDDEASDPKPAKLITLKFEYLVKLKVVCVGI 380

Query: 1216 EGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQHLAGIDFL 1395
            EG+N   ENDILCNLFPNDTG ELPHQSAKL V D I F++QR SRPYKWAQHLAGIDFL
Sbjct: 381  EGTNDGSENDILCNLFPNDTGNELPHQSAKLFVGDDILFDSQRASRPYKWAQHLAGIDFL 440

Query: 1396 PEVSPLLL-----TVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXXXXE 1560
            PEVSPLLL     + N E AR EDVISGLSLYRQQNRVQTVL RIRSRR          E
Sbjct: 441  PEVSPLLLAGRETSDNGEAARGEDVISGLSLYRQQNRVQTVLQRIRSRRKSQLALLEQLE 500

Query: 1561 SLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPMDVD 1740
            SL+K+EWPLLSC+SVPWA+H+PLC L GWS IR LPV SEASS+ V DKEEHVQE MDVD
Sbjct: 501  SLSKLEWPLLSCKSVPWALHSPLCNLVGWSVIRALPVPSEASSVAVTDKEEHVQESMDVD 560

Query: 1741 LNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFKKVD 1917
            + E  G TK ELE++TEDGELPTL PN+SKFDHS Q SLIS S I PLNKVRSQSFKK D
Sbjct: 561  VIECPGDTKDELESITEDGELPTLRPNMSKFDHSKQLSLISKS-ITPLNKVRSQSFKKSD 619

Query: 1918 DNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAGERN 2097
            DN+DFLLD ESD DEPAQ ERE+E+ +S+YC +KSVSWMD GV+EF LVL RK +A ERN
Sbjct: 620  DNSDFLLDIESDLDEPAQVEREDES-MSNYCVKKSVSWMDYGVKEFSLVLRRKISADERN 678

Query: 2098 VTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHILKIL 2277
            V LEAKIKISMEYPLRPPLFAL L  +SSGENH G DGL+WYNELRAMEAEVNLH+LK+L
Sbjct: 679  VNLEAKIKISMEYPLRPPLFALGLCCISSGENHKGKDGLEWYNELRAMEAEVNLHMLKML 738

Query: 2278 PDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSFRGR 2457
            P+ +QN+VLAHQV CLAMLFDYYL DA+ SSER NCTSV DVGLC P+SGRFL RSFRGR
Sbjct: 739  PNDQQNYVLAHQVCCLAMLFDYYLKDAYPSSERINCTSVFDVGLCNPISGRFLSRSFRGR 798

Query: 2458 DRRKMISWKDMNFISNYP 2511
            D RKMISWKDM F SN P
Sbjct: 799  DHRKMISWKDMKFTSNCP 816


>gb|OIV97645.1| hypothetical protein TanjilG_12402 [Lupinus angustifolius]
          Length = 860

 Score = 1162 bits (3006), Expect = 0.0
 Identities = 592/795 (74%), Positives = 664/795 (83%), Gaps = 8/795 (1%)
 Frame = +1

Query: 136  DEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDLVTQMFLHF 315
            DEQG  +S   R ++ESPYE L N+KAS+ENI++E+LSIK++ K K  + +LVTQMFLHF
Sbjct: 21   DEQGSTNSEQQRIIDESPYEILQNTKASVENIISELLSIKREGKSKSHVPELVTQMFLHF 80

Query: 316  ITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKACKDFKSKYP 495
            ITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKACKDFKSKYP
Sbjct: 81   ITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKACKDFKSKYP 140

Query: 496  DIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEKLEQQKKVL 675
            DIDLVPED FFRDAPQ+IK SV+S D+AHNLML RL FEL+QRK LCK HEKLEQQKK L
Sbjct: 141  DIDLVPEDRFFRDAPQDIKDSVVSNDSAHNLMLNRLTFELFQRKGLCKLHEKLEQQKKSL 200

Query: 676  LETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPALYVVYSQL 855
            LETIA RKKFLTSLPSHLKSLKKASLPVQNQLG+LHTK+ KQH +AELLPPALYV+YSQL
Sbjct: 201  LETIANRKKFLTSLPSHLKSLKKASLPVQNQLGVLHTKKLKQHHSAELLPPALYVIYSQL 260

Query: 856  LAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDDEEDGXXXX 1035
            LAQKEAF EPID+E++GSLKDAQ F RNQAH  TGI T ++SSKLEDD PD+EEDG    
Sbjct: 261  LAQKEAFREPIDVEIIGSLKDAQTFARNQAHKDTGIPTNVDSSKLEDDAPDEEEDGQRRR 320

Query: 1036 XXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVKLNIVCVGI 1215
                     +S D+ G FQ+HPLKI IHVY+DEASDPKPAKL+TL+FEY+VKL +VCVGI
Sbjct: 321  KRPKRVQGKDSADRAGVFQVHPLKITIHVYDDEASDPKPAKLITLKFEYLVKLKVVCVGI 380

Query: 1216 EGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQHLAGIDFL 1395
            EG+N   ENDILCNLFPNDTG ELPHQSAKL V D I F++QR SRPYKWAQHLAGIDFL
Sbjct: 381  EGTNDGSENDILCNLFPNDTGNELPHQSAKLFVGDDILFDSQRASRPYKWAQHLAGIDFL 440

Query: 1396 PEVSPLLL-----TVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXXXXE 1560
            PEVSPLLL     + N E AR EDVISGLSLYRQQNRVQTVL RIRSRR          E
Sbjct: 441  PEVSPLLLAGRETSDNGEAARGEDVISGLSLYRQQNRVQTVLQRIRSRRKSQLALLEQLE 500

Query: 1561 SLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPMDVD 1740
            SL+K+EWPLLSC+SVPWA+H+PLC L GWS IR LPV SEASS+ V DKEEHVQE MDVD
Sbjct: 501  SLSKLEWPLLSCKSVPWALHSPLCNLVGWSVIRALPVPSEASSVAVTDKEEHVQESMDVD 560

Query: 1741 LNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFKKVD 1917
            + E  G TK ELE++TEDGELPTL PN+SKFDHS Q SLIS S I PLNKVRSQSFKK D
Sbjct: 561  VIECPGDTKDELESITEDGELPTLRPNMSKFDHSKQLSLISKS-ITPLNKVRSQSFKKSD 619

Query: 1918 DNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAGERN 2097
            DN+DFLLD ESD DEPAQ ERE+E+ +S+YC +KSVSWMD GV+EF LVL RK +A ERN
Sbjct: 620  DNSDFLLDIESDLDEPAQVEREDES-MSNYCVKKSVSWMDYGVKEFSLVLRRKISADERN 678

Query: 2098 VTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHILKIL 2277
            V LEAKIKISMEYPLRPPLFAL L  +SSGENH G DGL+WYNELRAMEAEVNLH+LK+L
Sbjct: 679  VNLEAKIKISMEYPLRPPLFALGLCCISSGENHKGKDGLEWYNELRAMEAEVNLHMLKML 738

Query: 2278 PDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPV--SGRFLDRSFR 2451
            P+ +QN+VLAHQV CLAMLFDYYL DA+ SSER NCTSV DVGLC P+  SG+FL  +FR
Sbjct: 739  PNDQQNYVLAHQVCCLAMLFDYYLKDAYPSSERINCTSVFDVGLCNPISGSGKFLAATFR 798

Query: 2452 GRDRRKMISWKDMNF 2496
             RD RK +SWKD+ F
Sbjct: 799  VRDHRKAMSWKDVKF 813


>gb|ABC47853.1| expressed protein-like protein [Glycine max]
          Length = 817

 Score = 1148 bits (2970), Expect = 0.0
 Identities = 583/772 (75%), Positives = 656/772 (84%), Gaps = 1/772 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+EQ    S +  K EESPY+ L NSKAS++NIVA++LSIKK+ KPKQLLRDL
Sbjct: 9    GSIGGVEEEEQ----SPEPHKSEESPYQMLRNSKASVQNIVADMLSIKKEGKPKQLLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
            KDFKSKYPDIDLVPE++FFRDAPQ+I+   LS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  KDFKSKYPDIDLVPEEDFFRDAPQDIQDCFLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTKR KQH +AELLPPA
Sbjct: 185  LEQQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKRLKQHHSAELLPPA 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH  T IST +ESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDTDISTTVESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG +Q+HPLKIIIHVYEDEAS PK AKL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQAKESLDQGGIYQVHPLKIIIHVYEDEASGPKSAKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN +PEND+LCNLFPNDTGLELPHQSAKL VQDAITFNTQRTSRPY+WAQ
Sbjct: 365  LNVVCVGIEGSNDAPENDLLCNLFPNDTGLELPHQSAKLFVQDAITFNTQRTSRPYRWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPE+SPLLLT N+  A++E+VISGLSLYRQQNRV TVL RIR+RR        
Sbjct: 425  HLAGIDFLPEMSPLLLTDNSGAAKNENVISGLSLYRQQNRVPTVLQRIRARRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP L C+SVPWA+HTPLC LD WSP++  PV  E+SS  VIDKEEH+QE M
Sbjct: 485  QLESLTKLEWPRLPCKSVPWALHTPLCNLDSWSPVKLPPVPRESSSPAVIDKEEHIQEAM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+ ERSG TK E E++TEDGELPTLLP +SK   S Q +LIS SI+PPLNK+RSQSFK
Sbjct: 545  DADVIERSGATKAEPESITEDGELPTLLPKVSKLGLSAQLNLISKSIVPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K+DD++DFLLDTESD DEPAQ E+E+E   S+Y ARKSVSWM+ G++EF LV+ RK    
Sbjct: 605  KIDDSSDFLLDTESDLDEPAQIEQEHEK--SNYHARKSVSWMNYGLKEFHLVICRKIGTD 662

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHIL 2268
            E N+ LEAKI+ISMEYPLRPPLF LS+  +SSGENH    GL+WYNELRAMEAEVNLH+L
Sbjct: 663  ESNLNLEAKIQISMEYPLRPPLFLLSISCISSGENH-DETGLEWYNELRAMEAEVNLHLL 721

Query: 2269 KILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVS 2424
            K+L  +++N VLAHQVSCLAMLFDYYLD+A  SSERTNCTSVVDVGLCKPVS
Sbjct: 722  KMLTVNQKNFVLAHQVSCLAMLFDYYLDEASPSSERTNCTSVVDVGLCKPVS 773


>ref|XP_015934253.1| THO complex subunit 5A [Arachis duranensis]
 ref|XP_015934254.1| THO complex subunit 5A [Arachis duranensis]
          Length = 807

 Score = 1133 bits (2931), Expect = 0.0
 Identities = 578/798 (72%), Positives = 654/798 (81%), Gaps = 6/798 (0%)
 Frame = +1

Query: 130  EEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDLVTQMFL 309
            EED+QGF+HSS +R  EESPYE L NSK S+ENIV EILSIKK AKPK LLR+L+TQMFL
Sbjct: 15   EEDDQGFSHSSQNRTPEESPYEMLQNSKTSVENIVTEILSIKKHAKPKHLLRELITQMFL 74

Query: 310  HFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKACKDFKSK 489
            HF+TLRQ NRSIL EEDRVKSETERAKAPVD TTLQLHNL+YEKSHY+KAIKAC DFKSK
Sbjct: 75   HFVTLRQANRSILTEEDRVKSETERAKAPVDLTTLQLHNLMYEKSHYVKAIKACNDFKSK 134

Query: 490  YPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEKLEQQKK 669
            YPDI+LVPE+EFF DAP++IK SVLS D AHNLML+RLNFEL+QRKELCK HEKLEQ+KK
Sbjct: 135  YPDIELVPEEEFFHDAPKDIKDSVLSNDAAHNLMLQRLNFELFQRKELCKLHEKLEQKKK 194

Query: 670  VLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPALYVVYS 849
             LLE+IA RKKFLTSLPSHLKSLKKASLPVQNQLG+LHTK+ KQH +AELLPP+LYV+YS
Sbjct: 195  SLLESIANRKKFLTSLPSHLKSLKKASLPVQNQLGVLHTKKLKQHHSAELLPPSLYVIYS 254

Query: 850  QLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDDEEDGXX 1029
            QL AQKEAFGE IDLE++GSLKDAQAF R QAHN TGIS+A ESSKLEDD PD+EED   
Sbjct: 255  QLFAQKEAFGESIDLEIIGSLKDAQAFARQQAHNDTGISSATESSKLEDDAPDEEEDDQR 314

Query: 1030 XXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVKLNIVCV 1209
                       ES DQ G FQ HPLK+IIHVY+DEAS+ K  +L+TL+FEY+VKLNIVCV
Sbjct: 315  RRKRPRRVQGKESLDQAGIFQAHPLKVIIHVYDDEASETKSTRLITLKFEYLVKLNIVCV 374

Query: 1210 GIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQHLAGID 1389
            GIEGSN  PENDILCNLFPNDTGLELPHQSAKL V D  TFN QRTSRPYKWAQHLAG+D
Sbjct: 375  GIEGSNDGPENDILCNLFPNDTGLELPHQSAKLFVGDDATFNNQRTSRPYKWAQHLAGLD 434

Query: 1390 FLPEVSPLLLT-----VNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXXX 1554
            FL EVSPLL T      N+  A+S+DV+SGLSLYRQQNRVQTVL RIRSRR         
Sbjct: 435  FLSEVSPLLHTGHGTSDNSGAAKSDDVVSGLSLYRQQNRVQTVLQRIRSRRKAHLALLEQ 494

Query: 1555 XESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPMD 1734
             ESLTK+EWP LSC+SVPWA+H PLC L+ WSPIR + V   AS+  ++DK+EH  E MD
Sbjct: 495  LESLTKLEWPFLSCKSVPWALHAPLCNLNVWSPIRAIDVPRVASTPALMDKDEHAPEAMD 554

Query: 1735 VDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFKK 1911
            +D  E S  TK ELE++TEDGELPTL+PN+SK DHS Q +LIS S+ P L  VRSQSFKK
Sbjct: 555  IDRAEHSVATKEELESITEDGELPTLIPNMSKLDHSKQITLISKSVTPSLINVRSQSFKK 614

Query: 1912 VDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAGE 2091
             +       + ESD DEPAQ E+E+E+I S +  RKSVSWMD GV+EF LVLSRK +A E
Sbjct: 615  FE-------EIESDIDEPAQIEQEDEDIESYHYGRKSVSWMDCGVKEFSLVLSRKFSADE 667

Query: 2092 RNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVNLHILK 2271
             NV LEAKI ISMEYPLRPPLFALSL  LS+G +H+ NDGL+WYNELRAMEAEVNLHILK
Sbjct: 668  SNVNLEAKINISMEYPLRPPLFALSLRCLSTGGDHYQNDGLEWYNELRAMEAEVNLHILK 727

Query: 2272 ILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVSGRFLDRSFR 2451
            +LP  +QN+VLAHQV CLAMLFD+YLDDA  +SERT+ T++VDVGLCKPV+G FL RSFR
Sbjct: 728  MLPVDQQNYVLAHQVRCLAMLFDHYLDDASPTSERTSSTTLVDVGLCKPVTGSFLARSFR 787

Query: 2452 GRDRRKMISWKDMNFISN 2505
            GRDRRKMISWK   F S+
Sbjct: 788  GRDRRKMISWKGTKFTSS 805


>ref|XP_016163247.1| LOW QUALITY PROTEIN: THO complex subunit 5A [Arachis ipaensis]
          Length = 816

 Score = 1125 bits (2910), Expect = 0.0
 Identities = 577/804 (71%), Positives = 653/804 (81%), Gaps = 15/804 (1%)
 Frame = +1

Query: 130  EEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDLVTQMFL 309
            EED+QG +HSS +R  EESPYE L NSK S+ENIV EILSIKK AKPK LLR+L+TQMFL
Sbjct: 15   EEDDQGSSHSSQNRTPEESPYEMLQNSKTSVENIVTEILSIKKHAKPKHLLRELITQMFL 74

Query: 310  HFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKACKDFKSK 489
            HF+TLRQ NRSIL EEDRVKSETERAKAPVD TTLQLHNL+YEKSHY+KAIKAC DFKSK
Sbjct: 75   HFVTLRQANRSILTEEDRVKSETERAKAPVDLTTLQLHNLMYEKSHYVKAIKACNDFKSK 134

Query: 490  YPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEKLEQQKK 669
            YPDI+LVPE+EFF DAP++IK SVLS D AHNLML+RLNFEL+QRKELCK HEKLEQ+KK
Sbjct: 135  YPDIELVPEEEFFHDAPKDIKDSVLSNDAAHNLMLQRLNFELFQRKELCKLHEKLEQKKK 194

Query: 670  VLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPALYVVYS 849
             LLE+IA RKKFLTSLPSHLKSLKKASLPVQNQLG+LHTK+ KQH +AELLPP+LYV+YS
Sbjct: 195  SLLESIANRKKFLTSLPSHLKSLKKASLPVQNQLGVLHTKKLKQHHSAELLPPSLYVIYS 254

Query: 850  QLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDDEEDGXX 1029
            QL AQKEAFGE IDLE++GSLKDAQAF R QAHN TGIS+A ESSKLEDD PD+E+D   
Sbjct: 255  QLFAQKEAFGESIDLEIIGSLKDAQAFARQQAHNDTGISSATESSKLEDDAPDEEDDDQR 314

Query: 1030 XXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVKLNIVCV 1209
                       ES DQ G FQ HPLK+IIHVY+DEASD K A+L+TL+FEY+VKLNIVCV
Sbjct: 315  RRKRPRRVQGKESLDQAGIFQAHPLKVIIHVYDDEASDTKSARLITLKFEYLVKLNIVCV 374

Query: 1210 GIEGSNGSPENDILCNLFPNDTGLELPHQ---------SAKLLVQDAITFNTQRTSRPYK 1362
            GIEGSN  PENDILCNLFPNDTGLELPHQ         SAKL V D  TFN QRTSRPYK
Sbjct: 375  GIEGSNDGPENDILCNLFPNDTGLELPHQVRLPSLHMXSAKLFVGDDATFNNQRTSRPYK 434

Query: 1363 WAQHLAGIDFLPEVSPLLLT-----VNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRR 1527
            WAQHLAG+DFL EVSPLL T      N+  A+S+DV+SGLSLYRQQNRVQTVL RIRSRR
Sbjct: 435  WAQHLAGLDFLSEVSPLLHTGHGTSDNSGAAKSDDVVSGLSLYRQQNRVQTVLQRIRSRR 494

Query: 1528 XXXXXXXXXXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDK 1707
                      ESLTK+EWP LSC+SVPWA+H PLC L+ WSPIR + V   AS+  ++DK
Sbjct: 495  KAHLALLEQLESLTKLEWPFLSCKSVPWALHAPLCNLNVWSPIRAVDVPRVASTPALMDK 554

Query: 1708 EEHVQEPMDVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLN 1884
            +EH  E MD+D+ E S  TK ELE++TEDGELPTL+PN+SK DHS Q +LIS S+ P L 
Sbjct: 555  DEHAPEAMDIDIAEHSAATKEELESITEDGELPTLIPNMSKLDHSKQITLISKSVTPSLI 614

Query: 1885 KVRSQSFKKVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLV 2064
             VRSQSFKK +       + ESD DEPAQ E+E+E+I S +  RKSVSWMD GV+EF LV
Sbjct: 615  NVRSQSFKKFE-------EIESDIDEPAQIEQEDEDIESYHYGRKSVSWMDCGVKEFSLV 667

Query: 2065 LSRKSNAGERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAME 2244
            LSRK +A E NV LEAKI ISMEYPLRPPLFALSL  LS+G +H+ NDGL+WYNELRAME
Sbjct: 668  LSRKFSADESNVNLEAKINISMEYPLRPPLFALSLRCLSTGGDHYQNDGLEWYNELRAME 727

Query: 2245 AEVNLHILKILPDSEQNHVLAHQVSCLAMLFDYYLDDAWSSSERTNCTSVVDVGLCKPVS 2424
            AEVNLHILK+LP  +QN+VLAHQV CLAMLFD+YLDDA  +SERT+ T++VDVGLCKPV+
Sbjct: 728  AEVNLHILKMLPVDQQNYVLAHQVRCLAMLFDHYLDDASPTSERTSSTTLVDVGLCKPVT 787

Query: 2425 GRFLDRSFRGRDRRKMISWKDMNF 2496
            G FL RSFRGRDRRKMISWK   F
Sbjct: 788  GSFLARSFRGRDRRKMISWKGTKF 811


>gb|PNY16156.1| THO complex subunit 5 B [Trifolium pratense]
          Length = 792

 Score = 1115 bits (2884), Expect = 0.0
 Identities = 570/725 (78%), Positives = 626/725 (86%), Gaps = 5/725 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFA---HSSDDRKLEE-SPYETLHNSKASIENIVAEILSIKKDAKPKQL 279
            GS PMAEE++Q      HSS+D + EE SPYETLHNSK+SIE+I+++ILSIKK AKPKQL
Sbjct: 9    GSFPMAEEEQQQQTFPPHSSEDNQSEEESPYETLHNSKSSIESIISDILSIKKQAKPKQL 68

Query: 280  LRDLVTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKA 459
            LRDLVTQMFLHFITLRQ NRSIL+EEDRVK ETERAKAPVDFTTLQLHNLVYEKSHYLKA
Sbjct: 69   LRDLVTQMFLHFITLRQANRSILIEEDRVKMETERAKAPVDFTTLQLHNLVYEKSHYLKA 128

Query: 460  IKACKDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCK 639
            IKACKDFKSKYPDIDLVPE+EFFRDAP++IK SVLSKDNAHNLMLKRLNFELYQRKELCK
Sbjct: 129  IKACKDFKSKYPDIDLVPEEEFFRDAPKDIKDSVLSKDNAHNLMLKRLNFELYQRKELCK 188

Query: 640  HHEKLEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAEL 819
            HH KLEQ+KK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLG++HTK+ KQH +AEL
Sbjct: 189  HHAKLEQKKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGMMHTKKLKQHHSAEL 248

Query: 820  LPPALYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDD 999
            LPPALYV+YSQLLAQKEAF EPIDLE+VGSLKDAQAF RNQA+  TGIST MESSKLEDD
Sbjct: 249  LPPALYVIYSQLLAQKEAFVEPIDLEIVGSLKDAQAFARNQANKDTGISTVMESSKLEDD 308

Query: 1000 VPDDEEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFE 1179
            V DDEEDG             E+ DQGG F+ HPLKIIIHVYEDE+SDPKPAKL+TL+FE
Sbjct: 309  VHDDEEDGQRRRKRPRRVEVKENHDQGGLFKSHPLKIIIHVYEDESSDPKPAKLITLKFE 368

Query: 1180 YMVKLNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPY 1359
            Y+VKLNIVCVGIEGS+  P+NDILCNLFPNDTGLELPHQSAKL VQD  TFNTQRTSRPY
Sbjct: 369  YLVKLNIVCVGIEGSSDGPDNDILCNLFPNDTGLELPHQSAKLFVQDVTTFNTQRTSRPY 428

Query: 1360 KWAQHLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXX 1539
            KWAQHLAGIDFLPEVSPLL        +SEDV+SGLSLYRQQNRVQT+L RIRSRR    
Sbjct: 429  KWAQHLAGIDFLPEVSPLL-----PADKSEDVVSGLSLYRQQNRVQTILQRIRSRRKAQL 483

Query: 1540 XXXXXXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHV 1719
                  ESLTK+EWP+LSC+SVPWA+HTPLCKLDGWS IR LPV  EASS  +IDKEEHV
Sbjct: 484  ALLEQLESLTKLEWPVLSCKSVPWALHTPLCKLDGWSSIRALPVPGEASSPNIIDKEEHV 543

Query: 1720 QEPMDVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRS 1896
            QE MDVD+ E SG TK E +++TEDGELPTLLP  +KFDHS QASLIS SI P LNKVRS
Sbjct: 544  QESMDVDVIENSGVTKEEPDSMTEDGELPTLLPKTTKFDHSTQASLISKSITPSLNKVRS 603

Query: 1897 QSFKKVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRK 2076
            QSFKKVDD++DFLLDT+SDFDEPAQ E E+ENI+S+Y ARKS+SWM+SGV+EFLLVLS K
Sbjct: 604  QSFKKVDDSSDFLLDTDSDFDEPAQIESEHENIISEYLARKSLSWMESGVKEFLLVLSSK 663

Query: 2077 SNAGERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVN 2256
            +N  ERNV LEAKIKISMEYPLRPP FALSL    SGENH  NDGL+WYNELRA+EAE+ 
Sbjct: 664  TNVVERNVNLEAKIKISMEYPLRPPHFALSLCCKPSGENHLENDGLEWYNELRAIEAEMR 723

Query: 2257 LHILK 2271
             H LK
Sbjct: 724  GHHLK 728


>dbj|GAU42192.1| hypothetical protein TSUD_305370 [Trifolium subterraneum]
          Length = 724

 Score = 1108 bits (2867), Expect = 0.0
 Identities = 568/718 (79%), Positives = 621/718 (86%), Gaps = 5/718 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQ---GFAHSSDDRKLE-ESPYETLHNSKASIENIVAEILSIKKDAKPKQL 279
            GSIPMAEE++Q    F HSS+D   E ESPYETLHNSK+SIE+I+++ILSIKK AKPKQ 
Sbjct: 9    GSIPMAEEEQQQQQSFPHSSEDNHSEGESPYETLHNSKSSIESIISDILSIKKQAKPKQQ 68

Query: 280  LRDLVTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKA 459
            LRDLVTQMFLHFITLRQ NRSIL+EEDRVK ETERAKAPVDFTTLQLHNLVYEKSHYLKA
Sbjct: 69   LRDLVTQMFLHFITLRQANRSILIEEDRVKMETERAKAPVDFTTLQLHNLVYEKSHYLKA 128

Query: 460  IKACKDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCK 639
            IKACKDFKSK+PDI+LVPE+EFFR AP++IK SVLSKD+AHNLMLKRLNFELYQRKELCK
Sbjct: 129  IKACKDFKSKFPDIELVPEEEFFRGAPKDIKDSVLSKDSAHNLMLKRLNFELYQRKELCK 188

Query: 640  HHEKLEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAEL 819
            HH KLEQ+KK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLG++HTK+ KQH +AEL
Sbjct: 189  HHAKLEQKKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGMMHTKKLKQHHSAEL 248

Query: 820  LPPALYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDD 999
            LPPALYV+YSQLLAQKEAF EPIDLE+VGSLKDAQAF RNQA+  TGIST MESSKLEDD
Sbjct: 249  LPPALYVIYSQLLAQKEAFVEPIDLEIVGSLKDAQAFARNQANKDTGISTVMESSKLEDD 308

Query: 1000 VPDDEEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFE 1179
            V DDEEDG             E+ DQGG F+ HPLKIIIHVYEDE+SDPKPAKL+TL+FE
Sbjct: 309  VHDDEEDGQRRRKRPRRVEVKENHDQGGLFKSHPLKIIIHVYEDESSDPKPAKLITLKFE 368

Query: 1180 YMVKLNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPY 1359
            YMVKLNIVCVGIEGSN  P+NDILCNLFPNDTGLELPHQSAKL VQD  TFN QRTSRPY
Sbjct: 369  YMVKLNIVCVGIEGSNDGPDNDILCNLFPNDTGLELPHQSAKLFVQDVTTFNAQRTSRPY 428

Query: 1360 KWAQHLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXX 1539
            KWAQHLAGIDFLPEVSPLL        +SEDV+SGLSLYRQQNRVQTVL RIRSRR    
Sbjct: 429  KWAQHLAGIDFLPEVSPLL-----PADKSEDVVSGLSLYRQQNRVQTVLQRIRSRRKAQL 483

Query: 1540 XXXXXXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHV 1719
                  ESLTK+EWPLLSC+SVPWA+HTPLCKL+GWS IR LP   EASS T+IDKEEHV
Sbjct: 484  ALLEQLESLTKLEWPLLSCKSVPWALHTPLCKLEGWSAIRALP--GEASSPTIIDKEEHV 541

Query: 1720 QEPMDVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRS 1896
            QE MDVD+ E SG TK E +++ EDGELPTLLP  +KFDHS QASLIS +IIP LNKVRS
Sbjct: 542  QESMDVDVIENSGVTKEEPDSMAEDGELPTLLPKTTKFDHSTQASLISKTIIPSLNKVRS 601

Query: 1897 QSFKKVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRK 2076
            QSFKKVDD++DFLLDT+SDFDEPAQ E E+ENI+SDY ARKS+SWMDSGV+EFLLVLS K
Sbjct: 602  QSFKKVDDSSDFLLDTDSDFDEPAQIESEHENIISDYLARKSLSWMDSGVKEFLLVLSSK 661

Query: 2077 SNAGERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAE 2250
            +N  ERNV LEAKIKISMEYPLRPP FALSL    SGENHF NDGL+WYNELRA+EAE
Sbjct: 662  TNVVERNVNLEAKIKISMEYPLRPPHFALSLCSKPSGENHFENDGLEWYNELRAIEAE 719


>ref|XP_014632732.1| PREDICTED: THO complex subunit 5A-like isoform X2 [Glycine max]
 gb|KRH52006.1| hypothetical protein GLYMA_06G040600 [Glycine max]
 gb|KRH52007.1| hypothetical protein GLYMA_06G040600 [Glycine max]
          Length = 724

 Score = 1058 bits (2735), Expect = 0.0
 Identities = 541/716 (75%), Positives = 603/716 (84%), Gaps = 1/716 (0%)
 Frame = +1

Query: 112  GSIPMAEEDEQGFAHSSDDRKLEESPYETLHNSKASIENIVAEILSIKKDAKPKQLLRDL 291
            GSI   EE+EQ    S +  K EESPYE L NSKAS+E+IVA++LSIKK+ KPKQLLRDL
Sbjct: 9    GSIGGVEEEEQ----SPEPHKSEESPYEMLRNSKASVESIVADMLSIKKEGKPKQLLRDL 64

Query: 292  VTQMFLHFITLRQTNRSILLEEDRVKSETERAKAPVDFTTLQLHNLVYEKSHYLKAIKAC 471
            VTQMFLHFITLRQ NRSILLEEDRVK+ETERAKAPVDFTTLQLHNL+YEKSHY+KAIKAC
Sbjct: 65   VTQMFLHFITLRQANRSILLEEDRVKTETERAKAPVDFTTLQLHNLMYEKSHYVKAIKAC 124

Query: 472  KDFKSKYPDIDLVPEDEFFRDAPQNIKGSVLSKDNAHNLMLKRLNFELYQRKELCKHHEK 651
             DFKSKYPDIDLVPE++FFRDAPQ+I+ SVLS D AHNLML+RLNFEL+QRKELCK HEK
Sbjct: 125  NDFKSKYPDIDLVPEEDFFRDAPQDIQDSVLSNDAAHNLMLRRLNFELFQRKELCKLHEK 184

Query: 652  LEQQKKVLLETIATRKKFLTSLPSHLKSLKKASLPVQNQLGLLHTKRTKQHSAAELLPPA 831
            LEQQKK+LLETIA RKKFLTSLPSHLKSLKKASLPVQNQLGL HTK+ KQH +AELLPP 
Sbjct: 185  LEQQKKILLETIANRKKFLTSLPSHLKSLKKASLPVQNQLGLHHTKKLKQHHSAELLPPG 244

Query: 832  LYVVYSQLLAQKEAFGEPIDLEVVGSLKDAQAFTRNQAHNATGISTAMESSKLEDDVPDD 1011
            LYV+YSQLLAQKEAFGEPIDLE++GSLKDAQAF R QAH  T IST MESSKLEDD PD+
Sbjct: 245  LYVIYSQLLAQKEAFGEPIDLEIIGSLKDAQAFARQQAHKDTDISTTMESSKLEDDAPDE 304

Query: 1012 EEDGXXXXXXXXXXXXXESPDQGGAFQIHPLKIIIHVYEDEASDPKPAKLVTLRFEYMVK 1191
            EEDG             ES DQGG +Q+HPLKI+IHVYEDEAS PK AKL+TLRFEY+VK
Sbjct: 305  EEDGQRRRKRPRRVQTKESLDQGGLYQVHPLKIVIHVYEDEASGPKSAKLITLRFEYLVK 364

Query: 1192 LNIVCVGIEGSNGSPENDILCNLFPNDTGLELPHQSAKLLVQDAITFNTQRTSRPYKWAQ 1371
            LN+VCVGIEGSN  PENDILCNLFPNDTGLELPHQSAKL VQDAITFNTQRTSRPYKWAQ
Sbjct: 365  LNVVCVGIEGSNDGPENDILCNLFPNDTGLELPHQSAKLFVQDAITFNTQRTSRPYKWAQ 424

Query: 1372 HLAGIDFLPEVSPLLLTVNNETARSEDVISGLSLYRQQNRVQTVLHRIRSRRXXXXXXXX 1551
            HLAGIDFLPEVS LLLT N+   ++E+VISGLSLYRQQNRV TVL RIR+RR        
Sbjct: 425  HLAGIDFLPEVSRLLLTDNSGAVKNENVISGLSLYRQQNRVPTVLQRIRARRKAQLALLE 484

Query: 1552 XXESLTKVEWPLLSCQSVPWAVHTPLCKLDGWSPIRTLPVLSEASSLTVIDKEEHVQEPM 1731
              ESLTK+EWP L C+SVPWA+HTPLC L  WSP+R  PVL E+SS  VIDKEEH+QEPM
Sbjct: 485  QLESLTKLEWPRLPCKSVPWALHTPLCNLGSWSPVRLPPVLRESSSPAVIDKEEHIQEPM 544

Query: 1732 DVDLNERSGPTK-ELETVTEDGELPTLLPNISKFDHSMQASLISTSIIPPLNKVRSQSFK 1908
            D D+ ERSG TK E +++TEDGELPTLLP +SK D S Q +LIS SIIPPLNK+RSQSFK
Sbjct: 545  DADVIERSGATKAEPQSITEDGELPTLLPKVSKLDLSAQLNLISKSIIPPLNKIRSQSFK 604

Query: 1909 KVDDNTDFLLDTESDFDEPAQNERENENIVSDYCARKSVSWMDSGVQEFLLVLSRKSNAG 2088
            K+DD++DFLLD ESD DEPAQ E+E+E   S+Y ARKS SWM+ G++EF LVL RK +A 
Sbjct: 605  KIDDSSDFLLDIESDIDEPAQIEQEHEK--SNYHARKSGSWMNYGLKEFRLVLCRKISAD 662

Query: 2089 ERNVTLEAKIKISMEYPLRPPLFALSLHGLSSGENHFGNDGLDWYNELRAMEAEVN 2256
            E  + LEAKIKISMEYPLRPPLFALS+  +SSGENH    GL+WYNELRAMEA V+
Sbjct: 663  ESKLNLEAKIKISMEYPLRPPLFALSISCISSGENH-DETGLEWYNELRAMEAAVS 717


Top