BLASTX nr result

ID: Rehmannia26_contig00003762 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00003762
         (3101 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solan...  1079   0.0  
ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solan...  1072   0.0  
gb|EOY01325.1| THO complex subunit 2 isoform 1 [Theobroma cacao]     1036   0.0  
gb|EOY01326.1| THO complex subunit 2 isoform 2 [Theobroma cacao]     1032   0.0  
gb|EOY01328.1| THO complex subunit 2 isoform 4 [Theobroma cacao]     1031   0.0  
gb|EOY01327.1| THO2 isoform 3 [Theobroma cacao]                      1009   0.0  
ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis...  1008   0.0  
ref|XP_006448121.1| hypothetical protein CICLE_v10014076mg [Citr...  1005   0.0  
ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citru...  1005   0.0  
ref|XP_006586338.1| PREDICTED: THO complex subunit 2-like [Glyci...   994   0.0  
gb|EMJ18294.1| hypothetical protein PRUPE_ppa000084mg [Prunus pe...   993   0.0  
gb|ESW32460.1| hypothetical protein PHAVU_002G324500g [Phaseolus...   992   0.0  
ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isofor...   988   0.0  
gb|EOY01329.1| THO complex subunit 2 isoform 5 [Theobroma cacao]      987   0.0  
ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucum...   984   0.0  
ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isofor...   970   0.0  
emb|CBI26799.3| unnamed protein product [Vitis vinifera]              937   0.0  
ref|XP_003631008.1| THO complex subunit [Medicago truncatula] gi...   931   0.0  
ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi...   926   0.0  
ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fraga...   898   0.0  

>ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solanum tuberosum]
          Length = 1859

 Score = 1079 bits (2791), Expect = 0.0
 Identities = 595/972 (61%), Positives = 668/972 (68%), Gaps = 62/972 (6%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G S KPI+W +LLDT+KTMLP KAWNSLSPDLYATFWGLTLYDL+VPR+RYESEI     
Sbjct: 898  GSSRKPISWTDLLDTIKTMLPSKAWNSLSPDLYATFWGLTLYDLHVPRSRYESEIGKQHA 957

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI KRKKDKERIQESLDRLT E Q+HEEHV SVRRRL  EKDTWLS
Sbjct: 958  ALKALEELSDNSSSAITKRKKDKERIQESLDRLTTELQRHEEHVTSVRRRLTREKDTWLS 1017

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FVNTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAVFVNTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYEVGRLGRFL+ETLKTAY+WK DESIYE+ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEVGRLGRFLYETLKTAYYWKGDESIYERECGNMPGFAVYYRYPNSQRVTY 1137

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALI+LTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALILLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYL++K A  PASKS + N+V + N
Sbjct: 1198 IKSDEREDLKVLATGVAAALASRKPSWVTDEEFGMGYLELKLAAAPASKSSAGNSVAIPN 1257

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGREPRRIDGDNLKQVEE-------------- 1258
            G+G SVSQ E   GRTV +G +    G L R    +   +L Q ++              
Sbjct: 1258 GSGASVSQGEPSIGRTVVAGIV--VDGKLDRPDSSMPKPDLGQTKQKGSQSINGLDVQSM 1315

Query: 1259 -SANKQSEENSK--------------VTKTVVEPEARPVVKRSAAAGSLAKQAKQDLSKD 1393
             SA  QS+  S+               +K   E E R   KR+  AGSL+KQ K D++KD
Sbjct: 1316 PSATLQSDTPSQNSTCRPLEESTIKAASKMSGEQEGRATGKRATPAGSLSKQQKHDIAKD 1375

Query: 1394 EDKSGKAVGR----------------------XXXXXXXXXXXXXXXXKLTNSSTRSSDH 1507
             DKSGKAVGR                                      K   S TR  D 
Sbjct: 1376 -DKSGKAVGRASGAASGDVSYPSESRASGSVNVSTTVSGNGSMFSAAPKGAASLTRLLDP 1434

Query: 1508 NTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHK 1687
            + E  AE+T +KS+D RV  GKD+ +E +D HK+ T R   SPR +    ASK+ +K  K
Sbjct: 1435 SNESNAELTTTKSADLRVSAGKDDVSESSDVHKESTLRLVHSPRHD----ASKANEKVQK 1490

Query: 1688 RLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSDD-- 1861
            R  PAEE DRLNKRRKGEID RDI+  + R SEKER  D RA DKLH A +D+ GSDD  
Sbjct: 1491 RSIPAEELDRLNKRRKGEIDGRDIECGDARSSEKERLIDARAADKLHPADYDRHGSDDQI 1550

Query: 1862 ------KPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDR 2023
                  KPLDR+K+K G               +SRGDD   EK RDRS ERHGRERS++R
Sbjct: 1551 LNRASEKPLDRSKDKGGERLERDPRERGDRPDRSRGDDAF-EKSRDRSTERHGRERSIER 1609

Query: 2024 VQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203
            V ER ADRNFDRL+KDER KDDRSK+R+ EAS+EKS  DDR                   
Sbjct: 1610 VHERVADRNFDRLSKDERIKDDRSKLRHSEASVEKSLTDDRLYNQNLPPPPPLPPHLVPQ 1669

Query: 2204 SVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXX 2383
            S+++GRRD+D+DRRFG ARH+Q+LSP            N + LQ                
Sbjct: 1670 SINAGRRDDDSDRRFGTARHSQRLSPRHDERERRRSEENNTLLQ-DDLKRRREDDFRDRK 1728

Query: 2384 XXXXALSIKMD--ERERDKANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPAPPPVS 2557
                 LSIK++  ERER+KA + KED+D NASKRRKLKREHM SEPGEY PA+  PPP+S
Sbjct: 1729 REERELSIKVEEREREREKAILVKEDMDPNASKRRKLKREHMASEPGEYSPAA-HPPPLS 1787

Query: 2558 INLLQSHDGRDRGDRKGVIV-QRPGYAEDPGLRAHSKEAASKATRRDADPMYDREWDDDK 2734
            IN+ Q  DGRDRG+RKGVIV QRPGY ++PGLR H KE+ASKA RRDAD MYDREWDDDK
Sbjct: 1788 INMTQPSDGRDRGERKGVIVQQRPGYLDEPGLRIHGKESASKAPRRDADSMYDREWDDDK 1847

Query: 2735 RQRAEPKRRHRK 2770
            RQRAEPKRRHRK
Sbjct: 1848 RQRAEPKRRHRK 1859


>ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solanum lycopersicum]
          Length = 1858

 Score = 1072 bits (2773), Expect = 0.0
 Identities = 596/972 (61%), Positives = 663/972 (68%), Gaps = 62/972 (6%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G S KPI+W +LLDT+KTMLP KAWNSLSPDLYATFWGLTLYDL+VPR+RYESEI     
Sbjct: 898  GSSRKPISWTDLLDTIKTMLPSKAWNSLSPDLYATFWGLTLYDLHVPRSRYESEIGKQHA 957

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI KRKKDKERIQESLDRLT E Q+HEEHV SVRRRL  EKDTWLS
Sbjct: 958  ALKALEELSDNSSSAITKRKKDKERIQESLDRLTAELQRHEEHVTSVRRRLTREKDTWLS 1017

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FVNTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAVFVNTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYEVGRLGRFL+ETLKTAY+WK DESIYE+ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEVGRLGRFLYETLKTAYYWKGDESIYERECGNMPGFAVYYRYPNSQRVTY 1137

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALI+LTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALILLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYL++K A  PASKS + N+V + N
Sbjct: 1198 IKSDEREDLKVLATGVAAALASRKPSWVTDEEFGMGYLELKLAAVPASKSSAGNSVAIAN 1257

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGREPRRIDGDNLKQVEE-------------- 1258
            G+G SVSQ E   GRTV +G +    G L R    +   +L Q +               
Sbjct: 1258 GSGASVSQGEPSIGRTVVAGRV--VDGKLDRPDSSMPKPDLGQAKHKGSQSINGLDVQSM 1315

Query: 1259 -SANKQSEENSK--------------VTKTVVEPEARPVVKRSAAAGSLAKQAKQDLSKD 1393
             SA  QS+  S+               +K   E E R   KRS   GSL+KQ K D++KD
Sbjct: 1316 PSATLQSDTPSQNSMCRPLEESTIKAASKMSGEQEGRGTGKRSTPVGSLSKQQKHDIAKD 1375

Query: 1394 EDKSGKAVGR----------------------XXXXXXXXXXXXXXXXKLTNSSTRSSDH 1507
            E KSGK VGR                                      K     TR  D 
Sbjct: 1376 E-KSGKTVGRASGAASGDVSYPSESRASGSVNVSTTVSGNGSMFSAAPKGAAPLTRLLDP 1434

Query: 1508 NTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHK 1687
            + E  AE T +KS+D RV  GKD+ TE +D HK+ T R   SPRQ+    ASK+ +K  K
Sbjct: 1435 SNESNAEHTTTKSADLRVSAGKDDVTESSDVHKESTLRLVHSPRQD----ASKANEKVQK 1490

Query: 1688 RLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSDD-- 1861
            R  PAEE DRLNKRRKGEID RD + ++ R SEKE   D RA DKLH A +DK GSDD  
Sbjct: 1491 RSIPAEELDRLNKRRKGEIDGRDTECADARSSEKEWLIDARAADKLHPADYDKHGSDDQI 1550

Query: 1862 ------KPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDR 2023
                  KPLDR+KEK G               +SRGDD   EK RDRS ERHGRERS++R
Sbjct: 1551 LNRASEKPLDRSKEKGGERPERDPRERGDRPDRSRGDDAF-EKSRDRSTERHGRERSIER 1609

Query: 2024 VQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203
            V ER ADRNFDRL+KDER KDDRSK+R+ EAS+EKS  DDRF                  
Sbjct: 1610 VHERVADRNFDRLSKDERIKDDRSKLRHNEASVEKSLTDDRFHNQNLPPPPPLPPHLVPQ 1669

Query: 2204 SVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXX 2383
            S+S+GRR++D+DRRFG ARH+Q+LSP            N + LQ                
Sbjct: 1670 SISAGRREDDSDRRFGTARHSQRLSPRHDERERRRSEENNALLQ-DDLKRRREDDFRDRK 1728

Query: 2384 XXXXALSIKMD--ERERDKANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPAPPPVS 2557
                 LSIK++  ERER+KA + KED+D NASKRRKLKREHM SEPGEY PA  A PP+S
Sbjct: 1729 REERELSIKVEEREREREKAILVKEDMDPNASKRRKLKREHMASEPGEYSPA--AHPPLS 1786

Query: 2558 INLLQSHDGRDRGDRKGVIV-QRPGYAEDPGLRAHSKEAASKATRRDADPMYDREWDDDK 2734
            IN+ Q  DGRDRG+RKGVIV QRPGY ++PGLR H KE+ASKA RRDAD MYDREWDDDK
Sbjct: 1787 INMTQPSDGRDRGERKGVIVQQRPGYLDEPGLRIHGKESASKAPRRDADSMYDREWDDDK 1846

Query: 2735 RQRAEPKRRHRK 2770
            RQRAEPKRRHRK
Sbjct: 1847 RQRAEPKRRHRK 1858


>gb|EOY01325.1| THO complex subunit 2 isoform 1 [Theobroma cacao]
          Length = 1853

 Score = 1036 bits (2680), Expect = 0.0
 Identities = 564/976 (57%), Positives = 656/976 (67%), Gaps = 66/976 (6%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G   KPI W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI     
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                     DNSSSAI KRKKDKERIQE+LDRLT E  KHEE+V SVRRRL +EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RK SWVTDEEFGMGYL++KPA   ASKSL+ N V +QN
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234
            G+ ++VSQ+E  G R V+ G+  SD                     + +LG+   +  G 
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317

Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336
                                     +N KQ++ES+NK  E  +KV  K   E E++   K
Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377

Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516
            RSA AGSL K  KQD  KD+ KSGKAVGR                 + + +       T 
Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRD-------VPSHTEGRQGGTTN 1430

Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696
            + + +T++ ++ S    GKD+G+E  DA + P+SR   SPR ++    SKS DK  KR +
Sbjct: 1431 VPSAVTSNGNAVSAPPKGKDDGSELPDASR-PSSRIVHSPRHDSSATVSKSSDKLQKRTT 1489

Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858
            P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D +  D      FDK G+D      
Sbjct: 1490 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQLAD------FDKPGTDELTSHR 1542

Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032
              DKPLDR+K+K                 KSR DD+L+EK RDRS+ER+GRERSV    E
Sbjct: 1543 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1598

Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203
            R  DRN +RL   AKDER+KD+RSKVRY + S EKSHVDDRF                  
Sbjct: 1599 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1658

Query: 2204 SVSS-GRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380
            SV++ GRRD+D DRRFG+ RH+Q+LSP            N+   Q               
Sbjct: 1659 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1718

Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542
                  LS+K++ER+RD      KA++ KED+D N +KRRKLKREH+PSEPGEY P +P 
Sbjct: 1719 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1778

Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREW 2722
            PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK  RRD DPMYDREW
Sbjct: 1779 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDPMYDREW 1837

Query: 2723 DDDKRQRAEPKRRHRK 2770
            DD+KRQR EPKRRHRK
Sbjct: 1838 DDEKRQRPEPKRRHRK 1853


>gb|EOY01326.1| THO complex subunit 2 isoform 2 [Theobroma cacao]
          Length = 1844

 Score = 1032 bits (2669), Expect = 0.0
 Identities = 567/976 (58%), Positives = 654/976 (67%), Gaps = 66/976 (6%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G   KPI W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI     
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                     DNSSSAI KRKKDKERIQE+LDRLT E  KHEE+V SVRRRL +EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RK SWVTDEEFGMGYL++KPA   ASKSL+ N V +QN
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234
            G+ ++VSQ+E  G R V+ G+  SD                     + +LG+   +  G 
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317

Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336
                                     +N KQ++ES+NK  E  +KV  K   E E++   K
Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377

Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516
            RSA AGSL K  KQD  KD+ KSGKAVGR                 +T    R    +TE
Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGRT---------------SVTCVIDRDVPSHTE 1422

Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696
             +   T +  S +    GKD+G+E  DA + P+SR   SPR ++    SKS DK  KR +
Sbjct: 1423 GRQGGTTNVPS-AVTSNGKDDGSELPDASR-PSSRIVHSPRHDSSATVSKSSDKLQKRTT 1480

Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858
            P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D +  D      FDK G+D      
Sbjct: 1481 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQLAD------FDKPGTDELTSHR 1533

Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032
              DKPLDR+K+K                 KSR DD+L+EK RDRS+ER+GRERSV    E
Sbjct: 1534 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1589

Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203
            R  DRN +RL   AKDER+KD+RSKVRY + S EKSHVDDRF                  
Sbjct: 1590 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1649

Query: 2204 SVSS-GRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380
            SV++ GRRD+D DRRFG+ RH+Q+LSP            N+   Q               
Sbjct: 1650 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1709

Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542
                  LS+K++ER+RD      KA++ KED+D N +KRRKLKREH+PSEPGEY P +P 
Sbjct: 1710 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1769

Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREW 2722
            PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK  RRD DPMYDREW
Sbjct: 1770 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDPMYDREW 1828

Query: 2723 DDDKRQRAEPKRRHRK 2770
            DD+KRQR EPKRRHRK
Sbjct: 1829 DDEKRQRPEPKRRHRK 1844


>gb|EOY01328.1| THO complex subunit 2 isoform 4 [Theobroma cacao]
          Length = 1831

 Score = 1031 bits (2666), Expect = 0.0
 Identities = 564/976 (57%), Positives = 649/976 (66%), Gaps = 66/976 (6%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G   KPI W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI     
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                     DNSSSAI KRKKDKERIQE+LDRLT E  KHEE+V SVRRRL +EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RK SWVTDEEFGMGYL++KPA   ASKSL+ N V +QN
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234
            G+ ++VSQ+E  G R V+ G+  SD                     + +LG+   +  G 
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317

Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336
                                     +N KQ++ES+NK  E  +KV  K   E E++   K
Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377

Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516
            RSA AGSL K  KQD  KD+ KSGKAVGR                             T 
Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGR-----------------------------TS 1408

Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696
            +   I     S +    GKD+G+E  DA  +P+SR   SPR ++    SKS DK  KR +
Sbjct: 1409 VTCVIDRDVPSHTEGRQGKDDGSELPDA-SRPSSRIVHSPRHDSSATVSKSSDKLQKRTT 1467

Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858
            P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D +      +A FDK G+D      
Sbjct: 1468 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQ------LADFDKPGTDELTSHR 1520

Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032
              DKPLDR+K+K                 KSR DD+L+EK RDRS+ER+GRERSV    E
Sbjct: 1521 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1576

Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203
            R  DRN +RL   AKDER+KD+RSKVRY + S EKSHVDDRF                  
Sbjct: 1577 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1636

Query: 2204 SV-SSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380
            SV ++GRRD+D DRRFG+ RH+Q+LSP            N+   Q               
Sbjct: 1637 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1696

Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542
                  LS+K++ER+RD      KA++ KED+D N +KRRKLKREH+PSEPGEY P +P 
Sbjct: 1697 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1756

Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREW 2722
            PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK  RRD DPMYDREW
Sbjct: 1757 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDPMYDREW 1815

Query: 2723 DDDKRQRAEPKRRHRK 2770
            DD+KRQR EPKRRHRK
Sbjct: 1816 DDEKRQRPEPKRRHRK 1831


>gb|EOY01327.1| THO2 isoform 3 [Theobroma cacao]
          Length = 1762

 Score = 1009 bits (2609), Expect = 0.0
 Identities = 554/929 (59%), Positives = 633/929 (68%), Gaps = 19/929 (2%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G   KPI W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI     
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                     DNSSSAI KRKKDKERIQE+LDRLT E  KHEE+V SVRRRL +EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RK SWVTDEEFGMGYL++KPA   ASKSL+A +   Q 
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAATS---QA 1254

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGREPRRIDGDNLKQVEESANKQSEENSKV-T 1297
            G G S+                                +N KQ++ES+NK  E  +KV  
Sbjct: 1255 GTGKSL--------------------------------ENQKQLDESSNKLDEHLAKVPA 1282

Query: 1298 KTVVEPEARPVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKL 1477
            K   E E++   KRSA AGSL K  KQD  KD+ KSGKAVGR                 +
Sbjct: 1283 KNSAELESKASAKRSAPAGSLTKTQKQDPGKDDGKSGKAVGR---------------TSV 1327

Query: 1478 TNSSTRSSDHNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTA 1657
            T    R    +TE +   T +  S +    GKD+G+E  DA  +P+SR   SPR ++   
Sbjct: 1328 TCVIDRDVPSHTEGRQGGTTNVPS-AVTSNGKDDGSELPDA-SRPSSRIVHSPRHDSSAT 1385

Query: 1658 ASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAP 1837
             SKS DK  KR +P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D +      +A 
Sbjct: 1386 VSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQ------LAD 1438

Query: 1838 FDKTGSD--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLE 1993
            FDK G+D        DKPLDR+K+K                 KSR DD+L+EK RDRS+E
Sbjct: 1439 FDKPGTDELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIE 1498

Query: 1994 RHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXX 2164
            R+GRERSV    ER  DRN +RL   AKDER+KD+RSKVRY + S EKSHVDDRF     
Sbjct: 1499 RYGRERSV----ERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSL 1554

Query: 2165 XXXXXXXXXXXXXSV-SSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXX 2341
                         SV ++GRRD+D DRRFG+ RH+Q+LSP            N+   Q  
Sbjct: 1555 PPPPPLPPHMVPQSVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDD 1614

Query: 2342 XXXXXXXXXXXXXXXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHM 2503
                               LS+K++ER+RD      KA++ KED+D N +KRRKLKREH+
Sbjct: 1615 GKRRREDDFRERKREEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHL 1674

Query: 2504 PSEPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKA 2683
            PSEPGEY P +P PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK 
Sbjct: 1675 PSEPGEYSPIAPPPPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKM 1733

Query: 2684 TRRDADPMYDREWDDDKRQRAEPKRRHRK 2770
             RRD DPMYDREWDD+KRQR EPKRRHRK
Sbjct: 1734 ARRDTDPMYDREWDDEKRQRPEPKRRHRK 1762


>ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis vinifera]
          Length = 1849

 Score = 1008 bits (2606), Expect = 0.0
 Identities = 555/955 (58%), Positives = 642/955 (67%), Gaps = 72/955 (7%)
 Frame = +2

Query: 53   KPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXXXX 232
            KPI W +LLDT +TMLP KAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI         
Sbjct: 902  KPIVWSDLLDTARTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRHRYESEIAKQHSALKA 961

Query: 233  XXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSCPD 412
                SDNS+SAI KRKKDKERIQESLDRLT E QKHEE+V SVRRRLA EKD WLSSCPD
Sbjct: 962  LEELSDNSNSAITKRKKDKERIQESLDRLTSELQKHEENVASVRRRLAREKDKWLSSCPD 1021

Query: 413  TLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQPMI 592
            TLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQPMI
Sbjct: 1022 TLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQPMI 1081

Query: 593  CCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQFI 772
            CCCTEYE GRLGRFL+ET+K AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQFI
Sbjct: 1082 CCCTEYEAGRLGRFLYETMKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQFI 1141

Query: 773  KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 952
            KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD
Sbjct: 1142 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 1201

Query: 953  EREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGAGL 1132
            EREDLK            RKPSWVTDEEFGMGYL++KPAP  ASKSL+ N V + NG+GL
Sbjct: 1202 EREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSLASKSLAGNLVAVPNGSGL 1261

Query: 1133 SVSQAEQMGGRTVSSGSLHSDSGNLGRE----PRRIDG---------------------- 1234
            ++ Q E  GGRTV+SG+ H D+GN  +E     + +DG                      
Sbjct: 1262 NIFQNESSGGRTVASGTQHLDAGNSVKEQVLRAKTVDGRLERTESVSLVKSDPVHAKVKG 1321

Query: 1235 --------------------------DNLKQVEESANKQSEENS-KV-TKTVVEPEARPV 1330
                                      +N + V+ES N+  +E++ KV ++   E E R  
Sbjct: 1322 GSSVNGSDIQQSMPSAASHTGTSRSGENQRPVDESTNRTLDESTVKVSSRASTESELRAT 1381

Query: 1331 VKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSD-- 1504
             KRS  +GSL KQ K D++KD+ KSGK VGR                  + SST   D  
Sbjct: 1382 GKRSLPSGSLTKQPKLDVAKDDSKSGKGVGR-----------------TSGSSTSDRDLP 1424

Query: 1505 -HNTEIKAEITNSKSSDSRVYGG--KDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGD 1675
             H  E +     + SS     G   KD+G E +D  + P+SR   SPR +N +A  KSGD
Sbjct: 1425 AHQLEGRQSGVTNVSSAGTADGSVVKDDGNEVSD--RAPSSRPIHSPRHDN-SATIKSGD 1481

Query: 1676 KPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGS 1855
            K  KR SPAEE +R+NKRRKG+ + RD +G EVR S+KERS D R LDK H    DK+G+
Sbjct: 1482 KQQKRTSPAEEPERVNKRRKGDTEVRDFEG-EVRFSDKERSMDPR-LDKSHAVDLDKSGT 1539

Query: 1856 D--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRER 2011
            D        DKP DR K+K                 KSRGD++++EK RDRS+ERHGRER
Sbjct: 1540 DEQGISRATDKPSDRLKDKGSERYERDHRERLERPDKSRGDEMIAEKSRDRSMERHGRER 1599

Query: 2012 SVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXX 2182
            SV+RVQER ++R+FDRL    KDERNKDDR K+RY E S+EKSH DDRF           
Sbjct: 1600 SVERVQERSSERSFDRLTDKVKDERNKDDRGKMRYSETSVEKSHADDRFHGQSLPPPPPL 1659

Query: 2183 XXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXX 2362
                   SV++ RRDEDADRRFG ARH Q+LSP            +    Q         
Sbjct: 1660 PPHMVPQSVTASRRDEDADRRFGTARHAQRLSP---RHEEKERRRSEEISQDDAKRRRED 1716

Query: 2363 XXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EPGEYLPAS 2536
                        LSIK+++RER+KA++ KED+D + ASKRRKLKREHMPS E GEY PA+
Sbjct: 1717 DIRERKREEREGLSIKVEDREREKASLLKEDMDPSAASKRRKLKREHMPSGEAGEYTPAA 1776

Query: 2537 PAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDAD 2701
            P PPP +I++ Q++DGR+RGDRKG +VQR GY ++PGLR H KE   K  RRDAD
Sbjct: 1777 PPPPPPAISMSQAYDGRERGDRKGAMVQRAGYLDEPGLRIHGKEVTGKMARRDAD 1831


>ref|XP_006448121.1| hypothetical protein CICLE_v10014076mg [Citrus clementina]
            gi|557550732|gb|ESR61361.1| hypothetical protein
            CICLE_v10014076mg [Citrus clementina]
          Length = 1193

 Score = 1005 bits (2599), Expect = 0.0
 Identities = 555/984 (56%), Positives = 654/984 (66%), Gaps = 76/984 (7%)
 Frame = +2

Query: 47   SSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXX 226
            S KP+ W +LLDTVKTMLP KAWNSLSPDLY TFWGLTLYDLYVPR+RYESEI       
Sbjct: 219  SQKPVMWSDLLDTVKTMLPSKAWNSLSPDLYTTFWGLTLYDLYVPRDRYESEIAKQHAAL 278

Query: 227  XXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSC 406
                  SDNSSSAI KRKKDKERIQESLDRLT E  KHEE+V SVRRRL+ EKD WLSSC
Sbjct: 279  KALEELSDNSSSAITKRKKDKERIQESLDRLTNELHKHEENVASVRRRLSREKDRWLSSC 338

Query: 407  PDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQP 586
            PDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQP
Sbjct: 339  PDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQP 398

Query: 587  MICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQ 766
            MICCCTEYE GRLG+FLFETLK AYHWKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQ
Sbjct: 399  MICCCTEYEAGRLGKFLFETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQ 458

Query: 767  FIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIK 946
            FIKVHWKWSQRITRLLIQCLES EYMEIRNALI+LTKIS VFPVTRKSGINLEKRVAKIK
Sbjct: 459  FIKVHWKWSQRITRLLIQCLESAEYMEIRNALILLTKISGVFPVTRKSGINLEKRVAKIK 518

Query: 947  SDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGA 1126
            +DEREDLK            RK  WVTDEEFGMGYL++KPAP  ASKSLS N V +Q G+
Sbjct: 519  NDEREDLKVLATGVAAALANRKSFWVTDEEFGMGYLELKPAPSLASKSLSGNVVAVQ-GS 577

Query: 1127 GLSVSQAEQMGGRTVSSGSLHSDSGNLGREPR-----RIDGDNLKQVEESANKQSEENSK 1291
             ++VSQ+E   G +V      +  G+ GR  R      +  DN+K    S    S+ +S 
Sbjct: 578  AINVSQSEPGTGNSVKDHISRAKPGD-GRLERTESISHVKSDNVKLKGSSLTNGSDIHSS 636

Query: 1292 VTKTVV-----------------------------EPEARPVVKRSAAAGSLAKQAKQDL 1384
            +  T V                             E E++  VKRS  + SL K  KQDL
Sbjct: 637  MPSTAVQAEMSRVVENQKQVDEDENMAKVAMKNSAESESKASVKRSVPSASLTKAPKQDL 696

Query: 1385 SKDEDKSGKAVGR-----------------------XXXXXXXXXXXXXXXXKLTNSSTR 1495
            +KD++KS KAVGR                                       K ++SS+R
Sbjct: 697  AKDDNKSAKAVGRTSGSSANDRDFSSHAAEGKQGGATTVSSAAAVTANLVSAKGSSSSSR 756

Query: 1496 SSD-HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSG 1672
            +SD H  E K +   +KSS+ R+  GK +G E +DA K  +SR+  SPR ++  AASKSG
Sbjct: 757  ASDMHGNESKTDGGVAKSSEVRLSTGKSDGNEVSDAPKSSSSRTMHSPRHDSSVAASKSG 816

Query: 1673 DKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTG 1852
            D+  KR SP+E+ DR +KR KG+ + RD DG EVR+ ++ERS+D R  D       DK G
Sbjct: 817  DRLQKRTSPSEDPDRPSKRYKGDTELRDSDG-EVRVPDRERSADPRFAD------LDKIG 869

Query: 1853 SDDKPL----DRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVD 2020
            +D++ +    DR+K+K                 KSR DD++ EK RDRS+ER+GRERSV+
Sbjct: 870  TDEQSMYRTTDRSKDKGNERYERDHRERLDRLDKSRVDDIIPEKQRDRSMERYGRERSVE 929

Query: 2021 RVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXX 2191
            R QERGADR FDRL   AKD+RNKDDRSK+RY ++S EKSHVD+RF              
Sbjct: 930  RGQERGADRAFDRLAEKAKDDRNKDDRSKLRYNDSSSEKSHVDERFHGQSLPPPPPLPPH 989

Query: 2192 XXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXX 2371
                SV++GRRDEDAD+RFG+ RH+Q+LSP            N+   Q            
Sbjct: 990  IVPQSVNAGRRDEDADKRFGSTRHSQRLSPRHDEKERRRSEENSLVSQDDAKRRREDDFR 1049

Query: 2372 XXXXXXXXALSIKMDERE--------RDKANMNKEDIDLN--ASKRRKLKREHMPS-EPG 2518
                     LS+KMDERE        R+KAN+ KE++D N  ASKRRKLKREH+PS E G
Sbjct: 1050 DRKREDREGLSLKMDERERERDRDRDREKANLLKEEMDANAAASKRRKLKREHLPSGEAG 1109

Query: 2519 EYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDA 2698
            EY P +P  PP++I + QS+DGRDRGDRKG  +QR GY E+  +R H KE A+K  RRD+
Sbjct: 1110 EYSPVAPPYPPLAIGISQSYDGRDRGDRKGAAMQRTGYMEEQSMRIHGKEVATKMARRDS 1169

Query: 2699 DPMYDREWDDDKRQRAEPKRRHRK 2770
            + +Y+REW+D+KRQRAE KRRHRK
Sbjct: 1170 ELIYEREWEDEKRQRAEQKRRHRK 1193


>ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citrus sinensis]
          Length = 1874

 Score = 1005 bits (2598), Expect = 0.0
 Identities = 555/984 (56%), Positives = 653/984 (66%), Gaps = 76/984 (7%)
 Frame = +2

Query: 47   SSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXX 226
            S KP+ W +LLDTVKTMLP KAWNSLSPDLY TFWGLTLYDLYVPR+RYESEI       
Sbjct: 900  SQKPVMWSDLLDTVKTMLPSKAWNSLSPDLYTTFWGLTLYDLYVPRDRYESEIAKQHAAL 959

Query: 227  XXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSC 406
                  SDNSSSAI KRKKDKERIQESLDRLT E  KHEE+V SVRRRL+ EKD WLSSC
Sbjct: 960  KALEELSDNSSSAITKRKKDKERIQESLDRLTNELHKHEENVASVRRRLSREKDRWLSSC 1019

Query: 407  PDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQP 586
            PDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQP
Sbjct: 1020 PDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQP 1079

Query: 587  MICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQ 766
            MICCCTEYE GRLG+FLFETLK AYHWKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQ
Sbjct: 1080 MICCCTEYEAGRLGKFLFETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQ 1139

Query: 767  FIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIK 946
            FIKVHWKWSQRITRLLIQCLES EYMEIRNALI+LTKIS VFPVTRKSGINLEKRVAKIK
Sbjct: 1140 FIKVHWKWSQRITRLLIQCLESAEYMEIRNALILLTKISGVFPVTRKSGINLEKRVAKIK 1199

Query: 947  SDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGA 1126
            +DEREDLK            RK  WVTDEEFGMGYL++KPAP  ASKSLS N V +Q G+
Sbjct: 1200 NDEREDLKVLATGVAAALANRKSFWVTDEEFGMGYLELKPAPSLASKSLSGNVVAVQ-GS 1258

Query: 1127 GLSVSQAEQMGGRTVSSGSLHSDSGNLGREPR-----RIDGDNLKQVEESANKQSEENSK 1291
             ++VSQ+E   G +V      +  G+ GR  R      +  DN+K    S    S+ +S 
Sbjct: 1259 AINVSQSEPGTGNSVKDHISRAKPGD-GRLERTESISHVKSDNVKLKGSSLTNGSDIHSS 1317

Query: 1292 VTKTVV-----------------------------EPEARPVVKRSAAAGSLAKQAKQDL 1384
            V  T V                             E E++  VKRS  + SL K  KQDL
Sbjct: 1318 VPSTAVQAEMSRVVENQKQVDEDENMAKVAMKNSAESESKASVKRSVPSASLTKAPKQDL 1377

Query: 1385 SKDEDKSGKAVGR-----------------------XXXXXXXXXXXXXXXXKLTNSSTR 1495
            +KD++KS KAVGR                                       K ++SS+R
Sbjct: 1378 AKDDNKSAKAVGRTSGSSANDRDFSSHAAEGKQGGATTVSSAAAVTANLVSAKGSSSSSR 1437

Query: 1496 SSD-HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSG 1672
            +SD H  E K +   +KSS+ R+  GK +G E +DA K  +SR+  SPR ++  A SKSG
Sbjct: 1438 ASDMHGNESKTDGGVAKSSEVRLSTGKSDGNEVSDAPKSSSSRAMHSPRHDSSVATSKSG 1497

Query: 1673 DKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTG 1852
            D+  KR SP+E+ DR +KR KG+ + RD DG EVR+ ++ERS+D R  D       DK G
Sbjct: 1498 DRLQKRTSPSEDPDRPSKRYKGDTELRDSDG-EVRVPDRERSADPRFAD------LDKIG 1550

Query: 1853 SDDKPL----DRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVD 2020
            +D++ +    DR+K+K                 KSR DD++ EK RDRS+ER+GRERSV+
Sbjct: 1551 TDEQSMYRTTDRSKDKGNERYERDHRERLDRLDKSRVDDIIPEKQRDRSMERYGRERSVE 1610

Query: 2021 RVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXX 2191
            R QERGADR FDRL   AKD+RNKDDRSK+RY ++S EKSHVD+RF              
Sbjct: 1611 RGQERGADRAFDRLADKAKDDRNKDDRSKLRYNDSSSEKSHVDERFHGQSLPPPPPLPPH 1670

Query: 2192 XXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXX 2371
                SV++GRRDEDAD+RFG+ RH+Q+LSP            N+   Q            
Sbjct: 1671 IVPQSVNAGRRDEDADKRFGSTRHSQRLSPRHDEKERRRSEENSLVSQDDAKRRREDDFR 1730

Query: 2372 XXXXXXXXALSIKMDERE--------RDKANMNKEDIDLN--ASKRRKLKREHMPS-EPG 2518
                     LS+KMDERE        R+KAN+ KE++D N  ASKRRKLKREH+PS E G
Sbjct: 1731 DRKREDREGLSLKMDERERERDRDRDREKANLLKEEMDANAAASKRRKLKREHLPSGEAG 1790

Query: 2519 EYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDA 2698
            EY P +P  PP++I + QS+DGRDRGDRKG  +QR GY E+  +R H KE A+K  RRD+
Sbjct: 1791 EYSPVAPPYPPLAIGISQSYDGRDRGDRKGATMQRTGYMEEQSMRIHGKEVATKMARRDS 1850

Query: 2699 DPMYDREWDDDKRQRAEPKRRHRK 2770
            + +Y+REW+D+KRQRAE KRRHRK
Sbjct: 1851 ELIYEREWEDEKRQRAEQKRRHRK 1874


>ref|XP_006586338.1| PREDICTED: THO complex subunit 2-like [Glycine max]
          Length = 1778

 Score =  994 bits (2571), Expect = 0.0
 Identities = 553/992 (55%), Positives = 649/992 (65%), Gaps = 82/992 (8%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G +  PI+W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI     
Sbjct: 805  GSAQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 864

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI KRKK+KERIQESLDRL  E  KHEE+V SVRRRL+HEKD WLS
Sbjct: 865  NLKSLEELSDNSSSAITKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 924

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 925  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 984

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 985  QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1044

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1045 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1104

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYL++KPAP   +KS + N+  +Q+
Sbjct: 1105 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPS-VTKSSAGNSATVQS 1163

Query: 1121 GAGLSVSQAEQMGGRTVSSGSL------------------------HSDSGNLG-REPRR 1225
            G  L+VSQ E   G+ V SG++                         SD+G++  +    
Sbjct: 1164 GINLNVSQTESASGKHVDSGNIVKDQAMRTKTADGRSERTESITVTKSDTGHIKLKSSSM 1223

Query: 1226 IDG--------------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345
            ++G                    +N KQVEES N+ S+E+   T+T    E R   KRS 
Sbjct: 1224 VNGLDAQSSLAPSSVQSGTSKSMENPKQVEESINRASDEHG--TRTT---ELRTSAKRSV 1278

Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTR---SSDHNT- 1513
             AGSL+K +KQD  K++ +SGK V R                +   + T    SS+ NT 
Sbjct: 1279 PAGSLSKPSKQDPVKEDGRSGKPVARTSGSSSSDKELQTHALEGRYTGTTNVPSSNGNTI 1338

Query: 1514 --------------------EIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQS 1633
                                E KAE+  +KSSD R    KD+G + TD  +  +SR   S
Sbjct: 1339 SGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGASSRVVHS 1398

Query: 1634 PRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRA 1813
            PR EN    SKS DK  KR S AEE DRL KRRKG+++ RD + +EVR SE+E+  D R 
Sbjct: 1399 PRYENTGVTSKSNDKVQKRASSAEEPDRLGKRRKGDVELRDFE-TEVRFSEREKMMDPRF 1457

Query: 1814 LDKLHVAPFDKTGSD--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSE 1969
             D       DK+G +        DKPL+RAK+K                 KSRGDD ++E
Sbjct: 1458 AD-------DKSGPEEHGLYRAGDKPLERAKDKGNERYERDHRERMDRLDKSRGDDFVAE 1510

Query: 1970 KLRDRSLERHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVD 2140
            K RDRS+ER+GRERSV+R+QERG+DR+F+RL   AKDERNKDDR+K+RY +AS+EKSH D
Sbjct: 1511 KPRDRSIERYGRERSVERMQERGSDRSFNRLPEKAKDERNKDDRNKLRYNDASVEKSHGD 1570

Query: 2141 DRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXN 2320
            DRF                  SV +GRRDED DRR+G  RH+Q+LSP             
Sbjct: 1571 DRFHGQSLPPPPPLPPNVVPQSVGAGRRDEDVDRRYGATRHSQRLSP----RHEEKERRR 1626

Query: 2321 ASALQXXXXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKRE 2497
            +                         + ++  ERER+KAN+ KE++DLN ASKRRK KRE
Sbjct: 1627 SEETVVSQDDAKRRKEDDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKPKRE 1686

Query: 2498 HMPS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAA 2674
            H+P+ EPGEY P +  P    I +  ++DGRDRGDRKG I+Q P Y ++  LR H KE A
Sbjct: 1687 HLPTGEPGEYSPVAHPPSSAGIGMSLAYDGRDRGDRKGPIMQHPSYVDESSLRIHGKEVA 1746

Query: 2675 SKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770
            SK  RRD+DP+YDREW+D+KRQRA+ KRRHRK
Sbjct: 1747 SKLNRRDSDPLYDREWEDEKRQRADQKRRHRK 1778


>gb|EMJ18294.1| hypothetical protein PRUPE_ppa000084mg [Prunus persica]
          Length = 1878

 Score =  993 bits (2568), Expect = 0.0
 Identities = 552/996 (55%), Positives = 651/996 (65%), Gaps = 86/996 (8%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G  SKP+ WL+LL+TVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRN YESEI     
Sbjct: 896  GSPSKPVTWLDLLNTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNCYESEIAKQHA 955

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI KRKKDKERIQESLDRLT E +KHEE+V SVR+RL+ EKD WLS
Sbjct: 956  ALKALEELSDNSSSAITKRKKDKERIQESLDRLTSELRKHEENVASVRKRLSREKDKWLS 1015

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKIN+EFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHID+LIC+TL
Sbjct: 1016 SCPDTLKINVEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDILICRTL 1075

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYEVGR G+FL ETLK AY+WK DESIYE+ECGNMPGFAVYYR+PNSQRV Y
Sbjct: 1076 QPMICCCTEYEVGRFGKFLQETLKIAYYWKKDESIYERECGNMPGFAVYYRHPNSQRVAY 1135

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
             QF+KVHWKWSQRIT+LLIQCLESTEYMEIRNALI+L+KISSVFPVTRK+G+NLEKRV+K
Sbjct: 1136 FQFMKVHWKWSQRITKLLIQCLESTEYMEIRNALILLSKISSVFPVTRKTGVNLEKRVSK 1195

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IK+DEREDLK            RK SW+TDEEFG GYL++K AP  ASKS + N+    +
Sbjct: 1196 IKADEREDLKVLATGVAAALAARKSSWITDEEFGNGYLELKSAP-LASKSSAGNSAATHS 1254

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGN-------------------------------- 1204
            G+ +++SQ+E +GG+  +  S H +S N                                
Sbjct: 1255 GSTINISQSEPIGGKVGALPSQHPESSNSVKDQILKTKTSDGRLERVESISTVKSDQGHL 1314

Query: 1205 ---LGREPRRIDGDNL-----------------KQVEESANKQSEEN--SKVTKTVVEPE 1318
               +G      DG +L                 KQV ES+N+ S+EN      K   E E
Sbjct: 1315 KLKVGSLVSGSDGQSLMSSPALQSGTSRSMENKKQVNESSNRTSDENMGKAAPKNSSESE 1374

Query: 1319 ARPVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRS 1498
             R   KRS  AGSLAK  KQDL+KD+ +SGK +GR                   N +T S
Sbjct: 1375 LRAQAKRSGPAGSLAKPPKQDLAKDDGRSGKGIGRDVLCHASAVSTNVSPAIAANGNTVS 1434

Query: 1499 SD------------HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQ 1642
            +             H  + K ++  +K+S++RV   K++G E +DA +  +SR   SPR 
Sbjct: 1435 ASAKGSFAKTSVEIHGIDSKVDVGAAKASNTRVSAPKEDGPETSDALRPHSSRLVHSPRH 1494

Query: 1643 ENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDK 1822
            +N  +ASKS DK  KR SPAEE DR +KRRKGE + RD +G E RLS++ERS D R LD 
Sbjct: 1495 DNSASASKSSDKLQKRTSPAEETDRQSKRRKGETEMRDFEG-EARLSDRERSVDARLLD- 1552

Query: 1823 LHVAPFDKTGSD--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLR 1978
                  DK+G+D        DKP DR+K+K                 KSRGDD L E+ R
Sbjct: 1553 -----LDKSGTDDQSVYKATDKPSDRSKDKGSERHDKDYRERLDRPDKSRGDD-LGERSR 1606

Query: 1979 DRSLERHGRERSVDRVQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXX 2158
            DRS+ERHGRE SV++VQERG DR+ DRL+  +++KDDR KVRY + S EKSHVD+R+   
Sbjct: 1607 DRSMERHGREHSVEKVQERGMDRSVDRLS--DKSKDDRGKVRYNDISTEKSHVDERYHGQ 1664

Query: 2159 XXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQX 2338
                           SVSSGRRDEDADRRFG  RHTQ+LSP            N+   Q 
Sbjct: 1665 SLPPPPPLPPHMVPHSVSSGRRDEDADRRFGTTRHTQRLSPRHDEKERRRSEDNSLISQD 1724

Query: 2339 XXXXXXXXXXXXXXXXXXXALSIKMD----ERERDKANMNKEDID-LNASKRRKLKREHM 2503
                                LSIK++    ERER+KAN+ KE+ D + ASKRRKLKREH 
Sbjct: 1725 DSKRRREDDFRDRKREDREGLSIKVEEREREREREKANLLKEETDAIAASKRRKLKREHP 1784

Query: 2504 PS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASK 2680
            PS EPGEY P  P PPP+SI+L QS+DGRDRGDRKG  VQR GY E+P +R H KEAASK
Sbjct: 1785 PSGEPGEYSPVPPPPPPLSISLSQSYDGRDRGDRKGPPVQRAGYLEEPSVRIHGKEAASK 1844

Query: 2681 ATRRDADP------MYDREWDDDKRQRAEPKRRHRK 2770
             TRRD DP      MY  EW+D+KRQRAE KRRHRK
Sbjct: 1845 MTRRDPDPYPSCCRMY--EWEDEKRQRAEQKRRHRK 1878


>gb|ESW32460.1| hypothetical protein PHAVU_002G324500g [Phaseolus vulgaris]
          Length = 1864

 Score =  992 bits (2564), Expect = 0.0
 Identities = 548/980 (55%), Positives = 642/980 (65%), Gaps = 70/980 (7%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G +  PI+W  LLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI     
Sbjct: 896  GSAQNPISWSYLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 955

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI KRKK+KERIQESLDRL  E  KHEE+V SV  RL+ EKD WLS
Sbjct: 956  NLKSLEELSDNSSSAITKRKKEKERIQESLDRLISELHKHEENVASVHSRLSREKDRWLS 1015

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1016 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1075

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1076 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1135

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLES+EYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1136 GQFIKVHWKWSQRITRLLIQCLESSEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1195

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYL++KPAP   +KS + N   + +
Sbjct: 1196 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPS-GTKSSAGNPSTVHS 1254

Query: 1121 GAGLSVSQAEQMGGRTVSSG------------------------SLHSDSGN----LGRE 1216
            G  L+VSQ E   G+ V SG                        +  SDSG+     G  
Sbjct: 1255 GMNLNVSQTESASGKHVDSGNTVKDQVIRTKTTDGKSERTESMTATKSDSGHTKVKTGAM 1314

Query: 1217 PRRIDG-----------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345
                DG                 +N KQVEE  N+ S+++   T+T    E+R   KRS 
Sbjct: 1315 VNGFDGQTSSISSSIQSGMSKSMENSKQVEELINRASDDHG--TRTA---ESRASAKRSV 1369

Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNS----------STR 1495
              GSL+K +KQD  K++ +SGK V R                 +T+S          ST+
Sbjct: 1370 PTGSLSKPSKQDPLKEDSRSGKPVARTSGSLSSDKDLHSGTTNVTSSVSANGNTITGSTK 1429

Query: 1496 SSD---------HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQEN 1648
             S+            E KAE+  SKSSD R    KD+G +  D  +  +SR   SPR EN
Sbjct: 1430 GSNAPVRISLDGPGNESKAEVGVSKSSDIRASVVKDDGNDTADLTRGSSSRVVHSPRHEN 1489

Query: 1649 LTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVR-ALDKL 1825
               ASKS +K  KR S AEE DRL KRRKG+++ RD + SEVR S++++  D R A DKL
Sbjct: 1490 TGVASKSNEKVQKRASSAEEPDRLGKRRKGDVELRDFE-SEVRFSDRDKLMDPRFADDKL 1548

Query: 1826 HVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGR 2005
                     + DK L+R K+K                 KSRGDD ++EK RDRS+ER+GR
Sbjct: 1549 GPEEHGLYRAGDKSLERPKDKGNERYERDHRERLDRVDKSRGDDSVAEKPRDRSIERYGR 1608

Query: 2006 ERSVDRVQERGADRNFDR---LAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXX 2176
            ERSV+R+QERG++R+F+R    AKDER+KDDR+K+RY +AS+EKSH DDRF         
Sbjct: 1609 ERSVERMQERGSERSFNRPPEKAKDERSKDDRNKLRYSDASVEKSHADDRFHGQSLPPPP 1668

Query: 2177 XXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXX 2356
                     SV +GRRDEDADRR+G  RH+Q+LSP             +           
Sbjct: 1669 PLPPNMVPQSVGAGRRDEDADRRYGATRHSQRLSP----RHEEKERRRSEETVVSQDDAK 1724

Query: 2357 XXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EPGEYLP 2530
                          + ++  ERER+KAN+ KED+DLN ASKRRKLKREH+ + EPGEY P
Sbjct: 1725 RRKEDDFRERKREEIKVEEREREREKANVLKEDLDLNAASKRRKLKREHLSTGEPGEYSP 1784

Query: 2531 ASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMY 2710
             +P PPP  I +   +DGRDRGDRKG ++Q P Y ++P +R H KE ASK  RRD+DP+Y
Sbjct: 1785 VAPPPPPTGIGMPLGYDGRDRGDRKGPVIQHPNYIDEPNIRIHGKEVASKLNRRDSDPLY 1844

Query: 2711 DREWDDDKRQRAEPKRRHRK 2770
            DREWDD+KRQRA+ KRRHRK
Sbjct: 1845 DREWDDEKRQRADQKRRHRK 1864


>ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isoform X1 [Glycine max]
          Length = 1870

 Score =  988 bits (2553), Expect = 0.0
 Identities = 548/985 (55%), Positives = 645/985 (65%), Gaps = 75/985 (7%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G    PI+W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI     
Sbjct: 897  GSDQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 956

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAIAKRKK+KERIQESLDRL  E  KHEE+V SVRRRL+HEKD WLS
Sbjct: 957  NLKSLEELSDNSSSAIAKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 1016

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1076

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1077 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1136

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1137 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1196

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYL++KP+P   +KS + N+  +Q+
Sbjct: 1197 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPS-MTKSSAGNSATVQS 1255

Query: 1121 GAGLSVSQAEQMGGRTVSSGS------------------------LHSDSGNLG-REPRR 1225
            G  L+VSQ E + G+ V SG+                          SD+G++  +    
Sbjct: 1256 GINLNVSQTESVSGKHVDSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIKLKSSSM 1315

Query: 1226 IDG--------------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345
            ++G                    +N KQVEES N+ S+E+   T++    E R   KRS 
Sbjct: 1316 VNGLDAQSSMAPSSVQSGMPKSMENPKQVEESINRASDEHG--TRST---ELRTSAKRSV 1370

Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTR---SSDHNT- 1513
             A SLAK +KQD  K++ +SGK V R                +  ++ T    SS+ NT 
Sbjct: 1371 PASSLAKPSKQDPVKEDGRSGKPVARTSGSLSSDKDLQTHALEGRHTGTTNVPSSNGNTI 1430

Query: 1514 --------------------EIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQS 1633
                                E KAE+  +KSSD R    KD+G + TD  +  +SR   S
Sbjct: 1431 SGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGSSSRIVHS 1490

Query: 1634 PRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVR- 1810
            PR EN    SKS D+  KR S  EE DRL KRRKG+++ RD + +E+R SE+E+  D R 
Sbjct: 1491 PRHENTVVTSKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFE-TELRFSEREKMMDPRF 1549

Query: 1811 ALDKLHVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSL 1990
            A DKL         + DKPL+R K+K                 KSRGDD ++EK RDRS+
Sbjct: 1550 ADDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKSRGDDFVAEKPRDRSI 1609

Query: 1991 ERHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXX 2161
            ER+GRERSV+R+QERG+DR+F+RL   AKDERNKDDR+K+RY +AS EKSH DDRF    
Sbjct: 1610 ERYGRERSVERMQERGSDRSFNRLPEKAKDERNKDDRNKLRYNDASAEKSHGDDRFHGQS 1669

Query: 2162 XXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXX 2341
                          SV +GRRDED DRR+G  RH+Q+LSP               +    
Sbjct: 1670 LPPPPPLPPNVVPQSVGAGRRDEDVDRRYGATRHSQRLSPRHEEKERRWSEETVVS---- 1725

Query: 2342 XXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EP 2515
                               + ++  ERER+KAN+ KE++DLN ASKRRKLKREH+P+ EP
Sbjct: 1726 QDDAKRRKEDDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKLKREHLPTDEP 1785

Query: 2516 GEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRD 2695
            GEY   +  P      +  ++DGRDRGDRKG I+Q P Y ++  LR H KEAASK  RRD
Sbjct: 1786 GEYSAVAHPPSSAGTGMPLAYDGRDRGDRKGPIMQHPSYIDESSLRIHGKEAASKLNRRD 1845

Query: 2696 ADPMYDREWDDDKRQRAEPKRRHRK 2770
            +DP+YDREW+D+KRQRA+ KRRHRK
Sbjct: 1846 SDPLYDREWEDEKRQRADQKRRHRK 1870


>gb|EOY01329.1| THO complex subunit 2 isoform 5 [Theobroma cacao]
          Length = 1824

 Score =  987 bits (2552), Expect = 0.0
 Identities = 547/954 (57%), Positives = 633/954 (66%), Gaps = 66/954 (6%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G   KPI W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI     
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                     DNSSSAI KRKKDKERIQE+LDRLT E  KHEE+V SVRRRL +EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RK SWVTDEEFGMGYL++KPA   ASKSL+ N V +QN
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234
            G+ ++VSQ+E  G R V+ G+  SD                     + +LG+   +  G 
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317

Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336
                                     +N KQ++ES+NK  E  +KV  K   E E++   K
Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377

Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516
            RSA AGSL K  KQD  KD+ KSGKAVGR                 +T    R    +TE
Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGRT---------------SVTCVIDRDVPSHTE 1422

Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696
             +   T +  S +    GKD+G+E  DA + P+SR   SPR ++    SKS DK  KR +
Sbjct: 1423 GRQGGTTNVPS-AVTSNGKDDGSELPDASR-PSSRIVHSPRHDSSATVSKSSDKLQKRTT 1480

Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858
            P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D +  D      FDK G+D      
Sbjct: 1481 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQLAD------FDKPGTDELTSHR 1533

Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032
              DKPLDR+K+K                 KSR DD+L+EK RDRS+ER+GRERSV    E
Sbjct: 1534 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1589

Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203
            R  DRN +RL   AKDER+KD+RSKVRY + S EKSHVDDRF                  
Sbjct: 1590 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1649

Query: 2204 SVSS-GRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380
            SV++ GRRD+D DRRFG+ RH+Q+LSP            N+   Q               
Sbjct: 1650 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1709

Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542
                  LS+K++ER+RD      KA++ KED+D N +KRRKLKREH+PSEPGEY P +P 
Sbjct: 1710 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1769

Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADP 2704
            PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK  RRD DP
Sbjct: 1770 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDP 1822


>ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucumis sativus]
            gi|449506883|ref|XP_004162874.1| PREDICTED: THO complex
            subunit 2-like [Cucumis sativus]
          Length = 1887

 Score =  984 bits (2545), Expect = 0.0
 Identities = 547/994 (55%), Positives = 646/994 (64%), Gaps = 84/994 (8%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G   KP+ W +LLDTVK+MLPPKAWNSLSPDLY TFWGLTLYDLYVPR+RYESEI     
Sbjct: 899  GSLQKPVRWSDLLDTVKSMLPPKAWNSLSPDLYTTFWGLTLYDLYVPRSRYESEIAKQHA 958

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI KRKKDKERIQESLDRL+ E  KHEE+V SVRRRL+ EKD WLS
Sbjct: 959  ALKALEELSDNSSSAINKRKKDKERIQESLDRLSNELVKHEENVASVRRRLSREKDKWLS 1018

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1019 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1078

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AYHWKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1079 QPMICCCTEYEAGRLGRFLYETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1138

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKIS+VFPVTRKSGINLEKRVAK
Sbjct: 1139 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISNVFPVTRKSGINLEKRVAK 1198

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYL++K  P  ASK  ++N    QN
Sbjct: 1199 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELK-TPSLASKPSASNLASSQN 1257

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGRE-----------PRRIDG----------- 1234
             + + VSQ E +GG+T +    +SDSGN+ ++             +IDG           
Sbjct: 1258 NS-IFVSQNEPVGGKTSALPIPNSDSGNMAKDHSLRSRTSDVRTDKIDGLSVPKSELGHG 1316

Query: 1235 -----------------------------DNLKQVEESANKQSEENSK-VTKTVVEPEAR 1324
                                         D+ K  ++S     E +SK V+KT  E E R
Sbjct: 1317 KQKGMSLNGPDSQPLVPSTSVHSGSLKMVDSQKPGDDSTRTLDEGSSKVVSKTSSESELR 1376

Query: 1325 PVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXK---------- 1474
               KRS    SL K  KQD++KDE +SGKA  +                           
Sbjct: 1377 GSTKRSGPVTSLNKAPKQDITKDEIRSGKAASKNPGSSTSERELPVHATDGGRHGGPSNS 1436

Query: 1475 ---LTNSSTRSS-------------DHNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHK 1606
               ++N +T++S              H  E KAE    ++SD RV   KD+G E  D  +
Sbjct: 1437 PSIMSNGNTQNSLTKGSSLTVKASDGHTIESKAESGVGRTSDGRVSSVKDDGPEALDVSR 1496

Query: 1607 QPTSRSTQSPRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSE 1786
              +SR   SPR +N  + S+S DK  KR SPAEE DR  KRRKG+ + RD+DG + R+S+
Sbjct: 1497 SSSSRLGHSPRHDNSASGSRSSDKLQKRASPAEEPDRQGKRRKGDGEIRDVDG-DFRISD 1555

Query: 1787 KERSSDVRALDKLHVAPFDKTG--SDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDL 1960
            K+RS D R++D   +   +++G    DKPLDR K+K                 KSRGDD 
Sbjct: 1556 KDRSMDPRSIDADKIGMEEQSGYRGLDKPLDRTKDKVNERYDRDYRDRAERPEKSRGDDP 1615

Query: 1961 LSEKLRDRSLERHGRERSVDRVQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVD 2140
              E+ RDRS+ER+GRERSV++V ER +DR +   +KDERNKDDRSK+RY +++++KSH D
Sbjct: 1616 QVERTRDRSIERYGRERSVEKV-ERVSDR-YPEKSKDERNKDDRSKLRYSDSTVDKSHTD 1673

Query: 2141 DRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXN 2320
            DRF                  SV+SGRR+EDADRRFG ARH Q+LSP            N
Sbjct: 1674 DRFHGQSLPPPPPLPPHLVPQSVNSGRREEDADRRFGTARHAQRLSPRHEEKERRRSEEN 1733

Query: 2321 ASALQXXXXXXXXXXXXXXXXXXXXALSIKMD--ERERDKANMNKEDIDLN-ASKRRKLK 2491
              +                       +S+K+D  ERER+KAN+ KED+D + ASKRRKLK
Sbjct: 1734 LISQDDAKRRREEEFRERKREERDVGMSLKVDDREREREKANLLKEDMDASAASKRRKLK 1793

Query: 2492 REHMP-SEPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKE 2668
            REH+   E GEY P  P PPP+   + QS+DGR+RGDRKGV++QRPGY +DPGLR H KE
Sbjct: 1794 REHLSLVEAGEYSPVGPPPPPMGGGVSQSYDGRERGDRKGVMMQRPGYLDDPGLRIHGKE 1853

Query: 2669 AASKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770
              +K TRR+AD MY+REWDD+KR RA+ KRRHRK
Sbjct: 1854 VVNKMTRREADLMYEREWDDEKRMRADQKRRHRK 1887


>ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isoform X2 [Glycine max]
          Length = 1845

 Score =  970 bits (2507), Expect = 0.0
 Identities = 542/985 (55%), Positives = 639/985 (64%), Gaps = 75/985 (7%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G    PI+W  LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI     
Sbjct: 897  GSDQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 956

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAIAKRKK+KERIQESLDRL  E  KHEE+V SVRRRL+HEKD WLS
Sbjct: 957  NLKSLEELSDNSSSAIAKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 1016

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1076

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1077 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1136

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK
Sbjct: 1137 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1196

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYL++KP+P   +KS + N+  +Q+
Sbjct: 1197 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPS-MTKSSAGNSATVQS 1255

Query: 1121 GAGLSVSQAEQMGGRTVSSGS------------------------LHSDSGNLG-REPRR 1225
            G  L+VSQ E + G+ V SG+                          SD+G++  +    
Sbjct: 1256 GINLNVSQTESVSGKHVDSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIKLKSSSM 1315

Query: 1226 IDG--------------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345
            ++G                    +N KQVEES N+ S+E+   T++    E R   KRS 
Sbjct: 1316 VNGLDAQSSMAPSSVQSGMPKSMENPKQVEESINRASDEHG--TRST---ELRTSAKRSV 1370

Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTR---SSDHNT- 1513
             A SLAK +KQD  K++ +SGK V R                +  ++ T    SS+ NT 
Sbjct: 1371 PASSLAKPSKQDPVKEDGRSGKPVARTSGSLSSDKDLQTHALEGRHTGTTNVPSSNGNTI 1430

Query: 1514 --------------------EIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQS 1633
                                E KAE+  +KSSD R    KD+G + TD  +  +SR   S
Sbjct: 1431 SGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGSSSRIVHS 1490

Query: 1634 PRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVR- 1810
            PR EN    SKS D+  KR S  EE DRL KRRKG+++ RD + +E+R SE+E+  D R 
Sbjct: 1491 PRHENTVVTSKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFE-TELRFSEREKMMDPRF 1549

Query: 1811 ALDKLHVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSL 1990
            A DKL         + DKPL+R K+K                 KSRGDD ++EK RDRS+
Sbjct: 1550 ADDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKSRGDDFVAEKPRDRSI 1609

Query: 1991 ERHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXX 2161
            ER+GRERSV+R+QERG+DR+F+RL   AKDERNKDDR+K+RY +AS EKSH         
Sbjct: 1610 ERYGRERSVERMQERGSDRSFNRLPEKAKDERNKDDRNKLRYNDASAEKSH--------- 1660

Query: 2162 XXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXX 2341
                             +GRRDED DRR+G  RH+Q+LSP               +    
Sbjct: 1661 ----------------GAGRRDEDVDRRYGATRHSQRLSPRHEEKERRWSEETVVS---- 1700

Query: 2342 XXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EP 2515
                               + ++  ERER+KAN+ KE++DLN ASKRRKLKREH+P+ EP
Sbjct: 1701 QDDAKRRKEDDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKLKREHLPTDEP 1760

Query: 2516 GEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRD 2695
            GEY   +  P      +  ++DGRDRGDRKG I+Q P Y ++  LR H KEAASK  RRD
Sbjct: 1761 GEYSAVAHPPSSAGTGMPLAYDGRDRGDRKGPIMQHPSYIDESSLRIHGKEAASKLNRRD 1820

Query: 2696 ADPMYDREWDDDKRQRAEPKRRHRK 2770
            +DP+YDREW+D+KRQRA+ KRRHRK
Sbjct: 1821 SDPLYDREWEDEKRQRADQKRRHRK 1845


>emb|CBI26799.3| unnamed protein product [Vitis vinifera]
          Length = 1767

 Score =  937 bits (2422), Expect = 0.0
 Identities = 526/939 (56%), Positives = 612/939 (65%), Gaps = 33/939 (3%)
 Frame = +2

Query: 53   KPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXXXX 232
            KPI W +LLDT +TMLP KAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI         
Sbjct: 902  KPIVWSDLLDTARTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRHRYESEIAKQHSALKA 961

Query: 233  XXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSCPD 412
                SDNS+SAI KRKKDKERIQESLDRLT E QKHEE+V SVRRRLA EKD WLSSCPD
Sbjct: 962  LEELSDNSNSAITKRKKDKERIQESLDRLTSELQKHEENVASVRRRLAREKDKWLSSCPD 1021

Query: 413  TLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQPMI 592
            TLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQPMI
Sbjct: 1022 TLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQPMI 1081

Query: 593  CCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQFI 772
            CCCTEYE GRLGRFL+ET+K AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQFI
Sbjct: 1082 CCCTEYEAGRLGRFLYETMKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQFI 1141

Query: 773  KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 952
            KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD
Sbjct: 1142 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 1201

Query: 953  EREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGAGL 1132
            EREDLK            RKPSWVTDEEFGMGYL++KPAP  ASK++++    L  G  +
Sbjct: 1202 EREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSLASKTVASGTQHLDAGNSV 1261

Query: 1133 ------------------SVS-------QAEQMGGRTVSSG----SLHSDSGNLGREPRR 1225
                              SVS        A+  GG +V+      S+ S + + G     
Sbjct: 1262 KEQVLRAKTVDGRLERTESVSLVKSDPVHAKVKGGSSVNGSDIQQSMPSAASHTGTSR-- 1319

Query: 1226 IDGDNLKQVEESANKQSEENS-KV-TKTVVEPEARPVVKRSAAAGSLAKQAKQDLSKDED 1399
              G+N + V+ES N+  +E++ KV ++   E E R   KRS  +GSL KQ K D++KD+ 
Sbjct: 1320 -SGENQRPVDESTNRTLDESTVKVSSRASTESELRATGKRSLPSGSLTKQPKLDVAKDDS 1378

Query: 1400 KSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTEIKAEITNSKSSDSRVYGGKDE 1579
            KSGK VGR                +      R S       A   +  S+D R+   KD+
Sbjct: 1379 KSGKGVGRTSGSSTSDRDLPAHQLE-----GRQSGVTNVSSAGTADGSSADLRLSAVKDD 1433

Query: 1580 GTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDI 1759
            G E +D  + P+SR   SPR +N +A  KSGDK  KR SPAEE +R+NKRRKG+ + RD 
Sbjct: 1434 GNEVSD--RAPSSRPIHSPRHDN-SATIKSGDKQQKRTSPAEEPERVNKRRKGDTEVRDF 1490

Query: 1760 DGSEVRLSEKERSSDVRALDKLHVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXX 1939
            +G EVR S+KE     R          D     ++P                        
Sbjct: 1491 EG-EVRFSDKESERYER----------DHRERLERP-----------------------D 1516

Query: 1940 KSRGDDLLSEKLRDRSLERHGRERSVDRVQERGADRNFDRLAKDERNKDDRSKVRYGEAS 2119
            KSRGD++++EK RDRS+ERHGRERSV+RVQER ++R                        
Sbjct: 1517 KSRGDEMIAEKSRDRSMERHGRERSVERVQERSSER------------------------ 1552

Query: 2120 LEKSHVDDRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXX 2299
             +KSH DDRF                  SV++ RRDEDADRRFG ARH Q+LSP      
Sbjct: 1553 -KKSHADDRFHGQSLPPPPPLPPHMVPQSVTASRRDEDADRRFGTARHAQRLSP---RHE 1608

Query: 2300 XXXXXXNASALQXXXXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASK 2476
                  +    Q                     LSIK+++RER+KA++ KED+D + ASK
Sbjct: 1609 EKERRRSEEISQDDAKRRREDDIRERKREEREGLSIKVEDREREKASLLKEDMDPSAASK 1668

Query: 2477 RRKLKREHMPS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLR 2653
            RRKLKREHMPS E GEY PA+P PPP +I++ Q++DGR+RGDRKG +VQR GY ++PGLR
Sbjct: 1669 RRKLKREHMPSGEAGEYTPAAPPPPPPAISMSQAYDGRERGDRKGAMVQRAGYLDEPGLR 1728

Query: 2654 AHSKEAASKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770
             H KE   K  RRDAD MYDREWDD+KRQRAE KRRHRK
Sbjct: 1729 IHGKEVTGKMARRDADQMYDREWDDEKRQRAEQKRRHRK 1767


>ref|XP_003631008.1| THO complex subunit [Medicago truncatula] gi|355525030|gb|AET05484.1|
            THO complex subunit [Medicago truncatula]
          Length = 2048

 Score =  931 bits (2406), Expect = 0.0
 Identities = 538/1013 (53%), Positives = 633/1013 (62%), Gaps = 103/1013 (10%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G +  PI+W +LLDTVKTMLPPK WNSLSPDLYATFWGLTLYDL+VP+NRYESEI     
Sbjct: 1063 GSNKNPISWSDLLDTVKTMLPPKTWNSLSPDLYATFWGLTLYDLHVPKNRYESEIAKLHA 1122

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI KRKK+KERIQESLDRL+ E  KHEE+V SV RRL HEKD WLS
Sbjct: 1123 NLKSLEELSDNSSSAITKRKKEKERIQESLDRLSSELHKHEENVVSVSRRLFHEKDKWLS 1182

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+ LHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1183 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHKLHSLGTPFFNTVNHIDVLICKTL 1242

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWK-----------------------SDESIYE 691
            QPMICCCTEYEVGRLGRFL+ETLK AYHWK                       SDESIYE
Sbjct: 1243 QPMICCCTEYEVGRLGRFLYETLKIAYHWKLFRACSIILIFTFIFVSSFYYLQSDESIYE 1302

Query: 692  KECGNMPGFAVYYRYPNSQRVTYGQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIML 871
            +ECGNMPGFAVYYR PN QRVTYGQFIKVHWKWSQRITRLLIQCLES+EYMEIRNALIML
Sbjct: 1303 RECGNMPGFAVYYRNPNGQRVTYGQFIKVHWKWSQRITRLLIQCLESSEYMEIRNALIML 1362

Query: 872  TKISSVFPVTRKSGINLEKRVAKIKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGY 1051
            TKISSVFPVTRKSGINLEKRVAKIKSDEREDLK            RKPSWVTDEEFGMGY
Sbjct: 1363 TKISSVFPVTRKSGINLEKRVAKIKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGY 1422

Query: 1052 LDIKPAPGPASKSLSANAVGLQNGAGLSVSQAEQMGGRTVSSG----------------- 1180
            L++KPAP   +KS + N+  +Q+G GL  SQ E   G+ + SG                 
Sbjct: 1423 LELKPAPS-MTKSAAGNSAAVQSGIGLQFSQTESASGKHLDSGNTVKDQTVKTKTADGKS 1481

Query: 1181 -------SLHSDSGN---------------------LGREPRRIDGDNLKQVEESANKQS 1276
                   +  SDSG+                      G+       +N KQVEES ++  
Sbjct: 1482 ERTESLTATKSDSGHGKLKGSSMVNGVDAQSSLASPAGQSGALKSVENQKQVEESISRAP 1541

Query: 1277 EENSKVTKTVVEPEARPVVK-RSAAAGSLAKQAKQDLSKDEDKSGKAV----GRXXXXXX 1441
            +E+  +T+ V   E+RP VK RS A GSL K +KQD  K++ +SGK V    G       
Sbjct: 1542 DEH--ITRNV---ESRPSVKQRSVATGSLLKPSKQDPLKEDGRSGKTVTRTSGSSSSDKD 1596

Query: 1442 XXXXXXXXXXKLTNSSTRSSDHNTEIKAEITNSKSSDSRVYGG----------------- 1570
                        TN S+  S +   +         + +  + G                 
Sbjct: 1597 LQTHASDGRHTGTNISSSFSANGNSVSGSAKGLAQAATTAFDGSGNESKAEVGAAKFSMV 1656

Query: 1571 KDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDS 1750
            KD+  E+ D  +  +SR   SPR EN TA SKS DK  KR    +E DRL KRRKG+ID 
Sbjct: 1657 KDDVNEFADFTRGSSSRVVHSPRHEN-TATSKSSDKIQKRAGSVDELDRLGKRRKGDIDL 1715

Query: 1751 RDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD--------DKPLDRAKEKTGXXXX 1906
            RD++G EVR SE+E+  D R  D       DK G D        DK L+R KEK      
Sbjct: 1716 RDLEG-EVRFSEREKLMDPRLAD-------DKVGPDELGVYRTGDKTLERPKEKGTDRYE 1767

Query: 1907 XXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQERGADRNFDRL---AKDER 2077
                       KSRGDD + EK RDRS+ER+GRERSV+RVQERG++R+F+RL   AKD+R
Sbjct: 1768 REHRERLDRLDKSRGDDFVVEKPRDRSIERYGRERSVERVQERGSERSFNRLPDKAKDDR 1827

Query: 2078 NKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNA 2257
            +KDDR+K+RY +A++EKSH + RF                  S+ +GRRDEDADRR+G  
Sbjct: 1828 SKDDRNKLRYNDATIEKSHAEGRFHGQSLPPPPPLPPNMVPQSLGAGRRDEDADRRYGAT 1887

Query: 2258 RHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXXXXXXALSIKMDERERDKA 2437
            RH+Q+LSP                LQ                       +K++ERER+KA
Sbjct: 1888 RHSQRLSPRHEEKELRRSEETV-ILQDDPKRRKEDDFRDRKRE-----EMKVEEREREKA 1941

Query: 2438 NMNKEDIDLN-ASKRRKLKREHMPS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGV 2611
            ++ KE+ DLN ASKRRKLKREH+P+ EPGEY P   APP   I + Q++DGR   DRKG 
Sbjct: 1942 SILKEE-DLNAASKRRKLKREHLPTMEPGEYSPV--APPLSGIGMSQAYDGR---DRKGP 1995

Query: 2612 IVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770
            ++Q   Y ++P LR H KE ASK  RR++DP+YDREWDD+KRQRA+ KRRHRK
Sbjct: 1996 MIQHASYIDEPSLRIHGKEVASKLNRRESDPLYDREWDDEKRQRADQKRRHRK 2048


>ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi|223533086|gb|EEF34845.1|
            tho2 protein, putative [Ricinus communis]
          Length = 1828

 Score =  926 bits (2392), Expect = 0.0
 Identities = 526/949 (55%), Positives = 624/949 (65%), Gaps = 61/949 (6%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G S KPI W +LL+TVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI     
Sbjct: 893  GSSQKPIMWSDLLETVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRDRYESEIAKQHA 952

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                    SDNSSSAI+KRKKDKERIQESLDRLT E  KHEE+V SVRRRL+ EKD WLS
Sbjct: 953  ALKALEELSDNSSSAISKRKKDKERIQESLDRLTSELHKHEENVASVRRRLSREKDKWLS 1012

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1013 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1072

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTEYE GRLG+FL ETLK AY+WKSDESIYE+ECGNMPGFAVYYR+PNSQRVTY
Sbjct: 1073 QPMICCCTEYEAGRLGKFLHETLKIAYYWKSDESIYERECGNMPGFAVYYRFPNSQRVTY 1132

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQFIKVHWKWSQRI+RLLIQCLESTEYMEIRNALI+LTKIS VFPVT++SGINLEKRVA+
Sbjct: 1133 GQFIKVHWKWSQRISRLLIQCLESTEYMEIRNALILLTKISGVFPVTKRSGINLEKRVAR 1192

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IKSDEREDLK            RKPSWVTDEEFGMGYLDI+P    ASKS+S N    QN
Sbjct: 1193 IKSDEREDLKVLATSVASALAARKPSWVTDEEFGMGYLDIRPP--AASKSVSGNISVGQN 1250

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGRE----------------------PRRIDG 1234
             +GL+ SQ E  GGR VS+ + H D GN  +E                       +++ G
Sbjct: 1251 SSGLNASQGESAGGRAVSTTTQHGDVGNSAKEHISRAKPADKQESVSYVKSDSVNQKVKG 1310

Query: 1235 DNL---KQVEESA--------NKQSEENSK-----------VTKTVVEPEARPVVKRSAA 1348
             +L     ++ SA          +S EN K             K   E E++   KR+  
Sbjct: 1311 GSLVIQSDLQSSAALVTGQAGASRSAENQKQMSESPIIIPDAPKNSAESESKASGKRAMP 1370

Query: 1349 AGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTEIKAE 1528
            AGS+ K  +QD++KD+ KSGK VGR                  ++ S     + T + + 
Sbjct: 1371 AGSV-KTPRQDVAKDDLKSGKTVGRVPVASSSDKDMP------SHLSESRLGNGTNVSST 1423

Query: 1529 ITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQE-NLTAASKSGDKPHKRLSPAE 1705
             T++  +   V   KD+ TE  D  K P SR   SPR + +  ++SKS DK  KR SP +
Sbjct: 1424 GTSNDGAAKSVV--KDDATEVGDVQK-PPSRVVHSPRHDGSFASSSKSSDKLQKRASPGD 1480

Query: 1706 EHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD-------DK 1864
            + DRL+KRRKG+ + RD+DG ++R S++ER  D R +D       DK GSD       DK
Sbjct: 1481 DPDRLSKRRKGDTELRDLDG-DIRFSDRERPMDSRLVD------LDKIGSDERVHRSMDK 1533

Query: 1865 PLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQER-GA 2041
            PLDR+K+K                 KSRGDD+L E+ RDRS+ER+GRERSV+R QER GA
Sbjct: 1534 PLDRSKDKGMERYDRDHRERSERPDKSRGDDILVERPRDRSMERYGRERSVERGQERGGA 1593

Query: 2042 DRNFDRLA---KDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXXSVS 2212
            DR+FDR +   KDERNKD   KVRYG+ S+EK H DDRF                  SV+
Sbjct: 1594 DRSFDRFSDKTKDERNKD---KVRYGDTSVEKLH-DDRFYGQNLPPPPPLPPHVVPQSVT 1649

Query: 2213 SGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXXXXX 2392
            + RRDEDADRR G+ARH+ +LSP            N+   Q                   
Sbjct: 1650 ASRRDEDADRRIGSARHSLRLSPRHDEKERRRSEENSLVSQDDVKRGRDDNFRDRKRDER 1709

Query: 2393 XALSIKMDERERDKANMN---KEDIDLN-ASKRRKLKREHMPS-EPGEYLPASPAPPPVS 2557
              L++K+++RERD+       K+DID+  ASKRRKLKREHMPS E GEY P +P PPP++
Sbjct: 1710 EGLAMKVEDRERDREREKVPLKDDIDVGAASKRRKLKREHMPSGEAGEYSPVAPPPPPLA 1769

Query: 2558 INLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADP 2704
            I++ QS+DGR+RGDR G ++QR GY E+P +R H KE A K TRRDADP
Sbjct: 1770 ISMSQSYDGRERGDR-GALIQRAGYLEEPPMRIHGKEVAGKMTRRDADP 1817


>ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fragaria vesca subsp. vesca]
          Length = 1860

 Score =  898 bits (2320), Expect = 0.0
 Identities = 512/987 (51%), Positives = 619/987 (62%), Gaps = 77/987 (7%)
 Frame = +2

Query: 41   GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220
            G +  PI WL+LLDT KTMLP +AWNSLSPDLYATFWGLTLYDLYVPRN Y SEI     
Sbjct: 897  GSTWNPITWLDLLDTAKTMLPARAWNSLSPDLYATFWGLTLYDLYVPRNCYISEIAKQQA 956

Query: 221  XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400
                     DNS S I +RKK+KERIQE++DRL  E +KHEEHV SVR+RL  EKD WLS
Sbjct: 957  SIKALEEQPDNSISEILRRKKEKERIQETIDRLISESRKHEEHVASVRKRLLREKDKWLS 1016

Query: 401  SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580
            SCPDTLKINMEFLQRCIFPRCTFSMPDAVY A FV+TLH+LGTPFFNTVNH+DVLIC+TL
Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYSAMFVHTLHTLGTPFFNTVNHMDVLICRTL 1076

Query: 581  QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760
            QPMICCCTE EVGRLG+FL ETLK AY+WKSDESIYE+ECGNMPGFAVYYR+P+SQRV Y
Sbjct: 1077 QPMICCCTESEVGRLGKFLCETLKIAYYWKSDESIYERECGNMPGFAVYYRFPDSQRVRY 1136

Query: 761  GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940
            GQF+KVHWKWSQRITRLL QCLESTEYMEIRNALI+L++ISSVFPVTRKS +NLEKRV+K
Sbjct: 1137 GQFVKVHWKWSQRITRLLGQCLESTEYMEIRNALIILSRISSVFPVTRKSALNLEKRVSK 1196

Query: 941  IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120
            IK D REDLK            RKPS V+DEEF MGY+++K A   +SK L++N+  + +
Sbjct: 1197 IKGDGREDLKVLATSVGASLAARKPSLVSDEEFCMGYVELKSAS--SSKPLASNSGAIHS 1254

Query: 1121 GAGLSVSQAEQMGGRTVSSGSLH-------------------------------SDSGNL 1207
            G  ++ SQ E  GG+  +  S H                               SD G+L
Sbjct: 1255 GPAVNNSQTEPAGGKAGTLVSQHAELIDSARDHVSKAKPADGRSERAESVSTAKSDPGHL 1314

Query: 1208 GREPRRIDG---------------------DNLKQVEESANKQSEENSK--VTKTVVEPE 1318
              +   +                       +N  Q+ E++ +++EEN+     K   E E
Sbjct: 1315 KHKGASLVNGSDAQASVPSATLQAGTARPIENQVQLNETSTRRAEENTGKLAAKNTSESE 1374

Query: 1319 ARPVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRS 1498
             R   KRS  AG  AK  KQDL KDE +SGKA G                  + +    S
Sbjct: 1375 LRAQAKRSVPAG--AKPLKQDLVKDESRSGKAAGATNVSSITANGST-----VPSLGKGS 1427

Query: 1499 SDHNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDK 1678
            +    E K E  ++K S++R+   K+EG E +D  + P+SR   SPR ++    SKS DK
Sbjct: 1428 ASLGIESKVEAGSAKISNTRIPSSKEEGAEVSDVARPPSSRFVNSPRHDSSATLSKSSDK 1487

Query: 1679 PHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD 1858
              KR  PAEE DR +KRRKGE + RD +G E RLS++ERS D R LD       DK+GSD
Sbjct: 1488 LQKRTGPAEETDRQSKRRKGEAEMRDSEG-EARLSDRERSVDARLLD------LDKSGSD 1540

Query: 1859 DKPL--------DRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERS 2014
            D+ +        DR+K+K                 KSRGDDL+ E+ RDRS+ERHGR+ S
Sbjct: 1541 DRSVYKATEKASDRSKDKGNERHDKDHRERADRPDKSRGDDLV-ERSRDRSMERHGRDHS 1599

Query: 2015 VDRVQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXX 2194
             +++QERG+DR+FDRL   E++KD++ K RY + S EKSHVD+R+               
Sbjct: 1600 AEKLQERGSDRSFDRLP--EKSKDEKGKGRYSDISTEKSHVDERYHGQSLPPPPPLPPHI 1657

Query: 2195 XXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXX 2374
               SVSSGRRDED+DRR    RHTQ+LSP            N+S  Q             
Sbjct: 1658 VPQSVSSGRRDEDSDRRT-TTRHTQRLSPRHDEKERRRSEENSSISQDDSKRRREDDFRE 1716

Query: 2375 XXXXXXXALSIKMDERERD--------------KANMNKEDIDL-NASKRRKLKREHMPS 2509
                    +S+K+DER+RD              KAN++KED D+  ASKRRKLKR+    
Sbjct: 1717 RKRDDREGISVKVDERDRDRDRDREREREKEREKANLSKEDPDMIAASKRRKLKRDLSSV 1776

Query: 2510 EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATR 2689
            E GEY P  P PPP+SINL QS+DGRDRG+RKG IV R GY E+P LR H KE ++K TR
Sbjct: 1777 EAGEYSPVHP-PPPLSINLSQSYDGRDRGERKGPIVARTGYVEEPSLRIHGKEVSNKMTR 1835

Query: 2690 RDADPMYDREWDDDKRQRAEPKRRHRK 2770
            RD DPMY  EWDDDKR R E KRRHRK
Sbjct: 1836 RDTDPMY--EWDDDKR-RGEQKRRHRK 1859


Top