BLASTX nr result

ID: Akebia27_contig00003217 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00003217
         (6552 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citru...  2471   0.0  
ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis...  2441   0.0  
emb|CBI26799.3| unnamed protein product [Vitis vinifera]             2419   0.0  
ref|XP_007217095.1| hypothetical protein PRUPE_ppa000084mg [Prun...  2411   0.0  
ref|XP_007045493.1| THO complex subunit 2 isoform 1 [Theobroma c...  2407   0.0  
ref|XP_007045494.1| THO complex subunit 2 isoform 2 [Theobroma c...  2403   0.0  
ref|XP_007045496.1| THO complex subunit 2 isoform 4 [Theobroma c...  2383   0.0  
ref|XP_007045497.1| THO complex subunit 2 isoform 5 [Theobroma c...  2360   0.0  
ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucum...  2358   0.0  
ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isofor...  2357   0.0  
ref|XP_007160466.1| hypothetical protein PHAVU_002G324500g [Phas...  2332   0.0  
ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isofor...  2328   0.0  
ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi...  2321   0.0  
ref|XP_002325475.1| F5A9.22 family protein [Populus trichocarpa]...  2311   0.0  
ref|XP_006826587.1| hypothetical protein AMTR_s00138p00044110 [A...  2277   0.0  
ref|XP_006376042.1| F5A9.22 family protein [Populus trichocarpa]...  2248   0.0  
ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solan...  2220   0.0  
ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solan...  2205   0.0  
ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fraga...  2197   0.0  
ref|XP_007045495.1| THO2 isoform 3 [Theobroma cacao] gi|50870943...  2196   0.0  

>ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citrus sinensis]
          Length = 1874

 Score = 2471 bits (6405), Expect = 0.0
 Identities = 1307/1900 (68%), Positives = 1471/1900 (77%), Gaps = 46/1900 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLP I+ KYITEE LREWKNGN  F+  + +P+LRFLYELC   VRGELP QKCKAA+D
Sbjct: 1    MSLPQIQCKYITEECLREWKNGNPSFRVPDPVPMLRFLYELCSITVRGELPFQKCKAAVD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVEF EK S+  + S FADIVTQMAQDLTM  E+R RLIK+AKWLVESALVPL+LFQERC
Sbjct: 61   SVEFVEKPSHRVVASTFADIVTQMAQDLTMPGEHRVRLIKLAKWLVESALVPLRLFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC +   A
Sbjct: 121  EEEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCHTYENA 180

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             TE+ S AT+ IIKSLIGHFDLDPNRVFDIVLEC+ELQ +N VFL+LIPIFPKSHAS IL
Sbjct: 181  -TESASAATIGIIKSLIGHFDLDPNRVFDIVLECYELQPNNKVFLELIPIFPKSHASHIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN PVPF LY+LTA+LVK +FIDLDSIY HLLPKD++AFEHY+A SAKR
Sbjct: 240  GFKFQYYQRMEVNSPVPFSLYKLTALLVKEEFIDLDSIYTHLLPKDDEAFEHYNAFSAKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+D+E+EAVAE SPELEN+Q LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDLENEAVAERSPELENSQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHAHILF+RL+PLNPVAHIQIC+GL RLIE SIS AYDI+RQ HLQ FG  S
Sbjct: 360  GFLSVDDWYHAHILFERLAPLNPVAHIQICDGLLRLIENSISSAYDIVRQTHLQSFGSFS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G+G DA++  D + + RSF+DLPKELF+MLA+ GPYLYR+ VLLQKVCRVLRGYY SALE
Sbjct: 420  GAGIDAMDTADLT-VHRSFIDLPKELFEMLATLGPYLYRDTVLLQKVCRVLRGYYFSALE 478

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+  G  A NPE     +R PR HLKEA+LRVEEALG CLLPSLQLIPANPAVGQEIWEV
Sbjct: 479  LVNCGDGAPNPEPLMDRNRVPRQHLKEARLRVEEALGACLLPSLQLIPANPAVGQEIWEV 538

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDER PMVL ARQT+KLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 539  MNLLPYEVRYRLYGEWEKDDERNPMVLAARQTSKLDTRRILKRLAKENLKQLGRMVAKLA 598

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGR+KLKDDG
Sbjct: 599  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGRDKLKDDG 658

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLK+              QMANVQY
Sbjct: 659  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKRGKGIELVLLQELIQQMANVQY 718

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TE+QLDAMAGSETLRYQAT FG+TRNNKAL KSTNRL+DS                
Sbjct: 719  TENLTEDQLDAMAGSETLRYQATSFGVTRNNKALIKSTNRLKDSLLPRDEPKLAIPLLLL 778

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRSVVVIN+DAP+IKMVCE+FDRCHGTLLQYVEFLCSA+T   +YAQLIP L+DLVH
Sbjct: 779  IAQHRSVVVINADAPYIKMVCEEFDRCHGTLLQYVEFLCSAVTPATAYAQLIPSLNDLVH 838

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLDL 3526
            +YHLDPEVAFL++RPVMRL+KC   S V WP DD E  + + I            ++LDL
Sbjct: 839  QYHLDPEVAFLIFRPVMRLFKCQGSSAVFWPLDDGEAANNTTINSESEPSEDIGNVILDL 898

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            GS QKP+MWSDLLDTV+TMLPSKAWNSLSPDLY TFWGLTLYDLYVPR RYESEI+KQH+
Sbjct: 899  GS-QKPVMWSDLLDTVKTMLPSKAWNSLSPDLYTTFWGLTLYDLYVPRDRYESEIAKQHA 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEELSDNS+SAITKRKKDKERIQE LDRLT EL KHE+NV SVRRRLSREKD WLS
Sbjct: 958  ALKALEELSDNSSSAITKRKKDKERIQESLDRLTNELHKHEENVASVRRRLSREKDRWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLG+FLFETLK+AYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGKFLFETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLESAEYMEIRNALI+LTKIS VFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESAEYMEIRNALILLTKISGVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVDXXXXXXXXXXXXA--IAVPNG 2452
            IK DEREDLK            RKS WVT+EEFGMG ++            +  +    G
Sbjct: 1198 IKNDEREDLKVLATGVAAALANRKSFWVTDEEFGMGYLELKPAPSLASKSLSGNVVAVQG 1257

Query: 2451 SALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTK 2272
            SA+N+SQ E                GN +K+  +R KP DG+LERTE+ + +K+D    K
Sbjct: 1258 SAINVSQSEP-------------GTGNSVKDHISRAKPGDGRLERTESISHVKSD--NVK 1302

Query: 2271 LKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDE-------SAKGTAESELRPTIRR 2113
            LK  SL NG D  S + + AVQ+  SR  E QK  DE       + K +AESE + +++R
Sbjct: 1303 LKGSSLTNGSDIHSSVPSTAVQAEMSRVVENQKQVDEDENMAKVAMKNSAESESKASVKR 1362

Query: 2112 SVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTST 1933
            SV + SLTK PKQ ++AK+DNKS K            N+RD S+H A+G+   GGA T +
Sbjct: 1363 SVPSASLTKAPKQ-DLAKDDNKSAKAVGRTSGSSA--NDRDFSSHAAEGKQ--GGATTVS 1417

Query: 1932 AIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASI--DDSEKLRPLP-- 1765
            + A  +    S K S++S+R S DMHG+ +  + G  KSSE+R S    D  ++   P  
Sbjct: 1418 SAAAVTANLVSAKGSSSSSRAS-DMHGNESKTDGGVAKSSEVRLSTGKSDGNEVSDAPKS 1476

Query: 1764 --SRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
              SR  HSP  D S   SKS D+LQKRTSP E+  +PSKR KGD E++D + E R  DRE
Sbjct: 1477 SSSRAMHSPRHDSSVATSKSGDRLQKRTSPSEDPDRPSKRYKGDTELRDSDGEVRVPDRE 1536

Query: 1593 RSVDPRLPHSDLERTGIDDQSLHR----SKDKGNGRYDRDHRERLERQDKSTGEDILVEK 1426
            RS DPR   +DL++ G D+QS++R    SKDKGN RY+RDHRERL+R DKS  +DI+ EK
Sbjct: 1537 RSADPRF--ADLDKIGTDEQSMYRTTDRSKDKGNERYERDHRERLDRLDKSRVDDIIPEK 1594

Query: 1425 SRDRSMERYGRERSVERVS-----------FXXXXXXXXXXXRGKPRYNETSGEK--YDD 1285
             RDRSMERYGRERSVER                         R K RYN++S EK   D+
Sbjct: 1595 QRDRSMERYGRERSVERGQERGADRAFDRLADKAKDDRNKDDRSKLRYNDSSSEKSHVDE 1654

Query: 1284 RFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXEDY 1105
            RFH QS        P++VPQSVNA RRDEDAD+R  +TRH QRLSP           E+ 
Sbjct: 1655 RFHGQSLPPPPPLPPHIVPQSVNAGRRDEDADKRFGSTRHSQRLSPRHDEKERRRSEENS 1714

Query: 1104 LVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERD----------KINLLKEEMDANAA-S 958
            LV+Q                   +GLS K+DER+          K NLLKEEMDANAA S
Sbjct: 1715 LVSQDDAKRRREDDFRDRKREDREGLSLKMDERERERDRDRDREKANLLKEEMDANAAAS 1774

Query: 957  KRRKLKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMP 781
            KRRKLKR+HL SGEAGEYS +APP PPLA+G+ Q YDGRDRGDRKG  +QR GY+EE   
Sbjct: 1775 KRRKLKREHLPSGEAGEYSPVAPPYPPLAIGISQSYDGRDRGDRKGATMQRTGYMEEQSM 1834

Query: 780  RMHGKETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            R+HGKE A+K+ RRD + IY+REWEDEKRQR EQKRRHRK
Sbjct: 1835 RIHGKEVATKMARRDSELIYEREWEDEKRQRAEQKRRHRK 1874


>ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis vinifera]
          Length = 1849

 Score = 2441 bits (6326), Expect = 0.0
 Identities = 1304/1877 (69%), Positives = 1458/1877 (77%), Gaps = 45/1877 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  ++T++ LREWK+GN  FK S ++P+LRFLYELC T+VRGELP+ KCK ALD
Sbjct: 1    MSLPPIECIHVTDDCLREWKSGNPSFKVSGTVPMLRFLYELCSTLVRGELPLHKCKVALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVEF++K ++EEL S FADIVTQMA DLTM  E R RLIK+AKWLVES LVPL+LFQERC
Sbjct: 61   SVEFSDKEADEELASNFADIVTQMALDLTMPGENRARLIKLAKWLVESTLVPLRLFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWESEMIKIKA +LK KEVRVNTRLLYQQTKFNL+REESEGY+KLVTLLCQ G+ +
Sbjct: 121  EEEFLWESEMIKIKAQELKNKEVRVNTRLLYQQTKFNLVREESEGYSKLVTLLCQ-GSES 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             ++N S AT+ IIKSLIGHFDLDPNRVFDIVLECFE Q DN+VFLDLIPIFPKSHASQIL
Sbjct: 180  SSQNASAATIGIIKSLIGHFDLDPNRVFDIVLECFEHQPDNSVFLDLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFK+QYYQRMEVN+ VP GLY+LTA+LVK +FIDLDSIYAHLLPKDE+AFEHY+  SAKR
Sbjct: 240  GFKYQYYQRMEVNNRVPPGLYQLTALLVKEEFIDLDSIYAHLLPKDEEAFEHYNVFSAKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+DME+EAVAE S ELENNQ LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERSSELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFL+VDDWYHAHILFDRLSPLNPVAHI+IC GL RLIEKSIS AY I+ Q HL+ FG  S
Sbjct: 360  GFLAVDDWYHAHILFDRLSPLNPVAHIEICNGLLRLIEKSISTAYGIVHQAHLESFGL-S 418

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
             SGSD +E T+ SS+ RSF+DLPKELFQMLA  GPY YR+ +LLQKVCRVLRGYYLSALE
Sbjct: 419  SSGSDLMETTN-SSVNRSFIDLPKELFQMLACVGPYFYRDTILLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ SG  A NPES   G+R PRLHLKEA+ R+EEALGTCLLPSLQLIPANPAV QEIWEV
Sbjct: 478  LVRSGDGAYNPESGVGGNRVPRLHLKEARSRIEEALGTCLLPSLQLIPANPAVCQEIWEV 537

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDERIP+VL ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 538  MNLLPYEVRYRLYGEWEKDDERIPVVLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 597

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGR+KLKDDG
Sbjct: 598  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGRDKLKDDG 657

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQY
Sbjct: 658  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGKGIELLLLQELIQQMANVQY 717

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLRYQAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 718  TENLTEEQLDAMAGSETLRYQATSFGITRNNKALIKSTNRLRDSLLPKEEPKLAIPLLLL 777

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRSVV+IN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T   +YAQLIPPL++LVH
Sbjct: 778  IAQHRSVVLINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPATAYAQLIPPLEELVH 837

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXSA-KLVLDL 3526
             YHLDPEVAFL+YRPVMRL+KC S S++ WP DD+E T+MS          S+ +++LDL
Sbjct: 838  MYHLDPEVAFLIYRPVMRLFKCRSSSNIFWPLDDVESTNMSTAEKESEPTDSSGEVILDL 897

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            G P KPI+WSDLLDT RTMLPSKAWNSLSPDLYATFWGLTLYDLYVPR RYESEI+KQHS
Sbjct: 898  GPPWKPIVWSDLLDTARTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRHRYESEIAKQHS 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEELSDNSNSAITKRKKDKERIQE LDRLT ELQKHE+NV SVRRRL+REKD WLS
Sbjct: 958  ALKALEELSDNSNSAITKRKKDKERIQESLDRLTSELQKHEENVASVRRRLAREKDKWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ET+K+AY+WKSDESIYERECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETMKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVPN 2455
            IK DEREDLK           ARK SWVT+EEFGMG ++                +AVPN
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSLASKSLAGNLVAVPN 1257

Query: 2454 GSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQT 2275
            GS LNI Q E+ G R   SG+Q  D GN +KEQ  R K VDG+LERTE+ +L+K+D    
Sbjct: 1258 GSGLNIFQNESSGGRTVASGTQHLDAGNSVKEQVLRAKTVDGRLERTESVSLVKSDPVHA 1317

Query: 2274 KLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDE-------------SAKGTAESE 2134
            K+K GS  NG D Q  M +AA  +GTSR+ E Q+  DE             S++ + ESE
Sbjct: 1318 KVKGGSSVNGSDIQQSMPSAASHTGTSRSGENQRPVDESTNRTLDESTVKVSSRASTESE 1377

Query: 2133 LRPTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSA 1954
            LR T +RS+ +GSLTKQPK  +VAK+D+KSGK           +++RDL AH  +GR S 
Sbjct: 1378 LRATGKRSLPSGSLTKQPK-LDVAKDDSKSGK--GVGRTSGSSTSDRDLPAHQLEGRQS- 1433

Query: 1953 GGANTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLR 1774
            G  N S+A              TA         GSV   +   V               R
Sbjct: 1434 GVTNVSSA-------------GTAD--------GSVVKDDGNEVSD-------------R 1459

Query: 1773 PLPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDR 1597
               SRP HSP  D+S T  KS DK QKRTSP EE  + +KRRKGD EV+D E E RFSD+
Sbjct: 1460 APSSRPIHSPRHDNSATI-KSGDKQQKRTSPAEEPERVNKRRKGDTEVRDFEGEVRFSDK 1518

Query: 1596 ERSVDPRLPHS---DLERTGIDDQSLHRS--------KDKGNGRYDRDHRERLERQDKST 1450
            ERS+DPRL  S   DL+++G D+Q + R+        KDKG+ RY+RDHRERLER DKS 
Sbjct: 1519 ERSMDPRLDKSHAVDLDKSGTDEQGISRATDKPSDRLKDKGSERYERDHRERLERPDKSR 1578

Query: 1449 GEDILVEKSRDRSMERYGRERSVERV-------SF----XXXXXXXXXXXRGKPRYNETS 1303
            G++++ EKSRDRSMER+GRERSVERV       SF               RGK RY+ETS
Sbjct: 1579 GDEMIAEKSRDRSMERHGRERSVERVQERSSERSFDRLTDKVKDERNKDDRGKMRYSETS 1638

Query: 1302 GEK--YDDRFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXX 1129
             EK   DDRFH QS        P+MVPQSV ASRRDEDADRR    RH QRLSP      
Sbjct: 1639 VEKSHADDRFHGQSLPPPPPLPPHMVPQSVTASRRDEDADRRFGTARHAQRLSPRHEEKE 1698

Query: 1128 XXXXXEDYLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVD--ERDKINLLKEEMDANAASK 955
                 E   ++Q                   +GLS KV+  ER+K +LLKE+MD +AASK
Sbjct: 1699 RRRSEE---ISQDDAKRRREDDIRERKREEREGLSIKVEDREREKASLLKEDMDPSAASK 1755

Query: 954  RRKLKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPR 778
            RRKLKR+H+ SGEAGEY+  APP PP A+ M Q YDGR+RGDRKG M+QR GYL+EP  R
Sbjct: 1756 RRKLKREHMPSGEAGEYTPAAPPPPPPAISMSQAYDGRERGDRKGAMVQRAGYLDEPGLR 1815

Query: 777  MHGKETASKITRRDGDQ 727
            +HGKE   K+ RRD DQ
Sbjct: 1816 IHGKEVTGKMARRDADQ 1832


>emb|CBI26799.3| unnamed protein product [Vitis vinifera]
          Length = 1767

 Score = 2419 bits (6268), Expect = 0.0
 Identities = 1288/1874 (68%), Positives = 1439/1874 (76%), Gaps = 20/1874 (1%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  ++T++ LREWK+GN  FK S ++P+LRFLYELC T+VRGELP+ KCK ALD
Sbjct: 1    MSLPPIECIHVTDDCLREWKSGNPSFKVSGTVPMLRFLYELCSTLVRGELPLHKCKVALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVEF++K ++EEL S FADIVTQMA DLTM  E R RLIK+AKWLVES LVPL+LFQERC
Sbjct: 61   SVEFSDKEADEELASNFADIVTQMALDLTMPGENRARLIKLAKWLVESTLVPLRLFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWESEMIKIKA +LK KEVRVNTRLLYQQTKFNL+REESEGY+KLVTLLCQ G+ +
Sbjct: 121  EEEFLWESEMIKIKAQELKNKEVRVNTRLLYQQTKFNLVREESEGYSKLVTLLCQ-GSES 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             ++N S AT+ IIKSLIGHFDLDPNRVFDIVLECFE Q DN+VFLDLIPIFPKSHASQIL
Sbjct: 180  SSQNASAATIGIIKSLIGHFDLDPNRVFDIVLECFEHQPDNSVFLDLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFK+QYYQRMEVN+ VP GLY+LTA+LVK +FIDLDSIYAHLLPKDE+AFEHY+  SAKR
Sbjct: 240  GFKYQYYQRMEVNNRVPPGLYQLTALLVKEEFIDLDSIYAHLLPKDEEAFEHYNVFSAKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+DME+EAVAE S ELENNQ LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERSSELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFL+VDDWYHAHILFDRLSPLNPVAHI+IC GL RLIEKSIS AY I+ Q HL+ FG  S
Sbjct: 360  GFLAVDDWYHAHILFDRLSPLNPVAHIEICNGLLRLIEKSISTAYGIVHQAHLESFGL-S 418

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
             SGSD +E T+ SS+ RSF+DLPKELFQMLA  GPY YR+ +LLQKVCRVLRGYYLSALE
Sbjct: 419  SSGSDLMETTN-SSVNRSFIDLPKELFQMLACVGPYFYRDTILLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ SG  A NPES   G+R PRLHLKEA+ R+EEALGTCLLPSLQLIPANPAV QEIWEV
Sbjct: 478  LVRSGDGAYNPESGVGGNRVPRLHLKEARSRIEEALGTCLLPSLQLIPANPAVCQEIWEV 537

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDERIP+VL ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 538  MNLLPYEVRYRLYGEWEKDDERIPVVLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 597

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGR+KLKDDG
Sbjct: 598  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGRDKLKDDG 657

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQY
Sbjct: 658  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGKGIELLLLQELIQQMANVQY 717

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLRYQAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 718  TENLTEEQLDAMAGSETLRYQATSFGITRNNKALIKSTNRLRDSLLPKEEPKLAIPLLLL 777

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRSVV+IN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T   +YAQLIPPL++LVH
Sbjct: 778  IAQHRSVVLINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPATAYAQLIPPLEELVH 837

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXSA-KLVLDL 3526
             YHLDPEVAFL+YRPVMRL+KC S S++ WP DD+E T+MS          S+ +++LDL
Sbjct: 838  MYHLDPEVAFLIYRPVMRLFKCRSSSNIFWPLDDVESTNMSTAEKESEPTDSSGEVILDL 897

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            G P KPI+WSDLLDT RTMLPSKAWNSLSPDLYATFWGLTLYDLYVPR RYESEI+KQHS
Sbjct: 898  GPPWKPIVWSDLLDTARTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRHRYESEIAKQHS 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEELSDNSNSAITKRKKDKERIQE LDRLT ELQKHE+NV SVRRRL+REKD WLS
Sbjct: 958  ALKALEELSDNSNSAITKRKKDKERIQESLDRLTSELQKHEENVASVRRRLAREKDKWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ET+K+AY+WKSDESIYERECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETMKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVDXXXXXXXXXXXXAIAVPNGSA 2446
            IK DEREDLK           ARK SWVT+EEFGMG                        
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGY----------------------- 1234

Query: 2445 LNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTKLK 2266
            L +    ++  +   SG+Q  D GN +KEQ  R K VDG+LERTE+ +L+K+D    K+K
Sbjct: 1235 LELKPAPSLASKTVASGTQHLDAGNSVKEQVLRAKTVDGRLERTESVSLVKSDPVHAKVK 1294

Query: 2265 SGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES-------------AKGTAESELRP 2125
             GS  NG D Q  M +AA  +GTSR+ E Q+  DES             ++ + ESELR 
Sbjct: 1295 GGSSVNGSDIQQSMPSAASHTGTSRSGENQRPVDESTNRTLDESTVKVSSRASTESELRA 1354

Query: 2124 TIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGA 1945
            T +RS+ +GSLTKQPK  +VAK+D+KSGK            ++RDL AH  +GR S G  
Sbjct: 1355 TGKRSLPSGSLTKQPKL-DVAKDDSKSGKGVGRTSGSST--SDRDLPAHQLEGRQS-GVT 1410

Query: 1944 NTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKL--RP 1771
            N S+A              TA         GS A++   AVK        DD  ++  R 
Sbjct: 1411 NVSSA-------------GTAD--------GSSADLRLSAVK--------DDGNEVSDRA 1441

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
              SRP HSP  D+S T  KS DK QKRTSP EE  + +KRRKGD EV+D E E RFSD+E
Sbjct: 1442 PSSRPIHSPRHDNSATI-KSGDKQQKRTSPAEEPERVNKRRKGDTEVRDFEGEVRFSDKE 1500

Query: 1593 RSVDPRLPHSDLERTGIDDQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDILVEKSRDR 1414
                                           RY+RDHRERLER DKS G++++ EKSRDR
Sbjct: 1501 SE-----------------------------RYERDHRERLERPDKSRGDEMIAEKSRDR 1531

Query: 1413 SMERYGRERSVERVSFXXXXXXXXXXXRGKPRYNETSGEKYDDRFHRQSXXXXXXXXPNM 1234
            SMER+GRERSVERV               + R +E      DDRFH QS        P+M
Sbjct: 1532 SMERHGRERSVERV---------------QERSSERKKSHADDRFHGQSLPPPPPLPPHM 1576

Query: 1233 VPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXEDYLVAQXXXXXXXXXXXXX 1054
            VPQSV ASRRDEDADRR    RH QRLSP           E   ++Q             
Sbjct: 1577 VPQSVTASRRDEDADRRFGTARHAQRLSPRHEEKERRRSEE---ISQDDAKRRREDDIRE 1633

Query: 1053 XXXXXXDGLSTKVD--ERDKINLLKEEMDANAASKRRKLKRDHLSSGEAGEYS-LAPPQP 883
                  +GLS KV+  ER+K +LLKE+MD +AASKRRKLKR+H+ SGEAGEY+  APP P
Sbjct: 1634 RKREEREGLSIKVEDREREKASLLKEDMDPSAASKRRKLKREHMPSGEAGEYTPAAPPPP 1693

Query: 882  PLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASKITRRDGDQIYDREWED 703
            P A+ M Q YDGR+RGDRKG M+QR GYL+EP  R+HGKE   K+ RRD DQ+YDREW+D
Sbjct: 1694 PPAISMSQAYDGRERGDRKGAMVQRAGYLDEPGLRIHGKEVTGKMARRDADQMYDREWDD 1753

Query: 702  EKRQRPEQKRRHRK 661
            EKRQR EQKRRHRK
Sbjct: 1754 EKRQRAEQKRRHRK 1767


>ref|XP_007217095.1| hypothetical protein PRUPE_ppa000084mg [Prunus persica]
            gi|462413245|gb|EMJ18294.1| hypothetical protein
            PRUPE_ppa000084mg [Prunus persica]
          Length = 1878

 Score = 2411 bits (6248), Expect = 0.0
 Identities = 1276/1907 (66%), Positives = 1475/1907 (77%), Gaps = 53/1907 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPP+E  Y+ E+ +REWKNG ++FK ++ +P+LRFLYELC TMV GELP+QKCKAALD
Sbjct: 1    MSLPPVERAYVREDCVREWKNGTSNFKLADPVPMLRFLYELCSTMVSGELPLQKCKAALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVEF++K S+EEL S FADIVTQ++QD+ M  E+R RLIK+AKWLVES+LVPL+LFQERC
Sbjct: 61   SVEFSDKVSDEELASSFADIVTQLSQDIRMPGEHRARLIKLAKWLVESSLVPLRLFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKA +LK+KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ+   +
Sbjct: 121  EEEFLWEAEMIKIKAQELKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQNSETS 180

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
               + + AT+ IIKSLIGHFDLDPN VFDIVLE FELQ D+NVFL+LIPIFPKSHASQIL
Sbjct: 181  --SHNAAATIGIIKSLIGHFDLDPNHVFDIVLEYFELQPDSNVFLELIPIFPKSHASQIL 238

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQR+EVN PVPFGLY+LTA+LVK +FIDLDSIYAHLLPKD++AFEHY A S+KR
Sbjct: 239  GFKFQYYQRLEVNSPVPFGLYKLTALLVKEEFIDLDSIYAHLLPKDDEAFEHYSAFSSKR 298

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLMDDEKQGDVTIDL+AA+DME+EAV E S E ENNQ LGLL+
Sbjct: 299  LDEANKIGKINLAATGKDLMDDEKQGDVTIDLFAALDMETEAVGERSTECENNQTLGLLT 358

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSV+DWYHAH+LF+RLSPL+PV HIQIC  LFRLIEK+IS AYD +R+ HL  FG  S
Sbjct: 359  GFLSVNDWYHAHLLFERLSPLHPVEHIQICNSLFRLIEKTISSAYDTVRRAHLLSFGSSS 418

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G+  D +  T+ SS   SFVDLPKELFQMLA AGPYLYR+ +LLQKVCRVLRGYY SAL+
Sbjct: 419  GTSVDVIH-TENSSRHGSFVDLPKELFQMLACAGPYLYRDTLLLQKVCRVLRGYYSSALD 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ SG R  +P     G+  PRLHLKEAK R+EEALGTCLLPSLQL+PANPAVGQEIWEV
Sbjct: 478  LVSSGERVVDPSYVFVGN--PRLHLKEAKSRIEEALGTCLLPSLQLVPANPAVGQEIWEV 535

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            MSLLPYE RYRLYGEWEK+DERIPMVL ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 536  MSLLPYEVRYRLYGEWEKEDERIPMVLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 595

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGR+KLK+DG
Sbjct: 596  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGRDKLKEDG 655

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LN+SDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANV Y
Sbjct: 656  LNVSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIQQMANVHY 715

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TE+QLDAMAGSETLRYQAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 716  TENLTEDQLDAMAGSETLRYQATSFGVTRNNKALIKSTNRLRDSLLPKDESKLAIPLLLL 775

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRSVV+I++DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T T++YAQLIP LDDLVH
Sbjct: 776  LAQHRSVVIIDADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPTSAYAQLIPSLDDLVH 835

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXSA-KLVLDL 3526
            +YHLDPEVAFL+YRPVMRL+KC   SDV WP D+ +   +++          +  LVLD+
Sbjct: 836  QYHLDPEVAFLIYRPVMRLFKCRGSSDVFWPLDNSDTQGITSANSESEAAEHSGNLVLDV 895

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            GSP KP+ W DLL+TV+TMLP KAWNSLSPDLYATFWGLTLYDLYVPR+ YESEI+KQH+
Sbjct: 896  GSPSKPVTWLDLLNTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNCYESEIAKQHA 955

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEELSDNS+SAITKRKKDKERIQE LDRLT EL+KHE+NV SVR+RLSREKD WLS
Sbjct: 956  ALKALEELSDNSSSAITKRKKDKERIQESLDRLTSELRKHEENVASVRKRLSREKDKWLS 1015

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKIN+EFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHID+LIC+TL
Sbjct: 1016 SCPDTLKINVEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDILICRTL 1075

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYE GR G+FL ETLK+AY+WK DESIYERECGNMPGFAVYYR+PNSQRV Y
Sbjct: 1076 QPMICCCTEYEVGRFGKFLQETLKIAYYWKKDESIYERECGNMPGFAVYYRHPNSQRVAY 1135

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QF++VHWKWS RIT+LLIQCLES EYMEIRNALI+L+KISSVFPVTRKTG+NLEKRV+K
Sbjct: 1136 FQFMKVHWKWSQRITKLLIQCLESTEYMEIRNALILLSKISSVFPVTRKTGVNLEKRVSK 1195

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD--XXXXXXXXXXXXAIAVPNG 2452
            IK DEREDLK           ARKSSW+T+EEFG G ++              + A  +G
Sbjct: 1196 IKADEREDLKVLATGVAAALAARKSSWITDEEFGNGYLELKSAPLASKSSAGNSAATHSG 1255

Query: 2451 SALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTK 2272
            S +NISQ E +G + G   SQ  +  N +K+Q  + K  DG+LER E+ + +K+D G  K
Sbjct: 1256 STINISQSEPIGGKVGALPSQHPESSNSVKDQILKTKTSDGRLERVESISTVKSDQGHLK 1315

Query: 2271 LKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDESA-------------KGTAESEL 2131
            LK GSL +G D QS+M++ A+QSGTSR+ E +K  +ES+             K ++ESEL
Sbjct: 1316 LKVGSLVSGSDGQSLMSSPALQSGTSRSMENKKQVNESSNRTSDENMGKAAPKNSSESEL 1375

Query: 2130 RPTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAG 1951
            R   +RS  AGSL K PK Q++AK+D +SGK              RD+  H      SA 
Sbjct: 1376 RAQAKRSGPAGSLAKPPK-QDLAKDDGRSGKGIG-----------RDVLCH-----ASAV 1418

Query: 1950 GANTSTAI-ANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASI------D 1792
              N S AI AN +T+  S K S   A+TSV++HG  + ++ GA K+S  R S       +
Sbjct: 1419 STNVSPAIAANGNTVSASAKGS--FAKTSVEIHGIDSKVDVGAAKASNTRVSAPKEDGPE 1476

Query: 1791 DSEKLRPLPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVE 1615
             S+ LRP  SR  HSP  D+S +ASKS+DKLQKRTSP EE  + SKRRKG+ E++D E E
Sbjct: 1477 TSDALRPHSSRLVHSPRHDNSASASKSSDKLQKRTSPAEETDRQSKRRKGETEMRDFEGE 1536

Query: 1614 ARFSDRERSVDPRLPHSDLERTGIDDQSLH--------RSKDKGNGRYDRDHRERLERQD 1459
            AR SDRERSVD RL   DL+++G DDQS++        RSKDKG+ R+D+D+RERL+R D
Sbjct: 1537 ARLSDRERSVDARL--LDLDKSGTDDQSVYKATDKPSDRSKDKGSERHDKDYRERLDRPD 1594

Query: 1458 KSTGEDILVEKSRDRSMERYGRERSVERVS------FXXXXXXXXXXXRGKPRYNETSGE 1297
            KS G+D L E+SRDRSMER+GRE SVE+V                   RGK RYN+ S E
Sbjct: 1595 KSRGDD-LGERSRDRSMERHGREHSVEKVQERGMDRSVDRLSDKSKDDRGKVRYNDISTE 1653

Query: 1296 K--YDDRFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXX 1123
            K   D+R+H QS        P+MVP SV++ RRDEDADRR   TRH QRLSP        
Sbjct: 1654 KSHVDERYHGQSLPPPPPLPPHMVPHSVSSGRRDEDADRRFGTTRHTQRLSPRHDEKERR 1713

Query: 1122 XXXEDYLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVD------ERDKINLLKEEMDANAA 961
               ++ L++Q                   +GLS KV+      ER+K NLLKEE DA AA
Sbjct: 1714 RSEDNSLISQDDSKRRREDDFRDRKREDREGLSIKVEEREREREREKANLLKEETDAIAA 1773

Query: 960  SKRRKLKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPM 784
            SKRRKLKR+H  SGE GEYS + PP PPL++ + Q YDGRDRGDRKG  +QR GYLEEP 
Sbjct: 1774 SKRRKLKREHPPSGEPGEYSPVPPPPPPLSISLSQSYDGRDRGDRKGPPVQRAGYLEEPS 1833

Query: 783  PRMHGKETASKITRRDGD------QIYDREWEDEKRQRPEQKRRHRK 661
             R+HGKE ASK+TRRD D      ++Y  EWEDEKRQR EQKRRHRK
Sbjct: 1834 VRIHGKEAASKMTRRDPDPYPSCCRMY--EWEDEKRQRAEQKRRHRK 1878


>ref|XP_007045493.1| THO complex subunit 2 isoform 1 [Theobroma cacao]
            gi|508709428|gb|EOY01325.1| THO complex subunit 2 isoform
            1 [Theobroma cacao]
          Length = 1853

 Score = 2407 bits (6238), Expect = 0.0
 Identities = 1289/1898 (67%), Positives = 1464/1898 (77%), Gaps = 44/1898 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  YITEE LRE K+GN++F  S+S+P+LRFLYELCWTMVRGELP QKCKA LD
Sbjct: 1    MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            +VEFTE+ S +ELGS FADIVTQMAQDLTM+ EYRTRLIK+AKWLVES++VPL+LF ER 
Sbjct: 61   AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKAPDLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + G+  
Sbjct: 121  EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFR-GSED 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+N S A + +IKSLIGHFDLDPNRVFDIVLEC+ELQ D + FL LIPIFPKSHASQIL
Sbjct: 180  STQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN P PFGLY+LTA+LVK +FIDLDSIY HLLPKD++ FE +++ S KR
Sbjct: 240  GFKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+DME+EAVAE +PELENNQ LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHA ILFDRLSPLNPVAH+QIC+GLFRLIEKSIS AYDI+RQ HLQ FG PS
Sbjct: 360  GFLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIVRQTHLQNFGSPS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G G D ++ + ++S   SF+DLPKELFQMLA+ GP+LY + +LLQKVCRVLRGYYLSALE
Sbjct: 420  GPGVDNMDTSTSAS--SSFIDLPKELFQMLATVGPHLYSDTLLLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ S G  SN E+A  G ++PRLHLKEA+ RVEE LG CLLPSLQL+PANPAVGQEIWEV
Sbjct: 478  LVASAGGVSNAETAAGGYQNPRLHLKEARSRVEETLGACLLPSLQLVPANPAVGQEIWEV 537

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDER P +L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 538  MNLLPYEVRYRLYGEWEKDDERNPTILAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 597

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDG
Sbjct: 598  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDG 657

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQ+
Sbjct: 658  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIHQMANVQF 717

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLR+QAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 718  TENLTEEQLDAMAGSETLRFQATSFGVTRNNKALIKSTNRLRDSLLPKDEPKLAIPLLLL 777

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRS+VVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  A+YAQLIP LDDLVH
Sbjct: 778  LAQHRSLVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPAAAYAQLIPSLDDLVH 837

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+KC   SDV WP DD E  +++            ++++LDL
Sbjct: 838  VYHLDPEVAFLIYRPVMRLFKCQGSSDVFWPLDDNETGNITMAYSESESKDDLSRVILDL 897

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            G P+KPI+WS+LLDTV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI+KQH+
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEEL DNS+SAI KRKKDKERIQE LDRLT EL KHE+NV SVRRRL+ EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVPN 2455
            IK DEREDLK           ARKSSWVT+EEFGMG ++                ++V N
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 2454 GSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQT 2275
            GS++N+SQ E  G RA   G+Q +D  N +K+Q  R K  DG+LER EN +L K+DL   
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDV-NLVKDQIPRTKS-DGRLERAENASLGKSDL--- 1312

Query: 2274 KLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES------------AKGTAESEL 2131
            K K G+ ANG D+   +  A  Q+GT ++ E QK  DES            AK +AE E 
Sbjct: 1313 KTKGGTSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELES 1372

Query: 2130 RPTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAG 1951
            + + +RS  AGSLTK  K Q+  K+D KSGK             +RD+ +H  +GR   G
Sbjct: 1373 KASAKRSAPAGSLTKTQK-QDPGKDDGKSGKAVGRTSVTCVI--DRDVPSH-TEGR--QG 1426

Query: 1950 GANTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRP 1771
            G         ++ +P++V S+               N  S   K  +  + + D+   RP
Sbjct: 1427 G---------TTNVPSAVTSN--------------GNAVSAPPKGKDDGSELPDAS--RP 1461

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
              SR  HSP  D S T SKS+DKLQKRT+P EE  + +KRRKGD E+KD + E R SDRE
Sbjct: 1462 -SSRIVHSPRHDSSATVSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRE 1520

Query: 1593 RSVDPRLPHSDLERTGID--------DQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDI 1438
            RS DP+L  +D ++ G D        D+ L RSKDKG+ R+DRD+RERLER +KS  +DI
Sbjct: 1521 RSTDPQL--ADFDKPGTDELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDI 1578

Query: 1437 LVEKSRDRSMERYGRERSVERVS-------FXXXXXXXXXXXRGKPRYNETSGEK--YDD 1285
            L EKSRDRS+ERYGRERSVER +                   R K RY +TS EK   DD
Sbjct: 1579 LTEKSRDRSIERYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDD 1638

Query: 1284 RFHRQSXXXXXXXXPNMVPQSVNAS-RRDEDADRRISNTRHIQRLSPXXXXXXXXXXXED 1108
            RFH QS        P+MVPQSVNA+ RRD+D DRR  +TRH QRLSP           E+
Sbjct: 1639 RFHGQSLPPPPPLPPHMVPQSVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEEN 1698

Query: 1107 YLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERD--------KINLLKEEMDANAASKR 952
             LV+Q                   +GLS KV+ERD        K +LLKE++DAN A KR
Sbjct: 1699 SLVSQDDGKRRREDDFRERKREEREGLSMKVEERDRDRERDREKASLLKEDVDANVA-KR 1757

Query: 951  RKLKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRM 775
            RKLKR+HL S E GEYS +APP PPLA+GM Q YDGRDR DRKG+M+QR GYLEEP  R+
Sbjct: 1758 RKLKREHLPS-EPGEYSPIAPPPPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRI 1815

Query: 774  HGKETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            HGKE ASK+ RRD D +YDREW+DEKRQRPE KRRHRK
Sbjct: 1816 HGKEAASKMARRDTDPMYDREWDDEKRQRPEPKRRHRK 1853


>ref|XP_007045494.1| THO complex subunit 2 isoform 2 [Theobroma cacao]
            gi|508709429|gb|EOY01326.1| THO complex subunit 2 isoform
            2 [Theobroma cacao]
          Length = 1844

 Score = 2403 bits (6227), Expect = 0.0
 Identities = 1289/1898 (67%), Positives = 1461/1898 (76%), Gaps = 44/1898 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  YITEE LRE K+GN++F  S+S+P+LRFLYELCWTMVRGELP QKCKA LD
Sbjct: 1    MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            +VEFTE+ S +ELGS FADIVTQMAQDLTM+ EYRTRLIK+AKWLVES++VPL+LF ER 
Sbjct: 61   AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKAPDLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + G+  
Sbjct: 121  EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFR-GSED 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+N S A + +IKSLIGHFDLDPNRVFDIVLEC+ELQ D + FL LIPIFPKSHASQIL
Sbjct: 180  STQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN P PFGLY+LTA+LVK +FIDLDSIY HLLPKD++ FE +++ S KR
Sbjct: 240  GFKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+DME+EAVAE +PELENNQ LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHA ILFDRLSPLNPVAH+QIC+GLFRLIEKSIS AYDI+RQ HLQ FG PS
Sbjct: 360  GFLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIVRQTHLQNFGSPS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G G D ++ + ++S   SF+DLPKELFQMLA+ GP+LY + +LLQKVCRVLRGYYLSALE
Sbjct: 420  GPGVDNMDTSTSAS--SSFIDLPKELFQMLATVGPHLYSDTLLLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ S G  SN E+A  G ++PRLHLKEA+ RVEE LG CLLPSLQL+PANPAVGQEIWEV
Sbjct: 478  LVASAGGVSNAETAAGGYQNPRLHLKEARSRVEETLGACLLPSLQLVPANPAVGQEIWEV 537

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDER P +L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 538  MNLLPYEVRYRLYGEWEKDDERNPTILAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 597

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDG
Sbjct: 598  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDG 657

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQ+
Sbjct: 658  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIHQMANVQF 717

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLR+QAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 718  TENLTEEQLDAMAGSETLRFQATSFGVTRNNKALIKSTNRLRDSLLPKDEPKLAIPLLLL 777

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRS+VVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  A+YAQLIP LDDLVH
Sbjct: 778  LAQHRSLVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPAAAYAQLIPSLDDLVH 837

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+KC   SDV WP DD E  +++            ++++LDL
Sbjct: 838  VYHLDPEVAFLIYRPVMRLFKCQGSSDVFWPLDDNETGNITMAYSESESKDDLSRVILDL 897

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            G P+KPI+WS+LLDTV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI+KQH+
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEEL DNS+SAI KRKKDKERIQE LDRLT EL KHE+NV SVRRRL+ EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVPN 2455
            IK DEREDLK           ARKSSWVT+EEFGMG ++                ++V N
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 2454 GSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQT 2275
            GS++N+SQ E  G RA   G+Q +D  N +K+Q  R K  DG+LER EN +L K+DL   
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDV-NLVKDQIPRTKS-DGRLERAENASLGKSDL--- 1312

Query: 2274 KLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES------------AKGTAESEL 2131
            K K G+ ANG D+   +  A  Q+GT ++ E QK  DES            AK +AE E 
Sbjct: 1313 KTKGGTSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELES 1372

Query: 2130 RPTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAG 1951
            + + +RS  AGSLTK  K Q+  K+D KSGK             +RD+ +H  +GR   G
Sbjct: 1373 KASAKRSAPAGSLTKTQK-QDPGKDDGKSGKAVGRTSVTCVI--DRDVPSH-TEGR--QG 1426

Query: 1950 GANTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRP 1771
            G         ++ +P++V S                   +G    SEL       +  RP
Sbjct: 1427 G---------TTNVPSAVTS-------------------NGKDDGSEL------PDASRP 1452

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
              SR  HSP  D S T SKS+DKLQKRT+P EE  + +KRRKGD E+KD + E R SDRE
Sbjct: 1453 -SSRIVHSPRHDSSATVSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRE 1511

Query: 1593 RSVDPRLPHSDLERTGID--------DQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDI 1438
            RS DP+L  +D ++ G D        D+ L RSKDKG+ R+DRD+RERLER +KS  +DI
Sbjct: 1512 RSTDPQL--ADFDKPGTDELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDI 1569

Query: 1437 LVEKSRDRSMERYGRERSVERVS-------FXXXXXXXXXXXRGKPRYNETSGEK--YDD 1285
            L EKSRDRS+ERYGRERSVER +                   R K RY +TS EK   DD
Sbjct: 1570 LTEKSRDRSIERYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDD 1629

Query: 1284 RFHRQSXXXXXXXXPNMVPQSVNAS-RRDEDADRRISNTRHIQRLSPXXXXXXXXXXXED 1108
            RFH QS        P+MVPQSVNA+ RRD+D DRR  +TRH QRLSP           E+
Sbjct: 1630 RFHGQSLPPPPPLPPHMVPQSVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEEN 1689

Query: 1107 YLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERD--------KINLLKEEMDANAASKR 952
             LV+Q                   +GLS KV+ERD        K +LLKE++DAN A KR
Sbjct: 1690 SLVSQDDGKRRREDDFRERKREEREGLSMKVEERDRDRERDREKASLLKEDVDANVA-KR 1748

Query: 951  RKLKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRM 775
            RKLKR+HL S E GEYS +APP PPLA+GM Q YDGRDR DRKG+M+QR GYLEEP  R+
Sbjct: 1749 RKLKREHLPS-EPGEYSPIAPPPPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRI 1806

Query: 774  HGKETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            HGKE ASK+ RRD D +YDREW+DEKRQRPE KRRHRK
Sbjct: 1807 HGKEAASKMARRDTDPMYDREWDDEKRQRPEPKRRHRK 1844


>ref|XP_007045496.1| THO complex subunit 2 isoform 4 [Theobroma cacao]
            gi|508709431|gb|EOY01328.1| THO complex subunit 2 isoform
            4 [Theobroma cacao]
          Length = 1831

 Score = 2383 bits (6175), Expect = 0.0
 Identities = 1276/1898 (67%), Positives = 1445/1898 (76%), Gaps = 44/1898 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  YITEE LRE K+GN++F  S+S+P+LRFLYELCWTMVRGELP QKCKA LD
Sbjct: 1    MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            +VEFTE+ S +ELGS FADIVTQMAQDLTM+ EYRTRLIK+AKWLVES++VPL+LF ER 
Sbjct: 61   AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKAPDLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + G+  
Sbjct: 121  EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFR-GSED 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+N S A + +IKSLIGHFDLDPNRVFDIVLEC+ELQ D + FL LIPIFPKSHASQIL
Sbjct: 180  STQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN P PFGLY+LTA+LVK +FIDLDSIY HLLPKD++ FE +++ S KR
Sbjct: 240  GFKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+DME+EAVAE +PELENNQ LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHA ILFDRLSPLNPVAH+QIC+GLFRLIEKSIS AYDI+RQ HLQ FG PS
Sbjct: 360  GFLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIVRQTHLQNFGSPS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G G D ++ + ++S   SF+DLPKELFQMLA+ GP+LY + +LLQKVCRVLRGYYLSALE
Sbjct: 420  GPGVDNMDTSTSAS--SSFIDLPKELFQMLATVGPHLYSDTLLLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ S G  SN E+A  G ++PRLHLKEA+ RVEE LG CLLPSLQL+PANPAVGQEIWEV
Sbjct: 478  LVASAGGVSNAETAAGGYQNPRLHLKEARSRVEETLGACLLPSLQLVPANPAVGQEIWEV 537

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDER P +L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 538  MNLLPYEVRYRLYGEWEKDDERNPTILAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 597

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDG
Sbjct: 598  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDG 657

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQ+
Sbjct: 658  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIHQMANVQF 717

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLR+QAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 718  TENLTEEQLDAMAGSETLRFQATSFGVTRNNKALIKSTNRLRDSLLPKDEPKLAIPLLLL 777

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRS+VVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  A+YAQLIP LDDLVH
Sbjct: 778  LAQHRSLVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPAAAYAQLIPSLDDLVH 837

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+KC   SDV WP DD E  +++            ++++LDL
Sbjct: 838  VYHLDPEVAFLIYRPVMRLFKCQGSSDVFWPLDDNETGNITMAYSESESKDDLSRVILDL 897

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            G P+KPI+WS+LLDTV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI+KQH+
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEEL DNS+SAI KRKKDKERIQE LDRLT EL KHE+NV SVRRRL+ EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVPN 2455
            IK DEREDLK           ARKSSWVT+EEFGMG ++                ++V N
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 2454 GSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQT 2275
            GS++N+SQ E  G RA   G+Q +D  N +K+Q  R K  DG+LER EN +L K+DL   
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDV-NLVKDQIPRTKS-DGRLERAENASLGKSDL--- 1312

Query: 2274 KLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES------------AKGTAESEL 2131
            K K G+ ANG D+   +  A  Q+GT ++ E QK  DES            AK +AE E 
Sbjct: 1313 KTKGGTSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELES 1372

Query: 2130 RPTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAG 1951
            + + +RS  AGSLTK  K Q+  K+D KSGK             +RD+ +H    +    
Sbjct: 1373 KASAKRSAPAGSLTKTQK-QDPGKDDGKSGKAVGRTSVTCVI--DRDVPSHTEGRQGKDD 1429

Query: 1950 GANTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRP 1771
            G+    A   SS +  S +            H S A +                      
Sbjct: 1430 GSELPDASRPSSRIVHSPR------------HDSSATV---------------------- 1455

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
                             SKS+DKLQKRT+P EE  + +KRRKGD E+KD + E R SDRE
Sbjct: 1456 -----------------SKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRE 1498

Query: 1593 RSVDPRLPHSDLERTGID--------DQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDI 1438
            RS DP+L  +D ++ G D        D+ L RSKDKG+ R+DRD+RERLER +KS  +DI
Sbjct: 1499 RSTDPQL--ADFDKPGTDELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDI 1556

Query: 1437 LVEKSRDRSMERYGRERSVERVS-------FXXXXXXXXXXXRGKPRYNETSGEK--YDD 1285
            L EKSRDRS+ERYGRERSVER +                   R K RY +TS EK   DD
Sbjct: 1557 LTEKSRDRSIERYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDD 1616

Query: 1284 RFHRQSXXXXXXXXPNMVPQSVNAS-RRDEDADRRISNTRHIQRLSPXXXXXXXXXXXED 1108
            RFH QS        P+MVPQSVNA+ RRD+D DRR  +TRH QRLSP           E+
Sbjct: 1617 RFHGQSLPPPPPLPPHMVPQSVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEEN 1676

Query: 1107 YLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERD--------KINLLKEEMDANAASKR 952
             LV+Q                   +GLS KV+ERD        K +LLKE++DAN A KR
Sbjct: 1677 SLVSQDDGKRRREDDFRERKREEREGLSMKVEERDRDRERDREKASLLKEDVDANVA-KR 1735

Query: 951  RKLKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRM 775
            RKLKR+HL S E GEYS +APP PPLA+GM Q YDGRDR DRKG+M+QR GYLEEP  R+
Sbjct: 1736 RKLKREHLPS-EPGEYSPIAPPPPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRI 1793

Query: 774  HGKETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            HGKE ASK+ RRD D +YDREW+DEKRQRPE KRRHRK
Sbjct: 1794 HGKEAASKMARRDTDPMYDREWDDEKRQRPEPKRRHRK 1831


>ref|XP_007045497.1| THO complex subunit 2 isoform 5 [Theobroma cacao]
            gi|508709432|gb|EOY01329.1| THO complex subunit 2 isoform
            5 [Theobroma cacao]
          Length = 1824

 Score = 2360 bits (6116), Expect = 0.0
 Identities = 1270/1875 (67%), Positives = 1440/1875 (76%), Gaps = 44/1875 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  YITEE LRE K+GN++F  S+S+P+LRFLYELCWTMVRGELP QKCKA LD
Sbjct: 1    MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            +VEFTE+ S +ELGS FADIVTQMAQDLTM+ EYRTRLIK+AKWLVES++VPL+LF ER 
Sbjct: 61   AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKAPDLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + G+  
Sbjct: 121  EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFR-GSED 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+N S A + +IKSLIGHFDLDPNRVFDIVLEC+ELQ D + FL LIPIFPKSHASQIL
Sbjct: 180  STQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN P PFGLY+LTA+LVK +FIDLDSIY HLLPKD++ FE +++ S KR
Sbjct: 240  GFKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+DME+EAVAE +PELENNQ LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHA ILFDRLSPLNPVAH+QIC+GLFRLIEKSIS AYDI+RQ HLQ FG PS
Sbjct: 360  GFLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIVRQTHLQNFGSPS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G G D ++ + ++S   SF+DLPKELFQMLA+ GP+LY + +LLQKVCRVLRGYYLSALE
Sbjct: 420  GPGVDNMDTSTSAS--SSFIDLPKELFQMLATVGPHLYSDTLLLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ S G  SN E+A  G ++PRLHLKEA+ RVEE LG CLLPSLQL+PANPAVGQEIWEV
Sbjct: 478  LVASAGGVSNAETAAGGYQNPRLHLKEARSRVEETLGACLLPSLQLVPANPAVGQEIWEV 537

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDER P +L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 538  MNLLPYEVRYRLYGEWEKDDERNPTILAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 597

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDG
Sbjct: 598  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDG 657

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQ+
Sbjct: 658  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIHQMANVQF 717

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLR+QAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 718  TENLTEEQLDAMAGSETLRFQATSFGVTRNNKALIKSTNRLRDSLLPKDEPKLAIPLLLL 777

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRS+VVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  A+YAQLIP LDDLVH
Sbjct: 778  LAQHRSLVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPAAAYAQLIPSLDDLVH 837

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+KC   SDV WP DD E  +++            ++++LDL
Sbjct: 838  VYHLDPEVAFLIYRPVMRLFKCQGSSDVFWPLDDNETGNITMAYSESESKDDLSRVILDL 897

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            G P+KPI+WS+LLDTV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI+KQH+
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEEL DNS+SAI KRKKDKERIQE LDRLT EL KHE+NV SVRRRL+ EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVPN 2455
            IK DEREDLK           ARKSSWVT+EEFGMG ++                ++V N
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257

Query: 2454 GSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQT 2275
            GS++N+SQ E  G RA   G+Q +D  N +K+Q  R K  DG+LER EN +L K+DL   
Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDV-NLVKDQIPRTKS-DGRLERAENASLGKSDL--- 1312

Query: 2274 KLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES------------AKGTAESEL 2131
            K K G+ ANG D+   +  A  Q+GT ++ E QK  DES            AK +AE E 
Sbjct: 1313 KTKGGTSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELES 1372

Query: 2130 RPTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAG 1951
            + + +RS  AGSLTK  K Q+  K+D KSGK             +RD+ +H  +GR   G
Sbjct: 1373 KASAKRSAPAGSLTKTQK-QDPGKDDGKSGKAVGRTSVTCVI--DRDVPSH-TEGR--QG 1426

Query: 1950 GANTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRP 1771
            G         ++ +P++V S                   +G    SEL       +  RP
Sbjct: 1427 G---------TTNVPSAVTS-------------------NGKDDGSEL------PDASRP 1452

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
              SR  HSP  D S T SKS+DKLQKRT+P EE  + +KRRKGD E+KD + E R SDRE
Sbjct: 1453 -SSRIVHSPRHDSSATVSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDGEVRLSDRE 1511

Query: 1593 RSVDPRLPHSDLERTGID--------DQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDI 1438
            RS DP+L  +D ++ G D        D+ L RSKDKG+ R+DRD+RERLER +KS  +DI
Sbjct: 1512 RSTDPQL--ADFDKPGTDELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDI 1569

Query: 1437 LVEKSRDRSMERYGRERSVERVS-------FXXXXXXXXXXXRGKPRYNETSGEK--YDD 1285
            L EKSRDRS+ERYGRERSVER +                   R K RY +TS EK   DD
Sbjct: 1570 LTEKSRDRSIERYGRERSVERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDD 1629

Query: 1284 RFHRQSXXXXXXXXPNMVPQSVNAS-RRDEDADRRISNTRHIQRLSPXXXXXXXXXXXED 1108
            RFH QS        P+MVPQSVNA+ RRD+D DRR  +TRH QRLSP           E+
Sbjct: 1630 RFHGQSLPPPPPLPPHMVPQSVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEEN 1689

Query: 1107 YLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERD--------KINLLKEEMDANAASKR 952
             LV+Q                   +GLS KV+ERD        K +LLKE++DAN A KR
Sbjct: 1690 SLVSQDDGKRRREDDFRERKREEREGLSMKVEERDRDRERDREKASLLKEDVDANVA-KR 1748

Query: 951  RKLKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRM 775
            RKLKR+HL S E GEYS +APP PPLA+GM Q YDGRDR DRKG+M+QR GYLEEP  R+
Sbjct: 1749 RKLKREHLPS-EPGEYSPIAPPPPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRI 1806

Query: 774  HGKETASKITRRDGD 730
            HGKE ASK+ RRD D
Sbjct: 1807 HGKEAASKMARRDTD 1821


>ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucumis sativus]
            gi|449506883|ref|XP_004162874.1| PREDICTED: THO complex
            subunit 2-like [Cucumis sativus]
          Length = 1887

 Score = 2358 bits (6112), Expect = 0.0
 Identities = 1235/1896 (65%), Positives = 1451/1896 (76%), Gaps = 42/1896 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            M+LPP+E  Y+ E  +REWK+GN+ F+    +PV+RFLYELCWTMVRG+LP QKCKAALD
Sbjct: 1    MALPPVECMYVVESNIREWKSGNSSFRVPQPVPVVRFLYELCWTMVRGDLPFQKCKAALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVEF+EK S EELGS FAD++TQ+AQD+T++ EYR RL+K+AKWLVESA VPL+LFQERC
Sbjct: 61   SVEFSEKMSAEELGSTFADVITQLAQDITLAGEYRARLLKLAKWLVESAFVPLRLFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKA +LK+KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+  T A
Sbjct: 121  EEEFLWEAEMIKIKAQELKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRV-TDA 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
              ++  G+T+ IIKSLIGHFDLDPNRVFDIVLECFELQ +N+VF++LIPIFPKSHASQIL
Sbjct: 180  SNKSFPGSTIGIIKSLIGHFDLDPNRVFDIVLECFELQPENSVFVELIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQR+EVN PVPFGLY+LTA+LVK  FIDLDSIYAHLLPK+++AFEHY + S+KR
Sbjct: 240  GFKFQYYQRIEVNSPVPFGLYKLTALLVKEKFIDLDSIYAHLLPKEDEAFEHYGSFSSKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEA++IGKINLAATGKDLMDDEKQGDV+IDL+AAIDMESEAV E SPELENNQ LGLL+
Sbjct: 300  LDEASRIGKINLAATGKDLMDDEKQGDVSIDLFAAIDMESEAVNERSPELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSV DWYHAH+LFDRLSPLNPV  + IC  LFRLIE+SIS AY I+RQ   Q  G  +
Sbjct: 360  GFLSVGDWYHAHVLFDRLSPLNPVELLPICNSLFRLIEESISSAYSIVRQNPHQSLGASA 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            GS  DA+E T+   +  SF+ LP+ELFQMLA+AGPYLYR+ +LLQKVCRVLRGYY SA+E
Sbjct: 420  GSSIDAIETTNLP-VGGSFIGLPRELFQMLATAGPYLYRDTILLQKVCRVLRGYYTSAIE 478

Query: 4782 LIGSGGRASNPESA-DSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWE 4606
             + S     NPE    +G+R P LHLKEA+LR+EEALGTCLLPSLQLIPANPAVGQ IWE
Sbjct: 479  FVNSVESGQNPELVMPAGNRVPHLHLKEARLRIEEALGTCLLPSLQLIPANPAVGQGIWE 538

Query: 4605 VMSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKL 4426
            VM+LLPYE RYRLYGEWE+DDE+IPMVL ARQTAKLDTRRILKRLAKENLKQLGRMVAKL
Sbjct: 539  VMNLLPYEVRYRLYGEWERDDEKIPMVLAARQTAKLDTRRILKRLAKENLKQLGRMVAKL 598

Query: 4425 AHANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDD 4246
            AHANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDD
Sbjct: 599  AHANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDD 658

Query: 4245 GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQ 4066
            GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQ
Sbjct: 659  GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELVQQMANVQ 718

Query: 4065 YTENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXX 3886
            YTEN+TEEQLD+MAGSETLRYQAT FG+TRNNKAL KS+NRLRDS               
Sbjct: 719  YTENLTEEQLDSMAGSETLRYQATSFGVTRNNKALIKSSNRLRDSLLPKDEPKLAVPLLL 778

Query: 3885 XXAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLV 3706
              AQHRS+VVIN++AP+IKMV EQFDRCHGTLLQYVEFL +A+T  ++YAQLIP L++L 
Sbjct: 779  LIAQHRSLVVINANAPYIKMVSEQFDRCHGTLLQYVEFLTTAVTPASAYAQLIPSLNELA 838

Query: 3705 HKYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXSAKLVLDL 3526
            H YHLDPEVAFL+YRP+MRLYKC  GSD+ WP D  +   + N         SA +VLDL
Sbjct: 839  HLYHLDPEVAFLIYRPIMRLYKCQGGSDIFWPLDGNDANVIGNSSDLEPAECSADVVLDL 898

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            GS QKP+ WSDLLDTV++MLP KAWNSLSPDLY TFWGLTLYDLYVPRSRYESEI+KQH+
Sbjct: 899  GSLQKPVRWSDLLDTVKSMLPPKAWNSLSPDLYTTFWGLTLYDLYVPRSRYESEIAKQHA 958

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEELSDNS+SAI KRKKDKERIQE LDRL+ EL KHE+NV SVRRRLSREKD WLS
Sbjct: 959  ALKALEELSDNSSSAINKRKKDKERIQESLDRLSNELVKHEENVASVRRRLSREKDKWLS 1018

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1019 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1078

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1079 QPMICCCTEYEAGRLGRFLYETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1138

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKIS+VFPVTRK+GINLEKRVAK
Sbjct: 1139 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISNVFPVTRKSGINLEKRVAK 1198

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD-XXXXXXXXXXXXAIAVPNGS 2449
            IK DEREDLK           ARK SWVT+EEFGMG ++              +A    +
Sbjct: 1199 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKTPSLASKPSASNLASSQNN 1258

Query: 2448 ALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTKL 2269
            ++ +SQ E VG +        +D GN  K+ + R +  D + ++ +  ++ K++LG  K 
Sbjct: 1259 SIFVSQNEPVGGKTSALPIPNSDSGNMAKDHSLRSRTSDVRTDKIDGLSVPKSELGHGKQ 1318

Query: 2268 KSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES------------AKGTAESELRP 2125
            K  SL NG DSQ ++ + +V SG+ +  + QK  D+S            +K ++ESELR 
Sbjct: 1319 KGMSL-NGPDSQPLVPSTSVHSGSLKMVDSQKPGDDSTRTLDEGSSKVVSKTSSESELRG 1377

Query: 2124 TIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGA 1945
            + +RS    SL K PK Q++ K++ +SGK           ++ER+L  H  DG    G +
Sbjct: 1378 STKRSGPVTSLNKAPK-QDITKDEIRSGK--AASKNPGSSTSERELPVHATDGGRHGGPS 1434

Query: 1944 NTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASI---DDSEKL- 1777
            N+ + ++N +T  +  K S+ + + S D H   +  ESG  ++S+ R S    D  E L 
Sbjct: 1435 NSPSIMSNGNTQNSLTKGSSLTVKAS-DGHTIESKAESGVGRTSDGRVSSVKDDGPEALD 1493

Query: 1776 --RPLPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARF 1606
              R   SR  HSP  D+S + S+S+DKLQKR SP EE  +  KRRKGD E++D + + R 
Sbjct: 1494 VSRSSSSRLGHSPRHDNSASGSRSSDKLQKRASPAEEPDRQGKRRKGDGEIRDVDGDFRI 1553

Query: 1605 SDRERSVDPRLPHSDLERTGIDDQS--------LHRSKDKGNGRYDRDHRERLERQDKST 1450
            SD++RS+DPR    D ++ G+++QS        L R+KDK N RYDRD+R+R ER +KS 
Sbjct: 1554 SDKDRSMDPR--SIDADKIGMEEQSGYRGLDKPLDRTKDKVNERYDRDYRDRAERPEKSR 1611

Query: 1449 GEDILVEKSRDRSMERYGRERSVERVSF------XXXXXXXXXXXRGKPRYNETSGEK-- 1294
            G+D  VE++RDRS+ERYGRERSVE+V                   R K RY++++ +K  
Sbjct: 1612 GDDPQVERTRDRSIERYGRERSVEKVERVSDRYPEKSKDERNKDDRSKLRYSDSTVDKSH 1671

Query: 1293 YDDRFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXX 1114
             DDRFH QS        P++VPQSVN+ RR+EDADRR    RH QRLSP           
Sbjct: 1672 TDDRFHGQSLPPPPPLPPHLVPQSVNSGRREEDADRRFGTARHAQRLSPRHEEKERRRSE 1731

Query: 1113 EDYLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVD----ERDKINLLKEEMDANAASKRRK 946
            E+ +                       G+S KVD    ER+K NLLKE+MDA+AASKRRK
Sbjct: 1732 ENLISQDDAKRRREEEFRERKREERDVGMSLKVDDREREREKANLLKEDMDASAASKRRK 1791

Query: 945  LKRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHG 769
            LKR+HLS  EAGEYS + PP PP+  G+ Q YDGR+RGDRKG M+QRPGYL++P  R+HG
Sbjct: 1792 LKREHLSLVEAGEYSPVGPPPPPMGGGVSQSYDGRERGDRKGVMMQRPGYLDDPGLRIHG 1851

Query: 768  KETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            KE  +K+TRR+ D +Y+REW+DEKR R +QKRRHRK
Sbjct: 1852 KEVVNKMTRREADLMYEREWDDEKRMRADQKRRHRK 1887


>ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isoform X1 [Glycine max]
          Length = 1870

 Score = 2357 bits (6108), Expect = 0.0
 Identities = 1251/1889 (66%), Positives = 1431/1889 (75%), Gaps = 35/1889 (1%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  Y+TEE +REW++GN   K S  +P+LRFLYELCWTMVRGELP QKCK ALD
Sbjct: 1    MSLPPIECAYVTEECIREWRSGNPALKVSQPVPMLRFLYELCWTMVRGELPFQKCKVALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SV F++K+SNE++ S F+DIVTQMAQD TMS E+R+RLIK+A+WLVES +VP++L QERC
Sbjct: 61   SVIFSDKASNEKIASNFSDIVTQMAQDHTMSGEFRSRLIKLARWLVESEMVPVRLLQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFL E E+IKIKA +LK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+    A
Sbjct: 121  EEEFLGEVELIKIKAQELKVKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRDSE-A 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+  S AT+ IIKSLIGHFDLDPNRVFDIVLECFELQ D++VF++LIPIFPKSHASQIL
Sbjct: 180  PTQKSSAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDDDVFIELIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN PVPFGLYRLTA+LVK DFIDLDSIYAHLLP+D++AFEHY+  S+KR
Sbjct: 240  GFKFQYYQRMEVNGPVPFGLYRLTALLVKQDFIDLDSIYAHLLPRDDEAFEHYNTFSSKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIG+INLAA GKDLMDDEKQGDVTIDL+AAIDME++AV E + EL+++Q LGLL+
Sbjct: 300  LDEANKIGRINLAAIGKDLMDDEKQGDVTIDLFAAIDMETDAVEERTTELQSSQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHAH+LF+RLSPLN V HIQIC+ LFRLI+KSIS AYD+IRQ HLQ  G  +
Sbjct: 360  GFLSVDDWYHAHLLFERLSPLNAVEHIQICDSLFRLIKKSISSAYDVIRQTHLQNPGLST 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G  +D ++V D SS   SF+DLPKELFQMLA  GPYLYR+ VLLQKVCRVLRGYYLSALE
Sbjct: 420  GGSTDVMDV-DNSSGFSSFIDLPKELFQMLACTGPYLYRDTVLLQKVCRVLRGYYLSALE 478

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+  G    NP+    G+    LHLKEA+LRVE+ALG CLLPSLQLIPANPAVGQEIWE+
Sbjct: 479  LVSHGNGVLNPQLQVPGNL--HLHLKEARLRVEDALGACLLPSLQLIPANPAVGQEIWEL 536

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            MSLLPYE RYRLYGEWEKDDERIPM+L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 537  MSLLPYEVRYRLYGEWEKDDERIPMLLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 596

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERL  GGR+KLKDDG
Sbjct: 597  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLVLGGRDKLKDDG 656

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQY
Sbjct: 657  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIQQMANVQY 716

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLRYQAT FG+TRNNKAL KST+RLRD+                
Sbjct: 717  TENLTEEQLDAMAGSETLRYQATSFGVTRNNKALIKSTSRLRDALLPNDEPKLAIPLLQL 776

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQH S+VVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  ++YA L+P L+DLVH
Sbjct: 777  IAQHCSLVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPASNYAILVPSLNDLVH 836

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIED-THMSNIXXXXXXXXSAKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+K     DVCWP  D +  +  S          SA +VL+L
Sbjct: 837  LYHLDPEVAFLIYRPVMRLFKSPRNPDVCWPLGDKDAASDASTNFESDPLDHSASMVLNL 896

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            GS Q PI WS LLDTV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVP++RYESEI+K H+
Sbjct: 897  GSDQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 956

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
             LK+LEELSDNS+SAI KRKK+KERIQE LDRL  EL KHE+NV SVRRRLS EKD WLS
Sbjct: 957  NLKSLEELSDNSSSAIAKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 1016

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1076

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WKSDESIYERECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1077 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1136

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1137 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1196

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD--XXXXXXXXXXXXAIAVPNG 2452
            IK DEREDLK           ARK SWVT+EEFGMG ++              +  V +G
Sbjct: 1197 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPSMTKSSAGNSATVQSG 1256

Query: 2451 SALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTK 2272
              LN+SQ E+V      SG  + D GN +K+Q  R K VDGK ER E+ T+ K+D G  K
Sbjct: 1257 INLNVSQTESV------SGKHV-DSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIK 1309

Query: 2271 LKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDESAK------GTAESELRPTIRRS 2110
            LKS S+ NGLD+QS M  ++VQSG  ++ E  K  +ES        GT  +ELR + +RS
Sbjct: 1310 LKSSSMVNGLDAQSSMAPSSVQSGMPKSMENPKQVEESINRASDEHGTRSTELRTSAKRS 1369

Query: 2109 VAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTSTA 1930
            V A SL K P +Q+  KED +SGK           S+++DL  H  +GR +     T+  
Sbjct: 1370 VPASSLAK-PSKQDPVKEDGRSGK--PVARTSGSLSSDKDLQTHALEGRHT---GTTNVP 1423

Query: 1929 IANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASI--DDSEKLRPLP--- 1765
             +N +T+  S K S    + S+D  G+ +  E G  KSS++RAS+  DD   +   P   
Sbjct: 1424 SSNGNTISGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGS 1483

Query: 1764 -SRPTHSPLPDDSFTASKSTDKLQKR-TSPEEHGKPSKRRKGDNEVKDGEVEARFSDRER 1591
             SR  HSP  +++   SKS D++QKR +S EE  +  KRRKGD E++D E E RFS+RE+
Sbjct: 1484 SSRIVHSPRHENTVVTSKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFETELRFSEREK 1543

Query: 1590 SVDPRLPHSDL--ERTGI---DDQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDILVEK 1426
             +DPR     L  E  G+    D+ L R+KDKGN RY+RDHRER++R DKS G+D + EK
Sbjct: 1544 MMDPRFADDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKSRGDDFVAEK 1603

Query: 1425 SRDRSMERYGRERSVERV-------SF----XXXXXXXXXXXRGKPRYNETSGEKY--DD 1285
             RDRS+ERYGRERSVER+       SF               R K RYN+ S EK   DD
Sbjct: 1604 PRDRSIERYGRERSVERMQERGSDRSFNRLPEKAKDERNKDDRNKLRYNDASAEKSHGDD 1663

Query: 1284 RFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXEDY 1105
            RFH QS        PN+VPQSV A RRDED DRR   TRH QRLSP            + 
Sbjct: 1664 RFHGQSLPPPPPLPPNVVPQSVGAGRRDEDVDRRYGATRHSQRLSP--RHEEKERRWSEE 1721

Query: 1104 LVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERDKINLLKEEMDANAASKRRKLKRDHLS 925
             V                       +  +  ER+K N+LKEE+D NAASKRRKLKR+HL 
Sbjct: 1722 TVVSQDDAKRRKEDDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKLKREHLP 1781

Query: 924  SGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASKI 748
            + E GEYS +A P      GM   YDGRDRGDRKG ++Q P Y++E   R+HGKE ASK+
Sbjct: 1782 TDEPGEYSAVAHPPSSAGTGMPLAYDGRDRGDRKGPIMQHPSYIDESSLRIHGKEAASKL 1841

Query: 747  TRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
             RRD D +YDREWEDEKRQR +QKRRHRK
Sbjct: 1842 NRRDSDPLYDREWEDEKRQRADQKRRHRK 1870


>ref|XP_007160466.1| hypothetical protein PHAVU_002G324500g [Phaseolus vulgaris]
            gi|561033881|gb|ESW32460.1| hypothetical protein
            PHAVU_002G324500g [Phaseolus vulgaris]
          Length = 1864

 Score = 2332 bits (6044), Expect = 0.0
 Identities = 1239/1890 (65%), Positives = 1431/1890 (75%), Gaps = 36/1890 (1%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  Y+TE+ +REW+  N   K S ++P+LRFLYELCWTMVRGELP  KCK ALD
Sbjct: 1    MSLPPIECLYVTEDCVREWRTANPALKVSQAVPMLRFLYELCWTMVRGELPFPKCKVALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SV F+E++SN+++ S FADIVTQMAQD TM  + R+RLIK+A+WLVES +VP++L QERC
Sbjct: 61   SVIFSEQASNDQIASNFADIVTQMAQDHTMPGDSRSRLIKLARWLVESEMVPVRLLQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFL E+E+IKIKA +LK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+    A
Sbjct: 121  EEEFLGEAELIKIKAQELKGKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRDSE-A 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+  S AT+ IIKSLIGHFDLDPNRVFDIVLECFELQ D++VF++LIPIFPKSHASQIL
Sbjct: 180  PTQKSSTATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDDDVFIELIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEV   VPFGLYRLTA+LVK DFIDLDSIYAHLLP+D++AFEHY+  S+KR
Sbjct: 240  GFKFQYYQRMEVGSSVPFGLYRLTALLVKQDFIDLDSIYAHLLPRDDEAFEHYNTFSSKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIG+INLAATGKDLMDDEKQGDV+IDL+AA+D+E+EA  E + EL+++Q LGLL+
Sbjct: 300  LDEANKIGRINLAATGKDLMDDEKQGDVSIDLFAALDLETEATEERTTELQSSQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHAH+LF+RLS LN V HIQIC+ LFRLI+KSIS AYD++R  HLQ  G  S
Sbjct: 360  GFLSVDDWYHAHLLFERLSALNAVEHIQICDSLFRLIQKSISSAYDVVRLSHLQNPGSSS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G G+D ++V D SS   SF+DLPKELFQML+  GPYLYR+ VLLQKVCRVLRGYYLSALE
Sbjct: 420  G-GADVMDV-DNSSGCNSFIDLPKELFQMLSCTGPYLYRDTVLLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+  G  A NP+    G+  P LHLKEAKLRVE+ALG C+LPSLQLIPANPAVGQEIWE+
Sbjct: 478  LVSRGNGALNPQLHVPGN--PNLHLKEAKLRVEDALGACVLPSLQLIPANPAVGQEIWEL 535

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            MSLLPYE RYRLYGEWEKD+ERIPM+L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 536  MSLLPYEVRYRLYGEWEKDEERIPMLLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 595

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HA+P+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLA GGR+KLKDDG
Sbjct: 596  HASPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLALGGRDKLKDDG 655

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQY
Sbjct: 656  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIQQMANVQY 715

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGS+TLRYQAT FG+TRNNKAL KST+RLRD+                
Sbjct: 716  TENLTEEQLDAMAGSDTLRYQATSFGVTRNNKALIKSTSRLRDALLPKDEPKLAIPLLLL 775

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRS+ V+N+DAP+IKMV EQFDRCHGTLLQYVEFL SA+T +++Y  LIP L+DLVH
Sbjct: 776  LAQHRSLAVVNADAPYIKMVSEQFDRCHGTLLQYVEFLGSAITPSSNYGILIPSLNDLVH 835

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIED-THMSNIXXXXXXXXSAKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+K     DVCWP DD    +  S+         S  +VL+ 
Sbjct: 836  LYHLDPEVAFLIYRPVMRLFKSQRNPDVCWPLDDKHAASDGSSNFESDPLGDSGSMVLNF 895

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            GS Q PI WS LLDTV+TMLP KAWNSLSPDLYATFWGLTLYDLYVP++RYESEI+K H+
Sbjct: 896  GSAQNPISWSYLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 955

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
             LK+LEELSDNS+SAITKRKK+KERIQE LDRL  EL KHE+NV SV  RLSREKD WLS
Sbjct: 956  NLKSLEELSDNSSSAITKRKKEKERIQESLDRLISELHKHEENVASVHSRLSREKDRWLS 1015

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1016 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1075

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WKSDESIYERECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1076 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1135

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES+EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1136 GQFIKVHWKWSQRITRLLIQCLESSEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1195

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD--XXXXXXXXXXXXAIAVPNG 2452
            IK DEREDLK           ARK SWVT+EEFGMG ++                 V +G
Sbjct: 1196 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSGTKSSAGNPSTVHSG 1255

Query: 2451 SALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTK 2272
              LN+SQ E+       SG  + D GN +K+Q  R K  DGK ERTE+ T  K+D G TK
Sbjct: 1256 MNLNVSQTESA------SGKHV-DSGNTVKDQVIRTKTTDGKSERTESMTATKSDSGHTK 1308

Query: 2271 LKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDE------SAKGTAESELRPTIRRS 2110
            +K+G++ NG D Q+   ++++QSG S++ E  K  +E         GT  +E R + +RS
Sbjct: 1309 VKTGAMVNGFDGQTSSISSSIQSGMSKSMENSKQVEELINRASDDHGTRTAESRASAKRS 1368

Query: 2109 VAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGAN-TST 1933
            V  GSL+K P +Q+  KED++SGK              R   +  +D    +G  N TS+
Sbjct: 1369 VPTGSLSK-PSKQDPLKEDSRSGKPV-----------ARTSGSLSSDKDLHSGTTNVTSS 1416

Query: 1932 AIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASI------DDSEKLRP 1771
              AN +T+  S K S A  R S+D  G+ +  E G  KSS++RAS+      D ++  R 
Sbjct: 1417 VSANGNTITGSTKGSNAPVRISLDGPGNESKAEVGVSKSSDIRASVVKDDGNDTADLTRG 1476

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKR-TSPEEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
              SR  HSP  +++  ASKS +K+QKR +S EE  +  KRRKGD E++D E E RFSDR+
Sbjct: 1477 SSSRVVHSPRHENTGVASKSNEKVQKRASSAEEPDRLGKRRKGDVELRDFESEVRFSDRD 1536

Query: 1593 RSVDPRLPHSDL--ERTGI---DDQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDILVE 1429
            + +DPR     L  E  G+    D+SL R KDKGN RY+RDHRERL+R DKS G+D + E
Sbjct: 1537 KLMDPRFADDKLGPEEHGLYRAGDKSLERPKDKGNERYERDHRERLDRVDKSRGDDSVAE 1596

Query: 1428 KSRDRSMERYGRERSVERV-------SF----XXXXXXXXXXXRGKPRYNETSGEK--YD 1288
            K RDRS+ERYGRERSVER+       SF               R K RY++ S EK   D
Sbjct: 1597 KPRDRSIERYGRERSVERMQERGSERSFNRPPEKAKDERSKDDRNKLRYSDASVEKSHAD 1656

Query: 1287 DRFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXED 1108
            DRFH QS        PNMVPQSV A RRDEDADRR   TRH QRLSP            +
Sbjct: 1657 DRFHGQSLPPPPPLPPNMVPQSVGAGRRDEDADRRYGATRHSQRLSP--RHEEKERRRSE 1714

Query: 1107 YLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERDKINLLKEEMDANAASKRRKLKRDHL 928
              V                       +  +  ER+K N+LKE++D NAASKRRKLKR+HL
Sbjct: 1715 ETVVSQDDAKRRKEDDFRERKREEIKVEEREREREKANVLKEDLDLNAASKRRKLKREHL 1774

Query: 927  SSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASK 751
            S+GE GEYS +APP PP  +GM   YDGRDRGDRKG +IQ P Y++EP  R+HGKE ASK
Sbjct: 1775 STGEPGEYSPVAPPPPPTGIGMPLGYDGRDRGDRKGPVIQHPNYIDEPNIRIHGKEVASK 1834

Query: 750  ITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            + RRD D +YDREW+DEKRQR +QKRRHRK
Sbjct: 1835 LNRRDSDPLYDREWDDEKRQRADQKRRHRK 1864


>ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isoform X2 [Glycine max]
          Length = 1845

 Score = 2328 bits (6032), Expect = 0.0
 Identities = 1234/1876 (65%), Positives = 1416/1876 (75%), Gaps = 22/1876 (1%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  Y+TEE +REW++GN   K S  +P+LRFLYELCWTMVRGELP QKCK ALD
Sbjct: 1    MSLPPIECAYVTEECIREWRSGNPALKVSQPVPMLRFLYELCWTMVRGELPFQKCKVALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SV F++K+SNE++ S F+DIVTQMAQD TMS E+R+RLIK+A+WLVES +VP++L QERC
Sbjct: 61   SVIFSDKASNEKIASNFSDIVTQMAQDHTMSGEFRSRLIKLARWLVESEMVPVRLLQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFL E E+IKIKA +LK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+    A
Sbjct: 121  EEEFLGEVELIKIKAQELKVKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRDSE-A 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+  S AT+ IIKSLIGHFDLDPNRVFDIVLECFELQ D++VF++LIPIFPKSHASQIL
Sbjct: 180  PTQKSSAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDDDVFIELIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN PVPFGLYRLTA+LVK DFIDLDSIYAHLLP+D++AFEHY+  S+KR
Sbjct: 240  GFKFQYYQRMEVNGPVPFGLYRLTALLVKQDFIDLDSIYAHLLPRDDEAFEHYNTFSSKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIG+INLAA GKDLMDDEKQGDVTIDL+AAIDME++AV E + EL+++Q LGLL+
Sbjct: 300  LDEANKIGRINLAAIGKDLMDDEKQGDVTIDLFAAIDMETDAVEERTTELQSSQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHAH+LF+RLSPLN V HIQIC+ LFRLI+KSIS AYD+IRQ HLQ  G  +
Sbjct: 360  GFLSVDDWYHAHLLFERLSPLNAVEHIQICDSLFRLIKKSISSAYDVIRQTHLQNPGLST 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G  +D ++V D SS   SF+DLPKELFQMLA  GPYLYR+ VLLQKVCRVLRGYYLSALE
Sbjct: 420  GGSTDVMDV-DNSSGFSSFIDLPKELFQMLACTGPYLYRDTVLLQKVCRVLRGYYLSALE 478

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+  G    NP+    G+    LHLKEA+LRVE+ALG CLLPSLQLIPANPAVGQEIWE+
Sbjct: 479  LVSHGNGVLNPQLQVPGNL--HLHLKEARLRVEDALGACLLPSLQLIPANPAVGQEIWEL 536

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            MSLLPYE RYRLYGEWEKDDERIPM+L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 537  MSLLPYEVRYRLYGEWEKDDERIPMLLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 596

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERL  GGR+KLKDDG
Sbjct: 597  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLVLGGRDKLKDDG 656

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQY
Sbjct: 657  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIQQMANVQY 716

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLRYQAT FG+TRNNKAL KST+RLRD+                
Sbjct: 717  TENLTEEQLDAMAGSETLRYQATSFGVTRNNKALIKSTSRLRDALLPNDEPKLAIPLLQL 776

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQH S+VVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  ++YA L+P L+DLVH
Sbjct: 777  IAQHCSLVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPASNYAILVPSLNDLVH 836

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIED-THMSNIXXXXXXXXSAKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+K     DVCWP  D +  +  S          SA +VL+L
Sbjct: 837  LYHLDPEVAFLIYRPVMRLFKSPRNPDVCWPLGDKDAASDASTNFESDPLDHSASMVLNL 896

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            GS Q PI WS LLDTV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVP++RYESEI+K H+
Sbjct: 897  GSDQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 956

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
             LK+LEELSDNS+SAI KRKK+KERIQE LDRL  EL KHE+NV SVRRRLS EKD WLS
Sbjct: 957  NLKSLEELSDNSSSAIAKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 1016

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1076

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WKSDESIYERECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1077 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1136

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1137 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1196

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD--XXXXXXXXXXXXAIAVPNG 2452
            IK DEREDLK           ARK SWVT+EEFGMG ++              +  V +G
Sbjct: 1197 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPSMTKSSAGNSATVQSG 1256

Query: 2451 SALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTK 2272
              LN+SQ E+V      SG  + D GN +K+Q  R K VDGK ER E+ T+ K+D G  K
Sbjct: 1257 INLNVSQTESV------SGKHV-DSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIK 1309

Query: 2271 LKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDESAK------GTAESELRPTIRRS 2110
            LKS S+ NGLD+QS M  ++VQSG  ++ E  K  +ES        GT  +ELR + +RS
Sbjct: 1310 LKSSSMVNGLDAQSSMAPSSVQSGMPKSMENPKQVEESINRASDEHGTRSTELRTSAKRS 1369

Query: 2109 VAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTSTA 1930
            V A SL K P +Q+  KED +SGK           S+++DL  H  +GR +     T+  
Sbjct: 1370 VPASSLAK-PSKQDPVKEDGRSGK--PVARTSGSLSSDKDLQTHALEGRHT---GTTNVP 1423

Query: 1929 IANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASI--DDSEKLRPLP--- 1765
             +N +T+  S K S    + S+D  G+ +  E G  KSS++RAS+  DD   +   P   
Sbjct: 1424 SSNGNTISGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGS 1483

Query: 1764 -SRPTHSPLPDDSFTASKSTDKLQKR-TSPEEHGKPSKRRKGDNEVKDGEVEARFSDRER 1591
             SR  HSP  +++   SKS D++QKR +S EE  +  KRRKGD E++D E E RFS+RE+
Sbjct: 1484 SSRIVHSPRHENTVVTSKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFETELRFSEREK 1543

Query: 1590 SVDPRLPHSDL--ERTGI---DDQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDILVEK 1426
             +DPR     L  E  G+    D+ L R+KDKGN RY+RDHRER++R DKS G+D + EK
Sbjct: 1544 MMDPRFADDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKSRGDDFVAEK 1603

Query: 1425 SRDRSMERYGRERSVERVSFXXXXXXXXXXXRGKPRYNETSGEKYDDRFHRQSXXXXXXX 1246
             RDRS+ERYGRERSVER+             RG  R      EK  D  ++         
Sbjct: 1604 PRDRSIERYGRERSVERMQ-----------ERGSDRSFNRLPEKAKDERNKDDRNKLRYN 1652

Query: 1245 XPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXEDYLVAQXXXXXXXXX 1066
              +   +S  A RRDED DRR   TRH QRLSP            +  V           
Sbjct: 1653 DAS-AEKSHGAGRRDEDVDRRYGATRHSQRLSP--RHEEKERRWSEETVVSQDDAKRRKE 1709

Query: 1065 XXXXXXXXXXDGLSTKVDERDKINLLKEEMDANAASKRRKLKRDHLSSGEAGEYS-LAPP 889
                        +  +  ER+K N+LKEE+D NAASKRRKLKR+HL + E GEYS +A P
Sbjct: 1710 DDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKLKREHLPTDEPGEYSAVAHP 1769

Query: 888  QPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASKITRRDGDQIYDREW 709
                  GM   YDGRDRGDRKG ++Q P Y++E   R+HGKE ASK+ RRD D +YDREW
Sbjct: 1770 PSSAGTGMPLAYDGRDRGDRKGPIMQHPSYIDESSLRIHGKEAASKLNRRDSDPLYDREW 1829

Query: 708  EDEKRQRPEQKRRHRK 661
            EDEKRQR +QKRRHRK
Sbjct: 1830 EDEKRQRADQKRRHRK 1845


>ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi|223533086|gb|EEF34845.1|
            tho2 protein, putative [Ricinus communis]
          Length = 1828

 Score = 2321 bits (6014), Expect = 0.0
 Identities = 1243/1864 (66%), Positives = 1414/1864 (75%), Gaps = 33/1864 (1%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPI+  Y+ E+++REWK+G++ F+  + +P+LRFLYELCWTMVRGELP  KCKAAL+
Sbjct: 1    MSLPPIDCIYVREDYIREWKSGSSSFRVPDPVPMLRFLYELCWTMVRGELPYLKCKAALE 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVE+TE  S   L S FADIVTQMAQDLTM  EYR RLIK+AKWLVES+LVPL+ FQERC
Sbjct: 61   SVEYTESVSARVLASTFADIVTQMAQDLTMPGEYRARLIKLAKWLVESSLVPLRFFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNL+REESEGYAKLVTLLCQ G   
Sbjct: 121  EEEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLVREESEGYAKLVTLLCQ-GYDN 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
            +  N S AT+ IIKSLIGHFDLDPNRVFDIVLECFELQ DNN+FLDLIPIFPKSHASQIL
Sbjct: 180  VNSNASAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDNNIFLDLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQR+EVN PVPFGLY+LTA+LVK +FIDLDSIY+HLLP+D++AFEHY A S+KR
Sbjct: 240  GFKFQYYQRLEVNSPVPFGLYKLTALLVKEEFIDLDSIYSHLLPRDDEAFEHYVAFSSKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVT+DL+AA+DME++AVAE   ELEN+Q LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTVDLFAALDMETDAVAERLSELENSQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDW+HAHILFDRLS LNPV H+QIC+GLFRLIEKSIS AYDII Q H+Q     S
Sbjct: 360  GFLSVDDWFHAHILFDRLSLLNPVGHVQICKGLFRLIEKSISAAYDIIHQTHVQNLESSS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G G  +++ T  +   RS +DLPKELFQML + GPYLYR+ +LLQKVCRVLRGYYL ALE
Sbjct: 420  GVGCSSMD-TSIALAHRSVIDLPKELFQMLTTVGPYLYRDTILLQKVCRVLRGYYLFALE 478

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            LIG     ++ ES   G+  PR+HL+EAK RVEEALGTCLLPSLQLIPANPAVGQEIWEV
Sbjct: 479  LIGGIDGGTSKESVSMGN--PRVHLREAKSRVEEALGTCLLPSLQLIPANPAVGQEIWEV 536

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            MSLLPYE RYRLYGEWEKDDE+ PMVL ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 537  MSLLPYEVRYRLYGEWEKDDEQNPMVLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 596

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMI PVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDG
Sbjct: 597  HANPMTVLRTIVHQIEAYRDMIAPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDG 656

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQY
Sbjct: 657  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELVHQMANVQY 716

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLRYQAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 717  TENLTEEQLDAMAGSETLRYQATSFGVTRNNKALIKSTNRLRDSLLPKDEPRLAIPLLLL 776

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRSVVVI++ AP+IKMV EQFDRCHGTLLQYVEFLCSA+T    YA+LIP LDDLVH
Sbjct: 777  IAQHRSVVVISAGAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPATGYAKLIPSLDDLVH 836

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXSAKLVLDLG 3523
             YHLDPEVAFL+YRPVMRL+KC+  SDV WP DD +     +         S  ++LDLG
Sbjct: 837  LYHLDPEVAFLIYRPVMRLFKCVGNSDVFWPLDDNDAV---STVDSEQTESSGNVILDLG 893

Query: 3522 SPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHSA 3343
            S QKPIMWSDLL+TV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR RYESEI+KQH+A
Sbjct: 894  SSQKPIMWSDLLETVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRDRYESEIAKQHAA 953

Query: 3342 LKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLSS 3163
            LKALEELSDNS+SAI+KRKKDKERIQE LDRLT EL KHE+NV SVRRRLSREKD WLSS
Sbjct: 954  LKALEELSDNSSSAISKRKKDKERIQESLDRLTSELHKHEENVASVRRRLSREKDKWLSS 1013

Query: 3162 CPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLH 2983
            CPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 
Sbjct: 1014 CPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQ 1073

Query: 2982 PMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTYS 2803
            PMICCCTEYEAGRLG+FL ETLK+AY+WKSDESIYERECGNMPGFAVYYR+PNSQRVTY 
Sbjct: 1074 PMICCCTEYEAGRLGKFLHETLKIAYYWKSDESIYERECGNMPGFAVYYRFPNSQRVTYG 1133

Query: 2802 QFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAKI 2623
            QFI+VHWKWS RI+RLLIQCLES EYMEIRNALI+LTKIS VFPVT+++GINLEKRVA+I
Sbjct: 1134 QFIKVHWKWSQRISRLLIQCLESTEYMEIRNALILLTKISGVFPVTKRSGINLEKRVARI 1193

Query: 2622 KGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVDXXXXXXXXXXXXAIAV-PNGSA 2446
            K DEREDLK           ARK SWVT+EEFGMG +D             I+V  N S 
Sbjct: 1194 KSDEREDLKVLATSVASALAARKPSWVTDEEFGMGYLDIRPPAASKSVSGNISVGQNSSG 1253

Query: 2445 LNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTKLK 2266
            LN SQ E+ G RA  + +Q  D GN  KE  +R KP D    + E+ + +K+D    K+K
Sbjct: 1254 LNASQGESAGGRAVSTTTQHGDVGNSAKEHISRAKPAD----KQESVSYVKSDSVNQKVK 1309

Query: 2265 SGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES-------AKGTAESELRPTIRRSV 2107
             GSL    D QS       Q+G SR+ E QK   ES        K +AESE + + +R++
Sbjct: 1310 GGSLVIQSDLQSSAALVTGQAGASRSAENQKQMSESPIIIPDAPKNSAESESKASGKRAM 1369

Query: 2106 AAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTSTAI 1927
             AGS+ K P+ Q+VAK+D KSGK           S+++D+ +H ++ R   G   +ST  
Sbjct: 1370 PAGSV-KTPR-QDVAKDDLKSGK--TVGRVPVASSSDKDMPSHLSESRLGNGTNVSSTGT 1425

Query: 1926 ANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRPLPSRPTHS 1747
            +N     + VK                   E G V+                 PSR  HS
Sbjct: 1426 SNDGAAKSVVKDDAT---------------EVGDVQKP---------------PSRVVHS 1455

Query: 1746 PLPDDSF-TASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRERSVDPRL 1573
            P  D SF ++SKS+DKLQKR SP ++  + SKRRKGD E++D + + RFSDRER +D RL
Sbjct: 1456 PRHDGSFASSSKSSDKLQKRASPGDDPDRLSKRRKGDTELRDLDGDIRFSDRERPMDSRL 1515

Query: 1572 PHSDLERTGID-------DQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDILVEKSRDR 1414
               DL++ G D       D+ L RSKDKG  RYDRDHRER ER DKS G+DILVE+ RDR
Sbjct: 1516 --VDLDKIGSDERVHRSMDKPLDRSKDKGMERYDRDHRERSERPDKSRGDDILVERPRDR 1573

Query: 1413 SMERYGRERSVER--------VSF-XXXXXXXXXXXRGKPRYNETSGEK-YDDRFHRQSX 1264
            SMERYGRERSVER         SF            + K RY +TS EK +DDRF+ Q+ 
Sbjct: 1574 SMERYGRERSVERGQERGGADRSFDRFSDKTKDERNKDKVRYGDTSVEKLHDDRFYGQNL 1633

Query: 1263 XXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXEDYLVAQXXX 1084
                   P++VPQSV ASRRDEDADRRI + RH  RLSP           E+ LV+Q   
Sbjct: 1634 PPPPPLPPHVVPQSVTASRRDEDADRRIGSARHSLRLSPRHDEKERRRSEENSLVSQDDV 1693

Query: 1083 XXXXXXXXXXXXXXXXDGLSTKVDERDKIN-----LLKEEMDANAASKRRKLKRDHLSSG 919
                            +GL+ KV++R++        LK+++D  AASKRRKLKR+H+ SG
Sbjct: 1694 KRGRDDNFRDRKRDEREGLAMKVEDRERDREREKVPLKDDIDVGAASKRRKLKREHMPSG 1753

Query: 918  EAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASKITR 742
            EAGEYS +APP PPLA+ M Q YDGR+RGDR G +IQR GYLEEP  R+HGKE A K+TR
Sbjct: 1754 EAGEYSPVAPPPPPLAISMSQSYDGRERGDR-GALIQRAGYLEEPPMRIHGKEVAGKMTR 1812

Query: 741  RDGD 730
            RD D
Sbjct: 1813 RDAD 1816


>ref|XP_002325475.1| F5A9.22 family protein [Populus trichocarpa]
            gi|222862350|gb|EEE99856.1| F5A9.22 family protein
            [Populus trichocarpa]
          Length = 1836

 Score = 2311 bits (5988), Expect = 0.0
 Identities = 1245/1895 (65%), Positives = 1417/1895 (74%), Gaps = 42/1895 (2%)
 Frame = -1

Query: 6219 SLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALDS 6040
            +LPP+E  Y+TEEFLRE K GN+ F+  + +P+LRFLYEL W +VRGELP QKCKAALDS
Sbjct: 4    TLPPMECLYVTEEFLRELKGGNHSFRLPHPVPILRFLYELSWNLVRGELPFQKCKAALDS 63

Query: 6039 VEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERCE 5860
            VEF +K S   LGS FADI+TQMAQDLTMS EYR+RLIK+AKWLVESALVPL+ FQERCE
Sbjct: 64   VEFVDKVSAVGLGSNFADIITQMAQDLTMSGEYRSRLIKLAKWLVESALVPLRFFQERCE 123

Query: 5859 EEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVAL 5680
            EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLL Q G+   
Sbjct: 124  EEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLYQ-GSEDT 182

Query: 5679 TENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQILG 5500
            TEN S AT+ IIKSLIGHFDLDPNRVFDIVLE FELQ D+NVFL+LIPIFPKSHASQILG
Sbjct: 183  TENTSAATIGIIKSLIGHFDLDPNRVFDIVLEYFELQPDSNVFLELIPIFPKSHASQILG 242

Query: 5499 FKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKRF 5320
            FKFQYYQR+E+N  VPFGLY+LTA+LVK +FIDLDSI AHLLPKD++AFEHY+  S+KR 
Sbjct: 243  FKFQYYQRIELNSHVPFGLYKLTALLVKEEFIDLDSICAHLLPKDDEAFEHYNTFSSKRL 302

Query: 5319 DEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLSG 5140
            DEANKIGKINLAATGKDLMDDEKQGDVT+DL+AA+DME+EAVAE   ELENNQ LGLL+G
Sbjct: 303  DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAALDMEAEAVAERFSELENNQTLGLLTG 362

Query: 5139 FLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPSG 4960
            FLSVDDWYHAH+LF+RLSPLNPVAH QIC GLFRLIEK +S AY+IIRQ H+Q  G P  
Sbjct: 363  FLSVDDWYHAHVLFERLSPLNPVAHTQICNGLFRLIEKLVSSAYNIIRQTHIQSCGSPRI 422

Query: 4959 SGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALEL 4780
            +G DA+ VT +S    SF+DLPKE FQML + GPYLYR+ +LL KVCRVLRGYY+SALEL
Sbjct: 423  AGIDAMGVTSSSG-HVSFIDLPKEFFQMLVTVGPYLYRDTLLLHKVCRVLRGYYMSALEL 481

Query: 4779 IGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEVM 4600
            + SG  A N E    G+R PRLHL+EA+ RVEEALG CLLPSLQL+PANPAVGQEIWEVM
Sbjct: 482  VDSGDGALNGELLIPGNRVPRLHLREARSRVEEALGACLLPSLQLVPANPAVGQEIWEVM 541

Query: 4599 SLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLAH 4420
            SLLPYE RYRLYGEWEKDDER P++L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLAH
Sbjct: 542  SLLPYEVRYRLYGEWEKDDERNPVILAARQTAKLDTRRILKRLAKENLKQLGRMVAKLAH 601

Query: 4419 ANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDGL 4240
            ANP+TVLRTIVHQIE+YRDMI+PVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDGL
Sbjct: 602  ANPMTVLRTIVHQIESYRDMISPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDGL 661

Query: 4239 NLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQYT 4060
            NLSDWLQSLASFWGHLCKKYPSMELRGLFQYL NQLKK              QMANVQYT
Sbjct: 662  NLSDWLQSLASFWGHLCKKYPSMELRGLFQYLGNQLKKGQGIELVLLQELLQQMANVQYT 721

Query: 4059 ENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXXX 3880
            EN+TEEQLDAMAGSETLRYQAT FG+TR NKAL KS NRLRDS                 
Sbjct: 722  ENLTEEQLDAMAGSETLRYQATSFGVTRTNKALFKSANRLRDSLLPKDEPKPAIPLLLLI 781

Query: 3879 AQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVHK 3700
            AQHRSVVVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  ++YAQLIP LDDLVH 
Sbjct: 782  AQHRSVVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPPSAYAQLIPSLDDLVHL 841

Query: 3699 YHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIED-THMSNIXXXXXXXXSAKLVLDLG 3523
            YHLDPEVAFL+YRPVMRL+KC    DV WP ++ +  T+ S I        S  ++LDLG
Sbjct: 842  YHLDPEVAFLIYRPVMRLFKCAGSLDVFWPLENNKTVTNTSAILEPEAIECSGGVILDLG 901

Query: 3522 SPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHSA 3343
            S  K + WSDLL+TV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI+KQH+A
Sbjct: 902  SSHKSVTWSDLLETVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHAA 961

Query: 3342 LKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLSS 3163
            LKALEELSDNS+SAITKRKK+KERIQE LDRLT EL+KHEDNV SVRRRLS EKD WL+S
Sbjct: 962  LKALEELSDNSSSAITKRKKEKERIQESLDRLTSELRKHEDNVSSVRRRLSCEKDKWLTS 1021

Query: 3162 CPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLH 2983
            CPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNH+DVLICKTL 
Sbjct: 1022 CPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHVDVLICKTLQ 1081

Query: 2982 PMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTYS 2803
            PMICCCTEYEAGRLGRFL+ETLK+AY+WKSDESIYE ECGNMPGFAVYYR+PNSQRVTY 
Sbjct: 1082 PMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYEHECGNMPGFAVYYRFPNSQRVTYG 1141

Query: 2802 QFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAKI 2623
            QFI+                     YMEIRNALI+LTKIS VFP       +++  V +I
Sbjct: 1142 QFIK---------------------YMEIRNALILLTKISGVFPFFSFVSFSIQ--VTRI 1178

Query: 2622 KGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD--XXXXXXXXXXXXAIAVPNGS 2449
            K DEREDLK           ARK SW+T+EEFGMG ++              A A  N S
Sbjct: 1179 KSDEREDLKVLATGVAAALAARKPSWITDEEFGMGYLEIKPPSAASKSLSGNAAAAQNSS 1238

Query: 2448 ALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTKL 2269
            ALN+SQ E    RA  +GSQ  D GN  +EQ +R K  DG+ +RT+N +  K D G  K 
Sbjct: 1239 ALNVSQGEPAEGRAPHTGSQHGDPGNSTREQISRAKHADGRSDRTDNVSHSKFDQGHQKS 1298

Query: 2268 KSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDESA-------------KGTAESELR 2128
            K GS  NG ++QS  + AAV  G SR +E +K  D+S+             K  AESE++
Sbjct: 1299 KGGSSTNGSNAQSAGSAAAVHVGASR-SENRKGVDDSSNRTLEDGTVRAAPKNLAESEMK 1357

Query: 2127 PTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGG 1948
             + +R V     +K PK Q+V K+DNKSGK           ++++D+  H ++GR   G 
Sbjct: 1358 ISTKRLV-----SKTPK-QDVVKDDNKSGK--AVGRTPSSSTSDKDIQVHLSEGR-QGGA 1408

Query: 1947 ANTSTAIA-NSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRP 1771
            AN S+A+  N + + TS K ST S R S      VA+++                     
Sbjct: 1409 ANVSSALTLNGNAVSTSGKISTLSTRAS---DSYVADVQK-------------------- 1445

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRE 1594
             P +  HSP  D+S  ASKS+DKLQKR SP EE  + SKRRKGD E++D E E +FS+RE
Sbjct: 1446 -PPQLVHSPRHDNSVAASKSSDKLQKRASPAEEPDRSSKRRKGDGELRDLEGEVKFSERE 1504

Query: 1593 RSVDPRLPHSDLERTGIDDQSLH--------RSKDKGNGRYDRDHRERLERQDKSTGEDI 1438
            RS D R   +DL++ G D+Q+ H        RSKDKGN RYDRDHRER ER DKS G+D 
Sbjct: 1505 RSTDTR--SADLDKVGNDEQNKHRSTDKPLDRSKDKGNDRYDRDHRERSERPDKSHGDDS 1562

Query: 1437 LVEKSRDRSMERYGRERSVER-------VSFXXXXXXXXXXXRGKPRYNETSGEKY--DD 1285
            L ++SRD+SMERYGRERS ER        SF           R K RYN+TS EK   DD
Sbjct: 1563 LADRSRDKSMERYGRERSDERGMDRGTDRSF-DRLADKAKDDRSKLRYNDTSAEKSQGDD 1621

Query: 1284 RFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXEDY 1105
            RFH Q+        P+MVPQSV + RRDEDADRR   TRH QRLSP           E+ 
Sbjct: 1622 RFHGQNLPPPPPLPPHMVPQSVTSGRRDEDADRRFGTTRHAQRLSPRHDEKERRRSEENS 1681

Query: 1104 LVAQXXXXXXXXXXXXXXXXXXXDGLSTKVD------ERDKINLLKEEMDANAASKRRKL 943
            LV+Q                   +GLS KV+      ER+K +LLKEEMDA AA+KRRK+
Sbjct: 1682 LVSQDDTKRRKEDDVRERKREEREGLSIKVEEREREREREKTHLLKEEMDAGAAAKRRKI 1741

Query: 942  KRDHLSSGEAGEYS-LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGK 766
            KRDHL +GEAGEYS +APP PPL  GM Q YDGRDRGDRKG  IQR  YLEEP  R+HGK
Sbjct: 1742 KRDHLPTGEAGEYSPVAPPPPPLGSGMSQSYDGRDRGDRKGGTIQRTSYLEEPSIRIHGK 1801

Query: 765  ETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            + A K+ RRD D +YDREW+++KRQR EQKRRHRK
Sbjct: 1802 DVAGKMARRDADPMYDREWDEDKRQRAEQKRRHRK 1836


>ref|XP_006826587.1| hypothetical protein AMTR_s00138p00044110 [Amborella trichopoda]
            gi|548830968|gb|ERM93824.1| hypothetical protein
            AMTR_s00138p00044110 [Amborella trichopoda]
          Length = 2456

 Score = 2277 bits (5901), Expect = 0.0
 Identities = 1224/1885 (64%), Positives = 1409/1885 (74%), Gaps = 32/1885 (1%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MS   +E KY T+E LREWK  +  FK    +P LRFLYELCWTMVRG+LP  KCK ALD
Sbjct: 1    MSPLGVERKYFTDECLREWKAPSTSFKLPVPVPSLRFLYELCWTMVRGDLPFAKCKTALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVEF++K S +ELGS+FADI+  M QDLT+  +YRTRL+K+AKWL+ES LVPL+LFQERC
Sbjct: 61   SVEFSDKRSKDELGSVFADIIAHMGQDLTLPGDYRTRLVKLAKWLMESGLVPLRLFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE EMIKIKA DLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ G   
Sbjct: 121  EEEFLWECEMIKIKAQDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVGPET 180

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T   S A +S +KSLIGHFDLDPNRVFD+VLECFELQ DN +F DLIPIFPKSHASQIL
Sbjct: 181  STRKTSDAVISSLKSLIGHFDLDPNRVFDVVLECFELQPDNTIFFDLIPIFPKSHASQIL 240

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVNDPVP GLYRL AVLVK++FIDLDSI AHLLPKDE+AFE Y+  S K+
Sbjct: 241  GFKFQYYQRMEVNDPVPHGLYRLAAVLVKSEFIDLDSICAHLLPKDEEAFELYETFSTKQ 300

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
            F+EANKIGKINLAA GK+LMDDEKQGDVTIDL+ A+DME+EAVAE SPELE NQ LGLL+
Sbjct: 301  FEEANKIGKINLAAIGKELMDDEKQGDVTIDLFTALDMENEAVAERSPELEKNQYLGLLN 360

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFL VDDW+HAHILFDRL+PLNPVAHIQIC GLFR IEKSIS  YDII Q HLQ  G  S
Sbjct: 361  GFLDVDDWFHAHILFDRLAPLNPVAHIQICNGLFRAIEKSISSTYDIINQSHLQILGGAS 420

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            GS SDA+E++     Q + V LPKELFQMLA AGPYL+RNVVLLQKVCRVLR YY SA E
Sbjct: 421  GSASDAMELSGEPPCQSTSVVLPKELFQMLACAGPYLHRNVVLLQKVCRVLRKYYQSAQE 480

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+        P S+    RDPRL LKEA+ RVEEALG+C+LPSLQLIPANPAVGQEIWE+
Sbjct: 481  LVDYLVEII-PRSSHGDHRDPRLQLKEARSRVEEALGSCILPSLQLIPANPAVGQEIWEL 539

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            MSLLPYE RYRLYGEWEKDDE +P++  ARQTA+LDTRRILKRLAKENLKQLGRMVAK+A
Sbjct: 540  MSLLPYEVRYRLYGEWEKDDESLPLLFAARQTARLDTRRILKRLAKENLKQLGRMVAKIA 599

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            H NP+TVLRTIVHQIEAYRDMI PVVDAFKYLTQLEYD+LEYVVIERLAQGGREKLKDDG
Sbjct: 600  HGNPMTVLRTIVHQIEAYRDMIAPVVDAFKYLTQLEYDILEYVVIERLAQGGREKLKDDG 659

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWG LCKKYPSMELRGLFQYLVNQLKK              QMANVQY
Sbjct: 660  LNLSDWLQSLASFWGSLCKKYPSMELRGLFQYLVNQLKKGNGIELILLQELVQQMANVQY 719

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TENM+EEQLDAMAG ETLRYQAT FG+T+NNKAL KSTNRLRDS                
Sbjct: 720  TENMSEEQLDAMAGGETLRYQATSFGITKNNKALVKSTNRLRDSLLAKEEPKLAIPLLLL 779

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHR++VVIN+DAP+IKMV EQFDRCHGTLLQYVEFL +A+T + +YA LIP LDDL+H
Sbjct: 780  IAQHRALVVINADAPYIKMVSEQFDRCHGTLLQYVEFLSNAVTPSTAYAHLIPSLDDLIH 839

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCD-DIEDTHMSNIXXXXXXXXSAKLVLDL 3526
            KY LDPEVAFL+YRPVMRL+KCL  SD  WP +   E T ++          S+++VLDL
Sbjct: 840  KYCLDPEVAFLIYRPVMRLFKCLRSSDTFWPSEFTREATQLNADKESDHSFSSSEMVLDL 899

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            GSP+KPI WSDLL TVR+MLPSKAWNSLSP+LYATFWGLTLYDLYVP++RYESEI+KQH+
Sbjct: 900  GSPRKPITWSDLLGTVRSMLPSKAWNSLSPELYATFWGLTLYDLYVPKNRYESEIAKQHA 959

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALK  EE SDNSNSAI KRKKDKERIQE+LDRLT EL KHE+NV SVR+RL+REKD WL+
Sbjct: 960  ALKNSEEQSDNSNSAIAKRKKDKERIQEILDRLTNELHKHEENVASVRKRLAREKDIWLT 1019

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC+FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVL+CKTL
Sbjct: 1020 SCPDTLKINMEFLQRCIFPRCVFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLVCKTL 1079

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLKMAY+WKSDE+IYERECGNMPGFAVYYR PNSQRVT+
Sbjct: 1080 QPMICCCTEYEAGRLGRFLYETLKMAYYWKSDEAIYERECGNMPGFAVYYRDPNSQRVTF 1139

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
            SQFIRVHWKWSGRITRLLIQCLES EYMEIRNALI+LTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1140 SQFIRVHWKWSGRITRLLIQCLESTEYMEIRNALILLTKISSVFPVTRKSGINLEKRVAK 1199

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVPN 2455
            IK DEREDLK           ARKS+WV+EEEFGMG VD               A+   N
Sbjct: 1200 IKLDEREDLKVLATGVAAALAARKSTWVSEEEFGMGYVDLKSAAAPAAKPLTSNAVTSAN 1259

Query: 2454 GSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQT 2275
              +L  SQIE    R   S +Q +D  N  K+  +R KPVDG+LER ++  L K D GQ 
Sbjct: 1260 NQSLVNSQIENGATRNVTSATQASDGLNSSKDHMSRSKPVDGRLERADSVPLNKIDAGQA 1319

Query: 2274 KLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDESAKGTA-----------ESELR 2128
            K K  S+ N  ++Q  + +A   SGTSR+  +QK++DE  KG+            ++E R
Sbjct: 1320 KSKGSSVVNTAEAQ--INSAVAFSGTSRSPGLQKNADEPIKGSTDESMSKVVAKLDTESR 1377

Query: 2127 PTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGG 1948
            P  +R   +GSLTKQ K  +V K+D+KSGK                 S    DG  S   
Sbjct: 1378 PLAKRGAHSGSLTKQSK-ADVTKDDSKSGKPSSRVTVLPLS------STGERDGLLS--- 1427

Query: 1947 ANTSTAIANSSTMPTSVKSSTASARTSVDM--HGSVANIESGAVKSSELRASIDDSEKLR 1774
             N S A  N ST    +    A+A T++ M     VA   S  V + +    +D ++ LR
Sbjct: 1428 -NPSVAAGNGSTASAPMHGKAAAA-TNIKMIVDSGVAKQMSQRVGAEKDSDEVDAADGLR 1485

Query: 1773 PLPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGE-VEARFSD 1600
             L SRP+ SP  D+   A+K ++K  +R+SP EE  +  KRRKG+ + KDG+ +EARFSD
Sbjct: 1486 ALSSRPSVSPFSDE---AAKFSEKQLRRSSPSEELDRHMKRRKGEMDAKDGDGLEARFSD 1542

Query: 1599 RERSVDPRLPHSDLERTGIDDQSLHR-SKDKGNGRYDRDHRERLERQDKSTGEDILVEKS 1423
            RER     L   D +RTG D+Q + R +++K + R+DRDHR R         ED+LVEK+
Sbjct: 1543 RERDKSHPL---DYDRTGSDEQVMDRPTREKLSERFDRDHRPR--------SEDVLVEKA 1591

Query: 1422 RDRSMERYGRERSVERVS-------FXXXXXXXXXXXRGKPRYNETSGEKY--DDRFHRQ 1270
            RDRSMER+GRERSV+R S                   RGKPRY+ET  E+   DDRFH Q
Sbjct: 1592 RDRSMERHGRERSVDRGSGRSFDRAGDKSKDERGKEERGKPRYSETPVERSHPDDRFHGQ 1651

Query: 1269 SXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRL-SPXXXXXXXXXXXEDYLVAQ 1093
            S        PN+VPQSV  SRRDE+ D+R+ + RH+QRL SP           ++ +V+ 
Sbjct: 1652 SLPPPPPLPPNIVPQSVAVSRRDEEQDKRVGSARHMQRLSSPRHEEKEKRRSEDNSVVSL 1711

Query: 1092 XXXXXXXXXXXXXXXXXXXDGLSTKVDERD--KINLLKEEMDANAASKRRKLKRDHLSSG 919
                               D LS +VDERD  K N LK++ DA AASKRR++K+DH+   
Sbjct: 1712 DDAKHRREEEFRERKRDDRDTLSLRVDERDREKGNQLKDDSDA-AASKRRRIKKDHIGD- 1769

Query: 918  EAGEYSLAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASKITRR 739
             AGEY L  P  PL +GM Q YD RDRG+RKG + QR  Y+EEP+PR+H KET SKITRR
Sbjct: 1770 TAGEYPLMAPS-PLPMGMSQSYDNRDRGERKGAVAQRATYMEEPLPRVHAKETPSKITRR 1828

Query: 738  DGDQIYDREWEDEKRQRPEQKRRHR 664
            D +Q+++R+W+DEKRQR + KR+HR
Sbjct: 1829 DNEQMHERDWDDEKRQRVDTKRKHR 1853


>ref|XP_006376042.1| F5A9.22 family protein [Populus trichocarpa]
            gi|550325266|gb|ERP53839.1| F5A9.22 family protein
            [Populus trichocarpa]
          Length = 1805

 Score = 2248 bits (5826), Expect = 0.0
 Identities = 1211/1881 (64%), Positives = 1386/1881 (73%), Gaps = 28/1881 (1%)
 Frame = -1

Query: 6219 SLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALDS 6040
            +LPP+E  ++TEEFL E K+GN  F+  + +P+LRFLYEL WT+VRGELP QKCKAALDS
Sbjct: 4    TLPPMECLHVTEEFLLELKSGNRSFRLPHPVPILRFLYELSWTLVRGELPFQKCKAALDS 63

Query: 6039 VEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERCE 5860
            VEF +K S   LGS FADI+TQMAQDLTMS EYR+RLIK+AKWLVESALVPL+ FQERCE
Sbjct: 64   VEFVDKMSAVGLGSNFADIITQMAQDLTMSGEYRSRLIKLAKWLVESALVPLRFFQERCE 123

Query: 5859 EEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVAL 5680
            EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+ G+   
Sbjct: 124  EEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCR-GSEDT 182

Query: 5679 TENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQILG 5500
             EN S AT+ IIKSLIGHFDLDPNRVFDIVLECFELQ D+NVFL+LIPIFPKSHASQILG
Sbjct: 183  AENTSAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDSNVFLELIPIFPKSHASQILG 242

Query: 5499 FKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKRF 5320
            FKFQYYQRME+N PVPFGL++LTA+LVK +FIDLDSI AHLLPKD++AFEHY+  S+KR 
Sbjct: 243  FKFQYYQRMELNSPVPFGLFKLTALLVKEEFIDLDSICAHLLPKDDEAFEHYNTFSSKRL 302

Query: 5319 DEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLSG 5140
            D A KIGKINLAATGKDLMDDEKQGDVT+DL+AA+DME+EAVAE   +LE NQ LGLL+G
Sbjct: 303  DAAYKIGKINLAATGKDLMDDEKQGDVTVDLFAALDMETEAVAEQFSDLEKNQTLGLLTG 362

Query: 5139 FLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPSG 4960
            FLSVDDWYHAHILF RLSPLNPVAH QIC GLFRLIEK+IS AY+IIRQ H+Q  G P+ 
Sbjct: 363  FLSVDDWYHAHILFKRLSPLNPVAHTQICGGLFRLIEKTISSAYNIIRQPHVQNCGSPAV 422

Query: 4959 SGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALEL 4780
            +G DA++VT +S    S +DLPKE FQML + GPYLYR+ +LLQKVCRVLRGYY+SALEL
Sbjct: 423  AGIDAMDVTSSSG-HVSLIDLPKEFFQMLVTVGPYLYRDTLLLQKVCRVLRGYYMSALEL 481

Query: 4779 IGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEVM 4600
            + SG  A N ES    +R  RLHL+E +  VEEALG CLLPSLQL+PANPA GQEIWEVM
Sbjct: 482  VDSGDGALNGESLIPRNRVLRLHLREGRSMVEEALGACLLPSLQLVPANPAAGQEIWEVM 541

Query: 4599 SLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLAH 4420
            SLLPYE RYRLYGEWEKDDER P+VL ARQTAKLDTRRILKRLAKENLKQLGRMVAKLAH
Sbjct: 542  SLLPYEVRYRLYGEWEKDDERNPLVLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLAH 601

Query: 4419 ANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDGL 4240
            ANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDGL
Sbjct: 602  ANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDGL 661

Query: 4239 NLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQYT 4060
            NLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQYT
Sbjct: 662  NLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELVQQMANVQYT 721

Query: 4059 ENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXXX 3880
            EN+TEEQLDAMAGSETLRYQAT FG+TRNNKAL KSTNRLRDS                 
Sbjct: 722  ENLTEEQLDAMAGSETLRYQATSFGVTRNNKALFKSTNRLRDSLLPKDEPKLAIPLLLLI 781

Query: 3879 AQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVHK 3700
            AQHRSVVVIN+DAP+IKMV EQFDRCHGTLLQYVEFLC A+T  ++YAQLIP LDDLVH 
Sbjct: 782  AQHRSVVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCGAVTPPSAYAQLIPSLDDLVHL 841

Query: 3699 YHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIED-THMSNIXXXXXXXXSAKLVLDLG 3523
            YHLDPEVAFL+YRPVMRL+KC    +V WP D  E  T  S          S +++LDLG
Sbjct: 842  YHLDPEVAFLIYRPVMRLFKCEGSLEVFWPLDSSETVTTTSANLEPEAREYSGRVILDLG 901

Query: 3522 SPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHSA 3343
            S  KP+MWSDLL+T++TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI+KQ +A
Sbjct: 902  SLHKPVMWSDLLETLKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQQAA 961

Query: 3342 LKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLSS 3163
            LKALEELSDNS+SAITKRKK+KERIQE LDRLT EL KHE+NV SVRRRLSREKD WL+S
Sbjct: 962  LKALEELSDNSSSAITKRKKEKERIQESLDRLTSELHKHEENVSSVRRRLSREKDKWLTS 1021

Query: 3162 CPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLH 2983
            CPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 
Sbjct: 1022 CPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQ 1081

Query: 2982 PMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTYS 2803
            PMICCCTEYEAGRLGRFL+ETLK+AY+WKSDE+IYERECGNMPGFAVYYR+PNSQRVTY 
Sbjct: 1082 PMICCCTEYEAGRLGRFLYETLKIAYYWKSDETIYERECGNMPGFAVYYRFPNSQRVTYG 1141

Query: 2802 QFI-RVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
            QFI R +         LLI C+                                   V +
Sbjct: 1142 QFIKRRNGNCYSAFYVLLILCI----------------------------------LVTR 1167

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD--XXXXXXXXXXXXAIAVPNG 2452
            IK DEREDLK           ARK SWVT+EEFGMG +D                A  N 
Sbjct: 1168 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLDIKPPSVASKSLSGNVAAAQNS 1227

Query: 2451 SALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTK 2272
            SALN+SQ E    RA  +GSQ  D GN  ++  +R K  DG+ +RTEN + +K+DLG  K
Sbjct: 1228 SALNVSQGEPADGRALVTGSQHGDPGNSNRDPISRAKHADGRSDRTENISHLKSDLGHQK 1287

Query: 2271 LKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDESAKGTAESELRPTIRRSVAAGSL 2092
             K         S+S      +   T+RT E       +AK  AESEL+ + +R V     
Sbjct: 1288 SKGA-------SRSAENQKGMDDSTNRTLE-DSTVRVAAKNLAESELKVSTKRPV----- 1334

Query: 2091 TKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTSTAIANSST 1912
            +K PK Q+V K+DNKSGK           ++++D+  H ++GR   G +N S+ + ++ +
Sbjct: 1335 SKTPK-QDVVKDDNKSGK--GVGRTLSSSTSDKDIQVHLSEGR-QGGASNVSSVLTSNES 1390

Query: 1911 MPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRPLPSRPTHSPLPDD 1732
             P              D  G+   ++  A + ++++            PSR  HSP  D+
Sbjct: 1391 KP--------------DSGGNKPMLKDEATEVADVQKP----------PSRLVHSPRHDN 1426

Query: 1731 SFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEARFSDRERSVDPRLPHSDLE 1555
            S  ASKS+DKLQKR SP EE  + SKR+KGD E++D E E +FS+RERS D R   +DL+
Sbjct: 1427 SVAASKSSDKLQKRASPAEEPDRLSKRQKGDVELRDLEGEVKFSERERSTDTR--SADLD 1484

Query: 1554 RTGID--------DQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDILVEKSRDRSMERY 1399
            + G D        D+ L RSKDKGN RYDRDHRER ER DKS G+D L ++SRD+SMERY
Sbjct: 1485 KVGNDEHNLYRSVDKPLDRSKDKGNDRYDRDHRERSERPDKSRGDDSLADRSRDKSMERY 1544

Query: 1398 GRERSVERVS------FXXXXXXXXXXXRGKPRYNETSGEK--YDDRFHRQSXXXXXXXX 1243
            GRE SVER                    R K RYN+TS EK   DDRFH Q+        
Sbjct: 1545 GRELSVERGQDRVADRSFDRLADKAKDDRSKLRYNDTSAEKSQVDDRFHGQNLPPPPPLP 1604

Query: 1242 PNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXXXXEDYLVAQXXXXXXXXXX 1063
            P+MVPQSV + RRDEDADRR   TRH+QRLSP           E+ LV+Q          
Sbjct: 1605 PHMVPQSVTSGRRDEDADRRFGTTRHVQRLSPRHDEKERRRSEENSLVSQDDAKRRKEDD 1664

Query: 1062 XXXXXXXXXDGLSTKVD------ERDKINLLKEEMDANAASKRRKLKRDHLSSGEAGEYS 901
                     +GLS KV+      ER+K NL KEEMD++A +KRRKLKRDHL +GEAGEYS
Sbjct: 1665 VRERKREEREGLSIKVEEREREREREKTNLPKEEMDSSATAKRRKLKRDHLPTGEAGEYS 1724

Query: 900  -LAPPQPPLALGMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASKITRRDGDQI 724
             +APP PPL +G+   YDGR+RGDRKG M QR  YLEEP+ R+HGK+   K+ RRD D +
Sbjct: 1725 PVAPPPPPLGIGISHSYDGRERGDRKGAMNQRASYLEEPLMRIHGKDVVGKMGRRDTDPM 1784

Query: 723  YDREWEDEKRQRPEQKRRHRK 661
            YDREW+++KRQR EQKRRHRK
Sbjct: 1785 YDREWDEDKRQRAEQKRRHRK 1805


>ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solanum tuberosum]
          Length = 1859

 Score = 2220 bits (5753), Expect = 0.0
 Identities = 1213/1899 (63%), Positives = 1413/1899 (74%), Gaps = 45/1899 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSL P+E+ Y TE+ ++E KNGN  FK +  +P LRFLYELCW MVRGELP QKCK AL+
Sbjct: 1    MSLSPLEYLYFTEDSIKELKNGNTSFKFAQPLPTLRFLYELCWVMVRGELPFQKCKMALE 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
             VEF + +S EELGS  ADIVTQ+AQDL++  E R R+ K+AKWLVESALVPL+ FQERC
Sbjct: 61   CVEFVDYASQEELGSSLADIVTQLAQDLSLPGENRQRVNKLAKWLVESALVPLRFFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWESEMIKIKA DLK+KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ    +
Sbjct: 121  EEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQIPEGS 180

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             ++N S ATV IIKSLIGHFDLDPNRVFDIVLECFE Q  N++FLDLIPIFPKSHASQIL
Sbjct: 181  -SQNSSAATVGIIKSLIGHFDLDPNRVFDIVLECFERQPGNSIFLDLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQR+EVNDPVP  LY+LTA+LVK DFID+DSIYAHLLPK+EDAF+HY+A SAKR
Sbjct: 240  GFKFQYYQRLEVNDPVPSELYQLTALLVKRDFIDVDSIYAHLLPKEEDAFDHYNAFSAKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIG+INLAATGKDLMD+EKQGDVT+DLYAA+DME+EAVAE S ELEN+Q LGLL 
Sbjct: 300  LDEANKIGRINLAATGKDLMDEEKQGDVTVDLYAALDMETEAVAERSSELENSQPLGLLM 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFL VDDWYHAH+LF RLS LNP  H+QIC+GLFRLIEKSIS   D++ ++ L   G  S
Sbjct: 360  GFLEVDDWYHAHVLFGRLSHLNPAEHVQICDGLFRLIEKSISGPNDLVCKMQL--LGSLS 417

Query: 4962 GSGSD-ALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSAL 4786
            G  +D ++EV ++SS  RS+++L KELF+ML+S GP+LYR+ +LLQKVCRVLRGYY+ A 
Sbjct: 418  GVVTDNSMEVANSSS-SRSYINLRKELFEMLSSVGPHLYRDTLLLQKVCRVLRGYYICAH 476

Query: 4785 ELIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWE 4606
            EL+ SG      ++   G R P++HLK+A  R+ EALG CLLPSLQLIPANPAVG EIWE
Sbjct: 477  ELVTSGETGFISQTVTIGDRTPQMHLKDATSRIVEALGGCLLPSLQLIPANPAVGLEIWE 536

Query: 4605 VMSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKL 4426
            +MSLLPYE RYRLYGEWEKDDE+ PM+L ARQTAKLDTRRILKRLAKENLKQLGRMVAKL
Sbjct: 537  LMSLLPYELRYRLYGEWEKDDEQFPMLLAARQTAKLDTRRILKRLAKENLKQLGRMVAKL 596

Query: 4425 AHANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDD 4246
            AHANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQ GREKLKDD
Sbjct: 597  AHANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQSGREKLKDD 656

Query: 4245 GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQ 4066
            GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANV 
Sbjct: 657  GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGNGIELVFMQELIQQMANVH 716

Query: 4065 YTENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXX 3886
            YTENMTEEQLDAMAGS+TLRYQAT FG+TRNNKAL KSTNRLRD+               
Sbjct: 717  YTENMTEEQLDAMAGSDTLRYQATSFGITRNNKALIKSTNRLRDALLPKDEPKLAIPLLL 776

Query: 3885 XXAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLV 3706
              AQHRS+VVIN++ P+IKMV EQFDRCHG LLQYVEFL SA+T TA+YA LIP L++LV
Sbjct: 777  LIAQHRSLVVINAEVPYIKMVSEQFDRCHGALLQYVEFLSSAVTPTAAYAVLIPALEELV 836

Query: 3705 HKYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLD 3529
            H YHLDPEVAFL+YRPVMRL++C   SDV WP D  E  + +N         S A L+LD
Sbjct: 837  HVYHLDPEVAFLIYRPVMRLFRCQRNSDVFWPSDSDEAVNAANAEKESERSDSSAYLLLD 896

Query: 3528 LGSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQH 3349
            LGS +KPI W+DLLDT++TMLPSKAWNSLSPDLYATFWGLTLYDL+VPRSRYESEI KQH
Sbjct: 897  LGSSRKPISWTDLLDTIKTMLPSKAWNSLSPDLYATFWGLTLYDLHVPRSRYESEIGKQH 956

Query: 3348 SALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWL 3169
            +ALKALEELSDNS+SAITKRKKDKERIQE LDRLT ELQ+HE++V SVRRRL+REKDTWL
Sbjct: 957  AALKALEELSDNSSSAITKRKKDKERIQESLDRLTTELQRHEEHVTSVRRRLTREKDTWL 1016

Query: 3168 SSCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKT 2989
            SSCPDTLKINMEFLQRCIFPRC FSMPDAVYCA+FV+TLHSLGTPFFNTVNHIDVLICKT
Sbjct: 1017 SSCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAVFVNTLHSLGTPFFNTVNHIDVLICKT 1076

Query: 2988 LHPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVT 2809
            L PMICCCTEYE GRLGRFL+ETLK AY+WK DESIYERECGNMPGFAVYYRYPNSQRVT
Sbjct: 1077 LQPMICCCTEYEVGRLGRFLYETLKTAYYWKGDESIYERECGNMPGFAVYYRYPNSQRVT 1136

Query: 2808 YSQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVA 2629
            Y QFI+VHWKWS RITRLLIQCLES EYMEIRNALI+LTKISSVFPVTRK+GINLEKRVA
Sbjct: 1137 YGQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALILLTKISSVFPVTRKSGINLEKRVA 1196

Query: 2628 KIKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVP 2458
            KIK DEREDLK           +RK SWVT+EEFGMG ++               ++A+P
Sbjct: 1197 KIKSDEREDLKVLATGVAAALASRKPSWVTDEEFGMGYLELKLAAAPASKSSAGNSVAIP 1256

Query: 2457 NGSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQ 2278
            NGS  ++SQ E    R   +G                   VDGKL+R + +++ K DLGQ
Sbjct: 1257 NGSGASVSQGEPSIGRTVVAGI-----------------VVDGKLDRPD-SSMPKPDLGQ 1298

Query: 2277 TKLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES-----AKGTAESELRPTIRR 2113
            TK K     NGLD QS M +A +QS T       +  +ES     +K + E E R T +R
Sbjct: 1299 TKQKGSQSINGLDVQS-MPSATLQSDTPSQNSTCRPLEESTIKAASKMSGEQEGRATGKR 1357

Query: 2112 SVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTST 1933
            +  AGSL+KQ K  ++AK+D KSGK               D+S +P++ R S G  N ST
Sbjct: 1358 ATPAGSLSKQQK-HDIAKDD-KSGKAVGRASGAA----SGDVS-YPSESRAS-GSVNVST 1409

Query: 1932 AIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASI------DDSEKLRP 1771
             ++ + +M ++     AS    +D   + +N E    KS++LR S       + S+  + 
Sbjct: 1410 TVSGNGSMFSAAPKGAASLTRLLD-PSNESNAELTTTKSADLRVSAGKDDVSESSDVHKE 1468

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEV-EARFSDR 1597
               R  HSP  D    ASK+ +K+QKR+ P EE  + +KRRKG+ + +D E  +AR S++
Sbjct: 1469 STLRLVHSPRHD----ASKANEKVQKRSIPAEELDRLNKRRKGEIDGRDIECGDARSSEK 1524

Query: 1596 ERSVDPR----LPHSDLERTGIDDQSLH--------RSKDKGNGRYDRDHRERLERQDKS 1453
            ER +D R    L  +D +R G DDQ L+        RSKDKG  R +RD RER +R D+S
Sbjct: 1525 ERLIDARAADKLHPADYDRHGSDDQILNRASEKPLDRSKDKGGERLERDPRERGDRPDRS 1584

Query: 1452 TGEDILVEKSRDRSMERYGRERSVERVS--------FXXXXXXXXXXXRGKPRYNETSGE 1297
             G+D   EKSRDRS ER+GRERS+ERV                     R K R++E S E
Sbjct: 1585 RGDDAF-EKSRDRSTERHGRERSIERVHERVADRNFDRLSKDERIKDDRSKLRHSEASVE 1643

Query: 1296 K--YDDRFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXX 1123
            K   DDR + Q+        P++VPQS+NA RRD+D+DRR    RH QRLSP        
Sbjct: 1644 KSLTDDRLYNQNLPPPPPLPPHLVPQSINAGRRDDDSDRRFGTARHSQRLSPRHDERERR 1703

Query: 1122 XXXEDYLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVD----ERDKINLLKEEMDANAASK 955
               E+  + Q                   + LS KV+    ER+K  L+KE+MD N ASK
Sbjct: 1704 RSEENNTLLQDDLKRRREDDFRDRKREERE-LSIKVEEREREREKAILVKEDMDPN-ASK 1761

Query: 954  RRKLKRDHLSSGEAGEYSLAPPQPPLALGMVQPYDGRDRGDRKGTMI-QRPGYLEEPMPR 778
            RRKLKR+H++S E GEYS A   PPL++ M QP DGRDRG+RKG ++ QRPGYL+EP  R
Sbjct: 1762 RRKLKREHMAS-EPGEYSPAAHPPPLSINMTQPSDGRDRGERKGVIVQQRPGYLDEPGLR 1820

Query: 777  MHGKETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            +HGKE+ASK  RRD D +YDREW+D+KRQR E KRRHRK
Sbjct: 1821 IHGKESASKAPRRDADSMYDREWDDDKRQRAEPKRRHRK 1859


>ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solanum lycopersicum]
          Length = 1858

 Score = 2205 bits (5714), Expect = 0.0
 Identities = 1204/1899 (63%), Positives = 1407/1899 (74%), Gaps = 45/1899 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSL P+E+ Y TE  ++E KNGN  FK +  +P LRFLYELCW MVRGELP QKCK AL+
Sbjct: 1    MSLSPLEYLYFTEHSIKELKNGNTSFKFAQPLPTLRFLYELCWVMVRGELPFQKCKLALE 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
             VEF + +S EELGS  ADIVTQ+AQDL++  E R R+ K+AKWLVESALVPL+ FQERC
Sbjct: 61   CVEFVDYASQEELGSSLADIVTQLAQDLSLPGENRQRVNKLAKWLVESALVPLRFFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWESEMIKIKA DLK+KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ    +
Sbjct: 121  EEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQIPEDS 180

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             ++N S ATV IIKSLIGHFDLDPNRVFDIVLECFE Q  N++FLDLIPIFPKSHASQIL
Sbjct: 181  -SQNASAATVGIIKSLIGHFDLDPNRVFDIVLECFERQPGNSIFLDLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQR+EVNDPVP  LY+LTA+LVK DFID+DSIYAHLLPK+EDAF+HY+A SAKR
Sbjct: 240  GFKFQYYQRLEVNDPVPSELYQLTALLVKRDFIDVDSIYAHLLPKEEDAFDHYNAFSAKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIG+INLAATGKDLMD+EKQGDVT+DLYAA+DME+EAVAE S ELEN+Q LGLL 
Sbjct: 300  LDEANKIGRINLAATGKDLMDEEKQGDVTVDLYAALDMETEAVAERSSELENSQPLGLLM 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFL V+DWYHAH+LF RLS LNP  H+QIC+GLFRLIEKSIS   D++ ++ L   G   
Sbjct: 360  GFLEVNDWYHAHVLFGRLSHLNPAEHVQICDGLFRLIEKSISGPNDLVCKMQL--LGSHP 417

Query: 4962 GSGSD-ALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSAL 4786
            G  +D ++EV ++SS  RS+++L KELF+ML+S GP+LYR+ +LLQKVCRVLRGYY+ A 
Sbjct: 418  GVVTDNSMEVANSSS-SRSYINLRKELFEMLSSVGPHLYRDTLLLQKVCRVLRGYYICAH 476

Query: 4785 ELIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWE 4606
            EL+ SG      ++   G R P++HLK+   R+ EALG CLLPSLQLIPANPAVG EIWE
Sbjct: 477  ELVTSGETGFISQTVTIGDRTPQMHLKDVTSRIVEALGGCLLPSLQLIPANPAVGLEIWE 536

Query: 4605 VMSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKL 4426
            +MSLLPYE RYRLYGEWEKDDE+ PM+L ARQTAKLDTRRILKRLAKENLKQLGRMVAKL
Sbjct: 537  LMSLLPYELRYRLYGEWEKDDEQFPMLLAARQTAKLDTRRILKRLAKENLKQLGRMVAKL 596

Query: 4425 AHANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDD 4246
            AHANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQ GREKLKDD
Sbjct: 597  AHANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQSGREKLKDD 656

Query: 4245 GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQ 4066
            GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANV 
Sbjct: 657  GLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGNGIELVFMQELIQQMANVH 716

Query: 4065 YTENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXX 3886
            YTENMTEEQLDAMAGS+TLRYQAT FG+TRNNKAL KSTNRLRD+               
Sbjct: 717  YTENMTEEQLDAMAGSDTLRYQATSFGITRNNKALIKSTNRLRDALLPKDEPKLAIPLLL 776

Query: 3885 XXAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLV 3706
              AQHRS+VVIN++ P+IKMV EQFDRCHG LLQYVEFL SA+T TA+YA LIP L++LV
Sbjct: 777  LIAQHRSLVVINAEVPYIKMVSEQFDRCHGALLQYVEFLSSAVTPTAAYAVLIPALEELV 836

Query: 3705 HKYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLD 3529
            H YHLDPEVAFL+YRPVMRL++C+  SDV WP D  E  + +N           A L+LD
Sbjct: 837  HMYHLDPEVAFLIYRPVMRLFRCMRYSDVFWPSDSDEAVNAANAEKESERSEVSAYLLLD 896

Query: 3528 LGSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQH 3349
            LGS +KPI W+DLLDT++TMLPSKAWNSLSPDLYATFWGLTLYDL+VPRSRYESEI KQH
Sbjct: 897  LGSSRKPISWTDLLDTIKTMLPSKAWNSLSPDLYATFWGLTLYDLHVPRSRYESEIGKQH 956

Query: 3348 SALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWL 3169
            +ALKALEELSDNS+SAITKRKKDKERIQE LDRLT ELQ+HE++V SVRRRL+REKDTWL
Sbjct: 957  AALKALEELSDNSSSAITKRKKDKERIQESLDRLTAELQRHEEHVTSVRRRLTREKDTWL 1016

Query: 3168 SSCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKT 2989
            SSCPDTLKINMEFLQRCIFPRC FSMPDAVYCA+FV+TLHSLGTPFFNTVNHIDVLICKT
Sbjct: 1017 SSCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAVFVNTLHSLGTPFFNTVNHIDVLICKT 1076

Query: 2988 LHPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVT 2809
            L PMICCCTEYE GRLGRFL+ETLK AY+WK DESIYERECGNMPGFAVYYRYPNSQRVT
Sbjct: 1077 LQPMICCCTEYEVGRLGRFLYETLKTAYYWKGDESIYERECGNMPGFAVYYRYPNSQRVT 1136

Query: 2808 YSQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVA 2629
            Y QFI+VHWKWS RITRLLIQCLES EYMEIRNALI+LTKISSVFPVTRK+GINLEKRVA
Sbjct: 1137 YGQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALILLTKISSVFPVTRKSGINLEKRVA 1196

Query: 2628 KIKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVD---XXXXXXXXXXXXAIAVP 2458
            KIK DEREDLK           +RK SWVT+EEFGMG ++               ++A+ 
Sbjct: 1197 KIKSDEREDLKVLATGVAAALASRKPSWVTDEEFGMGYLELKLAAVPASKSSAGNSVAIA 1256

Query: 2457 NGSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQ 2278
            NGS  ++SQ E    R   +G                 + VDGKL+R + +++ K DLGQ
Sbjct: 1257 NGSGASVSQGEPSIGRTVVAG-----------------RVVDGKLDRPD-SSMPKPDLGQ 1298

Query: 2277 TKLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDES-----AKGTAESELRPTIRR 2113
             K K     NGLD QS M +A +QS T     + +  +ES     +K + E E R T +R
Sbjct: 1299 AKHKGSQSINGLDVQS-MPSATLQSDTPSQNSMCRPLEESTIKAASKMSGEQEGRGTGKR 1357

Query: 2112 SVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTST 1933
            S   GSL+KQ K  ++AK D KSGK               D+S +P++ R S G  N ST
Sbjct: 1358 STPVGSLSKQQK-HDIAK-DEKSGKTVGRASGAA----SGDVS-YPSESRAS-GSVNVST 1409

Query: 1932 AIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRAS------IDDSEKLRP 1771
             ++ + +M ++     A     +D   + +N E    KS++LR S       + S+  + 
Sbjct: 1410 TVSGNGSMFSAAPKGAAPLTRLLD-PSNESNAEHTTTKSADLRVSAGKDDVTESSDVHKE 1468

Query: 1770 LPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGE-VEARFSDR 1597
               R  HSP  D    ASK+ +K+QKR+ P EE  + +KRRKG+ + +D E  +AR S++
Sbjct: 1469 STLRLVHSPRQD----ASKANEKVQKRSIPAEELDRLNKRRKGEIDGRDTECADARSSEK 1524

Query: 1596 ERSVDPR----LPHSDLERTGIDDQSLH--------RSKDKGNGRYDRDHRERLERQDKS 1453
            E  +D R    L  +D ++ G DDQ L+        RSK+KG  R +RD RER +R D+S
Sbjct: 1525 EWLIDARAADKLHPADYDKHGSDDQILNRASEKPLDRSKEKGGERPERDPRERGDRPDRS 1584

Query: 1452 TGEDILVEKSRDRSMERYGRERSVERVS--------FXXXXXXXXXXXRGKPRYNETSGE 1297
             G+D   EKSRDRS ER+GRERS+ERV                     R K R+NE S E
Sbjct: 1585 RGDDAF-EKSRDRSTERHGRERSIERVHERVADRNFDRLSKDERIKDDRSKLRHNEASVE 1643

Query: 1296 K--YDDRFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXX 1123
            K   DDRFH Q+        P++VPQS++A RR++D+DRR    RH QRLSP        
Sbjct: 1644 KSLTDDRFHNQNLPPPPPLPPHLVPQSISAGRREDDSDRRFGTARHSQRLSPRHDERERR 1703

Query: 1122 XXXEDYLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVD----ERDKINLLKEEMDANAASK 955
               E+  + Q                   + LS KV+    ER+K  L+KE+MD N ASK
Sbjct: 1704 RSEENNALLQDDLKRRREDDFRDRKREERE-LSIKVEEREREREKAILVKEDMDPN-ASK 1761

Query: 954  RRKLKRDHLSSGEAGEYSLAPPQPPLALGMVQPYDGRDRGDRKGTMI-QRPGYLEEPMPR 778
            RRKLKR+H++S E GEYS A   PPL++ M QP DGRDRG+RKG ++ QRPGYL+EP  R
Sbjct: 1762 RRKLKREHMAS-EPGEYSPA-AHPPLSINMTQPSDGRDRGERKGVIVQQRPGYLDEPGLR 1819

Query: 777  MHGKETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
            +HGKE+ASK  RRD D +YDREW+D+KRQR E KRRHRK
Sbjct: 1820 IHGKESASKAPRRDADSMYDREWDDDKRQRAEPKRRHRK 1858


>ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fragaria vesca subsp. vesca]
          Length = 1860

 Score = 2197 bits (5693), Expect = 0.0
 Identities = 1191/1909 (62%), Positives = 1405/1909 (73%), Gaps = 55/1909 (2%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPP+E  +I E+ LREWK GN  FK    +P+LRFLYELC TMVRGELP+QKC+AALD
Sbjct: 1    MSLPPVERAHINEDHLREWKTGNPSFKLPEPVPMLRFLYELCSTMVRGELPVQKCRAALD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            SVEF+EK S +EL S  ADIVTQM+QDLTM  E+R RL K+AKWLVES+LVPL+LFQERC
Sbjct: 61   SVEFSEKVSEQELASSLADIVTQMSQDLTMPGEHRARLTKLAKWLVESSLVPLRLFQERC 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKA +LK+KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ+    
Sbjct: 121  EEEFLWEAEMIKIKAQELKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQN---- 176

Query: 5682 LTENVS---GATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHAS 5512
             +EN S   GAT+ IIKSLIGHFDLDPN VFDIVLECFEL  DNNVFL+LIPIFPKSHAS
Sbjct: 177  -SENSSHNAGATIGIIKSLIGHFDLDPNHVFDIVLECFELLPDNNVFLELIPIFPKSHAS 235

Query: 5511 QILGFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAIS 5332
            QILGFKFQ+YQR+EVNDPVPFGLY+LTA+LVK +FIDLDSI AHLLPKD++AFEHY + S
Sbjct: 236  QILGFKFQHYQRLEVNDPVPFGLYKLTALLVKENFIDLDSICAHLLPKDDEAFEHYSSFS 295

Query: 5331 AKRFDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLG 5152
            +K+ DEANKIGKINLAATGKDLM+DEKQGDVTIDL+A++DM+S AV E S E ENNQ LG
Sbjct: 296  SKQLDEANKIGKINLAATGKDLMEDEKQGDVTIDLFASLDMDSVAVGERSTEFENNQTLG 355

Query: 5151 LLSGFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFG 4972
            LL+GFL+VDDWYHA++LFDRLSPLNPV H QIC  LFRLIEKSIS AYD++ Q  L   G
Sbjct: 356  LLTGFLAVDDWYHANLLFDRLSPLNPVEHTQICNSLFRLIEKSISSAYDMVHQARLNNLG 415

Query: 4971 FPSGSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLS 4792
              SG+G   +  T+ SS   SF++L KELFQMLA  GPYLYR+ +LLQKVCRVL+GYYLS
Sbjct: 416  -SSGTGVGVM-TTENSSASGSFIELQKELFQMLAIVGPYLYRDTLLLQKVCRVLKGYYLS 473

Query: 4791 ALELIGSGGR----ASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAV 4624
            A EL G  G     ASNP      +R PR   + A  RVEEA  TCLLPSLQL+PANPAV
Sbjct: 474  APEL-GIPGEVAVSASNPGLPLKEARSPR---QVAMSRVEEAFRTCLLPSLQLVPANPAV 529

Query: 4623 GQEIWEVMSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLG 4444
            G EIWEVMSLLPYE RYRLYGEWEK+DERIP+VL ARQTAKLDTRRILKRLAKENLKQ  
Sbjct: 530  GMEIWEVMSLLPYEVRYRLYGEWEKEDERIPIVLAARQTAKLDTRRILKRLAKENLKQQS 589

Query: 4443 RMVAKLAHANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGR 4264
            RMVAKLAHANP+TVLRTIVHQIEAY+DMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGR
Sbjct: 590  RMVAKLAHANPMTVLRTIVHQIEAYKDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGR 649

Query: 4263 EKLKDDGLNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXX 4084
            +KLK+DGLNLSDWLQSLASFWGHLCKKYPSMELR LFQYLVNQLKK              
Sbjct: 650  DKLKEDGLNLSDWLQSLASFWGHLCKKYPSMELRSLFQYLVNQLKKGQGIELVVLQELIQ 709

Query: 4083 QMANVQYTENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXX 3904
            QMANV +TE++TE+QLDAMAG ETLR+  T FG+TR+NK L KSTNRLR+S         
Sbjct: 710  QMANVHHTEDLTEDQLDAMAGGETLRHLTTGFGVTRHNKQLIKSTNRLRESLLPKDETKL 769

Query: 3903 XXXXXXXXAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIP 3724
                    AQHRS+V+I++DAP+IKMVCEQFDRCHG LLQYVEFLCSA+ S ++YAQLIP
Sbjct: 770  AIPLLLHLAQHRSLVIIDADAPYIKMVCEQFDRCHGALLQYVEFLCSAMPSASAYAQLIP 829

Query: 3723 PLDDLVHKYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSN-IXXXXXXXXS 3547
             LDDLVHKYHL+PEVAFL+YRPVMRL+K    SDV WP D+ +   +++ I        S
Sbjct: 830  SLDDLVHKYHLEPEVAFLIYRPVMRLFKTPPSSDVFWPLDNNDAQSITSAISESEAAQNS 889

Query: 3546 AKLVLDLGSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYES 3367
              +VLDLGS   PI W DLLDT +TMLP++AWNSLSPDLYATFWGLTLYDLYVPR+ Y S
Sbjct: 890  GNVVLDLGSTWNPITWLDLLDTAKTMLPARAWNSLSPDLYATFWGLTLYDLYVPRNCYIS 949

Query: 3366 EISKQHSALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSR 3187
            EI+KQ +++KALEE  DNS S I +RKK+KERIQE +DRL  E +KHE++V SVR+RL R
Sbjct: 950  EIAKQQASIKALEEQPDNSISEILRRKKEKERIQETIDRLISESRKHEEHVASVRKRLLR 1009

Query: 3186 EKDTWLSSCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHID 3007
            EKD WLSSCPDTLKINMEFLQRCIFPRC FSMPDAVY AMFVHTLH+LGTPFFNTVNH+D
Sbjct: 1010 EKDKWLSSCPDTLKINMEFLQRCIFPRCTFSMPDAVYSAMFVHTLHTLGTPFFNTVNHMD 1069

Query: 3006 VLICKTLHPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYP 2827
            VLIC+TL PMICCCTE E GRLG+FL ETLK+AY+WKSDESIYERECGNMPGFAVYYR+P
Sbjct: 1070 VLICRTLQPMICCCTESEVGRLGKFLCETLKIAYYWKSDESIYERECGNMPGFAVYYRFP 1129

Query: 2826 NSQRVTYSQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGIN 2647
            +SQRV Y QF++VHWKWS RITRLL QCLES EYMEIRNALI+L++ISSVFPVTRK+ +N
Sbjct: 1130 DSQRVRYGQFVKVHWKWSQRITRLLGQCLESTEYMEIRNALIILSRISSVFPVTRKSALN 1189

Query: 2646 LEKRVAKIKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVDXXXXXXXXXXXXAI 2467
            LEKRV+KIKGD REDLK           ARK S V++EEF MG V+              
Sbjct: 1190 LEKRVSKIKGDGREDLKVLATSVGASLAARKPSLVSDEEFCMGYVELKSASSSKPLASNS 1249

Query: 2466 -AVPNGSALNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKT 2290
             A+ +G A+N SQ E  G +AG   SQ  +  +  ++  ++ KP DG+ ER E+ +  K+
Sbjct: 1250 GAIHSGPAVNNSQTEPAGGKAGTLVSQHAELIDSARDHVSKAKPADGRSERAESVSTAKS 1309

Query: 2289 DLGQTKLKSGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDE-------------SAKG 2149
            D G  K K  SL NG D+Q+ + +A +Q+GT+R  E Q   +E             +AK 
Sbjct: 1310 DPGHLKHKGASLVNGSDAQASVPSATLQAGTARPIENQVQLNETSTRRAEENTGKLAAKN 1369

Query: 2148 TAESELRPTIRRSVAAGSLTKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPAD 1969
            T+ESELR   +RSV AG+   +P +Q++ K++++SGK                       
Sbjct: 1370 TSESELRAQAKRSVPAGA---KPLKQDLVKDESRSGK----------------------- 1403

Query: 1968 GRPSAGGANTSTAIANSSTMPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDD 1789
               +AG  N S+  AN ST+P+  K S AS      +    A I +  + SS+     + 
Sbjct: 1404 ---AAGATNVSSITANGSTVPSLGKGS-ASLGIESKVEAGSAKISNTRIPSSK-EEGAEV 1458

Query: 1788 SEKLRPLPSRPTHSPLPDDSFTASKSTDKLQKRTSP-EEHGKPSKRRKGDNEVKDGEVEA 1612
            S+  RP  SR  +SP  D S T SKS+DKLQKRT P EE  + SKRRKG+ E++D E EA
Sbjct: 1459 SDVARPPSSRFVNSPRHDSSATLSKSSDKLQKRTGPAEETDRQSKRRKGEAEMRDSEGEA 1518

Query: 1611 RFSDRERSVDPRLPHSDLERTGIDDQSLH--------RSKDKGNGRYDRDHRERLERQDK 1456
            R SDRERSVD RL   DL+++G DD+S++        RSKDKGN R+D+DHRER +R DK
Sbjct: 1519 RLSDRERSVDARL--LDLDKSGSDDRSVYKATEKASDRSKDKGNERHDKDHRERADRPDK 1576

Query: 1455 STGEDILVEKSRDRSMERYGRERSVERVS------FXXXXXXXXXXXRGKPRYNETSGEK 1294
            S G+D LVE+SRDRSMER+GR+ S E++                   +GK RY++ S EK
Sbjct: 1577 SRGDD-LVERSRDRSMERHGRDHSAEKLQERGSDRSFDRLPEKSKDEKGKGRYSDISTEK 1635

Query: 1293 --YDDRFHRQSXXXXXXXXPNMVPQSVNASRRDEDADRRISNTRHIQRLSPXXXXXXXXX 1120
               D+R+H QS        P++VPQSV++ RRDED+DRR + TRH QRLSP         
Sbjct: 1636 SHVDERYHGQSLPPPPPLPPHIVPQSVSSGRRDEDSDRR-TTTRHTQRLSPRHDEKERRR 1694

Query: 1119 XXEDYLVAQXXXXXXXXXXXXXXXXXXXDGLSTKVDERD----------------KINLL 988
              E+  ++Q                   +G+S KVDERD                K NL 
Sbjct: 1695 SEENSSISQDDSKRRREDDFRERKRDDREGISVKVDERDRDRDRDREREREKEREKANLS 1754

Query: 987  KEEMDANAASKRRKLKRDHLSSGEAGEYSLAPPQPPLALGMVQPYDGRDRGDRKGTMIQR 808
            KE+ D  AASKRRKLKRD LSS EAGEYS   P PPL++ + Q YDGRDRG+RKG ++ R
Sbjct: 1755 KEDPDMIAASKRRKLKRD-LSSVEAGEYSPVHPPPPLSINLSQSYDGRDRGERKGPIVAR 1813

Query: 807  PGYLEEPMPRMHGKETASKITRRDGDQIYDREWEDEKRQRPEQKRRHRK 661
             GY+EEP  R+HGKE ++K+TRRD D +Y  EW+D+KR R EQKRRHRK
Sbjct: 1814 TGYVEEPSLRIHGKEVSNKMTRRDTDPMY--EWDDDKR-RGEQKRRHRK 1859


>ref|XP_007045495.1| THO2 isoform 3 [Theobroma cacao] gi|508709430|gb|EOY01327.1| THO2
            isoform 3 [Theobroma cacao]
          Length = 1762

 Score = 2196 bits (5689), Expect = 0.0
 Identities = 1188/1870 (63%), Positives = 1365/1870 (72%), Gaps = 16/1870 (0%)
 Frame = -1

Query: 6222 MSLPPIEFKYITEEFLREWKNGNNDFKASNSIPVLRFLYELCWTMVRGELPIQKCKAALD 6043
            MSLPPIE  YITEE LRE K+GN++F  S+S+P+LRFLYELCWTMVRGELP QKCKA LD
Sbjct: 1    MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60

Query: 6042 SVEFTEKSSNEELGSLFADIVTQMAQDLTMSVEYRTRLIKVAKWLVESALVPLKLFQERC 5863
            +VEFTE+ S +ELGS FADIVTQMAQDLTM+ EYRTRLIK+AKWLVES++VPL+LF ER 
Sbjct: 61   AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120

Query: 5862 EEEFLWESEMIKIKAPDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQSGTVA 5683
            EEEFLWE+EMIKIKAPDLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + G+  
Sbjct: 121  EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFR-GSED 179

Query: 5682 LTENVSGATVSIIKSLIGHFDLDPNRVFDIVLECFELQADNNVFLDLIPIFPKSHASQIL 5503
             T+N S A + +IKSLIGHFDLDPNRVFDIVLEC+ELQ D + FL LIPIFPKSHASQIL
Sbjct: 180  STQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQIL 239

Query: 5502 GFKFQYYQRMEVNDPVPFGLYRLTAVLVKADFIDLDSIYAHLLPKDEDAFEHYDAISAKR 5323
            GFKFQYYQRMEVN P PFGLY+LTA+LVK +FIDLDSIY HLLPKD++ FE +++ S KR
Sbjct: 240  GFKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKR 299

Query: 5322 FDEANKIGKINLAATGKDLMDDEKQGDVTIDLYAAIDMESEAVAEHSPELENNQNLGLLS 5143
             DEANKIGKINLAATGKDLM+DEKQGDVTIDL+AA+DME+EAVAE +PELENNQ LGLL+
Sbjct: 300  LDEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLT 359

Query: 5142 GFLSVDDWYHAHILFDRLSPLNPVAHIQICEGLFRLIEKSISPAYDIIRQLHLQCFGFPS 4963
            GFLSVDDWYHA ILFDRLSPLNPVAH+QIC+GLFRLIEKSIS AYDI+RQ HLQ FG PS
Sbjct: 360  GFLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIVRQTHLQNFGSPS 419

Query: 4962 GSGSDALEVTDTSSIQRSFVDLPKELFQMLASAGPYLYRNVVLLQKVCRVLRGYYLSALE 4783
            G G D ++ + ++S   SF+DLPKELFQMLA+ GP+LY + +LLQKVCRVLRGYYLSALE
Sbjct: 420  GPGVDNMDTSTSAS--SSFIDLPKELFQMLATVGPHLYSDTLLLQKVCRVLRGYYLSALE 477

Query: 4782 LIGSGGRASNPESADSGSRDPRLHLKEAKLRVEEALGTCLLPSLQLIPANPAVGQEIWEV 4603
            L+ S G  SN E+A  G ++PRLHLKEA+ RVEE LG CLLPSLQL+PANPAVGQEIWEV
Sbjct: 478  LVASAGGVSNAETAAGGYQNPRLHLKEARSRVEETLGACLLPSLQLVPANPAVGQEIWEV 537

Query: 4602 MSLLPYEARYRLYGEWEKDDERIPMVLTARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 4423
            M+LLPYE RYRLYGEWEKDDER P +L ARQTAKLDTRRILKRLAKENLKQLGRMVAKLA
Sbjct: 538  MNLLPYEVRYRLYGEWEKDDERNPTILAARQTAKLDTRRILKRLAKENLKQLGRMVAKLA 597

Query: 4422 HANPLTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDVLEYVVIERLAQGGREKLKDDG 4243
            HANP+TVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYD+LEYVVIERLAQGGR+KLKDDG
Sbjct: 598  HANPMTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQGGRDKLKDDG 657

Query: 4242 LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKXXXXXXXXXXXXXXQMANVQY 4063
            LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKK              QMANVQ+
Sbjct: 658  LNLSDWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKKGQGIELVLLQELIHQMANVQF 717

Query: 4062 TENMTEEQLDAMAGSETLRYQATLFGMTRNNKALTKSTNRLRDSXXXXXXXXXXXXXXXX 3883
            TEN+TEEQLDAMAGSETLR+QAT FG+TRNNKAL KSTNRLRDS                
Sbjct: 718  TENLTEEQLDAMAGSETLRFQATSFGVTRNNKALIKSTNRLRDSLLPKDEPKLAIPLLLL 777

Query: 3882 XAQHRSVVVINSDAPHIKMVCEQFDRCHGTLLQYVEFLCSALTSTASYAQLIPPLDDLVH 3703
             AQHRS+VVIN+DAP+IKMV EQFDRCHGTLLQYVEFLCSA+T  A+YAQLIP LDDLVH
Sbjct: 778  LAQHRSLVVINADAPYIKMVSEQFDRCHGTLLQYVEFLCSAVTPAAAYAQLIPSLDDLVH 837

Query: 3702 KYHLDPEVAFLLYRPVMRLYKCLSGSDVCWPCDDIEDTHMSNIXXXXXXXXS-AKLVLDL 3526
             YHLDPEVAFL+YRPVMRL+KC   SDV WP DD E  +++            ++++LDL
Sbjct: 838  VYHLDPEVAFLIYRPVMRLFKCQGSSDVFWPLDDNETGNITMAYSESESKDDLSRVILDL 897

Query: 3525 GSPQKPIMWSDLLDTVRTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRSRYESEISKQHS 3346
            G P+KPI+WS+LLDTV+TMLPSKAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI+KQH+
Sbjct: 898  GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957

Query: 3345 ALKALEELSDNSNSAITKRKKDKERIQELLDRLTGELQKHEDNVESVRRRLSREKDTWLS 3166
            ALKALEEL DNS+SAI KRKKDKERIQE LDRLT EL KHE+NV SVRRRL+ EKD WLS
Sbjct: 958  ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017

Query: 3165 SCPDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 2986
            SCPDTLKINMEFLQRCIFPRC FSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL
Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077

Query: 2985 HPMICCCTEYEAGRLGRFLFETLKMAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 2806
             PMICCCTEYEAGRLGRFL+ETLK+AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY
Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137

Query: 2805 SQFIRVHWKWSGRITRLLIQCLESAEYMEIRNALIMLTKISSVFPVTRKTGINLEKRVAK 2626
             QFI+VHWKWS RITRLLIQCLES EYMEIRNALIMLTKISSVFPVTRK+GINLEKRVAK
Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197

Query: 2625 IKGDEREDLKXXXXXXXXXXXARKSSWVTEEEFGMGIVDXXXXXXXXXXXXAIAVPNGSA 2446
            IK DEREDLK           ARKSSWVT+EEFGMG ++                P  S 
Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELK--------------PATSL 1243

Query: 2445 LNISQIETVGPRAGPSGSQLTDYGNPLKEQTARPKPVDGKLERTENTTLMKTDLGQTKLK 2266
             + S   T                     Q    K ++ + +  E++  +   L +   K
Sbjct: 1244 ASKSLAAT--------------------SQAGTGKSLENQKQLDESSNKLDEHLAKVPAK 1283

Query: 2265 SGSLANGLDSQSIMTTAAVQSGTSRTTEIQKHSDE--SAKGTAESELRPTIRRSVAAGSL 2092
            + +    L+S++    +A     ++T +     D+  S K    + +   I R V + + 
Sbjct: 1284 NSA---ELESKASAKRSAPAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRDVPSHTE 1340

Query: 2091 TKQPKQQEVAKEDNKSGKXXXXXXXXXXXSNERDLSAHPADGRPSAGGANTSTAIANSST 1912
             +Q     V      +GK               D S  P   RPS+   ++    ++++ 
Sbjct: 1341 GRQGGTTNVPSAVTSNGKD--------------DGSELPDASRPSSRIVHSPRHDSSATV 1386

Query: 1911 MPTSVKSSTASARTSVDMHGSVANIESGAVKSSELRASIDDSEKLRPLPSRPTHSPLP-D 1735
              +S K    +  T V+    +     G V+  +L   +  S++ R    +      P  
Sbjct: 1387 SKSSDKLQKRT--TPVEETDRLTKRRKGDVELKDLDGEVRLSDRERSTDPQLADFDKPGT 1444

Query: 1734 DSFTASKSTDKLQKRTSPEEHGKPSKRRKGDNEVKDGEVEARFSDRERSVDPRLPHSDLE 1555
            D  T+ ++ DK      P +  K     + D +            RER   P       E
Sbjct: 1445 DELTSHRAVDK------PLDRSKDKGSERHDRDY-----------RERLERP-------E 1480

Query: 1554 RTGIDDQSLHRSKDKGNGRYDRDHRERLERQDKSTGEDILVEKSRDRSMERYGRERSVER 1375
            ++  DD    +S+D+   RY R+                 VE+S DR++ER G +   ER
Sbjct: 1481 KSRADDILTEKSRDRSIERYGRERS---------------VERSTDRNLERLGDKAKDER 1525

Query: 1374 VSFXXXXXXXXXXXRGKPRYNETSGEK--YDDRFHRQSXXXXXXXXPNMVPQSVNAS-RR 1204
                          R K RY +TS EK   DDRFH QS        P+MVPQSVNA+ RR
Sbjct: 1526 ----------SKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQSVNATGRR 1575

Query: 1203 DEDADRRISNTRHIQRLSPXXXXXXXXXXXEDYLVAQXXXXXXXXXXXXXXXXXXXDGLS 1024
            D+D DRR  +TRH QRLSP           E+ LV+Q                   +GLS
Sbjct: 1576 DDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERKREEREGLS 1635

Query: 1023 TKVDERD--------KINLLKEEMDANAASKRRKLKRDHLSSGEAGEYS-LAPPQPPLAL 871
             KV+ERD        K +LLKE++DAN A KRRKLKR+HL S E GEYS +APP PPLA+
Sbjct: 1636 MKVEERDRDRERDREKASLLKEDVDANVA-KRRKLKREHLPS-EPGEYSPIAPPPPPLAI 1693

Query: 870  GMVQPYDGRDRGDRKGTMIQRPGYLEEPMPRMHGKETASKITRRDGDQIYDREWEDEKRQ 691
            GM Q YDGRDR DRKG+M+QR GYLEEP  R+HGKE ASK+ RRD D +YDREW+DEKRQ
Sbjct: 1694 GMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDPMYDREWDDEKRQ 1752

Query: 690  RPEQKRRHRK 661
            RPE KRRHRK
Sbjct: 1753 RPEPKRRHRK 1762


Top