BLASTX nr result
ID: Rehmannia26_contig00003762
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia26_contig00003762 (3101 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solan... 1079 0.0 ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solan... 1072 0.0 gb|EOY01325.1| THO complex subunit 2 isoform 1 [Theobroma cacao] 1036 0.0 gb|EOY01326.1| THO complex subunit 2 isoform 2 [Theobroma cacao] 1032 0.0 gb|EOY01328.1| THO complex subunit 2 isoform 4 [Theobroma cacao] 1031 0.0 gb|EOY01327.1| THO2 isoform 3 [Theobroma cacao] 1009 0.0 ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis... 1008 0.0 ref|XP_006448121.1| hypothetical protein CICLE_v10014076mg [Citr... 1005 0.0 ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citru... 1005 0.0 ref|XP_006586338.1| PREDICTED: THO complex subunit 2-like [Glyci... 994 0.0 gb|EMJ18294.1| hypothetical protein PRUPE_ppa000084mg [Prunus pe... 993 0.0 gb|ESW32460.1| hypothetical protein PHAVU_002G324500g [Phaseolus... 992 0.0 ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isofor... 988 0.0 gb|EOY01329.1| THO complex subunit 2 isoform 5 [Theobroma cacao] 987 0.0 ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucum... 984 0.0 ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isofor... 970 0.0 emb|CBI26799.3| unnamed protein product [Vitis vinifera] 937 0.0 ref|XP_003631008.1| THO complex subunit [Medicago truncatula] gi... 931 0.0 ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi... 926 0.0 ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fraga... 898 0.0 >ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solanum tuberosum] Length = 1859 Score = 1079 bits (2791), Expect = 0.0 Identities = 595/972 (61%), Positives = 668/972 (68%), Gaps = 62/972 (6%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G S KPI+W +LLDT+KTMLP KAWNSLSPDLYATFWGLTLYDL+VPR+RYESEI Sbjct: 898 GSSRKPISWTDLLDTIKTMLPSKAWNSLSPDLYATFWGLTLYDLHVPRSRYESEIGKQHA 957 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI KRKKDKERIQESLDRLT E Q+HEEHV SVRRRL EKDTWLS Sbjct: 958 ALKALEELSDNSSSAITKRKKDKERIQESLDRLTTELQRHEEHVTSVRRRLTREKDTWLS 1017 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FVNTLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAVFVNTLHSLGTPFFNTVNHIDVLICKTL 1077 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYEVGRLGRFL+ETLKTAY+WK DESIYE+ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1078 QPMICCCTEYEVGRLGRFLYETLKTAYYWKGDESIYERECGNMPGFAVYYRYPNSQRVTY 1137 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALI+LTKISSVFPVTRKSGINLEKRVAK Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALILLTKISSVFPVTRKSGINLEKRVAK 1197 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYL++K A PASKS + N+V + N Sbjct: 1198 IKSDEREDLKVLATGVAAALASRKPSWVTDEEFGMGYLELKLAAAPASKSSAGNSVAIPN 1257 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGREPRRIDGDNLKQVEE-------------- 1258 G+G SVSQ E GRTV +G + G L R + +L Q ++ Sbjct: 1258 GSGASVSQGEPSIGRTVVAGIV--VDGKLDRPDSSMPKPDLGQTKQKGSQSINGLDVQSM 1315 Query: 1259 -SANKQSEENSK--------------VTKTVVEPEARPVVKRSAAAGSLAKQAKQDLSKD 1393 SA QS+ S+ +K E E R KR+ AGSL+KQ K D++KD Sbjct: 1316 PSATLQSDTPSQNSTCRPLEESTIKAASKMSGEQEGRATGKRATPAGSLSKQQKHDIAKD 1375 Query: 1394 EDKSGKAVGR----------------------XXXXXXXXXXXXXXXXKLTNSSTRSSDH 1507 DKSGKAVGR K S TR D Sbjct: 1376 -DKSGKAVGRASGAASGDVSYPSESRASGSVNVSTTVSGNGSMFSAAPKGAASLTRLLDP 1434 Query: 1508 NTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHK 1687 + E AE+T +KS+D RV GKD+ +E +D HK+ T R SPR + ASK+ +K K Sbjct: 1435 SNESNAELTTTKSADLRVSAGKDDVSESSDVHKESTLRLVHSPRHD----ASKANEKVQK 1490 Query: 1688 RLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSDD-- 1861 R PAEE DRLNKRRKGEID RDI+ + R SEKER D RA DKLH A +D+ GSDD Sbjct: 1491 RSIPAEELDRLNKRRKGEIDGRDIECGDARSSEKERLIDARAADKLHPADYDRHGSDDQI 1550 Query: 1862 ------KPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDR 2023 KPLDR+K+K G +SRGDD EK RDRS ERHGRERS++R Sbjct: 1551 LNRASEKPLDRSKDKGGERLERDPRERGDRPDRSRGDDAF-EKSRDRSTERHGRERSIER 1609 Query: 2024 VQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203 V ER ADRNFDRL+KDER KDDRSK+R+ EAS+EKS DDR Sbjct: 1610 VHERVADRNFDRLSKDERIKDDRSKLRHSEASVEKSLTDDRLYNQNLPPPPPLPPHLVPQ 1669 Query: 2204 SVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXX 2383 S+++GRRD+D+DRRFG ARH+Q+LSP N + LQ Sbjct: 1670 SINAGRRDDDSDRRFGTARHSQRLSPRHDERERRRSEENNTLLQ-DDLKRRREDDFRDRK 1728 Query: 2384 XXXXALSIKMD--ERERDKANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPAPPPVS 2557 LSIK++ ERER+KA + KED+D NASKRRKLKREHM SEPGEY PA+ PPP+S Sbjct: 1729 REERELSIKVEEREREREKAILVKEDMDPNASKRRKLKREHMASEPGEYSPAA-HPPPLS 1787 Query: 2558 INLLQSHDGRDRGDRKGVIV-QRPGYAEDPGLRAHSKEAASKATRRDADPMYDREWDDDK 2734 IN+ Q DGRDRG+RKGVIV QRPGY ++PGLR H KE+ASKA RRDAD MYDREWDDDK Sbjct: 1788 INMTQPSDGRDRGERKGVIVQQRPGYLDEPGLRIHGKESASKAPRRDADSMYDREWDDDK 1847 Query: 2735 RQRAEPKRRHRK 2770 RQRAEPKRRHRK Sbjct: 1848 RQRAEPKRRHRK 1859 >ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solanum lycopersicum] Length = 1858 Score = 1072 bits (2773), Expect = 0.0 Identities = 596/972 (61%), Positives = 663/972 (68%), Gaps = 62/972 (6%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G S KPI+W +LLDT+KTMLP KAWNSLSPDLYATFWGLTLYDL+VPR+RYESEI Sbjct: 898 GSSRKPISWTDLLDTIKTMLPSKAWNSLSPDLYATFWGLTLYDLHVPRSRYESEIGKQHA 957 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI KRKKDKERIQESLDRLT E Q+HEEHV SVRRRL EKDTWLS Sbjct: 958 ALKALEELSDNSSSAITKRKKDKERIQESLDRLTAELQRHEEHVTSVRRRLTREKDTWLS 1017 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FVNTLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAVFVNTLHSLGTPFFNTVNHIDVLICKTL 1077 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYEVGRLGRFL+ETLKTAY+WK DESIYE+ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1078 QPMICCCTEYEVGRLGRFLYETLKTAYYWKGDESIYERECGNMPGFAVYYRYPNSQRVTY 1137 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALI+LTKISSVFPVTRKSGINLEKRVAK Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALILLTKISSVFPVTRKSGINLEKRVAK 1197 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYL++K A PASKS + N+V + N Sbjct: 1198 IKSDEREDLKVLATGVAAALASRKPSWVTDEEFGMGYLELKLAAVPASKSSAGNSVAIAN 1257 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGREPRRIDGDNLKQVEE-------------- 1258 G+G SVSQ E GRTV +G + G L R + +L Q + Sbjct: 1258 GSGASVSQGEPSIGRTVVAGRV--VDGKLDRPDSSMPKPDLGQAKHKGSQSINGLDVQSM 1315 Query: 1259 -SANKQSEENSK--------------VTKTVVEPEARPVVKRSAAAGSLAKQAKQDLSKD 1393 SA QS+ S+ +K E E R KRS GSL+KQ K D++KD Sbjct: 1316 PSATLQSDTPSQNSMCRPLEESTIKAASKMSGEQEGRGTGKRSTPVGSLSKQQKHDIAKD 1375 Query: 1394 EDKSGKAVGR----------------------XXXXXXXXXXXXXXXXKLTNSSTRSSDH 1507 E KSGK VGR K TR D Sbjct: 1376 E-KSGKTVGRASGAASGDVSYPSESRASGSVNVSTTVSGNGSMFSAAPKGAAPLTRLLDP 1434 Query: 1508 NTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHK 1687 + E AE T +KS+D RV GKD+ TE +D HK+ T R SPRQ+ ASK+ +K K Sbjct: 1435 SNESNAEHTTTKSADLRVSAGKDDVTESSDVHKESTLRLVHSPRQD----ASKANEKVQK 1490 Query: 1688 RLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSDD-- 1861 R PAEE DRLNKRRKGEID RD + ++ R SEKE D RA DKLH A +DK GSDD Sbjct: 1491 RSIPAEELDRLNKRRKGEIDGRDTECADARSSEKEWLIDARAADKLHPADYDKHGSDDQI 1550 Query: 1862 ------KPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDR 2023 KPLDR+KEK G +SRGDD EK RDRS ERHGRERS++R Sbjct: 1551 LNRASEKPLDRSKEKGGERPERDPRERGDRPDRSRGDDAF-EKSRDRSTERHGRERSIER 1609 Query: 2024 VQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203 V ER ADRNFDRL+KDER KDDRSK+R+ EAS+EKS DDRF Sbjct: 1610 VHERVADRNFDRLSKDERIKDDRSKLRHNEASVEKSLTDDRFHNQNLPPPPPLPPHLVPQ 1669 Query: 2204 SVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXX 2383 S+S+GRR++D+DRRFG ARH+Q+LSP N + LQ Sbjct: 1670 SISAGRREDDSDRRFGTARHSQRLSPRHDERERRRSEENNALLQ-DDLKRRREDDFRDRK 1728 Query: 2384 XXXXALSIKMD--ERERDKANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPAPPPVS 2557 LSIK++ ERER+KA + KED+D NASKRRKLKREHM SEPGEY PA A PP+S Sbjct: 1729 REERELSIKVEEREREREKAILVKEDMDPNASKRRKLKREHMASEPGEYSPA--AHPPLS 1786 Query: 2558 INLLQSHDGRDRGDRKGVIV-QRPGYAEDPGLRAHSKEAASKATRRDADPMYDREWDDDK 2734 IN+ Q DGRDRG+RKGVIV QRPGY ++PGLR H KE+ASKA RRDAD MYDREWDDDK Sbjct: 1787 INMTQPSDGRDRGERKGVIVQQRPGYLDEPGLRIHGKESASKAPRRDADSMYDREWDDDK 1846 Query: 2735 RQRAEPKRRHRK 2770 RQRAEPKRRHRK Sbjct: 1847 RQRAEPKRRHRK 1858 >gb|EOY01325.1| THO complex subunit 2 isoform 1 [Theobroma cacao] Length = 1853 Score = 1036 bits (2680), Expect = 0.0 Identities = 564/976 (57%), Positives = 656/976 (67%), Gaps = 66/976 (6%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G KPI W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI Sbjct: 898 GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 DNSSSAI KRKKDKERIQE+LDRLT E KHEE+V SVRRRL +EKD WLS Sbjct: 958 ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RK SWVTDEEFGMGYL++KPA ASKSL+ N V +QN Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234 G+ ++VSQ+E G R V+ G+ SD + +LG+ + G Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317 Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336 +N KQ++ES+NK E +KV K E E++ K Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377 Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516 RSA AGSL K KQD KD+ KSGKAVGR + + + T Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGRTSVTCVIDRD-------VPSHTEGRQGGTTN 1430 Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696 + + +T++ ++ S GKD+G+E DA + P+SR SPR ++ SKS DK KR + Sbjct: 1431 VPSAVTSNGNAVSAPPKGKDDGSELPDASR-PSSRIVHSPRHDSSATVSKSSDKLQKRTT 1489 Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858 P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D + D FDK G+D Sbjct: 1490 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQLAD------FDKPGTDELTSHR 1542 Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032 DKPLDR+K+K KSR DD+L+EK RDRS+ER+GRERSV E Sbjct: 1543 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1598 Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203 R DRN +RL AKDER+KD+RSKVRY + S EKSHVDDRF Sbjct: 1599 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1658 Query: 2204 SVSS-GRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380 SV++ GRRD+D DRRFG+ RH+Q+LSP N+ Q Sbjct: 1659 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1718 Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542 LS+K++ER+RD KA++ KED+D N +KRRKLKREH+PSEPGEY P +P Sbjct: 1719 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1778 Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREW 2722 PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK RRD DPMYDREW Sbjct: 1779 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDPMYDREW 1837 Query: 2723 DDDKRQRAEPKRRHRK 2770 DD+KRQR EPKRRHRK Sbjct: 1838 DDEKRQRPEPKRRHRK 1853 >gb|EOY01326.1| THO complex subunit 2 isoform 2 [Theobroma cacao] Length = 1844 Score = 1032 bits (2669), Expect = 0.0 Identities = 567/976 (58%), Positives = 654/976 (67%), Gaps = 66/976 (6%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G KPI W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI Sbjct: 898 GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 DNSSSAI KRKKDKERIQE+LDRLT E KHEE+V SVRRRL +EKD WLS Sbjct: 958 ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RK SWVTDEEFGMGYL++KPA ASKSL+ N V +QN Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234 G+ ++VSQ+E G R V+ G+ SD + +LG+ + G Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317 Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336 +N KQ++ES+NK E +KV K E E++ K Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377 Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516 RSA AGSL K KQD KD+ KSGKAVGR +T R +TE Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGRT---------------SVTCVIDRDVPSHTE 1422 Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696 + T + S + GKD+G+E DA + P+SR SPR ++ SKS DK KR + Sbjct: 1423 GRQGGTTNVPS-AVTSNGKDDGSELPDASR-PSSRIVHSPRHDSSATVSKSSDKLQKRTT 1480 Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858 P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D + D FDK G+D Sbjct: 1481 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQLAD------FDKPGTDELTSHR 1533 Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032 DKPLDR+K+K KSR DD+L+EK RDRS+ER+GRERSV E Sbjct: 1534 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1589 Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203 R DRN +RL AKDER+KD+RSKVRY + S EKSHVDDRF Sbjct: 1590 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1649 Query: 2204 SVSS-GRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380 SV++ GRRD+D DRRFG+ RH+Q+LSP N+ Q Sbjct: 1650 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1709 Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542 LS+K++ER+RD KA++ KED+D N +KRRKLKREH+PSEPGEY P +P Sbjct: 1710 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1769 Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREW 2722 PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK RRD DPMYDREW Sbjct: 1770 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDPMYDREW 1828 Query: 2723 DDDKRQRAEPKRRHRK 2770 DD+KRQR EPKRRHRK Sbjct: 1829 DDEKRQRPEPKRRHRK 1844 >gb|EOY01328.1| THO complex subunit 2 isoform 4 [Theobroma cacao] Length = 1831 Score = 1031 bits (2666), Expect = 0.0 Identities = 564/976 (57%), Positives = 649/976 (66%), Gaps = 66/976 (6%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G KPI W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI Sbjct: 898 GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 DNSSSAI KRKKDKERIQE+LDRLT E KHEE+V SVRRRL +EKD WLS Sbjct: 958 ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RK SWVTDEEFGMGYL++KPA ASKSL+ N V +QN Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234 G+ ++VSQ+E G R V+ G+ SD + +LG+ + G Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317 Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336 +N KQ++ES+NK E +KV K E E++ K Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377 Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516 RSA AGSL K KQD KD+ KSGKAVGR T Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGR-----------------------------TS 1408 Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696 + I S + GKD+G+E DA +P+SR SPR ++ SKS DK KR + Sbjct: 1409 VTCVIDRDVPSHTEGRQGKDDGSELPDA-SRPSSRIVHSPRHDSSATVSKSSDKLQKRTT 1467 Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858 P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D + +A FDK G+D Sbjct: 1468 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQ------LADFDKPGTDELTSHR 1520 Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032 DKPLDR+K+K KSR DD+L+EK RDRS+ER+GRERSV E Sbjct: 1521 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1576 Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203 R DRN +RL AKDER+KD+RSKVRY + S EKSHVDDRF Sbjct: 1577 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1636 Query: 2204 SV-SSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380 SV ++GRRD+D DRRFG+ RH+Q+LSP N+ Q Sbjct: 1637 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1696 Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542 LS+K++ER+RD KA++ KED+D N +KRRKLKREH+PSEPGEY P +P Sbjct: 1697 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1756 Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREW 2722 PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK RRD DPMYDREW Sbjct: 1757 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDPMYDREW 1815 Query: 2723 DDDKRQRAEPKRRHRK 2770 DD+KRQR EPKRRHRK Sbjct: 1816 DDEKRQRPEPKRRHRK 1831 >gb|EOY01327.1| THO2 isoform 3 [Theobroma cacao] Length = 1762 Score = 1009 bits (2609), Expect = 0.0 Identities = 554/929 (59%), Positives = 633/929 (68%), Gaps = 19/929 (2%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G KPI W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI Sbjct: 898 GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 DNSSSAI KRKKDKERIQE+LDRLT E KHEE+V SVRRRL +EKD WLS Sbjct: 958 ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RK SWVTDEEFGMGYL++KPA ASKSL+A + Q Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAATS---QA 1254 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGREPRRIDGDNLKQVEESANKQSEENSKV-T 1297 G G S+ +N KQ++ES+NK E +KV Sbjct: 1255 GTGKSL--------------------------------ENQKQLDESSNKLDEHLAKVPA 1282 Query: 1298 KTVVEPEARPVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKL 1477 K E E++ KRSA AGSL K KQD KD+ KSGKAVGR + Sbjct: 1283 KNSAELESKASAKRSAPAGSLTKTQKQDPGKDDGKSGKAVGR---------------TSV 1327 Query: 1478 TNSSTRSSDHNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTA 1657 T R +TE + T + S + GKD+G+E DA +P+SR SPR ++ Sbjct: 1328 TCVIDRDVPSHTEGRQGGTTNVPS-AVTSNGKDDGSELPDA-SRPSSRIVHSPRHDSSAT 1385 Query: 1658 ASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAP 1837 SKS DK KR +P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D + +A Sbjct: 1386 VSKSSDKLQKRTTPVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQ------LAD 1438 Query: 1838 FDKTGSD--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLE 1993 FDK G+D DKPLDR+K+K KSR DD+L+EK RDRS+E Sbjct: 1439 FDKPGTDELTSHRAVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIE 1498 Query: 1994 RHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXX 2164 R+GRERSV ER DRN +RL AKDER+KD+RSKVRY + S EKSHVDDRF Sbjct: 1499 RYGRERSV----ERSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSL 1554 Query: 2165 XXXXXXXXXXXXXSV-SSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXX 2341 SV ++GRRD+D DRRFG+ RH+Q+LSP N+ Q Sbjct: 1555 PPPPPLPPHMVPQSVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDD 1614 Query: 2342 XXXXXXXXXXXXXXXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHM 2503 LS+K++ER+RD KA++ KED+D N +KRRKLKREH+ Sbjct: 1615 GKRRREDDFRERKREEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHL 1674 Query: 2504 PSEPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKA 2683 PSEPGEY P +P PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK Sbjct: 1675 PSEPGEYSPIAPPPPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKM 1733 Query: 2684 TRRDADPMYDREWDDDKRQRAEPKRRHRK 2770 RRD DPMYDREWDD+KRQR EPKRRHRK Sbjct: 1734 ARRDTDPMYDREWDDEKRQRPEPKRRHRK 1762 >ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis vinifera] Length = 1849 Score = 1008 bits (2606), Expect = 0.0 Identities = 555/955 (58%), Positives = 642/955 (67%), Gaps = 72/955 (7%) Frame = +2 Query: 53 KPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXXXX 232 KPI W +LLDT +TMLP KAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI Sbjct: 902 KPIVWSDLLDTARTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRHRYESEIAKQHSALKA 961 Query: 233 XXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSCPD 412 SDNS+SAI KRKKDKERIQESLDRLT E QKHEE+V SVRRRLA EKD WLSSCPD Sbjct: 962 LEELSDNSNSAITKRKKDKERIQESLDRLTSELQKHEENVASVRRRLAREKDKWLSSCPD 1021 Query: 413 TLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQPMI 592 TLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQPMI Sbjct: 1022 TLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQPMI 1081 Query: 593 CCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQFI 772 CCCTEYE GRLGRFL+ET+K AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQFI Sbjct: 1082 CCCTEYEAGRLGRFLYETMKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQFI 1141 Query: 773 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 952 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD Sbjct: 1142 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 1201 Query: 953 EREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGAGL 1132 EREDLK RKPSWVTDEEFGMGYL++KPAP ASKSL+ N V + NG+GL Sbjct: 1202 EREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSLASKSLAGNLVAVPNGSGL 1261 Query: 1133 SVSQAEQMGGRTVSSGSLHSDSGNLGRE----PRRIDG---------------------- 1234 ++ Q E GGRTV+SG+ H D+GN +E + +DG Sbjct: 1262 NIFQNESSGGRTVASGTQHLDAGNSVKEQVLRAKTVDGRLERTESVSLVKSDPVHAKVKG 1321 Query: 1235 --------------------------DNLKQVEESANKQSEENS-KV-TKTVVEPEARPV 1330 +N + V+ES N+ +E++ KV ++ E E R Sbjct: 1322 GSSVNGSDIQQSMPSAASHTGTSRSGENQRPVDESTNRTLDESTVKVSSRASTESELRAT 1381 Query: 1331 VKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSD-- 1504 KRS +GSL KQ K D++KD+ KSGK VGR + SST D Sbjct: 1382 GKRSLPSGSLTKQPKLDVAKDDSKSGKGVGR-----------------TSGSSTSDRDLP 1424 Query: 1505 -HNTEIKAEITNSKSSDSRVYGG--KDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGD 1675 H E + + SS G KD+G E +D + P+SR SPR +N +A KSGD Sbjct: 1425 AHQLEGRQSGVTNVSSAGTADGSVVKDDGNEVSD--RAPSSRPIHSPRHDN-SATIKSGD 1481 Query: 1676 KPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGS 1855 K KR SPAEE +R+NKRRKG+ + RD +G EVR S+KERS D R LDK H DK+G+ Sbjct: 1482 KQQKRTSPAEEPERVNKRRKGDTEVRDFEG-EVRFSDKERSMDPR-LDKSHAVDLDKSGT 1539 Query: 1856 D--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRER 2011 D DKP DR K+K KSRGD++++EK RDRS+ERHGRER Sbjct: 1540 DEQGISRATDKPSDRLKDKGSERYERDHRERLERPDKSRGDEMIAEKSRDRSMERHGRER 1599 Query: 2012 SVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXX 2182 SV+RVQER ++R+FDRL KDERNKDDR K+RY E S+EKSH DDRF Sbjct: 1600 SVERVQERSSERSFDRLTDKVKDERNKDDRGKMRYSETSVEKSHADDRFHGQSLPPPPPL 1659 Query: 2183 XXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXX 2362 SV++ RRDEDADRRFG ARH Q+LSP + Q Sbjct: 1660 PPHMVPQSVTASRRDEDADRRFGTARHAQRLSP---RHEEKERRRSEEISQDDAKRRRED 1716 Query: 2363 XXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EPGEYLPAS 2536 LSIK+++RER+KA++ KED+D + ASKRRKLKREHMPS E GEY PA+ Sbjct: 1717 DIRERKREEREGLSIKVEDREREKASLLKEDMDPSAASKRRKLKREHMPSGEAGEYTPAA 1776 Query: 2537 PAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDAD 2701 P PPP +I++ Q++DGR+RGDRKG +VQR GY ++PGLR H KE K RRDAD Sbjct: 1777 PPPPPPAISMSQAYDGRERGDRKGAMVQRAGYLDEPGLRIHGKEVTGKMARRDAD 1831 >ref|XP_006448121.1| hypothetical protein CICLE_v10014076mg [Citrus clementina] gi|557550732|gb|ESR61361.1| hypothetical protein CICLE_v10014076mg [Citrus clementina] Length = 1193 Score = 1005 bits (2599), Expect = 0.0 Identities = 555/984 (56%), Positives = 654/984 (66%), Gaps = 76/984 (7%) Frame = +2 Query: 47 SSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXX 226 S KP+ W +LLDTVKTMLP KAWNSLSPDLY TFWGLTLYDLYVPR+RYESEI Sbjct: 219 SQKPVMWSDLLDTVKTMLPSKAWNSLSPDLYTTFWGLTLYDLYVPRDRYESEIAKQHAAL 278 Query: 227 XXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSC 406 SDNSSSAI KRKKDKERIQESLDRLT E KHEE+V SVRRRL+ EKD WLSSC Sbjct: 279 KALEELSDNSSSAITKRKKDKERIQESLDRLTNELHKHEENVASVRRRLSREKDRWLSSC 338 Query: 407 PDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQP 586 PDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQP Sbjct: 339 PDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQP 398 Query: 587 MICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQ 766 MICCCTEYE GRLG+FLFETLK AYHWKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQ Sbjct: 399 MICCCTEYEAGRLGKFLFETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQ 458 Query: 767 FIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIK 946 FIKVHWKWSQRITRLLIQCLES EYMEIRNALI+LTKIS VFPVTRKSGINLEKRVAKIK Sbjct: 459 FIKVHWKWSQRITRLLIQCLESAEYMEIRNALILLTKISGVFPVTRKSGINLEKRVAKIK 518 Query: 947 SDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGA 1126 +DEREDLK RK WVTDEEFGMGYL++KPAP ASKSLS N V +Q G+ Sbjct: 519 NDEREDLKVLATGVAAALANRKSFWVTDEEFGMGYLELKPAPSLASKSLSGNVVAVQ-GS 577 Query: 1127 GLSVSQAEQMGGRTVSSGSLHSDSGNLGREPR-----RIDGDNLKQVEESANKQSEENSK 1291 ++VSQ+E G +V + G+ GR R + DN+K S S+ +S Sbjct: 578 AINVSQSEPGTGNSVKDHISRAKPGD-GRLERTESISHVKSDNVKLKGSSLTNGSDIHSS 636 Query: 1292 VTKTVV-----------------------------EPEARPVVKRSAAAGSLAKQAKQDL 1384 + T V E E++ VKRS + SL K KQDL Sbjct: 637 MPSTAVQAEMSRVVENQKQVDEDENMAKVAMKNSAESESKASVKRSVPSASLTKAPKQDL 696 Query: 1385 SKDEDKSGKAVGR-----------------------XXXXXXXXXXXXXXXXKLTNSSTR 1495 +KD++KS KAVGR K ++SS+R Sbjct: 697 AKDDNKSAKAVGRTSGSSANDRDFSSHAAEGKQGGATTVSSAAAVTANLVSAKGSSSSSR 756 Query: 1496 SSD-HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSG 1672 +SD H E K + +KSS+ R+ GK +G E +DA K +SR+ SPR ++ AASKSG Sbjct: 757 ASDMHGNESKTDGGVAKSSEVRLSTGKSDGNEVSDAPKSSSSRTMHSPRHDSSVAASKSG 816 Query: 1673 DKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTG 1852 D+ KR SP+E+ DR +KR KG+ + RD DG EVR+ ++ERS+D R D DK G Sbjct: 817 DRLQKRTSPSEDPDRPSKRYKGDTELRDSDG-EVRVPDRERSADPRFAD------LDKIG 869 Query: 1853 SDDKPL----DRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVD 2020 +D++ + DR+K+K KSR DD++ EK RDRS+ER+GRERSV+ Sbjct: 870 TDEQSMYRTTDRSKDKGNERYERDHRERLDRLDKSRVDDIIPEKQRDRSMERYGRERSVE 929 Query: 2021 RVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXX 2191 R QERGADR FDRL AKD+RNKDDRSK+RY ++S EKSHVD+RF Sbjct: 930 RGQERGADRAFDRLAEKAKDDRNKDDRSKLRYNDSSSEKSHVDERFHGQSLPPPPPLPPH 989 Query: 2192 XXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXX 2371 SV++GRRDEDAD+RFG+ RH+Q+LSP N+ Q Sbjct: 990 IVPQSVNAGRRDEDADKRFGSTRHSQRLSPRHDEKERRRSEENSLVSQDDAKRRREDDFR 1049 Query: 2372 XXXXXXXXALSIKMDERE--------RDKANMNKEDIDLN--ASKRRKLKREHMPS-EPG 2518 LS+KMDERE R+KAN+ KE++D N ASKRRKLKREH+PS E G Sbjct: 1050 DRKREDREGLSLKMDERERERDRDRDREKANLLKEEMDANAAASKRRKLKREHLPSGEAG 1109 Query: 2519 EYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDA 2698 EY P +P PP++I + QS+DGRDRGDRKG +QR GY E+ +R H KE A+K RRD+ Sbjct: 1110 EYSPVAPPYPPLAIGISQSYDGRDRGDRKGAAMQRTGYMEEQSMRIHGKEVATKMARRDS 1169 Query: 2699 DPMYDREWDDDKRQRAEPKRRHRK 2770 + +Y+REW+D+KRQRAE KRRHRK Sbjct: 1170 ELIYEREWEDEKRQRAEQKRRHRK 1193 >ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citrus sinensis] Length = 1874 Score = 1005 bits (2598), Expect = 0.0 Identities = 555/984 (56%), Positives = 653/984 (66%), Gaps = 76/984 (7%) Frame = +2 Query: 47 SSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXX 226 S KP+ W +LLDTVKTMLP KAWNSLSPDLY TFWGLTLYDLYVPR+RYESEI Sbjct: 900 SQKPVMWSDLLDTVKTMLPSKAWNSLSPDLYTTFWGLTLYDLYVPRDRYESEIAKQHAAL 959 Query: 227 XXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSC 406 SDNSSSAI KRKKDKERIQESLDRLT E KHEE+V SVRRRL+ EKD WLSSC Sbjct: 960 KALEELSDNSSSAITKRKKDKERIQESLDRLTNELHKHEENVASVRRRLSREKDRWLSSC 1019 Query: 407 PDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQP 586 PDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQP Sbjct: 1020 PDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQP 1079 Query: 587 MICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQ 766 MICCCTEYE GRLG+FLFETLK AYHWKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQ Sbjct: 1080 MICCCTEYEAGRLGKFLFETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQ 1139 Query: 767 FIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIK 946 FIKVHWKWSQRITRLLIQCLES EYMEIRNALI+LTKIS VFPVTRKSGINLEKRVAKIK Sbjct: 1140 FIKVHWKWSQRITRLLIQCLESAEYMEIRNALILLTKISGVFPVTRKSGINLEKRVAKIK 1199 Query: 947 SDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGA 1126 +DEREDLK RK WVTDEEFGMGYL++KPAP ASKSLS N V +Q G+ Sbjct: 1200 NDEREDLKVLATGVAAALANRKSFWVTDEEFGMGYLELKPAPSLASKSLSGNVVAVQ-GS 1258 Query: 1127 GLSVSQAEQMGGRTVSSGSLHSDSGNLGREPR-----RIDGDNLKQVEESANKQSEENSK 1291 ++VSQ+E G +V + G+ GR R + DN+K S S+ +S Sbjct: 1259 AINVSQSEPGTGNSVKDHISRAKPGD-GRLERTESISHVKSDNVKLKGSSLTNGSDIHSS 1317 Query: 1292 VTKTVV-----------------------------EPEARPVVKRSAAAGSLAKQAKQDL 1384 V T V E E++ VKRS + SL K KQDL Sbjct: 1318 VPSTAVQAEMSRVVENQKQVDEDENMAKVAMKNSAESESKASVKRSVPSASLTKAPKQDL 1377 Query: 1385 SKDEDKSGKAVGR-----------------------XXXXXXXXXXXXXXXXKLTNSSTR 1495 +KD++KS KAVGR K ++SS+R Sbjct: 1378 AKDDNKSAKAVGRTSGSSANDRDFSSHAAEGKQGGATTVSSAAAVTANLVSAKGSSSSSR 1437 Query: 1496 SSD-HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSG 1672 +SD H E K + +KSS+ R+ GK +G E +DA K +SR+ SPR ++ A SKSG Sbjct: 1438 ASDMHGNESKTDGGVAKSSEVRLSTGKSDGNEVSDAPKSSSSRAMHSPRHDSSVATSKSG 1497 Query: 1673 DKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTG 1852 D+ KR SP+E+ DR +KR KG+ + RD DG EVR+ ++ERS+D R D DK G Sbjct: 1498 DRLQKRTSPSEDPDRPSKRYKGDTELRDSDG-EVRVPDRERSADPRFAD------LDKIG 1550 Query: 1853 SDDKPL----DRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVD 2020 +D++ + DR+K+K KSR DD++ EK RDRS+ER+GRERSV+ Sbjct: 1551 TDEQSMYRTTDRSKDKGNERYERDHRERLDRLDKSRVDDIIPEKQRDRSMERYGRERSVE 1610 Query: 2021 RVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXX 2191 R QERGADR FDRL AKD+RNKDDRSK+RY ++S EKSHVD+RF Sbjct: 1611 RGQERGADRAFDRLADKAKDDRNKDDRSKLRYNDSSSEKSHVDERFHGQSLPPPPPLPPH 1670 Query: 2192 XXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXX 2371 SV++GRRDEDAD+RFG+ RH+Q+LSP N+ Q Sbjct: 1671 IVPQSVNAGRRDEDADKRFGSTRHSQRLSPRHDEKERRRSEENSLVSQDDAKRRREDDFR 1730 Query: 2372 XXXXXXXXALSIKMDERE--------RDKANMNKEDIDLN--ASKRRKLKREHMPS-EPG 2518 LS+KMDERE R+KAN+ KE++D N ASKRRKLKREH+PS E G Sbjct: 1731 DRKREDREGLSLKMDERERERDRDRDREKANLLKEEMDANAAASKRRKLKREHLPSGEAG 1790 Query: 2519 EYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDA 2698 EY P +P PP++I + QS+DGRDRGDRKG +QR GY E+ +R H KE A+K RRD+ Sbjct: 1791 EYSPVAPPYPPLAIGISQSYDGRDRGDRKGATMQRTGYMEEQSMRIHGKEVATKMARRDS 1850 Query: 2699 DPMYDREWDDDKRQRAEPKRRHRK 2770 + +Y+REW+D+KRQRAE KRRHRK Sbjct: 1851 ELIYEREWEDEKRQRAEQKRRHRK 1874 >ref|XP_006586338.1| PREDICTED: THO complex subunit 2-like [Glycine max] Length = 1778 Score = 994 bits (2571), Expect = 0.0 Identities = 553/992 (55%), Positives = 649/992 (65%), Gaps = 82/992 (8%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G + PI+W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI Sbjct: 805 GSAQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 864 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI KRKK+KERIQESLDRL E KHEE+V SVRRRL+HEKD WLS Sbjct: 865 NLKSLEELSDNSSSAITKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 924 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 925 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 984 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY Sbjct: 985 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1044 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1045 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1104 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYL++KPAP +KS + N+ +Q+ Sbjct: 1105 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPS-VTKSSAGNSATVQS 1163 Query: 1121 GAGLSVSQAEQMGGRTVSSGSL------------------------HSDSGNLG-REPRR 1225 G L+VSQ E G+ V SG++ SD+G++ + Sbjct: 1164 GINLNVSQTESASGKHVDSGNIVKDQAMRTKTADGRSERTESITVTKSDTGHIKLKSSSM 1223 Query: 1226 IDG--------------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345 ++G +N KQVEES N+ S+E+ T+T E R KRS Sbjct: 1224 VNGLDAQSSLAPSSVQSGTSKSMENPKQVEESINRASDEHG--TRTT---ELRTSAKRSV 1278 Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTR---SSDHNT- 1513 AGSL+K +KQD K++ +SGK V R + + T SS+ NT Sbjct: 1279 PAGSLSKPSKQDPVKEDGRSGKPVARTSGSSSSDKELQTHALEGRYTGTTNVPSSNGNTI 1338 Query: 1514 --------------------EIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQS 1633 E KAE+ +KSSD R KD+G + TD + +SR S Sbjct: 1339 SGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGASSRVVHS 1398 Query: 1634 PRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRA 1813 PR EN SKS DK KR S AEE DRL KRRKG+++ RD + +EVR SE+E+ D R Sbjct: 1399 PRYENTGVTSKSNDKVQKRASSAEEPDRLGKRRKGDVELRDFE-TEVRFSEREKMMDPRF 1457 Query: 1814 LDKLHVAPFDKTGSD--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSE 1969 D DK+G + DKPL+RAK+K KSRGDD ++E Sbjct: 1458 AD-------DKSGPEEHGLYRAGDKPLERAKDKGNERYERDHRERMDRLDKSRGDDFVAE 1510 Query: 1970 KLRDRSLERHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVD 2140 K RDRS+ER+GRERSV+R+QERG+DR+F+RL AKDERNKDDR+K+RY +AS+EKSH D Sbjct: 1511 KPRDRSIERYGRERSVERMQERGSDRSFNRLPEKAKDERNKDDRNKLRYNDASVEKSHGD 1570 Query: 2141 DRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXN 2320 DRF SV +GRRDED DRR+G RH+Q+LSP Sbjct: 1571 DRFHGQSLPPPPPLPPNVVPQSVGAGRRDEDVDRRYGATRHSQRLSP----RHEEKERRR 1626 Query: 2321 ASALQXXXXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKRE 2497 + + ++ ERER+KAN+ KE++DLN ASKRRK KRE Sbjct: 1627 SEETVVSQDDAKRRKEDDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKPKRE 1686 Query: 2498 HMPS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAA 2674 H+P+ EPGEY P + P I + ++DGRDRGDRKG I+Q P Y ++ LR H KE A Sbjct: 1687 HLPTGEPGEYSPVAHPPSSAGIGMSLAYDGRDRGDRKGPIMQHPSYVDESSLRIHGKEVA 1746 Query: 2675 SKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770 SK RRD+DP+YDREW+D+KRQRA+ KRRHRK Sbjct: 1747 SKLNRRDSDPLYDREWEDEKRQRADQKRRHRK 1778 >gb|EMJ18294.1| hypothetical protein PRUPE_ppa000084mg [Prunus persica] Length = 1878 Score = 993 bits (2568), Expect = 0.0 Identities = 552/996 (55%), Positives = 651/996 (65%), Gaps = 86/996 (8%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G SKP+ WL+LL+TVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRN YESEI Sbjct: 896 GSPSKPVTWLDLLNTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNCYESEIAKQHA 955 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI KRKKDKERIQESLDRLT E +KHEE+V SVR+RL+ EKD WLS Sbjct: 956 ALKALEELSDNSSSAITKRKKDKERIQESLDRLTSELRKHEENVASVRKRLSREKDKWLS 1015 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKIN+EFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHID+LIC+TL Sbjct: 1016 SCPDTLKINVEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDILICRTL 1075 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYEVGR G+FL ETLK AY+WK DESIYE+ECGNMPGFAVYYR+PNSQRV Y Sbjct: 1076 QPMICCCTEYEVGRFGKFLQETLKIAYYWKKDESIYERECGNMPGFAVYYRHPNSQRVAY 1135 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 QF+KVHWKWSQRIT+LLIQCLESTEYMEIRNALI+L+KISSVFPVTRK+G+NLEKRV+K Sbjct: 1136 FQFMKVHWKWSQRITKLLIQCLESTEYMEIRNALILLSKISSVFPVTRKTGVNLEKRVSK 1195 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IK+DEREDLK RK SW+TDEEFG GYL++K AP ASKS + N+ + Sbjct: 1196 IKADEREDLKVLATGVAAALAARKSSWITDEEFGNGYLELKSAP-LASKSSAGNSAATHS 1254 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGN-------------------------------- 1204 G+ +++SQ+E +GG+ + S H +S N Sbjct: 1255 GSTINISQSEPIGGKVGALPSQHPESSNSVKDQILKTKTSDGRLERVESISTVKSDQGHL 1314 Query: 1205 ---LGREPRRIDGDNL-----------------KQVEESANKQSEEN--SKVTKTVVEPE 1318 +G DG +L KQV ES+N+ S+EN K E E Sbjct: 1315 KLKVGSLVSGSDGQSLMSSPALQSGTSRSMENKKQVNESSNRTSDENMGKAAPKNSSESE 1374 Query: 1319 ARPVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRS 1498 R KRS AGSLAK KQDL+KD+ +SGK +GR N +T S Sbjct: 1375 LRAQAKRSGPAGSLAKPPKQDLAKDDGRSGKGIGRDVLCHASAVSTNVSPAIAANGNTVS 1434 Query: 1499 SD------------HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQ 1642 + H + K ++ +K+S++RV K++G E +DA + +SR SPR Sbjct: 1435 ASAKGSFAKTSVEIHGIDSKVDVGAAKASNTRVSAPKEDGPETSDALRPHSSRLVHSPRH 1494 Query: 1643 ENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDK 1822 +N +ASKS DK KR SPAEE DR +KRRKGE + RD +G E RLS++ERS D R LD Sbjct: 1495 DNSASASKSSDKLQKRTSPAEETDRQSKRRKGETEMRDFEG-EARLSDRERSVDARLLD- 1552 Query: 1823 LHVAPFDKTGSD--------DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLR 1978 DK+G+D DKP DR+K+K KSRGDD L E+ R Sbjct: 1553 -----LDKSGTDDQSVYKATDKPSDRSKDKGSERHDKDYRERLDRPDKSRGDD-LGERSR 1606 Query: 1979 DRSLERHGRERSVDRVQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXX 2158 DRS+ERHGRE SV++VQERG DR+ DRL+ +++KDDR KVRY + S EKSHVD+R+ Sbjct: 1607 DRSMERHGREHSVEKVQERGMDRSVDRLS--DKSKDDRGKVRYNDISTEKSHVDERYHGQ 1664 Query: 2159 XXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQX 2338 SVSSGRRDEDADRRFG RHTQ+LSP N+ Q Sbjct: 1665 SLPPPPPLPPHMVPHSVSSGRRDEDADRRFGTTRHTQRLSPRHDEKERRRSEDNSLISQD 1724 Query: 2339 XXXXXXXXXXXXXXXXXXXALSIKMD----ERERDKANMNKEDID-LNASKRRKLKREHM 2503 LSIK++ ERER+KAN+ KE+ D + ASKRRKLKREH Sbjct: 1725 DSKRRREDDFRDRKREDREGLSIKVEEREREREREKANLLKEETDAIAASKRRKLKREHP 1784 Query: 2504 PS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASK 2680 PS EPGEY P P PPP+SI+L QS+DGRDRGDRKG VQR GY E+P +R H KEAASK Sbjct: 1785 PSGEPGEYSPVPPPPPPLSISLSQSYDGRDRGDRKGPPVQRAGYLEEPSVRIHGKEAASK 1844 Query: 2681 ATRRDADP------MYDREWDDDKRQRAEPKRRHRK 2770 TRRD DP MY EW+D+KRQRAE KRRHRK Sbjct: 1845 MTRRDPDPYPSCCRMY--EWEDEKRQRAEQKRRHRK 1878 >gb|ESW32460.1| hypothetical protein PHAVU_002G324500g [Phaseolus vulgaris] Length = 1864 Score = 992 bits (2564), Expect = 0.0 Identities = 548/980 (55%), Positives = 642/980 (65%), Gaps = 70/980 (7%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G + PI+W LLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI Sbjct: 896 GSAQNPISWSYLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 955 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI KRKK+KERIQESLDRL E KHEE+V SV RL+ EKD WLS Sbjct: 956 NLKSLEELSDNSSSAITKRKKEKERIQESLDRLISELHKHEENVASVHSRLSREKDRWLS 1015 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1016 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1075 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1076 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1135 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLES+EYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1136 GQFIKVHWKWSQRITRLLIQCLESSEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1195 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYL++KPAP +KS + N + + Sbjct: 1196 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPS-GTKSSAGNPSTVHS 1254 Query: 1121 GAGLSVSQAEQMGGRTVSSG------------------------SLHSDSGN----LGRE 1216 G L+VSQ E G+ V SG + SDSG+ G Sbjct: 1255 GMNLNVSQTESASGKHVDSGNTVKDQVIRTKTTDGKSERTESMTATKSDSGHTKVKTGAM 1314 Query: 1217 PRRIDG-----------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345 DG +N KQVEE N+ S+++ T+T E+R KRS Sbjct: 1315 VNGFDGQTSSISSSIQSGMSKSMENSKQVEELINRASDDHG--TRTA---ESRASAKRSV 1369 Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNS----------STR 1495 GSL+K +KQD K++ +SGK V R +T+S ST+ Sbjct: 1370 PTGSLSKPSKQDPLKEDSRSGKPVARTSGSLSSDKDLHSGTTNVTSSVSANGNTITGSTK 1429 Query: 1496 SSD---------HNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQEN 1648 S+ E KAE+ SKSSD R KD+G + D + +SR SPR EN Sbjct: 1430 GSNAPVRISLDGPGNESKAEVGVSKSSDIRASVVKDDGNDTADLTRGSSSRVVHSPRHEN 1489 Query: 1649 LTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVR-ALDKL 1825 ASKS +K KR S AEE DRL KRRKG+++ RD + SEVR S++++ D R A DKL Sbjct: 1490 TGVASKSNEKVQKRASSAEEPDRLGKRRKGDVELRDFE-SEVRFSDRDKLMDPRFADDKL 1548 Query: 1826 HVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGR 2005 + DK L+R K+K KSRGDD ++EK RDRS+ER+GR Sbjct: 1549 GPEEHGLYRAGDKSLERPKDKGNERYERDHRERLDRVDKSRGDDSVAEKPRDRSIERYGR 1608 Query: 2006 ERSVDRVQERGADRNFDR---LAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXX 2176 ERSV+R+QERG++R+F+R AKDER+KDDR+K+RY +AS+EKSH DDRF Sbjct: 1609 ERSVERMQERGSERSFNRPPEKAKDERSKDDRNKLRYSDASVEKSHADDRFHGQSLPPPP 1668 Query: 2177 XXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXX 2356 SV +GRRDEDADRR+G RH+Q+LSP + Sbjct: 1669 PLPPNMVPQSVGAGRRDEDADRRYGATRHSQRLSP----RHEEKERRRSEETVVSQDDAK 1724 Query: 2357 XXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EPGEYLP 2530 + ++ ERER+KAN+ KED+DLN ASKRRKLKREH+ + EPGEY P Sbjct: 1725 RRKEDDFRERKREEIKVEEREREREKANVLKEDLDLNAASKRRKLKREHLSTGEPGEYSP 1784 Query: 2531 ASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADPMY 2710 +P PPP I + +DGRDRGDRKG ++Q P Y ++P +R H KE ASK RRD+DP+Y Sbjct: 1785 VAPPPPPTGIGMPLGYDGRDRGDRKGPVIQHPNYIDEPNIRIHGKEVASKLNRRDSDPLY 1844 Query: 2711 DREWDDDKRQRAEPKRRHRK 2770 DREWDD+KRQRA+ KRRHRK Sbjct: 1845 DREWDDEKRQRADQKRRHRK 1864 >ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isoform X1 [Glycine max] Length = 1870 Score = 988 bits (2553), Expect = 0.0 Identities = 548/985 (55%), Positives = 645/985 (65%), Gaps = 75/985 (7%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G PI+W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI Sbjct: 897 GSDQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 956 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAIAKRKK+KERIQESLDRL E KHEE+V SVRRRL+HEKD WLS Sbjct: 957 NLKSLEELSDNSSSAIAKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 1016 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1076 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1077 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1136 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1137 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1196 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYL++KP+P +KS + N+ +Q+ Sbjct: 1197 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPS-MTKSSAGNSATVQS 1255 Query: 1121 GAGLSVSQAEQMGGRTVSSGS------------------------LHSDSGNLG-REPRR 1225 G L+VSQ E + G+ V SG+ SD+G++ + Sbjct: 1256 GINLNVSQTESVSGKHVDSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIKLKSSSM 1315 Query: 1226 IDG--------------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345 ++G +N KQVEES N+ S+E+ T++ E R KRS Sbjct: 1316 VNGLDAQSSMAPSSVQSGMPKSMENPKQVEESINRASDEHG--TRST---ELRTSAKRSV 1370 Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTR---SSDHNT- 1513 A SLAK +KQD K++ +SGK V R + ++ T SS+ NT Sbjct: 1371 PASSLAKPSKQDPVKEDGRSGKPVARTSGSLSSDKDLQTHALEGRHTGTTNVPSSNGNTI 1430 Query: 1514 --------------------EIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQS 1633 E KAE+ +KSSD R KD+G + TD + +SR S Sbjct: 1431 SGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGSSSRIVHS 1490 Query: 1634 PRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVR- 1810 PR EN SKS D+ KR S EE DRL KRRKG+++ RD + +E+R SE+E+ D R Sbjct: 1491 PRHENTVVTSKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFE-TELRFSEREKMMDPRF 1549 Query: 1811 ALDKLHVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSL 1990 A DKL + DKPL+R K+K KSRGDD ++EK RDRS+ Sbjct: 1550 ADDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKSRGDDFVAEKPRDRSI 1609 Query: 1991 ERHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXX 2161 ER+GRERSV+R+QERG+DR+F+RL AKDERNKDDR+K+RY +AS EKSH DDRF Sbjct: 1610 ERYGRERSVERMQERGSDRSFNRLPEKAKDERNKDDRNKLRYNDASAEKSHGDDRFHGQS 1669 Query: 2162 XXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXX 2341 SV +GRRDED DRR+G RH+Q+LSP + Sbjct: 1670 LPPPPPLPPNVVPQSVGAGRRDEDVDRRYGATRHSQRLSPRHEEKERRWSEETVVS---- 1725 Query: 2342 XXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EP 2515 + ++ ERER+KAN+ KE++DLN ASKRRKLKREH+P+ EP Sbjct: 1726 QDDAKRRKEDDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKLKREHLPTDEP 1785 Query: 2516 GEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRD 2695 GEY + P + ++DGRDRGDRKG I+Q P Y ++ LR H KEAASK RRD Sbjct: 1786 GEYSAVAHPPSSAGTGMPLAYDGRDRGDRKGPIMQHPSYIDESSLRIHGKEAASKLNRRD 1845 Query: 2696 ADPMYDREWDDDKRQRAEPKRRHRK 2770 +DP+YDREW+D+KRQRA+ KRRHRK Sbjct: 1846 SDPLYDREWEDEKRQRADQKRRHRK 1870 >gb|EOY01329.1| THO complex subunit 2 isoform 5 [Theobroma cacao] Length = 1824 Score = 987 bits (2552), Expect = 0.0 Identities = 547/954 (57%), Positives = 633/954 (66%), Gaps = 66/954 (6%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G KPI W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPRNRYESEI Sbjct: 898 GPPRKPIVWSELLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIAKQHA 957 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 DNSSSAI KRKKDKERIQE+LDRLT E KHEE+V SVRRRL +EKD WLS Sbjct: 958 ALKALEELPDNSSSAINKRKKDKERIQEALDRLTSELHKHEENVASVRRRLTYEKDKWLS 1017 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1018 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1077 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WK+DESIYE ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1078 QPMICCCTEYEAGRLGRFLYETLKIAYYWKADESIYEHECGNMPGFAVYYRYPNSQRVTY 1137 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1138 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1197 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RK SWVTDEEFGMGYL++KPA ASKSL+ N V +QN Sbjct: 1198 IKSDEREDLKVLATGVAAALAARKSSWVTDEEFGMGYLELKPATSLASKSLAGNTVSVQN 1257 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSD---------------------SGNLGREPRRIDG- 1234 G+ ++VSQ+E G R V+ G+ SD + +LG+ + G Sbjct: 1258 GSSINVSQSEAAGARAVALGTQQSDVNLVKDQIPRTKSDGRLERAENASLGKSDLKTKGG 1317 Query: 1235 -------------------------DNLKQVEESANKQSEENSKV-TKTVVEPEARPVVK 1336 +N KQ++ES+NK E +KV K E E++ K Sbjct: 1318 TSANGSDAVLSVVLATSQAGTGKSLENQKQLDESSNKLDEHLAKVPAKNSAELESKASAK 1377 Query: 1337 RSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTE 1516 RSA AGSL K KQD KD+ KSGKAVGR +T R +TE Sbjct: 1378 RSAPAGSLTKTQKQDPGKDDGKSGKAVGRT---------------SVTCVIDRDVPSHTE 1422 Query: 1517 IKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLS 1696 + T + S + GKD+G+E DA + P+SR SPR ++ SKS DK KR + Sbjct: 1423 GRQGGTTNVPS-AVTSNGKDDGSELPDASR-PSSRIVHSPRHDSSATVSKSSDKLQKRTT 1480 Query: 1697 PAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD------ 1858 P EE DRL KRRKG+++ +D+DG EVRLS++ERS+D + D FDK G+D Sbjct: 1481 PVEETDRLTKRRKGDVELKDLDG-EVRLSDRERSTDPQLAD------FDKPGTDELTSHR 1533 Query: 1859 --DKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQE 2032 DKPLDR+K+K KSR DD+L+EK RDRS+ER+GRERSV E Sbjct: 1534 AVDKPLDRSKDKGSERHDRDYRERLERPEKSRADDILTEKSRDRSIERYGRERSV----E 1589 Query: 2033 RGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXX 2203 R DRN +RL AKDER+KD+RSKVRY + S EKSHVDDRF Sbjct: 1590 RSTDRNLERLGDKAKDERSKDERSKVRYADTSTEKSHVDDRFHGQSLPPPPPLPPHMVPQ 1649 Query: 2204 SVSS-GRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXX 2380 SV++ GRRD+D DRRFG+ RH+Q+LSP N+ Q Sbjct: 1650 SVNATGRRDDDPDRRFGSTRHSQRLSPRHEDKERRRSEENSLVSQDDGKRRREDDFRERK 1709 Query: 2381 XXXXXALSIKMDERERD------KANMNKEDIDLNASKRRKLKREHMPSEPGEYLPASPA 2542 LS+K++ER+RD KA++ KED+D N +KRRKLKREH+PSEPGEY P +P Sbjct: 1710 REEREGLSMKVEERDRDRERDREKASLLKEDVDANVAKRRKLKREHLPSEPGEYSPIAPP 1769 Query: 2543 PPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADP 2704 PPP++I + QS+DGRDR DRKG ++QR GY E+PG+R H KEAASK RRD DP Sbjct: 1770 PPPLAIGMSQSYDGRDR-DRKGSMMQRGGYLEEPGMRIHGKEAASKMARRDTDP 1822 >ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucumis sativus] gi|449506883|ref|XP_004162874.1| PREDICTED: THO complex subunit 2-like [Cucumis sativus] Length = 1887 Score = 984 bits (2545), Expect = 0.0 Identities = 547/994 (55%), Positives = 646/994 (64%), Gaps = 84/994 (8%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G KP+ W +LLDTVK+MLPPKAWNSLSPDLY TFWGLTLYDLYVPR+RYESEI Sbjct: 899 GSLQKPVRWSDLLDTVKSMLPPKAWNSLSPDLYTTFWGLTLYDLYVPRSRYESEIAKQHA 958 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI KRKKDKERIQESLDRL+ E KHEE+V SVRRRL+ EKD WLS Sbjct: 959 ALKALEELSDNSSSAINKRKKDKERIQESLDRLSNELVKHEENVASVRRRLSREKDKWLS 1018 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1019 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1078 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AYHWKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1079 QPMICCCTEYEAGRLGRFLYETLKIAYHWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1138 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKIS+VFPVTRKSGINLEKRVAK Sbjct: 1139 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISNVFPVTRKSGINLEKRVAK 1198 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYL++K P ASK ++N QN Sbjct: 1199 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELK-TPSLASKPSASNLASSQN 1257 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGRE-----------PRRIDG----------- 1234 + + VSQ E +GG+T + +SDSGN+ ++ +IDG Sbjct: 1258 NS-IFVSQNEPVGGKTSALPIPNSDSGNMAKDHSLRSRTSDVRTDKIDGLSVPKSELGHG 1316 Query: 1235 -----------------------------DNLKQVEESANKQSEENSK-VTKTVVEPEAR 1324 D+ K ++S E +SK V+KT E E R Sbjct: 1317 KQKGMSLNGPDSQPLVPSTSVHSGSLKMVDSQKPGDDSTRTLDEGSSKVVSKTSSESELR 1376 Query: 1325 PVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXK---------- 1474 KRS SL K KQD++KDE +SGKA + Sbjct: 1377 GSTKRSGPVTSLNKAPKQDITKDEIRSGKAASKNPGSSTSERELPVHATDGGRHGGPSNS 1436 Query: 1475 ---LTNSSTRSS-------------DHNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHK 1606 ++N +T++S H E KAE ++SD RV KD+G E D + Sbjct: 1437 PSIMSNGNTQNSLTKGSSLTVKASDGHTIESKAESGVGRTSDGRVSSVKDDGPEALDVSR 1496 Query: 1607 QPTSRSTQSPRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSE 1786 +SR SPR +N + S+S DK KR SPAEE DR KRRKG+ + RD+DG + R+S+ Sbjct: 1497 SSSSRLGHSPRHDNSASGSRSSDKLQKRASPAEEPDRQGKRRKGDGEIRDVDG-DFRISD 1555 Query: 1787 KERSSDVRALDKLHVAPFDKTG--SDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDL 1960 K+RS D R++D + +++G DKPLDR K+K KSRGDD Sbjct: 1556 KDRSMDPRSIDADKIGMEEQSGYRGLDKPLDRTKDKVNERYDRDYRDRAERPEKSRGDDP 1615 Query: 1961 LSEKLRDRSLERHGRERSVDRVQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVD 2140 E+ RDRS+ER+GRERSV++V ER +DR + +KDERNKDDRSK+RY +++++KSH D Sbjct: 1616 QVERTRDRSIERYGRERSVEKV-ERVSDR-YPEKSKDERNKDDRSKLRYSDSTVDKSHTD 1673 Query: 2141 DRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXN 2320 DRF SV+SGRR+EDADRRFG ARH Q+LSP N Sbjct: 1674 DRFHGQSLPPPPPLPPHLVPQSVNSGRREEDADRRFGTARHAQRLSPRHEEKERRRSEEN 1733 Query: 2321 ASALQXXXXXXXXXXXXXXXXXXXXALSIKMD--ERERDKANMNKEDIDLN-ASKRRKLK 2491 + +S+K+D ERER+KAN+ KED+D + ASKRRKLK Sbjct: 1734 LISQDDAKRRREEEFRERKREERDVGMSLKVDDREREREKANLLKEDMDASAASKRRKLK 1793 Query: 2492 REHMP-SEPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKE 2668 REH+ E GEY P P PPP+ + QS+DGR+RGDRKGV++QRPGY +DPGLR H KE Sbjct: 1794 REHLSLVEAGEYSPVGPPPPPMGGGVSQSYDGRERGDRKGVMMQRPGYLDDPGLRIHGKE 1853 Query: 2669 AASKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770 +K TRR+AD MY+REWDD+KR RA+ KRRHRK Sbjct: 1854 VVNKMTRREADLMYEREWDDEKRMRADQKRRHRK 1887 >ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isoform X2 [Glycine max] Length = 1845 Score = 970 bits (2507), Expect = 0.0 Identities = 542/985 (55%), Positives = 639/985 (64%), Gaps = 75/985 (7%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G PI+W LLDTVKTMLP KAWNSLSPDLYATFWGLTLYDLYVP+NRYESEI Sbjct: 897 GSDQNPISWSYLLDTVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPKNRYESEIAKLHA 956 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAIAKRKK+KERIQESLDRL E KHEE+V SVRRRL+HEKD WLS Sbjct: 957 NLKSLEELSDNSSSAIAKRKKEKERIQESLDRLISELHKHEENVASVRRRLSHEKDKWLS 1016 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1076 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLGRFL+ETLK AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTY Sbjct: 1077 QPMICCCTEYEAGRLGRFLYETLKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTY 1136 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK Sbjct: 1137 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 1196 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYL++KP+P +KS + N+ +Q+ Sbjct: 1197 IKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPSPS-MTKSSAGNSATVQS 1255 Query: 1121 GAGLSVSQAEQMGGRTVSSGS------------------------LHSDSGNLG-REPRR 1225 G L+VSQ E + G+ V SG+ SD+G++ + Sbjct: 1256 GINLNVSQTESVSGKHVDSGNTVKDQAIRTKTVDGKSERIESITVTKSDAGHIKLKSSSM 1315 Query: 1226 IDG--------------------DNLKQVEESANKQSEENSKVTKTVVEPEARPVVKRSA 1345 ++G +N KQVEES N+ S+E+ T++ E R KRS Sbjct: 1316 VNGLDAQSSMAPSSVQSGMPKSMENPKQVEESINRASDEHG--TRST---ELRTSAKRSV 1370 Query: 1346 AAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTR---SSDHNT- 1513 A SLAK +KQD K++ +SGK V R + ++ T SS+ NT Sbjct: 1371 PASSLAKPSKQDPVKEDGRSGKPVARTSGSLSSDKDLQTHALEGRHTGTTNVPSSNGNTI 1430 Query: 1514 --------------------EIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQS 1633 E KAE+ +KSSD R KD+G + TD + +SR S Sbjct: 1431 SGSTKGSNPPVKISLDGPGNESKAEVGVAKSSDIRASMVKDDGNDITDNPRGSSSRIVHS 1490 Query: 1634 PRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVR- 1810 PR EN SKS D+ KR S EE DRL KRRKG+++ RD + +E+R SE+E+ D R Sbjct: 1491 PRHENTVVTSKSNDRVQKRASSVEEPDRLGKRRKGDVELRDFE-TELRFSEREKMMDPRF 1549 Query: 1811 ALDKLHVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSL 1990 A DKL + DKPL+R K+K KSRGDD ++EK RDRS+ Sbjct: 1550 ADDKLGPEEHGLYRASDKPLERTKDKGNERYERDHRERMDRLDKSRGDDFVAEKPRDRSI 1609 Query: 1991 ERHGRERSVDRVQERGADRNFDRL---AKDERNKDDRSKVRYGEASLEKSHVDDRFXXXX 2161 ER+GRERSV+R+QERG+DR+F+RL AKDERNKDDR+K+RY +AS EKSH Sbjct: 1610 ERYGRERSVERMQERGSDRSFNRLPEKAKDERNKDDRNKLRYNDASAEKSH--------- 1660 Query: 2162 XXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXX 2341 +GRRDED DRR+G RH+Q+LSP + Sbjct: 1661 ----------------GAGRRDEDVDRRYGATRHSQRLSPRHEEKERRWSEETVVS---- 1700 Query: 2342 XXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASKRRKLKREHMPS-EP 2515 + ++ ERER+KAN+ KE++DLN ASKRRKLKREH+P+ EP Sbjct: 1701 QDDAKRRKEDDFRDRKREEIKVEEREREREKANILKEELDLNAASKRRKLKREHLPTDEP 1760 Query: 2516 GEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRD 2695 GEY + P + ++DGRDRGDRKG I+Q P Y ++ LR H KEAASK RRD Sbjct: 1761 GEYSAVAHPPSSAGTGMPLAYDGRDRGDRKGPIMQHPSYIDESSLRIHGKEAASKLNRRD 1820 Query: 2696 ADPMYDREWDDDKRQRAEPKRRHRK 2770 +DP+YDREW+D+KRQRA+ KRRHRK Sbjct: 1821 SDPLYDREWEDEKRQRADQKRRHRK 1845 >emb|CBI26799.3| unnamed protein product [Vitis vinifera] Length = 1767 Score = 937 bits (2422), Expect = 0.0 Identities = 526/939 (56%), Positives = 612/939 (65%), Gaps = 33/939 (3%) Frame = +2 Query: 53 KPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXXXXXX 232 KPI W +LLDT +TMLP KAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI Sbjct: 902 KPIVWSDLLDTARTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRHRYESEIAKQHSALKA 961 Query: 233 XXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLSSCPD 412 SDNS+SAI KRKKDKERIQESLDRLT E QKHEE+V SVRRRLA EKD WLSSCPD Sbjct: 962 LEELSDNSNSAITKRKKDKERIQESLDRLTSELQKHEENVASVRRRLAREKDKWLSSCPD 1021 Query: 413 TLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTLQPMI 592 TLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTLQPMI Sbjct: 1022 TLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQPMI 1081 Query: 593 CCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTYGQFI 772 CCCTEYE GRLGRFL+ET+K AY+WKSDESIYE+ECGNMPGFAVYYRYPNSQRVTYGQFI Sbjct: 1082 CCCTEYEAGRLGRFLYETMKIAYYWKSDESIYERECGNMPGFAVYYRYPNSQRVTYGQFI 1141 Query: 773 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 952 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD Sbjct: 1142 KVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAKIKSD 1201 Query: 953 EREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQNGAGL 1132 EREDLK RKPSWVTDEEFGMGYL++KPAP ASK++++ L G + Sbjct: 1202 EREDLKVLATGVAAALAARKPSWVTDEEFGMGYLELKPAPSLASKTVASGTQHLDAGNSV 1261 Query: 1133 ------------------SVS-------QAEQMGGRTVSSG----SLHSDSGNLGREPRR 1225 SVS A+ GG +V+ S+ S + + G Sbjct: 1262 KEQVLRAKTVDGRLERTESVSLVKSDPVHAKVKGGSSVNGSDIQQSMPSAASHTGTSR-- 1319 Query: 1226 IDGDNLKQVEESANKQSEENS-KV-TKTVVEPEARPVVKRSAAAGSLAKQAKQDLSKDED 1399 G+N + V+ES N+ +E++ KV ++ E E R KRS +GSL KQ K D++KD+ Sbjct: 1320 -SGENQRPVDESTNRTLDESTVKVSSRASTESELRATGKRSLPSGSLTKQPKLDVAKDDS 1378 Query: 1400 KSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTEIKAEITNSKSSDSRVYGGKDE 1579 KSGK VGR + R S A + S+D R+ KD+ Sbjct: 1379 KSGKGVGRTSGSSTSDRDLPAHQLE-----GRQSGVTNVSSAGTADGSSADLRLSAVKDD 1433 Query: 1580 GTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDSRDI 1759 G E +D + P+SR SPR +N +A KSGDK KR SPAEE +R+NKRRKG+ + RD Sbjct: 1434 GNEVSD--RAPSSRPIHSPRHDN-SATIKSGDKQQKRTSPAEEPERVNKRRKGDTEVRDF 1490 Query: 1760 DGSEVRLSEKERSSDVRALDKLHVAPFDKTGSDDKPLDRAKEKTGXXXXXXXXXXXXXXX 1939 +G EVR S+KE R D ++P Sbjct: 1491 EG-EVRFSDKESERYER----------DHRERLERP-----------------------D 1516 Query: 1940 KSRGDDLLSEKLRDRSLERHGRERSVDRVQERGADRNFDRLAKDERNKDDRSKVRYGEAS 2119 KSRGD++++EK RDRS+ERHGRERSV+RVQER ++R Sbjct: 1517 KSRGDEMIAEKSRDRSMERHGRERSVERVQERSSER------------------------ 1552 Query: 2120 LEKSHVDDRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXX 2299 +KSH DDRF SV++ RRDEDADRRFG ARH Q+LSP Sbjct: 1553 -KKSHADDRFHGQSLPPPPPLPPHMVPQSVTASRRDEDADRRFGTARHAQRLSP---RHE 1608 Query: 2300 XXXXXXNASALQXXXXXXXXXXXXXXXXXXXXALSIKMDERERDKANMNKEDIDLN-ASK 2476 + Q LSIK+++RER+KA++ KED+D + ASK Sbjct: 1609 EKERRRSEEISQDDAKRRREDDIRERKREEREGLSIKVEDREREKASLLKEDMDPSAASK 1668 Query: 2477 RRKLKREHMPS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLR 2653 RRKLKREHMPS E GEY PA+P PPP +I++ Q++DGR+RGDRKG +VQR GY ++PGLR Sbjct: 1669 RRKLKREHMPSGEAGEYTPAAPPPPPPAISMSQAYDGRERGDRKGAMVQRAGYLDEPGLR 1728 Query: 2654 AHSKEAASKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770 H KE K RRDAD MYDREWDD+KRQRAE KRRHRK Sbjct: 1729 IHGKEVTGKMARRDADQMYDREWDDEKRQRAEQKRRHRK 1767 >ref|XP_003631008.1| THO complex subunit [Medicago truncatula] gi|355525030|gb|AET05484.1| THO complex subunit [Medicago truncatula] Length = 2048 Score = 931 bits (2406), Expect = 0.0 Identities = 538/1013 (53%), Positives = 633/1013 (62%), Gaps = 103/1013 (10%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G + PI+W +LLDTVKTMLPPK WNSLSPDLYATFWGLTLYDL+VP+NRYESEI Sbjct: 1063 GSNKNPISWSDLLDTVKTMLPPKTWNSLSPDLYATFWGLTLYDLHVPKNRYESEIAKLHA 1122 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI KRKK+KERIQESLDRL+ E KHEE+V SV RRL HEKD WLS Sbjct: 1123 NLKSLEELSDNSSSAITKRKKEKERIQESLDRLSSELHKHEENVVSVSRRLFHEKDKWLS 1182 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+ LHSLGTPFFNTVNHIDVLICKTL Sbjct: 1183 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHKLHSLGTPFFNTVNHIDVLICKTL 1242 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWK-----------------------SDESIYE 691 QPMICCCTEYEVGRLGRFL+ETLK AYHWK SDESIYE Sbjct: 1243 QPMICCCTEYEVGRLGRFLYETLKIAYHWKLFRACSIILIFTFIFVSSFYYLQSDESIYE 1302 Query: 692 KECGNMPGFAVYYRYPNSQRVTYGQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIML 871 +ECGNMPGFAVYYR PN QRVTYGQFIKVHWKWSQRITRLLIQCLES+EYMEIRNALIML Sbjct: 1303 RECGNMPGFAVYYRNPNGQRVTYGQFIKVHWKWSQRITRLLIQCLESSEYMEIRNALIML 1362 Query: 872 TKISSVFPVTRKSGINLEKRVAKIKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGY 1051 TKISSVFPVTRKSGINLEKRVAKIKSDEREDLK RKPSWVTDEEFGMGY Sbjct: 1363 TKISSVFPVTRKSGINLEKRVAKIKSDEREDLKVLATGVAAALAARKPSWVTDEEFGMGY 1422 Query: 1052 LDIKPAPGPASKSLSANAVGLQNGAGLSVSQAEQMGGRTVSSG----------------- 1180 L++KPAP +KS + N+ +Q+G GL SQ E G+ + SG Sbjct: 1423 LELKPAPS-MTKSAAGNSAAVQSGIGLQFSQTESASGKHLDSGNTVKDQTVKTKTADGKS 1481 Query: 1181 -------SLHSDSGN---------------------LGREPRRIDGDNLKQVEESANKQS 1276 + SDSG+ G+ +N KQVEES ++ Sbjct: 1482 ERTESLTATKSDSGHGKLKGSSMVNGVDAQSSLASPAGQSGALKSVENQKQVEESISRAP 1541 Query: 1277 EENSKVTKTVVEPEARPVVK-RSAAAGSLAKQAKQDLSKDEDKSGKAV----GRXXXXXX 1441 +E+ +T+ V E+RP VK RS A GSL K +KQD K++ +SGK V G Sbjct: 1542 DEH--ITRNV---ESRPSVKQRSVATGSLLKPSKQDPLKEDGRSGKTVTRTSGSSSSDKD 1596 Query: 1442 XXXXXXXXXXKLTNSSTRSSDHNTEIKAEITNSKSSDSRVYGG----------------- 1570 TN S+ S + + + + + G Sbjct: 1597 LQTHASDGRHTGTNISSSFSANGNSVSGSAKGLAQAATTAFDGSGNESKAEVGAAKFSMV 1656 Query: 1571 KDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDKPHKRLSPAEEHDRLNKRRKGEIDS 1750 KD+ E+ D + +SR SPR EN TA SKS DK KR +E DRL KRRKG+ID Sbjct: 1657 KDDVNEFADFTRGSSSRVVHSPRHEN-TATSKSSDKIQKRAGSVDELDRLGKRRKGDIDL 1715 Query: 1751 RDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD--------DKPLDRAKEKTGXXXX 1906 RD++G EVR SE+E+ D R D DK G D DK L+R KEK Sbjct: 1716 RDLEG-EVRFSEREKLMDPRLAD-------DKVGPDELGVYRTGDKTLERPKEKGTDRYE 1767 Query: 1907 XXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQERGADRNFDRL---AKDER 2077 KSRGDD + EK RDRS+ER+GRERSV+RVQERG++R+F+RL AKD+R Sbjct: 1768 REHRERLDRLDKSRGDDFVVEKPRDRSIERYGRERSVERVQERGSERSFNRLPDKAKDDR 1827 Query: 2078 NKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXXSVSSGRRDEDADRRFGNA 2257 +KDDR+K+RY +A++EKSH + RF S+ +GRRDEDADRR+G Sbjct: 1828 SKDDRNKLRYNDATIEKSHAEGRFHGQSLPPPPPLPPNMVPQSLGAGRRDEDADRRYGAT 1887 Query: 2258 RHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXXXXXXALSIKMDERERDKA 2437 RH+Q+LSP LQ +K++ERER+KA Sbjct: 1888 RHSQRLSPRHEEKELRRSEETV-ILQDDPKRRKEDDFRDRKRE-----EMKVEEREREKA 1941 Query: 2438 NMNKEDIDLN-ASKRRKLKREHMPS-EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGV 2611 ++ KE+ DLN ASKRRKLKREH+P+ EPGEY P APP I + Q++DGR DRKG Sbjct: 1942 SILKEE-DLNAASKRRKLKREHLPTMEPGEYSPV--APPLSGIGMSQAYDGR---DRKGP 1995 Query: 2612 IVQRPGYAEDPGLRAHSKEAASKATRRDADPMYDREWDDDKRQRAEPKRRHRK 2770 ++Q Y ++P LR H KE ASK RR++DP+YDREWDD+KRQRA+ KRRHRK Sbjct: 1996 MIQHASYIDEPSLRIHGKEVASKLNRRESDPLYDREWDDEKRQRADQKRRHRK 2048 >ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi|223533086|gb|EEF34845.1| tho2 protein, putative [Ricinus communis] Length = 1828 Score = 926 bits (2392), Expect = 0.0 Identities = 526/949 (55%), Positives = 624/949 (65%), Gaps = 61/949 (6%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G S KPI W +LL+TVKTMLP KAWNSLSPDLYATFWGLTLYDLYVPR+RYESEI Sbjct: 893 GSSQKPIMWSDLLETVKTMLPSKAWNSLSPDLYATFWGLTLYDLYVPRDRYESEIAKQHA 952 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 SDNSSSAI+KRKKDKERIQESLDRLT E KHEE+V SVRRRL+ EKD WLS Sbjct: 953 ALKALEELSDNSSSAISKRKKDKERIQESLDRLTSELHKHEENVASVRRRLSREKDKWLS 1012 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCA FV+TLHSLGTPFFNTVNHIDVLICKTL Sbjct: 1013 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTL 1072 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTEYE GRLG+FL ETLK AY+WKSDESIYE+ECGNMPGFAVYYR+PNSQRVTY Sbjct: 1073 QPMICCCTEYEAGRLGKFLHETLKIAYYWKSDESIYERECGNMPGFAVYYRFPNSQRVTY 1132 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQFIKVHWKWSQRI+RLLIQCLESTEYMEIRNALI+LTKIS VFPVT++SGINLEKRVA+ Sbjct: 1133 GQFIKVHWKWSQRISRLLIQCLESTEYMEIRNALILLTKISGVFPVTKRSGINLEKRVAR 1192 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IKSDEREDLK RKPSWVTDEEFGMGYLDI+P ASKS+S N QN Sbjct: 1193 IKSDEREDLKVLATSVASALAARKPSWVTDEEFGMGYLDIRPP--AASKSVSGNISVGQN 1250 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLHSDSGNLGRE----------------------PRRIDG 1234 +GL+ SQ E GGR VS+ + H D GN +E +++ G Sbjct: 1251 SSGLNASQGESAGGRAVSTTTQHGDVGNSAKEHISRAKPADKQESVSYVKSDSVNQKVKG 1310 Query: 1235 DNL---KQVEESA--------NKQSEENSK-----------VTKTVVEPEARPVVKRSAA 1348 +L ++ SA +S EN K K E E++ KR+ Sbjct: 1311 GSLVIQSDLQSSAALVTGQAGASRSAENQKQMSESPIIIPDAPKNSAESESKASGKRAMP 1370 Query: 1349 AGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRSSDHNTEIKAE 1528 AGS+ K +QD++KD+ KSGK VGR ++ S + T + + Sbjct: 1371 AGSV-KTPRQDVAKDDLKSGKTVGRVPVASSSDKDMP------SHLSESRLGNGTNVSST 1423 Query: 1529 ITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQE-NLTAASKSGDKPHKRLSPAE 1705 T++ + V KD+ TE D K P SR SPR + + ++SKS DK KR SP + Sbjct: 1424 GTSNDGAAKSVV--KDDATEVGDVQK-PPSRVVHSPRHDGSFASSSKSSDKLQKRASPGD 1480 Query: 1706 EHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD-------DK 1864 + DRL+KRRKG+ + RD+DG ++R S++ER D R +D DK GSD DK Sbjct: 1481 DPDRLSKRRKGDTELRDLDG-DIRFSDRERPMDSRLVD------LDKIGSDERVHRSMDK 1533 Query: 1865 PLDRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERSVDRVQER-GA 2041 PLDR+K+K KSRGDD+L E+ RDRS+ER+GRERSV+R QER GA Sbjct: 1534 PLDRSKDKGMERYDRDHRERSERPDKSRGDDILVERPRDRSMERYGRERSVERGQERGGA 1593 Query: 2042 DRNFDRLA---KDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXXXXXSVS 2212 DR+FDR + KDERNKD KVRYG+ S+EK H DDRF SV+ Sbjct: 1594 DRSFDRFSDKTKDERNKD---KVRYGDTSVEKLH-DDRFYGQNLPPPPPLPPHVVPQSVT 1649 Query: 2213 SGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXXXXXXXX 2392 + RRDEDADRR G+ARH+ +LSP N+ Q Sbjct: 1650 ASRRDEDADRRIGSARHSLRLSPRHDEKERRRSEENSLVSQDDVKRGRDDNFRDRKRDER 1709 Query: 2393 XALSIKMDERERDKANMN---KEDIDLN-ASKRRKLKREHMPS-EPGEYLPASPAPPPVS 2557 L++K+++RERD+ K+DID+ ASKRRKLKREHMPS E GEY P +P PPP++ Sbjct: 1710 EGLAMKVEDRERDREREKVPLKDDIDVGAASKRRKLKREHMPSGEAGEYSPVAPPPPPLA 1769 Query: 2558 INLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATRRDADP 2704 I++ QS+DGR+RGDR G ++QR GY E+P +R H KE A K TRRDADP Sbjct: 1770 ISMSQSYDGRERGDR-GALIQRAGYLEEPPMRIHGKEVAGKMTRRDADP 1817 >ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fragaria vesca subsp. vesca] Length = 1860 Score = 898 bits (2320), Expect = 0.0 Identities = 512/987 (51%), Positives = 619/987 (62%), Gaps = 77/987 (7%) Frame = +2 Query: 41 GYSSKPIAWLNLLDTVKTMLPPKAWNSLSPDLYATFWGLTLYDLYVPRNRYESEIXXXXX 220 G + PI WL+LLDT KTMLP +AWNSLSPDLYATFWGLTLYDLYVPRN Y SEI Sbjct: 897 GSTWNPITWLDLLDTAKTMLPARAWNSLSPDLYATFWGLTLYDLYVPRNCYISEIAKQQA 956 Query: 221 XXXXXXXXSDNSSSAIAKRKKDKERIQESLDRLTLEFQKHEEHVESVRRRLAHEKDTWLS 400 DNS S I +RKK+KERIQE++DRL E +KHEEHV SVR+RL EKD WLS Sbjct: 957 SIKALEEQPDNSISEILRRKKEKERIQETIDRLISESRKHEEHVASVRKRLLREKDKWLS 1016 Query: 401 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYCANFVNTLHSLGTPFFNTVNHIDVLICKTL 580 SCPDTLKINMEFLQRCIFPRCTFSMPDAVY A FV+TLH+LGTPFFNTVNH+DVLIC+TL Sbjct: 1017 SCPDTLKINMEFLQRCIFPRCTFSMPDAVYSAMFVHTLHTLGTPFFNTVNHMDVLICRTL 1076 Query: 581 QPMICCCTEYEVGRLGRFLFETLKTAYHWKSDESIYEKECGNMPGFAVYYRYPNSQRVTY 760 QPMICCCTE EVGRLG+FL ETLK AY+WKSDESIYE+ECGNMPGFAVYYR+P+SQRV Y Sbjct: 1077 QPMICCCTESEVGRLGKFLCETLKIAYYWKSDESIYERECGNMPGFAVYYRFPDSQRVRY 1136 Query: 761 GQFIKVHWKWSQRITRLLIQCLESTEYMEIRNALIMLTKISSVFPVTRKSGINLEKRVAK 940 GQF+KVHWKWSQRITRLL QCLESTEYMEIRNALI+L++ISSVFPVTRKS +NLEKRV+K Sbjct: 1137 GQFVKVHWKWSQRITRLLGQCLESTEYMEIRNALIILSRISSVFPVTRKSALNLEKRVSK 1196 Query: 941 IKSDEREDLKXXXXXXXXXXXXRKPSWVTDEEFGMGYLDIKPAPGPASKSLSANAVGLQN 1120 IK D REDLK RKPS V+DEEF MGY+++K A +SK L++N+ + + Sbjct: 1197 IKGDGREDLKVLATSVGASLAARKPSLVSDEEFCMGYVELKSAS--SSKPLASNSGAIHS 1254 Query: 1121 GAGLSVSQAEQMGGRTVSSGSLH-------------------------------SDSGNL 1207 G ++ SQ E GG+ + S H SD G+L Sbjct: 1255 GPAVNNSQTEPAGGKAGTLVSQHAELIDSARDHVSKAKPADGRSERAESVSTAKSDPGHL 1314 Query: 1208 GREPRRIDG---------------------DNLKQVEESANKQSEENSK--VTKTVVEPE 1318 + + +N Q+ E++ +++EEN+ K E E Sbjct: 1315 KHKGASLVNGSDAQASVPSATLQAGTARPIENQVQLNETSTRRAEENTGKLAAKNTSESE 1374 Query: 1319 ARPVVKRSAAAGSLAKQAKQDLSKDEDKSGKAVGRXXXXXXXXXXXXXXXXKLTNSSTRS 1498 R KRS AG AK KQDL KDE +SGKA G + + S Sbjct: 1375 LRAQAKRSVPAG--AKPLKQDLVKDESRSGKAAGATNVSSITANGST-----VPSLGKGS 1427 Query: 1499 SDHNTEIKAEITNSKSSDSRVYGGKDEGTEYTDAHKQPTSRSTQSPRQENLTAASKSGDK 1678 + E K E ++K S++R+ K+EG E +D + P+SR SPR ++ SKS DK Sbjct: 1428 ASLGIESKVEAGSAKISNTRIPSSKEEGAEVSDVARPPSSRFVNSPRHDSSATLSKSSDK 1487 Query: 1679 PHKRLSPAEEHDRLNKRRKGEIDSRDIDGSEVRLSEKERSSDVRALDKLHVAPFDKTGSD 1858 KR PAEE DR +KRRKGE + RD +G E RLS++ERS D R LD DK+GSD Sbjct: 1488 LQKRTGPAEETDRQSKRRKGEAEMRDSEG-EARLSDRERSVDARLLD------LDKSGSD 1540 Query: 1859 DKPL--------DRAKEKTGXXXXXXXXXXXXXXXKSRGDDLLSEKLRDRSLERHGRERS 2014 D+ + DR+K+K KSRGDDL+ E+ RDRS+ERHGR+ S Sbjct: 1541 DRSVYKATEKASDRSKDKGNERHDKDHRERADRPDKSRGDDLV-ERSRDRSMERHGRDHS 1599 Query: 2015 VDRVQERGADRNFDRLAKDERNKDDRSKVRYGEASLEKSHVDDRFXXXXXXXXXXXXXXX 2194 +++QERG+DR+FDRL E++KD++ K RY + S EKSHVD+R+ Sbjct: 1600 AEKLQERGSDRSFDRLP--EKSKDEKGKGRYSDISTEKSHVDERYHGQSLPPPPPLPPHI 1657 Query: 2195 XXXSVSSGRRDEDADRRFGNARHTQKLSPXXXXXXXXXXXXNASALQXXXXXXXXXXXXX 2374 SVSSGRRDED+DRR RHTQ+LSP N+S Q Sbjct: 1658 VPQSVSSGRRDEDSDRRT-TTRHTQRLSPRHDEKERRRSEENSSISQDDSKRRREDDFRE 1716 Query: 2375 XXXXXXXALSIKMDERERD--------------KANMNKEDIDL-NASKRRKLKREHMPS 2509 +S+K+DER+RD KAN++KED D+ ASKRRKLKR+ Sbjct: 1717 RKRDDREGISVKVDERDRDRDRDREREREKEREKANLSKEDPDMIAASKRRKLKRDLSSV 1776 Query: 2510 EPGEYLPASPAPPPVSINLLQSHDGRDRGDRKGVIVQRPGYAEDPGLRAHSKEAASKATR 2689 E GEY P P PPP+SINL QS+DGRDRG+RKG IV R GY E+P LR H KE ++K TR Sbjct: 1777 EAGEYSPVHP-PPPLSINLSQSYDGRDRGERKGPIVARTGYVEEPSLRIHGKEVSNKMTR 1835 Query: 2690 RDADPMYDREWDDDKRQRAEPKRRHRK 2770 RD DPMY EWDDDKR R E KRRHRK Sbjct: 1836 RDTDPMY--EWDDDKR-RGEQKRRHRK 1859