BLASTX nr result
ID: Mentha23_contig00020442
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha23_contig00020442 (1471 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EPS70916.1| hypothetical protein M569_03841, partial [Genlise... 654 0.0 ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solan... 635 e-179 ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solan... 632 e-178 ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis... 632 e-178 emb|CBI26799.3| unnamed protein product [Vitis vinifera] 632 e-178 ref|XP_006376042.1| F5A9.22 family protein [Populus trichocarpa]... 622 e-175 ref|XP_002325475.1| F5A9.22 family protein [Populus trichocarpa]... 622 e-175 ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi... 621 e-175 ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citru... 618 e-174 ref|XP_007045498.1| THO complex subunit 2 isoform 6, partial [Th... 617 e-174 ref|XP_007045497.1| THO complex subunit 2 isoform 5 [Theobroma c... 617 e-174 ref|XP_007045496.1| THO complex subunit 2 isoform 4 [Theobroma c... 617 e-174 ref|XP_007045495.1| THO2 isoform 3 [Theobroma cacao] gi|50870943... 617 e-174 ref|XP_007045494.1| THO complex subunit 2 isoform 2 [Theobroma c... 617 e-174 ref|XP_007045493.1| THO complex subunit 2 isoform 1 [Theobroma c... 617 e-174 ref|XP_007217095.1| hypothetical protein PRUPE_ppa000084mg [Prun... 613 e-173 ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucum... 612 e-172 ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isofor... 595 e-167 ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isofor... 595 e-167 ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fraga... 594 e-167 >gb|EPS70916.1| hypothetical protein M569_03841, partial [Genlisea aurea] Length = 1222 Score = 654 bits (1687), Expect = 0.0 Identities = 319/415 (76%), Positives = 373/415 (89%) Frame = -2 Query: 1248 LLIQKAETMSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPC 1069 L+++K MSLP L+CVY+TEE++KELKN N++F+FP +LRFLYELC+ VRGDLP Sbjct: 2 LIVRK---MSLPHLQCVYLTEESIKELKNSNTSFRFPRSGHVLRFLYELCFVMVRGDLPY 58 Query: 1068 QKCKVALEAVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVP 889 QKCK A++A+EF DC + D+GSYFADIV+QM QD ++GEYR+RL+KLAKWLVESALVP Sbjct: 59 QKCKAAVDAMEFLDCGPEEDMGSYFADIVAQMGQDHAVVGEYRSRLVKLAKWLVESALVP 118 Query: 888 LRFFQERCDEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTL 709 LR+FQERC+EEFLWE EMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTL Sbjct: 119 LRYFQERCEEEFLWECEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTL 178 Query: 708 LCQVPQVSTENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPK 529 LCQVP+VS E+ S A +GI+KSLIGHFDLDPNRVFDIVLECFELQL NSVFLDLIP+FPK Sbjct: 179 LCQVPEVSNEDLSTAIIGIVKSLIGHFDLDPNRVFDIVLECFELQLHNSVFLDLIPLFPK 238 Query: 528 SHASQILGFKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQY 349 SHASQILGFKFQY+Q++E+N+P P+GLYQLTAL+ KK+FID++SIYSHL+PKDE AFE Y Sbjct: 239 SHASQILGFKFQYYQQLEINSPAPSGLYQLTALLAKKKFIDIESIYSHLVPKDEDAFEHY 298 Query: 348 NAFSAKRLDEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANN 169 NA AKRL+EANKIGKINLAATGKDLMDD+KQGDVTVDLFAA D+E +AV+ERSS+LA++ Sbjct: 299 NALLAKRLEEANKIGKINLAATGKDLMDDDKQGDVTVDLFAALDIEEMAVSERSSELASS 358 Query: 168 QPLGLLMGFLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 Q LGLLMGFL+VDDW HA+QLL+RLSPLNPVE+ IC+GLFRLIEK I AY L+ Sbjct: 359 QSLGLLMGFLSVDDWLHANQLLDRLSPLNPVEHTHICAGLFRLIEKAIFSAYMLL 413 >ref|XP_006343178.1| PREDICTED: THO complex subunit 2-like [Solanum tuberosum] Length = 1859 Score = 635 bits (1638), Expect = e-179 Identities = 318/408 (77%), Positives = 359/408 (87%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSL PLE +Y TE+++KELKN N++FKF P P LRFLYELC+ VRG+LP QKCK+ALE Sbjct: 1 MSLSPLEYLYFTEDSIKELKNGNTSFKFAQPLPTLRFLYELCWVMVRGELPFQKCKMALE 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 VEF D ++GS ADIV+Q+AQDL++ GE R R+ KLAKWLVESALVPLRFFQERC Sbjct: 61 CVEFVDYASQEELGSSLADIVTQLAQDLSLPGENRQRVNKLAKWLVESALVPLRFFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ+P+ S Sbjct: 121 EEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQIPEGS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 ++N SAATVGIIKSLIGHFDLDPNRVFDIVLECFE Q NS+FLDLIPIFPKSHASQILG Sbjct: 181 SQNSSAATVGIIKSLIGHFDLDPNRVFDIVLECFERQPGNSIFLDLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QR+EVN PVP+ LYQLTAL+VK++FIDVDSIY+HLLPK+E AF+ YNAFSAKRL Sbjct: 241 FKFQYYQRLEVNDPVPSELYQLTALLVKRDFIDVDSIYAHLLPKEEDAFDHYNAFSAKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIG+INLAATGKDLMD+EKQGDVTVDL+AA DME AVAERSS+L N+QPLGLLMG Sbjct: 301 DEANKIGRINLAATGKDLMDEEKQGDVTVDLYAALDMETEAVAERSSELENSQPLGLLMG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLVC 1 FL VDDW+HAH L RLS LNP E+++IC GLFRLIEK+I LVC Sbjct: 361 FLEVDDWYHAHVLFGRLSHLNPAEHVQICDGLFRLIEKSISGPNDLVC 408 >ref|XP_004239260.1| PREDICTED: THO complex subunit 2-like [Solanum lycopersicum] Length = 1858 Score = 632 bits (1630), Expect = e-178 Identities = 317/408 (77%), Positives = 358/408 (87%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSL PLE +Y TE ++KELKN N++FKF P P LRFLYELC+ VRG+LP QKCK+ALE Sbjct: 1 MSLSPLEYLYFTEHSIKELKNGNTSFKFAQPLPTLRFLYELCWVMVRGELPFQKCKLALE 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 VEF D ++GS ADIV+Q+AQDL++ GE R R+ KLAKWLVESALVPLRFFQERC Sbjct: 61 CVEFVDYASQEELGSSLADIVTQLAQDLSLPGENRQRVNKLAKWLVESALVPLRFFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ+P+ S Sbjct: 121 EEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQIPEDS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 ++N SAATVGIIKSLIGHFDLDPNRVFDIVLECFE Q NS+FLDLIPIFPKSHASQILG Sbjct: 181 SQNASAATVGIIKSLIGHFDLDPNRVFDIVLECFERQPGNSIFLDLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QR+EVN PVP+ LYQLTAL+VK++FIDVDSIY+HLLPK+E AF+ YNAFSAKRL Sbjct: 241 FKFQYYQRLEVNDPVPSELYQLTALLVKRDFIDVDSIYAHLLPKEEDAFDHYNAFSAKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIG+INLAATGKDLMD+EKQGDVTVDL+AA DME AVAERSS+L N+QPLGLLMG Sbjct: 301 DEANKIGRINLAATGKDLMDEEKQGDVTVDLYAALDMETEAVAERSSELENSQPLGLLMG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLVC 1 FL V+DW+HAH L RLS LNP E+++IC GLFRLIEK+I LVC Sbjct: 361 FLEVNDWYHAHVLFGRLSHLNPAEHVQICDGLFRLIEKSISGPNDLVC 408 >ref|XP_002281541.2| PREDICTED: THO complex subunit 2-like [Vitis vinifera] Length = 1849 Score = 632 bits (1630), Expect = e-178 Identities = 316/407 (77%), Positives = 357/407 (87%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC++VT++ ++E K+ N +FK P+LRFLYELC T VRG+LP KCKVAL+ Sbjct: 1 MSLPPIECIHVTDDCLREWKSGNPSFKVSGTVPMLRFLYELCSTLVRGELPLHKCKVALD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +VEFSD + D ++ S FADIV+QMA DLTM GE R RLIKLAKWLVES LVPLR FQERC Sbjct: 61 SVEFSDKEADEELASNFADIVTQMALDLTMPGENRARLIKLAKWLVESTLVPLRLFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWESEMIKIKA +LK+KEVRVNTRLLYQQTKFNL+REESEGY+KLVTLLCQ + S Sbjct: 121 EEEFLWESEMIKIKAQELKNKEVRVNTRLLYQQTKFNLVREESEGYSKLVTLLCQGSESS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 ++N SAAT+GIIKSLIGHFDLDPNRVFDIVLECFE Q DNSVFLDLIPIFPKSHASQILG Sbjct: 181 SQNASAATIGIIKSLIGHFDLDPNRVFDIVLECFEHQPDNSVFLDLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FK+QY+QRMEVN VP GLYQLTAL+VK+EFID+DSIY+HLLPKDE AFE YN FSAKRL Sbjct: 241 FKYQYYQRMEVNNRVPPGLYQLTALLVKEEFIDLDSIYAHLLPKDEEAFEHYNVFSAKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAERSS+L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERSSELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FLAVDDW+HAH L +RLSPLNPV +IEIC+GL RLIEK+I AY +V Sbjct: 361 FLAVDDWYHAHILFDRLSPLNPVAHIEICNGLLRLIEKSISTAYGIV 407 >emb|CBI26799.3| unnamed protein product [Vitis vinifera] Length = 1767 Score = 632 bits (1630), Expect = e-178 Identities = 316/407 (77%), Positives = 357/407 (87%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC++VT++ ++E K+ N +FK P+LRFLYELC T VRG+LP KCKVAL+ Sbjct: 1 MSLPPIECIHVTDDCLREWKSGNPSFKVSGTVPMLRFLYELCSTLVRGELPLHKCKVALD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +VEFSD + D ++ S FADIV+QMA DLTM GE R RLIKLAKWLVES LVPLR FQERC Sbjct: 61 SVEFSDKEADEELASNFADIVTQMALDLTMPGENRARLIKLAKWLVESTLVPLRLFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWESEMIKIKA +LK+KEVRVNTRLLYQQTKFNL+REESEGY+KLVTLLCQ + S Sbjct: 121 EEEFLWESEMIKIKAQELKNKEVRVNTRLLYQQTKFNLVREESEGYSKLVTLLCQGSESS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 ++N SAAT+GIIKSLIGHFDLDPNRVFDIVLECFE Q DNSVFLDLIPIFPKSHASQILG Sbjct: 181 SQNASAATIGIIKSLIGHFDLDPNRVFDIVLECFEHQPDNSVFLDLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FK+QY+QRMEVN VP GLYQLTAL+VK+EFID+DSIY+HLLPKDE AFE YN FSAKRL Sbjct: 241 FKYQYYQRMEVNNRVPPGLYQLTALLVKEEFIDLDSIYAHLLPKDEEAFEHYNVFSAKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAERSS+L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERSSELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FLAVDDW+HAH L +RLSPLNPV +IEIC+GL RLIEK+I AY +V Sbjct: 361 FLAVDDWYHAHILFDRLSPLNPVAHIEICNGLLRLIEKSISTAYGIV 407 >ref|XP_006376042.1| F5A9.22 family protein [Populus trichocarpa] gi|550325266|gb|ERP53839.1| F5A9.22 family protein [Populus trichocarpa] Length = 1805 Score = 622 bits (1603), Expect = e-175 Identities = 311/406 (76%), Positives = 355/406 (87%) Frame = -2 Query: 1221 SLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALEA 1042 +LPP+EC++VTEE + ELK+ N +F+ P P PILRFLYEL +T VRG+LP QKCK AL++ Sbjct: 4 TLPPMECLHVTEEFLLELKSGNRSFRLPHPVPILRFLYELSWTLVRGELPFQKCKAALDS 63 Query: 1041 VEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERCD 862 VEF D +GS FADI++QMAQDLTM GEYR+RLIKLAKWLVESALVPLRFFQERC+ Sbjct: 64 VEFVDKMSAVGLGSNFADIITQMAQDLTMSGEYRSRLIKLAKWLVESALVPLRFFQERCE 123 Query: 861 EEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVST 682 EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+ + + Sbjct: 124 EEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRGSEDTA 183 Query: 681 ENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILGF 502 EN SAAT+GIIKSLIGHFDLDPNRVFDIVLECFELQ D++VFL+LIPIFPKSHASQILGF Sbjct: 184 ENTSAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDSNVFLELIPIFPKSHASQILGF 243 Query: 501 KFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRLD 322 KFQY+QRME+N+PVP GL++LTAL+VK+EFID+DSI +HLLPKD+ AFE YN FS+KRLD Sbjct: 244 KFQYYQRMELNSPVPFGLFKLTALLVKEEFIDLDSICAHLLPKDDEAFEHYNTFSSKRLD 303 Query: 321 EANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMGF 142 A KIGKINLAATGKDLMDDEKQGDVTVDLFAA DME AVAE+ SDL NQ LGLL GF Sbjct: 304 AAYKIGKINLAATGKDLMDDEKQGDVTVDLFAALDMETEAVAEQFSDLEKNQTLGLLTGF 363 Query: 141 LAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 L+VDDW+HAH L +RLSPLNPV + +IC GLFRLIEKTI AY ++ Sbjct: 364 LSVDDWYHAHILFKRLSPLNPVAHTQICGGLFRLIEKTISSAYNII 409 >ref|XP_002325475.1| F5A9.22 family protein [Populus trichocarpa] gi|222862350|gb|EEE99856.1| F5A9.22 family protein [Populus trichocarpa] Length = 1836 Score = 622 bits (1603), Expect = e-175 Identities = 312/406 (76%), Positives = 355/406 (87%) Frame = -2 Query: 1221 SLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALEA 1042 +LPP+EC+YVTEE ++ELK N +F+ P P PILRFLYEL + VRG+LP QKCK AL++ Sbjct: 4 TLPPMECLYVTEEFLRELKGGNHSFRLPHPVPILRFLYELSWNLVRGELPFQKCKAALDS 63 Query: 1041 VEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERCD 862 VEF D +GS FADI++QMAQDLTM GEYR+RLIKLAKWLVESALVPLRFFQERC+ Sbjct: 64 VEFVDKVSAVGLGSNFADIITQMAQDLTMSGEYRSRLIKLAKWLVESALVPLRFFQERCE 123 Query: 861 EEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVST 682 EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLL Q + +T Sbjct: 124 EEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLYQGSEDTT 183 Query: 681 ENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILGF 502 EN SAAT+GIIKSLIGHFDLDPNRVFDIVLE FELQ D++VFL+LIPIFPKSHASQILGF Sbjct: 184 ENTSAATIGIIKSLIGHFDLDPNRVFDIVLEYFELQPDSNVFLELIPIFPKSHASQILGF 243 Query: 501 KFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRLD 322 KFQY+QR+E+N+ VP GLY+LTAL+VK+EFID+DSI +HLLPKD+ AFE YN FS+KRLD Sbjct: 244 KFQYYQRIELNSHVPFGLYKLTALLVKEEFIDLDSICAHLLPKDDEAFEHYNTFSSKRLD 303 Query: 321 EANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMGF 142 EANKIGKINLAATGKDLMDDEKQGDVTVDLFAA DME AVAER S+L NNQ LGLL GF Sbjct: 304 EANKIGKINLAATGKDLMDDEKQGDVTVDLFAALDMEAEAVAERFSELENNQTLGLLTGF 363 Query: 141 LAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 L+VDDW+HAH L ERLSPLNPV + +IC+GLFRLIEK + AY ++ Sbjct: 364 LSVDDWYHAHVLFERLSPLNPVAHTQICNGLFRLIEKLVSSAYNII 409 >ref|XP_002527536.1| tho2 protein, putative [Ricinus communis] gi|223533086|gb|EEF34845.1| tho2 protein, putative [Ricinus communis] Length = 1828 Score = 621 bits (1601), Expect = e-175 Identities = 309/407 (75%), Positives = 356/407 (87%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP++C+YV E+ ++E K+ +S+F+ P P P+LRFLYELC+T VRG+LP KCK ALE Sbjct: 1 MSLPPIDCIYVREDYIREWKSGSSSFRVPDPVPMLRFLYELCWTMVRGELPYLKCKAALE 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +VE+++ + S FADIV+QMAQDLTM GEYR RLIKLAKWLVES+LVPLRFFQERC Sbjct: 61 SVEYTESVSARVLASTFADIVTQMAQDLTMPGEYRARLIKLAKWLVESSLVPLRFFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNL+REESEGYAKLVTLLCQ Sbjct: 121 EEEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLVREESEGYAKLVTLLCQGYDNV 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 N SAAT+GIIKSLIGHFDLDPNRVFDIVLECFELQ DN++FLDLIPIFPKSHASQILG Sbjct: 181 NSNASAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDNNIFLDLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QR+EVN+PVP GLY+LTAL+VK+EFID+DSIYSHLLP+D+ AFE Y AFS+KRL Sbjct: 241 FKFQYYQRLEVNSPVPFGLYKLTALLVKEEFIDLDSIYSHLLPRDDEAFEHYVAFSSKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVTVDLFAA DME AVAER S+L N+Q LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTVDLFAALDMETDAVAERLSELENSQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDWFHAH L +RLS LNPV +++IC GLFRLIEK+I AY ++ Sbjct: 361 FLSVDDWFHAHILFDRLSLLNPVGHVQICKGLFRLIEKSISAAYDII 407 >ref|XP_006469280.1| PREDICTED: THO complex subunit 2-like [Citrus sinensis] Length = 1874 Score = 618 bits (1594), Expect = e-174 Identities = 308/407 (75%), Positives = 352/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLP ++C Y+TEE ++E KN N +F+ P P P+LRFLYELC TVRG+LP QKCK A++ Sbjct: 1 MSLPQIQCKYITEECLREWKNGNPSFRVPDPVPMLRFLYELCSITVRGELPFQKCKAAVD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +VEF + V S FADIV+QMAQDLTM GE+R RLIKLAKWLVESALVPLR FQERC Sbjct: 61 SVEFVEKPSHRVVASTFADIVTQMAQDLTMPGEHRVRLIKLAKWLVESALVPLRLFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC + + Sbjct: 121 EEEFLWEAEMIKIKAQDLKGKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCHTYENA 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 TE+ SAAT+GIIKSLIGHFDLDPNRVFDIVLEC+ELQ +N VFL+LIPIFPKSHAS ILG Sbjct: 181 TESASAATIGIIKSLIGHFDLDPNRVFDIVLECYELQPNNKVFLELIPIFPKSHASHILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVN+PVP LY+LTAL+VK+EFID+DSIY+HLLPKD+ AFE YNAFSAKRL Sbjct: 241 FKFQYYQRMEVNSPVPFSLYKLTALLVKEEFIDLDSIYTHLLPKDDEAFEHYNAFSAKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA D+E AVAERS +L N+Q LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDLENEAVAERSPELENSQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HAH L ERL+PLNPV +I+IC GL RLIE +I AY +V Sbjct: 361 FLSVDDWYHAHILFERLAPLNPVAHIQICDGLLRLIENSISSAYDIV 407 >ref|XP_007045498.1| THO complex subunit 2 isoform 6, partial [Theobroma cacao] gi|508709433|gb|EOY01330.1| THO complex subunit 2 isoform 6, partial [Theobroma cacao] Length = 1345 Score = 617 bits (1592), Expect = e-174 Identities = 304/407 (74%), Positives = 353/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC+Y+TEE ++E K+ NSNF F + P+LRFLYELC+T VRG+LP QKCK L+ Sbjct: 1 MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 AVEF++ + ++GS FADIV+QMAQDLTM GEYRTRLIKLAKWLVES++VPLR F ER Sbjct: 61 AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + + S Sbjct: 121 EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFRGSEDS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+N S A +G+IKSLIGHFDLDPNRVFDIVLEC+ELQ D FL LIPIFPKSHASQILG Sbjct: 181 TQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVNTP P GLY+LTAL+VK+EFID+DSIY+HLLPKD+ FEQ+N+FS KRL Sbjct: 241 FKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAER+ +L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HA L +RLSPLNPV +++IC GLFRLIEK+I AY +V Sbjct: 361 FLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIV 407 >ref|XP_007045497.1| THO complex subunit 2 isoform 5 [Theobroma cacao] gi|508709432|gb|EOY01329.1| THO complex subunit 2 isoform 5 [Theobroma cacao] Length = 1824 Score = 617 bits (1592), Expect = e-174 Identities = 304/407 (74%), Positives = 353/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC+Y+TEE ++E K+ NSNF F + P+LRFLYELC+T VRG+LP QKCK L+ Sbjct: 1 MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 AVEF++ + ++GS FADIV+QMAQDLTM GEYRTRLIKLAKWLVES++VPLR F ER Sbjct: 61 AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + + S Sbjct: 121 EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFRGSEDS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+N S A +G+IKSLIGHFDLDPNRVFDIVLEC+ELQ D FL LIPIFPKSHASQILG Sbjct: 181 TQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVNTP P GLY+LTAL+VK+EFID+DSIY+HLLPKD+ FEQ+N+FS KRL Sbjct: 241 FKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAER+ +L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HA L +RLSPLNPV +++IC GLFRLIEK+I AY +V Sbjct: 361 FLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIV 407 >ref|XP_007045496.1| THO complex subunit 2 isoform 4 [Theobroma cacao] gi|508709431|gb|EOY01328.1| THO complex subunit 2 isoform 4 [Theobroma cacao] Length = 1831 Score = 617 bits (1592), Expect = e-174 Identities = 304/407 (74%), Positives = 353/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC+Y+TEE ++E K+ NSNF F + P+LRFLYELC+T VRG+LP QKCK L+ Sbjct: 1 MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 AVEF++ + ++GS FADIV+QMAQDLTM GEYRTRLIKLAKWLVES++VPLR F ER Sbjct: 61 AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + + S Sbjct: 121 EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFRGSEDS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+N S A +G+IKSLIGHFDLDPNRVFDIVLEC+ELQ D FL LIPIFPKSHASQILG Sbjct: 181 TQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVNTP P GLY+LTAL+VK+EFID+DSIY+HLLPKD+ FEQ+N+FS KRL Sbjct: 241 FKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAER+ +L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HA L +RLSPLNPV +++IC GLFRLIEK+I AY +V Sbjct: 361 FLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIV 407 >ref|XP_007045495.1| THO2 isoform 3 [Theobroma cacao] gi|508709430|gb|EOY01327.1| THO2 isoform 3 [Theobroma cacao] Length = 1762 Score = 617 bits (1592), Expect = e-174 Identities = 304/407 (74%), Positives = 353/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC+Y+TEE ++E K+ NSNF F + P+LRFLYELC+T VRG+LP QKCK L+ Sbjct: 1 MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 AVEF++ + ++GS FADIV+QMAQDLTM GEYRTRLIKLAKWLVES++VPLR F ER Sbjct: 61 AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + + S Sbjct: 121 EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFRGSEDS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+N S A +G+IKSLIGHFDLDPNRVFDIVLEC+ELQ D FL LIPIFPKSHASQILG Sbjct: 181 TQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVNTP P GLY+LTAL+VK+EFID+DSIY+HLLPKD+ FEQ+N+FS KRL Sbjct: 241 FKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAER+ +L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HA L +RLSPLNPV +++IC GLFRLIEK+I AY +V Sbjct: 361 FLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIV 407 >ref|XP_007045494.1| THO complex subunit 2 isoform 2 [Theobroma cacao] gi|508709429|gb|EOY01326.1| THO complex subunit 2 isoform 2 [Theobroma cacao] Length = 1844 Score = 617 bits (1592), Expect = e-174 Identities = 304/407 (74%), Positives = 353/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC+Y+TEE ++E K+ NSNF F + P+LRFLYELC+T VRG+LP QKCK L+ Sbjct: 1 MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 AVEF++ + ++GS FADIV+QMAQDLTM GEYRTRLIKLAKWLVES++VPLR F ER Sbjct: 61 AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + + S Sbjct: 121 EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFRGSEDS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+N S A +G+IKSLIGHFDLDPNRVFDIVLEC+ELQ D FL LIPIFPKSHASQILG Sbjct: 181 TQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVNTP P GLY+LTAL+VK+EFID+DSIY+HLLPKD+ FEQ+N+FS KRL Sbjct: 241 FKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAER+ +L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HA L +RLSPLNPV +++IC GLFRLIEK+I AY +V Sbjct: 361 FLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIV 407 >ref|XP_007045493.1| THO complex subunit 2 isoform 1 [Theobroma cacao] gi|508709428|gb|EOY01325.1| THO complex subunit 2 isoform 1 [Theobroma cacao] Length = 1853 Score = 617 bits (1592), Expect = e-174 Identities = 304/407 (74%), Positives = 353/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC+Y+TEE ++E K+ NSNF F + P+LRFLYELC+T VRG+LP QKCK L+ Sbjct: 1 MSLPPIECMYITEEILREGKSGNSNFSFSSSVPMLRFLYELCWTMVRGELPFQKCKAVLD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 AVEF++ + ++GS FADIV+QMAQDLTM GEYRTRLIKLAKWLVES++VPLR F ER Sbjct: 61 AVEFTERVSEDELGSCFADIVTQMAQDLTMAGEYRTRLIKLAKWLVESSVVPLRLFHERS 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA DLK KEVRVNTRLLYQQTKFNLLREESEGYAKL+TLL + + S Sbjct: 121 EEEFLWEAEMIKIKAPDLKVKEVRVNTRLLYQQTKFNLLREESEGYAKLITLLFRGSEDS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+N S A +G+IKSLIGHFDLDPNRVFDIVLEC+ELQ D FL LIPIFPKSHASQILG Sbjct: 181 TQNASTARIGVIKSLIGHFDLDPNRVFDIVLECYELQPDKDAFLQLIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVNTP P GLY+LTAL+VK+EFID+DSIY+HLLPKD+ FEQ+N+FS KRL Sbjct: 241 FKFQYYQRMEVNTPTPFGLYKLTALLVKEEFIDLDSIYTHLLPKDDETFEQFNSFSTKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFAA DME AVAER+ +L NNQ LGLL G Sbjct: 301 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFAALDMETEAVAERTPELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HA L +RLSPLNPV +++IC GLFRLIEK+I AY +V Sbjct: 361 FLSVDDWYHARILFDRLSPLNPVAHVQICKGLFRLIEKSISLAYDIV 407 >ref|XP_007217095.1| hypothetical protein PRUPE_ppa000084mg [Prunus persica] gi|462413245|gb|EMJ18294.1| hypothetical protein PRUPE_ppa000084mg [Prunus persica] Length = 1878 Score = 613 bits (1580), Expect = e-173 Identities = 308/407 (75%), Positives = 354/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+E YV E+ V+E KN SNFK P P+LRFLYELC T V G+LP QKCK AL+ Sbjct: 1 MSLPPVERAYVREDCVREWKNGTSNFKLADPVPMLRFLYELCSTMVSGELPLQKCKAALD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +VEFSD D ++ S FADIV+Q++QD+ M GE+R RLIKLAKWLVES+LVPLR FQERC Sbjct: 61 SVEFSDKVSDEELASSFADIVTQLSQDIRMPGEHRARLIKLAKWLVESSLVPLRLFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA +LKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ + S Sbjct: 121 EEEFLWEAEMIKIKAQELKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQNSETS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 + N +AAT+GIIKSLIGHFDLDPN VFDIVLE FELQ D++VFL+LIPIFPKSHASQILG Sbjct: 181 SHN-AAATIGIIKSLIGHFDLDPNHVFDIVLEYFELQPDSNVFLELIPIFPKSHASQILG 239 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QR+EVN+PVP GLY+LTAL+VK+EFID+DSIY+HLLPKD+ AFE Y+AFS+KRL Sbjct: 240 FKFQYYQRLEVNSPVPFGLYKLTALLVKEEFIDLDSIYAHLLPKDDEAFEHYSAFSSKRL 299 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLMDDEKQGDVT+DLFAA DME AV ERS++ NNQ LGLL G Sbjct: 300 DEANKIGKINLAATGKDLMDDEKQGDVTIDLFAALDMETEAVGERSTECENNQTLGLLTG 359 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+V+DW+HAH L ERLSPL+PVE+I+IC+ LFRLIEKTI AY V Sbjct: 360 FLSVNDWYHAHLLFERLSPLHPVEHIQICNSLFRLIEKTISSAYDTV 406 >ref|XP_004142861.1| PREDICTED: THO complex subunit 2-like [Cucumis sativus] gi|449506883|ref|XP_004162874.1| PREDICTED: THO complex subunit 2-like [Cucumis sativus] Length = 1887 Score = 612 bits (1578), Expect = e-172 Identities = 296/407 (72%), Positives = 356/407 (87%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 M+LPP+EC+YV E ++E K+ NS+F+ P P P++RFLYELC+T VRGDLP QKCK AL+ Sbjct: 1 MALPPVECMYVVESNIREWKSGNSSFRVPQPVPVVRFLYELCWTMVRGDLPFQKCKAALD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +VEFS+ ++GS FAD+++Q+AQD+T+ GEYR RL+KLAKWLVESA VPLR FQERC Sbjct: 61 SVEFSEKMSAEELGSTFADVITQLAQDITLAGEYRARLLKLAKWLVESAFVPLRLFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA +LKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+V S Sbjct: 121 EEEFLWEAEMIKIKAQELKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRVTDAS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 ++ +T+GIIKSLIGHFDLDPNRVFDIVLECFELQ +NSVF++LIPIFPKSHASQILG Sbjct: 181 NKSFPGSTIGIIKSLIGHFDLDPNRVFDIVLECFELQPENSVFVELIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QR+EVN+PVP GLY+LTAL+VK++FID+DSIY+HLLPK++ AFE Y +FS+KRL Sbjct: 241 FKFQYYQRIEVNSPVPFGLYKLTALLVKEKFIDLDSIYAHLLPKEDEAFEHYGSFSSKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEA++IGKINLAATGKDLMDDEKQGDV++DLFAA DME AV ERS +L NNQ LGLL G Sbjct: 301 DEASRIGKINLAATGKDLMDDEKQGDVSIDLFAAIDMESEAVNERSPELENNQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+V DW+HAH L +RLSPLNPVE + IC+ LFRLIE++I AY +V Sbjct: 361 FLSVGDWYHAHVLFDRLSPLNPVELLPICNSLFRLIEESISSAYSIV 407 >ref|XP_006580422.1| PREDICTED: THO complex subunit 2-like isoform X2 [Glycine max] Length = 1845 Score = 595 bits (1535), Expect = e-167 Identities = 294/407 (72%), Positives = 348/407 (85%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC YVTEE ++E ++ N K P P+LRFLYELC+T VRG+LP QKCKVAL+ Sbjct: 1 MSLPPIECAYVTEECIREWRSGNPALKVSQPVPMLRFLYELCWTMVRGELPFQKCKVALD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +V FSD + + S F+DIV+QMAQD TM GE+R+RLIKLA+WLVES +VP+R QERC Sbjct: 61 SVIFSDKASNEKIASNFSDIVTQMAQDHTMSGEFRSRLIKLARWLVESEMVPVRLLQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFL E E+IKIKA +LK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+ + Sbjct: 121 EEEFLGEVELIKIKAQELKVKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRDSEAP 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+ SAAT+GIIKSLIGHFDLDPNRVFDIVLECFELQ D+ VF++LIPIFPKSHASQILG Sbjct: 181 TQKSSAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDDDVFIELIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVN PVP GLY+LTAL+VK++FID+DSIY+HLLP+D+ AFE YN FS+KRL Sbjct: 241 FKFQYYQRMEVNGPVPFGLYRLTALLVKQDFIDLDSIYAHLLPRDDEAFEHYNTFSSKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIG+INLAA GKDLMDDEKQGDVT+DLFAA DME AV ER+++L ++Q LGLL G Sbjct: 301 DEANKIGRINLAAIGKDLMDDEKQGDVTIDLFAAIDMETDAVEERTTELQSSQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HAH L ERLSPLN VE+I+IC LFRLI+K+I AY ++ Sbjct: 361 FLSVDDWYHAHLLFERLSPLNAVEHIQICDSLFRLIKKSISSAYDVI 407 >ref|XP_006580421.1| PREDICTED: THO complex subunit 2-like isoform X1 [Glycine max] Length = 1870 Score = 595 bits (1535), Expect = e-167 Identities = 294/407 (72%), Positives = 348/407 (85%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+EC YVTEE ++E ++ N K P P+LRFLYELC+T VRG+LP QKCKVAL+ Sbjct: 1 MSLPPIECAYVTEECIREWRSGNPALKVSQPVPMLRFLYELCWTMVRGELPFQKCKVALD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +V FSD + + S F+DIV+QMAQD TM GE+R+RLIKLA+WLVES +VP+R QERC Sbjct: 61 SVIFSDKASNEKIASNFSDIVTQMAQDHTMSGEFRSRLIKLARWLVESEMVPVRLLQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFL E E+IKIKA +LK KEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLC+ + Sbjct: 121 EEEFLGEVELIKIKAQELKVKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCRDSEAP 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 T+ SAAT+GIIKSLIGHFDLDPNRVFDIVLECFELQ D+ VF++LIPIFPKSHASQILG Sbjct: 181 TQKSSAATIGIIKSLIGHFDLDPNRVFDIVLECFELQPDDDVFIELIPIFPKSHASQILG 240 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQY+QRMEVN PVP GLY+LTAL+VK++FID+DSIY+HLLP+D+ AFE YN FS+KRL Sbjct: 241 FKFQYYQRMEVNGPVPFGLYRLTALLVKQDFIDLDSIYAHLLPRDDEAFEHYNTFSSKRL 300 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIG+INLAA GKDLMDDEKQGDVT+DLFAA DME AV ER+++L ++Q LGLL G Sbjct: 301 DEANKIGRINLAAIGKDLMDDEKQGDVTIDLFAAIDMETDAVEERTTELQSSQTLGLLTG 360 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FL+VDDW+HAH L ERLSPLN VE+I+IC LFRLI+K+I AY ++ Sbjct: 361 FLSVDDWYHAHLLFERLSPLNAVEHIQICDSLFRLIKKSISSAYDVI 407 >ref|XP_004297411.1| PREDICTED: THO complex subunit 2-like [Fragaria vesca subsp. vesca] Length = 1860 Score = 594 bits (1532), Expect = e-167 Identities = 295/407 (72%), Positives = 351/407 (86%) Frame = -2 Query: 1224 MSLPPLECVYVTEEAVKELKNPNSNFKFPAPAPILRFLYELCYTTVRGDLPCQKCKVALE 1045 MSLPP+E ++ E+ ++E K N +FK P P P+LRFLYELC T VRG+LP QKC+ AL+ Sbjct: 1 MSLPPVERAHINEDHLREWKTGNPSFKLPEPVPMLRFLYELCSTMVRGELPVQKCRAALD 60 Query: 1044 AVEFSDCDLDGDVGSYFADIVSQMAQDLTMLGEYRTRLIKLAKWLVESALVPLRFFQERC 865 +VEFS+ + ++ S ADIV+QM+QDLTM GE+R RL KLAKWLVES+LVPLR FQERC Sbjct: 61 SVEFSEKVSEQELASSLADIVTQMSQDLTMPGEHRARLTKLAKWLVESSLVPLRLFQERC 120 Query: 864 DEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQVPQVS 685 +EEFLWE+EMIKIKA +LKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQ + S Sbjct: 121 EEEFLWEAEMIKIKAQELKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQNSENS 180 Query: 684 TENQSAATVGIIKSLIGHFDLDPNRVFDIVLECFELQLDNSVFLDLIPIFPKSHASQILG 505 + N + AT+GIIKSLIGHFDLDPN VFDIVLECFEL DN+VFL+LIPIFPKSHASQILG Sbjct: 181 SHN-AGATIGIIKSLIGHFDLDPNHVFDIVLECFELLPDNNVFLELIPIFPKSHASQILG 239 Query: 504 FKFQYFQRMEVNTPVPTGLYQLTALMVKKEFIDVDSIYSHLLPKDEVAFEQYNAFSAKRL 325 FKFQ++QR+EVN PVP GLY+LTAL+VK+ FID+DSI +HLLPKD+ AFE Y++FS+K+L Sbjct: 240 FKFQHYQRLEVNDPVPFGLYKLTALLVKENFIDLDSICAHLLPKDDEAFEHYSSFSSKQL 299 Query: 324 DEANKIGKINLAATGKDLMDDEKQGDVTVDLFAAHDMEMVAVAERSSDLANNQPLGLLMG 145 DEANKIGKINLAATGKDLM+DEKQGDVT+DLFA+ DM+ VAV ERS++ NNQ LGLL G Sbjct: 300 DEANKIGKINLAATGKDLMEDEKQGDVTIDLFASLDMDSVAVGERSTEFENNQTLGLLTG 359 Query: 144 FLAVDDWFHAHQLLERLSPLNPVEYIEICSGLFRLIEKTIIQAYKLV 4 FLAVDDW+HA+ L +RLSPLNPVE+ +IC+ LFRLIEK+I AY +V Sbjct: 360 FLAVDDWYHANLLFDRLSPLNPVEHTQICNSLFRLIEKSISSAYDMV 406