BLASTX nr result
ID: Rheum21_contig00006362
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00006362 (1451 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002277958.2| PREDICTED: thyroid adenoma-associated protei... 508 e-141 gb|EMJ16046.1| hypothetical protein PRUPE_ppa000039mg [Prunus pe... 486 e-134 ref|XP_004234801.1| PREDICTED: uncharacterized protein LOC101261... 480 e-133 ref|XP_006349572.1| PREDICTED: thyroid adenoma-associated protei... 475 e-131 gb|EXC20615.1| hypothetical protein L484_027170 [Morus notabilis] 474 e-131 ref|XP_006482571.1| PREDICTED: thyroid adenoma-associated protei... 474 e-131 ref|XP_006431126.1| hypothetical protein CICLE_v100108891mg, par... 472 e-130 gb|EOY03434.1| Uncharacterized protein TCM_018498 [Theobroma cacao] 459 e-126 ref|XP_004163531.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 459 e-126 ref|XP_004147469.1| PREDICTED: uncharacterized protein LOC101204... 459 e-126 ref|XP_002517489.1| conserved hypothetical protein [Ricinus comm... 448 e-123 ref|NP_191076.2| uncharacterized protein [Arabidopsis thaliana] ... 440 e-121 emb|CAB75750.1| putative protein [Arabidopsis thaliana] 440 e-121 ref|XP_006395331.1| hypothetical protein EUTSA_v10003503mg [Eutr... 435 e-119 ref|XP_002878022.1| hypothetical protein ARALYDRAFT_324042 [Arab... 435 e-119 ref|XP_006290484.1| hypothetical protein CARUB_v10016558mg [Caps... 431 e-118 gb|EPS68931.1| hypothetical protein M569_05834 [Genlisea aurea] 417 e-114 ref|XP_004489387.1| PREDICTED: thyroid adenoma-associated protei... 417 e-114 ref|XP_002305983.2| hypothetical protein POPTR_0004s13360g [Popu... 386 e-104 ref|XP_002445127.1| hypothetical protein SORBIDRAFT_07g004530 [S... 385 e-104 >ref|XP_002277958.2| PREDICTED: thyroid adenoma-associated protein homolog [Vitis vinifera] Length = 2223 Score = 508 bits (1309), Expect = e-141 Identities = 263/449 (58%), Positives = 324/449 (72%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSAVVFPQ + +SL+ ST ++ +L SL S Y Q+ H Sbjct: 1 MSAKWRALQHRHRYTYSAVVFPQSYVESLNS-----STSGIVPELNQLISLNSIYAQVDH 55 Query: 1166 AKSVASSFGALLSTSDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLIGR 987 AK VAS+F LL DE+LIS AAR YLE+LFL+NS+PLHRT S LAK+RNF S+I Sbjct: 56 AKQVASAFTDLLLNCTDEALISEAARLYLEILFLENSLPLHRTLISVLAKTRNFQSVIRN 115 Query: 986 CFRALCEEYGGL-DKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLSS 810 CFR+LC+EY GL +G+G+ FCVSR ALSMMS+PKLGYLV++V+EC +LVA+DIV GL+ Sbjct: 116 CFRSLCDEYCGLRSEGRGKRFCVSRVALSMMSSPKLGYLVEIVEECVVLVALDIVFGLNG 175 Query: 809 VITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXXX 630 V++ETN W+RPSP+VMEQCQ+ALSC+YYLLQRFP++F +DS C Sbjct: 176 VVSETNGWSRPSPIVMEQCQEALSCMYYLLQRFPSKF-SDSSGCVGESS-----VLEMIV 229 Query: 629 XXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSKDK 450 SRDCFVAAGV+ CAALQACLSPEE+G I+ +F ++ C + K Sbjct: 230 TAILSILKSLAFSRDCFVAAGVAFCAALQACLSPEEVGLFIMEGIFYQTNCYSANSGQSK 289 Query: 449 FIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHIDG 270 F + KV Y+GD+ E+ +F+ LSRLCLIRGILT VSR +L +QF+V + G G Sbjct: 290 FGDVILKVPYKGDVYTEICNFAVLSRLCLIRGILTAVSRTVLTSQFIVSRNDLNGFDPQG 349 Query: 269 GNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESVTT 90 ++S VQTILY ILP+LCNYCENP DSHFNFHALTVMQICLQQIKTS+ A+L S Sbjct: 350 ISNSSVQTILYDGILPELCNYCENPTDSHFNFHALTVMQICLQQIKTSMSANLASVSENY 409 Query: 89 DLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 DL PED+ RI+RI+WNNLED L+QTVKQ Sbjct: 410 DLIPEDMGTRILRIIWNNLEDPLSQTVKQ 438 >gb|EMJ16046.1| hypothetical protein PRUPE_ppa000039mg [Prunus persica] Length = 2195 Score = 486 bits (1250), Expect = e-134 Identities = 248/449 (55%), Positives = 320/449 (71%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MS+KWRA+QHRHRYTY+ VVFP +T+SL+ LP ++S+LKFFS +KEL SL STY QL H Sbjct: 1 MSSKWRAIQHRHRYTYNTVVFPSSYTESLNSLPSQLSSLKFFSQLKELVSLNSTYAQLNH 60 Query: 1166 AKSVASSFGALLSTSDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLIGR 987 K +A++FG LL T+ DE+ ++ A +YLELLFL+NS+PLH+T S LAK+R F +LIGR Sbjct: 61 TKGLAAAFGDLL-TNGDEATVAQVAPFYLELLFLENSLPLHKTLVSVLAKARTFQALIGR 119 Query: 986 CFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLSSV 807 C+R LCE+YGG GKG+ FCVSR+ALS+M PKLG+LV +V+EC +L+A+D VS L+ + Sbjct: 120 CYRKLCEDYGG---GKGKRFCVSRSALSVMGMPKLGFLVQIVEECAVLIALDTVSSLNGL 176 Query: 806 ITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXXXX 627 ++ET ARPSP+V+EQCQ+ALSCLYYLLQRFP++F + S + Sbjct: 177 VSETKGSARPSPIVIEQCQEALSCLYYLLQRFPSKF-EEFNSSRSGFDAGHSNVLEMSVT 235 Query: 626 XXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRST-CNLVGDSKDK 450 SRDC+VAAGVS CAALQ CLSPEELG I +F + +L +S+ + Sbjct: 236 VVLSILKSLAFSRDCYVAAGVSFCAALQVCLSPEELGLFIFEGIFHPTDYSSLDANSESE 295 Query: 449 FIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHIDG 270 A++KV Y+GD+ E+ + S LSRLCLIRGILT VSR +LN+ F + G + Sbjct: 296 KRNAIAKVPYKGDIYTEICNLSDLSRLCLIRGILTAVSRVVLNSHFDMSRGYSNGYEVHT 355 Query: 269 GNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESVTT 90 + V+TILY ILP+LCNYCENP DSHFNFH LTV+QICLQQIKTS+LA+L S Sbjct: 356 NGGNCVKTILYDGILPELCNYCENPTDSHFNFHTLTVLQICLQQIKTSMLANLTIPSEHY 415 Query: 89 DLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 D P ++ RI+RIVWNNLED L+QTVKQ Sbjct: 416 DPIPVEMGTRILRIVWNNLEDPLSQTVKQ 444 >ref|XP_004234801.1| PREDICTED: uncharacterized protein LOC101261303 [Solanum lycopersicum] Length = 2163 Score = 480 bits (1235), Expect = e-133 Identities = 254/450 (56%), Positives = 312/450 (69%), Gaps = 2/450 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSAV+FP+ F ++L Q P ++ F+S++++ SL STY+QL H Sbjct: 1 MSAKWRALQHRHRYTYSAVIFPKSFIEALQQTP----SVHFYSELQQFVSLNSTYSQLNH 56 Query: 1166 AKSVASSFGALLST--SDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLI 993 AK +ASSF LLS +DDES IS A+R+YLE+L L+NS PLHRT S L K NF +LI Sbjct: 57 AKKLASSFSELLSNVKADDES-ISTASRFYLEILLLENSQPLHRTLLSVLVKCNNFHTLI 115 Query: 992 GRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLS 813 CFR +CEEYG G+ FCVSRAALSMMSTPKLGYLV++VDEC +LV +++V GLS Sbjct: 116 QNCFRQICEEYGE----NGKRFCVSRAALSMMSTPKLGYLVEIVDECAVLVGLNVVLGLS 171 Query: 812 SVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXX 633 SV+ E N W+RPSP+VMEQCQ+ALSC+YYLLQRFP++F G N Sbjct: 172 SVLAEINDWSRPSPVVMEQCQEALSCMYYLLQRFPSKFVN---------AGSNVLERILV 222 Query: 632 XXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSKD 453 RDC VAAGVS C ALQ CLSP+E+G I+G +F S+ V SK Sbjct: 223 TVLSILKSESFS--RDCLVAAGVSFCVALQVCLSPQEIGLFIMGGIFNESS---VVCSKL 277 Query: 452 KFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHID 273 F L K+ ++G+L EL FS+LSRLCL+RGILT VSR +LN FVV D G Sbjct: 278 VFKGVLEKIPFKGNLVDELSKFSSLSRLCLVRGILTAVSRTVLNTGFVVSNDSFGSVRDS 337 Query: 272 GGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESVT 93 G N ++ ILY +ILP+LCN+CENPIDSHF+FHALTVMQICLQQ+KTS+L V Sbjct: 338 GDNKKSIKMILYDAILPELCNFCENPIDSHFSFHALTVMQICLQQVKTSMLDKNGSLEVN 397 Query: 92 TDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 DL ED+ R+++IVWNNLED LNQTVKQ Sbjct: 398 YDLISEDIGTRLLQIVWNNLEDPLNQTVKQ 427 >ref|XP_006349572.1| PREDICTED: thyroid adenoma-associated protein homolog [Solanum tuberosum] Length = 2187 Score = 475 bits (1223), Expect = e-131 Identities = 249/449 (55%), Positives = 308/449 (68%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSAV+FP+ F ++L Q P ++ F+S++++ SL STY+QL H Sbjct: 1 MSAKWRALQHRHRYTYSAVIFPKSFVEALQQTP----SVHFYSELRQFVSLNSTYSQLNH 56 Query: 1166 AKSVASSFGALLST-SDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLIG 990 AK +ASSF LLS DE IS A+R+YLE+L L+NS PLHRT S L K NF +LI Sbjct: 57 AKKLASSFSELLSNVKADEESISTASRFYLEILLLENSQPLHRTLLSVLVKCNNFHTLIQ 116 Query: 989 RCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLSS 810 CFR LCEEYG G+ FCVSR ALSMMSTPKLGYLV++VDEC +LV +++V GLSS Sbjct: 117 NCFRQLCEEYGE----NGKRFCVSRVALSMMSTPKLGYLVEIVDECAVLVGLNVVLGLSS 172 Query: 809 VITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXXX 630 V+ + N W+RPSP+VMEQCQ+ALSC+YYLLQRFP++F G N Sbjct: 173 VLADINDWSRPSPVVMEQCQEALSCMYYLLQRFPSKFVN---------AGSNVLERILVI 223 Query: 629 XXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSKDK 450 RDC VAAGVS C ALQ CLSP+ELG I+G +F +S+ + SK Sbjct: 224 VLSILKSESFS--RDCLVAAGVSFCVALQVCLSPQELGLFIMGGIFNQSS---IVCSKLA 278 Query: 449 FIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHIDG 270 F L K+ ++G+L EL FS+LSRLC++RGILT VSR +LN FVV D G G Sbjct: 279 FKDVLEKIPFKGNLVDELSKFSSLSRLCVVRGILTAVSRTVLNTGFVVSNDSFGSVRDSG 338 Query: 269 GNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESVTT 90 N ++ ILY +ILP+LCN+CENPIDSHF+FHALTVMQICLQQ+KTS+L V Sbjct: 339 DNKKSIKMILYDAILPELCNFCENPIDSHFSFHALTVMQICLQQVKTSMLDKNGSLEVNY 398 Query: 89 DLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 D ED+ R+++IVWNNLED LNQTVKQ Sbjct: 399 DPISEDIGTRLLQIVWNNLEDPLNQTVKQ 427 >gb|EXC20615.1| hypothetical protein L484_027170 [Morus notabilis] Length = 2199 Score = 474 bits (1219), Expect = e-131 Identities = 257/451 (56%), Positives = 315/451 (69%), Gaps = 3/451 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRA+QHRHRYTY+AVVFP + DS S + S KFFS +K SLTS + QL H Sbjct: 1 MSAKWRAIQHRHRYTYNAVVFPDSYADSFSTIS---SRNKFFSQLKLFTSLTSLHAQLNH 57 Query: 1166 AKSVASSFGALLSTSDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLIGR 987 AKS+A SF LL T+D E L S+AA+ YL +LFLDNS+PLHRT S LAK+R F S+I Sbjct: 58 AKSLACSFSDLLLTAD-EPLASLAAKLYLRILFLDNSLPLHRTLVSDLAKARAFRSVISA 116 Query: 986 CFRALCEEYGG--LDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLS 813 CFR LC EYGG G G+ F VSR ALS+M PK+GYLVDVV+EC +LVA D+V L+ Sbjct: 117 CFRDLCAEYGGGGAGDGGGKRFRVSRTALSVMGMPKVGYLVDVVEECAVLVAWDVVGSLN 176 Query: 812 SVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXX 633 V++ET WARPSP+VMEQCQ+ALSCLYYLLQRFP++F E S++ G + Sbjct: 177 GVVSETERWARPSPIVMEQCQEALSCLYYLLQRFPSKFKDQDSE--SNVLGRS------- 227 Query: 632 XXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRST-CNLVGDSK 456 SRDC+VAAGVS CAALQ CLSPE+LG +II +F ++ CN S+ Sbjct: 228 LSVVLSILTSLSFSRDCYVAAGVSFCAALQVCLSPEDLGLVIIQGIFYQTVFCN----SE 283 Query: 455 DKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHI 276 + F A+ KV Y GDLC+E++SFS+LSRLC+IRGILT V RA+LN F V D Sbjct: 284 NDFENAVLKVPYDGDLCSEIRSFSSLSRLCVIRGILTAVPRAVLNTCFTVSGDSS----- 338 Query: 275 DGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESV 96 +TILY +LP+LCNYCENP DSHFNFHALTV+QICLQQIKTS+LA+L +S Sbjct: 339 --------RTILYDGVLPELCNYCENPTDSHFNFHALTVLQICLQQIKTSMLANLTIQSD 390 Query: 95 TTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 D PE++ R++RI+WNNLED L+QTVKQ Sbjct: 391 NYDPIPEEMGTRVLRIIWNNLEDPLSQTVKQ 421 >ref|XP_006482571.1| PREDICTED: thyroid adenoma-associated protein homolog [Citrus sinensis] Length = 2224 Score = 474 bits (1219), Expect = e-131 Identities = 258/461 (55%), Positives = 319/461 (69%), Gaps = 13/461 (2%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPE--ISTLKFFSDMKELASLTSTYTQL 1173 MSAKWRALQHRHRYTYSAVVFP T+SL+Q+P S KF + +EL SL S Y Q+ Sbjct: 1 MSAKWRALQHRHRYTYSAVVFPTSLTESLTQIPSSQNSSFSKFHNAFRELVSLNSIYAQV 60 Query: 1172 THAKSVASSFGALLSTSD---DESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFG 1002 HAK ASSF LLS+++ DE ++S A R YLE++FL+NS+PLHRT SALAK R F Sbjct: 61 NHAKKFASSFIELLSSANAAADEWVLSKATRVYLEVMFLENSLPLHRTLVSALAKERKFQ 120 Query: 1001 SLIGRCFRALCEEYGGLDKG--KGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDI 828 +LI CFR LC+EYGG + + + FCVSR LS+MS PKLGYL+DV+ +C +LVA D+ Sbjct: 121 ALIVSCFRDLCDEYGGGGRASDQNKRFCVSRVVLSVMSLPKLGYLMDVIQDCAVLVAWDV 180 Query: 827 VSGLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXX 648 V GL+ V+ ET WARPSP+VMEQCQ+ALSCLYYLLQR +F L G Sbjct: 181 VLGLNGVVLETQEWARPSPIVMEQCQEALSCLYYLLQRCLDKF--------KGLSGQKES 232 Query: 647 XXXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLV 468 SRDC+VAAGV+LCAALQ CL P+ELG +I +F + TC+ Sbjct: 233 IMEMIFVVLISILKSTAFSRDCYVAAGVALCAALQVCLGPQELGLFLIEGIFYQKTCSFS 292 Query: 467 GD-SKDKFIRALS----KVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVV- 306 + SK +F AL K + GD+C+E+ +FS LSRLCLIRGILT VSR +LNA F V Sbjct: 293 SEKSKSEFEDALQVCFRKTPFNGDVCSEIHNFSVLSRLCLIRGILTAVSRNVLNALFFVS 352 Query: 305 ETDGQGGSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTS 126 + D GS + G+DS +TILY+ ILP+LC+YCENP DSHFNFHALTV+QICLQQIKTS Sbjct: 353 KEDLSNGS--ENGDDS-AKTILYNGILPELCSYCENPTDSHFNFHALTVLQICLQQIKTS 409 Query: 125 LLADLIDESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 +LA+L + S D PED+ RI+RI+WNNLED L+QTVKQ Sbjct: 410 ILANLTNVSFDYDPIPEDMGTRILRIIWNNLEDPLSQTVKQ 450 >ref|XP_006431126.1| hypothetical protein CICLE_v100108891mg, partial [Citrus clementina] gi|557533183|gb|ESR44366.1| hypothetical protein CICLE_v100108891mg, partial [Citrus clementina] Length = 845 Score = 472 bits (1215), Expect = e-130 Identities = 258/464 (55%), Positives = 320/464 (68%), Gaps = 16/464 (3%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPE--ISTLKFFSDMKELASLTSTYTQL 1173 MSAKWRALQHRHRYTYSAVVFP T+SL+Q+P S KF ++ +EL SL S Y Q+ Sbjct: 1 MSAKWRALQHRHRYTYSAVVFPTSLTESLTQIPSSQNSSFSKFHNEFRELVSLNSIYAQV 60 Query: 1172 THAKSVASSFGALLSTSD---DESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFG 1002 HAK ASSF LLS+++ DE ++S A R YLE++FL+NS+P+HRT SALAK R F Sbjct: 61 NHAKKFASSFIELLSSANAAADEWVLSKATRVYLEVMFLENSLPMHRTLVSALAKERKFQ 120 Query: 1001 SLIGRCFRALCEEYGGLDKG-----KGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVA 837 +LI CFR LC+EYGG G + + FCVSR LS+MS PKLGYL+DV+ +C +LVA Sbjct: 121 ALIVSCFRDLCDEYGGGGGGGRASDQNKRFCVSRVVLSVMSLPKLGYLMDVIQDCAVLVA 180 Query: 836 VDIVSGLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGD 657 D+V GL+ V+ ET ARPSP+VMEQCQ+ALSCLYYLLQR P +F L G Sbjct: 181 WDVVLGLNGVVLETQERARPSPIVMEQCQEALSCLYYLLQRCPDKF--------KGLSGQ 232 Query: 656 NXXXXXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTC 477 SRDC+VAAGV+LCAALQ CL P+ELG +I +F + TC Sbjct: 233 KESIMEMIFVVLISTLKSTAFSRDCYVAAGVALCAALQVCLGPQELGLFLIEGIFYQKTC 292 Query: 476 NLVGD-SKDKFIRALS----KVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQF 312 + + SK +F AL K + GD+C+E+ +FS LSRLCLIRGILT VSR +LNA F Sbjct: 293 SFSSEKSKSEFEDALQVCFRKTPFNGDVCSEIHNFSVLSRLCLIRGILTAVSRNVLNAIF 352 Query: 311 VV-ETDGQGGSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQI 135 V + D GS + G+DS +TILY+ ILP+LC+YCENP DSHFNFHALTV+QICLQQI Sbjct: 353 FVSKEDLSNGS--ENGDDS-AKTILYNGILPELCSYCENPTDSHFNFHALTVLQICLQQI 409 Query: 134 KTSLLADLIDESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 KTS+LA+L + S D PED+ RI+RI+WNNLED L+QTVKQ Sbjct: 410 KTSILANLTNVSFDYDPIPEDMGTRILRIIWNNLEDPLSQTVKQ 453 >gb|EOY03434.1| Uncharacterized protein TCM_018498 [Theobroma cacao] Length = 2221 Score = 459 bits (1181), Expect = e-126 Identities = 241/451 (53%), Positives = 306/451 (67%), Gaps = 3/451 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRA+QHRHRYTY+AVVFP F DSL+Q S+ F+++++ L SL STY+Q+ H Sbjct: 1 MSAKWRAIQHRHRYTYNAVVFPPSFIDSLNQSSLSASSPTFYTELQHLISLNSTYSQVNH 60 Query: 1166 AKSVASSFGALLSTSDD--ESLISVAARYYLELLFLDNSVPLHRTFASALAKSRN-FGSL 996 K VASSF LL + E L+S AA +YLE+ FL+NS+PLHRT S ++K+++ F + Sbjct: 61 VKKVASSFNKLLVKEGEKNEGLVSTAAAFYLEVFFLENSMPLHRTLLSVVSKTKDVFQPV 120 Query: 995 IGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGL 816 IG CFR LC EYG + + R F VSR ALS+M PKLG+LVDV++EC +LV DIV GL Sbjct: 121 IGECFRVLCNEYGRMTNKRNR-FSVSRVALSVMGMPKLGFLVDVIEECAVLVCWDIVLGL 179 Query: 815 SSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXX 636 SV+ ET WARPSP+V+EQCQ+ALSCLYYL Q+FP +F L ++ Sbjct: 180 KSVVLETEEWARPSPIVLEQCQEALSCLYYLFQKFPGKF--------KDLDTEDSNVMEM 231 Query: 635 XXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSK 456 SRDCFVAAGVS AALQ CLS +ELG II +F + N +S+ Sbjct: 232 ALGVLISVLKSVAFSRDCFVAAGVSFFAALQVCLSDQELGLFIIEGIFDQIVSNSGTNSE 291 Query: 455 DKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHI 276 D F +SKV Y+GD+C ++++ L+RLCLIRGILT V R +LN FVV + Sbjct: 292 DSFSNVISKVPYKGDVCLDIRNLLVLNRLCLIRGILTAVPRMVLNTNFVVSREIFNDFES 351 Query: 275 DGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESV 96 G S ++TILY ILP+LCNYCENP DSHFNFHALTVMQICLQQIKTS+LA+L + S Sbjct: 352 VGNIVSSLKTILYDGILPELCNYCENPTDSHFNFHALTVMQICLQQIKTSMLANLTNASE 411 Query: 95 TTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 + PED+ R++RI+WNNLED L+QTVKQ Sbjct: 412 EYNPLPEDMGTRMLRIIWNNLEDPLSQTVKQ 442 >ref|XP_004163531.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101204483 [Cucumis sativus] Length = 2186 Score = 459 bits (1181), Expect = e-126 Identities = 244/450 (54%), Positives = 310/450 (68%), Gaps = 2/450 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSA+VFP + DSL+ S+ KFF+++ +L SL S Y Q+ H Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPNSYVDSLNSFQ---SSSKFFTELLQLVSLNSVYAQVNH 57 Query: 1166 AKSVASSFGALLSTSDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLIGR 987 AK VAS+F LL+ D++S +S AAR+YLE+LF +NS PLHRT S LAKSR F +G Sbjct: 58 AKKVASAFSELLANGDEDS-VSKAARFYLEVLFFENSQPLHRTLVSTLAKSRKFHDPLGE 116 Query: 986 CFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLSSV 807 CFR LCEE+ G+ +G + FCVSR ALS+M PKLGYLVDV+ +C +LVA DIVS L V Sbjct: 117 CFRDLCEEHSGVLQGGEKRFCVSRVALSVMGMPKLGYLVDVIKDCALLVARDIVSSLDYV 176 Query: 806 ITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXXXX 627 + ETN ARPSP++MEQCQ+ALSCLYYLLQRFP++F D G + Sbjct: 177 VKETNESARPSPIIMEQCQEALSCLYYLLQRFPSKFQEDFGVLGMIVSS----------- 225 Query: 626 XXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSKDKF 447 SRDC+VAAGVS CA+LQ CL+ EELG LI + ++ +F Sbjct: 226 -ILSILKSLAFSRDCYVAAGVSFCASLQVCLNSEELGVLIFYGILEQTNHISFLKYDSEF 284 Query: 446 IRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQF-VVETDGQGGSH-ID 273 + KV +Q ++CAE+++FS LSRLCLIRGILT + R +LN F +VE D G ++ Sbjct: 285 RNTVGKVPHQANVCAEIRTFSVLSRLCLIRGILTAIPRPVLNIPFSMVEGDSNGHPGCLN 344 Query: 272 GGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESVT 93 GN V+TILY ILP+LCNYCENP DSHFNFH+LTV+QICLQQIKTSL+++L D S + Sbjct: 345 SGNS--VKTILYDGILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCS 402 Query: 92 TDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 D PE++ RI+ I+W NL+D L+QTVKQ Sbjct: 403 YDPLPEEMGSRILSIMWTNLDDPLSQTVKQ 432 >ref|XP_004147469.1| PREDICTED: uncharacterized protein LOC101204483 [Cucumis sativus] Length = 2184 Score = 459 bits (1180), Expect = e-126 Identities = 244/450 (54%), Positives = 310/450 (68%), Gaps = 2/450 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSA+VFP + DSL+ S+ KFF+++ +L SL S Y Q+ H Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPNSYVDSLNSFQ---SSSKFFTELLQLVSLNSVYAQVNH 57 Query: 1166 AKSVASSFGALLSTSDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLIGR 987 AK VAS+F LL+ D++S +S AAR+YLE+LF +NS PLHRT S LAKSR F +G Sbjct: 58 AKKVASAFSELLANGDEDS-VSKAARFYLEVLFFENSQPLHRTLVSTLAKSRKFHDPLGE 116 Query: 986 CFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLSSV 807 CFR LCEE+ G+ +G + FCVSR ALS+M PKLGYLVDV+ +C +LVA DIVS L V Sbjct: 117 CFRDLCEEHSGVLQGGEKRFCVSRVALSVMGMPKLGYLVDVIKDCALLVARDIVSSLDYV 176 Query: 806 ITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXXXX 627 + ETN ARPSP++MEQCQ+ALSCLYYLLQRFP++F D G + Sbjct: 177 VKETNESARPSPIIMEQCQEALSCLYYLLQRFPSKFQEDFGVLGMIVSS----------- 225 Query: 626 XXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSKDKF 447 SRDC+VAAGVS CA+LQ CL+ EELG LI + ++ +F Sbjct: 226 -ILSILKSLAFSRDCYVAAGVSFCASLQVCLNSEELGVLIFYGILEQTNHIPFLKYDSEF 284 Query: 446 IRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQF-VVETDGQGGSH-ID 273 + KV +Q ++CAE+++FS LSRLCLIRGILT + R +LN F +VE D G ++ Sbjct: 285 RNTVGKVPHQANVCAEIRTFSVLSRLCLIRGILTAIPRPVLNIPFSMVEGDSNGHPGCLN 344 Query: 272 GGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESVT 93 GN V+TILY ILP+LCNYCENP DSHFNFH+LTV+QICLQQIKTSL+++L D S + Sbjct: 345 SGNS--VKTILYDGILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCS 402 Query: 92 TDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 D PE++ RI+ I+W NL+D L+QTVKQ Sbjct: 403 YDPLPEEMGSRILSIMWTNLDDPLSQTVKQ 432 >ref|XP_002517489.1| conserved hypothetical protein [Ricinus communis] gi|223543500|gb|EEF45031.1| conserved hypothetical protein [Ricinus communis] Length = 2190 Score = 448 bits (1153), Expect = e-123 Identities = 248/456 (54%), Positives = 301/456 (66%), Gaps = 8/456 (1%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQ--LPPEISTLKFFSDMKELASLTSTYTQL 1173 MSAKWRA+QHRHRYTYSAV+FP FTDSLSQ LP +L FF+ + L SLTS Y+QL Sbjct: 1 MSAKWRAIQHRHRYTYSAVIFPSSFTDSLSQSLLPLNPKSLPFFNQLNNLVSLTSIYSQL 60 Query: 1172 THAKSVASSFGALLSTSDDESLISVAARYYLELLFLDNSVPLHRTFASALAK--SRNFGS 999 LFL+NS+PLHRT SAL+K ++++ S Sbjct: 61 ---------------------------------LFLENSLPLHRTLVSALSKVSNKDYQS 87 Query: 998 LIGRCFRALCEEYGGLD--KGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIV 825 L+ CFR +CEEYG D + K + FC+SR ALS++ PKL YLVDV+++C +LVA D+V Sbjct: 88 LVCGCFREICEEYGSGDGKEYKSKRFCLSRVALSILGMPKLVYLVDVIEDCAVLVAWDVV 147 Query: 824 SGLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXX 645 GL SV+ E WARPSP+VMEQCQ+ALSC YYLLQRFP +F D G Sbjct: 148 LGLDSVLLEIQDWARPSPIVMEQCQEALSCSYYLLQRFPDKFKEDL----EGFDGVEFNI 203 Query: 644 XXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVG 465 SRDCFVAAGVSLCAALQ CLS +ELG II +F ++TCN+ G Sbjct: 204 MERILLVLISLLKSMAFSRDCFVAAGVSLCAALQVCLSAQELGLFIIQGIFSQTTCNVYG 263 Query: 464 DSKD--KFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQ 291 ++ D +F AL KV ++GDL +E+ SFS LSRLCLIRGILT VSR +LN QFV + Sbjct: 264 NNCDGGEFRDALLKVPFKGDLISEVGSFSVLSRLCLIRGILTAVSRTVLNLQFVESSSKL 323 Query: 290 GGSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADL 111 G +G S V+TILY ILP+LCNYCENPIDSHFNFH LTVMQICLQQ+KTSLLA+L Sbjct: 324 NGHEGNGTCASSVKTILYDGILPELCNYCENPIDSHFNFHTLTVMQICLQQMKTSLLANL 383 Query: 110 IDESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 ID S D PE++ RI+RI+WNNLED L+QTVKQ Sbjct: 384 IDLSDNYDPMPEEMGSRILRIIWNNLEDPLSQTVKQ 419 >ref|NP_191076.2| uncharacterized protein [Arabidopsis thaliana] gi|332645826|gb|AEE79347.1| uncharacterized protein AT3G55160 [Arabidopsis thaliana] Length = 2130 Score = 440 bits (1131), Expect = e-121 Identities = 237/455 (52%), Positives = 301/455 (66%), Gaps = 7/455 (1%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSAV+FP FT SLSQ S KF+S+++EL SL S Y Q+ H Sbjct: 1 MSAKWRALQHRHRYTYSAVLFPSSFTASLSQSSLSQSCPKFYSNIEELVSLNSIYAQVNH 60 Query: 1166 AKSVASSFGALLSTSDDES-----LISV--AARYYLELLFLDNSVPLHRTFASALAKSRN 1008 AK V +SFG L+ +++ +SV A R+YLE+LF++NS+PLH+T SALAK+ Sbjct: 61 AKKVVASFGEFLAKANENEGGERETVSVREAIRFYLEILFMENSLPLHKTLVSALAKTTK 120 Query: 1007 FGSLIGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDI 828 F S+I CF+ LC+EYGG + G G FCVSR ALS+M PKLGYLVD++++C +LV DI Sbjct: 121 FHSVISSCFKELCDEYGGFEDG-GNRFCVSRVALSVMGMPKLGYLVDIIEDCALLVGYDI 179 Query: 827 VSGLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXX 648 VSGL+ ++ +T + RP P VMEQCQ+ALSC YYL QRFP +F L G++ Sbjct: 180 VSGLNGIVLDTEACDRPPPTVMEQCQEALSCSYYLFQRFPLKF--------KGLVGEDAS 231 Query: 647 XXXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLV 468 SRDC+VAAGVS CAALQ CL EELG I C+FC S+ + Sbjct: 232 FMESVLAVQVSILKSLAFSRDCYVAAGVSFCAALQVCLKDEELGLFIAQCIFCWSSVVRL 291 Query: 467 GDSKDKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQG 288 D +SK+ + GD+C+E+ SFS+LSRLCLIRGILT VSR IL + F ++ Sbjct: 292 AD-------IVSKIPFAGDICSEICSFSSLSRLCLIRGILTTVSRGILVSSFARLSN--- 341 Query: 287 GSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLI 108 +D +TILY IL +LC+ CENPIDSH NFH LTVMQIC+QQIKTS+L DL Sbjct: 342 -------SDCDHKTILYDGILLELCDLCENPIDSHLNFHVLTVMQICMQQIKTSMLTDL- 393 Query: 107 DESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 S D P+ + R++RI+WNNLED L+QTVKQ Sbjct: 394 --SEGYDPMPDSMAARVLRIIWNNLEDPLSQTVKQ 426 >emb|CAB75750.1| putative protein [Arabidopsis thaliana] Length = 2149 Score = 440 bits (1131), Expect = e-121 Identities = 237/455 (52%), Positives = 301/455 (66%), Gaps = 7/455 (1%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSAV+FP FT SLSQ S KF+S+++EL SL S Y Q+ H Sbjct: 1 MSAKWRALQHRHRYTYSAVLFPSSFTASLSQSSLSQSCPKFYSNIEELVSLNSIYAQVNH 60 Query: 1166 AKSVASSFGALLSTSDDES-----LISV--AARYYLELLFLDNSVPLHRTFASALAKSRN 1008 AK V +SFG L+ +++ +SV A R+YLE+LF++NS+PLH+T SALAK+ Sbjct: 61 AKKVVASFGEFLAKANENEGGERETVSVREAIRFYLEILFMENSLPLHKTLVSALAKTTK 120 Query: 1007 FGSLIGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDI 828 F S+I CF+ LC+EYGG + G G FCVSR ALS+M PKLGYLVD++++C +LV DI Sbjct: 121 FHSVISSCFKELCDEYGGFEDG-GNRFCVSRVALSVMGMPKLGYLVDIIEDCALLVGYDI 179 Query: 827 VSGLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXX 648 VSGL+ ++ +T + RP P VMEQCQ+ALSC YYL QRFP +F L G++ Sbjct: 180 VSGLNGIVLDTEACDRPPPTVMEQCQEALSCSYYLFQRFPLKF--------KGLVGEDAS 231 Query: 647 XXXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLV 468 SRDC+VAAGVS CAALQ CL EELG I C+FC S+ + Sbjct: 232 FMESVLAVQVSILKSLAFSRDCYVAAGVSFCAALQVCLKDEELGLFIAQCIFCWSSVVRL 291 Query: 467 GDSKDKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQG 288 D +SK+ + GD+C+E+ SFS+LSRLCLIRGILT VSR IL + F ++ Sbjct: 292 AD-------IVSKIPFAGDICSEICSFSSLSRLCLIRGILTTVSRGILVSSFARLSN--- 341 Query: 287 GSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLI 108 +D +TILY IL +LC+ CENPIDSH NFH LTVMQIC+QQIKTS+L DL Sbjct: 342 -------SDCDHKTILYDGILLELCDLCENPIDSHLNFHVLTVMQICMQQIKTSMLTDL- 393 Query: 107 DESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 S D P+ + R++RI+WNNLED L+QTVKQ Sbjct: 394 --SEGYDPMPDSMAARVLRIIWNNLEDPLSQTVKQ 426 >ref|XP_006395331.1| hypothetical protein EUTSA_v10003503mg [Eutrema salsugineum] gi|567141372|ref|XP_006395332.1| hypothetical protein EUTSA_v10003503mg [Eutrema salsugineum] gi|557091970|gb|ESQ32617.1| hypothetical protein EUTSA_v10003503mg [Eutrema salsugineum] gi|557091971|gb|ESQ32618.1| hypothetical protein EUTSA_v10003503mg [Eutrema salsugineum] Length = 2122 Score = 435 bits (1119), Expect = e-119 Identities = 237/453 (52%), Positives = 296/453 (65%), Gaps = 6/453 (1%) Frame = -1 Query: 1343 SAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTHA 1164 SAKWRALQHRHRYTYSA+VFP FT SLSQ KF+SD+ EL SL S Y Q+ HA Sbjct: 3 SAKWRALQHRHRYTYSAIVFPSSFTVSLSQSSLSQCCPKFYSDIAELVSLNSIYAQVNHA 62 Query: 1163 KSVASSFGALLSTSDDES------LISVAARYYLELLFLDNSVPLHRTFASALAKSRNFG 1002 K V +SFG +L+ + + + A R+YLE+LF++NS+PLH+T SALAK+R F Sbjct: 63 KKVVASFGEILAKTHENEGEREAVFVREAIRFYLEVLFMENSLPLHKTLVSALAKTRKFH 122 Query: 1001 SLIGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVS 822 S+I CFR LC+ YG L+ G G+ FCVSR ALS+M PKLGYLVD++++C ILV D+VS Sbjct: 123 SVISSCFRELCDGYGDLEDG-GKRFCVSRVALSVMGMPKLGYLVDIIEDCAILVGRDVVS 181 Query: 821 GLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXX 642 GL+ +I ET + ARP P VMEQCQ+ALSC YYL QRFP++F L G++ Sbjct: 182 GLNGIILETEACARPPPTVMEQCQEALSCSYYLFQRFPSKF--------KGLVGEDASFM 233 Query: 641 XXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGD 462 SRDC VAAGVSLCAALQ CL+ EELG I +FC S Sbjct: 234 ESIFAVQISILKSAAFSRDCCVAAGVSLCAALQVCLNDEELGLFIAQGIFCWSNVG---- 289 Query: 461 SKDKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGS 282 +F + K+ + GD+ E+ SFSALSRLCLIRGILT VSR +L + F ++ Sbjct: 290 ---RFTDIVGKIPFAGDIWLEICSFSALSRLCLIRGILTAVSRGVLVSSFARLSN----- 341 Query: 281 HIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDE 102 +D +TILY ILP+LC+ CENPIDSH NFHALTVMQICLQQIKTS L DL ++ Sbjct: 342 -----SDCDHKTILYDGILPELCDLCENPIDSHLNFHALTVMQICLQQIKTSTLNDLSED 396 Query: 101 SVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 D P+ R+++I+WNNLED L+QTVKQ Sbjct: 397 ---YDPMPDSKVTRVLKIIWNNLEDPLSQTVKQ 426 >ref|XP_002878022.1| hypothetical protein ARALYDRAFT_324042 [Arabidopsis lyrata subsp. lyrata] gi|297323860|gb|EFH54281.1| hypothetical protein ARALYDRAFT_324042 [Arabidopsis lyrata subsp. lyrata] Length = 2128 Score = 435 bits (1119), Expect = e-119 Identities = 235/455 (51%), Positives = 299/455 (65%), Gaps = 7/455 (1%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSAV+FP FT SLS+ S KF+SD+ EL SL S Y Q+ H Sbjct: 1 MSAKWRALQHRHRYTYSAVLFPSSFTASLSKSSLSQSCPKFYSDIGELVSLNSIYAQVNH 60 Query: 1166 AKSVASSFGALLSTSDDES-------LISVAARYYLELLFLDNSVPLHRTFASALAKSRN 1008 AK +SFG +L+ + + + A R+YLE+LF++NS+PLH+T SALAK+ Sbjct: 61 AKKFVASFGEVLAKTHENEGGEREAVFVREAIRFYLEVLFMENSLPLHKTLVSALAKTSK 120 Query: 1007 FGSLIGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDI 828 F S+I CF+ LC+EYGGL+ G G FCVSR ALS+M PKLGYLVD++++C +LV DI Sbjct: 121 FHSVISSCFKELCDEYGGLEDG-GNRFCVSRVALSVMGMPKLGYLVDIIEDCALLVGHDI 179 Query: 827 VSGLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXX 648 VSGL+ +I +T + RP P VMEQCQ+ALSC YYL QRFP +F L G++ Sbjct: 180 VSGLNGIILDTEACDRPPPTVMEQCQEALSCSYYLFQRFPLKF--------KGLIGEDAC 231 Query: 647 XXXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLV 468 SRDC+VAAGVS CAALQ CL EELG I +FC S+ + Sbjct: 232 FMESVLAVQVSILKSPAFSRDCYVAAGVSFCAALQVCLKDEELGLFIAQGIFCWSSVVRL 291 Query: 467 GDSKDKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQG 288 D +SK+ + GD+C+E+ SFS+LSRLCLIRGILT VSR IL + F ++ Sbjct: 292 TD-------IISKIPFAGDICSEICSFSSLSRLCLIRGILTTVSRGILVSSFARLSN--- 341 Query: 287 GSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLI 108 +D +TILY IL +LC+ CENPIDSH NFHALTVMQIC+QQIKTS+L DL Sbjct: 342 -------SDCDHKTILYDGILLELCDLCENPIDSHLNFHALTVMQICMQQIKTSMLTDLS 394 Query: 107 DESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 ++ D P+ + R++RI+WNNLED L+QTVKQ Sbjct: 395 ED---CDPMPDSMAARVLRIIWNNLEDPLSQTVKQ 426 >ref|XP_006290484.1| hypothetical protein CARUB_v10016558mg [Capsella rubella] gi|565465042|ref|XP_006290485.1| hypothetical protein CARUB_v10016558mg [Capsella rubella] gi|482559191|gb|EOA23382.1| hypothetical protein CARUB_v10016558mg [Capsella rubella] gi|482559192|gb|EOA23383.1| hypothetical protein CARUB_v10016558mg [Capsella rubella] Length = 1949 Score = 431 bits (1107), Expect = e-118 Identities = 236/455 (51%), Positives = 301/455 (66%), Gaps = 7/455 (1%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWRALQHRHRYTYSAV+FP F SLSQ S KF+S+++EL SL S Y Q+ H Sbjct: 1 MSAKWRALQHRHRYTYSAVLFPSSFAASLSQSSLFQSCPKFYSNIEELVSLNSIYAQVNH 60 Query: 1166 AKSVASSFGALLSTSDDES-----LISV--AARYYLELLFLDNSVPLHRTFASALAKSRN 1008 AK + +SFG +L+ + + +SV A R+YLE+LF++NS+PLH+T SALAK+ Sbjct: 61 AKKIVASFGEVLAKNHENEGGEREAVSVREAIRFYLEILFMENSLPLHKTLVSALAKTSK 120 Query: 1007 FGSLIGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDI 828 F S+I CF+ LC+EYGGL+ G G FCVSR ALS+M PKLGYLVD++++C +LV DI Sbjct: 121 FHSVISSCFKELCDEYGGLEDG-GNRFCVSRVALSVMGMPKLGYLVDIIEDCALLVGHDI 179 Query: 827 VSGLSSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXX 648 VSGL+ +I ++ + RP P VMEQCQ+ALSC YYL QRFP +F L G++ Sbjct: 180 VSGLNGIILDSEACDRPPPTVMEQCQEALSCSYYLFQRFPLKF--------KGLIGEDAN 231 Query: 647 XXXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLV 468 SRDC+VAAGVS CAALQ CL EELG I +FC S+ + Sbjct: 232 FMESVLAVQLSILKSSAFSRDCYVAAGVSFCAALQVCLKDEELGLFIAQGIFCWSSVVTL 291 Query: 467 GDSKDKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQG 288 D +SK+ + GD+C+E+ SFS+LSRLCLIRGILT VSR IL + F ++ Sbjct: 292 TD-------IVSKIPFAGDICSEICSFSSLSRLCLIRGILTTVSRGILVSSFGRLSN--- 341 Query: 287 GSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLI 108 +D +TILY IL +LC+ CENPIDSH NFHALTVMQICLQQIKTS+L DL Sbjct: 342 -------SDCDHKTILYDGILLELCDLCENPIDSHLNFHALTVMQICLQQIKTSMLTDLS 394 Query: 107 DESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 ++ D P+ + RI+R +WNNLED L+QTVKQ Sbjct: 395 ED---YDPMPDSMTARILRTIWNNLEDPLSQTVKQ 426 >gb|EPS68931.1| hypothetical protein M569_05834 [Genlisea aurea] Length = 2127 Score = 417 bits (1073), Expect = e-114 Identities = 238/451 (52%), Positives = 291/451 (64%), Gaps = 3/451 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 M AKWRALQHRHRYTY AVVFP F ++L+ FF +++ LA L STY+QL + Sbjct: 1 MPAKWRALQHRHRYTYGAVVFPPSFIEALNGAS---YGFHFFEELRHLADLNSTYSQLEN 57 Query: 1166 AKSVASSFGALLS--TSDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLI 993 K +A +F +LLS SD E ++ A R YLE+ FL+NS+PLHRT ASALAK RNF S+I Sbjct: 58 VKKLALAFSSLLSDPNSDGEPVVC-AVRLYLEIFFLENSLPLHRTLASALAKCRNFRSVI 116 Query: 992 GRCFRALCEEY-GGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGL 816 CFR LCEEY GG G G+ FCVSRAALSMM TPKLGYLV+VV++C LV D+V GL Sbjct: 117 EGCFRKLCEEYCGGGCWGNGKRFCVSRAALSMMCTPKLGYLVEVVEQCAPLVGSDVVWGL 176 Query: 815 SSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXX 636 SVI ETN +RPSP+VMEQCQ+ALSC+YYL QRFP++F + + L DN Sbjct: 177 QSVIDETNELSRPSPIVMEQCQEALSCMYYLFQRFPSKFLNIDVQY-NGLCFDNSSVLEM 235 Query: 635 XXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSK 456 RDC VAAGVSL AAL CLS +E+G II +F + L +S Sbjct: 236 AILSVLSILKSQFFPRDCLVAAGVSLFAALHVCLSNDEIGLFIIRGIF--NQTELGSNSI 293 Query: 455 DKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHI 276 D+F + ++ Y+GDL E+ LSRLCLIRGILT VSR +L+ +VV + S Sbjct: 294 DEFSAVVRRIPYKGDLVREILDVLPLSRLCLIRGILTAVSREVLDTHYVVSCEYLSDS-- 351 Query: 275 DGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESV 96 S +TI+Y +ILP+LC Y ENP D+H NFHALTVMQICLQQIKT L Sbjct: 352 ----KSTTKTIIYDAILPELCVYAENPCDTHSNFHALTVMQICLQQIKTLLQGSACSFPD 407 Query: 95 TTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 E++ RIIRIVWNNLED L+QTVKQ Sbjct: 408 GYTPISEEMETRIIRIVWNNLEDPLSQTVKQ 438 >ref|XP_004489387.1| PREDICTED: thyroid adenoma-associated protein homolog [Cicer arietinum] Length = 2209 Score = 417 bits (1071), Expect = e-114 Identities = 232/458 (50%), Positives = 302/458 (65%), Gaps = 10/458 (2%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MS+KWRALQHRHRYTY+AVVFP F +SL P S+ F ++ +LTSTY+QL+H Sbjct: 1 MSSKWRALQHRHRYTYNAVVFPSSFLNSLHHHNPNPSS-PFILNLLHFTTLTSTYSQLSH 59 Query: 1166 AKSVASSFGALLSTS---DDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSL 996 +K++ASSF LL++ + E I++A++ YLE+LFL+NS PLHRT S L K +NF + Sbjct: 60 SKTLASSFLNLLNSEPSPNSEPEITIASKLYLEILFLENSSPLHRTLLSILIKVKNFHEI 119 Query: 995 IGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGL 816 + CF+ L E+Y GKGR F VSR ALS+M KLGYL DVV+ C +LVA D+V GL Sbjct: 120 LSGCFQKLMEDYSF---GKGRQFTVSRVALSVMGMSKLGYLNDVVEVCAVLVAGDVVRGL 176 Query: 815 SSVITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXX 636 + V+ ET+S RPSP VMEQCQ+ LSCLYYLLQ+FP +F +GE + G D Sbjct: 177 NGVVLETDS--RPSPTVMEQCQEGLSCLYYLLQKFPLKFGCQNGEIENGFGIDGFSSVME 234 Query: 635 XXXXXXXXXXXXXXS-RDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCN----- 474 RDCFVAAGV+LCAA Q C++ +ELG +++ +F N Sbjct: 235 GIVSVVLSLMGSDGFSRDCFVAAGVALCAAFQVCVTSQELGLVLMQGIFNLKVSNSISVG 294 Query: 473 LVGDSKDKFIRALSKVSYQGD-LCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETD 297 +V +F+ A+ K+ GD + + S LSR+CLIRGILT VSR +LN QF V Sbjct: 295 IVDCCDSEFMNAVRKIPCIGDDVYCRICRLSVLSRICLIRGILTAVSRNLLNTQFSVVNG 354 Query: 296 GQGGSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLA 117 + G DG S+ +TILY ILP+LC +CENP+DSHFNFHALTVMQICLQQIK S++ Sbjct: 355 CEDGD--DGVVGSVNKTILYDGILPELCMHCENPVDSHFNFHALTVMQICLQQIKASMIL 412 Query: 116 DLIDESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 +L D SV D PE++ +RI+RI+WNNLED L+QTVKQ Sbjct: 413 NLTDLSVDYDPIPEEMGMRILRIIWNNLEDPLSQTVKQ 450 >ref|XP_002305983.2| hypothetical protein POPTR_0004s13360g [Populus trichocarpa] gi|550340925|gb|EEE86494.2| hypothetical protein POPTR_0004s13360g [Populus trichocarpa] Length = 2004 Score = 386 bits (992), Expect = e-104 Identities = 222/459 (48%), Positives = 284/459 (61%), Gaps = 11/459 (2%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSL---SQLPPEISTLKFFSDMKELASLTSTYTQ 1176 MSAKWRALQHRHRYTYSAV+FP FTD+L S LP + FF+ +K L SL S Y+Q Sbjct: 1 MSAKWRALQHRHRYTYSAVIFPSSFTDTLLSQSLLPLNPNFSLFFTQLKTLISLNSIYSQ 60 Query: 1175 LTHAKSVASSFGALLS---TSDDESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNF 1005 + H+K++ASSF LLS T +D ++ A R Y+E+LFL+NSVPLHRT S L+K N Sbjct: 61 VNHSKNLASSFTNLLSLIHTENDTPILQTACRLYVEVLFLENSVPLHRTLISGLSKVSNK 120 Query: 1004 GS--LIGRCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVD 831 LI CFR LCEEY K FC+SR ALS+M PKLG+L+ VV +C +L+ D Sbjct: 121 DRQVLIVECFRDLCEEYKKWSNRK--RFCLSRVALSIMGMPKLGFLISVVGDCAVLIGWD 178 Query: 830 IVSGLSSVITETNS-WARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDN 654 +V GL SV +E RPSP+VMEQCQ++LSCLYYL+QRFP F Sbjct: 179 VVLGLDSVFSEIEDLGGRPSPVVMEQCQESLSCLYYLIQRFPGTF--------------- 223 Query: 653 XXXXXXXXXXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYL--IIGCVFCRST 480 CF EE+G++ ++G + S Sbjct: 224 ----------------------KCF-----------------EEVGFMERVLGVLV--SV 242 Query: 479 CNLVGDSKDKFIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVET 300 N + +F + KV ++GDLC E+ FS LSRLCLIRGILT VSRA+LN+QFVV + Sbjct: 243 LNGTNCFESEFRDVILKVPFKGDLCFEINGFSGLSRLCLIRGILTAVSRAVLNSQFVVSS 302 Query: 299 DGQGGSHIDGGNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLL 120 G + +G V+TILY ILP+LCNYCENPIDSHFNFHALTV+QICLQQ+KTS+L Sbjct: 303 GGLNVNEENGNCCGSVKTILYDGILPELCNYCENPIDSHFNFHALTVLQICLQQMKTSML 362 Query: 119 ADLIDESVTTDLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 ++L D S + P ++ RI++I+WN+LED L+QTVKQ Sbjct: 363 SNLTDISNNYEPIPVEMGTRILKIIWNSLEDPLSQTVKQ 401 >ref|XP_002445127.1| hypothetical protein SORBIDRAFT_07g004530 [Sorghum bicolor] gi|241941477|gb|EES14622.1| hypothetical protein SORBIDRAFT_07g004530 [Sorghum bicolor] Length = 2121 Score = 385 bits (988), Expect = e-104 Identities = 206/449 (45%), Positives = 287/449 (63%), Gaps = 1/449 (0%) Frame = -1 Query: 1346 MSAKWRALQHRHRYTYSAVVFPQHFTDSLSQLPPEISTLKFFSDMKELASLTSTYTQLTH 1167 MSAKWR+LQHRHRYTY+++VFP+H+ D+L+ +P E+S+ FF + +L SLTSTY+Q+ Sbjct: 1 MSAKWRSLQHRHRYTYTSIVFPKHYHDALAVVPAEVSSSDFFVRLNDLISLTSTYSQVVA 60 Query: 1166 AKSVASSFGALLSTSD-DESLISVAARYYLELLFLDNSVPLHRTFASALAKSRNFGSLIG 990 K +AS++ LST+ + + A + YLE+LFL+NS+PLHRT S LAK + F + I Sbjct: 61 VKDLASAYVQFLSTTGIPDDAVLAATKLYLEILFLENSLPLHRTLISVLAKCKKFSTAIS 120 Query: 989 RCFRALCEEYGGLDKGKGRSFCVSRAALSMMSTPKLGYLVDVVDECCILVAVDIVSGLSS 810 CF LCEEYGG + F VSRAALS++ PKLG+L + V+ C ++A+D+V GL Sbjct: 121 GCFALLCEEYGGSGSKAKKRFMVSRAALSLIGYPKLGFLDEAVERCAEIMALDVVDGLDG 180 Query: 809 VITETNSWARPSPLVMEQCQDALSCLYYLLQRFPTRFFADSGECPSSLGGDNXXXXXXXX 630 V + + +RPSP+VMEQCQ+A+SC+YYLLQR+P++F +G +S Sbjct: 181 VTNDIDEGSRPSPVVMEQCQEAMSCMYYLLQRYPSKF---TGLDKAS------SVFKSAV 231 Query: 629 XXXXXXXXXXXXSRDCFVAAGVSLCAALQACLSPEELGYLIIGCVFCRSTCNLVGDSKDK 450 SRDC VA+GVS CAA+Q +S EE+ + I +F ++ + +D+ Sbjct: 232 RTILSVLKSSAFSRDCLVASGVSFCAAIQVFMSSEEISWFIYQGLF-----DICANHEDR 286 Query: 449 FIRALSKVSYQGDLCAELQSFSALSRLCLIRGILTGVSRAILNAQFVVETDGQGGSHIDG 270 +++ V DLC +++ S LSRLCL+RGILT + R +LN + + H G Sbjct: 287 KNQSVHNVFSDFDLCEQIRDLSVLSRLCLLRGILTSIPRTVLNMRQL---------HSSG 337 Query: 269 GNDSLVQTILYHSILPDLCNYCENPIDSHFNFHALTVMQICLQQIKTSLLADLIDESVTT 90 + TILY ILP+LC +CENP+DSHFNFHALTV QICLQQIKTS+ +D D S Sbjct: 338 P----LWTILYDGILPELCKHCENPVDSHFNFHALTVTQICLQQIKTSISSDFTDFSGDY 393 Query: 89 DLFPEDLRVRIIRIVWNNLEDSLNQTVKQ 3 F D+ RI+ I+W NLED L+QTVKQ Sbjct: 394 KPFSRDVVNRILGIIWRNLEDPLSQTVKQ 422