BLASTX nr result
ID: Mentha24_contig00023279
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00023279 (1350 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU39621.1| hypothetical protein MIMGU_mgv1a0000302mg, partia... 494 e-137 gb|EYU23596.1| hypothetical protein MIMGU_mgv1a025361mg [Mimulus... 493 e-136 ref|XP_003634725.1| PREDICTED: uncharacterized protein LOC100264... 459 e-126 ref|XP_006344824.1| PREDICTED: uncharacterized protein LOC102599... 457 e-126 ref|XP_006475162.1| PREDICTED: uncharacterized protein LOC102613... 449 e-123 ref|XP_006475161.1| PREDICTED: uncharacterized protein LOC102613... 449 e-123 ref|XP_004233937.1| PREDICTED: uncharacterized protein LOC101258... 447 e-123 ref|XP_006370696.1| hypothetical protein POPTR_0001s44980g [Popu... 439 e-120 emb|CAN83957.1| hypothetical protein VITISV_039906 [Vitis vinifera] 429 e-117 ref|XP_007022465.1| Uncharacterized protein isoform 1 [Theobroma... 427 e-117 gb|EXB50294.1| hypothetical protein L484_017832 [Morus notabilis] 425 e-116 ref|XP_002533083.1| conserved hypothetical protein [Ricinus comm... 399 e-108 ref|XP_004163080.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 390 e-106 ref|XP_004149328.1| PREDICTED: uncharacterized protein LOC101215... 390 e-106 ref|XP_006282527.1| hypothetical protein CARUB_v10003963mg [Caps... 379 e-102 ref|XP_002869583.1| hypothetical protein ARALYDRAFT_354097 [Arab... 375 e-101 ref|XP_004295819.1| PREDICTED: uncharacterized protein LOC101298... 367 5e-99 ref|XP_006413117.1| hypothetical protein EUTSA_v10024185mg [Eutr... 366 1e-98 ref|XP_002893970.1| hypothetical protein ARALYDRAFT_336756 [Arab... 358 2e-96 ref|XP_002891275.1| hypothetical protein ARALYDRAFT_314107 [Arab... 358 4e-96 >gb|EYU39621.1| hypothetical protein MIMGU_mgv1a0000302mg, partial [Mimulus guttatus] Length = 1540 Score = 494 bits (1271), Expect = e-137 Identities = 266/451 (58%), Positives = 322/451 (71%), Gaps = 1/451 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +ELV+LLLSSYGATCSE+D EIYNL+L+IESND+S AG VA+TDY+WG +S K+RK Sbjct: 894 GINCRELVYLLLSSYGATCSEVDKEIYNLMLEIESNDKSSAGIVAQTDYIWGPSSLKMRK 953 Query: 1170 DLEQNNDMQSVDQNSMEVYER-RKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQ 994 D SVD + E +E +KVKFRENIP+DP +CAQT L FPY+ V GG Sbjct: 954 D--------SVDLKNTESFEELQKVKFRENIPVDPNMCAQTVLHFPYNEFVNGGT----- 1000 Query: 993 KPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSPD 814 SS VM E +TTDKLQIYDP+FILRFSIHCL+ +YIEPIEFASLGLL++TF S+SS D Sbjct: 1001 --SSTVMTEACSTTDKLQIYDPIFILRFSIHCLSRNYIEPIEFASLGLLAITFVSMSSND 1058 Query: 813 EDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEAS 634 E RKLGYEAL+KF SALEKCQKKKD +T LQNGIE W+RIPS+IA+F AEAS Sbjct: 1059 EVTRKLGYEALSKFNSALEKCQKKKDVKRLGLLMTSLQNGIEGQWRRIPSIIAIFCAEAS 1118 Query: 633 LILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLNT 454 L+LLD S N+++I ++ + S VNMK IPLF T FWS+S FK DR+WMLRLL VGLNT Sbjct: 1119 LVLLDESYANHSSIYEYFNKSRCVNMKDIPLFSTLFWSSSDKFKMDRLWMLRLLYVGLNT 1178 Query: 453 EDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISWL 274 EDDAQ Y+ N IF+TL+SFY SPLSDN+SKELIIQ+++KA H+AV LVEH G+I WL Sbjct: 1179 EDDAQIYLGNHIFKTLMSFYCSPLSDNDSKELIIQIVEKACQFHRAVRVLVEHGGLILWL 1238 Query: 273 SSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXXX 94 SSI VV+ ITSPRN +EWL KHA+EQ Sbjct: 1239 SSI-------------------------VVSYITSPRNNIEWLPKHAMEQLSELSSNLFK 1273 Query: 93 XXXSGVELITNQSSICASILQILTLVQKISQ 1 S +LI +S++C SIL+ LTL+ K+SQ Sbjct: 1274 LLVSSFDLIKEESTLCYSILETLTLLLKVSQ 1304 >gb|EYU23596.1| hypothetical protein MIMGU_mgv1a025361mg [Mimulus guttatus] Length = 2371 Score = 493 bits (1268), Expect = e-136 Identities = 265/451 (58%), Positives = 322/451 (71%), Gaps = 1/451 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +ELV+LLLSSYGATCSE+D EIYNL+L+IESND+S AG VA+TDY+WG +S K+RK Sbjct: 1703 GINCRELVYLLLSSYGATCSEVDKEIYNLMLEIESNDKSSAGIVAQTDYIWGPSSLKMRK 1762 Query: 1170 DLEQNNDMQSVDQNSMEVYER-RKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQ 994 D SVD + E +E +KVKFRENIP+DP +CAQT L FPY+ V GG Sbjct: 1763 D--------SVDLKNTESFEELQKVKFRENIPVDPNMCAQTVLHFPYNEFVNGGT----- 1809 Query: 993 KPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSPD 814 SS VM E +TTDKLQIYDP+FILRFSIHC++ +YIEPIEFASLGLL++TF S+SS D Sbjct: 1810 --SSTVMTEACSTTDKLQIYDPIFILRFSIHCISRNYIEPIEFASLGLLAITFVSMSSND 1867 Query: 813 EDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEAS 634 E RKLGYEAL+KF SALEKCQKKKD +T LQNGIE W+RIPS+IA+F AEAS Sbjct: 1868 EVTRKLGYEALSKFNSALEKCQKKKDVKRLGLLMTSLQNGIEGQWRRIPSIIAIFCAEAS 1927 Query: 633 LILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLNT 454 L+LLD S N+++I ++ + S VNMK IPLF T FWS+S FK DR+WMLRLL VGLNT Sbjct: 1928 LVLLDESYANHSSIYEYFNKSRCVNMKDIPLFSTLFWSSSDKFKMDRLWMLRLLYVGLNT 1987 Query: 453 EDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISWL 274 EDDAQ Y+ N IF+TL+SFY SPLSDN+SKELIIQ+++KA H+AV LVEH G+I WL Sbjct: 1988 EDDAQIYLGNHIFKTLMSFYCSPLSDNDSKELIIQIVEKACQFHRAVRVLVEHGGLILWL 2047 Query: 273 SSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXXX 94 SSI VV+ ITSPRN +EWL KHA+EQ Sbjct: 2048 SSI-------------------------VVSYITSPRNNIEWLPKHAMEQLSELSSNLFK 2082 Query: 93 XXXSGVELITNQSSICASILQILTLVQKISQ 1 S +LI +S++C SIL+ LTL+ K+SQ Sbjct: 2083 LLVSSFDLIKEESTLCNSILKTLTLLLKVSQ 2113 >ref|XP_003634725.1| PREDICTED: uncharacterized protein LOC100264016 [Vitis vinifera] Length = 2563 Score = 459 bits (1182), Expect = e-126 Identities = 242/451 (53%), Positives = 310/451 (68%), Gaps = 2/451 (0%) Frame = -1 Query: 1347 INAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRKD 1168 INA+EL+ LLLSSYGA +E+DLEIY+L+ +IESND +G++A DYLWG ++ ++RK+ Sbjct: 1826 INARELISLLLSSYGAMLNEVDLEIYSLMHEIESNDRLKSGSIADMDYLWGSSALRIRKE 1885 Query: 1167 LEQNNDMQSVDQNSME-VYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGG-NLPKLQ 994 Q ++ + + E V ER++ +FREN+PIDPK+C T L+FPY+R G N+P+ Sbjct: 1886 RVQELEISANNILDAEAVEERQRSQFRENLPIDPKLCVNTVLYFPYNRTASDGENVPR-- 1943 Query: 993 KPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSPD 814 YDPVFIL FSIH L+M YIEP+EF++LGLL+V F S+SSPD Sbjct: 1944 -------------------YDPVFILHFSIHSLSMRYIEPVEFSALGLLAVAFVSLSSPD 1984 Query: 813 EDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEAS 634 + +RKLGYE L +FK+ALE CQK+KD LTY+QNGIEEPWQRIPSV A+FAAEAS Sbjct: 1985 DMIRKLGYETLGRFKNALEMCQKRKDVMQLRLLLTYMQNGIEEPWQRIPSVTAIFAAEAS 2044 Query: 633 LILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLNT 454 ILLDPS+ +Y+TISK L S VNMK IPLF F WS+S+ FK++R+W+LRL GLN Sbjct: 2045 FILLDPSHEHYSTISKLLMRSTGVNMKCIPLFNNFIWSSSINFKSERLWILRLSYAGLNL 2104 Query: 453 EDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISWL 274 EDDAQ YIRNSI ET++SFY+SP SDNESKELI+Q++KK+ LHK LVEHCG+ISWL Sbjct: 2105 EDDAQIYIRNSILETILSFYASPFSDNESKELILQIVKKSVKLHKMARYLVEHCGLISWL 2164 Query: 273 SSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXXX 94 SS L DQ+ F L QL + EV+N + S RNI+ WLQK ALEQ Sbjct: 2165 SSALSFFSERLSGDQRSFWLKQLTIVTEVINNVISSRNIIGWLQKDALEQLSEVALHLYK 2224 Query: 93 XXXSGVELITNQSSICASILQILTLVQKISQ 1 V+L+ + ++ SILQIL K SQ Sbjct: 2225 LLIGAVQLMKDNVTLVNSILQILISTLKFSQ 2255 >ref|XP_006344824.1| PREDICTED: uncharacterized protein LOC102599460 [Solanum tuberosum] Length = 2550 Score = 457 bits (1176), Expect = e-126 Identities = 237/452 (52%), Positives = 313/452 (69%), Gaps = 2/452 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL+ LLLSSYGA+ S IDLEIY+L+ +I S ++ G++A+ DYLWG A KVRK Sbjct: 1807 GINLKELLFLLLSSYGASMSVIDLEIYSLMDEINSTNDLGEGSMAKLDYLWGSALLKVRK 1866 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGNL--PKL 997 + E + S + V + R++ FRENIPIDPK+CA T L+FPYDR V G L PK Sbjct: 1867 ENELEQTISSNLSEAEAVDDYRRICFRENIPIDPKVCATTVLYFPYDRTVGSGILKEPKK 1926 Query: 996 QKPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSP 817 P ++ + +KL++YDP+FIL FS+HCL+M +IEP+EFASLGLL++ SISSP Sbjct: 1927 DYPDFGYEVQYADA-EKLRVYDPIFILHFSVHCLSMGFIEPLEFASLGLLAIAVVSISSP 1985 Query: 816 DEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEA 637 D+DMRKLGYE L +FKS LE+CQK+KD ++YLQNGIEEPWQ+I SV A+F AEA Sbjct: 1986 DDDMRKLGYEVLGRFKSVLERCQKRKDVMRLRLLMSYLQNGIEEPWQKISSVTAIFVAEA 2045 Query: 636 SLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLN 457 S +LLDPS+++Y+ ISK+L SP+ NMK IPLF+TFFWS S F +R+WMLRLLC GLN Sbjct: 2046 SYVLLDPSHDHYSAISKYLIRSPNANMKGIPLFQTFFWSISTNFITERLWMLRLLCSGLN 2105 Query: 456 TEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISW 277 +DDAQ YIRN+IFETL SFY SP+SD+ESKELI+Q+++K+ + K LVE CG+ISW Sbjct: 2106 VDDDAQIYIRNAIFETLFSFYVSPISDHESKELIVQIVRKSVRIPKMARYLVEQCGLISW 2165 Query: 276 LSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXX 97 S ++ SL ++ +L + ILE +N + R+ VEW+QK+ALEQ Sbjct: 2166 SSCVVSSL---SWSQCRRNSLVEFTVILEALNEVVLSRHTVEWMQKYALEQLVELSCNLY 2222 Query: 96 XXXXSGVELITNQSSICASILQILTLVQKISQ 1 GVE + + + ILQIL +ISQ Sbjct: 2223 KMLIEGVERLKVNTQLVKLILQILRSALRISQ 2254 >ref|XP_006475162.1| PREDICTED: uncharacterized protein LOC102613555 isoform X2 [Citrus sinensis] Length = 2578 Score = 449 bits (1156), Expect = e-123 Identities = 246/461 (53%), Positives = 317/461 (68%), Gaps = 12/461 (2%) Frame = -1 Query: 1347 INAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRKD 1168 IN +EL LLL+SYGAT S+ID+EIY+++ +IE + S +A+ DYLWG A+AKVRK+ Sbjct: 1871 INLRELCLLLLASYGATLSDIDMEIYDVMHEIERIENS-DNEIAQLDYLWGRAAAKVRKE 1929 Query: 1167 --LEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQ 994 LEQ+ ++ ++ E+++ +FREN+ IDPKICA T L+FPYDR G Sbjct: 1930 WILEQDTSC-NIMTDAEAAKEQKRSQFRENLAIDPKICAMTVLYFPYDRTTDG------- 1981 Query: 993 KPSSAVMLEG----------STTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLS 844 PSS+ L+ S LQ YDPVFILRF+IH L++ +IEP+EFA LGLL+ Sbjct: 1982 -PSSSNKLKADNLWNTHEIHSPDLQDLQRYDPVFILRFAIHSLSVGFIEPVEFAGLGLLA 2040 Query: 843 VTFASISSPDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPS 664 V F SISSPD MRKLGYE L +FK+ LEKC KKKD LTY+QNGIEEPWQRIPS Sbjct: 2041 VAFVSISSPDVGMRKLGYETLGRFKNELEKCSKKKDVMRLRLLLTYVQNGIEEPWQRIPS 2100 Query: 663 VIAVFAAEASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWM 484 VIA+FAAEASL+LLDPS+++YT++SK L S VN+K+IPLF FF S+SV F+ +R+WM Sbjct: 2101 VIAIFAAEASLLLLDPSHDHYTSVSKLLMRSSRVNLKSIPLFHDFFSSSSVNFRKERLWM 2160 Query: 483 LRLLCVGLNTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWAL 304 LRLL GLN +DDAQ YIRNS+ E L+SFY+SPLSD+ESKELI+ ++KK+ LHK L Sbjct: 2161 LRLLYAGLNLDDDAQVYIRNSVLEILMSFYASPLSDSESKELILLILKKSIKLHKMACYL 2220 Query: 303 VEHCGVISWLSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQ 124 VEHCG+ SWLSS+L S G + ++ F + QL ++EVVN + S RNI EWLQ+HALEQ Sbjct: 2221 VEHCGLFSWLSSLLSSFSGMLLGGEKMFLMAQLIVVVEVVNDVISSRNINEWLQRHALEQ 2280 Query: 123 XXXXXXXXXXXXXSGVELITNQSSICASILQILTLVQKISQ 1 G++L+ + SIL IL KISQ Sbjct: 2281 LVDFSSHLYKLLVGGMKLMRENVPLVNSILLILISTVKISQ 2321 >ref|XP_006475161.1| PREDICTED: uncharacterized protein LOC102613555 isoform X1 [Citrus sinensis] Length = 2618 Score = 449 bits (1156), Expect = e-123 Identities = 246/461 (53%), Positives = 317/461 (68%), Gaps = 12/461 (2%) Frame = -1 Query: 1347 INAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRKD 1168 IN +EL LLL+SYGAT S+ID+EIY+++ +IE + S +A+ DYLWG A+AKVRK+ Sbjct: 1871 INLRELCLLLLASYGATLSDIDMEIYDVMHEIERIENS-DNEIAQLDYLWGRAAAKVRKE 1929 Query: 1167 --LEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQ 994 LEQ+ ++ ++ E+++ +FREN+ IDPKICA T L+FPYDR G Sbjct: 1930 WILEQDTSC-NIMTDAEAAKEQKRSQFRENLAIDPKICAMTVLYFPYDRTTDG------- 1981 Query: 993 KPSSAVMLEG----------STTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLS 844 PSS+ L+ S LQ YDPVFILRF+IH L++ +IEP+EFA LGLL+ Sbjct: 1982 -PSSSNKLKADNLWNTHEIHSPDLQDLQRYDPVFILRFAIHSLSVGFIEPVEFAGLGLLA 2040 Query: 843 VTFASISSPDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPS 664 V F SISSPD MRKLGYE L +FK+ LEKC KKKD LTY+QNGIEEPWQRIPS Sbjct: 2041 VAFVSISSPDVGMRKLGYETLGRFKNELEKCSKKKDVMRLRLLLTYVQNGIEEPWQRIPS 2100 Query: 663 VIAVFAAEASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWM 484 VIA+FAAEASL+LLDPS+++YT++SK L S VN+K+IPLF FF S+SV F+ +R+WM Sbjct: 2101 VIAIFAAEASLLLLDPSHDHYTSVSKLLMRSSRVNLKSIPLFHDFFSSSSVNFRKERLWM 2160 Query: 483 LRLLCVGLNTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWAL 304 LRLL GLN +DDAQ YIRNS+ E L+SFY+SPLSD+ESKELI+ ++KK+ LHK L Sbjct: 2161 LRLLYAGLNLDDDAQVYIRNSVLEILMSFYASPLSDSESKELILLILKKSIKLHKMACYL 2220 Query: 303 VEHCGVISWLSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQ 124 VEHCG+ SWLSS+L S G + ++ F + QL ++EVVN + S RNI EWLQ+HALEQ Sbjct: 2221 VEHCGLFSWLSSLLSSFSGMLLGGEKMFLMAQLIVVVEVVNDVISSRNINEWLQRHALEQ 2280 Query: 123 XXXXXXXXXXXXXSGVELITNQSSICASILQILTLVQKISQ 1 G++L+ + SIL IL KISQ Sbjct: 2281 LVDFSSHLYKLLVGGMKLMRENVPLVNSILLILISTVKISQ 2321 >ref|XP_004233937.1| PREDICTED: uncharacterized protein LOC101258227 [Solanum lycopersicum] Length = 2434 Score = 447 bits (1150), Expect = e-123 Identities = 234/452 (51%), Positives = 308/452 (68%), Gaps = 2/452 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL+ LLLSSYGA+ S IDLEIY+L+ +I S + ++A+ DYLWG A KVRK Sbjct: 1691 GINLRELLFLLLSSYGASMSVIDLEIYSLMDEISSANNLGEVSMAKLDYLWGSALLKVRK 1750 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGNL--PKL 997 + EQ + + V + R+++FRENIPIDPK+CA T L+FPY+R V L PK Sbjct: 1751 ENEQEQTISCNLSEAEAVDDYRRIRFRENIPIDPKVCATTVLYFPYERTVGPRILKEPKK 1810 Query: 996 QKPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSP 817 P + + +KL +YDP+FIL FS+HCL+M ++EP+EFASLGLL++ SISSP Sbjct: 1811 DYPDFGYEVHYADA-EKLHVYDPIFILHFSVHCLSMGFVEPLEFASLGLLAIAVVSISSP 1869 Query: 816 DEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEA 637 D+DMRKLGYE L +FKS LE+CQK+KD ++YLQNGIEEPWQ+I SV A+F AEA Sbjct: 1870 DDDMRKLGYEVLGRFKSVLERCQKRKDVVRLRLLMSYLQNGIEEPWQKISSVTAIFVAEA 1929 Query: 636 SLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLN 457 S +LLDPS+++Y+ ISK+L SPS NMK IPLF+TFFWS S + +R+WMLRLLC GLN Sbjct: 1930 SYVLLDPSHDHYSAISKYLIRSPSANMKGIPLFQTFFWSISTNYITERLWMLRLLCSGLN 1989 Query: 456 TEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISW 277 +DDAQ YIRN+IFETL SFY SP+SD+ESKELI+Q+++K+ + K LVE CG+ISW Sbjct: 1990 LDDDAQIYIRNAIFETLFSFYVSPISDHESKELIVQIVRKSVRIPKMARYLVEQCGLISW 2049 Query: 276 LSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXX 97 S + SL ++ + +L ILE +N + R+ VEW+QK+ALEQ Sbjct: 2050 SSCAVSSL---SWSQCRRNSFVELTVILEALNEVVLSRHTVEWMQKYALEQLVELSCNLY 2106 Query: 96 XXXXSGVELITNQSSICASILQILTLVQKISQ 1 GVE + S + ILQIL +ISQ Sbjct: 2107 KMLIEGVERLKVNSQLVKLILQILRSALRISQ 2138 >ref|XP_006370696.1| hypothetical protein POPTR_0001s44980g [Populus trichocarpa] gi|550349902|gb|ERP67265.1| hypothetical protein POPTR_0001s44980g [Populus trichocarpa] Length = 2573 Score = 439 bits (1128), Expect = e-120 Identities = 240/454 (52%), Positives = 303/454 (66%), Gaps = 4/454 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL LLLSSYGAT SE D EIYNL+L+IES D S VA DYLWG A K+ K Sbjct: 1849 GINLKELHLLLLSSYGATLSETDFEIYNLMLEIESIDNSVVDVVADMDYLWGTAVLKISK 1908 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGN--LPKL 997 + + + V N+ V E R+ +FREN+P+DPK+C TAL FPYDR V G+ L +L Sbjct: 1909 ERVLDQETYDVVTNTEAVKEHRRSQFRENLPVDPKMCVTTALHFPYDRTVTDGSFSLDRL 1968 Query: 996 QKPSSAVMLEGSTT-TDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISS 820 Q + + E + +Q+YDPVFILRFSIH L+M YIE +EFA LGLL+V F S+SS Sbjct: 1969 QLDNLKDIYERHVPGVENIQLYDPVFILRFSIHALSMGYIEAVEFAGLGLLAVAFVSMSS 2028 Query: 819 PDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAE 640 PD MRKLGYE + K+K+ LE CQK KD LTYLQNGI EPWQRIPSV+A+FAAE Sbjct: 2029 PDVGMRKLGYELIGKYKNVLENCQKTKDVMRLRLLLTYLQNGISEPWQRIPSVLALFAAE 2088 Query: 639 ASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGL 460 +SLILLDPS+++YTT+SKHL +S VNMK R+WMLRL C GL Sbjct: 2089 SSLILLDPSHDHYTTLSKHLMHSSKVNMK-------------------RLWMLRLACGGL 2129 Query: 459 NTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVIS 280 N +DD Q +IRNS ETL+SFYSSPLSDNESKE+I++++KKAA L + V LVEHCG+ Sbjct: 2130 NLDDDTQIFIRNSTIETLLSFYSSPLSDNESKEIILEIVKKAAKLPRMVRYLVEHCGLFP 2189 Query: 279 WLSSILPSLYGGGVEDQQKFALTQ-LPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXX 103 WLSS+L S+Y G + + ++ +Q L ++EVVN + S RNIVEWLQ +ALEQ Sbjct: 2190 WLSSVL-SVYKGMLHENERIFFSQLLVVVIEVVNDVVSSRNIVEWLQNYALEQLMELATY 2248 Query: 102 XXXXXXSGVELITNQSSICASILQILTLVQKISQ 1 +G +LI ++ S+L I+ KISQ Sbjct: 2249 LYKLLVAGSKLIKENVTLVNSVLHIMLTTLKISQ 2282 >emb|CAN83957.1| hypothetical protein VITISV_039906 [Vitis vinifera] Length = 2715 Score = 429 bits (1103), Expect = e-117 Identities = 221/400 (55%), Positives = 289/400 (72%), Gaps = 6/400 (1%) Frame = -1 Query: 1347 INAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRKD 1168 INA+EL+ LLLSSYGA +E+DLEIY+L+ +IESND +G++A DYLWG ++ ++RK+ Sbjct: 1871 INARELISLLLSSYGAMXNEVDLEIYSLMHEIESNDRLKSGSIADMDYLWGSSALRIRKE 1930 Query: 1167 LEQNNDMQSVDQNSME-VYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGG--NLPKL 997 Q ++ + + E V ER++ +FREN+PIDPK+C T L+FPY+R G +L K+ Sbjct: 1931 RVQELEISANNIXDAEAVEERQRSQFRENLPIDPKLCVNTVLYFPYNRTASDGPISLNKV 1990 Query: 996 QKPSSAVMLEGSTT-TDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISS 820 + M++G + + YDPVFIL FSIH L+M YIEP+EF++LGLL+V F S+SS Sbjct: 1991 HPDNVKDMIQGYPPHVENVPRYDPVFILHFSIHSLSMRYIEPVEFSALGLLAVAFVSLSS 2050 Query: 819 PDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAE 640 PD+ +RKLGYE L +FK+ALE CQK+KD LTY+QNGIEEPWQRIPSV A+FAAE Sbjct: 2051 PDDMIRKLGYETLGRFKNALEMCQKRKDVMQLRLLLTYMQNGIEEPWQRIPSVTAIFAAE 2110 Query: 639 ASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGL 460 AS ILLDPS+ +Y+TISK L S VNMK IPLF F WS+S+ FK++R+W+LRL GL Sbjct: 2111 ASFILLDPSHEHYSTISKLLMRSTGVNMKCIPLFNNFIWSSSINFKSERLWILRLSYAGL 2170 Query: 459 NTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVIS 280 N EDDAQ YIRNSI ET++SFY+SP SDNESKELI+Q++KK+ LHK LVEHCG+IS Sbjct: 2171 NLEDDAQIYIRNSILETILSFYASPFSDNESKELILQIVKKSVKLHKMARYLVEHCGLIS 2230 Query: 279 WLSSILPSLYGGGVEDQQKFALTQLPTILEVVN--CITSP 166 WLSS L DQ+ F L QL + E + C+ +P Sbjct: 2231 WLSSALSFFSERLSGDQRSFWLKQLTIVTEPLTWACVVAP 2270 >ref|XP_007022465.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508722093|gb|EOY13990.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 2614 Score = 427 bits (1098), Expect = e-117 Identities = 233/453 (51%), Positives = 306/453 (67%), Gaps = 4/453 (0%) Frame = -1 Query: 1347 INAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRKD 1168 IN +EL LLLSSYGAT SEIDLE+Y+L+ +IE+ D S + +A DYLWG A+ KVRK+ Sbjct: 1865 INLKELHLLLLSSYGATLSEIDLEMYSLINEIETIDSSDSKYIAEIDYLWGSAAMKVRKE 1924 Query: 1167 LE-QNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGG--NLPKL 997 ++ +++ + ER K+K+R+N+P+DPK+CA T L FPYDR +L KL Sbjct: 1925 HGLEHGASRNIMTDIEAAQERLKIKYRDNLPVDPKVCAATVLHFPYDRTASDRPLSLNKL 1984 Query: 996 QKPSSAVMLE-GSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISS 820 Q + M++ S +Q YDPVFI+RFSIH L+ YIEP+EFA LGLL+V F S+SS Sbjct: 1985 QSDNIKDMIKLHSPGAGNIQRYDPVFIMRFSIHSLSAGYIEPVEFAGLGLLAVAFVSMSS 2044 Query: 819 PDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAE 640 D MRKL YE L++FK +LE+CQ+KKD L Y+QNGIEEPWQRIPSVIA+FAAE Sbjct: 2045 LDVGMRKLAYEVLSRFKISLERCQRKKDVTRLHLLLMYMQNGIEEPWQRIPSVIALFAAE 2104 Query: 639 ASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGL 460 SL+LLDP + +Y+T +K L NS VNMK IPLF FF S++V F+A R+W+LRL GL Sbjct: 2105 TSLVLLDPLHEHYSTFNKLLMNSSRVNMKQIPLFHDFFQSSAVNFRAQRLWILRLANAGL 2164 Query: 459 NTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVIS 280 N EDDA YIR+SI ETL+SFY SPLSDNESK+LI+Q++KK+ LHK V LVE C + S Sbjct: 2165 NLEDDAWLYIRSSILETLMSFYVSPLSDNESKKLILQILKKSVQLHKMVRYLVEQCSLFS 2224 Query: 279 WLSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXX 100 WLSSIL + + D+ + LT+L ++EVV + S ++I EWLQ ALEQ Sbjct: 2225 WLSSILSNYSRVLLGDENRIFLTELVMVIEVVTEVISSKDITEWLQSCALEQLMELASHL 2284 Query: 99 XXXXXSGVELITNQSSICASILQILTLVQKISQ 1 G++LI ++ LQI+ K+SQ Sbjct: 2285 YKLLVGGMKLINEHAAFVNPTLQIIISTLKMSQ 2317 >gb|EXB50294.1| hypothetical protein L484_017832 [Morus notabilis] Length = 2615 Score = 425 bits (1093), Expect = e-116 Identities = 227/457 (49%), Positives = 306/457 (66%), Gaps = 7/457 (1%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GI ++L LLLSSYGA +E+D+EIYNL+ IES D A +A D+LWG A++KV K Sbjct: 1862 GIKFRKLHLLLLSSYGAKLNEMDMEIYNLMSTIESFDGLEAENIAGLDHLWGTAASKVEK 1921 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGG--NLPKL 997 + D+ + ++ V ERR+ +FREN+P+DPKICA T L+FPYDR +L K Sbjct: 1922 EQALEQDIMN---DAEAVKERRRSQFRENLPVDPKICASTVLYFPYDRTASHEPVSLDKF 1978 Query: 996 QKPSSAVMLEGSTTT-----DKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFA 832 + + A M+ T T + L+ YDPVFILRFS++ LT+ YIEP+EFA LGLL++ F Sbjct: 1979 RADNFACMIVNYTQTRPSDVENLERYDPVFILRFSLYSLTVGYIEPMEFAGLGLLAIAFV 2038 Query: 831 SISSPDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAV 652 S+SSPDE +RKL Y L KFK LE+C+K+K+ L+ LQNGIEEPWQRIPSV+++ Sbjct: 2039 SMSSPDEGIRKLAYSTLGKFKDTLEQCKKRKEVTRIRLLLSSLQNGIEEPWQRIPSVVSI 2098 Query: 651 FAAEASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLL 472 FAAEAS ILLDPS++ Y+T+S+ L NS +N+K +P+F FFWSTSV ++ADR+W+LRL+ Sbjct: 2099 FAAEASFILLDPSHDQYSTLSRLLMNSSKLNLKNVPVFSDFFWSTSVNYRADRLWILRLV 2158 Query: 471 CVGLNTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHC 292 GLN+ DDAQ YIRNSI ET +SFY SPLSD ESK+LI+QV+K++ +K LVE C Sbjct: 2159 YAGLNSSDDAQIYIRNSIPETFMSFYFSPLSDTESKDLILQVVKRSVKFYKLTRHLVESC 2218 Query: 291 GVISWLSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXX 112 G++ WLSS+L + D+ + QL +LEVVN + S RNI EWLQK ALEQ Sbjct: 2219 GLLLWLSSVLTANTRNS-RDETNIFIMQLTVVLEVVNGVISSRNITEWLQKEALEQLMEL 2277 Query: 111 XXXXXXXXXSGVELITNQSSICASILQILTLVQKISQ 1 G+ + +++ +L+ L KISQ Sbjct: 2278 VSHLYRFLVDGMVSVKEHATLVNLLLETLISTLKISQ 2314 >ref|XP_002533083.1| conserved hypothetical protein [Ricinus communis] gi|223527122|gb|EEF29298.1| conserved hypothetical protein [Ricinus communis] Length = 2587 Score = 399 bits (1026), Expect = e-108 Identities = 221/453 (48%), Positives = 303/453 (66%), Gaps = 3/453 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL LLLSSYGAT +ID+EI++L+ +IES D S + +A+ DYLWG A+ ++RK Sbjct: 1847 GINLKELYFLLLSSYGATLGDIDVEIFSLMREIESIDTSVSEDLAKLDYLWGTAALRIRK 1906 Query: 1170 DLEQNNDMQSVDQNSMEVYER-RKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQ 994 + + D S + EV+E R+ +FRE +PI+P ICA T +FPYDR + +L+ Sbjct: 1907 ERALDWDTSSSVITNKEVFEEHRRSQFREVLPINPNICATTVNYFPYDRIMS----IELE 1962 Query: 993 KPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSPD 814 P + + + YDP+FIL FS H L+M +IEP+EFA LGLL+++F S+SSPD Sbjct: 1963 NPKNMRVAHFPG-----ERYDPIFILNFSNHNLSMGHIEPLEFACLGLLAISFISMSSPD 2017 Query: 813 EDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEAS 634 ++RKL +L KFK ALE+ QKKKD LTY+QNGI+E QRIPS+IA+FAAE+S Sbjct: 2018 IEIRKLSDASLGKFKDALERFQKKKDVLRLHLLLTYIQNGIKERLQRIPSIIALFAAESS 2077 Query: 633 LILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLNT 454 ILLDPSN+++TT++KHL +S +V+MK IPLF TFF S SV F+A+R+WMLRL+C GLN Sbjct: 2078 FILLDPSNDHFTTLNKHLMHSSAVDMKHIPLFHTFFHSNSVNFRAERLWMLRLVCAGLNL 2137 Query: 453 EDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISWL 274 +DDAQ YI NSI ETL+SFY++PL+DNESKELI+QV+KK+ L + LVE CG+ WL Sbjct: 2138 DDDAQIYISNSILETLLSFYTTPLADNESKELILQVVKKSVKLDRMTRHLVESCGLFPWL 2197 Query: 273 SSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIV--EWLQKHALEQXXXXXXXX 100 S++L E++ F+ QL +EV+ I S NI+ W K++ EQ Sbjct: 2198 STVLSISSAMLDENKDSFSSLQLVLAIEVIFDIISSGNIIGSAWFGKYSFEQCIELASHL 2257 Query: 99 XXXXXSGVELITNQSSICASILQILTLVQKISQ 1 G++LI ++ SILQI+ KISQ Sbjct: 2258 YKILVGGLKLIKENVALIESILQIVISTLKISQ 2290 >ref|XP_004163080.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224336 [Cucumis sativus] Length = 2375 Score = 390 bits (1001), Expect = e-106 Identities = 218/452 (48%), Positives = 292/452 (64%), Gaps = 3/452 (0%) Frame = -1 Query: 1347 INAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRKD 1168 IN +EL LLLSSYGAT SE D I L DIE+ S A + D+LWG A V K+ Sbjct: 1622 INFRELYALLLSSYGATVSETDSTILMTLNDIETIIGSDAKNQVQMDFLWGNAVLGVSKE 1681 Query: 1167 -LEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYG--GNLPKL 997 L + ++ ++ V ER + +FREN+P+DP+IC T L+FPYDR L K Sbjct: 1682 RLLEQEPSSNISNDAEAVKERHRNQFRENLPVDPRICVSTVLWFPYDRTESDEESRLKKY 1741 Query: 996 QKPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSP 817 + + +G + + YDP+++LRFSIH L+M YIE +EFA+LGLL+V F S+SS Sbjct: 1742 RVKDLDDLFKGHYHGTEPERYDPIYVLRFSIHALSMGYIEALEFATLGLLAVAFVSLSSA 1801 Query: 816 DEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEA 637 ++ +RKLGY L K+ +E +++K LTY+QNGIEEPWQRIPS+IA+FAAEA Sbjct: 1802 NDKLRKLGYGTLGALKNTVENGKRRKGTTRLRLLLTYVQNGIEEPWQRIPSIIALFAAEA 1861 Query: 636 SLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLN 457 S ILL+PS+++Y ISK L S +N K+IPLF+ F WS+SV FK++R+WMLRL+ VG+N Sbjct: 1862 SFILLEPSHHHYAAISKFLVRSTRLNSKSIPLFKNFLWSSSVNFKSERLWMLRLVYVGIN 1921 Query: 456 TEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISW 277 +DDA+ YI+NSI E L SFY S LSDNESKELI+QVMKK+ L + + LVE+ G+ SW Sbjct: 1922 VDDDARLYIKNSIHEDLQSFYVSSLSDNESKELILQVMKKSVKLQRMAFYLVEN-GLFSW 1980 Query: 276 LSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXX 97 L SI+ + EDQ+ QL +LEVVN + S RNI EWLQK ALEQ Sbjct: 1981 LCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIF 2040 Query: 96 XXXXSGVELITNQSSICASILQILTLVQKISQ 1 G +L+ + ++ ILQI+T V +ISQ Sbjct: 2041 KILVGGEQLLLIEGALVNQILQIITSVLRISQ 2072 >ref|XP_004149328.1| PREDICTED: uncharacterized protein LOC101215477 [Cucumis sativus] Length = 2446 Score = 390 bits (1001), Expect = e-106 Identities = 218/452 (48%), Positives = 292/452 (64%), Gaps = 3/452 (0%) Frame = -1 Query: 1347 INAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRKD 1168 IN +EL LLLSSYGAT SE D I L DIE+ S A + D+LWG A V K+ Sbjct: 1693 INFRELYALLLSSYGATVSETDSTILMTLNDIETIIGSDAKNQVQMDFLWGNAVLGVSKE 1752 Query: 1167 -LEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYG--GNLPKL 997 L + ++ ++ V ER + +FREN+P+DP+IC T L+FPYDR L K Sbjct: 1753 RLLEQEPSSNISNDAEAVKERHRNQFRENLPVDPRICVSTVLWFPYDRTESDEESRLKKY 1812 Query: 996 QKPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSP 817 + + +G + + YDP+++LRFSIH L+M YIE +EFA+LGLL+V F S+SS Sbjct: 1813 RVKDLDDLFKGHYHGTEPERYDPIYVLRFSIHALSMGYIEALEFATLGLLAVAFVSLSSA 1872 Query: 816 DEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEA 637 ++ +RKLGY L K+ +E +++K LTY+QNGIEEPWQRIPS+IA+FAAEA Sbjct: 1873 NDKLRKLGYGTLGALKNTVENGKRRKGTTRLRLLLTYVQNGIEEPWQRIPSIIALFAAEA 1932 Query: 636 SLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLN 457 S ILL+PS+++Y ISK L S +N K+IPLF+ F WS+SV FK++R+WMLRL+ VG+N Sbjct: 1933 SFILLEPSHHHYAAISKFLVRSTRLNSKSIPLFKNFLWSSSVNFKSERLWMLRLVYVGIN 1992 Query: 456 TEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISW 277 +DDA+ YI+NSI E L SFY S LSDNESKELI+QVMKK+ L + + LVE+ G+ SW Sbjct: 1993 VDDDARLYIKNSIHEDLQSFYVSSLSDNESKELILQVMKKSVKLQRMAFYLVEN-GLFSW 2051 Query: 276 LSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXX 97 L SI+ + EDQ+ QL +LEVVN + S RNI EWLQK ALEQ Sbjct: 2052 LCSIISTSSRRLTEDQKSIFPKQLALVLEVVNNVISFRNICEWLQKDALEQLMEFSSNIF 2111 Query: 96 XXXXSGVELITNQSSICASILQILTLVQKISQ 1 G +L+ + ++ ILQI+T V +ISQ Sbjct: 2112 KILVGGEQLLLIEGALVNQILQIITSVLRISQ 2143 >ref|XP_006282527.1| hypothetical protein CARUB_v10003963mg [Capsella rubella] gi|482551232|gb|EOA15425.1| hypothetical protein CARUB_v10003963mg [Capsella rubella] Length = 2547 Score = 379 bits (972), Expect = e-102 Identities = 210/451 (46%), Positives = 284/451 (62%), Gaps = 1/451 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL LLL SYGAT SEIDLEI+ L+ DI+ D V+ TD LWG A+ K+R+ Sbjct: 1811 GINLKELHFLLLCSYGATLSEIDLEIFKLMHDIKLVDAEHTLNVSETDCLWGKAALKIRE 1870 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAV-YGGNLPKLQ 994 L + D V ++ + + R+ F+EN+ +DPK+CA T LFFPY R NL Sbjct: 1871 GLRFSQDASYVGESDF-LEDVRQSLFKENLCVDPKMCALTVLFFPYQRTTEVSDNLYLYD 1929 Query: 993 KPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSPD 814 P V + S + ++ YDPVFILR SI L+M +IEP+EFASLGLL+V F S+SS D Sbjct: 1930 DP---VNEKCSPVMEDIERYDPVFILRISIDSLSMGFIEPVEFASLGLLAVAFVSMSSAD 1986 Query: 813 EDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEAS 634 MRKLGYE L + ALE C+K K L Y+QNG+EEPWQRIP+V A+FAAE S Sbjct: 1987 LGMRKLGYETLEIYLDALESCRKNKHVTALRLLLMYVQNGVEEPWQRIPTVSAIFAAETS 2046 Query: 633 LILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLNT 454 LI LDPS+ +Y I+K L +S ++ ++ IPLF FFWS++V F++ R W+LRL+C GL + Sbjct: 2047 LIFLDPSHEHYVPINKLLKSSSTLKLRGIPLFHDFFWSSAVNFRSQRFWVLRLVCAGLKS 2106 Query: 453 EDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISWL 274 +DDAQ YIRNSI ET++SF SSPL+D+E+K LI+QV++K+ HK LVE+CG+ SW Sbjct: 2107 DDDAQIYIRNSILETVMSFSSSPLTDDETKGLILQVVRKSVKFHKMSRHLVENCGLFSWC 2166 Query: 273 SSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXXX 94 SS + + + D+ F L +LEV+ + + RN+ EWLQ+ LE Sbjct: 2167 SSFISTFTTNPIGDED-FCLV---AVLEVITDVLASRNVTEWLQRCGLEGLMEFSSRLYR 2222 Query: 93 XXXSGVELITNQSSICASILQILTLVQKISQ 1 G+ + + ILQIL+ KISQ Sbjct: 2223 ILGGGLVSVQENDTSVDLILQILSATLKISQ 2253 >ref|XP_002869583.1| hypothetical protein ARALYDRAFT_354097 [Arabidopsis lyrata subsp. lyrata] gi|297315419|gb|EFH45842.1| hypothetical protein ARALYDRAFT_354097 [Arabidopsis lyrata subsp. lyrata] Length = 2550 Score = 375 bits (963), Expect = e-101 Identities = 208/450 (46%), Positives = 280/450 (62%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL LL SYGAT SEIDLEIY L+ DI+ D V+ TD LWG A+ K+R+ Sbjct: 1815 GINLKELHFFLLCSYGATLSEIDLEIYKLMHDIKLIDAEQTLNVSETD-LWGKAALKLRE 1873 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQK 991 L D +V Q + V + ++ F+EN+ +DPKICA T LFFPY R + L Sbjct: 1874 GLRFKQDASNVGQAEL-VEDVQQSLFKENLCVDPKICASTVLFFPYQRTTEKSDNFYLY- 1931 Query: 990 PSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSPDE 811 + + S + ++ YDP FIL FSI L++ YIEP+EFASLGLL+V F S+SS D Sbjct: 1932 -DDPINEKCSPVIEDIERYDPAFILHFSIDSLSVGYIEPVEFASLGLLAVAFVSMSSADL 1990 Query: 810 DMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFAAEASL 631 MRKLGYE L F ALE C+K K L Y+QNG+EEPWQRIP+V A+FAAE SL Sbjct: 1991 GMRKLGYETLQIFLDALENCRKNKHVTGLRLLLMYVQNGVEEPWQRIPTVSAIFAAETSL 2050 Query: 630 ILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLNTE 451 ILLDPS+ +Y I+K L +S ++ ++ IPLF FFWS++V F++ R W LRL+C+GL ++ Sbjct: 2051 ILLDPSHEHYVPINKLLQSSSTLKLRGIPLFHDFFWSSAVNFRSQRFWELRLVCLGLKSD 2110 Query: 450 DDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISWLS 271 DD Q YI+NSI ET+ISF SSPL+D+E+K LI+QV++K+ HK LVE+CG+ SW S Sbjct: 2111 DDVQIYIKNSILETVISFSSSPLADDETKRLILQVVRKSVKFHKMARHLVENCGLFSWCS 2170 Query: 270 SILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXXXX 91 S + + + D+ L +LE++ + + RNI EWLQ+ LE Sbjct: 2171 SFISNFTTKPIGDKD----LHLVVVLEIITDVLASRNITEWLQRFGLEGLMEISSRLYKL 2226 Query: 90 XXSGVELITNQSSICASILQILTLVQKISQ 1 G+ + + ILQIL+ KISQ Sbjct: 2227 LGGGLVSVQANGTSVDLILQILSATLKISQ 2256 >ref|XP_004295819.1| PREDICTED: uncharacterized protein LOC101298301 [Fragaria vesca subsp. vesca] Length = 2542 Score = 367 bits (943), Expect = 5e-99 Identities = 209/455 (45%), Positives = 281/455 (61%), Gaps = 5/455 (1%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GI+ +E+ LLLSS+GAT +E D+EIYNL+ IE D A DYLWG A+ K+ K Sbjct: 1794 GIDLREVHLLLLSSFGATLNETDVEIYNLMRTIECIDGLEHVKFAGMDYLWGSAALKIEK 1853 Query: 1170 DLEQNNDMQSVDQNSME-VYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQ 994 + + N E V E + + REN+ IDPKICA T L+FPY A L L Sbjct: 1854 ERNLEQSLSYDTMNDAEAVKEYHRNQLRENLSIDPKICASTVLYFPYQLAA-SDELLSLN 1912 Query: 993 KPSSAVMLE----GSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASI 826 K + ++ + D Y+P+FILRFS+HCL+ +IEP+EFA LGLL++ F SI Sbjct: 1913 KFQTDLVDDLPVLNCPDVDTKARYNPIFILRFSMHCLSEGFIEPLEFAGLGLLAIAFMSI 1972 Query: 825 SSPDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVFA 646 SSP + +R LGYE L + L+ CQK+K L +++NGI++ QRI SV A+FA Sbjct: 1973 SSPSDKIRSLGYETLGTLQDVLKTCQKRKGITEIKLLLLFVENGIQQIGQRISSVNAIFA 2032 Query: 645 AEASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCV 466 AE SLILLD S+ +Y T+ L S ++N K +P F FFWS+SV F+++R+W+LR+L V Sbjct: 2033 AETSLILLDTSHEHYATLLTLLKRSSALNTKIVPFFSNFFWSSSVNFRSERLWILRILYV 2092 Query: 465 GLNTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGV 286 GLN +DDA YI+NSI ETL+SFY SPLSD ESKELI+QV+KK+ LHK LVE CG+ Sbjct: 2093 GLNFDDDAHVYIKNSILETLLSFYGSPLSDKESKELILQVVKKSIKLHKLARHLVEKCGL 2152 Query: 285 ISWLSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXX 106 I WLSS+L G +ED + QL + EVVN ++S RNI EWLQ +ALEQ Sbjct: 2153 IPWLSSLLSISSGSRLED-ETLCFLQLGVVSEVVNDVSS-RNITEWLQNNALEQLMELTS 2210 Query: 105 XXXXXXXSGVELITNQSSICASILQILTLVQKISQ 1 + V L+T+ + IL+ + K+SQ Sbjct: 2211 HLYKFLATDVTLMTDNVTAINRILETIISTFKLSQ 2245 >ref|XP_006413117.1| hypothetical protein EUTSA_v10024185mg [Eutrema salsugineum] gi|557114287|gb|ESQ54570.1| hypothetical protein EUTSA_v10024185mg [Eutrema salsugineum] Length = 2382 Score = 366 bits (940), Expect = 1e-98 Identities = 206/451 (45%), Positives = 281/451 (62%), Gaps = 1/451 (0%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL LLL SYGAT SEIDLE+Y L+ DIE D+ V+ T +LWG A+ K+R+ Sbjct: 1645 GINLKELHFLLLCSYGATLSEIDLELYKLMHDIELIDDEHRLNVSETGHLWGKAALKIRE 1704 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAVYGGNLPKLQK 991 L + D + +V R F+EN+ +DPK CA T L+FP R + L Sbjct: 1705 GLRFSQDASDGGEAD-KVENLRHSLFKENLCVDPKRCALTVLYFPNQRTPEVSDNSCLYD 1763 Query: 990 PSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFASISSPDE 811 P S + ST + +++YDP FIL FS+H L+M YIEP+EFASLGLL+V F S+SS D Sbjct: 1764 PISK---KCSTVIEDIELYDPAFILPFSVHSLSMRYIEPVEFASLGLLAVAFVSMSSADI 1820 Query: 810 DMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLT-YLQNGIEEPWQRIPSVIAVFAAEAS 634 MRKLGYE L F ALE C+ K L ++QNG+EE WQRIP+V AVFA+E S Sbjct: 1821 GMRKLGYETLEIFLDALECCKMNKHVKDGIRLLLLHVQNGVEEQWQRIPTVSAVFASETS 1880 Query: 633 LILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLCVGLNT 454 LILLD S+ +Y I K L +S ++ ++ IPLF FFWS++ ++ R+W LRLLCVGL + Sbjct: 1881 LILLDSSHEHYVPIVKFLKSSSTMKLRGIPLFLDFFWSSAFNSRSQRLWELRLLCVGLKS 1940 Query: 453 EDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCGVISWL 274 +DDA YIRNSI E L+S +SSPL+D+E+K LI+QV++K+ HK V LVE CG+ SWL Sbjct: 1941 DDDAHIYIRNSILEELMSVFSSPLADDETKGLILQVVRKSVKFHKMVRHLVEKCGLFSWL 2000 Query: 273 SSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXXXXXXX 94 SS++ + + D+ +L +LEV+ + + RN+ EWLQ+ ALE+ Sbjct: 2001 SSLISTFTTKPIGDED----LRLVVVLEVMTDVLASRNVTEWLQRFALEELMEISSRLYR 2056 Query: 93 XXXSGVELITNQSSICASILQILTLVQKISQ 1 G+ + ++ ILQIL+ KISQ Sbjct: 2057 LLGGGLVSVQENGTLVDLILQILSATLKISQ 2087 >ref|XP_002893970.1| hypothetical protein ARALYDRAFT_336756 [Arabidopsis lyrata subsp. lyrata] gi|297339812|gb|EFH70229.1| hypothetical protein ARALYDRAFT_336756 [Arabidopsis lyrata subsp. lyrata] Length = 2496 Score = 358 bits (920), Expect = 2e-96 Identities = 200/456 (43%), Positives = 282/456 (61%), Gaps = 6/456 (1%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL L L SYGAT SEIDLE+Y L+ DIE ++ V+ TDYLWG A+ K+R+ Sbjct: 1807 GINLKELRFLFLCSYGATMSEIDLELYKLMHDIELIEDEQRLNVSETDYLWGKAALKIRE 1866 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAV------YGGN 1009 L + D + + V +++ F+EN+ IDPKICAQT L+FPY R Y + Sbjct: 1867 GLRFSQDAYYGGEAGL-VENLQQILFKENLWIDPKICAQTLLYFPYQRTAEVSDNSYISD 1925 Query: 1008 LPKLQKPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFAS 829 P +K S + + YDP +IL FSIH L+M IEP++FAS GLL+V AS Sbjct: 1926 DPVSEKCSPVI-----------ERYDPAYILPFSIHSLSMGCIEPVKFASSGLLAVALAS 1974 Query: 828 ISSPDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVF 649 SS D MRKLGYE L F AL++C+K ++ L +++NG+++ W+RIP+V A F Sbjct: 1975 TSSADLGMRKLGYETLGIFVHALKRCEKNENVMGLMLLLMHVENGVDKRWKRIPTVCAYF 2034 Query: 648 AAEASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLC 469 AA SLILLD S+ Y I+K L +S ++N+K IPLF FFWS++V ++ R+W LRL+C Sbjct: 2035 AAVTSLILLDSSHELYAPINKLLKSSSTLNLKGIPLFYDFFWSSTVVLRSQRLWELRLVC 2094 Query: 468 VGLNTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCG 289 VGL +EDDAQ YIRNS+ +TL+SF SSPL+D+E+K LI+QV++K+ HK LVE+CG Sbjct: 2095 VGLESEDDAQLYIRNSVLDTLMSFSSSPLADDETKGLILQVVRKSVKFHKIARHLVENCG 2154 Query: 288 VISWLSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXX 109 ++ W SS + + D+ ++L +LEV+ + RN+ EWLQ+ ALE+ Sbjct: 2155 LLLWCSSFISMFATKPIGDED----SRLVAVLEVITDTLASRNVTEWLQRSALEELMEIS 2210 Query: 108 XXXXXXXXSGVELITNQSSICASILQILTLVQKISQ 1 G+ + ++ ILQIL+ KISQ Sbjct: 2211 SRLYRFLGGGLVSMKENGTLVDLILQILSATLKISQ 2246 >ref|XP_002891275.1| hypothetical protein ARALYDRAFT_314107 [Arabidopsis lyrata subsp. lyrata] gi|297337117|gb|EFH67534.1| hypothetical protein ARALYDRAFT_314107 [Arabidopsis lyrata subsp. lyrata] Length = 2475 Score = 358 bits (918), Expect = 4e-96 Identities = 200/456 (43%), Positives = 281/456 (61%), Gaps = 6/456 (1%) Frame = -1 Query: 1350 GINAQELVHLLLSSYGATCSEIDLEIYNLLLDIESNDESCAGTVARTDYLWGVASAKVRK 1171 GIN +EL L L SYGAT SEIDLE+Y L+ DIE ++ V+ TDYLWG A+ K+R+ Sbjct: 1760 GINLKELRFLFLCSYGATMSEIDLELYKLMHDIELIEDEQRLNVSETDYLWGKAALKIRE 1819 Query: 1170 DLEQNNDMQSVDQNSMEVYERRKVKFRENIPIDPKICAQTALFFPYDRAV------YGGN 1009 L + D + + V +++ F+EN+ IDPKICAQT L+FPY R Y + Sbjct: 1820 GLRFSQDAYYGGEAGL-VENLQQILFKENLWIDPKICAQTLLYFPYQRTAEVSDNSYISD 1878 Query: 1008 LPKLQKPSSAVMLEGSTTTDKLQIYDPVFILRFSIHCLTMSYIEPIEFASLGLLSVTFAS 829 P +K S + + YDP +IL FSIH L+M IEP++FAS GLL+V AS Sbjct: 1879 DPVSEKCSPVI-----------ERYDPAYILPFSIHSLSMGCIEPVKFASSGLLAVALAS 1927 Query: 828 ISSPDEDMRKLGYEALAKFKSALEKCQKKKDXXXXXXXLTYLQNGIEEPWQRIPSVIAVF 649 SS D MRKLGYE L F AL++C+K ++ L +++NG+++ W+RIP+V A F Sbjct: 1928 TSSADLGMRKLGYETLGIFVHALKRCEKNENVMGLMLLLMHVENGVDKRWKRIPTVCAYF 1987 Query: 648 AAEASLILLDPSNNNYTTISKHLSNSPSVNMKAIPLFRTFFWSTSVTFKADRIWMLRLLC 469 AA SLILLD S+ Y I+K L +S ++N+K IPLF FFWS++V ++ R+W LRL+C Sbjct: 1988 AAVTSLILLDSSHELYAPINKLLKSSSTLNLKGIPLFYDFFWSSTVVLRSQRLWELRLVC 2047 Query: 468 VGLNTEDDAQTYIRNSIFETLISFYSSPLSDNESKELIIQVMKKAAVLHKAVWALVEHCG 289 VGL +EDDAQ YIRNS+ ETL+SF SSPL+D+E+K LI+QV++K+ HK LVE+CG Sbjct: 2048 VGLESEDDAQLYIRNSVLETLMSFSSSPLADDETKGLILQVVRKSVKFHKIARHLVENCG 2107 Query: 288 VISWLSSILPSLYGGGVEDQQKFALTQLPTILEVVNCITSPRNIVEWLQKHALEQXXXXX 109 ++ W SS + + D+ ++L +LEV+ + RN+ WLQ+ ALE+ Sbjct: 2108 LLLWCSSFISMFATKPIGDED----SRLVAVLEVITDTLASRNVTVWLQRSALEELMEIS 2163 Query: 108 XXXXXXXXSGVELITNQSSICASILQILTLVQKISQ 1 G+ + ++ ILQIL+ KISQ Sbjct: 2164 SRLYRFLGGGLVSVKENGTLVDLILQILSATLKISQ 2199