BLASTX nr result
ID: Akebia26_contig00009702
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00009702 (1408 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002271655.2| PREDICTED: uncharacterized protein LOC100258... 475 e-131 ref|XP_002515683.1| conserved hypothetical protein [Ricinus comm... 470 e-130 ref|XP_006468170.1| PREDICTED: uncharacterized protein LOC102607... 461 e-127 ref|XP_006431995.1| hypothetical protein CICLE_v100000061mg, par... 461 e-127 ref|XP_002321979.2| hypothetical protein POPTR_0015s01090g [Popu... 458 e-126 ref|XP_002317800.1| hypothetical protein POPTR_0012s02690g [Popu... 456 e-125 gb|ACC64519.1| neuroblastoma-amplified gene [Nicotiana benthamiana] 434 e-119 ref|XP_007220568.1| hypothetical protein PRUPE_ppa000029mg [Prun... 433 e-119 ref|XP_007039145.1| Uncharacterized protein isoform 3 [Theobroma... 432 e-118 ref|XP_007039143.1| Uncharacterized protein isoform 1 [Theobroma... 432 e-118 gb|EXC21398.1| hypothetical protein L484_011840 [Morus notabilis] 426 e-116 gb|AFP55540.1| hypothetical protein [Rosa rugosa] 426 e-116 ref|XP_004235116.1| PREDICTED: uncharacterized protein LOC101256... 425 e-116 ref|XP_006350502.1| PREDICTED: uncharacterized protein LOC102589... 423 e-115 ref|XP_003602296.1| Neuroblastoma-amplified sequence [Medicago t... 423 e-115 ref|XP_004309107.1| PREDICTED: uncharacterized protein LOC101306... 420 e-115 gb|EYU45288.1| hypothetical protein MIMGU_mgv1a000026mg [Mimulus... 415 e-113 ref|XP_006581664.1| PREDICTED: uncharacterized protein LOC100818... 412 e-112 ref|XP_007136472.1| hypothetical protein PHAVU_009G048100g [Phas... 411 e-112 ref|XP_006578887.1| PREDICTED: neuroblastoma-amplified sequence-... 410 e-112 >ref|XP_002271655.2| PREDICTED: uncharacterized protein LOC100258836 [Vitis vinifera] Length = 2390 Score = 475 bits (1222), Expect = e-131 Identities = 248/450 (55%), Positives = 317/450 (70%), Gaps = 5/450 (1%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R+ FS N++L SHVRVYALELMQ I+G N++G AEL SN+ PWE W EL TS SE Sbjct: 1947 RIVMFSDNLELPSHVRVYALELMQFISGGNIKGFSAELKSNILPWEDWHELHFTSKSSET 2006 Query: 182 A-NQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 NQ P+ D SS FTSTLVAL+S++L + IS SIEITPDDL+T+D+AVS F L GAA Sbjct: 2007 TTNQGLPDHADTSSRFTSTLVALKSSQLVAAISSSIEITPDDLLTVDAAVSRFSRLCGAA 2066 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 ++ H D+L A+L EW+GLF R+ E +A D GNNW S+DWDEGWESFQEE+P EK Sbjct: 2067 TTDPHIDALLAVLGEWEGLFVIERDFETSPEAHDTGNNWSSEDWDEGWESFQEEEPAEKE 2126 Query: 539 ----GSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLV 706 S S+HPLH CWMEI KKLI SRF+DLLKLIDRSL+KSN +LLDE+DA+SL+Q V Sbjct: 2127 KNKESSFSVHPLHACWMEIFKKLIMQSRFSDLLKLIDRSLTKSNGMLLDEDDAQSLTQTV 2186 Query: 707 IGIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXX 886 +G+DC ALKM+LLLPY+ +QLQC ++VE KLKQGG+S +I D EL Sbjct: 2187 LGVDCFVALKMVLLLPYEAMQLQCANSVEEKLKQGGISDTIGRDHELLLLILSSGIISNI 2246 Query: 887 XXKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFI 1066 +++Y TTFS CY VG+FS QE QL +LK + + ++FR LFPCFI Sbjct: 2247 ITQSSYGTTFSYLCYLVGNFSRQYQEAQLSKLK------HQESNNPILLLFRRTLFPCFI 2300 Query: 1067 SELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMC 1246 SELV A Q +LAG +++FMHT+A+LSLIN+A++SL+RYLE ++ QG E + G C Sbjct: 2301 SELVKADQSILAGLFLTKFMHTNAALSLINIADSSLSRYLERELLALQGKEFDPQETGSC 2360 Query: 1247 TYLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 L NTV LR KL + ++SAL++LS+NV+ Sbjct: 2361 DTLGNTVSSLRGKLRNSIESALASLSSNVR 2390 >ref|XP_002515683.1| conserved hypothetical protein [Ricinus communis] gi|223545226|gb|EEF46735.1| conserved hypothetical protein [Ricinus communis] Length = 2429 Score = 470 bits (1209), Expect = e-130 Identities = 244/449 (54%), Positives = 312/449 (69%), Gaps = 4/449 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R++ FS N QL SHVRVY LELMQ I GRN++G EL S V PWEGWDEL TS SEI Sbjct: 1981 RMAQFSDNSQLPSHVRVYVLELMQLIRGRNIKGFSTELQSKVLPWEGWDELLSTSIKSEI 2040 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 AN + TDASS TSTLVAL+S++L + ISPSIEITPD+L+ +++AVSCFL L + Sbjct: 2041 NANHLLLHHTDASSQLTSTLVALKSSQLVAAISPSIEITPDNLLNVETAVSCFLKLCDVS 2100 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 +S+ H + L AI+EEW+G F GR+E P + ++A N+W +DDWDEGWESFQE +EK Sbjct: 2101 NSDTHVEVLLAIVEEWEGFFVVGRDEIKPSETTEAVNDWNNDDWDEGWESFQEVDSLEKE 2160 Query: 539 ---GSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 S+SI PLH CWMEI KKLIA+SRF D+L+LID SL+KSN +LLDE+ A++LS++++ Sbjct: 2161 KIENSLSIDPLHVCWMEIFKKLIAISRFNDVLRLIDHSLTKSNRILLDEDGAKTLSEVLL 2220 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 +DC ALK++LLLPY+ +Q QCL VE K KQGG+S ++ D E F Sbjct: 2221 EMDCFVALKLVLLLPYEALQFQCLAVVEDKFKQGGISETVGRDHEFFILVLSSKIISVII 2280 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K++Y T FS CY G+ S CQE QL+ + + K ES E F +FR ILFP FIS Sbjct: 2281 TKSSYGTIFSFLCYLAGNLSRQCQESQLFRIMEKEKTESVDTEKDFLFLFRRILFPSFIS 2340 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q +LAGFLV++FMHT+ASLSL+NVAEASL RYLE Q+ Q E + + C Sbjct: 2341 ELVKADQHILAGFLVTKFMHTNASLSLVNVAEASLARYLERQLHALQHDEFAVDDISSCK 2400 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 L+NTV LR KLG+ +QSAL+ L NV+ Sbjct: 2401 LLKNTVSKLRGKLGTGIQSALALLPANVR 2429 >ref|XP_006468170.1| PREDICTED: uncharacterized protein LOC102607684 isoform X1 [Citrus sinensis] gi|568827667|ref|XP_006468171.1| PREDICTED: uncharacterized protein LOC102607684 isoform X2 [Citrus sinensis] gi|568827669|ref|XP_006468172.1| PREDICTED: uncharacterized protein LOC102607684 isoform X3 [Citrus sinensis] Length = 2429 Score = 461 bits (1185), Expect = e-127 Identities = 246/449 (54%), Positives = 312/449 (69%), Gaps = 4/449 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R+ FS N+QL SH+RVY LELMQ I+G N++G ++L SNV PWEGWDE +S SE Sbjct: 1984 RMVKFSENLQLPSHIRVYTLELMQFISGGNIKGFSSDLQSNVLPWEGWDEFLNSSKKSEA 2043 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 A Q + Q D S FT+TLVAL+ST+L + ISPSIEITPDDL +++AVSCFL L GAA Sbjct: 2044 SAIQGSSEQMDTCSRFTNTLVALKSTQLVAAISPSIEITPDDLNNVEAAVSCFLKLCGAA 2103 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 + HFD L AILEEW+GLF R+E ASD N W +DDWDEGWESFQE +P EK Sbjct: 2104 SKDPHFDVLVAILEEWEGLFII-RDEVTSVAASDPENTWNTDDWDEGWESFQEVEPPEKE 2162 Query: 539 G---SVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 S+++HPLH CWMEI KK I +SR D+L++IDRSLSKSN +LLDE+D RSL+++ + Sbjct: 2163 QKDISLAVHPLHICWMEIFKKFITMSRIRDVLRMIDRSLSKSNGILLDEDDVRSLNKIAL 2222 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 G+DC ALKM+LLLPY+ +QL+ L+AVE KLKQGG+S +I D E Sbjct: 2223 GMDCFLALKMVLLLPYKGVQLESLNAVEEKLKQGGISDTIGRDHEFLLLVLSSGIVSTII 2282 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K++Y T FS FC+ VG+ S QE Q L G++E E + FR ILFP FIS Sbjct: 2283 TKSSYGTVFSYFCFLVGNLSRQLQETQFSRLAKGGRDECGNSETDLHL-FRRILFPRFIS 2341 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q +LAGFL+++FMHT+ASLSLIN+AEASL RYLE Q+Q+ Q +E +F Sbjct: 2342 ELVKADQQILAGFLITKFMHTNASLSLINIAEASLNRYLEKQLQQLQ-HEEAFLYESCSE 2400 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 L+NTV LR K+G+L++SALS LS NV+ Sbjct: 2401 TLKNTVSRLRSKMGNLIESALSFLSRNVR 2429 >ref|XP_006431995.1| hypothetical protein CICLE_v100000061mg, partial [Citrus clementina] gi|557534117|gb|ESR45235.1| hypothetical protein CICLE_v100000061mg, partial [Citrus clementina] Length = 1789 Score = 461 bits (1185), Expect = e-127 Identities = 246/449 (54%), Positives = 312/449 (69%), Gaps = 4/449 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R+ FS N+QL SH+RVY LELMQ I+G N++G ++L SNV PWEGWDE +S SE Sbjct: 1344 RMVKFSENLQLPSHIRVYTLELMQFISGGNIKGFSSDLQSNVLPWEGWDEFLNSSKKSEA 1403 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 A Q + Q D S FT+TLVAL+ST+L + ISPSIEITPDDL +++AVSCFL L GAA Sbjct: 1404 SAIQGSSEQMDTCSRFTNTLVALKSTQLVAAISPSIEITPDDLNNVEAAVSCFLKLCGAA 1463 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 + HFD L AILEEW+GLF R+E ASD N W +DDWDEGWESFQE +P EK Sbjct: 1464 SKDPHFDVLVAILEEWEGLFII-RDEVTSVAASDPENTWNTDDWDEGWESFQEVEPPEKE 1522 Query: 539 G---SVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 S+++HPLH CWMEI KK I +SR D+L++IDRSLSKSN +LLDE+D RSL+++ + Sbjct: 1523 QKDISLAVHPLHICWMEIFKKFITMSRIRDVLRMIDRSLSKSNGILLDEDDVRSLNKIAL 1582 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 G+DC ALKM+LLLPY+ +QL+ L+AVE KLKQGG+S +I D E Sbjct: 1583 GMDCFLALKMVLLLPYKGVQLESLNAVEEKLKQGGISDTIGRDHEFLLLVLSSGIVSTII 1642 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K++Y T FS FC+ VG+ S QE Q L G++E E + FR ILFP FIS Sbjct: 1643 TKSSYGTVFSYFCFLVGNLSRQLQETQFSRLAKGGRDECGNSETDLHL-FRRILFPRFIS 1701 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q +LAGFL+++FMHT+ASLSLIN+AEASL RYLE Q+Q+ Q +E +F Sbjct: 1702 ELVKADQQILAGFLITKFMHTNASLSLINIAEASLNRYLEKQLQQLQ-HEEAFLYESCSE 1760 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 L+NTV LR K+G+L++SALS LS NV+ Sbjct: 1761 TLKNTVSRLRSKMGNLIESALSFLSRNVR 1789 >ref|XP_002321979.2| hypothetical protein POPTR_0015s01090g [Populus trichocarpa] gi|550321714|gb|EEF06106.2| hypothetical protein POPTR_0015s01090g [Populus trichocarpa] Length = 2421 Score = 458 bits (1178), Expect = e-126 Identities = 243/449 (54%), Positives = 307/449 (68%), Gaps = 4/449 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R++ FS+N++L SHVRVY LE+MQ ITGRN++G P EL SN+ WEGWD L TS SE Sbjct: 1976 RMAQFSNNLELPSHVRVYVLEIMQFITGRNIKGFPTELESNLLSWEGWDGLISTSKKSET 2035 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 ANQ P+ D SS FTSTLVAL+S++LAS ISP IEITPDDL+ I++AVSCFL L ++ Sbjct: 2036 SANQGLPDHIDTSSRFTSTLVALKSSQLASSISPRIEITPDDLVNIETAVSCFLKLCASS 2095 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 +E HFD+L ILEEW+G F + ++E D ++A N W +D WDEGWESFQ+E+ EK Sbjct: 2096 CTEPHFDALIGILEEWEGFFVTAKDEVD---TTEAENCWSNDGWDEGWESFQDEEAPEKE 2152 Query: 539 ---GSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 S +HPLH CWMEIIKKLI LS+F D+ +LIDRSLSK+ +LLDE+DARSLSQ V+ Sbjct: 2153 KTENSNHVHPLHVCWMEIIKKLIGLSQFKDVSRLIDRSLSKTYGILLDEDDARSLSQAVL 2212 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 D ALKM+LLLPY+ IQLQCLD VE KLKQGG+S D E Sbjct: 2213 EKDSFMALKMVLLLPYEAIQLQCLDVVEDKLKQGGISDLAGRDHEFLMLVLSSGVISTII 2272 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K +YSTTFS CY VG+FS QE Q + ++G E E ++FR I+FPCFIS Sbjct: 2273 AKPSYSTTFSYLCYLVGNFSRQSQEAQSSTIMNKGTNEHVNTEKDVLLLFRRIMFPCFIS 2332 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV Q +LAGFL+++FMHT+ SLSLIN+ EASL+RYLE Q+ Q + S ++ C Sbjct: 2333 ELVKGDQQILAGFLITKFMHTNPSLSLINITEASLSRYLERQLHALQQADFSAEEIISCE 2392 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 +NTV L KL L+QSAL +S+N + Sbjct: 2393 MFKNTVSRLTIKLQDLIQSALPLISSNAR 2421 >ref|XP_002317800.1| hypothetical protein POPTR_0012s02690g [Populus trichocarpa] gi|222858473|gb|EEE96020.1| hypothetical protein POPTR_0012s02690g [Populus trichocarpa] Length = 2414 Score = 456 bits (1172), Expect = e-125 Identities = 240/450 (53%), Positives = 306/450 (68%), Gaps = 5/450 (1%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSE- 178 R++ FS N++L SHVRVY LE+MQ ITGR+++G EL SN+ PWEGWD L T S Sbjct: 1965 RMAQFSDNLELPSHVRVYVLEIMQFITGRSIKGFSTELNSNLLPWEGWDGLLSTGKKSNP 2024 Query: 179 IANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 ANQ +P+ TD SS FTSTLVALRS++LAS ISPSI ITPDDL+ ++AVSCFL L ++ Sbjct: 2025 SANQGSPDHTDNSSRFTSTLVALRSSQLASAISPSIAITPDDLLNAETAVSCFLKLCESS 2084 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 +E HFD+L ILEEW+G F + ++E D +A++ GN+W +DDWDEGWESFQE + +EK Sbjct: 2085 STEPHFDALIGILEEWEGFFVTAKDEVDTTEATETGNDWNNDDWDEGWESFQEVEALEKE 2144 Query: 539 ---GSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 S +HPLH CWMEI KKLI LS+F D+L+LID SLSKS +LLDE+DARSLS V+ Sbjct: 2145 KPENSNHVHPLHVCWMEIFKKLITLSKFKDVLRLIDCSLSKSYGILLDEDDARSLSHTVL 2204 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 D ALKM LLLPY+ IQLQCL+ VE KLKQGG+S + D E+ Sbjct: 2205 EKDSFMALKMGLLLPYEAIQLQCLNVVEDKLKQGGISGVLGRDHEVLMLVLSSGVISNII 2264 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K +Y TTFS CY VG+FS QE QL + ++G E E ++F I+FPCFIS Sbjct: 2265 TKPSYGTTFSYLCYVVGNFSRQSQEAQLSTITNKGANERVNIEKDVLLLFIRIMFPCFIS 2324 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQK-QQGYEPSFGKMGMC 1246 ELV Q +LAGFL+++FMHT+ S SLIN E+SL+RYLE Q+ QQG S ++ C Sbjct: 2325 ELVKTDQQILAGFLITKFMHTNPSFSLINTTESSLSRYLERQLHALQQGDYFSLEEISSC 2384 Query: 1247 TYLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 NTV L +KLG ++SAL LS+N + Sbjct: 2385 EMFRNTVSRLTNKLGDEIRSALPLLSSNAR 2414 >gb|ACC64519.1| neuroblastoma-amplified gene [Nicotiana benthamiana] Length = 2409 Score = 434 bits (1116), Expect = e-119 Identities = 226/446 (50%), Positives = 294/446 (65%), Gaps = 5/446 (1%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSI--TGRNLRGLPAELLSNVQPWEGWDELDCTSAGS 175 RL FS N L +HVRVY LELMQ I T +N +G + L V WEGW+ L +A Sbjct: 1960 RLEEFSENFHLSNHVRVYMLELMQLIAATDKNSKGFSSGLEVEVHSWEGWENLHSATANR 2019 Query: 176 E-IANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSG 352 E A + DAS+ FT+TL+AL+ST+L S ISPSIEITP+DL T++S VSCFL +S Sbjct: 2020 ENTAADGISKKLDASNKFTNTLIALKSTQLVSTISPSIEITPEDLSTVESTVSCFLGVSK 2079 Query: 353 AAHSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQE--EQP 526 A SE H ++L A+L EW+G FT G E+D G+ SD GN+W +DDWDEGWESFQE E+ Sbjct: 2080 FAESESHVETLLAMLREWEGQFTRGETEKDSGEISDGGNSWSNDDWDEGWESFQEPIERE 2139 Query: 527 VEKGGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLV 706 +K +S+HPLH CWMEI +KL+ S++ +LKL+D+SL+K VLLDE +A+ LSQ+ Sbjct: 2140 PKKDAELSVHPLHVCWMEIFRKLLTTSQYNKMLKLLDKSLAKPGEVLLDEENAQGLSQIA 2199 Query: 707 IGIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXX 886 +G+DC ALK+MLLLPY+ +QL CLD VE KLKQ G+S IS D E Sbjct: 2200 LGVDCFLALKLMLLLPYEVVQLHCLDIVEQKLKQEGISDKISMDLEFLVLVLSSGVISTI 2259 Query: 887 XXKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFI 1066 K +Y T FS CY VG+FS CQ+ QL ++ G ES +F ++FPCF+ Sbjct: 2260 ITKPSYGTIFSYLCYMVGNFSRWCQDSQLSDVGCGGSVESENIPKDHIDLFTRLVFPCFV 2319 Query: 1067 SELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMC 1246 SELV + Q +LAGFLV++FMHT+ SLSLIN+A A LT+YLE QIQ Q PS+ + Sbjct: 2320 SELVRSGQQILAGFLVAKFMHTNPSLSLINIAGACLTKYLERQIQILQEGNPSWDSVKFS 2379 Query: 1247 TYLENTVFCLRDKLGSLVQSALSTLS 1324 L NTV LRD++ +L+QS+LS LS Sbjct: 2380 NPLLNTVSSLRDRMENLIQSSLSLLS 2405 >ref|XP_007220568.1| hypothetical protein PRUPE_ppa000029mg [Prunus persica] gi|462417030|gb|EMJ21767.1| hypothetical protein PRUPE_ppa000029mg [Prunus persica] Length = 2361 Score = 433 bits (1114), Expect = e-119 Identities = 230/449 (51%), Positives = 301/449 (67%), Gaps = 4/449 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R++ FS N+QL VRV LELMQ +TG++ +GL A + S+V PWEGWDE+ S SE Sbjct: 1917 RMAKFSDNLQLPGSVRVCTLELMQFLTGKSTKGLSASIQSSVMPWEGWDEVHFMSNKSET 1976 Query: 182 ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAAH 361 +Q + D + FTSTLVAL+S++L + ISP++EIT DDL ++ AVSCFL L A Sbjct: 1977 TDQGLVDHNDTPNRFTSTLVALKSSQLVATISPTLEITSDDLSNLEKAVSCFLKLCDVAQ 2036 Query: 362 SEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPV--EK 535 S H SL A+L EW+G F +++ +ASDAGN+W +++WDEGWESFQE +P EK Sbjct: 2037 SYSHVGSLLAMLGEWEGFFLVREDKKPSVEASDAGNDW-NENWDEGWESFQELEPPVKEK 2095 Query: 536 GGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVIGI 715 S SIHPLH CW+EI KKL+ LS+F D+L+LID+SL KSN +LLDE+ ARSLSQ+V+ Sbjct: 2096 ESSFSIHPLHACWLEIFKKLVMLSQFKDVLRLIDQSLLKSNGILLDEDGARSLSQIVLER 2155 Query: 716 DCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXXXK 895 DC TALK++LLLP++T+QLQCL AVE KLKQGG+S SI GD EL Sbjct: 2156 DCFTALKLVLLLPFETLQLQCLAAVEDKLKQGGISDSIGGDHELLMLVLFSGVLPTIISN 2215 Query: 896 TTYSTTFSCFCYSVGHFSHLCQ--EIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 ++Y T SC CY VG+ SH Q +Q L +GK + + + + +VFR +LFPCFIS Sbjct: 2216 SSYGNTLSCICYLVGNLSHKFQAARLQNERLVQKGKGGCKEENESWLLVFRRMLFPCFIS 2275 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q LLAG +V++FMHT+ASL L+NVAEASL R+LE Q+ G + Sbjct: 2276 ELVKADQQLLAGLIVTKFMHTNASLGLVNVAEASLGRFLEVQL---HGLHDPLDETRSQE 2332 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 L+N V LR KL +L+Q ALS LS N + Sbjct: 2333 TLKNVVSSLRGKLENLIQGALSLLSTNAR 2361 >ref|XP_007039145.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508776390|gb|EOY23646.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 1979 Score = 432 bits (1112), Expect = e-118 Identities = 231/447 (51%), Positives = 299/447 (66%), Gaps = 4/447 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSE- 178 R+++FS ++QL SHVRVYALELMQ ITG ++GL +EL NV PW GWD+ C S ++ Sbjct: 1529 RIASFSEDLQLASHVRVYALELMQFITGTTMKGLSSELQLNVHPWVGWDDSLCGSNKTQS 1588 Query: 179 IANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 +N+ P QTD SS FTSTLVAL+S++L + ISP IEIT DDL+ +++AVSCFL L A Sbjct: 1589 TSNEGLPEQTDTSSRFTSTLVALKSSQLMAAISPGIEITSDDLLNVETAVSCFLKLCEVA 1648 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 ++ HF+ L AILEEW+GLF EE SDA N W +DDWDEGWESFQE +P EK Sbjct: 1649 NAAPHFNVLVAILEEWEGLFVIKTEEVASAVFSDAENIWSNDDWDEGWESFQEVEPSEKE 1708 Query: 539 GS---VSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 + +HPLH CW+EI++ L+ S+F D+LKLID+S +KS VLLDE ARSL+ V+ Sbjct: 1709 KKEDLLLVHPLHECWIEILRSLVKASQFRDVLKLIDQSTTKSGGVLLDEGGARSLNDSVL 1768 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 G+DC ALKMMLLLPY+ +QL+ L A+E+KLKQ G S+ I D E Sbjct: 1769 GVDCFVALKMMLLLPYKGLQLESLSALENKLKQEGTSNMIGSDHEFLMLVLSSGVLSTVI 1828 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K++Y T FS CY VG+FS QE QL +L + E +E +F ILFP FIS Sbjct: 1829 NKSSYVTVFSYVCYLVGNFSRQFQEAQLSKLGKKRSNERGNNEGDTLFLFARILFPMFIS 1888 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV ++Q +LAGFLV++FMHT+ SL LIN+AEASL RYL Q+ + + + +MG C Sbjct: 1889 ELVKSEQQVLAGFLVTKFMHTNVSLGLINIAEASLRRYLARQLHVLEHDKFAPEEMGSCE 1948 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNN 1330 L+ TV LR KLG+ +QSALS L N Sbjct: 1949 TLKYTVSSLRGKLGNSLQSALSLLPRN 1975 >ref|XP_007039143.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590674353|ref|XP_007039144.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508776388|gb|EOY23644.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508776389|gb|EOY23645.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 2432 Score = 432 bits (1112), Expect = e-118 Identities = 231/447 (51%), Positives = 299/447 (66%), Gaps = 4/447 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSE- 178 R+++FS ++QL SHVRVYALELMQ ITG ++GL +EL NV PW GWD+ C S ++ Sbjct: 1982 RIASFSEDLQLASHVRVYALELMQFITGTTMKGLSSELQLNVHPWVGWDDSLCGSNKTQS 2041 Query: 179 IANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 +N+ P QTD SS FTSTLVAL+S++L + ISP IEIT DDL+ +++AVSCFL L A Sbjct: 2042 TSNEGLPEQTDTSSRFTSTLVALKSSQLMAAISPGIEITSDDLLNVETAVSCFLKLCEVA 2101 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 ++ HF+ L AILEEW+GLF EE SDA N W +DDWDEGWESFQE +P EK Sbjct: 2102 NAAPHFNVLVAILEEWEGLFVIKTEEVASAVFSDAENIWSNDDWDEGWESFQEVEPSEKE 2161 Query: 539 GS---VSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 + +HPLH CW+EI++ L+ S+F D+LKLID+S +KS VLLDE ARSL+ V+ Sbjct: 2162 KKEDLLLVHPLHECWIEILRSLVKASQFRDVLKLIDQSTTKSGGVLLDEGGARSLNDSVL 2221 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 G+DC ALKMMLLLPY+ +QL+ L A+E+KLKQ G S+ I D E Sbjct: 2222 GVDCFVALKMMLLLPYKGLQLESLSALENKLKQEGTSNMIGSDHEFLMLVLSSGVLSTVI 2281 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K++Y T FS CY VG+FS QE QL +L + E +E +F ILFP FIS Sbjct: 2282 NKSSYVTVFSYVCYLVGNFSRQFQEAQLSKLGKKRSNERGNNEGDTLFLFARILFPMFIS 2341 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV ++Q +LAGFLV++FMHT+ SL LIN+AEASL RYL Q+ + + + +MG C Sbjct: 2342 ELVKSEQQVLAGFLVTKFMHTNVSLGLINIAEASLRRYLARQLHVLEHDKFAPEEMGSCE 2401 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNN 1330 L+ TV LR KLG+ +QSALS L N Sbjct: 2402 TLKYTVSSLRGKLGNSLQSALSLLPRN 2428 >gb|EXC21398.1| hypothetical protein L484_011840 [Morus notabilis] Length = 2817 Score = 426 bits (1095), Expect = e-116 Identities = 222/440 (50%), Positives = 300/440 (68%), Gaps = 4/440 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 RL+ FS ++Q+ VRVY LELMQ +TGRN++G E+ SNV PWEGWDE+ TS SE Sbjct: 1982 RLAKFSDDLQIPGSVRVYVLELMQFLTGRNMKGFSTEIHSNVVPWEGWDEVHFTSEQSET 2041 Query: 182 A-NQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 + NQ + D S TSTL+AL+S++LA+ ISP+IEITPDDL T+++AVSCF LS + Sbjct: 2042 SGNQGLADHNDTSCRVTSTLIALKSSQLAASISPTIEITPDDLSTVETAVSCFSKLSDVS 2101 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEK- 535 H++ H SL A+L EW+GLF + +EE +ASDAGN W DDWDEGWESFQ+ +P EK Sbjct: 2102 HTDSHIYSLVAVLGEWEGLFMAKHDEEASLEASDAGNAWNGDDWDEGWESFQDIEPPEKE 2161 Query: 536 --GGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 G S+HPLH CW+EI KKL+ LSRF D+L+L+D +SN +LLDE+ ARSL+++V+ Sbjct: 2162 KTGSVPSLHPLHICWLEIFKKLVTLSRFRDVLRLLD----QSNGILLDEDGARSLTEVVL 2217 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 +DC+ ALK++LLLPY+ ++L+CL AVE KL++GG S I D + Sbjct: 2218 QMDCLMALKLVLLLPYEALRLRCLAAVEDKLRRGGFSDPIGQDHDFLVLISSSGLLSSII 2277 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 K++Y TTFS CY VG+FSH CQ QL L G ES D ++FR I+FP FIS Sbjct: 2278 SKSSYGTTFSYICYLVGNFSHKCQAAQLSGLVPEGSAESERD----LLLFRRIVFPSFIS 2333 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q LLAG +V++FMHT+ASLSL+N+AE+SL R+LE Q+ + + + + Sbjct: 2334 ELVKADQQLLAGLVVTKFMHTNASLSLVNIAESSLIRFLERQLHQLRHDKLALFDASSHE 2393 Query: 1250 YLENTVFCLRDKLGSLVQSA 1309 L+NTV L D+L ++V+ A Sbjct: 2394 TLKNTVSGLMDRLETVVEGA 2413 >gb|AFP55540.1| hypothetical protein [Rosa rugosa] Length = 2445 Score = 426 bits (1094), Expect = e-116 Identities = 227/452 (50%), Positives = 303/452 (67%), Gaps = 3/452 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R++ FS N QL +RV+ALELMQ +TG+N++G A + S+V PWEGWDE+ T+ SE Sbjct: 1969 RMAEFSDNPQLPGSIRVFALELMQYLTGKNIKGFSAGIQSSVIPWEGWDEVHFTNKKSET 2028 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 ANQ + + D S+ FTSTLVAL+S++L + ISP++EITPDDL+ +++AVSCFL L A Sbjct: 2029 TANQGSADHNDRSNRFTSTLVALKSSQLVANISPTMEITPDDLLNLETAVSCFLKLCDVA 2088 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 + H +SL A+L EW+G F ++E + SDAGN+W D+WDEGWESFQE P EK Sbjct: 2089 QNYSHVESLLAVLGEWEGFFLVRDDKEASVEVSDAGNDWTEDNWDEGWESFQEVGPSEKE 2148 Query: 539 --GSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVIG 712 S+SI+PLH CW+ I KKLI LS F +L+LIDRSL KS +LLDE A+SLSQ+V+ Sbjct: 2149 KESSISINPLHVCWLAIFKKLITLSHFKVVLRLIDRSLIKSGGILLDEEGAKSLSQIVLE 2208 Query: 713 IDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXXX 892 IDC ALK++LLLP++ +QLQCL AVE KLKQGG+S +I GD E Sbjct: 2209 IDCFMALKLVLLLPFKPLQLQCLAAVEDKLKQGGISDTIGGDIEFLMLVLFSGVVSSIIS 2268 Query: 893 KTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFISE 1072 ++Y TFS CY VG+ SH CQ QL + +G +E ++FR +LFPCFISE Sbjct: 2269 NSSYGNTFSYICYLVGNLSHKCQAAQLQNQRQKGNSALGENERS-LLLFRRVLFPCFISE 2327 Query: 1073 LVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCTY 1252 LV Q LLAG +V++FMHT+ASLSL+N+AEASL R+LE Q+ + + + Sbjct: 2328 LVKGDQQLLAGLVVTKFMHTNASLSLVNIAEASLGRFLEVQLNVLHD-KSTPDETHSQDA 2386 Query: 1253 LENTVFCLRDKLGSLVQSALSTLSNNVK*VVV 1348 L+NT+ LR K+ +L++ ALS LS NV V V Sbjct: 2387 LQNTISSLRGKMENLIRHALSLLSTNVDIVFV 2418 >ref|XP_004235116.1| PREDICTED: uncharacterized protein LOC101256264 [Solanum lycopersicum] Length = 2425 Score = 425 bits (1092), Expect = e-116 Identities = 220/449 (48%), Positives = 299/449 (66%), Gaps = 6/449 (1%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSI--TGRNLRGLPAELLSNVQPWEGWDELDCTSAGS 175 RL FS N QL +HVRVY LELMQ I T ++ + ++L V WEGWD +A Sbjct: 1975 RLEEFSENFQLPNHVRVYILELMQLIAATDKSSKRFSSKLQVEVHSWEGWDNTHNVTANC 2034 Query: 176 E-IANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSG 352 E A N+ D S+ FT+TL+AL+ST+L S ISP+IEI P+DL T++S VSCFL +S Sbjct: 2035 ENTATDGISNKIDTSNKFTNTLIALKSTQLVSTISPNIEIRPEDLSTVESTVSCFLGVSK 2094 Query: 353 AAHSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQE--EQP 526 A SE H D+L A+L EW+G F+ E+D G+ SD GN+WG+DDWDEGWESFQE E+ Sbjct: 2095 FAESESHVDALLAMLREWEGHFSREEMEKDSGEVSDGGNSWGNDDWDEGWESFQEPNEEE 2154 Query: 527 VEKGGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLV 706 +KG +S+HPLH CWMEI +KL+ +S++ +LKL+D+S++K VLLDE A+ LSQ+ Sbjct: 2155 PKKGAKLSVHPLHVCWMEIFRKLLTISQYNKMLKLLDKSVAKPGEVLLDEESAQGLSQIA 2214 Query: 707 IGIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXX 886 + IDC ALK+MLLLPY+ +QLQCL++VE KLKQ G+S I D E Sbjct: 2215 VEIDCFLALKLMLLLPYEVMQLQCLESVEQKLKQEGISDKIGVDLEFLLLILSSGVISTI 2274 Query: 887 XXKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFI 1066 K++Y TTFS C+ VG+FS CQE QL ES + + +F ++FPCF+ Sbjct: 2275 ITKSSYGTTFSYICFMVGNFSRQCQESQLSSSGCGESAESESISKYYIDLFPRLIFPCFV 2334 Query: 1067 SELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSF-GKMGM 1243 SELV + Q +LAGFLV++ MH++ SLSLIN+A A LT+YLE QIQ+Q PSF +G Sbjct: 2335 SELVRSGQQVLAGFLVTKLMHSNPSLSLINIAGACLTKYLERQIQQQHDSNPSFRDGVGS 2394 Query: 1244 CTYLENTVFCLRDKLGSLVQSALSTLSNN 1330 L NT+ LRD++ +L+QS+L++LS++ Sbjct: 2395 SEPLVNTISSLRDRMQNLIQSSLASLSHD 2423 >ref|XP_006350502.1| PREDICTED: uncharacterized protein LOC102589454 [Solanum tuberosum] Length = 2409 Score = 423 bits (1087), Expect = e-115 Identities = 222/449 (49%), Positives = 297/449 (66%), Gaps = 6/449 (1%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSI--TGRNLRGLPAELLSNVQPWEGWDELDCTSAGS 175 RL FS N QL +HVRVY LELMQ I T ++ + ++L V WEGW+ L +A Sbjct: 1959 RLEEFSENFQLPNHVRVYILELMQLIAATDKSSKRFSSKLQVEVHSWEGWENLHNATANC 2018 Query: 176 E-IANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSG 352 E A N+ D S+ FT+TL+AL+ST+L S ISP+IEITP+DL T++S VSCFL +S Sbjct: 2019 ENTATDGISNKIDTSNKFTNTLIALKSTQLVSTISPNIEITPEDLSTVESTVSCFLGVSK 2078 Query: 353 AAHSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQE--EQP 526 A SE H D+L A+L EW+G F+ E+D G+ SD GN WG+DDWDEGWESFQE E+ Sbjct: 2079 FAESESHVDALLAMLREWEGHFSREEIEKDSGEVSDGGNCWGNDDWDEGWESFQEPIEEE 2138 Query: 527 VEKGGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLV 706 +KG +S+HPLH CWMEI +KL+ +S++ +LKL+D+S++K VLLD+ +A+ LSQ Sbjct: 2139 PKKGAKLSVHPLHVCWMEIFRKLLTISQYNKMLKLLDKSVAKPGEVLLDKENAQGLSQTA 2198 Query: 707 IGIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXX 886 + IDC ALK+MLLLPY+ IQLQCL++VE KLKQ G+S I D E Sbjct: 2199 VEIDCFLALKLMLLLPYEVIQLQCLESVEQKLKQEGISDKIGVDLEFLLLVLSSGVISTI 2258 Query: 887 XXKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFI 1066 K +Y TTFS C+ VG+FS CQE QL ES + + +F ++FPCF+ Sbjct: 2259 ITKPSYGTTFSYICFMVGNFSRQCQESQLSSSGRGESAESESISKDYIDLFPRLIFPCFV 2318 Query: 1067 SELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSF-GKMGM 1243 SELV + Q +LAGFLV++ MHT+ SLSLIN+A A LT+YLE QIQ PSF +G Sbjct: 2319 SELVRSGQQVLAGFLVTKLMHTNPSLSLINIAGACLTKYLERQIQILHDSNPSFRDGVGS 2378 Query: 1244 CTYLENTVFCLRDKLGSLVQSALSTLSNN 1330 L NT+ LRD++ +L+QS+LS+LS++ Sbjct: 2379 SEPLVNTISSLRDRMQNLIQSSLSSLSHD 2407 >ref|XP_003602296.1| Neuroblastoma-amplified sequence [Medicago truncatula] gi|355491344|gb|AES72547.1| Neuroblastoma-amplified sequence [Medicago truncatula] Length = 2401 Score = 423 bits (1087), Expect = e-115 Identities = 227/449 (50%), Positives = 304/449 (67%), Gaps = 4/449 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDE-LDCTSAGSE 178 ++ FS N+QL S +RVY LELMQ I+G+N++G E+L+NVQPWE WDE L + G Sbjct: 1950 KMVKFSDNLQLPSSIRVYVLELMQFISGKNIKGFSTEILANVQPWEDWDESLYASRKGET 2009 Query: 179 IANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 ++ +P+ D+SS FT+TLVAL+S++L + ISPSIEITPDDL+ +D+AVSCFL L G A Sbjct: 2010 GVDKESPDHKDSSSRFTNTLVALKSSQLLTSISPSIEITPDDLLNVDTAVSCFLRLCGEA 2069 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 + HFD+L +ILEEW+GLFT G++ E +ASD GN+W +DDWDEGWES +E EK Sbjct: 2070 IEDPHFDALVSILEEWEGLFTMGKDGEITTEASDGGNDWNNDDWDEGWESLEEVDKPEKE 2129 Query: 539 ---GSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 SVS+HPLH CW EI++K ++LSRF+D+L+LID+S SK N +LLDE+DA L+++ + Sbjct: 2130 KIVDSVSVHPLHVCWAEILRKFMSLSRFSDVLRLIDQSSSKPNGMLLDEDDATRLNEIAL 2189 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 +DC ALKM L+LPY+T+QLQCL AVE ++Q G+ + S D EL Sbjct: 2190 SMDCFLALKMSLMLPYKTLQLQCLGAVEDSVRQ-GIPQTRSKDCELLILILSSGILTSIA 2248 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 +TY TTFS CY VG+ S+ CQ+ RG S E+ F FR ILFP FI+ Sbjct: 2249 TGSTYGTTFSYLCYMVGNLSNRCQQAL---ASGRGFTNSEDSENQF---FRRILFPNFIT 2302 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q +LAGF+V++FMHT SL+LI++A ASL RYLE Q+ Q E +M C Sbjct: 2303 ELVKADQHVLAGFIVTKFMHTSESLNLISIANASLNRYLERQLHMLQANEFQV-EMECCK 2361 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 L NTV LR +L +L+QS L LS ++K Sbjct: 2362 TLRNTVSRLRGRLINLIQSTLPLLSCSLK 2390 >ref|XP_004309107.1| PREDICTED: uncharacterized protein LOC101306190 [Fragaria vesca subsp. vesca] Length = 2397 Score = 420 bits (1080), Expect = e-115 Identities = 220/448 (49%), Positives = 297/448 (66%), Gaps = 3/448 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R++ FS N+QL RVYALELMQ +TG+N +G A + SN+ PWEGWDE+ T+ SE Sbjct: 1961 RMAEFSDNLQLPGSTRVYALELMQYLTGKNSKGFSAAIQSNIIPWEGWDEMRLTNKKSET 2020 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 AN+ + +D S+ FTSTLVAL+S++L + ISP++EITPDD+ +++AVSCF + A Sbjct: 2021 TANEGLADNSDKSNRFTSTLVALKSSQLVANISPTMEITPDDIQNLETAVSCFQKMCDVA 2080 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEKG 538 + H +SL A+L EW+G F ++E + SDAGN W D+WDEGWESFQE Sbjct: 2081 QNYSHVESLLAVLGEWEGFFLVREDKEASVQVSDAGNEWTGDNWDEGWESFQES------ 2134 Query: 539 GSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVIGID 718 S+SI+PLH CW+ I KKL+ LS F D+L+LID+SL K + +LLDE ARSLSQ+ + ID Sbjct: 2135 -SISINPLHVCWLAIFKKLVMLSHFKDVLRLIDQSLLKDSGILLDEEGARSLSQIFLEID 2193 Query: 719 CITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXXXKT 898 C ALK++LLLP++ +Q QCL AVE KLKQ G+S ++ GD EL + Sbjct: 2194 CFMALKLVLLLPFKPLQEQCLAAVEDKLKQAGISDTMGGDLELLMLVLFSGVLSSIISDS 2253 Query: 899 TYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFISELV 1078 +Y FS CY VG+ SH CQ QL + +G +E ++FRT+LFPCFISELV Sbjct: 2254 SYGNMFSYICYLVGNLSHKCQAAQLQNQRRKGNSALGENERA-LLLFRTVLFPCFISELV 2312 Query: 1079 NAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFG--KMGMCTY 1252 Q LLAG +V++FMHT+ASLSL+N+AEASL R+LE Q+ G +F + Sbjct: 2313 KGDQQLLAGLVVTKFMHTNASLSLVNIAEASLGRFLEVQL---NGLHDNFNLDETHSQDA 2369 Query: 1253 LENTVFCLRDKLGSLVQSALSTLSNNVK 1336 L+NT+ LRDK+ +L+Q ALSTLS NV+ Sbjct: 2370 LQNTISSLRDKMENLIQDALSTLSTNVR 2397 >gb|EYU45288.1| hypothetical protein MIMGU_mgv1a000026mg [Mimulus guttatus] Length = 2381 Score = 415 bits (1066), Expect = e-113 Identities = 218/453 (48%), Positives = 298/453 (65%), Gaps = 8/453 (1%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGR--NLRGLPAELLSNVQPWEGWDELDCTSAGS 175 R+STFS N+QL SH+RVYALELMQ I+GR NL+ +E + + PWE WD+L + Sbjct: 1929 RMSTFSDNLQLPSHLRVYALELMQFISGRKRNLKVFSSEGPTYLLPWEAWDDLQDRTIDH 1988 Query: 176 EIANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGA 355 E + D+SS F+STLVAL+S++L ISP +EITP+D++++DSAVSCFL +S + Sbjct: 1989 ENTSDDPTVVKDSSSRFSSTLVALKSSQLLLSISPGLEITPEDILSVDSAVSCFLRVSES 2048 Query: 356 AHSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEK 535 A + H SL A+L EW+GLFT+ ++ D +A DA NNW SDDWDEGWESFQEE +EK Sbjct: 2049 ATTPFHISSLLAVLAEWEGLFTARVDDGDSAEAPDAVNNWSSDDWDEGWESFQEESSIEK 2108 Query: 536 ------GGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLS 697 ++SIHPLH CWM ++KK++ S D+LKL+D++ K+ VLLD+ND R L+ Sbjct: 2109 ETKESNNNTLSIHPLHICWMTVLKKMVKFSSQTDILKLLDQNAGKNCGVLLDDNDTRILT 2168 Query: 698 QLVIGIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXX 877 Q + +DC ALKM LLLPY+ IQLQCLDAVE+KLK+GG+S I+ D F Sbjct: 2169 QNALEMDCFLALKMTLLLPYEAIQLQCLDAVENKLKEGGISEDIAHDHFFFVLVLSSGIL 2228 Query: 878 XXXXXKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFP 1057 + +Y TTFS C+ VG+F QE + +K +ED +F ++FP Sbjct: 2229 PNIITEASYGTTFSYLCFMVGNFCRQFQEARASTIKHGPSIGGERNEDKLDFLFVKLVFP 2288 Query: 1058 CFISELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKM 1237 CFI+ELV A Q + AGFLV++FMH +ASLSLIN+AE++L +YLE Q ++ Q + S+ Sbjct: 2289 CFIAELVKANQHISAGFLVTKFMHMNASLSLINIAESTLRKYLERQFEEVQERKSSWENS 2348 Query: 1238 GMCTYLENTVFCLRDKLGSLVQSALSTLSNNVK 1336 C L NTV LR K +L+QSALS+L +V+ Sbjct: 2349 SFCEPLVNTVANLRGKFENLIQSALSSLPTDVR 2381 >ref|XP_006581664.1| PREDICTED: uncharacterized protein LOC100818814 [Glycine max] Length = 2393 Score = 412 bits (1060), Expect = e-112 Identities = 228/448 (50%), Positives = 297/448 (66%), Gaps = 4/448 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R+ FS N+QL S VRV+ LELMQ I+G+N++G AE+L+NVQPWE W+EL S SE Sbjct: 1952 RMVQFSDNLQLPSSVRVFVLELMQFISGKNIKGFSAEILANVQPWEEWNELIYASRKSET 2011 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 ++ P+ D+SS T+TLVAL+S++L + ISPSIEITPDDL+ D+AVSCF+ L G A Sbjct: 2012 DVDKHLPDHKDSSSRVTNTLVALKSSQLVASISPSIEITPDDLLNADTAVSCFMRLCGEA 2071 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEK- 535 + HFD+L ILEEW LFT+G++ E +ASD GN+W +DDWDEGWE+ E EK Sbjct: 2072 SEDLHFDALLTILEEWDELFTAGKDGETTAEASDGGNDWNNDDWDEGWENLVEVDNPEKE 2131 Query: 536 --GGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 SV +HPLH CW EI++K I+LSRF D+L+LID+S K N++LLDE+DA SL+++ + Sbjct: 2132 KIEDSVFVHPLHLCWAEILRKFISLSRFTDVLRLIDQSSLKPNAMLLDEDDASSLTRIAL 2191 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 GIDC ALKM LLLPY+T+QLQCL AVE +Q G+ + S D EL Sbjct: 2192 GIDCFLALKMTLLLPYKTLQLQCLGAVEDSTRQ-GIPQTRSKDYELLILILSSGILTSIM 2250 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 +TY T FS CY VG +LC + Q + RG + D + ++F ILFP FIS Sbjct: 2251 IDSTYGTIFSYICYLVG---NLCNQCQQALVSGRGTNNNE-DNENQLLLFTRILFPNFIS 2306 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q +LAGFLV++FMH++ SLSL N+A ASL RYL+ Q+ Q E F C Sbjct: 2307 ELVKADQHILAGFLVTKFMHSNESLSLFNIAGASLNRYLKMQLHMLQVNE--FPVEKTCK 2364 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNV 1333 L+NTV LR KL SL+QS L LS +V Sbjct: 2365 TLKNTVGRLRGKLSSLIQSILPMLSASV 2392 >ref|XP_007136472.1| hypothetical protein PHAVU_009G048100g [Phaseolus vulgaris] gi|561009559|gb|ESW08466.1| hypothetical protein PHAVU_009G048100g [Phaseolus vulgaris] Length = 2399 Score = 411 bits (1057), Expect = e-112 Identities = 230/448 (51%), Positives = 293/448 (65%), Gaps = 4/448 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R+ FS N+QL S VRV+ LELMQ I+G+N+RG E+L+NVQPWE W+EL SE Sbjct: 1958 RMVQFSDNLQLPSSVRVFVLELMQFISGKNIRGFSTEILANVQPWEEWNELIYAGRKSET 2017 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 +++ P D+SS T+TL+AL+S++LA+ ISPSIEITPDDL+ D+AVSCF+ L G A Sbjct: 2018 DVDKSLPAHKDSSSRVTNTLIALKSSQLAAPISPSIEITPDDLLNADTAVSCFMGLCGEA 2077 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEK- 535 + HFD+L AILEEW GLFT+G++ E +A+D GN+W +DDWDEGWES + EK Sbjct: 2078 SEDIHFDALLAILEEWDGLFTAGKDGEPVAEATDGGNDWNNDDWDEGWESLEGVDNPEKE 2137 Query: 536 --GGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 SV +HPLH CW EI +K I+LSRF D+L+LID+S K N++LLDE+DA SL Q+ Sbjct: 2138 KIEDSVFVHPLHVCWAEIFRKFISLSRFTDVLRLIDQSSLKPNAMLLDEDDACSLIQMAF 2197 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 IDC ALKM LLLPY+ +QLQCL AVE +Q G+ S S D EL Sbjct: 2198 SIDCFLALKMALLLPYKKLQLQCLGAVEDSTRQ-GIPQSRSKDYELLILILSSGILSSII 2256 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 +TY T FS CY VG+ S+ Q+ L S + D + ++F ILFP FIS Sbjct: 2257 TDSTYGTIFSYICYLVGNLSNQYQQ----ALVSGRGIHNNEDHENQLLLFTRILFPNFIS 2312 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q +LAGFLV++FMH++ SLSLIN+AEASL RYLE Q+Q Q E F C Sbjct: 2313 ELVRADQHILAGFLVTKFMHSNESLSLINIAEASLNRYLEMQLQMLQISE--FPVEKTCK 2370 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNV 1333 L+NTV LR KL S +QS L LS V Sbjct: 2371 TLKNTVGRLRGKLSSFIQSILPLLSARV 2398 >ref|XP_006578887.1| PREDICTED: neuroblastoma-amplified sequence-like [Glycine max] Length = 2392 Score = 410 bits (1054), Expect = e-112 Identities = 226/448 (50%), Positives = 296/448 (66%), Gaps = 4/448 (0%) Frame = +2 Query: 2 RLSTFSHNMQLQSHVRVYALELMQSITGRNLRGLPAELLSNVQPWEGWDELDCTSAGSEI 181 R+ FS N+QL S VRV+ LELMQ I+G+N++G E+L+NVQPWE W+EL S SE Sbjct: 1951 RMVQFSDNLQLPSSVRVFVLELMQFISGKNIKGFSTEILANVQPWEEWNELIYASRKSET 2010 Query: 182 -ANQATPNQTDASSGFTSTLVALRSTRLASVISPSIEITPDDLMTIDSAVSCFLNLSGAA 358 ++ P+ D+SS T+TLVAL+S++L + ISPSIEIT DDL+ D+AVSCF+ L G A Sbjct: 2011 DVDKQLPDHKDSSSRVTNTLVALKSSQLVASISPSIEITLDDLLNADTAVSCFMRLCGEA 2070 Query: 359 HSEQHFDSLQAILEEWQGLFTSGREEEDPGKASDAGNNWGSDDWDEGWESFQEEQPVEK- 535 + H D+L AILEEW GLFT+G++EE + SD GN+W +DDWDEGWES +E EK Sbjct: 2071 TEDLHLDALLAILEEWDGLFTAGKDEETTVETSDGGNDWNNDDWDEGWESLEEVDNPEKE 2130 Query: 536 --GGSVSIHPLHTCWMEIIKKLIALSRFADLLKLIDRSLSKSNSVLLDENDARSLSQLVI 709 V +HPLH CW EI +K I+LSRF D+L+LID+S K N++LLDENDA SL+++ + Sbjct: 2131 KIEDPVFVHPLHLCWAEIFRKFISLSRFTDVLRLIDQSSLKPNAMLLDENDAISLTRIAL 2190 Query: 710 GIDCITALKMMLLLPYQTIQLQCLDAVESKLKQGGLSSSISGDRELFXXXXXXXXXXXXX 889 GIDC ALKM LLLPY+T++LQCL AVE +Q G+ + S D EL Sbjct: 2191 GIDCFLALKMALLLPYKTLRLQCLGAVEDSTRQ-GIPQTRSKDYELLILILSSGILTSII 2249 Query: 890 XKTTYSTTFSCFCYSVGHFSHLCQEIQLYELKSRGKEESRTDEDGFFIVFRTILFPCFIS 1069 +TY T FS CY VG+ S+ CQ+ L S + D + ++F ILFP FIS Sbjct: 2250 TDSTYGTIFSYICYLVGNLSNQCQQ----ALVSGRGTNNNEDHENQLLLFTRILFPNFIS 2305 Query: 1070 ELVNAKQLLLAGFLVSRFMHTHASLSLINVAEASLTRYLEGQIQKQQGYEPSFGKMGMCT 1249 ELV A Q +LAGFLV++FMH++ SLSL+N+A ASL RYLE Q+ Q E F C Sbjct: 2306 ELVKADQHILAGFLVTKFMHSNESLSLVNIAGASLNRYLEMQLHILQVKE--FPVEKTCK 2363 Query: 1250 YLENTVFCLRDKLGSLVQSALSTLSNNV 1333 L+NTV +R +L SL+QS L LS +V Sbjct: 2364 TLKNTVGRMRGQLSSLIQSILPLLSASV 2391