BLASTX nr result
ID: Akebia23_contig00014502
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00014502 (1865 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260... 452 e-124 emb|CBI16185.3| unnamed protein product [Vitis vinifera] 451 e-124 ref|XP_004294602.1| PREDICTED: uncharacterized protein LOC101302... 446 e-122 ref|XP_002299890.2| hypothetical protein POPTR_0001s24280g [Popu... 413 e-112 gb|EYU36256.1| hypothetical protein MIMGU_mgv1a006660mg [Mimulus... 399 e-108 ref|XP_007016512.1| Uncharacterized protein isoform 1 [Theobroma... 367 8e-99 ref|XP_007016513.1| Uncharacterized protein isoform 2 [Theobroma... 365 3e-98 ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802... 365 5e-98 ref|XP_006592206.1| PREDICTED: uncharacterized protein LOC100819... 362 2e-97 ref|XP_007016516.1| Uncharacterized protein isoform 5 [Theobroma... 351 7e-94 ref|XP_006592207.1| PREDICTED: uncharacterized protein LOC100819... 347 1e-92 ref|XP_006350905.1| PREDICTED: uncharacterized protein LOC102583... 343 1e-91 ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251... 341 6e-91 ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana] ... 337 1e-89 gb|AAV66096.1| At5g59830 [Arabidopsis thaliana] 337 1e-89 dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana] 335 3e-89 ref|XP_006280518.1| hypothetical protein CARUB_v10026471mg [Caps... 335 5e-89 ref|XP_007205145.1| hypothetical protein PRUPE_ppa005281mg [Prun... 330 2e-87 dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana] 324 1e-85 ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arab... 322 5e-85 >ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera] Length = 486 Score = 452 bits (1162), Expect = e-124 Identities = 243/454 (53%), Positives = 301/454 (66%), Gaps = 58/454 (12%) Frame = +1 Query: 256 SFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 435 SFQ+KGFWM KG +G L+DG+ DN SRIEPKR+HQWF D EP LFPNKKQAV +++S Sbjct: 37 SFQNKGFWMPKG-AGHLSDGDTTFDNPSRIEPKRSHQWFADIAEPGLFPNKKQAVHSTSS 95 Query: 436 RQISGVPNVN-LPWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRR 612 + SG+ N + PWEN S+F SV QF DRLFG E +R ++F RN + T R Sbjct: 96 KSTSGISNAHGSPWENTSSFHSVPNQFIDRLFGPETARPVNFTERNISPVGTDGSR--SR 153 Query: 613 GIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQV-------------------- 732 I+EQFGND+S+ LS+S+ +ED +CL+YGGIRKVKVNQV Sbjct: 154 DIDEQFGNDSSVGLSISNAIEDPETCLSYGGIRKVKVNQVRESDSSENASKGHSYDREIH 213 Query: 733 -------------------------KDSDNAMSM-----------PMSHSFNKRDGTTIS 804 K+ +N M PM H +NK D TIS Sbjct: 214 SNIPTVQDYDRGSDTSFMSIGAAYYKEDENDKLMGHTYNTGDHDIPMGHPYNKGDANTIS 273 Query: 805 FGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNAN-VLENATPLAIVTN 981 FG + + + P R +++Y L QSS+Q S++ E+EL NAN L +A + Sbjct: 274 FGSYHDEPDNIPFARPISSYGLY--QSSVQISDTESERELDASNANGTLSSAQLAKLRPE 331 Query: 982 KTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCS 1161 K+K E KMS K PN+FPSNVR+L+STG+LDGVPVKY+S S EE G+IKGSGYLC Sbjct: 332 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSREELHGIIKGSGYLCG 391 Query: 1162 CQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTV 1341 CQSCN++KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LLF+AIQTV Sbjct: 392 CQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTV 451 Query: 1342 TGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 TG PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 452 TGSPINQKSFRIWKESFQAATRELKRIYGKEELN 485 >emb|CBI16185.3| unnamed protein product [Vitis vinifera] Length = 416 Score = 451 bits (1159), Expect = e-124 Identities = 233/416 (56%), Positives = 294/416 (70%), Gaps = 32/416 (7%) Frame = +1 Query: 292 GSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVN-L 468 G+G L+DG+ DN SRIEPKR+HQWF D EP LFPNKKQAV +++S+ SG+ N + Sbjct: 4 GAGHLSDGDTTFDNPSRIEPKRSHQWFADIAEPGLFPNKKQAVHSTSSKSTSGISNAHGS 63 Query: 469 PWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASI 648 PWEN S+F SV QF DRLFG E +R ++F RN + T R I+EQFGND+S+ Sbjct: 64 PWENTSSFHSVPNQFIDRLFGPETARPVNFTERNISPVGTDGSR--SRDIDEQFGNDSSV 121 Query: 649 ALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMS------------------------ 756 LS+S+ +ED +CL+YGGIRKVKVNQV++SD++ + Sbjct: 122 GLSISNAIEDPETCLSYGGIRKVKVNQVRESDSSENASKGHSYDREIHSNIPTVQDYDRG 181 Query: 757 ------MPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEK 918 +PM H +NK D TISFG + + + P R +++Y L QSS+Q S++ E+ Sbjct: 182 SDTNHDIPMGHPYNKGDANTISFGSYHDEPDNIPFARPISSYGLY--QSSVQISDTESER 239 Query: 919 ELVDPNAN-VLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPV 1095 EL NAN L +A + K+K E KMS K PN+FPSNVR+L+STG+LDGVPV Sbjct: 240 ELDASNANGTLSSAQLAKLRPESASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPV 299 Query: 1096 KYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTI 1275 KY+S S EE G+IKGSGYLC CQSCN++KVLNAYEFERHAGCKTKHPNNHI+F+NGKTI Sbjct: 300 KYVSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTI 359 Query: 1276 YGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 Y IVQEL+STP++LLF+AIQTVTG PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 360 YQIVQELRSTPESLLFDAIQTVTGSPINQKSFRIWKESFQAATRELKRIYGKEELN 415 >ref|XP_004294602.1| PREDICTED: uncharacterized protein LOC101302631 [Fragaria vesca subsp. vesca] Length = 469 Score = 446 bits (1148), Expect = e-122 Identities = 240/469 (51%), Positives = 300/469 (63%), Gaps = 72/469 (15%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MSFQ+KGFWMAKG +G DG+ N SRIEPKR+HQWF+D+ EP+LFPNKKQAV N Sbjct: 1 MSFQNKGFWMAKG-AGHDNDGDATFGNPSRIEPKRSHQWFVDSAEPQLFPNKKQAVHIPN 59 Query: 433 SRQISGVPNVNLPWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRR 612 S+ +PN N+ WENPS+FQSV QF DRLFGS+ + + +F RN + + + I + Sbjct: 60 SKLSVEMPNENVSWENPSSFQSVPHQFIDRLFGSDTASSTNFSDRNVSPVGSDDWSIRTK 119 Query: 613 GIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAM------------- 753 GI++QFG+DA + LS+SH +E+ CL Y GIRK+KVNQVKDSD M Sbjct: 120 GIDDQFGSDAPVNLSISHAIENPEVCLGYAGIRKIKVNQVKDSDIDMHASREHGSSREYN 179 Query: 754 -SMP-----------------------------MSHSFN--------------KRDGTTI 801 ++P M H++N KR+ I Sbjct: 180 INLPTSQAFDRTHETGFISAGQAYDKEHDNVTLMGHAYNKGAAHVRPLGASYGKREENVI 239 Query: 802 SFGD--------------FQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNA 939 S D F + +MN GR + NY+ L QSS+Q SE+ EKEL NA Sbjct: 240 SMSDGYSKGNANMISFGGFPDEQDMNTMGRAVANYDQLYHQSSVQTSETAHEKELDTTNA 299 Query: 940 NVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSH 1116 N ++N +A KSK E K + K PN+FPSNVRSL+STGILDGVPVKY+S + Sbjct: 300 NAVDNTASVAKSKPESASKSKPESKPTKKQAPNSFPSNVRSLISTGILDGVPVKYVSMAR 359 Query: 1117 EEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQEL 1296 EE RG+IKG+ YLC CQSCN++K LNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL Sbjct: 360 EELRGIIKGASYLCGCQSCNFTKGLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQEL 419 Query: 1297 KSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 +STP++LLF+ +QTV G PINQKAF WKES+QAATREL+RIYGK+ELN Sbjct: 420 RSTPESLLFDTMQTVFGAPINQKAFLSWKESFQAATRELQRIYGKEELN 468 >ref|XP_002299890.2| hypothetical protein POPTR_0001s24280g [Populus trichocarpa] gi|550348073|gb|EEE84695.2| hypothetical protein POPTR_0001s24280g [Populus trichocarpa] Length = 400 Score = 413 bits (1062), Expect = e-112 Identities = 221/428 (51%), Positives = 271/428 (63%), Gaps = 35/428 (8%) Frame = +1 Query: 265 SKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQI 444 +KGFWM+KG DG+ +N R+E KR+HQWF+D TEPELFPNKKQAV+ NS Sbjct: 2 NKGFWMSKG-----TDGDPAFENPPRLESKRSHQWFIDDTEPELFPNKKQAVQTPNSTTT 56 Query: 445 SGVPNVNLP-WENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRGIE 621 SG+P+ N P W N S FQSV QF RLFG+E +R+++F RN T Sbjct: 57 SGIPSANSPSWHNTSGFQSVPNQFIHRLFGAETARSVNFAERNLYPAGTVE--------- 107 Query: 622 EQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSF-------- 777 ++AS A CLNYGGIRKVK+NQVKD D+ + P H F Sbjct: 108 ----SNASEA------------CLNYGGIRKVKINQVKDFDSGVHAPKGHGFTIESDSNN 151 Query: 778 -------------------------NKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQ 882 N D +SFG F + ++ P R L++Y+ Q Sbjct: 152 STGQAFQRESQSSFISTGHAFDKEDNSEDTNLLSFGGFDDAHDIIPVDRPLSSYDHSYDQ 211 Query: 883 SSIQPSESLKEKELVDPNAN-VLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRS 1059 SS++ E++ EKEL A V N T K++ E K + K PN+FPSNVRS Sbjct: 212 SSVRTREAVDEKELRTTTAKAVASNTQATKSRTEPVSKNRPELKTTRKEAPNSFPSNVRS 271 Query: 1060 LLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHP 1239 L+STG+LDGVPVKY+S S EE RG+IKGSGYLC CQSCNYSKVLNAYEFERHAGCKTKHP Sbjct: 272 LISTGMLDGVPVKYVSLSREELRGIIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKHP 331 Query: 1240 NNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELER 1419 NNHI+F+NGKTIY IVQEL+STP+++LF+ IQTV G PINQK+FRIWKES+QAATREL+R Sbjct: 332 NNHIYFENGKTIYQIVQELRSTPESMLFDVIQTVFGAPINQKSFRIWKESFQAATRELQR 391 Query: 1420 IYGKDELN 1443 IYGK+ELN Sbjct: 392 IYGKEELN 399 >gb|EYU36256.1| hypothetical protein MIMGU_mgv1a006660mg [Mimulus guttatus] Length = 436 Score = 399 bits (1024), Expect = e-108 Identities = 229/442 (51%), Positives = 282/442 (63%), Gaps = 50/442 (11%) Frame = +1 Query: 268 KGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQIS 447 K FWM KGG G ++DG+ DNSSRIEPKRA QW LDA+EPELFP+KKQ +EA ++Q S Sbjct: 3 KEFWMLKGG-GHVSDGDAVFDNSSRIEPKRARQWLLDASEPELFPSKKQVLEAPITKQES 61 Query: 448 GV-PNVNLPWENPSNFQSVTG---QFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRG 615 + +L WE+ S FQSV QF DRLFGSE G T + + Sbjct: 62 EILMQSSLSWESSSGFQSVPSAPNQFMDRLFGSETIIPAIAG--------TDGSGVREKV 113 Query: 616 IEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKD------------------- 738 I E+F +++S+ LS+S+ ME+ + ++YGG+RKVKVNQVKD Sbjct: 114 IGEEFEDNSSVGLSISYAMEEQENGVSYGGLRKVKVNQVKDPIEHDIGVSMEQTYHRGGE 173 Query: 739 -------------SDNAMSMPMSHS------------FNKRDGTTI-SFGDFQEGSEMNP 840 NA M S++ F K D I SFG +QE S M Sbjct: 174 ITFESIGQHYGKEGGNATLMGQSYNTGESNITCTGSTFGKGDNNNIISFGGYQEESVMEA 233 Query: 841 SGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPLAIV-TNKTPKSKVEQKMS 1017 R +++Y LL QSS Q SE+ +KE+ PN+ T + T K K + K S Sbjct: 234 LARPVSSYSLLYEQSSAQTSETPTKKEVGAPNSGATVGTTQAPKPKVDSTSKIKSDTKPS 293 Query: 1018 NKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNA 1197 K PN+FPSNVRSL++TG+LDGVPVKY+S S EE RG+IKGSGYLC CQSCNYSK LNA Sbjct: 294 RKEAPNSFPSNVRSLIATGMLDGVPVKYVSVSREELRGIIKGSGYLCGCQSCNYSKALNA 353 Query: 1198 YEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRI 1377 YEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+ST +++LF+AIQTVTG PINQKAFR Sbjct: 354 YEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTSESMLFDAIQTVTGSPINQKAFRT 413 Query: 1378 WKESYQAATRELERIYGKDELN 1443 WKES+QAATREL+RIYGK+ELN Sbjct: 414 WKESFQAATRELQRIYGKEELN 435 >ref|XP_007016512.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508786875|gb|EOY34131.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 467 Score = 367 bits (943), Expect = 8e-99 Identities = 217/470 (46%), Positives = 275/470 (58%), Gaps = 73/470 (15%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MSFQ+K FWMAKG + ++DG+ DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N Sbjct: 1 MSFQNKSFWMAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPN 58 Query: 433 SRQISGVPNVNL-PWENPSNFQSVTGQ---------------FTDRL------------- 525 ++ SG+ N+N+ PWEN S+FQSV Q FT+R Sbjct: 59 NKSSSGISNLNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKA 118 Query: 526 ----FGSE-------------PSRTIDFGGRNFQSINT-------------------GNL 597 FG + P ++GG +N N Sbjct: 119 IEDHFGEDASVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNS 178 Query: 598 DIGRRGIEEQFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMS 756 D+ IE + S +SM H+ + +G N G + + Sbjct: 179 DMTT--IEAYDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIP 236 Query: 757 MPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPN 936 + M ++ K D +SFG F E E+ P GR L+++E SS SE EK+L Sbjct: 237 ISMGDTYGKEDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDAST 296 Query: 937 ANVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWS 1113 A V+ + T + ++K E K S K PN+FPSNVRSL+STG+LDGVPVKYIS S Sbjct: 297 AVVVASTTRTPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLS 356 Query: 1114 HEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQE 1293 EE RGVIKGSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQE Sbjct: 357 REELRGVIKGSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQE 416 Query: 1294 LKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 L+STP++LLF+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 417 LRSTPESLLFDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGKEELN 466 >ref|XP_007016513.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590589665|ref|XP_007016515.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786876|gb|EOY34132.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786878|gb|EOY34134.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 489 Score = 365 bits (938), Expect = 3e-98 Identities = 216/469 (46%), Positives = 274/469 (58%), Gaps = 73/469 (15%) Frame = +1 Query: 256 SFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 435 SFQ+K FWMAKG + ++DG+ DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N+ Sbjct: 24 SFQNKSFWMAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPNN 81 Query: 436 RQISGVPNVNL-PWENPSNFQSVTGQ---------------FTDRL-------------- 525 + SG+ N+N+ PWEN S+FQSV Q FT+R Sbjct: 82 KSSSGISNLNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKAI 141 Query: 526 ---FGSE-------------PSRTIDFGGRNFQSINT-------------------GNLD 600 FG + P ++GG +N N D Sbjct: 142 EDHFGEDASVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNSD 201 Query: 601 IGRRGIEEQFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMSM 759 + IE + S +SM H+ + +G N G + + + Sbjct: 202 MTT--IEAYDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIPI 259 Query: 760 PMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNA 939 M ++ K D +SFG F E E+ P GR L+++E SS SE EK+L A Sbjct: 260 SMGDTYGKEDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDASTA 319 Query: 940 NVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSH 1116 V+ + T + ++K E K S K PN+FPSNVRSL+STG+LDGVPVKYIS S Sbjct: 320 VVVASTTRTPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLSR 379 Query: 1117 EEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQEL 1296 EE RGVIKGSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL Sbjct: 380 EELRGVIKGSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQEL 439 Query: 1297 KSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 +STP++LLF+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 440 RSTPESLLFDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGKEELN 488 >ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max] Length = 463 Score = 365 bits (936), Expect = 5e-98 Identities = 216/465 (46%), Positives = 273/465 (58%), Gaps = 68/465 (14%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MS Q+KGFWM KG SG + D + DN ++IEPKR HQWF+DA E + FPNKKQAVE ++ Sbjct: 1 MSLQNKGFWMVKG-SGHINDRDTVFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDAD 59 Query: 433 SRQISGVPNVNLP-WENPSNFQSVTGQ--------------FTDR--------------- 522 + G NVN+P WEN NF SV Q FT++ Sbjct: 60 EKSSPGFSNVNIPPWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTYVLADDSNVRSKM 119 Query: 523 ---LFGSEPS-------------RTIDFGGRNFQSINT-----------------GNLDI 603 +G E S ++FGG +N N D+ Sbjct: 120 VTNQYGDEASFGLSISHSIEDSEACVNFGGIKKVKVNQVKEVDVQALEGHNFGRQSNGDL 179 Query: 604 GRRGIEEQFGNDASIALSMSHTMEDLGSCLNYGG----IRKVKVNQVKDSDNAMSMPMSH 771 + E ASI + + L Y +R + VK D+ +S+ S Sbjct: 180 HQAYNREVETRSASIGQAFDKDRDATLMGLTYSRGDAHVRSFGASFVKGDDSIVSI--SE 237 Query: 772 SFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLE 951 S+NK D ISFG F + ++ GR Y+ L QSS+ S + EKEL +++ + Sbjct: 238 SYNKEDTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHVSTTAHEKELDVSSSDAVA 297 Query: 952 NATPLAIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFR 1128 + +A V ++T K+K E K + K PN+FPSNVRSL+STGILDGVPVKY+S S EE R Sbjct: 298 STLQVAKVKSETVSKNKQELKTAKKEAPNSFPSNVRSLISTGILDGVPVKYVSVSREELR 357 Query: 1129 GVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTP 1308 G+IKGSGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP Sbjct: 358 GIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTP 417 Query: 1309 QNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 ++LLF+ IQTV G PINQKAFR WKES+QAATREL+RIYGK+ELN Sbjct: 418 ESLLFDTIQTVFGAPINQKAFRNWKESFQAATRELQRIYGKEELN 462 >ref|XP_006592206.1| PREDICTED: uncharacterized protein LOC100819317 isoform X1 [Glycine max] Length = 464 Score = 362 bits (930), Expect = 2e-97 Identities = 219/473 (46%), Positives = 274/473 (57%), Gaps = 76/473 (16%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MS Q+KGFWM KG SG + D E DN ++IEPKR HQWF+DA E + FPNKKQAVE ++ Sbjct: 1 MSLQNKGFWMVKG-SGQINDRETIFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDAD 59 Query: 433 SRQISGVPNVNLP-WENPSNFQSV------------------------------------ 501 + G NVN+P WEN NF SV Sbjct: 60 EKSSPGFSNVNIPPWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTSYVLADDSNVRSK 119 Query: 502 --TGQF-TDRLFGSEPSRTID-------FGG-------------------RNFQSINTGN 594 T Q+ D FG S +I+ FGG NF N GN Sbjct: 120 MITNQYGDDASFGLSISHSIEDSEACVNFGGIKKVKVNQVKEDDIQALEGHNFGRPNNGN 179 Query: 595 L--------DIGRRGIEEQFGNDASIAL-SMSHTMEDLGSCLNYGGIRKVKVNQVKDSDN 747 L + I + F D +L ++++ D +R VK D+ Sbjct: 180 LHQAYNREVETRSASIGQAFDRDGDASLMGLTYSKGD-------AHVRSFSAPFVKGDDS 232 Query: 748 AMSMPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELV 927 +S+ S S+NK D ISFG F + ++ GR Y+ L QSS+ S + EKEL Sbjct: 233 IVSI--SESYNKEDTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHGSTTAHEKELD 290 Query: 928 DPNANVLENATPLAIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYI 1104 +++ + + +A V ++T K+K E K + PN+FPSNVRSL+STGILDGVPVKYI Sbjct: 291 VSSSDAVASTLQVAKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYI 350 Query: 1105 SWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGI 1284 S S EE RG+IKGSGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY I Sbjct: 351 SVSREELRGIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQI 410 Query: 1285 VQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 VQEL+STP++LLF+ IQTV G PI+QKAFR WKES+QAATREL+RIYGK+ELN Sbjct: 411 VQELRSTPESLLFDTIQTVFGAPIHQKAFRNWKESFQAATRELQRIYGKEELN 463 >ref|XP_007016516.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508786879|gb|EOY34135.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 458 Score = 351 bits (900), Expect = 7e-94 Identities = 210/461 (45%), Positives = 267/461 (57%), Gaps = 73/461 (15%) Frame = +1 Query: 280 MAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPN 459 MAKG + ++DG+ DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N++ SG+ N Sbjct: 1 MAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPNNKSSSGISN 58 Query: 460 VNL-PWENPSNFQSVTGQ---------------FTDRL-----------------FGSE- 537 +N+ PWEN S+FQSV Q FT+R FG + Sbjct: 59 LNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKAIEDHFGEDA 118 Query: 538 ------------PSRTIDFGGRNFQSINT-------------------GNLDIGRRGIEE 624 P ++GG +N N D+ IE Sbjct: 119 SVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNSDMTT--IEA 176 Query: 625 QFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNK 783 + S +SM H+ + +G N G + + + M ++ K Sbjct: 177 YDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIPISMGDTYGK 236 Query: 784 RDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATP 963 D +SFG F E E+ P GR L+++E SS SE EK+L A V+ + T Sbjct: 237 EDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDASTAVVVASTTR 296 Query: 964 LA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIK 1140 + ++K E K S K PN+FPSNVRSL+STG+LDGVPVKYIS S EE RGVIK Sbjct: 297 TPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLSREELRGVIK 356 Query: 1141 GSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLL 1320 GSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LL Sbjct: 357 GSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLL 416 Query: 1321 FEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 F+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 417 FDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGKEELN 457 >ref|XP_006592207.1| PREDICTED: uncharacterized protein LOC100819317 isoform X2 [Glycine max] Length = 455 Score = 347 bits (889), Expect = 1e-92 Identities = 210/460 (45%), Positives = 264/460 (57%), Gaps = 76/460 (16%) Frame = +1 Query: 292 GSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP 471 GSG + D E DN ++IEPKR HQWF+DA E + FPNKKQAVE ++ + G NVN+P Sbjct: 4 GSGQINDRETIFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDADEKSSPGFSNVNIP 63 Query: 472 -WENPSNFQSV--------------------------------------TGQF-TDRLFG 531 WEN NF SV T Q+ D FG Sbjct: 64 PWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTSYVLADDSNVRSKMITNQYGDDASFG 123 Query: 532 SEPSRTID-------FGG-------------------RNFQSINTGNL--------DIGR 609 S +I+ FGG NF N GNL + Sbjct: 124 LSISHSIEDSEACVNFGGIKKVKVNQVKEDDIQALEGHNFGRPNNGNLHQAYNREVETRS 183 Query: 610 RGIEEQFGNDASIAL-SMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKR 786 I + F D +L ++++ D +R VK D+ +S+ S S+NK Sbjct: 184 ASIGQAFDRDGDASLMGLTYSKGD-------AHVRSFSAPFVKGDDSIVSI--SESYNKE 234 Query: 787 DGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPL 966 D ISFG F + ++ GR Y+ L QSS+ S + EKEL +++ + + + Sbjct: 235 DTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHGSTTAHEKELDVSSSDAVASTLQV 294 Query: 967 AIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKG 1143 A V ++T K+K E K + PN+FPSNVRSL+STGILDGVPVKYIS S EE RG+IKG Sbjct: 295 AKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYISVSREELRGIIKG 354 Query: 1144 SGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLF 1323 SGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LLF Sbjct: 355 SGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLF 414 Query: 1324 EAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 + IQTV G PI+QKAFR WKES+QAATREL+RIYGK+ELN Sbjct: 415 DTIQTVFGAPIHQKAFRNWKESFQAATRELQRIYGKEELN 454 >ref|XP_006350905.1| PREDICTED: uncharacterized protein LOC102583417 [Solanum tuberosum] Length = 560 Score = 343 bits (881), Expect = 1e-91 Identities = 204/453 (45%), Positives = 267/453 (58%), Gaps = 57/453 (12%) Frame = +1 Query: 256 SFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 435 SF K FW+ K G G L+DGE D+SSRI+ KRAHQ F E ELFPNKKQAV S Sbjct: 113 SFHDKDFWIPKCG-GHLSDGEAVFDSSSRIDVKRAHQLFSSTAEAELFPNKKQAVHTSLG 171 Query: 436 RQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRN-------------- 570 + S + N WE S+ S QF DRLF + +R ++ R+ Sbjct: 172 KSTSEIAVTNSTCWETTSDLPSGANQFIDRLFRVDTTRPVNLTERSTGNSTIRKKVIDDQ 231 Query: 571 --------------------------FQSINTGNLDIGRRGIEEQFGNDASIALSMSHTM 672 +++N ++ N+ ++++S H Sbjct: 232 IGDDPLVGLSMSYTIEEQQICISDSRIRNLNVNQVEDSENAFHSPIENNINMSISQVHNR 291 Query: 673 ---------------EDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKRDGTTISF 807 ED N G I + + V+ S + + P++ S+ + D TI F Sbjct: 292 ASETSFLSMGQAYGKEDESQTYNPGDISRSIRSNVEKSHS--TTPIADSYTRGDSDTI-F 348 Query: 808 GDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPLAIV-TNK 984 G F+ S+++ R ++ Y+ L QSS+ SE +K+L NA ++ ++ + T+ Sbjct: 349 G-FELVSDIDALARPISGYDYLHYQSSVDTSEPHCDKQLDGSNAKAVDISSQTSKPRTDS 407 Query: 985 TPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSC 1164 PK+K E K ++K PN+FPSNVRSLL+TGILDGVPVKY+ S +E RG+IKGSGYLC C Sbjct: 408 LPKTKSESKPAHKGAPNSFPSNVRSLLATGILDGVPVKYVL-SRQELRGIIKGSGYLCGC 466 Query: 1165 QSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVT 1344 Q CNYSKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY I QEL+STPQ+LLFEAIQTVT Sbjct: 467 QPCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQITQELRSTPQSLLFEAIQTVT 526 Query: 1345 GHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 G PINQKAF+IWKES+QAATREL+RIYGK+ELN Sbjct: 527 GSPINQKAFQIWKESFQAATRELQRIYGKEELN 559 >ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera] Length = 599 Score = 341 bits (875), Expect = 6e-91 Identities = 183/331 (55%), Positives = 228/331 (68%), Gaps = 22/331 (6%) Frame = +1 Query: 526 FGSEPSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSMSHTME-------DLG 684 +G E + I G Q+ N G+ +I + G D +I SM HT +G Sbjct: 277 YGREDNNFISMG----QAYNKGDENIAMSHTYK--GGDNTI--SMGHTFSKGDNNIISMG 328 Query: 685 SCLNYGGIRKVKVNQV--KDSDNAMSM-----------PMSHSFNKRDGTTISFGDFQEG 825 N G + + + K +N +SM + HS+NK + ISFG F + Sbjct: 329 QTYNKGDDNTISMGHIYNKGDENTISMGHTYKGDNSNLSIGHSYNKGESNIISFGGFHDD 388 Query: 826 SE-MNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPLAIVTNKT-PKSK 999 + NPSGRL+ +Y+LLMGQ S+Q SE+L EK+LV+ NA+ L + + ++T K K Sbjct: 389 DDDTNPSGRLVCSYDLLMGQPSVQRSEALNEKKLVESNADALISTAQITASGSETVSKKK 448 Query: 1000 VEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNY 1179 EQK+S KVPPNNFPSNVRSLLSTG+LDGVPVKYI+WS EE RG+IKGSGYLC CQSCN+ Sbjct: 449 EEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKYIAWSREELRGIIKGSGYLCGCQSCNF 508 Query: 1180 SKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPIN 1359 SKV+NAYEFERHAGCKTKHPNNHI+F+NGKTIYGIVQELKSTPQN LF+ IQT+TG PIN Sbjct: 509 SKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPIN 568 Query: 1360 QKAFRIWKESYQAATRELERIYGKDELNQLS 1452 QK+FR+WKES+ AATREL+RIYGK+E QLS Sbjct: 569 QKSFRLWKESFLAATRELQRIYGKEEGKQLS 599 Score = 224 bits (571), Expect = 1e-55 Identities = 110/186 (59%), Positives = 139/186 (74%), Gaps = 1/186 (0%) Frame = +1 Query: 250 KMSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEAS 429 +MSFQ+KGFWMAKG GC+ DGEM DN SRIEPKR+HQWF+D TE ELFPNKKQAVE Sbjct: 61 RMSFQNKGFWMAKG-VGCVTDGEMAYDNPSRIEPKRSHQWFMDGTE-ELFPNKKQAVEVP 118 Query: 430 NSRQISGVPNVNL-PWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIG 606 NS G+ N N+ PW N S F SV+G FT+RLF E +RT++F RN S+ GN+++ Sbjct: 119 NSNLFPGLSNPNVSPWANASGFHSVSGHFTERLFDPEAARTVNFDDRNIPSVGAGNMNMA 178 Query: 607 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKR 786 R+ IE+ FGN++ LSMSH++ED S LNYGGIRKVKV+QVKDS+N MS+ M H++ + Sbjct: 179 RKVIEDPFGNESLFGLSMSHSLEDPRSGLNYGGIRKVKVSQVKDSENIMSVSMGHTYTRA 238 Query: 787 DGTTIS 804 D T+S Sbjct: 239 DNNTMS 244 >ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana] gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana] gi|332009855|gb|AED97238.1| uncharacterized protein AT5G59830 [Arabidopsis thaliana] gi|332009856|gb|AED97239.1| uncharacterized protein AT5G59830 [Arabidopsis thaliana] Length = 425 Score = 337 bits (863), Expect = 1e-89 Identities = 192/428 (44%), Positives = 262/428 (61%), Gaps = 33/428 (7%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MS++SKGFW+ K + + D+S+R + KR H WF+D++ E+FPNKKQAV+ + Sbjct: 1 MSYESKGFWVMKNNEHTSEEDSV-YDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57 Query: 433 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 606 G NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+ + + Sbjct: 58 PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115 Query: 607 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 741 + I E + D S+ LS+S+ +E G C G RK+ V++VK++ Sbjct: 116 NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175 Query: 742 ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 879 +N S H + D I+FG+ + + + ++ NY+ + Sbjct: 176 ESSSIQACSRENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV- 234 Query: 880 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1053 Q I + + ++E ++ V+ PK+K E K S K +FPSNV Sbjct: 235 QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294 Query: 1054 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1233 RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 295 RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354 Query: 1234 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1413 HPNNHI+F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 355 HPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414 Query: 1414 ERIYGKDE 1437 +RIYGK+E Sbjct: 415 QRIYGKEE 422 >gb|AAV66096.1| At5g59830 [Arabidopsis thaliana] Length = 425 Score = 337 bits (863), Expect = 1e-89 Identities = 192/428 (44%), Positives = 262/428 (61%), Gaps = 33/428 (7%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MS++SKGFW+ K + + D+S+R + KR H WF+D++ E+FPNKKQAV+ + Sbjct: 1 MSYESKGFWVMKNNEHTSEEDSV-YDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57 Query: 433 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 606 G NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+ + + Sbjct: 58 PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115 Query: 607 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 741 + I E + D S+ LS+S+ +E G C G RK+ V++VK++ Sbjct: 116 NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175 Query: 742 ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 879 +N S H + D I+FG+ + + + ++ NY+ + Sbjct: 176 ESSSIQACSWENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSASNVVGNYQSYV- 234 Query: 880 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1053 Q I + + ++E ++ V+ PK+K E K S K +FPSNV Sbjct: 235 QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294 Query: 1054 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1233 RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 295 RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354 Query: 1234 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1413 HPNNHI+F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 355 HPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414 Query: 1414 ERIYGKDE 1437 +RIYGK+E Sbjct: 415 QRIYGKEE 422 >dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana] Length = 425 Score = 335 bits (860), Expect = 3e-89 Identities = 191/428 (44%), Positives = 262/428 (61%), Gaps = 33/428 (7%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MS++SKGFW+ K + + D+S+R + KR H WF+D++ E+FPNKKQAV+ + Sbjct: 1 MSYESKGFWVMKNNEHTSEEDSV-YDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57 Query: 433 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 606 G NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+ + + Sbjct: 58 PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115 Query: 607 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 741 + I E + D S+ LS+S+ +E G C G RK+ V++VK++ Sbjct: 116 NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175 Query: 742 ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 879 +N S H + D I+FG+ + + + ++ NY+ + Sbjct: 176 ESSSIQACSRENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV- 234 Query: 880 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1053 Q I + + ++E ++ V+ PK+K E K S K +FPSNV Sbjct: 235 QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294 Query: 1054 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1233 RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 295 RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354 Query: 1234 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1413 HPNNHI+F+NG+TIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 355 HPNNHIYFENGRTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414 Query: 1414 ERIYGKDE 1437 +RIYGK+E Sbjct: 415 QRIYGKEE 422 >ref|XP_006280518.1| hypothetical protein CARUB_v10026471mg [Capsella rubella] gi|482549222|gb|EOA13416.1| hypothetical protein CARUB_v10026471mg [Capsella rubella] Length = 422 Score = 335 bits (858), Expect = 5e-89 Identities = 194/428 (45%), Positives = 260/428 (60%), Gaps = 33/428 (7%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 432 MS++SKGFW+ K + + D+S+R + KR H WF D++ ++FPNKKQAV+ Sbjct: 1 MSYESKGFWVLKNNEHTSEEDSV-YDHSTRDDSKRPHPWFADSSRSDMFPNKKQAVQDPV 59 Query: 433 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 606 G ++ LP WE+ S FQSV+ QF DRL G+E PSR + FG R+ G Sbjct: 60 GGL--GKSSLGLPLWESSSVFQSVSNQFMDRLLGAEMPSRPLLFGDRDRTE---GCSHHQ 114 Query: 607 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHS---- 774 + I E F + S+ LS+S+ +E GSC GIRK+ V++VK++ + + HS Sbjct: 115 NKSIAESFMENTSVELSISNGVEVAGSCFGGDGIRKLPVSRVKETMSTHAALDGHSQRKI 174 Query: 775 -------------------------FNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 879 + D I+FG+ + + S NY+ + Sbjct: 175 ESSSIQACSRENESSFINFALAGHPYGNEDSHGITFGEINDEHGVGSSSN--GNYQSYV- 231 Query: 880 QSSIQPSESL--KEKELVDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 1053 Q I+ S+ + +E ++ V+ PK+K E K S K +FPSNV Sbjct: 232 QDPIETSDMVYGQETGCSQTSSRVVSEQQMAKPSLETPPKNKAEAKTSKKEASTSFPSNV 291 Query: 1054 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 1233 RSL+STG+LDGVPVKYIS S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 292 RSLISTGMLDGVPVKYISLSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 351 Query: 1234 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 1413 HPNNHI+F+NGKTIY IVQEL++T +++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 352 HPNNHIYFENGKTIYQIVQELRNTQESMLFDVIQTVFGSPINQKAFRIWKESFQAATREL 411 Query: 1414 ERIYGKDE 1437 +RIYGK+E Sbjct: 412 QRIYGKEE 419 >ref|XP_007205145.1| hypothetical protein PRUPE_ppa005281mg [Prunus persica] gi|462400787|gb|EMJ06344.1| hypothetical protein PRUPE_ppa005281mg [Prunus persica] Length = 469 Score = 330 bits (845), Expect = 2e-87 Identities = 206/471 (43%), Positives = 261/471 (55%), Gaps = 74/471 (15%) Frame = +1 Query: 253 MSFQSKGFWMAKGGSGCLADGEMGCDNSSRIEPKRAHQWFLDATEPELF---------PN 405 MSFQ+KGFWM KG +G + DG+ N SRIEPKR HQWF+DA EPELF PN Sbjct: 1 MSFQNKGFWMPKG-AGLVNDGDATYGNPSRIEPKRPHQWFVDAAEPELFPNKKQAVHIPN 59 Query: 406 KKQAVEAS--------NSRQISGVPN--------------VNLPWENPSNFQSVTGQFT- 516 K S N+ VP+ VN N S S Sbjct: 60 SKLGSGMSNENVSSWENASSFQSVPHQFIDRLFGSDTASSVNFAERNISPVGSDNWNIRK 119 Query: 517 --DRLFGSE-------------PSRTIDFGGRNFQSIN-TGNLDIGRRGIEEQFGNDASI 648 D FG + P +++ G +N + D G E N S Sbjct: 120 GIDDQFGEDSPVSLSVSHAMEDPETCLNYAGIRKVKVNQVRDSDNGMHASREHGSNRGSN 179 Query: 649 A-LSMSHTMED----------------------LGSCLNYGG--IRKVKVNQVKDSDNAM 753 + LS S + +G N+G +R + N K +NA+ Sbjct: 180 SNLSSSQAFDRVNETAFLSVGQAYDKEHGSVTLIGHPYNHGDAHVRPIDTNYGKGDENAI 239 Query: 754 SMPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDP 933 S+ + +K + ISFG F + ++ P GR + NY+ L S+Q E+ EK+L Sbjct: 240 SV--GDNCSKGNANMISFGGFPDEQDIIPIGRPVGNYDQLYHPDSVQTLETSYEKDLDAS 297 Query: 934 NANVLENATPLAIVT-NKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISW 1110 NA+ ++N LA K+K E K S K PN+FPSNVRSL+STG+LDGVPVKY+S Sbjct: 298 NASAVDNTASLAKPRLESVSKNKPEIKPSRKPAPNSFPSNVRSLISTGMLDGVPVKYVSL 357 Query: 1111 SHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQ 1290 + EE RG+IKG GYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQ Sbjct: 358 AREELRGIIKGVGYLCGCQSCNYAKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQ 417 Query: 1291 ELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 1443 EL+STP++LLF+ +QTV G PINQK+F WKES+QAATREL+RIYGK+ELN Sbjct: 418 ELRSTPESLLFDTLQTVFGAPINQKSFHSWKESFQAATRELQRIYGKEELN 468 >dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana] Length = 415 Score = 324 bits (830), Expect = 1e-85 Identities = 184/403 (45%), Positives = 249/403 (61%), Gaps = 33/403 (8%) Frame = +1 Query: 328 DNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP-WENPSNFQSVT 504 D+S+R + KR H WF+D++ E+FPNKKQAV+ + G NV LP WE+ S FQSV+ Sbjct: 15 DHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--DPVVGLGKSNVGLPLWESSSVFQSVS 72 Query: 505 GQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSMSHTMEDL 681 QF DRL G+E P R + FG R+ + + + I E + D S+ LS+S+ +E Sbjct: 73 NQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ--NKSIAESYMEDTSVELSISNGVEVA 130 Query: 682 GSCLNYGGIRKVKVNQVKDS-------------------------DNAMSMP----MSHS 774 G C G RK+ V++VK++ +N S H Sbjct: 131 GGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKIESSSIQACSRENESSYINFALAGHP 190 Query: 775 FNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKEL--VDPNANVL 948 + D I+FG+ + + + ++ NY+ + Q I + + ++E ++ V+ Sbjct: 191 YGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV-QDPIGTLDIVYDQETGSSQTSSGVV 249 Query: 949 ENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFR 1128 PK+K E K S K +FPSNVRSL+STG+LDGVPVKY+S S EE R Sbjct: 250 SEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSREELR 309 Query: 1129 GVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTP 1308 GVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTKHPNNHI+F+NGKTIY IVQEL++TP Sbjct: 310 GVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIVQELRNTP 369 Query: 1309 QNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDE 1437 +++LF+ IQTV G PINQKAFRIWKES+QAATREL+RIYGK+E Sbjct: 370 ESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGKEE 412 >ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp. lyrata] gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp. lyrata] Length = 415 Score = 322 bits (824), Expect = 5e-85 Identities = 188/415 (45%), Positives = 248/415 (59%), Gaps = 45/415 (10%) Frame = +1 Query: 328 DNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP-WENPSNFQSVT 504 D S+R + KR H WF+D++ E+FPNKKQAV+ G NV LP WE+ S FQSV+ Sbjct: 15 DQSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQDPVGGL--GKSNVGLPLWESSSVFQSVS 72 Query: 505 GQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSMSHTMEDL 681 QF DRL G+E P R + FG R+ + + + I E + D S+ LS+S+ +E Sbjct: 73 NQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQT--KSIAESYMEDTSVELSISNGVEVA 130 Query: 682 GSCLNYGGIRKVKVNQVK---------DSDNAMSMPMS--------------------HS 774 GS GIRK+ V++VK D N + S H Sbjct: 131 GSSFGGDGIRKLPVSRVKETMSTHVALDGHNQRKIESSSIQACSRENESSFINFALAGHP 190 Query: 775 FNKRDGTTISFGDFQEGSEMNPSGRLLTNYE-----------LLMGQ---SSIQPSESLK 912 + D I+FG+ + + + ++ NY+ ++ GQ SS S + Sbjct: 191 YGNEDSHGITFGEINDEHGVGSTSNVVGNYQSYVQDPIGTLDIVYGQETGSSQTSSGVVS 250 Query: 913 EKELVDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVP 1092 E+++ P+ + PK+K E K S K +FPSNVRSL+STG+LDGVP Sbjct: 251 EQQVAKPSLEPV-------------PKNKAETKSSKKEASTSFPSNVRSLISTGMLDGVP 297 Query: 1093 VKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKT 1272 V Y+S S EE RGVIKGSGYLC CQ+C ++KVLNAY FERHAGCKTKHPNNHI+F+NGKT Sbjct: 298 VTYVSISREELRGVIKGSGYLCGCQTCEFTKVLNAYAFERHAGCKTKHPNNHIYFENGKT 357 Query: 1273 IYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDE 1437 IY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL+RIYGK+E Sbjct: 358 IYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGKEE 412