BLASTX nr result
ID: Akebia25_contig00004151
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00004151 (1553 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260... 453 e-125 emb|CBI16185.3| unnamed protein product [Vitis vinifera] 452 e-124 ref|XP_004294602.1| PREDICTED: uncharacterized protein LOC101302... 448 e-123 ref|XP_002299890.2| hypothetical protein POPTR_0001s24280g [Popu... 415 e-113 gb|EYU36256.1| hypothetical protein MIMGU_mgv1a006660mg [Mimulus... 400 e-109 ref|XP_007016512.1| Uncharacterized protein isoform 1 [Theobroma... 369 2e-99 ref|XP_007016513.1| Uncharacterized protein isoform 2 [Theobroma... 367 8e-99 ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802... 366 1e-98 ref|XP_006592206.1| PREDICTED: uncharacterized protein LOC100819... 361 4e-97 ref|XP_007016516.1| Uncharacterized protein isoform 5 [Theobroma... 352 2e-94 ref|XP_006592207.1| PREDICTED: uncharacterized protein LOC100819... 345 2e-92 ref|XP_006350905.1| PREDICTED: uncharacterized protein LOC102583... 342 2e-91 ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251... 341 5e-91 ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana] ... 338 4e-90 gb|AAV66096.1| At5g59830 [Arabidopsis thaliana] 338 4e-90 dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana] 337 9e-90 ref|XP_006280518.1| hypothetical protein CARUB_v10026471mg [Caps... 336 1e-89 ref|XP_007205145.1| hypothetical protein PRUPE_ppa005281mg [Prun... 331 5e-88 dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana] 324 6e-86 ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arab... 322 2e-85 >ref|XP_002281403.1| PREDICTED: uncharacterized protein LOC100260456 [Vitis vinifera] Length = 486 Score = 453 bits (1166), Expect = e-125 Identities = 244/454 (53%), Positives = 301/454 (66%), Gaps = 58/454 (12%) Frame = -1 Query: 1523 SFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 1344 SFQ+KGFWM KG +G L+DGD DN SRIEPKR+HQWF D EP LFPNKKQAV +++S Sbjct: 37 SFQNKGFWMPKG-AGHLSDGDTTFDNPSRIEPKRSHQWFADIAEPGLFPNKKQAVHSTSS 95 Query: 1343 RQISGVPNVN-LPWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRR 1167 + SG+ N + PWEN S+F SV QF DRLFG E +R ++F RN + T R Sbjct: 96 KSTSGISNAHGSPWENTSSFHSVPNQFIDRLFGPETARPVNFTERNISPVGTDGSR--SR 153 Query: 1166 GIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQV-------------------- 1047 I+EQFGND+S+ LS+S+ +ED +CL+YGGIRKVKVNQV Sbjct: 154 DIDEQFGNDSSVGLSISNAIEDPETCLSYGGIRKVKVNQVRESDSSENASKGHSYDREIH 213 Query: 1046 -------------------------KDSDNAMSM-----------PMSHSFNKRDGTTIS 975 K+ +N M PM H +NK D TIS Sbjct: 214 SNIPTVQDYDRGSDTSFMSIGAAYYKEDENDKLMGHTYNTGDHDIPMGHPYNKGDANTIS 273 Query: 974 FGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNAN-VLENATPLAIVTN 798 FG + + + P R +++Y L QSS+Q S++ E+EL NAN L +A + Sbjct: 274 FGSYHDEPDNIPFARPISSYGLY--QSSVQISDTESERELDASNANGTLSSAQLAKLRPE 331 Query: 797 KTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCS 618 K+K E KMS K PN+FPSNVR+L+STG+LDGVPVKY+S S EE G+IKGSGYLC Sbjct: 332 SASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPVKYVSLSREELHGIIKGSGYLCG 391 Query: 617 CQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTV 438 CQSCN++KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LLF+AIQTV Sbjct: 392 CQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLFDAIQTV 451 Query: 437 TGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 TG PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 452 TGSPINQKSFRIWKESFQAATRELKRIYGKEELN 485 >emb|CBI16185.3| unnamed protein product [Vitis vinifera] Length = 416 Score = 452 bits (1163), Expect = e-124 Identities = 234/416 (56%), Positives = 294/416 (70%), Gaps = 32/416 (7%) Frame = -1 Query: 1487 GSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVN-L 1311 G+G L+DGD DN SRIEPKR+HQWF D EP LFPNKKQAV +++S+ SG+ N + Sbjct: 4 GAGHLSDGDTTFDNPSRIEPKRSHQWFADIAEPGLFPNKKQAVHSTSSKSTSGISNAHGS 63 Query: 1310 PWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASI 1131 PWEN S+F SV QF DRLFG E +R ++F RN + T R I+EQFGND+S+ Sbjct: 64 PWENTSSFHSVPNQFIDRLFGPETARPVNFTERNISPVGTDGSR--SRDIDEQFGNDSSV 121 Query: 1130 ALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMS------------------------ 1023 LS+S+ +ED +CL+YGGIRKVKVNQV++SD++ + Sbjct: 122 GLSISNAIEDPETCLSYGGIRKVKVNQVRESDSSENASKGHSYDREIHSNIPTVQDYDRG 181 Query: 1022 ------MPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEK 861 +PM H +NK D TISFG + + + P R +++Y L QSS+Q S++ E+ Sbjct: 182 SDTNHDIPMGHPYNKGDANTISFGSYHDEPDNIPFARPISSYGLY--QSSVQISDTESER 239 Query: 860 ELVDPNAN-VLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPV 684 EL NAN L +A + K+K E KMS K PN+FPSNVR+L+STG+LDGVPV Sbjct: 240 ELDASNANGTLSSAQLAKLRPESASKNKSEFKMSKKEAPNSFPSNVRTLISTGMLDGVPV 299 Query: 683 KYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTI 504 KY+S S EE G+IKGSGYLC CQSCN++KVLNAYEFERHAGCKTKHPNNHI+F+NGKTI Sbjct: 300 KYVSLSREELHGIIKGSGYLCGCQSCNFNKVLNAYEFERHAGCKTKHPNNHIYFENGKTI 359 Query: 503 YGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 Y IVQEL+STP++LLF+AIQTVTG PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 360 YQIVQELRSTPESLLFDAIQTVTGSPINQKSFRIWKESFQAATRELKRIYGKEELN 415 >ref|XP_004294602.1| PREDICTED: uncharacterized protein LOC101302631 [Fragaria vesca subsp. vesca] Length = 469 Score = 448 bits (1152), Expect = e-123 Identities = 241/469 (51%), Positives = 300/469 (63%), Gaps = 72/469 (15%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MSFQ+KGFWMAKG +G DGD N SRIEPKR+HQWF+D+ EP+LFPNKKQAV N Sbjct: 1 MSFQNKGFWMAKG-AGHDNDGDATFGNPSRIEPKRSHQWFVDSAEPQLFPNKKQAVHIPN 59 Query: 1346 SRQISGVPNVNLPWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRR 1167 S+ +PN N+ WENPS+FQSV QF DRLFGS+ + + +F RN + + + I + Sbjct: 60 SKLSVEMPNENVSWENPSSFQSVPHQFIDRLFGSDTASSTNFSDRNVSPVGSDDWSIRTK 119 Query: 1166 GIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAM------------- 1026 GI++QFG+DA + LS+SH +E+ CL Y GIRK+KVNQVKDSD M Sbjct: 120 GIDDQFGSDAPVNLSISHAIENPEVCLGYAGIRKIKVNQVKDSDIDMHASREHGSSREYN 179 Query: 1025 -SMP-----------------------------MSHSFN--------------KRDGTTI 978 ++P M H++N KR+ I Sbjct: 180 INLPTSQAFDRTHETGFISAGQAYDKEHDNVTLMGHAYNKGAAHVRPLGASYGKREENVI 239 Query: 977 SFGD--------------FQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNA 840 S D F + +MN GR + NY+ L QSS+Q SE+ EKEL NA Sbjct: 240 SMSDGYSKGNANMISFGGFPDEQDMNTMGRAVANYDQLYHQSSVQTSETAHEKELDTTNA 299 Query: 839 NVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSH 663 N ++N +A KSK E K + K PN+FPSNVRSL+STGILDGVPVKY+S + Sbjct: 300 NAVDNTASVAKSKPESASKSKPESKPTKKQAPNSFPSNVRSLISTGILDGVPVKYVSMAR 359 Query: 662 EEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQEL 483 EE RG+IKG+ YLC CQSCN++K LNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL Sbjct: 360 EELRGIIKGASYLCGCQSCNFTKGLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQEL 419 Query: 482 KSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 +STP++LLF+ +QTV G PINQKAF WKES+QAATREL+RIYGK+ELN Sbjct: 420 RSTPESLLFDTMQTVFGAPINQKAFLSWKESFQAATRELQRIYGKEELN 468 >ref|XP_002299890.2| hypothetical protein POPTR_0001s24280g [Populus trichocarpa] gi|550348073|gb|EEE84695.2| hypothetical protein POPTR_0001s24280g [Populus trichocarpa] Length = 400 Score = 415 bits (1066), Expect = e-113 Identities = 222/428 (51%), Positives = 271/428 (63%), Gaps = 35/428 (8%) Frame = -1 Query: 1514 SKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQI 1335 +KGFWM+KG DGD +N R+E KR+HQWF+D TEPELFPNKKQAV+ NS Sbjct: 2 NKGFWMSKG-----TDGDPAFENPPRLESKRSHQWFIDDTEPELFPNKKQAVQTPNSTTT 56 Query: 1334 SGVPNVNLP-WENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRGIE 1158 SG+P+ N P W N S FQSV QF RLFG+E +R+++F RN T Sbjct: 57 SGIPSANSPSWHNTSGFQSVPNQFIHRLFGAETARSVNFAERNLYPAGTVE--------- 107 Query: 1157 EQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSF-------- 1002 ++AS A CLNYGGIRKVK+NQVKD D+ + P H F Sbjct: 108 ----SNASEA------------CLNYGGIRKVKINQVKDFDSGVHAPKGHGFTIESDSNN 151 Query: 1001 -------------------------NKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQ 897 N D +SFG F + ++ P R L++Y+ Q Sbjct: 152 STGQAFQRESQSSFISTGHAFDKEDNSEDTNLLSFGGFDDAHDIIPVDRPLSSYDHSYDQ 211 Query: 896 SSIQPSESLKEKELVDPNAN-VLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRS 720 SS++ E++ EKEL A V N T K++ E K + K PN+FPSNVRS Sbjct: 212 SSVRTREAVDEKELRTTTAKAVASNTQATKSRTEPVSKNRPELKTTRKEAPNSFPSNVRS 271 Query: 719 LLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHP 540 L+STG+LDGVPVKY+S S EE RG+IKGSGYLC CQSCNYSKVLNAYEFERHAGCKTKHP Sbjct: 272 LISTGMLDGVPVKYVSLSREELRGIIKGSGYLCGCQSCNYSKVLNAYEFERHAGCKTKHP 331 Query: 539 NNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELER 360 NNHI+F+NGKTIY IVQEL+STP+++LF+ IQTV G PINQK+FRIWKES+QAATREL+R Sbjct: 332 NNHIYFENGKTIYQIVQELRSTPESMLFDVIQTVFGAPINQKSFRIWKESFQAATRELQR 391 Query: 359 IYGKDELN 336 IYGK+ELN Sbjct: 392 IYGKEELN 399 >gb|EYU36256.1| hypothetical protein MIMGU_mgv1a006660mg [Mimulus guttatus] Length = 436 Score = 400 bits (1028), Expect = e-109 Identities = 230/442 (52%), Positives = 282/442 (63%), Gaps = 50/442 (11%) Frame = -1 Query: 1511 KGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQIS 1332 K FWM KGG G ++DGD DNSSRIEPKRA QW LDA+EPELFP+KKQ +EA ++Q S Sbjct: 3 KEFWMLKGG-GHVSDGDAVFDNSSRIEPKRARQWLLDASEPELFPSKKQVLEAPITKQES 61 Query: 1331 GV-PNVNLPWENPSNFQSVTG---QFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIGRRG 1164 + +L WE+ S FQSV QF DRLFGSE G T + + Sbjct: 62 EILMQSSLSWESSSGFQSVPSAPNQFMDRLFGSETIIPAIAG--------TDGSGVREKV 113 Query: 1163 IEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKD------------------- 1041 I E+F +++S+ LS+S+ ME+ + ++YGG+RKVKVNQVKD Sbjct: 114 IGEEFEDNSSVGLSISYAMEEQENGVSYGGLRKVKVNQVKDPIEHDIGVSMEQTYHRGGE 173 Query: 1040 -------------SDNAMSMPMSHS------------FNKRDGTTI-SFGDFQEGSEMNP 939 NA M S++ F K D I SFG +QE S M Sbjct: 174 ITFESIGQHYGKEGGNATLMGQSYNTGESNITCTGSTFGKGDNNNIISFGGYQEESVMEA 233 Query: 938 SGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPLAIV-TNKTPKSKVEQKMS 762 R +++Y LL QSS Q SE+ +KE+ PN+ T + T K K + K S Sbjct: 234 LARPVSSYSLLYEQSSAQTSETPTKKEVGAPNSGATVGTTQAPKPKVDSTSKIKSDTKPS 293 Query: 761 NKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNA 582 K PN+FPSNVRSL++TG+LDGVPVKY+S S EE RG+IKGSGYLC CQSCNYSK LNA Sbjct: 294 RKEAPNSFPSNVRSLIATGMLDGVPVKYVSVSREELRGIIKGSGYLCGCQSCNYSKALNA 353 Query: 581 YEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRI 402 YEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+ST +++LF+AIQTVTG PINQKAFR Sbjct: 354 YEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTSESMLFDAIQTVTGSPINQKAFRT 413 Query: 401 WKESYQAATRELERIYGKDELN 336 WKES+QAATREL+RIYGK+ELN Sbjct: 414 WKESFQAATRELQRIYGKEELN 435 >ref|XP_007016512.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508786875|gb|EOY34131.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 467 Score = 369 bits (947), Expect = 2e-99 Identities = 218/470 (46%), Positives = 275/470 (58%), Gaps = 73/470 (15%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MSFQ+K FWMAKG + ++DGD DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N Sbjct: 1 MSFQNKSFWMAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPN 58 Query: 1346 SRQISGVPNVNL-PWENPSNFQSVTGQ---------------FTDRL------------- 1254 ++ SG+ N+N+ PWEN S+FQSV Q FT+R Sbjct: 59 NKSSSGISNLNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKA 118 Query: 1253 ----FGSE-------------PSRTIDFGGRNFQSINT-------------------GNL 1182 FG + P ++GG +N N Sbjct: 119 IEDHFGEDASVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNS 178 Query: 1181 DIGRRGIEEQFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMS 1023 D+ IE + S +SM H+ + +G N G + + Sbjct: 179 DMTT--IEAYDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIP 236 Query: 1022 MPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPN 843 + M ++ K D +SFG F E E+ P GR L+++E SS SE EK+L Sbjct: 237 ISMGDTYGKEDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDAST 296 Query: 842 ANVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWS 666 A V+ + T + ++K E K S K PN+FPSNVRSL+STG+LDGVPVKYIS S Sbjct: 297 AVVVASTTRTPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLS 356 Query: 665 HEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQE 486 EE RGVIKGSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQE Sbjct: 357 REELRGVIKGSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQE 416 Query: 485 LKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 L+STP++LLF+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 417 LRSTPESLLFDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGKEELN 466 >ref|XP_007016513.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|590589665|ref|XP_007016515.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786876|gb|EOY34132.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508786878|gb|EOY34134.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 489 Score = 367 bits (942), Expect = 8e-99 Identities = 217/469 (46%), Positives = 274/469 (58%), Gaps = 73/469 (15%) Frame = -1 Query: 1523 SFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 1344 SFQ+K FWMAKG + ++DGD DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N+ Sbjct: 24 SFQNKSFWMAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPNN 81 Query: 1343 RQISGVPNVNL-PWENPSNFQSVTGQ---------------FTDRL-------------- 1254 + SG+ N+N+ PWEN S+FQSV Q FT+R Sbjct: 82 KSSSGISNLNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKAI 141 Query: 1253 ---FGSE-------------PSRTIDFGGRNFQSINT-------------------GNLD 1179 FG + P ++GG +N N D Sbjct: 142 EDHFGEDASVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNSD 201 Query: 1178 IGRRGIEEQFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMSM 1020 + IE + S +SM H+ + +G N G + + + Sbjct: 202 MTT--IEAYDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIPI 259 Query: 1019 PMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNA 840 M ++ K D +SFG F E E+ P GR L+++E SS SE EK+L A Sbjct: 260 SMGDTYGKEDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDASTA 319 Query: 839 NVLENATPLA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSH 663 V+ + T + ++K E K S K PN+FPSNVRSL+STG+LDGVPVKYIS S Sbjct: 320 VVVASTTRTPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLSR 379 Query: 662 EEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQEL 483 EE RGVIKGSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL Sbjct: 380 EELRGVIKGSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQEL 439 Query: 482 KSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 +STP++LLF+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 440 RSTPESLLFDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGKEELN 488 >ref|XP_003539090.1| PREDICTED: uncharacterized protein LOC100802229 [Glycine max] Length = 463 Score = 366 bits (940), Expect = 1e-98 Identities = 217/465 (46%), Positives = 273/465 (58%), Gaps = 68/465 (14%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MS Q+KGFWM KG SG + D D DN ++IEPKR HQWF+DA E + FPNKKQAVE ++ Sbjct: 1 MSLQNKGFWMVKG-SGHINDRDTVFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDAD 59 Query: 1346 SRQISGVPNVNLP-WENPSNFQSVTGQ--------------FTDR--------------- 1257 + G NVN+P WEN NF SV Q FT++ Sbjct: 60 EKSSPGFSNVNIPPWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTYVLADDSNVRSKM 119 Query: 1256 ---LFGSEPS-------------RTIDFGGRNFQSINT-----------------GNLDI 1176 +G E S ++FGG +N N D+ Sbjct: 120 VTNQYGDEASFGLSISHSIEDSEACVNFGGIKKVKVNQVKEVDVQALEGHNFGRQSNGDL 179 Query: 1175 GRRGIEEQFGNDASIALSMSHTMEDLGSCLNYGG----IRKVKVNQVKDSDNAMSMPMSH 1008 + E ASI + + L Y +R + VK D+ +S+ S Sbjct: 180 HQAYNREVETRSASIGQAFDKDRDATLMGLTYSRGDAHVRSFGASFVKGDDSIVSI--SE 237 Query: 1007 SFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLE 828 S+NK D ISFG F + ++ GR Y+ L QSS+ S + EKEL +++ + Sbjct: 238 SYNKEDTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHVSTTAHEKELDVSSSDAVA 297 Query: 827 NATPLAIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFR 651 + +A V ++T K+K E K + K PN+FPSNVRSL+STGILDGVPVKY+S S EE R Sbjct: 298 STLQVAKVKSETVSKNKQELKTAKKEAPNSFPSNVRSLISTGILDGVPVKYVSVSREELR 357 Query: 650 GVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTP 471 G+IKGSGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP Sbjct: 358 GIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTP 417 Query: 470 QNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 ++LLF+ IQTV G PINQKAFR WKES+QAATREL+RIYGK+ELN Sbjct: 418 ESLLFDTIQTVFGAPINQKAFRNWKESFQAATRELQRIYGKEELN 462 >ref|XP_006592206.1| PREDICTED: uncharacterized protein LOC100819317 isoform X1 [Glycine max] Length = 464 Score = 361 bits (927), Expect = 4e-97 Identities = 218/473 (46%), Positives = 274/473 (57%), Gaps = 76/473 (16%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MS Q+KGFWM KG SG + D + DN ++IEPKR HQWF+DA E + FPNKKQAVE ++ Sbjct: 1 MSLQNKGFWMVKG-SGQINDRETIFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDAD 59 Query: 1346 SRQISGVPNVNLP-WENPSNFQSV------------------------------------ 1278 + G NVN+P WEN NF SV Sbjct: 60 EKSSPGFSNVNIPPWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTSYVLADDSNVRSK 119 Query: 1277 --TGQF-TDRLFGSEPSRTID-------FGG-------------------RNFQSINTGN 1185 T Q+ D FG S +I+ FGG NF N GN Sbjct: 120 MITNQYGDDASFGLSISHSIEDSEACVNFGGIKKVKVNQVKEDDIQALEGHNFGRPNNGN 179 Query: 1184 L--------DIGRRGIEEQFGNDASIAL-SMSHTMEDLGSCLNYGGIRKVKVNQVKDSDN 1032 L + I + F D +L ++++ D +R VK D+ Sbjct: 180 LHQAYNREVETRSASIGQAFDRDGDASLMGLTYSKGD-------AHVRSFSAPFVKGDDS 232 Query: 1031 AMSMPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELV 852 +S+ S S+NK D ISFG F + ++ GR Y+ L QSS+ S + EKEL Sbjct: 233 IVSI--SESYNKEDTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHGSTTAHEKELD 290 Query: 851 DPNANVLENATPLAIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYI 675 +++ + + +A V ++T K+K E K + PN+FPSNVRSL+STGILDGVPVKYI Sbjct: 291 VSSSDAVASTLQVAKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYI 350 Query: 674 SWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGI 495 S S EE RG+IKGSGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY I Sbjct: 351 SVSREELRGIIKGSGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQI 410 Query: 494 VQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 VQEL+STP++LLF+ IQTV G PI+QKAFR WKES+QAATREL+RIYGK+ELN Sbjct: 411 VQELRSTPESLLFDTIQTVFGAPIHQKAFRNWKESFQAATRELQRIYGKEELN 463 >ref|XP_007016516.1| Uncharacterized protein isoform 5 [Theobroma cacao] gi|508786879|gb|EOY34135.1| Uncharacterized protein isoform 5 [Theobroma cacao] Length = 458 Score = 352 bits (904), Expect = 2e-94 Identities = 211/461 (45%), Positives = 267/461 (57%), Gaps = 73/461 (15%) Frame = -1 Query: 1499 MAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPN 1320 MAKG + ++DGD DN SRIEPKR+H WF+DA EP+LFP+KKQA++A N++ SG+ N Sbjct: 1 MAKGPAH-ISDGDAAFDNPSRIEPKRSHNWFVDA-EPQLFPSKKQAIQAPNNKSSSGISN 58 Query: 1319 VNL-PWENPSNFQSVTGQ---------------FTDRL-----------------FGSE- 1242 +N+ PWEN S+FQSV Q FT+R FG + Sbjct: 59 LNVSPWENVSSFQSVPSQFIDRLFGSDSERPENFTERNISPVEVDNIRRKAIEDHFGEDA 118 Query: 1241 ------------PSRTIDFGGRNFQSINT-------------------GNLDIGRRGIEE 1155 P ++GG +N N D+ IE Sbjct: 119 SVGSSISHTMEDPETCFNYGGIRKVKVNQVKDSANSMHAPKEHSFSRENNSDMTT--IEA 176 Query: 1154 QFGNDASIALSMSHTMED-------LGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNK 996 + S +SM H+ + +G N G + + + M ++ K Sbjct: 177 YDRENESSFISMGHSYDKEYDNVALMGHTYNRGDTHIRTATPAYGKGDEIPISMGDTYGK 236 Query: 995 RDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATP 816 D +SFG F E E+ P GR L+++E SS SE EK+L A V+ + T Sbjct: 237 EDANILSFGGFHEEHEIIPVGRPLSSFEPSYTPSSNPSSEGASEKQLDASTAVVVASTTR 296 Query: 815 LA-IVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIK 639 + ++K E K S K PN+FPSNVRSL+STG+LDGVPVKYIS S EE RGVIK Sbjct: 297 TPKLRPESASRTKPELKSSKKEAPNSFPSNVRSLISTGMLDGVPVKYISLSREELRGVIK 356 Query: 638 GSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLL 459 GSGYLC CQSCN+SKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LL Sbjct: 357 GSGYLCGCQSCNFSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLL 416 Query: 458 FEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 F+ IQTV G PINQK+FRIWKES+QAATREL+RIYGK+ELN Sbjct: 417 FDTIQTVFGAPINQKSFRIWKESFQAATRELQRIYGKEELN 457 >ref|XP_006592207.1| PREDICTED: uncharacterized protein LOC100819317 isoform X2 [Glycine max] Length = 455 Score = 345 bits (886), Expect = 2e-92 Identities = 209/460 (45%), Positives = 264/460 (57%), Gaps = 76/460 (16%) Frame = -1 Query: 1487 GSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP 1308 GSG + D + DN ++IEPKR HQWF+DA E + FPNKKQAVE ++ + G NVN+P Sbjct: 4 GSGQINDRETIFDNPTKIEPKRPHQWFVDAAEVDFFPNKKQAVEDADEKSSPGFSNVNIP 63 Query: 1307 -WENPSNFQSV--------------------------------------TGQF-TDRLFG 1248 WEN NF SV T Q+ D FG Sbjct: 64 PWENNPNFHSVPNQFIGRLFGSETRPVNFTEKNTSYVLADDSNVRSKMITNQYGDDASFG 123 Query: 1247 SEPSRTID-------FGG-------------------RNFQSINTGNL--------DIGR 1170 S +I+ FGG NF N GNL + Sbjct: 124 LSISHSIEDSEACVNFGGIKKVKVNQVKEDDIQALEGHNFGRPNNGNLHQAYNREVETRS 183 Query: 1169 RGIEEQFGNDASIAL-SMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKR 993 I + F D +L ++++ D +R VK D+ +S+ S S+NK Sbjct: 184 ASIGQAFDRDGDASLMGLTYSKGD-------AHVRSFSAPFVKGDDSIVSI--SESYNKE 234 Query: 992 DGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPL 813 D ISFG F + ++ GR Y+ L QSS+ S + EKEL +++ + + + Sbjct: 235 DTNIISFGGFPDERDIISVGRPAAEYDQLYNQSSVHGSTTAHEKELDVSSSDAVASTLQV 294 Query: 812 AIVTNKT-PKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKG 636 A V ++T K+K E K + PN+FPSNVRSL+STGILDGVPVKYIS S EE RG+IKG Sbjct: 295 AKVKSETVSKNKQELKTAKNEAPNSFPSNVRSLISTGILDGVPVKYISVSREELRGIIKG 354 Query: 635 SGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLF 456 SGYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQEL+STP++LLF Sbjct: 355 SGYLCGCQSCNYTKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQELRSTPESLLF 414 Query: 455 EAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 + IQTV G PI+QKAFR WKES+QAATREL+RIYGK+ELN Sbjct: 415 DTIQTVFGAPIHQKAFRNWKESFQAATRELQRIYGKEELN 454 >ref|XP_006350905.1| PREDICTED: uncharacterized protein LOC102583417 [Solanum tuberosum] Length = 560 Score = 342 bits (878), Expect = 2e-91 Identities = 203/453 (44%), Positives = 267/453 (58%), Gaps = 57/453 (12%) Frame = -1 Query: 1523 SFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNS 1344 SF K FW+ K G G L+DG+ D+SSRI+ KRAHQ F E ELFPNKKQAV S Sbjct: 113 SFHDKDFWIPKCG-GHLSDGEAVFDSSSRIDVKRAHQLFSSTAEAELFPNKKQAVHTSLG 171 Query: 1343 RQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRN-------------- 1209 + S + N WE S+ S QF DRLF + +R ++ R+ Sbjct: 172 KSTSEIAVTNSTCWETTSDLPSGANQFIDRLFRVDTTRPVNLTERSTGNSTIRKKVIDDQ 231 Query: 1208 --------------------------FQSINTGNLDIGRRGIEEQFGNDASIALSMSHTM 1107 +++N ++ N+ ++++S H Sbjct: 232 IGDDPLVGLSMSYTIEEQQICISDSRIRNLNVNQVEDSENAFHSPIENNINMSISQVHNR 291 Query: 1106 ---------------EDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKRDGTTISF 972 ED N G I + + V+ S + + P++ S+ + D TI F Sbjct: 292 ASETSFLSMGQAYGKEDESQTYNPGDISRSIRSNVEKSHS--TTPIADSYTRGDSDTI-F 348 Query: 971 GDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPLAIV-TNK 795 G F+ S+++ R ++ Y+ L QSS+ SE +K+L NA ++ ++ + T+ Sbjct: 349 G-FELVSDIDALARPISGYDYLHYQSSVDTSEPHCDKQLDGSNAKAVDISSQTSKPRTDS 407 Query: 794 TPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSC 615 PK+K E K ++K PN+FPSNVRSLL+TGILDGVPVKY+ S +E RG+IKGSGYLC C Sbjct: 408 LPKTKSESKPAHKGAPNSFPSNVRSLLATGILDGVPVKYVL-SRQELRGIIKGSGYLCGC 466 Query: 614 QSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVT 435 Q CNYSKVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY I QEL+STPQ+LLFEAIQTVT Sbjct: 467 QPCNYSKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQITQELRSTPQSLLFEAIQTVT 526 Query: 434 GHPINQKAFRIWKESYQAATRELERIYGKDELN 336 G PINQKAF+IWKES+QAATREL+RIYGK+ELN Sbjct: 527 GSPINQKAFQIWKESFQAATRELQRIYGKEELN 559 >ref|XP_002274877.2| PREDICTED: uncharacterized protein LOC100251629 [Vitis vinifera] Length = 599 Score = 341 bits (875), Expect = 5e-91 Identities = 183/331 (55%), Positives = 228/331 (68%), Gaps = 22/331 (6%) Frame = -1 Query: 1253 FGSEPSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSMSHTME-------DLG 1095 +G E + I G Q+ N G+ +I + G D +I SM HT +G Sbjct: 277 YGREDNNFISMG----QAYNKGDENIAMSHTYK--GGDNTI--SMGHTFSKGDNNIISMG 328 Query: 1094 SCLNYGGIRKVKVNQV--KDSDNAMSM-----------PMSHSFNKRDGTTISFGDFQEG 954 N G + + + K +N +SM + HS+NK + ISFG F + Sbjct: 329 QTYNKGDDNTISMGHIYNKGDENTISMGHTYKGDNSNLSIGHSYNKGESNIISFGGFHDD 388 Query: 953 SE-MNPSGRLLTNYELLMGQSSIQPSESLKEKELVDPNANVLENATPLAIVTNKT-PKSK 780 + NPSGRL+ +Y+LLMGQ S+Q SE+L EK+LV+ NA+ L + + ++T K K Sbjct: 389 DDDTNPSGRLVCSYDLLMGQPSVQRSEALNEKKLVESNADALISTAQITASGSETVSKKK 448 Query: 779 VEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNY 600 EQK+S KVPPNNFPSNVRSLLSTG+LDGVPVKYI+WS EE RG+IKGSGYLC CQSCN+ Sbjct: 449 EEQKLSKKVPPNNFPSNVRSLLSTGMLDGVPVKYIAWSREELRGIIKGSGYLCGCQSCNF 508 Query: 599 SKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPIN 420 SKV+NAYEFERHAGCKTKHPNNHI+F+NGKTIYGIVQELKSTPQN LF+ IQT+TG PIN Sbjct: 509 SKVINAYEFERHAGCKTKHPNNHIYFENGKTIYGIVQELKSTPQNSLFDVIQTITGSPIN 568 Query: 419 QKAFRIWKESYQAATRELERIYGKDELNQLS 327 QK+FR+WKES+ AATREL+RIYGK+E QLS Sbjct: 569 QKSFRLWKESFLAATRELQRIYGKEEGKQLS 599 Score = 223 bits (568), Expect = 2e-55 Identities = 109/186 (58%), Positives = 139/186 (74%), Gaps = 1/186 (0%) Frame = -1 Query: 1529 KMSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEAS 1350 +MSFQ+KGFWMAKG GC+ DG+M DN SRIEPKR+HQWF+D TE ELFPNKKQAVE Sbjct: 61 RMSFQNKGFWMAKG-VGCVTDGEMAYDNPSRIEPKRSHQWFMDGTE-ELFPNKKQAVEVP 118 Query: 1349 NSRQISGVPNVNL-PWENPSNFQSVTGQFTDRLFGSEPSRTIDFGGRNFQSINTGNLDIG 1173 NS G+ N N+ PW N S F SV+G FT+RLF E +RT++F RN S+ GN+++ Sbjct: 119 NSNLFPGLSNPNVSPWANASGFHSVSGHFTERLFDPEAARTVNFDDRNIPSVGAGNMNMA 178 Query: 1172 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHSFNKR 993 R+ IE+ FGN++ LSMSH++ED S LNYGGIRKVKV+QVKDS+N MS+ M H++ + Sbjct: 179 RKVIEDPFGNESLFGLSMSHSLEDPRSGLNYGGIRKVKVSQVKDSENIMSVSMGHTYTRA 238 Query: 992 DGTTIS 975 D T+S Sbjct: 239 DNNTMS 244 >ref|NP_200791.2| uncharacterized protein [Arabidopsis thaliana] gi|42573736|ref|NP_974964.1| uncharacterized protein [Arabidopsis thaliana] gi|332009855|gb|AED97238.1| uncharacterized protein AT5G59830 [Arabidopsis thaliana] gi|332009856|gb|AED97239.1| uncharacterized protein AT5G59830 [Arabidopsis thaliana] Length = 425 Score = 338 bits (867), Expect = 4e-90 Identities = 193/428 (45%), Positives = 263/428 (61%), Gaps = 33/428 (7%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MS++SKGFW+ K ++ D D+S+R + KR H WF+D++ E+FPNKKQAV+ + Sbjct: 1 MSYESKGFWVMKNNEHT-SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57 Query: 1346 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 1173 G NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+ + + Sbjct: 58 PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115 Query: 1172 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 1038 + I E + D S+ LS+S+ +E G C G RK+ V++VK++ Sbjct: 116 NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175 Query: 1037 ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 900 +N S H + D I+FG+ + + + ++ NY+ + Sbjct: 176 ESSSIQACSRENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV- 234 Query: 899 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 726 Q I + + ++E ++ V+ PK+K E K S K +FPSNV Sbjct: 235 QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294 Query: 725 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 546 RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 295 RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354 Query: 545 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 366 HPNNHI+F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 355 HPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414 Query: 365 ERIYGKDE 342 +RIYGK+E Sbjct: 415 QRIYGKEE 422 >gb|AAV66096.1| At5g59830 [Arabidopsis thaliana] Length = 425 Score = 338 bits (867), Expect = 4e-90 Identities = 193/428 (45%), Positives = 263/428 (61%), Gaps = 33/428 (7%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MS++SKGFW+ K ++ D D+S+R + KR H WF+D++ E+FPNKKQAV+ + Sbjct: 1 MSYESKGFWVMKNNEHT-SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57 Query: 1346 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 1173 G NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+ + + Sbjct: 58 PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115 Query: 1172 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 1038 + I E + D S+ LS+S+ +E G C G RK+ V++VK++ Sbjct: 116 NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175 Query: 1037 ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 900 +N S H + D I+FG+ + + + ++ NY+ + Sbjct: 176 ESSSIQACSWENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSASNVVGNYQSYV- 234 Query: 899 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 726 Q I + + ++E ++ V+ PK+K E K S K +FPSNV Sbjct: 235 QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294 Query: 725 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 546 RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 295 RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354 Query: 545 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 366 HPNNHI+F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 355 HPNNHIYFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414 Query: 365 ERIYGKDE 342 +RIYGK+E Sbjct: 415 QRIYGKEE 422 >dbj|BAF00943.1| hypothetical protein [Arabidopsis thaliana] Length = 425 Score = 337 bits (864), Expect = 9e-90 Identities = 192/428 (44%), Positives = 263/428 (61%), Gaps = 33/428 (7%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MS++SKGFW+ K ++ D D+S+R + KR H WF+D++ E+FPNKKQAV+ + Sbjct: 1 MSYESKGFWVMKNNEHT-SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--D 57 Query: 1346 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 1173 G NV LP WE+ S FQSV+ QF DRL G+E P R + FG R+ + + Sbjct: 58 PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ-- 115 Query: 1172 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDS--------------- 1038 + I E + D S+ LS+S+ +E G C G RK+ V++VK++ Sbjct: 116 NKSIAESYMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKI 175 Query: 1037 ----------DNAMSMP----MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 900 +N S H + D I+FG+ + + + ++ NY+ + Sbjct: 176 ESSSIQACSRENESSYINFALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV- 234 Query: 899 QSSIQPSESLKEKEL--VDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 726 Q I + + ++E ++ V+ PK+K E K S K +FPSNV Sbjct: 235 QDPIGTLDIVYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNV 294 Query: 725 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 546 RSL+STG+LDGVPVKY+S S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 295 RSLISTGMLDGVPVKYVSVSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 354 Query: 545 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 366 HPNNHI+F+NG+TIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 355 HPNNHIYFENGRTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATREL 414 Query: 365 ERIYGKDE 342 +RIYGK+E Sbjct: 415 QRIYGKEE 422 >ref|XP_006280518.1| hypothetical protein CARUB_v10026471mg [Capsella rubella] gi|482549222|gb|EOA13416.1| hypothetical protein CARUB_v10026471mg [Capsella rubella] Length = 422 Score = 336 bits (862), Expect = 1e-89 Identities = 195/428 (45%), Positives = 261/428 (60%), Gaps = 33/428 (7%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASN 1347 MS++SKGFW+ K ++ D D+S+R + KR H WF D++ ++FPNKKQAV+ Sbjct: 1 MSYESKGFWVLKNNEHT-SEEDSVYDHSTRDDSKRPHPWFADSSRSDMFPNKKQAVQDPV 59 Query: 1346 SRQISGVPNVNLP-WENPSNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIG 1173 G ++ LP WE+ S FQSV+ QF DRL G+E PSR + FG R+ G Sbjct: 60 GGL--GKSSLGLPLWESSSVFQSVSNQFMDRLLGAEMPSRPLLFGDRDRTE---GCSHHQ 114 Query: 1172 RRGIEEQFGNDASIALSMSHTMEDLGSCLNYGGIRKVKVNQVKDSDNAMSMPMSHS---- 1005 + I E F + S+ LS+S+ +E GSC GIRK+ V++VK++ + + HS Sbjct: 115 NKSIAESFMENTSVELSISNGVEVAGSCFGGDGIRKLPVSRVKETMSTHAALDGHSQRKI 174 Query: 1004 -------------------------FNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMG 900 + D I+FG+ + + S NY+ + Sbjct: 175 ESSSIQACSRENESSFINFALAGHPYGNEDSHGITFGEINDEHGVGSSSN--GNYQSYV- 231 Query: 899 QSSIQPSESL--KEKELVDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNV 726 Q I+ S+ + +E ++ V+ PK+K E K S K +FPSNV Sbjct: 232 QDPIETSDMVYGQETGCSQTSSRVVSEQQMAKPSLETPPKNKAEAKTSKKEASTSFPSNV 291 Query: 725 RSLLSTGILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTK 546 RSL+STG+LDGVPVKYIS S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTK Sbjct: 292 RSLISTGMLDGVPVKYISLSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTK 351 Query: 545 HPNNHIFFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATREL 366 HPNNHI+F+NGKTIY IVQEL++T +++LF+ IQTV G PINQKAFRIWKES+QAATREL Sbjct: 352 HPNNHIYFENGKTIYQIVQELRNTQESMLFDVIQTVFGSPINQKAFRIWKESFQAATREL 411 Query: 365 ERIYGKDE 342 +RIYGK+E Sbjct: 412 QRIYGKEE 419 >ref|XP_007205145.1| hypothetical protein PRUPE_ppa005281mg [Prunus persica] gi|462400787|gb|EMJ06344.1| hypothetical protein PRUPE_ppa005281mg [Prunus persica] Length = 469 Score = 331 bits (849), Expect = 5e-88 Identities = 207/471 (43%), Positives = 261/471 (55%), Gaps = 74/471 (15%) Frame = -1 Query: 1526 MSFQSKGFWMAKGGSGCLADGDMGCDNSSRIEPKRAHQWFLDATEPELF---------PN 1374 MSFQ+KGFWM KG +G + DGD N SRIEPKR HQWF+DA EPELF PN Sbjct: 1 MSFQNKGFWMPKG-AGLVNDGDATYGNPSRIEPKRPHQWFVDAAEPELFPNKKQAVHIPN 59 Query: 1373 KKQAVEAS--------NSRQISGVPN--------------VNLPWENPSNFQSVTGQFT- 1263 K S N+ VP+ VN N S S Sbjct: 60 SKLGSGMSNENVSSWENASSFQSVPHQFIDRLFGSDTASSVNFAERNISPVGSDNWNIRK 119 Query: 1262 --DRLFGSE-------------PSRTIDFGGRNFQSIN-TGNLDIGRRGIEEQFGNDASI 1131 D FG + P +++ G +N + D G E N S Sbjct: 120 GIDDQFGEDSPVSLSVSHAMEDPETCLNYAGIRKVKVNQVRDSDNGMHASREHGSNRGSN 179 Query: 1130 A-LSMSHTMED----------------------LGSCLNYGG--IRKVKVNQVKDSDNAM 1026 + LS S + +G N+G +R + N K +NA+ Sbjct: 180 SNLSSSQAFDRVNETAFLSVGQAYDKEHGSVTLIGHPYNHGDAHVRPIDTNYGKGDENAI 239 Query: 1025 SMPMSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKELVDP 846 S+ + +K + ISFG F + ++ P GR + NY+ L S+Q E+ EK+L Sbjct: 240 SV--GDNCSKGNANMISFGGFPDEQDIIPIGRPVGNYDQLYHPDSVQTLETSYEKDLDAS 297 Query: 845 NANVLENATPLAIVT-NKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYISW 669 NA+ ++N LA K+K E K S K PN+FPSNVRSL+STG+LDGVPVKY+S Sbjct: 298 NASAVDNTASLAKPRLESVSKNKPEIKPSRKPAPNSFPSNVRSLISTGMLDGVPVKYVSL 357 Query: 668 SHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIVQ 489 + EE RG+IKG GYLC CQSCNY+KVLNAYEFERHAGCKTKHPNNHI+F+NGKTIY IVQ Sbjct: 358 AREELRGIIKGVGYLCGCQSCNYAKVLNAYEFERHAGCKTKHPNNHIYFENGKTIYQIVQ 417 Query: 488 ELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDELN 336 EL+STP++LLF+ +QTV G PINQK+F WKES+QAATREL+RIYGK+ELN Sbjct: 418 ELRSTPESLLFDTLQTVFGAPINQKSFHSWKESFQAATRELQRIYGKEELN 468 >dbj|BAB08350.1| unnamed protein product [Arabidopsis thaliana] Length = 415 Score = 324 bits (831), Expect = 6e-86 Identities = 185/410 (45%), Positives = 252/410 (61%), Gaps = 33/410 (8%) Frame = -1 Query: 1472 ADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP-WENP 1296 ++ D D+S+R + KR H WF+D++ E+FPNKKQAV+ + G NV LP WE+ Sbjct: 8 SEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQ--DPVVGLGKSNVGLPLWESS 65 Query: 1295 SNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSM 1119 S FQSV+ QF DRL G+E P R + FG R+ + + + I E + D S+ LS+ Sbjct: 66 SVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQ--NKSIAESYMEDTSVELSI 123 Query: 1118 SHTMEDLGSCLNYGGIRKVKVNQVKDS-------------------------DNAMSMP- 1017 S+ +E G C G RK+ V++VK++ +N S Sbjct: 124 SNGVEVAGGCFGGDGNRKLPVSRVKETMSTHVALEGHSQRKIESSSIQACSRENESSYIN 183 Query: 1016 ---MSHSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYELLMGQSSIQPSESLKEKEL--V 852 H + D I+FG+ + + + ++ NY+ + Q I + + ++E Sbjct: 184 FALAGHPYGNEDSQGITFGEINDEHGVGSTSNVVGNYQSYV-QDPIGTLDIVYDQETGSS 242 Query: 851 DPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLSTGILDGVPVKYIS 672 ++ V+ PK+K E K S K +FPSNVRSL+STG+LDGVPVKY+S Sbjct: 243 QTSSGVVSEQQVAKPSLGSLPKTKAEAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVS 302 Query: 671 WSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHIFFDNGKTIYGIV 492 S EE RGVIKGSGYLC CQ+C+++KVLNAY FERHAGCKTKHPNNHI+F+NGKTIY IV Sbjct: 303 VSREELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTIYQIV 362 Query: 491 QELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGKDE 342 QEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL+RIYGK+E Sbjct: 363 QELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGKEE 412 >ref|XP_002864653.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp. lyrata] gi|297310488|gb|EFH40912.1| hypothetical protein ARALYDRAFT_332253 [Arabidopsis lyrata subsp. lyrata] Length = 415 Score = 322 bits (826), Expect = 2e-85 Identities = 189/422 (44%), Positives = 251/422 (59%), Gaps = 45/422 (10%) Frame = -1 Query: 1472 ADGDMGCDNSSRIEPKRAHQWFLDATEPELFPNKKQAVEASNSRQISGVPNVNLP-WENP 1296 ++ D D S+R + KR H WF+D++ E+FPNKKQAV+ G NV LP WE+ Sbjct: 8 SEDDSVYDQSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQDPVGGL--GKSNVGLPLWESS 65 Query: 1295 SNFQSVTGQFTDRLFGSE-PSRTIDFGGRNFQSINTGNLDIGRRGIEEQFGNDASIALSM 1119 S FQSV+ QF DRL G+E P R + FG R+ + + + I E + D S+ LS+ Sbjct: 66 SVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQT--KSIAESYMEDTSVELSI 123 Query: 1118 SHTMEDLGSCLNYGGIRKVKVNQVK---------DSDNAMSMPMS--------------- 1011 S+ +E GS GIRK+ V++VK D N + S Sbjct: 124 SNGVEVAGSSFGGDGIRKLPVSRVKETMSTHVALDGHNQRKIESSSIQACSRENESSFIN 183 Query: 1010 -----HSFNKRDGTTISFGDFQEGSEMNPSGRLLTNYE-----------LLMGQ---SSI 888 H + D I+FG+ + + + ++ NY+ ++ GQ SS Sbjct: 184 FALAGHPYGNEDSHGITFGEINDEHGVGSTSNVVGNYQSYVQDPIGTLDIVYGQETGSSQ 243 Query: 887 QPSESLKEKELVDPNANVLENATPLAIVTNKTPKSKVEQKMSNKVPPNNFPSNVRSLLST 708 S + E+++ P+ + PK+K E K S K +FPSNVRSL+ST Sbjct: 244 TSSGVVSEQQVAKPSLEPV-------------PKNKAETKSSKKEASTSFPSNVRSLIST 290 Query: 707 GILDGVPVKYISWSHEEFRGVIKGSGYLCSCQSCNYSKVLNAYEFERHAGCKTKHPNNHI 528 G+LDGVPV Y+S S EE RGVIKGSGYLC CQ+C ++KVLNAY FERHAGCKTKHPNNHI Sbjct: 291 GMLDGVPVTYVSISREELRGVIKGSGYLCGCQTCEFTKVLNAYAFERHAGCKTKHPNNHI 350 Query: 527 FFDNGKTIYGIVQELKSTPQNLLFEAIQTVTGHPINQKAFRIWKESYQAATRELERIYGK 348 +F+NGKTIY IVQEL++TP+++LF+ IQTV G PINQKAFRIWKES+QAATREL+RIYGK Sbjct: 351 YFENGKTIYQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGK 410 Query: 347 DE 342 +E Sbjct: 411 EE 412