BLASTX nr result
ID: Paeonia22_contig00015116
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia22_contig00015116 (1728 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu... 477 e-132 ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593... 456 e-125 ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247... 443 e-121 gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] 439 e-120 ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629... 434 e-119 ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr... 433 e-118 ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma... 430 e-118 ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm... 422 e-115 ref|XP_007023217.1| Uncharacterized protein isoform 2 [Theobroma... 414 e-113 ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781... 412 e-112 ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phas... 410 e-112 ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma... 367 9e-99 ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629... 365 3e-98 ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma... 342 1e-91 gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi... 335 3e-89 ref|XP_006470788.1| PREDICTED: uncharacterized protein LOC102629... 323 2e-85 ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766... 317 8e-84 gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo... 317 8e-84 ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A... 310 1e-81 dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou... 309 2e-81 >ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] gi|550342350|gb|EEE79091.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] Length = 489 Score = 477 bits (1227), Expect = e-132 Identities = 268/494 (54%), Positives = 329/494 (66%), Gaps = 18/494 (3%) Frame = -1 Query: 1632 MEGVKENTRSGEDWCTV--EVALGEAAATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPX 1459 ME K + + E+ +V E+ LG+AA TF+LEKAVCSHGLFMM PNHWDP + T RP Sbjct: 1 MENTKTDGKEEEEEESVVFEIPLGDAAETFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPL 60 Query: 1458 XXXXXXXXXXXXXXXXXXXXXXHPPTH-PNSLHLRIFGTRFLSPQYQQALQAQVIRMLRL 1282 P H P SL +R++GTR LSP++Q++L AQV+RMLRL Sbjct: 61 RLSLSDSDPQVSTPTTSLFVSISHPPHLPRSLSVRVYGTRCLSPKHQESLVAQVVRMLRL 120 Query: 1281 SETDERNVREFQKIH----GKDERS----FSGRVFRSPTLFEDMVKCILLCNCQWPRTLS 1126 SETDERN REF+KI ++ S F GRVFRSPTLFEDMVKCILLCNCQWPRTLS Sbjct: 121 SETDERNAREFRKIAEAAAAEENNSWLTGFGGRVFRSPTLFEDMVKCILLCNCQWPRTLS 180 Query: 1125 MARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVAESEH-FTPKTPAGKESKRKLGVQKVS 949 MARA + N ++ H F P T AGKESKR + KV+ Sbjct: 181 MARALCELQCELQCKSSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKRNIRASKVT 240 Query: 948 ANLTNRFTETEAVAILKIDCIQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDS 769 NL ++ ETE +L+ D + ++ L S + +D C+S R + + DS Sbjct: 241 KNLASKIVETET--LLEADANLKTDSAHIGRETLES-VENDSCARCSS-RHGSDSWAPDS 296 Query: 768 CLPFENQISI-----KSLGNFPTPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRI 604 ++Q I K + NFP+PRELA++DE+ LAKRC LGYRA RI+KLAQS+VEGRI Sbjct: 297 ---LQSQHGIQPGVNKMICNFPSPRELANLDESFLAKRCNLGYRAIRIIKLAQSIVEGRI 353 Query: 603 QLEELEEVC-KVASLASYDMLADKLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLE 427 L E+EE C AS + Y+ LAD+ R+I+GFGPFTCANVL+CMGFYHIIPTDSET RHL+ Sbjct: 354 PLREVEEDCANGASSSCYNKLADQFRQIDGFGPFTCANVLMCMGFYHIIPTDSETVRHLK 413 Query: 426 QVHARESNMKTVQRDVEEIYGKYAPFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITAS 247 QVHA++S ++TVQRDVEEIYGKYAPFQFLAYW+ELWHFYE RFGKLSE+P SDYKLITAS Sbjct: 414 QVHAKKSTIQTVQRDVEEIYGKYAPFQFLAYWAELWHFYEKRFGKLSEIPTSDYKLITAS 473 Query: 246 NMRTKGTSKNKKAK 205 NMR+KG KNK+ K Sbjct: 474 NMRSKGGQKNKRTK 487 >ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED: uncharacterized protein LOC102593287 isoform X2 [Solanum tuberosum] Length = 485 Score = 456 bits (1173), Expect = e-125 Identities = 247/475 (52%), Positives = 306/475 (64%), Gaps = 13/475 (2%) Frame = -1 Query: 1584 VEVALGEA-----AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXX 1420 VE+ LG+ ATFDLEKAVCSHGLFMM PN WD +KTL RP Sbjct: 15 VELPLGDGDGDGGCATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHEQ 74 Query: 1419 XXXXXXXXXHPPTHPNSLHLRIFGTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQKI 1240 P P+SL LR+FGT LS +Q++L QV RM+RLS + + V++FQ+I Sbjct: 75 SVLVQINQ--PSDSPHSLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEI 132 Query: 1239 HGKDERSFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXX 1060 G+ + GRVFRSPTLFEDMVKC+LLCNCQW RTLSMA A Sbjct: 133 CGEAKDRGLGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFP 192 Query: 1059 XXXXXXXXNPVA-ESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDCIQ 883 V +SEHFTP+TPAGKES+++ G S L R TE E + + + Sbjct: 193 DPDNQNQLKGVTFKSEHFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIIDIGKPGV- 251 Query: 882 TINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFE-------NQISIKSLGN 724 ++P F +G+++ + N CR + EV V + PF S LGN Sbjct: 252 -----TVTPAF---SVGEEVLKKSNLCRDTTEVCDVGTSAPFNLDPSEDRKLSSFNQLGN 303 Query: 723 FPTPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDML 544 FP+P+ELAS+DE+ LAKRC LGYRA RI+KLA+ +VEG IQL+ELEE C SL+ YD + Sbjct: 304 FPSPKELASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLKELEEACSNPSLSDYDKM 363 Query: 543 ADKLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYG 364 A++LREI+GFGPFTCANVL+C+G+YH+IPTDSET RHL+QVHAR S ++ VQRDVE IYG Sbjct: 364 AEQLREIDGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTSTIQNVQRDVENIYG 423 Query: 363 KYAPFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAKVS 199 KYAPFQFLAYWSE+WHFYE RFGKLSEMP S+YKLITA+NMR K K KK K++ Sbjct: 424 KYAPFQFLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMRRKRNGKCKKLKIT 478 >ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum lycopersicum] Length = 483 Score = 443 bits (1140), Expect = e-121 Identities = 236/461 (51%), Positives = 297/461 (64%), Gaps = 8/461 (1%) Frame = -1 Query: 1557 ATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPTH 1378 A+FDLEKAVCSHGLFMM PN WD +KTL RP P + Sbjct: 27 ASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDDDHEQSVLVQITQ--PSDY 84 Query: 1377 PNSLHLRIFGTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQKIHGKDERSFSGRVFR 1198 P+SL LR+ T LS +Q++L QV RM+RLS + + V+ FQ+I G+ + GRVFR Sbjct: 85 PHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQEICGEAKERGFGRVFR 144 Query: 1197 SPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPV-AE 1021 SPTLFEDMVKC+LLCNCQW RTLSMA A V ++ Sbjct: 145 SPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFPDPDNQNQLKGVTSK 204 Query: 1020 SEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDCIQTINCKNLSPNFLSS 841 SEHFTP+TPAGKE +++ G S NL R E E + + + ++P F Sbjct: 205 SEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPGV------TVTPAF--- 255 Query: 840 DIGDDLRENCNSCRTSKEVYRVDSCLPFENQ-------ISIKSLGNFPTPRELASVDENI 682 +G+++ + N C+ + EV+ V P S LGNFP+P++LAS+DE+ Sbjct: 256 SVGEEVLQKSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFNQLGNFPSPKQLASLDESF 315 Query: 681 LAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLADKLREINGFGPFT 502 LAKRC LGYRA RI+KLA+ +VEG IQL ELEE C SL++YD +A++LREI+GFGPFT Sbjct: 316 LAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEACSNPSLSNYDKMAEQLREIDGFGPFT 375 Query: 501 CANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKYAPFQFLAYWSEL 322 CANVL+C+G+YH+IPTDSET RHL+QVHAR S ++ VQRDVE IYGKYAPFQFLAYWSE+ Sbjct: 376 CANVLMCLGYYHVIPTDSETIRHLKQVHARTSTIQNVQRDVENIYGKYAPFQFLAYWSEV 435 Query: 321 WHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAKVS 199 WHFYE RFGKLSEMP S+YKLITA+NMR K K KK K++ Sbjct: 436 WHFYEERFGKLSEMPHSEYKLITAANMRPKRNGKCKKLKIA 476 >gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] Length = 472 Score = 439 bits (1128), Expect = e-120 Identities = 250/477 (52%), Positives = 301/477 (63%), Gaps = 14/477 (2%) Frame = -1 Query: 1587 TVEVALGEAAATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXX 1408 ++E+ LG+AAATF LE AVCSHGLFMM PN WDP +KTLLRP Sbjct: 4 SLELPLGDAAATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDD 63 Query: 1407 XXXXXHPPTHPNSLHLRIF---GTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQKIH 1237 H LR+ GTR L+ +QAL AQV RMLRLS+T+ER REF +++ Sbjct: 64 SVMARISQPHDRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVY 123 Query: 1236 GKDERSFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXX 1057 G S GRVFRSPTLFEDMVKCILLCNCQWPRTLSMA+A Q Sbjct: 124 GCG--SGLGRVFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQSVP----- 176 Query: 1056 XXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAI------LKI 895 +++ F PKTPAGKE KRK+ K S LT++F + L I Sbjct: 177 ----------SKTVDFVPKTPAGKEPKRKVEKLKASTCLTSQFDAQSNEGLESHSNDLSI 226 Query: 894 DCIQ-TINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQI----SIKSL 730 D Q T + +NLSP+ L S +++ T +E Y VDS QI + Sbjct: 227 DISQPTPSAQNLSPSSLLSVPMENV--------TCEESYGVDSASLCNPQILRDREFEGT 278 Query: 729 GNFPTPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYD 550 G+FPTP ELA +DE LAKRCKLGYRA RILKLA+ +VEGRIQL ELEE C SL SY Sbjct: 279 GDFPTPTELAKLDEKFLAKRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSLCSYS 338 Query: 549 MLADKLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEI 370 LA +LR+I+GFGPFTCANVL+CMGFYH+IP+DSET RHL+QVH R S ++T++RDV++I Sbjct: 339 KLAVQLRQIDGFGPFTCANVLMCMGFYHVIPSDSETIRHLQQVHGRNSTVRTIERDVQQI 398 Query: 369 YGKYAPFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAKVS 199 Y KY PFQFLAYWSELWHFYE +FGK+SEMP S YKL TASNM+TK N + K + Sbjct: 399 YAKYEPFQFLAYWSELWHFYEKKFGKISEMPCSAYKLFTASNMKTKAERPNNRKKTA 455 >ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus sinensis] Length = 454 Score = 434 bits (1115), Expect = e-119 Identities = 247/473 (52%), Positives = 298/473 (63%), Gaps = 19/473 (4%) Frame = -1 Query: 1560 AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPT 1381 A TF+LE AVCSHGLFMM PN WDP +++L RP P Sbjct: 13 AETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDVTICQ-PQQ 71 Query: 1380 HPNSLHLRIFGTRF-----LSPQYQQALQAQVIRMLRLSETDERNVREFQKI-------H 1237 P+SL + + + LS + Q AL AQV RMLRLSE DERNVREF++I Sbjct: 72 DPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIVRQVAQEE 131 Query: 1236 GKDER---SFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXX 1066 G++ + FSGRVFRSPTLFEDMVKC+LLCNCQWPRTLSMARA Sbjct: 132 GEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWELQHCSPSI- 190 Query: 1065 XXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAI----LK 898 SE F P+TPAGKESKR+ V KV++ LT+R E++A + LK Sbjct: 191 ---------------SEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235 Query: 897 IDCIQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKSLGNFP 718 +DC + +N+ P+F +DI DL N T+ D +GNFP Sbjct: 236 LDCAGVLE-ENVQPSFPQNDIESDLH-GLNELSTTDPPSARDR------------IGNFP 281 Query: 717 TPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLAD 538 +PRELA++DE+ LAKRC LGYRA RILKLA+ +V+G+IQL ELE++C ASL +Y LA+ Sbjct: 282 SPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAE 341 Query: 537 KLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKY 358 +L +INGFGPFT NVLVC+GFYH+IPTDSET RHL+QVHAR KTVQ E IYGKY Sbjct: 342 QLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNCTSKTVQMIAESIYGKY 401 Query: 357 APFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAKVS 199 APFQFLAYWSELWHFYE RFGKLSEMP SDYKLITASNM K + K+ K+S Sbjct: 402 APFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNMGIKNIRQVKRTKIS 454 >ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] gi|557533482|gb|ESR44600.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] Length = 454 Score = 433 bits (1113), Expect = e-118 Identities = 245/472 (51%), Positives = 297/472 (62%), Gaps = 19/472 (4%) Frame = -1 Query: 1560 AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPT 1381 A TF+LE AVCSHGLFMM PN WDP +++L RP P Sbjct: 13 AETFNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDVTICQ-PQQ 71 Query: 1380 HPNSLHLRIFGTRF-----LSPQYQQALQAQVIRMLRLSETDERNVREFQKI-------H 1237 P+SL + + + LS + Q AL AQV RMLRLSE DERNVR+F++I Sbjct: 72 DPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRIVRQVAQEE 131 Query: 1236 GKDER---SFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXX 1066 G++ + FSGRVFRSPTLFEDMVKC+LLCNCQWPRTL+MARA Sbjct: 132 GEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQWELQHCSPSI- 190 Query: 1065 XXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAI----LK 898 SE F P+TPAGKESKR+ V KV++ LT+R E++A + LK Sbjct: 191 ---------------SEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDDMNLK 235 Query: 897 IDCIQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKSLGNFP 718 +DC + +N+ P+F +DI DL N T+ D +GNFP Sbjct: 236 LDCTGALE-ENVQPSFPRNDIESDLH-GLNELSTTDPPSACDR------------IGNFP 281 Query: 717 TPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLAD 538 +PRELA++DE+ LAKRC LGYRA RILKLAQ +V+G+IQL ELE+ C ASL +Y+ LA+ Sbjct: 282 SPRELANLDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELEDTCNEASLTTYNKLAE 341 Query: 537 KLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKY 358 +L +INGFGPFT NVLVC+GFYH+IPTDSET RHL+QVHAR KTVQ E IYGKY Sbjct: 342 QLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNCTSKTVQIIAESIYGKY 401 Query: 357 APFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAKV 202 +PFQFLAYWSELWHFYE RFGKLSEMP SDYKLITASNM K K K+ K+ Sbjct: 402 SPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNMGIKNIRKVKRTKI 453 >ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508778582|gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 467 Score = 430 bits (1106), Expect = e-118 Identities = 252/502 (50%), Positives = 309/502 (61%), Gaps = 17/502 (3%) Frame = -1 Query: 1665 CNKSEEKETMDMEGVKENTRSGEDWCTV--EVALGEAAAT-----FDLEKAVCSHGLFMM 1507 C K+ KE +E + C+V E+ +GEAAA F+LEKAVCSHGLFMM Sbjct: 27 CQKTMVKE-------QEENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMM 79 Query: 1506 PPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPTHPNSLHLRIFGTRFLSPQ 1327 PN WDP +++L RP PT ++LHLR++GTR LSPQ Sbjct: 80 APNQWDPISRSLSRPLRLLDHHSPPLTVQVRISQ------PT-ASTLHLRVYGTRCLSPQ 132 Query: 1326 YQQALQAQVIRMLRLSETDERNVREFQKI----HGKDE------RSFSGRVFRSPTLFED 1177 ++ +L QV RMLRLSE +E VREF+KI HG++E RSFSGRVFRSPTLFED Sbjct: 133 HRHSLLNQVSRMLRLSEEEESKVREFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFED 192 Query: 1176 MVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVAESEHFTPKT 997 MVKCILLCNCQ+ RTLSMA+A A + F PKT Sbjct: 193 MVKCILLCNCQFSRTLSMAKALCELQFETQRPFSGVR-------------AAEDDFIPKT 239 Query: 996 PAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDCIQTINCKNLSPNFLSSDIGDDLRE 817 PAG E KRKL V KVS L +F E A + SD+ Sbjct: 240 PAGNELKRKLRVSKVSMRLEGKFAEPRA-------------------DHSKSDL------ 274 Query: 816 NCNSCRTSKEVYRVDSCLPFENQISIKSLGNFPTPRELASVDENILAKRCKLGYRASRIL 637 + S+E+ + + K +G+FP+P ELA++DE+ LAKRC LGYRASRIL Sbjct: 275 -----QPSQEL---------DEPHAYKGMGSFPSPEELANLDESFLAKRCNLGYRASRIL 320 Query: 636 KLAQSVVEGRIQLEELEEVCKVASLASYDMLADKLREINGFGPFTCANVLVCMGFYHIIP 457 KLA+ +V+G IQL +LEE CK SL+SY+ LA++LR+I+GFGPFTCANVL+CMGFYH+IP Sbjct: 321 KLAKGIVQGIIQLMQLEEGCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIP 380 Query: 456 TDSETTRHLEQVHARESNMKTVQRDVEEIYGKYAPFQFLAYWSELWHFYETRFGKLSEMP 277 DSET RHL+QVH++ S M+TV RDVE IY KYAPFQFLAYW+ELWH+YE RFGKLSEMP Sbjct: 381 ADSETIRHLKQVHSKSSTMQTVGRDVEGIYAKYAPFQFLAYWAELWHYYEQRFGKLSEMP 440 Query: 276 QSDYKLITASNMRTKGTSKNKK 211 YKLITASNM+ K TSK K Sbjct: 441 FCGYKLITASNMKMKATSKRTK 462 >ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis] gi|223541451|gb|EEF43001.1| conserved hypothetical protein [Ricinus communis] Length = 458 Score = 422 bits (1085), Expect = e-115 Identities = 240/477 (50%), Positives = 291/477 (61%), Gaps = 16/477 (3%) Frame = -1 Query: 1581 EVALG-EAAATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXX 1405 EVA+G EAA TFDLEK VCSHGLFM+ PNHWDP ++T RP Sbjct: 11 EVAVGGEAADTFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHL 70 Query: 1404 XXXXHPPTHPNSLHLRIFGTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQKIHGKDE 1225 SL +R++G R LSP++Q++L Q++RMLRLS+ DE N REF+KI E Sbjct: 71 S---------KSLLVRVYGNRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFE 121 Query: 1224 RS-------FSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXX 1066 F GRV RSPTLFEDMVKCILLCNCQW RTLSMA A Q Sbjct: 122 GEECPLIGDFGGRVLRSPTLFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQSPQQK 181 Query: 1065 XXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAILKID-C 889 HF P TP KE KRK+ + KV TE++ + D C Sbjct: 182 HAF-------------NHFIPNTPVKKEPKRKIRLSKVP---------TESMDLEAADTC 219 Query: 888 IQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISI-------KSL 730 + T + + N L+ + D +N SC+ S Y + Q + K+ Sbjct: 220 LTTDDSQMKISNSLNC-VDDGSFDNLKSCQGSNTFYSTGPYATSDIQSHLVTQHCAKKTT 278 Query: 729 GNFPTPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYD 550 GNFP+PRELA++DE LAKRC LGYRA RI+KLAQ +VEGRI L E E+V SL++Y Sbjct: 279 GNFPSPRELANLDERFLAKRCGLGYRAGRIIKLAQGIVEGRIPLREFEQVSNGGSLSTYS 338 Query: 549 MLADKLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEI 370 L D+LREI GFGPFT ANVL+CMGFYH+IPTDSET RH +QVHA+ S +KTVQ + EEI Sbjct: 339 KLTDQLREIEGFGPFTRANVLMCMGFYHVIPTDSETVRHFKQVHAKNSTIKTVQSEAEEI 398 Query: 369 YGKYAPFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAKVS 199 Y K+APFQFL YW+ELWHFYE RFGKLSEMP S+YKLITASN+R KG K K+AK+S Sbjct: 399 YRKFAPFQFLVYWAELWHFYEQRFGKLSEMPCSNYKLITASNLRNKGHHKAKRAKIS 455 >ref|XP_007023217.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508778583|gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 426 Score = 414 bits (1063), Expect = e-113 Identities = 245/502 (48%), Positives = 300/502 (59%), Gaps = 17/502 (3%) Frame = -1 Query: 1665 CNKSEEKETMDMEGVKENTRSGEDWCTV--EVALGEAAAT-----FDLEKAVCSHGLFMM 1507 C K+ KE +E + C+V E+ +GEAAA F+LEKAVCSHGLFMM Sbjct: 12 CQKTMVKE-------QEENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMM 64 Query: 1506 PPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPTHPNSLHLRIFGTRFLSPQ 1327 PN WDP +++L RP PT ++LHLR++GTR LSPQ Sbjct: 65 APNQWDPISRSLSRPLRLLDHHSPPLTVQVRISQ------PT-ASTLHLRVYGTRCLSPQ 117 Query: 1326 YQQALQAQVIRMLRLSETDERNVREFQKI----HGKDE------RSFSGRVFRSPTLFED 1177 ++ +L QV RMLRLSE +E VREF+KI HG++E RSFSGRVFRSPTLFED Sbjct: 118 HRHSLLNQVSRMLRLSEEEESKVREFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFED 177 Query: 1176 MVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVAESEHFTPKT 997 MVKCILLCNCQ A + F PKT Sbjct: 178 MVKCILLCNCQ---------------------------------------AAEDDFIPKT 198 Query: 996 PAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDCIQTINCKNLSPNFLSSDIGDDLRE 817 PAG E KRKL V KVS L +F E A + SD+ Sbjct: 199 PAGNELKRKLRVSKVSMRLEGKFAEPRA-------------------DHSKSDL------ 233 Query: 816 NCNSCRTSKEVYRVDSCLPFENQISIKSLGNFPTPRELASVDENILAKRCKLGYRASRIL 637 + S+E+ + + K +G+FP+P ELA++DE+ LAKRC LGYRASRIL Sbjct: 234 -----QPSQEL---------DEPHAYKGMGSFPSPEELANLDESFLAKRCNLGYRASRIL 279 Query: 636 KLAQSVVEGRIQLEELEEVCKVASLASYDMLADKLREINGFGPFTCANVLVCMGFYHIIP 457 KLA+ +V+G IQL +LEE CK SL+SY+ LA++LR+I+GFGPFTCANVL+CMGFYH+IP Sbjct: 280 KLAKGIVQGIIQLMQLEEGCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIP 339 Query: 456 TDSETTRHLEQVHARESNMKTVQRDVEEIYGKYAPFQFLAYWSELWHFYETRFGKLSEMP 277 DSET RHL+QVH++ S M+TV RDVE IY KYAPFQFLAYW+ELWH+YE RFGKLSEMP Sbjct: 340 ADSETIRHLKQVHSKSSTMQTVGRDVEGIYAKYAPFQFLAYWAELWHYYEQRFGKLSEMP 399 Query: 276 QSDYKLITASNMRTKGTSKNKK 211 YKLITASNM+ K TSK K Sbjct: 400 FCGYKLITASNMKMKATSKRTK 421 >ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max] Length = 443 Score = 412 bits (1060), Expect = e-112 Identities = 231/455 (50%), Positives = 291/455 (63%), Gaps = 3/455 (0%) Frame = -1 Query: 1566 EAAATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHP 1387 E + F LE+AVCSHGLFMMPPNHWDP +KTL+RP Sbjct: 19 ELPSPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLRSSPSSFLVSLSQ----------- 67 Query: 1386 PTHPNSLHLRIFGTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQKIHGKDE--RSFS 1213 H SL +R+ T LSPQ Q + AQV RMLR SE +E+ VREF+ +H D RSFS Sbjct: 68 --HSQSLAVRVHATHALSPQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFS 125 Query: 1212 GRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXN 1033 GRVFRSPTLFEDMVKCILLCNCQWPRTLSMA+A Sbjct: 126 GRVFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNSK---- 181 Query: 1032 PVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDCIQTINCKNLSPN 853 ESE F PKTPA KE++R KVS + E L+ID + + + + Sbjct: 182 --GESEGFIPKTPASKETRRN----KVSTKGMFCKKKLELDGNLQIDHV--VASSSTATT 233 Query: 852 FLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKS-LGNFPTPRELASVDENILA 676 L++D GD S+E+ DSC F N S GNFP+P ELA++DE+ LA Sbjct: 234 LLTTDNGD-----------SEELRSHDSCHEFSNGNEYFSRTGNFPSPSELANLDESFLA 282 Query: 675 KRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLADKLREINGFGPFTCA 496 KRC LGYRA I++LA+++VEG+IQL +LEE+ K ASL++Y L D+L++I G+GPFT A Sbjct: 283 KRCGLGYRAGYIIELARAIVEGKIQLGQLEELSKDASLSNYKQLDDQLKQIRGYGPFTRA 342 Query: 495 NVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKYAPFQFLAYWSELWH 316 NVL+C+G+YH+IPTDSET RHL+QVH+R + KT++R++EEIYGKY P+QFLA+WSE+W Sbjct: 343 NVLMCLGYYHVIPTDSETVRHLKQVHSRYTTSKTIERELEEIYGKYEPYQFLAFWSEVWD 402 Query: 315 FYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKK 211 FYETRFGKL+EM SDYKLITA NMR+ T+K K+ Sbjct: 403 FYETRFGKLNEMHSSDYKLITACNMRST-TNKRKR 436 >ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris] gi|561020766|gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris] Length = 474 Score = 410 bits (1055), Expect = e-112 Identities = 226/472 (47%), Positives = 290/472 (61%), Gaps = 4/472 (0%) Frame = -1 Query: 1602 GEDWCTVEVALGEAAATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXX 1423 G W + L F L++AVCSHG FMM PNHWDP +KTL RP Sbjct: 29 GTAWFEFHMELPSETEPFQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSS 88 Query: 1422 XXXXXXXXXXHPPTHPNSLHLRIFGTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQK 1243 P SL +R+ F+SPQ Q+ ++AQ+ RMLRLSE +E+ VREF+ Sbjct: 89 LLVSLSQ-------RPQSLAVRVHSVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRS 141 Query: 1242 IHGKDE--RSFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXX 1069 +H D RSF GRVFRSPTLFEDMVKCILLCNCQWPRTLSMA+A Sbjct: 142 VHAADHPNRSFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCELQSGLQNGLPCA 201 Query: 1068 XXXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDC 889 P E+E F PKTPA KE++RK K L + E E L+++ Sbjct: 202 VEGSGN------PKVEAEEFVPKTPASKENRRKKAPTK--GVLLKKKLELE----LEMEV 249 Query: 888 IQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVD-SCLPFENQIS-IKSLGNFPT 715 + ++ + + + DL EV R D SC F N+ GNFP+ Sbjct: 250 DGNLQMDHMFASSSDTTLLGDL-----------EVLRSDDSCCQFPNEGEYFDHTGNFPS 298 Query: 714 PRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLADK 535 P ELA++ E+ LAKRCKLGYRA IL+LAQ +VEG+IQLE+LEE+ K ASL+ Y L D+ Sbjct: 299 PIELANLSESFLAKRCKLGYRAGYILELAQGIVEGKIQLEQLEELSKDASLSCYKQLGDQ 358 Query: 534 LREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKYA 355 L+ I GFGPFT ANVL+C+G+YH+IP DSET RHL+QVH++ ++ KT++RD+EEIYGKY Sbjct: 359 LKPIKGFGPFTRANVLMCLGYYHVIPWDSETVRHLKQVHSKNTSSKTIERDLEEIYGKYE 418 Query: 354 PFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAKVS 199 P+QFLA+WSE+W FYETRFGK++EM S+YK ITASNMR+ + NK+ + S Sbjct: 419 PYQFLAFWSEIWDFYETRFGKMNEMHSSEYKRITASNMRSTRKATNKRKRPS 470 >ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508778584|gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 421 Score = 367 bits (942), Expect = 9e-99 Identities = 222/461 (48%), Positives = 276/461 (59%), Gaps = 17/461 (3%) Frame = -1 Query: 1665 CNKSEEKETMDMEGVKENTRSGEDWCTV--EVALGEAAAT-----FDLEKAVCSHGLFMM 1507 C K+ KE +E + C+V E+ +GEAAA F+LEKAVCSHGLFMM Sbjct: 27 CQKTMVKE-------QEENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMM 79 Query: 1506 PPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPTHPNSLHLRIFGTRFLSPQ 1327 PN WDP +++L RP PT ++LHLR++GTR LSPQ Sbjct: 80 APNQWDPISRSLSRPLRLLDHHSPPLTVQVRISQ------PT-ASTLHLRVYGTRCLSPQ 132 Query: 1326 YQQALQAQVIRMLRLSETDERNVREFQKI----HGKDE------RSFSGRVFRSPTLFED 1177 ++ +L QV RMLRLSE +E VREF+KI HG++E RSFSGRVFRSPTLFED Sbjct: 133 HRHSLLNQVSRMLRLSEEEESKVREFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFED 192 Query: 1176 MVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVAESEHFTPKT 997 MVKCILLCNCQ+ RTLSMA+A A + F PKT Sbjct: 193 MVKCILLCNCQFSRTLSMAKALCELQFETQRPFSGVR-------------AAEDDFIPKT 239 Query: 996 PAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDCIQTINCKNLSPNFLSSDIGDDLRE 817 PAG E KRKL V KVS L +F E A + SD+ Sbjct: 240 PAGNELKRKLRVSKVSMRLEGKFAEPRA-------------------DHSKSDL------ 274 Query: 816 NCNSCRTSKEVYRVDSCLPFENQISIKSLGNFPTPRELASVDENILAKRCKLGYRASRIL 637 + S+E+ + + K +G+FP+P ELA++DE+ LAKRC LGYRASRIL Sbjct: 275 -----QPSQEL---------DEPHAYKGMGSFPSPEELANLDESFLAKRCNLGYRASRIL 320 Query: 636 KLAQSVVEGRIQLEELEEVCKVASLASYDMLADKLREINGFGPFTCANVLVCMGFYHIIP 457 KLA+ +V+G IQL +LEE CK SL+SY+ LA++LR+I+GFGPFTCANVL+CMGFYH+IP Sbjct: 321 KLAKGIVQGIIQLMQLEEGCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIP 380 Query: 456 TDSETTRHLEQVHARESNMKTVQRDVEEIYGKYAPFQFLAY 334 DSET RHL+QVH++ S M+TV RDVE IY KYAPFQFLAY Sbjct: 381 ADSETIRHLKQVHSKSSTMQTVGRDVEGIYAKYAPFQFLAY 421 >ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus sinensis] Length = 409 Score = 365 bits (938), Expect = 3e-98 Identities = 214/428 (50%), Positives = 262/428 (61%), Gaps = 19/428 (4%) Frame = -1 Query: 1560 AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPT 1381 A TF+LE AVCSHGLFMM PN WDP +++L RP P Sbjct: 13 AETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDVTICQ-PQQ 71 Query: 1380 HPNSLHLRIFGTRF-----LSPQYQQALQAQVIRMLRLSETDERNVREFQKI-------H 1237 P+SL + + + LS + Q AL AQV RMLRLSE DERNVREF++I Sbjct: 72 DPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIVRQVAQEE 131 Query: 1236 GKDER---SFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXX 1066 G++ + FSGRVFRSPTLFEDMVKC+LLCNCQWPRTLSMARA Sbjct: 132 GEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWELQHCSPSI- 190 Query: 1065 XXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAI----LK 898 SE F P+TPAGKESKR+ V KV++ LT+R E++A + LK Sbjct: 191 ---------------SEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235 Query: 897 IDCIQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKSLGNFP 718 +DC + +N+ P+F +DI DL N T+ D +GNFP Sbjct: 236 LDCAGVLE-ENVQPSFPQNDIESDLH-GLNELSTTDPPSARDR------------IGNFP 281 Query: 717 TPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLAD 538 +PRELA++DE+ LAKRC LGYRA RILKLA+ +V+G+IQL ELE++C ASL +Y LA+ Sbjct: 282 SPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAE 341 Query: 537 KLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKY 358 +L +INGFGPFT NVLVC+GFYH+IPTDSET RHL+QVHAR KTVQ E IYGKY Sbjct: 342 QLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNCTSKTVQMIAESIYGKY 401 Query: 357 APFQFLAY 334 APFQFLAY Sbjct: 402 APFQFLAY 409 >ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508778585|gb|EOY25841.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 406 Score = 342 bits (878), Expect(2) = 1e-91 Identities = 210/447 (46%), Positives = 264/447 (59%), Gaps = 17/447 (3%) Frame = -1 Query: 1665 CNKSEEKETMDMEGVKENTRSGEDWCTV--EVALGEAAAT-----FDLEKAVCSHGLFMM 1507 C K+ KE +E + C+V E+ +GEAAA F+LEKAVCSHGLFMM Sbjct: 12 CQKTMVKE-------QEENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMM 64 Query: 1506 PPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPTHPNSLHLRIFGTRFLSPQ 1327 PN WDP +++L RP PT ++LHLR++GTR LSPQ Sbjct: 65 APNQWDPISRSLSRPLRLLDHHSPPLTVQVRISQ------PT-ASTLHLRVYGTRCLSPQ 117 Query: 1326 YQQALQAQVIRMLRLSETDERNVREFQKI----HGKDE------RSFSGRVFRSPTLFED 1177 ++ +L QV RMLRLSE +E VREF+KI HG++E RSFSGRVFRSPTLFED Sbjct: 118 HRHSLLNQVSRMLRLSEEEESKVREFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFED 177 Query: 1176 MVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVAESEHFTPKT 997 MVKCILLCNCQ+ RTLSMA+A A + F PKT Sbjct: 178 MVKCILLCNCQFSRTLSMAKALCELQFETQRPFSGVR-------------AAEDDFIPKT 224 Query: 996 PAGKESKRKLGVQKVSANLTNRFTETEAVAILKIDCIQTINCKNLSPNFLSSDIGDDLRE 817 PAG E KRKL V KVS L +F E A + SD+ Sbjct: 225 PAGNELKRKLRVSKVSMRLEGKFAEPRA-------------------DHSKSDL------ 259 Query: 816 NCNSCRTSKEVYRVDSCLPFENQISIKSLGNFPTPRELASVDENILAKRCKLGYRASRIL 637 + S+E+ + + K +G+FP+P ELA++DE+ LAKRC LGYRASRIL Sbjct: 260 -----QPSQEL---------DEPHAYKGMGSFPSPEELANLDESFLAKRCNLGYRASRIL 305 Query: 636 KLAQSVVEGRIQLEELEEVCKVASLASYDMLADKLREINGFGPFTCANVLVCMGFYHIIP 457 KLA+ +V+G IQL +LEE CK SL+SY+ LA++LR+I+GFGPFTCANVL+CMGFYH+IP Sbjct: 306 KLAKGIVQGIIQLMQLEEGCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIP 365 Query: 456 TDSETTRHLEQVHARESNMKTVQRDVE 376 DSET RHL+QVH++ S M+TV RDVE Sbjct: 366 ADSETIRHLKQVHSKSSTMQTVGRDVE 392 Score = 23.5 bits (49), Expect(2) = 1e-91 Identities = 8/14 (57%), Positives = 12/14 (85%) Frame = -2 Query: 332 GQNCGTSTRQGLGS 291 GQ+CGT+ +GLG+ Sbjct: 393 GQSCGTTMSKGLGN 406 >gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group] Length = 463 Score = 335 bits (860), Expect = 3e-89 Identities = 197/467 (42%), Positives = 269/467 (57%), Gaps = 18/467 (3%) Frame = -1 Query: 1584 VEVALGEA-----AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXX 1420 +E+ LG A AA FDLE AVCSHGLFMM PN WDP ++ L+RP Sbjct: 21 LELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVR 80 Query: 1419 XXXXXXXXXHPPTHPNSLHLRIFGTR--FLSPQYQQALQAQVIRMLRLSETDERNVREFQ 1246 P ++L + + G LSP Q ++ QV RMLRL E D R EFQ Sbjct: 81 VSRH------PARPSDALLVSVLGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQ 134 Query: 1245 KIHGKDERSFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXX 1066 +H + GR+FRSPTLFEDMVKCILLCNCQW RTLSM+ A Sbjct: 135 AMHAVAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMSTALCELQLELRSS----- 189 Query: 1065 XXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQK-VSANLTNRFTETEAVAILKIDC 889 + +E+F +TP +E KRK ++ V L +F E + V C Sbjct: 190 -------------SSTENFQSRTPPIRECKRKRSNKRNVRVKLETKFNEDKLV------C 230 Query: 888 IQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKSLG-NFPTP 712 ++ N + N + + +L + + EV S L N+ ++ G +FPTP Sbjct: 231 LEDPNLATDTANLQTYENSFNLPSAASGTGNTSEVSLDHSELKLRNEPCLEDCGGDFPTP 290 Query: 711 RELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVA---------SLA 559 ELA++DE+ LAKRC LGYRA RI+ LA+S+VEG+I L++LEE+ K++ + + Sbjct: 291 EELANLDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPS 350 Query: 558 SYDMLADKLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDV 379 +YD L ++L I+GFGPFT ANVL+CMGF+H+IP D+ET RHL+Q H R S + +VQ+++ Sbjct: 351 TYDRLNEELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKEL 410 Query: 378 EEIYGKYAPFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMR 238 + IYGKYAPFQFLAYW ELW FY +FGK+S+M +Y+L TAS ++ Sbjct: 411 DNIYGKYAPFQFLAYWCELWGFYNKQFGKISDMEPINYRLFTASKLK 457 >ref|XP_006470788.1| PREDICTED: uncharacterized protein LOC102629917 isoform X3 [Citrus sinensis] Length = 382 Score = 323 bits (827), Expect = 2e-85 Identities = 192/398 (48%), Positives = 240/398 (60%), Gaps = 19/398 (4%) Frame = -1 Query: 1560 AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPT 1381 A TF+LE AVCSHGLFMM PN WDP +++L RP P Sbjct: 13 AETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDVTICQ-PQQ 71 Query: 1380 HPNSLHLRIFGTRF-----LSPQYQQALQAQVIRMLRLSETDERNVREFQKI-------H 1237 P+SL + + + LS + Q AL AQV RMLRLSE DERNVREF++I Sbjct: 72 DPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIVRQVAQEE 131 Query: 1236 GKDER---SFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXX 1066 G++ + FSGRVFRSPTLFEDMVKC+LLCNCQWPRTLSMARA Sbjct: 132 GEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWELQHCSPSI- 190 Query: 1065 XXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQKVSANLTNRFTETEAVAI----LK 898 SE F P+TPAGKESKR+ V KV++ LT+R E++A + LK Sbjct: 191 ---------------SEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235 Query: 897 IDCIQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKSLGNFP 718 +DC + +N+ P+F +DI DL N T+ D +GNFP Sbjct: 236 LDCAGVLE-ENVQPSFPQNDIESDLH-GLNELSTTDPPSARDR------------IGNFP 281 Query: 717 TPRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLAD 538 +PRELA++DE+ LAKRC LGYRA RILKLA+ +V+G+IQL ELE++C ASL +Y LA+ Sbjct: 282 SPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAE 341 Query: 537 KLREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQ 424 +L +INGFGPFT NVLVC+GFYH+IPTDSET RHL+Q Sbjct: 342 QLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQ 379 >ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica] Length = 461 Score = 317 bits (813), Expect = 8e-84 Identities = 194/456 (42%), Positives = 255/456 (55%), Gaps = 15/456 (3%) Frame = -1 Query: 1560 AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPT 1381 AA FDL AVCSHGLFMM PN WDP + L+RP P Sbjct: 33 AAPFDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAH-------PA 85 Query: 1380 HPNS-LHLRIFGTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQKIHGKDERSFSGRV 1204 P + L + + G LS + + QV RMLRLSE D V EFQ +H GR+ Sbjct: 86 RPGTALLVAVEGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRI 145 Query: 1203 FRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVA 1024 FRSPTLFEDMVKCILLCNCQW RTLSMA A + Sbjct: 146 FRSPTLFEDMVKCILLCNCQWTRTLSMATALCEIQLELKCS------------------S 187 Query: 1023 ESEHFTPKTPAGKESKRKLGV-QKVSANLTNRFTETEAVAILKIDCIQTINCKNLS---P 856 E F +TP +E KRK Q V L RF E + L+ I + +L+ Sbjct: 188 SVEDFQSRTPPIRERKRKRSKRQSVRIKLETRFAEDK----LEGPTIASGTSNDLTHPET 243 Query: 855 NFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKS-LGNFPTPRELASVDENIL 679 N S + E ++C + + +S L N ++ +G+FPTP ELA++DE L Sbjct: 244 NEYLSSLASVASETGSACDSLPSLD--NSELSLNNAPGLEDCIGDFPTPEELANLDEGFL 301 Query: 678 AKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDM---------LADKLRE 526 AKRC LGYRA RI+ LA+ VVEG++ L++LEE+C+++ A+ ++ L +L Sbjct: 302 AKRCNLGYRAKRIVMLARGVVEGKVCLQKLEEMCRISVPAAEEVSTIESACERLNKELSA 361 Query: 525 INGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKYAPFQ 346 I+GFGPFT ANVL+CMGF H IP D+ET RHL+QVH R S + +V +++++IYGKYAPFQ Sbjct: 362 ISGFGPFTRANVLMCMGFNHTIPADTETIRHLKQVHKRASTISSVHQELDKIYGKYAPFQ 421 Query: 345 FLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMR 238 FLAYW ELW FY +FGK+ EM S+Y+L TAS+++ Sbjct: 422 FLAYWFELWGFYNKQFGKICEMEPSNYRLFTASHLK 457 >gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group] Length = 442 Score = 317 bits (813), Expect = 8e-84 Identities = 191/459 (41%), Positives = 260/459 (56%), Gaps = 10/459 (2%) Frame = -1 Query: 1584 VEVALGEA-----AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXX 1420 +E+ LG A AA FDLE AVCSHGLFMM PN WDP ++ L+RP Sbjct: 21 LELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVR 80 Query: 1419 XXXXXXXXXHPPTHPNSLHLRIFGTR---FLSPQYQQALQAQVIRMLRLSETDERNVREF 1249 P ++L + + G LSP Q ++ QV RMLRL E D R V EF Sbjct: 81 VSRH------PARPSDALLVSVLGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEF 134 Query: 1248 QKIHGKDERSFSGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXX 1069 Q +H GR+FRSPTLFEDM+KCILLCNCQW RTLSM+ A Sbjct: 135 QAMHAVAREVGFGRIFRSPTLFEDMIKCILLCNCQWTRTLSMSTALCELQLELRSS---- 190 Query: 1068 XXXXXXXXXXXNPVAESEHFTPKTPAGKESKRKLGVQK-VSANLTNRFTETEAVAILKID 892 + +E+F +TP +E KRK ++ V L +F E + V Sbjct: 191 --------------SSTENFQSRTPPIRECKRKRSNKRNVRVKLETKFNEDKMV------ 230 Query: 891 CIQTINCKNLSPNFLSSDIGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKSLG-NFPT 715 C++ NL+ N + ++ L + N + EV S L ++ ++ G +FPT Sbjct: 231 CLED---PNLATNTANENLFS-LPSSANETGNTSEVSLDHSELKLRYELCLEDCGGDFPT 286 Query: 714 PRELASVDENILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLADK 535 P ELA++DE+ LAKRC LGYRA RI+ LA+S+VEG+I L++LEE+ K+ L ++ Sbjct: 287 PEELANLDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKI--------LIEE 338 Query: 534 LREINGFGPFTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKYA 355 L I+G PF NVL+CMGF+H+IP D+ET RHL+Q H R S + +VQ++++ IYGKYA Sbjct: 339 LSTISGIWPFHSCNVLMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGKYA 398 Query: 354 PFQFLAYWSELWHFYETRFGKLSEMPQSDYKLITASNMR 238 PFQFLAYW ELW FY +FG +S+M +Y+L TAS ++ Sbjct: 399 PFQFLAYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 437 >ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] gi|548856677|gb|ERN14505.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] Length = 458 Score = 310 bits (795), Expect = 1e-81 Identities = 196/461 (42%), Positives = 259/461 (56%), Gaps = 11/461 (2%) Frame = -1 Query: 1554 TFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXXXXXXXXXXXHPPTHP 1375 +F+LEKAVCSHG FMM PN W ++TL RP + Sbjct: 17 SFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQLSL--------SSQ 68 Query: 1374 NSLHLRIFGTRFLSPQYQQALQAQVIRMLRLSETDERNVREFQKIHGKDERSFSGRVFRS 1195 SL + + G L QQ L AQV RMLR+SE D+ V +F +++ + + GRVFRS Sbjct: 69 KSLQILVLGASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFGRVFRS 128 Query: 1194 PTLFEDMVKCILLCNCQWPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVAESE 1015 PTLFEDMVK ILLCNCQW RTLSMARA ++S Sbjct: 129 PTLFEDMVKSILLCNCQWTRTLSMARALCELQLELNGNSLRQSNKDTDF-------SKSV 181 Query: 1014 HFTPKTPAGKESK--RKLGVQKVSANLTNRFTETEA-----VAILKIDCIQTINCKNLSP 856 + +P TP E K RK Q + NL +F+E E ++ ID + + KN SP Sbjct: 182 NLSPVTPMQLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFS-KN-SP 239 Query: 855 NFLSSDIGDDLRENCNSCRTSK--EVYRVDSCLPFENQISI-KSLGNFPTPRELASVDEN 685 SS+ G + + N + K + +D+ L +S GNFP P ELA++DE Sbjct: 240 TMFSSEEGRNGKLNYDQVSEEKLGDGAILDNQLLENKTLSFFLEAGNFPCPEELANLDEK 299 Query: 684 ILAKRCKLGYRASRILKLAQSVVEGRIQLEELEEVCKVASLASYDMLADKLREINGFGPF 505 IL KRCK+G+R+ RI+KLAQS+VEG + L ++E + + + D L +L I G GP+ Sbjct: 300 ILEKRCKVGFRSKRIVKLAQSIVEGALDLGKIEVLSQQDPI-HLDGLMRQLLSIYGVGPY 358 Query: 504 TCANVLVCMGFYHIIPTDSETTRHLEQVHARES-NMKTVQRDVEEIYGKYAPFQFLAYWS 328 C NVL+ MG Y IP D+ET RHL+Q HAR+ + T+Q+D+EEIYGK+ PFQFL YWS Sbjct: 359 VCNNVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFLVYWS 418 Query: 327 ELWHFYETRFGKLSEMPQSDYKLITASNMRTKGTSKNKKAK 205 E+W FYE RFGKLS+MP SDY+LITA NM+ K K+ K Sbjct: 419 EMWEFYEKRFGKLSQMPPSDYELITAHNMK-NNIPKRKRYK 458 >dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group] gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza sativa Japonica Group] Length = 501 Score = 309 bits (792), Expect = 2e-81 Identities = 197/510 (38%), Positives = 270/510 (52%), Gaps = 61/510 (11%) Frame = -1 Query: 1584 VEVALGEA-----AATFDLEKAVCSHGLFMMPPNHWDPHTKTLLRPXXXXXXXXXXXXXX 1420 +E+ LG A AA FDLE AVCSHGLFMM PN WDP ++ L+RP Sbjct: 21 LELPLGGAPPYPGAAPFDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVR 80 Query: 1419 XXXXXXXXXHPPTHPNSLHLRIFGTR---FLSPQYQQALQAQVIRMLRLSETDERNVREF 1249 P ++L + + G LSP Q ++ QV RMLRL E D R V EF Sbjct: 81 VSRH------PARPSDALLVSVLGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEF 134 Query: 1248 QKIHGKDERSFSGRVFRSPTLFEDMVKCILLCNCQ------------------------- 1144 Q +H GR+FRSPTLFEDM+KCILLCNCQ Sbjct: 135 QAMHAVAREVGFGRIFRSPTLFEDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRY 194 Query: 1143 -----------------WPRTLSMARAXXXXXXXXXXQXXXXXXXXXXXXXXXNPVAESE 1015 W RTLSM+ A + +E Sbjct: 195 LGIAIFHLHSTVLFNCRWTRTLSMSTALCELQLELRSS------------------SSTE 236 Query: 1014 HFTPKTPAGKESKRKLGVQK-VSANLTNRFTETEAVAILKIDCIQTINCKNLSPNFLSSD 838 +F +TP +E KRK ++ V L +F E + V C++ NL+ N + + Sbjct: 237 NFQSRTPPIRECKRKRSNKRNVRVKLETKFNEDKMV------CLED---PNLATNTANEN 287 Query: 837 IGDDLRENCNSCRTSKEVYRVDSCLPFENQISIKSLG-NFPTPRELASVDENILAKRCKL 661 + L + N + EV S L ++ ++ G +FPTP ELA++DE+ LAKRC L Sbjct: 288 LFS-LPSSANETGNTSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAKRCNL 346 Query: 660 GYRASRILKLAQSVVEGRIQLEELEEVCKVA---------SLASYDMLADKLREINGFGP 508 GYRA RI+ LA+S+VEG+I L++LEE+ K++ + ++YD L ++L I+GFGP Sbjct: 347 GYRARRIVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGP 406 Query: 507 FTCANVLVCMGFYHIIPTDSETTRHLEQVHARESNMKTVQRDVEEIYGKYAPFQFLAYWS 328 FT ANVL+CMGF+H+IP D+ET RHL+Q H R S + +VQ++++ IYGKYAPFQFLAYW Sbjct: 407 FTRANVLMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGKYAPFQFLAYWC 466 Query: 327 ELWHFYETRFGKLSEMPQSDYKLITASNMR 238 ELW FY +FG +S+M +Y+L TAS ++ Sbjct: 467 ELWGFYNKQFGIISDMEPINYRLFTASKLK 496