BLASTX nr result
ID: Akebia25_contig00021889
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00021889 (1745 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247... 395 e-107 ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu... 384 e-104 ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593... 381 e-103 gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] 374 e-101 ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629... 365 5e-98 ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr... 361 7e-97 ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A... 360 1e-96 ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma... 354 8e-95 gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi... 351 7e-94 ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781... 347 1e-92 ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phas... 346 2e-92 ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766... 343 1e-91 ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm... 338 3e-90 gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo... 330 1e-87 dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou... 326 2e-86 ref|XP_007023217.1| Uncharacterized protein isoform 2 [Theobroma... 322 3e-85 ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629... 315 5e-83 ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma... 308 4e-81 ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma... 289 3e-75 gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii] 287 9e-75 >ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum lycopersicum] Length = 483 Score = 395 bits (1014), Expect = e-107 Identities = 234/467 (50%), Positives = 286/467 (61%), Gaps = 16/467 (3%) Frame = -1 Query: 1679 LKLELGDSY-SSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503 L LE G+ Y +SFDLEKAVCSHGLFMMAPN WD +KTL+RP Sbjct: 17 LPLEDGNGYCASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDDDHEQSVLV 76 Query: 1502 XXXXXXXXXXXXXXSPLD--------QQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKN 1347 LD Q+ LLGQV RM+RLS + +K F +I EAK Sbjct: 77 QITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQEICGEAKE 136 Query: 1346 RGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQD 1167 RGFGRVFRSPTLFEDMVKCMLLCNCQW RTL+MA ALCELQL L S + + D Sbjct: 137 RGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPS-----SAASFPD 191 Query: 1166 P---NCLKPNT---EGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQ 1005 P N LK T E F P TP G+EL+++ NL + +E E ++ + Sbjct: 192 PDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPGV-- 249 Query: 1004 QTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSC- 828 +P+F + E K N CQ + +V + + SE R SS Sbjct: 250 ---------TVTPAFSVGEEVLQ---KSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFN 297 Query: 827 RIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPS 648 ++G+FPSPK+LASLD FLAKRC LGYRA RII+LA+ I EG Q+ +LEE C+ S Sbjct: 298 QLGNFPSPKQLASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEA-CSNPSLS 356 Query: 647 LYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDV 468 YDK+A+QL EIDGFGPFTCANVLMC+G+Y VIP DSET+RHLK++H +ST + VQRDV Sbjct: 357 NYDKMAEQLREIDGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTSTIQNVQRDV 416 Query: 467 EKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 E +YGKYA F+FLAYWSE+W+FYE+ FGK SEMP Y LITA+NMR Sbjct: 417 ENIYGKYAPFQFLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMR 463 >ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] gi|550342350|gb|EEE79091.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] Length = 489 Score = 384 bits (985), Expect = e-104 Identities = 228/492 (46%), Positives = 284/492 (57%), Gaps = 24/492 (4%) Frame = -1 Query: 1730 GRMDEEHHNPXXXXSCLLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXX 1551 G+ +EE + + ++ LGD+ +F+LEKAVCSHGLFMM+PN WDP + T RP Sbjct: 8 GKEEEEEES------VVFEIPLGDAAETFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLR 61 Query: 1550 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXS-----------PLDQQFLLGQVARMLRLS 1404 P Q+ L+ QV RMLRLS Sbjct: 62 LSLSDSDPQVSTPTTSLFVSISHPPHLPRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLS 121 Query: 1403 ESDEMCIKEFHKIHPEAKNR-------GFG-RVFRSPTLFEDMVKCMLLCNCQWPRTLTM 1248 E+DE +EF KI A GFG RVFRSPTLFEDMVKC+LLCNCQWPRTL+M Sbjct: 122 ETDERNAREFRKIAEAAAAEENNSWLTGFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSM 181 Query: 1247 ARALCELQLNLK-SDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPA 1071 ARALCELQ L+ S ++ V + N F+P T G+E KR K+ Sbjct: 182 ARALCELQCELQCKSSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKRNIRASKVTK 241 Query: 1070 NLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNN 891 NL K E ET LEA+ + + + +E L S E D SC + + Sbjct: 242 NLASKIVETETLLEADANL--KTDSAHIGRET-----LESVEND-------SCARCSSRH 287 Query: 890 KVDACS----MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIEL 723 D+ + S + G C +FPSP+ELA+LD FLAKRC LGYRA RII+L Sbjct: 288 GSDSWAPDSLQSQHGIQPGVNKMIC---NFPSPRELANLDESFLAKRCNLGYRAIRIIKL 344 Query: 722 ARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPI 543 A+SI EGR + ++EE N S Y+KLA Q +IDGFGPFTCANVLMCMGFY +IP Sbjct: 345 AQSIVEGRIPLREVEEDCANGASSSCYNKLADQFRQIDGFGPFTCANVLMCMGFYHIIPT 404 Query: 542 DSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPP 363 DSET+RHLK++H ST +TVQRDVE++YGKYA F+FLAYW+ELW+FYEK FGK SE+P Sbjct: 405 DSETVRHLKQVHAKKSTIQTVQRDVEEIYGKYAPFQFLAYWAELWHFYEKRFGKLSEIPT 464 Query: 362 PNYHLITASNMR 327 +Y LITASNMR Sbjct: 465 SDYKLITASNMR 476 >ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED: uncharacterized protein LOC102593287 isoform X2 [Solanum tuberosum] Length = 485 Score = 381 bits (978), Expect = e-103 Identities = 228/472 (48%), Positives = 286/472 (60%), Gaps = 20/472 (4%) Frame = -1 Query: 1682 LLKLELGDS-----YSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXX 1518 +++L LGD ++FDLEKAVCSHGLFMMAPN WD +KTL+RP Sbjct: 14 VVELPLGDGDGDGGCATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHE 73 Query: 1517 XXXXXXXXXXXXXXXXXXXS--------PLDQQFLLGQVARMLRLSESDEMCIKEFHKIH 1362 + Q+ LLGQV RM+RLS + +K+F +I Sbjct: 74 QSVLVQINQPSDSPHSLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEIC 133 Query: 1361 PEAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTE 1182 EAK+RG GRVFRSPTLFEDMVKCMLLCNCQW RTL+MA ALCELQL L S + Sbjct: 134 GEAKDRGLGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPS-----SA 188 Query: 1181 VASQDP---NCLKPNT---EGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAET 1020 + DP N LK T E F P TP G+E +++ L + +E E ++ Sbjct: 189 ASFPDPDNQNQLKGVTFKSEHFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIID--- 245 Query: 1019 TNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRT 840 + K + + S E+ K K N C+ + V + + SE R Sbjct: 246 ----------IGKPGVTVTPAFSVGEEVLK-KSNLCRDTTEVCDVGTSAPFNLDPSEDRK 294 Query: 839 DSSC-RIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCN 663 SS ++G+FPSPKELASLD FLAKRC LGYRA RII+LA+ I EG Q+++LEE C+ Sbjct: 295 LSSFNQLGNFPSPKELASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLKELEEA-CS 353 Query: 662 REIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRT 483 S YDK+A+QL EIDGFGPFTCANVLMC+G+Y VIP DSET+RHLK++H +ST + Sbjct: 354 NPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTSTIQN 413 Query: 482 VQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 VQRDVE +YGKYA F+FLAYWSE+W+FYE+ FGK SEMP Y LITA+NMR Sbjct: 414 VQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMR 465 >gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] Length = 472 Score = 374 bits (960), Expect = e-101 Identities = 220/469 (46%), Positives = 282/469 (60%), Gaps = 18/469 (3%) Frame = -1 Query: 1679 LKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXX 1500 L+L LGD+ ++F LE AVCSHGLFMMAPN WDP +KTL RP Sbjct: 5 LELPLGDAAATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDS 64 Query: 1499 XXXXXXXXXXXXXSPL-------------DQQFLLGQVARMLRLSESDEMCIKEFHKIHP 1359 ++Q LL QV+RMLRLS+++E +EF +++ Sbjct: 65 VMARISQPHDRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVY- 123 Query: 1358 EAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEV 1179 G GRVFRSPTLFEDMVKC+LLCNCQWPRTL+MA+ALC+LQ L+ S Sbjct: 124 -GCGSGLGRVFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQS-------- 174 Query: 1178 ASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF-SENETKLEAETTNCH-- 1008 + T F+P TP G+E KRK K L +F +++ LE+ + + Sbjct: 175 -------VPSKTVDFVPKTPAGKEPKRKVEKLKASTCLTSQFDAQSNEGLESHSNDLSID 227 Query: 1007 --QQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDS 834 Q T S + SPS L+S ++ +C+ ++ VD+ S+ + + R Sbjct: 228 ISQPTP---SAQNLSPSSLLSVPMENV-----TCE---ESYGVDSASLCNPQILRDREFE 276 Query: 833 SCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREI 654 GDFP+P ELA LD FLAKRCKLGYRA RI++LAR I EGR Q+ +LEE R + Sbjct: 277 GT--GDFPTPTELAKLDEKFLAKRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSL 334 Query: 653 PSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQR 474 S Y KLA QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHL+++HG +ST RT++R Sbjct: 335 CS-YSKLAVQLRQIDGFGPFTCANVLMCMGFYHVIPSDSETIRHLQQVHGRNSTVRTIER 393 Query: 473 DVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 DV+++Y KY F+FLAYWSELW+FYEK FGK SEMP Y L TASNM+ Sbjct: 394 DVQQIYAKYEPFQFLAYWSELWHFYEKKFGKISEMPCSAYKLFTASNMK 442 >ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus sinensis] Length = 454 Score = 365 bits (936), Expect = 5e-98 Identities = 217/475 (45%), Positives = 278/475 (58%), Gaps = 24/475 (5%) Frame = -1 Query: 1682 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503 LLKL L ++ F+LE AVCSHGLFMM+PN WDP +++L RP Sbjct: 7 LLKLPLAET---FNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63 Query: 1502 XXXXXXXXXXXXXXSPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 1365 + Q LL QV RMLRLSE+DE ++EF +I Sbjct: 64 VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRI 123 Query: 1364 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNL 1215 + A+ G GRVFRSPTLFEDMVKCMLLCNCQWPRTL+MARALCELQ L Sbjct: 124 VRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWEL 183 Query: 1214 KSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 1035 + +C +E F+P TP G+E KR++ + K+ + L + +E++ Sbjct: 184 Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227 Query: 1034 LEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETL 855 E + N L +E PSF + E D G LN+ + D S D Sbjct: 228 SE-DYMNLKLDCAGVL-EENVQPSFPQNDIESDLHG-------LNELSTTDPPSARD--- 275 Query: 854 SEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE 675 RIG+FPSP+ELA+LD FLAKRC LGYRA RI++LAR I +G+ Q+ +LE+ Sbjct: 276 ---------RIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELED 326 Query: 674 LDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495 + CN + Y KLA+QL +I+GFGPFT NVL+C+GFY VIP DSET+RHLK++H + Sbjct: 327 M-CNEASLTAYVKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNC 385 Query: 494 TNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNM 330 T++TVQ E +YGKYA F+FLAYWSELW+FYEK FGK SEMP +Y LITASNM Sbjct: 386 TSKTVQMIAESIYGKYAPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440 >ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] gi|557533482|gb|ESR44600.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] Length = 454 Score = 361 bits (926), Expect = 7e-97 Identities = 214/475 (45%), Positives = 278/475 (58%), Gaps = 24/475 (5%) Frame = -1 Query: 1682 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503 +LKL L ++ F+LE AVCSHGLFMM+PN WDP +++L RP Sbjct: 7 VLKLPLAET---FNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63 Query: 1502 XXXXXXXXXXXXXXSPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 1365 + Q LL QV RMLRLSE+DE +++F +I Sbjct: 64 VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRI 123 Query: 1364 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNL 1215 + A+ G GRVFRSPTLFEDMVKCMLLCNCQWPRTL MARALCELQ L Sbjct: 124 VRQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQWEL 183 Query: 1214 KSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 1035 + +C +E F+P TP G+E KR++ + K+ + L + +E++ Sbjct: 184 Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227 Query: 1034 LEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETL 855 E + N T L +E PSF + E D G LN+ + D S D Sbjct: 228 SE-DDMNLKLDCTGAL-EENVQPSFPRNDIESDLHG-------LNELSTTDPPSACD--- 275 Query: 854 SEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE 675 RIG+FPSP+ELA+LD FLAKRC LGYRA RI++LA+ I +G+ Q+ +LE+ Sbjct: 276 ---------RIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELED 326 Query: 674 LDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495 CN + Y+KLA+QL +I+GFGPFT NVL+C+GFY VIP DSET+RHLK++H + Sbjct: 327 T-CNEASLTTYNKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNC 385 Query: 494 TNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNM 330 T++TVQ E +YGKY+ F+FLAYWSELW+FYEK FGK SEMP +Y LITASNM Sbjct: 386 TSKTVQIIAESIYGKYSPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440 >ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] gi|548856677|gb|ERN14505.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] Length = 458 Score = 360 bits (924), Expect = 1e-96 Identities = 208/447 (46%), Positives = 274/447 (61%), Gaps = 6/447 (1%) Frame = -1 Query: 1649 SFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1470 SF+LEKAVCSHG FMMAPNLW S++TLQRP Sbjct: 17 SFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQLSLSSQKSLQILVL 76 Query: 1469 XXXSPL--DQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFEDMV 1296 DQQ+LL QVARMLR+SE D++ + +FH+++P AK GFGRVFRSPTLFEDMV Sbjct: 77 GASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFGRVFRSPTLFEDMV 136 Query: 1295 KCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPI 1116 K +LLCNCQW RTL+MARALCELQL L +S + +++D + K + P+TP+ Sbjct: 137 KSILLCNCQWTRTLSMARALCELQLELNGNSLRQ-----SNKDTDFSK--SVNLSPVTPM 189 Query: 1115 GRELK--RKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEE 942 E K RK + I NL KFSENET L A+ + SK P+ + + E Sbjct: 190 QLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPT----MFSSE 245 Query: 941 DDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDS-SCRIGDFPSPKELASLDVDFLAK 765 + GK N Q+ K+ ++ D L E +T S G+FP P+ELA+LD L K Sbjct: 246 EGRNGKLNYDQV--SEEKLGDGAILDNQLLENKTLSFFLEAGNFPCPEELANLDEKILEK 303 Query: 764 RCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCA 585 RCK+G+R+ RI++LA+SI EG + ++E L +++ P D L +QL+ I G GP+ C Sbjct: 304 RCKVGFRSKRIVKLAQSIVEGALDLGKIEVL--SQQDPIHLDGLMRQLLSIYGVGPYVCN 361 Query: 584 NVLMCMGFYQVIPIDSETLRHLKKIHGISS-TNRTVQRDVEKVYGKYAQFKFLAYWSELW 408 NVLM MG YQ IP D+ETLRHLK+ H T T+Q+D+E++YGK+ F+FL YWSE+W Sbjct: 362 NVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFLVYWSEMW 421 Query: 407 NFYEKSFGKASEMPPPNYHLITASNMR 327 FYEK FGK S+MPP +Y LITA NM+ Sbjct: 422 EFYEKRFGKLSQMPPSDYELITAHNMK 448 >ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508778582|gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 467 Score = 354 bits (908), Expect = 8e-95 Identities = 220/482 (45%), Positives = 273/482 (56%), Gaps = 18/482 (3%) Frame = -1 Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554 EE+ N S L++L +G++ ++ F+LEKAVCSHGLFMMAPN WDP +++L RP Sbjct: 36 EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 95 Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383 L Q LL QV+RMLRLSE +E + Sbjct: 96 RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 155 Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233 +EF KI H E + R F GRVFRSPTLFEDMVKC+LLCNCQ+ RTL+MA+ALC Sbjct: 156 REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALC 215 Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053 ELQ + + G A D F+P TP G ELKRK + K+ L+ KF Sbjct: 216 ELQFETQRP---FSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKF 262 Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873 AE H ++ S+E P KG Sbjct: 263 --------AEPRADHSKSDLQPSQELDEPHAY--------KG------------------ 288 Query: 872 MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693 +G FPSP+ELA+LD FLAKRC LGYRA+RI++LA+ I +G Q Sbjct: 289 ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 332 Query: 692 IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513 + QLEE C S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+ Sbjct: 333 LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 391 Query: 512 IHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASN 333 +H SST +TV RDVE +Y KYA F+FLAYW+ELW++YE+ FGK SEMP Y LITASN Sbjct: 392 VHSKSSTMQTVGRDVEGIYAKYAPFQFLAYWAELWHYYEQRFGKLSEMPFCGYKLITASN 451 Query: 332 MR 327 M+ Sbjct: 452 MK 453 >gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group] Length = 463 Score = 351 bits (900), Expect = 7e-94 Identities = 206/461 (44%), Positives = 265/461 (57%), Gaps = 21/461 (4%) Frame = -1 Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467 FDLE AVCSHGLFMMAPN WDP+++ L RP Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 1466 XXSP------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFE 1305 +P DQ +L QV RMLRL E D EF +H A+ GFGR+FRSPTLFE Sbjct: 97 LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156 Query: 1304 DMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPI 1125 DMVKC+LLCNCQW RTL+M+ ALCELQL L+S S +TE F Sbjct: 157 DMVKCILLCNCQWTRTLSMSTALCELQLELRSSS------------------STENFQSR 198 Query: 1124 TPIGRELKRKRSMKK-IPANLDCKFSEN------ETKLEAETTNCHQQTTCFLSKEKPSP 966 TP RE KRKRS K+ + L+ KF+E+ + L +T N F PS Sbjct: 199 TPPIRECKRKRSNKRNVRVKLETKFNEDKLVCLEDPNLATDTANLQTYENSF---NLPSA 255 Query: 965 SFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASL 786 + G N+ ++ D++++ + +E E C GDFP+P+ELA+L Sbjct: 256 A----------SGTGNTSEVSLDHSEL---KLRNEPCLE-----DCG-GDFPTPEELANL 296 Query: 785 DVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEEL--------DCNREIPSLYDKLA 630 D DFLAKRC LGYRA RI+ LARSI EG+ +++LEE+ + PS YD+L Sbjct: 297 DEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLN 356 Query: 629 KQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGK 450 ++L I GFGPFT ANVLMCMGF+ +IP D+ET+RHLK+ H +ST +VQ++++ +YGK Sbjct: 357 EELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGK 416 Query: 449 YAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 YA F+FLAYW ELW FY K FGK S+M P NY L TAS ++ Sbjct: 417 YAPFQFLAYWCELWGFYNKQFGKISDMEPINYRLFTASKLK 457 >ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max] Length = 443 Score = 347 bits (890), Expect = 1e-92 Identities = 199/446 (44%), Positives = 267/446 (59%), Gaps = 4/446 (0%) Frame = -1 Query: 1652 SSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1473 S F LE+AVCSHGLFMM PN WDP +KTL RP Sbjct: 22 SPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLRSSPSSFLVSLSQHSQSLAVRVHATHA 81 Query: 1472 XXXXSPLDQQFLLGQVARMLRLSESDEMCIKEFHKIHP-EAKNRGF-GRVFRSPTLFEDM 1299 P Q + QV+RMLR SE++E ++EF +H + NR F GRVFRSPTLFEDM Sbjct: 82 LS---PQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFEDM 138 Query: 1298 VKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITP 1119 VKC+LLCNCQWPRTL+MA+ALCELQL L++ S + S K +EGF+P TP Sbjct: 139 VKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNS------KGESEGFIPKTP 192 Query: 1118 IGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEED 939 +E +R + K F + + +L+ H + + + L++ + Sbjct: 193 ASKETRRNKVSTK------GMFCKKKLELDGNLQIDH------VVASSSTATTLLTTDNG 240 Query: 938 DSKGKRN--SCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAK 765 DS+ R+ SC ++ N+ + R G+FPSP ELA+LD FLAK Sbjct: 241 DSEELRSHDSCHEFSNGNEYFS-----------------RTGNFPSPSELANLDESFLAK 283 Query: 764 RCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCA 585 RC LGYRA IIELAR+I EG+ Q+ QLEEL + + S Y +L QL +I G+GPFT A Sbjct: 284 RCGLGYRAGYIIELARAIVEGKIQLGQLEELSKDASL-SNYKQLDDQLKQIRGYGPFTRA 342 Query: 584 NVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWN 405 NVLMC+G+Y VIP DSET+RHLK++H +T++T++R++E++YGKY ++FLA+WSE+W+ Sbjct: 343 NVLMCLGYYHVIPTDSETVRHLKQVHSRYTTSKTIERELEEIYGKYEPYQFLAFWSEVWD 402 Query: 404 FYEKSFGKASEMPPPNYHLITASNMR 327 FYE FGK +EM +Y LITA NMR Sbjct: 403 FYETRFGKLNEMHSSDYKLITACNMR 428 >ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris] gi|561020766|gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris] Length = 474 Score = 346 bits (888), Expect = 2e-92 Identities = 200/454 (44%), Positives = 260/454 (57%), Gaps = 5/454 (1%) Frame = -1 Query: 1673 LELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXX 1494 +EL F L++AVCSHG FMMAPN WDP +KTL RP Sbjct: 37 MELPSETEPFQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQR 96 Query: 1493 XXXXXXXXXXXS---PLDQQFLLGQVARMLRLSESDEMCIKEFHKIHP-EAKNRGFG-RV 1329 P Q+ + Q+ RMLRLSE++E ++EF +H + NR FG RV Sbjct: 97 PQSLAVRVHSVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRV 156 Query: 1328 FRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKP 1149 FRSPTLFEDMVKC+LLCNCQWPRTL+MA+ALCELQ L++ G A + K Sbjct: 157 FRSPTLFEDMVKCILLCNCQWPRTLSMAQALCELQSGLQN------GLPCAVEGSGNPKV 210 Query: 1148 NTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPS 969 E F+P TP +E +RK K P + E +LE E Q F S S Sbjct: 211 EAEEFVPKTPASKENRRK----KAPTKGVLLKKKLELELEMEVDGNLQMDHMFAS----S 262 Query: 968 PSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELAS 789 + + + + + CQ N+ D G+FPSP ELA+ Sbjct: 263 SDTTLLGDLEVLRSDDSCCQFPNEGEYFD------------------HTGNFPSPIELAN 304 Query: 788 LDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEID 609 L FLAKRCKLGYRA I+ELA+ I EG+ Q+EQLEEL + + S Y +L QL I Sbjct: 305 LSESFLAKRCKLGYRAGYILELAQGIVEGKIQLEQLEELSKDASL-SCYKQLGDQLKPIK 363 Query: 608 GFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFL 429 GFGPFT ANVLMC+G+Y VIP DSET+RHLK++H +++++T++RD+E++YGKY ++FL Sbjct: 364 GFGPFTRANVLMCLGYYHVIPWDSETVRHLKQVHSKNTSSKTIERDLEEIYGKYEPYQFL 423 Query: 428 AYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 A+WSE+W+FYE FGK +EM Y ITASNMR Sbjct: 424 AFWSEIWDFYETRFGKMNEMHSSEYKRITASNMR 457 >ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica] Length = 461 Score = 343 bits (881), Expect = 1e-91 Identities = 203/459 (44%), Positives = 263/459 (57%), Gaps = 19/459 (4%) Frame = -1 Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467 FDL AVCSHGLFMMAPN WDP+ + L RP Sbjct: 36 FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95 Query: 1466 XXSP----LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFEDM 1299 + LD+ ++L QV RMLRLSE D + EF +H A+ GFGR+FRSPTLFEDM Sbjct: 96 EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155 Query: 1298 VKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITP 1119 VKC+LLCNCQW RTL+MA ALCE+QL LK S + E F TP Sbjct: 156 VKCILLCNCQWTRTLSMATALCEIQLELKCSS------------------SVEDFQSRTP 197 Query: 1118 IGRELKRKRSMKK-IPANLDCKFSENETK---LEAETTN--CHQQTTCFLSKEKPSPSFL 957 RE KRKRS ++ + L+ +F+E++ + + + T+N H +T +LS S Sbjct: 198 PIRERKRKRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLSSLASVASET 257 Query: 956 ISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASLDVD 777 SA D NS LN+ ++ C IGDFP+P+ELA+LD Sbjct: 258 GSAC-DSLPSLDNSELSLNNAPGLEDC-----------------IGDFPTPEELANLDEG 299 Query: 776 FLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIP---------SLYDKLAKQ 624 FLAKRC LGYRA RI+ LAR + EG+ +++LEE+ C +P S ++L K+ Sbjct: 300 FLAKRCNLGYRAKRIVMLARGVVEGKVCLQKLEEM-CRISVPAAEEVSTIESACERLNKE 358 Query: 623 LMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYA 444 L I GFGPFT ANVLMCMGF IP D+ET+RHLK++H +ST +V ++++K+YGKYA Sbjct: 359 LSAISGFGPFTRANVLMCMGFNHTIPADTETIRHLKQVHKRASTISSVHQELDKIYGKYA 418 Query: 443 QFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 F+FLAYW ELW FY K FGK EM P NY L TAS+++ Sbjct: 419 PFQFLAYWFELWGFYNKQFGKICEMEPSNYRLFTASHLK 457 >ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis] gi|223541451|gb|EEF43001.1| conserved hypothetical protein [Ricinus communis] Length = 458 Score = 338 bits (868), Expect = 3e-90 Identities = 199/458 (43%), Positives = 255/458 (55%), Gaps = 12/458 (2%) Frame = -1 Query: 1664 GDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXX 1485 G++ +FDLEK VCSHGLFM++PN WDP ++T RP Sbjct: 16 GEAADTFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKSLL 75 Query: 1484 XXXXXXXXS-PLDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGF-------GRV 1329 P Q+ LL Q+ RMLRLS+ DE +EF KI + GRV Sbjct: 76 VRVYGNRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRV 135 Query: 1328 FRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKP 1149 RSPTLFEDMVKC+LLCNCQW RTL+MA ALC+ Q+ L S S + K Sbjct: 136 LRSPTLFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQSPQQ-------------KH 182 Query: 1148 NTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPS 969 F+P TP+ +E KRK + K+P E+ + TC + + Sbjct: 183 AFNHFIPNTPVKKEPKRKIRLSKVPT---------------ESMDLEAADTCLTTDDSQM 227 Query: 968 P-SFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCR---IGDFPSPK 801 S ++ +D S SCQ N + SD + C G+FPSP+ Sbjct: 228 KISNSLNCVDDGSFDNLKSCQGSNTFYSTGPYATSD--IQSHLVTQHCAKKTTGNFPSPR 285 Query: 800 ELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQL 621 ELA+LD FLAKRC LGYRA RII+LA+ I EGR + + E++ + S Y KL QL Sbjct: 286 ELANLDERFLAKRCGLGYRAGRIIKLAQGIVEGRIPLREFEQVSNGGSL-STYSKLTDQL 344 Query: 620 MEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQ 441 EI+GFGPFT ANVLMCMGFY VIP DSET+RH K++H +ST +TVQ + E++Y K+A Sbjct: 345 REIEGFGPFTRANVLMCMGFYHVIPTDSETVRHFKQVHAKNSTIKTVQSEAEEIYRKFAP 404 Query: 440 FKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 F+FL YW+ELW+FYE+ FGK SEMP NY LITASN+R Sbjct: 405 FQFLVYWAELWHFYEQRFGKLSEMPCSNYKLITASNLR 442 >gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group] Length = 442 Score = 330 bits (846), Expect = 1e-87 Identities = 196/454 (43%), Positives = 255/454 (56%), Gaps = 14/454 (3%) Frame = -1 Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467 FDLE AVCSHGLFMMAPN WDP+++ L RP Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 1466 XXSP-------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLF 1308 +P LDQ +L QV RMLRL E D + EF +H A+ GFGR+FRSPTLF Sbjct: 97 LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156 Query: 1307 EDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLP 1128 EDM+KC+LLCNCQW RTL+M+ ALCELQL L+S S +TE F Sbjct: 157 EDMIKCILLCNCQWTRTLSMSTALCELQLELRSSS------------------STENFQS 198 Query: 1127 ITPIGRELKRKRSMKK-IPANLDCKFSEN------ETKLEAETTNCHQQTTCFLSKEKPS 969 TP RE KRKRS K+ + L+ KF+E+ + L T N + + + E + Sbjct: 199 RTPPIRECKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLATNTANENLFSLPSSANETGN 258 Query: 968 PSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELAS 789 S +S + + K + C ++ C GDFP+P+ELA+ Sbjct: 259 TSE-VSLDHSELKLRYELC--------LEDCG-----------------GDFPTPEELAN 292 Query: 788 LDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEID 609 LD DFLAKRC LGYRA RI+ LARSI EG+ +++LEE+ R+I L ++L I Sbjct: 293 LDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEI---RKI------LIEELSTIS 343 Query: 608 GFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFL 429 G PF NVLMCMGF+ +IP D+ET+RHLK+ H +ST +VQ++++ +YGKYA F+FL Sbjct: 344 GIWPFHSCNVLMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGKYAPFQFL 403 Query: 428 AYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 AYW ELW FY K FG S+M P NY L TAS ++ Sbjct: 404 AYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 437 >dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group] gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza sativa Japonica Group] Length = 501 Score = 326 bits (835), Expect = 2e-86 Identities = 202/504 (40%), Positives = 262/504 (51%), Gaps = 64/504 (12%) Frame = -1 Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467 FDLE AVCSHGLFMMAPN WDP+++ L RP Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 1466 XXSP-------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLF 1308 +P LDQ +L QV RMLRL E D + EF +H A+ GFGR+FRSPTLF Sbjct: 97 LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156 Query: 1307 EDMVKCMLLCNCQ------------------------------------------WPRTL 1254 EDM+KC+LLCNCQ W RTL Sbjct: 157 EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216 Query: 1253 TMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKK-I 1077 +M+ ALCELQL L+S S +TE F TP RE KRKRS K+ + Sbjct: 217 SMSTALCELQLELRSSS------------------STENFQSRTPPIRECKRKRSNKRNV 258 Query: 1076 PANLDCKFSEN------ETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNS 915 L+ KF+E+ + L T N + + + E + S +S + + K + Sbjct: 259 RVKLETKFNEDKMVCLEDPNLATNTANENLFSLPSSANETGNTSE-VSLDHSELKLRYEL 317 Query: 914 CQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANR 735 C ++ C GDFP+P+ELA+LD DFLAKRC LGYRA R Sbjct: 318 C--------LEDCG-----------------GDFPTPEELANLDEDFLAKRCNLGYRARR 352 Query: 734 IIELARSITEGRFQIEQLEEL--------DCNREIPSLYDKLAKQLMEIDGFGPFTCANV 579 I+ LARSI EG+ +++LEE+ + PS YD+L ++L I GFGPFT ANV Sbjct: 353 IVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANV 412 Query: 578 LMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFY 399 LMCMGF+ +IP D+ET+RHLK+ H +ST +VQ++++ +YGKYA F+FLAYW ELW FY Sbjct: 413 LMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGKYAPFQFLAYWCELWGFY 472 Query: 398 EKSFGKASEMPPPNYHLITASNMR 327 K FG S+M P NY L TAS ++ Sbjct: 473 NKQFGIISDMEPINYRLFTASKLK 496 >ref|XP_007023217.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508778583|gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 426 Score = 322 bits (826), Expect = 3e-85 Identities = 206/482 (42%), Positives = 257/482 (53%), Gaps = 18/482 (3%) Frame = -1 Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554 EE+ N S L++L +G++ ++ F+LEKAVCSHGLFMMAPN WDP +++L RP Sbjct: 21 EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 80 Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383 L Q LL QV+RMLRLSE +E + Sbjct: 81 RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 140 Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233 +EF KI H E + R F GRVFRSPTLFEDMVKC+LLCNCQ Sbjct: 141 REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQ------------ 188 Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053 A++D F+P TP G ELKRK + K+ L+ KF Sbjct: 189 ------------------AAEDD---------FIPKTPAGNELKRKLRVSKVSMRLEGKF 221 Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873 +E D SK Q L++ + Sbjct: 222 AEPRA--------------------------------DHSKSDLQPSQELDEPHAYKG-- 247 Query: 872 MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693 +G FPSP+ELA+LD FLAKRC LGYRA+RI++LA+ I +G Q Sbjct: 248 ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 291 Query: 692 IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513 + QLEE C S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+ Sbjct: 292 LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 350 Query: 512 IHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASN 333 +H SST +TV RDVE +Y KYA F+FLAYW+ELW++YE+ FGK SEMP Y LITASN Sbjct: 351 VHSKSSTMQTVGRDVEGIYAKYAPFQFLAYWAELWHYYEQRFGKLSEMPFCGYKLITASN 410 Query: 332 MR 327 M+ Sbjct: 411 MK 412 >ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus sinensis] Length = 409 Score = 315 bits (806), Expect = 5e-83 Identities = 193/444 (43%), Positives = 252/444 (56%), Gaps = 24/444 (5%) Frame = -1 Query: 1682 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503 LLKL L ++ F+LE AVCSHGLFMM+PN WDP +++L RP Sbjct: 7 LLKLPLAET---FNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63 Query: 1502 XXXXXXXXXXXXXXSPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 1365 + Q LL QV RMLRLSE+DE ++EF +I Sbjct: 64 VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRI 123 Query: 1364 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNL 1215 + A+ G GRVFRSPTLFEDMVKCMLLCNCQWPRTL+MARALCELQ L Sbjct: 124 VRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWEL 183 Query: 1214 KSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 1035 + +C +E F+P TP G+E KR++ + K+ + L + +E++ Sbjct: 184 Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227 Query: 1034 LEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETL 855 E + N L +E PSF + E D G LN+ + D S D Sbjct: 228 SE-DYMNLKLDCAGVL-EENVQPSFPQNDIESDLHG-------LNELSTTDPPSARD--- 275 Query: 854 SEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE 675 RIG+FPSP+ELA+LD FLAKRC LGYRA RI++LAR I +G+ Q+ +LE+ Sbjct: 276 ---------RIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELED 326 Query: 674 LDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495 + CN + Y KLA+QL +I+GFGPFT NVL+C+GFY VIP DSET+RHLK++H + Sbjct: 327 M-CNEASLTAYVKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNC 385 Query: 494 TNRTVQRDVEKVYGKYAQFKFLAY 423 T++TVQ E +YGKYA F+FLAY Sbjct: 386 TSKTVQMIAESIYGKYAPFQFLAY 409 >ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508778584|gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 421 Score = 308 bits (790), Expect = 4e-81 Identities = 199/450 (44%), Positives = 247/450 (54%), Gaps = 18/450 (4%) Frame = -1 Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554 EE+ N S L++L +G++ ++ F+LEKAVCSHGLFMMAPN WDP +++L RP Sbjct: 36 EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 95 Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383 L Q LL QV+RMLRLSE +E + Sbjct: 96 RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 155 Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233 +EF KI H E + R F GRVFRSPTLFEDMVKC+LLCNCQ+ RTL+MA+ALC Sbjct: 156 REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALC 215 Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053 ELQ + + G A D F+P TP G ELKRK + K+ L+ KF Sbjct: 216 ELQFETQRP---FSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKF 262 Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873 AE H ++ S+E P KG Sbjct: 263 --------AEPRADHSKSDLQPSQELDEPHAY--------KG------------------ 288 Query: 872 MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693 +G FPSP+ELA+LD FLAKRC LGYRA+RI++LA+ I +G Q Sbjct: 289 ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 332 Query: 692 IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513 + QLEE C S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+ Sbjct: 333 LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 391 Query: 512 IHGISSTNRTVQRDVEKVYGKYAQFKFLAY 423 +H SST +TV RDVE +Y KYA F+FLAY Sbjct: 392 VHSKSSTMQTVGRDVEGIYAKYAPFQFLAY 421 >ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508778585|gb|EOY25841.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 406 Score = 289 bits (739), Expect = 3e-75 Identities = 190/436 (43%), Positives = 236/436 (54%), Gaps = 18/436 (4%) Frame = -1 Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554 EE+ N S L++L +G++ ++ F+LEKAVCSHGLFMMAPN WDP +++L RP Sbjct: 21 EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 80 Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383 L Q LL QV+RMLRLSE +E + Sbjct: 81 RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 140 Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233 +EF KI H E + R F GRVFRSPTLFEDMVKC+LLCNCQ+ RTL+MA+ALC Sbjct: 141 REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALC 200 Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053 ELQ + + G A D F+P TP G ELKRK + K+ L+ KF Sbjct: 201 ELQFETQRP---FSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKF 247 Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873 AE H ++ S+E P KG Sbjct: 248 --------AEPRADHSKSDLQPSQELDEPHAY--------KG------------------ 273 Query: 872 MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693 +G FPSP+ELA+LD FLAKRC LGYRA+RI++LA+ I +G Q Sbjct: 274 ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 317 Query: 692 IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513 + QLEE C S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+ Sbjct: 318 LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 376 Query: 512 IHGISSTNRTVQRDVE 465 +H SST +TV RDVE Sbjct: 377 VHSKSSTMQTVGRDVE 392 >gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii] Length = 333 Score = 287 bits (735), Expect = 9e-75 Identities = 173/356 (48%), Positives = 216/356 (60%), Gaps = 9/356 (2%) Frame = -1 Query: 1367 IHPEAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLG 1188 +H A+ GFGR+FRSPTLFEDMVKC+LLCNCQW RTL+MA ALCELQL LK + Sbjct: 1 MHAAAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMATALCELQLELKCSA----- 55 Query: 1187 TEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKK-IPANLDCKFSENETKLEAETTNC 1011 TE TP RE KRKRS + + L+ KF+E E LE Sbjct: 56 -------------GTEDLQLRTPPIREHKRKRSKNQNVRVKLEKKFTELEC-LEDPRVET 101 Query: 1010 HQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSS 831 Q T + S +I+ E D K + Q+ + V S E EG Sbjct: 102 AQDT-----RVATGTSDVITHLEADEK-LASLPQVAPETGSVCQSFDSSELSLEG----- 150 Query: 830 CRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE-----LDC 666 C IGDFP+P+ELA+LD DFLAKRC LGYRA RI+ LARSI EG+ + LEE L Sbjct: 151 C-IGDFPTPEELANLDEDFLAKRCGLGYRAERIVLLARSIVEGKVCPQNLEEMQKMSLPA 209 Query: 665 NRE---IPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495 E IPS Y++L +L I GFGPFT ANVLMCMGF+ +IP D+ET+RHLK+ H I+S Sbjct: 210 TEELSTIPSTYERLNNELTTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQCHEIAS 269 Query: 494 TNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327 T ++V +++K+YG+YA F+FLAYW ELW FY+K FGK +EM P Y L TAS ++ Sbjct: 270 TIKSVHMELDKIYGEYAPFQFLAYWFELWGFYDKQFGKITEMDPSTYRLFTASALK 325