BLASTX nr result
ID: Ephedra28_contig00016437
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra28_contig00016437 (1769 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A... 341 7e-91 ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593... 318 6e-84 ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247... 318 6e-84 ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781... 287 1e-74 gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao] 285 3e-74 ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766... 285 4e-74 gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] 284 1e-73 gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus... 282 4e-73 ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu... 281 5e-73 ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629... 279 3e-72 ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr... 278 5e-72 ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm... 276 2e-71 gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi... 273 1e-70 gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo... 268 4e-69 gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao] 256 2e-65 dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou... 252 4e-64 gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao] 249 3e-63 gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao] 244 1e-61 ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629... 238 6e-60 gb|EOY25841.1| Uncharacterized protein isoform 4 [Theobroma cacao] 225 5e-56 >ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] gi|548856677|gb|ERN14505.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] Length = 458 Score = 341 bits (874), Expect = 7e-91 Identities = 195/454 (42%), Positives = 254/454 (55%), Gaps = 30/454 (6%) Frame = +3 Query: 63 GGECT-LRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXX 239 G E T L + V SF+LE+AVCS+GFFMM+PN W S+ +TL RPLRL D Sbjct: 4 GAERTVLTLPVNESFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQL 63 Query: 240 XXXXXXR----VFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFG 407 V G S+L D+ ++ AQV RMLR+SE +D ++ FH ++ AK GFG Sbjct: 64 SLSSQKSLQILVLGASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFG 123 Query: 408 RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXF 587 RVFRSPTLFED+VK+ LLCNC+W RTLSMA +LC+LQ EL G L Sbjct: 124 RVFRSPTLFEDMVKSILLCNCQWTRTLSMARALCELQLELNGNSLRQSNKDTDFSKSVNL 183 Query: 588 YPKTPTKTRLKRRE-----------------------CSEILRPAKLR--FDETSQCKVM 692 P TP + K+R E LRP L F + S Sbjct: 184 SPVTPMQLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPTMFS 243 Query: 693 SGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELA 872 S + G ++S K LG + L + ++ L L AGNFP P+ELA Sbjct: 244 SEEGRNGKLNYDQVSEEK---------LGDGAILDNQLLENKTLSFFLEAGNFPCPEELA 294 Query: 873 SLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDG 1052 +L E L KRC VG+R++RI+ LA+ I G++DL +E + L+ L L + G Sbjct: 295 NLDEKILEKRCKVGFRSKRIVKLAQSIVEGALDLGKIEVLSQQDPIHLDGLMRQLLSIYG 354 Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232 VG + C+ VLM MGIYQ +P DTET+RHLKQ R CTI ++ D+EE+Y K+ PFQFL Sbjct: 355 VGPYVCNNVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFL 414 Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334 YW E+W+ YEK+FG+LS MPPSDY LI+ HNMK Sbjct: 415 VYWSEMWEFYEKRFGKLSQMPPSDYELITAHNMK 448 >ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED: uncharacterized protein LOC102593287 isoform X2 [Solanum tuberosum] Length = 485 Score = 318 bits (814), Expect = 6e-84 Identities = 185/441 (41%), Positives = 246/441 (55%), Gaps = 25/441 (5%) Frame = +3 Query: 96 SSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------ 257 ++FDLE+AVCS+G FMM+PNRW S KTL RPL L + Sbjct: 29 ATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHEQSVLVQINQPSDSPH 88 Query: 258 ----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSP 425 RVFG + L+ + + QV RM+RLS E+ + F + +AK +G GRVFRSP Sbjct: 89 SLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEICGEAKDRGLGRVFRSP 148 Query: 426 TLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK--------GKPLGWXXXXXXXXXXX 581 TLFED+VK LLCNC+W RTLSMA +LC+LQ EL P Sbjct: 149 TLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFPDPDNQNQLKGVTFKSE 208 Query: 582 XFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEI--SIT 743 F P+TP ++R CS L +E ++ S+ E+ Sbjct: 209 HFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIIDIGKPGVTVTPAFSVGEEVLKKSN 268 Query: 744 KCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRA 923 C V ++G+S+ + + +L S GNFP+PKELASL E++L KRCG+GYRA Sbjct: 269 LCRDTTEVCDVGTSAPFNLDPSEDRKLSSFNQLGNFPSPKELASLDESFLAKRCGLGYRA 328 Query: 924 RRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKLDGVGKFTCDVVLMCMGIY 1100 RI+ LAK I GSI L LE + + D A L+++DG G FTC VLMC+G Y Sbjct: 329 GRIIKLAKGIVEGSIQLKELEEACSNPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYY 388 Query: 1101 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1280 +PTD+ET+RHLKQV R+ TI++V DVE +Y KY PFQFLAYW E+W YE++FG+ Sbjct: 389 HVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGK 447 Query: 1281 LSHMPPSDYGLISGHNMKEER 1343 LS MP S+Y LI+ NM+ +R Sbjct: 448 LSEMPHSEYKLITAANMRRKR 468 >ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum lycopersicum] Length = 483 Score = 318 bits (814), Expect = 6e-84 Identities = 189/459 (41%), Positives = 251/459 (54%), Gaps = 25/459 (5%) Frame = +3 Query: 42 ERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXX 221 E +E+ G C +SFDLE+AVCS+G FMM+PNRW + KTL RPLRL + Sbjct: 16 ELPLEDGNGYC-------ASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDD 68 Query: 222 XXXXXXXXXXXX----------RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371 RV L+ + + QV RM+RLS E+ + F Sbjct: 69 DHEQSVLVQITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQ 128 Query: 372 RVHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK------- 530 + +AK +GFGRVFRSPTLFED+VK LLCNC+W RTLSMA +LC+LQ EL Sbjct: 129 EICGEAKERGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAAS 188 Query: 531 -GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMS 695 P F P+TP L++R CS L +E Sbjct: 189 FPDPDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPG 248 Query: 696 GKISEGCSIVSEI--SITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKEL 869 ++ S+ E+ C V + S+ L + + +L S GNFP+PK+L Sbjct: 249 VTVTPAFSVGEEVLQKSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFNQLGNFPSPKQL 308 Query: 870 ASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKL 1046 ASL E++L KRCG+GYRA RI+ LAK I GSI L+ LE + + D A L+++ Sbjct: 309 ASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEACSNPSLSNYDKMAEQLREI 368 Query: 1047 DGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQ 1226 DG G FTC VLMC+G Y +PTD+ET+RHLKQV R+ TI++V DVE +Y KY PFQ Sbjct: 369 DGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQ 427 Query: 1227 FLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKEER 1343 FLAYW E+W YE++FG+LS MP S+Y LI+ NM+ +R Sbjct: 428 FLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMRPKR 466 >ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max] Length = 443 Score = 287 bits (734), Expect = 1e-74 Identities = 179/429 (41%), Positives = 233/429 (54%), Gaps = 12/429 (2%) Frame = +3 Query: 84 VSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXRV 263 + + S F LE+AVCS+G FMM PN W KTL RPLR RV Sbjct: 18 MELPSPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLR-SSPSSFLVSLSQHSQSLAVRV 76 Query: 264 FGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH--SQAKSQGFGRVFRSPTLFE 437 L+ Q +NHI AQV RMLR SE E+ A+ F +H GRVFRSPTLFE Sbjct: 77 HATHALSPQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFE 136 Query: 438 DIVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPTKTR 614 D+VK LLCNC+W RTLSMA +LC+LQ EL+ G P F PKTP Sbjct: 137 DMVKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNSKGESEGFIPKTPASKE 196 Query: 615 LKRRECSE--ILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 788 +R + S + KL D Q + S ++ + + G S Sbjct: 197 TRRNKVSTKGMFCKKKLELDGNLQIDHVVASSSTATTL-------------LTTDNGDSE 243 Query: 789 KLTSNTMDTSELPSN-----LMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQI 953 +L S+ D+ SN GNFP+P ELA+L E++L KRCG+GYRA I+ LA+ I Sbjct: 244 ELRSH--DSCHEFSNGNEYFSRTGNFPSPSELANLDESFLAKRCGLGYRAGYIIELARAI 301 Query: 954 CNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTET 1127 G I L LE + D S+ + L L+++ G G FT VLMC+G Y +PTD+ET Sbjct: 302 VEGKIQLGQLEELSKDASLS-NYKQLDDQLKQIRGYGPFTRANVLMCLGYYHVIPTDSET 360 Query: 1128 VRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDY 1307 VRHLKQV R T K++ ++EE+Y KY P+QFLA+W E+WD YE +FG+L+ M SDY Sbjct: 361 VRHLKQVHSRY-TTSKTIERELEEIYGKYEPYQFLAFWSEVWDFYETRFGKLNEMHSSDY 419 Query: 1308 GLISGHNMK 1334 LI+ NM+ Sbjct: 420 KLITACNMR 428 >gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 467 Score = 285 bits (730), Expect = 3e-74 Identities = 182/454 (40%), Positives = 246/454 (54%), Gaps = 18/454 (3%) Frame = +3 Query: 27 AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206 ++S C ++E GE F+LE+AVCS+G FMM+PN+W ++L RPLRL D Sbjct: 41 SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 99 Query: 207 EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371 RV+G L+ Q + + QV RMLRLSE E+ + F Sbjct: 100 HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 159 Query: 372 RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521 ++ H + ++ GRVFRSPTLFED+VK LLCNC++ RTLSMA +LC+LQ Sbjct: 160 KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 219 Query: 522 ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701 E + G F PKTP LKR KLR Sbjct: 220 ETQRPFSG------VRAAEDDFIPKTPAGNELKR----------KLR------------- 250 Query: 702 ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881 VS++S+ + +G++ P S + + E + G+FP+P+ELA+L Sbjct: 251 -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 302 Query: 882 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052 E++L KRC +GYRA RIL LAK I G I L LE +G ++ L L L+++DG Sbjct: 303 ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 360 Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232 G FTC VLMCMG Y +P D+ET+RHLKQV +S T+++V DVE +YAKY PFQFL Sbjct: 361 FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 419 Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334 AYW E+W YE++FG+LS MP Y LI+ NMK Sbjct: 420 AYWAELWHYYEQRFGKLSEMPFCGYKLITASNMK 453 >ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica] Length = 461 Score = 285 bits (729), Expect = 4e-74 Identities = 171/446 (38%), Positives = 237/446 (53%), Gaps = 32/446 (7%) Frame = +3 Query: 102 FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------RV 263 FDL AVCS+G FMM+PNRW A + L RPLRL + V Sbjct: 36 FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95 Query: 264 FGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFEDI 443 G L+ D ++I QV RMLRLSE + A+ F +H+ A+ +GFGR+FRSPTLFED+ Sbjct: 96 EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155 Query: 444 VKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTP----TKT 611 VK LLCNC+W RTLSMA +LC++Q ELK F +TP K Sbjct: 156 VKCILLCNCQWTRTLSMATALCEIQLELK-----------CSSSVEDFQSRTPPIRERKR 204 Query: 612 RLKRRECSEILRPAKLRFDETSQCKVMSGK------------ISEGCSIVSEISITKCDG 755 + +R+ I + D+ + SG +S S+ SE + CD Sbjct: 205 KRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLSSLASVASETG-SACDS 263 Query: 756 EYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRIL 935 +P+L +S +N + G+FPTP+ELA+L E +L KRC +GYRA+RI+ Sbjct: 264 ---LPSLDNSELSLNNAPGLED-----CIGDFPTPEELANLDEGFLAKRCNLGYRAKRIV 315 Query: 936 NLAKQICNGSIDLDSLE----------NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLM 1085 LA+ + G + L LE +++ E L L + G G FT VLM Sbjct: 316 MLARGVVEGKVCLQKLEEMCRISVPAAEEVSTIESACERLNKELSAISGFGPFTRANVLM 375 Query: 1086 CMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYE 1265 CMG +P DTET+RHLKQV R+ TI SV +++++Y KY PFQFLAYW+E+W Y Sbjct: 376 CMGFNHTIPADTETIRHLKQVHKRAS-TISSVHQELDKIYGKYAPFQFLAYWFELWGFYN 434 Query: 1266 KQFGRLSHMPPSDYGLISGHNMKEER 1343 KQFG++ M PS+Y L + ++K+ + Sbjct: 435 KQFGKICEMEPSNYRLFTASHLKKAK 460 >gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] Length = 472 Score = 284 bits (726), Expect = 1e-73 Identities = 186/447 (41%), Positives = 240/447 (53%), Gaps = 34/447 (7%) Frame = +3 Query: 96 SSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLC------------DEXXXXXXXXXX 239 ++F LE AVCS+G FMM+PN+W KTL RPLRL D+ Sbjct: 14 ATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDSVMARISQPH 73 Query: 240 XXXXXXRVF---GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 410 RV G LT ++ + AQV RMLRLS+ E+ F V+ G GR Sbjct: 74 DRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVYGCGS--GLGR 131 Query: 411 VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 590 VFRSPTLFED+VK LLCNC+W RTLSMA +LCDLQ EL+ + + F Sbjct: 132 VFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQSV--------PSKTVDFV 183 Query: 591 PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 770 PKTP KR+ + K TSQ S + E S +++SI Sbjct: 184 PKTPAGKEPKRK-----VEKLKASTCLTSQFDAQSNEGLESHS--NDLSIDISQPTPSAQ 236 Query: 771 NLGSSSKLT----------SNTMDTSEL--PSNLM------AGNFPTPKELASLSENYLT 896 NL SS L+ S +D++ L P L G+FPTP ELA L E +L Sbjct: 237 NLSPSSLLSVPMENVTCEESYGVDSASLCNPQILRDREFEGTGDFPTPTELAKLDEKFLA 296 Query: 897 KRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVGKFTCD 1073 KRC +GYRA RIL LA+ I G I L LE + L L+++DG G FTC Sbjct: 297 KRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSLCSYSKLAVQLRQIDGFGPFTCA 356 Query: 1074 VVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIW 1253 VLMCMG Y +P+D+ET+RHL+QV GR+ T++++ DV+++YAKY PFQFLAYW E+W Sbjct: 357 NVLMCMGFYHVIPSDSETIRHLQQVHGRNS-TVRTIERDVQQIYAKYEPFQFLAYWSELW 415 Query: 1254 DSYEKQFGRLSHMPPSDYGLISGHNMK 1334 YEK+FG++S MP S Y L + NMK Sbjct: 416 HFYEKKFGKISEMPCSAYKLFTASNMK 442 >gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris] Length = 474 Score = 282 bits (721), Expect = 4e-73 Identities = 180/445 (40%), Positives = 243/445 (54%), Gaps = 19/445 (4%) Frame = +3 Query: 102 FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRL-----CDEXXXXXXXXXXXXXXXXRVF 266 F L++AVCS+GFFMM+PN W KTL RPL L RV Sbjct: 46 FQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQRPQSLAVRVH 105 Query: 267 GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHS-QAKSQGFG-RVFRSPTLFED 440 + ++ Q + HIKAQ+ RMLRLSE E+ A+ F VH+ ++ FG RVFRSPTLFED Sbjct: 106 SVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRVFRSPTLFED 165 Query: 441 IVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPT--KT 611 +VK LLCNC+W RTLSMA +LC+LQS L+ G P F PKTP + Sbjct: 166 MVKCILLCNCQWPRTLSMAQALCELQSGLQNGLPCAVEGSGNPKVEAEEFVPKTPASKEN 225 Query: 612 RLKRRECSEILRPAKLRF------DETSQCKVMSGKISEGCSIVSEISITKCDGEYI-VP 770 R K+ +L KL D Q M S+ +++ ++ + + D P Sbjct: 226 RRKKAPTKGVLLKKKLELELEMEVDGNLQMDHMFASSSD-TTLLGDLEVLRSDDSCCQFP 284 Query: 771 NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 950 N G T GNFP+P ELA+LSE++L KRC +GYRA IL LA+ Sbjct: 285 NEGEYFDHT---------------GNFPSPIELANLSESFLAKRCKLGYRAGYILELAQG 329 Query: 951 ICNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTE 1124 I G I L+ LE + D S+ + L L+ + G G FT VLMC+G Y +P D+E Sbjct: 330 IVEGKIQLEQLEELSKDASLSC-YKQLGDQLKPIKGFGPFTRANVLMCLGYYHVIPWDSE 388 Query: 1125 TVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSD 1304 TVRHLKQV ++ + K++ D+EE+Y KY P+QFLA+W EIWD YE +FG+++ M S+ Sbjct: 389 TVRHLKQVHSKNTSS-KTIERDLEEIYGKYEPYQFLAFWSEIWDFYETRFGKMNEMHSSE 447 Query: 1305 YGLISGHNMKEERAVTSIDPDKSQE 1379 Y I+ NM+ R T+ SQ+ Sbjct: 448 YKRITASNMRSTRKATNKRKRPSQK 472 >ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] gi|550342350|gb|EEE79091.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] Length = 489 Score = 281 bits (720), Expect = 5e-73 Identities = 182/456 (39%), Positives = 238/456 (52%), Gaps = 42/456 (9%) Frame = +3 Query: 99 SFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLC-------------DEXXXXXXXXXX 239 +F+LE+AVCS+G FMMSPN W T RPLRL Sbjct: 29 TFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLRLSLSDSDPQVSTPTTSLFVSISHPPHL 88 Query: 240 XXXXXXRVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQ------- 398 RV+G L+ + + + AQVVRMLRLSE ++ F ++ A ++ Sbjct: 89 PRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLSETDERNAREFRKIAEAAAAEENNSWLT 148 Query: 399 GFG-RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXX 575 GFG RVFRSPTLFED+VK LLCNC+W RTLSMA +LC+LQ EL+ K G Sbjct: 149 GFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARALCELQCELQCKSSGVFVAQAVNAT 208 Query: 576 XXX--------FYPKTPTKTRLKRR-ECSEILRPAKLRFDET-------SQCKVMSGKIS 707 F P T KR S++ + + ET + K S I Sbjct: 209 VKNKCNDTAHNFIPNTSAGKESKRNIRASKVTKNLASKIVETETLLEADANLKTDSAHIG 268 Query: 708 -EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPS--NLMAGNFPTPKELASL 878 E V S +C + GS S + + N M NFP+P+ELA+L Sbjct: 269 RETLESVENDSCARCSSRH-----GSDSWAPDSLQSQHGIQPGVNKMICNFPSPRELANL 323 Query: 879 SENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENP--DGSVQMKLEDLKAYLQKLDG 1052 E++L KRC +GYRA RI+ LA+ I G I L +E +G+ L +++DG Sbjct: 324 DESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLREVEEDCANGASSSCYNKLADQFRQIDG 383 Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232 G FTC VLMCMG Y +PTD+ETVRHLKQV + TI++V DVEE+Y KY PFQFL Sbjct: 384 FGPFTCANVLMCMGFYHIIPTDSETVRHLKQVHAKKS-TIQTVQRDVEEIYGKYAPFQFL 442 Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKEE 1340 AYW E+W YEK+FG+LS +P SDY LI+ NM+ + Sbjct: 443 AYWAELWHFYEKRFGKLSEIPTSDYKLITASNMRSK 478 >ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus sinensis] Length = 454 Score = 279 bits (713), Expect = 3e-72 Identities = 177/451 (39%), Positives = 240/451 (53%), Gaps = 30/451 (6%) Frame = +3 Query: 69 ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 248 E L++ + +F+LE AVCS+G FMMSPNRW ++L RPL L + Sbjct: 5 ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64 Query: 249 XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH 380 + L+ + ++ + AQV RMLRLSE ++ + F R+ Sbjct: 65 TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124 Query: 381 SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 530 Q A+ +G GRVFRSPTLFED+VK LLCNC+W RTLSMA +LC+LQ Sbjct: 125 RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180 Query: 531 GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 707 W F P+TP KRR+ S++ R E+ + Sbjct: 181 -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235 Query: 708 EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPSNL-MAGNFPTPKELASLS 881 C+ V E ++ P S L N + T++ PS GNFP+P+ELA+L Sbjct: 236 LDCAGVLEENVQPS-----FPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290 Query: 882 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLE-DLKAYLQKLDGVG 1058 E++L KRC +GYRA RIL LA+ I +G I L LE+ + L L +++G G Sbjct: 291 ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350 Query: 1059 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1238 FT + VL+C+G Y +PTD+ET+RHLKQV R+ CT K+V M E +Y KY PFQFLAY Sbjct: 351 PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQMIAESIYGKYAPFQFLAY 409 Query: 1239 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1331 W E+W YEK+FG+LS MP SDY LI+ NM Sbjct: 410 WSELWHFYEKRFGKLSEMPYSDYKLITASNM 440 >ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] gi|557533482|gb|ESR44600.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] Length = 454 Score = 278 bits (711), Expect = 5e-72 Identities = 173/453 (38%), Positives = 243/453 (53%), Gaps = 32/453 (7%) Frame = +3 Query: 69 ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 248 E L++ + +F+LE AVCS+G FMMSPNRW ++L RPL L + Sbjct: 5 ESVLKLPLAETFNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64 Query: 249 XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRV- 377 + L+ + ++ + AQV RMLRLSE ++ + F R+ Sbjct: 65 TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRIV 124 Query: 378 ---------HSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 530 SQ + GRVFRSPTLFED+VK LLCNC+W RTL+MA +LC+LQ Sbjct: 125 RQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQ---- 180 Query: 531 GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISE 710 W F P+TP KRR+ + +K+ TS ++ K S Sbjct: 181 -----WELQHCSPSISEDFIPQTPAGKESKRRQ-----KVSKVASKLTS--RIAESKASS 228 Query: 711 GCSIVSEISITKCDGEYIVPNLGSSSKLTS----NTMDTSELPSNL-MAGNFPTPKELAS 875 + ++ T E + P+ + + N + T++ PS GNFP+P+ELA+ Sbjct: 229 EDDMNLKLDCTGALEENVQPSFPRNDIESDLHGLNELSTTDPPSACDRIGNFPSPRELAN 288 Query: 876 LSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED-LKAYLQKLDG 1052 L E++L KRC +GYRA RIL LA+ I +G I L LE+ + + L L +++G Sbjct: 289 LDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELEDTCNEASLTTYNKLAEQLSQING 348 Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232 G FT + VL+C+G Y +PTD+ET+RHLKQV R+ CT K+V + E +Y KY PFQFL Sbjct: 349 FGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQIIAESIYGKYSPFQFL 407 Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1331 AYW E+W YEK+FG+LS MP SDY LI+ NM Sbjct: 408 AYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440 >ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis] gi|223541451|gb|EEF43001.1| conserved hypothetical protein [Ricinus communis] Length = 458 Score = 276 bits (706), Expect = 2e-71 Identities = 167/436 (38%), Positives = 235/436 (53%), Gaps = 22/436 (5%) Frame = +3 Query: 99 SFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX---RVFG 269 +FDLE+ VCS+G FM+SPN W +T RPLRL D+ RV+G Sbjct: 21 TFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKSLLVRVYG 80 Query: 270 ISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGF-------GRVFRSPT 428 L+ + + + Q+VRMLRLS+ ++ F ++ S + + GRV RSPT Sbjct: 81 NRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRVLRSPT 140 Query: 429 LFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTK 608 LFED+VK LLCNC+W RTLSMA +LC Q EL + F P TP K Sbjct: 141 LFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQS------PQQKHAFNHFIPNTPVK 194 Query: 609 TRLKRRECSEILRPAKLRFDETSQCKVMSG---KISEGCSIVSEISITKCDGEYIVPNLG 779 KR+ + + + C KIS + V + S + + G Sbjct: 195 KEPKRKIRLSKVPTESMDLEAADTCLTTDDSQMKISNSLNCVDDGSF-----DNLKSCQG 249 Query: 780 SSSKLTSNTMDTSELPSNLMA--------GNFPTPKELASLSENYLTKRCGVGYRARRIL 935 S++ ++ TS++ S+L+ GNFP+P+ELA+L E +L KRCG+GYRA RI+ Sbjct: 250 SNTFYSTGPYATSDIQSHLVTQHCAKKTTGNFPSPRELANLDERFLAKRCGLGYRAGRII 309 Query: 936 NLAKQICNGSIDLDSLEN-PDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVP 1112 LA+ I G I L E +G L L++++G G FT VLMCMG Y +P Sbjct: 310 KLAQGIVEGRIPLREFEQVSNGGSLSTYSKLTDQLREIEGFGPFTRANVLMCMGFYHVIP 369 Query: 1113 TDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHM 1292 TD+ETVRH KQV ++ TIK+V + EE+Y K+ PFQFL YW E+W YE++FG+LS M Sbjct: 370 TDSETVRHFKQVHAKNS-TIKTVQSEAEEIYRKFAPFQFLVYWAELWHFYEQRFGKLSEM 428 Query: 1293 PPSDYGLISGHNMKEE 1340 P S+Y LI+ N++ + Sbjct: 429 PCSNYKLITASNLRNK 444 >gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group] Length = 463 Score = 273 bits (699), Expect = 1e-70 Identities = 169/439 (38%), Positives = 230/439 (52%), Gaps = 27/439 (6%) Frame = +3 Query: 102 FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 263 FDLE AVCS+G FMM+PNRW A + L RPLRL + V Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 264 FGI--SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFE 437 G L+ D+ I QV RMLRL E + A F +H+ A+ GFGR+FRSPTLFE Sbjct: 97 LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156 Query: 438 DIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRL 617 D+VK LLCNC+W RTLSM+ +LC+LQ EL+ F +TP Sbjct: 157 DMVKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIREC 205 Query: 618 KRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSK 791 KR+ ++ KL +F+E + ++ ++ + + +P+ S + Sbjct: 206 KRKRSNKRNVRVKLETKFNEDKLVCLEDPNLA-----TDTANLQTYENSFNLPSAASGTG 260 Query: 792 LTSN-TMDTSELPSNL------MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 950 TS ++D SEL G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ Sbjct: 261 NTSEVSLDHSELKLRNEPCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARS 320 Query: 951 ICNGSIDLDSLENPD----------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIY 1100 I G I L LE + + L L + G G FT VLMCMG + Sbjct: 321 IVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFF 380 Query: 1101 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1280 +P DTET+RHLKQ R+ TI SV +++ +Y KY PFQFLAYW E+W Y KQFG+ Sbjct: 381 HMIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGK 439 Query: 1281 LSHMPPSDYGLISGHNMKE 1337 +S M P +Y L + +K+ Sbjct: 440 ISDMEPINYRLFTASKLKK 458 >gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group] Length = 442 Score = 268 bits (686), Expect = 4e-69 Identities = 169/433 (39%), Positives = 229/433 (52%), Gaps = 21/433 (4%) Frame = +3 Query: 102 FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDE------XXXXXXXXXXXXXXXXRV 263 FDLE AVCS+G FMM+PNRW A + L RPLRL + V Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 264 FGI---SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 434 G L+ D+ I QV RMLRL E + A+ F +H+ A+ GFGR+FRSPTLF Sbjct: 97 LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156 Query: 435 EDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTR 614 ED++K LLCNC+W RTLSM+ +LC+LQ EL+ F +TP Sbjct: 157 EDMIKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIRE 205 Query: 615 LKRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 788 KR+ ++ KL +F+E + ++ T E + +L SS+ Sbjct: 206 CKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLA-----------TNTANENLF-SLPSSA 253 Query: 789 KLTSNTMDTS----------ELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILN 938 T NT + S EL G+FPTP+ELA+L E++L KRC +GYRARRI+ Sbjct: 254 NETGNTSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVM 313 Query: 939 LAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTD 1118 LA+ I G I L LE ++ +E+L + G+ F VLMCMG + +P D Sbjct: 314 LARSIVEGKICLQKLEE---IRKILIEELST----ISGIWPFHSCNVLMCMGFFHMIPAD 366 Query: 1119 TETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPP 1298 TET+RHLKQ R+ TI SV +++ +Y KY PFQFLAYW E+W Y KQFG +S M P Sbjct: 367 TETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEP 425 Query: 1299 SDYGLISGHNMKE 1337 +Y L + +K+ Sbjct: 426 INYRLFTASKLKK 438 >gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 426 Score = 256 bits (654), Expect = 2e-65 Identities = 170/454 (37%), Positives = 230/454 (50%), Gaps = 18/454 (3%) Frame = +3 Query: 27 AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206 ++S C ++E GE F+LE+AVCS+G FMM+PN+W ++L RPLRL D Sbjct: 26 SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 84 Query: 207 EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371 RV+G L+ Q + + QV RMLRLSE E+ + F Sbjct: 85 HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 144 Query: 372 RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521 ++ H + ++ GRVFRSPTLFED+VK LLCNC+ Sbjct: 145 KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQ---------------- 188 Query: 522 ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701 F PKTP LKR KLR Sbjct: 189 ----------------AAEDDFIPKTPAGNELKR----------KLR------------- 209 Query: 702 ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881 VS++S+ + +G++ P S + + E + G+FP+P+ELA+L Sbjct: 210 -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 261 Query: 882 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052 E++L KRC +GYRA RIL LAK I G I L LE +G ++ L L L+++DG Sbjct: 262 ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 319 Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232 G FTC VLMCMG Y +P D+ET+RHLKQV +S T+++V DVE +YAKY PFQFL Sbjct: 320 FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 378 Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334 AYW E+W YE++FG+LS MP Y LI+ NMK Sbjct: 379 AYWAELWHYYEQRFGKLSEMPFCGYKLITASNMK 412 >dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group] gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza sativa Japonica Group] Length = 501 Score = 252 bits (643), Expect = 4e-64 Identities = 169/481 (35%), Positives = 230/481 (47%), Gaps = 69/481 (14%) Frame = +3 Query: 102 FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 263 FDLE AVCS+G FMM+PNRW A + L RPLRL + V Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 264 FGISQ---LTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 434 G L+ D+ I QV RMLRL E + A+ F +H+ A+ GFGR+FRSPTLF Sbjct: 97 LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156 Query: 435 EDIVKAFLLCNC------------------------------------------RWQRTL 488 ED++K LLCNC RW RTL Sbjct: 157 EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216 Query: 489 SMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKL--R 662 SM+ +LC+LQ EL+ F +TP KR+ ++ KL + Sbjct: 217 SMSTALCELQLELRSSS-----------STENFQSRTPPIRECKRKRSNKRNVRVKLETK 265 Query: 663 FDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNL-- 836 F+E + ++ + + S+ E G++S+++ +D SEL Sbjct: 266 FNEDKMVCLEDPNLATNTANENLFSLPSSANE-----TGNTSEVS---LDHSELKLRYEL 317 Query: 837 ----MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD--- 995 G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ I G I L LE Sbjct: 318 CLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMS 377 Query: 996 -------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVG 1154 + + L L + G G FT VLMCMG + +P DTET+RHLKQ Sbjct: 378 VPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHK 437 Query: 1155 RSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334 R+ TI SV +++ +Y KY PFQFLAYW E+W Y KQFG +S M P +Y L + +K Sbjct: 438 RAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 496 Query: 1335 E 1337 + Sbjct: 497 K 497 >gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 421 Score = 249 bits (636), Expect = 3e-63 Identities = 165/422 (39%), Positives = 225/422 (53%), Gaps = 18/422 (4%) Frame = +3 Query: 27 AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206 ++S C ++E GE F+LE+AVCS+G FMM+PN+W ++L RPLRL D Sbjct: 41 SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 99 Query: 207 EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371 RV+G L+ Q + + QV RMLRLSE E+ + F Sbjct: 100 HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 159 Query: 372 RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521 ++ H + ++ GRVFRSPTLFED+VK LLCNC++ RTLSMA +LC+LQ Sbjct: 160 KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 219 Query: 522 ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701 E + +P F PKTP LKR KLR Sbjct: 220 ETQ-RPFS-----GVRAAEDDFIPKTPAGNELKR----------KLR------------- 250 Query: 702 ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881 VS++S+ + +G++ P S + + E + G+FP+P+ELA+L Sbjct: 251 -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 302 Query: 882 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052 E++L KRC +GYRA RIL LAK I G I L LE +G ++ L L L+++DG Sbjct: 303 ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 360 Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232 G FTC VLMCMG Y +P D+ET+RHLKQV +S T+++V DVE +YAKY PFQFL Sbjct: 361 FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 419 Query: 1233 AY 1238 AY Sbjct: 420 AY 421 >gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao] Length = 374 Score = 244 bits (622), Expect = 1e-61 Identities = 153/424 (36%), Positives = 215/424 (50%), Gaps = 5/424 (1%) Frame = +3 Query: 66 GECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXX 245 GEC+ SSF++E+AVC++G FMMSPN W+ + K+L RPLRL D Sbjct: 12 GECS------SSFNMEKAVCNHGLFMMSPNVWIPSTKSLRRPLRLADSSGSVYVTISHPA 65 Query: 246 XXXX----RVFGI-SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 410 +V G+ + ++ D+ I QV RMLR+S ++ + F +H AK +GFGR Sbjct: 66 PNHPFLVIQVNGLQNSISSADKAVIMEQVARMLRISSKDERDVREFQTLHGSAKDRGFGR 125 Query: 411 VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 590 +FRSP+ FED VK+ LLCNC GW Sbjct: 126 IFRSPSFFEDAVKSILLCNC------------------------GWK------------- 148 Query: 591 PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 770 +T T + R C+ L+ A + KIS TK Sbjct: 149 -RTLT---MARALCALQLQLASAHLQHKRVASNSNVKIS-----------TKRLKHKKYT 193 Query: 771 NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 950 S+S+L+ + D S GNFPT ELA L E YL +RC +GYRAR IL LA++ Sbjct: 194 KASSTSELSMSGFDQS-------IGNFPTSTELACLDEKYLNERCNLGYRARCILQLARK 246 Query: 951 ICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETV 1130 + NG ++L+ LE + S E L K+ G G F C ++MC+G Y+ +P D+ET+ Sbjct: 247 VENGELELNKLE--ESSDTTSYERFYQKLMKIKGFGPFVCSNIMMCIGFYERIPFDSETI 304 Query: 1131 RHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYG 1310 RHLK V G+ C+ K++ D+EE+Y KY PFQ +AYW E+ D YE +FG+LS + S Y Sbjct: 305 RHLKMVHGKGKCSRKTIEKDIEEIYGKYAPFQCMAYWLELLDEYENKFGKLSELESSSYH 364 Query: 1311 LISG 1322 L +G Sbjct: 365 LATG 368 >ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus sinensis] Length = 409 Score = 238 bits (607), Expect = 6e-60 Identities = 158/420 (37%), Positives = 218/420 (51%), Gaps = 30/420 (7%) Frame = +3 Query: 69 ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 248 E L++ + +F+LE AVCS+G FMMSPNRW ++L RPL L + Sbjct: 5 ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64 Query: 249 XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH 380 + L+ + ++ + AQV RMLRLSE ++ + F R+ Sbjct: 65 TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124 Query: 381 SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 530 Q A+ +G GRVFRSPTLFED+VK LLCNC+W RTLSMA +LC+LQ Sbjct: 125 RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180 Query: 531 GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 707 W F P+TP KRR+ S++ R E+ + Sbjct: 181 -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235 Query: 708 EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPS-NLMAGNFPTPKELASLS 881 C+ V E ++ + P S L N + T++ PS GNFP+P+ELA+L Sbjct: 236 LDCAGVLEENV-----QPSFPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290 Query: 882 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVG 1058 E++L KRC +GYRA RIL LA+ I +G I L LE+ + L L +++G G Sbjct: 291 ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350 Query: 1059 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1238 FT + VL+C+G Y +PTD+ET+RHLKQV R +CT K+V M E +Y KY PFQFLAY Sbjct: 351 PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHAR-NCTSKTVQMIAESIYGKYAPFQFLAY 409 >gb|EOY25841.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 406 Score = 225 bits (573), Expect = 5e-56 Identities = 154/408 (37%), Positives = 213/408 (52%), Gaps = 18/408 (4%) Frame = +3 Query: 27 AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206 ++S C ++E GE F+LE+AVCS+G FMM+PN+W ++L RPLRL D Sbjct: 26 SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 84 Query: 207 EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371 RV+G L+ Q + + QV RMLRLSE E+ + F Sbjct: 85 HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 144 Query: 372 RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521 ++ H + ++ GRVFRSPTLFED+VK LLCNC++ RTLSMA +LC+LQ Sbjct: 145 KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 204 Query: 522 ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701 E + +P F PKTP LKR KLR Sbjct: 205 ETQ-RPFS-----GVRAAEDDFIPKTPAGNELKR----------KLR------------- 235 Query: 702 ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881 VS++S+ + +G++ P S + + E + G+FP+P+ELA+L Sbjct: 236 -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 287 Query: 882 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052 E++L KRC +GYRA RIL LAK I G I L LE +G ++ L L L+++DG Sbjct: 288 ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 345 Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVE 1196 G FTC VLMCMG Y +P D+ET+RHLKQV +S T+++V DVE Sbjct: 346 FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVE 392