BLASTX nr result
ID: Rehmannia22_contig00024891
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00024891 (2376 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citr... 333 2e-88 ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citr... 327 1e-86 ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c... 315 6e-83 ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyp... 310 2e-81 ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267... 305 6e-80 gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus pe... 305 8e-80 ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyp... 303 2e-79 gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis] 302 4e-79 ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309... 302 4e-79 ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207... 298 6e-78 gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao] 297 1e-77 gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao] 294 1e-76 gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus... 290 2e-75 ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyp... 287 1e-74 ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyp... 283 3e-73 gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma caca... 280 2e-72 ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Popu... 278 6e-72 emb|CBI40233.3| unnamed protein product [Vitis vinifera] 278 8e-72 ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like i... 272 4e-70 ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251... 266 3e-68 >ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] gi|568878417|ref|XP_006492190.1| PREDICTED: uncharacterized protein LOC102610545 [Citrus sinensis] gi|557538863|gb|ESR49907.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] Length = 732 Score = 333 bits (854), Expect = 2e-88 Identities = 273/739 (36%), Positives = 362/739 (48%), Gaps = 108/739 (14%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 E ++QR + M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQ Sbjct: 7 EMQDQRTNSGMEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 RKKAEKATA+VLAILEN+GIS++S+ FDS SDQ E+P + + N K R+ Sbjct: 67 RKKAEKATADVLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRR 125 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RR 1700 N R LSW + ++ +LEK Y DS +RRR+SF S S+ R Sbjct: 126 NASVEHSGSGNDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNR 183 Query: 1699 VGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTL 1523 VGKSCR+IR R+++S + TE G V + E G E L Sbjct: 184 VGKSCRQIRRRESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYL 241 Query: 1522 RSNS-----ETQKM---DGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFR 1367 S E +K+ G F+ D DME AL+ Q QLIG+Y E+FR Sbjct: 242 GEGSDSGCFENEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFR 301 Query: 1366 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE 1202 ENNS T DSCDPGN SDVTEER E K ++ R AGT NS QE K E Q+ S Sbjct: 302 ENNSSTPDSCDPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSN 360 Query: 1201 ---KPETSKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRS-QQFPPMV 1034 P++ + + + E A +F+F MS EK NQE LG H + S + P Sbjct: 361 GFLPPQSGDQKCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHG 418 Query: 1033 QTTTQSSTKISPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLN 869 QSS +S +T S+ ++ S + A+VP VLEALK+A+ SL Sbjct: 419 SPENQSSQTVS----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLR 474 Query: 868 QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------N 716 QK+++ P T R+ G V +PS + + D +IPV GLFR+PTDY E + Sbjct: 475 QKMSSLPSTESRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSD 534 Query: 715 ARPGFANFPPENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTP 587 +RP AN+ P + +G + D+RS F++ DLFLT P ++ Sbjct: 535 SRPSLANYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSA 594 Query: 586 ERPFSQPRLSEGPSSNRMNR--LDSYTNPVLPSVK-------DSYP-------------- 476 E + S+ S M R DS + LPS + SYP Sbjct: 595 ENRLLTRQYSDTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLST 654 Query: 475 FL---------------------------------PDVTLRVPLNEGGASRNFPSSERGL 395 FL PD+ ++P +E G S PS G+ Sbjct: 655 FLPGRSVEMSVEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGM 713 Query: 394 PPVMRLSSYDEHVRPDMYR 338 PP L +++H RP MYR Sbjct: 714 PPANHLPFHNDHTRPYMYR 732 >ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] gi|557538862|gb|ESR49906.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] Length = 716 Score = 327 bits (839), Expect = 1e-86 Identities = 270/729 (37%), Positives = 356/729 (48%), Gaps = 108/729 (14%) Frame = -3 Query: 2200 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2021 M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQRKKAEKATA+ Sbjct: 1 MEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQRKKAEKATAD 60 Query: 2020 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1841 VLAILEN+GIS++S+ FDS SDQ E+P + + N K R+N Sbjct: 61 VLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRRNASVEHSGSG 119 Query: 1840 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RRVGKSCRRIRH 1670 R LSW + ++ +LEK Y DS +RRR+SF S S+ RVGKSCR+IR Sbjct: 120 NDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNRVGKSCRQIRR 177 Query: 1669 RDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTLRSNS-----E 1508 R+++S + TE G V + E G E L S E Sbjct: 178 RESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYLGEGSDSGCFE 235 Query: 1507 TQKM---DGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1337 +K+ G F+ D DME AL+ Q QLIG+Y E+FRENNS T DSC Sbjct: 236 NEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFRENNSSTPDSC 295 Query: 1336 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE---KPETSKR 1181 DPGN SDVTEER E K ++ R AGT NS QE K E Q+ S P++ + Sbjct: 296 DPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSNGFLPPQSGDQ 354 Query: 1180 SLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRS-QQFPPMVQTTTQSSTKI 1004 + + E A +F+F MS EK NQE LG H + S + P QSS + Sbjct: 355 KCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHGSPENQSSQTV 412 Query: 1003 SPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLNQKLNNSPPTA 839 S +T S+ ++ S + A+VP VLEALK+A+ SL QK+++ P T Sbjct: 413 S----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLRQKMSSLPSTE 468 Query: 838 GRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------NARPGFANFPP 686 R+ G V +PS + + D +IPV GLFR+PTDY E ++RP AN+ P Sbjct: 469 SRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSDSRPSLANYNP 528 Query: 685 ENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTPERPFSQPRLS 557 + +G + D+RS F++ DLFLT P ++ E + S Sbjct: 529 TSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSAENRLLTRQYS 588 Query: 556 EGPSSNRMNR--LDSYTNPVLPSVK-------DSYP--------------FL-------- 470 + S M R DS + LPS + SYP FL Sbjct: 589 DTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLSTFLPGRSVEMS 648 Query: 469 -------------------------PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYD 365 PD+ ++P +E G S PS G+PP L ++ Sbjct: 649 VEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGMPPANHLPFHN 707 Query: 364 EHVRPDMYR 338 +H RP MYR Sbjct: 708 DHTRPYMYR 716 >ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis] gi|223526443|gb|EEF28720.1| hypothetical protein RCOM_0152200 [Ricinus communis] Length = 665 Score = 315 bits (807), Expect = 6e-83 Identities = 250/649 (38%), Positives = 337/649 (51%), Gaps = 50/649 (7%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 E ++QR + M++S AMTIEFLRARLLSERSVS+TARQRADELA +VAELEEQL+ VSLQ Sbjct: 7 EKQDQRTNSGMEDSTAMTIEFLRARLLSERSVSRTARQRADELATRVAELEEQLRIVSLQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 R KAEKATA++LAILE +GISD+SE FDSCSD+ ++P + K N K+R Sbjct: 67 RMKAEKATADILAILEGNGISDISETFDSCSDR-DTPCESKVGN-RSSKEENSINSKVRN 124 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS-NSLSARRVG 1694 N GRSLSW+ K+S +LEK S+RRR+SF S S +R G Sbjct: 125 NDSEELSGSDFDFSSVPGRSLSWKGRKNSPRSLEKSK-DSSMRRRSSFSSVGSSPKQRPG 183 Query: 1693 KSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE------S 1532 KSCR+IR +++R K C D + + ++E + +++ Sbjct: 184 KSCRQIRRKESRF---EYKASPVKRDCPEDEVAATSANFPSCSDFEPKRGEVKPLLEDSH 240 Query: 1531 STLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSG 1352 S N +G ++V+ D DME AL+HQ QLIGQY EKFRENNS Sbjct: 241 SDCLGNERNASDNGLDYNVYRGDRDMEKALEHQAQLIGQYEAMEKVQREWEEKFRENNSS 300 Query: 1351 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQ 1172 T DSCD GN SD+TEERYE++ P ++ T+N+ E V+ + +P S Sbjct: 301 TPDSCDHGNRSDITEERYEIREP--AKGPATTNAIQTEGLLSVVEGVSNTQPHGFLPSSH 358 Query: 1171 NENIISCESSAS-----EFS-----FPMSREKNNQEFLG--------IQHNASQYRSQQF 1046 + + E +S EFS FPM++ K NQ+ G I H+ S Q+ Sbjct: 359 VDAVCLEERKSSIAPVPEFSTQDSAFPMAKAKQNQKNPGNNDHSPLLIAHHDSASFGSQY 418 Query: 1045 PPMVQTTTQ--SSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRAKSSL 872 Q+ S+T S + K+T+ S + +L A LG VLEAL+ A+ SL Sbjct: 419 SSGSQSVLSFPSNTGSSFNKGKATSGSENERCALVPHKA---SGGLGGVLEALEEARQSL 475 Query: 871 NQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFAN 695 Q++N P A SV + S + T D QIPV GLFRLPTD+ E N R + Sbjct: 476 QQRINRLPSVATTVRKSV-ESSVSTTISRDEVQIPVGCVGLFRLPTDFSVEGNTRANLLS 534 Query: 694 FPPENSLG--------------RFLSEPF-DSRSAFSS-DLFLTDPY-----RPFTPERP 578 + SLG +F++ P+ RS+ S+ D FL+ Y R TP +P Sbjct: 535 SSAQLSLGNHYSDRGVPAAASNQFVASPYLQGRSSSSTEDQFLSSQYVGGGSRIPTP-KP 593 Query: 577 FSQPRLSEG-PSSNRMNRLDSYTNPVLPSVKDSYPFLPDVTLRVPLNEG 434 + P L G PSS+R YT P P + SY PD+ R+P EG Sbjct: 594 YFDPYLDTGLPSSSR------YTYPNYP-INTSY---PDLMPRIPSREG 632 >ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1 [Solanum tuberosum] Length = 643 Score = 310 bits (793), Expect = 2e-81 Identities = 254/678 (37%), Positives = 330/678 (48%), Gaps = 47/678 (6%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 +D++QRK M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ Sbjct: 7 QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1877 RKKAEKATA VL+ILEN GISD SEEFDS SDQE + K + Sbjct: 66 RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125 Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1703 + STGRSLSW+S K S + E+ Y DS RR SF S S S + Sbjct: 126 ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185 Query: 1702 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNG-SDGEHVALREYENGKNQLESST 1526 R GKSCRRIR T++ D C + ++ H +L + G N ++ Sbjct: 186 RAGKSCRRIRRNTTKTATDE---------CPPEHLPSFANNGHQSLMD-SAGNNDVKD-- 233 Query: 1525 LRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1346 + + T +M E D+ ME ALQH+ QLIGQY EK+RENN+ Q Sbjct: 234 -QRHLPTSEMSENQRKSDESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQ 292 Query: 1345 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE---- 1193 DSCDPGN+SDVTEER +MK+ E +A N N K ++VD P Sbjct: 293 DSCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHI 352 Query: 1192 -TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQ 1019 TS R QN II+ ES ASEF+ K+N Y Q P Sbjct: 353 GTSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP-------- 400 Query: 1018 SSTKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSP 848 S SP ++S+ SL A+V + DN+GS+L AL++AK S++Q++N SP Sbjct: 401 -SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSP 459 Query: 847 PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLG 671 G +S P T + D I PGLFRLPTD+Q E + FP S Sbjct: 460 IAEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSA 515 Query: 670 RFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPV-LPS 494 EP D F T PY +P + + G + +N + +P S Sbjct: 516 NHFHEP-------GYDQFSTTPYME-SPSNAITGLPYTTG--FDYLNPPSGFGHPFSSKS 565 Query: 493 VKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGLP 392 +YPF P+ T V PL E + R+ P +E G P Sbjct: 566 TYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGKP 625 Query: 391 PVMRLSSYDEHVRPDMYR 338 P +S YD H+RP+MYR Sbjct: 626 PSFPVSHYDAHLRPNMYR 643 >ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267607 [Solanum lycopersicum] Length = 617 Score = 305 bits (781), Expect = 6e-80 Identities = 250/682 (36%), Positives = 328/682 (48%), Gaps = 51/682 (7%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 +D++QRKT M E+++MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ Sbjct: 7 KDQDQRKTVGM-ENSSMTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1877 RKKAEKATA VL+ILEN GI+D SEEFDS SDQE + K + Sbjct: 66 RKKAEKATAAVLSILENEGITDASEEFDSGSDQEAIFSNSKGADSTDNRNEYKPDPSNVK 125 Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSNSLSA-R 1703 + STGRSLSW+S K S + E+ Y DS RR SF S S+ + Sbjct: 126 ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGTSSPK 185 Query: 1702 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1523 R GKSCRRIR +T + + ND QL T Sbjct: 186 RAGKSCRRIRRSNTNAGNNDVND------------------------------QLHLPTS 215 Query: 1522 RSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1343 ++ +K D E D+ ME ALQH+ LIG+Y EK+RENN QD Sbjct: 216 ETSENQRKAD-------ESDEGMERALQHKALLIGKYEAEEKAQREWEEKYRENNYA-QD 267 Query: 1342 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFS--------EKPETS 1187 SCDPGN+SDVTEER +MK+ E +A N N K ++VD + P S Sbjct: 268 SCDPGNYSDVTEERDDMKAFEQPYSAEMINLQNHANKFQEVDIPSTNGVTDNVPSNPHIS 327 Query: 1186 KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016 +++N II+ ES ASEF+ P K+N Y Q P Sbjct: 328 TSCRKDQNCSRIINSESPASEFALP----KSNGSCPENDGPTPAYCHHQLP--------- 374 Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 845 S+ SP + ++S+ SL A+V DN+GS+L AL++AK S++Q++N S P Sbjct: 375 SSNGSPIQPLENSISSSGGSSLQAGQALVSGDASDNIGSILGALEQAKFSISQQINVS-P 433 Query: 844 TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 668 GR+S + S D IP PGLFRLPTD+Q E + FP S Sbjct: 434 VEGRSS---IEHSIPTAKIEDRLDIPPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 490 Query: 667 FLSEPFDSRSAFSSDLFLTDPYR-----PFTPERPFSQPRLSEG-PSSNRMNRLDSYTNP 506 EP + FS+ ++ P P+T + P S G P S++ Sbjct: 491 HFHEP--GYNQFSATPYMESPSNAITGLPYTTGFDYLNPPSSFGHPFSSK---------- 538 Query: 505 VLPSVKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSE 404 S +YPF P+ T V PL E + R+ P +E Sbjct: 539 ---STYPTYPFRPNTTTTVSQSQASWSPLYESSLTKSSPVVVPNLSSGEDVFLRSLPRNE 595 Query: 403 RGLPPVMRLSSYDEHVRPDMYR 338 G PP +S YD H+RP+MYR Sbjct: 596 TGKPPSFPVSHYDAHMRPNMYR 617 >gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus persica] Length = 690 Score = 305 bits (780), Expect = 8e-80 Identities = 259/700 (37%), Positives = 336/700 (48%), Gaps = 69/700 (9%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 + ++QR M++S AMTIEFLRARLL+ERSVS++ARQR DEL + V ELEEQLK VSLQ Sbjct: 7 DTQDQRSNLGMEDSTAMTIEFLRARLLAERSVSRSARQRVDELERMVEELEEQLKIVSLQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVS-EEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLR 1874 RK AEKAT +VLAILE+ GISD+S EEFDS SDQ E+ K N K+R Sbjct: 67 RKMAEKATEDVLAILESQGISDISEEEFDSSSDQ-ETHQGSKVGNSLANEEESFVISKVR 125 Query: 1873 KNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-- 1700 + GRSLSW+ DS + EK + SVRRR+SF S S+ R Sbjct: 126 RKEQEEHSGSDADSSLIPGRSLSWKGRIDSPRSREKCKDL-SVRRRSSFSSIGFSSPRHH 184 Query: 1699 VGKSCRRIRHRDTRSME-DSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESS 1529 +GKSCR+I+H++TRS + DS +G A S N S+G LRE + L + Sbjct: 185 LGKSCRQIKHKETRSDKFDSHENGV--GASSEGLPNFSNGGPEKLREGSEFPEEKVLSND 242 Query: 1528 TLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGT 1349 +L E Q+ F+ H RD DME AL+HQ +LI + EKFRENN+ T Sbjct: 243 SLSRTKENQRDSDLDFNGHGRDKDMEKALEHQAKLICENEEMEKAQREWEEKFRENNTST 302 Query: 1348 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC------------FS 1205 DSCDPGNHSD+TEER E+K+ + +AG + QETK E+ D C F Sbjct: 303 PDSCDPGNHSDITEERDEIKA-QTPCSAGVVVAQAQETKSEEGDVCLPKETFKIQQNGFL 361 Query: 1204 EKPETSKRSLQNE--NIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQ 1031 LQ++ S EF+FP K N E L + S P + Sbjct: 362 PASHVDMGGLQDQLNKSTVAPSQVEEFAFPTENGKQNHESLENFARHPSHGSHPNPLVHG 421 Query: 1030 TTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEALKRAKSSLNQKL 860 + S+ S S S A+VP QD LG VL+ALK+AK SL Q + Sbjct: 422 SAHNRSSDASSSVAGSGFHKGNASGSRSDLYALVPHDSQDRLGGVLDALKQAKLSLQQNM 481 Query: 859 NNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA----------- 713 P G + +PS D +IPV GLFRLPTD+ E A Sbjct: 482 TRLPLVDGTSVHKSIEPSIPVMKTGDRVEIPVGCAGLFRLPTDFAVEEAATQSSFLGSSW 541 Query: 712 -----------------RPGFANFPPENSLGRFLSEPF-DSRSAFS---SDLFLTDPY-- 602 RP F+ N+ R++ P+ ++R FS +D F+ + Y Sbjct: 542 SGRYCPETLVTSSFVETRPTFS----MNAADRYVPSPYIETRQTFSTNATDRFIPNAYVE 597 Query: 601 -RPFTP---ERPF----SQPRLSEGPSSNRM----NRLDSYTNPVLPSVKDSYPFLPDVT 458 RP P PF S S P+ NR Y P P +YP +PD T Sbjct: 598 SRPNFPANAAEPFVTSPSVDTRSNFPADNRFLSGPYSESGYAQPPYP----NYPSVPDRT 653 Query: 457 LRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338 + +E +R P G P R S YD+ RP+MYR Sbjct: 654 PWITSDE-ALTRALPRKPVG-APTDRFSFYDQ-FRPNMYR 690 >ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X2 [Solanum tuberosum] Length = 618 Score = 303 bits (776), Expect = 2e-79 Identities = 252/677 (37%), Positives = 322/677 (47%), Gaps = 46/677 (6%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 +D++QRK M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ Sbjct: 7 QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1877 RKKAEKATA VL+ILEN GISD SEEFDS SDQE + K + Sbjct: 66 RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125 Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1703 + STGRSLSW+S K S + E+ Y DS RR SF S S S + Sbjct: 126 ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185 Query: 1702 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1523 R GKSCRRIR T + + D + L +S + Sbjct: 186 RAGKSCRRIRRNTTNAGNNDVKD----------------------------QRHLPTSEM 217 Query: 1522 RSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1343 N +K D E D+ ME ALQH+ QLIGQY EK+RENN+ QD Sbjct: 218 SENQ--RKSD-------ESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQD 268 Query: 1342 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE----- 1193 SCDPGN+SDVTEER +MK+ E +A N N K ++VD P Sbjct: 269 SCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHIG 328 Query: 1192 TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016 TS R QN II+ ES ASEF+ K+N Y Q P Sbjct: 329 TSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP--------- 375 Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSPP 845 S SP ++S+ SL A+V + DN+GS+L AL++AK S++Q++N SP Sbjct: 376 SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSPI 435 Query: 844 TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 668 G +S P T + D I PGLFRLPTD+Q E + FP S Sbjct: 436 AEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 491 Query: 667 FLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPV-LPSV 491 EP D F T PY +P + + G + +N + +P S Sbjct: 492 HFHEP-------GYDQFSTTPYME-SPSNAITGLPYTTG--FDYLNPPSGFGHPFSSKST 541 Query: 490 KDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGLPP 389 +YPF P+ T V PL E + R+ P +E G PP Sbjct: 542 YPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGKPP 601 Query: 388 VMRLSSYDEHVRPDMYR 338 +S YD H+RP+MYR Sbjct: 602 SFPVSHYDAHLRPNMYR 618 >gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis] Length = 654 Score = 302 bits (774), Expect = 4e-79 Identities = 257/689 (37%), Positives = 343/689 (49%), Gaps = 58/689 (8%) Frame = -3 Query: 2230 EDREQRKTTSMQESN--AMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2057 E ++QR ++SM++S AMTIEFLRARLLSERSVS++ARQRADEL K+V ELEEQL+ VS Sbjct: 7 EKQDQRSSSSMEDSQSTAMTIEFLRARLLSERSVSRSARQRADELEKRVEELEEQLRIVS 66 Query: 2056 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1877 LQRK AEKAT +VL+ILENHGISD SE +DS SDQE NG Sbjct: 67 LQRKMAEKATVDVLSILENHGISDASETYDSGSDQETHQVANNYANGEERSVVSK----- 121 Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-----ASFGSNSL 1712 R++ GRSLSW+ DS + EK Y DS RR +SFGS+S Sbjct: 122 RRSVLEELSGSDLDSSPINGRSLSWKGRSDSSRSREK--YKDSSVRRQNALSSSFGSSS- 178 Query: 1711 SARRVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLES 1532 VGKSCR+IR R+TR++ + E + ENG Sbjct: 179 PKHYVGKSCRQIRCRETRTVVEDHKT-----------------EPLKFDSQENGAATPPE 221 Query: 1531 STLRSNSETQKMDGRYFDV--HERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENN 1358 +++++ + DV H ++ DM+ AL+H+ QLIGQY EK+RENN Sbjct: 222 GSVKNDRRIP----NHLDVNGHGQEKDMKKALEHRAQLIGQYEEMEKAQREWEEKYRENN 277 Query: 1357 SGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-ACFSEKPETS-- 1187 + T DS DPGNHSDVTE+R E+K+ L G + + K +VD + S KP+++ Sbjct: 278 TSTPDSYDPGNHSDVTEDRDEVKAQTLYN-VGIDIAQAVDAKSNKVDLSKESSKPQSNGF 336 Query: 1186 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQ 1049 ++ N + ++ A EF+FP ++EK QE L +R + Sbjct: 337 LHPTRTRAAMGDLKVQASSNIDPVASRFQAQEFAFPTAKEKEAQESL----ENRDFRPSE 392 Query: 1048 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLEL--------AVVPQDN---LGSVL 902 P Q +S P++ ALS S + A+VP + LG VL Sbjct: 393 SPHHGQLLHRSLPN-QPFDR--GALSDAGSSSHKRDFSGSQNDLYALVPHNPPVVLGGVL 449 Query: 901 EALKRAKSSLNQKLNNSP----PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPT 734 +ALK+AK SL QK+N P T A +P+ T D +IPV GLFRLPT Sbjct: 450 DALKQAKLSLQQKINRLPLEGTTTQTVAVNRSIEPTQPGTRVGDRLEIPVGCTGLFRLPT 509 Query: 733 DYQPENARPGFANFPPENSLGRFLSEPF--DSRSAFSS-DLFLTDPY----RPFTPERPF 575 D+ A ANF S R EP+ D++ A ++ D FLT PY F P+ F Sbjct: 510 DFATVEASTQ-ANFLSSGS--RLSLEPYYPDNKVALTAPDRFLTSPYIESRSEFPPDVRF 566 Query: 574 --SQPRLSEGPSSNRMNRLDSYTNPVLPSVK--------DSYPFLPDVTLRVPLNEGGAS 425 S +S +S +R DS+ + SV SYP PD R+P +E G Sbjct: 567 LTSSSVVSGSRASTLNSRFDSHFDTGPSSVNRYSNYPPHPSYPPFPDSMPRIPSDE-GLR 625 Query: 424 RNFPSSERGLPPVMRLSSYDEHVRPDMYR 338 R F SS P R S YD+H RP+MYR Sbjct: 626 RPFRSSRSFGLPEDRFSFYDDHGRPNMYR 654 >ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309582 [Fragaria vesca subsp. vesca] Length = 807 Score = 302 bits (774), Expect = 4e-79 Identities = 220/577 (38%), Positives = 297/577 (51%), Gaps = 34/577 (5%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 + ++ R + M +S +TIEFLRARLLSERSVS++ARQRADEL K V ELEEQLK VSLQ Sbjct: 7 DTQDLRINSGMDDSPGITIEFLRARLLSERSVSRSARQRADELEKMVEELEEQLKIVSLQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQE---ESPHDFKARNGXXXXXXXXXXXK 1880 RK AEKATA+VLAILEN G SD+SEEFDS SD E ES K+R Sbjct: 67 RKMAEKATADVLAILENQGASDISEEFDSSSDHETFQESKMGNKSRKEEENFLISE---- 122 Query: 1879 LRKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSA 1706 R+N GR+LSW+ DS + EK S+RRR++F + +S S Sbjct: 123 -RRNEHEEYSGSDLDSSSIPGRNLSWKGRIDSPRSREKYK-EPSIRRRSTFSAVGSSSSR 180 Query: 1705 RRVGKSCRRIRHRDTRSM-----------EDSQNDGTEKAACSGDAFNGSDGEHVALREY 1559 +GKSCR+I+HR+TRS+ +DS+ +G ++ F+ D E + Sbjct: 181 HNLGKSCRQIKHRETRSVVERSKDEPAKFDDSEENGVAASSEGLSNFSYCDPERLRDGPE 240 Query: 1558 ENGKNQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXX 1379 + L L + E Q+ F+ H R+ DME AL+HQ QLIGQ Sbjct: 241 SQKEKFLSKDALTRSKEHQRNGDPNFNGHGRNKDMERALEHQAQLIGQNEEMEMAQREWE 300 Query: 1378 EKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC-FSE 1202 EKFRENN+ T DSCDPGNHSD+TEER EMK+P A + S+ QE K E D+C F E Sbjct: 301 EKFRENNTSTPDSCDPGNHSDITEERDEMKTP---FPAEINASEAQEAKSEARDSCLFEE 357 Query: 1201 KPET--------------SKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQ 1064 K +T + N + ++ S EF+FP + E+ QE L + Sbjct: 358 KMKTQLNGYLPPSDVEMGGMQDQMNRSSVASASPIQEFAFPTAYERQTQESLENNAHQPS 417 Query: 1063 YRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEAL 893 S P +++++ S+ +S S ++ + L A+VP Q+ LG VL+AL Sbjct: 418 PGSHHDPLLLESSHNRSSVVSSDGGSSFHNASGSRNDL---YALVPHDSQERLGGVLDAL 474 Query: 892 KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA 713 K+AK SL QK+ P + +P + IPV GLFRLPTD+ E A Sbjct: 475 KQAKLSLQQKIIRLPLVDDTSVQESIEPPIPAVTTGNRLDIPVGCAGLFRLPTDFAVEEA 534 Query: 712 RPGFANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPY 602 + +SL P +A S+D F+T Y Sbjct: 535 ATKHSYLGLGSSLPSARYCPDKGLAASSTDQFVTSTY 571 >ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus] Length = 671 Score = 298 bits (764), Expect = 6e-78 Identities = 248/697 (35%), Positives = 335/697 (48%), Gaps = 66/697 (9%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 + ++ R ++++ AMTIEFLRARLLSERSVSK+ARQRADELAK+VAELEEQLK VSLQ Sbjct: 7 DQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSLQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 RK AEKATA+VLAILE++G SD+SE DS SD E P K +G + R+ Sbjct: 67 RKMAEKATADVLAILEDNGASDISETLDSNSDHETEP---KVEDGLAREDVSSGTVR-RR 122 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSARRV 1697 N G SLSW+ DS H EK S+R R+SF S +S ++ Sbjct: 123 NEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTREKYK-KHSIRSRSSFTSIGSSSPKHQL 181 Query: 1696 GKSCRRIRHRDTRSMEDSQN-------DGTEKAACSG--DAFNGSDGEHVALRE-YENGK 1547 G+SCR+I+ RDTR ++ Q D +E+ + D+ N S H LR+ YE + Sbjct: 182 GRSCRQIKRRDTRPLDGEQELKSDALVDSSEEIPSTSLEDSQNYSVNGHSILRDGYEVRE 241 Query: 1546 NQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFR 1367 SS+ NS D +E+ DDME AL+ Q QLI QY EKFR Sbjct: 242 KTRSSSSGVHNSVGNSDQDNDIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEKFR 301 Query: 1366 ENNSGTQDSCDPGNHSDVTEERYEMK--SPELSRAAGTS-------NSDNQETKQEQVDA 1214 ENN+ T DSCDPGNHSD+TEER EM+ +P LS + D ++ Q Q + Sbjct: 302 ENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNPANEAKPQVAFDCDTRDLSQAQTNG 361 Query: 1213 CFSEKPETSKRSL--QNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQ---YRSQQ 1049 L QN N IS S EF+FPM+ K QE Q N++Q S Sbjct: 362 LGPSMCAVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQE---SQENSAQEPSCTSHL 418 Query: 1048 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRAKSSLN 869 + + S I+ Y++++ + +P E L VLEALK+AK SL Sbjct: 419 NHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHE-----PPALDGVLEALKQAKLSLT 473 Query: 868 QKLNNSPPTAG------RASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NAR 710 +K+ P G ++ G + P D +IPV GLFRLPTD+ E +++ Sbjct: 474 KKIIKLPSVDGESESIDKSIGPLSIPKMG-----DRLEIPVGCAGLFRLPTDFAAEASSQ 528 Query: 709 PGF----------ANFPPENSL----------------------GRFLSEPFDSRSAFSS 626 F ++P E + R S + + S F+ Sbjct: 529 ANFLASSSQLRSPTHYPGEGAALSANHQIFPGHEMEDRSSFLRDSRLRSSGYRAGSGFTR 588 Query: 625 DLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPVLP-SVKDSYPFLPDVTLRV 449 D FLTD PE + P + + D Y + V P S +YP P V+ + Sbjct: 589 DGFLTD----HIPENRWKNP--------GQKHHFDQYFDAVQPSSYVHNYPPRP-VSSNI 635 Query: 448 PLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338 N+ R FP +PP + S YD+ RP+MYR Sbjct: 636 HPND-TFLRTFPGRSTEMPPTNQYSFYDDQFRPNMYR 671 >gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 709 Score = 297 bits (761), Expect = 1e-77 Identities = 238/717 (33%), Positives = 353/717 (49%), Gaps = 84/717 (11%) Frame = -3 Query: 2236 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2057 S + ++ ++TT E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS Sbjct: 4 SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63 Query: 2056 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1877 +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P + NG K+ Sbjct: 64 VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122 Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1703 R+ ++GRSLSW+ K + H+ E+ Y D VR R SF S S S+R Sbjct: 123 RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180 Query: 1702 -RVGKSCRRIRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYE 1556 R GKSCR+IR R++RS M D Q G E ++ +A + + G H+ E Sbjct: 181 HRQGKSCRQIRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSE 239 Query: 1555 NGKNQLESSTLRSNSETQKMDGRYFDV----HERDDDMESALQHQVQLIGQYXXXXXXXX 1388 +N+ L S++ + + FD+ +E + DME AL+HQ QLI Y Sbjct: 240 IHENKSTVDNLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQR 299 Query: 1387 XXXEKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACF 1208 EKFRE NS + DSCDPGNHSDVTEER E+K+ + +GT+ S Q ++E + + Sbjct: 300 EWEEKFREKNSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFS 357 Query: 1207 SEKPETS--------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFL 1088 +E P+ RSL E+ ++ S + +F M++E ++Q Sbjct: 358 AELPKIHSNDLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ 416 Query: 1087 GIQHNASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG- 911 +N+ S F + + + + S + P+ L A+VP + G Sbjct: 417 --SNNSPSNSSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGR 473 Query: 910 --SVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLP 737 VL++LK+A+ SL QK++ G + G + S + + +IP+ GLFR+P Sbjct: 474 FTGVLDSLKQARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVP 533 Query: 736 TDYQPENARPGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPE 584 TD E + F AN P+ + S + S ++ + Y+P + + Sbjct: 534 TDISVEAPKANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSD 593 Query: 583 R----PFSQPRLSEGP-----------------------SSNRMN----RLDSYTNPVLP 497 R P+ PR S P + +R++ D PVLP Sbjct: 594 RFFSGPYMYPRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLP 653 Query: 496 SVK----DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338 S ++P PD+ ++ EG + + S P S YD H RPD++R Sbjct: 654 SSSLQNYPTFPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 708 >gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 749 Score = 294 bits (752), Expect = 1e-76 Identities = 237/708 (33%), Positives = 347/708 (49%), Gaps = 84/708 (11%) Frame = -3 Query: 2209 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2030 TT E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS+QR++AEKA Sbjct: 53 TTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSVQRRRAEKA 112 Query: 2029 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1850 TA+VLAILEN+G+SD+SEE DS SDQ ++P + NG K+R+ Sbjct: 113 TADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKVRQKESEELS 171 Query: 1849 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR--RVGKSCRR 1679 ++GRSLSW+ K + H+ E+ Y D VR R SF S S S+R R GKSCR+ Sbjct: 172 GSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRKHRQGKSCRQ 229 Query: 1678 IRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1529 IR R++RS M D Q G E ++ +A + + G H+ E +N+ Sbjct: 230 IRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSEIHENKSTVD 288 Query: 1528 TLRSNSETQKMDGRYFDV----HERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFREN 1361 L S++ + + FD+ +E + DME AL+HQ QLI Y EKFRE Sbjct: 289 NLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREK 348 Query: 1360 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS-- 1187 NS + DSCDPGNHSDVTEER E+K+ + +GT+ S Q ++E + + +E P+ Sbjct: 349 NSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSN 406 Query: 1186 ------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQY 1061 RSL E+ ++ S + +F M++E ++Q +N+ Sbjct: 407 DLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSN 463 Query: 1060 RSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALK 890 S F + + + + S + P+ L A+VP + G VL++LK Sbjct: 464 SSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLK 522 Query: 889 RAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENAR 710 +A+ SL QK++ G + G + S + + +IP+ GLFR+PTD E + Sbjct: 523 QARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPK 582 Query: 709 PGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQ 569 F AN P+ + S + S ++ + Y+P + +R P+ Sbjct: 583 ANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMY 642 Query: 568 PRLSEGP-----------------------SSNRMN----RLDSYTNPVLPSVK----DS 482 PR S P + +R++ D PVLPS + Sbjct: 643 PRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPT 702 Query: 481 YPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338 +P PD+ ++ EG + + S P S YD H RPD++R Sbjct: 703 FPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 748 >gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus vulgaris] Length = 652 Score = 290 bits (743), Expect = 2e-75 Identities = 238/666 (35%), Positives = 337/666 (50%), Gaps = 40/666 (6%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 + ++QR +S ++S AMTIEFLRARLLSERS+SK+ARQRADELA+KV ELEEQL+ V LQ Sbjct: 7 DPQDQRIASSTEDSTAMTIEFLRARLLSERSISKSARQRADELAEKVMELEEQLRMVILQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 RK AEKATA+VLAILE+ GIS VS+EFDS SD E+P D N K R+ Sbjct: 67 RKMAEKATADVLAILESQGISGVSDEFDSGSDL-ENPFDSSMSNECAKEDEGPMKSKGRQ 125 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEK-KNYMDSVRRRASFGSNSLSAR-RV 1697 + + +SLSW+ D H+LEK K +VRR++SF S S S + R+ Sbjct: 126 HGSDEMSGSNEDSSLVSSKSLSWKGRHDLSHSLEKYKTKSTNVRRQSSFSSFSSSPKHRL 185 Query: 1696 GKSCRRIRHRDTRS-MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLR 1520 GKSCR+IRHR RS ME+S+ + + S+G + +G S+ L+ Sbjct: 186 GKSCRKIRHRQPRSVMEESRGKFVHVNCQVNELVSSSEG----FPNFRDG----GSNILK 237 Query: 1519 SNSETQKMDG---------RYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFR 1367 S+ Q+ DG + D + R+++ME AL+HQ +LI QY EKFR Sbjct: 238 IESKIQEEDGSEANLLSKNHHIDGYGRENEMEKALEHQAELIDQYEAMEKAQREWEEKFR 297 Query: 1366 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS 1187 ENNS T DSCDPGNHSD+TE++ E K ++ AA S +E+K E C SE+ Sbjct: 298 ENNSTTPDSCDPGNHSDMTEDKDEGK-VQIPYAAKVVTSKAEESKGEPGGVCLSEE---- 352 Query: 1186 KRSLQNENIISCESSASE-FSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQSST 1010 K + I+ + ++ + S + +FLG +++ S + Q +V +QSS Sbjct: 353 KLKAEGREIMPKKHDDTDVYRNQKSTTFSTSDFLGQENSHSPLKGNQNEILVNGHSQSSD 412 Query: 1009 KISPYEEKSTALST----------PPKISLPLELAVVPQDN--LGSVLEALKRAKSSLNQ 866 + + ++ T K L V + + VLE+LK+A+ SL Q Sbjct: 413 MNHLDQGRHSSFPTDIHGVQHQHDASKNQKDLYALVTREQSHQFDGVLESLKQARISLQQ 472 Query: 865 KLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPP 686 +LN P G G +P + + D F+IP GLFRLPTD+ E A P F P Sbjct: 473 ELNRLPVVEG---GYTAKPLPSVSKNEDRFEIPFGFSGLFRLPTDFSDE-ATPRFNVRDP 528 Query: 685 ENSLGRFLSEPFDSRSAFSSDLFLTDP-------YRPFTPERPFSQPRLSEGPS-SNRMN 530 G + S S F T+P P ++ + L G S+ + Sbjct: 529 TTGFGSNY-HLNGTMSRTSVGQFFTNPPHSGKMLMSPSANDQALATRYLENGSRFSSSQS 587 Query: 529 RLDSYTN-PVLPSVKDSYPFLP------DVTLRVPLNEGGASRNFPSSERGLPPVMRLSS 371 D ++N L S K SYP P + T ++P + SR + +S G+P R S Sbjct: 588 PFDPFSNGGPLSSSKYSYPTFPINPSYQNATPQMPFGD-EVSRPYSNSTVGVPLANRFSF 646 Query: 370 YDEHVR 353 D+H+R Sbjct: 647 NDDHLR 652 >ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X4 [Glycine max] Length = 641 Score = 287 bits (735), Expect = 1e-74 Identities = 232/662 (35%), Positives = 323/662 (48%), Gaps = 36/662 (5%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 + ++QR T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQ Sbjct: 7 DPQDQRVTSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 RK AEKATA+VLAILE+ GISDVSEEFDS SD E+P D N K R+ Sbjct: 67 RKMAEKATADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQ 125 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVG 1694 + + +SLSW+ DS H+LEK ++RR++SF S S S + R G Sbjct: 126 HGSDKMPGSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQG 184 Query: 1693 KSCRRIRHRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESST 1526 KSCR+IRHR R +E+S+N + ++ A F G + + E+ + S Sbjct: 185 KSCRKIRHRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSG 244 Query: 1525 LRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1346 ++ +DG + R+ DME AL+HQ QLI QY EKFRENNS T Sbjct: 245 ANPLNKNHHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTP 299 Query: 1345 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNE 1166 DSCDPGN+SD+TE++ E K + AA SD QE+K E C SE+ K + Sbjct: 300 DSCDPGNYSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEAR 354 Query: 1165 NII-SCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQSSTKISPYEE 989 +I+ +S + + + LG Q++ + Q V Q S Sbjct: 355 DIMPKTHDDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPG 414 Query: 988 KSTALSTPPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLN 869 + + P S P ++ V N VLE+LK+A+ SL Sbjct: 415 RHGYHDSKPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQ 474 Query: 868 QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPG 704 Q+L P SG +PS + + D F++PV GLFR+PTD+ N + Sbjct: 475 QELKRLPLV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDP 531 Query: 703 FANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSN 539 A F L R +S D + F + PY P + L+ GP+ Sbjct: 532 TAGFGSNFHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGG 585 Query: 538 RMNRLDSYTNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEH 359 ++ YT P P + SY + T ++P SR + SS G+P R S +H Sbjct: 586 SLSS-SKYTYPTFP-INPSY---QNATPQMPFG-NEVSRPYSSSTVGVPLANRFSFNSDH 639 Query: 358 VR 353 +R Sbjct: 640 LR 641 >ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1 [Glycine max] gi|571568788|ref|XP_006606285.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X2 [Glycine max] gi|571568792|ref|XP_006606286.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X3 [Glycine max] Length = 664 Score = 283 bits (723), Expect = 3e-73 Identities = 230/655 (35%), Positives = 318/655 (48%), Gaps = 36/655 (5%) Frame = -3 Query: 2209 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2030 T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQRK AEKA Sbjct: 37 TSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQRKMAEKA 96 Query: 2029 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1850 TA+VLAILE+ GISDVSEEFDS SD E+P D N K R++ Sbjct: 97 TADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQHGSDKMP 155 Query: 1849 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVGKSCRRIR 1673 + +SLSW+ DS H+LEK ++RR++SF S S S + R GKSCR+IR Sbjct: 156 GSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQGKSCRKIR 214 Query: 1672 HRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRSNSET 1505 HR R +E+S+N + ++ A F G + + E+ + S ++ Sbjct: 215 HRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSGANPLNKN 274 Query: 1504 QKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSCDPGN 1325 +DG + R+ DME AL+HQ QLI QY EKFRENNS T DSCDPGN Sbjct: 275 HHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTPDSCDPGN 329 Query: 1324 HSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENII-SCE 1148 +SD+TE++ E K + AA SD QE+K E C SE+ K + +I+ Sbjct: 330 YSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEARDIMPKTH 384 Query: 1147 SSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALST 968 +S + + + LG Q++ + Q V Q S + + Sbjct: 385 DDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDS 444 Query: 967 PPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLNQKLNNSP 848 P S P ++ V N VLE+LK+A+ SL Q+L P Sbjct: 445 KPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQQELKRLP 504 Query: 847 PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPGFANFPPE 683 SG +PS + + D F++PV GLFR+PTD+ N + A F Sbjct: 505 LV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDPTAGFGSN 561 Query: 682 NSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSNRMNRLDS 518 L R +S D + F + PY P + L+ GP+ ++ Sbjct: 562 FHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGGSLSS-SK 614 Query: 517 YTNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVR 353 YT P P + SY + T ++P SR + SS G+P R S +H+R Sbjct: 615 YTYPTFP-INPSY---QNATPQMPFG-NEVSRPYSSSTVGVPLANRFSFNSDHLR 664 >gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508727307|gb|EOY19204.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 665 Score = 280 bits (717), Expect = 2e-72 Identities = 230/704 (32%), Positives = 338/704 (48%), Gaps = 71/704 (10%) Frame = -3 Query: 2236 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2057 S + ++ ++TT E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS Sbjct: 4 SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63 Query: 2056 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1877 +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P + NG K+ Sbjct: 64 VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122 Query: 1876 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1703 R+ ++GRSLSW+ K + H+ E+ Y D VR R SF S S S+R Sbjct: 123 RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180 Query: 1702 -RVGKSCRRIRHRDTRSM-EDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1529 R GKSCR+IR R++RS+ E+ ++D + K SS Sbjct: 181 HRQGKSCRQIRRRESRSVAEELKSDN--------------------IMVDPQVKGLENSS 220 Query: 1528 TLRSNSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGT 1349 + +N T + DME AL+HQ QLI Y EKFRE NS + Sbjct: 221 EVNANHST------------GEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREKNSSS 268 Query: 1348 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS------ 1187 DSCDPGNHSDVTEER E+K+ + +GT+ S Q ++E + + +E P+ Sbjct: 269 PDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSNDLVP 326 Query: 1186 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQ 1049 RSL E+ ++ S + +F M++E ++Q +N+ S Sbjct: 327 PSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSNSSHH 383 Query: 1048 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALKRAKS 878 F + + + + S + P+ L A+VP + G VL++LK+A+ Sbjct: 384 FAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLKQARL 442 Query: 877 SLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGF- 701 SL QK++ G + G + S + + +IP+ GLFR+PTD E + F Sbjct: 443 SLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFL 502 Query: 700 --------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQPRLS 557 AN P+ + S + S ++ + Y+P + +R P+ PR S Sbjct: 503 GSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMYPRTS 562 Query: 556 EGP-----------------------SSNRMN----RLDSYTNPVLPSVK----DSYPFL 470 P + +R++ D PVLPS ++P Sbjct: 563 SSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPTFPSY 622 Query: 469 PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338 PD+ ++ EG + + S P S YD H RPD++R Sbjct: 623 PDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 664 >ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa] gi|222850857|gb|EEE88404.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa] Length = 684 Score = 278 bits (712), Expect = 6e-72 Identities = 239/691 (34%), Positives = 339/691 (49%), Gaps = 60/691 (8%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 E ++QR +SM++S A+TIEFLRARLL+ERSVS+TARQRADELA++VAELEEQL+ VSLQ Sbjct: 7 EKQDQRTRSSMEDSTAITIEFLRARLLAERSVSRTARQRADELAERVAELEEQLRIVSLQ 66 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 R KAEKAT +VLAILE++GISD SE F S SDQ+ +P + K K+ K Sbjct: 67 RMKAEKATVDVLAILESNGISDDSEIFGSSSDQD-TPCESKVGK-KTKQEESSVISKVTK 124 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-VG 1694 S GR+LSW+ K S +LEK S+RRR+SF S S S + G Sbjct: 125 YKLEEHSGSGHDFSSSQGRNLSWKGRKHSPRSLEKCKD-PSLRRRSSFASTSSSPKHHQG 183 Query: 1693 KSCRRIRHRDTR-------SMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE 1535 KSCR++R++++R + D + A + + F V ENG+ + Sbjct: 184 KSCRQVRNKESRLTIGAFRTNPDKVDSPENGVATTSEVFPNCSEPEVG--RIENGEEKTL 241 Query: 1534 SSTLRSNSETQKMDGRYFD--VHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFREN 1361 Q+ D + V+ D DME AL+HQ QLI +Y EKFREN Sbjct: 242 PPISVGLENGQRADSNELEDNVYGSDRDMEKALEHQAQLIDRYKAMEKVQREWEEKFREN 301 Query: 1360 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKR 1181 N T DS D GN SDVTEE YE+K+ ++ + GT + + K E V+ + +P R Sbjct: 302 NGSTPDSYDAGNRSDVTEEGYEIKA-QVQQHTGTVAAQSNRAKSE-VEKASNIQPNGILR 359 Query: 1180 ----------SLQNENIISCESSASEFSFPMSREK--NNQEFLGIQHNASQYRSQQFPPM 1037 ++ + + ES A +F+F ++K N+E LG ++ S + S P Sbjct: 360 PSHVNIGQLQEWKSSSAPTSESPAQDFAFRAEKQKQNENEESLGNNYHPSPHSSHDHP-- 417 Query: 1036 VQTTTQSSTKISPYEEKSTALSTPPKISLPL--------EL-AVVP---QDNLGSVLEAL 893 S+ SP + +T+ + EL A+VP + LG VL+AL Sbjct: 418 ----QSHSSHDSPGSQSATSFPSNTDSGFSKGQFSGRQNELYALVPHRASNELGGVLDAL 473 Query: 892 KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-- 719 K A+ SL QK++ P G + + PS D IP+ + GLFRLP D+ E Sbjct: 474 KLARQSLQQKISTLPLIEGGSIRNSVDPSLPPPIPGDKVDIPLGNAGLFRLPFDFLAEGS 533 Query: 718 --------NARPGFANFPPEN-----SLGRFLSE-PFDSRSAF-SSDLFLTDPYRPFTPE 584 NA N+ P+ ++ RF+S P + S F ++D FL T Sbjct: 534 TRKNLDSTNAGLSLRNYYPDTGVPAAAINRFVSRFPTATGSRFPTADQFLASQSYSATGS 593 Query: 583 RPFSQPRLSEGPSSNRMNRLDS---YTNPVL-----PSVKDSYPFLPDVTLRVPLNEGGA 428 R ++ + +R+ S + P L PS + SYP P +P Sbjct: 594 RFPTEDQFLASQDVEAGSRISSQRPFFYPYLDTVSPPSARYSYPTNPSYPGPMPQLPSRE 653 Query: 427 SRNF-PSSERGLPPVMRLSSYDEHVRPDMYR 338 +F PS+ G+PP S D H+RP+MYR Sbjct: 654 PPSFLPSTTAGVPPADHFSFPDYHIRPNMYR 684 >emb|CBI40233.3| unnamed protein product [Vitis vinifera] Length = 682 Score = 278 bits (711), Expect = 8e-72 Identities = 187/418 (44%), Positives = 237/418 (56%), Gaps = 32/418 (7%) Frame = -3 Query: 2200 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2021 M++S AMTIEFLRARLLSERSVS+TARQRADELA++V +LEEQLK VS+QR KAEKATA+ Sbjct: 1 MEDSTAMTIEFLRARLLSERSVSRTARQRADELAQRVWKLEEQLKIVSIQRNKAEKATAD 60 Query: 2020 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1841 VLAILENH ISDVS EFDS SDQE + D G Sbjct: 61 VLAILENHAISDVSWEFDSSSDQEVALCDSHVGGG------------------------- 95 Query: 1840 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR--VGKSCRRIRHR 1667 R LSW+SSKDS H++EK+ S+RRR SF S+ S+ + +GKSCR+IR R Sbjct: 96 --------RRLSWKSSKDSSHSIEKRYLDCSIRRRHSFASSGSSSPKHNLGKSCRQIRRR 147 Query: 1666 DTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESSTL 1523 +TRS M DSQN+G + S NG D LRE + + L + Sbjct: 148 ETRSAVDELKVGRVMVDSQNNGI--ISSSEGLPNGFDSGQEILREGSENQEEEALMDGQV 205 Query: 1522 RSNSETQKM---DGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSG 1352 + E+Q+ + + + RD DME AL+HQ QLIGQY EKFRENNS Sbjct: 206 SDSLESQRDATGSNHHLNRNGRDRDMERALEHQAQLIGQYEAEEKAQREWEEKFRENNSS 265 Query: 1351 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS----- 1187 T DSC+PGNHSDVTEER E+K P+ AAG S +Q TK + D F+E+ + Sbjct: 266 TPDSCEPGNHSDVTEERDEVK-PQAPSAAGILTSQDQGTKLDDEDVHFNEESSQTLPTIS 324 Query: 1186 -------KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFP 1043 LQ +N +++ ES A +F FPM++E +QEFL Q + S +P Sbjct: 325 TTHLHGDMECLQEQNRCSMLAYESLAPDFVFPMAKENLHQEFLENQSYPLSHSSHHYP 382 Score = 103 bits (257), Expect = 3e-19 Identities = 77/220 (35%), Positives = 109/220 (49%), Gaps = 20/220 (9%) Frame = -3 Query: 937 AVVPQDN---LGSVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIP 767 A+VP++ LG VLEAL++A+ SL KLN P G + G +PS T + +IP Sbjct: 472 ALVPRETSNELGGVLEALQQARLSLQHKLNRLPLIEGGSIGRAIEPSFPSTRAWERVEIP 531 Query: 766 VISPGLFRLPTDYQ----------PENARPGFANFPPE-----NSLGRFLSEPF--DSRS 638 V GLFR+P DYQ +++ N+ P+ N RFL+ P+ S Sbjct: 532 VGCAGLFRVPADYQLGTATEANFLGSDSQSSLKNYYPDTGFVANPGDRFLTSPYLKTGSS 591 Query: 637 AFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSNRMNRLDSYTNPVLPSVKDSYPFLPDVT 458 + D FLT PYR P +P + S ++ YT+P +Y PD+ Sbjct: 592 VPTDDSFLTSPYRETGSRIPPLRPSF-DYYSDAGLSASTRYTHP-------TYSSHPDLL 643 Query: 457 LRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 338 R+P NEG A R +SE G+P S YD+H+RP+MYR Sbjct: 644 YRMPFNEGFA-RPPRNSEVGIPSTDHFSFYDDHIRPNMYR 682 >ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like isoform X1 [Solanum tuberosum] gi|565389467|ref|XP_006360477.1| PREDICTED: flocculation protein FLO11-like isoform X2 [Solanum tuberosum] gi|565389469|ref|XP_006360478.1| PREDICTED: flocculation protein FLO11-like isoform X3 [Solanum tuberosum] gi|565389471|ref|XP_006360479.1| PREDICTED: flocculation protein FLO11-like isoform X4 [Solanum tuberosum] gi|565389473|ref|XP_006360480.1| PREDICTED: flocculation protein FLO11-like isoform X5 [Solanum tuberosum] Length = 678 Score = 272 bits (696), Expect = 4e-70 Identities = 225/656 (34%), Positives = 320/656 (48%), Gaps = 29/656 (4%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 ED++Q K +++S TIEFLR RLL+ERS S+TA+QRADELA++V+ELEEQLK VSLQ Sbjct: 7 EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQRVSELEEQLKAVSLQ 65 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 RKKAE+ATA VL+ILENH I DVSEEF S SD+E D K + ++ Sbjct: 66 RKKAERATAAVLSILENHSIDDVSEEFSSGSDKEAILSDQKDAENKTGGDISSSVKE-KE 124 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1697 + ST RSLSW+S K S H+L+++ Y DS RRR ++F S +S+ +RV Sbjct: 125 DDVDTLSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSSTDISSPKRV 183 Query: 1696 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1517 G SCRRIR RDTRS D + + A C+ + S G N + S Sbjct: 184 GNSCRRIRRRDTRSASDKLQNSS--AECASEPLPSSANNEPHPLTAGAGINDVNDQVHVS 241 Query: 1516 NSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1337 + + G + + D+D + AL Q QLIGQY EK+RE+N T DSC Sbjct: 242 AID---VSGNGKEADKSDEDSQRALHQQAQLIGQYEAEEKAQREWEEKYRESNICTPDSC 298 Query: 1336 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEK----------PETS 1187 D N+SDVTEER ++K+ + AG ++ N + D +E+ P + Sbjct: 299 DRENYSDVTEERDDLKASQEPCLAGNTSMQNHANQSGAADVSRTEQNGNIDNSPSTPHVN 358 Query: 1186 KRSLQNE---NIISCESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016 L+++ + +S ASE + PMS N +L S Y QQ P+ + Sbjct: 359 MSCLEDKKGSRTVESDSPASELARPMS----NGNYLENHGQTSAYSHQQSLPVTR----- 409 Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 845 SP +S++L ELA+V +++ SVL L++AK SL +++N+S P Sbjct: 410 ----SPMHPRSSSLQAGQAPQTGYELALVSHNTSNSVNSVLGELEQAKLSLTKQINSSLP 465 Query: 844 TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPPENSLGRF 665 TA S N++++ P+ Y+ Sbjct: 466 TASYPGMPSRFSSVNQSSE----------------PSTYETS------------------ 491 Query: 664 LSEPFDSRSAF-SSDLFLTDPYRPFTPE--------RPFSQPRLSEG-PSSNRMNRLDSY 515 LS +SRS + + +T P++ PE RP S+ G PSS R N S Sbjct: 492 LSPYMESRSKYVTQGNRVTYPFQRAFPEVSSSAPSYRPISETNFDAGQPSSMRFNPNSSS 551 Query: 514 TNPVLPS-VKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRP 350 P+ SYP PD+ ++P NE SRN+P +E LPP S++ V P Sbjct: 552 RLPLSSKFTYPSYPKFPDMVPKLPPNE-VFSRNYPRNETDLPPSFSFSTWSPEVVP 606 >ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251943 [Solanum lycopersicum] Length = 729 Score = 266 bits (680), Expect = 3e-68 Identities = 231/673 (34%), Positives = 334/673 (49%), Gaps = 46/673 (6%) Frame = -3 Query: 2230 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2051 ED++Q K +++S TIEFLR RLL+ERS S+TA+QRADELA+ V+ELEEQLK VSLQ Sbjct: 7 EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQMVSELEEQLKVVSLQ 65 Query: 2050 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1871 RK+AEKATA VL+ILE+H I DVSEEF S SD+E D K G K ++ Sbjct: 66 RKRAEKATAAVLSILEDHSIDDVSEEFSSGSDKETILSDQKDA-GNKTGGDISSSAKEKE 124 Query: 1870 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1697 + ST RSLSW+S K S H+L+++ Y DS RRR ++F +S+ +RV Sbjct: 125 DDVDILSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSYTDISSPKRV 183 Query: 1696 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1517 G SCR+IR RDTRS D + + A C+ + + S G + + Sbjct: 184 GNSCRQIRRRDTRSASDKLRNSS--AECASEPLSSSANNEPHSLTAGAGISDVNDQV--- 238 Query: 1516 NSETQKMDGRYFDVHERDDDMESALQHQVQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1337 + + G + + D+D + AL QVQ IGQY EK+RE+NS T DSC Sbjct: 239 HVPALDVPGNGREADKSDEDSQRALHQQVQPIGQYEAEEKAQREWEEKYRESNSCTPDSC 298 Query: 1336 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENI- 1160 D N+SDVTEER ++K+ + AG ++ N + D +++ S N+ Sbjct: 299 DRENYSDVTEERDDLKASQEPCLAGRTSMQNHANQCGAADVSRTKQNGNIDNSPSTPNVN 358 Query: 1159 ISC------------ESSASEFSFPMSREKNNQEFLGIQHNASQYRSQQFPPMVQTTTQS 1016 +SC +SSASE + PMS +L S + QQ P+ + Sbjct: 359 MSCLEDKKGSRTVGSDSSASELARPMS----TGNYLENHGQTSAFSHQQSFPVTR----- 409 Query: 1015 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 845 S +S++L + ELA+V + + SVL L++AK SL +++N+S P Sbjct: 410 ----SSMHPRSSSLQAGQALQTGYELALVSHNTSNGVDSVLGKLEQAKLSLTKQINSSLP 465 Query: 844 TAGRASGSVFQPSNNETNKTDSFQIPVISPGL----------FRLPTDYQ---PE--NAR 710 TA S N + + +++I + P + R+ +Q PE ++ Sbjct: 466 TASYPGTPSRFSSLNHSPELSTYEISLTPPYVESRSKYVTQSNRVTYPFQRAFPEVSSSA 525 Query: 709 PGFANFPPEN-SLGRFLSEPF-DSRSAF-SSDLFLTDPY-RPFT-------PERPFSQPR 563 P + N G+ S P+ +SRS + + +T P+ R FT RP S+ Sbjct: 526 PSYRPISETNFEAGQPSSTPYVESRSKYVTQSNRVTYPFQRAFTEVSSSAPSYRPISETN 585 Query: 562 LSEG-PSSNRMNRLDSYTNPVLPSVK-DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPP 389 G PSS R N S P + SYP PD+ ++P NE SRNFP++E LPP Sbjct: 586 FDAGQPSSVRFNPNSSSRLPFSSKLTYPSYPKFPDMVPKLPPNE-VFSRNFPTNETDLPP 644 Query: 388 VMRLSSYDEHVRP 350 S+ + V P Sbjct: 645 SFSFSTLSQEVVP 657