BLASTX nr result
ID: Rehmannia24_contig00012103
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia24_contig00012103 (2379 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citr... 334 1e-88 ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citr... 328 7e-87 ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c... 317 1e-83 ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyp... 311 6e-82 ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267... 308 9e-81 gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis] 305 5e-80 ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyp... 305 6e-80 gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus pe... 305 8e-80 ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309... 303 2e-79 ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207... 300 3e-78 gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao] 297 1e-77 gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus... 295 5e-77 gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao] 294 1e-76 ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyp... 288 6e-75 ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyp... 284 1e-73 ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Popu... 281 9e-73 gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma caca... 280 2e-72 emb|CBI40233.3| unnamed protein product [Vitis vinifera] 279 3e-72 ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like i... 276 4e-71 ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251... 266 4e-68 >ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] gi|568878417|ref|XP_006492190.1| PREDICTED: uncharacterized protein LOC102610545 [Citrus sinensis] gi|557538863|gb|ESR49907.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] Length = 732 Score = 334 bits (856), Expect = 1e-88 Identities = 273/739 (36%), Positives = 363/739 (49%), Gaps = 107/739 (14%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 E ++QR + M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQ Sbjct: 7 EMQDQRTNSGMEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 RKKAEKATA+VLAILEN+GIS++S+ FDS SDQ E+P + + N K R+ Sbjct: 67 RKKAEKATADVLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRR 125 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RR 1722 N R LSW + ++ +LEK Y DS +RRR+SF S S+ R Sbjct: 126 NASVEHSGSGNDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNR 183 Query: 1721 VGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTL 1545 VGKSCR+IR R+++S + TE G V + E G E L Sbjct: 184 VGKSCRQIRRRESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYL 241 Query: 1544 RSNS-----ETQKM---DGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFR 1389 S E +K+ G F+ D DME AL+ QAQLIG+Y E+FR Sbjct: 242 GEGSDSGCFENEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFR 301 Query: 1388 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE 1224 ENNS T DSCDPGN SDVTEER E K ++ R AGT NS QE K E Q+ S Sbjct: 302 ENNSSTPDSCDPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSN 360 Query: 1223 ---KPETSKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRS-QQFPPMV 1056 P++ + + + E A +F+F MS EK NQE LG H + S + P Sbjct: 361 GFLPPQSGDQKCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHG 418 Query: 1055 QTTTQSSTKISPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLN 891 QSS +S +T S+ ++ S + A+VP VLEALK+A+ SL Sbjct: 419 SPENQSSQTVS----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLR 474 Query: 890 QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------N 738 QK+++ P T R+ G V +PS + + D +IPV GLFR+PTDY E + Sbjct: 475 QKMSSLPSTESRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSD 534 Query: 737 ARPGFANFPPENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTP 609 +RP AN+ P + +G + D+RS F++ DLFLT P ++ Sbjct: 535 SRPSLANYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSA 594 Query: 608 ERPFSQPRLSEGPSSSNRMN-RLDSYTNPVLPSVK-------DSYP-------------- 495 E + S+ S + M DS + LPS + SYP Sbjct: 595 ENRLLTRQYSDTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLST 654 Query: 494 FL---------------------------------PDVTLRVPLNEGGASRNFPSSERGL 414 FL PD+ ++P +E G S PS G+ Sbjct: 655 FLPGRSVEMSVEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGM 713 Query: 413 PPVMRLSSYDEHVRPDMYR 357 PP L +++H RP MYR Sbjct: 714 PPANHLPFHNDHTRPYMYR 732 >ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] gi|557538862|gb|ESR49906.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] Length = 716 Score = 328 bits (841), Expect = 7e-87 Identities = 270/729 (37%), Positives = 357/729 (48%), Gaps = 107/729 (14%) Frame = -2 Query: 2222 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2043 M++SN MTIEFLRARLLSERSVSK+ARQRADELA++V ELEEQLK VSLQRKKAEKATA+ Sbjct: 1 MEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQRKKAEKATAD 60 Query: 2042 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1863 VLAILEN+GIS++S+ FDS SDQ E+P + + N K R+N Sbjct: 61 VLAILENNGISEISDSFDSGSDQ-ETPCESEVGNNFNKEEENSVDSKFRRNASVEHSGSG 119 Query: 1862 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSA--RRVGKSCRRIRH 1692 R LSW + ++ +LEK Y DS +RRR+SF S S+ RVGKSCR+IR Sbjct: 120 NDFSPVPHRGLSWNGRRGTKQSLEK--YKDSYLRRRSSFASTGSSSPKNRVGKSCRQIRR 177 Query: 1691 RDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALR-EYENGKNQLESSTLRSNS-----E 1530 R+++S + TE G V + E G E L S E Sbjct: 178 RESKSAVEELK--TEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYLGEGSDSGCFE 235 Query: 1529 TQKM---DGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1359 +K+ G F+ D DME AL+ QAQLIG+Y E+FRENNS T DSC Sbjct: 236 NEKLVTGGGIDFNGCGGDKDMEKALEDQAQLIGRYEEMEKAQREWEERFRENNSSTPDSC 295 Query: 1358 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQE-----QVDACFSE---KPETSKR 1203 DPGN SDVTEER E K ++ R AGT NS QE K E Q+ S P++ + Sbjct: 296 DPGNQSDVTEEREESK-VQVQRVAGTVNSQVQEAKTEVHLSNQLSNTKSNGFLPPQSGDQ 354 Query: 1202 SLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRS-QQFPPMVQTTTQSSTKI 1026 + + E A +F+F MS EK NQE LG H + S + P QSS + Sbjct: 355 KCSSTP--ASEPLAQDFAFTMSNEKQNQESLGNNHYVPSHSSHHRLHPHGSPENQSSQTV 412 Query: 1025 SPYEEKSTALSTPPKI--SLPLELAVVP---QDNLGSVLEALKRAKSSLNQKLNNSPPTA 861 S +T S+ ++ S + A+VP VLEALK+A+ SL QK+++ P T Sbjct: 413 S----SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQARLSLRQKMSSLPSTE 468 Query: 860 GRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE---------NARPGFANFPP 708 R+ G V +PS + + D +IPV GLFR+PTDY E ++RP AN+ P Sbjct: 469 SRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKANFLVSDSRPSLANYNP 528 Query: 707 ENSLGRFLSEP------FDSRSAFSS-------DLFLTDP----YRPFTPERPFSQPRLS 579 + +G + D+RS F++ DLFLT P ++ E + S Sbjct: 529 TSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPSTDTRSSYSAENRLLTRQYS 588 Query: 578 EGPSSSNRMN-RLDSYTNPVLPSVK-------DSYP--------------FL-------- 489 + S + M DS + LPS + SYP FL Sbjct: 589 DTRSRVSMMRPSFDSNLDAGLPSFRQYMYPNFSSYPDQVPQVPRNERLSTFLPGRSVEMS 648 Query: 488 -------------------------PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYD 384 PD+ ++P +E G S PS G+PP L ++ Sbjct: 649 VEISPMLDAGLSSSSQSANPYFSSYPDLMPQIPAHE-GLSTLRPSRSAGMPPANHLPFHN 707 Query: 383 EHVRPDMYR 357 +H RP MYR Sbjct: 708 DHTRPYMYR 716 >ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis] gi|223526443|gb|EEF28720.1| hypothetical protein RCOM_0152200 [Ricinus communis] Length = 665 Score = 317 bits (813), Expect = 1e-83 Identities = 249/649 (38%), Positives = 337/649 (51%), Gaps = 49/649 (7%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 E ++QR + M++S AMTIEFLRARLLSERSVS+TARQRADELA +VAELEEQL+ VSLQ Sbjct: 7 EKQDQRTNSGMEDSTAMTIEFLRARLLSERSVSRTARQRADELATRVAELEEQLRIVSLQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 R KAEKATA++LAILE +GISD+SE FDSCSD+ ++P + K N K+R Sbjct: 67 RMKAEKATADILAILEGNGISDISETFDSCSDR-DTPCESKVGN-RSSKEENSINSKVRN 124 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS-NSLSARRVG 1716 N GRSLSW+ K+S +LEK S+RRR+SF S S +R G Sbjct: 125 NDSEELSGSDFDFSSVPGRSLSWKGRKNSPRSLEKSK-DSSMRRRSSFSSVGSSPKQRPG 183 Query: 1715 KSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE------S 1554 KSCR+IR +++R K C D + + ++E + +++ Sbjct: 184 KSCRQIRRKESRF---EYKASPVKRDCPEDEVAATSANFPSCSDFEPKRGEVKPLLEDSH 240 Query: 1553 STLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSG 1374 S N +G ++V+ D DME AL+HQAQLIGQY EKFRENNS Sbjct: 241 SDCLGNERNASDNGLDYNVYRGDRDMEKALEHQAQLIGQYEAMEKVQREWEEKFRENNSS 300 Query: 1373 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQ 1194 T DSCD GN SD+TEERYE++ P ++ T+N+ E V+ + +P S Sbjct: 301 TPDSCDHGNRSDITEERYEIREP--AKGPATTNAIQTEGLLSVVEGVSNTQPHGFLPSSH 358 Query: 1193 NENIISCESSAS-----EFS-----FPMSREKNNQE---------FLGIQHDASQYRSQQ 1071 + + E +S EFS FPM++ K NQ+ L HD++ + SQ Sbjct: 359 VDAVCLEERKSSIAPVPEFSTQDSAFPMAKAKQNQKNPGNNDHSPLLIAHHDSASFGSQY 418 Query: 1070 FPPMVQTTT-QSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRAKSSL 894 + S+T S + K+T+ S + +L A LG VLEAL+ A+ SL Sbjct: 419 SSGSQSVLSFPSNTGSSFNKGKATSGSENERCALVPHKA---SGGLGGVLEALEEARQSL 475 Query: 893 NQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFAN 717 Q++N P A SV + S + T D QIPV GLFRLPTD+ E N R + Sbjct: 476 QQRINRLPSVATTVRKSV-ESSVSTTISRDEVQIPVGCVGLFRLPTDFSVEGNTRANLLS 534 Query: 716 FPPENSLG--------------RFLSEPF-DSRSAFSS-DLFLTDPY-----RPFTPERP 600 + SLG +F++ P+ RS+ S+ D FL+ Y R TP +P Sbjct: 535 SSAQLSLGNHYSDRGVPAAASNQFVASPYLQGRSSSSTEDQFLSSQYVGGGSRIPTP-KP 593 Query: 599 FSQPRLSEGPSSSNRMNRLDSYTNPVLPSVKDSYPFLPDVTLRVPLNEG 453 + P L G SS+R YT P P + SY PD+ R+P EG Sbjct: 594 YFDPYLDTGLPSSSR------YTYPNYP-INTSY---PDLMPRIPSREG 632 >ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1 [Solanum tuberosum] Length = 643 Score = 311 bits (798), Expect = 6e-82 Identities = 255/679 (37%), Positives = 330/679 (48%), Gaps = 47/679 (6%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 +D++QRK M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ Sbjct: 7 QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1899 RKKAEKATA VL+ILEN GISD SEEFDS SDQE + K + Sbjct: 66 RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125 Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1725 + STGRSLSW+S K S + E+ Y DS RR SF S S S + Sbjct: 126 ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185 Query: 1724 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNG-SDGEHVALREYENGKNQLESST 1548 R GKSCRRIR T++ D C + ++ H +L + G N ++ Sbjct: 186 RAGKSCRRIRRNTTKTATDE---------CPPEHLPSFANNGHQSLMD-SAGNNDVKD-- 233 Query: 1547 LRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1368 + + T +M E D+ ME ALQH+AQLIGQY EK+RENN+ Q Sbjct: 234 -QRHLPTSEMSENQRKSDESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQ 292 Query: 1367 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE---- 1215 DSCDPGN+SDVTEER +MK+ E +A N N K ++VD P Sbjct: 293 DSCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHI 352 Query: 1214 -TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQ 1041 TS R QN II+ ES ASEF+ K+N Y Q P Sbjct: 353 GTSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP-------- 400 Query: 1040 SSTKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSP 870 S SP ++S+ SL A+V + DN+GS+L AL++AK S++Q++N SP Sbjct: 401 -SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSP 459 Query: 869 PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLG 693 G +S P T + D I PGLFRLPTD+Q E + FP S Sbjct: 460 IAEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSA 515 Query: 692 RFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSSNRMNRLDSYTNPV-LP 516 EP D F T PY E P + + + +N + +P Sbjct: 516 NHFHEP-------GYDQFSTTPYM----ESPSNAITGLPYTTGFDYLNPPSGFGHPFSSK 564 Query: 515 SVKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGL 414 S +YPF P+ T V PL E + R+ P +E G Sbjct: 565 STYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGK 624 Query: 413 PPVMRLSSYDEHVRPDMYR 357 PP +S YD H+RP+MYR Sbjct: 625 PPSFPVSHYDAHLRPNMYR 643 >ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267607 [Solanum lycopersicum] Length = 617 Score = 308 bits (788), Expect = 9e-81 Identities = 250/682 (36%), Positives = 328/682 (48%), Gaps = 50/682 (7%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 +D++QRKT M E+++MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ Sbjct: 7 KDQDQRKTVGM-ENSSMTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1899 RKKAEKATA VL+ILEN GI+D SEEFDS SDQE + K + Sbjct: 66 RKKAEKATAAVLSILENEGITDASEEFDSGSDQEAIFSNSKGADSTDNRNEYKPDPSNVK 125 Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSNSLSA-R 1725 + STGRSLSW+S K S + E+ Y DS RR SF S S+ + Sbjct: 126 ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGTSSPK 185 Query: 1724 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1545 R GKSCRRIR +T + + ND QL T Sbjct: 186 RAGKSCRRIRRSNTNAGNNDVND------------------------------QLHLPTS 215 Query: 1544 RSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1365 ++ +K D E D+ ME ALQH+A LIG+Y EK+RENN QD Sbjct: 216 ETSENQRKAD-------ESDEGMERALQHKALLIGKYEAEEKAQREWEEKYRENNYA-QD 267 Query: 1364 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFS--------EKPETS 1209 SCDPGN+SDVTEER +MK+ E +A N N K ++VD + P S Sbjct: 268 SCDPGNYSDVTEERDDMKAFEQPYSAEMINLQNHANKFQEVDIPSTNGVTDNVPSNPHIS 327 Query: 1208 KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038 +++N II+ ES ASEF+ P K+N Y Q P Sbjct: 328 TSCRKDQNCSRIINSESPASEFALP----KSNGSCPENDGPTPAYCHHQLP--------- 374 Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 867 S+ SP + ++S+ SL A+V DN+GS+L AL++AK S++Q++N S P Sbjct: 375 SSNGSPIQPLENSISSSGGSSLQAGQALVSGDASDNIGSILGALEQAKFSISQQINVS-P 433 Query: 866 TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 690 GR+S + S D IP PGLFRLPTD+Q E + FP S Sbjct: 434 VEGRSS---IEHSIPTAKIEDRLDIPPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 490 Query: 689 FLSEPFDSRSAFSSDLFLTDPYR-----PFTPERPFSQPRLSEGPSSSNRMNRLDSYTNP 525 EP + FS+ ++ P P+T + P S G S++ Sbjct: 491 HFHEP--GYNQFSATPYMESPSNAITGLPYTTGFDYLNPPSSFGHPFSSK---------- 538 Query: 524 VLPSVKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSE 423 S +YPF P+ T V PL E + R+ P +E Sbjct: 539 ---STYPTYPFRPNTTTTVSQSQASWSPLYESSLTKSSPVVVPNLSSGEDVFLRSLPRNE 595 Query: 422 RGLPPVMRLSSYDEHVRPDMYR 357 G PP +S YD H+RP+MYR Sbjct: 596 TGKPPSFPVSHYDAHMRPNMYR 617 >gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis] Length = 654 Score = 305 bits (782), Expect = 5e-80 Identities = 257/689 (37%), Positives = 345/689 (50%), Gaps = 57/689 (8%) Frame = -2 Query: 2252 EDREQRKTTSMQESN--AMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2079 E ++QR ++SM++S AMTIEFLRARLLSERSVS++ARQRADEL K+V ELEEQL+ VS Sbjct: 7 EKQDQRSSSSMEDSQSTAMTIEFLRARLLSERSVSRSARQRADELEKRVEELEEQLRIVS 66 Query: 2078 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1899 LQRK AEKAT +VL+ILENHGISD SE +DS SDQE NG Sbjct: 67 LQRKMAEKATVDVLSILENHGISDASETYDSGSDQETHQVANNYANGEERSVVSK----- 121 Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-----ASFGSNSL 1734 R++ GRSLSW+ DS + EK Y DS RR +SFGS+S Sbjct: 122 RRSVLEELSGSDLDSSPINGRSLSWKGRSDSSRSREK--YKDSSVRRQNALSSSFGSSS- 178 Query: 1733 SARRVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLES 1554 VGKSCR+IR R+TR++ + E + ENG Sbjct: 179 PKHYVGKSCRQIRCRETRTVVEDHKT-----------------EPLKFDSQENGAATPPE 221 Query: 1553 STLRSNSETQKMDGRYFDV--HERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENN 1380 +++++ + DV H ++ DM+ AL+H+AQLIGQY EK+RENN Sbjct: 222 GSVKNDRRIP----NHLDVNGHGQEKDMKKALEHRAQLIGQYEEMEKAQREWEEKYRENN 277 Query: 1379 SGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-ACFSEKPETS-- 1209 + T DS DPGNHSDVTE+R E+K+ L G + + K +VD + S KP+++ Sbjct: 278 TSTPDSYDPGNHSDVTEDRDEVKAQTLYN-VGIDIAQAVDAKSNKVDLSKESSKPQSNGF 336 Query: 1208 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQ 1071 ++ N + ++ A EF+FP ++EK QE L + +R + Sbjct: 337 LHPTRTRAAMGDLKVQASSNIDPVASRFQAQEFAFPTAKEKEAQESL----ENRDFRPSE 392 Query: 1070 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLEL--------AVVPQDN---LGSVL 924 P Q +S P++ ALS S + A+VP + LG VL Sbjct: 393 SPHHGQLLHRSLPN-QPFDR--GALSDAGSSSHKRDFSGSQNDLYALVPHNPPVVLGGVL 449 Query: 923 EALKRAKSSLNQKLNNSP----PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPT 756 +ALK+AK SL QK+N P T A +P+ T D +IPV GLFRLPT Sbjct: 450 DALKQAKLSLQQKINRLPLEGTTTQTVAVNRSIEPTQPGTRVGDRLEIPVGCTGLFRLPT 509 Query: 755 DYQPENARPGFANFPPENSLGRFLSEPF--DSRSAFSS-DLFLTDPY----RPFTPE-RP 600 D+ A ANF S R EP+ D++ A ++ D FLT PY F P+ R Sbjct: 510 DFATVEASTQ-ANFLSSGS--RLSLEPYYPDNKVALTAPDRFLTSPYIESRSEFPPDVRF 566 Query: 599 FSQPRLSEGPSSSNRMNRLDSYTNPVLPSVK--------DSYPFLPDVTLRVPLNEGGAS 444 + + G +S +R DS+ + SV SYP PD R+P +E G Sbjct: 567 LTSSSVVSGSRASTLNSRFDSHFDTGPSSVNRYSNYPPHPSYPPFPDSMPRIPSDE-GLR 625 Query: 443 RNFPSSERGLPPVMRLSSYDEHVRPDMYR 357 R F SS P R S YD+H RP+MYR Sbjct: 626 RPFRSSRSFGLPEDRFSFYDDHGRPNMYR 654 >ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X2 [Solanum tuberosum] Length = 618 Score = 305 bits (781), Expect = 6e-80 Identities = 253/678 (37%), Positives = 322/678 (47%), Gaps = 46/678 (6%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 +D++QRK M++S+ MTIEFLRARLL+ERSVS+TARQRADELA++V ELE+QLK VSLQ Sbjct: 7 QDQDQRKIVGMEDSS-MTIEFLRARLLAERSVSQTARQRADELAERVLELEDQLKIVSLQ 65 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXK--L 1899 RKKAEKATA VL+ILEN GISD SEEFDS SDQE + K + Sbjct: 66 RKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKPNPSNVK 125 Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSV-RRRASFGSN-SLSAR 1725 + STGRSLSW+S K S + E+ Y DS RR SF S S S + Sbjct: 126 ERENDADISSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFASTGSSSPK 185 Query: 1724 RVGKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTL 1545 R GKSCRRIR T + + D + L +S + Sbjct: 186 RAGKSCRRIRRNTTNAGNNDVKD----------------------------QRHLPTSEM 217 Query: 1544 RSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQD 1365 N +K D E D+ ME ALQH+AQLIGQY EK+RENN+ QD Sbjct: 218 SENQ--RKSD-------ESDEGMERALQHKAQLIGQYEAEEKAQREWEEKYRENNNYAQD 268 Query: 1364 SCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVD-----ACFSEKPE----- 1215 SCDPGN+SDVTEER +MK+ E +A N N K ++VD P Sbjct: 269 SCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDIPSTNGVTDNVPSTPHIG 328 Query: 1214 TSKRSLQN-ENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038 TS R QN II+ ES ASEF+ K+N Y Q P Sbjct: 329 TSCRKDQNCSRIINSESPASEFAL----SKSNGSCPENDGPTPAYSRHQLP--------- 375 Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVVPQ---DNLGSVLEALKRAKSSLNQKLNNSPP 867 S SP ++S+ SL A+V + DN+GS+L AL++AK S++Q++N SP Sbjct: 376 SANGSPIHPLENSISSSGGSSLQAGQALVSRDASDNIGSILGALEQAKFSISQQINVSPI 435 Query: 866 TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-NARPGFANFPPENSLGR 690 G +S P T + D I PGLFRLPTD+Q E + FP S Sbjct: 436 AEGGSSIEHSIP----TARIDRLDILPGFPGLFRLPTDFQLEATTTASYQGFPSRFSSAN 491 Query: 689 FLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSSNRMNRLDSYTNPV-LPS 513 EP D F T PY E P + + + +N + +P S Sbjct: 492 HFHEP-------GYDQFSTTPYM----ESPSNAITGLPYTTGFDYLNPPSGFGHPFSSKS 540 Query: 512 VKDSYPFLPDVTLRV--------PLNEGGAS------------------RNFPSSERGLP 411 +YPF P+ T V PL E + R+ P +E G P Sbjct: 541 TYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVPNLSSGEEVFLRSLPRNETGKP 600 Query: 410 PVMRLSSYDEHVRPDMYR 357 P +S YD H+RP+MYR Sbjct: 601 PSFPVSHYDAHLRPNMYR 618 >gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus persica] Length = 690 Score = 305 bits (780), Expect = 8e-80 Identities = 256/700 (36%), Positives = 333/700 (47%), Gaps = 68/700 (9%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 + ++QR M++S AMTIEFLRARLL+ERSVS++ARQR DEL + V ELEEQLK VSLQ Sbjct: 7 DTQDQRSNLGMEDSTAMTIEFLRARLLAERSVSRSARQRVDELERMVEELEEQLKIVSLQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVS-EEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLR 1896 RK AEKAT +VLAILE+ GISD+S EEFDS SDQ E+ K N K+R Sbjct: 67 RKMAEKATEDVLAILESQGISDISEEEFDSSSDQ-ETHQGSKVGNSLANEEESFVISKVR 125 Query: 1895 KNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-- 1722 + GRSLSW+ DS + EK + SVRRR+SF S S+ R Sbjct: 126 RKEQEEHSGSDADSSLIPGRSLSWKGRIDSPRSREKCKDL-SVRRRSSFSSIGFSSPRHH 184 Query: 1721 VGKSCRRIRHRDTRSME-DSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESS 1551 +GKSCR+I+H++TRS + DS +G A S N S+G LRE + L + Sbjct: 185 LGKSCRQIKHKETRSDKFDSHENGV--GASSEGLPNFSNGGPEKLREGSEFPEEKVLSND 242 Query: 1550 TLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGT 1371 +L E Q+ F+ H RD DME AL+HQA+LI + EKFRENN+ T Sbjct: 243 SLSRTKENQRDSDLDFNGHGRDKDMEKALEHQAKLICENEEMEKAQREWEEKFRENNTST 302 Query: 1370 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC------------FS 1227 DSCDPGNHSD+TEER E+K+ + +AG + QETK E+ D C F Sbjct: 303 PDSCDPGNHSDITEERDEIKA-QTPCSAGVVVAQAQETKSEEGDVCLPKETFKIQQNGFL 361 Query: 1226 EKPETSKRSLQNE--NIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQ 1053 LQ++ S EF+FP K N E L + S P + Sbjct: 362 PASHVDMGGLQDQLNKSTVAPSQVEEFAFPTENGKQNHESLENFARHPSHGSHPNPLVHG 421 Query: 1052 TTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEALKRAKSSLNQKL 882 + S+ S S S A+VP QD LG VL+ALK+AK SL Q + Sbjct: 422 SAHNRSSDASSSVAGSGFHKGNASGSRSDLYALVPHDSQDRLGGVLDALKQAKLSLQQNM 481 Query: 881 NNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA----------- 735 P G + +PS D +IPV GLFRLPTD+ E A Sbjct: 482 TRLPLVDGTSVHKSIEPSIPVMKTGDRVEIPVGCAGLFRLPTDFAVEEAATQSSFLGSSW 541 Query: 734 -----------------RPGFANFPPENSLGRFLSEPF-DSRSAFS---SDLFLTDPYRP 618 RP F+ N+ R++ P+ ++R FS +D F+ + Y Sbjct: 542 SGRYCPETLVTSSFVETRPTFS----MNAADRYVPSPYIETRQTFSTNATDRFIPNAYVE 597 Query: 617 FTPERPFSQPR-LSEGPSSSNRMN------------RLDSYTNPVLPSVKDSYPFLPDVT 477 P P + PS R N Y P P +YP +PD T Sbjct: 598 SRPNFPANAAEPFVTSPSVDTRSNFPADNRFLSGPYSESGYAQPPYP----NYPSVPDRT 653 Query: 476 LRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357 + +E +R P G P R S YD+ RP+MYR Sbjct: 654 PWITSDE-ALTRALPRKPVG-APTDRFSFYDQ-FRPNMYR 690 >ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309582 [Fragaria vesca subsp. vesca] Length = 807 Score = 303 bits (776), Expect = 2e-79 Identities = 221/577 (38%), Positives = 297/577 (51%), Gaps = 34/577 (5%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 + ++ R + M +S +TIEFLRARLLSERSVS++ARQRADEL K V ELEEQLK VSLQ Sbjct: 7 DTQDLRINSGMDDSPGITIEFLRARLLSERSVSRSARQRADELEKMVEELEEQLKIVSLQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQE---ESPHDFKARNGXXXXXXXXXXXK 1902 RK AEKATA+VLAILEN G SD+SEEFDS SD E ES K+R Sbjct: 67 RKMAEKATADVLAILENQGASDISEEFDSSSDHETFQESKMGNKSRKEEENFLISE---- 122 Query: 1901 LRKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSA 1728 R+N GR+LSW+ DS + EK S+RRR++F + +S S Sbjct: 123 -RRNEHEEYSGSDLDSSSIPGRNLSWKGRIDSPRSREKYK-EPSIRRRSTFSAVGSSSSR 180 Query: 1727 RRVGKSCRRIRHRDTRSM-----------EDSQNDGTEKAACSGDAFNGSDGEHVALREY 1581 +GKSCR+I+HR+TRS+ +DS+ +G ++ F+ D E + Sbjct: 181 HNLGKSCRQIKHRETRSVVERSKDEPAKFDDSEENGVAASSEGLSNFSYCDPERLRDGPE 240 Query: 1580 ENGKNQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXX 1401 + L L + E Q+ F+ H R+ DME AL+HQAQLIGQ Sbjct: 241 SQKEKFLSKDALTRSKEHQRNGDPNFNGHGRNKDMERALEHQAQLIGQNEEMEMAQREWE 300 Query: 1400 EKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDAC-FSE 1224 EKFRENN+ T DSCDPGNHSD+TEER EMK+P A + S+ QE K E D+C F E Sbjct: 301 EKFRENNTSTPDSCDPGNHSDITEERDEMKTP---FPAEINASEAQEAKSEARDSCLFEE 357 Query: 1223 KPET--------------SKRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQ 1086 K +T + N + ++ S EF+FP + E+ QE L Sbjct: 358 KMKTQLNGYLPPSDVEMGGMQDQMNRSSVASASPIQEFAFPTAYERQTQESLENNAHQPS 417 Query: 1085 YRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVP---QDNLGSVLEAL 915 S P +++++ S+ +S S ++ + L A+VP Q+ LG VL+AL Sbjct: 418 PGSHHDPLLLESSHNRSSVVSSDGGSSFHNASGSRNDL---YALVPHDSQERLGGVLDAL 474 Query: 914 KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENA 735 K+AK SL QK+ P + +P + IPV GLFRLPTD+ E A Sbjct: 475 KQAKLSLQQKIIRLPLVDDTSVQESIEPPIPAVTTGNRLDIPVGCAGLFRLPTDFAVEEA 534 Query: 734 RPGFANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPY 624 + +SL P +A S+D F+T Y Sbjct: 535 ATKHSYLGLGSSLPSARYCPDKGLAASSTDQFVTSTY 571 >ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus] Length = 671 Score = 300 bits (767), Expect = 3e-78 Identities = 250/703 (35%), Positives = 335/703 (47%), Gaps = 71/703 (10%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 + ++ R ++++ AMTIEFLRARLLSERSVSK+ARQRADELAK+VAELEEQLK VSLQ Sbjct: 7 DQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQLKIVSLQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 RK AEKATA+VLAILE++G SD+SE DS SD E P K +G + R+ Sbjct: 67 RKMAEKATADVLAILEDNGASDISETLDSNSDHETEP---KVEDGLAREDVSSGTVR-RR 122 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGS--NSLSARRV 1719 N G SLSW+ DS H EK S+R R+SF S +S ++ Sbjct: 123 NEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTREKYK-KHSIRSRSSFTSIGSSSPKHQL 181 Query: 1718 GKSCRRIRHRDTRSMEDSQN-------DGTEKAACSG--DAFNGSDGEHVALRE-YENGK 1569 G+SCR+I+ RDTR ++ Q D +E+ + D+ N S H LR+ YE + Sbjct: 182 GRSCRQIKRRDTRPLDGEQELKSDALVDSSEEIPSTSLEDSQNYSVNGHSILRDGYEVRE 241 Query: 1568 NQLESSTLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFR 1389 SS+ NS D +E+ DDME AL+ QAQLI QY EKFR Sbjct: 242 KTRSSSSGVHNSVGNSDQDNDIDGYEKVDDMEKALKCQAQLIDQYEAMEKAQREWEEKFR 301 Query: 1388 ENNSGTQDSCDPGNHSDVTEERYEMK--SPELSRAAGTS-------NSDNQETKQEQVDA 1236 ENN+ T DSCDPGNHSD+TEER EM+ +P LS + D ++ Q Q + Sbjct: 302 ENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNPANEAKPQVAFDCDTRDLSQAQTNG 361 Query: 1235 CFSEKPETSKRSL--QNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPP 1062 L QN N IS S EF+FPM+ K QE SQ S Q P Sbjct: 362 LGPSMCAVDVEDLQDQNTNSISTSKSLEEFTFPMANVKQCQE--------SQENSAQEPS 413 Query: 1061 --------MVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLGSVLEALKRA 906 + + S I+ Y++++ + +P E L VLEALK+A Sbjct: 414 CTSHLNHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHE-----PPALDGVLEALKQA 468 Query: 905 KSSLNQKLNNSPPTAG------RASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQP 744 K SL +K+ P G ++ G + P D +IPV GLFRLPTD+ Sbjct: 469 KLSLTKKIIKLPSVDGESESIDKSIGPLSIPKMG-----DRLEIPVGCAGLFRLPTDFAA 523 Query: 743 E-NARPGF----------ANFPPENSL----------------------GRFLSEPFDSR 663 E +++ F ++P E + R S + + Sbjct: 524 EASSQANFLASSSQLRSPTHYPGEGAALSANHQIFPGHEMEDRSSFLRDSRLRSSGYRAG 583 Query: 662 SAFSSDLFLTDPYRPFTPERPFSQPRLSEGPSSSNRMNRLDSYTNPVLP-SVKDSYPFLP 486 S F+ D FLTD PE + P + + D Y + V P S +YP P Sbjct: 584 SGFTRDGFLTD----HIPENRWKNP---------GQKHHFDQYFDAVQPSSYVHNYPPRP 630 Query: 485 DVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357 V+ + N+ R FP +PP + S YD+ RP+MYR Sbjct: 631 -VSSNIHPND-TFLRTFPGRSTEMPPTNQYSFYDDQFRPNMYR 671 >gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 709 Score = 297 bits (761), Expect = 1e-77 Identities = 238/717 (33%), Positives = 354/717 (49%), Gaps = 83/717 (11%) Frame = -2 Query: 2258 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2079 S + ++ ++TT E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS Sbjct: 4 SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63 Query: 2078 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1899 +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P + NG K+ Sbjct: 64 VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122 Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1725 R+ ++GRSLSW+ K + H+ E+ Y D VR R SF S S S+R Sbjct: 123 RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180 Query: 1724 -RVGKSCRRIRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYE 1578 R GKSCR+IR R++RS M D Q G E ++ +A + + G H+ E Sbjct: 181 HRQGKSCRQIRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSE 239 Query: 1577 NGKNQLESSTLRSNSETQKMDGRYFDV----HERDDDMESALQHQAQLIGQYXXXXXXXX 1410 +N+ L S++ + + FD+ +E + DME AL+HQAQLI Y Sbjct: 240 IHENKSTVDNLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQR 299 Query: 1409 XXXEKFRENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACF 1230 EKFRE NS + DSCDPGNHSDVTEER E+K+ + +GT+ S Q ++E + + Sbjct: 300 EWEEKFREKNSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFS 357 Query: 1229 SEKPETS--------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFL 1110 +E P+ RSL E+ ++ S + +F M++E ++Q Sbjct: 358 AELPKIHSNDLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ 416 Query: 1109 GIQHDASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG- 933 +++ S F + + + + S + P+ L A+VP + G Sbjct: 417 --SNNSPSNSSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGR 473 Query: 932 --SVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLP 759 VL++LK+A+ SL QK++ G + G + S + + +IP+ GLFR+P Sbjct: 474 FTGVLDSLKQARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVP 533 Query: 758 TDYQPENARPGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPE 606 TD E + F AN P+ + S + S ++ + Y+P + + Sbjct: 534 TDISVEAPKANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSD 593 Query: 605 R----PFSQPRLSEGP----------------------SSSNRMN----RLDSYTNPVLP 516 R P+ PR S P + +R++ D PVLP Sbjct: 594 RFFSGPYMYPRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLP 653 Query: 515 SVK----DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357 S ++P PD+ ++ EG + + S P S YD H RPD++R Sbjct: 654 SSSLQNYPTFPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 708 >gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus vulgaris] Length = 652 Score = 295 bits (756), Expect = 5e-77 Identities = 239/666 (35%), Positives = 337/666 (50%), Gaps = 39/666 (5%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 + ++QR +S ++S AMTIEFLRARLLSERS+SK+ARQRADELA+KV ELEEQL+ V LQ Sbjct: 7 DPQDQRIASSTEDSTAMTIEFLRARLLSERSISKSARQRADELAEKVMELEEQLRMVILQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 RK AEKATA+VLAILE+ GIS VS+EFDS SD E+P D N K R+ Sbjct: 67 RKMAEKATADVLAILESQGISGVSDEFDSGSDL-ENPFDSSMSNECAKEDEGPMKSKGRQ 125 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEK-KNYMDSVRRRASFGSNSLSAR-RV 1719 + + +SLSW+ D H+LEK K +VRR++SF S S S + R+ Sbjct: 126 HGSDEMSGSNEDSSLVSSKSLSWKGRHDLSHSLEKYKTKSTNVRRQSSFSSFSSSPKHRL 185 Query: 1718 GKSCRRIRHRDTRS-MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLR 1542 GKSCR+IRHR RS ME+S+ + + S+G + +G S+ L+ Sbjct: 186 GKSCRKIRHRQPRSVMEESRGKFVHVNCQVNELVSSSEG----FPNFRDG----GSNILK 237 Query: 1541 SNSETQKMDG---------RYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFR 1389 S+ Q+ DG + D + R+++ME AL+HQA+LI QY EKFR Sbjct: 238 IESKIQEEDGSEANLLSKNHHIDGYGRENEMEKALEHQAELIDQYEAMEKAQREWEEKFR 297 Query: 1388 ENNSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS 1209 ENNS T DSCDPGNHSD+TE++ E K ++ AA S +E+K E C SE+ Sbjct: 298 ENNSTTPDSCDPGNHSDMTEDKDEGK-VQIPYAAKVVTSKAEESKGEPGGVCLSEE---- 352 Query: 1208 KRSLQNENIISCESSASE-FSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQSST 1032 K + I+ + ++ + S + +FLG ++ S + Q +V +QSS Sbjct: 353 KLKAEGREIMPKKHDDTDVYRNQKSTTFSTSDFLGQENSHSPLKGNQNEILVNGHSQSSD 412 Query: 1031 KISPYEEKSTALST----------PPKISLPLELAVVPQDN--LGSVLEALKRAKSSLNQ 888 + + ++ T K L V + + VLE+LK+A+ SL Q Sbjct: 413 MNHLDQGRHSSFPTDIHGVQHQHDASKNQKDLYALVTREQSHQFDGVLESLKQARISLQQ 472 Query: 887 KLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPP 708 +LN P G G +P + + D F+IP GLFRLPTD+ E A P F P Sbjct: 473 ELNRLPVVEG---GYTAKPLPSVSKNEDRFEIPFGFSGLFRLPTDFSDE-ATPRFNVRDP 528 Query: 707 ENSLGRFLSEPFDSRSAFSSDLFLTDP-------YRPFTPERPFSQPRLSEGPSSSNRMN 549 G + S S F T+P P ++ + L G S+ + Sbjct: 529 TTGFGSNY-HLNGTMSRTSVGQFFTNPPHSGKMLMSPSANDQALATRYLENGSRFSSSQS 587 Query: 548 RLDSYTN-PVLPSVKDSYPFLP------DVTLRVPLNEGGASRNFPSSERGLPPVMRLSS 390 D ++N L S K SYP P + T ++P + SR + +S G+P R S Sbjct: 588 PFDPFSNGGPLSSSKYSYPTFPINPSYQNATPQMPFGD-EVSRPYSNSTVGVPLANRFSF 646 Query: 389 YDEHVR 372 D+H+R Sbjct: 647 NDDHLR 652 >gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 749 Score = 294 bits (752), Expect = 1e-76 Identities = 237/708 (33%), Positives = 348/708 (49%), Gaps = 83/708 (11%) Frame = -2 Query: 2231 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2052 TT E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS+QR++AEKA Sbjct: 53 TTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSVQRRRAEKA 112 Query: 2051 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1872 TA+VLAILEN+G+SD+SEE DS SDQ ++P + NG K+R+ Sbjct: 113 TADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKVRQKESEELS 171 Query: 1871 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR--RVGKSCRR 1701 ++GRSLSW+ K + H+ E+ Y D VR R SF S S S+R R GKSCR+ Sbjct: 172 GSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRKHRQGKSCRQ 229 Query: 1700 IRHRDTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1551 IR R++RS M D Q G E ++ +A + + G H+ E +N+ Sbjct: 230 IRRRESRSVAEELKSDNIMVDPQVKGLENSS-EVNANHSTGGPHILPMGSEIHENKSTVD 288 Query: 1550 TLRSNSETQKMDGRYFDV----HERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFREN 1383 L S++ + + FD+ +E + DME AL+HQAQLI Y EKFRE Sbjct: 289 NLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREK 348 Query: 1382 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS-- 1209 NS + DSCDPGNHSDVTEER E+K+ + +GT+ S Q ++E + + +E P+ Sbjct: 349 NSSSPDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSN 406 Query: 1208 ------------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQY 1083 RSL E+ ++ S + +F M++E ++Q +++ Sbjct: 407 DLVPPSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSN 463 Query: 1082 RSQQFPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALK 912 S F + + + + S + P+ L A+VP + G VL++LK Sbjct: 464 SSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLK 522 Query: 911 RAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENAR 732 +A+ SL QK++ G + G + S + + +IP+ GLFR+PTD E + Sbjct: 523 QARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPK 582 Query: 731 PGF---------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQ 591 F AN P+ + S + S ++ + Y+P + +R P+ Sbjct: 583 ANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMY 642 Query: 590 PRLSEGP----------------------SSSNRMN----RLDSYTNPVLPSVK----DS 501 PR S P + +R++ D PVLPS + Sbjct: 643 PRTSSSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPT 702 Query: 500 YPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357 +P PD+ ++ EG + + S P S YD H RPD++R Sbjct: 703 FPSYPDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 748 >ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X4 [Glycine max] Length = 641 Score = 288 bits (738), Expect = 6e-75 Identities = 234/664 (35%), Positives = 326/664 (49%), Gaps = 37/664 (5%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 + ++QR T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQ Sbjct: 7 DPQDQRVTSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 RK AEKATA+VLAILE+ GISDVSEEFDS SD E+P D N K R+ Sbjct: 67 RKMAEKATADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQ 125 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVG 1716 + + +SLSW+ DS H+LEK ++RR++SF S S S + R G Sbjct: 126 HGSDKMPGSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQG 184 Query: 1715 KSCRRIRHRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESST 1548 KSCR+IRHR R +E+S+N + ++ A F G + + E+ + S Sbjct: 185 KSCRKIRHRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSG 244 Query: 1547 LRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQ 1368 ++ +DG + R+ DME AL+HQAQLI QY EKFRENNS T Sbjct: 245 ANPLNKNHHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTP 299 Query: 1367 DSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNE 1188 DSCDPGN+SD+TE++ E K + AA SD QE+K E C SE+ K + Sbjct: 300 DSCDPGNYSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEAR 354 Query: 1187 NII-SCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQSSTKISPYEE 1011 +I+ +S + + + LG Q+ + Q V Q S Sbjct: 355 DIMPKTHDDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPG 414 Query: 1010 KSTALSTPPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLN 891 + + P S P ++ V N VLE+LK+A+ SL Sbjct: 415 RHGYHDSKPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQ 474 Query: 890 QKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPG 726 Q+L P SG +PS + + D F++PV GLFR+PTD+ N + Sbjct: 475 QELKRLPLV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDP 531 Query: 725 FANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSS 561 A F L R +S D + F + PY P + L+ GP+ Sbjct: 532 TAGFGSNFHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGG 585 Query: 560 NRMNRLDSY-TNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYD 384 + + +Y T P+ PS +++ P +P NE SR + SS G+P R S Sbjct: 586 SLSSSKYTYPTFPINPSYQNATPQMPFG------NE--VSRPYSSSTVGVPLANRFSFNS 637 Query: 383 EHVR 372 +H+R Sbjct: 638 DHLR 641 >ref|XP_006606284.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1 [Glycine max] gi|571568788|ref|XP_006606285.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X2 [Glycine max] gi|571568792|ref|XP_006606286.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X3 [Glycine max] Length = 664 Score = 284 bits (726), Expect = 1e-73 Identities = 232/657 (35%), Positives = 321/657 (48%), Gaps = 37/657 (5%) Frame = -2 Query: 2231 TTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKA 2052 T+ M++S AMTIEFLRARLLSERS+S++A+QRADELAKKV +LEEQLK V LQRK AEKA Sbjct: 37 TSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQLKTVILQRKMAEKA 96 Query: 2051 TANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXX 1872 TA+VLAILE+ GISDVSEEFDS SD E+P D N K R++ Sbjct: 97 TADVLAILESEGISDVSEEFDSGSDL-ENPCDSSVSNECAKEGEEPMSSKGRQHGSDKMP 155 Query: 1871 XXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSAR-RVGKSCRRIR 1695 + +SLSW+ DS H+LEK ++RR++SF S S S + R GKSCR+IR Sbjct: 156 GSNVDSSPVSSKSLSWKGRHDSSHSLEKYK-TSNLRRQSSFSSISSSPKHRQGKSCRKIR 214 Query: 1694 HRDTR-SMEDSQN---DGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRSNSET 1527 HR R +E+S+N + ++ A F G + + E+ + S ++ Sbjct: 215 HRQIRLVVEESRNKFANHEKELASLSKGFPNFSGGGSNIPKIESEIQEEGGSGANPLNKN 274 Query: 1526 QKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSCDPGN 1347 +DG + R+ DME AL+HQAQLI QY EKFRENNS T DSCDPGN Sbjct: 275 HHVDG-----YGREKDMEKALEHQAQLIDQYEAMEKVQREWEEKFRENNSTTPDSCDPGN 329 Query: 1346 HSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENII-SCE 1170 +SD+TE++ E K + AA SD QE+K E C SE+ K + +I+ Sbjct: 330 YSDMTEDKDESK-VHIPFAAKVVTSDAQESKGEPRGVCLSEE----KFKAEARDIMPKTH 384 Query: 1169 SSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQSSTKISPYEEKSTALST 990 +S + + + LG Q+ + Q V Q S + + Sbjct: 385 DDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGHFQPSVMNHQDPGRHGYHDS 444 Query: 989 PPKISLPLELAVVPQDN--------------------LGSVLEALKRAKSSLNQKLNNSP 870 P S P ++ V N VLE+LK+A+ SL Q+L P Sbjct: 445 KPTYSFPTDIHGVQHQNDASRNKTDLFALVTHEQPHKFNGVLESLKQARISLQQELKRLP 504 Query: 869 PTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-----NARPGFANFPPE 705 SG +PS + + D F++PV GLFR+PTD+ N + A F Sbjct: 505 LV---ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIPTDFSDGATARFNVKDPTAGFGSN 561 Query: 704 NSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLS-----EGPSSSNRMNRLD 540 L R +S D + F + PY P + L+ GP+ + + Sbjct: 562 FHLNRAMSRTSDGQ------FFPSLPYPDTQLSLPANDQSLAIRYVENGPNGGSLSSSKY 615 Query: 539 SY-TNPVLPSVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVR 372 +Y T P+ PS +++ P +P NE SR + SS G+P R S +H+R Sbjct: 616 TYPTFPINPSYQNATPQMPFG------NE--VSRPYSSSTVGVPLANRFSFNSDHLR 664 >ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa] gi|222850857|gb|EEE88404.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa] Length = 684 Score = 281 bits (719), Expect = 9e-73 Identities = 240/691 (34%), Positives = 342/691 (49%), Gaps = 59/691 (8%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 E ++QR +SM++S A+TIEFLRARLL+ERSVS+TARQRADELA++VAELEEQL+ VSLQ Sbjct: 7 EKQDQRTRSSMEDSTAITIEFLRARLLAERSVSRTARQRADELAERVAELEEQLRIVSLQ 66 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 R KAEKAT +VLAILE++GISD SE F S SDQ+ +P + K K+ K Sbjct: 67 RMKAEKATVDVLAILESNGISDDSEIFGSSSDQD-TPCESKVGK-KTKQEESSVISKVTK 124 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR-VG 1716 S GR+LSW+ K S +LEK S+RRR+SF S S S + G Sbjct: 125 YKLEEHSGSGHDFSSSQGRNLSWKGRKHSPRSLEKCKD-PSLRRRSSFASTSSSPKHHQG 183 Query: 1715 KSCRRIRHRDTR-------SMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLE 1557 KSCR++R++++R + D + A + + F V ENG+ + Sbjct: 184 KSCRQVRNKESRLTIGAFRTNPDKVDSPENGVATTSEVFPNCSEPEVG--RIENGEEKTL 241 Query: 1556 SSTLRSNSETQKMDGRYFD--VHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFREN 1383 Q+ D + V+ D DME AL+HQAQLI +Y EKFREN Sbjct: 242 PPISVGLENGQRADSNELEDNVYGSDRDMEKALEHQAQLIDRYKAMEKVQREWEEKFREN 301 Query: 1382 NSGTQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKR 1203 N T DS D GN SDVTEE YE+K+ ++ + GT + + K E V+ + +P R Sbjct: 302 NGSTPDSYDAGNRSDVTEEGYEIKA-QVQQHTGTVAAQSNRAKSE-VEKASNIQPNGILR 359 Query: 1202 ----------SLQNENIISCESSASEFSFPMSREK--NNQEFLGIQHDASQYRSQQFPPM 1059 ++ + + ES A +F+F ++K N+E LG + S + S P Sbjct: 360 PSHVNIGQLQEWKSSSAPTSESPAQDFAFRAEKQKQNENEESLGNNYHPSPHSSHDHP-- 417 Query: 1058 VQTTTQSSTKISPYEEKSTALSTPPKISLPL--------EL-AVVP---QDNLGSVLEAL 915 S+ SP + +T+ + EL A+VP + LG VL+AL Sbjct: 418 ----QSHSSHDSPGSQSATSFPSNTDSGFSKGQFSGRQNELYALVPHRASNELGGVLDAL 473 Query: 914 KRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPE-- 741 K A+ SL QK++ P G + + PS D IP+ + GLFRLP D+ E Sbjct: 474 KLARQSLQQKISTLPLIEGGSIRNSVDPSLPPPIPGDKVDIPLGNAGLFRLPFDFLAEGS 533 Query: 740 --------NARPGFANFPPEN-----SLGRFLSE-PFDSRSAF-SSDLFLTDPYRPFTPE 606 NA N+ P+ ++ RF+S P + S F ++D FL T Sbjct: 534 TRKNLDSTNAGLSLRNYYPDTGVPAAAINRFVSRFPTATGSRFPTADQFLASQSYSATGS 593 Query: 605 RPFSQPRL--SEGPSSSNRMNRLDSYTNPVL-----PSVKDSYPFLPDVTLRVPLNEGGA 447 R ++ + S+ + +R++ + P L PS + SYP P +P Sbjct: 594 RFPTEDQFLASQDVEAGSRISSQRPFFYPYLDTVSPPSARYSYPTNPSYPGPMPQLPSRE 653 Query: 446 SRNF-PSSERGLPPVMRLSSYDEHVRPDMYR 357 +F PS+ G+PP S D H+RP+MYR Sbjct: 654 PPSFLPSTTAGVPPADHFSFPDYHIRPNMYR 684 >gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508727307|gb|EOY19204.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 665 Score = 280 bits (717), Expect = 2e-72 Identities = 230/704 (32%), Positives = 339/704 (48%), Gaps = 70/704 (9%) Frame = -2 Query: 2258 SGEDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVS 2079 S + ++ ++TT E + MTIEFLRARLLSERSVSK+ARQR DELAK+VAELE+QLKFVS Sbjct: 4 SDQVKQDQRTTCNVEDSTMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVS 63 Query: 2078 LQRKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKL 1899 +QR++AEKATA+VLAILEN+G+SD+SEE DS SDQ ++P + NG K+ Sbjct: 64 VQRRRAEKATADVLAILENNGVSDISEELDSSSDQ-DAPFESNINNGSTKEEESSVTSKV 122 Query: 1898 RKNXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDS-VRRRASFGSNSLSAR- 1725 R+ ++GRSLSW+ K + H+ E+ Y D VR R SF S S S+R Sbjct: 123 RQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPER--YKDKLVRSRNSFASISFSSRK 180 Query: 1724 -RVGKSCRRIRHRDTRSM-EDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESS 1551 R GKSCR+IR R++RS+ E+ ++D + K SS Sbjct: 181 HRQGKSCRQIRRRESRSVAEELKSDN--------------------IMVDPQVKGLENSS 220 Query: 1550 TLRSNSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGT 1371 + +N T + DME AL+HQAQLI Y EKFRE NS + Sbjct: 221 EVNANHST------------GEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREKNSSS 268 Query: 1370 QDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS------ 1209 DSCDPGNHSDVTEER E+K+ + +GT+ S Q ++E + + +E P+ Sbjct: 269 PDSCDPGNHSDVTEERDEIKA-QAQYVSGTATSQVQGAEEEHI-SFSAELPKIHSNDLVP 326 Query: 1208 --------------KRSLQNENIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQ 1071 RSL E+ ++ S + +F M++E ++Q +++ S Sbjct: 327 PSQADMDRLQDWRYSRSLSPES-LNPNSPGQKLTFLMAKENHHQSMQ--SNNSPSNSSHH 383 Query: 1070 FPPMVQTTTQSSTKISPYEEKSTALSTPPKISLPLELAVVPQDNLG---SVLEALKRAKS 900 F + + + + S + P+ L A+VP + G VL++LK+A+ Sbjct: 384 FAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNEL-YALVPHETSGRFTGVLDSLKQARL 442 Query: 899 SLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGF- 723 SL QK++ G + G + S + + +IP+ GLFR+PTD E + F Sbjct: 443 SLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFL 502 Query: 722 --------ANFPPENSLGRFLSEPFDSRSAFSSDLFLTDPYRPFTPER----PFSQPRLS 579 AN P+ + S + S ++ + Y+P + +R P+ PR S Sbjct: 503 GSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFSGPYMYPRTS 562 Query: 578 EGP----------------------SSSNRMN----RLDSYTNPVLPSVK----DSYPFL 489 P + +R++ D PVLPS ++P Sbjct: 563 SSPFPTAFASSGYIKDDQILTGQCEETGSRLSTPKPSFDPSLEPVLPSSSLQNYPTFPSY 622 Query: 488 PDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357 PD+ ++ EG + + S P S YD H RPD++R Sbjct: 623 PDLVPQIHAKEGFPAFHTTRSVGATPD--WFSFYDSHFRPDIHR 664 >emb|CBI40233.3| unnamed protein product [Vitis vinifera] Length = 682 Score = 279 bits (714), Expect = 3e-72 Identities = 188/418 (44%), Positives = 238/418 (56%), Gaps = 32/418 (7%) Frame = -2 Query: 2222 MQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQRKKAEKATAN 2043 M++S AMTIEFLRARLLSERSVS+TARQRADELA++V +LEEQLK VS+QR KAEKATA+ Sbjct: 1 MEDSTAMTIEFLRARLLSERSVSRTARQRADELAQRVWKLEEQLKIVSIQRNKAEKATAD 60 Query: 2042 VLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRKNXXXXXXXXX 1863 VLAILENH ISDVS EFDS SDQE + D G Sbjct: 61 VLAILENHAISDVSWEFDSSSDQEVALCDSHVGGG------------------------- 95 Query: 1862 XXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRRASFGSNSLSARR--VGKSCRRIRHR 1689 R LSW+SSKDS H++EK+ S+RRR SF S+ S+ + +GKSCR+IR R Sbjct: 96 --------RRLSWKSSKDSSHSIEKRYLDCSIRRRHSFASSGSSSPKHNLGKSCRQIRRR 147 Query: 1688 DTRS----------MEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQ--LESSTL 1545 +TRS M DSQN+G + S NG D LRE + + L + Sbjct: 148 ETRSAVDELKVGRVMVDSQNNGI--ISSSEGLPNGFDSGQEILREGSENQEEEALMDGQV 205 Query: 1544 RSNSETQKM---DGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSG 1374 + E+Q+ + + + RD DME AL+HQAQLIGQY EKFRENNS Sbjct: 206 SDSLESQRDATGSNHHLNRNGRDRDMERALEHQAQLIGQYEAEEKAQREWEEKFRENNSS 265 Query: 1373 TQDSCDPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETS----- 1209 T DSC+PGNHSDVTEER E+K P+ AAG S +Q TK + D F+E+ + Sbjct: 266 TPDSCEPGNHSDVTEERDEVK-PQAPSAAGILTSQDQGTKLDDEDVHFNEESSQTLPTIS 324 Query: 1208 -------KRSLQNEN---IISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFP 1065 LQ +N +++ ES A +F FPM++E +QEFL Q + S +P Sbjct: 325 TTHLHGDMECLQEQNRCSMLAYESLAPDFVFPMAKENLHQEFLENQSYPLSHSSHHYP 382 Score = 105 bits (263), Expect = 7e-20 Identities = 81/225 (36%), Positives = 110/225 (48%), Gaps = 24/225 (10%) Frame = -2 Query: 959 AVVPQDN---LGSVLEALKRAKSSLNQKLNNSPPTAGRASGSVFQPSNNETNKTDSFQIP 789 A+VP++ LG VLEAL++A+ SL KLN P G + G +PS T + +IP Sbjct: 472 ALVPRETSNELGGVLEALQQARLSLQHKLNRLPLIEGGSIGRAIEPSFPSTRAWERVEIP 531 Query: 788 VISPGLFRLPTDYQ----------PENARPGFANFPPE-----NSLGRFLSEPF--DSRS 660 V GLFR+P DYQ +++ N+ P+ N RFL+ P+ S Sbjct: 532 VGCAGLFRVPADYQLGTATEANFLGSDSQSSLKNYYPDTGFVANPGDRFLTSPYLKTGSS 591 Query: 659 AFSSDLFLTDPYRP----FTPERPFSQPRLSEGPSSSNRMNRLDSYTNPVLPSVKDSYPF 492 + D FLT PYR P RP G S+S R YT+P +Y Sbjct: 592 VPTDDSFLTSPYRETGSRIPPLRPSFDYYSDAGLSASTR------YTHP-------TYSS 638 Query: 491 LPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRPDMYR 357 PD+ R+P NEG A R +SE G+P S YD+H+RP+MYR Sbjct: 639 HPDLLYRMPFNEGFA-RPPRNSEVGIPSTDHFSFYDDHIRPNMYR 682 >ref|XP_006360476.1| PREDICTED: flocculation protein FLO11-like isoform X1 [Solanum tuberosum] gi|565389467|ref|XP_006360477.1| PREDICTED: flocculation protein FLO11-like isoform X2 [Solanum tuberosum] gi|565389469|ref|XP_006360478.1| PREDICTED: flocculation protein FLO11-like isoform X3 [Solanum tuberosum] gi|565389471|ref|XP_006360479.1| PREDICTED: flocculation protein FLO11-like isoform X4 [Solanum tuberosum] gi|565389473|ref|XP_006360480.1| PREDICTED: flocculation protein FLO11-like isoform X5 [Solanum tuberosum] Length = 678 Score = 276 bits (705), Expect = 4e-71 Identities = 224/649 (34%), Positives = 326/649 (50%), Gaps = 21/649 (3%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 ED++Q K +++S TIEFLR RLL+ERS S+TA+QRADELA++V+ELEEQLK VSLQ Sbjct: 7 EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQRVSELEEQLKAVSLQ 65 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 RKKAE+ATA VL+ILENH I DVSEEF S SD+E D K + ++ Sbjct: 66 RKKAERATAAVLSILENHSIDDVSEEFSSGSDKEAILSDQKDAENKTGGDISSSVKE-KE 124 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1719 + ST RSLSW+S K S H+L+++ Y DS RRR ++F S +S+ +RV Sbjct: 125 DDVDTLSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSSTDISSPKRV 183 Query: 1718 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1539 G SCRRIR RDTRS D + + A C+ + S G N + S Sbjct: 184 GNSCRRIRRRDTRSASDKLQNSS--AECASEPLPSSANNEPHPLTAGAGINDVNDQVHVS 241 Query: 1538 NSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1359 + + G + + D+D + AL QAQLIGQY EK+RE+N T DSC Sbjct: 242 AID---VSGNGKEADKSDEDSQRALHQQAQLIGQYEAEEKAQREWEEKYRESNICTPDSC 298 Query: 1358 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEK----------PETS 1209 D N+SDVTEER ++K+ + AG ++ N + D +E+ P + Sbjct: 299 DRENYSDVTEERDDLKASQEPCLAGNTSMQNHANQSGAADVSRTEQNGNIDNSPSTPHVN 358 Query: 1208 KRSLQNE---NIISCESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038 L+++ + +S ASE + PMS N +L S Y QQ P+ + Sbjct: 359 MSCLEDKKGSRTVESDSPASELARPMS----NGNYLENHGQTSAYSHQQSLPVTR----- 409 Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 867 SP +S++L ELA+V +++ SVL L++AK SL +++N+S P Sbjct: 410 ----SPMHPRSSSLQAGQAPQTGYELALVSHNTSNSVNSVLGELEQAKLSLTKQINSSLP 465 Query: 866 TAGRASGSVFQPSNNETNKTDSFQIPVISPGLFRLPTDYQPENARPGFANFPPENSLGRF 687 TA S N++++ +++ +SP + +R + + G Sbjct: 466 TASYPGMPSRFSSVNQSSEPSTYETS-LSPYM----------ESRSKYV------TQGNR 508 Query: 686 LSEPFDSRSAFSSDLFLTDPYRPFTPERPFSQPRLSE---GPSSSNRMNRLDSYTNPVLP 516 ++ PF + AF YRP + E F + S P+SS+R+ +T P Sbjct: 509 VTYPF--QRAFPEVSSSAPSYRPIS-ETNFDAGQPSSMRFNPNSSSRLPLSSKFTYP--- 562 Query: 515 SVKDSYPFLPDVTLRVPLNEGGASRNFPSSERGLPPVMRLSSYDEHVRP 369 SYP PD+ ++P NE SRN+P +E LPP S++ V P Sbjct: 563 ----SYPKFPDMVPKLPPNE-VFSRNYPRNETDLPPSFSFSTWSPEVVP 606 >ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251943 [Solanum lycopersicum] Length = 729 Score = 266 bits (679), Expect = 4e-68 Identities = 229/673 (34%), Positives = 332/673 (49%), Gaps = 45/673 (6%) Frame = -2 Query: 2252 EDREQRKTTSMQESNAMTIEFLRARLLSERSVSKTARQRADELAKKVAELEEQLKFVSLQ 2073 ED++Q K +++S TIEFLR RLL+ERS S+TA+QRADELA+ V+ELEEQLK VSLQ Sbjct: 7 EDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQMVSELEEQLKVVSLQ 65 Query: 2072 RKKAEKATANVLAILENHGISDVSEEFDSCSDQEESPHDFKARNGXXXXXXXXXXXKLRK 1893 RK+AEKATA VL+ILE+H I DVSEEF S SD+E D K G K ++ Sbjct: 66 RKRAEKATAAVLSILEDHSIDDVSEEFSSGSDKETILSDQKDA-GNKTGGDISSSAKEKE 124 Query: 1892 NXXXXXXXXXXXXXXSTGRSLSWRSSKDSQHALEKKNYMDSVRRR-ASFGSNSLSA-RRV 1719 + ST RSLSW+S K S H+L+++ Y DS RRR ++F +S+ +RV Sbjct: 125 DDVDILSSSGTVSSSSTARSLSWKSGK-SSHSLDRRKYTDSNRRRYSNFSYTDISSPKRV 183 Query: 1718 GKSCRRIRHRDTRSMEDSQNDGTEKAACSGDAFNGSDGEHVALREYENGKNQLESSTLRS 1539 G SCR+IR RDTRS D + + A C+ + + S G + + Sbjct: 184 GNSCRQIRRRDTRSASDKLRNSS--AECASEPLSSSANNEPHSLTAGAGISDVNDQV--- 238 Query: 1538 NSETQKMDGRYFDVHERDDDMESALQHQAQLIGQYXXXXXXXXXXXEKFRENNSGTQDSC 1359 + + G + + D+D + AL Q Q IGQY EK+RE+NS T DSC Sbjct: 239 HVPALDVPGNGREADKSDEDSQRALHQQVQPIGQYEAEEKAQREWEEKYRESNSCTPDSC 298 Query: 1358 DPGNHSDVTEERYEMKSPELSRAAGTSNSDNQETKQEQVDACFSEKPETSKRSLQNENI- 1182 D N+SDVTEER ++K+ + AG ++ N + D +++ S N+ Sbjct: 299 DRENYSDVTEERDDLKASQEPCLAGRTSMQNHANQCGAADVSRTKQNGNIDNSPSTPNVN 358 Query: 1181 ISC------------ESSASEFSFPMSREKNNQEFLGIQHDASQYRSQQFPPMVQTTTQS 1038 +SC +SSASE + PMS +L S + QQ P+ + Sbjct: 359 MSCLEDKKGSRTVGSDSSASELARPMS----TGNYLENHGQTSAFSHQQSFPVTR----- 409 Query: 1037 STKISPYEEKSTALSTPPKISLPLELAVV---PQDNLGSVLEALKRAKSSLNQKLNNSPP 867 S +S++L + ELA+V + + SVL L++AK SL +++N+S P Sbjct: 410 ----SSMHPRSSSLQAGQALQTGYELALVSHNTSNGVDSVLGKLEQAKLSLTKQINSSLP 465 Query: 866 TAGRASGSVFQPSNNETNKTDSFQIPVISPGL----------FRLPTDYQ---PE--NAR 732 TA S N + + +++I + P + R+ +Q PE ++ Sbjct: 466 TASYPGTPSRFSSLNHSPELSTYEISLTPPYVESRSKYVTQSNRVTYPFQRAFPEVSSSA 525 Query: 731 PGFANFPPEN-SLGRFLSEPF-DSRSAF-SSDLFLTDPY-RPFT-------PERPFSQPR 585 P + N G+ S P+ +SRS + + +T P+ R FT RP S+ Sbjct: 526 PSYRPISETNFEAGQPSSTPYVESRSKYVTQSNRVTYPFQRAFTEVSSSAPSYRPISETN 585 Query: 584 LSEGPSSSNRMNRLDSYTNPVLPSVK-DSYPFLPDVTLRVPLNEGGASRNFPSSERGLPP 408 G SS R N S P + SYP PD+ ++P NE SRNFP++E LPP Sbjct: 586 FDAGQPSSVRFNPNSSSRLPFSSKLTYPSYPKFPDMVPKLPPNE-VFSRNFPTNETDLPP 644 Query: 407 VMRLSSYDEHVRP 369 S+ + V P Sbjct: 645 SFSFSTLSQEVVP 657