BLASTX nr result
ID: Catharanthus22_contig00010650
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00010650 (2562 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyp... 310 3e-81 emb|CBI40233.3| unnamed protein product [Vitis vinifera] 303 2e-79 ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citr... 301 9e-79 ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267... 298 1e-77 ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyp... 297 1e-77 gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus pe... 286 4e-74 ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citr... 284 2e-73 ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus c... 277 1e-71 ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309... 276 4e-71 gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao] 272 5e-70 gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma caca... 267 2e-68 gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis] 266 3e-68 gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao] 264 2e-67 ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207... 261 8e-67 gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus... 235 8e-59 ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Popu... 233 2e-58 ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyp... 222 6e-55 ref|XP_004496182.1| PREDICTED: uncharacterized protein LOC101514... 220 2e-54 ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251... 216 3e-53 ref|XP_004496183.1| PREDICTED: uncharacterized protein LOC101514... 213 3e-52 >ref|XP_006345859.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1 [Solanum tuberosum] Length = 643 Score = 310 bits (793), Expect = 3e-81 Identities = 245/723 (33%), Positives = 363/723 (50%), Gaps = 15/723 (2%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M+S+ +DQDQR G+ED ++MTIE LRARLL+ER++S+TARQRADEL +RVLELEDQL Sbjct: 1 MTSNGKQDQDQRKIVGMED-SSMTIEFLRARLLAERSVSQTARQRADELAERVLELEDQL 59 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSK---ISNNSVKMKG 387 K+V LQ+K+AEKATA VL+ILE+ G++DASEEFDS SD E S+SK ++N + K Sbjct: 60 KIVSLQRKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKP 119 Query: 388 ASTDFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMD-XXXXXXXXXX 564 ++ R ++ ++ GR ER ++ D Sbjct: 120 NPSNVKERENDADI-SSSEIISSPSTGRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFAS 178 Query: 565 XXXXLPKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETV 744 PKR GKSC + +E + LP + G +++ Sbjct: 179 TGSSSPKRAGKSCRRIRRNTTKTATDECP----------------PEHLPSFANNGHQSL 222 Query: 745 RDDPESHEVENPREAPSSRFCGTVIPEVSVSSR--TEQDISMGRALNDQAQK-KAHEEEK 915 D +++V++ R P+S E+S + R E D M RAL +AQ +E E+ Sbjct: 223 MDSAGNNDVKDQRHLPTS--------EMSENQRKSDESDEGMERALQHKAQLIGQYEAEE 274 Query: 916 TAQREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYS---VGGTNYLNQGTEV 1086 AQREWEE RENN+ A DSCDPGN SDVTEERD+MKA Q YS + N+ N+ EV Sbjct: 275 KAQREWEEKYRENNNYAQDSCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEV 334 Query: 1087 EVANTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKN 1266 ++ +T + D+ P P+ + T + N S ++ E+ SEFA SNG+ +N Sbjct: 335 DIPSTNGVTDNVPSTPH-------IGTSCRKDQNCSRIINSESPASEFALSKSNGSCPEN 387 Query: 1267 LSGDSHCDPASTSYIRSLANGSSGDPLGYVPLPYANNGESSENRKELALMPQNSSSNLDT 1446 D + +Y R ++G P+ + +++G SS + AL+ +++S N+ + Sbjct: 388 -------DGPTPAYSRHQLPSANGSPIHPLENSISSSGGSSLQAGQ-ALVSRDASDNIGS 439 Query: 1447 VLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDF 1626 +L AL+QAK S+ ++ NV+P + G IEH +P AR D L+I G PG+FRLPTDF Sbjct: 440 ILGALEQAKFSISQQI-NVSP--IAEGGSSIEHSIPTARI-DRLDILPGFPGLFRLPTDF 495 Query: 1627 QFE-RTKANYLGSDPALSLSYQGSETTCNKLMPNPYIDARSSVLVNDRFWSSSGPSFMEI 1803 Q E T A+Y G S + E ++ PY+++ S+ + + ++G ++ Sbjct: 496 QLEATTTASYQGFPSRFSSANHFHEPGYDQFSTTPYMESPSNAITGLPY--TTGFDYLNP 553 Query: 1804 RSGIGMGTSPLTERVSETRTHP-IPFTEN-LPGIPARRPLFEPVMGASFSGRNVYIDPRX 1977 SG G S T+P PF N + + + P+ +S + + + P Sbjct: 554 PSGFG-------HPFSSKSTYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVP-- 604 Query: 1978 XXXXXXXXXXXXXXXXXXXXXXRLQLPSGERFSRNS--TMEFGMPSTARFSLYDDHIRPN 2151 L SGE S E G P + S YD H+RPN Sbjct: 605 ------------------------NLSSGEEVFLRSLPRNETGKPPSFPVSHYDAHLRPN 640 Query: 2152 MYK 2160 MY+ Sbjct: 641 MYR 643 >emb|CBI40233.3| unnamed protein product [Vitis vinifera] Length = 682 Score = 303 bits (776), Expect = 2e-79 Identities = 248/764 (32%), Positives = 346/764 (45%), Gaps = 72/764 (9%) Frame = +1 Query: 85 LEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQLKMVFLQKKRAEKATAD 264 +ED T MTIE LRARLLSER++S+TARQRADEL +RV +LE+QLK+V +Q+ +AEKATAD Sbjct: 1 MEDSTAMTIEFLRARLLSERSVSRTARQRADELAQRVWKLEEQLKIVSIQRNKAEKATAD 60 Query: 265 VLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGASTDFDARRHNREVXXXXX 444 VLAILE++ ++D S EFDS+SD E + DS + Sbjct: 61 VLAILENHAISDVSWEFDSSSDQEVALCDSHVGG-------------------------- 94 Query: 445 XXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXXLPKR-VGKSCXXXXXX 621 GR + +E++ PK +GKSC Sbjct: 95 -------GRRLSWKSSKDSSHSIEKRYLDCSIRRRHSFASSGSSSPKHNLGKSCRQIRRR 147 Query: 622 XXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDDPESHEVENPREAPSSR 801 S +E++ + ++ + + + S+ LP+ D G E +R+ E+ E E + S Sbjct: 148 ETRSAVDELKVGRVMVDSQNNGIISSSEGLPNGFDSGQEILREGSENQEEEALMDGQVSD 207 Query: 802 FCGTVIPEVSVS---SRTEQDISMGRALNDQAQKKA-HEEEKTAQREWEENVRENNSSAL 969 + + +R +D M RAL QAQ +E E+ AQREWEE RENNSS Sbjct: 208 SLESQRDATGSNHHLNRNGRDRDMERALEHQAQLIGQYEAEEKAQREWEEKFRENNSSTP 267 Query: 970 DSCDPGNRSDVTEERDEMKAPPQMYSVGGT-NYLNQGTEVEVANTTFIADHKPKAPNSFL 1146 DSC+PGN SDVTEERDE+K PQ S G +QGT+++ + F + P Sbjct: 268 DSCEPGNHSDVTEERDEVK--PQAPSAAGILTSQDQGTKLDDEDVHFNEESSQTLPTIST 325 Query: 1147 AAQQVDTRNLQGPNSSSMVPHETQLSEFAFPM--------------------SNGTPCKN 1266 D LQ N SM+ +E+ +F FPM S+ P + Sbjct: 326 THLHGDMECLQEQNRCSMLAYESLAPDFVFPMAKENLHQEFLENQSYPLSHSSHHYPWSH 385 Query: 1267 LSGDSHC------------DPAS----TSYIRSLANGSS--------------------G 1338 +S H PA + ++R + S+ Sbjct: 386 VSPGDHSANVTDHSLHVADHPADVRDHSEHVRDHSGHSTDHSADATDHSGHITDHSEHVA 445 Query: 1339 DPLGYVPLP--YANNGESSENR-KELALMPQNSSSNLDTVLEALQQAKLSLRDKLNNVAP 1509 D VPLP + GESS ++ K AL+P+ +S+ L VLEALQQA+LSL+ KLN + Sbjct: 446 DHSADVPLPSYVGSKGESSRSQDKHYALVPRETSNELGGVLEALQQARLSLQHKLNRLPL 505 Query: 1510 PVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDFQF-ERTKANYLGSDPALSLSY 1686 GS GR IE P+ R+ + +EIPVGC G+FR+P D+Q T+AN+LGSD SL Sbjct: 506 IEGGSIGRAIEPSFPSTRAWERVEIPVGCAGLFRVPADYQLGTATEANFLGSDSQSSLKN 565 Query: 1687 QGSETTC-----NKLMPNPYIDARSSVLVNDRFWSSSGPSFMEIRSGIGMGTSPLTERVS 1851 +T ++ + +PY+ SSV +D F +S Sbjct: 566 YYPDTGFVANPGDRFLTSPYLKTGSSVPTDDSFLTS------------------------ 601 Query: 1852 ETRTHPIPFTENLPGIPARRPLFEPVMGASFSGRNVYIDPRXXXXXXXXXXXXXXXXXXX 2031 P+ E IP RP F+ A S Y P Sbjct: 602 -------PYRETGSRIPPLRPSFDYYSDAGLSASTRYTHPTYSSHPDLLY---------- 644 Query: 2032 XXXXRLQLPSGERFSRNS-TMEFGMPSTARFSLYDDHIRPNMYK 2160 ++P E F+R E G+PST FS YDDHIRPNMY+ Sbjct: 645 ------RMPFNEGFARPPRNSEVGIPSTDHFSFYDDHIRPNMYR 682 >ref|XP_006436667.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] gi|568878417|ref|XP_006492190.1| PREDICTED: uncharacterized protein LOC102610545 [Citrus sinensis] gi|557538863|gb|ESR49907.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] Length = 732 Score = 301 bits (771), Expect = 9e-79 Identities = 241/672 (35%), Positives = 333/672 (49%), Gaps = 27/672 (4%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M SS E QDQR+N G+ED TMTIE LRARLLSER++SK+ARQRADEL +RV+ELE+QL Sbjct: 1 MPSSGQEMQDQRTNSGMEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 K+V LQ+K+AEKATADVLAILE+NG+++ S+ FDS SD ET +S++ NN K + S Sbjct: 61 KLVSLQRKKAEKATADVLAILENNGISEISDSFDSGSDQETPC-ESEVGNNFNKEEENSV 119 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 D RR+ R +E+ K Sbjct: 120 DSKFRRNASVEHSGSGNDFSPVPHRGLSWNGRRGTKQSLEKYKDSYLRRRSSFASTGSSS 179 Query: 577 LPKRVGKSCXXXXXXXXXSVAEE-----VQDDSNLNNNHVD-RVDTCSQDLPDSHDIGTE 738 RVGKSC S EE V+ DS N VD + L S + Sbjct: 180 PKNRVGKSCRQIRRRESKSAVEELKTEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQ 239 Query: 739 TVRDDPESHEVENPREAPSSRF----CGTVIPEVSVSSRTEQDISMGRALNDQAQKKA-H 903 + + +S EN + CG D M +AL DQAQ + Sbjct: 240 YLGEGSDSGCFENEKLVTGGGIDFNGCGG-------------DKDMEKALEDQAQLIGRY 286 Query: 904 EEEKTAQREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGT-NYLNQGT 1080 EE + AQREWEE RENNSS DSCDPGN+SDVTEER+E K Q+ V GT N Q Sbjct: 287 EEMEKAQREWEERFRENNSSTPDSCDPGNQSDVTEEREESKV--QVQRVAGTVNSQVQEA 344 Query: 1081 EVEVANTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPC 1260 + EV + +++ K N FL Q D + P S + +FAF MSN Sbjct: 345 KTEVHLSNQLSNTKS---NGFLPPQSGDQKCSSTPASEPLA------QDFAFTMSNEKQN 395 Query: 1261 KNLSGDSHCDPASTSYIRSLANGSSGDPLGYVPLPYANNGESSENR------KELALMPQ 1422 + G++H P+ +S+ R +GS + +N G SS ++ AL+P Sbjct: 396 QESLGNNHYVPSHSSHHRLHPHGSPENQSSQTVS--SNTGSSSRREVSGSQSEQYALVPH 453 Query: 1423 NSSSNLDTVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPG 1602 +SS + VLEAL+QA+LSLR K++++ S G+ IE + A+ D +EIPVGC G Sbjct: 454 QTSSGFNEVLEALKQARLSLRQKMSSLPSTESRSVGKVIEPSLSASTVWDRVEIPVGCSG 513 Query: 1603 IFRLPTDFQFERTKANYLGSDPALSLSYQGSET-----TCNKLMPNPYIDARSSVLVND- 1764 +FR+PTD+ E +KAN+L SD SL+ + + ++ + N +D RS+ ++ Sbjct: 514 LFRVPTDYAVETSKANFLVSDSRPSLANYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNF 573 Query: 1765 ---RFWSSSGPSFMEIRSGIGMGTSPLTERVSETRTHPIPFTENLPGIPARRPLFEPVMG 1935 R +GPS + RS LT + S+TR+ + RP F+ + Sbjct: 574 RPTRDLFLTGPS-TDTRSSYSAENRLLTRQYSDTRSR----------VSMMRPSFDSNLD 622 Query: 1936 ASFSGRNVYIDP 1971 A Y+ P Sbjct: 623 AGLPSFRQYMYP 634 >ref|XP_004239716.1| PREDICTED: uncharacterized protein LOC101267607 [Solanum lycopersicum] Length = 617 Score = 298 bits (762), Expect = 1e-77 Identities = 246/721 (34%), Positives = 349/721 (48%), Gaps = 13/721 (1%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 MSS+ +DQDQR G+E+ ++MTIE LRARLL+ER++S+TARQRADEL +RVLELEDQL Sbjct: 1 MSSNGKKDQDQRKTVGMEN-SSMTIEFLRARLLAERSVSQTARQRADELAERVLELEDQL 59 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSK---ISNNSVKMKG 387 K+V LQ+K+AEKATA VL+ILE+ G+TDASEEFDS SD E S+SK ++N + K Sbjct: 60 KIVSLQRKKAEKATAAVLSILENEGITDASEEFDSGSDQEAIFSNSKGADSTDNRNEYKP 119 Query: 388 ASTDFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXX 567 ++ R ++ ++ GR ER ++ D Sbjct: 120 DPSNVKERENDADISSSEIISSPST-GRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFAS 178 Query: 568 XXXL-PKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETV 744 PKR GKSC + ++N NN D+ D + Sbjct: 179 TGTSSPKRAGKSCRRIR-----------RSNTNAGNN----------DVNDQLHL----- 212 Query: 745 RDDPESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQKKA-HEEEKTA 921 P S EN R+A E D M RAL +A +E E+ A Sbjct: 213 ---PTSETSENQRKAD------------------ESDEGMERALQHKALLIGKYEAEEKA 251 Query: 922 QREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYS---VGGTNYLNQGTEVEV 1092 QREWEE RENN A DSCDPGN SDVTEERD+MKA Q YS + N+ N+ EV++ Sbjct: 252 QREWEEKYRENNY-AQDSCDPGNYSDVTEERDDMKAFEQPYSAEMINLQNHANKFQEVDI 310 Query: 1093 ANTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNLS 1272 +T + D+ P P+ + T + N S ++ E+ SEFA P SNG+ +N Sbjct: 311 PSTNGVTDNVPSNPH-------ISTSCRKDQNCSRIINSESPASEFALPKSNGSCPEN-- 361 Query: 1273 GDSHCDPASTSYIRSLANGSSGDPLGYVPLPYANNGESSENRKELALMPQNSSSNLDTVL 1452 D + +Y S+G P+ + +++G SS + AL+ ++S N+ ++L Sbjct: 362 -----DGPTPAYCHHQLPSSNGSPIQPLENSISSSGGSSLQAGQ-ALVSGDASDNIGSIL 415 Query: 1453 EALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDFQF 1632 AL+QAK S+ ++N PV G S IEH +P A+ D L+IP G PG+FRLPTDFQ Sbjct: 416 GALEQAKFSISQQIN--VSPVEGRS--SIEHSIPTAKIEDRLDIPPGFPGLFRLPTDFQL 471 Query: 1633 E-RTKANYLGSDPALSLSYQGSETTCNKLMPNPYIDARSSVLVNDRFWSSSGPSFMEIRS 1809 E T A+Y G S + E N+ PY+++ S+ + + ++G ++ S Sbjct: 472 EATTTASYQGFPSRFSSANHFHEPGYNQFSATPYMESPSNAITGLPY--TTGFDYLNPPS 529 Query: 1810 GIGMGTSPLTERVSETRTHP-IPFTEN-LPGIPARRPLFEPVMGASFSGRNVYIDPRXXX 1983 G S T+P PF N + + + P+ +S + + + P Sbjct: 530 SFG-------HPFSSKSTYPTYPFRPNTTTTVSQSQASWSPLYESSLTKSSPVVVP---- 578 Query: 1984 XXXXXXXXXXXXXXXXXXXXRLQLPSGERFSRNS--TMEFGMPSTARFSLYDDHIRPNMY 2157 L SGE S E G P + S YD H+RPNMY Sbjct: 579 ----------------------NLSSGEDVFLRSLPRNETGKPPSFPVSHYDAHMRPNMY 616 Query: 2158 K 2160 + Sbjct: 617 R 617 >ref|XP_006345860.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X2 [Solanum tuberosum] Length = 618 Score = 297 bits (761), Expect = 1e-77 Identities = 243/721 (33%), Positives = 353/721 (48%), Gaps = 13/721 (1%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M+S+ +DQDQR G+ED ++MTIE LRARLL+ER++S+TARQRADEL +RVLELEDQL Sbjct: 1 MTSNGKQDQDQRKIVGMED-SSMTIEFLRARLLAERSVSQTARQRADELAERVLELEDQL 59 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSK---ISNNSVKMKG 387 K+V LQ+K+AEKATA VL+ILE+ G++DASEEFDS SD E S+SK ++N + K Sbjct: 60 KIVSLQRKKAEKATAAVLSILENEGISDASEEFDSGSDQEAIFSNSKGADSTDNRNERKP 119 Query: 388 ASTDFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXX 567 ++ R ++ ++ GR ER ++ D Sbjct: 120 NPSNVKERENDADISSSEIISSPST-GRSLSWKSGKHSLPSFERNRYTDSAWRRSGSFAS 178 Query: 568 XXXL-PKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETV 744 PKR GKSC ++ +N NN D+ D + Sbjct: 179 TGSSSPKRAGKSCRRIR-----------RNTTNAGNN----------DVKDQRHL----- 212 Query: 745 RDDPESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQKKA-HEEEKTA 921 P S EN R++ E D M RAL +AQ +E E+ A Sbjct: 213 ---PTSEMSENQRKSD------------------ESDEGMERALQHKAQLIGQYEAEEKA 251 Query: 922 QREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYS---VGGTNYLNQGTEVEV 1092 QREWEE RENN+ A DSCDPGN SDVTEERD+MKA Q YS + N+ N+ EV++ Sbjct: 252 QREWEEKYRENNNYAQDSCDPGNYSDVTEERDDMKAFEQPYSAEMINLHNHANKFQEVDI 311 Query: 1093 ANTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNLS 1272 +T + D+ P P+ + T + N S ++ E+ SEFA SNG+ +N Sbjct: 312 PSTNGVTDNVPSTPH-------IGTSCRKDQNCSRIINSESPASEFALSKSNGSCPEN-- 362 Query: 1273 GDSHCDPASTSYIRSLANGSSGDPLGYVPLPYANNGESSENRKELALMPQNSSSNLDTVL 1452 D + +Y R ++G P+ + +++G SS + AL+ +++S N+ ++L Sbjct: 363 -----DGPTPAYSRHQLPSANGSPIHPLENSISSSGGSSLQAGQ-ALVSRDASDNIGSIL 416 Query: 1453 EALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDFQF 1632 AL+QAK S+ ++ NV+P + G IEH +P AR D L+I G PG+FRLPTDFQ Sbjct: 417 GALEQAKFSISQQI-NVSP--IAEGGSSIEHSIPTARI-DRLDILPGFPGLFRLPTDFQL 472 Query: 1633 E-RTKANYLGSDPALSLSYQGSETTCNKLMPNPYIDARSSVLVNDRFWSSSGPSFMEIRS 1809 E T A+Y G S + E ++ PY+++ S+ + + ++G ++ S Sbjct: 473 EATTTASYQGFPSRFSSANHFHEPGYDQFSTTPYMESPSNAITGLPY--TTGFDYLNPPS 530 Query: 1810 GIGMGTSPLTERVSETRTHP-IPFTEN-LPGIPARRPLFEPVMGASFSGRNVYIDPRXXX 1983 G G S T+P PF N + + + P+ +S + + + P Sbjct: 531 GFG-------HPFSSKSTYPTYPFRPNTTTTVSQSQASWSPLYESSLTTLSPVVVP---- 579 Query: 1984 XXXXXXXXXXXXXXXXXXXXRLQLPSGERFSRNS--TMEFGMPSTARFSLYDDHIRPNMY 2157 L SGE S E G P + S YD H+RPNMY Sbjct: 580 ----------------------NLSSGEEVFLRSLPRNETGKPPSFPVSHYDAHLRPNMY 617 Query: 2158 K 2160 + Sbjct: 618 R 618 >gb|EMJ20137.1| hypothetical protein PRUPE_ppa002306mg [Prunus persica] Length = 690 Score = 286 bits (731), Expect = 4e-74 Identities = 236/669 (35%), Positives = 329/669 (49%), Gaps = 43/669 (6%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M++S + QDQRSN G+ED T MTIE LRARLL+ER++S++ARQR DEL + V ELE+QL Sbjct: 1 MNNSNQDTQDQRSNLGMEDSTAMTIEFLRARLLAERSVSRSARQRVDELERMVEELEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDAS-EEFDSNSDGETTISDSKISNNSVKMKGAS 393 K+V LQ+K AEKAT DVLAILES G++D S EEFDS+SD ET SK+ N+ + + Sbjct: 61 KIVSLQRKMAEKATEDVLAILESQGISDISEEEFDSSSDQETH-QGSKVGNSLANEEESF 119 Query: 394 TDFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXX 573 RR +E GR E+ K + Sbjct: 120 VISKVRRKEQEEHSGSDADSSLIPGRSLSWKGRIDSPRSREKCKDLSVRRRSSFSSIGFS 179 Query: 574 XLPKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDD 753 +GKSC + + S+ ++H + V S+ LP+ + G E +R+ Sbjct: 180 SPRHHLGKSCRQ---------IKHKETRSDKFDSHENGVGASSEGLPNFSNGGPEKLREG 230 Query: 754 PESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQKKAHEEE-KTAQRE 930 E E + SR + +D M +AL QA+ EE + AQRE Sbjct: 231 SEFPEEKVLSNDSLSRTKENQRDSDLDFNGHGRDKDMEKALEHQAKLICENEEMEKAQRE 290 Query: 931 WEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTEVEVANTTFI 1110 WEE RENN+S DSCDPGN SD+TEERDE+KA S G Q T+ E + Sbjct: 291 WEEKFRENNTSTPDSCDPGNHSDITEERDEIKAQTPC-SAGVVVAQAQETKSEEGDVCLP 349 Query: 1111 ADHKPKAPNSFLAAQQVDTRNLQGP-NSSSMVPHETQLSEFAFPMSNGTPCKNLSGDSHC 1287 + N FL A VD LQ N S++ P +Q+ EFAFP NG + Sbjct: 350 KETFKIQQNGFLPASHVDMGGLQDQLNKSTVAP--SQVEEFAFPTENGKQNHESLENFAR 407 Query: 1288 DPASTSYIRSLANGS----SGDPLGYVPLPYANNGESSENRKEL-ALMPQNSSSNLDTVL 1452 P+ S+ L +GS S D V + G +S +R +L AL+P +S L VL Sbjct: 408 HPSHGSHPNPLVHGSAHNRSSDASSSVAGSGFHKGNASGSRSDLYALVPHDSQDRLGGVL 467 Query: 1453 EALQQAKLSLRDKLNNVAPPVVGSS-GRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDFQ 1629 +AL+QAKLSL+ + + P V G+S + IE +P ++ D +EIPVGC G+FRLPTDF Sbjct: 468 DALKQAKLSLQQNMTRL-PLVDGTSVHKSIEPSIPVMKTGDRVEIPVGCAGLFRLPTDFA 526 Query: 1630 FER--TKANYLGSD------PALSLSYQGSET-------TCNKLMPNPYIDARSSVLVN- 1761 E T++++LGS P ++ ET ++ +P+PYI+ R + N Sbjct: 527 VEEAATQSSFLGSSWSGRYCPETLVTSSFVETRPTFSMNAADRYVPSPYIETRQTFSTNA 586 Query: 1762 -DRF----WSSSGPSF-------------MEIRSGIGMGTSPLTERVSETRTHPIPFTEN 1887 DRF + S P+F ++ RS L+ SE+ P+ N Sbjct: 587 TDRFIPNAYVESRPNFPANAAEPFVTSPSVDTRSNFPADNRFLSGPYSESGYAQPPY-PN 645 Query: 1888 LPGIPARRP 1914 P +P R P Sbjct: 646 YPSVPDRTP 654 >ref|XP_006436666.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] gi|557538862|gb|ESR49906.1| hypothetical protein CICLE_v10030805mg [Citrus clementina] Length = 716 Score = 284 bits (726), Expect = 2e-73 Identities = 231/656 (35%), Positives = 322/656 (49%), Gaps = 27/656 (4%) Frame = +1 Query: 85 LEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQLKMVFLQKKRAEKATAD 264 +ED TMTIE LRARLLSER++SK+ARQRADEL +RV+ELE+QLK+V LQ+K+AEKATAD Sbjct: 1 MEDSNTMTIEFLRARLLSERSVSKSARQRADELARRVVELEEQLKLVSLQRKKAEKATAD 60 Query: 265 VLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGASTDFDARRHNREVXXXXX 444 VLAILE+NG+++ S+ FDS SD ET +S++ NN K + S D RR+ Sbjct: 61 VLAILENNGISEISDSFDSGSDQETPC-ESEVGNNFNKEEENSVDSKFRRNASVEHSGSG 119 Query: 445 XXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXXLPKRVGKSCXXXXXXX 624 R +E+ K RVGKSC Sbjct: 120 NDFSPVPHRGLSWNGRRGTKQSLEKYKDSYLRRRSSFASTGSSSPKNRVGKSCRQIRRRE 179 Query: 625 XXSVAEE-----VQDDSNLNNNHVD-RVDTCSQDLPDSHDIGTETVRDDPESHEVENPRE 786 S EE V+ DS N VD + L S + + + +S EN + Sbjct: 180 SKSAVEELKTEPVKVDSQENGGGTSLEVDRKPEVLRGSEAQEEQYLGEGSDSGCFENEKL 239 Query: 787 APSSRF----CGTVIPEVSVSSRTEQDISMGRALNDQAQKKA-HEEEKTAQREWEENVRE 951 CG D M +AL DQAQ +EE + AQREWEE RE Sbjct: 240 VTGGGIDFNGCGG-------------DKDMEKALEDQAQLIGRYEEMEKAQREWEERFRE 286 Query: 952 NNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGT-NYLNQGTEVEVANTTFIADHKPK 1128 NNSS DSCDPGN+SDVTEER+E K Q+ V GT N Q + EV + +++ K Sbjct: 287 NNSSTPDSCDPGNQSDVTEEREESKV--QVQRVAGTVNSQVQEAKTEVHLSNQLSNTKS- 343 Query: 1129 APNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNLSGDSHCDPASTSY 1308 N FL Q D + P S + +FAF MSN + G++H P+ +S+ Sbjct: 344 --NGFLPPQSGDQKCSSTPASEPLA------QDFAFTMSNEKQNQESLGNNHYVPSHSSH 395 Query: 1309 IRSLANGSSGDPLGYVPLPYANNGESSENR------KELALMPQNSSSNLDTVLEALQQA 1470 R +GS + +N G SS ++ AL+P +SS + VLEAL+QA Sbjct: 396 HRLHPHGSPENQSSQTVS--SNTGSSSRREVSGSQSEQYALVPHQTSSGFNEVLEALKQA 453 Query: 1471 KLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDFQFERTKAN 1650 +LSLR K++++ S G+ IE + A+ D +EIPVGC G+FR+PTD+ E +KAN Sbjct: 454 RLSLRQKMSSLPSTESRSVGKVIEPSLSASTVWDRVEIPVGCSGLFRVPTDYAVETSKAN 513 Query: 1651 YLGSDPALSLSYQGSET-----TCNKLMPNPYIDARSSVLVND----RFWSSSGPSFMEI 1803 +L SD SL+ + + ++ + N +D RS+ ++ R +GPS + Sbjct: 514 FLVSDSRPSLANYNPTSGIGLVSDDQTVSNSLMDTRSTFAADNFRPTRDLFLTGPS-TDT 572 Query: 1804 RSGIGMGTSPLTERVSETRTHPIPFTENLPGIPARRPLFEPVMGASFSGRNVYIDP 1971 RS LT + S+TR+ + RP F+ + A Y+ P Sbjct: 573 RSSYSAENRLLTRQYSDTRSR----------VSMMRPSFDSNLDAGLPSFRQYMYP 618 >ref|XP_002533661.1| hypothetical protein RCOM_0152200 [Ricinus communis] gi|223526443|gb|EEF28720.1| hypothetical protein RCOM_0152200 [Ricinus communis] Length = 665 Score = 277 bits (709), Expect = 1e-71 Identities = 219/606 (36%), Positives = 296/606 (48%), Gaps = 25/606 (4%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M++S E QDQR+N G+ED T MTIE LRARLLSER++S+TARQRADEL RV ELE+QL Sbjct: 1 MNNSDKEKQDQRTNSGMEDSTAMTIEFLRARLLSERSVSRTARQRADELATRVAELEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 ++V LQ+ +AEKATAD+LAILE NG++D SE FDS SD +T +SK+ N S K + S Sbjct: 61 RIVSLQRMKAEKATADILAILEGNGISDISETFDSCSDRDTP-CESKVGNRSSKEEN-SI 118 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 + R ++ E GR +E+ K D Sbjct: 119 NSKVRNNDSEELSGSDFDFSSVPGRSLSWKGRKNSPRSLEKSK--DSSMRRRSSFSSVGS 176 Query: 577 LPK-RVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDI-----GTE 738 PK R GKSC E + + D V S + P D + Sbjct: 177 SPKQRPGKSCRQIRRKESRF---EYKASPVKRDCPEDEVAATSANFPSCSDFEPKRGEVK 233 Query: 739 TVRDDPESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQK-KAHEEEK 915 + +D S + N R A + V D M +AL QAQ +E + Sbjct: 234 PLLEDSHSDCLGNERNASDNGLDYNVY---------RGDRDMEKALEHQAQLIGQYEAME 284 Query: 916 TAQREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAP---PQMYSVGGTNYLNQGTEV 1086 QREWEE RENNSS DSCD GNRSD+TEER E++ P P + T L E Sbjct: 285 KVQREWEEKFRENNSSTPDSCDHGNRSDITEERYEIREPAKGPATTNAIQTEGLLSVVE- 343 Query: 1087 EVANTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKN 1266 V+NT P+ FL + VD L+ SS E + AFPM+ + Sbjct: 344 GVSNT---------QPHGFLPSSHVDAVCLEERKSSIAPVPEFSTQDSAFPMAKAKQNQK 394 Query: 1267 LSGDSHCDPASTSYIRSLANGSSGDPLGYVPLPYANNGESSENR---------KELALMP 1419 G++ P ++ S + GS L + +N SS N+ + AL+P Sbjct: 395 NPGNNDHSPLLIAHHDSASFGSQYSSGSQSVLSFPSNTGSSFNKGKATSGSENERCALVP 454 Query: 1420 QNSSSNLDTVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCP 1599 +S L VLEAL++A+ SL+ ++N + P V + + +E V SRD ++IPVGC Sbjct: 455 HKASGGLGGVLEALEEARQSLQQRINRL-PSVATTVRKSVESSVSTTISRDEVQIPVGCV 513 Query: 1600 GIFRLPTDFQFE-RTKANYLGSDPALSLSYQGSE-----TTCNKLMPNPYIDARSSVLVN 1761 G+FRLPTDF E T+AN L S LSL S+ N+ + +PY+ RSS Sbjct: 514 GLFRLPTDFSVEGNTRANLLSSSAQLSLGNHYSDRGVPAAASNQFVASPYLQGRSSSSTE 573 Query: 1762 DRFWSS 1779 D+F SS Sbjct: 574 DQFLSS 579 >ref|XP_004307047.1| PREDICTED: uncharacterized protein LOC101309582 [Fragaria vesca subsp. vesca] Length = 807 Score = 276 bits (705), Expect = 4e-71 Identities = 238/681 (34%), Positives = 331/681 (48%), Gaps = 35/681 (5%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M +S + QD R N G++D +TIE LRARLLSER++S++ARQRADEL K V ELE+QL Sbjct: 1 MHNSNQDTQDLRINSGMDDSPGITIEFLRARLLSERSVSRSARQRADELEKMVEELEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 K+V LQ+K AEKATADVLAILE+ G +D SEEFDS+SD E T +SK+ N S K + + Sbjct: 61 KIVSLQRKMAEKATADVLAILENQGASDISEEFDSSSDHE-TFQESKMGNKSRK-EEENF 118 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 RR+ E GR E+ K Sbjct: 119 LISERRNEHEEYSGSDLDSSSIPGRNLSWKGRIDSPRSREKYKEPSIRRRSTFSAVGSSS 178 Query: 577 LPKRVGKSCXXXXXXXXXSVAEEVQDD-SNLNNNHVDRVDTCSQDLPDSHDIGTETVRDD 753 +GKSC SV E +D+ + +++ + V S+ L + E +RD Sbjct: 179 SRHNLGKSCRQIKHRETRSVVERSKDEPAKFDDSEENGVAASSEGLSNFSYCDPERLRDG 238 Query: 754 PESHE---VENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQKKAHEEE-KTA 921 PES + + S P + R + M RAL QAQ EE + A Sbjct: 239 PESQKEKFLSKDALTRSKEHQRNGDPNFNGHGRNK---DMERALEHQAQLIGQNEEMEMA 295 Query: 922 QREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAP-PQMYSVGGTNYLNQGTEVEVAN 1098 QREWEE RENN+S DSCDPGN SD+TEERDEMK P P + Q + E + Sbjct: 296 QREWEEKFRENNTSTPDSCDPGNHSDITEERDEMKTPFPAEINASEA----QEAKSEARD 351 Query: 1099 TTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPM--------SNGT 1254 + + N +L V+ +Q + S V + + EFAFP S Sbjct: 352 SCLFEEKMKTQLNGYLPPSDVEMGGMQDQMNRSSVASASPIQEFAFPTAYERQTQESLEN 411 Query: 1255 PCKNLSGDSHCDP--ASTSYIRSLANGSSGDPLGYVPLPYANNGESSENRKEL-ALMPQN 1425 S SH DP +S+ RS S G ++ +S +R +L AL+P + Sbjct: 412 NAHQPSPGSHHDPLLLESSHNRSSVVSSDGG---------SSFHNASGSRNDLYALVPHD 462 Query: 1426 SSSNLDTVLEALQQAKLSLRDKLNNVAPPVVGSSG--RGIEHFVPAARSRDGLEIPVGCP 1599 S L VL+AL+QAKLSL+ K+ + P+V + IE +PA + + L+IPVGC Sbjct: 463 SQERLGGVLDALKQAKLSLQQKI--IRLPLVDDTSVQESIEPPIPAVTTGNRLDIPVGCA 520 Query: 1600 GIFRLPTDFQFER--TKANYLGSDPAL-SLSY---QG-SETTCNKLMPNPYIDARSSVLV 1758 G+FRLPTDF E TK +YLG +L S Y +G + ++ ++ + + Y++ R V Sbjct: 521 GLFRLPTDFAVEEAATKHSYLGLGSSLPSARYCPDKGLAASSTDQFVTSTYVETRPPYHV 580 Query: 1759 NDRFWSSSGPSFMEIRSGIGMGTSPL--TERVSETRTHPIPFTENL-------PGIPARR 1911 DRF +S ++E R + G L +ETR F+ N+ P I AR Sbjct: 581 GDRFVAS---PYVENRRTVSTGAGDLVVANPYAETRR---SFSSNVAGQFVTSPSIEARP 634 Query: 1912 PLFEPVMGASFSGRNVYIDPR 1974 P F G F R+ Y++ R Sbjct: 635 P-FSNNFGDRFVQRH-YVESR 653 >gb|EOY19205.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 709 Score = 272 bits (696), Expect = 5e-70 Identities = 216/609 (35%), Positives = 307/609 (50%), Gaps = 29/609 (4%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M +S QDQR+ +ED +TMTIE LRARLLSER++SK+ARQR DEL KRV ELE QL Sbjct: 1 MHNSDQVKQDQRTTCNVED-STMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQL 59 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 K V +Q++RAEKATADVLAILE+NGV+D SEE DS+SD + +S I+N S K + +S Sbjct: 60 KFVSVQRRRAEKATADVLAILENNGVSDISEELDSSSDQDAPF-ESNINNGSTKEEESSV 118 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 R+ E GR + ER K Sbjct: 119 TSKVRQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPERYKDKLVRSRNSFASISFSS 178 Query: 577 LPKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDDP 756 R GKSC SVAEE++ D+ + + V ++ S ++ +H G + P Sbjct: 179 RKHRQGKSCRQIRRRESRSVAEELKSDNIMVDPQVKGLEN-SSEVNANHSTGGPHIL--P 235 Query: 757 ESHEVENPREAPSSRFCGTVIPEVSVSS------RTEQDISMGRALNDQAQKKAH-EEEK 915 E+ + + + E +V+ E + M +AL QAQ H E + Sbjct: 236 MGSEIHENKSTVDNLHSDALKNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAME 295 Query: 916 TAQREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTEVEVA 1095 AQREWEE RE NSS+ DSCDPGN SDVTEERDE+KA Q S T+ + QG E E Sbjct: 296 RAQREWEEKFREKNSSSPDSCDPGNHSDVTEERDEIKAQAQYVSGTATSQV-QGAEEE-- 352 Query: 1096 NTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHET-------QLSEFAFPMSNGT 1254 + +F A+ N + Q D LQ S + E+ Q F N Sbjct: 353 HISFSAELPKIHSNDLVPPSQADMDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKEN-- 410 Query: 1255 PCKNLSGDSHCDPASTSYIRSLANGSSGD-PLGYVPLPYANNG--ESSENRKEL-ALMPQ 1422 + S S+ P+++S+ + + S G+ + ++ ++ E N+ EL AL+P Sbjct: 411 --HHQSMQSNNSPSNSSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNELYALVPH 468 Query: 1423 NSSSNLDTVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPG 1602 +S VL++L+QA+LSL+ K++ ++ S G+ IE + + +EIP+GC G Sbjct: 469 ETSGRFTGVLDSLKQARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSG 528 Query: 1603 IFRLPTDFQFERTKANYLGSDPALSLSYQGSE-----TTCNKLMPNPYIDARSSVLVN-- 1761 +FR+PTD E KAN+LGS LSL+ + T N L+ Y++ +SS N Sbjct: 529 LFRVPTDISVEAPKANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQ 588 Query: 1762 ----DRFWS 1776 DRF+S Sbjct: 589 PVSSDRFFS 597 >gb|EOY19203.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508727307|gb|EOY19204.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 665 Score = 267 bits (683), Expect = 2e-68 Identities = 215/603 (35%), Positives = 297/603 (49%), Gaps = 23/603 (3%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M +S QDQR+ +ED +TMTIE LRARLLSER++SK+ARQR DEL KRV ELE QL Sbjct: 1 MHNSDQVKQDQRTTCNVED-STMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQL 59 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 K V +Q++RAEKATADVLAILE+NGV+D SEE DS+SD + +S I+N S K + +S Sbjct: 60 KFVSVQRRRAEKATADVLAILENNGVSDISEELDSSSDQDAPF-ESNINNGSTKEEESSV 118 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 R+ E GR + ER K Sbjct: 119 TSKVRQKESEELSGSEFDCSSASGRSLSWKGRKSASHSPERYKDKLVRSRNSFASISFSS 178 Query: 577 LPKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDDP 756 R GKSC SVAEE++ D+ + DP Sbjct: 179 RKHRQGKSCRQIRRRESRSVAEELKSDN---------------------------IMVDP 211 Query: 757 ESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQKKAHEEE-KTAQREW 933 + +EN E ++ G E+D M +AL QAQ H E + AQREW Sbjct: 212 QVKGLENSSEVNANHSTG------------EKD--MEKALEHQAQLIVHYEAMERAQREW 257 Query: 934 EENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTEVEVANTTFIA 1113 EE RE NSS+ DSCDPGN SDVTEERDE+KA Q S T+ + QG E E + +F A Sbjct: 258 EEKFREKNSSSPDSCDPGNHSDVTEERDEIKAQAQYVSGTATSQV-QGAEEE--HISFSA 314 Query: 1114 DHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHET-------QLSEFAFPMSNGTPCKNLS 1272 + N + Q D LQ S + E+ Q F N + S Sbjct: 315 ELPKIHSNDLVPPSQADMDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKEN----HHQS 370 Query: 1273 GDSHCDPASTSYIRSLANGSSGDP-LGYVPLPYANNG--ESSENRKEL-ALMPQNSSSNL 1440 S+ P+++S+ + + S G+ + ++ ++ E N+ EL AL+P +S Sbjct: 371 MQSNNSPSNSSHHFAHPHDSPGNQAVQHISSDLGSHSCRELPRNKNELYALVPHETSGRF 430 Query: 1441 DTVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPT 1620 VL++L+QA+LSL+ K++ ++ S G+ IE + + +EIP+GC G+FR+PT Sbjct: 431 TGVLDSLKQARLSLQQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPT 490 Query: 1621 DFQFERTKANYLGSDPALSLSYQGSE-----TTCNKLMPNPYIDARSSVLVN------DR 1767 D E KAN+LGS LSL+ + T N L+ Y++ +SS N DR Sbjct: 491 DISVEAPKANFLGSSSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDR 550 Query: 1768 FWS 1776 F+S Sbjct: 551 FFS 553 >gb|EXC25400.1| hypothetical protein L484_016782 [Morus notabilis] Length = 654 Score = 266 bits (680), Expect = 3e-68 Identities = 246/745 (33%), Positives = 333/745 (44%), Gaps = 37/745 (4%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDL--TTMTIESLRARLLSERAISKTARQRADELTKRVLELED 210 M+ S E QDQRS+ +ED T MTIE LRARLLSER++S++ARQRADEL KRV ELE+ Sbjct: 1 MADSNQEKQDQRSSSSMEDSQSTAMTIEFLRARLLSERSVSRSARQRADELEKRVEELEE 60 Query: 211 QLKMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGA 390 QL++V LQ+K AEKAT DVL+ILE++G++DASE +DS SD ET +++NN + Sbjct: 61 QLRIVSLQRKMAEKATVDVLSILENHGISDASETYDSGSDQET----HQVANNYANGEER 116 Query: 391 STDFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXX 570 S RR E GR E+ K Sbjct: 117 SV-VSKRRSVLEELSGSDLDSSPINGRSLSWKGRSDSSRSREKYKDSSVRRQNALSSSFG 175 Query: 571 XXLPKR-VGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLP-DSHDIGTETV 744 PK VGKSC +V E D ++ L DS + G T Sbjct: 176 SSSPKHYVGKSCRQIRCRETRTVVE----------------DHKTEPLKFDSQENGAAT- 218 Query: 745 RDDPESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQKKA-HEEEKTA 921 P V+N R P+ + V+ Q+ M +AL +AQ +EE + A Sbjct: 219 ---PPEGSVKNDRRIPN---------HLDVNGHG-QEKDMKKALEHRAQLIGQYEEMEKA 265 Query: 922 QREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTEVEVANT 1101 QREWEE RENN+S DS DPGN SDVTE+RDE+KA +Y+VG + Q + + Sbjct: 266 QREWEEKYRENNTSTPDSYDPGNHSDVTEDRDEVKA-QTLYNVGID--IAQAVDAKSNKV 322 Query: 1102 TFIADHKPKAPNSFL----AAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNL 1269 + N FL + +Q ++ V Q EFAFP + + Sbjct: 323 DLSKESSKPQSNGFLHPTRTRAAMGDLKVQASSNIDPVASRFQAQEFAFPTAKEKEAQES 382 Query: 1270 SGDSHCDPASTSYIRSLANGS-SGDPLGYVPLPYANNGE-----SSENRKELALMPQNSS 1431 + P+ + + L + S P L A + S AL+P N Sbjct: 383 LENRDFRPSESPHHGQLLHRSLPNQPFDRGALSDAGSSSHKRDFSGSQNDLYALVPHNPP 442 Query: 1432 SNLDTVLEALQQAKLSLRDKLNNVAPPVVGSS------GRGIEHFVPAARSRDGLEIPVG 1593 L VL+AL+QAKLSL+ K+N + P+ G++ R IE P R D LEIPVG Sbjct: 443 VVLGGVLDALKQAKLSLQQKINRL--PLEGTTTQTVAVNRSIEPTQPGTRVGDRLEIPVG 500 Query: 1594 CPGIFRLPTDFQF--ERTKANYLGSDPALSLS--YQGSE---TTCNKLMPNPYIDARSSV 1752 C G+FRLPTDF T+AN+L S LSL Y ++ T ++ + +PYI++RS Sbjct: 501 CTGLFRLPTDFATVEASTQANFLSSGSRLSLEPYYPDNKVALTAPDRFLTSPYIESRSEF 560 Query: 1753 LVNDRFWSSSG------PSFMEIRSGIGMGTSPLT-ERVSETRTHPI--PFTENLPGIPA 1905 + RF +SS S + R T P + R S HP PF +++P IP+ Sbjct: 561 PPDVRFLTSSSVVSGSRASTLNSRFDSHFDTGPSSVNRYSNYPPHPSYPPFPDSMPRIPS 620 Query: 1906 RRPLFEPVMGASFSGRNVYIDPRXXXXXXXXXXXXXXXXXXXXXXXRLQLPSGERFSRNS 2085 L P S Sbjct: 621 DEGLRRPF--------------------------------------------------RS 630 Query: 2086 TMEFGMPSTARFSLYDDHIRPNMYK 2160 + FG+P RFS YDDH RPNMY+ Sbjct: 631 SRSFGLPED-RFSFYDDHGRPNMYR 654 >gb|EOY19202.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 749 Score = 264 bits (674), Expect = 2e-67 Identities = 208/589 (35%), Positives = 296/589 (50%), Gaps = 29/589 (4%) Frame = +1 Query: 97 TTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQLKMVFLQKKRAEKATADVLAI 276 +TMTIE LRARLLSER++SK+ARQR DEL KRV ELE QLK V +Q++RAEKATADVLAI Sbjct: 60 STMTIEFLRARLLSERSVSKSARQRVDELAKRVAELEKQLKFVSVQRRRAEKATADVLAI 119 Query: 277 LESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGASTDFDARRHNREVXXXXXXXXX 456 LE+NGV+D SEE DS+SD + +S I+N S K + +S R+ E Sbjct: 120 LENNGVSDISEELDSSSDQDAPF-ESNINNGSTKEEESSVTSKVRQKESEELSGSEFDCS 178 Query: 457 XXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXXLPKRVGKSCXXXXXXXXXSV 636 GR + ER K R GKSC SV Sbjct: 179 SASGRSLSWKGRKSASHSPERYKDKLVRSRNSFASISFSSRKHRQGKSCRQIRRRESRSV 238 Query: 637 AEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDDPESHEVENPREAPSSRFCGTV 816 AEE++ D+ + + V ++ S ++ +H G + P E+ + + + Sbjct: 239 AEELKSDNIMVDPQVKGLEN-SSEVNANHSTGGPHIL--PMGSEIHENKSTVDNLHSDAL 295 Query: 817 IPEVSVSS------RTEQDISMGRALNDQAQKKAH-EEEKTAQREWEENVRENNSSALDS 975 E +V+ E + M +AL QAQ H E + AQREWEE RE NSS+ DS Sbjct: 296 KNERNVTGFDLDFHGYEGEKDMEKALEHQAQLIVHYEAMERAQREWEEKFREKNSSSPDS 355 Query: 976 CDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTEVEVANTTFIADHKPKAPNSFLAAQ 1155 CDPGN SDVTEERDE+KA Q S T+ + QG E E + +F A+ N + Sbjct: 356 CDPGNHSDVTEERDEIKAQAQYVSGTATSQV-QGAEEE--HISFSAELPKIHSNDLVPPS 412 Query: 1156 QVDTRNLQGPNSSSMVPHET-------QLSEFAFPMSNGTPCKNLSGDSHCDPASTSYIR 1314 Q D LQ S + E+ Q F N + S S+ P+++S+ Sbjct: 413 QADMDRLQDWRYSRSLSPESLNPNSPGQKLTFLMAKEN----HHQSMQSNNSPSNSSHHF 468 Query: 1315 SLANGSSGD-PLGYVPLPYANNG--ESSENRKEL-ALMPQNSSSNLDTVLEALQQAKLSL 1482 + + S G+ + ++ ++ E N+ EL AL+P +S VL++L+QA+LSL Sbjct: 469 AHPHDSPGNQAVQHISSDLGSHSCRELPRNKNELYALVPHETSGRFTGVLDSLKQARLSL 528 Query: 1483 RDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDFQFERTKANYLGS 1662 + K++ ++ S G+ IE + + +EIP+GC G+FR+PTD E KAN+LGS Sbjct: 529 QQKISTLSLVEGASVGKAIETSGSGRKVGERVEIPLGCSGLFRVPTDISVEAPKANFLGS 588 Query: 1663 DPALSLSYQGSE-----TTCNKLMPNPYIDARSSVLVN------DRFWS 1776 LSL+ + T N L+ Y++ +SS N DRF+S Sbjct: 589 SSQLSLANHYPDRGVAPTASNHLLTTSYMNTQSSSSSNYQPVSSDRFFS 637 >ref|XP_004140985.1| PREDICTED: uncharacterized protein LOC101207733 [Cucumis sativus] Length = 671 Score = 261 bits (668), Expect = 8e-67 Identities = 233/719 (32%), Positives = 329/719 (45%), Gaps = 11/719 (1%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M + + QD RS G+ED T MTIE LRARLLSER++SK+ARQRADEL KRV ELE+QL Sbjct: 1 MENPDQDQQDPRSVPGVEDTTAMTIEFLRARLLSERSVSKSARQRADELAKRVAELEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 K+V LQ+K AEKATADVLAILE NG +D SE DSNSD ET + K+ + + + S+ Sbjct: 61 KIVSLQRKMAEKATADVLAILEDNGASDISETLDSNSDHET---EPKVEDGLAR-EDVSS 116 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 RR+ E G + E+ K Sbjct: 117 GTVRRRNEHEEYSGSNIDTSPVLGGSLSWKGRNDSPHTREKYKKHSIRSRSSFTSIGSSS 176 Query: 577 LPKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDI-GTETVRDD 753 ++G+SC + E + S+ + + + + S + ++ + G +RD Sbjct: 177 PKHQLGRSCRQIKRRDTRPLDGEQELKSDALVDSSEEIPSTSLEDSQNYSVNGHSILRDG 236 Query: 754 PESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQK-KAHEEEKTAQRE 930 E E + G + + + D M +AL QAQ +E + AQRE Sbjct: 237 YEVREKTRSSSSGVHNSVGNSDQDNDIDGYEKVD-DMEKALKCQAQLIDQYEAMEKAQRE 295 Query: 931 WEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTEVEVANTTFI 1110 WEE RENN+S DSCDPGN SD+TEERDEM+A S N + +T + Sbjct: 296 WEEKFRENNNSTPDSCDPGNHSDITEERDEMRAQAPNLSNNPANEAKPQVAFD-CDTRDL 354 Query: 1111 ADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNLSGDSHCD 1290 + + + A VD +LQ N++S + L EF FPM+N C+ +S + Sbjct: 355 SQAQTNGLGPSMCA--VDVEDLQDQNTNS-ISTSKSLEEFTFPMANVKQCQESQENSAQE 411 Query: 1291 PASTSYIRSLANGSSGDPLGYVPLPYANNGESSENRKELALMPQNSSSNLDTVLEALQQA 1470 P+ TS+ L +G PL + + E+ + +L + + LD VLEAL+QA Sbjct: 412 PSCTSH---LNHGLPERPLSSHGGINSYDQETPCSNNDLYALVPHEPPALDGVLEALKQA 468 Query: 1471 KLSLRDKLNNVAPPVVGSSGRGIEHFVPAA--RSRDGLEIPVGCPGIFRLPTDFQFE-RT 1641 KLSL K+ + P V G S + P + + D LEIPVGC G+FRLPTDF E + Sbjct: 469 KLSLTKKIIKL-PSVDGESESIDKSIGPLSIPKMGDRLEIPVGCAGLFRLPTDFAAEASS 527 Query: 1642 KANYLGSDPAL----SLSYQGSETTCN-KLMPNPYIDARSSVLVNDRFWSSSGPSFMEIR 1806 +AN+L S L +G+ + N ++ P ++ RSS L + R SS R Sbjct: 528 QANFLASSSQLRSPTHYPGEGAALSANHQIFPGHEMEDRSSFLRDSRLRSSG------YR 581 Query: 1807 SGIGMGTSP-LTERVSETRTHPIPFTENLPGIPARRPLFEPVMGASFSGRNVYIDPRXXX 1983 +G G LT+ + E R P ++ F+ A V+ P Sbjct: 582 AGSGFTRDGFLTDHIPENRWKN----------PGQKHHFDQYFDAVQPSSYVHNYPPRPV 631 Query: 1984 XXXXXXXXXXXXXXXXXXXXRLQLPSGERFSRNSTMEFGMPSTARFSLYDDHIRPNMYK 2160 F ST MP T ++S YDD RPNMY+ Sbjct: 632 SSNIHPNDTFL----------------RTFPGRST---EMPPTNQYSFYDDQFRPNMYR 671 >gb|ESW15816.1| hypothetical protein PHAVU_007G104500g [Phaseolus vulgaris] Length = 652 Score = 235 bits (599), Expect = 8e-59 Identities = 220/715 (30%), Positives = 329/715 (46%), Gaps = 12/715 (1%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M +SV + QDQR ED T MTIE LRARLLSER+ISK+ARQRADEL ++V+ELE+QL Sbjct: 1 MQNSVHDPQDQRIASSTEDSTAMTIEFLRARLLSERSISKSARQRADELAEKVMELEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 +MV LQ+K AEKATADVLAILES G++ S+EFDS SD E DS +SN K Sbjct: 61 RMVILQRKMAEKATADVLAILESQGISGVSDEFDSGSDLENPF-DSSMSNECAKEDEGPM 119 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 R+H + + + +E+ K Sbjct: 120 KSKGRQHGSDEMSGSNEDSSLVSSKSLSWKGRHDLSHSLEKYKTKSTNVRRQSSFSSFSS 179 Query: 577 LPK-RVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDD 753 PK R+GKSC SV EE + N V+ + + S+ P+ D G+ ++ + Sbjct: 180 SPKHRLGKSCRKIRHRQPRSVMEESRGKFVHVNCQVNELVSSSEGFPNFRDGGSNILKIE 239 Query: 754 PESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQK-KAHEEEKTAQRE 930 + E E+ EA ++ + ++ M +AL QA+ +E + AQRE Sbjct: 240 SKIQE-EDGSEA-------NLLSKNHHIDGYGRENEMEKALEHQAELIDQYEAMEKAQRE 291 Query: 931 WEENVRENNSSALDSCDPGNRSDVTEERDEMKA--PPQMYSVGGTNYLNQGTEVEVANTT 1104 WEE RENNS+ DSCDPGN SD+TE++DE K P V ++G V Sbjct: 292 WEEKFRENNSTTPDSCDPGNHSDMTEDKDEGKVQIPYAAKVVTSKAEESKGEPGGVC--- 348 Query: 1105 FIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNLSGD-- 1278 +++ K KA + ++ D ++ S+ S+F ++ +P K + Sbjct: 349 -LSEEKLKAEGREIMPKKHDDTDVYRNQKSTTF----STSDFLGQENSHSPLKGNQNEIL 403 Query: 1279 --SHCDPASTSYIRSLANGSSGDPLGYVPLPYANNGESSENRKEL-ALMPQNSSSNLDTV 1449 H + +++ + S + V + ++S+N+K+L AL+ + S D V Sbjct: 404 VNGHSQSSDMNHLDQGRHSSFPTDIHGV----QHQHDASKNQKDLYALVTREQSHQFDGV 459 Query: 1450 LEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTDFQ 1629 LE+L+QA++SL+ +LN + PVV G + +++ D EIP G G+FRLPTDF Sbjct: 460 LESLKQARISLQQELNRL--PVV-EGGYTAKPLPSVSKNEDRFEIPFGFSGLFRLPTDFS 516 Query: 1630 FERTKANYLGSDPALSLSYQGSETTCNKLMPNPYIDARSSVLVNDRFWSSSGPSFMEIRS 1809 E T + DP GS N M +R+SV +F+++ S Sbjct: 517 DEAT-PRFNVRDPTTGF---GSNYHLNGTM------SRTSV---GQFFTNPP------HS 557 Query: 1810 GIGMGTSPLTERVSETRTHPIPFTENLPGIPARRPLFEPVMGASFSGRNVYIDPRXXXXX 1989 G + + ++ TR + EN + + F+P + Y P Sbjct: 558 GKMLMSPSANDQALATR-----YLENGSRFSSSQSPFDPFSNGGPLSSSKYSYPTFPINP 612 Query: 1990 XXXXXXXXXXXXXXXXXXRLQLPSGERFSR---NSTMEFGMPSTARFSLYDDHIR 2145 Q+P G+ SR NST+ G+P RFS DDH+R Sbjct: 613 SYQNATP-------------QMPFGDEVSRPYSNSTV--GVPLANRFSFNDDHLR 652 >ref|XP_002311037.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa] gi|222850857|gb|EEE88404.1| hypothetical protein POPTR_0008s02540g [Populus trichocarpa] Length = 684 Score = 233 bits (595), Expect = 2e-58 Identities = 226/738 (30%), Positives = 326/738 (44%), Gaps = 30/738 (4%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M++S E QDQR+ +ED T +TIE LRARLL+ER++S+TARQRADEL +RV ELE+QL Sbjct: 1 MNNSDQEKQDQRTRSSMEDSTAITIEFLRARLLAERSVSRTARQRADELAERVAELEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 ++V LQ+ +AEKAT DVLAILESNG++D SE F S+SD +T +SK+ + K + +S Sbjct: 61 RIVSLQRMKAEKATVDVLAILESNGISDDSEIFGSSSDQDTP-CESKVGKKT-KQEESSV 118 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 ++ E GR +E+ K D Sbjct: 119 ISKVTKYKLEEHSGSGHDFSSSQGRNLSWKGRKHSPRSLEKCK--DPSLRRRSSFASTSS 176 Query: 577 LPK-RVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDD 753 PK GKSC + + + ++ + V T S+ P+ + Sbjct: 177 SPKHHQGKSCRQVRNKESRLTIGAFRTNPDKVDSPENGVATTSEVFPNC---------SE 227 Query: 754 PESHEVENPREAPSSRFCGTVIPEVSVSSRTEQ--------------DISMGRALNDQAQ 891 PE +EN E +P +SV Q D M +AL QAQ Sbjct: 228 PEVGRIENGEE--------KTLPPISVGLENGQRADSNELEDNVYGSDRDMEKALEHQAQ 279 Query: 892 K-KAHEEEKTAQREWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYL 1068 ++ + QREWEE RENN S DS D GNRSDVTEE E+KA Q ++ Sbjct: 280 LIDRYKAMEKVQREWEEKFRENNGSTPDSYDAGNRSDVTEEGYEIKAQVQQHTGTVAAQS 339 Query: 1069 NQG-TEVEVANTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMS 1245 N+ +EVE A+ PN L V+ LQ SSS E+ +FAF Sbjct: 340 NRAKSEVEKASNI--------QPNGILRPSHVNIGQLQEWKSSSAPTSESPAQDFAFRAE 391 Query: 1246 NGTPCKN---LSGDSHCDPASTSYIRSLANGSSGDPLGYVPLPYANN-------GESSEN 1395 +N L + H P S S+ ++ S P + +N G+ S Sbjct: 392 KQKQNENEESLGNNYHPSPHS-SHDHPQSHSSHDSPGSQSATSFPSNTDSGFSKGQFSGR 450 Query: 1396 RKEL-ALMPQNSSSNLDTVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRD 1572 + EL AL+P +S+ L VL+AL+ A+ SL+ K++ + GS ++ +P D Sbjct: 451 QNELYALVPHRASNELGGVLDALKLARQSLQQKISTLPLIEGGSIRNSVDPSLPPPIPGD 510 Query: 1573 GLEIPVGCPGIFRLPTDFQFE-RTKANYLGSDPALSLSYQGSETTCNKLMPNPYIDARSS 1749 ++IP+G G+FRLP DF E T+ N ++ LSL +T N ++ +R Sbjct: 511 KVDIPLGNAGLFRLPFDFLAEGSTRKNLDSTNAGLSLRNYYPDTGVPAAAINRFV-SRFP 569 Query: 1750 VLVNDRFWSSSGPSFMEIRSGIGMGTS-PLTERVSETRTHPIPFTENLPGIPARRPLFEP 1926 RF + F+ +S G+ P ++ ++ E I ++RP F P Sbjct: 570 TATGSRF--PTADQFLASQSYSATGSRFPTEDQFLASQD-----VEAGSRISSQRPFFYP 622 Query: 1927 VMGASFSGRNVYIDPRXXXXXXXXXXXXXXXXXXXXXXXRLQLPSGERFSRNSTMEFGMP 2106 + Y P QLPS E S + G+P Sbjct: 623 YLDTVSPPSARYSYPTNPSYPGPMP----------------QLPSREPPSFLPSTTAGVP 666 Query: 2107 STARFSLYDDHIRPNMYK 2160 FS D HIRPNMY+ Sbjct: 667 PADHFSFPDYHIRPNMYR 684 >ref|XP_006606287.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X4 [Glycine max] Length = 641 Score = 222 bits (566), Expect = 6e-55 Identities = 196/597 (32%), Positives = 280/597 (46%), Gaps = 20/597 (3%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M +SV + QDQR +ED T MTIE LRARLLSER+IS++A+QRADEL K+V++LE+QL Sbjct: 1 MQNSVLDPQDQRVTSCMEDSTAMTIEFLRARLLSERSISRSAKQRADELAKKVMDLEEQL 60 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 K V LQ+K AEKATADVLAILES G++D SEEFDS SD E DS +SN K Sbjct: 61 KTVILQRKMAEKATADVLAILESEGISDVSEEFDSGSDLENP-CDSSVSNECAKEGEEPM 119 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXX 576 R+H + + + +E K+ Sbjct: 120 SSKGRQHGSDKMPGSNVDSSPVSSKSLSWKGRHDSSHSLE--KYKTSNLRRQSSFSSISS 177 Query: 577 LPK-RVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDD 753 PK R GKSC V EE N NH + + S+ P+ G+ + + Sbjct: 178 SPKHRQGKSCRKIRHRQIRLVVEE---SRNKFANHEKELASLSKGFPNFSGGGSNIPKIE 234 Query: 754 PESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQK-KAHEEEKTAQRE 930 E E P ++ V R E+D M +AL QAQ +E + QRE Sbjct: 235 SEIQEEGGSGANPLNK-----NHHVDGYGR-EKD--MEKALEHQAQLIDQYEAMEKVQRE 286 Query: 931 WEENVRENNSSALDSCDPGNRSDVTEERDEMKA--PPQMYSVGGTNYLNQGTEVEVANTT 1104 WEE RENNS+ DSCDPGN SD+TE++DE K P V ++G V Sbjct: 287 WEEKFRENNSTTPDSCDPGNYSDMTEDKDESKVHIPFAAKVVTSDAQESKGEPRGVC--- 343 Query: 1105 FIADHKPKA-PNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNL---- 1269 +++ K KA + DT +++ + + + P G ++ Sbjct: 344 -LSEEKFKAEARDIMPKTHDDTGGYSDQKNTTFSTSDLLGQQNSCPPLKGNQNESSVNGH 402 Query: 1270 ---SGDSHCDPASTSYIRSLANGSSGDPLGYVPLPYANNGESSENRKEL-ALMPQNSSSN 1437 S +H DP Y S S + V + ++S N+ +L AL+ Sbjct: 403 FQPSVMNHQDPGRHGYHDSKPTYSFPTDIHGV----QHQNDASRNKTDLFALVTHEQPHK 458 Query: 1438 LDTVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLP 1617 + VLE+L+QA++SL+ +L + P+V SG + ++S D E+PVGC G+FR+P Sbjct: 459 FNGVLESLKQARISLQQELKRL--PLV-ESGYTAKPSASFSKSEDRFEVPVGCSGLFRIP 515 Query: 1618 TDFQFERTKANYLGSDPA------LSLSYQGSETTCNKLMPN-PYIDARSSVLVNDR 1767 TDF + A + DP L+ S T+ + P+ PY D + S+ ND+ Sbjct: 516 TDFS-DGATARFNVKDPTAGFGSNFHLNRAMSRTSDGQFFPSLPYPDTQLSLPANDQ 571 >ref|XP_004496182.1| PREDICTED: uncharacterized protein LOC101514253 isoform X1 [Cicer arietinum] Length = 663 Score = 220 bits (561), Expect = 2e-54 Identities = 196/600 (32%), Positives = 287/600 (47%), Gaps = 23/600 (3%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M + + QDQR +ED T+MTIE LRARLL+ER+IS++ARQR EL K+V ELE+QL Sbjct: 3 MQTPTLDPQDQRVTSCMEDSTSMTIEFLRARLLAERSISRSARQRTAELEKKVAELEEQL 62 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 + V LQ+K AEKATADVLAILE G++D SEE DS SD + +S +SN S K Sbjct: 63 RTVTLQRKMAEKATADVLAILEDQGISDLSEELDSGSDIDIPY-ESGVSNESSKEGERYR 121 Query: 397 DFDARRHNR-EVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXX 573 RRH E+ R +E+ K + Sbjct: 122 SSKERRHESDELYDSHVVDSSPVSNRSLSWKGRHDSPRSLEKYKTSNIRRRNSFSSVSSS 181 Query: 574 XLPKR-VGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRD 750 PK GKSC SV EE +D S +N + + S+ P+ G+ +R Sbjct: 182 --PKHHQGKSCRKIRHRQNRSVVEESRDKSVKDNFQENDFVSSSEGYPNRSVDGSNILRI 239 Query: 751 DPESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQ--KKAHEEEKTAQ 924 + + E + ++ + R + M +AL QAQ + EK AQ Sbjct: 240 ESKILEGDESEV--------NLVNKNHHVDRCGRKEDMEKALEHQAQLIDRFGAMEK-AQ 290 Query: 925 REWEENVRENNSSAL-DSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTE----VE 1089 REWEE RENN+S DSCDPGN SD+TE+++E KA S T+ + V Sbjct: 291 REWEEKFRENNNSTTPDSCDPGNHSDMTEDKEESKAQIPYSSKAVTSNAQEDKAEPGGVR 350 Query: 1090 VANTTFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNL 1269 + F ++ + P S+ + +N +S+++ E S NG ++ Sbjct: 351 SSEEIFKSEARDVMPKSYDDTSDYNNQNSPTFRTSNLLGQENLHSPL-----NGNQTES- 404 Query: 1270 SGDSHCDPASTSYIRSLANG--SSGDPLG---YVPLPYANNGESSENRKEL-ALMPQNSS 1431 S +SH + +Y G S L Y+ + +SS N+ +L AL+ + S Sbjct: 405 SVNSHPQSSEVNYHDPHGRGYPDSKPTLSFPKYIQHGSLHQNDSSRNKNDLYALVFREQS 464 Query: 1432 SNLDTVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFR 1611 + +LE+L+QA+LSL+ +LN + P+V SS +GI+ +S +IPVG G+FR Sbjct: 465 HEFNGILESLKQARLSLQQELNRL--PLVESSHKGIKPSAFVGKSEGRFDIPVGFSGLFR 522 Query: 1612 LPTDFQFE-------RTKANYLGSDPALSLSYQGSETTCN-KLMPNPYIDARSSVLVNDR 1767 LPTDF E R A GS+ + +G+ T + + + NPY R S+ ND+ Sbjct: 523 LPTDFSDEATSRFGVRDSAGGFGSN--FYHNNRGTSRTSDVQFVANPYYGTRMSLSANDQ 580 >ref|XP_004249997.1| PREDICTED: uncharacterized protein LOC101251943 [Solanum lycopersicum] Length = 729 Score = 216 bits (551), Expect = 3e-53 Identities = 231/769 (30%), Positives = 328/769 (42%), Gaps = 61/769 (7%) Frame = +1 Query: 37 MSSSVSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQL 216 M+S EDQDQ G+ED T TIE LR RLL+ER+ S+TA+QRADEL + V ELE+QL Sbjct: 1 MASFGKEDQDQSKIDGVEDSKT-TIEFLRGRLLAERSASRTAKQRADELAQMVSELEEQL 59 Query: 217 KMVFLQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGAST 396 K+V LQ+KRAEKATA VL+ILE + + D SEEF S SD ET +SD K + N G Sbjct: 60 KVVSLQRKRAEKATAAVLSILEDHSIDDVSEEFSSGSDKETILSDQKDAGNKT---GGDI 116 Query: 397 DFDARRHNREVXXXXXXXXXXXXGRXXXXXXXXXXXNY-VERKKFMD-XXXXXXXXXXXX 570 A+ +V ++ ++R+K+ D Sbjct: 117 SSSAKEKEDDVDILSSSGTVSSSSTARSLSWKSGKSSHSLDRRKYTDSNRRRYSNFSYTD 176 Query: 571 XXLPKRVGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRD 750 PKRVG SC S ++++++ S + S+ L S + ++ Sbjct: 177 ISSPKRVGNSCRQIRRRDTRSASDKLRNSS---------AECASEPLSSSANNEPHSLTA 227 Query: 751 DPESHEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQK-KAHEEEKTAQR 927 +V + P+ G + + D RAL+ Q Q +E E+ AQR Sbjct: 228 GAGISDVNDQVHVPALDVPG------NGREADKSDEDSQRALHQQVQPIGQYEAEEKAQR 281 Query: 928 EWEENVRENNSSALDSCDPGNRSDVTEERDEMKAPPQMYSVGGT---NYLNQGTEVEVAN 1098 EWEE RE+NS DSCD N SDVTEERD++KA + G T N+ NQ +V+ Sbjct: 282 EWEEKYRESNSCTPDSCDRENYSDVTEERDDLKASQEPCLAGRTSMQNHANQCGAADVSR 341 Query: 1099 T--TFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNLS 1272 T D+ P PN V+ L+ S V ++ SE A PMS G +N Sbjct: 342 TKQNGNIDNSPSTPN-------VNMSCLEDKKGSRTVGSDSSASELARPMSTGNYLEN-- 392 Query: 1273 GDSHCDPASTSYIRSL-ANGSSGDPLGYVPLPYANNGESSENRKELALMPQNSSSNLDTV 1449 H ++ S+ +S SS P G++ + ELAL+ N+S+ +D+V Sbjct: 393 ---HGQTSAFSHQQSFPVTRSSMHPRS----SSLQAGQALQTGYELALVSHNTSNGVDSV 445 Query: 1450 LEALQQAKLSLRDKLNNVAP----PVVGSSGRGIEHFVPAARSRDGLEIPVGCPGI-FRL 1614 L L+QAKLSL ++N+ P P S + H + EI + P + R Sbjct: 446 LGKLEQAKLSLTKQINSSLPTASYPGTPSRFSSLNH----SPELSTYEISLTPPYVESRS 501 Query: 1615 PTDFQFERTKANYLGSDPALSLSYQG----SETTCNKLMPN--PYIDARSSVLVNDR--- 1767 Q R + + P +S S SET P+ PY+++RS + Sbjct: 502 KYVTQSNRVTYPFQRAFPEVSSSAPSYRPISETNFEAGQPSSTPYVESRSKYVTQSNRVT 561 Query: 1768 --------FWSSSGPSFMEIRSGIGMGTSPLTERVSETRTHPIPFTENL-----PGIPAR 1908 SSS PS+ I P + R + + +PF+ L P P Sbjct: 562 YPFQRAFTEVSSSAPSYRPISETNFDAGQPSSVRFNPNSSSRLPFSSKLTYPSYPKFPDM 621 Query: 1909 RPLFEP---------------VMGASFSGRNVYIDPRXXXXXXXXXXXXXXXXXXXXXXX 2043 P P SFS + + PR Sbjct: 622 VPKLPPNEVFSRNFPTNETDLPPSFSFSTLSQEVVPRLPSTEKVSRIFPTNETNPPPSFS 681 Query: 2044 RL--------QLPSGERFSRN-STMEFGMPSTARFSLYDD-HIRPNMYK 2160 +LPS E F RN T E G+P + F+L +D HIRPNMY+ Sbjct: 682 FSTLSPEVVPRLPSTEVFPRNIPTNEAGIPPS--FALRNDPHIRPNMYR 728 >ref|XP_004496183.1| PREDICTED: uncharacterized protein LOC101514253 isoform X2 [Cicer arietinum] gi|502118270|ref|XP_004496184.1| PREDICTED: uncharacterized protein LOC101514253 isoform X3 [Cicer arietinum] gi|502118272|ref|XP_004496185.1| PREDICTED: uncharacterized protein LOC101514253 isoform X4 [Cicer arietinum] Length = 660 Score = 213 bits (543), Expect = 3e-52 Identities = 193/596 (32%), Positives = 284/596 (47%), Gaps = 23/596 (3%) Frame = +1 Query: 49 VSEDQDQRSNGGLEDLTTMTIESLRARLLSERAISKTARQRADELTKRVLELEDQLKMVF 228 +S+ R +ED T+MTIE LRARLL+ER+IS++ARQR EL K+V ELE+QL+ V Sbjct: 4 ISDFSVTRVTSCMEDSTSMTIEFLRARLLAERSISRSARQRTAELEKKVAELEEQLRTVT 63 Query: 229 LQKKRAEKATADVLAILESNGVTDASEEFDSNSDGETTISDSKISNNSVKMKGASTDFDA 408 LQ+K AEKATADVLAILE G++D SEE DS SD + +S +SN S K Sbjct: 64 LQRKMAEKATADVLAILEDQGISDLSEELDSGSDIDIPY-ESGVSNESSKEGERYRSSKE 122 Query: 409 RRHNR-EVXXXXXXXXXXXXGRXXXXXXXXXXXNYVERKKFMDXXXXXXXXXXXXXXLPK 585 RRH E+ R +E+ K + PK Sbjct: 123 RRHESDELYDSHVVDSSPVSNRSLSWKGRHDSPRSLEKYKTSNIRRRNSFSSVSSS--PK 180 Query: 586 R-VGKSCXXXXXXXXXSVAEEVQDDSNLNNNHVDRVDTCSQDLPDSHDIGTETVRDDPES 762 GKSC SV EE +D S +N + + S+ P+ G+ +R + + Sbjct: 181 HHQGKSCRKIRHRQNRSVVEESRDKSVKDNFQENDFVSSSEGYPNRSVDGSNILRIESKI 240 Query: 763 HEVENPREAPSSRFCGTVIPEVSVSSRTEQDISMGRALNDQAQ--KKAHEEEKTAQREWE 936 E + ++ + R + M +AL QAQ + EK AQREWE Sbjct: 241 LEGDESEV--------NLVNKNHHVDRCGRKEDMEKALEHQAQLIDRFGAMEK-AQREWE 291 Query: 937 ENVRENNSSAL-DSCDPGNRSDVTEERDEMKAPPQMYSVGGTNYLNQGTE----VEVANT 1101 E RENN+S DSCDPGN SD+TE+++E KA S T+ + V + Sbjct: 292 EKFRENNNSTTPDSCDPGNHSDMTEDKEESKAQIPYSSKAVTSNAQEDKAEPGGVRSSEE 351 Query: 1102 TFIADHKPKAPNSFLAAQQVDTRNLQGPNSSSMVPHETQLSEFAFPMSNGTPCKNLSGDS 1281 F ++ + P S+ + +N +S+++ E S NG ++ S +S Sbjct: 352 IFKSEARDVMPKSYDDTSDYNNQNSPTFRTSNLLGQENLHSPL-----NGNQTES-SVNS 405 Query: 1282 HCDPASTSYIRSLANG--SSGDPLG---YVPLPYANNGESSENRKEL-ALMPQNSSSNLD 1443 H + +Y G S L Y+ + +SS N+ +L AL+ + S + Sbjct: 406 HPQSSEVNYHDPHGRGYPDSKPTLSFPKYIQHGSLHQNDSSRNKNDLYALVFREQSHEFN 465 Query: 1444 TVLEALQQAKLSLRDKLNNVAPPVVGSSGRGIEHFVPAARSRDGLEIPVGCPGIFRLPTD 1623 +LE+L+QA+LSL+ +LN + P+V SS +GI+ +S +IPVG G+FRLPTD Sbjct: 466 GILESLKQARLSLQQELNRL--PLVESSHKGIKPSAFVGKSEGRFDIPVGFSGLFRLPTD 523 Query: 1624 FQFE-------RTKANYLGSDPALSLSYQGSETTCN-KLMPNPYIDARSSVLVNDR 1767 F E R A GS+ + +G+ T + + + NPY R S+ ND+ Sbjct: 524 FSDEATSRFGVRDSAGGFGSN--FYHNNRGTSRTSDVQFVANPYYGTRMSLSANDQ 577