BLASTX nr result
ID: Catharanthus23_contig00005348
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00005348 (1987 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261... 465 e-128 ref|XP_004245198.1| PREDICTED: uncharacterized protein LOC101264... 456 e-125 ref|XP_006364287.1| PREDICTED: CRC domain-containing protein TSO... 453 e-124 emb|CBI21012.3| unnamed protein product [Vitis vinifera] 439 e-120 ref|XP_006450481.1| hypothetical protein CICLE_v10007369mg [Citr... 419 e-114 ref|XP_006483326.1| PREDICTED: CRC domain-containing protein TSO... 417 e-114 gb|EOY31444.1| Tesmin/TSO1-like CXC domain-containing protein, p... 396 e-107 gb|EOY31443.1| Tesmin/TSO1-like CXC domain-containing protein, p... 396 e-107 ref|XP_002515547.1| tso1, putative [Ricinus communis] gi|2235454... 376 e-101 gb|EXC26038.1| hypothetical protein L484_005620 [Morus notabilis] 360 1e-96 ref|XP_006483327.1| PREDICTED: CRC domain-containing protein TSO... 352 4e-94 ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine... 319 2e-84 ref|XP_002309611.2| cysteine-rich polycomb-like family protein [... 315 5e-83 gb|EMJ26483.1| hypothetical protein PRUPE_ppa001375mg [Prunus pe... 315 5e-83 gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japo... 313 1e-82 emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japon... 313 1e-82 ref|XP_006596108.1| PREDICTED: LOW QUALITY PROTEIN: CRC domain-c... 302 4e-79 ref|XP_004498661.1| PREDICTED: protein tesmin/TSO1-like CXC 2-li... 296 3e-77 ref|XP_004498660.1| PREDICTED: protein tesmin/TSO1-like CXC 2-li... 296 3e-77 gb|EOY31447.1| Tesmin/TSO1-like CXC domain-containing protein, p... 255 4e-65 >ref|XP_002282493.2| PREDICTED: uncharacterized protein LOC100261127 [Vitis vinifera] Length = 1001 Score = 465 bits (1196), Expect = e-128 Identities = 244/518 (47%), Positives = 315/518 (60%), Gaps = 16/518 (3%) Frame = -3 Query: 1880 SVSSDDSRHDGXXXXXXXXXXXXXPYGVKHFNDSLEPKPIELQPSPGDKRKSVSEIADSV 1701 S ++D R++ +G++ D L KPIEL P DKRK SE DS+ Sbjct: 483 SAGTEDRRNENKASIAMSSATSQSSHGLEPSKDPLLLKPIELHEIPCDKRKFNSEHMDSL 542 Query: 1700 DDFXXXXXXXXXXKTLDTGESNGCKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNR 1521 ++F K T + +GCKRCNCKK+KCLKLYCDCFAAGIYCAE CAC GC NR Sbjct: 543 EEFNQPSPKKRRKKASSTNDGDGCKRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNR 602 Query: 1520 PDYEDTVLETRQQIESRNPLAFAPKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKK 1341 +Y+D VLETR+QIESRNPLAFAPKI+ + P +S G+DG R TP+SARHKRGCNCKK Sbjct: 603 AEYDDRVLETRKQIESRNPLAFAPKIVPPVNGSPINS-GEDG-RSTPSSARHKRGCNCKK 660 Query: 1340 SKCLKKYCECYQSNVGCSDGCRCEGCENMYGRKGEYSMLKDLVNKHDNVEILDGSFDKKL 1161 S CLKKYCECYQ+NVGCS GCRCEGC+N+YGRK EY K+++++ N + L+ SFDKKL Sbjct: 661 SMCLKKYCECYQANVGCSAGCRCEGCKNVYGRKEEYGAFKEMMSRRANDDRLESSFDKKL 720 Query: 1160 ELAAPRDSLLHNDLCNPHNLSPLTPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPY 981 E+ A R L +LCNPHN++P+TPSFQCS+HG A+ + S R SPES + L Y Sbjct: 721 EMVAARSDFLRPELCNPHNMTPMTPSFQCSDHGNDAANFQIPSRRYAQSPESDFSILSSY 780 Query: 980 GMSPVRDVNPENHHMISETSNEFMNLVSFDQELEYGSGETLLPFSPGFDG-----HKRXX 816 G SPV N ++H+++ E E ++ ++ E EYG+ E + F+PG D H Sbjct: 781 GKSPVSPKNSDSHNILPEPGKEIWDMACYEHEFEYGNAEKVDQFTPGCDRLSDVCHFTSL 840 Query: 815 XXXXXXXXPNSS---------NLLKSQMFPGNRNISSSSYLKWRSSPVSPMTQFGGTKLL 663 N S N+ + Q+ PG+ ++ S L+WRSSP+ P TQ G TK+ Sbjct: 841 PSSLPTTAMNPSASSKTSGLTNVSRGQLCPGSDHLVSGGSLRWRSSPIPPSTQLGETKIF 900 Query: 662 EVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPNKKRVXXXXXXXXXXXXXXXXXXX 483 + + D+ LY +L+D+TP ILK+ AVK SSP +KRV Sbjct: 901 QGLESDNELYDILEDDTPTILKDASTPTKAVKASSPKQKRV-SPPKIQSHERGSSSSLAI 959 Query: 482 XXXXRKFILQAVPSFPPLTPTID--SEPYADGQDMNDS 375 RKFILQAVPSFPPLTP ID S + + D DS Sbjct: 960 LKSGRKFILQAVPSFPPLTPCIDTKSSSHQNSSDPQDS 997 >ref|XP_004245198.1| PREDICTED: uncharacterized protein LOC101264540 [Solanum lycopersicum] Length = 633 Score = 456 bits (1173), Expect = e-125 Identities = 256/547 (46%), Positives = 324/547 (59%), Gaps = 5/547 (0%) Frame = -3 Query: 1982 QRGSLSILGKKSVSTMSCHSSKN---CSISLNGAEGISVSSDDSRHDGXXXXXXXXXXXX 1812 QRG+LSI GKK S MSCH SKN C IS N S +D H+ Sbjct: 94 QRGNLSIRGKKLTSMMSCHPSKNLKNCLISANVVGSNLTSDNDGIHESYRSDAESAAASL 153 Query: 1811 XPYGVKHFNDSLEPKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNG 1632 K ND++ KP E PS +KRK SE DS D+ K D + +G Sbjct: 154 SHNNAKLLNDTVLLKPTEHTPS--NKRKLNSEHIDSNMDYNQSSPQKKRKKISDGNDGDG 211 Query: 1631 CKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFA 1452 CKRCNCKKTKCLKLYCDCFAAG+YC +SC CQGC NRP+YEDTVL+ RQQI+SRNPLAFA Sbjct: 212 CKRCNCKKTKCLKLYCDCFAAGVYCVDSCTCQGCFNRPEYEDTVLDVRQQIQSRNPLAFA 271 Query: 1451 PKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRC 1272 PKI+QH PA+ G+ FTP+SARHKRGCNCKKS CLKKYCECYQ+NVGCS GCRC Sbjct: 272 PKIVQHSTNSPANILGEGVASFTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSSGCRC 331 Query: 1271 EGCENMYGRKGEYSMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPL 1092 EGC+N++G K EY + DLVNKH E L+ S ++++E+ LL + N N +PL Sbjct: 332 EGCKNVFGPKEEYGI--DLVNKHCITESLERSVEEEVEMVTATSGLLQSGPINQCNSTPL 389 Query: 1091 TPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEF 912 TPSF+ SN+ AS+SW +SGR +SPESG PYG+SP + NH ET + Sbjct: 390 TPSFRRSNN-VDASKSWFTSGRYLSSPESGQADTAPYGLSPGSPRSSNNHDTHQETIGDM 448 Query: 911 MNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISS 732 ++LV+FD EL YG+ + SPGF+ + ++ Q+ P + S Sbjct: 449 LDLVTFDHELSYGNAKLANEISPGFNVTGNMDDILALPKSQDWASNSGGQLIPQTVHFQS 508 Query: 731 SSYLKWRSSPVSPMTQFGGTKL--LEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSS 558 + L WR+SP++ MTQF G+ + LE+ D D + Y +L+D+TPEILK++ VKV+S Sbjct: 509 TDPLSWRNSPMTHMTQFDGSGMNALELLDSDKKPY-VLEDDTPEILKDSSIPQIGVKVNS 567 Query: 557 PNKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSEPYADGQDMND 378 PNKKRV RKFIL+AVPSFPPL+P I S+ A N Sbjct: 568 PNKKRV-SPPYRHLNEIGSSSSGGGLKTGRKFILRAVPSFPPLSPCIQSKNVAAHSTDNS 626 Query: 377 SQGRSRK 357 + S K Sbjct: 627 EKDSSSK 633 >ref|XP_006364287.1| PREDICTED: CRC domain-containing protein TSO1-like [Solanum tuberosum] Length = 962 Score = 453 bits (1166), Expect = e-124 Identities = 254/548 (46%), Positives = 324/548 (59%), Gaps = 6/548 (1%) Frame = -3 Query: 1982 QRGSLSILGKKSVSTMSCHSSKN---CSISLNGA-EGISVSSDDSRHDGXXXXXXXXXXX 1815 QRG+LSI GKK S MSCH SKN C IS N +++ +D +H+ Sbjct: 425 QRGNLSIRGKKLTSMMSCHPSKNLKNCLISSNVVGSNLTIGDNDGKHESYGSDAEIVAAS 484 Query: 1814 XXPYGVKHFNDSLEPKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESN 1635 K ND++ KP E PS +KRK SE +S D+ KTLD + + Sbjct: 485 LSLNNAKPLNDTVLLKPTEHTPS--NKRKFNSEHINSNMDYNQSSPQKKRKKTLDGNDGD 542 Query: 1634 GCKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAF 1455 GCKRCNCKKTKCLKLYCDCFAAG+YC +SC CQGC NRP+YEDTVL+ RQQI+SRNPLAF Sbjct: 543 GCKRCNCKKTKCLKLYCDCFAAGVYCVDSCTCQGCFNRPEYEDTVLDVRQQIQSRNPLAF 602 Query: 1454 APKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCR 1275 APKI+QH PA+ G+ G FTP+SARHKRGCNCKKS CLKKYCECYQ+NVGCS GCR Sbjct: 603 APKIVQHSTSSPANILGEGGASFTPSSARHKRGCNCKKSMCLKKYCECYQANVGCSSGCR 662 Query: 1274 CEGCENMYGRKGEYSMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSP 1095 CEGC+N++G K E + DLVNKH E L+ S ++++E+ LL + N N +P Sbjct: 663 CEGCKNVFGPKEECGI--DLVNKHCITERLERSVEEEVEMVTATSGLLQSGPINQCNSTP 720 Query: 1094 LTPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNE 915 LTPSF+CSN AS+SW +SGR +SP+SG YG+SP + NH + ET+ + Sbjct: 721 LTPSFRCSN-SVDASKSWFTSGRYLSSPDSGQANTAAYGLSPGSPRSSNNHDIHQETTGD 779 Query: 914 FMNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNIS 735 ++LV+FD EL YG+ + SPGF + ++ Q+ P + Sbjct: 780 MLDLVTFDHELNYGNAKLANEISPGFHVTGNMDDILALPKSQDWASNSGGQLIPQTVHFQ 839 Query: 734 SSSYLKWRSSPVSPMTQF--GGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVS 561 S+ L W + SPMTQF G LE+ D D + Y +++D+TPEILK + L VKV+ Sbjct: 840 STDPLSWHN---SPMTQFDRSGMNTLELLDSDKKPY-VMEDDTPEILKASSILQIGVKVN 895 Query: 560 SPNKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSEPYADGQDMN 381 SPNKKRV RKFIL+AVPSFPPL+P I S+ A N Sbjct: 896 SPNKKRV-SPPYRHLNEIGSSSSGGGLKTGRKFILRAVPSFPPLSPCIQSKDVAVHSTNN 954 Query: 380 DSQGRSRK 357 + S K Sbjct: 955 SEKDSSSK 962 >emb|CBI21012.3| unnamed protein product [Vitis vinifera] Length = 1094 Score = 439 bits (1129), Expect = e-120 Identities = 218/427 (51%), Positives = 280/427 (65%), Gaps = 2/427 (0%) Frame = -3 Query: 1649 TGESNGCKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESR 1470 T + +GCKRCNCKK+KCLKLYCDCFAAGIYCAE CAC GC NR +Y+D VLETR+QIESR Sbjct: 567 TNDGDGCKRCNCKKSKCLKLYCDCFAAGIYCAEGCACVGCFNRAEYDDRVLETRKQIESR 626 Query: 1469 NPLAFAPKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGC 1290 NPLAFAPKI+ + P +S G+DG R TP+SARHKRGCNCKKS CLKKYCECYQ+NVGC Sbjct: 627 NPLAFAPKIVPPVNGSPINS-GEDG-RSTPSSARHKRGCNCKKSMCLKKYCECYQANVGC 684 Query: 1289 SDGCRCEGCENMYGRKGEYSMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNP 1110 S GCRCEGC+N+YGRK EY K+++++ N + L+ SFDKKLE+ A R L +LCNP Sbjct: 685 SAGCRCEGCKNVYGRKEEYGAFKEMMSRRANDDRLESSFDKKLEMVAARSDFLRPELCNP 744 Query: 1109 HNLSPLTPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMIS 930 HN++P+TPSFQCS+HG A+ + S R SPES + L YG SPV N ++H+++ Sbjct: 745 HNMTPMTPSFQCSDHGNDAANFQIPSRRYAQSPESDFSILSSYGKSPVSPKNSDSHNILP 804 Query: 929 ETSNEFMNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPG 750 E E ++ ++ E EYG+ E + F+PG +N+ + Q+ PG Sbjct: 805 EPGKEIWDMACYEHEFEYGNAEKVDQFTPG------SMNPSASSKTSGLTNVSRGQLCPG 858 Query: 749 NRNISSSSYLKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAV 570 + ++ S L+WRSSP+ P TQ G TK+ + + D+ LY +L+D+TP ILK+ AV Sbjct: 859 SDHLVSGGSLRWRSSPIPPSTQLGETKIFQGLESDNELYDILEDDTPTILKDASTPTKAV 918 Query: 569 KVSSPNKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTID--SEPYAD 396 K SSP +KRV RKFILQAVPSFPPLTP ID S + + Sbjct: 919 KASSPKQKRV-SPPKIQSHERGSSSSLAILKSGRKFILQAVPSFPPLTPCIDTKSSSHQN 977 Query: 395 GQDMNDS 375 D DS Sbjct: 978 SSDPQDS 984 >ref|XP_006450481.1| hypothetical protein CICLE_v10007369mg [Citrus clementina] gi|557553707|gb|ESR63721.1| hypothetical protein CICLE_v10007369mg [Citrus clementina] Length = 952 Score = 419 bits (1077), Expect = e-114 Identities = 227/468 (48%), Positives = 291/468 (62%), Gaps = 13/468 (2%) Frame = -3 Query: 1772 PKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNCKKTKCLK 1593 P P++ +P RK SE AD+ ++ K+ T +S+GCKRCNCKKT+CLK Sbjct: 462 PPPVDPNGTPLTMRKFNSEHADNFEEISQLSPKKKRKKSSSTVDSDGCKRCNCKKTRCLK 521 Query: 1592 LYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQHIAEPPAS 1413 LYCDCFAAGIYCAESCACQGC NRP+YEDTVLETRQQIESRNPLAFAPKII + E P Sbjct: 522 LYCDCFAAGIYCAESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAPKIIPRVTEFP-- 579 Query: 1412 SCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENMYGRKGEY 1233 DDG RFTP+S+RHKRGCNCKKS CLKKYCECYQ+ VGCS GCRCE C+N+YGRK EY Sbjct: 580 ---DDGNRFTPSSSRHKRGCNCKKSMCLKKYCECYQAYVGCSSGCRCENCKNVYGRKEEY 636 Query: 1232 SMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQCSNHGQTA 1053 +++VN I +G D K E ++ LH +L + NL+PLTPSFQ S+HG+ A Sbjct: 637 VGNEEMVNSR---AIPEGVSDSKPERVTNKNEFLHAELYDLCNLTPLTPSFQFSDHGKDA 693 Query: 1052 SRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLVSFDQELEYG 873 S+S + SGR SP+S + L Y S + +++ M+ E S E +++ + QE +Y Sbjct: 694 SKSRILSGRYVPSPKSDLTILSSYVKSSRTLNSSDSNEMLLEKSREIVDVDPYGQERDYS 753 Query: 872 SGETLLPFSPG-------------FDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISS 732 S + + FSP D +N+ + Q+ P + ++ S Sbjct: 754 SADMVEQFSPRCHSLVDLCDFNPLLDFPSTAMESSASSKATGWTNVSRLQLCPRSGSLLS 813 Query: 731 SSYLKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPN 552 S L+WRSSPV+P+TQ GGTK L+ D D RL +L D+TPE+LK+ L +VKVSSP+ Sbjct: 814 GSSLRWRSSPVTPLTQLGGTKSLQALDSDGRLSGILGDDTPEVLKDASTLIKSVKVSSPS 873 Query: 551 KKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSE 408 +KRV RKFIL+AVPSFPPLTP IDS+ Sbjct: 874 RKRV--SPPHGRAHEHGSSSSSMLKSGRKFILKAVPSFPPLTPCIDSK 919 >ref|XP_006483326.1| PREDICTED: CRC domain-containing protein TSO1-like isoform X1 [Citrus sinensis] Length = 952 Score = 417 bits (1073), Expect = e-114 Identities = 226/468 (48%), Positives = 290/468 (61%), Gaps = 13/468 (2%) Frame = -3 Query: 1772 PKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNCKKTKCLK 1593 P P++ +P RK SE AD+ ++ K+ T +S+GCKRCNCKKT+CLK Sbjct: 462 PPPVDPNGTPLTMRKFNSEHADNFEEISQLSPKKKRKKSSSTVDSDGCKRCNCKKTRCLK 521 Query: 1592 LYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQHIAEPPAS 1413 LYCDCFAAGIYCAESCACQGC NRP+YEDTVLETRQQIESRNPLAFAPKII + E P Sbjct: 522 LYCDCFAAGIYCAESCACQGCFNRPEYEDTVLETRQQIESRNPLAFAPKIIPRVTEFP-- 579 Query: 1412 SCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENMYGRKGEY 1233 DDG RFTP+S+RHKRGCNCKKS CLKKYCECYQ+ VGCS GCRCE C+N+YGRK EY Sbjct: 580 ---DDGNRFTPSSSRHKRGCNCKKSMCLKKYCECYQAYVGCSSGCRCENCKNVYGRKEEY 636 Query: 1232 SMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQCSNHGQTA 1053 +++VN I +G D K E ++ LH +L + NL+PLTPSFQ S+HG+ A Sbjct: 637 VGNEEMVNSR---AIPEGVSDSKPERVTNKNEFLHAELYDLRNLTPLTPSFQFSDHGKDA 693 Query: 1052 SRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLVSFDQELEYG 873 S+S + SGR SP+S + L Y S + +++ M+ E S E +++ + QE +Y Sbjct: 694 SKSRILSGRYVPSPKSDLTILSSYVKSSRTLNSSDSNEMLLEKSREIVDVDPYGQERDYS 753 Query: 872 SGETLLPFSPG-------------FDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISS 732 S + + FSP D +N+ + Q+ P + ++ S Sbjct: 754 SADMVEQFSPRCHSLADLCDFNPLLDFPSTAMESSASSKATGWTNVSRLQLCPRSGSLLS 813 Query: 731 SSYLKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPN 552 S L+WRSSPV+P+TQ GGTK L+ D D RL +L D+TPE+LK+ +VKVSSP+ Sbjct: 814 GSSLRWRSSPVTPLTQLGGTKSLQALDSDGRLSGILGDDTPEVLKDASTPIKSVKVSSPS 873 Query: 551 KKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSE 408 +KRV RKFIL+AVPSFPPLTP IDS+ Sbjct: 874 RKRV--SPPHGRAHEHGSSSSSMLKSGRKFILKAVPSFPPLTPCIDSK 919 >gb|EOY31444.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 2 [Theobroma cacao] Length = 704 Score = 396 bits (1017), Expect = e-107 Identities = 227/523 (43%), Positives = 302/523 (57%), Gaps = 3/523 (0%) Frame = -3 Query: 1973 SLSILGKKSVSTMSCHSSKNCSISLNGAEGISVSSDDSRHDGXXXXXXXXXXXXXPYGVK 1794 S+ I G KS S MSC S +N + E + + D + G Sbjct: 189 SMGIQGIKSASVMSCQSMENMQSCSDAFEKVLAAPQDGTLEAKACVIP---------GSA 239 Query: 1793 HFNDSLEPKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNC 1614 + I+ Q + KR+ SE DS + F K+ ++ + GCKRCNC Sbjct: 240 ASESLCTMESIDCQTTLHRKRELSSEHGDSNEMFNQQSPKKKRKKSSNSTDGEGCKRCNC 299 Query: 1613 KKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQH 1434 KKTKCLKLYCDCFAAGIYCA+ C+CQGC NRP+YEDTVLETRQQIESRNPLAFAPKI+Q Sbjct: 300 KKTKCLKLYCDCFAAGIYCADPCSCQGCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQP 359 Query: 1433 IAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENM 1254 + E P +S +DG TP+SARHKRGCNCK+S CLKKYCECYQ+NVGCS GCRCEGC+N+ Sbjct: 360 VTEFPVTS-REDGNWKTPSSARHKRGCNCKRSMCLKKYCECYQANVGCSIGCRCEGCKNV 418 Query: 1253 YGRKGEYSMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQC 1074 +G+K +Y + +++VN+ G + A + L++DLC+PH L+PLTPSFQC Sbjct: 419 FGKKEDYCVTEEIVNR--------GGGEISESTVAAKKDFLNSDLCDPHYLTPLTPSFQC 470 Query: 1073 SNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLVSF 894 S+HG+ A +S L S R SPES + L SP +++ M+ ETS E +++ S+ Sbjct: 471 SDHGKNAPKSRLLSRRCLPSPESDLTVL---AKSPRSPRTSDSNDMLLETSKENLDVGSY 527 Query: 893 DQELEYGSGETL---LPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISSSSY 723 + + Y + + L +P H ++L + + P + +SS Sbjct: 528 CEGINYNNADVLGDGCHHTP-LPNHPSIILGSTSSKARELTSLSRFPLGPRSGCLSSGGS 586 Query: 722 LKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPNKKR 543 L+WRSSP++PM+ GTK L+ D D L +L+D+TPEILK+T +VK SSPN KR Sbjct: 587 LRWRSSPITPMSSLDGTKNLQGLDSD-GLSDILEDDTPEILKDTSTPNKSVKTSSPNGKR 645 Query: 542 VXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTID 414 V RKFIL+AVPSFPPLTP ID Sbjct: 646 V---SPPHNLLQLGSSSSGPLRSGRKFILKAVPSFPPLTPCID 685 >gb|EOY31443.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 1 [Theobroma cacao] gi|508784189|gb|EOY31445.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 1 [Theobroma cacao] gi|508784190|gb|EOY31446.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 1 [Theobroma cacao] Length = 940 Score = 396 bits (1017), Expect = e-107 Identities = 227/523 (43%), Positives = 302/523 (57%), Gaps = 3/523 (0%) Frame = -3 Query: 1973 SLSILGKKSVSTMSCHSSKNCSISLNGAEGISVSSDDSRHDGXXXXXXXXXXXXXPYGVK 1794 S+ I G KS S MSC S +N + E + + D + G Sbjct: 425 SMGIQGIKSASVMSCQSMENMQSCSDAFEKVLAAPQDGTLEAKACVIP---------GSA 475 Query: 1793 HFNDSLEPKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNC 1614 + I+ Q + KR+ SE DS + F K+ ++ + GCKRCNC Sbjct: 476 ASESLCTMESIDCQTTLHRKRELSSEHGDSNEMFNQQSPKKKRKKSSNSTDGEGCKRCNC 535 Query: 1613 KKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQH 1434 KKTKCLKLYCDCFAAGIYCA+ C+CQGC NRP+YEDTVLETRQQIESRNPLAFAPKI+Q Sbjct: 536 KKTKCLKLYCDCFAAGIYCADPCSCQGCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQP 595 Query: 1433 IAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENM 1254 + E P +S +DG TP+SARHKRGCNCK+S CLKKYCECYQ+NVGCS GCRCEGC+N+ Sbjct: 596 VTEFPVTS-REDGNWKTPSSARHKRGCNCKRSMCLKKYCECYQANVGCSIGCRCEGCKNV 654 Query: 1253 YGRKGEYSMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQC 1074 +G+K +Y + +++VN+ G + A + L++DLC+PH L+PLTPSFQC Sbjct: 655 FGKKEDYCVTEEIVNR--------GGGEISESTVAAKKDFLNSDLCDPHYLTPLTPSFQC 706 Query: 1073 SNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLVSF 894 S+HG+ A +S L S R SPES + L SP +++ M+ ETS E +++ S+ Sbjct: 707 SDHGKNAPKSRLLSRRCLPSPESDLTVL---AKSPRSPRTSDSNDMLLETSKENLDVGSY 763 Query: 893 DQELEYGSGETL---LPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISSSSY 723 + + Y + + L +P H ++L + + P + +SS Sbjct: 764 CEGINYNNADVLGDGCHHTP-LPNHPSIILGSTSSKARELTSLSRFPLGPRSGCLSSGGS 822 Query: 722 LKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPNKKR 543 L+WRSSP++PM+ GTK L+ D D L +L+D+TPEILK+T +VK SSPN KR Sbjct: 823 LRWRSSPITPMSSLDGTKNLQGLDSD-GLSDILEDDTPEILKDTSTPNKSVKTSSPNGKR 881 Query: 542 VXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTID 414 V RKFIL+AVPSFPPLTP ID Sbjct: 882 V---SPPHNLLQLGSSSSGPLRSGRKFILKAVPSFPPLTPCID 921 >ref|XP_002515547.1| tso1, putative [Ricinus communis] gi|223545491|gb|EEF46996.1| tso1, putative [Ricinus communis] Length = 873 Score = 376 bits (965), Expect = e-101 Identities = 205/426 (48%), Positives = 265/426 (62%), Gaps = 14/426 (3%) Frame = -3 Query: 1643 ESNGCKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNP 1464 + +GCKRCNCKKTKCLKLYCDCFAAGIYCA+ CACQ C NRP+YEDTVLETRQQIESRNP Sbjct: 448 DGDGCKRCNCKKTKCLKLYCDCFAAGIYCADPCACQDCFNRPEYEDTVLETRQQIESRNP 507 Query: 1463 LAFAPKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSD 1284 LAFAPKI+QH E AS +D + P+ +RHKRGCNCKKS CLKKYCECYQ+NVGCS Sbjct: 508 LAFAPKIVQHAKEFAASR--EDRSSSMPSLSRHKRGCNCKKSMCLKKYCECYQANVGCSS 565 Query: 1283 GCRCEGCENMYGRKGEYSMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHN 1104 CRCEGC+N YGRK EY ++++ V+ E L+G D KL + A + LL +L + HN Sbjct: 566 ECRCEGCKNGYGRKEEYGIIEETVSDRVGEERLEGRVDDKLAIVATDEDLLPAELYDLHN 625 Query: 1103 LSPLTPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENH-HMISE 927 L+P TPSFQ S+HG++ +S LSS R+ SPES ++ LP S + + I E Sbjct: 626 LTPSTPSFQHSDHGKSTPKSPLSSSRHVPSPESDISILP----SNAKSTRSSRYCDTIPE 681 Query: 926 TSNEFMNLVSFDQELEYGSGETLLPFSPG-------FDGHKRXXXXXXXXXXPNSSNL-- 774 S E +++ S DQ ++Y E + FS +D + SS Sbjct: 682 ASKETVDIDSCDQGIDYNVSEMMSQFSSKCSALADIYDPNPLTNPMTVTLGSSASSKTRD 741 Query: 773 ----LKSQMFPGNRNISSSSYLKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPE 606 + Q+ PG+ + S ++W +SP++PMT+ G K ++ D D LY++L+D+TPE Sbjct: 742 WSSGSRFQLCPGSGRLPSGRSVRWWNSPITPMTRLGENK-IQGHDSDSGLYNILEDDTPE 800 Query: 605 ILKETPPLPNAVKVSSPNKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLT 426 ILKE +VK SSPNKKRV RKFIL++VPSFPPLT Sbjct: 801 ILKEGSTPSASVKASSPNKKRV--SPPQNHIHDFRSSSSGGLKSGRKFILRSVPSFPPLT 858 Query: 425 PTIDSE 408 P ++SE Sbjct: 859 PCVNSE 864 >gb|EXC26038.1| hypothetical protein L484_005620 [Morus notabilis] Length = 656 Score = 360 bits (924), Expect = 1e-96 Identities = 213/477 (44%), Positives = 282/477 (59%), Gaps = 13/477 (2%) Frame = -3 Query: 1748 SPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNCKKTKCLKLYCDCFAA 1569 +P + KS S AD+ ++ KT TG +GC RCNCKKTKCLKLYCDCFAA Sbjct: 192 APDENGKSNSHNADNFEEHDQPSPRKKRKKT-STGGGDGCNRCNCKKTKCLKLYCDCFAA 250 Query: 1568 GIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQHIAEPPASSCGDDGTR 1389 GIYC++ C+CQGC NRP+YE TV+ETR+QIESRNPLAFAPKI+Q I E P + G+DG R Sbjct: 251 GIYCSDPCSCQGCFNRPEYEKTVIETREQIESRNPLAFAPKIVQRIPELPPNH-GEDGYR 309 Query: 1388 FTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENMYGRKGEYSMLKDLVN 1209 TP+SARHK+GCNCKKS CLKKYCECYQ+NVGCS GCRCEGC+N+YGRK EY+ ++ V Sbjct: 310 STPSSARHKKGCNCKKSMCLKKYCECYQANVGCSSGCRCEGCKNVYGRKEEYAAMEHGVT 369 Query: 1208 KH-DNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQCSNHGQTASRSWLSS 1032 + L+ +FDK+LE+ + LL + NPHNL+PLTPSFQ S+HG+ A +S S Sbjct: 370 REMVGNRKLESTFDKELEIVGTKRDLLCTESINPHNLTPLTPSFQYSDHGKDAPKSRFIS 429 Query: 1031 GRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLVSFDQELEYGSG----- 867 R SP+S + SP RD+ E ++ E S E + S++ +++Y G Sbjct: 430 RRYLPSPDSAILSSNEKTKSPPRDL--EKSDVLLEVSEELPDEGSYEWQVDYNVGIADSS 487 Query: 866 ---ETLLPFSPGFDGH--KRXXXXXXXXXXPNSSNLLKSQMFPGNRNISSSSYLKWRSSP 702 +P P H + NL ++Q+ PG+ + S+S ++ SP Sbjct: 488 SSRNDSVPSVPHLTQHPDTSVPMASATSFRRDYRNLSQNQLCPGSARLLSNS-MRRHGSP 546 Query: 701 VSPMTQF-GGTKLLEVTDFDH-RLYSMLDDETPEILKETPPLPNAVKVSSPNKKRVXXXX 528 ++PMT+ TK + D +L +L+DETP VKVSSPNKKRV Sbjct: 547 LAPMTRLCEATKSRQGLDSGSVQLVDILEDETP------------VKVSSPNKKRV--SP 592 Query: 527 XXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSEPYADGQDMNDSQGRSRK 357 RKFIL++VPSFPPLTP IDS+ + ++ + QG S K Sbjct: 593 PHSHIFELGSGSSGKLKSGRKFILKSVPSFPPLTPCIDSKG-STSENTHKLQGDSVK 648 >ref|XP_006483327.1| PREDICTED: CRC domain-containing protein TSO1-like isoform X2 [Citrus sinensis] Length = 929 Score = 352 bits (902), Expect = 4e-94 Identities = 204/468 (43%), Positives = 268/468 (57%), Gaps = 13/468 (2%) Frame = -3 Query: 1772 PKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNCKKTKCLK 1593 P P++ +P RK SE AD+ ++ K+ T +S+GCKRCNCKKT+CLK Sbjct: 462 PPPVDPNGTPLTMRKFNSEHADNFEEISQLSPKKKRKKSSSTVDSDGCKRCNCKKTRCLK 521 Query: 1592 LYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQHIAEPPAS 1413 L P+YEDTVLETRQQIESRNPLAFAPKII + E P Sbjct: 522 L-----------------------PEYEDTVLETRQQIESRNPLAFAPKIIPRVTEFP-- 556 Query: 1412 SCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENMYGRKGEY 1233 DDG RFTP+S+RHKRGCNCKKS CLKKYCECYQ+ VGCS GCRCE C+N+YGRK EY Sbjct: 557 ---DDGNRFTPSSSRHKRGCNCKKSMCLKKYCECYQAYVGCSSGCRCENCKNVYGRKEEY 613 Query: 1232 SMLKDLVNKHDNVEILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQCSNHGQTA 1053 +++VN I +G D K E ++ LH +L + NL+PLTPSFQ S+HG+ A Sbjct: 614 VGNEEMVNSR---AIPEGVSDSKPERVTNKNEFLHAELYDLRNLTPLTPSFQFSDHGKDA 670 Query: 1052 SRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLVSFDQELEYG 873 S+S + SGR SP+S + L Y S + +++ M+ E S E +++ + QE +Y Sbjct: 671 SKSRILSGRYVPSPKSDLTILSSYVKSSRTLNSSDSNEMLLEKSREIVDVDPYGQERDYS 730 Query: 872 SGETLLPFSPG-------------FDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISS 732 S + + FSP D +N+ + Q+ P + ++ S Sbjct: 731 SADMVEQFSPRCHSLADLCDFNPLLDFPSTAMESSASSKATGWTNVSRLQLCPRSGSLLS 790 Query: 731 SSYLKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPN 552 S L+WRSSPV+P+TQ GGTK L+ D D RL +L D+TPE+LK+ +VKVSSP+ Sbjct: 791 GSSLRWRSSPVTPLTQLGGTKSLQALDSDGRLSGILGDDTPEVLKDASTPIKSVKVSSPS 850 Query: 551 KKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSE 408 +KRV RKFIL+AVPSFPPLTP IDS+ Sbjct: 851 RKRV--SPPHGRAHEHGSSSSSMLKSGRKFILKAVPSFPPLTPCIDSK 896 >ref|NP_001236112.1| cysteine-rich polycomb-like protein [Glycine max] gi|4218187|emb|CAA09028.1| cysteine-rich polycomb-like protein [Glycine max] Length = 896 Score = 319 bits (818), Expect = 2e-84 Identities = 184/431 (42%), Positives = 248/431 (57%), Gaps = 8/431 (1%) Frame = -3 Query: 1649 TGESNGCKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESR 1470 T + NGCKRCNCKK+KCLKLYCDCFAAG YC + CACQGCLNRP+Y +TV+ET+QQIESR Sbjct: 467 TADDNGCKRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLNRPEYVETVVETKQQIESR 526 Query: 1469 NPLAFAPKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGC 1290 NP+AFAPKI+Q + SS DD TP+SARHKRGCNCK+S CLKKYCECYQ+NVGC Sbjct: 527 NPIAFAPKIVQPTTD--ISSHMDDENLTTPSSARHKRGCNCKRSMCLKKYCECYQANVGC 584 Query: 1289 SDGCRCEGCENMYGRKGEYSMLKDLVNKHDNVEIL----DGSFDKKLELAAPRDSLLHND 1122 S GCRCEGC+N++G+K +Y +K I+ D +F KLE+ A + Sbjct: 585 SSGCRCEGCKNVHGKKEDYVAFGHTSSKERVSSIVEEGSDCTFHNKLEMVASK------T 638 Query: 1121 LCNPHNLSPLTPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENH 942 + + H LSP+TPS QCS+ G+ ++S + SG SPES VN L N Sbjct: 639 VYDLHCLSPITPSLQCSDQGKEDAKSRVISGNYLPSPESDVNMLASCTNYTKSSENLHGS 698 Query: 941 HMISETSNEFMNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQ 762 + +T NE + +D ++E S LL +P + + + LL Sbjct: 699 EALLDT-NEMLGNTPYDSQIEC-SDAALLQLTPLPNPEQSGTFIILICTQMSVQRLLT-- 754 Query: 761 MFPGNRNISSSSYLK----WRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKE 594 P + +SYL P++P T+ G + L+ ++ D +L+ +L++ETP+ILKE Sbjct: 755 --PDSPMDVFASYLAVLFVGVVLPLTPSTRVGEAQYLQCSESDSKLFDILENETPDILKE 812 Query: 593 TPPLPNAVKVSSPNKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTID 414 +VKV+SP +KRV RKFIL+AVP+FP L+P I+ Sbjct: 813 ASTPMTSVKVNSPTQKRV--SPPQSCHIGIGSSSSGGLRSGRKFILKAVPTFPSLSPCIN 870 Query: 413 SEPYADGQDMN 381 S+ D N Sbjct: 871 SKSNGDEDSCN 881 Score = 58.9 bits (141), Expect = 8e-06 Identities = 24/55 (43%), Positives = 30/55 (54%) Frame = -3 Query: 1421 PASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCEN 1257 P S C A + CNCKKSKCLK YC+C+ + C+D C C+GC N Sbjct: 454 PPSPCKKKKKTSVTADDNGCKRCNCKKSKCLKLYCDCFAAGTYCTDPCACQGCLN 508 >ref|XP_002309611.2| cysteine-rich polycomb-like family protein [Populus trichocarpa] gi|550337153|gb|EEE93134.2| cysteine-rich polycomb-like family protein [Populus trichocarpa] Length = 847 Score = 315 bits (807), Expect = 5e-83 Identities = 180/403 (44%), Positives = 239/403 (59%), Gaps = 26/403 (6%) Frame = -3 Query: 1538 QGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQHIAEPPASSCGDDGTRFTPASARHKR 1359 QGC NRP+YEDTVLETRQQIESRNPLAFAPKI+QH+ E A +D FTP S RHK Sbjct: 442 QGCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQHVTEFQAIDV-EDVDLFTPYSGRHKT 500 Query: 1358 GCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENMYGRKGEYSMLKDLVNKHDNVEILDG 1179 GCNCK+S C+KKYCECYQ+NVGCS+ CRCEGC N++GRK EY+M +++V+ N E L+G Sbjct: 501 GCNCKRSMCVKKYCECYQANVGCSNACRCEGCRNIHGRKEEYAMTQEIVSNRANEESLEG 560 Query: 1178 SFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQ-------------CSNHGQTASRSWL 1038 D+KLE+ A + LH +L + +L+P TPSF+ +H + A +S L Sbjct: 561 MADEKLEMVA-NNKFLHTELYDLRSLTPPTPSFEYLRYYFFDEDAFVLESHEKDAPKSRL 619 Query: 1037 SSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLVSFDQELEYGSGETL 858 GR S ES + LP Y S N + + M+ +TS +++VS QEL+Y E Sbjct: 620 LPGRYVLSSESDFSMLPSYAKSVSSPSNSQGNDMLPKTSIT-LDIVSHGQELDYNITEIT 678 Query: 857 LPFSPGFD------GHKRXXXXXXXXXXPNSS-------NLLKSQMFPGNRNISSSSYLK 717 FSP FD H ++S N+ + +++PG+ +SS S L Sbjct: 679 GQFSPQFDELADFSDHTPLPNPSSIMMASSASSKTQDKANVSQPRVYPGSARLSSGSSLH 738 Query: 716 WRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPNKKRVX 537 W SSP++PMT+ G TK + D D LY +L+D+TPEILK++ +VK SSPNKKRV Sbjct: 739 WYSSPITPMTRLGETK-NQAQDSDCGLYDILEDDTPEILKDSSAPITSVKASSPNKKRVS 797 Query: 536 XXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSE 408 RKFIL++VPSFPPLTP +DS+ Sbjct: 798 PPHSHIREFQSSSSAGLKSGRGRKFILKSVPSFPPLTPCLDSK 840 >gb|EMJ26483.1| hypothetical protein PRUPE_ppa001375mg [Prunus persica] Length = 842 Score = 315 bits (807), Expect = 5e-83 Identities = 178/398 (44%), Positives = 236/398 (59%), Gaps = 16/398 (4%) Frame = -3 Query: 1553 ESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQHIAEPPASSCGDDGTRFTPAS 1374 E C C N DYEDTVLETRQ IESRNPLAFAPKI+QH ++ +FTP+S Sbjct: 431 EECKQSRCFNITDYEDTVLETRQHIESRNPLAFAPKIVQH----------EEEIQFTPSS 480 Query: 1373 ARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENMYGRKGEYSMLKDLVNKHDNV 1194 ARHKRGCNCKKS CLKKYCECYQ+NVGCS GCRC+GC+N+YGRKGE+ + KD ++ Sbjct: 481 ARHKRGCNCKKSMCLKKYCECYQANVGCSSGCRCDGCKNVYGRKGEHGVGKDNISDKAGK 540 Query: 1193 EILDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSFQCSNHGQTASRSWLSSGRNFAS 1014 E ++ +F +KLE+ A + +L +L + HNL+PL PSFQCS+H +S S Sbjct: 541 ERIESTFHEKLEMVATKKDILSTELYDSHNLTPLAPSFQCSDHANNVPKSPCLPTSYLPS 600 Query: 1013 PESGVNFLPPY---GMSPVRDVNPENHHMISETSNEFMNLVSFDQELEYGSGETLLPFSP 843 PES + + Y SP+R + E+ ++ ETS E +L S++ ++Y + + FSP Sbjct: 601 PESDLTIISSYEKSTRSPLR--HSESSDILLETSKELSDLGSYNWRVDYDNIGIVDTFSP 658 Query: 842 GFDG-----HKRXXXXXXXXXXPNS--------SNLLKSQMFPGNRNISSSSYLKWRSSP 702 D H +S +N + Q+ PG+ +SS S L RSSP Sbjct: 659 RCDAAPTTCHITPMSDLCSMAMASSTSSKTSDWTNASQVQLCPGSHGLSSDSSLHRRSSP 718 Query: 701 VSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPNKKRVXXXXXX 522 V+PMT+ GGTK + DF++ LY +L D+TPEILK++ ++KVSSPNKKRV Sbjct: 719 VTPMTRLGGTKSFQGLDFENGLYDILQDDTPEILKDSSTPIRSLKVSSPNKKRV--SPPH 776 Query: 521 XXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSE 408 RKFIL+AVPSFPPLTP I S+ Sbjct: 777 SHNHELGASSSGALRSGRKFILKAVPSFPPLTPCIGSK 814 >gb|AAU14844.1| cysteine-rich polycomb-like protein 1 [Lotus japonicus] Length = 821 Score = 313 bits (803), Expect = 1e-82 Identities = 191/469 (40%), Positives = 258/469 (55%), Gaps = 7/469 (1%) Frame = -3 Query: 1793 HFNDSLEPKPIELQPSPG-DKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCN 1617 +F+ SL+ +PI L P+ G DKRK A + + KT T + NGCKRCN Sbjct: 378 YFSPSLK-EPIALYPASGHDKRKLSPTDAGNSEGLDQHTPGKKKKKTSSTADGNGCKRCN 436 Query: 1616 CKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQ 1437 CKK+KCLKLYCDCFAAG++C + C+CQ C N+P+Y + VLETRQQIESRNPLAFAPKI++ Sbjct: 437 CKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAPKIVK 496 Query: 1436 HIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCEN 1257 P++ +D TP+SARH RGCNCK+S CLKKYCECYQSNVGCS GCRCEGC+N Sbjct: 497 SATNAPSNM--EDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGCKN 554 Query: 1256 MYGRKGEYSMLKDLVNKH---DNVEI-LDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLT 1089 +YG+K +Y ++K NVE D + K E+ A R+ L + + H+LSP+T Sbjct: 555 VYGKKEDYVAPDHALSKERVSSNVEKGSDSTMLNKPEMVASREDLFQKEFYDQHHLSPIT 614 Query: 1088 PSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHM-ISETSNEF 912 PS QCS+ G + S R+ +S + SP N + + ++ SN Sbjct: 615 PSLQCSDQG----KDMFPSHRSLMLTDSCDSKSYENVQSPAPPCNSASCILQLTPLSNP- 669 Query: 911 MNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISS 732 + SG +P P ++ S++ G S Sbjct: 670 ----------DSTSGAPSIPAKP------------------VGTSAPSSRVSHGCVRQLS 701 Query: 731 SSYLKWRSSPVSPM-TQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSP 555 L+WRSSPV+P T G + L+ + D RL+ +++DETP ILKE+ VK +SP Sbjct: 702 GGSLRWRSSPVTPRNTNLGEAQHLQGLESDSRLFDIVEDETPAILKESSTPTKTVKANSP 761 Query: 554 NKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSE 408 +KRV RKFIL++VPSFPPLTP +DS+ Sbjct: 762 IQKRV------SPPRVIGSSSSGGLRTGRKFILKSVPSFPPLTPCMDSK 804 >emb|CAF02297.1| cysteine-rich polycomb-like protein [Lotus japonicus] gi|40241253|emb|CAF02298.1| cysteine-rich polycomb-like protein [Lotus japonicus] Length = 897 Score = 313 bits (803), Expect = 1e-82 Identities = 191/469 (40%), Positives = 258/469 (55%), Gaps = 7/469 (1%) Frame = -3 Query: 1793 HFNDSLEPKPIELQPSPG-DKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCN 1617 +F+ SL+ +PI L P+ G DKRK A + + KT T + NGCKRCN Sbjct: 454 YFSPSLK-EPIALYPASGHDKRKLSPTDAGNSEGLDQHTPGKKKKKTSSTADGNGCKRCN 512 Query: 1616 CKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQ 1437 CKK+KCLKLYCDCFAAG++C + C+CQ C N+P+Y + VLETRQQIESRNPLAFAPKI++ Sbjct: 513 CKKSKCLKLYCDCFAAGVFCLDPCSCQDCFNKPEYGEKVLETRQQIESRNPLAFAPKIVK 572 Query: 1436 HIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCEN 1257 P++ +D TP+SARH RGCNCK+S CLKKYCECYQSNVGCS GCRCEGC+N Sbjct: 573 SATNAPSNM--EDVNLTTPSSARHTRGCNCKRSMCLKKYCECYQSNVGCSSGCRCEGCKN 630 Query: 1256 MYGRKGEYSMLKDLVNKH---DNVEI-LDGSFDKKLELAAPRDSLLHNDLCNPHNLSPLT 1089 +YG+K +Y ++K NVE D + K E+ A R+ L + + H+LSP+T Sbjct: 631 VYGKKEDYVAPDHALSKERVSSNVEKGSDSTMLNKPEMVASREDLFQKEFYDQHHLSPIT 690 Query: 1088 PSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHM-ISETSNEF 912 PS QCS+ G + S R+ +S + SP N + + ++ SN Sbjct: 691 PSLQCSDQG----KDMFPSHRSLMLTDSCDSKSYENVQSPAPPCNSASCILQLTPLSNP- 745 Query: 911 MNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNISS 732 + SG +P P ++ S++ G S Sbjct: 746 ----------DSTSGAPSIPAKP------------------VGTSAPSSRVSHGCVRQLS 777 Query: 731 SSYLKWRSSPVSPM-TQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSP 555 L+WRSSPV+P T G + L+ + D RL+ +++DETP ILKE+ VK +SP Sbjct: 778 GGSLRWRSSPVTPRNTNLGEAQHLQGLESDSRLFDIVEDETPAILKESSTPTKTVKANSP 837 Query: 554 NKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSE 408 +KRV RKFIL++VPSFPPLTP +DS+ Sbjct: 838 IQKRV------SPPRVIGSSSSGGLRTGRKFILKSVPSFPPLTPCMDSK 880 >ref|XP_006596108.1| PREDICTED: LOW QUALITY PROTEIN: CRC domain-containing protein TSO1-like [Glycine max] Length = 877 Score = 302 bits (773), Expect = 4e-79 Identities = 178/472 (37%), Positives = 249/472 (52%), Gaps = 5/472 (1%) Frame = -3 Query: 1787 NDSLEPKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNCKK 1608 +D+ + + SP K+K +SE AD +GCK CNCKK Sbjct: 436 DDTGNSEDFNIPSSPCQKKKKISETAD----------------------DDGCKHCNCKK 473 Query: 1607 TKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQHIA 1428 ++CLKLYC CFAAG YC + CACQGCLNRP+Y +TV+ET+Q IESR+P AF PKI+ + Sbjct: 474 SRCLKLYCHCFAAGTYCTDPCACQGCLNRPEYAETVVETKQLIESRDPSAFDPKIV--LP 531 Query: 1427 EPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENMYG 1248 SS DD TP+SARHKRGCNCK+S CLKKYCECYQ+NVGCS GCRCEGC+N+YG Sbjct: 532 TTDISSHMDDENLTTPSSARHKRGCNCKRSMCLKKYCECYQANVGCSSGCRCEGCKNVYG 591 Query: 1247 RKGEYSMLKDLVNKHDNVEIL----DGSFDKKLELAAPRDSLLHNDLCNPHNLSPLTPSF 1080 +K +Y + +K I+ D +F KKLE A + + H LSP+TPS Sbjct: 592 KKEDYVAFEHTSSKERESSIVEEGSDYTFHKKLERVASK------TVYGLHCLSPITPSL 645 Query: 1079 QCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISETSNEFMNLV 900 QCS G+ A++S + SG SPES VN N + + T NE + Sbjct: 646 QCSEQGKEAAKSIIISGNYLPSPESDVNMFASCANYTKSSENLHSSQALLGT-NEMLGST 704 Query: 899 SFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGNRNIS-SSSY 723 +D ++E S LL +P + S + + P + S + Sbjct: 705 PYDSQIEC-SHAALLQLTPPLSNPELCGTSSFSSIXQMSGEIFLNPNPPTDVLASYPVAL 763 Query: 722 LKWRSSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAVKVSSPNKKR 543 P++P + G + + ++ D RL+ +L++ETP++LKE +VK++SP +KR Sbjct: 764 FVGVVXPLTPSNRVGEAQYFQCSESDSRLFDILENETPDVLKEASTPMTSVKINSPTQKR 823 Query: 542 VXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSEPYADGQD 387 V RKFI ++VPSFP L+P ++S+ D + Sbjct: 824 V--SPPQSRHFEIGSSSSGGLRSDRKFIFESVPSFPSLSPCVNSKSNGDDDE 873 >ref|XP_004498661.1| PREDICTED: protein tesmin/TSO1-like CXC 2-like isoform X2 [Cicer arietinum] Length = 709 Score = 296 bits (757), Expect = 3e-77 Identities = 201/551 (36%), Positives = 286/551 (51%), Gaps = 8/551 (1%) Frame = -3 Query: 1985 AQRGSLSILGKKSVSTMSCHSSKNCSIS--LNGAEGISVSSDDSRHDGXXXXXXXXXXXX 1812 A R S + G KS T+S H +N + S L+ +G SV D+R++ Sbjct: 226 AVRLSNGLQGIKSKPTISLHKVENVTQSSILSNIDGQSVI--DARNEIHETDASVAADSF 283 Query: 1811 XPYGVKHFNDSLEPKPIELQPSPG-DKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESN 1635 S+ +PI L P DKR+ ++ ++F KT+ T + Sbjct: 284 IS------ESSILTEPIALYPENAHDKRRLSPTDTENTEEFNHPSTSKKKKKTI-TDDGG 336 Query: 1634 GCKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAF 1455 G K C+CKK+KCLKLYCDCF AGIYC E CACQ C NR ++E+ V+ET+Q IESRNP AF Sbjct: 337 GSKGCHCKKSKCLKLYCDCFGAGIYCGEGCACQSCGNRIEFEEKVVETKQHIESRNPNAF 396 Query: 1454 APKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCR 1275 APKI+Q +A+ P ++ +D + TPASARHKRGCNCK+SKC KKYCEC+Q+NVGCS GCR Sbjct: 397 APKIVQCVADVPLNNM-EDVSMTTPASARHKRGCNCKRSKCTKKYCECFQANVGCSSGCR 455 Query: 1274 CEGCENMYGRKGEYSMLKDLVNKHDNVEILDGSFDKKL----ELAAPRDSLLHNDLCNPH 1107 C+GC+N++G+K +Y ++ + I++ D KL ++ R LL P+ Sbjct: 456 CDGCKNVFGKKEDYVAIEHTSSIETESSIIEEGLDDKLYNRQKMVVSRTGLLR----APN 511 Query: 1106 NLSPLTPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISE 927 +LSPLTPS QCS+ G+ A++S L+S S + + L + + P +S Sbjct: 512 HLSPLTPSLQCSDQGKQAAKSRLASANWTKSSKKSRSSLAHTARNDSQKNAPP---CVSL 568 Query: 926 TSNEFMNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGN 747 NE+ ++V P+ P SN+ G Sbjct: 569 KENEWTDIV---------------PYQP--------------------SNVC------GI 587 Query: 746 RNISSSSYLKWR-SSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAV 570 R +S S L+W SSP++P FG + + +L+ +L+DETP++LKET +V Sbjct: 588 RQLSGGS-LRWHSSSPITPSANFG-------DESNGKLFDILEDETPDVLKETSTPIKSV 639 Query: 569 KVSSPNKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSEPYADGQ 390 K +SP KRV RKFILQ+VPSFPPLTP DS+ + Sbjct: 640 KANSPIHKRV-SPPQSHLLRIGSSSSGGGLRSGRKFILQSVPSFPPLTPCADSKVNCNSN 698 Query: 389 DMNDSQGRSRK 357 + DS ++K Sbjct: 699 E--DSSNNAKK 707 >ref|XP_004498660.1| PREDICTED: protein tesmin/TSO1-like CXC 2-like isoform X1 [Cicer arietinum] Length = 869 Score = 296 bits (757), Expect = 3e-77 Identities = 201/551 (36%), Positives = 286/551 (51%), Gaps = 8/551 (1%) Frame = -3 Query: 1985 AQRGSLSILGKKSVSTMSCHSSKNCSIS--LNGAEGISVSSDDSRHDGXXXXXXXXXXXX 1812 A R S + G KS T+S H +N + S L+ +G SV D+R++ Sbjct: 386 AVRLSNGLQGIKSKPTISLHKVENVTQSSILSNIDGQSVI--DARNEIHETDASVAADSF 443 Query: 1811 XPYGVKHFNDSLEPKPIELQPSPG-DKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESN 1635 S+ +PI L P DKR+ ++ ++F KT+ T + Sbjct: 444 IS------ESSILTEPIALYPENAHDKRRLSPTDTENTEEFNHPSTSKKKKKTI-TDDGG 496 Query: 1634 GCKRCNCKKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAF 1455 G K C+CKK+KCLKLYCDCF AGIYC E CACQ C NR ++E+ V+ET+Q IESRNP AF Sbjct: 497 GSKGCHCKKSKCLKLYCDCFGAGIYCGEGCACQSCGNRIEFEEKVVETKQHIESRNPNAF 556 Query: 1454 APKIIQHIAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCR 1275 APKI+Q +A+ P ++ +D + TPASARHKRGCNCK+SKC KKYCEC+Q+NVGCS GCR Sbjct: 557 APKIVQCVADVPLNNM-EDVSMTTPASARHKRGCNCKRSKCTKKYCECFQANVGCSSGCR 615 Query: 1274 CEGCENMYGRKGEYSMLKDLVNKHDNVEILDGSFDKKL----ELAAPRDSLLHNDLCNPH 1107 C+GC+N++G+K +Y ++ + I++ D KL ++ R LL P+ Sbjct: 616 CDGCKNVFGKKEDYVAIEHTSSIETESSIIEEGLDDKLYNRQKMVVSRTGLLR----APN 671 Query: 1106 NLSPLTPSFQCSNHGQTASRSWLSSGRNFASPESGVNFLPPYGMSPVRDVNPENHHMISE 927 +LSPLTPS QCS+ G+ A++S L+S S + + L + + P +S Sbjct: 672 HLSPLTPSLQCSDQGKQAAKSRLASANWTKSSKKSRSSLAHTARNDSQKNAPP---CVSL 728 Query: 926 TSNEFMNLVSFDQELEYGSGETLLPFSPGFDGHKRXXXXXXXXXXPNSSNLLKSQMFPGN 747 NE+ ++V P+ P SN+ G Sbjct: 729 KENEWTDIV---------------PYQP--------------------SNVC------GI 747 Query: 746 RNISSSSYLKWR-SSPVSPMTQFGGTKLLEVTDFDHRLYSMLDDETPEILKETPPLPNAV 570 R +S S L+W SSP++P FG + + +L+ +L+DETP++LKET +V Sbjct: 748 RQLSGGS-LRWHSSSPITPSANFG-------DESNGKLFDILEDETPDVLKETSTPIKSV 799 Query: 569 KVSSPNKKRVXXXXXXXXXXXXXXXXXXXXXXXRKFILQAVPSFPPLTPTIDSEPYADGQ 390 K +SP KRV RKFILQ+VPSFPPLTP DS+ + Sbjct: 800 KANSPIHKRV-SPPQSHLLRIGSSSSGGGLRSGRKFILQSVPSFPPLTPCADSKVNCNSN 858 Query: 389 DMNDSQGRSRK 357 + DS ++K Sbjct: 859 E--DSSNNAKK 867 >gb|EOY31447.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 5 [Theobroma cacao] gi|508784193|gb|EOY31449.1| Tesmin/TSO1-like CXC domain-containing protein, putative isoform 5 [Theobroma cacao] Length = 667 Score = 255 bits (652), Expect = 4e-65 Identities = 130/248 (52%), Positives = 160/248 (64%), Gaps = 1/248 (0%) Frame = -3 Query: 1973 SLSILGKKSVSTMSCHSSKNCSISLNGAEGISVSSDDSRHDGXXXXXXXXXXXXXPYGVK 1794 S+ I G KS S MSC S +N + E + + D + G Sbjct: 425 SMGIQGIKSASVMSCQSMENMQSCSDAFEKVLAAPQDGTLEAKACVIP---------GSA 475 Query: 1793 HFNDSLEPKPIELQPSPGDKRKSVSEIADSVDDFXXXXXXXXXXKTLDTGESNGCKRCNC 1614 + I+ Q + KR+ SE DS + F K+ ++ + GCKRCNC Sbjct: 476 ASESLCTMESIDCQTTLHRKRELSSEHGDSNEMFNQQSPKKKRKKSSNSTDGEGCKRCNC 535 Query: 1613 KKTKCLKLYCDCFAAGIYCAESCACQGCLNRPDYEDTVLETRQQIESRNPLAFAPKIIQH 1434 KKTKCLKLYCDCFAAGIYCA+ C+CQGC NRP+YEDTVLETRQQIESRNPLAFAPKI+Q Sbjct: 536 KKTKCLKLYCDCFAAGIYCADPCSCQGCFNRPEYEDTVLETRQQIESRNPLAFAPKIVQP 595 Query: 1433 IAEPPASSCGDDGTRFTPASARHKRGCNCKKSKCLKKYCECYQSNVGCSDGCRCEGCENM 1254 + E P +S +DG TP+SARHKRGCNCK+S CLKKYCECYQ+NVGCS GCRCEGC+N+ Sbjct: 596 VTEFPVTS-REDGNWKTPSSARHKRGCNCKRSMCLKKYCECYQANVGCSIGCRCEGCKNV 654 Query: 1253 YGRK-GEY 1233 +G+K GE+ Sbjct: 655 FGKKEGEF 662