BLASTX nr result
ID: Forsythia21_contig00010401
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00010401 (1959 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166... 717 0.0 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 652 0.0 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 650 0.0 ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236... 644 0.0 ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260... 644 0.0 ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116... 640 e-180 ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260... 626 e-176 ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236... 625 e-176 emb|CDP05166.1| unnamed protein product [Coffea canephora] 610 e-171 ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160... 604 e-170 ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648... 560 e-156 ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333... 559 e-156 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 559 e-156 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 555 e-155 gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum] 555 e-155 ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765... 553 e-154 ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 550 e-153 gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium r... 547 e-152 ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 547 e-152 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 546 e-152 >ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum] Length = 479 Score = 717 bits (1851), Expect = 0.0 Identities = 351/479 (73%), Positives = 392/479 (81%), Gaps = 9/479 (1%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSVHNS AESRVQPSTVQKRRWGSCWS+YWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVHNSVETVNAAATAIVTAESRVQPSTVQKRRWGSCWSIYWCFGSHKQSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 EP GV AP++ENRN S+TIVLPFI SFLQSDPPSATQSPAGL+SL +LSV+ Sbjct: 61 SEPAAAGVAAPISENRNQSSTIVLPFIAPPSSPASFLQSDPPSATQSPAGLISLASLSVH 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 SPGGTAPIF +GPYAHETQL+SPPVFS FTTEPSTA FTPPPE VQ+TTPSSPEVPFA Sbjct: 121 ANSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLL+SSLARNRRN GTNLK+SLSQY+FQPYQYPGSPGG +KSPGSA+STSGTSSPFPDK Sbjct: 181 QLLSSSLARNRRNCGTNLKYSLSQYEFQPYQYPGSPGGHIKSPGSALSTSGTSSPFPDKH 240 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 871 PI+EFR GEAPKFLGYEHF N KWGSRVGSGSLTP GWGSRLGSG LTPNGGLSRLGSGT Sbjct: 241 PIMEFRMGEAPKFLGYEHFPNYKWGSRVGSGSLTPNGWGSRLGSGALTPNGGLSRLGSGT 300 Query: 870 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 +TPNGGEPPS+D LLE+QI EVASLANSD +S++ + VVD+RVSFEL GEDIPTC + E Sbjct: 301 LTPNGGEPPSRDGNLLENQIYEVASLANSDRKSQNDDAVVDHRVSFELFGEDIPTCVVTE 360 Query: 690 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVP---------DMARR 538 S PS+K A A G N KDL KN D CRE++DGET +EVP ++ ++ Sbjct: 361 SAPSHKNASGYPGVATAEGTNNKDLTTKNADSCREHNDGETTNEVPEIPLDGEGGELHQK 420 Query: 537 HRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 RT+SLGSSKDFNFNNAK E+ EKS+I+CEWWTN+KV +KELGPR W+FFPMLQ G S Sbjct: 421 QRTVSLGSSKDFNFNNAKGEIPEKSSINCEWWTNEKVVRKELGPRNSWSFFPMLQSGAS 479 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 652 bits (1683), Expect = 0.0 Identities = 331/474 (69%), Positives = 367/474 (77%), Gaps = 4/474 (0%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSV N+ AESRVQPSTVQKRRWGSCWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG PV EN NHSATIV+PFI SFL SDPPSATQSPAGLLSL +LS+N Sbjct: 61 PEPAAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSIN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGGTA IFA+GPYAHETQL+SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 871 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 870 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 VTPNGGEPPS+DSYLLE QISEVASLANSDN S+ GE V+D+RVSFEL GED+P+C KE Sbjct: 301 VTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKE 360 Query: 690 SVPSY--KTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPD--MARRHRTIS 523 V S+ +T V N M MA+ + Y SE + R+HR I+ Sbjct: 361 PVMSHSQQTLPMDVSNLLANEMKSGSSMAEE----KTYGSPRKASESGEDQCHRKHRNIT 416 Query: 522 LGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 GSSKDF+F+N K EVLEK +IDCEWWT+DK KE G + +WTFFP+LQPGVS Sbjct: 417 FGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum lycopersicum] Length = 470 Score = 650 bits (1677), Expect = 0.0 Identities = 329/474 (69%), Positives = 366/474 (77%), Gaps = 4/474 (0%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSV N+ AESRVQPSTVQKRRWGSCWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG PV EN NHSATIV+PFI SFL SDPPSATQSPAGLLSL ALS+N Sbjct: 61 PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSIN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGGTA IFA+GPYAHETQL+SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 871 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGS+TP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 870 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 VTPNGGEPPS+DSYLLE+QISEVASLANSDN S+ GE V+D+RVSFEL ED+P+C KE Sbjct: 301 VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKE 360 Query: 690 SVPSYK--TALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE--VPDMARRHRTIS 523 V S+ T V + M MA+ + Y SE + R+HR I+ Sbjct: 361 PVMSHSQPTLPMDVSNLLASEMRSGSSMAEE----KTYGSPRKASESGEDECHRKHRNIT 416 Query: 522 LGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 GSSKDF+F+N K EVLEK +IDCEWWT+DK KE G + +WTFFP+LQPGVS Sbjct: 417 FGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470 >ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1 [Nicotiana sylvestris] Length = 470 Score = 644 bits (1661), Expect = 0.0 Identities = 326/474 (68%), Positives = 370/474 (78%), Gaps = 4/474 (0%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSV N+ AESRVQPS+VQKRRWGSCWSLYWCFGS+K +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAATAIVTAESRVQPSSVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG PV EN N SATIV+PFI SFL SDPPSATQSPAGLLSL + S+N Sbjct: 61 PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGGTA IFA+GPYAHETQL+SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 871 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 870 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 VTPNGGEPPS+D YLLE+QISEVASLANSDN S+ E V+D+RVSFEL GED+P+C KE Sbjct: 301 VTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREKE 360 Query: 690 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG--ETVSEVPD--MARRHRTIS 523 V S+ + +T+P P N++ M ++ E +DG E SE D R+HR I+ Sbjct: 361 PVMSH--SQQTLPMDVPAPSNKE--MRSSSSIVEEKTDGLPEKASERGDDQCHRKHRNIT 416 Query: 522 LGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 GSSKDF+F+N K EVLEK ++DCEWWT+DK KE + +WTFFP+LQPGVS Sbjct: 417 FGSSKDFDFDNVKIEVLEKHSVDCEWWTSDKATGKESSIQNNWTFFPVLQPGVS 470 >ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260903 isoform X3 [Solanum lycopersicum] Length = 469 Score = 644 bits (1660), Expect = 0.0 Identities = 328/474 (69%), Positives = 365/474 (77%), Gaps = 4/474 (0%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSV N+ AESRVQPSTVQ RRWGSCWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG PV EN NHSATIV+PFI SFL SDPPSATQSPAGLLSL ALS+N Sbjct: 60 PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSIN 119 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGGTA IFA+GPYAHETQL+SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 120 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 179 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 180 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 239 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 871 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGS+TP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 240 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGT 299 Query: 870 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 VTPNGGEPPS+DSYLLE+QISEVASLANSDN S+ GE V+D+RVSFEL ED+P+C KE Sbjct: 300 VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKE 359 Query: 690 SVPSYK--TALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE--VPDMARRHRTIS 523 V S+ T V + M MA+ + Y SE + R+HR I+ Sbjct: 360 PVMSHSQPTLPMDVSNLLASEMRSGSSMAEE----KTYGSPRKASESGEDECHRKHRNIT 415 Query: 522 LGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 GSSKDF+F+N K EVLEK +IDCEWWT+DK KE G + +WTFFP+LQPGVS Sbjct: 416 FGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 469 >ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana tomentosiformis] Length = 470 Score = 640 bits (1650), Expect = e-180 Identities = 322/474 (67%), Positives = 370/474 (78%), Gaps = 4/474 (0%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSV N+ AESRVQPS++QK+RWGSCWSLYWCFGS+K +KRIGHA+LV Sbjct: 1 MSSVQNTVDTVNAAATAIITAESRVQPSSIQKKRWGSCWSLYWCFGSYKHSKRIGHAILV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG PV EN N SATIV+PFI SFL SDPPSATQSPAGLLSL + S+N Sbjct: 61 PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGGTA IFA+GPYAHETQL+SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSSLISPGSVVSNSGTSSPFPGKC 240 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 871 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 870 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 VTPNGGEPPS+D YLLE+QISEVASLANSDN S+ E V+D+RVSFEL GED+P+C KE Sbjct: 301 VTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREKE 360 Query: 690 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG--ETVSEVPD--MARRHRTIS 523 V S+ + +T+P P N++ M ++ E +DG E SE D R+HR I+ Sbjct: 361 PVMSH--SQQTLPMDVPAPSNKE--MRSSSSNVEEKTDGLPEKASERGDDQCHRKHRNIT 416 Query: 522 LGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 GSSKDF+F+N K EVLE+ ++DCEWWT+DK KE + +WTFFP+LQPGVS Sbjct: 417 FGSSKDFDFDNVKIEVLEEDSVDCEWWTSDKATGKESSIQNNWTFFPVLQPGVS 470 >ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260903 isoform X1 [Solanum lycopersicum] Length = 476 Score = 626 bits (1615), Expect = e-176 Identities = 312/443 (70%), Positives = 349/443 (78%), Gaps = 4/443 (0%) Frame = -1 Query: 1677 KRRWGSCWSLYWCFGSHKPNKRIGHAVLVPEPTPPGVEAPVAENRNHSATIVLPFIXXXX 1498 +RRWGSCWSLYWCFGSHK +KRIGHAVLVPEP PG PV EN NHSATIV+PFI Sbjct: 38 ERRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVTENPNHSATIVIPFIAPPS 97 Query: 1497 XXXSFLQSDPPSATQSPAGLLSLTALSVNVYSPGGTAPIFAVGPYAHETQLISPPVFSAF 1318 SFL SDPPSATQSPAGLLSL ALS+N YSPGGTA IFA+GPYAHETQL+SPPVFS F Sbjct: 98 SPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVFSTF 157 Query: 1317 TTEPSTACFTPPPESVQITTPSSPEVPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ 1138 TTEPSTA FTPPPE V +TTP SPEVPFAQLLTSSLARNRR +G+N KF LSQY+F PYQ Sbjct: 158 TTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ 217 Query: 1137 YPGSPGGRLKSPGSAISTSGTSSPFPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVGSG 958 PGSPG L SPGS +S SGTSSPFP K PI+EFRKGE PKFLGYEHF+ RKWGSRVGSG Sbjct: 218 DPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSG 277 Query: 957 SLTPTGWGSRLGSGTLTPNGGLSRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDN 778 S+TP+GWGSRLGSGTLTPNGG+SRLGSGTVTPNGGEPPS+DSYLLE+QISEVASLANSDN Sbjct: 278 SVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSDN 337 Query: 777 ESKDGEVVVDYRVSFELIGEDIPTCTIKESVPSYK--TALKTVPEAAPNGMNQKDLMAKN 604 S+ GE V+D+RVSFEL ED+P+C KE V S+ T V + M MA+ Sbjct: 338 GSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMRSGSSMAEE 397 Query: 603 TDCCREYSDGETVSE--VPDMARRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDK 430 + Y SE + R+HR I+ GSSKDF+F+N K EVLEK +IDCEWWT+DK Sbjct: 398 ----KTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDK 453 Query: 429 VPKKELGPRKDWTFFPMLQPGVS 361 KE G + +WTFFP+LQPGVS Sbjct: 454 AAVKESGIQNNWTFFPVLQPGVS 476 >ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236433 isoform X2 [Nicotiana sylvestris] Length = 442 Score = 625 bits (1613), Expect = e-176 Identities = 312/445 (70%), Positives = 355/445 (79%), Gaps = 4/445 (0%) Frame = -1 Query: 1683 VQKRRWGSCWSLYWCFGSHKPNKRIGHAVLVPEPTPPGVEAPVAENRNHSATIVLPFIXX 1504 +QKRRWGSCWSLYWCFGS+K +KRIGHAVLVPEP PG PV EN N SATIV+PFI Sbjct: 2 MQKRRWGSCWSLYWCFGSYKHSKRIGHAVLVPEPAAPGPAVPVTENPNRSATIVIPFIAP 61 Query: 1503 XXXXXSFLQSDPPSATQSPAGLLSLTALSVNVYSPGGTAPIFAVGPYAHETQLISPPVFS 1324 SFL SDPPSATQSPAGLLSL + S+N YSPGGTA IFA+GPYAHETQL+SPPVFS Sbjct: 62 PSSPASFLPSDPPSATQSPAGLLSLKSFSINAYSPGGTASIFAIGPYAHETQLVSPPVFS 121 Query: 1323 AFTTEPSTACFTPPPESVQITTPSSPEVPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQP 1144 FTTEPSTA FTPPPE V +TTP SPEVPFAQLLTSSLARNRR +G+N KF LSQY+F P Sbjct: 122 TFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVP 181 Query: 1143 YQYPGSPGGRLKSPGSAISTSGTSSPFPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVG 964 YQ PGSPG L SPGS +S SGTSSPFP K PI+EFRKGE PKFLGYEHF+ RKWGSRVG Sbjct: 182 YQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVG 241 Query: 963 SGSLTPTGWGSRLGSGTLTPNGGLSRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANS 784 SGSLTP+GWGSRLGSGTLTPNGG+SRLGSGTVTPNGGEPPS+D YLLE+QISEVASLANS Sbjct: 242 SGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDCYLLENQISEVASLANS 301 Query: 783 DNESKDGEVVVDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKN 604 DN S+ E V+D+RVSFEL GED+P+C KE V S+ + +T+P P N++ M + Sbjct: 302 DNGSEIAEGVIDHRVSFELTGEDVPSCREKEPVMSH--SQQTLPMDVPAPSNKE--MRSS 357 Query: 603 TDCCREYSDG--ETVSEVPD--MARRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTN 436 + E +DG E SE D R+HR I+ GSSKDF+F+N K EVLEK ++DCEWWT+ Sbjct: 358 SSIVEEKTDGLPEKASERGDDQCHRKHRNITFGSSKDFDFDNVKIEVLEKHSVDCEWWTS 417 Query: 435 DKVPKKELGPRKDWTFFPMLQPGVS 361 DK KE + +WTFFP+LQPGVS Sbjct: 418 DKATGKESSIQNNWTFFPVLQPGVS 442 >emb|CDP05166.1| unnamed protein product [Coffea canephora] Length = 452 Score = 610 bits (1572), Expect = e-171 Identities = 312/470 (66%), Positives = 353/470 (75%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSVHNS AESRVQP TVQKRRWGSCWS YWCFGS K +KRIG+AVLV Sbjct: 1 MSSVHNSVETVNAAATAIVTAESRVQPPTVQKRRWGSCWSFYWCFGSVKNSKRIGNAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEPT PG PV +N NHSATIV+PFI SFLQSDPPSATQSPA L L + SVN Sbjct: 61 PEPTVPGSAVPVPDNLNHSATIVIPFIAPPSSPASFLQSDPPSATQSPAKFLPLASFSVN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSP G A IFA+GPYAHETQL+SPPVFSAFTTEPSTA FTPPPE VQ+TTPSSPEVPFA Sbjct: 121 TYSPSGAASIFAIGPYAHETQLVSPPVFSAFTTEPSTASFTPPPEPVQLTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLL SSL NRR++GT++KF LSQY+FQPYQ PGSPG L SPGSAIS SGTSSPFP+K Sbjct: 181 QLLVSSLTHNRRHSGTSIKFPLSQYEFQPYQCPGSPGSHLISPGSAISNSGTSSPFPEKR 240 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 871 PI+EFR GEAPKFLGYE F RKWGSRVGSGSLTP GWGSRLGSG+LTPNGG+SRLGSGT Sbjct: 241 PIIEFRIGEAPKFLGYELF-TRKWGSRVGSGSLTPNGWGSRLGSGSLTPNGGISRLGSGT 299 Query: 870 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 +TPNGGEP ++DSYLLE+QISEVASLANSDN + + E ++D+RVSFEL E +P C ++E Sbjct: 300 LTPNGGEPAARDSYLLENQISEVASLANSDNGTHNEEGLMDHRVSFELTAEHVPNC-VEE 358 Query: 690 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPDMARRHRTISLGSS 511 + ++ N R+ DG+ E + +RT SLGSS Sbjct: 359 EMKGQNFCEDCTGDSIHN-------------ITRKALDGQ---EGKQCLKNNRTFSLGSS 402 Query: 510 KDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 KDFNF+N K+E +KSTIDCEWWTN+ KELG + WTFFPMLQPGVS Sbjct: 403 KDFNFDNMKQESPDKSTIDCEWWTNETAAAKELGSKNKWTFFPMLQPGVS 452 >ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160458 [Sesamum indicum] Length = 466 Score = 604 bits (1557), Expect = e-170 Identities = 314/484 (64%), Positives = 359/484 (74%), Gaps = 14/484 (2%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M+SVHNS AE+R QPSTVQKRRWGSCWSLYWCFGS+K +KRIGHAVL+ Sbjct: 1 MTSVHNSAETLNAAATAIVTAENRAQPSTVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLI 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 EPT APV EN N SAT++LPFI SFLQSDPPSATQS AGL+SL ALSV+ Sbjct: 61 SEPTAQVAVAPVVENLNRSATLMLPFIAPPSSPASFLQSDPPSATQSAAGLVSLAALSVH 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGGTAPIF +GPYA+ETQL+SPPVFSAFTTEPSTA FTPPPE VQ+TTPSSPEVPFA Sbjct: 121 TYSPGGTAPIFTIGPYAYETQLVSPPVFSAFTTEPSTASFTPPPEPVQMTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 1051 QLL+SSLARNRRN+G N+K SLSQY+F Y+ SPGSA+S+SGTSSPFPDK Sbjct: 181 QLLSSSLARNRRNSG-NMKSSLSQYEFLAYE----------SPGSALSSSGTSSPFPDKW 229 Query: 1050 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWG--------------SRLGSGT 913 P+VE R+GEAP F+GYEHF N KWGSRVGSGSLTP G G SRLGSG Sbjct: 230 PVVEIRRGEAPIFIGYEHFFNHKWGSRVGSGSLTPNGRGSRLGSGALTPNGGLSRLGSGA 289 Query: 912 LTPNGGLSRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSF 733 LTPNGGLSRLGSG +TPNGGEPPS+D LL + ISEV SLANS NE ++ + VVD+RVSF Sbjct: 290 LTPNGGLSRLGSGALTPNGGEPPSRDCNLLGNPISEVVSLANSGNELQNCDAVVDHRVSF 349 Query: 732 ELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVP 553 EL GEDIPTC + E+VPS K + + EA N D MAK ++ R+ S+GET+ E Sbjct: 350 ELSGEDIPTCVVSETVPSPKMESRDLQEATAEVTNHSDFMAKVSETYRKLSNGETMHE-- 407 Query: 552 DMARRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQ 373 + TISLGSS+DFNFNNA E+ + +DCEWWTND V KEL PR +WTFFPMLQ Sbjct: 408 -----NHTISLGSSRDFNFNNADGELSARIAVDCEWWTNDDVVGKELAPRNNWTFFPMLQ 462 Query: 372 PGVS 361 GVS Sbjct: 463 SGVS 466 >ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas] gi|643706116|gb|KDP22248.1| hypothetical protein JCGZ_26079 [Jatropha curcas] Length = 498 Score = 560 bits (1444), Expect = e-156 Identities = 299/502 (59%), Positives = 357/502 (71%), Gaps = 32/502 (6%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M SV+NS AESRVQP+ VQKRRWG CWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFGSHKNSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP P AEN+ HS +PFI SFLQSDPPS TQSPAGLLSLTALSV+ Sbjct: 61 PEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFLQSDPPSVTQSPAGLLSLTALSVS 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGG A IFA+GPYAHETQL++PPVFSAFTTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL R RRN+G N KF+LS Y+FQ Y YPGSPGG+L SPGS IS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFPDR 240 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 928 P++EFR GEAPK LG+EHF RKWGSR+GSG+LTP G GSR Sbjct: 241 HPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLGSR 300 Query: 927 LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 754 LGSG++TP+G GL SRLGSG++TP+ P SQD LLE+QISEVASLANS+N SK+ E + Sbjct: 301 LGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDENI 360 Query: 753 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEA-APNGMNQKDLMAKNTDCCREYSD 577 VD+RVSFEL GE++ C +S+ S +T + ++ A +N ++++ + DC Sbjct: 361 VDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLH---I 417 Query: 576 GETVSEVPDMA----------RRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKV 427 GET +E P+ R+HR+I+LGS K+FNF+N+K EV +K TI EWW N+ + Sbjct: 418 GETSNETPEKPSGETEEEPCYRKHRSITLGSIKEFNFDNSK-EVPDKPTISSEWWANETI 476 Query: 426 PKKELGPRKDWTFFPMLQPGVS 361 KE P +WTFFP+LQP VS Sbjct: 477 AGKEARPANNWTFFPLLQPEVS 498 >ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333182 [Prunus mume] Length = 499 Score = 559 bits (1441), Expect = e-156 Identities = 292/499 (58%), Positives = 350/499 (70%), Gaps = 30/499 (6%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M SV++S AE+R QP+TV KRRWGSCWSLYWCFGSHK NKRIGHAVLV Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARAQPTTVPKRRWGSCWSLYWCFGSHK-NKRIGHAVLV 59 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG +N+ S IV+PFI SFL SDPPSATQSPAG LSL +LS N Sbjct: 60 PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGG A IF++GPYA+ETQL+SPPVFS F TEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 120 AYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFA 179 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPY-QYPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL RNRRN+GTN KF+LS Y+FQPY QYPGSPGG L SPGSA+S SGTSSPFPD+ Sbjct: 180 QLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR 239 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNGGL---- 892 P++EF GEAPK G++HF RKWGSR+GSGSLTP G GSRLGSG+LTP+G Sbjct: 240 HPVLEFHMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSR 299 Query: 891 --------------SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 754 SRLGSG +TP+G P S+DS+LLE+QISEVASLANS++ + E V Sbjct: 300 LGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETV 359 Query: 753 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 574 D+RVSFEL GED+ C +++ S +TA + A + +++D ++ ++ E+S Sbjct: 360 FDHRVSFELTGEDVACCLANKAMASNRTASGSSKVIASDYPSERDALSSDSSNHCEFSVE 419 Query: 573 ETVSEVPDMA---------RRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPK 421 E+ S +P+ R+HR+I+LGS+KDFNF+N K EV K I EWW N V Sbjct: 420 ESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPSKPNIGSEWWANKNVAA 479 Query: 420 KELGPRKDWTFFPMLQPGV 364 KE P DWTFFP+LQPGV Sbjct: 480 KESKPCNDWTFFPILQPGV 498 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 559 bits (1441), Expect = e-156 Identities = 293/499 (58%), Positives = 349/499 (69%), Gaps = 30/499 (6%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M SV++S AE+R QP+TV KRRWGSCWSLYWCFG HK NKRIGHAVLV Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHK-NKRIGHAVLV 59 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG +N+ S IV+PFI SFL SDPPSATQSPAG LSL +LS N Sbjct: 60 PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGG A IF++GPYA+ETQL+SPPVFS F TEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 120 AYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFA 179 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPY-QYPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL RNRRN+GTN KF+LS Y+FQPY QYPGSPGG L SPGSA+S SGTSSPFPD+ Sbjct: 180 QLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR 239 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNGGL---- 892 P++EFR GEAPK G++HF RKWGSR+GSGSLTP G GSRLGSG+LTP+G Sbjct: 240 HPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSR 299 Query: 891 --------------SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 754 SRLGSG +TP+G P S+DS+LLE+QISEVASLANS++ + E V Sbjct: 300 LGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETV 359 Query: 753 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 574 D+RVSFEL GED+ C ++V S +TA + A +++D ++ ++ E+S Sbjct: 360 FDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVE 419 Query: 573 ETVSEVPDMA---------RRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPK 421 E+ S +P+ R+HR+I+LGS+KDFNF+N K EV K I EWW N V Sbjct: 420 ESSSRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNVAA 479 Query: 420 KELGPRKDWTFFPMLQPGV 364 KE P DWTFFP+LQPGV Sbjct: 480 KESKPCNDWTFFPILQPGV 498 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 555 bits (1431), Expect = e-155 Identities = 293/485 (60%), Positives = 348/485 (71%), Gaps = 15/485 (3%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M SV++S A+SRVQP+TVQK+RWGSCW LYWCFGS K +KRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG AEN ++ I+LPFI SFLQSDPPSATQSPAGLLSLT+LSVN Sbjct: 61 PEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSP G A IFA+GPYAHETQL++PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGSAIS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR 240 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 886 PI+EFR GEAPK LG+E+F RKWGSR+GSGSLTP G GSRLGSG++TP+G GL SR Sbjct: 241 RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSR 300 Query: 885 LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 706 LGSG++TP+G P S+D +L+ SQISEVA LAN N K+ E +VD+RVSFEL GED+ Sbjct: 301 LGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAP 360 Query: 705 CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPDMA------ 544 C +S+ + + + G ++D + K+ + E ET +E + A Sbjct: 361 CLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEE 420 Query: 543 ----RRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPML 376 ++HR+++LGS K+FNF+N K E +K TI EWW N+KV KE P WTFFPML Sbjct: 421 EHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPML 480 Query: 375 QPGVS 361 QP VS Sbjct: 481 QPEVS 485 >gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum] Length = 465 Score = 555 bits (1429), Expect = e-155 Identities = 295/475 (62%), Positives = 345/475 (72%), Gaps = 5/475 (1%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M SV++S AESRVQP+TVQK+RWGSCWS YWCFGSHK +KRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG AEN ++ IV+PFI SFLQSDPPSATQSPAGLLSLTALSVN Sbjct: 61 PEPVVPGASVSTAENASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSP G A IF++GPYAHETQL++PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPRGPASIFSIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGS IS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDR 240 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 886 PI+EFR GEAPK LG+EHF RKWGSR+GSGSLTP G GSRLGS +TP+G GL SR Sbjct: 241 RPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSR 300 Query: 885 LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 706 LGSG++TP+G P S+D + +ESQ SEVA L+N N K+ E++VD+RVSFEL GED+ Sbjct: 301 LGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVAR 360 Query: 705 CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPDMARRHRTI 526 C +S+ S +T P+ KDL+A+ E GE +E ++HR++ Sbjct: 361 CLKNKSLVSSRT--------MPDYEYPKDLVAQGRIEKDEKVSGE--AEEDHCYQKHRSV 410 Query: 525 SLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 +LGS K+FNF+N K E EK T+ EWW N+KV KE P +WTFFPMLQP VS Sbjct: 411 TLGSIKEFNFDNRKGEASEKPTVRSEWWANEKVAGKEARPGNNWTFFPMLQPEVS 465 >ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium raimondii] gi|763785675|gb|KJB52746.1| hypothetical protein B456_008G275500 [Gossypium raimondii] Length = 465 Score = 553 bits (1424), Expect = e-154 Identities = 295/475 (62%), Positives = 344/475 (72%), Gaps = 5/475 (1%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M SV++S AESRVQP+TVQK+RWGSCWS YWCFGSHK +KRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG AEN ++ IV+PFI SFLQSDPPSATQSPAGLLSLTALSVN Sbjct: 61 PEPVVPGALVSTAENASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSP G A IFA+GPYAHETQL++PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGS IS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDR 240 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 886 PI+EFR GEAPK LG+EHF RKWGSR+GSGSLTP G GSRLGS +TP+G GL SR Sbjct: 241 RPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSR 300 Query: 885 LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 706 LGSG++TP+G P S+D + +ESQ SEVA L+N N K+ E++VD+RVSFEL GED+ Sbjct: 301 LGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVAR 360 Query: 705 CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPDMARRHRTI 526 C +S+ S +T P+ DL+A+ E GE +E ++HR++ Sbjct: 361 CLKNKSLVSSRT--------MPDYEYPNDLVAQGRIEKDEKVSGE--AEEDHCYQKHRSV 410 Query: 525 SLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 +LGS K+FNF+N K E EK T+ EWW N+KV KE P +WTFFPMLQP VS Sbjct: 411 TLGSIKEFNFDNRKGEASEKPTVRSEWWANEKVAGKEARPGNNWTFFPMLQPEVS 465 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 550 bits (1416), Expect = e-153 Identities = 293/489 (59%), Positives = 348/489 (71%), Gaps = 19/489 (3%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQ----KRRWGSCWSLYWCFGSHKPNKRIGH 1603 M SV++S A+SRVQP+TVQ K+RWGSCW LYWCFGS K +KRIGH Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60 Query: 1602 AVLVPEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTA 1423 AVLVPEP PG AEN ++ I+LPFI SFLQSDPPSATQSPAGLLSLT+ Sbjct: 61 AVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTS 120 Query: 1422 LSVNVYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPE 1243 LSVN YSP G A IFA+GPYAHETQL++PPVFSA TTEPSTA FTPPPESVQ+TTPSSPE Sbjct: 121 LSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPE 180 Query: 1242 VPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSP 1066 VPFAQLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGSAIS SGTSSP Sbjct: 181 VPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSP 240 Query: 1065 FPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-G 895 FPD+ PI+EFR GEAPK LG+E+F RKWGSR+GSGSLTP G GSRLGSG++TP+G G Sbjct: 241 FPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMG 300 Query: 894 L-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGE 718 L SRLGSG++TP+G P S+D +L+ SQISEVA LAN N K+ E +VD+RVSFEL GE Sbjct: 301 LGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGE 360 Query: 717 DIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPDMA-- 544 D+ C +S+ + + + G ++D + K+ + E ET +E + A Sbjct: 361 DVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASG 420 Query: 543 --------RRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTF 388 ++HR+++LGS K+FNF+N K E +K TI EWW N+KV KE P WTF Sbjct: 421 EAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTF 480 Query: 387 FPMLQPGVS 361 FPMLQP VS Sbjct: 481 FPMLQPEVS 489 >gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium raimondii] Length = 464 Score = 547 bits (1410), Expect = e-152 Identities = 295/475 (62%), Positives = 343/475 (72%), Gaps = 5/475 (1%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 M SV++S AESRVQP+TVQKR WGSCWS YWCFGSHK +KRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAASAIVSAESRVQPTTVQKR-WGSCWSFYWCFGSHKSSKRIGHAVLV 59 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP PG AEN ++ IV+PFI SFLQSDPPSATQSPAGLLSLTALSVN Sbjct: 60 PEPVVPGALVSTAENASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVN 119 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSP G A IFA+GPYAHETQL++PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 120 AYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 179 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGS IS SGTSSPFPD+ Sbjct: 180 QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDR 239 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 886 PI+EFR GEAPK LG+EHF RKWGSR+GSGSLTP G GSRLGS +TP+G GL SR Sbjct: 240 RPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSR 299 Query: 885 LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 706 LGSG++TP+G P S+D + +ESQ SEVA L+N N K+ E++VD+RVSFEL GED+ Sbjct: 300 LGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVAR 359 Query: 705 CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPDMARRHRTI 526 C +S+ S +T P+ DL+A+ E GE +E ++HR++ Sbjct: 360 CLKNKSLVSSRT--------MPDYEYPNDLVAQGRIEKDEKVSGE--AEEDHCYQKHRSV 409 Query: 525 SLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 +LGS K+FNF+N K E EK T+ EWW N+KV KE P +WTFFPMLQP VS Sbjct: 410 TLGSIKEFNFDNRKGEASEKPTVRSEWWANEKVAGKEARPGNNWTFFPMLQPEVS 464 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 547 bits (1410), Expect = e-152 Identities = 289/501 (57%), Positives = 350/501 (69%), Gaps = 31/501 (6%) Frame = -1 Query: 1770 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 1591 MSSVH+S AESR++P+ +QKRRWGSCWSLYWCFGSHK +KRI HAVLV Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60 Query: 1590 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVN 1411 PEP G AP AE + HS IVLPFI SFLQSDPPSATQSPAGLLSL +LSVN Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVN 120 Query: 1410 VYSPGGTAPIFAVGPYAHETQLISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 1231 YSPGG A +FA+GPYAHETQL++PPVFSAFTTEPSTA TPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180 Query: 1230 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 1054 QLLTSSL R RRN+GTN K SLS Y +QPYQ YPGSPGG+L SPGS +S SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240 Query: 1053 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 928 PI++F APK LG+EHF RKWGSR+GSGS+TP G GSR Sbjct: 241 HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300 Query: 927 LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 754 LGSGT+TP+G GL SRLGSG++TP+G P S+D ++ E+QISEVASLANSDN +K E + Sbjct: 301 LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360 Query: 753 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 574 +D+RVSFEL GE++ C +S S + + + P G ++D +++ E Sbjct: 361 IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPE 420 Query: 573 ETVSEVPDMA----------RRHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVP 424 E+ + +P+ R+HR+I+LGS K+FNF+N + EV K +I+ EWW N+ V Sbjct: 421 ESSNRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENV- 479 Query: 423 KKELGPRKDWTFFPMLQPGVS 361 KE P +WTFFPMLQ S Sbjct: 480 GKESKPSNNWTFFPMLQSEAS 500 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 546 bits (1407), Expect = e-152 Identities = 290/480 (60%), Positives = 342/480 (71%), Gaps = 31/480 (6%) Frame = -1 Query: 1707 ESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLVPEPTPPGVEAPVAENRNHSAT 1528 ESRVQP+TVQKRRWG CWSLYWCFGSHK KRIGHAVL PEP G AEN++ S Sbjct: 36 ESRVQPTTVQKRRWGGCWSLYWCFGSHK-TKRIGHAVLAPEPEVQGAVVTSAENQSQSTA 94 Query: 1527 IVLPFIXXXXXXXSFLQSDPPSATQSPAGLLSLTALSVNVYSPGGTAPIFAVGPYAHETQ 1348 I +PFI SFLQSDPPSATQSPAGLLSLT+LSVN YSPGG A IFA+GPYAHETQ Sbjct: 95 ITVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPGGPASIFAIGPYAHETQ 154 Query: 1347 LISPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFAQLLTSSLARNRRNNGTNLKFS 1168 L++PP FSAFTTEPSTA FTPPPESVQ+TTPSSPEVPFAQLLTSSL R RRN+GTN KF+ Sbjct: 155 LVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGTNQKFA 214 Query: 1167 LSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDKLPIVEFRKGEAPKFLGYEHFA 991 LS Y+FQ Y YPGSPGG+L SPGS IS SGTSSPFPD+ PI+EFR GEAPK LG+EHF Sbjct: 215 LSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPILEFRMGEAPKLLGFEHFT 274 Query: 990 NRKWGSRVGSGSLTP--TGWGSRLGSGTLTPN--GGLSRLGSGTVTPNG----------- 856 RKWGSR+GSG++TP G GSRLGSGT+TP+ G SRLGSGTVTP+G Sbjct: 275 TRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSGTVTPDGVGLRSMLGSGS 334 Query: 855 -----GEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 691 P S+D + LE+QISEVASLANS+N SK E +VD+RVSFEL GE++ C + Sbjct: 335 LTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHRVSFELSGEEVARCLESK 394 Query: 690 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEVPDMA----------R 541 S+ S + + P++ + M + GET E P+ R Sbjct: 395 SLASCRAFSECPPDSMAEDQIKSGKMLMTDE---NLPTGETSGETPEKPSGEMEEEHCYR 451 Query: 540 RHRTISLGSSKDFNFNNAKEEVLEKSTIDCEWWTNDKVPKKELGPRKDWTFFPMLQPGVS 361 +HR+I+LGS K+FNF+N+K EV +K +I+ EWW N+ + KE P +WTFFP+LQP VS Sbjct: 452 KHRSITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510