BLASTX nr result
ID: Forsythia23_contig00022617
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00022617 (1322 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166... 559 e-156 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 501 e-139 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 499 e-138 ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236... 496 e-137 ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116... 493 e-136 ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260... 493 e-136 ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236... 478 e-132 ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260... 475 e-131 emb|CDP05166.1| unnamed protein product [Coffea canephora] 470 e-130 ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160... 459 e-126 ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648... 425 e-116 ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333... 424 e-116 ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prun... 424 e-116 ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 423 e-115 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 421 e-115 gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sin... 419 e-114 ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-... 419 e-114 gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum] 418 e-114 ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765... 417 e-113 ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 415 e-113 >ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum] Length = 479 Score = 559 bits (1441), Expect = e-156 Identities = 284/405 (70%), Positives = 309/405 (76%), Gaps = 30/405 (7%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSVHNS AESRVQPSTVQKRRWGSCWS+YWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVHNSVETVNAAATAIVTAESRVQPSTVQKRRWGSCWSIYWCFGSHKQSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 EP GV AP++ENRN S+TIVLPFI SFLQSDPPSATQSPAGL Sbjct: 61 SEPAAAGVAAPISENRNQSSTIVLPFIAPPSSPASFLQSDPPSATQSPAGLISLASLSVH 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L SPPVFS FTTEPSTA FTPPPE VQ+TTPSSPEVPFA Sbjct: 121 ANSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLL+SSLARNRRN GTNLK+SLSQY+FQPYQYPGSPGG +KSPGSA+STSGTSSPFPDK Sbjct: 181 QLLSSSLARNRRNCGTNLKYSLSQYEFQPYQYPGSPGGHIKSPGSALSTSGTSSPFPDKH 240 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 PI+EFR GEAPKFLGYEHF N KWGSRVGSGSLTP GWGSRLGSG LTPNGGLSRLGSGT Sbjct: 241 PIMEFRMGEAPKFLGYEHFPNYKWGSRVGSGSLTPNGWGSRLGSGALTPNGGLSRLGSGT 300 Query: 317 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138 +TPNGGEPPS+D LLE+QI EVASLANSD +S++ + VVD+RVSFEL GEDIPTC + E Sbjct: 301 LTPNGGEPPSRDGNLLENQIYEVASLANSDRKSQNDDAVVDHRVSFELFGEDIPTCVVTE 360 Query: 137 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSEV 3 S PS+K A A G N KDL KN D CRE++DGET +EV Sbjct: 361 SAPSHKNASGYPGVATAEGTNNKDLTTKNADSCREHNDGETTNEV 405 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 501 bits (1291), Expect = e-139 Identities = 263/390 (67%), Positives = 284/390 (72%), Gaps = 32/390 (8%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSV N+ AESRVQPSTVQKRRWGSCWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEP PG PV EN NHSATIV+PFI SFL SDPPSATQSPAGLL Sbjct: 61 PEPAAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSIN 120 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 317 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138 VTPNGGEPPS+DSYLLE QISEVASLANSDN S+ GE V+D+RVSFEL GED+P+C KE Sbjct: 301 VTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREKE 360 Query: 137 SVPSY--KTALKTVPEAAPNGMNQKDLMAK 54 V S+ +T V N M MA+ Sbjct: 361 PVMSHSQQTLPMDVSNLLANEMKSGSSMAE 390 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum lycopersicum] Length = 470 Score = 499 bits (1285), Expect = e-138 Identities = 255/365 (69%), Positives = 276/365 (75%), Gaps = 30/365 (8%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSV N+ AESRVQPSTVQKRRWGSCWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEP PG PV EN NHSATIV+PFI SFL SDPPSATQSPAGLL Sbjct: 61 PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSIN 120 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGS+TP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 317 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138 VTPNGGEPPS+DSYLLE+QISEVASLANSDN S+ GE V+D+RVSFEL ED+P+C KE Sbjct: 301 VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKE 360 Query: 137 SVPSY 123 V S+ Sbjct: 361 PVMSH 365 >ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1 [Nicotiana sylvestris] Length = 470 Score = 496 bits (1277), Expect = e-137 Identities = 260/399 (65%), Positives = 290/399 (72%), Gaps = 30/399 (7%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSV N+ AESRVQPS+VQKRRWGSCWSLYWCFGS+K +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAATAIVTAESRVQPSSVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEP PG PV EN N SATIV+PFI SFL SDPPSATQSPAGLL Sbjct: 61 PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 240 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 317 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138 VTPNGGEPPS+D YLLE+QISEVASLANSDN S+ E V+D+RVSFEL GED+P+C KE Sbjct: 301 VTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREKE 360 Query: 137 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21 V S+ + +T+P P N++ M ++ E +DG Sbjct: 361 PVMSH--SQQTLPMDVPAPSNKE--MRSSSSIVEEKTDG 395 >ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana tomentosiformis] Length = 470 Score = 493 bits (1269), Expect = e-136 Identities = 257/399 (64%), Positives = 290/399 (72%), Gaps = 30/399 (7%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSV N+ AESRVQPS++QK+RWGSCWSLYWCFGS+K +KRIGHA+LV Sbjct: 1 MSSVQNTVDTVNAAATAIITAESRVQPSSIQKKRWGSCWSLYWCFGSYKHSKRIGHAILV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEP PG PV EN N SATIV+PFI SFL SDPPSATQSPAGLL Sbjct: 61 PEPAAPGPAVPVTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSIN 120 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 121 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 181 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSSLISPGSVVSNSGTSSPFPGKC 240 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGSLTP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 241 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGT 300 Query: 317 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138 VTPNGGEPPS+D YLLE+QISEVASLANSDN S+ E V+D+RVSFEL GED+P+C KE Sbjct: 301 VTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREKE 360 Query: 137 SVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21 V S+ + +T+P P N++ M ++ E +DG Sbjct: 361 PVMSH--SQQTLPMDVPAPSNKE--MRSSSSNVEEKTDG 395 >ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260903 isoform X3 [Solanum lycopersicum] Length = 469 Score = 493 bits (1268), Expect = e-136 Identities = 254/365 (69%), Positives = 275/365 (75%), Gaps = 30/365 (8%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSV N+ AESRVQPSTVQ RRWGSCWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEP PG PV EN NHSATIV+PFI SFL SDPPSATQSPAGLL Sbjct: 60 PEPVAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSIN 119 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFS FTTEPSTA FTPPPE V +TTP SPEVPFA Sbjct: 120 AYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFA 179 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLLTSSLARNRR +G+N KF LSQY+F PYQ PGSPG L SPGS +S SGTSSPFP K Sbjct: 180 QLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKC 239 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 PI+EFRKGE PKFLGYEHF+ RKWGSRVGSGS+TP+GWGSRLGSGTLTPNGG+SRLGSGT Sbjct: 240 PIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGT 299 Query: 317 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138 VTPNGGEPPS+DSYLLE+QISEVASLANSDN S+ GE V+D+RVSFEL ED+P+C KE Sbjct: 300 VTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREKE 359 Query: 137 SVPSY 123 V S+ Sbjct: 360 PVMSH 364 >ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236433 isoform X2 [Nicotiana sylvestris] Length = 442 Score = 478 bits (1229), Expect = e-132 Identities = 246/370 (66%), Positives = 275/370 (74%), Gaps = 30/370 (8%) Frame = -1 Query: 1040 VQKRRWGSCWSLYWCFGSHKPNKRIGHAVLVPEPTPPGVEAPVAENRNHSATIVLPFIXX 861 +QKRRWGSCWSLYWCFGS+K +KRIGHAVLVPEP PG PV EN N SATIV+PFI Sbjct: 2 MQKRRWGSCWSLYWCFGSYKHSKRIGHAVLVPEPAAPGPAVPVTENPNRSATIVIPFIAP 61 Query: 860 XXXXXSFLQSDPPSATQSPAGLLX------------------------------SPPVFS 771 SFL SDPPSATQSPAGLL SPPVFS Sbjct: 62 PSSPASFLPSDPPSATQSPAGLLSLKSFSINAYSPGGTASIFAIGPYAHETQLVSPPVFS 121 Query: 770 AFTTEPSTACFTPPPESVQITTPSSPEVPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQP 591 FTTEPSTA FTPPPE V +TTP SPEVPFAQLLTSSLARNRR +G+N KF LSQY+F P Sbjct: 122 TFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVP 181 Query: 590 YQYPGSPGGRLKSPGSAISTSGTSSPFPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVG 411 YQ PGSPG L SPGS +S SGTSSPFP K PI+EFRKGE PKFLGYEHF+ RKWGSRVG Sbjct: 182 YQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVG 241 Query: 410 SGSLTPTGWGSRLGSGTLTPNGGLSRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANS 231 SGSLTP+GWGSRLGSGTLTPNGG+SRLGSGTVTPNGGEPPS+D YLLE+QISEVASLANS Sbjct: 242 SGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDCYLLENQISEVASLANS 301 Query: 230 DNESKDGEVVVDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKN 51 DN S+ E V+D+RVSFEL GED+P+C KE V S+ + +T+P P N++ M + Sbjct: 302 DNGSEIAEGVIDHRVSFELTGEDVPSCREKEPVMSH--SQQTLPMDVPAPSNKE--MRSS 357 Query: 50 TDCCREYSDG 21 + E +DG Sbjct: 358 SSIVEEKTDG 367 >ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260903 isoform X1 [Solanum lycopersicum] Length = 476 Score = 475 bits (1223), Expect = e-131 Identities = 238/334 (71%), Positives = 259/334 (77%), Gaps = 30/334 (8%) Frame = -1 Query: 1034 KRRWGSCWSLYWCFGSHKPNKRIGHAVLVPEPTPPGVEAPVAENRNHSATIVLPFIXXXX 855 +RRWGSCWSLYWCFGSHK +KRIGHAVLVPEP PG PV EN NHSATIV+PFI Sbjct: 38 ERRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVPVTENPNHSATIVIPFIAPPS 97 Query: 854 XXXSFLQSDPPSATQSPAGLLX------------------------------SPPVFSAF 765 SFL SDPPSATQSPAGLL SPPVFS F Sbjct: 98 SPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVFSTF 157 Query: 764 TTEPSTACFTPPPESVQITTPSSPEVPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ 585 TTEPSTA FTPPPE V +TTP SPEVPFAQLLTSSLARNRR +G+N KF LSQY+F PYQ Sbjct: 158 TTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQ 217 Query: 584 YPGSPGGRLKSPGSAISTSGTSSPFPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVGSG 405 PGSPG L SPGS +S SGTSSPFP K PI+EFRKGE PKFLGYEHF+ RKWGSRVGSG Sbjct: 218 DPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSG 277 Query: 404 SLTPTGWGSRLGSGTLTPNGGLSRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDN 225 S+TP+GWGSRLGSGTLTPNGG+SRLGSGTVTPNGGEPPS+DSYLLE+QISEVASLANSDN Sbjct: 278 SVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSDN 337 Query: 224 ESKDGEVVVDYRVSFELIGEDIPTCTIKESVPSY 123 S+ GE V+D+RVSFEL ED+P+C KE V S+ Sbjct: 338 GSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSH 371 >emb|CDP05166.1| unnamed protein product [Coffea canephora] Length = 452 Score = 470 bits (1210), Expect = e-130 Identities = 244/360 (67%), Positives = 269/360 (74%), Gaps = 30/360 (8%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSVHNS AESRVQP TVQKRRWGSCWS YWCFGS K +KRIG+AVLV Sbjct: 1 MSSVHNSVETVNAAATAIVTAESRVQPPTVQKRRWGSCWSFYWCFGSVKNSKRIGNAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEPT PG PV +N NHSATIV+PFI SFLQSDPPSATQSPA L Sbjct: 61 PEPTVPGSAVPVPDNLNHSATIVIPFIAPPSSPASFLQSDPPSATQSPAKFLPLASFSVN 120 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFSAFTTEPSTA FTPPPE VQ+TTPSSPEVPFA Sbjct: 121 TYSPSGAASIFAIGPYAHETQLVSPPVFSAFTTEPSTASFTPPPEPVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLL SSL NRR++GT++KF LSQY+FQPYQ PGSPG L SPGSAIS SGTSSPFP+K Sbjct: 181 QLLVSSLTHNRRHSGTSIKFPLSQYEFQPYQCPGSPGSHLISPGSAISNSGTSSPFPEKR 240 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 PI+EFR GEAPKFLGYE F RKWGSRVGSGSLTP GWGSRLGSG+LTPNGG+SRLGSGT Sbjct: 241 PIIEFRIGEAPKFLGYELF-TRKWGSRVGSGSLTPNGWGSRLGSGSLTPNGGISRLGSGT 299 Query: 317 VTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPTCTIKE 138 +TPNGGEP ++DSYLLE+QISEVASLANSDN + + E ++D+RVSFEL E +P C +E Sbjct: 300 LTPNGGEPAARDSYLLENQISEVASLANSDNGTHNEEGLMDHRVSFELTAEHVPNCVEEE 359 >ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160458 [Sesamum indicum] Length = 466 Score = 459 bits (1180), Expect = e-126 Identities = 251/418 (60%), Positives = 285/418 (68%), Gaps = 44/418 (10%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 M+SVHNS AE+R QPSTVQKRRWGSCWSLYWCFGS+K +KRIGHAVL+ Sbjct: 1 MTSVHNSAETLNAAATAIVTAENRAQPSTVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLI 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 EPT APV EN N SAT++LPFI SFLQSDPPSATQS AGL Sbjct: 61 SEPTAQVAVAPVVENLNRSATLMLPFIAPPSSPASFLQSDPPSATQSAAGLVSLAALSVH 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L SPPVFSAFTTEPSTA FTPPPE VQ+TTPSSPEVPFA Sbjct: 121 TYSPGGTAPIFTIGPYAYETQLVSPPVFSAFTTEPSTASFTPPPEPVQMTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQYPGSPGGRLKSPGSAISTSGTSSPFPDKL 498 QLL+SSLARNRRN+G N+K SLSQY+F Y+ SPGSA+S+SGTSSPFPDK Sbjct: 181 QLLSSSLARNRRNSG-NMKSSLSQYEFLAYE----------SPGSALSSSGTSSPFPDKW 229 Query: 497 PIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTPTGWGSRLGSGTLTPNGGLSRLGSGT 318 P+VE R+GEAP F+GYEHF N KWGSRVGSGSLTP G GSRLGSG LTPNGGLSRLGSG Sbjct: 230 PVVEIRRGEAPIFIGYEHFFNHKWGSRVGSGSLTPNGRGSRLGSGALTPNGGLSRLGSGA 289 Query: 317 VTPNGG--------------EPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSF 180 +TPNGG EPPS+D LL + ISEV SLANS NE ++ + VVD+RVSF Sbjct: 290 LTPNGGLSRLGSGALTPNGGEPPSRDCNLLGNPISEVVSLANSGNELQNCDAVVDHRVSF 349 Query: 179 ELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE 6 EL GEDIPTC + E+VPS K + + EA N D MAK ++ R+ S+GET+ E Sbjct: 350 ELSGEDIPTCVVSETVPSPKMESRDLQEATAEVTNHSDFMAKVSETYRKLSNGETMHE 407 >ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas] gi|643706116|gb|KDP22248.1| hypothetical protein JCGZ_26079 [Jatropha curcas] Length = 498 Score = 425 bits (1092), Expect = e-116 Identities = 238/426 (55%), Positives = 281/426 (65%), Gaps = 52/426 (12%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 M SV+NS AESRVQP+ VQKRRWG CWSLYWCFGSHK +KRIGHAVLV Sbjct: 1 MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFGSHKNSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 PEP P AEN+ HS +PFI SFLQSDPPS TQSPAGL Sbjct: 61 PEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFLQSDPPSVTQSPAGLLSLTALSVS 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L +PPVFSAFTTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL R RRN+G N KF+LS Y+FQ Y YPGSPGG+L SPGS IS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFPDR 240 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375 P++EFR GEAPK LG+EHF RKWGSR+GSG+LTP G GSR Sbjct: 241 HPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLGSR 300 Query: 374 LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201 LGSG++TP+G GL SRLGSG++TP+ P SQD LLE+QISEVASLANS+N SK+ E + Sbjct: 301 LGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDENI 360 Query: 200 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEA-APNGMNQKDLMAKNTDCCREYSD 24 VD+RVSFEL GE++ C +S+ S +T + ++ A +N ++++ + DC Sbjct: 361 VDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLH---I 417 Query: 23 GETVSE 6 GET +E Sbjct: 418 GETSNE 423 >ref|XP_008234199.1| PREDICTED: uncharacterized protein LOC103333182 [Prunus mume] Length = 499 Score = 424 bits (1091), Expect = e-116 Identities = 233/426 (54%), Positives = 279/426 (65%), Gaps = 51/426 (11%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 M SV++S AE+R QP+TV KRRWGSCWSLYWCFGSHK NKRIGHAVLV Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARAQPTTVPKRRWGSCWSLYWCFGSHK-NKRIGHAVLV 59 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEP PG +N+ S IV+PFI SFL SDPPSATQSPAG L Sbjct: 60 PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFS F TEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 120 AYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFA 179 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPY-QYPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL RNRRN+GTN KF+LS Y+FQPY QYPGSPGG L SPGSA+S SGTSSPFPD+ Sbjct: 180 QLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR 239 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNGGL---- 339 P++EF GEAPK G++HF RKWGSR+GSGSLTP G GSRLGSG+LTP+G Sbjct: 240 HPVLEFHMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSR 299 Query: 338 --------------SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201 SRLGSG +TP+G P S+DS+LLE+QISEVASLANS++ + E V Sbjct: 300 LGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETV 359 Query: 200 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21 D+RVSFEL GED+ C +++ S +TA + A + +++D ++ ++ E+S Sbjct: 360 FDHRVSFELTGEDVACCLANKAMASNRTASGSSKVIASDYPSERDALSSDSSNHCEFSVE 419 Query: 20 ETVSEV 3 E+ S + Sbjct: 420 ESSSRI 425 >ref|XP_007219041.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] gi|462415503|gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 424 bits (1091), Expect = e-116 Identities = 234/426 (54%), Positives = 278/426 (65%), Gaps = 51/426 (11%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 M SV++S AE+R QP+TV KRRWGSCWSLYWCFG HK NKRIGHAVLV Sbjct: 1 MRSVNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHK-NKRIGHAVLV 59 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGLLX------- 789 PEP PG +N+ S IV+PFI SFL SDPPSATQSPAG L Sbjct: 60 PEPVVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSAN 119 Query: 788 -----------------------SPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 SPPVFS F TEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 120 AYSPGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFA 179 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPY-QYPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL RNRRN+GTN KF+LS Y+FQPY QYPGSPGG L SPGSA+S SGTSSPFPD+ Sbjct: 180 QLLTSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDR 239 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNGGL---- 339 P++EFR GEAPK G++HF RKWGSR+GSGSLTP G GSRLGSG+LTP+G Sbjct: 240 HPVLEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSR 299 Query: 338 --------------SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201 SRLGSG +TP+G P S+DS+LLE+QISEVASLANS++ + E V Sbjct: 300 LGSGCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETV 359 Query: 200 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDG 21 D+RVSFEL GED+ C ++V S +TA + A +++D ++ ++ E+S Sbjct: 360 FDHRVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVE 419 Query: 20 ETVSEV 3 E+ S + Sbjct: 420 ESSSRI 425 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 423 bits (1087), Expect = e-115 Identities = 230/405 (56%), Positives = 270/405 (66%), Gaps = 51/405 (12%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSVH+S AESR++P+ +QKRRWGSCWSLYWCFGSHK +KRI HAVLV Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 PEP G AP AE + HS IVLPFI SFLQSDPPSATQSPAGL Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVN 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L +PPVFSAFTTEPSTA TPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL R RRN+GTN K SLS Y +QPYQ YPGSPGG+L SPGS +S SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375 PI++F APK LG+EHF RKWGSR+GSGS+TP G GSR Sbjct: 241 HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300 Query: 374 LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201 LGSGT+TP+G GL SRLGSG++TP+G P S+D ++ E+QISEVASLANSDN +K E + Sbjct: 301 LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360 Query: 200 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKD 66 +D+RVSFEL GE++ C +S S + + + P G ++D Sbjct: 361 IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRD 405 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 421 bits (1082), Expect = e-115 Identities = 233/409 (56%), Positives = 275/409 (67%), Gaps = 35/409 (8%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 M SV++S A+SRVQP+TVQK+RWGSCW LYWCFGS K +KRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 PEP PG AEN ++ I+LPFI SFLQSDPPSATQSPAGL Sbjct: 61 PEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVN 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGSAIS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDR 240 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 333 PI+EFR GEAPK LG+E+F RKWGSR+GSGSLTP G GSRLGSG++TP+G GL SR Sbjct: 241 RPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSR 300 Query: 332 LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 153 LGSG++TP+G P S+D +L+ SQISEVA LAN N K+ E +VD+RVSFEL GED+ Sbjct: 301 LGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAP 360 Query: 152 CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE 6 C +S+ + + + G ++D + K+ + E ET +E Sbjct: 361 CLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNE 409 >gb|KDO51973.1| hypothetical protein CISIN_1g010808mg [Citrus sinensis] Length = 500 Score = 419 bits (1077), Expect = e-114 Identities = 228/405 (56%), Positives = 269/405 (66%), Gaps = 51/405 (12%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSVH+S AESR++P+ +QKRRWGSCWSLYWCFGSHK +KRI HAVL+ Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 PEP G AP AE + HS IVLPFI SFLQSDP SATQSPAGL Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLCLNSLSVN 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L +PPVFSAFTTEPSTA TPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL R RRN+GTN K SLS Y +QPYQ YPGSPGG+L SPGS +S SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375 PI++F APK LG+EHF RKWGSR+GSGS+TP G GSR Sbjct: 241 RPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300 Query: 374 LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201 LGSGT+TP+G GL SRLGSG++TP+G P S+D ++ E+QISEVASLANSDN +K E + Sbjct: 301 LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360 Query: 200 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKD 66 +D+RVSFEL GE++ C +S S + + + P G ++D Sbjct: 361 IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRD 405 >ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis] Length = 500 Score = 419 bits (1076), Expect = e-114 Identities = 228/405 (56%), Positives = 269/405 (66%), Gaps = 51/405 (12%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 MSSVH+S AESR++P+ +QKRRWGSCWSLYWCFGSHK +KRI HAVL+ Sbjct: 1 MSSVHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLL 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 PEP G AP AE + HS IVLPFI SFLQSDP SATQSPAGL Sbjct: 61 PEPMVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVN 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L +PPVFSAFTTEPSTA TPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL R RRN+GTN K SLS Y +QPYQ YPGSPGG+L SPGS +S SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDR 240 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP------------------TGWGSR 375 PI++F APK LG+EHF RKWGSR+GSGS+TP G GSR Sbjct: 241 HPILDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSR 300 Query: 374 LGSGTLTPNG-GL-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVV 201 LGSGT+TP+G GL SRLGSG++TP+G P S+D ++ E+QISEVASLANSDN +K E + Sbjct: 301 LGSGTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHI 360 Query: 200 VDYRVSFELIGEDIPTCTIKESVPSYKTALKTVPEAAPNGMNQKD 66 +D+RVSFEL GE++ C +S S + + + P G ++D Sbjct: 361 IDHRVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRD 405 >gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum] Length = 465 Score = 418 bits (1075), Expect = e-114 Identities = 232/393 (59%), Positives = 268/393 (68%), Gaps = 35/393 (8%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 M SV++S AESRVQP+TVQK+RWGSCWS YWCFGSHK +KRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 PEP PG AEN ++ IV+PFI SFLQSDPPSATQSPAGL Sbjct: 61 PEPVVPGASVSTAENASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVN 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPRGPASIFSIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGS IS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDR 240 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 333 PI+EFR GEAPK LG+EHF RKWGSR+GSGSLTP G GSRLGS +TP+G GL SR Sbjct: 241 RPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSR 300 Query: 332 LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 153 LGSG++TP+G P S+D + +ESQ SEVA L+N N K+ E++VD+RVSFEL GED+ Sbjct: 301 LGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVAR 360 Query: 152 CTIKESVPSYKTALKTVPEAAPNGMNQKDLMAK 54 C +S+ S +T P+ KDL+A+ Sbjct: 361 CLKNKSLVSSRT--------MPDYEYPKDLVAQ 385 >ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium raimondii] gi|763785675|gb|KJB52746.1| hypothetical protein B456_008G275500 [Gossypium raimondii] Length = 465 Score = 417 bits (1071), Expect = e-113 Identities = 227/372 (61%), Positives = 260/372 (69%), Gaps = 35/372 (9%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQKRRWGSCWSLYWCFGSHKPNKRIGHAVLV 948 M SV++S AESRVQP+TVQK+RWGSCWS YWCFGSHK +KRIGHAVLV Sbjct: 1 MRSVNDSVETVNAAASAIVSAESRVQPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLV 60 Query: 947 PEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL--------- 795 PEP PG AEN ++ IV+PFI SFLQSDPPSATQSPAGL Sbjct: 61 PEPVVPGALVSTAENASNPTGIVMPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVN 120 Query: 794 ---------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPEVPFA 678 L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPEVPFA Sbjct: 121 AYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFA 180 Query: 677 QLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSPFPDK 501 QLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGS IS SGTSSPFPD+ Sbjct: 181 QLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDR 240 Query: 500 LPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-GL-SR 333 PI+EFR GEAPK LG+EHF RKWGSR+GSGSLTP G GSRLGS +TP+G GL SR Sbjct: 241 RPILEFRMGEAPKTLGFEHFTTRKWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSR 300 Query: 332 LGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGEDIPT 153 LGSG++TP+G P S+D + +ESQ SEVA L+N N K+ E++VD+RVSFEL GED+ Sbjct: 301 LGSGSLTPDGLGPASRDGFPIESQNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVAR 360 Query: 152 CTIKESVPSYKT 117 C +S+ S +T Sbjct: 361 CLKNKSLVSSRT 372 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 415 bits (1067), Expect = e-113 Identities = 233/413 (56%), Positives = 275/413 (66%), Gaps = 39/413 (9%) Frame = -1 Query: 1127 MSSVHNSXXXXXXXXXXXXXAESRVQPSTVQ----KRRWGSCWSLYWCFGSHKPNKRIGH 960 M SV++S A+SRVQP+TVQ K+RWGSCW LYWCFGS K +KRIGH Sbjct: 1 MRSVNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGH 60 Query: 959 AVLVPEPTPPGVEAPVAENRNHSATIVLPFIXXXXXXXSFLQSDPPSATQSPAGL----- 795 AVLVPEP PG AEN ++ I+LPFI SFLQSDPPSATQSPAGL Sbjct: 61 AVLVPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTS 120 Query: 794 -------------------------LXSPPVFSAFTTEPSTACFTPPPESVQITTPSSPE 690 L +PPVFSA TTEPSTA FTPPPESVQ+TTPSSPE Sbjct: 121 LSVNAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPE 180 Query: 689 VPFAQLLTSSLARNRRNNGTNLKFSLSQYDFQPYQ-YPGSPGGRLKSPGSAISTSGTSSP 513 VPFAQLLTSSL R RRN+G N KF LS Y+FQ YQ YPGSPGG L SPGSAIS SGTSSP Sbjct: 181 VPFAQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSP 240 Query: 512 FPDKLPIVEFRKGEAPKFLGYEHFANRKWGSRVGSGSLTP--TGWGSRLGSGTLTPNG-G 342 FPD+ PI+EFR GEAPK LG+E+F RKWGSR+GSGSLTP G GSRLGSG++TP+G G Sbjct: 241 FPDRRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMG 300 Query: 341 L-SRLGSGTVTPNGGEPPSQDSYLLESQISEVASLANSDNESKDGEVVVDYRVSFELIGE 165 L SRLGSG++TP+G P S+D +L+ SQISEVA LAN N K+ E +VD+RVSFEL GE Sbjct: 301 LGSRLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGE 360 Query: 164 DIPTCTIKESVPSYKTALKTVPEAAPNGMNQKDLMAKNTDCCREYSDGETVSE 6 D+ C +S+ + + + G ++D + K+ + E ET +E Sbjct: 361 DVAPCLESKSLLPSRAVSEYPKDLVAEGRKERDGIKKDLESSCELFIRETSNE 413