BLASTX nr result
ID: Rehmannia27_contig00003712
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia27_contig00003712 (2320 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166... 466 e-154 ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160... 434 e-142 ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236... 386 e-123 ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116... 384 e-122 ref|XP_015070691.1| PREDICTED: uncharacterized protein LOC107015... 382 e-121 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 379 e-120 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 379 e-120 emb|CDP05166.1| unnamed protein product [Coffea canephora] 376 e-119 ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236... 375 e-119 ref|XP_015070692.1| PREDICTED: uncharacterized protein LOC107015... 375 e-119 ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260... 373 e-118 ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260... 363 e-114 ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot... 326 e-100 gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum] 320 5e-98 ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot... 320 9e-98 ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648... 320 2e-97 ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765... 318 2e-97 ref|XP_015888763.1| PREDICTED: uncharacterized protein LOC107423... 317 4e-96 gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium r... 313 3e-95 ref|XP_009368760.1| PREDICTED: uncharacterized protein LOC103958... 313 3e-95 >ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum] Length = 479 Score = 466 bits (1198), Expect = e-154 Identities = 261/480 (54%), Positives = 292/480 (60%), Gaps = 54/480 (11%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV NS QPSTVQKRRW SCWS+Y C GS+K SKRIGHAVLV Sbjct: 1 MSSVHNSVETVNAAATAIVTAESRVQPSTVQKRRWGSCWSIYWCFGSHKQSKRIGHAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 S+P AP + N N+S+T QSDPPSAT Sbjct: 61 SEPAAAGVAAP-ISENRNQSSTIVLPFIAPPSSPASFLQSDPPSATQSPAGLISLASLSV 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 H SPGGTAPIFTIGPYA+ETQLVSPPVFSTFTTEPSTASF EV F Sbjct: 120 HANSPGGTAPIFTIGPYAHETQLVSPPVFSTFTTEPSTASFTPPPEPVQMTTPSSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTN----------------------IKSPGSAISTSGTSSPFPDK 1040 TN IKSPGSA+STSGTSSPFPDK Sbjct: 180 AQLLSSSLARNRRNCGTNLKYSLSQYEFQPYQYPGSPGGHIKSPGSALSTSGTSSPFPDK 239 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 I+EF +GE KF+GYEHF NYKWGSRVGSGS LTPNGG+SRLGSG Sbjct: 240 HPIMEFRMGEAPKFLGYEHFPNYKWGSRVGSGSLTPNGWGSRLGSGALTPNGGLSRLGSG 299 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 +LTPNG EP SRD NLLE+QI EVASLANSD +S+NDD VV+HRVSFEL GEDIPTCVV Sbjct: 300 TLTPNGGEPPSRDGNLLENQIYEVASLANSDRKSQNDDAVVDHRVSFELFGEDIPTCVVT 359 Query: 1359 ETVRSPK---------IALDTNQKDLMTKNEDSCRENNNAKNVNAIP---------GFDQ 1484 E+ S K A TN KDL TKN DSCRE+N+ + N +P Q Sbjct: 360 ESAPSHKNASGYPGVATAEGTNNKDLTTKNADSCREHNDGETTNEVPEIPLDGEGGELHQ 419 Query: 1485 KHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 K RTVS+GSSKDFNFNN+K E+ +KS+++CEWW N+KVV KELGP NSW+FFPMLQSG S Sbjct: 420 KQRTVSLGSSKDFNFNNAKGEIPEKSSINCEWWTNEKVVRKELGPRNSWSFFPMLQSGAS 479 >ref|XP_011076135.1| PREDICTED: uncharacterized protein LOC105160458 [Sesamum indicum] Length = 466 Score = 434 bits (1116), Expect = e-142 Identities = 246/474 (51%), Positives = 282/474 (59%), Gaps = 48/474 (10%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 M+SV NS QPSTVQKRRW SCWSLY C GSYKHSKRIGHAVL+ Sbjct: 1 MTSVHNSAETLNAAATAIVTAENRAQPSTVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLI 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 S+PT QV+VAP V+N NRSAT QSDPPSAT Sbjct: 61 SEPTAQVAVAPVVENL-NRSATLMLPFIAPPSSPASFLQSDPPSATQSAAGLVSLAALSV 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 HT+SPGGTAPIFTIGPYAYETQLVSPPVFS FTTEPSTASF EV F Sbjct: 120 HTYSPGGTAPIFTIGPYAYETQLVSPPVFSAFTTEPSTASFTPPPEPVQMTTPSSPEVPF 179 Query: 927 -----------XXXXXXXXXXXXXXXWTNIKSPGSAISTSGTSSPFPDKRAIIEFCVGEG 1073 + +SPGSA+S+SGTSSPFPDK ++E GE Sbjct: 180 AQLLSSSLARNRRNSGNMKSSLSQYEFLAYESPGSALSSSGTSSPFPDKWPVVEIRRGEA 239 Query: 1074 SKFIGYEHFLNYKWGSRVGSGS----------------------------LTPNGGISRL 1169 FIGYEHF N+KWGSRVGSGS LTPNGG+SRL Sbjct: 240 PIFIGYEHFFNHKWGSRVGSGSLTPNGRGSRLGSGALTPNGGLSRLGSGALTPNGGLSRL 299 Query: 1170 GSGSLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTC 1349 GSG+LTPNG EP SRDCNLL + ISEV SLANS E +N D VV+HRVSFEL GEDIPTC Sbjct: 300 GSGALTPNGGEPPSRDCNLLGNPISEVVSLANSGNELQNCDAVVDHRVSFELSGEDIPTC 359 Query: 1350 VVKETVRSPKI---------ALDTNQKDLMTKNEDSCRENNNAKNVNAIPGFDQKHRTVS 1502 VV ETV SPK+ A TN D M K ++ R+ +N + ++ ++ T+S Sbjct: 360 VVSETVPSPKMESRDLQEATAEVTNHSDFMAKVSETYRKLSNGETMH-------ENHTIS 412 Query: 1503 MGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 +GSS+DFNFNN+ E++ + VDCEWW ND VV KEL P N+W FFPMLQSGVS Sbjct: 413 LGSSRDFNFNNADGELSARIAVDCEWWTNDDVVGKELAPRNNWTFFPMLQSGVS 466 >ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1 [Nicotiana sylvestris] Length = 470 Score = 386 bits (991), Expect = e-123 Identities = 228/473 (48%), Positives = 265/473 (56%), Gaps = 47/473 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV+N+ QPS+VQKRRW SCWSLY C GSYKHSKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAATAIVTAESRVQPSSVQKRRWGSCWSLYWCFGSYKHSKRIGHAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P P V NPNRSAT SDPPSAT Sbjct: 61 PEPAAPGPAVP-VTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSI 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F EV F Sbjct: 120 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 +N K SPGS +S SGTSSPFP K Sbjct: 180 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 IIEF GE KF+GYEHF KWGSRVGSGS LTPNGGISRLGSG Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 ++TPNG EP SRDC LLE+QISEVASLANSD S +GV++HRVSFEL GED+P+C K Sbjct: 300 TVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREK 359 Query: 1359 ETVRSPKIALDTNQKDLMTKNEDSCRENNN--AKNVNAIP------GFDQ---KHRTVSM 1505 E V S + T D+ + R +++ + + +P G DQ KHR ++ Sbjct: 360 EPVMSH--SQQTLPMDVPAPSNKEMRSSSSIVEEKTDGLPEKASERGDDQCHRKHRNITF 417 Query: 1506 GSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 GSSKDF+F+N K EV +K +VDCEWW +DK KE N+W FFP+LQ GVS Sbjct: 418 GSSKDFDFDNVKIEVLEKHSVDCEWWTSDKATGKESSIQNNWTFFPVLQPGVS 470 >ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana tomentosiformis] Length = 470 Score = 384 bits (986), Expect = e-122 Identities = 222/471 (47%), Positives = 263/471 (55%), Gaps = 45/471 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV+N+ QPS++QK+RW SCWSLY C GSYKHSKRIGHA+LV Sbjct: 1 MSSVQNTVDTVNAAATAIITAESRVQPSSIQKKRWGSCWSLYWCFGSYKHSKRIGHAILV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P P V NPNRSAT SDPPSAT Sbjct: 61 PEPAAPGPAVP-VTENPNRSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSFSI 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F EV F Sbjct: 120 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 +N K SPGS +S SGTSSPFP K Sbjct: 180 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSSLISPGSVVSNSGTSSPFPGK 239 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 IIEF GE KF+GYEHF KWGSRVGSGS LTPNGGISRLGSG Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 ++TPNG EP SRDC LLE+QISEVASLANSD S +GV++HRVSFEL GED+P+C K Sbjct: 300 TVTPNGGEPPSRDCYLLENQISEVASLANSDNGSEIAEGVIDHRVSFELTGEDVPSCREK 359 Query: 1359 ETVRS------PKIALDTNQKDLMTKNEDSCRENNNAKNVNAIPGFDQ---KHRTVSMGS 1511 E V S P + K++ + + + + + + G DQ KHR ++ GS Sbjct: 360 EPVMSHSQQTLPMDVPAPSNKEMRSSSSNVEEKTDGLPEKASERGDDQCHRKHRNITFGS 419 Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 SKDF+F+N K EV ++ +VDCEWW +DK KE N+W FFP+LQ GVS Sbjct: 420 SKDFDFDNVKIEVLEEDSVDCEWWTSDKATGKESSIQNNWTFFPVLQPGVS 470 >ref|XP_015070691.1| PREDICTED: uncharacterized protein LOC107015045 isoform X1 [Solanum pennellii] Length = 470 Score = 382 bits (981), Expect = e-121 Identities = 223/471 (47%), Positives = 261/471 (55%), Gaps = 45/471 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV+N+ QPSTVQKRRW SCWSLY C GS+KHSKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P P V NPN SAT SDPPSAT Sbjct: 61 PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F EV F Sbjct: 120 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 +N K SPGS +S SGTSSPFP K Sbjct: 180 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 IIEF GE KF+GYEHF KWGSRVGSGS LTPNGGISRLGSG Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 ++TPNG EP SRD LLE+QISEVASLANSD S +GV++HRVSFEL GED+P+C K Sbjct: 300 TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 359 Query: 1359 ETVRS---PKIALD------TNQKDLMTKNEDSCRENNNAKNVNAIPGFDQKHRTVSMGS 1511 E V S P + +D + K + E+ + + + +KHR ++ GS Sbjct: 360 EPVMSHSQPTLPMDVSNLLASEMKSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 419 Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 SKDF+F+N K EV +K ++DCEWW +DK KE G N+W FFP+LQ GVS Sbjct: 420 SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 379 bits (974), Expect = e-120 Identities = 227/471 (48%), Positives = 262/471 (55%), Gaps = 45/471 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV+N+ QPSTVQKRRW SCWSLY C GS+KHSKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P P V NPN SAT SDPPSAT Sbjct: 61 PEPAAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSI 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F EV F Sbjct: 120 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 +N K SPGS +S SGTSSPFP K Sbjct: 180 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 IIEF GE KF+GYEHF KWGSRVGSGS LTPNGGISRLGSG Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 ++TPNG EP SRD LLE QISEVASLANSD S +GV++HRVSFEL GED+P+C K Sbjct: 300 TVTPNGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 359 Query: 1359 ETVRS-PKIALDTNQKDLMT---KNEDSCRENN--NAKNVNAIPGFDQ---KHRTVSMGS 1511 E V S + L + +L+ K+ S E + + G DQ KHR ++ GS Sbjct: 360 EPVMSHSQQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGS 419 Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 SKDF+F+N K EV +K ++DCEWW +DK KE G N+W FFP+LQ GVS Sbjct: 420 SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum lycopersicum] Length = 470 Score = 379 bits (974), Expect = e-120 Identities = 222/471 (47%), Positives = 263/471 (55%), Gaps = 45/471 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV+N+ QPSTVQKRRW SCWSLY C GS+KHSKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P P V NPN SAT SDPPSAT Sbjct: 61 PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F EV F Sbjct: 120 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 +N K SPGS +S SGTSSPFP K Sbjct: 180 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 239 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 IIEF GE KF+GYEHF KWGSRVGSGS LTPNGGISRLGSG Sbjct: 240 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 299 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 ++TPNG EP SRD LLE+QISEVASLANSD S + V++HRVSFEL ED+P+C K Sbjct: 300 TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREK 359 Query: 1359 ETVRS---PKIALDTNQ---KDLMTKNEDSCRENNNAKNVNAIPGFDQ---KHRTVSMGS 1511 E V S P + +D + ++ + + + + + + G D+ KHR ++ GS Sbjct: 360 EPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 419 Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 SKDF+F+N K EV +K ++DCEWW +DK VKE G N+W FFP+LQ GVS Sbjct: 420 SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470 >emb|CDP05166.1| unnamed protein product [Coffea canephora] Length = 452 Score = 376 bits (965), Expect = e-119 Identities = 223/470 (47%), Positives = 260/470 (55%), Gaps = 44/470 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV NS QP TVQKRRW SCWS Y C GS K+SKRIG+AVLV Sbjct: 1 MSSVHNSVETVNAAATAIVTAESRVQPPTVQKRRWGSCWSFYWCFGSVKNSKRIGNAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +PT S P V +N N SAT QSDPPSAT Sbjct: 61 PEPTVPGSAVP-VPDNLNHSATIVIPFIAPPSSPASFLQSDPPSATQSPAKFLPLASFSV 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 +T+SP G A IF IGPYA+ETQLVSPPVFS FTTEPSTASF EV F Sbjct: 120 NTYSPSGAASIFAIGPYAHETQLVSPPVFSAFTTEPSTASFTPPPEPVQLTTPSSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 T+IK SPGSAIS SGTSSPFP+K Sbjct: 180 AQLLVSSLTHNRRHSGTSIKFPLSQYEFQPYQCPGSPGSHLISPGSAISNSGTSSPFPEK 239 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNY-------------KWGSRVGSGSLTPNGGISRLGSGS 1181 R IIEF +GE KF+GYE F WGSR+GSGSLTPNGGISRLGSG+ Sbjct: 240 RPIIEFRIGEAPKFLGYELFTRKWGSRVGSGSLTPNGWGSRLGSGSLTPNGGISRLGSGT 299 Query: 1182 LTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKE 1361 LTPNG EP +RD LLE+QISEVASLANSD + N++G+++HRVSFEL E +P CV +E Sbjct: 300 LTPNGGEPAARDSYLLENQISEVASLANSDNGTHNEEGLMDHRVSFELTAEHVPNCVEEE 359 Query: 1362 TVRSPKIALDTNQKDLMTKNEDSCRE--NNNAKNV--NAIPGFDQK-----HRTVSMGSS 1514 K ++ C + ++ N+ A+ G + K +RT S+GSS Sbjct: 360 -----------------MKGQNFCEDCTGDSIHNITRKALDGQEGKQCLKNNRTFSLGSS 402 Query: 1515 KDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 KDFNF+N K+E DKST+DCEWW N+ KELG N W FFPMLQ GVS Sbjct: 403 KDFNFDNMKQESPDKSTIDCEWWTNETAAAKELGSKNKWTFFPMLQPGVS 452 >ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236433 isoform X2 [Nicotiana sylvestris] Length = 442 Score = 375 bits (964), Expect = e-119 Identities = 219/444 (49%), Positives = 254/444 (57%), Gaps = 47/444 (10%) Frame = +3 Query: 474 VQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXXXXXX 653 +QKRRW SCWSLY C GSYKHSKRIGHAVLV +P P V NPNRSAT Sbjct: 2 MQKRRWGSCWSLYWCFGSYKHSKRIGHAVLVPEPAAPGPAVP-VTENPNRSATIVIPFIA 60 Query: 654 XXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVSPPVF 833 SDPPSAT + +SPGGTA IF IGPYA+ETQLVSPPVF Sbjct: 61 PPSSPASFLPSDPPSATQSPAGLLSLKSFSINAYSPGGTASIFAIGPYAHETQLVSPPVF 120 Query: 834 STFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK--------- 986 STFTTEPSTA+F EV F +N K Sbjct: 121 STFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFV 180 Query: 987 -------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYKWGSRV 1127 SPGS +S SGTSSPFP K IIEF GE KF+GYEHF KWGSRV Sbjct: 181 PYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRV 240 Query: 1128 GSGS--------------LTPNGGISRLGSGSLTPNGLEPLSRDCNLLESQISEVASLAN 1265 GSGS LTPNGGISRLGSG++TPNG EP SRDC LLE+QISEVASLAN Sbjct: 241 GSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDCYLLENQISEVASLAN 300 Query: 1266 SDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTNQKDLMTKNEDSCRENN 1445 SD S +GV++HRVSFEL GED+P+C KE V S + T D+ + R ++ Sbjct: 301 SDNGSEIAEGVIDHRVSFELTGEDVPSCREKEPVMSH--SQQTLPMDVPAPSNKEMRSSS 358 Query: 1446 N--AKNVNAIP------GFDQ---KHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWIND 1592 + + + +P G DQ KHR ++ GSSKDF+F+N K EV +K +VDCEWW +D Sbjct: 359 SIVEEKTDGLPEKASERGDDQCHRKHRNITFGSSKDFDFDNVKIEVLEKHSVDCEWWTSD 418 Query: 1593 KVVVKELGPHNSWNFFPMLQSGVS 1664 K KE N+W FFP+LQ GVS Sbjct: 419 KATGKESSIQNNWTFFPVLQPGVS 442 >ref|XP_015070692.1| PREDICTED: uncharacterized protein LOC107015045 isoform X2 [Solanum pennellii] Length = 469 Score = 375 bits (964), Expect = e-119 Identities = 222/471 (47%), Positives = 260/471 (55%), Gaps = 45/471 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV+N+ QPSTVQ RRW SCWSLY C GS+KHSKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P P V NPN SAT SDPPSAT Sbjct: 60 PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 118 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F EV F Sbjct: 119 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 178 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 +N K SPGS +S SGTSSPFP K Sbjct: 179 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 238 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 IIEF GE KF+GYEHF KWGSRVGSGS LTPNGGISRLGSG Sbjct: 239 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSG 298 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 ++TPNG EP SRD LLE+QISEVASLANSD S +GV++HRVSFEL GED+P+C K Sbjct: 299 TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPSCREK 358 Query: 1359 ETVRS---PKIALD------TNQKDLMTKNEDSCRENNNAKNVNAIPGFDQKHRTVSMGS 1511 E V S P + +D + K + E+ + + + +KHR ++ GS Sbjct: 359 EPVMSHSQPTLPMDVSNLLASEMKSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 418 Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 SKDF+F+N K EV +K ++DCEWW +DK KE G N+W FFP+LQ GVS Sbjct: 419 SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 469 >ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260903 isoform X3 [Solanum lycopersicum] Length = 469 Score = 373 bits (957), Expect = e-118 Identities = 221/471 (46%), Positives = 262/471 (55%), Gaps = 45/471 (9%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 MSSV+N+ QPSTVQ RRW SCWSLY C GS+KHSKRIGHAVLV Sbjct: 1 MSSVQNTVDTVNAAASAIVNAESRVQPSTVQ-RRWGSCWSLYWCFGSHKHSKRIGHAVLV 59 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P P V NPN SAT SDPPSAT Sbjct: 60 PEPVAPGPAVP-VTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSI 118 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGGTA IF IGPYA+ETQLVSPPVFSTFTTEPSTA+F EV F Sbjct: 119 NAYSPGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPF 178 Query: 927 XXXXXXXXXXXXXXXWTNIK----------------------SPGSAISTSGTSSPFPDK 1040 +N K SPGS +S SGTSSPFP K Sbjct: 179 AQLLTSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGK 238 Query: 1041 RAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGS--------------LTPNGGISRLGSG 1178 IIEF GE KF+GYEHF KWGSRVGSGS LTPNGGISRLGSG Sbjct: 239 CPIIEFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSG 298 Query: 1179 SLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVK 1358 ++TPNG EP SRD LLE+QISEVASLANSD S + V++HRVSFEL ED+P+C K Sbjct: 299 TVTPNGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPSCREK 358 Query: 1359 ETVRS---PKIALDTNQ---KDLMTKNEDSCRENNNAKNVNAIPGFDQ---KHRTVSMGS 1511 E V S P + +D + ++ + + + + + + G D+ KHR ++ GS Sbjct: 359 EPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGS 418 Query: 1512 SKDFNFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 SKDF+F+N K EV +K ++DCEWW +DK VKE G N+W FFP+LQ GVS Sbjct: 419 SKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 469 >ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260903 isoform X1 [Solanum lycopersicum] Length = 476 Score = 363 bits (933), Expect = e-114 Identities = 210/440 (47%), Positives = 250/440 (56%), Gaps = 45/440 (10%) Frame = +3 Query: 480 KRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXXXXXXXX 659 +RRW SCWSLY C GS+KHSKRIGHAVLV +P P V NPN SAT Sbjct: 38 ERRWGSCWSLYWCFGSHKHSKRIGHAVLVPEPVAPGPAVP-VTENPNHSATIVIPFIAPP 96 Query: 660 XXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVSPPVFST 839 SDPPSAT + +SPGGTA IF IGPYA+ETQLVSPPVFST Sbjct: 97 SSPASFLPSDPPSATQSPAGLLSLKALSINAYSPGGTASIFAIGPYAHETQLVSPPVFST 156 Query: 840 FTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK----------- 986 FTTEPSTA+F EV F +N K Sbjct: 157 FTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLLTSSLARNRRYSGSNYKFPLSQYEFVPY 216 Query: 987 -----------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYKWGSRVGS 1133 SPGS +S SGTSSPFP K IIEF GE KF+GYEHF KWGSRVGS Sbjct: 217 QDPGSPGSNLISPGSVVSNSGTSSPFPGKCPIIEFRKGEPPKFLGYEHFSTRKWGSRVGS 276 Query: 1134 GS--------------LTPNGGISRLGSGSLTPNGLEPLSRDCNLLESQISEVASLANSD 1271 GS LTPNGGISRLGSG++TPNG EP SRD LLE+QISEVASLANSD Sbjct: 277 GSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTPNGGEPPSRDSYLLENQISEVASLANSD 336 Query: 1272 EESRNDDGVVEHRVSFELIGEDIPTCVVKETVRS---PKIALDTNQ---KDLMTKNEDSC 1433 S + V++HRVSFEL ED+P+C KE V S P + +D + ++ + + + Sbjct: 337 NGSEIGEAVIDHRVSFELTEEDVPSCREKEPVMSHSQPTLPMDVSNLLASEMRSGSSMAE 396 Query: 1434 RENNNAKNVNAIPGFDQ---KHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVV 1604 + + + G D+ KHR ++ GSSKDF+F+N K EV +K ++DCEWW +DK V Sbjct: 397 EKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAV 456 Query: 1605 KELGPHNSWNFFPMLQSGVS 1664 KE G N+W FFP+LQ GVS Sbjct: 457 KESGIQNNWTFFPVLQPGVS 476 >ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] gi|508776010|gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 326 bits (836), Expect = e-100 Identities = 204/463 (44%), Positives = 248/463 (53%), Gaps = 62/463 (13%) Frame = +3 Query: 462 QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRSATXX 638 QP+TVQK+RW SCW LY C GS K+SKRIGHAVLV +P P SV+ N + Sbjct: 26 QPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVS--TAENVSNPTGII 83 Query: 639 XXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLV 818 QSDPPSAT + +SP G A IF IGPYA+ETQLV Sbjct: 84 LPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHETQLV 143 Query: 819 SPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK---- 986 +PPVFS TTEPSTA F EV F N K Sbjct: 144 TPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLS 203 Query: 987 -------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNY 1109 SPGSAIS SGTSSPFPD+R I+EF +GE K +G+E+F Sbjct: 204 HYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFENFTTR 263 Query: 1110 KWGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLES 1235 KWGSR+GSGSLTP+G G+ SRLGSGSLTP+GL P SRD L+ S Sbjct: 264 KWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGFLVGS 323 Query: 1236 QISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTNQKDLMT 1415 QISEVA LAN +ND+ +V+HRVSFEL GED+ C+ +++ P A+ KDL+ Sbjct: 324 QISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSL-LPSRAVSEYPKDLVA 382 Query: 1416 KN-----------EDSC----RENNNAKNVNAIPGFD-----QKHRTVSMGSSKDFNFNN 1535 + E SC RE +N A + QKHR+V++GS K+FNF+N Sbjct: 383 EGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDN 442 Query: 1536 SKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 +K E +DK T+ EWW N+KV KE P NSW FFPMLQ VS Sbjct: 443 TKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485 >gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum] Length = 465 Score = 320 bits (821), Expect = 5e-98 Identities = 196/446 (43%), Positives = 243/446 (54%), Gaps = 45/446 (10%) Frame = +3 Query: 462 QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRSATXX 638 QP+TVQK+RW SCWS Y C GS+K SKRIGHAVLV +P P SV+ N + Sbjct: 26 QPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGASVS--TAENASNPTGIV 83 Query: 639 XXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLV 818 QSDPPSAT + +SP G A IF+IGPYA+ETQLV Sbjct: 84 MPFIAPPSSPASFLQSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFSIGPYAHETQLV 143 Query: 819 SPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK---- 986 +PPVFS TTEPSTA F EV F N K Sbjct: 144 TPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLS 203 Query: 987 -------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNY 1109 SPGS IS SGTSSPFPD+R I+EF +GE K +G+EHF Sbjct: 204 HYEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTTR 263 Query: 1110 KWGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLES 1235 KWGSR+GSGSLTP+G G+ SRLGSGSLTP+GL P SRD +ES Sbjct: 264 KWGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIES 323 Query: 1236 QISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTN-QKDLM 1412 Q SEVA L+N +ND+ +V+HRVSFEL GED+ C+ +++ S + D KDL+ Sbjct: 324 QNSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPKDLV 383 Query: 1413 TKN--EDSCRENNNAKNVNAIPGFDQKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWI 1586 + E + + A+ + QKHR+V++GS K+FNF+N K E ++K TV EWW Sbjct: 384 AQGRIEKDEKVSGEAEEDHCY----QKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWA 439 Query: 1587 NDKVVVKELGPHNSWNFFPMLQSGVS 1664 N+KV KE P N+W FFPMLQ VS Sbjct: 440 NEKVAGKEARPGNNWTFFPMLQPEVS 465 >ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] gi|508776011|gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 320 bits (821), Expect = 9e-98 Identities = 204/467 (43%), Positives = 248/467 (53%), Gaps = 66/467 (14%) Frame = +3 Query: 462 QPSTVQ----KRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRS 626 QP+TVQ K+RW SCW LY C GS K+SKRIGHAVLV +P P SV+ N + Sbjct: 26 QPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEPVVPGASVS--TAENVSNP 83 Query: 627 ATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYE 806 QSDPPSAT + +SP G A IF IGPYA+E Sbjct: 84 TGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSPRGPASIFAIGPYAHE 143 Query: 807 TQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK 986 TQLV+PPVFS TTEPSTA F EV F N K Sbjct: 144 TQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQK 203 Query: 987 -----------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEH 1097 SPGSAIS SGTSSPFPD+R I+EF +GE K +G+E+ Sbjct: 204 FGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPILEFRMGEAPKLLGFEN 263 Query: 1098 FLNYKWGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCN 1223 F KWGSR+GSGSLTP+G G+ SRLGSGSLTP+GL P SRD Sbjct: 264 FTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGSGSLTPDGLGPASRDGF 323 Query: 1224 LLESQISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTNQK 1403 L+ SQISEVA LAN +ND+ +V+HRVSFEL GED+ C+ +++ P A+ K Sbjct: 324 LVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLESKSL-LPSRAVSEYPK 382 Query: 1404 DLMTKN-----------EDSC----RENNNAKNVNAIPGFD-----QKHRTVSMGSSKDF 1523 DL+ + E SC RE +N A + QKHR+V++GS K+F Sbjct: 383 DLVAEGRKERDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEF 442 Query: 1524 NFNNSKEEVTDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGVS 1664 NF+N+K E +DK T+ EWW N+KV KE P NSW FFPMLQ VS Sbjct: 443 NFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489 >ref|XP_012090213.1| PREDICTED: uncharacterized protein LOC105648441 [Jatropha curcas] gi|643706116|gb|KDP22248.1| hypothetical protein JCGZ_26079 [Jatropha curcas] Length = 498 Score = 320 bits (819), Expect = 2e-97 Identities = 194/500 (38%), Positives = 251/500 (50%), Gaps = 74/500 (14%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 M SV NS QP+ VQKRRW CWSLY C GS+K+SKRIGHAVLV Sbjct: 1 MRSVNNSVETINAAATAIISAESRVQPTVVQKRRWGGCWSLYWCFGSHKNSKRIGHAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 +P +V +N + +A QSDPPS T Sbjct: 61 PEPEVPQAVVTSAENQTHSTAAAVPFIAPPSSPASFL-QSDPPSVTQSPAGLLSLTALSV 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 +SPGG A IF IGPYA+ETQLV+PPVFS FTTEPSTA F EV F Sbjct: 120 SAYSPGGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK-----------------------SPGSAISTSGTSSPFPD 1037 N K SPGS IS SGTSSPFPD Sbjct: 180 AQLLTSSLERARRNSGANQKFALSHYEFQSYPLYPGSPGGQLISPGSIISNSGTSSPFPD 239 Query: 1038 KRAIIEFCVGEGSKFIGYEHFLNYKWGSRVGSGSLTPNG---------------GI---- 1160 + ++EF +GE K +G+EHF KWGSR+GSG+LTP+G G+ Sbjct: 240 RHPLLEFRMGEAPKLLGFEHFTTRKWGSRLGSGTLTPDGVGLGSRLCSGTATPDGVGLGS 299 Query: 1161 ---------------SRLGSGSLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDG 1295 SRLGSGSLTP+ + P S+D LLE+QISEVASLANS+ S+ND+ Sbjct: 300 RLGSGSVTPDGVGLRSRLGSGSLTPDCVVPASQDGLLLENQISEVASLANSENASKNDEN 359 Query: 1296 VVEHRVSFELIGEDIPTCVVKETVRSPKIALD----------TNQKDLMTKNEDSCRENN 1445 +V+HRVSFEL GE++ C+ +++ S + + N ++++ + D Sbjct: 360 IVDHRVSFELSGEEVARCLESKSMTSSRTFSECPQDSMAEEQINSEEILINSNDCLHIGE 419 Query: 1446 NAKNVNAIPGFD-------QKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWINDKVVV 1604 + P + +KHR++++GS K+FNF+NSK EV DK T+ EWW N+ + Sbjct: 420 TSNETPEKPSGETEEEPCYRKHRSITLGSIKEFNFDNSK-EVPDKPTISSEWWANETIAG 478 Query: 1605 KELGPHNSWNFFPMLQSGVS 1664 KE P N+W FFP+LQ VS Sbjct: 479 KEARPANNWTFFPLLQPEVS 498 >ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium raimondii] gi|763785675|gb|KJB52746.1| hypothetical protein B456_008G275500 [Gossypium raimondii] Length = 465 Score = 318 bits (816), Expect = 2e-97 Identities = 193/445 (43%), Positives = 241/445 (54%), Gaps = 44/445 (9%) Frame = +3 Query: 462 QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXX 641 QP+TVQK+RW SCWS Y C GS+K SKRIGHAVLV +P ++ +N N + Sbjct: 26 QPTTVQKKRWGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGALVSTAENASNPTGIVMP 85 Query: 642 XXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVS 821 QSDPPSAT + +SP G A IF IGPYA+ETQLV+ Sbjct: 86 FIAPPSSPASFL-QSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFAIGPYAHETQLVT 144 Query: 822 PPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK----- 986 PPVFS TTEPSTA F EV F N K Sbjct: 145 PPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSH 204 Query: 987 ------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYK 1112 SPGS IS SGTSSPFPD+R I+EF +GE K +G+EHF K Sbjct: 205 YEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTTRK 264 Query: 1113 WGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLESQ 1238 WGSR+GSGSLTP+G G+ SRLGSGSLTP+GL P SRD +ESQ Sbjct: 265 WGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIESQ 324 Query: 1239 ISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTN-QKDLMT 1415 SEVA L+N +ND+ +V+HRVSFEL GED+ C+ +++ S + D DL+ Sbjct: 325 NSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPNDLVA 384 Query: 1416 KN--EDSCRENNNAKNVNAIPGFDQKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWIN 1589 + E + + A+ + QKHR+V++GS K+FNF+N K E ++K TV EWW N Sbjct: 385 QGRIEKDEKVSGEAEEDHCY----QKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWAN 440 Query: 1590 DKVVVKELGPHNSWNFFPMLQSGVS 1664 +KV KE P N+W FFPMLQ VS Sbjct: 441 EKVAGKEARPGNNWTFFPMLQPEVS 465 >ref|XP_015888763.1| PREDICTED: uncharacterized protein LOC107423668 [Ziziphus jujuba] Length = 501 Score = 317 bits (811), Expect = 4e-96 Identities = 201/506 (39%), Positives = 248/506 (49%), Gaps = 80/506 (15%) Frame = +3 Query: 387 MSSVRNSXXXXXXXXXXXXXXXXXXQPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLV 566 M SV NS QP+ V KRRW SCWSLY C GS+K++KRI HAVLV Sbjct: 1 MRSVNNSVETINAAASAIVSAETRAQPTAVPKRRWGSCWSLYWCFGSHKNTKRISHAVLV 60 Query: 567 SQPTPQVSVAPFVDNNPNRSATXXXXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXX 746 + + P +N +A QSDPPSAT Sbjct: 61 PEQVVPGAAVPAAENQIPSTAVVLPFIAPPSSPASFL-QSDPPSATQSPAGLLSLTSLSV 119 Query: 747 HTFSPGGTAPIFTIGPYAYETQLVSPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSF 926 + +SPGG A IF IGPYAYETQLVSPPVFSTFTTEPSTA F EV F Sbjct: 120 NAYSPGGPASIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPF 179 Query: 927 XXXXXXXXXXXXXXXWTNIK-----------------------SPGSAISTSGTSSPFPD 1037 TN K SPGS IS SGTSSPFPD Sbjct: 180 AQLLTSSLDRTRRNNGTNQKFALSHCEFQPYQPYPGSPGGQLISPGSVISNSGTSSPFPD 239 Query: 1038 KRAIIEFCVGEGSKFIGYEHFLNYKW--------------------------------GS 1121 + I+EF +GE + +G+EHF KW GS Sbjct: 240 RHPILEFRMGEAPRLLGFEHFTTRKWGSRLGSGSITPDGLGLGSRLGSGCLTPDGNGLGS 299 Query: 1122 RVGSGSLTPNGG--ISRLGSGSLTPNGLEPLSRDCNLLESQISEVASLANSDEESRNDDG 1295 R+GSGSLTPNG SRLGSG LTP+G+ P S D +E+QISEVASLANS+ + D Sbjct: 300 RIGSGSLTPNGAGLASRLGSGCLTPDGVGPASGDSFPMENQISEVASLANSESGCQLDGN 359 Query: 1296 VVEHRVSFELIGEDIPTCVVKETVRSPKIALDT---------NQKDLMTKN-----EDSC 1433 V+ HRVSFEL GED+ C+ +++ S + A D +KD M +SC Sbjct: 360 VINHRVSFELTGEDVARCLANKSMASVRTASDPLKDTPSECGVKKDRMISTGTDHFSESC 419 Query: 1434 RENNNAKNVNAIPGFD---------QKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWI 1586 E + + +P D +KHR++++GS K+FNF+++K E +DK T EWW Sbjct: 420 VEETSVE----LPENDHGEWEDQCYRKHRSITLGSIKEFNFDSTKSEFSDKPTNGSEWWA 475 Query: 1587 NDKVVVKELGPHNSWNFFPMLQSGVS 1664 N+KV KE P N W FFP+LQ GVS Sbjct: 476 NEKVAGKESKPGNGWTFFPILQPGVS 501 >gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium raimondii] Length = 464 Score = 313 bits (802), Expect = 3e-95 Identities = 193/445 (43%), Positives = 240/445 (53%), Gaps = 44/445 (9%) Frame = +3 Query: 462 QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPTPQVSVAPFVDNNPNRSATXXX 641 QP+TVQKR W SCWS Y C GS+K SKRIGHAVLV +P ++ +N N + Sbjct: 26 QPTTVQKR-WGSCWSFYWCFGSHKSSKRIGHAVLVPEPVVPGALVSTAENASNPTGIVMP 84 Query: 642 XXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLVS 821 QSDPPSAT + +SP G A IF IGPYA+ETQLV+ Sbjct: 85 FIAPPSSPASFL-QSDPPSATQSPAGLLSLTALSVNAYSPRGPASIFAIGPYAHETQLVT 143 Query: 822 PPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK----- 986 PPVFS TTEPSTA F EV F N K Sbjct: 144 PPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLTSSLERARRNSGINQKFGLSH 203 Query: 987 ------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNYK 1112 SPGS IS SGTSSPFPD+R I+EF +GE K +G+EHF K Sbjct: 204 YEFQSYQIYPGSPGGNLISPGSVISNSGTSSPFPDRRPILEFRMGEAPKTLGFEHFTTRK 263 Query: 1113 WGSRVGSGSLTPNG-----------------GI-SRLGSGSLTPNGLEPLSRDCNLLESQ 1238 WGSR+GSGSLTP+G G+ SRLGSGSLTP+GL P SRD +ESQ Sbjct: 264 WGSRLGSGSLTPDGLGQGSRLGSECVTPDGMGLGSRLGSGSLTPDGLGPASRDGFPIESQ 323 Query: 1239 ISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVRSPKIALDTN-QKDLMT 1415 SEVA L+N +ND+ +V+HRVSFEL GED+ C+ +++ S + D DL+ Sbjct: 324 NSEVALLSNPPNGPKNDEIIVDHRVSFELSGEDVARCLKNKSLVSSRTMPDYEYPNDLVA 383 Query: 1416 KN--EDSCRENNNAKNVNAIPGFDQKHRTVSMGSSKDFNFNNSKEEVTDKSTVDCEWWIN 1589 + E + + A+ + QKHR+V++GS K+FNF+N K E ++K TV EWW N Sbjct: 384 QGRIEKDEKVSGEAEEDHCY----QKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWAN 439 Query: 1590 DKVVVKELGPHNSWNFFPMLQSGVS 1664 +KV KE P N+W FFPMLQ VS Sbjct: 440 EKVAGKEARPGNNWTFFPMLQPEVS 464 >ref|XP_009368760.1| PREDICTED: uncharacterized protein LOC103958237 isoform X2 [Pyrus x bretschneideri] Length = 478 Score = 313 bits (803), Expect = 3e-95 Identities = 189/457 (41%), Positives = 246/457 (53%), Gaps = 57/457 (12%) Frame = +3 Query: 462 QPSTVQKRRWASCWSLYSCNGSYKHSKRIGHAVLVSQPT-PQVSVAPFVDNNPNRSATXX 638 QP+ + KRRW SCWSLY C GS+K SKRIGHAVLV +P P +V+ +N S Sbjct: 23 QPTNISKRRWGSCWSLYWCFGSHKTSKRIGHAVLVPEPVVPGAAVS--TSDNQTTSTAIV 80 Query: 639 XXXXXXXXXXXXXXQSDPPSATHXXXXXXXXXXXXXHTFSPGGTAPIFTIGPYAYETQLV 818 SDPPSA+ + +S G A +F+IGPYAYETQLV Sbjct: 81 LPFIAPPSSPASFLPSDPPSASQSPAGFLSLTSLSVNAYSSGEPASMFSIGPYAYETQLV 140 Query: 819 SPPVFSTFTTEPSTASFXXXXXXXXXXXXXXXEVSFXXXXXXXXXXXXXXXWTNIK---- 986 SPPVFSTF TEPSTA F EV F N K Sbjct: 141 SPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLSSSLDRQRRNSSNNQKFPLS 200 Query: 987 -------------------SPGSAISTSGTSSPFPDKRAIIEFCVGEGSKFIGYEHFLNY 1109 SPGSAIS SGTSSPFPD+ ++EF +GEG K G++HF N+ Sbjct: 201 QYEYQPYQQYPGSPGGDLISPGSAISNSGTSSPFPDRHPMLEFRMGEGPKLYGFDHFTNH 260 Query: 1110 KWGSRVGSGSLTPNG---------------GI---SRLGSGSLTPNGLEPLSRDCNLLES 1235 KWGSR+GSG+LTP+G G+ SR+ SG LTP+G P SRD +E+ Sbjct: 261 KWGSRLGSGTLTPDGYELGSRLGSGCLTPNGVGVGSRMSSGCLTPDGTGPASRDGFHMEN 320 Query: 1236 QISEVASLANSDEESRNDDGVVEHRVSFELIGEDIPTCVVKETVR-----SPKIALDTN- 1397 QISEVASLAN++ N + +HRVSFEL GED+ C+ + +R S IA + + Sbjct: 321 QISEVASLANTESGCHNGGTIFDHRVSFELTGEDVACCLANKALRTATESSNDIAAENSI 380 Query: 1398 QKDLMTKNEDSCRENNNAKNVNAIP------GFDQ---KHRTVSMGSSKDFNFNNSKEEV 1550 + D + + ++ RE N ++++ IP G DQ K R++++GS+K+FNF+++K EV Sbjct: 381 ETDGLLTDSNNHREFNVEESLSRIPENASGEGEDQGYRKQRSITLGSTKEFNFDHTKAEV 440 Query: 1551 TDKSTVDCEWWINDKVVVKELGPHNSWNFFPMLQSGV 1661 KS + EWW N V KE P N W FFP+LQ GV Sbjct: 441 PSKSNIGSEWWANKNVAAKESKPCNDWTFFPILQPGV 477