BLASTX nr result
ID: Achyranthes22_contig00006230
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes22_contig00006230 (1991 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 407 e-111 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 396 e-107 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 389 e-105 ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 389 e-105 gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i... 385 e-104 ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-... 385 e-104 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 381 e-103 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 381 e-103 gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i... 380 e-102 gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus pe... 374 e-101 ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210... 374 e-101 ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225... 374 e-100 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 370 e-100 gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus... 352 4e-94 gb|AFK46430.1| unknown [Medicago truncatula] 350 2e-93 ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494... 348 4e-93 ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806... 337 1e-89 ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 337 1e-89 ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798... 335 6e-89 ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818... 331 6e-88 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 407 bits (1046), Expect = e-111 Identities = 246/454 (54%), Positives = 282/454 (62%), Gaps = 30/454 (6%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 VNNSV+T+N AIV+AESRVQP +VQKRRWGSC +L WCFGS + SKRI HA LVPEP Sbjct: 4 VNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEP 63 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 V + + ++N N STSIVLPFIAPPSSP SFLQSDPP++T SP G L++T LSVNA S Sbjct: 64 MVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYS 123 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 1235 SG A +F IGPYAHETQLVSPPVFS F TEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 1234 XXXXXXXXXXXXXNHKLALSCYE-----------VHHLISPGSVNSTSGTSSPYLDKRSV 1088 N KL+LS YE V HLISP S SGTSSP+ D+R + Sbjct: 184 TSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRPI 240 Query: 1087 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSAGYVPYDSIPLENQISEVVSLANS 908 V+A KL + FST +WGSRLGSGSL PD AG DS LENQISEV SLANS Sbjct: 241 -----VEAPKLLGFEHFSTRRWGSRLGSGSLTPD-GAGPASRDSFLLENQISEVASLANS 294 Query: 907 ETGSPIAEALIDHRVSFELLQEDI-------PTALVGGCQGT------RDPYTRIDDVVP 767 E+GS E +IDHRVSFEL ED+ P A Q T R D + Sbjct: 295 ESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGIS 354 Query: 766 HKVKNC--FLCSQXXXXXXXXXXXXXXEDCLHKK-SSVSLGSAKEFNFDSSTGETIGKSN 596 +NC F + E+ HKK + GS KEFNFD++ GE K N Sbjct: 355 ESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPN 414 Query: 595 -LSSEWWADEK-IGKSIDAQKNWTFFPLLQPGVS 500 + SEWW +EK +GK Q NWTFFPLLQPG+S Sbjct: 415 IIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 396 bits (1018), Expect = e-107 Identities = 230/472 (48%), Positives = 274/472 (58%), Gaps = 48/472 (10%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 V N+VDTVN AIV AESRVQP +VQKRRWGSCW+L WCFGS K SKRI HA LVPEP Sbjct: 4 VQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEP 63 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 V +NPN S +IV+PFIAPPSSP SFL SDPP+AT SP GLL++ LS+NA S Sbjct: 64 VAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYS 123 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 1235 G A IF IGPYAHETQLVSPPVFS FTTEPSTA TPP EPV TTP SPEVPFAQ Sbjct: 124 PGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLL 183 Query: 1234 XXXXXXXXXXXXXNHKLALSCYEV----------HHLISPGSVNSTSGTSSPYLDKRSVL 1085 N+K LS YE +LISPGSV S SGTSSP+ K ++ Sbjct: 184 TSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPII 243 Query: 1084 EFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDY-------------------------- 983 EFR + K + FST KWGSR+GSGS+ P Sbjct: 244 EFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTP 303 Query: 982 SAGYVPY-DSIPLENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIPTALVGGCQG 806 + G P DS LENQISEV SLANS+ GS I EA+IDHRVSFEL +ED+P+ C+ Sbjct: 304 NGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPS-----CRE 358 Query: 805 TRDPYTRIDDVVPHKVKNCF---------LCSQXXXXXXXXXXXXXXEDCLHKKSSVSLG 653 + +P V N + + ++C K +++ G Sbjct: 359 KEPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFG 418 Query: 652 SAKEFNFDSSTGETIGKSNLSSEWWADEKIG-KSIDAQKNWTFFPLLQPGVS 500 S+K+F+FD+ E + K ++ EWW +K K Q NWTFFP+LQPGVS Sbjct: 419 SSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 389 bits (1000), Expect = e-105 Identities = 229/472 (48%), Positives = 271/472 (57%), Gaps = 48/472 (10%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 V N+VDTVN AIV AESRVQP +VQKRRWGSCW+L WCFGS K SKRI HA LVPEP Sbjct: 4 VQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEP 63 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 V +NPN S +IV+PFIAPPSSP SFL SDPP+AT SP GLL++ LS+NA S Sbjct: 64 AAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYS 123 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 1235 G A IF IGPYAHETQLVSPPVFS FTTEPSTA TPP E V TTP SPEVPFAQ Sbjct: 124 PGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLL 183 Query: 1234 XXXXXXXXXXXXXNHKLALSCYEV----------HHLISPGSVNSTSGTSSPYLDKRSVL 1085 N+K LS YE +LISPGSV S SGTSSP+ K ++ Sbjct: 184 TSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPII 243 Query: 1084 EFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDY-------------------------- 983 EFR + K + FST KWGSR+GSGSL P Sbjct: 244 EFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTP 303 Query: 982 SAGYVPY-DSIPLENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIPTALVGGCQG 806 + G P DS LE QISEV SLANS+ GS I E +IDHRVSFEL ED+P+ C+ Sbjct: 304 NGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPS-----CRE 358 Query: 805 TRDPYTRIDDVVPHKVKNCF---------LCSQXXXXXXXXXXXXXXEDCLHKKSSVSLG 653 + +P V N + + + C K +++ G Sbjct: 359 KEPVMSHSQQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFG 418 Query: 652 SAKEFNFDSSTGETIGKSNLSSEWW-ADEKIGKSIDAQKNWTFFPLLQPGVS 500 S+K+F+FD+ E + K ++ EWW +D+ GK Q NWTFFP+LQPGVS Sbjct: 419 SSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 389 bits (1000), Expect = e-105 Identities = 236/497 (47%), Positives = 284/497 (57%), Gaps = 73/497 (14%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 V++SV+TVN AIV+AESR++P ++QKRRWGSCW+L WCFGS KTSKRISHA LVPEP Sbjct: 4 VHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLVPEP 63 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 VT + + A+ ST+IVLPFIAPPSSP SFLQSDPP+AT SP GLL++ LSVNA S Sbjct: 64 MVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVNAYS 123 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 1235 G A +F IGPYAHETQLV+PPVFSAFTTEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLL 183 Query: 1234 XXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 1088 N KL+LS Y LISPGSV S SGTSSP+ D+ + Sbjct: 184 TSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDRHPI 243 Query: 1087 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL------ 947 L+F A KL + F+T KWGSRLGSGS+ PD +G + D + L Sbjct: 244 LDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGS 303 Query: 946 ----------------------------------ENQISEVVSLANSETGSPIAEALIDH 869 ENQISEV SLANS+ G+ E +IDH Sbjct: 304 GTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDH 363 Query: 868 RVSFELLQEDIPTALVG-GCQGTRDPYTRIDDVVPH----------KVKNCF-LCSQXXX 725 RVSFEL E++ L R D+VP +N F LC + Sbjct: 364 RVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPEESS 423 Query: 724 XXXXXXXXXXXED--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIGKSI 551 E+ C K S++LGS KEFNFD++ GE K +++SEWWA+E +GK Sbjct: 424 NRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENVGKES 483 Query: 550 DAQKNWTFFPLLQPGVS 500 NWTFFP+LQ S Sbjct: 484 KPSNNWTFFPMLQSEAS 500 >gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 385 bits (990), Expect = e-104 Identities = 234/486 (48%), Positives = 277/486 (56%), Gaps = 62/486 (12%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 VN+SV+TVN AIV+A+SRVQP +VQK+RWGSCW L WCFGS K SKRI HA LVPEP Sbjct: 4 VNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEP 63 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 V ++ + A+N + T I+LPFIAPPSSP SFLQSDPP+AT SP GLL++T LSVNA S Sbjct: 64 VVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYS 123 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 1235 G A IF IGPYAHETQLV+PPVFSA TTEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 1234 XXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 1088 N K LS YE +LISPGS S SGTSSP+ D+R + Sbjct: 184 TSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI 243 Query: 1087 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL------ 947 LEFRM +A KL + F+T KWGSRLGSGSL PD +G V D + L Sbjct: 244 LEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGS 303 Query: 946 ------------------ENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIPTALV 821 +QISEV LAN G E ++DHRVSFEL ED+ L Sbjct: 304 GSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLE 363 Query: 820 GG----------------CQGTRDPYTRIDDVVPHKVKNC--FLCSQXXXXXXXXXXXXX 695 +G ++ D + +C F+ Sbjct: 364 SKSLLPSRAVSEYPKDLVAEGRKER----DGIKKDLESSCELFIRETSNETVEKASGEAE 419 Query: 694 XEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSIDAQKNWTFFPL 518 E K SV+LGS KEFNFD++ GE K + SEWWA+EK+ GK +WTFFP+ Sbjct: 420 EEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPM 479 Query: 517 LQPGVS 500 LQP VS Sbjct: 480 LQPEVS 485 >ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis] Length = 500 Score = 385 bits (989), Expect = e-104 Identities = 234/497 (47%), Positives = 283/497 (56%), Gaps = 73/497 (14%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 V++SV+TVN AIV+AESR++P ++QKRRWGSCW+L WCFGS KTSKRISHA L+PEP Sbjct: 4 VHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLLPEP 63 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 VT + + A+ ST+IVLPFIAPPSSP SFLQSDP +AT SP GLL++ LSVNA S Sbjct: 64 MVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVNAYS 123 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 1235 G A +F IGPYAHETQLV+PPVFSAFTTEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLL 183 Query: 1234 XXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 1088 N KL+LS Y LISPGSV S SGTSSP+ D+ + Sbjct: 184 TSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDRHPI 243 Query: 1087 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL------ 947 L+F A KL + F+T KWGSRLGSGS+ PD +G + D + L Sbjct: 244 LDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGS 303 Query: 946 ----------------------------------ENQISEVVSLANSETGSPIAEALIDH 869 ENQISEV SLANS+ G+ E +IDH Sbjct: 304 GTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDH 363 Query: 868 RVSFELLQEDIPTALVG-GCQGTRDPYTRIDDVVPH----------KVKNCF-LCSQXXX 725 RVSFEL E++ L R D+VP +N F LC + Sbjct: 364 RVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPEESS 423 Query: 724 XXXXXXXXXXXED--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIGKSI 551 E+ C K S++LGS KEFNFD++ GE K +++SEWWA+E +GK Sbjct: 424 NRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENVGKES 483 Query: 550 DAQKNWTFFPLLQPGVS 500 NWTFFP+LQ S Sbjct: 484 KPSNNWTFFPMLQSEAS 500 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 381 bits (978), Expect = e-103 Identities = 238/494 (48%), Positives = 279/494 (56%), Gaps = 71/494 (14%) Frame = -2 Query: 1768 NNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEPT 1589 N+SVDT+N AIV+AESRVQP +VQKRRWG CW+L WCFGS KT KRI HA L PEP Sbjct: 19 NSSVDTINAAATAIVSAESRVQPTTVQKRRWGGCWSLYWCFGSHKT-KRIGHAVLAPEPE 77 Query: 1588 VTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALSS 1409 V + A+N + ST+I +PFIAPPSSP SFLQSDPP+AT SP GLL++T LSVNA S Sbjct: 78 VQGAVVTSAENQSQSTAITVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSP 137 Query: 1408 SGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXXX 1232 G A IF IGPYAHETQLV+PP FSAFTTEPSTA TPP E VQ TTPSSPEVPFAQ Sbjct: 138 GGPASIFAIGPYAHETQLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLT 197 Query: 1231 XXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSVL 1085 N K ALS YE LISPGSV S SGTSSP+ D+ +L Sbjct: 198 SSLERARRNSGTNQKFALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPIL 257 Query: 1084 EFRMVDASKLFDSKKFSTCKWGSRLGS----------------GSLIPD----------- 986 EFRM +A KL + F+T KWGSRLGS G++ PD Sbjct: 258 EFRMGEAPKLLGFEHFTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSG 317 Query: 985 --------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEALIDHR 866 + G D LENQISEV SLANSE GS E ++DHR Sbjct: 318 TVTPDGVGLRSMLGSGSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHR 377 Query: 865 VSFELLQEDIPTAL----VGGCQGTRD--PYTRIDDVVPH-----KVKNCFLCSQXXXXX 719 VSFEL E++ L + C+ + P + +D + +N Sbjct: 378 VSFELSGEEVARCLESKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETP 437 Query: 718 XXXXXXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSIDAQ 542 E C K S++LGS KEFNFD+S E K +++SEWWA+E I GK Sbjct: 438 EKPSGEMEEEHCYRKHRSITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEARPA 496 Query: 541 KNWTFFPLLQPGVS 500 NWTFFPLLQP VS Sbjct: 497 NNWTFFPLLQPEVS 510 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 381 bits (978), Expect = e-103 Identities = 244/502 (48%), Positives = 284/502 (56%), Gaps = 79/502 (15%) Frame = -2 Query: 1768 NNSVDTVNXXXXAIVTAESRVQPVS--VQKRRWGSCWNLSWCFGSL---KTSKRISHAAL 1604 N+S++TVN AIV+AESRVQP S VQKRRWG CW+L WCFGS K SKRI HA L Sbjct: 6 NSSIETVNAAATAIVSAESRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVL 65 Query: 1603 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 1424 VPEP V + S+ +N ST I+LPFIAPPSSP SFLQSDPP++T SP GLL++T LS Sbjct: 66 VPEPEVPGAVSSSTENQTQSTPILLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSA 125 Query: 1423 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 1247 NA S G A IF IGPYAHETQLV+PPVFSAFTTEPSTA T PPE VQ TTPSSPEVPF Sbjct: 126 NAYSPRGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPF 185 Query: 1246 AQXXXXXXXXXXXXXXXNHKLALSCYEV--HHL---------ISPGSVNSTSGTSSPYLD 1100 AQ N K +LS YE +HL ISPGS S SGTSSP+ D Sbjct: 186 AQLLTSSLERARRNSGPNQKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPD 245 Query: 1099 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYS-------------------- 980 + +LEFRM +A KL + FST KWGSRLGSGSL PD + Sbjct: 246 RHPMLEFRMGEAPKLLGFEHFSTRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMG 305 Query: 979 -------------AG--------------YVPYDSIP--LENQISEVVSLANSETGSPIA 887 AG +VP I LENQISEV SL NSE GS Sbjct: 306 LSRLCSGTATPDGAGLRSRLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTE 365 Query: 886 EALIDHRVSFELLQEDIPTAL-VGGCQGTRDPYTRIDDVVPH----------KVKNCFLC 740 E ++ HRVSFEL E++ L + TR D +P + C Sbjct: 366 ENVVHHRVSFELSGEEVARCLEIKSVASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQN 425 Query: 739 SQXXXXXXXXXXXXXXEDCLHKK-SSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI 563 + ED +++K S++LGS KEFNFD+S GE K +SSEWWA+E I Sbjct: 426 GEASSEMPEKNSEETEEDHVYRKHRSITLGSIKEFNFDNSKGEVSDKPAISSEWWANETI 485 Query: 562 -GKSIDAQKNWTFFPLLQPGVS 500 GK +WTFFPLLQP VS Sbjct: 486 AGKEARPANSWTFFPLLQPEVS 507 >gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 380 bits (975), Expect = e-102 Identities = 234/490 (47%), Positives = 277/490 (56%), Gaps = 66/490 (13%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQ----KRRWGSCWNLSWCFGSLKTSKRISHAAL 1604 VN+SV+TVN AIV+A+SRVQP +VQ K+RWGSCW L WCFGS K SKRI HA L Sbjct: 4 VNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGHAVL 63 Query: 1603 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 1424 VPEP V ++ + A+N + T I+LPFIAPPSSP SFLQSDPP+AT SP GLL++T LSV Sbjct: 64 VPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSV 123 Query: 1423 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 1247 NA S G A IF IGPYAHETQLV+PPVFSA TTEPSTA T PPE VQ TTPSSPEVPF Sbjct: 124 NAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPF 183 Query: 1246 AQXXXXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 1100 AQ N K LS YE +LISPGS S SGTSSP+ D Sbjct: 184 AQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPD 243 Query: 1099 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL-- 947 +R +LEFRM +A KL + F+T KWGSRLGSGSL PD +G V D + L Sbjct: 244 RRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGS 303 Query: 946 ----------------------ENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIP 833 +QISEV LAN G E ++DHRVSFEL ED+ Sbjct: 304 RLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVA 363 Query: 832 TALVGG----------------CQGTRDPYTRIDDVVPHKVKNC--FLCSQXXXXXXXXX 707 L +G ++ D + +C F+ Sbjct: 364 PCLESKSLLPSRAVSEYPKDLVAEGRKER----DGIKKDLESSCELFIRETSNETVEKAS 419 Query: 706 XXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSIDAQKNWT 530 E K SV+LGS KEFNFD++ GE K + SEWWA+EK+ GK +WT Sbjct: 420 GEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWT 479 Query: 529 FFPLLQPGVS 500 FFP+LQP VS Sbjct: 480 FFPMLQPEVS 489 >gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 374 bits (961), Expect = e-101 Identities = 229/496 (46%), Positives = 269/496 (54%), Gaps = 73/496 (14%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 VN+SVDT+N AIV+AE+R QP +V KRRWGSCW+L WCFG K +KRI HA LVPEP Sbjct: 4 VNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHK-NKRIGHAVLVPEP 62 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 V + + DN ST+IV+PFIAPPSSP SFL SDPP+AT SP G L++ LS NA S Sbjct: 63 VVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSANAYS 122 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 1235 G A IF+IGPYA+ETQLVSPPVFS F TEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 123 PGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 182 Query: 1234 XXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 1088 N K ALS YE +LISPGS S SGTSSP+ D+ V Sbjct: 183 TSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDRHPV 242 Query: 1087 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------------------------- 986 LEFRM +A KLF F+T KWGSR+GSGSL PD Sbjct: 243 LEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSRLGS 302 Query: 985 ---------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEALIDH 869 G DS LENQISEV SLANSE+G E + DH Sbjct: 303 GCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETVFDH 362 Query: 868 RVSFELLQEDIPTALV-----------GGCQGTRDPYTRIDDVVPHKVKN-C-FLCSQXX 728 RVSFEL ED+ L G + Y D + N C F + Sbjct: 363 RVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVEESS 422 Query: 727 XXXXXXXXXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSI 551 + K S++LGS K+FNFD++ E K N+ SEWWA++ + K Sbjct: 423 SRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNVAAKES 482 Query: 550 DAQKNWTFFPLLQPGV 503 +WTFFP+LQPGV Sbjct: 483 KPCNDWTFFPILQPGV 498 >ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus] Length = 497 Score = 374 bits (961), Expect = e-101 Identities = 230/498 (46%), Positives = 279/498 (56%), Gaps = 72/498 (14%) Frame = -2 Query: 1777 ARVNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFG--SLKTSKRISHAAL 1604 A +NNSVDTVN AIV+AE+RVQP + KRRWGSCW+L WCFG S K++KRI HA L Sbjct: 2 ASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVL 61 Query: 1603 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 1424 VPEP V + + ++ PST++VLPFIAPPSSP SFLQS+P + T SP GLL++T LSV Sbjct: 62 VPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSV 121 Query: 1423 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 1247 N S +G A IF IGPY ++TQLVSPPVFSAFTTEPSTA IT PPE VQ TTPSSPEVPF Sbjct: 122 NNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPF 181 Query: 1246 AQXXXXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 1100 A+ N K LS + HLISPGSV S SGTSSP+ D Sbjct: 182 AKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPD 241 Query: 1099 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA------------------- 977 K +LEFRM DA KL + F+T KW SR+GSGSL PD + Sbjct: 242 KHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGS 301 Query: 976 ----------------------------GYVPYDSIPLENQISEVVSLANSETGSPIAEA 881 G+ DS L+NQISEV SLANSETG Sbjct: 302 RLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETG--CQND 359 Query: 880 LIDHRVSFELLQEDIP--------TALVGGCQGTRDPYTRIDDVVPHKVKNCFLCSQXXX 725 + +HRVSFEL ED+ T++ + + T + + C Sbjct: 360 VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDI 419 Query: 724 XXXXXXXXXXXED--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIG-KS 554 ED C + +V+LGS KEFNFD + GE +++ +EWWA+EK+G K Sbjct: 420 KTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKE 479 Query: 553 IDAQKNWTFFPLLQPGVS 500 NWTFFPLLQPGVS Sbjct: 480 ASPGNNWTFFPLLQPGVS 497 >ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus] Length = 497 Score = 374 bits (959), Expect = e-100 Identities = 230/498 (46%), Positives = 278/498 (55%), Gaps = 72/498 (14%) Frame = -2 Query: 1777 ARVNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFG--SLKTSKRISHAAL 1604 A +NNSVDTVN AIV+AE+RVQP + KRRWGSCW+L WCFG S K++KRI HA L Sbjct: 2 ASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVL 61 Query: 1603 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 1424 VPEP V + + ++ PST++VLPFIAPPSSP SFLQS+P + T SP GLL+ T LSV Sbjct: 62 VPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSFTALSV 121 Query: 1423 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 1247 N S +G A IF IGPY ++TQLVSPPVFSAFTTEPSTA IT PPE VQ TTPSSPEVPF Sbjct: 122 NNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPF 181 Query: 1246 AQXXXXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 1100 A+ N K LS + HLISPGSV S SGTSSP+ D Sbjct: 182 AKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPD 241 Query: 1099 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA------------------- 977 K +LEFRM DA KL + F+T KW SR+GSGSL PD + Sbjct: 242 KHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGS 301 Query: 976 ----------------------------GYVPYDSIPLENQISEVVSLANSETGSPIAEA 881 G+ DS L+NQISEV SLANSETG Sbjct: 302 RLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETG--CQND 359 Query: 880 LIDHRVSFELLQEDIP--------TALVGGCQGTRDPYTRIDDVVPHKVKNCFLCSQXXX 725 + +HRVSFEL ED+ T++ + + T + + C Sbjct: 360 VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDI 419 Query: 724 XXXXXXXXXXXED--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIG-KS 554 ED C + +V+LGS KEFNFD + GE +++ +EWWA+EK+G K Sbjct: 420 KTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKE 479 Query: 553 IDAQKNWTFFPLLQPGVS 500 NWTFFPLLQPGVS Sbjct: 480 ASPGNNWTFFPLLQPGVS 497 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 370 bits (951), Expect = e-100 Identities = 234/518 (45%), Positives = 281/518 (54%), Gaps = 94/518 (18%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 1592 VNNSV+T+N AIV+AE+R QP +V KRRWGSCW+L WCFGS K SKRI HA LVPEP Sbjct: 4 VNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLVPEP 63 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 + + + +N PST+IVLPFIAPPSSP SFLQSDPP+AT SP GLL++T LS+NA S Sbjct: 64 VLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSINAYS 123 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 1235 G IF IGPYA+ETQLVSPPVFS FTTEPSTA TPP E VQ TTPSSPEVPFAQ Sbjct: 124 PGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 1234 XXXXXXXXXXXXXNH-KLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRS 1091 + K +LS E +LISPGSV S SGTSSP+ DK Sbjct: 184 TSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPFPDKHP 243 Query: 1090 VLEFRMVDASKLFDSKKFSTCKWGSRLGS------------------------------- 1004 +L FRM +A +L + F+T KWGSRLGS Sbjct: 244 ILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGLGSRLG 303 Query: 1003 -GSLIPD-YSAG------------------------------YVPYDSIPLENQISEVVS 920 GSL PD Y G V DS LENQISEV S Sbjct: 304 SGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQISEVAS 363 Query: 919 LANSETGSPIAEALIDHRVSFELLQEDIPTALVGGCQGTR-------------DPYTRID 779 LANS+ G +++DHRVSFEL ED+ L + + T+ D Sbjct: 364 LANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAECPTKKD 423 Query: 778 DVVPHKVK--NCFLCSQXXXXXXXXXXXXXXED--CLHKKSSVSLGSAKEFNFDSSTGET 611 + + V N C + ED K S++LGS KEFNFD++ + Sbjct: 424 GISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDNTKADV 483 Query: 610 IGKSNLSSEWWADEKI-GKSIDAQKNWTFFPLLQPGVS 500 K + SEWWA+EK+ GK A +W+FFP+LQPGVS Sbjct: 484 SVKPTIGSEWWANEKVAGKEAKAGNSWSFFPILQPGVS 521 >gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus vulgaris] Length = 500 Score = 352 bits (902), Expect = 4e-94 Identities = 220/501 (43%), Positives = 278/501 (55%), Gaps = 77/501 (15%) Frame = -2 Query: 1783 MTARVNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKT---SKRISH 1613 M +R N +++TVN AIVTAESRVQP +V K+RWG CW+ WCFGS K+ SKRI H Sbjct: 1 MGSRNNTTLETVNAAATAIVTAESRVQPTTVPKKRWGGCWSQYWCFGSYKSTKSSKRIGH 60 Query: 1612 AALVPEPTV-TESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAIT 1436 A LVPEP T +A A PNPST+IV+PFIAPPSSP S +QSDPP+A SP GLL+++ Sbjct: 61 AVLVPEPVAPTGPAAAAAAPPNPSTAIVMPFIAPPSSPASLIQSDPPSAIQSPPGLLSLS 120 Query: 1435 PLSVNALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSP 1259 L+ +A SS G A +FTIGPYA+ETQLVSPPVFS FTTEPSTA T PPE V TTPSSP Sbjct: 121 SLAASAYSSGGPASMFTIGPYAYETQLVSPPVFSNFTTEPSTAPFTPPPESVHQTTPSSP 180 Query: 1258 EVPFAQXXXXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSS 1112 +VPFAQ N K AL Y+ LISPGS STSGTS+ Sbjct: 181 DVPFAQ-LLASSLDRARKSNGNQKFALYNYDFQPYHQYPGSPGGQLISPGSAFSTSGTST 239 Query: 1111 PYLDKRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYS---------------- 980 P+ D+ LEFR + K+ + FST +W SRLGSGSL PD + Sbjct: 240 PFPDRPPTLEFRKGETPKILGVEHFSTQRWSSRLGSGSLTPDGAGQGSRLGSGSVTPDGV 299 Query: 979 -------------------------------AGYVPYDSIPLENQISEVVSLANSETGSP 893 G + +++P++NQIS+ +LANS+ G P Sbjct: 300 GLASRLGSGCATPDGLGQESRLGSGCLTPDGVGQINENNLPVQNQISKEATLANSDNGHP 359 Query: 892 IAEALIDHRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKN 752 LIDHRVSFEL ED+ L G QG +DP R + V+ + Sbjct: 360 SNATLIDHRVSFELTGEDVARCLANKTGVLLRNMSGSSQGILAKDPVDR-ERVLRDTDAS 418 Query: 751 CFLCSQXXXXXXXXXXXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWAD 572 C +C++ E C HK++SV+ S+KEFNFDSS G G SEWW + Sbjct: 419 CNVCTE-KTDDKPYNPIGEGEQCFHKQNSVN--SSKEFNFDSSKGVVSGTGGSGSEWWTN 475 Query: 571 EKI-GKSIDAQKNWTFFPLLQ 512 ++ G+ + +W FFP+LQ Sbjct: 476 RRVAGREGRSANSWAFFPMLQ 496 >gb|AFK46430.1| unknown [Medicago truncatula] Length = 487 Score = 350 bits (897), Expect = 2e-93 Identities = 228/498 (45%), Positives = 277/498 (55%), Gaps = 75/498 (15%) Frame = -2 Query: 1768 NNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSL-KTSKRISHAALVPEP 1592 NNS+DTVN AIV+AESRVQP S K+RWGSC++L CFGS KTS+RI HA LVPEP Sbjct: 6 NNSIDTVNAAATAIVSAESRVQPTSSPKKRWGSCFSLPSCFGSHNKTSERIGHAVLVPEP 65 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSP-GGLLAITPLSVNAL 1415 V + PNPST+IV+PFIAPPSSP SFLQSDPP++THSP GLL+++ LS NA Sbjct: 66 -VAPTVPVANAAPNPSTAIVIPFIAPPSSPASFLQSDPPSSTHSPAAGLLSLSSLSANAY 124 Query: 1414 SSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQX 1238 S+SG A +FTIGPYA+ETQLVSPPVFS FT EPSTA T PPE V TTPSSPEVPFAQ Sbjct: 125 STSGPASMFTIGPYAYETQLVSPPVFSNFTAEPSTANFTPPPESVLMTTPSSPEVPFAQ- 183 Query: 1237 XXXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRS 1091 NHK AL YE L+SPGSV STSGTS+P+ D+RS Sbjct: 184 --LLASSLDRARKSNHKFALYNYEYQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRS 241 Query: 1090 VLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYS----------------------- 980 LE R +A K+ + FST KW SR+GSGSL PD + Sbjct: 242 SLELRKGEAPKILGFEHFSTRKWMSRIGSGSLTPDGTGQGSRLGSGSLTPDGVSHTSRLG 301 Query: 979 ------------------------AGYVPYDSIPLENQISEVVSLANSETGSPIAEALID 872 G SI ++NQI VS+ANS+ GS L+D Sbjct: 302 SGCATPDGLGQDSRLGSGSLTPDGVGPTTRGSIDVQNQIPVGVSVANSDHGSQTNATLVD 361 Query: 871 HRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLCSQX 731 HRVSFEL ED+ L QG +DP R + ++ C +CS Sbjct: 362 HRVSFELTGEDVARCLANKTGALLRNMSSSSQGILAKDPIDR-EKILKETNSCCDVCS-- 418 Query: 730 XXXXXXXXXXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKS 554 E C K++SVS S+KEFNFD+ G+ G S S WW ++K+ GK Sbjct: 419 -------GKAIGGEHCCPKRNSVS--SSKEFNFDNRKGDVSGTSANGSSWWTNKKVDGKE 469 Query: 553 IDAQKNWTFFPLLQPGVS 500 + +W FFP+LQP +S Sbjct: 470 SKSVNSWAFFPMLQPDIS 487 >ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum] Length = 492 Score = 348 bits (894), Expect = 4e-93 Identities = 226/494 (45%), Positives = 273/494 (55%), Gaps = 71/494 (14%) Frame = -2 Query: 1768 NNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEPT 1589 NNS+DTVN AIVTAESRVQP + K+RWGSC++LS CFGS K+SKRI HA LVPEP Sbjct: 6 NNSIDTVNAAATAIVTAESRVQPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEP- 64 Query: 1588 VTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALSS 1409 V PNPST IV+PFIAPPSSP SFLQSDPP++THSP L ++P A SS Sbjct: 65 VAPIVPVAHSAPNPSTVIVMPFIAPPSSPASFLQSDPPSSTHSPAAGL-LSPSVNAAYSS 123 Query: 1408 SGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXXX 1232 SG A IFTIGPYA+ETQLVSPPVFS FTTEPSTA+ T PPE VQ TTPSSPEVPFAQ Sbjct: 124 SGSASIFTIGPYAYETQLVSPPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQ-LL 182 Query: 1231 XXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSVL 1085 +HK AL YE L+SPGSV STSGTS+P+ D+RS L Sbjct: 183 ASSLDRARKNNGSHKFALYNYEFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSL 242 Query: 1084 EFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA------------------------ 977 E + K+ + FST +W SR+GSGSL PD + Sbjct: 243 ELSRGETPKILGFEHFSTRRWNSRIGSGSLTPDGAGQGSRLGSGSLTPDGFAHASRLGSG 302 Query: 976 ---------------------GYVPYDSIPLENQISEVVSLANSETGSPIAEALIDHRVS 860 G P ++NQISE VS+ANSE GS L+DHRVS Sbjct: 303 CTTPDGLGQDSRLGSGSLTPDGAGPTTRESMQNQISEDVSVANSEHGSQSNATLVDHRVS 362 Query: 859 FELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLCSQXXXXX 719 FEL ED+ L QG +DP R + ++ C +CS+ Sbjct: 363 FELTGEDVARCLANKAGALLRNMSSSSQGILAKDPIDR-ERILKETNGCCDVCSR-KTND 420 Query: 718 XXXXXXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIGKSIDAQK 539 E C K++SVS S+KEFNFD+ G+ S S WW+++K+G+ Sbjct: 421 KSDNSCAGGEQCCQKRNSVS--SSKEFNFDNRKGDVSDTSANGSGWWSNKKVGEKEGRSV 478 Query: 538 N-WTFFPLLQPGVS 500 N W FFP+LQP +S Sbjct: 479 NSWAFFPMLQPDIS 492 >ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806399 [Glycine max] Length = 515 Score = 337 bits (864), Expect = 1e-89 Identities = 218/497 (43%), Positives = 269/497 (54%), Gaps = 78/497 (15%) Frame = -2 Query: 1768 NNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSK---RISHAALVP 1598 N + +TV AIV AESRVQP K+RWG CW+ WCFGS K+SK RI HA LVP Sbjct: 20 NATAETVFAAANAIVAAESRVQPTDAPKKRWGGCWSQYWCFGSRKSSKSSKRIGHAVLVP 79 Query: 1597 EPTVTE--STSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 1424 EP + +A A PNPST+IV+PFIAPPSSP SFLQSDPP+ SP GLL+++ L+ Sbjct: 80 EPAAPTGPAAAATAAAPNPSTAIVMPFIAPPSSPASFLQSDPPSGIQSPPGLLSLSALAA 139 Query: 1423 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPF 1247 NA SS G A +FTIGPYA+ETQLVSPPVFSAFTTEPSTA TPP E VQ TTPSSP+VPF Sbjct: 140 NAYSSGGPATMFTIGPYAYETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPF 199 Query: 1246 AQXXXXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 1100 AQ K L YE H LISPGS STSGTS+P+ D Sbjct: 200 AQLLASSLDRARKCNGH-QKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPD 258 Query: 1099 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGS----------------GSLIPD------ 986 + LEF + K+ + FST +WGSRLGS GSL PD Sbjct: 259 RPPTLEFPKGETPKILGVEHFSTRRWGSRLGSGSLTPDSAWQGSRLGSGSLTPDGVGLAS 318 Query: 985 -------------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEA 881 SAG ++I ++NQIS+ +LA+S+ G P Sbjct: 319 RLGSGCVTPDGLGQESRLGSGCLTPDSAGPTNQNNISVQNQISKEATLADSDNGHPSNAT 378 Query: 880 LIDHRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLC 740 L+DHRVSFEL ED+ L G QG T+DP R + V +C C Sbjct: 379 LVDHRVSFELTGEDVARCLANKTGVLLRNMSGSSQGILTKDPVDR-ERVQIDTNSSCNAC 437 Query: 739 SQXXXXXXXXXXXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI- 563 ++ E CLHK++SV+ S+KEFNFD+ G+ + EWW + K+ Sbjct: 438 TE-KTDDKPDNPVGKGEQCLHKQNSVN--SSKEFNFDNRKGDVSVTTGSGYEWWTNRKVA 494 Query: 562 GKSIDAQKNWTFFPLLQ 512 GK + +W FFP+LQ Sbjct: 495 GKEGRSANSWAFFPMLQ 511 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 337 bits (863), Expect = 1e-89 Identities = 215/477 (45%), Positives = 266/477 (55%), Gaps = 50/477 (10%) Frame = -2 Query: 1780 TARVNNSVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALV 1601 T +N++++T+N AI +AE+RV +VQKRRWGSCW WCF S K KRI HA L Sbjct: 8 TRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPK-DKRIGHAVLA 66 Query: 1600 PEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVN 1421 PE S A+N + +IVLPF+APPSSP SFLQS+PP+AT SP GLL++T ++ N Sbjct: 67 PESRAPGSGVPAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINAN 126 Query: 1420 ALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFA 1244 S G A IF IGPYAHETQLVSPPVFS FTTEPSTA T PPE V TTPSSPEVPFA Sbjct: 127 IYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA 186 Query: 1243 QXXXXXXXXXXXXXXXNHKLALSCYE-----------VHHLISPGSVNSTSGTSSPYLDK 1097 Q H+ LS YE V HLISP S S SGTSSP+ D+ Sbjct: 187 Q----LFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDR 242 Query: 1096 RSV-------LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA-----GYV----- 968 V LEFR KL K S +WGSR+GSGS+ PD G V Sbjct: 243 DFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVLDRQV 302 Query: 967 -------PYDSIPLENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDI-------PT 830 D L+ QIS+V S + S++G P E ++DHRVSFEL ED+ Sbjct: 303 SDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSA 362 Query: 829 ALVGGCQGT-RDPYT-RID----DVVPHKVKNCFLCSQXXXXXXXXXXXXXXEDCLHKKS 668 ALV + ++P T ID +VV + HK+ Sbjct: 363 ALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQR 422 Query: 667 SVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEK-IGKSIDAQKNWTFFPLLQPGVS 500 S++LGSAKEFNFD++ G K N+SS+WWA+EK +GK + A KNW+ F ++QP VS Sbjct: 423 SITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479 >ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798631 isoform X1 [Glycine max] Length = 504 Score = 335 bits (858), Expect = 6e-89 Identities = 220/504 (43%), Positives = 266/504 (52%), Gaps = 84/504 (16%) Frame = -2 Query: 1771 VNNSVDTVNXXXXAIVTAESRVQPVS-VQKRRWGSCWNLSWCFGSLKTSKRISHAALVPE 1595 VNN+VDTVN AIV AESR+QP + V K+RWGSCW+L WCFG K SKR+ +A LVPE Sbjct: 4 VNNTVDTVNAAASAIVYAESRIQPTTTVPKKRWGSCWSLCWCFGPHKNSKRVGNAVLVPE 63 Query: 1594 PTVTESTSAVADNP-----NPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPL 1430 P E V +P NPST+IV+PFI PPSSP SFLQSDPP+AT SP GL +++ L Sbjct: 64 PV--EPIGPVGFHPATAAPNPSTAIVMPFIVPPSSPASFLQSDPPSATQSPVGLFSLSSL 121 Query: 1429 SVNALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEV 1253 +VNA S G A IF IGPY +ETQLVSPPVFS FTTEPSTA T PPE VQ TTPSSPEV Sbjct: 122 TVNA--SGGPASIFAIGPYTYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEV 179 Query: 1252 PFAQXXXXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPY 1106 PFAQ N + ALS YE L+SP S+ STSG+S+P+ Sbjct: 180 PFAQLLASSLDRNCKSNGTNQRFALSNYEFQPYQQYPGSPGTQLVSPRSIISTSGSSTPF 239 Query: 1105 LDKRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------------------- 986 D+ VLEF +A KL + F T KW SRLGSGSL PD Sbjct: 240 PDRHPVLEFHKGEAPKLLGFENFLTHKWNSRLGSGSLTPDSAGQGSRLGSGSFTPDAVKL 299 Query: 985 -------------------YSAGYVPYDS--------IPLENQISEVVSLANSETGSPIA 887 + +G + D+ I + QISEV S+ NSE Sbjct: 300 ASQLGSGCLTPDGLCQDSRFGSGSLTPDAVAPTARNDIDIGKQISEVTSIVNSENECQPK 359 Query: 886 EALIDHRVSFELLQEDIPTALV------------GGCQGT--RDPYTRIDDVVPHKVKNC 749 AL+DHRVSFEL D+P L G QGT DP I+ + + +C Sbjct: 360 AALVDHRVSFELTGVDVPRCLANKSGSSLLGNMSGSSQGTLVEDP-VDIEKIQKNSNSSC 418 Query: 748 FLCSQXXXXXXXXXXXXXXED----CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEW 581 CS+ + C K S S+KEFNFD+ G SS W Sbjct: 419 AFCSRKTSNASNDKSCNSPGEGAEQCCRKHH--SFNSSKEFNFDNRKGVVSDTPANSSNW 476 Query: 580 WADEKI-GKSIDAQKNWTFFPLLQ 512 W ++KI GK + +WTFFP+LQ Sbjct: 477 WTNKKIVGKEGRSSNSWTFFPMLQ 500 >ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818313 isoform X1 [Glycine max] Length = 509 Score = 331 bits (849), Expect = 6e-88 Identities = 214/495 (43%), Positives = 266/495 (53%), Gaps = 78/495 (15%) Frame = -2 Query: 1762 SVDTVNXXXXAIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSK---RISHAALVPEP 1592 + +TV AIV AESRVQP K+RWG CW+ WCFGS K+SK RI HA LVPEP Sbjct: 21 TAETVFAAANAIVAAESRVQPTDAPKKRWGGCWSQYWCFGSCKSSKSSKRIGHAVLVPEP 80 Query: 1591 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 1412 +A A PNPS +IV+PFIAPPSSP SFLQSDPP+ SP GLL+++ L+ NA S Sbjct: 81 AAPTGPAAAAAAPNPSAAIVMPFIAPPSSPASFLQSDPPSGIQSPPGLLSLSALAANAYS 140 Query: 1411 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 1235 S G A +FTIGPYA+ETQLVSPPVFSAFTTEPSTA TPP E VQ TTPSSP+VPFAQ Sbjct: 141 SGGPASMFTIGPYAYETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPFAQLL 200 Query: 1234 XXXXXXXXXXXXXNHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 1088 HK L YE H LISPGS STSGTS+P+ D+ Sbjct: 201 ASSLDRARKSNGN-HKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPT 259 Query: 1087 LEFRMV--DASKLFDSKKFSTCKWGSRLGS----------------GSLIPD-------- 986 LEF + ++ + FST +WGSRLGS GSL PD Sbjct: 260 LEFPFPKGETPRILGFEHFSTRRWGSRLGSGSLTPDGAWQGSRLGSGSLTPDGIGLASRL 319 Query: 985 -----------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEALI 875 SAG + ++I ++NQIS+ +LA+++ G LI Sbjct: 320 GSGCVTPDGLGLESRLGSGCLTPDSAGPINQNNISVQNQISKEATLADTDNGHSSNATLI 379 Query: 874 DHRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLCSQ 734 DHRVSFEL ED+ L G QG ++DP R K+ C++ Sbjct: 380 DHRVSFELTGEDVARCLANKTGVLLRNMSGSSQGILSKDPVDR-----ERVQKDTDTCTE 434 Query: 733 XXXXXXXXXXXXXXEDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GK 557 E CLHK++SV+ S+KEFNFD+ G+ + SEWW + K+ GK Sbjct: 435 --KTDDKPDNSVGGEQCLHKQNSVN--SSKEFNFDNRKGDVSVTAGSGSEWWTNRKVAGK 490 Query: 556 SIDAQKNWTFFPLLQ 512 + +W FFP+LQ Sbjct: 491 EGRSANSWAFFPMLQ 505