BLASTX nr result
ID: Achyranthes23_contig00014521
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Achyranthes23_contig00014521 (1988 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264... 407 e-111 ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260... 396 e-107 ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583... 389 e-105 ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citr... 389 e-105 gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein i... 385 e-104 ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-... 385 e-104 ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm... 381 e-103 ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus tr... 381 e-103 gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein i... 380 e-102 gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus pe... 374 e-101 ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210... 374 e-101 ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225... 374 e-100 gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] 370 e-100 gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus... 352 4e-94 gb|AFK46430.1| unknown [Medicago truncatula] 350 2e-93 ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494... 348 4e-93 ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806... 337 1e-89 ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 337 1e-89 ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798... 335 6e-89 ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818... 331 6e-88 >ref|XP_002272322.1| PREDICTED: uncharacterized protein LOC100264629 [Vitis vinifera] Length = 448 Score = 407 bits (1046), Expect = e-111 Identities = 243/454 (53%), Positives = 279/454 (61%), Gaps = 30/454 (6%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 VNNSV+T+N IV+AESRVQP +VQKRRWGSC +L WCFGS + SKRI HA LVPEP Sbjct: 4 VNNSVETINAAATAIVSAESRVQPTTVQKRRWGSCLSLYWCFGSHRHSKRIGHAVLVPEP 63 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 V + + ++N N STSIVLPFIAPPSSP SFLQSDPP++T SP G L++T LSVNA S Sbjct: 64 MVPGAVAPASENLNLSTSIVLPFIAPPSSPASFLQSDPPSSTQSPAGFLSLTALSVNAYS 123 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 784 SG A +F IGPYAHETQLVSPPVFS F TEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PSGPASMFAIGPYAHETQLVSPPVFSTFPTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 785 XXXXXXXXXXXXXXHKLALSCYE-----------VHHLISPGSVNSTSGTSSPYLDKRSV 931 KL+LS YE V HLISP S SGTSSP+ D+R + Sbjct: 184 TSSLDRSRRNSGTNQKLSLSNYEFQPYQLYPESPVGHLISP---ISNSGTSSPFPDRRPI 240 Query: 932 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSAGYVPYDSIPLENQISEVVSLANS 1111 V+A KL + FST +WGSRLGSGSL PD AG DS LENQISEV SLANS Sbjct: 241 -----VEAPKLLGFEHFSTRRWGSRLGSGSLTPD-GAGPASRDSFLLENQISEVASLANS 294 Query: 1112 ETGSPIAEALIDHRVSFELLQEDI-------PTALVGGCQGT------RDPYTRIDDVVP 1252 E+GS E +IDHRVSFEL ED+ P A Q T R D + Sbjct: 295 ESGSQNGETVIDHRVSFELAGEDVAVCVEKKPVASAETVQNTLQDIVEEGEIERERDGIS 354 Query: 1253 HKVKNC--FLCSQXXXXXXXXXXXXXXXDCLHKK-SSVSLGSAKEFNFDSSTGETIGKSN 1423 +NC F + + HKK + GS KEFNFD++ GE K N Sbjct: 355 ESTENCCEFCVGEALKAASEKASAEGEEEQCHKKHPPIRHGSIKEFNFDNTKGEVSAKPN 414 Query: 1424 -LSSEWWADEK-IGKSIDAQKNWTFFPLLQPGVS 1519 + SEWW +EK +GK Q NWTFFPLLQPG+S Sbjct: 415 IIGSEWWVNEKVVGKGTGPQTNWTFFPLLQPGIS 448 >ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 [Solanum lycopersicum] Length = 470 Score = 396 bits (1018), Expect = e-107 Identities = 228/472 (48%), Positives = 271/472 (57%), Gaps = 48/472 (10%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 V N+VDTVN IV AESRVQP +VQKRRWGSCW+L WCFGS K SKRI HA LVPEP Sbjct: 4 VQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEP 63 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 V +NPN S +IV+PFIAPPSSP SFL SDPP+AT SP GLL++ LS+NA S Sbjct: 64 VAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKALSINAYS 123 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 784 G A IF IGPYAHETQLVSPPVFS FTTEPSTA TPP EPV TTP SPEVPFAQ Sbjct: 124 PGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPEPVHMTTPPSPEVPFAQLL 183 Query: 785 XXXXXXXXXXXXXXHKLALSCYEV----------HHLISPGSVNSTSGTSSPYLDKRSVL 934 +K LS YE +LISPGSV S SGTSSP+ K ++ Sbjct: 184 TSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPII 243 Query: 935 EFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDY-------------------------- 1036 EFR + K + FST KWGSR+GSGS+ P Sbjct: 244 EFRKGEPPKFLGYEHFSTRKWGSRVGSGSVTPSGWGSRLGSGTLTPNGGISRLGSGTVTP 303 Query: 1037 SAGYVPY-DSIPLENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIPTALVGGCQG 1213 + G P DS LENQISEV SLANS+ GS I EA+IDHRVSFEL +ED+P+ C+ Sbjct: 304 NGGEPPSRDSYLLENQISEVASLANSDNGSEIGEAVIDHRVSFELTEEDVPS-----CRE 358 Query: 1214 TRDPYTRIDDVVPHKVKNCF---------LCSQXXXXXXXXXXXXXXXDCLHKKSSVSLG 1366 + +P V N + + +C K +++ G Sbjct: 359 KEPVMSHSQPTLPMDVSNLLASEMRSGSSMAEEKTYGSPRKASESGEDECHRKHRNITFG 418 Query: 1367 SAKEFNFDSSTGETIGKSNLSSEWWADEKIG-KSIDAQKNWTFFPLLQPGVS 1519 S+K+F+FD+ E + K ++ EWW +K K Q NWTFFP+LQPGVS Sbjct: 419 SSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAVKESGIQNNWTFFPVLQPGVS 470 >ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum] Length = 470 Score = 389 bits (1000), Expect = e-105 Identities = 227/472 (48%), Positives = 268/472 (56%), Gaps = 48/472 (10%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 V N+VDTVN IV AESRVQP +VQKRRWGSCW+L WCFGS K SKRI HA LVPEP Sbjct: 4 VQNTVDTVNAAASAIVNAESRVQPSTVQKRRWGSCWSLYWCFGSHKHSKRIGHAVLVPEP 63 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 V +NPN S +IV+PFIAPPSSP SFL SDPP+AT SP GLL++ LS+NA S Sbjct: 64 AAPGPAVPVTENPNHSATIVIPFIAPPSSPASFLPSDPPSATQSPAGLLSLKSLSINAYS 123 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 784 G A IF IGPYAHETQLVSPPVFS FTTEPSTA TPP E V TTP SPEVPFAQ Sbjct: 124 PGGTASIFAIGPYAHETQLVSPPVFSTFTTEPSTANFTPPPELVHMTTPPSPEVPFAQLL 183 Query: 785 XXXXXXXXXXXXXXHKLALSCYEV----------HHLISPGSVNSTSGTSSPYLDKRSVL 934 +K LS YE +LISPGSV S SGTSSP+ K ++ Sbjct: 184 TSSLARNRRYSGSNYKFPLSQYEFVPYQDPGSPGSNLISPGSVVSNSGTSSPFPGKCPII 243 Query: 935 EFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDY-------------------------- 1036 EFR + K + FST KWGSR+GSGSL P Sbjct: 244 EFRKGEPPKFLGYEHFSTRKWGSRVGSGSLTPSGWGSRLGSGTLTPNGGISRLGSGTVTP 303 Query: 1037 SAGYVPY-DSIPLENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIPTALVGGCQG 1213 + G P DS LE QISEV SLANS+ GS I E +IDHRVSFEL ED+P+ C+ Sbjct: 304 NGGEPPSRDSYLLEYQISEVASLANSDNGSEIGEGVIDHRVSFELTGEDVPS-----CRE 358 Query: 1214 TRDPYTRIDDVVPHKVKNCF---------LCSQXXXXXXXXXXXXXXXDCLHKKSSVSLG 1366 + +P V N + + C K +++ G Sbjct: 359 KEPVMSHSQQTLPMDVSNLLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFG 418 Query: 1367 SAKEFNFDSSTGETIGKSNLSSEWW-ADEKIGKSIDAQKNWTFFPLLQPGVS 1519 S+K+F+FD+ E + K ++ EWW +D+ GK Q NWTFFP+LQPGVS Sbjct: 419 SSKDFDFDNVKIEVLEKDSIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470 >ref|XP_006421977.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] gi|557523850|gb|ESR35217.1| hypothetical protein CICLE_v10004813mg [Citrus clementina] Length = 500 Score = 389 bits (1000), Expect = e-105 Identities = 233/497 (46%), Positives = 281/497 (56%), Gaps = 73/497 (14%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 V++SV+TVN IV+AESR++P ++QKRRWGSCW+L WCFGS KTSKRISHA LVPEP Sbjct: 4 VHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLVPEP 63 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 VT + + A+ ST+IVLPFIAPPSSP SFLQSDPP+AT SP GLL++ LSVNA S Sbjct: 64 MVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLNSLSVNAYS 123 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 784 G A +F IGPYAHETQLV+PPVFSAFTTEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLL 183 Query: 785 XXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 931 KL+LS Y LISPGSV S SGTSSP+ D+ + Sbjct: 184 TSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDRHPI 243 Query: 932 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL------ 1072 L+F A KL + F+T KWGSRLGSGS+ PD +G + D + L Sbjct: 244 LDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGS 303 Query: 1073 ----------------------------------ENQISEVVSLANSETGSPIAEALIDH 1150 ENQISEV SLANS+ G+ E +IDH Sbjct: 304 GTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDH 363 Query: 1151 RVSFELLQEDIPTALVG-GCQGTRDPYTRIDDVVPH----------KVKNCF-LCSQXXX 1294 RVSFEL E++ L R D+VP +N F LC + Sbjct: 364 RVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPEESS 423 Query: 1295 XXXXXXXXXXXXD--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIGKSI 1468 + C K S++LGS KEFNFD++ GE K +++SEWWA+E +GK Sbjct: 424 NRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENVGKES 483 Query: 1469 DAQKNWTFFPLLQPGVS 1519 NWTFFP+LQ S Sbjct: 484 KPSNNWTFFPMLQSEAS 500 >gb|EOY23266.1| Hydroxyproline-rich glycoprotein family protein isoform 1 [Theobroma cacao] Length = 485 Score = 385 bits (990), Expect = e-104 Identities = 231/486 (47%), Positives = 274/486 (56%), Gaps = 62/486 (12%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 VN+SV+TVN IV+A+SRVQP +VQK+RWGSCW L WCFGS K SKRI HA LVPEP Sbjct: 4 VNDSVETVNAAATAIVSADSRVQPTTVQKKRWGSCWGLYWCFGSQKNSKRIGHAVLVPEP 63 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 V ++ + A+N + T I+LPFIAPPSSP SFLQSDPP+AT SP GLL++T LSVNA S Sbjct: 64 VVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYS 123 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 784 G A IF IGPYAHETQLV+PPVFSA TTEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 785 XXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 931 K LS YE +LISPGS S SGTSSP+ D+R + Sbjct: 184 TSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPDRRPI 243 Query: 932 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL------ 1072 LEFRM +A KL + F+T KWGSRLGSGSL PD +G V D + L Sbjct: 244 LEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGSRLGS 303 Query: 1073 ------------------ENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIPTALV 1198 +QISEV LAN G E ++DHRVSFEL ED+ L Sbjct: 304 GSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVAPCLE 363 Query: 1199 GG----------------CQGTRDPYTRIDDVVPHKVKNC--FLCSQXXXXXXXXXXXXX 1324 +G ++ D + +C F+ Sbjct: 364 SKSLLPSRAVSEYPKDLVAEGRKER----DGIKKDLESSCELFIRETSNETVEKASGEAE 419 Query: 1325 XXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSIDAQKNWTFFPL 1501 K SV+LGS KEFNFD++ GE K + SEWWA+EK+ GK +WTFFP+ Sbjct: 420 EEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWTFFPM 479 Query: 1502 LQPGVS 1519 LQP VS Sbjct: 480 LQPEVS 485 >ref|XP_006490432.1| PREDICTED: uncharacterized protein FLJ40925-like [Citrus sinensis] Length = 500 Score = 385 bits (989), Expect = e-104 Identities = 231/497 (46%), Positives = 280/497 (56%), Gaps = 73/497 (14%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 V++SV+TVN IV+AESR++P ++QKRRWGSCW+L WCFGS KTSKRISHA L+PEP Sbjct: 4 VHDSVETVNAAATAIVSAESRLRPAAIQKRRWGSCWSLYWCFGSHKTSKRISHAVLLPEP 63 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 VT + + A+ ST+IVLPFIAPPSSP SFLQSDP +AT SP GLL++ LSVNA S Sbjct: 64 MVTGAAAPAAETQAHSTAIVLPFIAPPSSPASFLQSDPSSATQSPAGLLSLNSLSVNAYS 123 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 784 G A +F IGPYAHETQLV+PPVFSAFTTEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 124 PGGPASMFAIGPYAHETQLVTPPVFSAFTTEPSTALCTPPPESVQLTTPSSPEVPFAQLL 183 Query: 785 XXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 931 KL+LS Y LISPGSV S SGTSSP+ D+ + Sbjct: 184 TSSLERARRNSGTNQKLSLSHYGYQPYQLYPGSPGGQLISPGSVVSYSGTSSPFPDRHPI 243 Query: 932 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL------ 1072 L+F A KL + F+T KWGSRLGSGS+ PD +G + D + L Sbjct: 244 LDFSAAAAPKLLGFEHFTTRKWGSRLGSGSVTPDGVGIGSRMGSGSLTPDGVGLGSRLGS 303 Query: 1073 ----------------------------------ENQISEVVSLANSETGSPIAEALIDH 1150 ENQISEV SLANS+ G+ E +IDH Sbjct: 304 GTVTPDGAGLGSRLGSGSLTPDGMGPTSRDGFVRENQISEVASLANSDNGTKSDEHIIDH 363 Query: 1151 RVSFELLQEDIPTALVG-GCQGTRDPYTRIDDVVPH----------KVKNCF-LCSQXXX 1294 RVSFEL E++ L R D+VP +N F LC + Sbjct: 364 RVSFELSGEEVARCLANKSAASPRIVPEFPQDIVPEGEIRRDGKLTDSENHFELCPEESS 423 Query: 1295 XXXXXXXXXXXXD--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIGKSI 1468 + C K S++LGS KEFNFD++ GE K +++SEWWA+E +GK Sbjct: 424 NRMPEKTMRDGEEEYCYRKHRSITLGSIKEFNFDNTEGEVSNKPSINSEWWANENVGKES 483 Query: 1469 DAQKNWTFFPLLQPGVS 1519 NWTFFP+LQ S Sbjct: 484 KPSNNWTFFPMLQSEAS 500 >ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis] gi|223547583|gb|EEF49078.1| conserved hypothetical protein [Ricinus communis] Length = 510 Score = 381 bits (978), Expect = e-103 Identities = 235/494 (47%), Positives = 276/494 (55%), Gaps = 71/494 (14%) Frame = +2 Query: 251 NNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEPT 430 N+SVDT+N IV+AESRVQP +VQKRRWG CW+L WCFGS KT KRI HA L PEP Sbjct: 19 NSSVDTINAAATAIVSAESRVQPTTVQKRRWGGCWSLYWCFGSHKT-KRIGHAVLAPEPE 77 Query: 431 VTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALSS 610 V + A+N + ST+I +PFIAPPSSP SFLQSDPP+AT SP GLL++T LSVNA S Sbjct: 78 VQGAVVTSAENQSQSTAITVPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSVNAYSP 137 Query: 611 SGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXXX 787 G A IF IGPYAHETQLV+PP FSAFTTEPSTA TPP E VQ TTPSSPEVPFAQ Sbjct: 138 GGPASIFAIGPYAHETQLVTPPAFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLLT 197 Query: 788 XXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSVL 934 K ALS YE LISPGSV S SGTSSP+ D+ +L Sbjct: 198 SSLERARRNSGTNQKFALSHYEFQSYPLYPGSPGGQLISPGSVISNSGTSSPFPDRYPIL 257 Query: 935 EFRMVDASKLFDSKKFSTCKWGSRLGS----------------GSLIPD----------- 1033 EFRM +A KL + F+T KWGSRLGS G++ PD Sbjct: 258 EFRMGEAPKLLGFEHFTTRKWGSRLGSGTVTPDGVGLGSRLGSGTVTPDGVGQGSRLGSG 317 Query: 1034 --------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEALIDHR 1153 + G D LENQISEV SLANSE GS E ++DHR Sbjct: 318 TVTPDGVGLRSMLGSGSLTPDAVGPASRDGFFLENQISEVASLANSENGSKTDENIVDHR 377 Query: 1154 VSFELLQEDIPTAL----VGGCQGTRD--PYTRIDDVVPH-----KVKNCFLCSQXXXXX 1300 VSFEL E++ L + C+ + P + +D + +N Sbjct: 378 VSFELSGEEVARCLESKSLASCRAFSECPPDSMAEDQIKSGKMLMTDENLPTGETSGETP 437 Query: 1301 XXXXXXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSIDAQ 1477 C K S++LGS KEFNFD+S E K +++SEWWA+E I GK Sbjct: 438 EKPSGEMEEEHCYRKHRSITLGSIKEFNFDNSK-EVPDKPSINSEWWANETIAGKEARPA 496 Query: 1478 KNWTFFPLLQPGVS 1519 NWTFFPLLQP VS Sbjct: 497 NNWTFFPLLQPEVS 510 >ref|XP_002318209.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|222858882|gb|EEE96429.1| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 507 Score = 381 bits (978), Expect = e-103 Identities = 241/502 (48%), Positives = 281/502 (55%), Gaps = 79/502 (15%) Frame = +2 Query: 251 NNSVDTVNXXXXXIVTAESRVQPVS--VQKRRWGSCWNLSWCFGSL---KTSKRISHAAL 415 N+S++TVN IV+AESRVQP S VQKRRWG CW+L WCFGS K SKRI HA L Sbjct: 6 NSSIETVNAAATAIVSAESRVQPSSSSVQKRRWGGCWSLYWCFGSHGSHKNSKRIGHAVL 65 Query: 416 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 595 VPEP V + S+ +N ST I+LPFIAPPSSP SFLQSDPP++T SP GLL++T LS Sbjct: 66 VPEPEVPGAVSSSTENQTQSTPILLPFIAPPSSPASFLQSDPPSSTQSPAGLLSLTSLSA 125 Query: 596 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 772 NA S G A IF IGPYAHETQLV+PPVFSAFTTEPSTA T PPE VQ TTPSSPEVPF Sbjct: 126 NAYSPRGPASIFAIGPYAHETQLVTPPVFSAFTTEPSTAPFTPPPESVQLTTPSSPEVPF 185 Query: 773 AQXXXXXXXXXXXXXXXXHKLALSCYEV--HHL---------ISPGSVNSTSGTSSPYLD 919 AQ K +LS YE +HL ISPGS S SGTSSP+ D Sbjct: 186 AQLLTSSLERARRNSGPNQKFSLSHYEFQSYHLYPGSPGGQIISPGSAISNSGTSSPFPD 245 Query: 920 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYS-------------------- 1039 + +LEFRM +A KL + FST KWGSRLGSGSL PD + Sbjct: 246 RHPMLEFRMGEAPKLLGFEHFSTRKWGSRLGSGSLTPDATPDGMGLSRLGSGTVTPDGMG 305 Query: 1040 -------------AG--------------YVPYDSIP--LENQISEVVSLANSETGSPIA 1132 AG +VP I LENQISEV SL NSE GS Sbjct: 306 LSRLCSGTATPDGAGLRSRLGSGTLTPDCFVPASQIGFLLENQISEVASLTNSENGSKTE 365 Query: 1133 EALIDHRVSFELLQEDIPTAL-VGGCQGTRDPYTRIDDVVPH----------KVKNCFLC 1279 E ++ HRVSFEL E++ L + TR D +P + C Sbjct: 366 ENVVHHRVSFELSGEEVARCLEIKSVASTRTFPEYPQDTMPEDPVRGDRLAMNGERCLQN 425 Query: 1280 SQXXXXXXXXXXXXXXXDCLHKK-SSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI 1456 + D +++K S++LGS KEFNFD+S GE K +SSEWWA+E I Sbjct: 426 GEASSEMPEKNSEETEEDHVYRKHRSITLGSIKEFNFDNSKGEVSDKPAISSEWWANETI 485 Query: 1457 -GKSIDAQKNWTFFPLLQPGVS 1519 GK +WTFFPLLQP VS Sbjct: 486 AGKEARPANSWTFFPLLQPEVS 507 >gb|EOY23267.1| Hydroxyproline-rich glycoprotein family protein isoform 2 [Theobroma cacao] Length = 489 Score = 380 bits (975), Expect = e-102 Identities = 231/490 (47%), Positives = 274/490 (55%), Gaps = 66/490 (13%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQ----KRRWGSCWNLSWCFGSLKTSKRISHAAL 415 VN+SV+TVN IV+A+SRVQP +VQ K+RWGSCW L WCFGS K SKRI HA L Sbjct: 4 VNDSVETVNAAATAIVSADSRVQPTTVQVHVYKKRWGSCWGLYWCFGSQKNSKRIGHAVL 63 Query: 416 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 595 VPEP V ++ + A+N + T I+LPFIAPPSSP SFLQSDPP+AT SP GLL++T LSV Sbjct: 64 VPEPVVPGASVSTAENVSNPTGIILPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSV 123 Query: 596 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 772 NA S G A IF IGPYAHETQLV+PPVFSA TTEPSTA T PPE VQ TTPSSPEVPF Sbjct: 124 NAYSPRGPASIFAIGPYAHETQLVTPPVFSALTTEPSTAPFTPPPESVQLTTPSSPEVPF 183 Query: 773 AQXXXXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 919 AQ K LS YE +LISPGS S SGTSSP+ D Sbjct: 184 AQLLTSSLERARRNSGINQKFGLSHYEFQSYQIYPGSPGGNLISPGSAISNSGTSSPFPD 243 Query: 920 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------YSAGYVPYDSIPL-- 1072 +R +LEFRM +A KL + F+T KWGSRLGSGSL PD +G V D + L Sbjct: 244 RRPILEFRMGEAPKLLGFENFTTRKWGSRLGSGSLTPDGLGQGSRLGSGSVTPDGMGLGS 303 Query: 1073 ----------------------ENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDIP 1186 +QISEV LAN G E ++DHRVSFEL ED+ Sbjct: 304 RLGSGSLTPDGLGPASRDGFLVGSQISEVALLANPANGPKNDETIVDHRVSFELSGEDVA 363 Query: 1187 TALVGG----------------CQGTRDPYTRIDDVVPHKVKNC--FLCSQXXXXXXXXX 1312 L +G ++ D + +C F+ Sbjct: 364 PCLESKSLLPSRAVSEYPKDLVAEGRKER----DGIKKDLESSCELFIRETSNETVEKAS 419 Query: 1313 XXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSIDAQKNWT 1489 K SV+LGS KEFNFD++ GE K + SEWWA+EK+ GK +WT Sbjct: 420 GEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEKVAGKEARPGNSWT 479 Query: 1490 FFPLLQPGVS 1519 FFP+LQP VS Sbjct: 480 FFPMLQPEVS 489 >gb|EMJ20240.1| hypothetical protein PRUPE_ppa004616mg [Prunus persica] Length = 499 Score = 374 bits (961), Expect = e-101 Identities = 227/496 (45%), Positives = 266/496 (53%), Gaps = 73/496 (14%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 VN+SVDT+N IV+AE+R QP +V KRRWGSCW+L WCFG K +KRI HA LVPEP Sbjct: 4 VNSSVDTINAAATAIVSAEARPQPTTVPKRRWGSCWSLYWCFGPHK-NKRIGHAVLVPEP 62 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 V + + DN ST+IV+PFIAPPSSP SFL SDPP+AT SP G L++ LS NA S Sbjct: 63 VVPGAAVSAIDNQTTSTAIVVPFIAPPSSPASFLPSDPPSATQSPAGFLSLKSLSANAYS 122 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXX 784 G A IF+IGPYA+ETQLVSPPVFS F TEPSTA T PPE VQ TTPSSPEVPFAQ Sbjct: 123 PGGPASIFSIGPYAYETQLVSPPVFSTFNTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 182 Query: 785 XXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 931 K ALS YE +LISPGS S SGTSSP+ D+ V Sbjct: 183 TSSLDRNRRNSGTNQKFALSHYEFQPYQQYPGSPGGNLISPGSAVSNSGTSSPFPDRHPV 242 Query: 932 LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------------------------- 1033 LEFRM +A KLF F+T KWGSR+GSGSL PD Sbjct: 243 LEFRMGEAPKLFGFDHFTTRKWGSRIGSGSLTPDGVGLGSRLGSGSLTPDGNELGSRLGS 302 Query: 1034 ---------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEALIDH 1150 G DS LENQISEV SLANSE+G E + DH Sbjct: 303 GCVTPNGAGIGSRLGSGCLTPDGPGPASRDSFLLENQISEVASLANSESGCQTVETVFDH 362 Query: 1151 RVSFELLQEDIPTALV-----------GGCQGTRDPYTRIDDVVPHKVKN-C-FLCSQXX 1291 RVSFEL ED+ L G + Y D + N C F + Sbjct: 363 RVSFELTGEDVACCLANKAVASNRTASGSSKVIASEYPSERDALSSDSSNHCEFSVEESS 422 Query: 1292 XXXXXXXXXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKSI 1468 K S++LGS K+FNFD++ E K N+ SEWWA++ + K Sbjct: 423 SRIPENVSGEGEDQGYRKHRSITLGSTKDFNFDNTKAEVPNKPNIGSEWWANKNVAAKES 482 Query: 1469 DAQKNWTFFPLLQPGV 1516 +WTFFP+LQPGV Sbjct: 483 KPCNDWTFFPILQPGV 498 >ref|XP_004140832.1| PREDICTED: uncharacterized protein LOC101210841 [Cucumis sativus] Length = 497 Score = 374 bits (961), Expect = e-101 Identities = 227/498 (45%), Positives = 276/498 (55%), Gaps = 72/498 (14%) Frame = +2 Query: 242 ARVNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFG--SLKTSKRISHAAL 415 A +NNSVDTVN IV+AE+RVQP + KRRWGSCW+L WCFG S K++KRI HA L Sbjct: 2 ASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVL 61 Query: 416 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 595 VPEP V + + ++ PST++VLPFIAPPSSP SFLQS+P + T SP GLL++T LSV Sbjct: 62 VPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSLTALSV 121 Query: 596 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 772 N S +G A IF IGPY ++TQLVSPPVFSAFTTEPSTA IT PPE VQ TTPSSPEVPF Sbjct: 122 NNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPF 181 Query: 773 AQXXXXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 919 A+ K LS + HLISPGSV S SGTSSP+ D Sbjct: 182 AKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPD 241 Query: 920 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA------------------- 1042 K +LEFRM DA KL + F+T KW SR+GSGSL PD + Sbjct: 242 KHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGS 301 Query: 1043 ----------------------------GYVPYDSIPLENQISEVVSLANSETGSPIAEA 1138 G+ DS L+NQISEV SLANSETG Sbjct: 302 RLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETG--CQND 359 Query: 1139 LIDHRVSFELLQEDIP--------TALVGGCQGTRDPYTRIDDVVPHKVKNCFLCSQXXX 1294 + +HRVSFEL ED+ T++ + + T + + C Sbjct: 360 VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDI 419 Query: 1295 XXXXXXXXXXXXD--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIG-KS 1465 D C + +V+LGS KEFNFD + GE +++ +EWWA+EK+G K Sbjct: 420 KTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKE 479 Query: 1466 IDAQKNWTFFPLLQPGVS 1519 NWTFFPLLQPGVS Sbjct: 480 ASPGNNWTFFPLLQPGVS 497 >ref|XP_004157195.1| PREDICTED: uncharacterized protein LOC101225370 [Cucumis sativus] Length = 497 Score = 374 bits (959), Expect = e-100 Identities = 227/498 (45%), Positives = 275/498 (55%), Gaps = 72/498 (14%) Frame = +2 Query: 242 ARVNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFG--SLKTSKRISHAAL 415 A +NNSVDTVN IV+AE+RVQP + KRRWGSCW+L WCFG S K++KRI HA L Sbjct: 2 ASINNSVDTVNAAATAIVSAEARVQPTTPPKRRWGSCWSLYWCFGIGSQKSNKRIGHAVL 61 Query: 416 VPEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 595 VPEP V + + ++ PST++VLPFIAPPSSP SFLQS+P + T SP GLL+ T LSV Sbjct: 62 VPEPAVPGAVAPAVEHRTPSTTMVLPFIAPPSSPASFLQSEPTSNTQSPAGLLSFTALSV 121 Query: 596 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPF 772 N S +G A IF IGPY ++TQLVSPPVFSAFTTEPSTA IT PPE VQ TTPSSPEVPF Sbjct: 122 NNYSPNGPASIFAIGPYTYDTQLVSPPVFSAFTTEPSTAPITPPPESVQLTTPSSPEVPF 181 Query: 773 AQXXXXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 919 A+ K LS + HLISPGSV S SGTSSP+ D Sbjct: 182 AKLLTSSLSHTNKSFGTNQKFTLSHCDFQPYQPYPGSPGAHLISPGSVISNSGTSSPFPD 241 Query: 920 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA------------------- 1042 K +LEFRM DA KL + F+T KW SR+GSGSL PD + Sbjct: 242 KHPILEFRMADAPKLLGLEHFTTRKWISRMGSGSLTPDGTGLCSRLGSGTLTPDGMGMGS 301 Query: 1043 ----------------------------GYVPYDSIPLENQISEVVSLANSETGSPIAEA 1138 G+ DS L+NQISEV SLANSETG Sbjct: 302 RLGSGSVTPNGMRQDSRLGSGTLTPDGLGHGLQDSPLLDNQISEVASLANSETG--CQND 359 Query: 1139 LIDHRVSFELLQEDIP--------TALVGGCQGTRDPYTRIDDVVPHKVKNCFLCSQXXX 1294 + +HRVSFEL ED+ T++ + + T + + C Sbjct: 360 VTNHRVSFELTGEDVARCLANKSLTSIRTESESPKQTSTSNQNENKESSREAETCEFFDI 419 Query: 1295 XXXXXXXXXXXXD--CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIG-KS 1465 D C + +V+LGS KEFNFD + GE +++ +EWWA+EK+G K Sbjct: 420 KTSAAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEIHNTASIGAEWWANEKVGVKE 479 Query: 1466 IDAQKNWTFFPLLQPGVS 1519 NWTFFPLLQPGVS Sbjct: 480 ASPGNNWTFFPLLQPGVS 497 >gb|EXB93840.1| hypothetical protein L484_004326 [Morus notabilis] Length = 521 Score = 370 bits (951), Expect = e-100 Identities = 232/518 (44%), Positives = 279/518 (53%), Gaps = 94/518 (18%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEP 427 VNNSV+T+N IV+AE+R QP +V KRRWGSCW+L WCFGS K SKRI HA LVPEP Sbjct: 4 VNNSVETINAAATAIVSAEARAQPAAVPKRRWGSCWSLYWCFGSHKNSKRIGHAVLVPEP 63 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 + + + +N PST+IVLPFIAPPSSP SFLQSDPP+AT SP GLL++T LS+NA S Sbjct: 64 VLPGAAAPAPENQAPSTAIVLPFIAPPSSPASFLQSDPPSATQSPAGLLSLTSLSINAYS 123 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 784 G IF IGPYA+ETQLVSPPVFS FTTEPSTA TPP E VQ TTPSSPEVPFAQ Sbjct: 124 PGGPTSIFAIGPYAYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEVPFAQLL 183 Query: 785 XXXXXXXXXXXXXXH-KLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRS 928 + K +LS E +LISPGSV S SGTSSP+ DK Sbjct: 184 TSSLDRTRRNSSGANQKFSLSHCEFQPYQLYPGSPGGNLISPGSVVSNSGTSSPFPDKHP 243 Query: 929 VLEFRMVDASKLFDSKKFSTCKWGSRLGS------------------------------- 1015 +L FRM +A +L + F+T KWGSRLGS Sbjct: 244 ILGFRMGEAPRLLGFEHFTTWKWGSRLGSGSLTPDGVGLGSRLGSGSVTPDGVGLGSRLG 303 Query: 1016 -GSLIPD-YSAG------------------------------YVPYDSIPLENQISEVVS 1099 GSL PD Y G V DS LENQISEV S Sbjct: 304 SGSLTPDGYGLGSRLGSGCMTPNGPGLGSRLGSGTLTPDGFLVVSGDSFLLENQISEVAS 363 Query: 1100 LANSETGSPIAEALIDHRVSFELLQEDIPTALVGGCQGTR-------------DPYTRID 1240 LANS+ G +++DHRVSFEL ED+ L + + T+ D Sbjct: 364 LANSDNGCQNDGSVVDHRVSFELTGEDVARCLASKSASSNGRTTSESLEDSPAECPTKKD 423 Query: 1241 DVVPHKVK--NCFLCSQXXXXXXXXXXXXXXXD--CLHKKSSVSLGSAKEFNFDSSTGET 1408 + + V N C + D K S++LGS KEFNFD++ + Sbjct: 424 GISANNVDSPNDQSCVEETSNKTPQSDCREGEDDHFYQKHRSITLGSIKEFNFDNTKADV 483 Query: 1409 IGKSNLSSEWWADEKI-GKSIDAQKNWTFFPLLQPGVS 1519 K + SEWWA+EK+ GK A +W+FFP+LQPGVS Sbjct: 484 SVKPTIGSEWWANEKVAGKEAKAGNSWSFFPILQPGVS 521 >gb|ESW24210.1| hypothetical protein PHAVU_004G111400g [Phaseolus vulgaris] Length = 500 Score = 352 bits (902), Expect = 4e-94 Identities = 217/501 (43%), Positives = 275/501 (54%), Gaps = 77/501 (15%) Frame = +2 Query: 236 MTARVNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKT---SKRISH 406 M +R N +++TVN IVTAESRVQP +V K+RWG CW+ WCFGS K+ SKRI H Sbjct: 1 MGSRNNTTLETVNAAATAIVTAESRVQPTTVPKKRWGGCWSQYWCFGSYKSTKSSKRIGH 60 Query: 407 AALVPEPTV-TESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAIT 583 A LVPEP T +A A PNPST+IV+PFIAPPSSP S +QSDPP+A SP GLL+++ Sbjct: 61 AVLVPEPVAPTGPAAAAAAPPNPSTAIVMPFIAPPSSPASLIQSDPPSAIQSPPGLLSLS 120 Query: 584 PLSVNALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSP 760 L+ +A SS G A +FTIGPYA+ETQLVSPPVFS FTTEPSTA T PPE V TTPSSP Sbjct: 121 SLAASAYSSGGPASMFTIGPYAYETQLVSPPVFSNFTTEPSTAPFTPPPESVHQTTPSSP 180 Query: 761 EVPFAQXXXXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSS 907 +VPFAQ K AL Y+ LISPGS STSGTS+ Sbjct: 181 DVPFAQ-LLASSLDRARKSNGNQKFALYNYDFQPYHQYPGSPGGQLISPGSAFSTSGTST 239 Query: 908 PYLDKRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYS---------------- 1039 P+ D+ LEFR + K+ + FST +W SRLGSGSL PD + Sbjct: 240 PFPDRPPTLEFRKGETPKILGVEHFSTQRWSSRLGSGSLTPDGAGQGSRLGSGSVTPDGV 299 Query: 1040 -------------------------------AGYVPYDSIPLENQISEVVSLANSETGSP 1126 G + +++P++NQIS+ +LANS+ G P Sbjct: 300 GLASRLGSGCATPDGLGQESRLGSGCLTPDGVGQINENNLPVQNQISKEATLANSDNGHP 359 Query: 1127 IAEALIDHRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKN 1267 LIDHRVSFEL ED+ L G QG +DP R + V+ + Sbjct: 360 SNATLIDHRVSFELTGEDVARCLANKTGVLLRNMSGSSQGILAKDPVDR-ERVLRDTDAS 418 Query: 1268 CFLCSQXXXXXXXXXXXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWAD 1447 C +C++ C HK++SV+ S+KEFNFDSS G G SEWW + Sbjct: 419 CNVCTE-KTDDKPYNPIGEGEQCFHKQNSVN--SSKEFNFDSSKGVVSGTGGSGSEWWTN 475 Query: 1448 EKI-GKSIDAQKNWTFFPLLQ 1507 ++ G+ + +W FFP+LQ Sbjct: 476 RRVAGREGRSANSWAFFPMLQ 496 >gb|AFK46430.1| unknown [Medicago truncatula] Length = 487 Score = 350 bits (897), Expect = 2e-93 Identities = 225/498 (45%), Positives = 274/498 (55%), Gaps = 75/498 (15%) Frame = +2 Query: 251 NNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSL-KTSKRISHAALVPEP 427 NNS+DTVN IV+AESRVQP S K+RWGSC++L CFGS KTS+RI HA LVPEP Sbjct: 6 NNSIDTVNAAATAIVSAESRVQPTSSPKKRWGSCFSLPSCFGSHNKTSERIGHAVLVPEP 65 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSP-GGLLAITPLSVNAL 604 V + PNPST+IV+PFIAPPSSP SFLQSDPP++THSP GLL+++ LS NA Sbjct: 66 -VAPTVPVANAAPNPSTAIVIPFIAPPSSPASFLQSDPPSSTHSPAAGLLSLSSLSANAY 124 Query: 605 SSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQX 781 S+SG A +FTIGPYA+ETQLVSPPVFS FT EPSTA T PPE V TTPSSPEVPFAQ Sbjct: 125 STSGPASMFTIGPYAYETQLVSPPVFSNFTAEPSTANFTPPPESVLMTTPSSPEVPFAQ- 183 Query: 782 XXXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRS 928 HK AL YE L+SPGSV STSGTS+P+ D+RS Sbjct: 184 --LLASSLDRARKSNHKFALYNYEYQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRS 241 Query: 929 VLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYS----------------------- 1039 LE R +A K+ + FST KW SR+GSGSL PD + Sbjct: 242 SLELRKGEAPKILGFEHFSTRKWMSRIGSGSLTPDGTGQGSRLGSGSLTPDGVSHTSRLG 301 Query: 1040 ------------------------AGYVPYDSIPLENQISEVVSLANSETGSPIAEALID 1147 G SI ++NQI VS+ANS+ GS L+D Sbjct: 302 SGCATPDGLGQDSRLGSGSLTPDGVGPTTRGSIDVQNQIPVGVSVANSDHGSQTNATLVD 361 Query: 1148 HRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLCSQX 1288 HRVSFEL ED+ L QG +DP R + ++ C +CS Sbjct: 362 HRVSFELTGEDVARCLANKTGALLRNMSSSSQGILAKDPIDR-EKILKETNSCCDVCS-- 418 Query: 1289 XXXXXXXXXXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GKS 1465 C K++SVS S+KEFNFD+ G+ G S S WW ++K+ GK Sbjct: 419 -------GKAIGGEHCCPKRNSVS--SSKEFNFDNRKGDVSGTSANGSSWWTNKKVDGKE 469 Query: 1466 IDAQKNWTFFPLLQPGVS 1519 + +W FFP+LQP +S Sbjct: 470 SKSVNSWAFFPMLQPDIS 487 >ref|XP_004512830.1| PREDICTED: uncharacterized protein LOC101494240 [Cicer arietinum] Length = 492 Score = 348 bits (894), Expect = 4e-93 Identities = 224/494 (45%), Positives = 270/494 (54%), Gaps = 71/494 (14%) Frame = +2 Query: 251 NNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALVPEPT 430 NNS+DTVN IVTAESRVQP + K+RWGSC++LS CFGS K+SKRI HA LVPEP Sbjct: 6 NNSIDTVNAAATAIVTAESRVQPSTSPKKRWGSCFSLSSCFGSHKSSKRIGHAVLVPEP- 64 Query: 431 VTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALSS 610 V PNPST IV+PFIAPPSSP SFLQSDPP++THSP L ++P A SS Sbjct: 65 VAPIVPVAHSAPNPSTVIVMPFIAPPSSPASFLQSDPPSSTHSPAAGL-LSPSVNAAYSS 123 Query: 611 SGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFAQXXX 787 SG A IFTIGPYA+ETQLVSPPVFS FTTEPSTA+ T PPE VQ TTPSSPEVPFAQ Sbjct: 124 SGSASIFTIGPYAYETQLVSPPVFSNFTTEPSTASFTPPPESVQMTTPSSPEVPFAQ-LL 182 Query: 788 XXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSVL 934 HK AL YE L+SPGSV STSGTS+P+ D+RS L Sbjct: 183 ASSLDRARKNNGSHKFALYNYEFQPYQQYPGSPGAQLVSPGSVISTSGTSTPFPDRRSSL 242 Query: 935 EFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA------------------------ 1042 E + K+ + FST +W SR+GSGSL PD + Sbjct: 243 ELSRGETPKILGFEHFSTRRWNSRIGSGSLTPDGAGQGSRLGSGSLTPDGFAHASRLGSG 302 Query: 1043 ---------------------GYVPYDSIPLENQISEVVSLANSETGSPIAEALIDHRVS 1159 G P ++NQISE VS+ANSE GS L+DHRVS Sbjct: 303 CTTPDGLGQDSRLGSGSLTPDGAGPTTRESMQNQISEDVSVANSEHGSQSNATLVDHRVS 362 Query: 1160 FELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLCSQXXXXX 1300 FEL ED+ L QG +DP R + ++ C +CS+ Sbjct: 363 FELTGEDVARCLANKAGALLRNMSSSSQGILAKDPIDR-ERILKETNGCCDVCSR-KTND 420 Query: 1301 XXXXXXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKIGKSIDAQK 1480 C K++SVS S+KEFNFD+ G+ S S WW+++K+G+ Sbjct: 421 KSDNSCAGGEQCCQKRNSVS--SSKEFNFDNRKGDVSDTSANGSGWWSNKKVGEKEGRSV 478 Query: 1481 N-WTFFPLLQPGVS 1519 N W FFP+LQP +S Sbjct: 479 NSWAFFPMLQPDIS 492 >ref|XP_003549033.2| PREDICTED: uncharacterized protein LOC100806399 [Glycine max] Length = 515 Score = 337 bits (864), Expect = 1e-89 Identities = 216/497 (43%), Positives = 267/497 (53%), Gaps = 78/497 (15%) Frame = +2 Query: 251 NNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSK---RISHAALVP 421 N + +TV IV AESRVQP K+RWG CW+ WCFGS K+SK RI HA LVP Sbjct: 20 NATAETVFAAANAIVAAESRVQPTDAPKKRWGGCWSQYWCFGSRKSSKSSKRIGHAVLVP 79 Query: 422 EPTVTE--STSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSV 595 EP + +A A PNPST+IV+PFIAPPSSP SFLQSDPP+ SP GLL+++ L+ Sbjct: 80 EPAAPTGPAAAATAAAPNPSTAIVMPFIAPPSSPASFLQSDPPSGIQSPPGLLSLSALAA 139 Query: 596 NALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPF 772 NA SS G A +FTIGPYA+ETQLVSPPVFSAFTTEPSTA TPP E VQ TTPSSP+VPF Sbjct: 140 NAYSSGGPATMFTIGPYAYETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPF 199 Query: 773 AQXXXXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLD 919 AQ K L YE H LISPGS STSGTS+P+ D Sbjct: 200 AQLLASSLDRARKCNGH-QKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPD 258 Query: 920 KRSVLEFRMVDASKLFDSKKFSTCKWGSRLGS----------------GSLIPD------ 1033 + LEF + K+ + FST +WGSRLGS GSL PD Sbjct: 259 RPPTLEFPKGETPKILGVEHFSTRRWGSRLGSGSLTPDSAWQGSRLGSGSLTPDGVGLAS 318 Query: 1034 -------------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEA 1138 SAG ++I ++NQIS+ +LA+S+ G P Sbjct: 319 RLGSGCVTPDGLGQESRLGSGCLTPDSAGPTNQNNISVQNQISKEATLADSDNGHPSNAT 378 Query: 1139 LIDHRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLC 1279 L+DHRVSFEL ED+ L G QG T+DP R + V +C C Sbjct: 379 LVDHRVSFELTGEDVARCLANKTGVLLRNMSGSSQGILTKDPVDR-ERVQIDTNSSCNAC 437 Query: 1280 SQXXXXXXXXXXXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI- 1456 ++ CLHK++SV+ S+KEFNFD+ G+ + EWW + K+ Sbjct: 438 TE-KTDDKPDNPVGKGEQCLHKQNSVN--SSKEFNFDNRKGDVSVTTGSGYEWWTNRKVA 494 Query: 1457 GKSIDAQKNWTFFPLLQ 1507 GK + +W FFP+LQ Sbjct: 495 GKEGRSANSWAFFPMLQ 511 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 337 bits (863), Expect = 1e-89 Identities = 214/477 (44%), Positives = 265/477 (55%), Gaps = 50/477 (10%) Frame = +2 Query: 239 TARVNNSVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSKRISHAALV 418 T +N++++T+N I +AE+RV +VQKRRWGSCW WCF S K KRI HA L Sbjct: 8 TRSMNSALETINAAATAIASAENRVPQPTVQKRRWGSCWGEYWCFRSPK-DKRIGHAVLA 66 Query: 419 PEPTVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVN 598 PE S A+N + +IVLPF+APPSSP SFLQS+PP+AT SP GLL++T ++ N Sbjct: 67 PESRAPGSGVPAAENLTQAPTIVLPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINAN 126 Query: 599 ALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEVPFA 775 S G A IF IGPYAHETQLVSPPVFS FTTEPSTA T PPE V TTPSSPEVPFA Sbjct: 127 IYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFA 186 Query: 776 QXXXXXXXXXXXXXXXXHKLALSCYE-----------VHHLISPGSVNSTSGTSSPYLDK 922 Q H+ LS YE V HLISP S S SGTSSP+ D+ Sbjct: 187 Q----LFDPNNRNGEAGHRFLLSQYEFQSYQLYPGSPVGHLISPSSGISGSGTSSPFPDR 242 Query: 923 RSV-------LEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPDYSA-----GYV----- 1051 V LEFR KL K S +WGSR+GSGS+ PD G V Sbjct: 243 DFVCSGSSQFLEFRAGGPPKLLTLDKLSNHEWGSRIGSGSITPDALGPPSRDGSVLDRQV 302 Query: 1052 -------PYDSIPLENQISEVVSLANSETGSPIAEALIDHRVSFELLQEDI-------PT 1189 D L+ QIS+V S + S++G P E ++DHRVSFEL ED+ Sbjct: 303 SDVIHPPSGDDSVLDRQISDVASHSLSDSGCPNNEIMVDHRVSFELTAEDVVRCVEKDSA 362 Query: 1190 ALVGGCQGT-RDPYT-RID----DVVPHKVKNCFLCSQXXXXXXXXXXXXXXXDCLHKKS 1351 ALV + ++P T ID +VV + HK+ Sbjct: 363 ALVKAVSASLQNPATVEIDENSREVVVDSEGRVGETANNPPEKAPEDANGEEGQPHHKQR 422 Query: 1352 SVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEK-IGKSIDAQKNWTFFPLLQPGVS 1519 S++LGSAKEFNFD++ G K N+SS+WWA+EK +GK + A KNW+ F ++QP VS Sbjct: 423 SITLGSAKEFNFDNADGGHSDKPNISSDWWANEKVVGKEVGASKNWSIFHMMQPSVS 479 >ref|XP_006589528.1| PREDICTED: uncharacterized protein LOC100798631 isoform X1 [Glycine max] Length = 504 Score = 335 bits (858), Expect = 6e-89 Identities = 218/504 (43%), Positives = 264/504 (52%), Gaps = 84/504 (16%) Frame = +2 Query: 248 VNNSVDTVNXXXXXIVTAESRVQPVS-VQKRRWGSCWNLSWCFGSLKTSKRISHAALVPE 424 VNN+VDTVN IV AESR+QP + V K+RWGSCW+L WCFG K SKR+ +A LVPE Sbjct: 4 VNNTVDTVNAAASAIVYAESRIQPTTTVPKKRWGSCWSLCWCFGPHKNSKRVGNAVLVPE 63 Query: 425 PTVTESTSAVADNP-----NPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPL 589 P E V +P NPST+IV+PFI PPSSP SFLQSDPP+AT SP GL +++ L Sbjct: 64 PV--EPIGPVGFHPATAAPNPSTAIVMPFIVPPSSPASFLQSDPPSATQSPVGLFSLSSL 121 Query: 590 SVNALSSSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAIT-PPEPVQFTTPSSPEV 766 +VNA S G A IF IGPY +ETQLVSPPVFS FTTEPSTA T PPE VQ TTPSSPEV Sbjct: 122 TVNA--SGGPASIFAIGPYTYETQLVSPPVFSTFTTEPSTAPFTPPPESVQLTTPSSPEV 179 Query: 767 PFAQXXXXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPY 913 PFAQ + ALS YE L+SP S+ STSG+S+P+ Sbjct: 180 PFAQLLASSLDRNCKSNGTNQRFALSNYEFQPYQQYPGSPGTQLVSPRSIISTSGSSTPF 239 Query: 914 LDKRSVLEFRMVDASKLFDSKKFSTCKWGSRLGSGSLIPD-------------------- 1033 D+ VLEF +A KL + F T KW SRLGSGSL PD Sbjct: 240 PDRHPVLEFHKGEAPKLLGFENFLTHKWNSRLGSGSLTPDSAGQGSRLGSGSFTPDAVKL 299 Query: 1034 -------------------YSAGYVPYDS--------IPLENQISEVVSLANSETGSPIA 1132 + +G + D+ I + QISEV S+ NSE Sbjct: 300 ASQLGSGCLTPDGLCQDSRFGSGSLTPDAVAPTARNDIDIGKQISEVTSIVNSENECQPK 359 Query: 1133 EALIDHRVSFELLQEDIPTALV------------GGCQGT--RDPYTRIDDVVPHKVKNC 1270 AL+DHRVSFEL D+P L G QGT DP I+ + + +C Sbjct: 360 AALVDHRVSFELTGVDVPRCLANKSGSSLLGNMSGSSQGTLVEDP-VDIEKIQKNSNSSC 418 Query: 1271 FLCSQXXXXXXXXXXXXXXXD----CLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEW 1438 CS+ + C K S S+KEFNFD+ G SS W Sbjct: 419 AFCSRKTSNASNDKSCNSPGEGAEQCCRKHH--SFNSSKEFNFDNRKGVVSDTPANSSNW 476 Query: 1439 WADEKI-GKSIDAQKNWTFFPLLQ 1507 W ++KI GK + +WTFFP+LQ Sbjct: 477 WTNKKIVGKEGRSSNSWTFFPMLQ 500 >ref|XP_003533172.2| PREDICTED: uncharacterized protein LOC100818313 isoform X1 [Glycine max] Length = 509 Score = 331 bits (849), Expect = 6e-88 Identities = 212/495 (42%), Positives = 264/495 (53%), Gaps = 78/495 (15%) Frame = +2 Query: 257 SVDTVNXXXXXIVTAESRVQPVSVQKRRWGSCWNLSWCFGSLKTSK---RISHAALVPEP 427 + +TV IV AESRVQP K+RWG CW+ WCFGS K+SK RI HA LVPEP Sbjct: 21 TAETVFAAANAIVAAESRVQPTDAPKKRWGGCWSQYWCFGSCKSSKSSKRIGHAVLVPEP 80 Query: 428 TVTESTSAVADNPNPSTSIVLPFIAPPSSPTSFLQSDPPTATHSPGGLLAITPLSVNALS 607 +A A PNPS +IV+PFIAPPSSP SFLQSDPP+ SP GLL+++ L+ NA S Sbjct: 81 AAPTGPAAAAAAPNPSAAIVMPFIAPPSSPASFLQSDPPSGIQSPPGLLSLSALAANAYS 140 Query: 608 SSGHAHIFTIGPYAHETQLVSPPVFSAFTTEPSTAAITPP-EPVQFTTPSSPEVPFAQXX 784 S G A +FTIGPYA+ETQLVSPPVFSAFTTEPSTA TPP E VQ TTPSSP+VPFAQ Sbjct: 141 SGGPASMFTIGPYAYETQLVSPPVFSAFTTEPSTAPYTPPPESVQQTTPSSPDVPFAQLL 200 Query: 785 XXXXXXXXXXXXXXHKLALSCYEVH-----------HLISPGSVNSTSGTSSPYLDKRSV 931 HK L YE H LISPGS STSGTS+P+ D+ Sbjct: 201 ASSLDRARKSNGN-HKFPLYNYEFHPYQQYPGSPGGQLISPGSAFSTSGTSTPFPDRPPT 259 Query: 932 LEFRMV--DASKLFDSKKFSTCKWGSRLGS----------------GSLIPD-------- 1033 LEF + ++ + FST +WGSRLGS GSL PD Sbjct: 260 LEFPFPKGETPRILGFEHFSTRRWGSRLGSGSLTPDGAWQGSRLGSGSLTPDGIGLASRL 319 Query: 1034 -----------------------YSAGYVPYDSIPLENQISEVVSLANSETGSPIAEALI 1144 SAG + ++I ++NQIS+ +LA+++ G LI Sbjct: 320 GSGCVTPDGLGLESRLGSGCLTPDSAGPINQNNISVQNQISKEATLADTDNGHSSNATLI 379 Query: 1145 DHRVSFELLQEDIPTALV-----------GGCQG--TRDPYTRIDDVVPHKVKNCFLCSQ 1285 DHRVSFEL ED+ L G QG ++DP R K+ C++ Sbjct: 380 DHRVSFELTGEDVARCLANKTGVLLRNMSGSSQGILSKDPVDR-----ERVQKDTDTCTE 434 Query: 1286 XXXXXXXXXXXXXXXDCLHKKSSVSLGSAKEFNFDSSTGETIGKSNLSSEWWADEKI-GK 1462 CLHK++SV+ S+KEFNFD+ G+ + SEWW + K+ GK Sbjct: 435 --KTDDKPDNSVGGEQCLHKQNSVN--SSKEFNFDNRKGDVSVTAGSGSEWWTNRKVAGK 490 Query: 1463 SIDAQKNWTFFPLLQ 1507 + +W FFP+LQ Sbjct: 491 EGRSANSWAFFPMLQ 505