BLASTX nr result
ID: Catharanthus23_contig00013248
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00013248 (2020 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY30097.1| Hydroxyproline-rich glycoprotein family protein [... 352 3e-94 ref|XP_002516170.1| hypothetical protein RCOM_0708150 [Ricinus c... 351 8e-94 ref|XP_006474533.1| PREDICTED: dual specificity protein kinase s... 345 6e-92 ref|XP_006452941.1| hypothetical protein CICLE_v10008490mg [Citr... 340 2e-90 gb|EMJ25889.1| hypothetical protein PRUPE_ppa025477mg [Prunus pe... 326 3e-86 ref|XP_004291283.1| PREDICTED: uncharacterized protein LOC101308... 300 2e-78 ref|XP_006361526.1| PREDICTED: uncharacterized protein LOC102597... 289 3e-75 ref|XP_002308507.2| hydroxyproline-rich glycoprotein [Populus tr... 273 2e-70 gb|EXB74925.1| hypothetical protein L484_018634 [Morus notabilis] 264 1e-67 ref|XP_004501405.1| PREDICTED: epidermal growth factor receptor ... 243 3e-61 ref|XP_003526088.1| PREDICTED: transcription factor btd-like [Gl... 241 9e-61 ref|XP_003523660.1| PREDICTED: myb-like protein I-like isoform X... 236 4e-59 gb|ESW08909.1| hypothetical protein PHAVU_009G084600g [Phaseolus... 234 8e-59 ref|XP_003603253.1| hypothetical protein MTR_3g105580 [Medicago ... 207 1e-50 ref|XP_006843859.1| hypothetical protein AMTR_s00007p00264200 [A... 205 7e-50 emb|CBI20749.3| unnamed protein product [Vitis vinifera] 185 7e-44 ref|XP_004245206.1| PREDICTED: uncharacterized protein LOC101266... 179 3e-42 ref|XP_004498967.1| PREDICTED: uncharacterized protein LOC101502... 162 7e-37 ref|XP_002465160.1| hypothetical protein SORBIDRAFT_01g033040 [S... 147 2e-32 gb|EMS49134.1| hypothetical protein TRIUR3_24556 [Triticum urartu] 140 2e-30 >gb|EOY30097.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao] Length = 415 Score = 352 bits (904), Expect = 3e-94 Identities = 209/425 (49%), Positives = 252/425 (59%), Gaps = 8/425 (1%) Frame = -1 Query: 1351 GRSSKQGSKEGLDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEG 1172 GRSSKQ +KEG+++DK+ KLK+EPHLSGAYIRSLVKQLTS+RTKDP+N K+ S++ +G Sbjct: 3 GRSSKQETKEGMEIDKISKLKEEPHLSGAYIRSLVKQLTSSRTKDPMNPKDPGSVDA-DG 61 Query: 1171 FSSYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVT 992 F+ QNL K G+GFSE VRRRLHTSRPYQERLLNMAEARREIVT Sbjct: 62 FNG-QNLAKFGEGFSETPQTQQPQPPQQQKKQ-VRRRLHTSRPYQERLLNMAEARREIVT 119 Query: 991 ALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAASDTS 812 ALKFHRAAMK K KSRRN R S+T+ Sbjct: 120 ALKFHRAAMKQANEQQQQQGLQQQQSSETSHLSSPPPFEQES-KKKSRRNPRIYP-SNTN 177 Query: 811 NNLTNYVDNFSYQAF--------PCPPNYPYSSWSTFPPVASPFQPENLNFALPNQTLGX 656 N T ++NFSY + P PP PYS W P+ P + LNF LPNQ LG Sbjct: 178 NFSTYNLENFSYSSCSQRYPPLPPPPPPNPYS-WPA-SPIPFPSATDTLNFTLPNQPLGL 235 Query: 655 XXXXXXXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQEGIPMMVT 476 YH +NNP+ IY L EEV S I E P + Sbjct: 236 NLNFHDFNNIDTTLYHNSNNPS-IYSSSSPSSSSSPTLSVVTEEVASAAISHEVGPTAMA 294 Query: 475 NNEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEID 296 + E+ G GLH A++D+EMAEIRS+GEQHQMEWNDT+NLVTSAWWFKFLKTME+ Sbjct: 295 DLAESY----GGGGLHQAINDEEMAEIRSLGEQHQMEWNDTMNLVTSAWWFKFLKTMELG 350 Query: 295 PQEQSEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXX 116 P+ ++ED PFD++MEFP+WLNAN+ CL+QH +D D+Y QDPALPC Sbjct: 351 PEVKAEDDGYQPFDQVMEFPAWLNANDSCLQQHFNDLCPDDYFQDPALPCMDIGEIEGMD 410 Query: 115 XEWLS 101 EWL+ Sbjct: 411 GEWLA 415 >ref|XP_002516170.1| hypothetical protein RCOM_0708150 [Ricinus communis] gi|223544656|gb|EEF46172.1| hypothetical protein RCOM_0708150 [Ricinus communis] Length = 425 Score = 351 bits (900), Expect = 8e-94 Identities = 206/433 (47%), Positives = 255/433 (58%), Gaps = 16/433 (3%) Frame = -1 Query: 1351 GRSSKQGSKEGLDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEG 1172 GRSSKQ ++EG+++DKL+KLK+EPHLSGAYIRSLVKQLTS+RTKDP+NSK++ ++ + Sbjct: 3 GRSSKQETREGMEIDKLNKLKEEPHLSGAYIRSLVKQLTSSRTKDPMNSKDRSCVDD-DS 61 Query: 1171 FSSYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVT 992 FS QN+ K G+G SEN QVRRRLHTSRPYQERLLNMAEARREIV Sbjct: 62 FSG-QNMAKFGEGVSENQQTQQPQQPQQQHRKQVRRRLHTSRPYQERLLNMAEARREIVA 120 Query: 991 ALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEV------KAKSRRNSRTS 830 ALKFHRA+MK K KSRRN R Sbjct: 121 ALKFHRASMKQANEQQQQHHQQQEQQQQQHQQQSLSVQLSPPPCFEQEGKMKSRRNPRIY 180 Query: 829 AASDTSNNLTNYVDNFSYQAFP--------CPPNYPYSSWSTFPPVASPFQPENLNFALP 674 ++ N +NY+D+ S +F PP YP+ W T PPV ENLNF LP Sbjct: 181 PSNTA--NFSNYLDSVSCTSFSHAPPPPSASPPPYPFC-WPT-PPVLPSTINENLNFPLP 236 Query: 673 NQTLGXXXXXXXXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXP--LQPAMEEVPSLVIPQ 500 NQTLG YH +NNP+++Y A E+VPS+ Q Sbjct: 237 NQTLGLNLNFQDFNDLDTSLYHNSNNPSSVYSSSSPSSFSSPSPSFSIATEDVPSVAKSQ 296 Query: 499 EGIPMMVTNNEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFK 320 EG+P + + E+ G GLH +DD+EMAE+RSIGEQHQMEWNDT+NLVTSAWWFK Sbjct: 297 EGMPPAICDLTESY----GGGGLHQVVDDEEMAEMRSIGEQHQMEWNDTMNLVTSAWWFK 352 Query: 319 FLKTMEIDPQEQSEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXX 140 FLKTM+ + ++ED PFDE+MEFPSWLNAN+ CL+QH DY +++Y DP+L C Sbjct: 353 FLKTMDSGHEVKTEDDGYQPFDEVMEFPSWLNANDACLQQHFDDYSTEDYYHDPSLRCMD 412 Query: 139 XXXXXXXXXEWLS 101 EWLS Sbjct: 413 IGEIEGMGGEWLS 425 >ref|XP_006474533.1| PREDICTED: dual specificity protein kinase splA-like isoform X1 [Citrus sinensis] gi|568841172|ref|XP_006474534.1| PREDICTED: dual specificity protein kinase splA-like isoform X2 [Citrus sinensis] gi|568841174|ref|XP_006474535.1| PREDICTED: dual specificity protein kinase splA-like isoform X3 [Citrus sinensis] Length = 403 Score = 345 bits (884), Expect = 6e-92 Identities = 218/422 (51%), Positives = 251/422 (59%), Gaps = 5/422 (1%) Frame = -1 Query: 1351 GRSSKQGSKEGLDMD-KLDKLK-QEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVV 1178 GRSSKQ +KEG+++D KL KL+ +PHLSGAYIRSLVKQLTS+RTKDP++ K+ D Sbjct: 3 GRSSKQETKEGMELDSKLSKLRGDQPHLSGAYIRSLVKQLTSSRTKDPMSPKDPD----F 58 Query: 1177 EGFS-SYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARRE 1001 +G S S Q LTK G+GFSE QVRRRLHTSRPYQERLLNMAEARRE Sbjct: 59 DGDSVSSQKLTKFGEGFSE-IPETQQPQQPQQHKKQVRRRLHTSRPYQERLLNMAEARRE 117 Query: 1000 IVTALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAAS 821 IVTALKFHRAAMK E K KSRRN R ++ Sbjct: 118 IVTALKFHRAAMKQASEQQQQQEQQQQLRQSQPLHLSTQPCFEQEGKLKSRRNPRIYPSN 177 Query: 820 DTSNNLTNYVDNFSYQAFPCPPNYPYSSWSTFPPVASPFQPENLNFALPNQTLGXXXXXX 641 + NFSY +F CPP SW V S F PE LNF LPNQTLG Sbjct: 178 ---------IANFSYSSFSCPPPPNSYSWPA-SQVPSAF-PEALNFPLPNQTLGLNLNLH 226 Query: 640 XXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQE-GIPMMVTNNEE 464 Y+ +NNP+ IY PL A EE P I Q+ G P +TN + Sbjct: 227 DFNNLDTTIYNNSNNPS-IYSYSSPSSSSSPPLSVATEEHPFTAISQDMGGPTAMTNVLD 285 Query: 463 TSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEIDPQE- 287 +S G +GLH A+ D+EMAEIRSIGEQHQMEWND +NLVTSAWWFKFLK ME P+E Sbjct: 286 SS----GGIGLHPALGDEEMAEIRSIGEQHQMEWNDKMNLVTSAWWFKFLKNMEPGPEEM 341 Query: 286 QSEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEW 107 SED HPFDE+MEFP+WLNANE CL+QH +DY D+Y QDPALPC EW Sbjct: 342 NSEDDGFHPFDEVMEFPAWLNANESCLQQHFNDYCPDDYFQDPALPCMDIGEFEGMDSEW 401 Query: 106 LS 101 LS Sbjct: 402 LS 403 >ref|XP_006452941.1| hypothetical protein CICLE_v10008490mg [Citrus clementina] gi|557556167|gb|ESR66181.1| hypothetical protein CICLE_v10008490mg [Citrus clementina] Length = 407 Score = 340 bits (871), Expect = 2e-90 Identities = 219/426 (51%), Positives = 252/426 (59%), Gaps = 9/426 (2%) Frame = -1 Query: 1351 GRSSKQGSKEGLDMD-KLDKLK-QEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVV 1178 GRSSKQ +KEG+++D KL KL+ +PHLSGAYIRSLVKQLTS+RTKDP++ K+ D Sbjct: 3 GRSSKQETKEGMELDSKLSKLRGDQPHLSGAYIRSLVKQLTSSRTKDPMSPKDPD----F 58 Query: 1177 EGFS-SYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARRE 1001 +G S S QNLTK G+GFSE QVRRRLHTSRPYQERLLNMAEARRE Sbjct: 59 DGDSVSSQNLTKFGEGFSE-IPETQQPQQPQQHKKQVRRRLHTSRPYQERLLNMAEARRE 117 Query: 1000 IVTALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEV---KAKSRRNSRTS 830 IVTALKFHRAAMK K KSRRN R Sbjct: 118 IVTALKFHRAAMKQASEQQHQQEQAEQQQQLQQSQPLHLSTQPCFEQEGKLKSRRNPRIY 177 Query: 829 AASDTSNNLTNYVDNFSYQAFPCPPNYPYS-SWSTFPPVASPFQPENLNFALPNQTLGXX 653 ++ + NFSY +F C P P S SW V S F PE LNF LPNQTLG Sbjct: 178 PSN---------IANFSYSSFSCRPPPPNSYSWPA-SQVPSAF-PEALNFPLPNQTLGLN 226 Query: 652 XXXXXXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQE-GIPMMVT 476 Y+ +NNP+ IY PL A EE P I Q+ G P +T Sbjct: 227 LNLHDFNNLDTTIYNNSNNPS-IYSYSSPSSSSSPPLSVATEEHPFTAISQDMGGPTAMT 285 Query: 475 NNEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEID 296 N ++S G +GLH A+ D+EMAEIRSIGEQHQMEWND +NLVTSAWWFKFLK ME Sbjct: 286 NVLDSS----GGIGLHPALGDEEMAEIRSIGEQHQMEWNDKMNLVTSAWWFKFLKNMEPG 341 Query: 295 PQE-QSEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXX 119 P+E SED HPFDE+MEFP+WLNANE CL+QH +DY D+Y QDPALPC Sbjct: 342 PEEMNSEDDGFHPFDEVMEFPAWLNANESCLQQHFNDYCPDDYFQDPALPCMDIGEFEGM 401 Query: 118 XXEWLS 101 EWLS Sbjct: 402 DSEWLS 407 >gb|EMJ25889.1| hypothetical protein PRUPE_ppa025477mg [Prunus persica] Length = 421 Score = 326 bits (835), Expect = 3e-86 Identities = 202/427 (47%), Positives = 246/427 (57%), Gaps = 10/427 (2%) Frame = -1 Query: 1351 GRSSKQGSKEGLDMDKLDKLKQEPH-LSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVE 1175 GRSSKQ +K+ +++ KL KLK+EPH LSGAYIRSLVKQLTS+RTKDP+N + + + Sbjct: 4 GRSSKQETKDLMELGKLSKLKEEPHHLSGAYIRSLVKQLTSSRTKDPMNPNKDLDCDALP 63 Query: 1174 GFSSYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIV 995 QN+ K G+GFSE QVRRRLHTSRPYQERLLNMAEARREIV Sbjct: 64 N----QNMAKYGEGFSETQQHTQQPHQPQQHKKQVRRRLHTSRPYQERLLNMAEARREIV 119 Query: 994 TALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEV-KAKSRRNSRTSAASD 818 TALKFHRAAMK + + KSRRN R +S Sbjct: 120 TALKFHRAAMKQASEQQQQQHQDQEQQPQSQQLQPQPHPCFEQEGRTKSRRNPRIYPSST 179 Query: 817 TSNNLT-NYVDNFSYQAFPCPPNYPYSSWSTFPPVASPFQPENLNFALPNQTLGXXXXXX 641 + T + +FS+ +P PN PYS W T +A P PEN +F LPNQTLG Sbjct: 180 ANYPETLPFPSDFSHHQYPSVPN-PYS-W-TASTIALP--PENFDFTLPNQTLGLNLNFQ 234 Query: 640 XXXXXXXXFYHCNNNP---TTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQEGIPM-MVTN 473 YH +N+P +T + E S I Q + VT+ Sbjct: 235 DFNNINTTLYHSSNSPPFYSTSASSPSSSSSPGLSVATDQEIPGSAAISQMEVEAPAVTD 294 Query: 472 NEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEIDP 293 ++ + SG GLH AMDD+EMAEIRSIGEQHQMEWNDT+NLVTSAWWFKFLKTME+ Sbjct: 295 VTDSGITISGGGGLHAAMDDEEMAEIRSIGEQHQMEWNDTMNLVTSAWWFKFLKTMELGG 354 Query: 292 QE---QSEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXX 122 E + ++ HPFDE MEFP+WLNANE C + HL+DYYS++Y QDPALPC Sbjct: 355 PEGKPEDDNVWYHPFDEAMEFPAWLNANESCFQHHLNDYYSEDYFQDPALPCMDIGEIEG 414 Query: 121 XXXEWLS 101 EWL+ Sbjct: 415 IDGEWLA 421 >ref|XP_004291283.1| PREDICTED: uncharacterized protein LOC101308340 [Fragaria vesca subsp. vesca] Length = 414 Score = 300 bits (767), Expect = 2e-78 Identities = 198/433 (45%), Positives = 239/433 (55%), Gaps = 17/433 (3%) Frame = -1 Query: 1348 RSSKQGSKEGLDMDKLDKLKQEPH-LSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEG 1172 RSSKQ + + +++D L KLK+EPH LSGAYIRSLVKQLTS+RTKD +NSK D L+ Sbjct: 5 RSSKQETDQLMEIDDLSKLKEEPHHLSGAYIRSLVKQLTSSRTKDTMNSK--DLLDCG-- 60 Query: 1171 FSSYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVT 992 TK +GFSE VRRRLHTSRPYQERLLNMAEAR+EIVT Sbjct: 61 -------TKFREGFSETQITQQPQQPQQHKKQ-VRRRLHTSRPYQERLLNMAEARKEIVT 112 Query: 991 ALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAASDTS 812 ALKFHRAAMK E + KSRRN R ++ T+ Sbjct: 113 ALKFHRAAMKQASEQQQQHQEQEQQPQSQEMLPQPHPCSEPEGRIKSRRNPRIYPSNSTN 172 Query: 811 NNLT-NYVDNFSYQAFPCPPNYPYSSWSTFPPVASPFQPENLNF-ALPNQTLGXXXXXXX 638 T Y NFS+Q P + +SS S PP P EN +F ALP+QTLG Sbjct: 173 YLETLPYSSNFSHQYPSVPNQFFWSSASRIPP---PPLYENFDFTALPSQTLGLNLNFQD 229 Query: 637 XXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQE--------GIPMM 482 YH +N+P +Y P+ PSL I + GI M Sbjct: 230 FNNISTTLYHNSNSPP-VYSTSGSVSFSASASSPSSSSSPSLSIATDHELPGSSAGISHM 288 Query: 481 ------VTNNEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFK 320 V + E++ + SG GLH AMDD+EMAEIRSIGEQHQMEWNDT+NLVTSAWWFK Sbjct: 289 EMEAPAVADAEDSGITTSGGDGLHAAMDDEEMAEIRSIGEQHQMEWNDTMNLVTSAWWFK 348 Query: 319 FLKTMEIDPQEQSEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXX 140 FLK + +D N PFDE++EFP+W +ANEG +QHL+DYYS +Y QDPALPC Sbjct: 349 FLKA-------EDDDGYNLPFDEVLEFPAWSSANEGSFQQHLNDYYSADYFQDPALPCMD 401 Query: 139 XXXXXXXXXEWLS 101 EWL+ Sbjct: 402 IGEFEGFDGEWLA 414 >ref|XP_006361526.1| PREDICTED: uncharacterized protein LOC102597274 [Solanum tuberosum] Length = 378 Score = 289 bits (740), Expect = 3e-75 Identities = 193/426 (45%), Positives = 231/426 (54%), Gaps = 9/426 (2%) Frame = -1 Query: 1351 GRSS-KQGSKEGLDMDKLDKLKQEP-HLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVV 1178 GRSS KQG +++ K+DKLKQEP HLSGAYIRSLVKQLTS+RTKDPLN K+ D Sbjct: 3 GRSSPKQGGV--MEVAKVDKLKQEPPHLSGAYIRSLVKQLTSSRTKDPLNPKDHDD---- 56 Query: 1177 EGFSSYQNLTKTGDGFSENXXXXXXXXXXXXXXXQ--VRRRLHTSRPYQERLLNMAEARR 1004 S +N TK D + + VRRRLHTSRPYQERLLNMAEARR Sbjct: 57 ----SLENSTKLDDTLCSDTQLSSPQPAAPPQIKKKQVRRRLHTSRPYQERLLNMAEARR 112 Query: 1003 EIVTALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAA 824 EIVTALKFHRA+MK K KSRRN R A+ Sbjct: 113 EIVTALKFHRASMKQQQQSHTEPQSLQTWPQESLGEEQG--------KPKSRRNPRIYAS 164 Query: 823 SDTSNNLTNY-VDNFSYQAFPCPPNYPYSSWSTFPPVAS--PFQPENLNFALPNQTLGXX 653 + +NNL Y ++NFS FPC P YPYS PV+S PFQ +N+NF LPNQTLG Sbjct: 165 NTMANNLPCYNMENFSNSTFPCLPQYPYSC-----PVSSQLPFQ-DNMNFPLPNQTLGLN 218 Query: 652 XXXXXXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQEGIPMMVTN 473 Y +N +I+ + +E + I MV + Sbjct: 219 LNFHDFNNLDATPYCSISNKNSIFSSSS---------SSSSDEFHYVGEGVGPIGEMVNS 269 Query: 472 NEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEIDP 293 LH +MDD+EM EIRS+G+QH+MEWNDTLNL TSAWWFKFLK MEI P Sbjct: 270 R------------LHPSMDDQEMEEIRSVGKQHEMEWNDTLNLATSAWWFKFLKAMEIGP 317 Query: 292 QEQ--SEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXX 119 E+ +ED+ +PFDE+MEFP W N NE CL+QH+ D YS P LPC Sbjct: 318 DEKNIAEDYGCYPFDEVMEFPPWFNPNETCLQQHVDDTYS-----HPTLPCMDIEGIEGM 372 Query: 118 XXEWLS 101 EWL+ Sbjct: 373 DVEWLA 378 >ref|XP_002308507.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] gi|550336935|gb|EEE92030.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 333 Score = 273 bits (699), Expect = 2e-70 Identities = 174/421 (41%), Positives = 218/421 (51%), Gaps = 5/421 (1%) Frame = -1 Query: 1348 RSSKQGSKEGLDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGF 1169 RSSKQ +KEG++ KL KLK+EPHLSGAYIRSLVKQLTS+RTKDP+N K S + Sbjct: 4 RSSKQETKEGMETGKLRKLKEEPHLSGAYIRSLVKQLTSSRTKDPMNPKGHGSAD----- 58 Query: 1168 SSYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTA 989 DG S+N VRRRLHTSRPYQERLLNMAEARREIVTA Sbjct: 59 ---------SDGLSKNQKSQQPQEPQPHKKQ-VRRRLHTSRPYQERLLNMAEARREIVTA 108 Query: 988 LKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAASDTSN 809 LK RN R ++ T Sbjct: 109 LK---------------------------------------------RNPRIYPSNST-- 121 Query: 808 NLTNYVDNFSYQAF----PCPPNYPYSSWSTFPPVASPFQPENLNFALPNQTLGXXXXXX 641 + +NY+DNFSY+ F PCPP YP+S W + + P EN+NF LPNQTLG Sbjct: 122 DFSNYLDNFSYKPFTPPPPCPPPYPFS-WPS-SSILDPTTAENINFPLPNQTLGLNLNFH 179 Query: 640 XXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQEGIPMMVTNNEET 461 Y+ ++NP ++Y A EE+PS+ EG+P Sbjct: 180 DFNNIDTTLYYSSDNPPSVYSSSSPSSSSFPSPFIATEEIPSVSNTCEGMP--------P 231 Query: 460 SMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEIDPQEQS 281 + F+ D S GEQHQMEWNDT+NLVTSAWWFKF+KT +DP+ +S Sbjct: 232 AAFDETD----------------SYGEQHQMEWNDTMNLVTSAWWFKFMKTTGLDPEVKS 275 Query: 280 -EDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEWL 104 ED HPF+++MEFP+WLNAN+ +QH +D++S +Y D ALPC EWL Sbjct: 276 TEDDGCHPFEQVMEFPAWLNAND---QQHFNDHFSQDYFHDAALPCMDIGEIEGIDGEWL 332 Query: 103 S 101 + Sbjct: 333 A 333 >gb|EXB74925.1| hypothetical protein L484_018634 [Morus notabilis] Length = 432 Score = 264 bits (674), Expect = 1e-67 Identities = 187/432 (43%), Positives = 227/432 (52%), Gaps = 31/432 (7%) Frame = -1 Query: 1351 GRSSKQGSKEGLDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEG 1172 GRSS + +G+++DK+ KLK+EPHLSGAYIRSLVKQLTS++TKD +N +D+ + V Sbjct: 4 GRSSSKHETKGMEIDKISKLKEEPHLSGAYIRSLVKQLTSSKTKDSMNINPKDNPDCVVL 63 Query: 1171 FSSY------QNLTKTGD-GFSENXXXXXXXXXXXXXXXQ---VRRRLHTSRPYQERLLN 1022 + QNLTK G+ G SE VRRRLHTSRPYQERLLN Sbjct: 64 VDGHHQSFPGQNLTKLGEAGLSETQQQTQQTQQTQRPQQHKKQVRRRLHTSRPYQERLLN 123 Query: 1021 MAEARREIVTALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEV----KAK 854 MAEARREIVTALKFHRAAMK K K Sbjct: 124 MAEARREIVTALKFHRAAMKQANEQQQQQQEQQQPPSPQQPQSQLLQPPLHFEQQEGKVK 183 Query: 853 SRRN-----SRTSAASDTSNNLTNYVDNF-SYQAFPCPPNYPYSSWSTFPPVASPF-QPE 695 RRN S TS S+ +NL++Y NF S+ PP P S P PF + Sbjct: 184 HRRNHRIYPSNTSIFSNYLDNLSSYSSNFSSHHNHQLPPPNPLFWPSNSPISTLPFGGDQ 243 Query: 694 NLNFALPNQTLGXXXXXXXXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQP---AMEE 524 NLNF LPNQ LG + +N P A+ + Sbjct: 244 NLNFTLPNQALGLNLNFHDFNNIDTTSLYHHNPSFHSNSASASASSPSSSSSPTISAVTD 303 Query: 523 VPSLVIPQEGIPMMVTNNE--ETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTL 350 V P P V+ + ++S S GLH AMDD+EMAEIRSIGEQHQMEWNDT+ Sbjct: 304 YDPYVSPAMAGPAAVSPADFIDSSATTSSGGGLHAAMDDEEMAEIRSIGEQHQMEWNDTM 363 Query: 349 NLVTSAWWFKFLKTMEIDPQEQSEDFVN---HPF-DEIMEFPSWLNANEGCLEQHLSDYY 182 NLVTSA+W KFLK E D + D +N +PF D+ MEFP+WL+ANE +QH DYY Sbjct: 364 NLVTSAYWLKFLKA-EGDHHDDDGDGINGFKYPFDDQFMEFPAWLSANE---QQHFDDYY 419 Query: 181 -SDEYMQDPALP 149 SD+Y Q PALP Sbjct: 420 CSDDYFQSPALP 431 >ref|XP_004501405.1| PREDICTED: epidermal growth factor receptor substrate 15 homolog [Cicer arietinum] Length = 405 Score = 243 bits (619), Expect = 3e-61 Identities = 167/419 (39%), Positives = 225/419 (53%), Gaps = 13/419 (3%) Frame = -1 Query: 1318 LDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTG 1139 +++DKL KL++EP LS AY++SL+KQL ++R KD +N K QD + V +G QNL+ Sbjct: 1 MELDKLCKLEEEPPLSSAYMQSLMKQLNTSRAKDLMNPKGQDFV-VNDGIFVGQNLSSRK 59 Query: 1138 DGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKX 959 +G + QVRRRLHTS+PYQERLLNMAEARREIVTALKFHRAAMK Sbjct: 60 NG--KVVDAKQRSTKPQQHKKQVRRRLHTSKPYQERLLNMAEARREIVTALKFHRAAMKE 117 Query: 958 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEV----KAKSRRNSRTSAASDTSNNLTNY- 794 + KSRRN R + T+ N ++Y Sbjct: 118 ASEQQKQQQQEPQEQLQRPSLSSQPSHPHPSFEQDGRYKSRRNPRIYPSCTTTTNSSSYF 177 Query: 793 VDNFSYQAFP-CPPNYPYS-SWSTFPPVA-SPFQPENLNFALPNQTLGXXXXXXXXXXXX 623 +D FSY PP+ P S +W +A P EN NF LPNQTLG Sbjct: 178 LDGFSYPYLSHTPPSVPNSYTWPVASSIAPPPLLAENPNFILPNQTLGLNLNFHDFNNLD 237 Query: 622 XXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQ-EGIPMMV---TNNEETSM 455 + NN ++ P ++EVPS+ Q EG ++V +++ T + Sbjct: 238 VTVH--LNNTSSSSSSSYSSQSSGTSSSPPLQEVPSVGTSQVEGFSLLVDTIDSHDATHV 295 Query: 454 FESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEID-PQEQSE 278 + GLH +MDD+ MAEIRS+G+Q+QMEWNDT+NLV SA WFK+LK ME P+ + E Sbjct: 296 TK----GLHTSMDDEAMAEIRSLGDQYQMEWNDTMNLVKSACWFKYLKHMEHGAPETKDE 351 Query: 277 DFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEWLS 101 + H D+ +EFP+WLNAN+ CLEQ SD+Y +D +LPC +WL+ Sbjct: 352 NDAYHNLDQHLEFPAWLNANDSCLEQ-----CSDDYFRDSSLPCMDIGDIDGMDDDWLA 405 >ref|XP_003526088.1| PREDICTED: transcription factor btd-like [Glycine max] Length = 388 Score = 241 bits (615), Expect = 9e-61 Identities = 168/408 (41%), Positives = 214/408 (52%), Gaps = 2/408 (0%) Frame = -1 Query: 1318 LDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTG 1139 +++D+L KL +EP LSGA IR LVKQLT++RTKD +N K Q+ + V G S QN TK G Sbjct: 1 MELDRLSKL-EEPPLSGACIRGLVKQLTTSRTKDFMNPKYQNCV-VNGGVSHGQNSTKHG 58 Query: 1138 DGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKX 959 + QVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMK Sbjct: 59 K--VADTRQAQQSSQPQQQKKQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKE 116 Query: 958 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAASDTSNNLTNYVDNFS 779 + K KSRRN R A T + +D+ S Sbjct: 117 ASEQKQQQAQEQQQRPSVSLQPSQFPSFDQDGKFKSRRNPRIYPACTTKFSSYMDMDDLS 176 Query: 778 YQAFPCPPNYPYSSWSTFPPVASPFQPENLNFALPNQTLGXXXXXXXXXXXXXXFYHCNN 599 P +Y + + S P P EN NF LPNQTLG + N+ Sbjct: 177 QPPPLVPNSYTWPAASPITP--PPLMAENPNFVLPNQTLGLNLNLQDFNNLDATLHLNNS 234 Query: 598 NPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQ-EGIPMMVTNNEETSMFESGDLGLHHA 422 + ++ Y +E+PS+ I Q EG +V E + + GLH A Sbjct: 235 SSSSSYSSATSSSPP--------QELPSVGISQGEGFSSLVDTIESNAATQVTG-GLHTA 285 Query: 421 MDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEIDPQE-QSEDFVNHPFDEIM 245 MDD+ MAE+RS+GEQ+QMEWNDT+NLV SA WFK+LK ME E + D + FD+++ Sbjct: 286 MDDEGMAEMRSLGEQYQMEWNDTMNLVKSACWFKYLKNMEHRAHEANTVDVAYNNFDQLL 345 Query: 244 EFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEWLS 101 EFP+WLNANE CLEQ DY+ Q+ +LPC +WL+ Sbjct: 346 EFPAWLNANESCLEQCSVDYF-----QESSLPCMDIGDIDSMDDDWLA 388 >ref|XP_003523660.1| PREDICTED: myb-like protein I-like isoform X1 [Glycine max] gi|571449320|ref|XP_006578105.1| PREDICTED: myb-like protein I-like isoform X2 [Glycine max] Length = 396 Score = 236 bits (601), Expect = 4e-59 Identities = 172/417 (41%), Positives = 220/417 (52%), Gaps = 11/417 (2%) Frame = -1 Query: 1318 LDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTG 1139 +++D+L KL +EP LSGA IR LVKQLT++RTK +N K+Q+ + V G S QN TK G Sbjct: 1 MELDRLSKL-EEPPLSGACIRGLVKQLTTSRTKHFMNPKDQNCV-VNGGVSHGQNSTKHG 58 Query: 1138 DGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKX 959 + QVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMK Sbjct: 59 K--VADARQAQQSSQPQQQKKQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKE 116 Query: 958 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEV--KAKSRRNSRTSAASDTSNNLTNYVDN 785 K KSRRN R A T+ + +D+ Sbjct: 117 ASEQKQQQQQAQEQQQRPSVSLQPSQFPSFNQDGKFKSRRNPRIYPACTTNFSSYMDMDD 176 Query: 784 FSYQAF----PCPPN---YPYSSWSTFPPVASPFQPENLNFALPNQTLGXXXXXXXXXXX 626 S+ P PN +P +S T PP+ + EN NF LPNQTLG Sbjct: 177 LSHSCLSQRPPLVPNSYTWPAASQITPPPLMA----ENPNFVLPNQTLGLNLNLQDFNNL 232 Query: 625 XXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQ-EGIPMMVTNNEETSMFE 449 H NN+ ++ A E+PS+ I Q EG+ +V E + + Sbjct: 233 DATL-HLNNSSSS------SSPYSSATSSSAPLELPSVGISQGEGLSSLVDTIESNAATQ 285 Query: 448 SGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTME-IDPQEQSEDF 272 GLH AMDD+ MAEIRS+GEQ+QMEWNDT+NLV SA WFK+LK ME P+ + D Sbjct: 286 VTG-GLHTAMDDEGMAEIRSLGEQYQMEWNDTMNLVKSACWFKYLKNMEHRAPEANTVDV 344 Query: 271 VNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEWLS 101 + FD+++EFP+WLNAN+ CLEQ DY+ Q+ +LPC +WL+ Sbjct: 345 AYNNFDQLLEFPAWLNANDSCLEQCSVDYF-----QESSLPCLDIGDIESMDDDWLA 396 >gb|ESW08909.1| hypothetical protein PHAVU_009G084600g [Phaseolus vulgaris] Length = 384 Score = 234 bits (598), Expect = 8e-59 Identities = 172/411 (41%), Positives = 216/411 (52%), Gaps = 5/411 (1%) Frame = -1 Query: 1318 LDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTG 1139 +++DKL KL+Q P L AYIRSLVKQLT++ TK +N K+Q+ VV G S+ + +K G Sbjct: 1 MELDKLSKLEQPP-LPSAYIRSLVKQLTTSGTKVSVNPKDQNF--VVNGGVSHGHNSKHG 57 Query: 1138 DGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKX 959 + QVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMK Sbjct: 58 K--VADACRATQSTQPQQHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKE 115 Query: 958 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAASDTSNNLTNYVDNFS 779 VK KSRRN R A ++N + Y D+ S Sbjct: 116 ASEQKQQQQKQTQEQQRRASVSLRPDQD---VKFKSRRNQRIYPAC--TSNFSGYKDDLS 170 Query: 778 YQAFPCPP-NYP-YSSWSTFPPVASP-FQPENLNFALPNQTLGXXXXXXXXXXXXXXFYH 608 Y PP + P + +W + P+ P EN NF LPNQTLG Sbjct: 171 YSCLSQPPPSVPKFYTWPSASPITPPHLMAENPNFILPNQTLGLNLNFQDFN-------- 222 Query: 607 CNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQ-EGIPMMVTNNEETSMFESGDLGL 431 N + T + EE+PSL I Q EG M + E++ +GL Sbjct: 223 -NLDATFLLNNSSSSSNSSATSSSPPEELPSLGISQGEGFYSM-GDTVESNAATQVTVGL 280 Query: 430 HHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTME-IDPQEQSEDFVNHPFD 254 H AMDD+ MAEIRS+GEQ+QMEWNDT+NLV SA WFKFLK ME P+ ++ D H FD Sbjct: 281 HTAMDDEGMAEIRSLGEQYQMEWNDTMNLVKSACWFKFLKNMENRAPEAKNGDDAYHNFD 340 Query: 253 EIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEWLS 101 +++EFP+WL NE CLEQ DY+ QD +LP +WL+ Sbjct: 341 QLLEFPAWL--NESCLEQCSVDYF-----QDSSLPRMDIGDFDSMDDDWLA 384 >ref|XP_003603253.1| hypothetical protein MTR_3g105580 [Medicago truncatula] gi|355492301|gb|AES73504.1| hypothetical protein MTR_3g105580 [Medicago truncatula] Length = 376 Score = 207 bits (528), Expect = 1e-50 Identities = 153/397 (38%), Positives = 198/397 (49%), Gaps = 12/397 (3%) Frame = -1 Query: 1255 SLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTGDGFSENXXXXXXXXXXXXXXX 1076 SLVKQL ++R KD +N K Q + V +G S QNL+K G + Sbjct: 5 SLVKQLNTSRAKDLMNPKNQHCV-VNDGVSVGQNLSKIGK-VVDAQHAQKRSTQPQEYKK 62 Query: 1075 QVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXX 896 QVRRRLHT +PYQERLLNMAEARREIVTALKFHRA+MK Sbjct: 63 QVRRRLHTVKPYQERLLNMAEARREIVTALKFHRASMKEASEQQKQQQQEAQEQQQRQSP 122 Query: 895 XXXXXXXXXEV---KAKSRRNSRTSAASDTSNNLTNYVDNFSYQAFP-CPPNYPYS-SWS 731 + KSRRN R + T N ++Y+++FSY PP+ P S +W Sbjct: 123 SPQPSQHQSFDQDGRYKSRRNPRIYPSCTTKTNSSSYLNDFSYPYLSHIPPSVPNSHTW- 181 Query: 730 TFPPVASPFQP-----ENLNFALPNQTLGXXXXXXXXXXXXXXFYHCNNNPTTIYXXXXX 566 P ASP P EN NF LPNQTLG + N++ T+ Y Sbjct: 182 ---PAASPINPPPLLAENPNFILPNQTLGLNLNFHDFNNLEFTVHLNNSSSTSSYSSGTS 238 Query: 565 XXXXXXPLQPAMEEVPSL-VIPQEGIPMMVTNNEETSMFESGDLGLHHAMDDKEMAEIRS 389 A ++VPS+ + EG + + + +G GLH MDD+ MAEIRS Sbjct: 239 --------SSASQDVPSVGTLQAEGFSI----DSHAATHVTG--GLHTVMDDEAMAEIRS 284 Query: 388 IGEQHQMEWNDTLNLVTSAWWFKFLKTMEID-PQEQSEDFVNHPFDEIMEFPSWLNANEG 212 +G Q+QMEWNDT+NLV SA W K+LK ME P+ + E+ FD+ +EFP+WLNANE Sbjct: 285 LGNQYQMEWNDTMNLVKSACWVKYLKHMEHGAPEAKGENDAYRDFDQPLEFPAWLNANES 344 Query: 211 CLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEWLS 101 LE S++ QD LPC +WL+ Sbjct: 345 SLE-----LCSEDNFQDSTLPCMDIGDIDGMDDDWLA 376 >ref|XP_006843859.1| hypothetical protein AMTR_s00007p00264200 [Amborella trichopoda] gi|548846227|gb|ERN05534.1| hypothetical protein AMTR_s00007p00264200 [Amborella trichopoda] Length = 432 Score = 205 bits (521), Expect = 7e-50 Identities = 159/443 (35%), Positives = 199/443 (44%), Gaps = 44/443 (9%) Frame = -1 Query: 1300 DKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKT----GDG 1133 +KLK+EPHLSGAYIRSLVKQLTS+R + +S E+DS+ + + KT G Sbjct: 19 EKLKEEPHLSGAYIRSLVKQLTSSRNRAS-SSSEEDSMNGKSPDNESPDFPKTPAKFNGG 77 Query: 1132 FSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKXXX 953 F EN VRRRLHT+RPYQERLLNMAEARREIVTALKFHRAAMK Sbjct: 78 FGENEQPNQPQVKKQ-----VRRRLHTTRPYQERLLNMAEARREIVTALKFHRAAMKQQN 132 Query: 952 XXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSR-----TS-------AASDTSN 809 K KSRRN R TS ++S +S Sbjct: 133 INSSQDSTAPLSSETNN-------------KIKSRRNPRIYPSSTSFPDYPIFSSSCSSL 179 Query: 808 NLTNYVDNFSYQAFPCPPNYPYSSWSTFPPVASPFQPENLNFALPNQTLGXXXXXXXXXX 629 + + +FS PP +P+ S P + +P L LP QTLG Sbjct: 180 SSSTPCSSFSPSLSTSPPCFPWVS----PTLCAPLTCNEL--PLPQQTLG----LNLNLQ 229 Query: 628 XXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVIPQ---------EGIPMMVT 476 ++CNNNP+ ++ P + E P Sbjct: 230 QCYNSFYCNNNPSIYSDPSSVFFNGVSETLQIIDSSPETLFSPPVSSDFSNFETPPENPN 289 Query: 475 NNEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEID 296 N T + S + H AMDD MAEIRSIGE+H+MEWNDT+NLV SAWW KFL +E Sbjct: 290 PNSSTPLRPSETVSFHSAMDDAGMAEIRSIGERHEMEWNDTMNLVHSAWWLKFLNNLESG 349 Query: 295 PQEQSEDFVN----------HPFDEIMEFPSWLNAN---------EGCLEQHLSDYYSDE 173 + + F+E+M+ P+WL N EGC HLSD SD Sbjct: 350 HENHGSSCNSSCSRNNRGEYELFEELMDIPAWLQDNAEHLGISSVEGCCLNHLSD-ASDG 408 Query: 172 YMQDPALPCXXXXXXXXXXXEWL 104 ++QD LP WL Sbjct: 409 FLQDTTLPSMDLGEVEGLDGGWL 431 >emb|CBI20749.3| unnamed protein product [Vitis vinifera] Length = 290 Score = 185 bits (469), Expect = 7e-44 Identities = 98/200 (49%), Positives = 118/200 (59%), Gaps = 1/200 (0%) Frame = -1 Query: 697 ENLNFALPNQTLGXXXXXXXXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVP 518 +N+NFALPNQTLG YH NNP++IY L A EEV Sbjct: 96 DNINFALPNQTLGLNLNFHDFNNLDASLYHTTNNPSSIYSSSSPSSSSSPTLSIATEEVR 155 Query: 517 SLVIPQEGIPMMVTNNEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVT 338 QEG P++ ++S+ G G H AMDD+EMAEIR IGEQHQMEWND LNL T Sbjct: 156 CFATSQEGPPVVAA---DSSVMTGG--GFHPAMDDEEMAEIRWIGEQHQMEWNDNLNLAT 210 Query: 337 SAWWFKFLKTMEIDPQEQS-EDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQD 161 SAWWFKFLK MEI P+ + ED HPFDE+M++P+WL+ NE CL L D S +Y+QD Sbjct: 211 SAWWFKFLKAMEIGPEATNVEDDGYHPFDEVMDYPAWLSGNESCLVPRLDDCCSMDYLQD 270 Query: 160 PALPCXXXXXXXXXXXEWLS 101 PALPC EWL+ Sbjct: 271 PALPCMDIGEIEGMDGEWLA 290 Score = 108 bits (270), Expect = 9e-21 Identities = 63/111 (56%), Positives = 73/111 (65%) Frame = -1 Query: 1318 LDMDKLDKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTG 1139 +D+DK+ KLK+EPHLSGAYIRSLVKQLTS+RTKDP+N K+ D+ ++Q Sbjct: 1 MDIDKIHKLKEEPHLSGAYIRSLVKQLTSSRTKDPMNPKDSDT-------QAHQT----- 48 Query: 1138 DGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTAL 986 QVRRRLHTSRPYQERLLNMAEARREIVTAL Sbjct: 49 -------------QQPQQHKKQVRRRLHTSRPYQERLLNMAEARREIVTAL 86 >ref|XP_004245206.1| PREDICTED: uncharacterized protein LOC101266961 [Solanum lycopersicum] Length = 260 Score = 179 bits (455), Expect = 3e-42 Identities = 109/257 (42%), Positives = 136/257 (52%), Gaps = 3/257 (1%) Frame = -1 Query: 862 KAKSRRNSRTSAASDTSNNLTNY-VDNFSYQAFPCPPNYPYSSWSTFPPVASPFQPENLN 686 K KSRRN R A+++ +NNL Y ++NFS FPC P YPY T P + +NLN Sbjct: 33 KQKSRRNPRIYASNNMANNLPCYNMENFSNSTFPCHPQYPY----TCPVSSFGSLQDNLN 88 Query: 685 FALPNQTLGXXXXXXXXXXXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSLVI 506 F LPNQTLG Y NN +I+ + +E + Sbjct: 89 FPLPNQTLGLNLNFHDFNNLDATPYCSMNNKNSIFSSSSP--------SSSSDEFHCVGE 140 Query: 505 PQEGIPMMVTNNEETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWW 326 I MV + LH +MDD+EM EIRS+GEQH+MEWNDTLNL TSAWW Sbjct: 141 GVGPISEMVNSR------------LHPSMDDQEMEEIRSVGEQHEMEWNDTLNLATSAWW 188 Query: 325 FKFLKTMEIDPQEQ--SEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSDEYMQDPAL 152 FKFLKTMEI P ++ +ED+ +PFDE+MEFP W N NE L+QH+ D YS P L Sbjct: 189 FKFLKTMEIGPDDKNVAEDYGCYPFDEVMEFPPWFNPNETFLQQHVDDTYS-----HPTL 243 Query: 151 PCXXXXXXXXXXXEWLS 101 PC EWL+ Sbjct: 244 PCMDIEEIEGMDVEWLA 260 >ref|XP_004498967.1| PREDICTED: uncharacterized protein LOC101502455 [Cicer arietinum] Length = 326 Score = 162 bits (409), Expect = 7e-37 Identities = 126/357 (35%), Positives = 167/357 (46%), Gaps = 11/357 (3%) Frame = -1 Query: 1213 LNSKEQDSLEVVEGFSSYQNLTKTGDGFSENXXXXXXXXXXXXXXXQVRRRLHTSRPYQE 1034 +N+K+ D + V S+ QNL K G N VRRRLHT+RPYQE Sbjct: 1 MNTKDHDFVSV----STRQNLRKHGKVKQHNKKQ-------------VRRRLHTTRPYQE 43 Query: 1033 RLLNMAEARREIVTALKFHRAAMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEVKAK 854 +LLNMAEARREIV+ALKFHRA MK ++ K Sbjct: 44 KLLNMAEARREIVSALKFHRATMKQANEKQEEESLVLSHSNHSSHLPNFEQD----LRFK 99 Query: 853 SRRNSRTSAASDTSNNLTNYVDNFSYQAFPCPPNYPYSSWSTFPPVASPFQPE----NLN 686 S+R R + T NFSY +F P+ + ++P +S F P N N Sbjct: 100 SKRYPRIYPSCTT---------NFSYSSFSSHPSLSLPNSHSWPLSSSSFSPPLVVANTN 150 Query: 685 FALPNQTLGXXXXXXXXXXXXXXFY--HCNNNPTTIYXXXXXXXXXXXPLQPAMEEVPSL 512 F LPN TLG + H NNN +++ +VPS Sbjct: 151 FTLPNHTLGLNLSLDDDFNNLEATFLLHNNNNNSSLCSYSSQTSSSPPLSLANDCQVPSN 210 Query: 511 VIPQ-EGIPMMVTNNEETSMF-ESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVT 338 I Q +G MV E ++ +S + G H +MDD+ M E+RSIGEQ+QMEWNDT+N VT Sbjct: 211 RISQGKGFSSMVDTIETSATTTQSNEGGFHASMDDEGMEEMRSIGEQYQMEWNDTMNFVT 270 Query: 337 SAWWFKFLKTME--IDPQ-EQSEDFVNHPFDEIMEFPSWLNANEGCLEQHLSDYYSD 176 S WF FLK ME + PQ + +D +H FDE+ + E LE+ DY+ D Sbjct: 271 STLWFNFLKKMEHDVSPQVNKEDDACHHVFDELFD------VQENYLEEWSEDYFLD 321 >ref|XP_002465160.1| hypothetical protein SORBIDRAFT_01g033040 [Sorghum bicolor] gi|241919014|gb|EER92158.1| hypothetical protein SORBIDRAFT_01g033040 [Sorghum bicolor] Length = 447 Score = 147 bits (370), Expect = 2e-32 Identities = 131/425 (30%), Positives = 187/425 (44%), Gaps = 41/425 (9%) Frame = -1 Query: 1300 DKLKQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTGDGFSEN 1121 +K ++PHLSGAYIRSLVKQL+S+ + SK+ S + S Q + D + Sbjct: 22 NKAAEQPHLSGAYIRSLVKQLSSSSSA-AARSKDHHSTMGTKPHSQPQP-PQEDDLLQQQ 79 Query: 1120 XXXXXXXXXXXXXXXQ--VRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKXXXXX 947 + VRRRLHTSRPYQERLLNMAEARREIVTALK HRA+M+ Sbjct: 80 AQQTATPQQQQQQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKIHRASMRQAKEH 139 Query: 946 XXXXXXXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAASDTSNNLTNYVDN--FSYQ 773 E + + S+A + + ++Y+ N FS+ Sbjct: 140 QQQQQQQQLMHLHLQRQQEVHHHLVQE-PSHAAATGGASSAPPSYASYSDYLYNSPFSHF 198 Query: 772 AFPCPPNYPYS-----SWSTFPPVASP--------FQPENLNFALPNQTLGXXXXXXXXX 632 P P +Y S S+ P VA P F ++L LP Q LG Sbjct: 199 TAPTPSSYSSSPLTMISYGGAPVVAPPMLNSSEHNFDDQSL-VPLPAQPLGLNLSFQGFN 257 Query: 631 XXXXXFYHCNNNPTTIYXXXXXXXXXXXPLQPA------MEEVPSLVIPQEGIPMMVTNN 470 N+ + LQP+ + PS+ + + + N Sbjct: 258 GSSVAADDTKNSTCSF--------DPPSLLQPSPASSYSVYSSPSVTMASHDLSAVTMEN 309 Query: 469 EETSMFESGDLGLHHAMDDKEMAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTME---- 302 + + D LH +DD+EMA I SIG+QH +EW+DT+NLVTSAWW K L+++E Sbjct: 310 TSLA---AADPSLHRVLDDEEMAAIYSIGQQHDIEWSDTMNLVTSAWWSKLLESIEDKGS 366 Query: 301 ----IDPQEQSEDFVNHPFDEIMEFPSWLNANEGCLEQ----------HLSDYYSDEYMQ 164 +D ++ V +++ PSW++ + G + HLSDYY + Sbjct: 367 NGTVVDQEDGGAANVTEDPSSLVDMPSWVSDSLGHVATQESNSDFPGIHLSDYYHPDEDV 426 Query: 163 DPALP 149 ALP Sbjct: 427 SLALP 431 >gb|EMS49134.1| hypothetical protein TRIUR3_24556 [Triticum urartu] Length = 380 Score = 140 bits (353), Expect = 2e-30 Identities = 122/403 (30%), Positives = 164/403 (40%), Gaps = 6/403 (1%) Frame = -1 Query: 1291 KQEPHLSGAYIRSLVKQLTSTRTKDPLNSKEQDSLEVVEGFSSYQNLTKTGDGFSENXXX 1112 ++EPHLSGAYIRSLVK L+ST T ++ +D + G S+Q + + Sbjct: 21 REEPHLSGAYIRSLVKHLSSTST-----ARSKDHHHIAMGTKSHQEEQQAPQTTPPSLQQ 75 Query: 1111 XXXXXXXXXXXXQVRRRLHTSRPYQERLLNMAEARREIVTALKFHRAAMKXXXXXXXXXX 932 VRRRLHTSRPYQERLLNMAEARREIVTALK HRA+M+ Sbjct: 76 QQPHKKQ------VRRRLHTSRPYQERLLNMAEARREIVTALKIHRASMREAKEQQQHQQ 129 Query: 931 XXXXXXXXXXXXXXXXXXXXXEVKAKSRRNSRTSAASDTSNNL-TNYVDNFSYQAFPCPP 755 + S +S + +S T N + PP Sbjct: 130 LVQQLQHQQEVQVVQDHRVAFSAPSMSSYSSFSDYLHSSSPFAHTTATSNSGSSYYSSPP 189 Query: 754 NYPYSSWSTFPPVASPFQPE----NLNFALPNQTLGXXXXXXXXXXXXXXFYHCNNNPTT 587 PY + PV +P P + LP Q LG N T Sbjct: 190 LLPYHT-----PVVAPMVPMVDALDQFLPLPTQPLGLNLTFDGFGGGVAAEDAKNCTATG 244 Query: 586 IYXXXXXXXXXXXPLQPAMEEVPSLVIPQEGIPMMVTNNEETSMFESGDLGLHHAMDDKE 407 + +Q A V MV+ + + E+ LH +D++E Sbjct: 245 PF-------DPPSLVQQASPASSYSVYSSPPPATMVSQDMASVAAENTSQSLHRVLDEEE 297 Query: 406 MAEIRSIGEQHQMEWNDTLNLVTSAWWFKFLKTMEIDPQEQSEDFVNHPFDEIMEFPSWL 227 MA I S GE+H +EW+DT+NL TSAWW + L+++E D ++ Sbjct: 298 MAAIHSAGERHDIEWSDTVNLATSAWWSRLLESVEGDGATAAQ----------------- 340 Query: 226 NANE-GCLEQHLSDYYSDEYMQDPALPCXXXXXXXXXXXEWLS 101 AN + HLSD Y Y QD + PC +WLS Sbjct: 341 QANTVDAMGMHLSDEY---YGQDVSFPCMDIGEIKGWDAQWLS 380