BLASTX nr result
ID: Catharanthus22_contig00011701
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00011701 (1532 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241... 238 7e-60 ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258... 230 1e-57 ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585... 228 4e-57 gb|EMJ01755.1| hypothetical protein PRUPE_ppa009673mg [Prunus pe... 223 1e-55 ref|XP_002523082.1| conserved hypothetical protein [Ricinus comm... 210 1e-51 ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206... 209 3e-51 ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citr... 206 2e-50 ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313... 203 2e-49 gb|EOX95076.1| LYR motif-containing protein 7 isoform 1 [Theobro... 195 4e-47 gb|EOX95077.1| LYR motif-containing protein 7 isoform 2 [Theobro... 194 6e-47 ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795... 188 5e-45 gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis] 186 2e-44 gb|ESW16312.1| hypothetical protein PHAVU_007G146100g [Phaseolus... 179 2e-42 ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798... 178 6e-42 ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Popu... 175 4e-41 ref|XP_002301478.1| hypothetical protein POPTR_0002s20610g [Popu... 155 6e-35 ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493... 153 2e-34 ref|XP_006288487.1| hypothetical protein CARUB_v10001746mg [Caps... 142 5e-31 ref|NP_849288.1| uncharacterized protein [Arabidopsis thaliana] ... 140 1e-30 ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago ... 134 8e-29 >ref|XP_002268658.1| PREDICTED: uncharacterized protein LOC100241933 [Vitis vinifera] Length = 269 Score = 238 bits (606), Expect = 7e-60 Identities = 139/273 (50%), Positives = 160/273 (58%), Gaps = 13/273 (4%) Frame = +3 Query: 294 LDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDS--PKPYS--TTANGFHSKRPCRADRA 461 + + SIESCT Q HSWRPFQ P PKTL+ DS KPYS T++NG HSKRPC +DR Sbjct: 1 MSPKTSIESCTFQLHSWRPFQLPTT-PKTLEPDSHNSKPYSITTSSNGLHSKRPCLSDRK 59 Query: 462 TSFSIEALDMSKLSLFDDDRPLSSAHK-----RWFAXXXXXXXXXXXXXXXXXXXXTHXX 626 TSF I+ALD+SKLSL +DD+P SSA + RW T Sbjct: 60 TSFPIDALDISKLSLLEDDKPASSAPRNRGNVRWIDRKRRRRGSRSVSGRSSDRSGTRRC 119 Query: 627 XXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSGEK 806 DFP+AAGTDSSGELFVNGD NWSSDVSE AKNSR++RD GSGEK Sbjct: 120 CSVGASAAYATCS---DFPVAAGTDSSGELFVNGDSNWSSDVSE-AKNSRKDRDGGSGEK 175 Query: 807 DNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFG--V 980 +NL SGF +G E DAEFGYG +RLLFWG+ G Sbjct: 176 ENLGSGFGHIGIFETQGNESGYGSEPGYRGDAEFGYGDELDEEEDDARLLFWGEQLGDND 235 Query: 981 SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 ++ME VGEN QKAHHRCRRKKHD RM+D L Sbjct: 236 TNMEMVGENTFSEQKAHHRCRRKKHDYRMIDAL 268 >ref|XP_004240545.1| PREDICTED: uncharacterized protein LOC101258757 [Solanum lycopersicum] Length = 269 Score = 230 bits (586), Expect = 1e-57 Identities = 135/270 (50%), Positives = 157/270 (58%), Gaps = 8/270 (2%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYS-TTANGFHSKRPCRAD 455 MS +TLDSRH+IESCT HSW+PFQFP KTLD DSPK YS +T G H+KR CRAD Sbjct: 1 MSPKTLDSRHAIESCTYHLHSWKPFQFPSPNSKTLDLDSPKTYSPSTHGGLHTKRQCRAD 60 Query: 456 RATSFSIEALDMSKLSLFDDDRPLSSAHKR----WFAXXXXXXXXXXXXXXXXXXXXTHX 623 R TS IEALDMSKLSLF++D+PLS HKR A T Sbjct: 61 RTTSIPIEALDMSKLSLFEEDKPLS-VHKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRR 119 Query: 624 XXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGS-G 800 DFP+AAGTDSSGELFVNGD +W+ DVSE K+ R+E++ G G Sbjct: 120 RCCSVGASAAYGTCS--DFPVAAGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVG 177 Query: 801 EKDNLSSGFA-QVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFG 977 E++N +G + Q GN E L DAEFGYG RL FWG FG Sbjct: 178 ERENNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFG 237 Query: 978 -VSSMERVGENMLQKAHHRCRRKKHDLRMV 1064 +S ME+VGEN LQK HHRCRR+K D RMV Sbjct: 238 ALSRMEKVGENSLQKVHHRCRRRKQDCRMV 267 >ref|XP_006355807.1| PREDICTED: uncharacterized protein LOC102585515 [Solanum tuberosum] Length = 269 Score = 228 bits (582), Expect = 4e-57 Identities = 135/270 (50%), Positives = 155/270 (57%), Gaps = 8/270 (2%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYS-TTANGFHSKRPCRAD 455 MS +TLDSRH+IESCT HSW+PFQFP KTLD DSPK YS +T G H+KR CRAD Sbjct: 1 MSPKTLDSRHAIESCTYHLHSWKPFQFPTPNSKTLDLDSPKTYSPSTHGGVHTKRQCRAD 60 Query: 456 RATSFSIEALDMSKLSLFDDDRPLSSAHKR----WFAXXXXXXXXXXXXXXXXXXXXTHX 623 R TS IEALDMSKLSLF++DRPLS HKR A T Sbjct: 61 RTTSIPIEALDMSKLSLFEEDRPLS-VHKRENLRLIAGKRRRRGSRSVSGRSSDRSGTRR 119 Query: 624 XXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGS-G 800 DFP+A GTDSSGELFVNGD +W+ DVSE K+ R+E++ G G Sbjct: 120 RCCSVGASAAYGTCS--DFPVAVGTDSSGELFVNGDMHWTLDVSEVTKSLRKEKEGGGVG 177 Query: 801 EKD-NLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFG 977 E++ NL+ Q GN E L DAEFGYG RL FWG FG Sbjct: 178 ERESNLNGLSVQSGNFEGLGNESGYGSEPGYRGDAEFGYGDEFDEEEDDQRLSFWGDEFG 237 Query: 978 -VSSMERVGENMLQKAHHRCRRKKHDLRMV 1064 +S ME+VGEN LQK HHRCRR+K D RMV Sbjct: 238 ALSRMEKVGENTLQKVHHRCRRRKQDCRMV 267 >gb|EMJ01755.1| hypothetical protein PRUPE_ppa009673mg [Prunus persica] Length = 282 Score = 223 bits (569), Expect = 1e-55 Identities = 135/285 (47%), Positives = 161/285 (56%), Gaps = 20/285 (7%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPF---QFPVAIPKTLDSD----SPKPYSTTANGF--H 431 MSH+ L+ RH I+SC Q HSWRPF Q KTLDSD +PKPY++++NG H Sbjct: 1 MSHKALEHRHPIDSCAFQLHSWRPFHLHQQTTPTSKTLDSDPSLPNPKPYNSSSNGLVVH 60 Query: 432 SKRPCRADRATSFSIEALDMSKLSLFDDDRPLSSAHK------RWFAXXXXXXXXXXXXX 593 +KRPC ++RATSFSI+A+DMS+L+L DDDR +S H R+ A Sbjct: 61 TKRPCLSNRATSFSIDAIDMSRLTLVDDDRTISGGHHNRHGSFRFIAKKRRRHGSRSVSG 120 Query: 594 XXXXXXXTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNS 773 T DFP+A GTDSSGELF NGD NW+SDVSE A+NS Sbjct: 121 RSSDRSGTRRCCSVGASAAYGTCS---DFPVAVGTDSSGELFGNGDANWASDVSE-ARNS 176 Query: 774 RRERD-NGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSR 950 R+ERD GSGEK+NL GF +G + DAEFGYG +R Sbjct: 177 RKERDGGGSGEKENLGIGFGPIGGFDVQGNESGYGSEPGYRGDAEFGYGDELDEEEEDTR 236 Query: 951 LLFWGQGFG--VSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 LLFWG FG S ME VGEN QK+HHRCRRKKHD RMVD L Sbjct: 237 LLFWGDQFGDADSMMEIVGENTFVDQKSHHRCRRKKHDCRMVDTL 281 >ref|XP_002523082.1| conserved hypothetical protein [Ricinus communis] gi|223537644|gb|EEF39267.1| conserved hypothetical protein [Ricinus communis] Length = 261 Score = 210 bits (534), Expect = 1e-51 Identities = 121/270 (44%), Positives = 149/270 (55%), Gaps = 7/270 (2%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRADR 458 MSHR+LDSRHSI+SCT Q HSWRPF +TLDSD PKPYS+T +KRPC +DR Sbjct: 1 MSHRSLDSRHSIDSCTFQLHSWRPFHL-----QTLDSDPPKPYSST-----TKRPCLSDR 50 Query: 459 ATSFSIEALDMSKLSLFDDDRPLSSAHKRWF---AXXXXXXXXXXXXXXXXXXXXTHXXX 629 TSF I+++D+SKLS+ DDD+P+S + + + Sbjct: 51 TTSFPIDSIDISKLSIIDDDKPISVSAATAYNSRGSLRLIARKRRRRGSRSVSGRSSDRS 110 Query: 630 XXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSGEKD 809 DFP+A GTDSSGELF NGD NW SDVSEA + +RE+D EK+ Sbjct: 111 GTRRCCSVGAHGTCSDFPVAVGTDSSGELFGNGDSNWGSDVSEAKNSIKREKDREREEKE 170 Query: 810 NLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFGVS-- 983 N+ G+ Q G EN DAEFGY ++LLFWG FG + Sbjct: 171 NM--GYGQFGTFENQGNESGYGSEPGYRGDAEFGYEDEIDEEEDDAKLLFWGDHFGGTGP 228 Query: 984 SMERVGENML--QKAHHRCRRKKHDLRMVD 1067 ME VGEN QK+HHRCRRKKHD RM+D Sbjct: 229 KMEMVGENSFSDQKSHHRCRRKKHDNRMLD 258 >ref|XP_004152251.1| PREDICTED: uncharacterized protein LOC101206482 [Cucumis sativus] Length = 266 Score = 209 bits (531), Expect = 3e-51 Identities = 129/276 (46%), Positives = 151/276 (54%), Gaps = 11/276 (3%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSD--------SPKPYSTTANGFHS 434 MS R LDSRHSI+SCTL+FH W PF +PKTLDSD + KPY ++ H+ Sbjct: 1 MSRRPLDSRHSIDSCTLKFHGWTPFH----LPKTLDSDPHNTSAPTNSKPYYSSTP-LHT 55 Query: 435 KRPCRADRATSFSIEALDMSKLSLFDDDRPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXX 614 KRPC +DR TSF+++A+DMS LSL DDD+P S R F Sbjct: 56 KRPCLSDRTTSFNVDAIDMSALSLIDDDKP-SIPPARSFRLIARKRRRRGSRSVSGRSSD 114 Query: 615 THXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNG 794 DFP+A GTDSSGELFVNGD NWSSDVSE AKNSRRER+ Sbjct: 115 RSGTRRCCSVGASAAHGTCSDFPIAVGTDSSGELFVNGDANWSSDVSE-AKNSRRERE-- 171 Query: 795 SGEKDNLSSGF-AQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQG 971 EKD+L SGF + G + D EFGYG +RLL WG+ Sbjct: 172 --EKDHLGSGFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 229 Query: 972 FGVSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 G S ME VGEN QK+HHRCRRKKH+ RMVD L Sbjct: 230 LGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDAL 265 >ref|XP_006444350.1| hypothetical protein CICLE_v10021537mg [Citrus clementina] gi|568852594|ref|XP_006479957.1| PREDICTED: uncharacterized protein LOC102627953 [Citrus sinensis] gi|557546612|gb|ESR57590.1| hypothetical protein CICLE_v10021537mg [Citrus clementina] Length = 283 Score = 206 bits (525), Expect = 2e-50 Identities = 134/284 (47%), Positives = 160/284 (56%), Gaps = 20/284 (7%) Frame = +3 Query: 282 SHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDS-DSPKPYSTTANGFHSKRPCRADR 458 SH+ LDSRHSI+SC LQ H+WRPF + LDS DS KP + ++ H+KRPC +DR Sbjct: 4 SHKPLDSRHSIDSCALQLHNWRPFH----LQNPLDSSDSTKPSYSPSSWVHTKRPCLSDR 59 Query: 459 ATSFSI---EALDMSKLSLFDDD---RPLSSA----HKRWFAXXXXXXXXXXXXXXXXXX 608 ATSFSI A+D+SKLSLFDDD +P+++A + + Sbjct: 60 ATSFSIIDAAAIDLSKLSLFDDDNVIKPMTAATAPQSRGGYRLIARKRRRRGSRSVSGRS 119 Query: 609 XXTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERD 788 DFP+A GTDSSGELF NG+ NW+SDVSE A+NSRRERD Sbjct: 120 SDRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGEANWASDVSE-ARNSRRERD 178 Query: 789 --NGSGEKDNLSSGF-AQVGNLEN--LXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRL 953 NGSGEK+N +GF QVG LE L DAEFGYG ++L Sbjct: 179 NGNGSGEKENSGTGFGGQVGCLEAQVLGNESGYGSEPGYRGDAEFGYGDELDEEEEDAKL 238 Query: 954 LFWGQGFG--VSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 LFWG FG S ME VGEN QK+HHRCRRKKHD RMVD L Sbjct: 239 LFWGNRFGDVDSKMEMVGENTFTDQKSHHRCRRKKHDCRMVDAL 282 >ref|XP_004292660.1| PREDICTED: uncharacterized protein LOC101313678 [Fragaria vesca subsp. vesca] Length = 271 Score = 203 bits (516), Expect = 2e-49 Identities = 130/277 (46%), Positives = 151/277 (54%), Gaps = 15/277 (5%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSD--SPKPYSTTANGFHSKRPCRA 452 MSH+ LDSR S++SCT Q HSWRPFQ KTLDSD +PKPY H+KRPC + Sbjct: 1 MSHKALDSRPSLDSCTFQLHSWRPFQLQQQPTKTLDSDPANPKPY-------HTKRPCLS 53 Query: 453 DRATS-FSIEALDMSKLSLFDDDRPLSSAHK------RWFAXXXXXXXXXXXXXXXXXXX 611 +RATS FSI+A+DMS+L+L DDDR +S H R+ A Sbjct: 54 NRATSSFSIDAIDMSRLTLVDDDRTISGGHHHKHGSFRFLARKRRRHGSRSVSGRSSDRS 113 Query: 612 XTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDN 791 T DFP+A GTDSSGELF NGD NW+SDVSE A+N R+ERD Sbjct: 114 GTRRCCSVGASAAHGTCS---DFPVAIGTDSSGELFGNGDANWASDVSE-ARNLRKERDG 169 Query: 792 -GSGEKDNLSS-GFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWG 965 GSGEK+ GF G + DAEFGYG +RLLFWG Sbjct: 170 VGSGEKETTPGVGFGPGGGFDAQGNESGYGSEPGYRGDAEFGYGDELDEEEEDARLLFWG 229 Query: 966 QGFGVSS--MERVGENML--QKAHHRCRRKKHDLRMV 1064 FG S ME VGEN QK+HHRCRRKKHD RMV Sbjct: 230 NRFGDSDTMMEVVGENTFTDQKSHHRCRRKKHDCRMV 266 >gb|EOX95076.1| LYR motif-containing protein 7 isoform 1 [Theobroma cacao] Length = 271 Score = 195 bits (496), Expect = 4e-47 Identities = 128/279 (45%), Positives = 148/279 (53%), Gaps = 16/279 (5%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPK---PYSTTANGFHSKRPCR 449 MSH+ L+ RHSI+SCT Q HSWRPFQ + +TLDS P+ P + N FHSKRPC Sbjct: 1 MSHKALEPRHSIDSCTFQLHSWRPFQ----LQQTLDSSDPQQTPPKRASTNCFHSKRPCL 56 Query: 450 ADRATSFSIEALDMSKLSLFDDDR-----PLSSAHKRW-FAXXXXXXXXXXXXXXXXXXX 611 +DR TSFSI D+SKL+L DDD P+++ KR F Sbjct: 57 SDRTTSFSI---DLSKLTLLDDDNNSSYNPIAANPKRGSFRLFARKRRRRGSRSVSGRSS 113 Query: 612 XTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNG-DPNWSSDVSEAAKNSRRER- 785 DFP+A GTDSSGELF NG D W+SDVSE A+NSRRER Sbjct: 114 DRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGADAYWASDVSE-ARNSRRERG 172 Query: 786 DNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWG 965 D GSGEK++L Q G + D EFGYG +RLLFWG Sbjct: 173 DGGSGEKESLG---GQFGGFDAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLFWG 229 Query: 966 QGFGV---SSMERVGENML--QKAHHRCRRKKHDLRMVD 1067 FG S ME VGEN QKAHHRCRRKKHD RMVD Sbjct: 230 HHFGADTDSKMEMVGENTFSDQKAHHRCRRKKHDYRMVD 268 >gb|EOX95077.1| LYR motif-containing protein 7 isoform 2 [Theobroma cacao] Length = 270 Score = 194 bits (494), Expect = 6e-47 Identities = 128/278 (46%), Positives = 148/278 (53%), Gaps = 15/278 (5%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPK---PYSTTANGFHSKRPCR 449 MSH+ L+ RHSI+SCT Q HSWRPFQ + +TLDS P+ P + N FHSKRPC Sbjct: 1 MSHKALEPRHSIDSCTFQLHSWRPFQ----LQQTLDSSDPQQTPPKRASTNCFHSKRPCL 56 Query: 450 ADRATSFSIEALDMSKLSLFDDDR-----PLSSAHKRW-FAXXXXXXXXXXXXXXXXXXX 611 +DR TSFSI D+SKL+L DDD P+++ KR F Sbjct: 57 SDRTTSFSI---DLSKLTLLDDDNNSSYNPIAANPKRGSFRLFARKRRRRGSRSVSGRSS 113 Query: 612 XTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNG-DPNWSSDVSEAAKNSRRER- 785 DFP+A GTDSSGELF NG D W+SDVSE A+NSRRER Sbjct: 114 DRSGTRRCCSVGASAAYGTCSDFPVAVGTDSSGELFGNGADAYWASDVSE-ARNSRRERG 172 Query: 786 DNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWG 965 D GSGEK++L Q G + D EFGYG +RLLFWG Sbjct: 173 DGGSGEKESLG---GQFGGFDAQGNESGYGSEPGYRGDGEFGYGDEVDEEEEDARLLFWG 229 Query: 966 QGFG--VSSMERVGENML--QKAHHRCRRKKHDLRMVD 1067 FG S ME VGEN QKAHHRCRRKKHD RMVD Sbjct: 230 HHFGDTDSKMEMVGENTFSDQKAHHRCRRKKHDYRMVD 267 >ref|XP_003519006.1| PREDICTED: uncharacterized protein LOC100795813 [Glycine max] Length = 260 Score = 188 bits (478), Expect = 5e-45 Identities = 123/273 (45%), Positives = 139/273 (50%), Gaps = 7/273 (2%) Frame = +3 Query: 276 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRAD 455 +MSH+ LDSRH+ +SC LQ +W+PF+ D PKPY + KRPC +D Sbjct: 4 SMSHKPLDSRHTTDSCLLQLRTWKPFKLQQ------DGPHPKPY-------YHKRPCLSD 50 Query: 456 RAT-SFSIEALDMSKLSLFDDDRPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXXTHXXXX 632 R T SFS LDMSKL+L DDD + + Sbjct: 51 RTTTSFS---LDMSKLTLADDDNHNPNNRATNYRLVARKRRRRGSRSVSGRSSDRSGTRR 107 Query: 633 XXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSR--RERDNGSGEK 806 DFP+A GTDSSGELF NGDPNWSSDVSE AKNSR RERD GSGEK Sbjct: 108 CCSVGASAAYGTCSDFPVAMGTDSSGELFGNGDPNWSSDVSE-AKNSRRERERDGGSGEK 166 Query: 807 DNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFGV-- 980 +NL GF G E DAEFGYG RLLFWG G Sbjct: 167 ENLGVGFGVSGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQLGAVD 226 Query: 981 SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 S ME VGEN L QK+HHRCRR+KHD RMVD L Sbjct: 227 SKMEMVGENTLLDQKSHHRCRRRKHDCRMVDAL 259 >gb|EXC10295.1| hypothetical protein L484_006190 [Morus notabilis] Length = 275 Score = 186 bits (473), Expect = 2e-44 Identities = 123/276 (44%), Positives = 145/276 (52%), Gaps = 13/276 (4%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIP-KTLDSDSPKPYSTTANGFHS--KRPCR 449 MS + LDSRHSI+SC Q HSWRPFQ P KTLD+ + + + G H+ KRPC Sbjct: 1 MSPKLLDSRHSIDSCAFQLHSWRPFQQHSTPPTKTLDAANNPRHYRSNGGAHAITKRPCL 60 Query: 450 ADRATSFSIEALDMSKLSLFDDD--RPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXXTHX 623 +DRATSF I+A+DMS+LSL DDD RP ++ Sbjct: 61 SDRATSFPIDAIDMSRLSLVDDDTARPHHHQYRGSLRLLARKRRRRGSRSVSGRSSDRSG 120 Query: 624 XXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVN-GDPNWSSDVSEAAKNSRRERD---N 791 DFP+A GTDSSGELF+N GD NWSSDVSE A+NSRRERD Sbjct: 121 TRRCCSVGASAAYGTCSDFPVAVGTDSSGELFLNTGDANWSSDVSE-ARNSRRERDGAGG 179 Query: 792 GSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQG 971 GSGEK++ +G ++ DAEFGYG +RLLFWG Sbjct: 180 GSGEKESFG---GVIGGFDSQGAESGYGSEPGYRGDAEFGYGDEHDEEEDDARLLFWGNR 236 Query: 972 F--GVSSMERVGENML--QKAHHRCRRKKHDLRMVD 1067 F S E VGEN QK HHRCRRKKHD RMVD Sbjct: 237 FEDTDSMTEIVGENTFSDQKVHHRCRRKKHDCRMVD 272 >gb|ESW16312.1| hypothetical protein PHAVU_007G146100g [Phaseolus vulgaris] Length = 261 Score = 179 bits (455), Expect = 2e-42 Identities = 124/279 (44%), Positives = 142/279 (50%), Gaps = 13/279 (4%) Frame = +3 Query: 276 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRAD 455 +MSH+ LDSRHSI+SC LQ SW+PF K D PKPY + KRPC +D Sbjct: 4 SMSHKPLDSRHSIDSCMLQLRSWKPF-------KLQDGPHPKPY-------YYKRPCLSD 49 Query: 456 RAT-SFSIEALDMSKLSLFDDDRPLSSAHK--------RWFAXXXXXXXXXXXXXXXXXX 608 RAT SFS LD++KL+L D D + A+ R A Sbjct: 50 RATTSFS---LDIAKLTLADADDTTTIANNPNHRATNYRLVARKRRRRGSRSVSGRSSDR 106 Query: 609 XXTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERD 788 T DFP+A GTDSSGELF NGDPNWSSDVSE AKNSRRER+ Sbjct: 107 SGTRRCCSVGASAAYGTCS---DFPVAMGTDSSGELFGNGDPNWSSDVSE-AKNSRRERE 162 Query: 789 NGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQ 968 GE++N+ GF G E DAEFGYG RLLFWG Sbjct: 163 R-DGERENVGVGFGVSGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGD 221 Query: 969 GFGV--SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 FG S E VGEN L QK+HHRCRR+KHD RMVD L Sbjct: 222 QFGAVDSKREMVGENTLLDQKSHHRCRRRKHDCRMVDAL 260 >ref|XP_006588589.1| PREDICTED: uncharacterized protein LOC100798288 [Glycine max] Length = 260 Score = 178 bits (451), Expect = 6e-42 Identities = 119/274 (43%), Positives = 138/274 (50%), Gaps = 8/274 (2%) Frame = +3 Query: 276 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRAD 455 +MSH+ LDSRHSI+SC LQ SW+PF+ D PKPY + KRPC +D Sbjct: 4 SMSHKPLDSRHSIDSCLLQLRSWKPFKLQQ------DGPHPKPY-------YHKRPCLSD 50 Query: 456 RAT-SFSIEALDMSKLSLFDDDRPLSSAHKRW---FAXXXXXXXXXXXXXXXXXXXXTHX 623 R T SFS LDMSKL+L DD + + + + Sbjct: 51 RTTTSFS---LDMSKLTLAADDDTIHNPNNNRATNYRLVARKRRRRGSRSLSGRSSDRSG 107 Query: 624 XXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSGE 803 DFP+A GTDSSGELF NGDPNWSSDVSE AKNSRRER+ GE Sbjct: 108 TRRCCSVGASAAYGTCSDFPVAMGTDSSGELFGNGDPNWSSDVSE-AKNSRRERER-DGE 165 Query: 804 KDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFGV- 980 K+N+ GF G + DAEFGYG RLLFWG G Sbjct: 166 KENVGVGFGVSGCSDANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDPRLLFWGDQLGAV 225 Query: 981 -SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 S E VGEN L QK+HHRCRR+KHD RMVD L Sbjct: 226 DSKREMVGENTLLDQKSHHRCRRRKHDCRMVDAL 259 >ref|XP_002321007.2| hypothetical protein POPTR_0014s12400g [Populus trichocarpa] gi|550324059|gb|EEE99322.2| hypothetical protein POPTR_0014s12400g [Populus trichocarpa] Length = 279 Score = 175 bits (444), Expect = 4e-41 Identities = 123/283 (43%), Positives = 149/283 (52%), Gaps = 18/283 (6%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSP---KPYSTTANGFHSKRPCR 449 MSH +SRHSI+SCTLQ HSWRPF LDSD P KPY+++ KRPC Sbjct: 19 MSH---NSRHSIDSCTLQLHSWRPF---------LDSDPPTNSKPYASSRT--LPKRPCL 64 Query: 450 ADRATSF--SIEALDMSKLSLFDDD-----RPL-------SSAHKRWFAXXXXXXXXXXX 587 +DRATSF +I+++D+SKLSL DD +P+ +S +KR Sbjct: 65 SDRATSFPSNIDSIDISKLSLLQDDDNNNNKPIPATPAVTNSPYKRG-TLRLIERKRRRR 123 Query: 588 XXXXXXXXXTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAK 767 + DFP+A GTDSSGELFVNGD NW+SDVSE AK Sbjct: 124 GSRSVSGRSSDRSGTWRCCSVGAAHGTCSDFPVAVGTDSSGELFVNGDANWASDVSE-AK 182 Query: 768 NSRRERDNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXS 947 NS +ER+ EK+NL + GNL++ DAEFGYG + Sbjct: 183 NSIKERE----EKENLLGVGSAFGNLDS---ESGYGSEPGYRGDAEFGYGDEVDEEEDDA 235 Query: 948 RLLFWGQGFGVSSMERVGENMLQ-KAHHRCRRKKHDLRMVDIL 1073 RLLFWG F S ME VGEN K HHRCRR+KHD RMVD L Sbjct: 236 RLLFWGHHFQDSKMEMVGENTFDPKTHHRCRRRKHDYRMVDSL 278 >ref|XP_002301478.1| hypothetical protein POPTR_0002s20610g [Populus trichocarpa] gi|222843204|gb|EEE80751.1| hypothetical protein POPTR_0002s20610g [Populus trichocarpa] Length = 263 Score = 155 bits (391), Expect = 6e-35 Identities = 113/282 (40%), Positives = 138/282 (48%), Gaps = 19/282 (6%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANG-FHSKRPCRAD 455 MSH +SR S++SCTLQ HSWRPF LDSD Y A+ +KRPC +D Sbjct: 1 MSH---NSRQSLDSCTLQLHSWRPF---------LDSDPTTSYKPHASSPTLTKRPCLSD 48 Query: 456 RATSF--SIEALDMSKLSLFDDD--------------RPLSSAHKRWFAXXXXXXXXXXX 587 R+TSF +++++D+SKL+L +DD RP R Sbjct: 49 RSTSFPSNVDSIDLSKLTLLEDDHNNTNNKPIPAVTSRPYKRGTLRLIQRKRRRRGSRSV 108 Query: 588 XXXXXXXXXTHXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAK 767 T DF +A GTDSSGELFVNGD NW+SDVS+ AK Sbjct: 109 SGRSSDRSGTRRCCSVGAASAAHATCS--DFHVAVGTDSSGELFVNGDANWASDVSQ-AK 165 Query: 768 NSRRERDNGSGEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXS 947 NS +ER+ EK+NL +GNL++ DAE GYG + Sbjct: 166 NSVKERE----EKENLLGVGNVIGNLDS---ESGYGSEPGYRGDAEVGYGDEVDEEEDDA 218 Query: 948 RLLFWGQGFGVSSMERVGENML-QKAHHRCRRKKHDL-RMVD 1067 RLLFWG F S ME VGEN K HHRCRRKKHD RMVD Sbjct: 219 RLLFWGHHFQDSKMEMVGENTFDSKTHHRCRRKKHDCSRMVD 260 >ref|XP_004495959.1| PREDICTED: uncharacterized protein LOC101493408 [Cicer arietinum] Length = 261 Score = 153 bits (386), Expect = 2e-34 Identities = 117/278 (42%), Positives = 141/278 (50%), Gaps = 12/278 (4%) Frame = +3 Query: 276 AMSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDS--DSPKPYSTTANGFHSKRPCR 449 +MSH++ +I+SC LQ +WRPF P+T S S P + N KRPC Sbjct: 4 SMSHKS-----TIDSCVLQLRTWRPFHH--LHPQTTSSLDGSHNPTKPSLN----KRPCL 52 Query: 450 ADRAT-SFSIEALDMSKLSLFDDDRPLSS-AHKRWFAXXXXXXXXXXXXXXXXXXXXTHX 623 +DR T SFS LD+SKL+L DDDRP+++ A+ R A T Sbjct: 53 SDRTTTSFS---LDLSKLTLADDDRPINNTANHRLIARKRRRRCSRSVSGRSSDRSATRR 109 Query: 624 XXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSG- 800 DFP+A GTDSSGELF NGD NWSSDVSE AKNS RD GSG Sbjct: 110 CCSVGASAAYGTCS---DFPVAMGTDSSGELFGNGDANWSSDVSE-AKNS---RDGGSGE 162 Query: 801 -EKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFG 977 EK+N++ GF G E DAEFGYG R+LFWG G Sbjct: 163 KEKENVALGFGVNGCSEANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDHRVLFWGNQLG 222 Query: 978 ----VSSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 S ME VGEN L QK+HHR RR+K+D RM+D L Sbjct: 223 GAAVDSKMEMVGENTLLDQKSHHRLRRRKNDCRMIDAL 260 >ref|XP_006288487.1| hypothetical protein CARUB_v10001746mg [Capsella rubella] gi|482557193|gb|EOA21385.1| hypothetical protein CARUB_v10001746mg [Capsella rubella] Length = 261 Score = 142 bits (357), Expect = 5e-31 Identities = 107/270 (39%), Positives = 130/270 (48%), Gaps = 7/270 (2%) Frame = +3 Query: 279 MSHRTLD-SRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHS--KRPCR 449 MS + L+ SR SIESCT Q SWRPFQ KTLDS P + NGFHS KRPC Sbjct: 1 MSQKHLEPSRSSIESCTSQLLSWRPFQRS----KTLDSPDHPPQT---NGFHSTTKRPCF 53 Query: 450 ADRATSFSIEALDMSKLSLFDDD---RPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXXTH 620 +DR+TSFSIEA MS+LSL DDD + LS+++ + Sbjct: 54 SDRSTSFSIEA--MSRLSLADDDNGGKTLSASNYSNRGSFRLVARKRRRRNSRSVSGRSS 111 Query: 621 XXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGSG 800 D P A GTDSSGELF G+ NW SDVSEAA+NSRRER + G Sbjct: 112 DRSGTRRCCSIGAHGTCSDLPFAVGTDSSGELF--GEANWGSDVSEAARNSRRERRDSGG 169 Query: 801 EKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFGV 980 EK+ S GF ++ + DAEFGYG + LFWG Sbjct: 170 EKE-ASGGFGFAIGIDPMGNESGYGSEPGYRGDAEFGYGDEFDDEEEDVKPLFWGDTDST 228 Query: 981 SSMERVGENMLQKAHHRCRRKK-HDLRMVD 1067 M + K RCRR++ HD + VD Sbjct: 229 MGMAGDTKFSDNKPQFRCRRRRQHDYKTVD 258 >ref|NP_849288.1| uncharacterized protein [Arabidopsis thaliana] gi|26450275|dbj|BAC42254.1| unknown protein [Arabidopsis thaliana] gi|28973027|gb|AAO63838.1| unknown protein [Arabidopsis thaliana] gi|332656769|gb|AEE82169.1| uncharacterized protein AT4G02425 [Arabidopsis thaliana] Length = 262 Score = 140 bits (354), Expect = 1e-30 Identities = 105/271 (38%), Positives = 130/271 (47%), Gaps = 8/271 (2%) Frame = +3 Query: 279 MSHRTLDS-RHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHS---KRPC 446 MS + L+S R SIESCT Q SWRPF KTLDS P + NGFHS KRPC Sbjct: 1 MSPKHLESSRSSIESCTSQLLSWRPFHRS----KTLDSSDQPPQT---NGFHSFTPKRPC 53 Query: 447 RADRATSFSIEALDMSKLSLFDDD---RPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXXT 617 +DR+TSF+IEA MS+LSL DDD + LS+++ + Sbjct: 54 FSDRSTSFTIEA--MSRLSLADDDNGGKTLSASNYSNRGSFRLVARKRRRRNSRSVSGRS 111 Query: 618 HXXXXXXXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNGS 797 D P A GTDSSGELF G+ NW+SDVSEAA+NSRRER + Sbjct: 112 SDRSGTRRCCSIGAHGTCSDLPFAVGTDSSGELF--GEANWASDVSEAARNSRRERRDSG 169 Query: 798 GEKDNLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQGFG 977 GEK+ S GF ++ + DAEFGYG + LFWG Sbjct: 170 GEKE-ASGGFGFANGVDPMGNESGYGSEPGYRGDAEFGYGDEFDDEEEDVKPLFWGDTDS 228 Query: 978 VSSMERVGENMLQKAHHRCRRKK-HDLRMVD 1067 M + K RCRR++ HD + VD Sbjct: 229 TMGMSGETKFSDSKPQFRCRRRRQHDYKTVD 259 >ref|XP_003591530.1| hypothetical protein MTR_1g088580 [Medicago truncatula] gi|355480578|gb|AES61781.1| hypothetical protein MTR_1g088580 [Medicago truncatula] Length = 249 Score = 134 bits (338), Expect = 8e-29 Identities = 106/273 (38%), Positives = 130/273 (47%), Gaps = 8/273 (2%) Frame = +3 Query: 279 MSHRTLDSRHSIESCTLQFHSWRPFQFPVAIPKTLDSDSPKPYSTTANGFHSKRPCRADR 458 MSH+ ++++C LQ +W+PF + D S + N KRPC +DR Sbjct: 1 MSHKP-----TLDTCVLQLRTWKPFH------QIHDHGSHSHNNNNIN----KRPCLSDR 45 Query: 459 AT-SFSIEALDMSKLSLFDDDRPLSSAHKRWFAXXXXXXXXXXXXXXXXXXXXTHXXXXX 635 T SFS LD+SKL+L D++ P A+ R A T Sbjct: 46 TTTSFS---LDLSKLTLTDNNPP---ANYRLIARKRRRRGSRSVSGRSSDRSATRRCCSV 99 Query: 636 XXXXXXXXXXXXXDFPMAAGTDSSGELFVNGDPNWSSDVSEAAKNSRRERDNG--SGEKD 809 DFP+A GTDSSGELF NGD NWSSDVSE AKNSR +G EK+ Sbjct: 100 GASAAYGTCS---DFPVAMGTDSSGELFGNGDANWSSDVSE-AKNSRDCGGSGEKEKEKE 155 Query: 810 NLSSGFAQVGNLENLXXXXXXXXXXXXXXDAEFGYGXXXXXXXXXSRLLFWGQ---GFGV 980 N+ GF G + DAEFGYG RLLFWG G Sbjct: 156 NVGVGFGVNGCSDANGNESGYGSEPGYRGDAEFGYGDEFDEEEDDHRLLFWGNQLVGAVD 215 Query: 981 SSMERVGENML--QKAHHRCRRKKHDLRMVDIL 1073 S ME VGEN L QK+HHRCRR+K+D RM+D L Sbjct: 216 SKMEMVGENTLLDQKSHHRCRRRKNDCRMIDAL 248