DM BNF

BYOND Forums

Announcements · BYOND Help · Bug Reports · Feature Requests · Beta Testers · Beta Bugs · Developer Help · Design Philosophy · Demos & Libraries · Tutorials & Snippets · Art & Sound · Classified Ads · Game Updates · Contests & Events · Linux Talk · On Topic · Off Topic

DM BNF

ID:1567644 May 5 2014, 5:32 pm Keywords: dm, dreammaker, language
Liquidweaver	I just finished creating a lexer for the DM language, and I was wondering: is there a [E]BNF somewhere for the language? I assume the C BNF might be a good start, but if one exists already that would be better :)

May 5 2014, 5:43 pm
Nadrew	Nopers, sorry.

May 5 2014, 5:52 pm
Liquidweaver	Out of curiosity: is a standard lexer/parser not used internally (i.e., is the parsing done entirely by hand?)

May 5 2014, 6:11 pm In response to Liquidweaver
Ter13	Liquidweaver wrote: Out of curiosity: is a standard lexer/parser not used internally (i.e., is the parsing done entirely by hand?) BYOND's VM/Bytecode was written by Dan back in 95-98. Back then there really wasn't much open-source information on compilers freely available outside of university settings. It's anyone's guess how he constructed the lexical parser.

May 6 2014, 5:38 am
Audeuro	I recommend trying to get in touch with Jp. I believe he was working on this some time ago for Scintilla or a similar project. He might have something, at the very least, for you to go on.

May 6 2014, 10:13 am
Liquidweaver	Reply - thanks, I'll probably do that.

May 6 2014, 10:20 am
Lummox JR	DM does not use a standard lexical parser. (I'm not entirely sure how that would work with the indent-sensitive language, but presumably EBNF can manage.) Good luck working up the EBNF grammar. I'm very curious to see how it ends up.

May 6 2014, 10:45 am
Liquidweaver	Sure. You are completely correct that a lexer designed for a context-free grammar would not properly tokenize an "off-sides" language like DM. The indentation gives it context. I had to take a standard tokenizer and add state. Also, handling the nested bracket syntax in strings requires a stateful tokenizer.

May 8 2014, 3:12 am

Coincidentally I just dropped by to upload something to my member filespace.

Very simple lexing/parsing code I hacked up over a few hours here: http://www.byond.com/members/Jp/files/dreamcatcher.zip . Almost certainly awful and primitive, but might be worth looking at. Parses out types and variables from a DM file without procs, verbs, preprocessor statements, couple of other things probably.

This is a Scintilla lexer for DM: http://files.byondhome.com/Jp/dmlex/LexDM.cxx . That's for code highlighting and folding, not full-blown parsing, so it's much simpler and probably less useful.

IIRC my approach was a stateful Flex lexer - kept a count of how many indent levels I was at, when that increased generated an INDENT token, when it decreased generated a DEDENT token (and had braces generate those too). Had to have NEWLINE as a token, though, because otherwise distinguishing these two cases wasn't possible:

a/b

a
/b

Python lexing/grammar might be worth looking into. It was a thought I always had, but I was too lazy to do it.

May 22 2014, 9:02 am
N3X15	I'm working on a Bison/Flex parser as part of an open source C++ API I'm creating. https://github.com/N3X15/OpenBYOND/blob/dev/openbyond-core/ grammar

May 25 2014, 3:44 am

If I'm not very much mistaken, that grammar you're basing stuff on - the one you've credited to 'nan0desu' - is, in fact, the code I wrote all the way back in 2010. I've linked to my version of the files above, here's the blog post I wrote about it in 2010.

I don't mind too much - not only is the code in your openbyond-core very, very different by now, but the original grammar stuff I was fiddling with was pretty primitive, and also as far as I'm concerned it was public domain. I am a bit put out by nan0desu claiming to have written the code.

May 26 2014, 7:40 am
Laser50	Wouldn't calling such project "OpenBYOND" be some sort of a trademark issue or something?

Dec 3 2014, 6:20 pm

In response to Laser50

N3X15

Jp wrote:

If I'm not very much mistaken, that grammar you're basing stuff on - the one you've credited to 'nan0desu' - is, in fact, the code I wrote all the way back in 2010. I've linked to my version of the files above, here's the blog post I wrote about it in 2010.

I don't mind too much - not only is the code in your openbyond-core very, very different by now, but the original grammar stuff I was fiddling with was pretty primitive, and also as far as I'm concerned it was public domain. I am a bit put out by nan0desu claiming to have written the code.

My apologies, I'll correct that ASAP.

Laser50 wrote:

Wouldn't calling such project "OpenBYOND" be some sort of a trademark issue or something?

Probably. I will happy change the name of the project if BYOND asks. It was just a quick and easy name, since I have the creativity of a doorknob.