Serus VM

Welcome to my register-based virtual machine project!
This is a personal project aimed at deepening my understanding of virtual machines, programming language implementations, and Rust.

Happy coding! 🚀

TODOs

VM

Instructions

Every Instruction is 4 bytes, where the first byte is the Opcode, the next 3 bytes are for the operands. For instructions that have a "result" the second byte is the register to store the result.

RR = Result register
IO = integer operand

# 1 byte | 1 byte | 2 bytes
LOAD RR Operand

# 1 byte | 1 byte | 1 byte | 1 byte
ADD RR IO IO # Adds number in first IO to number in second IO and stores result in RR

# 1 byte | 1 byte | 1 byte | 1 byte
DIV RR IO IO # Divides number in first IO with number in second IO and stores result in RR

# 1 byte | 1 byte | 1 byte | 1 byte
MUL RR IO IO # Multiplies number in first IO with number in second IO and stores result in RR

# 1 byte | 1 byte | 1 byte | 1 byte
SUB RR IO IO # Subtracks number in first IO from number in second IO and stores result in RR

Bytecode Format

Byte 0-4: Magic number
Byte 5: Version number
Byte 6-63: Header section
Byte 64-71: Code Start section (This will point to at what byte the code section start)
Byte 72-199: Data section

Assembler

Lexing and Parsing

The lexer goes over all the source code and turns it into Tokens, lexer needs better error handling.
Parser groups Tokens into instructions. It also filter out LabelDeclarations to later be used to build up a symboltable

Grammar

EBNF representation of the grammar for the assembler


Program             ::= { LabelDeclaration | Instruction | Directive } .
LabelDeclaration    ::= identifier ":" .
Instruction         ::= opcode [LabelRef] | [operand] .
Directive           ::= "." identifier [operand] .

LabelRef            ::= "@" identifier ":" .
identifier          ::= letter { letter | digit } .
letter              ::= "a" | "b" | ... | "z" | "A" | "B" | ... | "Z" .
digit               ::= "0" | "1" | ... | "9" .
opcode              ::= "LOAD" | "ADD" | "DIV" | "MUL" | "SUB" | "HLT"
                        | "JMP" | "JMPB" | "JMPF" | "EQ" | "NEQ" | "GT"
                        | "LT" | "GTQ" | "LTQ" | "JEQ" | "JNEQ" | "ALOC"
                        | "INC" | "DEC" | "IGL" .
operand             ::= register | number | string .

register            ::= "$" (identifier | number) .
number              ::= "#" digit { digit } .
string              ::= "\"" {character} "\"" .

character           ::= letter | digit | special_character .
special_character   ::= " " | "!" | "#" | ... | "~" .

test1: LOAD $0 #100 // LabelDeclaration, Opcode, register, number
DJMP @test1 // Opcode, LabelRef

my_string: .asciiz "Hello world" // LabelDeclaration, Directive, string

LOAD $1 #10 // Opcode, register, number

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Serus VM

TODOs

VM

Instructions

Bytecode Format

Assembler

Lexing and Parsing

Grammar

About

Releases

Packages

Contributors 2

Languages

bjoroen/serus

Folders and files

Latest commit

History

Repository files navigation

Serus VM

TODOs

VM

Instructions

Bytecode Format

Assembler

Lexing and Parsing

Grammar

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages