serenity - The Serenity Operating System 🐞

Age	Commit message (Collapse)	Author
2020-11-25	LibJS: Fix possible OOB read during Lexer construction	Linus Groh
	The Lexer constructor calls consume() once, which initializes m_position to be > 0 and sets m_character. consume() calls is_line_terminator(), which wasn't accounting for this state.
2020-10-29	LibJS: "-->" preceded by token on same line isn't start of HTML-like comment	Linus Groh
	B.1.3 HTML-like Comments The syntax and semantics of 11.4 is extended as follows except that this extension is not allowed when parsing source code using the goal symbol Module: Syntax (only relevant part included) SingleLineHTMLCloseComment :: LineTerminatorSequence HTMLCloseComment HTMLCloseComment :: WhiteSpaceSequence[opt] SingleLineDelimitedCommentSequence[opt] --> SingleLineCommentChars[opt] Fixes #3810.
2020-10-26	LibJS: Emit token message for invalid numeric literals	Linus Groh

2020-10-26	LibJS: Emit TokenType::Invalid for unterminated multi-line comments	Linus Groh

2020-10-26	LibJS: Add message string to Token	Linus Groh
	This allows us to communicate details about invalid tokens to the parser without having to invent a bunch of specific invalid tokens like TokenType::InvalidNumericLiteral.
2020-10-22	LibJS: Support all line terminators (LF, CR, LS, PS)	Linus Groh
	https://tc39.es/ecma262/#sec-line-terminators
2020-10-19	LibJS: Unprefixed octal numbers are a syntax error in strict mode	Linus Groh

2020-10-18	LibJS: Fix parsing of invalid numeric literals	Stephan Unverwerth
	i.e. "1e" "0x" "0b" "0o" used to be parsed as valid literals. They now produce invalid tokens. Fixes #3716
2020-10-17	LibJS: Avoid creating temporary Strings to look up tokens while lexing	Andreas Kling
	It would be cool to solve this in a general way so that looking up a string literal or StringView in a HashMap with String keys avoids creating a temp string. For now, this patch simply addresses the issue in JS::Lexer. This is a 2-3% speed-up on test-js.
2020-10-05	LibJS: Implement logical assignment operators (&&=, \|\|=, ??=)	Linus Groh
	TC39 proposal, stage 4 as of 2020-07. https://tc39.es/proposal-logical-assignment/
2020-09-12	LibJS: Fix start position of multi-line tokens	Ben Wiederhake
	This broke in case of unterminated regular expressions, causing goofy location numbers, and 'source_location_hint' to eat up all memory: Unexpected token UnterminatedRegexLiteral. Expected statement (line: 2, column: 4294967292)
2020-06-08	LibJS: Move regex logic to main Lexer if statement	Matthew Olsson
	This prevents a regex such as /=/ from lexing into TokenType::SlashEquals, preventing the regex logic from working.
2020-06-08	LibJS: Properly consume escaped backslash in regex literal	Matthew Olsson

2020-06-07	LibJS: Fix big int division lexing as UnterminatedRegexLiteral	Matthew Olsson

2020-06-07	LibJS: Add BigInt	Linus Groh

2020-06-07	LibJS: Lex and parse regex literals, add RegExp objects	Matthew Olsson
	This adds regex parsing/lexing, as well as a relatively empty RegExpObject. The purpose of this patch is to allow the engine to not get hung up on parsing regexes. This will aid in finding new syntax errors (say, from google or twitter) without having to replace all of their regexes first!
2020-05-26	LibJS: Fix incorrect token column values (#2401)	Paul Redmond
	- initializing m_line_column to 1 in the lexer results in incorrect column values in tokens on the first line of input. - not incrementing m_line_column when EOF is reached results in an incorrect column value on the last token.
2020-05-15	LibJS: Remove syntax errors from lexer	Linus Groh
	Giving the lexer the ability to generate errors adds unnecessary complexity - also it only calls its syntax_error() function in one place anyway ("unterminated string literal"). But since the lexer also emits tokens like Eof or UnterminatedStringLiteral, it should be up to the consumer of these tokens to decide what to do. Also remove the option to not print errors to stderr as that's not relevant anymore.
2020-05-12	LibJS: Add missing keywords/tokens	Linus Groh
	Some of these are required for syntax we have not implemented yet, some are future reserved words in strict mode.
2020-05-05	LibJS: Implement exponentiation assignment operator (**=)	Linus Groh

2020-05-05	LibJS: Implement bitwise assignment operators (&=, \|=, ^=)	Linus Groh

2020-05-04	LibJS: Add template literals	mattco98
	Adds fully functioning template literals. Because template literals contain expressions, most of the work has to be done in the Lexer rather than the Parser. And because of the complexity of template literals (expressions, nesting, escapes, etc), the Lexer needs to have some template-related state. When entering a new template literal, a TemplateLiteralStart token is emitted. When inside a literal, all text will be parsed up until a '${' or '`' (or EOF, but that's a syntax error) is seen, and then a TemplateLiteralExprStart token is emitted. At this point, the Lexer proceeds as normal, however it keeps track of the number of opening and closing curly braces it has seen in order to determine the close of the expression. Once it finds a matching curly brace for the '${', a TemplateLiteralExprEnd token is emitted and the state is updated accordingly. When the Lexer is inside of a template literal, but not an expression, and sees a '`', this must be the closing grave: a TemplateLiteralEnd token is emitted. The state required to correctly parse template strings consists of a vector (for nesting) of two pieces of information: whether or not we are in a template expression (as opposed to a template string); and the count of the number of unmatched open curly braces we have seen (only applicable if the Lexer is currently in a template expression). TODO: Add support for template literal newlines in the JS REPL (this will cause a syntax error currently): > `foo > bar` 'foo bar'
2020-05-01	LibJS: Implement (no-op) debugger statement	Linus Groh

2020-04-27	LibJS: Add spreading in array literals	mattco98
	Implement the syntax and behavor necessary to support array literals such as [...[1, 2, 3]]. A type error is thrown if the target of the spread operator does not evaluate to an array (though it should eventually just check for an iterable). Note that the spread token's name is TripleDot, since the '...' token is used for two features: spread and rest. Calling it anything involving 'spread' or 'rest' would be a bit confusing.
2020-04-24	LibJS: Add TokenType::TemplateLiteral	Linus Groh
	This is required for template literals - we're not quite there yet, but at least the parser can now tell us when this token is encountered - currently this yields "Unexpected token Invalid". Not really helpful. The character is a "backtick", but as we already have TokenType::{StringLiteral,RegexLiteral} this seemed like a fitting name. This also enables syntax highlighting for template literals in the js REPL and LibGUI's JSSyntaxHighlighter.
2020-04-14	LibJS: Handle HTML-style comments	Stephan Unverwerth

2020-04-13	LibJS: Parse "this" as ThisExpression	Stephan Unverwerth

2020-04-05	LibJS: Report the start position of a token as its line column	AnotherTest

2020-04-05	LibJS: Allow lexer to run without logging errors	AnotherTest

2020-04-05	LibJS: Add numeric literal parsing for different bases and exponents	Stephan Unverwerth

2020-04-05	LibJS: Plumb line and column information through Lexer / Parser	Brian Gianforcaro
	While debugging test failures, it's pretty frustrating to have to go do printf debugging to figure out what test is failing right now. While watching your JS Raytracer stream it seemed like this was pretty furstrating as well. So I wanted to start working on improving the diagnostics here. In the future I hope we can eventually be able to plumb the info down to the Error classes so any thrown exceptions will contain enough metadata to know where they came from.
2020-04-05	LibJS: Add support for "continue" inside "for" statements :^)	Andreas Kling

2020-04-04	LibJS: Hack the lexer to allow numbers with decimals	Andreas Kling
	This is very hackish and should definitely be improved. :^)
2020-04-03	LibJS: Remove UndefinedLiteral, add undefined to global object	Linus Groh
	There is no such thing as a "undefined literal" in JS - undefined is just a property on the global object with a value of undefined. This is pretty similar to NaN. var undefined = "foo"; is a perfectly fine AssignmentExpression :^)
2020-03-30	LibJS: Add support for arrow functions	Jack Karamanian

2020-03-29	LibJS: Lexer and parser support for "switch" statements	Andreas Kling

2020-03-24	LibJS: Implement "throw"	Andreas Kling
	You can now throw an expression to the nearest catcher! :^) To support throwing arbitrary values, I added an Exception class that sits as a wrapper around whatever is thrown. In the future it will be a logical place to store a call stack.
2020-03-23	LibJS: Teach the lexer to recognize ">=" and "<=" :^)	Andreas Kling

2020-03-21	LibJS: Parse object expressions	0xtechnobabble

2020-03-16	LibJS: Implement null and undefined literals	0xtechnobabble

2020-03-14	LibJS: Lex single quote strings, escaped chars and unterminated strings	Stephan Unverwerth

2020-03-14	LibJS: Add operator precedence parsing	Stephan Unverwerth
	Obey precedence and associativity rules when parsing expressions with chained operators.
2020-03-13	LibJS: Fix endless loop in string lexing	Oriko

2020-03-13	LibJS: Fix lexing of the last character in a file	Stephan Unverwerth
	Before this commit the last character in a file would be swallowed. This also fixes parsing of empty files which would previously ASSERT.
2020-03-12	LibJS: Fix some coding style mistakes in Lexer	Andreas Kling

2020-03-12	LibJS: Implement for statement	Conrad Pankoff

2020-03-12	LibJS: Parse === and !== binary operators	Conrad Pankoff

2020-03-12	LibJS: Implement basic lexing + parsing of StringLiteral	Andreas Kling
	This still includes the double-quote characters (") but at least the AST comes out right.
2020-03-12	LibJS: Add Javascript lexer and parser	Stephan Unverwerth
	This adds a basic Javascript lexer and parser. It can parse the currently existing demo programs. More work needs to be done to turn it into a complete parser than can parse arbitrary JS Code. The lexer outputs tokens with preceeding whitespace and comments in the trivia member. This should allow us to generate the exact source code by concatenating the generated tokens. The parser is written in a way that it always returns a complete syntax tree. Error conditions are represented as nodes in the tree. This simplifies the code and allows it to be used as an early stage parser, e.g for parsing JS documents in an IDE while editing the source code.: