beenull/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-22 15:40:19 +00:00

Author	SHA1	Message	Date
Linus Groh	69845ae460	LibJS: "-->" preceded by token on same line isn't start of HTML-like comment B.1.3 HTML-like Comments The syntax and semantics of 11.4 is extended as follows except that this extension is not allowed when parsing source code using the goal symbol Module: Syntax (only relevant part included) SingleLineHTMLCloseComment :: LineTerminatorSequence HTMLCloseComment HTMLCloseComment :: WhiteSpaceSequence[opt] SingleLineDelimitedCommentSequence[opt] --> SingleLineCommentChars[opt] Fixes #3810.	2020-10-29 22:28:15 +01:00
Linus Groh	15642874f3	LibJS: Support all line terminators (LF, CR, LS, PS) https://tc39.es/ecma262/#sec-line-terminators	2020-10-22 10:06:30 +02:00
Stephan Unverwerth	2c888b3c6e	LibJS: Fix parsing of invalid numeric literals i.e. "1e" "0x" "0b" "0o" used to be parsed as valid literals. They now produce invalid tokens. Fixes #3716	2020-10-18 15:38:57 +02:00
Matthew Olsson	61ac1d3ffa	LibJS: Lex and parse regex literals, add RegExp objects This adds regex parsing/lexing, as well as a relatively empty RegExpObject. The purpose of this patch is to allow the engine to not get hung up on parsing regexes. This will aid in finding new syntax errors (say, from google or twitter) without having to replace all of their regexes first!	2020-06-07 19:06:55 +02:00
Paul Redmond	11405c5139	LibJS: Fix incorrect token column values (#2401 ) - initializing m_line_column to 1 in the lexer results in incorrect column values in tokens on the first line of input. - not incrementing m_line_column when EOF is reached results in an incorrect column value on the last token.	2020-05-26 19:00:30 +02:00
Linus Groh	00b61a212f	LibJS: Remove syntax errors from lexer Giving the lexer the ability to generate errors adds unnecessary complexity - also it only calls its syntax_error() function in one place anyway ("unterminated string literal"). But since the lexer also emits tokens like Eof or UnterminatedStringLiteral, it should be up to the consumer of these tokens to decide what to do. Also remove the option to not print errors to stderr as that's not relevant anymore.	2020-05-15 09:53:52 +02:00
mattco98	adb4accab3	LibJS: Add template literals Adds fully functioning template literals. Because template literals contain expressions, most of the work has to be done in the Lexer rather than the Parser. And because of the complexity of template literals (expressions, nesting, escapes, etc), the Lexer needs to have some template-related state. When entering a new template literal, a TemplateLiteralStart token is emitted. When inside a literal, all text will be parsed up until a '${' or '`' (or EOF, but that's a syntax error) is seen, and then a TemplateLiteralExprStart token is emitted. At this point, the Lexer proceeds as normal, however it keeps track of the number of opening and closing curly braces it has seen in order to determine the close of the expression. Once it finds a matching curly brace for the '${', a TemplateLiteralExprEnd token is emitted and the state is updated accordingly. When the Lexer is inside of a template literal, but not an expression, and sees a '`', this must be the closing grave: a TemplateLiteralEnd token is emitted. The state required to correctly parse template strings consists of a vector (for nesting) of two pieces of information: whether or not we are in a template expression (as opposed to a template string); and the count of the number of unmatched open curly braces we have seen (only applicable if the Lexer is currently in a template expression). TODO: Add support for template literal newlines in the JS REPL (this will cause a syntax error currently): > `foo > bar` 'foo bar'	2020-05-04 16:46:31 +02:00
Stephan Unverwerth	9477efe970	LibJS: Handle HTML-style comments	2020-04-14 12:54:09 +02:00
AnotherTest	cdb627a516	LibJS: Allow lexer to run without logging errors	2020-04-05 16:11:13 +02:00
Stephan Unverwerth	500f6d9e3a	LibJS: Add numeric literal parsing for different bases and exponents	2020-04-05 16:01:22 +02:00
Brian Gianforcaro	dd112421b4	LibJS: Plumb line and column information through Lexer / Parser While debugging test failures, it's pretty frustrating to have to go do printf debugging to figure out what test is failing right now. While watching your JS Raytracer stream it seemed like this was pretty furstrating as well. So I wanted to start working on improving the diagnostics here. In the future I hope we can eventually be able to plumb the info down to the Error classes so any thrown exceptions will contain enough metadata to know where they came from.	2020-04-05 12:43:39 +02:00
Stephan Unverwerth	c0e6234219	LibJS: Lex single quote strings, escaped chars and unterminated strings	2020-03-14 12:13:53 +01:00
Conrad Pankoff	e88f2f15ee	LibJS: Parse === and !== binary operators	2020-03-12 13:42:23 +01:00
Stephan Unverwerth	f3a9eba987	LibJS: Add Javascript lexer and parser This adds a basic Javascript lexer and parser. It can parse the currently existing demo programs. More work needs to be done to turn it into a complete parser than can parse arbitrary JS Code. The lexer outputs tokens with preceeding whitespace and comments in the trivia member. This should allow us to generate the exact source code by concatenating the generated tokens. The parser is written in a way that it always returns a complete syntax tree. Error conditions are represented as nodes in the tree. This simplifies the code and allows it to be used as an early stage parser, e.g for parsing JS documents in an IDE while editing the source code.:	2020-03-12 09:25:49 +01:00

14 commits