tre - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	Use new parser and DFA compiler	Ryo Nihei	2021-12-10	1	-862/+0
\|
*	Move UTF8-related processes to utf8 package	Ryo Nihei	2021-12-01	1	-591/+17
\|
*	Make contributory properties unavailable except internal use	Ryo Nihei	2021-11-28	1	-1/+30
\| \| \| \| \| \| \| \| \| \| \| \|	This change follows [UAX #44 5.13 Property APIs]. > The following subtypes of Unicode character properties should generally not be exposed in APIs, > except in limited circumstances. They may not be useful, particularly in public API collections, > and may instead prove misleading to the users of such API collections. > > * Contributory properties are not recommended for public APIs. > ... https://unicode.org/reports/tr44/#Property_APIs
*	Move all UCD-related processes to ucd package	Ryo Nihei	2021-11-27	1	-4/+5
\|
*	Make character properties available in an inverse expression (Make ↵	Ryo Nihei	2021-11-25	1	-0/+4
\| \| \| \|	[^\p{...}] available)
*	Support Lowercase and Uppercase property (Meet RL1.2 of UTS #18 partially)	Ryo Nihei	2021-11-25	1	-21/+34
\|
*	Support White_Space property (Meet RL1.2 of UTS #18 partially)	Ryo Nihei	2021-11-24	1	-6/+24
\|
*	Keep the order of AST nodes constant	Ryo Nihei	2021-09-22	1	-13/+23
\|
*	Change APIs	Ryo Nihei	2021-08-01	1	-5/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Change fields of tokens, results of lexical analysis, as follows: - Rename: mode -> mode_id - Rename: kind_id -> mode_kind_id - Add: kind_id The kind ID is unique across all modes, but the mode kind ID is unique only within a mode. Change fields of a transition table as follows: - Rename: initial_mode -> initial_mode_id - Rename: modes -> mode_names - Rename: kinds -> kind_names - Rename: specs[].kinds -> specs[].kind_names - Rename: specs[].dfa.initial_state -> specs[].dfa.initial_state_id Change public types defined in the spec package as follows: - Rename: LexModeNum -> LexModeID - Rename: LexKind -> LexKindName - Add: LexKindID - Add: StateID
*	Add fragment expression	Ryo Nihei	2021-05-25	1	-29/+158
\| \| \| \|	A fragment entry is defined by an entry whose `fragment` field is `true`, and is referenced by a fragment expression (`\f{...}`).
*	Fix parser to recognize property expressions in bracket expressions	Ryo Nihei	2021-05-02	1	-0/+3
\|
*	Add character property expression (Meet RL1.2 of UTS #18 partially)	Ryo Nihei	2021-04-30	1	-1/+54
\| \| \| \| \| \| \| \| \| \|	\p{property name=property value} matches a character has the property. When the property name is General_Category, it can be omitted. That is, \p{Letter} equals \p{General_Category=Letter}. Currently, only General_Category is supported. This feature meets RL1.2 of UTS #18 partially. RL1.2 Properties: https://unicode.org/reports/tr18/#RL1.2
*	Add code point expression (Meet RL1.1 of UTS #18)	Ryo Nihei	2021-04-24	1	-0/+55
\| \| \| \| \| \| \| \|	\u{hex string} matches a character has the code point represented by the hex string. For instance, \u{3042} matches hiragana あ (U+3042). The hex string must have 4 or 6 digits. This feature meets RL1.1 of UTS #18. RL1.1 Hex Notation: https://unicode.org/reports/tr18/#RL1.1
*	Change the lexical specs of regexp and define concrete syntax error values	Ryo Nihei	2021-04-17	1	-44/+142
\| \| \| \| \|	* Make the lexer treat ']' as an ordinary character in default mode * Define values of the syntax error type that represents error information concretely
*	Increase the maximum number of symbol positions per pattern	Ryo Nihei	2021-04-12	1	-1/+4
\| \| \| \| \|	This commit increases the maximum number of symbol positions per pattern to 2^15 (= 32,768). When the limit is exceeded, the parse method returns an error.
*	Fix grammar the parser accepts	Ryo Nihei	2021-04-11	1	-35/+68
\| \| \| \| \|	* Add cases test the parse method. * Fix the parser to pass the cases.
*	Add logical inverse expression	Ryo Nihei	2021-04-01	1	-6/+91
\| \| \| \|	[^a-z] matches any character that is not in the range a-z.
*	Pass values in error type to panic()	Ryo Nihei	2021-03-07	1	-2/+2
\| \| \| \|	Because parser.parse() expects that recover() returns a value in error type, apply this change.
*	Refactoring	Ryo Nihei	2021-02-25	1	-449/+301
\| \| \| \| \| \|	* Remove token field from symbolNode * Simplify notation of nested nodes * Simplify arguments of newSymbolNode()
*	Add range expression	Ryo Nihei	2021-02-24	1	-4/+705
\| \| \| \|	[a-z] matches any one character from a to z. The order of the characters depends on Unicode code points.
*	Add + and ? operators	Ryo Nihei	2021-02-20	1	-3/+9
\| \| \| \| \|	* a+ matches 'a' one or more times. This is equivalent to aa. a? matches 'a' zero or one time.
*	Add bracket expression matching specified character	Ryo Nihei	2021-02-14	1	-6/+25
\| \| \| \|	The bracket expression matches any single character specified in it. In the bracket expression, the special characters like ., *, and so on are also handled as normal characters.
*	Add dot symbol matching any single character	Ryo Nihei	2021-02-14	1	-3/+104
\| \| \| \| \| \| \| \| \|	The dot symbol matches any single character. When the dot symbol appears, the parser generates an AST matching all of the well-formed UTF-8 byte sequences. Refelences: * https://www.unicode.org/versions/Unicode13.0.0/ch03.pdf#G7404 * Table 3-6. UTF-8 Bit Distribution * Table 3-7. Well-Formed UTF-8 Byte Sequences
*	Add compiler	Ryo Nihei	2021-02-14	1	-0/+221
	The compiler takes a lexical specification expressed by regular expressions and generates a DFA accepting the tokens. Operators that you can use in the regular expressions are concatenation, alternation, repeat, and grouping.