Skip to content

fix: replacing specific unicode characters with all unicode letters#30

Open
HackZers7 wants to merge 1 commit into
chrisbottin:masterfrom
HackZers7:fix/expanded-unicode
Open

fix: replacing specific unicode characters with all unicode letters#30
HackZers7 wants to merge 1 commit into
chrisbottin:masterfrom
HackZers7:fix/expanded-unicode

Conversation

@HackZers7
Copy link
Copy Markdown

What Changed

The closing-tag parsing regex was updated from:

^<\/[\w-:.\u00C0-\u00FF]+\s*>

to:

^<\/[\p{L}\w\-:.]+\s*>/u

The previous range \u00C0-\u00FF only covers a limited subset of Latin characters.
The new pattern uses \p{L} in Unicode mode (u), which properly supports letters from many writing systems and improves parsing for international tag names.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant