Skip to content

Commit

Permalink
Update for Unicode 16
Browse files Browse the repository at this point in the history
No released version of Go uses this Unicode version yet, so this doesn't
include any autoversion updates.
  • Loading branch information
apparentlymart committed Oct 13, 2024
1 parent 34c68c2 commit 0377bdd
Show file tree
Hide file tree
Showing 7 changed files with 3,214 additions and 2,929 deletions.
2 changes: 1 addition & 1 deletion go.mod
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
module github.com/apparentlymart/go-textseg/v15
module github.com/apparentlymart/go-textseg/v16

go 1.16
18 changes: 12 additions & 6 deletions textseg/emoji_table.rl
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# The following Ragel file was autogenerated with unicode2ragel.rb
# from: https://www.unicode.org/Public/15.0.0/ucd/emoji/emoji-data.txt
# from: https://www.unicode.org/Public/16.0.0/ucd/emoji/emoji-data.txt
#
# It defines ["Extended_Pictographic"].
#
Expand Down Expand Up @@ -513,27 +513,33 @@
| 0xF0 0x9F 0xAA 0x80..0x82 #E12.0 [3] (πŸͺ€..πŸͺ‚) yo-yo..parachute
| 0xF0 0x9F 0xAA 0x83..0x86 #E13.0 [4] (πŸͺƒ..πŸͺ†) boomerang..nesting ...
| 0xF0 0x9F 0xAA 0x87..0x88 #E15.0 [2] (πŸͺ‡..πŸͺˆ) maracas..flute
| 0xF0 0x9F 0xAA 0x89..0x8F #E0.0 [7] (πŸͺ‰..πŸͺ) <reserved-1FA89>..<...
| 0xF0 0x9F 0xAA 0x89 #E16.0 [1] (πŸͺ‰) harp
| 0xF0 0x9F 0xAA 0x8A..0x8E #E0.0 [5] (πŸͺŠ..πŸͺŽ) <reserved-1FA8A>..<...
| 0xF0 0x9F 0xAA 0x8F #E16.0 [1] (πŸͺ) shovel
| 0xF0 0x9F 0xAA 0x90..0x95 #E12.0 [6] (πŸͺ..πŸͺ•) ringed planet..banjo
| 0xF0 0x9F 0xAA 0x96..0xA8 #E13.0 [19] (πŸͺ–..πŸͺ¨) military helmet..rock
| 0xF0 0x9F 0xAA 0xA9..0xAC #E14.0 [4] (πŸͺ©..πŸͺ¬) mirror ball..hamsa
| 0xF0 0x9F 0xAA 0xAD..0xAF #E15.0 [3] (πŸͺ­..πŸͺ―) folding hand fan..k...
| 0xF0 0x9F 0xAA 0xB0..0xB6 #E13.0 [7] (πŸͺ°..πŸͺΆ) fly..feather
| 0xF0 0x9F 0xAA 0xB7..0xBA #E14.0 [4] (πŸͺ·..πŸͺΊ) lotus..nest with eggs
| 0xF0 0x9F 0xAA 0xBB..0xBD #E15.0 [3] (πŸͺ»..πŸͺ½) hyacinth..wing
| 0xF0 0x9F 0xAA 0xBE #E0.0 [1] (πŸͺΎ) <reserved-1FABE>
| 0xF0 0x9F 0xAA 0xBE #E16.0 [1] (πŸͺΎ) leafless tree
| 0xF0 0x9F 0xAA 0xBF #E15.0 [1] (πŸͺΏ) goose
| 0xF0 0x9F 0xAB 0x80..0x82 #E13.0 [3] (πŸ«€..πŸ«‚) anatomical heart..p...
| 0xF0 0x9F 0xAB 0x83..0x85 #E14.0 [3] (πŸ«ƒ..πŸ«…) pregnant man..perso...
| 0xF0 0x9F 0xAB 0x86..0x8D #E0.0 [8] (πŸ«†..🫍) <reserved-1FAC6>..<...
| 0xF0 0x9F 0xAB 0x86 #E16.0 [1] (πŸ«†) fingerprint
| 0xF0 0x9F 0xAB 0x87..0x8D #E0.0 [7] (πŸ«‡..🫍) <reserved-1FAC7>..<...
| 0xF0 0x9F 0xAB 0x8E..0x8F #E15.0 [2] (🫎..🫏) moose..donkey
| 0xF0 0x9F 0xAB 0x90..0x96 #E13.0 [7] (🫐..πŸ«–) blueberries..teapot
| 0xF0 0x9F 0xAB 0x97..0x99 #E14.0 [3] (πŸ«—..πŸ«™) pouring liquid..jar
| 0xF0 0x9F 0xAB 0x9A..0x9B #E15.0 [2] (🫚..πŸ«›) ginger root..pea pod
| 0xF0 0x9F 0xAB 0x9C..0x9F #E0.0 [4] (🫜..🫟) <reserved-1FADC>..<...
| 0xF0 0x9F 0xAB 0x9C #E16.0 [1] (🫜) root vegetable
| 0xF0 0x9F 0xAB 0x9D..0x9E #E0.0 [2] (🫝..🫞) <reserved-1FADD>..<...
| 0xF0 0x9F 0xAB 0x9F #E16.0 [1] (🫟) splatter
| 0xF0 0x9F 0xAB 0xA0..0xA7 #E14.0 [8] (🫠..🫧) melting face..bubbles
| 0xF0 0x9F 0xAB 0xA8 #E15.0 [1] (🫨) shaking face
| 0xF0 0x9F 0xAB 0xA9..0xAF #E0.0 [7] (🫩..🫯) <reserved-1FAE9>..<...
| 0xF0 0x9F 0xAB 0xA9 #E16.0 [1] (🫩) face with bags under eyes
| 0xF0 0x9F 0xAB 0xAA..0xAF #E0.0 [6] (πŸ«ͺ..🫯) <reserved-1FAEA>..<...
| 0xF0 0x9F 0xAB 0xB0..0xB6 #E14.0 [7] (🫰..🫢) hand with index fin...
| 0xF0 0x9F 0xAB 0xB7..0xB8 #E15.0 [2] (🫷..🫸) leftwards pushing h...
| 0xF0 0x9F 0xAB 0xB9..0xBF #E0.0 [7] (🫹..🫿) <reserved-1FAF9>..<...
Expand Down
4 changes: 2 additions & 2 deletions textseg/generate.go
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@ package textseg

//go:generate go run make_tables.go -output tables.go
//go:generate go run make_test_tables.go -output tables_test.go
//go:generate ruby unicode2ragel.rb --url=https://www.unicode.org/Public/15.0.0/ucd/auxiliary/GraphemeBreakProperty.txt -m GraphemeCluster -p "Prepend,CR,LF,Control,Extend,Regional_Indicator,SpacingMark,L,V,T,LV,LVT,ZWJ" -o grapheme_clusters_table.rl
//go:generate ruby unicode2ragel.rb --url=https://www.unicode.org/Public/15.0.0/ucd/emoji/emoji-data.txt -m Emoji -p "Extended_Pictographic" -o emoji_table.rl
//go:generate ruby unicode2ragel.rb --url=https://www.unicode.org/Public/16.0.0/ucd/auxiliary/GraphemeBreakProperty.txt -m GraphemeCluster -p "Prepend,CR,LF,Control,Extend,Regional_Indicator,SpacingMark,L,V,T,LV,LVT,ZWJ" -o grapheme_clusters_table.rl
//go:generate ruby unicode2ragel.rb --url=https://www.unicode.org/Public/16.0.0/ucd/emoji/emoji-data.txt -m Emoji -p "Extended_Pictographic" -o emoji_table.rl
//go:generate ragel -Z grapheme_clusters.rl
//go:generate gofmt -w grapheme_clusters.go
Loading

0 comments on commit 0377bdd

Please sign in to comment.