README.md in unicode-scripts-1.9.0 vs README.md in unicode-scripts-1.10.0
- old
+ new
@@ -1,14 +1,14 @@
# Unicode::Scripts [![[version]](https://badge.fury.io/rb/unicode-scripts.svg)](https://badge.fury.io/rb/unicode-scripts) [![[ci]](https://github.com/janlelis/unicode-scripts/workflows/Test/badge.svg)](https://github.com/janlelis/unicode-scripts/actions?query=workflow%3ATest)
Retrieve the [Unicode script(s)](https://en.wikipedia.org/wiki/Script_%28Unicode%29) a string belongs to. Can also return the *Script_Extension* property which is defined as characters which are "commonly used with more than one script, but with a limited number of scripts".
-Unicode version: **15.1.0** (September 2023)
+Unicode version: **16.0.0** (September 2024)
-Supported Rubies: **3.2**, **3.1**, **3.0**
+Supported Rubies: **3.3**, **3.2**, **3.1**, **3.0**
-Old Rubies that might still work: **2.X**
+Old Rubies that might still work: **2.7**, **2.6**, **2.5**, **2.4**, **2.3**, **2.X**
## Gemfile
```ruby
gem "unicode-scripts"
@@ -27,13 +27,13 @@
# Single character
Unicode::Scripts.script("ᴦ") # => "Greek"
# Script_Extension property
Unicode::Scripts.script_extensions("॥")
-# => ["Bengali", "Devanagari", "Dogra", "Grantha", "Gujarati","Gunjala_Gondi", "Gurmukhi", "Kannada",
-"Khudawadi", "Limbu", "Mahajani", "Malayalam", "Masaram_Gondi", "Nandinagari", "Oriya", "Sinhala",
-"Syloti_Nagri", "Takri", "Tamil", "Telugu", "Tirhuta"]
+# => ["Bengali", "Devanagari", "Dogra", "Grantha", "Gujarati", "Gunjala_Gondi", "Gurmukhi","Gurung_Khema",
+ "Kannada","Khudawadi", "Limbu", "Mahajani", "Malayalam", "Masaram_Gondi", "Nandinagari", "Ol_Onal",
+ "Oriya", "Sinhala", "Syloti_Nagri", "Takri", "Tamil", "Telugu", "Tirhuta"]
```
## Hints
### Regex Matching
@@ -92,18 +92,20 @@
Duployan
Egyptian_Hieroglyphs
Elbasan
Elymaic
Ethiopic
+Garay
Georgian
Glagolitic
Gothic
Grantha
Greek
Gujarati
Gunjala_Gondi
Gurmukhi
+Gurung_Khema
Han
Hangul
Hanifi_Rohingya
Hanunoo
Hatran
@@ -123,10 +125,11 @@
Kharoshthi
Khitan_Small_Script
Khmer
Khojki
Khudawadi
+Kirat_Rai
Lao
Latin
Lepcha
Limbu
Linear_A
@@ -160,10 +163,11 @@
Nko
Nushu
Nyiakeng_Puachue_Hmong
Ogham
Ol_Chiki
+Ol_Onal
Old_Hungarian
Old_Italic
Old_North_Arabian
Old_Permic
Old_Persian
@@ -191,10 +195,11 @@
Sinhala
Sogdian
Sora_Sompeng
Soyombo
Sundanese
+Sunuwar
Syloti_Nagri
Syriac
Tagalog
Tagbanwa
Tai_Le
@@ -208,11 +213,13 @@
Thaana
Thai
Tibetan
Tifinagh
Tirhuta
+Todhri
Toto
+Tulu_Tigalari
Ugaritic
Unknown
Vai
Vithkuqi
Wancho
@@ -221,11 +228,10 @@
Yi
Zanabazar_Square
```
### Short Script Names
-
You can extract all 4 letter script names from the gem like this:
```ruby
require "unicode/scripts"
puts Unicode::Scripts.names(format: :short)
@@ -267,18 +273,20 @@
Dupl
Egyp
Elba
Elym
Ethi
+Gara
Geor
Glag
Gong
Gonm
Goth
Gran
Grek
Gujr
+Gukh
Guru
Hang
Hani
Hano
Hatr
@@ -297,10 +305,11 @@
Khar
Khmr
Khoj
Kits
Knda
+Krai
Kthi
Lana
Laoo
Latn
Lepc
@@ -333,10 +342,11 @@
Newa
Nkoo
Nshu
Ogam
Olck
+Onao
Orkh
Orya
Osge
Osma
Ougr
@@ -366,10 +376,11 @@
Sogd
Sogo
Sora
Soyo
Sund
+Sunu
Sylo
Syrc
Tagb
Takr
Tale
@@ -383,11 +394,13 @@
Thaa
Thai
Tibt
Tirh
Tnsa
+Todr
Toto
+Tutg
Ugar
Vaii
Vith
Wara
Wcho
@@ -403,7 +416,7 @@
See [unicode-x](https://github.com/janlelis/unicode-x) for more Unicode related micro libraries.
## MIT License
-- Copyright (C) 2016-2023 Jan Lelis <https://janlelis.com>. Released under the MIT license.
+- Copyright (C) 2016-2024 Jan Lelis <https://janlelis.com>. Released under the MIT license.
- Unicode data: https://www.unicode.org/copyright.html#Exhibit1