README.md in unicode-scripts-1.3.0 vs README.md in unicode-scripts-1.4.0
- old
+ new
@@ -1,14 +1,14 @@
-# Unicode::Scripts [![[version]](https://badge.fury.io/rb/unicode-scripts.svg)](http://badge.fury.io/rb/unicode-scripts) [![[travis]](https://travis-ci.org/janlelis/unicode-scripts.png)](https://travis-ci.org/janlelis/unicode-scripts)
+# Unicode::Scripts [![[version]](https://badge.fury.io/rb/unicode-scripts.svg)](https://badge.fury.io/rb/unicode-scripts) [![[travis]](https://travis-ci.org/janlelis/unicode-scripts.png)](https://travis-ci.org/janlelis/unicode-scripts)
Retrieve the [Unicode script(s)](https://en.wikipedia.org/wiki/Script_%28Unicode%29) a string belongs to. Can also return the *Script_Extension* property which is defined as characters which are "commonly used with more than one script, but with a limited number of scripts".
-Unicode version: **11.0.0**
+Unicode version: **12.0.0** (March 2019)
-Supported Rubies: **2.5**, **2.4**, **2.3**
+Supported Rubies: **2.6**, **2.5**, **2.4**
-Old Rubies that might still work: **2.2**, **2.1**, **2.0**
+Old Rubies that might still work: **2.3**, **2.2**, **2.1**, **2.0**
## Gemfile
```ruby
gem "unicode-scripts"
@@ -27,25 +27,29 @@
# Single character
Unicode::Scripts.script("ᴦ") # => "Greek"
# Script_Extension property
Unicode::Scripts.script_extensions("॥")
-# => ["Bengali", "Devanagari", "Dogra", "Grantha", "Gujarati",
-"Gunjala_Gondi", "Gurmukhi", "Kannada", "Khudawadi", "Limbu",
-"Mahajani", "Malayalam", "Oriya", "Sinhala", "Syloti_Nagri",
-"Takri", "Tamil", "Telugu", "Tirhuta"]
+# => ["Bengali", "Devanagari", "Dogra", "Grantha", "Gujarati","Gunjala_Gondi", "Gurmukhi", "Kannada",
+"Khudawadi", "Limbu", "Mahajani", "Malayalam", "Masaram_Gondi", "Nandinagari", "Oriya", "Sinhala",
+"Syloti_Nagri", "Takri", "Tamil", "Telugu", "Tirhuta"]
```
+
+
+
## Hints
### Regex Matching
-If you have a string and want to match a substring/character from a specific Unicode script, you actually won't need this gem. Instead, you can use the [Regexp Unicode Property Syntax `\p{}`](http://ruby-doc.org/core-2.3.0/Regexp.html#class-Regexp-label-Character+Properties):
+If you have a string and want to match a substring/character from a specific Unicode script, you actually won't need this gem. Instead, you can use the [Regexp Unicode Property Syntax `\p{}`](http://ruby-doc.org/core/Regexp.html#class-Regexp-label-Character+Properties):
```ruby
"Coptic letter: ⲁ".scan(/\p{Coptic}/) # => ["ⲁ"]
```
+See [Idiosyncratic Ruby: Proper Unicoding](https://idiosyncratic-ruby.com/41-proper-unicoding.html) for more info.
+
### Script Names
You can extract all script names from the gem like this:
```ruby
@@ -86,10 +90,11 @@
Devanagari
Dogra
Duployan
Egyptian_Hieroglyphs
Elbasan
+Elymaic
Ethiopic
Georgian
Glagolitic
Gothic
Grantha
@@ -144,14 +149,16 @@
Mongolian
Mro
Multani
Myanmar
Nabataean
+Nandinagari
New_Tai_Lue
Newa
Nko
Nushu
+Nyiakeng_Puachue_Hmong
Ogham
Ol_Chiki
Old_Hungarian
Old_Italic
Old_North_Arabian
@@ -199,10 +206,11 @@
Tifinagh
Tirhuta
Ugaritic
Unknown
Vai
+Wancho
Warang_Citi
Yi
Zanabazar_Square
```
@@ -246,10 +254,11 @@
Dogr
Dsrt
Dupl
Egyp
Elba
+Elym
Ethi
Geor
Glag
Gong
Gonm
@@ -264,10 +273,11 @@
Hatr
Hebr
Hira
Hluw
Hmng
+Hmnp
Hrkt
Hung
Ital
Java
Kali
@@ -301,10 +311,11 @@
Mong
Mroo
Mtei
Mult
Mymr
+Nand
Narb
Nbat
Newa
Nkoo
Nshu
@@ -359,10 +370,11 @@
Tibt
Tirh
Ugar
Vaii
Wara
+Wcho
Xpeo
Xsux
Yiii
Zanb
Zinh
@@ -372,7 +384,7 @@
See [unicode-x](https://github.com/janlelis/unicode-x) for more Unicode related micro libraries.
## MIT License
-- Copyright (C) 2016-2018 Jan Lelis <http://janlelis.com>. Released under the MIT license.
+- Copyright (C) 2016-2019 Jan Lelis <http://janlelis.com>. Released under the MIT license.
- Unicode data: http://www.unicode.org/copyright.html#Exhibit1