README.md in unicode-scripts-1.5.0 vs README.md in unicode-scripts-1.6.0
- old
+ new
@@ -1,12 +1,12 @@
# Unicode::Scripts [![[version]](https://badge.fury.io/rb/unicode-scripts.svg)](https://badge.fury.io/rb/unicode-scripts) [![[travis]](https://travis-ci.org/janlelis/unicode-scripts.png)](https://travis-ci.org/janlelis/unicode-scripts)
Retrieve the [Unicode script(s)](https://en.wikipedia.org/wiki/Script_%28Unicode%29) a string belongs to. Can also return the *Script_Extension* property which is defined as characters which are "commonly used with more than one script, but with a limited number of scripts".
-Unicode version: **12.1.0** (May 2019)
+Unicode version: **13.0.0** (March 2020)
-Supported Rubies: **2.6**, **2.5**, **2.4**
+Supported Rubies: **2.7**, **2.6**, **2.5**, **2.4**
Old Rubies that might still work: **2.3**, **2.2**, **2.1**, **2.0**
## Gemfile
@@ -32,17 +32,14 @@
# => ["Bengali", "Devanagari", "Dogra", "Grantha", "Gujarati","Gunjala_Gondi", "Gurmukhi", "Kannada",
"Khudawadi", "Limbu", "Mahajani", "Malayalam", "Masaram_Gondi", "Nandinagari", "Oriya", "Sinhala",
"Syloti_Nagri", "Takri", "Tamil", "Telugu", "Tirhuta"]
```
-
-
-
## Hints
### Regex Matching
-If you have a string and want to match a substring/character from a specific Unicode script, you actually won't need this gem. Instead, you can use the [Regexp Unicode Property Syntax `\p{}`](http://ruby-doc.org/core/Regexp.html#class-Regexp-label-Character+Properties):
+If you have a string and want to match a substring/character from a specific Unicode script, you actually won't need this gem. Instead, you can use the [Regexp Unicode Property Syntax `\p{}`](https://ruby-doc.org/core/Regexp.html#class-Regexp-label-Character+Properties):
```ruby
"Coptic letter: ⲁ".scan(/\p{Coptic}/) # => ["ⲁ"]
```
@@ -79,17 +76,19 @@
Carian
Caucasian_Albanian
Chakma
Cham
Cherokee
+Chorasmian
Common
Coptic
Cuneiform
Cypriot
Cyrillic
Deseret
Devanagari
+Dives_Akuru
Dogra
Duployan
Egyptian_Hieroglyphs
Elbasan
Elymaic
@@ -118,10 +117,11 @@
Kannada
Katakana
Katakana_Or_Hiragana
Kayah_Li
Kharoshthi
+Khitan_Small_Script
Khmer
Khojki
Khudawadi
Lao
Latin
@@ -208,10 +208,11 @@
Ugaritic
Unknown
Vai
Wancho
Warang_Citi
+Yezidi
Yi
Zanabazar_Square
```
### Short Script Names
@@ -245,14 +246,16 @@
Cakm
Cans
Cari
Cham
Cher
+Chrs
Copt
Cprt
Cyrl
Deva
+Diak
Dogr
Dsrt
Dupl
Egyp
Elba
@@ -283,10 +286,11 @@
Kali
Kana
Khar
Khmr
Khoj
+Kits
Knda
Kthi
Lana
Laoo
Latn
@@ -373,10 +377,11 @@
Vaii
Wara
Wcho
Xpeo
Xsux
+Yezi
Yiii
Zanb
Zinh
Zyyy
Zzzz
@@ -384,7 +389,7 @@
See [unicode-x](https://github.com/janlelis/unicode-x) for more Unicode related micro libraries.
## MIT License
-- Copyright (C) 2016-2019 Jan Lelis <http://janlelis.com>. Released under the MIT license.
-- Unicode data: http://www.unicode.org/copyright.html#Exhibit1
+- Copyright (C) 2016-2020 Jan Lelis <https://janlelis.com>. Released under the MIT license.
+- Unicode data: https://www.unicode.org/copyright.html#Exhibit1