| rivo/uniseg |
500 |
|
31 |
7,628 |
about 2 years ago |
17 |
February 21, 2023 |
2 |
mit |
Go |
| Unicode Text Segmentation, Word Wrapping, and String Width Calculation in Go |
| unicode-rs/unicode-segmentation |
496 |
|
2,864 |
511 |
over 2 years ago |
21 |
January 31, 2023 |
26 |
other |
Rust |
| Grapheme Cluster and Word boundaries according to UAX#29 rules |
| tonton-pixel/unicopedia-plus |
144 |
|
0 |
0 |
over 2 years ago |
0 |
|
0 |
mit |
JavaScript |
| Developer-oriented set of Unicode, Unihan & emoji utilities wrapped into one single app, built with Electron. |
| tc39/proposal-intl-segmenter |
118 |
|
0 |
0 |
about 4 years ago |
0 |
|
12 |
|
HTML |
| Unicode text segmentation for ECMAScript |
| blevesearch/segment |
74 |
|
165 |
260 |
over 3 years ago |
3 |
December 19, 2022 |
5 |
apache-2.0 |
Go |
| A Go library for performing Unicode Text Segmentation as described in Unicode Standard Annex #29 |
| kipcole9/text |
61 |
|
0 |
2 |
over 3 years ago |
2 |
June 29, 2020 |
1 |
other |
Elixir |
| Text detection and processing for Elixir |
| tapeinosyne/hyphenation |
48 |
|
7 |
11 |
about 2 years ago |
18 |
August 19, 2021 |
6 |
apache-2.0 |
Rust |
| Text hyphenation for Rust |
| clipperhouse/uax29 |
35 |
|
0 |
6 |
over 2 years ago |
40 |
May 26, 2023 |
1 |
mit |
Go |
| A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes. |
| dbuenzli/uuseg |
21 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
isc |
OCaml |
| Unicode text segmentation for OCaml |
| cldf/segments |
17 |
|
13 |
9 |
almost 4 years ago |
17 |
July 08, 2022 |
6 |
apache-2.0 |
Python |
| Unicode Standard tokenization routines and orthography profile segmentation |