Digital Language Vitality | Upper Sorbian

Upper Sorbian (hsb)



English name Upper Sorbian
Native name Hornjoserbsce
SIL code hsb
Alternative names bautzen, eastern sorbian, haut sorabe, hornjoserbsce, hornjoserbski, hornjoserbšćina, hornoserbski, kamenz, lusatian upper, obersorbisch, sorbian, sorbian upper, sorbianupper, upper, wendisch, wendish
Speakers L1: 13,490 (ethnologue),
L1: 16,540 (aggregate),
L1: 18,240 (Ethnologue: Languages of the World, 16th Edition (2009)” . M. Paul Lewis · SIL Interna),
L1: 15,000 (World Oral Literature Project” .)
Country Germany
Region Western Europe
Champion N/A
ISO scope I
ISO type L
ISO active yes
Integrated code hsb_____________A
Last updated Feb. 1, 2017, 3:36 p.m.


EGIDS (Ethnologue) 4
In yes
Vitality (Kornai, 2013) living

Language packs

Windows10 input method no
Mac input no
Ubuntu input yes
Windows language pack no
Mac language pack no
Ubuntu language pack yes
Firefox language pack yes
Firefox dictionary yes
Office language pack no
Office interface pack no


Has Wikipedia yes
Wikipedia articles 11,057
Wikipedia real articles 2,594
Wikipedia adjusted size 14,912,529
Wikipedia total 28,585
Wikipedia edits 355,671
Wikipedia admins 3
Wikipedia users 14,689
Wikipedia active users 37
Wikipedia images 133
Wikipedia depth 31
Has Wikipedia Incubator no

NLP tools

Hunspell status yes
Hunspell coverage 0.63
TreeTagger no

Open Language Archives Community

Primary texts online 19
Primary texts all 19
Lexical resources online 1
Lexical resources all 2
Language descriptions online 5
Language descriptions all 5
Language in online resource 1
Language in any resource 1
Online resource about the language 12
Any resource about the language 13


Source Crubadan
Number of documents 2,567
Number of words 855,015
Number of characters None
Has FLOSS spell checkers yes
In Watchtower no
Has UDHR translation yes

Indigenous Tweets project

Number of blogs 2
Number of posts 116
Number of words 11,942
Number of users 6
Number of tweets 442

Swadesh lists

Has Swadesh 110 yes
Has Swadesh 207 no

Other databases

Panlex translations 611,966
In WALS no
In Omniglot no
On no
Uriel features 0
In Leipzig Corpora yes
In SIREN project no