Unicode Language Detection

Attempts to infer a language based on Unicode property values and ranges


This is just a very basic demo, language detection is very difficult. It's more of a script detector🤔

This shortcut attempts to use the Unicode ranges and property values of the given text to infer a language.

English does not mean it's English. It just means it uses A-Z Latin alphabet😅

Languages like Spanish use accents so they will sometimes incorrectly be labeled as Vietnamese so in order to fix it add a Spanish regex entry and remove the shared characters between them.

I don't have time to improve this shortcut, I only worked on this to detect Chinese for my Android phone but I felt it might be useful to iphone users.

I made a webview edition of this shortcut try it out here: https://routinehub.co/shortcut/13028/

Latest Release Notes

1.01 - Sept. 24, 2022, 3:37 a.m.

Fixed prompt to say "input text to detect language"

Version history