Unicode Language Detection
Attempts to infer a language based on Unicode property values and ranges
This is just a very basic demo, language detection is very difficult. It's more of a script detector🤔
This shortcut attempts to use the Unicode ranges and property values of the given text to infer a language.
English does not mean it's English. It just means it uses A-Z Latin alphabet😅
Languages like Spanish use accents so they will sometimes incorrectly be labeled as Vietnamese so in order to fix it add a Spanish regex entry and remove the shared characters between them.
I don't have time to improve this shortcut, I only worked on this to detect Chinese for my Android phone but I felt it might be useful to iphone users.
I made a webview edition of this shortcut try it out here: https://routinehub.co/shortcut/13028/
Latest Release Notes
1.01 - Sept. 24, 2022, 3:37 a.m.
Fixed prompt to say "input text to detect language"