Unicode Language Detection (WebView edition)

Attempts to infer a language based on Unicode property values and ranges


Description

This is just a very basic demo, language detection is very difficult. It's more of a script detector🤔

This shortcut attempts to use the Unicode ranges and property values of the given text to infer a language.

English does not mean it's English. It just means it uses A-Z Latin alphabet😅

Languages like Spanish use accents so they will sometimes incorrectly be labeled as Vietnamese so in order to fix it add a Spanish regex entry and remove the shared characters between them.

I don't have time to improve this shortcut, I only worked on this to detect Chinese for my Android phone but I felt it might be useful to iphone users.

I made a version without WebView, try it out here: https://routinehub.co/shortcut/13029/


Latest Release Notes

1 - Sept. 23, 2022, 1:28 p.m.

This is the first release on here - The process to get webiew working correctly on here was a bit tricky since the "get contents of webpage" method caused errors but the "get contents of url" worked.