Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List of common words to add that trick the transliterator #59

Open
dreamingfifi opened this issue Nov 1, 2019 · 1 comment
Open

List of common words to add that trick the transliterator #59

dreamingfifi opened this issue Nov 1, 2019 · 1 comment

Comments

@dreamingfifi
Copy link
Collaborator

So, I have been thinking of this for a while, and I think it would be a good partial step between just having an entire dictionary in the transliterator and letting it be fooled by every little spelling weirdness that English does.

Why not have a list of function words added to it? These are words like determiners, pronouns, helping verbs, prepositions, pronouns and so on. We wouldn't have to add all of them either, just ones that wouldn't be rendered correctly because they "trick" the transliterator.

I found a handy list of function words: https://semanticsimilarity.wordpress.com/function-word-lists/

If there is anything I'm good at, it's gathering lists and transliterating them. It'd take me a couple hours tops. Hell, before you get a chance to read this I may have already finished and posted the list.

It should be easy to implement too... you're already doing this with the shorthands of "of, the, of the, and" so this would basically be the same thing.

@dreamingfifi
Copy link
Collaborator Author

Here is a list of tricky words.
list-of-function-words.pdf

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant