Skip to content

Fix apostrophe replace order#8

Open
HarikalarKutusu wants to merge 1 commit into
dabinat:masterfrom
HarikalarKutusu:master
Open

Fix apostrophe replace order#8
HarikalarKutusu wants to merge 1 commit into
dabinat:masterfrom
HarikalarKutusu:master

Conversation

@HarikalarKutusu

Copy link
Copy Markdown

First of all, thank you for this repo...

When using --strip_apostrophes option on Turkish Wikipedia resource (while working with cv-sentence-extractor), I've got many words with apostrophes (same word with different suffixes). When checking the code I saw that alternative Unicode versions are replaced after the stripping (in clean function).

This PR fixes this...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant