Leona Zhang compares the FlashText Python library to using regular expressions:
If you have done any text/data analysis, you might already be familiar with Regular Expressions (RegEx). RegEx evolved as a necessary tool for text editing. If you are still using RegEx to deal with text processing, then you may have some problems to deal with. Why? When it comes to large-sized texts, the low efficiency of RegEx can make data analysis unacceptably slow.
In this article, we will discuss how you can use FlashText, a Python library that is 100 times faster than RegEx to perform data analysis.
Learn more on the GitHub repo. I haven’t used this before but I could see it being handy.
Comments closed