As with any neural network, we need to convert our data into a numeric format; in Keras and TensorFlow we work with tensors. The IMDB example data from the
keras
package has been preprocessed to a list of integers, where every integer corresponds to a word arranged by descending word frequency.So, how do we make it from raw text to such a list of integers? Luckily, Keras offers a few convenience functions that make our lives much easier.
This is a very nice tutorial if you’re new to the process.