Thursday, March 13, 2025

War, what is it good for... NLP!

 On the back of the last post about sentiment analysis word clouds for pub reviews on trip advisor, I decided on another quick-and dirty exercise: a word cloud for reviews of one of my favourite films, Saving Private Ryan (directed by Stephen Spielberg, starring Tom Hanks, yes that one). IMDB was the source this time:

On the back of the last post about sentiment analysis word clouds for pub reviews on trip advisor, I decided on another quick-and dirty exercise: a word cloud for reviews of one of my favourite films, Saving Private Ryan (directed by Stephen Spielberg, starring Tom Hanks, yes that one). IMDB was the source this time:



I like this one because it really captures the essence of the reviews. The colours are set at random so don't represent anything, however the words like 'masterpiece', 'best' and other positives really stand out. And ok, 'movie', 'film' and 'war' are prominent but they are descriptive and you wouldn't want to filter them out. 



Evaluating Embeddings for NLP and Document Clustering

Embeddings are an alternative to traditional methods of vectorisation first proposed by Bengio et al in 2003, who developed the first langua...