So Prof. Mark Reidl of Georgia Tech is the best kind of geek, and used some cool scripting to extract all the things on Wikipedia with plot summaries: movies, books, tv episodes, video games, etc. That’s a lot of plot summaries: 112,936, to be exact. With a dataset