Messing around with STM, part IIIb: model analysis

This is the fourth of a series of posts where I explore the application of structural topic models and its R implementation. The other posts are here, here and here; while this is the github repository with the jupyter notebooks where the contents of the posts are better integrated with code snippets and visualizations. The …

Continue reading Messing around with STM, part IIIb: model analysis

Messing around with STM, part IIIa: model selection

This is the third of a series of posts where I explore the application of structural topic models and its R implementation. The other posts are here and here; while here is the github repository with the jupyter notebooks where the contents of the posts are better integrated with code snippets and visualizations. The notebook with the …

Continue reading Messing around with STM, part IIIa: model selection

Messing around with STM – Part II – Data cleaning

This is the second of a series of posts where I explore the application of structural topic models and its R implementation. The first post is here; while here is the github repository with the jupyter notebook containing basically the contents of the posts and the code chunks, and some of the functions I used.  …

Continue reading Messing around with STM – Part II – Data cleaning

Messing around with STM – Part I, Web Scraping

Introduction As the second semester of my Master’s degree is finally wrapping up (two more to go), I suddenly found myself with some spare time, which I decided to employ to mess around a bit with Structural Topic Model (STM) and its R implementation, aptly named stm. As I mentioned in my previous post, it …

Continue reading Messing around with STM – Part I, Web Scraping

Impressions on Udacity Data Analyst Nanodegree

I have recently completed the first term of the Udacity Nanodegree in Data Analytics (which is structured in two independent “terms” of three months each, even though the first one is optional). It mostly covered Python (in particular numpy and pandas, the Python libraries for data analysis), SQL and practical statistics (basically from the basics …

Continue reading Impressions on Udacity Data Analyst Nanodegree