A Byte of my 2.2-lb Brain

Just sharing stuff…

SONAlysis since 1936 with R and Python

State of the Nation Addresses since 1936
Tomorrow, the President will be delivering his fourth State of the Nation Address (SONA). This morning, I decided to do some text analysis (more like frequency) on past SONAs (http://www.gov.ph/past-sona-speeches/). Here, I followed some keywords (that I chose) within each of the SONA given in the past years (from 1936-2012). I used both Filipino and English words in tracking said keywords, e.g. (“poverty” and “kahirapan”) or (“economy” and “ekonomiya”). The x-axis represents the years when the SONAs were given, while the y-axis represents the frequencies of the chosen words. The word frequencies are normalized across presidents.

Since I have not really looked into the individual speeches, I have opted to not give any political interpretations on the trends. I hope to refine the analysis some more with the help of my more politically inclined friends who have intuition on the various states of our nation.

keywords: development, economy, population, health, education, poverty, infrastructure

SONA

Below, I have plotted the comparison and commonality clouds of PNoy’s first three SONAs. I used R to generate said clouds. The commonality cloud shows the top words shared across his three speeches while the comparison cloud compares the frequencies of words across his speeches (in Filipino).
PNoy SONA Filipino

Same thing for the image below. Here, I used Python to restructure and concatenate Pres Arroyo’s (nine in total) and Pres Aquino’s speeches (3 in total). Note that for this analysis, I used the English (translated) version of PNoy’s three speeches. Just like in the above, I constructed the comparison and commonality clouds of the concatenated speeches.

Again, since I have not really looked into the individual speeches, I have opted to not give any political interpretations on the trends.

cloud-noynoy-arroyo

More word clouds from our last three presidents



ERAP
Erap

GMA
Arroyo

NOY-NOY
Noynoy


 

I have done some more text analyses, but it’s already running late (1:12AM). I’ll try to update this post ASAP.

2 comments on “SONAlysis since 1936 with R and Python

  1. geekborj
    August 3, 2013

    I like the results of the clouds from arroyo!

    Like

  2. genkumag
    August 26, 2013

    In other words, not much substance in PNOY’s speeches!

    Like

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Information

This entry was posted on July 22, 2013 by in Philippines and tagged , , , , , , , , , .
%d bloggers like this: