Home | Posts | Projects | About



Applying natural language processing and scraping financial news sources to gauge public sentiment of the stock market.

The modern day investor has tools such as MSN Money, Yahoo Finance and Google Finance at their disposal. Problem is these tools aggregate news about companies, but don’t analyze the sentiment of the news sources. Given the wealth of data and the collective intelligence of the internet, DaBuzz aims to use natural language processing and machine learning techniques to gauge the overall public perception of companies in the stock market.

Investors could then get an idea of present or future trends in the market by uploading their portfolios and viewing rankings by browsing DaBuzz. In the future, DaBuzz could be used as part of a more advanced computer program to make semi or fully autonomous trading decisions.

DaBuzz was a Rensselaer Center for Open Source (RCOS) project developed by Lucas Doyle, Christian Johnson, Michael Horowitz and Dan Kimball. We made this as a group of friends in their last semester at RPI who had the idea kicking around for awhile.


Check out the source code of DaBuzz on github

Lucas DoyleWritten by Lucas Doyle