Conductor

In-progress tutorial project to collect data from NYC Subway real-time data feeds and do modeling, visualization,and predictive analytics with Python

View the Project on GitHub johngergely/conductor

About Conductor

Conductor collects, analyzes, and visualizes data for the New York City Subway system from real-time feeds maintained by the MTA.

It is an open source software and data science project that aims to offer some practical as well as pedagogical value by applying standard modeling, analysis, and visualization techniques to an interesting real-time data feed and documenting the project well enough to serve as a tutorial for others.

The tools and analysis are written in Python using popular packages including numpy, scikit-learn, pandas, and bokeh. One goal of the project is to evaluate the performance of the all-Python stack for data science by applying these tools to a real-world data set of considerable public interest.

Links and Documentation

For further documentation and data analysis from the project, visit the wiki.

Jump to the Analysis and Visualization section.

Authors and Contributors

@johngergely - contact with questions or to contribute.