Community Data Science Workshops (Fall 2014)/Day 3 projects/Matplotlib Session: Difference between revisions
Content added Content deleted
imported>Mdg (Adding a comment to clarify which zip should be downloaded) |
imported>Bnewbold (Updates for Fall 2014 session) |
||
Line 3: | Line 3: | ||
== Visualizing data with Matplotlib and Wiki-bios == |
== Visualizing data with Matplotlib and Wiki-bios == |
||
In this |
In this session, we will explore how to produce clear, informative charts, graphs, and plots with [http://matplotlib.org/ Matplotlib], the most popular toolkit for scientific data visualization in Python. |
||
We'll be focusing on a dataset drawn from Wikipedia and [http://dbpedia.org DBpedia], containing the names, birth dates, genders, article creation dates, and number of edits, of over 180,000 Wikipedia biography articles. |
We'll be focusing on a dataset drawn from Wikipedia and [http://dbpedia.org DBpedia], containing the names, birth dates, genders, article creation dates, and number of edits, of over 180,000 Wikipedia biography articles. |
||
Line 23: | Line 23: | ||
After installing matplotlib, and downloading and unpacking the Wikibios bundle, move into that directory with '''cd'''. You can test your installation by running '''python histograms.py'''. If matplotlib is install correcting, a chart file named '''histograms.pdf''' will appear in the current directory. |
After installing matplotlib, and downloading and unpacking the Wikibios bundle, move into that directory with '''cd'''. You can test your installation by running '''python histograms.py'''. If matplotlib is install correcting, a chart file named '''histograms.pdf''' will appear in the current directory. |
||
[http://mako.cc/teaching/2014/cdsw/wikibios.zip |
Wikibios bundle for all platforms: '''[https://github.com/makoshark/cdsw-wikibios/archive/master.zip latest]''', [http://mako.cc/teaching/2014/cdsw/wikibios.zip old mirror] |
||
Original link, does not contain the Matplotlib scripts - [http://mako.cc/teaching/2014/cdsw-autumn/wikibios.zip Mediawiki client test programs] |
Original link, does not contain the Matplotlib scripts - [http://mako.cc/teaching/2014/cdsw-autumn/wikibios.zip Mediawiki client test programs] |
||
=== References === |
|||
* [http://matplotlib.org/api/pyplot_summary.html matplotlib API reference] |
|||
* [http://matplotlib.org/examples/index.html matplotlib Examples] (many, with source) |
|||
* Other plotting resources |
|||
** [http://blog.olgabotvinnik.com/prettyplotlib/ prettyplotlib]: hip-aesthetic matplotlib plots |
|||
** [http://web.stanford.edu/~mwaskom/software/seaborn/ Seaborn]: fancy matplotlib-based visualizations |
|||
** [http://ggplot.yhathq.com/ ggplot]: port of the R language's ggplot2 library to python |
|||
** [http://d3js.org/ D3.js]: interactive data visualization for the browser (javascript) |
|||
=== Example topics to cover in Lecture === |
=== Example topics to cover in Lecture === |