Anonymous user
Community Data Science Workshops (Spring 2014)/Saturday May 31st lecture: Difference between revisions
Community Data Science Workshops (Spring 2014)/Saturday May 31st lecture (view source)
Revision as of 22:07, 15 March 2015
, 9 years agono edit summary
imported>Mako |
imported>Mako No edit summary |
||
(6 intermediate revisions by the same user not shown) | |||
Line 1:
{{CDSW Moved}}
== Material for the lecture ==
Line 9 ⟶ 11:
* Lecture
** New tools!
** Our philosophy around data visualization
** We're going to walk through some analysis of edits to Harry Potter in Wikipedia, start to finish
** We'll focus on manipulating data in Python
Line 27 ⟶ 30:
** break
** string.join()
* My philosophy about data analysis: ''use the tools you have''
* Walk-through of <code>get_hpwp_dataset.py</code>
* Look at dataset with <code>more</code> and/or in spreadsheet
Line 38 ⟶ 42:
** Answer question: ''What proportion of edits to Wikipedia Harry Potter articles are made by "anonymous" contributors?''
*** Count the number of anonymous edits and calculate proportion
* Moire advanced counting▼
We mostly worked on these questions in the afternoon:
** Answer question: ''What are the most edited articles on Harry Potter?''
*** Count the number of edits per articles
|