Community Data Science Workshops (Fall 2014)/Day 2 SQL project: Difference between revisions

links, title
imported>Jtmorgan
(swap logo)
imported>Jtmorgan
(links, title)
Line 3:
__NOTOC__
 
== Building a Dataset using the Twitter APIMySQL ==
 
In this project, we will explore a few ways to gather data from Wikimedia and StackExchange projects using [https://en.wikipedia.org/wiki/MySQL MySQL] and the [http://quarry.wmflabs.org/ Quarry] and [http://data.stackexchange.com/ Data Explorer] applications. Once we've done that, we will download the results of our queries in [https://en.wikipedia.org/wiki/Comma-separated_values CSV filesformat] which can be used to ask and answer questions and visualize data in the final session.
 
=== Goals ===
 
* Learn how to use SQLMySQL (a [https://en.wikipedia.org/wiki/SQL Structured Query Language]) to build datasets.
* Get set up to run SQLMySQL queries to gather data from Wikimedia and StackExchange projects.
* Practice running SQLMySQL queries on your own to get data about who is editing particular Wikipedia articles and answering questions on StackOverflow.
* Create a few collections of Wikipedia data that you can do research with in the final section.
 
Anonymous user