GitHub Archive is a project to record the public GitHub timeline, archive it, and make it easily accessible for further analysis.
GitHub provides 18 event types, which range from new commits and fork events, to opening new tickets, commenting, and adding members to a project. The activity is aggregated in hourly archives, which you can access with any HTTP client.
Each archive contains a stream of JSON encoded GitHub events, which you can process in any language.
GitHub Archive dataset is also available via Google BigQuery.
Online resources: