--- a/index.md +++ b/index.md @@ -1,21 +1,42 @@ Welcome to the GovHack toolkit. This page provides all the information you need to prepare hackfest entries. These tools can be used to make entries like: mobile apps, web apps, data visualisations/infographics - -# General Data Hacking and Programming References {#general-data-hacking-and-programming-references} +# How to register and submit your entry +## registering your team +how to use website "Hacker Space" to register and find teams etc. + +## preparing your submission + + record a 3 minute speech and mix images/text to accompany +http://www.screenr.com/ and other screencasting tools allow you to demo apps. + youtube video editor http://www.youtube.com/editor +or local software like http://www.videolan.org/vlmc/ or http://www.lwks.com/ + +you also need to submit your "source material". For an application this may be source code, for another work it might be your notes or prototypes. +The key thing here is that your source material demonstrates to the judges that some of the end result was your own work and that it is possible for another person to replicate that work. + + +# General References {#general-data-hacking-and-programming-references} + +## Who can be a hack day participant + - roles; coder, designer UX/graphics + +## Definitions + - definitions, open licence reuse permissive hacker hack data journalism data vis UX etc. + ## The basics of being a data scientist -* Have a hypothesis � even if you’re making a tool/api that helps people with their questions too, remember what the objective of that is. +* Have a hypothesis - even if you're making a tool/api that helps people with their questions too, remember what the objective of that is. * Find the people and tools you need to prove/show/find. This rest of this page will help with the latter. -* Analyse and present results � were they what you expected? Do they help explain to others what you have found out? Can present as a interactive data visualisation or a web/mobile application or just a infographic/motion graphics video that tells a story. -Please note, there are a combination of Analysis and Visualisation tools in each of the major categories below. +* Analyse and present results - were they what you expected? Do they help explain to others what you have found out? +Can present as a interactive data visualisation or a web/mobile application or just a infographic/motion graphics video that tells a story. [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m6a65720f-300x199.gif "Data Journalism Diagram")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m6a65720f.gif)</dt> Illustration from Data Journalism Handbook, CC BY-SA 3.0</dd> -The best high level reference is the �Understanding Data� and �Delivering Data� chapters of the Data Journalism Handbook which is available online for free at +The best high level reference is the 'Understanding Data' and 'Delivering Data' chapters of the Data Journalism Handbook which is available online for free at [datajournalismhandbook.org](http://datajournalismhandbook.org/) @@ -35,26 +56,17 @@ **Programming** Programming is valuable skill for manipulating and displaying data. - Basic tutorials for a variety of languages are available for free online or you can learn -interactively with websites like [http://www.codecademy.com/](http://www.codecademy.com/#!/exercises/0\. for JavaScript or [http://www.learnpython.org/ ](http://www.learnpython.org/)or [http://tryruby.org](http://tryruby.org/) - -[https://developer.mozilla.org/en/JavaScript](https://developer.mozilla.org/en/JavaScript) –\. especially for web applications and visualisations, you’ll need a basic understanding of JS. Common libraries like prototype or jQuery can help +interactively with websites like [http://www.codecademy.com/](http://www.codecademy.com/#!/exercises/0). for JavaScript or [http://www.learnpython.org/ ](http://www.learnpython.org/)or [http://tryruby.org](http://tryruby.org/) + +[https://developer.mozilla.org/en/JavaScript](https://developer.mozilla.org/en/JavaScript) - especially for web applications and visualisations, you'll need a basic understanding of JS. Common libraries like prototype or jQuery can help **Accessibility/User Experience** WCAG guidelines not only make a web app accessible but make it a better experience for all users! Even if not making an app, good to consider these things to do and not do: [http://www.w3.org/TR/WCAG/](http://www.w3.org/TR/WCAG/) -## Who can be a hack day participant - - roles; coder, designer UX/graphics - -## Definitions - - definitions, open licence reuse permissive hacker hack data journalism data bis UCX etc. - - -## key datasets - - key datasets, directory.gov.au gazetter/AEC electorates/suburbs/postcodes/LGAs + ## examples @@ -102,11 +114,15 @@ server admin / technical tools many projects will require some kind of internet presence, webpage etc. - css framework like bootstrap or zurb foundation - video tools, youtube video editor/slideshow, FOSS video editing tools +- css gauges http://www.larentis.eu/donuts/ +- bootstrap themes, web fonts, css sprites, icon fonts + - http://designmodo.com/flat-free/ http://designmodo.github.com/Flat-UI/ + - http://ubuntu-tutorials.com/2008/11/11/relaying-postfix-smtp-via-smtpgmailcom/ - amon -### Source Control –\. Git / Subversion +### Source Control + Git / Subversion [![](http://www.govhack.org/wp-content/uploads/Screenshot-at-2012-04-29-172132-300x235.png "Git Screenshot")](http://progit.org/book/) @@ -148,7 +164,7 @@ # API Development {#api-development} -So an API isn’t just an XML file ![;)](http://www.govhack.org/wp-includes/images/smilies/icon_wink.gif) +So an API isn't just an XML file ![;)](http://www.govhack.org/wp-includes/images/smilies/icon_wink.gif) A good web based data API: @@ -162,7 +178,7 @@ Some people like sensis [http://](http://developers.sensis.com.au/)[developers.sensis.com.<wbr>au</wbr>](http://developers.sensis.com.au/)[/](http://developers.sensis.com.au/) use a provider like[http://](http://mashery.com/)[mashery.com](http://mashery.com/)[/](http://mashery.com/) or [https](https://apigee.com/)[://](https://apigee.com/)[apigee.com](https://apigee.com/) or [http://](http://apiaxle.com/)[apiaxle.com](http://apiaxle.com/)[/](http://apiaxle.com/) or [http://www.3scale.net/](http://www.3scale.net/) which handles making a good API for them. -Atlassian have a great page on what makes a good API [https](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[://](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[developer.atlassian.<wbr>com</wbr>](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[/display/REST/](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[Atlassian](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[+<wbr>REST+API+Design+Guidelines+<wbr>version+1</wbr></wbr>](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1) +Atlassian have a great page on what makes a good API https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1) API - howto.gov api tutorial @@ -182,7 +198,7 @@ Most of the categories to follow have visualisation tools specific to their purpose. -You can find some data visualisation “essential”\. tools below: +You can find some data visualisation tools below: [http://www.visualisingdata.com/index.php/2011/07/part-6-the-essential-collection-of-visualisation-resources/](http://www.visualisingdata.com/index.php/2011/07/part-6-the-essential-collection-of-visualisation-resources/) @@ -200,6 +216,7 @@ # Mobile +bom water, nz gov budget html5 jquery mobile like directory.gov.au - android datviz - http://code.google.com/p/afreechart/ http://code.google.com/p/snowdon/ http://code.google.com/p/chartdroid/ http://androidplot.com/ http://code.google.com/p/achartengine/ @@ -208,7 +225,7 @@ # Geographical Data Tools {#geographical-data-tools} -Check out the[ GeoRabble Boundary Mapper’s Cookbook](http://georabble.org/2012/05/31/the-boundary-mappers-cookbook/) to see how you can tie all these things together! +Check out the[ GeoRabble Boundary Mapper's Cookbook](http://georabble.org/2012/05/31/the-boundary-mappers-cookbook/) to see how you can tie all these things together! ## Key datasets - base layers like agri http://agri.openstreetmap.org/, http://irs.gis-lab.info/ wms or http://www.gdal.org/frmt_wms_openstreetmap_tms.xml @@ -245,7 +262,7 @@ ### Google Fusion Tables/ChartsBin/[OpenHeatMap](http://www.openheatmap.com/) -[![](http://www.govhack.org/wp-content/uploads/fusiontablesscreenshot-300x168.jpg "fusiontablesscreenshot")](http://www.govhack.org/wp-content/uploads/fusiontablesscreenshot.jpg)Input a numerical values and areas to a spreadsheet and maps are produced +[![](http://www.govhack.org/wp-content/uploads/fusiontablesscreenshot-300x168.jpg "fusiontablesscreenshot")](http://www.govhack.org/wp-content/uploads/fusiontablesscreenshot.jpg)Input numerical values and areas to a spreadsheet and maps are produced where the areas are colored on a scale of the values ### [Cartographer.js](http://cartographer.visualmotive.com/) @@ -284,14 +301,13 @@ Great basic analysis and viewing. Older versions can be limited to 6500\. or so rows. Eg [http://www.tcij.org/training-material/car/data-mining/3474](http://www.tcij.org/training-material/car/data-mining/3474) - ### PostgreSQL/MySQL [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_209ee972.jpg "SQL screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_209ee972.jpg)Next step up, large datasets can be manipulated/extracted efficiently for example [http://www.postgresql.org/docs/8.4/static/tutorial-window.html](http://www.postgresql.org/docs/8.4/static/tutorial-window.html) , no built-in data visualisation though. ### [Miso Dataset](http://misoproject.com/dataset/) -[![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m53b7ee38-293x300.png "miso screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m53b7ee38.png)Javascript data transformation library � especially good if you want to use the output for javascript interactive visualisations because the transformations can be done on-the-fly by users. +[![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m53b7ee38-293x300.png "miso screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m53b7ee38.png)Javascript data transformation library - especially good if you want to use the output for javascript interactive visualisations because the transformations can be done on-the-fly by users. ### R Statistical Language @@ -330,16 +346,23 @@ # Unstructured (text documents, webpages, metadata, tweets etc) Data Tools -## wranglying +## wrangling Scraperwiki pytemplate scrapy - -Overviewer/ Jigsaw -http://www.cc.gatech.edu/gvu/ii/jigsaw/ +regex + +#analysing - opennlp/nltk / https://github.com/clips/pattern - lucene/solr - http://www.r-bloggers.com/simple-text-mining-with-r/ - http://blog.josephwilk.net/ruby/latent-semantic-analysis-in-ruby.html similar terms usually found together +#visualising + +Overviewer/ Jigsaw +http://www.cc.gatech.edu/gvu/ii/jigsaw/ + +http://www.jasondavies.com/wordtree/ + # Graph (relationships and networks) Data Tools {#graph-relationships-and-networks-data-tools} - http://www.slideshare.net/OReillyStrata/visualizing-networks-beyond-the-hairball @@ -356,9 +379,10 @@ ### Neo4j / OrientDB -[![](http://www.govhack.org/wp-content/uploads/webadmin-data-300x127.png "Neo4\. web admin screenshot")](http://www.govhack.org/wp-content/uploads/webadmin-data.png)Help understand relationships � how is X connected to Y and via what other entities they both are connected to. Imports and exports +[![](http://www.govhack.org/wp-content/uploads/webadmin-data-300x127.png "Neo4\. web admin screenshot")](http://www.govhack.org/wp-content/uploads/webadmin-data.png)Help understand relationships - how is X connected to Y and via what other entities they both are connected to. Imports and exports - http://www.slideshare.net/maxdemarzi/etl-into-neo4j + http://blog.neo4j.org/2013/03/importing-data-into-neo4j-spreadsheet.html http://www.orientdb.org/ @@ -375,7 +399,8 @@ ## Visualisation ### Tree/Hierarchy visualisation - - don't use network viz if what you actually have is a tree/hierarchy with no interconnections http://www.randelshofer.ch/treeviz/ http://thejit.org/demos/ http://mbostock.github.com/protovis/ex/treemap.html http://blog.pixelingene.com/2011/07/building-a-tree-diagram-in-d3-js/d3 for Trees and Hierarchies +Sometimes what you actually have is a tree/hierarchy with no interconnections. + http://www.randelshofer.ch/treeviz/ http://thejit.org/demos/ http://mbostock.github.com/protovis/ex/treemap.html http://blog.pixelingene.com/2011/07/building-a-tree-diagram-in-d3-js/d3 for Trees and Hierarchies http://mbostock.github.com/d3/ex/pack.html http://mbostock.github.com/d3/ex/tree.html ### NodeXL for Microsoft Excel @@ -383,7 +408,7 @@ ### [Graphviz](http://www.graphviz.org/) -[![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_7579906d-300x184.png "Graphviz Screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_7579906d.png)Classic directed graph visualisation tool, can even [generate images online without installing](http://ashitani.jp/gv/) or use in webpages with [javascript port of software](http://code.google.com/p/canviz/). File format [�dot� very easy to learn](http://en.wikipedia.org/wiki/DOT_language) +[![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_7579906d-300x184.png "Graphviz Screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_7579906d.png)Classic directed graph visualisation tool, can even [generate images online without installing](http://ashitani.jp/gv/) or use in webpages with [javascript port of software](http://code.google.com/p/canviz/). File format ["dot" very easy to learn](http://en.wikipedia.org/wiki/DOT_language) ### Gephi