add school of data tutorials
[tools.git] / index.md
Alex Sadleir 1 Welcome to the GovHack toolkit. This page provides all the information you need to prepare hackfest entries.
Maxious 2 These tools can be used to make entries like: mobile apps, web apps, data visualisations/infographics
Alex Sadleir 3
4
5 - govhack toolkit!!!!
Maxious 6 - key datasets, directory.gov.au gazetter/AEC electorates/suburbs/postcodes/LGAs
Maxious 7
Alex Sadleir 8 - definitions, open licence reuse permissive hacker hack data journalism data bis UCX etc.
9 - what do govhack entries look like?
Maxious 10 - roles; coder, designer UX/graphics
Alex Sadleir 11
Maxious 12
Maxious 13
Maxious 14 geo
Maxious 15 - http://converter.mygeodata.eu/vector kml exporter for shp
Maxious 16 - http://bost.ocks.org/mike/map/
Maxious 17
Maxious 18 R
Maxious 19 - http://blog.yhathq.com/posts/10-R-packages-I-wish-I-knew-about-earlier.html
20 - excel -> R/rattle/ deducer? http://www.r-bloggers.com/updates-to-the-deducer-family-of-packages/
Alex Sadleir 21
Maxious 22
Maxious 23
Maxious 24 - tools.disclo.gs - how to use data
25 - developer tools inc. linked data
maxious 26 - gephi -> neo4j
Maxious 27 - neo4j lets you build on, do massive queries of who is friends with who
Maxious 28 - don't use network viz if what you actually have is a tree/hierarchy with no interconnections http://www.randelshofer.ch/treeviz/ http://thejit.org/demos/ http://mbostock.github.com/protovis/ex/treemap.html http://blog.pixelingene.com/2011/07/building-a-tree-diagram-in-d3-js/
29 - http://mbostock.github.com/d3/ex/pack.html http://mbostock.github.com/d3/ex/tree.html
Maxious 30 - postgis/quantum gis
Alex Sadleir 31 - (google earth is alright but many limitations) NASA World Wind?
Maxious 32 - ABS statistical areas
33 - http://irs.gis-lab.info/ wms or http://www.gdal.org/frmt_wms_openstreetmap_tms.xml
maxious 34 - can do nearest/isin/union queries - personal geocoder
Maxious 35 - cloudmade geocoder, google maps my maps
36 - base layers like agri or nearmap
37 - scraperwiki with new pytemplate libraries
38 - makes an API for your data to get in sqlite/json/csv
39 - three.js with cables vis? Overviewer from AP?
40 - govhack library
41 - http://hpneo.github.com/gmaps/examples.html
42 - https://graphics.stanford.edu/wikis/cs448b-12-fall/ data viz theory
43 - https://mixpanel.com/education/understanding-the-first-five-minutes
Alex Sadleir 44 - http://drawingbynumbers.org/toolsandresources
45 - http://wmbriggs.com/blog/?p=6465
46 - http://ofps.oreilly.com/titles/9781449339739/k_00000002.html list of d3 alternatives
Maxious 47 - http://craigkerstiens.com/2012/10/01/understanding-postgres-performance/
48 - https://github.com/clips/pattern for easy NLP/network analysis/data mining
49 - https://github.com/theodi/open-data-tech-review/wiki othr cleanup/linked data toola
50 - http://selection.datavisualization.ch/ data viz tools catalog
Alex Sadleir 51 - manipulating data - grep/find replace/sed/regex
Maxious 52 - d3 tools and tutorial http://enjalot.com/ http://news.ycombinator.com/item?id=4608440
Alex Sadleir 53 - Why d3 is the way it is and how to make charts http://bost.ocks.org/mike/chart/
Maxious 54 - how to make an xkcd chart http://bl.ocks.org/3914862
55
56 - data viz
Alex Sadleir 57 - http://k2company.com/blog/2012/09/06/toolbox-for-learning-machine-learning-and-data-science/
Maxious 58 - http://williamparry.blogspot.com.au/2011/04/putting-data-into-google-fusion-tables.html google fusion tutorial
59 - andrewharvey4.wordpress.com postgis/asgs tutorial
60 - http://www.slideshare.net/maxdemarzi/etl-into-neo4j
Alex Sadleir 61
Alex Sadleir 62 - http://www.twotorials.com/ for R
Maxious 63 - http://www.r-bloggers.com/gradient-word-clouds/ http://www.rstudio.com/shiny/ http://blog.ouseful.info/2012/11/28/quick-shiny-demo-exploring-nhs-winter-sit-rep-data/ https://github.com/timelyportfolio/shiny-d3-plot https://github.com/trestletech/shiny-sandbox/tree/master/grn
maxious 64 - http://is-r.tumblr.com/post/38240018815/making-prettier-network-graphs-with-sna-and-igraph
Alex Sadleir 65 - http://www.r-bloggers.com/video-simpler-tricks-and-tools-help-debugging-git-latex-and-workflow-with-r-by-prof-rob-hyndman/
Maxious 66 - http://yihui.name/knitr/ makes reports including google widgets/charts/maps via http://www.r-bloggers.com/googlevis-0-3-2-is-released-better-integration-with-knitr/
Alex Sadleir 67 - http://chartsnthings.tumblr.com/post/36978271916/r-tutorial-simple-charts http://flowingdata.com/2012/12/17/getting-started-with-charts-in-r/
Maxious 68
Maxious 69 - http://dydra.com/
Maxious 70 - http://selection.datavisualization.ch/ data viz tools list
Maxious 71 - http://nodexl.codeplex.com/ network graphs for excel
72 - http://sunfoundation.tumblr.com/
Alex Sadleir 73 - analysing - linked data tools
Maxious 74 - http://govcampau.wikispaces.com/useful+tools
75 - http://linkeddata.org/home
76
77
78 # Hosted Developer Tools {#hosted-developer-tools}
Maxious 79
80 ### Sourceforge
81
Alex Sadleir 82 Subversion, Git, Mercurial, Bazaar, CVS, issue tracker, wiki, release file downloads. Unlimited free use for open source projects.
Maxious 83
Alex Sadleir 84 You can create your own Sourceforge project at [http://sourceforge.net/](http://sourceforge.net/)
Maxious 85
86 ### Google Code Project Hosting
Alex Sadleir 87
88 Git, Mercurial, and Subversion code. Issue tracker, wiki, release file downloads. Unlimited free use for open source projects.
89
90 You can host your Google Code project and get access to developer tools, APIs and documentation at [http://code.google.com/](http://code.google.com/)
91
92
93
94 # The basics of being a data scientist
95
96 * Have a hypothesis � even if you’re making a tool/api that helps people with their questions too, remember what the objective of that is.
97 * Find the people and tools you need to prove/show/find. This rest of this page will help with the latter.
98 * Analyse and present results � were they what you expected? Do they help explain to others what you have found out? Can present as a interactive data visualisation or a web/mobile application or just a infographic/motion graphics video that tells a story.
99 Please note, there are a combination of Analysis and Visualisation tools in each of the major categories below.
100
101 # API Development {#api-development}
Alex Sadleir 102
103
Alex Sadleir 104 So an API isn’t just an XML file ![;)](http://www.govhack.org/wp-includes/images/smilies/icon_wink.gif)
Maxious 105
106 A good web based data API:
Alex Sadleir 107
108 * Is logically organised
109 * Can filter returned data
110 * Can return results in different open formats (CSV/JSON etc.)
111 * Is efficient and responsive by using caching and databases appropriately
112 * Handles errors gracefully
113 * Monitors and controls access (to show benefit realised of API and prevent abuse)
114 * Provides appropriate documentation with examples
115
116 Some people like sensis [http://](http://developers.sensis.com.au/)[developers.sensis.com.<wbr>au</wbr>](http://developers.sensis.com.au/)[/](http://developers.sensis.com.au/) use a provider like[http://](http://mashery.com/)[mashery.com](http://mashery.com/)[/](http://mashery.com/) or [https](https://apigee.com/)[://](https://apigee.com/)[apigee.com](https://apigee.com/) or [http://](http://apiaxle.com/)[apiaxle.com](http://apiaxle.com/)[/](http://apiaxle.com/) or [http://www.3scale.net/](http://www.3scale.net/) which handles making a good API for them.
117
118 Atlassian have a great page on what makes a good API [https](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[://](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[developer.atlassian.<wbr>com</wbr>](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[/display/REST/](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[Atlassian](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[+<wbr>REST+API+Design+Guidelines+<wbr>version+1</wbr></wbr>](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)
119
120 API
Maxious 121 - howto.gov api tutorial
122 - http://shancarter.com/data_converter/
Maxious 123 - http://training.sunlightfoundation.com/module/data-visualizations-google-docs/
Alex Sadleir 124 - api documentation
Maxious 125 - WSDL or http://swagger.wordnik.com/ or https://github.com/mashery/iodocs
Alex Sadleir 126 - http://symfony.com/ https://github.com/FriendsOfSymfony/FOSRestBundle http://williamdurand.fr/2012/08/02/rest-apis-with-symfony2-the-right-way/ https://github.com/nelmio/NelmioApiDocBundle better apis https://github.com/liip/LiipHelloBundle
maxious 127 - https://github.com/elc/rapi_doc https://github.com/Pajk/apipie-rails
Maxious 128 -
Alex Sadleir 129
Maxious 130
131
Alex Sadleir 132 # Infographics and Data Visualisation {#data-visualisation}
Alex Sadleir 133
134 Infographics try to contextualise charts and graphs to tell a story. Data vis builds on this to find new ways to design insight.
135
136 Most of the categories to follow have visualisation tools specific to their purpose.
137
maxious 138 You can find some data visualisation &#8220;essential&#8221\. tools below:
139
Maxious 140 [http://www.visualisingdata.com/index.php/2011/07/part-6-the-essential-collection-of-visualisation-resources/](http://www.visualisingdata.com/index.php/2011/07/part-6-the-essential-collection-of-visualisation-resources/)
141
142 Also check out [http://thejit.org](http://thejit.org/) &amp; [http://www.senchalabs.org/<wbr>philogl/</wbr>](http://www.senchalabs.org/philogl/) (contributed by Matt Adcock)
143
144 # Mobile
Maxious 145 html5 jquery mobile like directory.gov.au