edits
[tools.git] / index.md
Alex Sadleir 1 Below is a whole bunch of information to help you prepare mashups!
Maxious 2
Alex Sadleir 3 # [Hosted Developer Tools](#hosted-developer-tools)
4
5 ### Sourceforge
Maxious 6
Maxious 7 Subversion, Git, Mercurial, Bazaar, CVS, issue tracker, wiki, release file downloads. Unlimited free use for open source projects.
Alex Sadleir 8
9 You can create your own Sourceforge project at [http://sourceforge.net/](http://sourceforge.net/)
Maxious 10
Alex Sadleir 11 ### Google Code Project Hosting
Maxious 12
Maxious 13 Git, Mercurial, and Subversion code. Issue tracker, wiki, release file downloads. Unlimited free use for open source projects.
Maxious 14
Maxious 15 You can host your Google Code project and get access to developer tools, APIs and documentation at [http://code.google.com/](http://code.google.com/)
Maxious 16
Maxious 17
Maxious 18
Maxious 19 # The basics of being a data scientist
20
Alex Sadleir 21
Maxious 22 * Find the people and tools you need to prove/show/find. This rest of this page will help with the latter.
Maxious 23
Maxious 24 Please note, there are a combination of Analysis and Visualisation tools in each of the major categories below.
25
maxious 26 # [API Development](#api-development)
Maxious 27
Maxious 28
29 So an API isn’t just an XML file ![;)](http://www.govhack.org/wp-includes/images/smilies/icon_wink.gif)
Maxious 30
Alex Sadleir 31 A good web based data API:
Maxious 32
33 * Is logically organised
maxious 34 * Can filter returned data
Maxious 35 * Can return results in different open formats (CSV/JSON etc.)
36 * Is efficient and responsive by using caching and databases appropriately
37 * Handles errors gracefully
38 * Monitors and controls access (to show benefit realised of API and prevent abuse)
39 * Provides appropriate documentation with examples
40
41 Some people like sensis [http://](http://developers.sensis.com.au/)[developers.sensis.com.<wbr>au</wbr>](http://developers.sensis.com.au/)[/](http://developers.sensis.com.au/) use a provider like[http://](http://mashery.com/)[mashery.com](http://mashery.com/)[/](http://mashery.com/) or [https](https://apigee.com/)[://](https://apigee.com/)[apigee.com](https://apigee.com/) or [http://](http://apiaxle.com/)[apiaxle.com](http://apiaxle.com/)[/](http://apiaxle.com/) or [http://www.3scale.net/](http://www.3scale.net/) which handles making a good API for them.
42
43 Atlassian have a great page on what makes a good API [https](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[://](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[developer.atlassian.<wbr>com</wbr>](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[/display/REST/](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[Atlassian](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)[+<wbr>REST+API+Design+Guidelines+<wbr>version+1</wbr></wbr>](https://developer.atlassian.com/display/REST/Atlassian+REST+API+Design+Guidelines+version+1)
Alex Sadleir 44
45
46
Maxious 47
48
49 # [Data Visualisation](#data-visualisation)
50
Alex Sadleir 51 Most of the categories to follow have visualisation tools specific to their purpose.
Maxious 52
Alex Sadleir 53 You can find some data visualisation &#8220;essential&#8221\. tools below:
Maxious 54
55 [http://www.visualisingdata.com/index.php/2011/07/part-6-the-essential-collection-of-visualisation-resources/](http://www.visualisingdata.com/index.php/2011/07/part-6-the-essential-collection-of-visualisation-resources/)
56
Alex Sadleir 57 Also check out [http://thejit.org](http://thejit.org/) &amp; [http://www.senchalabs.org/<wbr>philogl/</wbr>](http://www.senchalabs.org/philogl/) (contributed by Matt Adcock)
Maxious 58
59
60
Alex Sadleir 61 # [Augmented Reality Tools](#augmented-reality-tools)
Alex Sadleir 62
Maxious 63 ### buildAR
maxious 64
Alex Sadleir 65 buildAR was the world&#8217;s first AR Content Management System with the first early version launched back in 200\. as a bit of a test to see if people were interested in being able to create their own augmented experiences. MOB Labs officially launched the new improved buildAR at the Augmented Reality event in Silicon Valley in May 2011.
Maxious 66
Alex Sadleir 67 [http://buildAR.com](http://buildar.com/) (normally a 1\. day free trial but GovHack attendees on the day will get a fully activated account for a month. If you are a GovHack attendee want full access now to play, just sign up for a trial and then forward your GovHack Eventbrite registration email to govhack [at] buildar.com and they&#8217;ll set you up for the free month fully activated account within a couple of days.
Maxious 68
Maxious 69 ### Other AR tools and references
Maxious 70
Maxious 71 theAWEsomeWEB &#8211\. Future of the Augmented Web [http://theAWEsomeWEB.com](http://theawesomeweb.com/)
72
Alex Sadleir 73 Layar &#8211\. AR Browser - [http://layar.com](http://layar.com/)
Maxious 74
75 Junaio &#8211\. AR Browser - [http://junaio.com](http://junaio.com/)
76
77 Aurasma &#8211\. AR Browser - [http://aurasma.com/](http://aurasma.com/)
78
Maxious 79 Vuforia &#8211\. NFT based AR SDK - [http://qualcomm.com/solutions/<wbr>augmented-reality</wbr>](http://qualcomm.com/solutions/augmented-reality)
80
81 String &#8211\. Marker based AR SDK - [http://poweredbystring.com/](http://poweredbystring.com/)
Alex Sadleir 82
Maxious 83 [pointcloud.io](http://pointcloud.io/) - SLAM based AR SDK - [http://pointcloud.io/](http://pointcloud.io/)
Alex Sadleir 84
Maxious 85 ARToolkit &#8211\. Marker based AR SDK - [http://en.wikipedia.org/wiki/<wbr>ARToolKit</wbr>](http://en.wikipedia.org/wiki/ARToolKit)
Maxious 86
87 FLARToolkit &#8211\. Marker based AR SDK - [http://saqoo.sh/a/en/<wbr>flartoolkit/start-up-guide</wbr>](http://saqoo.sh/a/en/flartoolkit/start-up-guide)
88
89 Kooaba &#8211\. Visual search platform - [http://kooaba.com](http://kooaba.com/)
90
91 Catchoom &#8211\. Visual search platform - [http://catchoom.com/](http://catchoom.com/)
92
93 Google Googles &#8211\. Visual search app - [http://www.google.com/mobile/<wbr>goggles/#text</wbr>](http://www.google.com/mobile/goggles/#text)
94
95 Vuzix 920AR &#8211\. Wearable display - [http://www.vuzix.com/consumer/<wbr>products_wrap920ar.html</wbr>](http://www.vuzix.com/consumer/products_wrap920ar.html)
96
97 Project Glass &#8211\. Googles wearable display project - [http://www.youtube.com/watch?<wbr>v=9c6W4CCU9M4</wbr>](http://www.youtube.com/watch?v=9c6W4CCU9M4)
98
99 Mixed Reality &#8211\. Nokia&#8217;s vision for wearable displays [http://www.youtube.com/watch?<wbr>v=CGwvZWyLiBU</wbr>](http://www.youtube.com/watch?v=CGwvZWyLiBU)
100
101 City Lens &#8211\. Nokia&#8217;s latest AR app - [http://www.youtube.com/watch?<wbr>v=63lNPkl-EPE&amp;feature=youtu.be</wbr>](http://www.youtube.com/watch?v=63lNPkl-EPE&amp;feature=youtu.be)
Alex Sadleir 102
Alex Sadleir 103 AR &#8211\. Wikipedia page - [http://en.wikipedia.org/wiki/<wbr>Augmented_reality</wbr>](http://en.wikipedia.org/wiki/Augmented_reality)
104
Alex Sadleir 105 Steve Mann &#8211\. Wearable Computing Pioneer - [http://en.wikipedia.org/wiki/<wbr>Steve_Mann</wbr>](http://en.wikipedia.org/wiki/Steve_Mann)
Maxious 106
107 Thad Starner &#8211\. Wearable Computing Pioneer [http://en.wikipedia.org/wiki/<wbr>Thad_Starner</wbr>](http://en.wikipedia.org/wiki/Thad_Starner)
Alex Sadleir 108
109 Mark Weiser &#8211\. Father of Pervasive Computing [http://en.wikipedia.org/wiki/<wbr>Mark_Weiser</wbr>](http://en.wikipedia.org/wiki/Mark_Weiser)
110
111 Geographical Data Tools](#geographical-data-tools)
112
113 Check out the[ GeoRabble Boundary Mapper&#8217;s Cookbook](http://georabble.org/2012/05/31/the-boundary-mappers-cookbook/) to see how you can tie all these things together!
114
115 ## Analysis
116
117 ### PostGIS
118
119 [![](http://www.govhack.org/wp-content/uploads/postgisexample-300x130.jpg "postgisexample")](http://www.govhack.org/wp-content/uploads/postgisexample.jpg)PostGIS is an extension for the PostgreSQL database server that allows you to store and manipulate geospatial data on a large scale. For example finding which points are in an area or what points are closest . It is also very useful for storing geospatial data because it can convert between all major formats including ESRI Shape files and Google Earth/Maps KML.
120
121 ### Quantum GIS
Maxious 122
123 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m50afbe88-300x160.jpg "QGIS Screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m50afbe88.jpg)QGIS is a graphical desktop application that allows viewing and editing of geospatial data. Some good base maps are available by adding the WMS layer/server [http://irs.gis-lab.info/](http://irs.gis-lab.info/)
Maxious 124
Alex Sadleir 125 ## Visualisation
Maxious 126
Alex Sadleir 127 ### Google Fusion Tables/ChartsBin/[OpenHeatMap](http://www.openheatmap.com/)
maxious 128
Maxious 129 [![](http://www.govhack.org/wp-content/uploads/fusiontablesscreenshot-300x168.jpg "fusiontablesscreenshot")](http://www.govhack.org/wp-content/uploads/fusiontablesscreenshot.jpg)Input a numerical values and areas to a spreadsheet and maps are produced
Alex Sadleir 130
Maxious 131 ### [Cartographer.js](http://cartographer.visualmotive.com/)
132
Alex Sadleir 133 [![](http://www.govhack.org/wp-content/uploads/cartographerjs-300x187.png "cartographerjs screenshot")](http://www.govhack.org/wp-content/uploads/cartographerjs.png)Input data as JSON and maps are produced.
Alex Sadleir 134
135
136
137 ### OpenLayers/Google Maps/[Leaflet](http://leaflet.cloudmade.com/)
138
maxious 139 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_512fcbe1-300x173.jpg "OpenLayers Screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_512fcbe1.jpg)Display points and different layers. Leaflet is the easiest to use if you just want to show points with popups when clicked on.
140
Maxious 141 ### NASA World Wind/Google Earth
142
143 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_4dda24a4-300x261.jpg "WorldWind screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_4dda24a4.jpg)Google Earth provides 3\. viewing of KML/GML files which represent points and shapes, both through a desktop application and a web plugin. These can be extended with interactive features that allow you to view by timeline or have animated tours between different points. You can also develop and customise your own viewer with the open source [NASA World Wind toolkit.](http://goworldwind.org/demos/)
144
145 ###
Maxious 146
147 Tabular Data Tools](#tabular-data-tools)
Maxious 148
Maxious 149 ## Analysis
Alex Sadleir 150
Maxious 151 ### Excel / Calc
Alex Sadleir 152
maxious 153 Great basic analysis and viewing. Older versions can be limited to 6500\. or so rows. Eg [http://www.tcij.org/training-material/car/data-mining/3474](http://www.tcij.org/training-material/car/data-mining/3474)
154
Maxious 155 ### [DataWrangler](http://vis.stanford.edu/wrangler/)/[Google Refine](http://code.google.com/p/google-refine/)
156
maxious 157 [![](http://www.govhack.org/wp-content/uploads/google_refine_interface.png "google_refine_interface")](http://www.govhack.org/wp-content/uploads/google_refine_interface.png)Clean up duplicate or inconsistent data entries.
Maxious 158
Alex Sadleir 159 ### PostgreSQL/MySQL
160
161 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_209ee972.jpg "SQL screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_209ee972.jpg)Next step up, large datasets can be manipulated/extracted efficiently for example [http://www.postgresql.org/docs/8.4/static/tutorial-window.html](http://www.postgresql.org/docs/8.4/static/tutorial-window.html) , no built-in data visualisation though.
162
Alex Sadleir 163 [Miso Dataset](http://misoproject.com/dataset/)
Alex Sadleir 164
165
166
167 ### R Statistical Language
168
169 [![](http://www.govhack.org/wp-content/uploads/rstudio-windows-300x249.png "rstudio-windows")](http://www.govhack.org/wp-content/uploads/rstudio-windows.png)Advanced data analysis, can find and visualise trends in large datasets. Some reference resources to learn the language [http://cran.r-project.org/doc/manuals/R-intro.html ](http://cran.r-project.org/doc/manuals/R-intro.html)There are also some addons that provide graphical interfaces that make it easier to use such as Rattle [http://rattle.togaware.com/](http://rattle.togaware.com/) , RStudio [http://rstudio.org/](http://rstudio.org/) or Deducer [http://www.deducer.org/pmwiki/pmwiki.php?n=Main.DeducerManual](http://www.deducer.org/pmwiki/pmwiki.php?n=Main.DeducerManual)
170
171 ## Visualisation
172
Alex Sadleir 173 ### [Tableau Desktop](http://www.tableausoftware.com/)
Alex Sadleir 174
175 Create visualisations from various data formats by dragging and dropping. Free trial available on website. [![](http://www.govhack.org/wp-content/uploads/Tableau-Screenshot-300x190.jpg "Tableau Screenshot")](http://www.govhack.org/wp-content/uploads/Tableau-Screenshot.jpg)
176
177 ### [Flotr2](http://www.humblesoftware.com/flotr2/)/[Google Chart Tools](https://developers.google.com/chart/)
178
179 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m11006fce-300x199.jpg "flotr2 screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m11006fce.jpg)Javascript based charts for webpages.
180
181 ### D3.js (Data-Driven Documents)
182
183 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m90d8020-300x277.jpg "d3 screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m90d8020.jpg)Javascript visualisations that are more interactive or intricate than charts. Can be hard to learn but there are examples and easier to use premade visualisations such as [word clouds](http://www.jasondavies.com/wordcloud/), [realtime filtering of barcharts](http://square.github.com/crossfilter/), or [bubble trees for comparing amount sizes](https://github.com/okfn/bubbletree).
Maxious 184
Alex Sadleir 185 ### Processing.js
Alex Sadleir 186
187 Graph (relationships and networks) Data Tools](#graph-relationships-and-networks-data-tools)
Maxious 188
Alex Sadleir 189 ## Analysis
190
191 ### Neo4j
192
193
194
195 can be done using a preexisting tool like Gremlin or by writing a simple Java/Python/Ruby application. Queries can be tested in the built in data browser.
196
197 ### [NetworkX](http://networkx.lanl.gov/index.html)
198
Maxious 199 [![](http://www.govhack.org/wp-content/uploads/chess_masters-300x300.png "NetworkX")](http://www.govhack.org/wp-content/uploads/chess_masters.png)
Alex Sadleir 200
Maxious 201 NetworkX is a social network analysis library for python. Many advanced analyses built in like finding communities within a graph. Also good for converting data into graphs.
Alex Sadleir 202
Alex Sadleir 203 ### Palantir
204
Maxious 205 Palantir make a good computer forensics tool, which they will showcase and give GovHack attendees access to for GovHack data analysis purposes. For more information check out:
Maxious 206
Alex Sadleir 207 [http://palantir.com.au/](http://palantir.com.au/)
208
maxious 209 ## Visualisation
Alex Sadleir 210
maxious 211 ### [Graphviz](http://www.graphviz.org/)
Alex Sadleir 212
Alex Sadleir 213
Alex Sadleir 214
215 ### Gephi
maxious 216
217 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_74d01d05-300x195.jpg "Gephi Screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_74d01d05.jpg)Desktop graph editor and renderer. Many good automatic layout algorithms even for very large graphs.
Maxious 218
Maxious 219 ### [sigma.js](http://sigmajs.org/)
Maxious 220
maxious 221 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m6006eaf3-300x130.jpg "Sigma.js Screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m6006eaf3.jpg)Javascript graph viewer, can use GEXF files exported from tools like neo4j, gephi and NetworkX.
Maxious 222
223
maxious 224
225 Below is some additional information including a few previous GovHack projects to get you thinking ![:)](http://www.govhack.org/wp-includes/images/smilies/icon_smile.gif) More examples are available at [http://mashupaustralia.org/](http://mashupaustralia.org/).
226
Maxious 227 Previous Project Examples](#previous-project-examples)
228
Maxious 229 ## PlanningAlerts
Maxious 230
231 [![Planning Alerts Screenshot](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_2f0199ff1-300x221.png "Planning Alerts Screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_2f0199ff1.png)Description: Planning Alerts takes data from local government development applications and sends alerts to users based on what applications are lodged in their area.
Maxious 232
Maxious 233 Programming Language: Ruby
Maxious 234
235 Source Control: [Git](https://github.com/openaustralia/planningalerts-app)
236
237 Issue Tracking: [Atlassian JIRA](http://tickets.openaustraliafoundation.org.au/browse/PA/)
Maxious 238
Maxious 239 ## LobbyLens
Maxious 240
241 [![](http://www.govhack.org/wp-content/uploads/129-Screenshot-LobbyClue_-_Chromium-300x180.png "LobbyLens screenshot")](http://www.govhack.org/wp-content/uploads/129-Screenshot-LobbyClue_-_Chromium.png)
242
243 Description: Displays connections between government contracts, business details, politician responsibilities, lobbyists, clients of lobbyists, political donors and the location of these entities.
244
245 Programing Language: PHP
Alex Sadleir 246
247 Source Control: SVN (Subversion)
248
249 Issue Tracking: A whiteboard
250
251 ## Bus.lambdacomplex.org
Maxious 252
253 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_3789acae-300x253.jpg "Bus.lambda screenshot")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_3789acae.jpg)
254
255 Description: Online Canberra Bus Timetables and Trip Planner.
maxious 256
Maxious 257 Programing Language: PHP/Ruby
maxious 258
259 Source Control: Git
260
Alex Sadleir 261 Issue Tracking: Github
262
Maxious 263 # General Data Hacking and Programming References](#general-data-hacking-and-programming-references)
264
265
266 [![](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m6a65720f-300x199.gif "Data Journalism Diagram")](http://www.govhack.org/wp-content/uploads/How-to-participate-in-GovHack_html_m6a65720f.gif)</dt>
Maxious 267 Illustration from Data Journalism Handbook, CC BY-SA 3.0</dd>
268
269
270
271
272 [datajournalismhandbook.org](http://datajournalismhandbook.org/)
Alex Sadleir 273
Maxious 274 You can learn the technical skills from scratch in Visualize This: The FlowingData Guide to Design, Visualization, and Statistics by Nathan Yau or for more advanced
275
Alex Sadleir 276 practical advice check out Data Analysis with Open Source Tools by Philipp K. Janert
Maxious 277
278 For further reading in this space
279
Alex Sadleir 280 [http://flowingdata.com/2012/04/27/data-and-visualization-blogs-worth-following/](http://flowingdata.com/2012/04/27/data-and-visualization-blogs-worth-following/)
281
282 **Statistics**
283
Alex Sadleir 284 [http://greenteapress.com/thinkstats/html/index.html](http://greenteapress.com/thinkstats/html/index.html)
285
Maxious 286 **Programming**
287
Maxious 288 Programming is valuable skill for manipulating and displaying data.
289
290 Basic tutorials for a variety of languages are available for free online or you can learn
291
292 interactively with websites like [http://www.codecademy.com/](http://www.codecademy.com/#!/exercises/0\. for JavaScript or [http://www.learnpython.org/ ](http://www.learnpython.org/)or [http://tryruby.org](http://tryruby.org/)
293
294 [https://developer.mozilla.org/en/JavaScript](https://developer.mozilla.org/en/JavaScript) &#8211\. especially for web applications and visualisations, you&#8217;ll need a basic understanding of JS. Common libraries like prototype or jQuery can help
295
296 **Accessibility/User Experience**
297
298 WCAG guidelines not only make a web app accessible but make it a better experience for all users! Even if not making an app, good to consider these things to do and not do: [http://www.w3.org/TR/WCAG/](http://www.w3.org/TR/WCAG/)
299
300
301
302 #[Developer Tools For Your Computer](#developer-tools-for-your-computer)
Maxious 303
maxious 304 ### Source Control &#8211\. Git / Subversion
Maxious 305
Maxious 306 [![](http://www.govhack.org/wp-content/uploads/Screenshot-at-2012-04-29-172132-300x235.png "Git Screenshot")](http://progit.org/book/)
Maxious 307
308 [http://progit.org/book/](http://progit.org/book/)
309
Alex Sadleir 310 [http://svnbook.red-bean.com/](http://svnbook.red-bean.com/)
Maxious 311
Maxious 312 [http://tortoisesvn.net/](http://tortoisesvn.net/)
313
Maxious 314 [http://code.google.com/p/tortoisegit/](http://code.google.com/p/tortoisegit/)
Maxious 315
316 ### Task Tracking - [Trac](http://trac.edgewall.org/) / [Redmine](http://www.redmine.org/) / [Bugzilla](http://www.bugzilla.org/)
317
Alex Sadleir 318 ## [![](http://www.govhack.org/wp-content/uploads/trac-300x207.png "trac screenshot")](http://www.govhack.org/wp-content/uploads/trac.png)
319
Maxious 320 Issue/task trackers allow you to outline the tasks required for your project and assign them to people to do.