This event has ended. Visit the official site or create your own event on Sched.
Click here to return to main conference site. For a one page, printable overview of the schedule, see this.
Back To Schedule
Tuesday, June 28 • 5:21pm - 5:39pm
The challenge of combining 176 x #otherpeoplesdata to create the Biomass And Allometry Database (BAAD)

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Despite the hype around "big data", a more immediate problem facing many scientific analyses is that large-scale databases must be assembled from a collection of small independent and heterogeneous fragments -- the outputs of many and isolated scientific studies conducted around the globe. Together with 92 other co-authors, we recently published the Biomass And Allometry Database (BAAD) as a data paper in the journal Ecology, combining data from 176 different scientific studies into a single unified database. BAAD is unique in that the workflow -- from raw fragments to homogenised database -- is entirely open and reproducible. In this talk I introduce BAAD and illustrate solutions (using R) for some of the challenges of working with and distributing lots and lots of #otherpeople's data.

avatar for Mine  Cetinkaya-Rundel

Mine Cetinkaya-Rundel

Duke University

avatar for Daniel Stein Falster

Daniel Stein Falster

Biological Sciences, Macquarie University, Australia

Tuesday June 28, 2016 5:21pm - 5:39pm PDT
Econ 140