Best Practices for Reproducible, Collaborative Data Science
Sophisticated analytical organizations are learning they can improve collaboration and accelerate analysis by applying best practices from scientific research and software engineering teams. This talk will describe these best practices and their benefits, specifically reproducibility, collaboration, and reusability. We’ll describe how to address the unique challenges of applying these practices to analytical work—for example, the need to track data and results, not just code — both at the level of general principles as well some specific techniques.
How to connect to the webinar:
- Go to http://berkeleydatascience.adobeconnect.com/myseminar/
- Type in your name and enter the room as a guest.
- When you get into the room, a popup window will ask you to join the meeting audio. To do this, type in your phone number and click "connect".
- You will receive a call directly from the room's conference line. Answer it to connect to the lecture's audio.
Nick Elprin is the CEO and co-founder of Domino Data Lab. Before starting Domino, Nick was a senior technologist and technology manager at a large hedge fund, where he managed a team that designed, developed, and delivered the firm’s next generation research platform. He has a BA and MS in computer science from Harvard.