Data Science for Doctors Abbreviated 1-Day Course

- 3 mins

It’s been a while since I’ve blogged. Life has become a little bit more hectic as I’ve returned to clinical training from my 3-year period Out-of-Programme doing Research… As some people might know, through my involvement with the Data Science Breakfast Club, I’ve been teaching the basics of coding in R to novice researchers, concentrating on an audience of clinicians with expertise in various specialties (primarily anaesthetists) but with little or no experience in coding or data analysis.

To date, we have run the course about 6 times, and usually the course has been conducted over 2 days. The course has been immensely popular, the last couple of times we ran it, it sold out within a week of advertising (20 attendees usually).

The course has also since been ported across to the UCL Clinician Coders scheme, and there’s also talk about making it a formal training programme for other organisations looking to increase the research capacity in the UK for healthcare data.

As part of the evolution of these courses, we delivered a one-day version at Guy’s Hospital to a group of 15 anaesthetists, who were really interested in picking up some data analysis basics, but weren’t able to find the time to commit to 2 days away from clinical practice. Many thanks to Arun Sahni and James Bedford for helping me to run this version of the course — without the high ratio of faculty to students, this course would never be able to run!

I think it is important to open up participation to a wider audience, and some people might be interested enough to get a taste of coding but can only find time in their busy schedules for one full day. So I ported the presentations into a seried of 6 lectures. Heavily borrowing from the material that my friends and I in the Data Science Breakfast Club created over the last 3 or 4 years. Special thanks to Steve Harris, Ed Palmer, Ahmed Al-Hindawi, Finn Catling, and Paolo Perella, who have all contributed to the development of the 2-day course material over this period.

In the spirit of open collaboration, I feel that this abbreviated 1-day course could easily be taught in other hospitals and therefore I want to share the content with anybody who might be interested.

The course agenda is below, and links to the slides are included, each lesson is 1 hour long, and there’s an hour-long break for lunch built in:

  1. Basics of R (09:00-10:00)
  2. Spreadsheet tips (10:00-11:00)
  3. Getting data into R (11:00-12:00)
  4. Lunch break! (12:00-13:00)
  5. Basic data wrangling (13:00-14:00)
  6. Real simple stats (14:00-15:00)
  7. Data visualisation with ggplot2 (15:00-16:00)

When and if I get the time, I’ll put this all on a Github repo with the data files and .Rmd files used to make the slides.

sessionInfo()
## R version 3.5.2 (2018-12-20)
## Platform: x86_64-w64-mingw32/x64 (64-bit)
## Running under: Windows 10 x64 (build 17763)
## 
## Matrix products: default
## 
## locale:
## [1] LC_COLLATE=English_United Kingdom.1252 
## [2] LC_CTYPE=English_United Kingdom.1252   
## [3] LC_MONETARY=English_United Kingdom.1252
## [4] LC_NUMERIC=C                           
## [5] LC_TIME=English_United Kingdom.1252    
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] knitr_1.20
## 
## loaded via a namespace (and not attached):
## [1] compiler_3.5.2  magrittr_1.5    tools_3.5.2     stringi_1.1.7  
## [5] stringr_1.3.1   packrat_0.4.9-3 evaluate_0.11
Danny Wong

Danny Wong

Anaesthetist & Health Services Researcher

comments powered by Disqus
rss facebook twitter github youtube mail spotify instagram linkedin google pinterest medium