This course is an introduction to Data Science for students of Molecular Biology. We use the R language to learn the basic tools to handle structured data and extract valuable scientific information from it.
Slides used in classes
These slides contain the subjects that were evaluated on the Midterm Exam. New slides will be available at the last minute. Please take good notes on classes to improve your learning. We recommend taking notes with pen and paper using the Cornell Method.
Structured Documents. (Sep 17, 2019).
Introduction to Rstudio and to Markdown.[Slides].Why “Computing in Molecular Biology”?. (Sep 17, 2019).
What is a computer? Why do we care?[Slides].Practice with Structured Documents. (Sep 24, 2019).
Introduction to Rstudio and to Markdown.[Slides].Using R and RStudio. (Oct 1, 2019).
Basic usage of RStudio. Introduction to R. Basic Data Types: Numeric, Character, Logic and Factor.[Slides].Making and Indexing Vectors. (Oct 8, 2019).
Handling structured data.[Slides].Combining Markdown and R. (Oct 8, 2019).
How to answers Quizzes, Exams and Make-ups.[class06.Rmd], [slides06.Rmd], [Slides].Lists: Mixing different types of data. (Oct 15, 2019).
Also, a comment about digital signatures, and a Quiz you have to do.[Slides].Welcome to the Matrix. (Oct 15, 2019).
Structures in two dimensions. Matrices and Data Frames.[Slides].Telling stories. (Oct 22, 2019).
Introduction to Descriptive Statistics.[Slides].Using Data Frames. (Oct 22, 2019).
Telling stories[Slides].Data Visualization. (Nov 19, 2019).
Telling stories with pictures. “One image worths a thousand words”. Plots, barplots, histograms. Making “nice” drawings. Adding points and lines.[survey1-tidy.txt], [midterm.txt], [Slides].More Data Visualization. (Nov 26, 2019).
Plotting two vectors, numeric or factor. Formulas.[Slides].Handling Lists and Data Frames. (Nov 26, 2019).
[Slides].Hooke’s Law. (Dec 3, 2019).
A simple application of linear models.[rubber.txt], [Slides].Subsets and formulas. (Dec 3, 2019).
Easier ways to plot. Also, introduction to Linear Models.[survey2019.txt], [Slides].Logarithmic scales. (Dec 10, 2019).
Not all lines are straight lines. Exponential growth in Science and Technology. What will be your future?[kleiber.txt], [Transistor_count.txt], [dna_price.txt], [Slides].Logarithmic models. (Dec 10, 2019).
Not all lines are straight lines[Slides].Practice with Linear Models. (Dec 17, 2019).
Get ready.[sra_bases.txt], [Slides].Polynomial Models. (Dec 17, 2019).
Not all lines are straight lines.[free-fall.txt], [Slides].
Attendance
By regulation from the Rectory, students need to attend at least 70% of the classes. The attendance book is updated every week and can be seen in Google Sheets. Please check it and verify if it is right.