Bayesian Analyst
Bayesian Analyst
About me
Data Nuggets
Projects
Contact
CV
Light
Dark
Automatic
Posts
Base R vs dplyr vs data.table
Introduction One of the many reasons why people do not like R is because you find more than one way of performing an operation. But i woul like to say this could be an advantage as every possible way can have more advantage for a given task .
Last updated on Sep 17, 2023
13 min read
dplyr
,
wrangling
,
munging
,
querying
Tidymodels in Healthcare
Last updated on Sep 17, 2023
0 min read
classification
,
datascience
Loan default prediction using Tidymodels
Machine Learning Scientist With an R track View certificate A gentle introduction to classification Classification is a form of machine learning in which you train a model to predict which category an item belongs to.
Last updated on Sep 15, 2023
26 min read
classification
,
datascience
,
dataviz
,
ggplot2
,
GLMS
Mastering data visualisation in R
Introduction ggplot2 is an R package by Hadley Wickham and Winston Chang that implements Wilkinson’s Grammar of Graphics. The emphasis of ggplot2 is on rapid exploration of data, and especially high-dimensional data.
Last updated on Sep 14, 2023
23 min read
ggplot2
,
janitor
,
Programming
dplyr vs Pandas
Introduction Over the years , the debate has always been …which one is better for data science ,R or Python? . I say it all depends with where you are coming from and as long as you get the job done .
Last updated on Sep 13, 2023
15 min read
dplyr
,
pandas
,
python
,
R
Generalised Linear Models 1
Generalized linear models Exponential Family The theory of GLMs is developed for data with distribution given y the exponential family. The form of the data distribution that is useful for GLMs is
Sep 9, 2023
14 min read
A glimpse of the tidyverse
Introduction Goal for Today Introduce you to R and Rstudio. What Are R and Rstudio? What Is R? R is an open source programming language with origins in C and FORTRAN.
Last updated on Sep 9, 2023
33 min read
dplyr
,
datascience
,
Rstats
Beyond Linear Regression
What do we mean by the linear model? Regression is the workhorse of statistics. It allows us to model a response variable as a function of predictors plus error. Linear regression is what most people first encounter in statistics.
Last updated on Sep 9, 2023
15 min read
regression
,
GAMS
,
GLMS
,
Time to event
Control flows in R
Every time some operations have to be repeated, a loop may come in handy. Loops are good for: Doing something for every element of an object; Doing something until the processed data runs out; Doing something for every file in a folder; Doing something that can fail, until it succeeds; Iterating a calculation until it reaches convergence.
Last updated on Sep 9, 2023
7 min read
Programming
,
Rstats
,
introduction
,
classification
Generalised Linear Models 2
Multinomial If we have more than two categories or groups that we want to model relative to covariates (e.g., we have observations \(i = 1,\cdots,n\) and groups covariates \(j = 1,\cdots,J\)), multinomial is our candidate model
Last updated on Sep 9, 2023
11 min read
classification
«
»
Cite
×