Blas M. Benito, PhD
Blas M. Benito, PhD
Resume
Experience
Skills
Software
Blog
Publications
Tags
Light
Dark
Automatic
Posts
My Reading List: Data Science
Live post with a curated list of high-quality data science posts and videos I found enlightening.
Blas M. Benito
Last updated on Mar 5, 2024
5 min read
Mapping Categorical Predictors to Numeric With Target Encoding
Target encoding is commonly used to map categorical variables to numeric with the objective of facilitating exploratory data analysis and machine learning modeling. This post covers the basics of this method, and explains how and when to use it.
Blas M. Benito
Last updated on Nov 15, 2023
24 min read
Everything You Don't Need to Know About Variance Inflation Factors
Deep explanation of what Variance Inflation Factors (VIF) are, how they work, what they really mean, and how they are used to manage multicollinearity in linear models.
Blas M. Benito
Last updated on Nov 6, 2023
16 min read
Multicollinearity Hinders Model Interpretability
In this post, I delve into the intricacies of model interpretation under the influence of multicollinearity, and use R and a toy data set to demonstrate how this phenomenon impacts both linear and machine learning models.
Blas M. Benito
Last updated on Nov 6, 2023
15 min read
Designing R functions to compute betadiversity indices from species lists
This is a tutorial written for R users needing to compute betadiversity indices from species lists rather than from presence-absence matrices, and for R beginners or intermediate users that want to start using their own functions.
Last updated on Jan 8, 2021
12 min read
Setup of a shared folder in a home cluster
In the previous posts I have covered how to setup a home cluster, and how to run parallel processes with foreach in R. However, so far I haven’t covered how to setup a folder shared among the cluster nodes to store the results of parallel computations.
Last updated on Jan 5, 2021
8 min read
Parallelized loops with R
Note: to better follow this tutorial you can download the .Rmd file from here. In a previous post I explained how to set up a small home cluster.
Last updated on Apr 1, 2021
19 min read
Setting up a home cluster
In this post I explain how to setup a small Beowulf cluster with a personal PC running Ubuntu 20.04 and a couple of Intel NUCs running Ubuntu Server 20.04, with the end-goal of parallelizing R tasks.
Last updated on Sep 6, 2023
11 min read
Cite
×