• Home
  • About
  • Blog
  • Projects
  • Talks
Categories
All (63)
book (2)
color palettes (3)
emoji (3)
Feature Engineering (1)
generative art (1)
ggplot2 (16)
package development (4)
positron (1)
quarto (13)
slidecraft 101 (13)
smltar (2)
textrecipes (8)
textrecipes series (5)
tidymodels (10)
tidytext (12)
tidytuesday (5)
xaringan (2)

Dummy variables, sparse vs dense

tidymodels
Over the last year, I have worked on improving sparsity support in tidymodels. With that work, I spend time with various levels of abstraction, one of the low levels…
Apr 28, 2025

Fun Feature Engineering with Gen 1 Pokémon

Feature Engineering
tidytuesday

A tidytuesday inspired feature engineering journey into Pokémon

Apr 5, 2025

Positron: My Key Bindings

positron

Positron is a new and existing IDE for data science. Using and adding key bindings to improve my experience.

Jan 10, 2025

Slidecraft 101: Fragments - JS

slidecraft 101
quarto

Fragments are powerful tools in revealjs to allow for changes within slides

Aug 22, 2024

Slidecraft 101: Fragments - CSS

slidecraft 101
quarto

Fragments are powerful tools in revealjs to allow for changes within slides

Aug 21, 2024

Slidecraft 101: plot sizing

slidecraft 101
quarto

Plots and charts are useful in slides. Sizing them correctly makes them stand out and be readable.

Aug 1, 2024

Slidecraft 101: plot backgrounds

slidecraft 101
quarto

Plots and charts are useful in slides. Changing the background makes them fit in.

Jul 8, 2024

Slidecraft 101: Creative uses of SASS loops

slidecraft 101
quarto

Using nested for loops in SASS to create many CSS classes

Jul 7, 2024

Slidecraft 101: Using asciicast

slidecraft 101
quarto

Using sass interpolation, functions and mixin for the most in your style sheets

Jul 1, 2024

Slidecraft 101: Better SCSS files

slidecraft 101
quarto

Using sass interpolation, functions and mixin for the most in your style sheets

Jun 24, 2024

Managing python for R package development

package development

Some R packages use python, and setting up good practices makes the development easier.

Feb 19, 2024

Slidecraft 101: Layout

slidecraft 101
quarto

Placement of the different elements on a slide can change everything

Jan 16, 2024

Sharing Pictures of Code

Many different ways to share pictures of code

Dec 6, 2023

Slidecraft 101: 7 Tips and Tricks

slidecraft 101
quarto

7 quick tips and tricks

Oct 14, 2023

Slidecraft 101: Advanced slide themes

slidecraft 101
quarto

Creating full themes with sass

Aug 7, 2023

Slidecraft 101: theme variants

slidecraft 101
quarto

All about theme variants in slidecrafting

May 26, 2023

Slidecraft 101: Code and Output

slidecraft 101
quarto

All about styling code and output in slidecrafting

Sep 6, 2022

Slidecraft 101: Colors and Fonts

slidecraft 101
quarto

All about colors and fonts in slidecrafting

Aug 24, 2022

Supervised Machine Learning for Text Analysis in R

smltar
book

Story about “Supervised Machine Learning for Text Analysis in R” Book

Aug 4, 2021

xaringan first-letter

xaringan
I recently saw the use of the ::first-letter pseudo selector and I was hooked! and I was hooked! This selector allows you to style the first letter in a block, sometimes…
Mar 22, 2021

xaringancolor announcement

xaringan

Announcement post for the xaringancolor package.

Feb 4, 2021

Textrecipes Version 0.4.0

tidymodels
textrecipes
I’m happy to announce that version 0.4.0 of textrecipes got on CRAN a couple of days ago. This will be a brief post going over the major additions and changes.
Nov 13, 2020

Textrecipes series: Pretrained Word Embedding

tidymodels
textrecipes
textrecipes series

This is the fifth blog post in a series I am starting to go over the various text preprocessing workflows you can do with textrecipes. This post looks at how to use pretrained word embeddings.

Aug 7, 2020

Supervised Machine Learning for Text Analysis in R

smltar
book

Announcement post for Supervised Machine Learning for Text Analysis in R book.

Jul 27, 2020

Textrecipes series: Feature Hashing

tidymodels
textrecipes
textrecipes series

This is the fourth blog post in a series I am starting to go over the various text preprocessing workflows you can do with textrecipes. This post looks at how to perform feature hashing.

May 28, 2020

Textrecipes series: TF-IDF

tidymodels
textrecipes
tidytuesday
textrecipes series

This is the first blog post in a series I am starting to go over the various text preprocessing workflows you can do with textrecipes. This post looks at how

May 22, 2020

Textrecipes series: lexicons

tidymodels
textrecipes
tidytuesday
textrecipes series

This is the second blog post in a series I am starting to go over the various text preprocessing workflows you can do with textrecipes. This post talks about how to use lexicons.

May 12, 2020

Textrecipes series: Term Frequency

tidymodels
textrecipes
tidytuesday
textrecipes series

This is the first blog post in a series I am starting to go over the various text preprocessing workflows you can do with textrecipes.

May 5, 2020

tidytuesday: Part-of-Speech and textrecipes with The Office

tidymodels
textrecipes
tidytuesday

Using Part-of-Speech features to enhance text classification.

Mar 19, 2020

Word Rank Slope Charts

tidytext
I have been working on visualizing how different kinds of words are used in texts and I finally found a good visualization style with the slope chart. More specifically I’m…
Mar 17, 2020

Using stm to Investigate if Stemming is Appropriate

tidytext

Using stm to Investigate if Stemming is Appropriate

Mar 16, 2020

Use prismatic with after_scale() for finer control of colors in ggplot2

color palettes
ggplot2
With the release of version 3.3.0 of ggplot2 came the ability to have more control over the aesthetic evaluation. This allows us to modify the colors of the mapped palettes…
Feb 25, 2020

Deploy your bookdown project to Netlify with Github Actions

his post will show you how you can set up a bookdown site with Netlify using Github Actions.

Jan 20, 2020

Real Emojis in ggplot2

ggplot2
emoji
Emojis are now fully supported in {ggplot2} thanks to the {ragg} package. Read more about it here: Modern Text Features in R.
Jan 2, 2020

Paletteer version 1.0.0

ggplot2
color palettes

Announcement of version 1.0.0 of the {paletteer} package that.

Dec 18, 2019

Refactoring Tests

package development

This shows an experimental take on how you can refactor tests into functions.

Nov 25, 2019

Manipulating colors with {prismatic}

ggplot2
color palettes
I’m happy to announce my newest package prismatic which facilitates simple manipulations of colors. I had been working on this package online and offline for some time, but…
Oct 1, 2019

Authorship classification with tidymodels and textrecipes

tidymodels
textrecipes

An early attempt at using tidymodels to perform text classification.

Aug 9, 2019

Creating RStudio addin to modify selection

Creating addins to be used in RStudio.

Jul 30, 2019

Changing Glyph in legend in ggplot2

ggplot2
The newest version of ggplot2 3.2.0 gave us the ability to change the glyph in the legend like so
Jun 17, 2019

Custom Profiler in R

package development

This blog post is going to describe how to write a customizable profiling function

May 25, 2019

Center continuous palettes in ggplot2

ggplot2

This blog post will show you how to properly use a divergent color palette with a user-specified midpoint.

May 21, 2019

Circle Love - making hearts with circles

generative art

Generative art, pretty hearts with circles.

May 8, 2019

Text Classification with Tidymodels

tidymodels

An early attempt at using tidymodels to perform text classification.

Dec 29, 2018

usethis workflow for package development

package development

A quick overview on how to get started with creating an R package with the help of usethis.

Sep 2, 2018

What are the reviews telling us?

ggplot2
tidytext

We will scrape and analyze IMDb reviews. We will try using log odds to tell good and bad reviews apart.

Aug 17, 2018

ggplot2 trial and error - US trade data

ggplot2

This blog post will showcase an example of a workflow and its associated thought process when iterating through visualization styles working with ggplot2.

Jun 12, 2018

Emoji use on Twitter

ggplot2
tidytext
emoji

This post will be a short demonstration of how the occurrence of emojis on Twitter can be analyzed using tidytools.

Jun 4, 2018

Using PCA for word embedding in R

tidytext

A brief look at using PCA as a word embedding technique.

May 22, 2018

Analysing ethnic diversity in Californian school

ggplot2

An interesting study of a measure of diversity. This diversity measure was then applied to schools in California.

May 1, 2018

ggpage version 0.2.0 showcase

ggplot2

highlights of the ggpage package. It provides unique text-based visualizations.

Apr 7, 2018

Binary text classification with tidytext and caret

tidytext

One of my first attempts at text classification. This example uses tidytext and caret. There are mistakes here methodically and it should not be used as a guide.

Mar 31, 2018

Recreate - Sankey flow chart

ggplot2

This entry in the recreate series explores how gganimate can be used to create Sankey flow charts.

Mar 21, 2018

Tidy Text Summarization using TextRank

tidytext

I’m taking a look at text summarization using the TextRank package.

Mar 15, 2018

Co Occurrence of Characters in Les Miserable

ggplot2
tidytext

I’ll be creating a new spin on the Les Miserable character co-occurrence graph by using Named Entity Recognition to detect the characters.

Feb 23, 2018

Rvision: A first look

Recently I stumbled across the Rvision package, which frankly looks amazing so far. So I decided to take it for a spin and show you what I found.

Feb 15, 2018

Predicting authorship in The Federalist Papers with tidytext

tidytext

My first attempt at performing supervised predictive modeling using text features.

Jan 30, 2018

Visualizing trigrams with the Tidyverse

tidytext
ggplot2

I try my hand at a unique visualization of token trigrams.

Jan 23, 2018

Purrr - tips and tricks

With the advent of purrrresolution on Twitter I’ll throw my 2 cents in in form of my bag of tips and tricks.

Jan 8, 2018

Recreate - Sunshine Report

ggplot2

This mini-series (of in-determined length) will I try as best as I can to recreate great visualizations in tidyverse.

Jan 1, 2018

Analysing useR!2017 schedule data

tidytext

I enjoyed my first useR! conference so very much, so it is only natural thing for me was to explore the schedule to create some visualizations.

Jul 20, 2017

Repetition in musicals with tidytext

tidytext

A project looking at repetition in song lyrics. Using {rvest} to perform web scraping (now outdated) to fetch lyrics from genius.com.

Jun 5, 2017

2017 World Press Freedom Index with emojis

ggplot2
emoji

A first (now outclassed) attempt at using emojis in ggplot2. This post combines the hadley/emo package and data from the 2017 World Press Freedom Index.

Apr 26, 2017
No matching items