My Blog Posts

My Medium.com blog posts:

Gemini Data Science Agent

Data Analysis Made Easier

Published on Mar 9, 2025 4 min read

TabPFN: New Way to Analyze Data

Supervised learning is a machine learning method to predict an outcome using data that has a known label

or outcome as part of the data. If…

Published on Jan 11, 2025. 5 min read

Artificial Intelligence: Battle of the Biases

It seems to me that artificial intelligence can reduce confirmation bias but at the same time worsens automation bias. Let’s take a look.

Published on Dec 2, 2024. 4 min read

OpenEvidence: Free AI-Powered Medical Search Engine

Physicians make multiple decisions every day. According to one study, a group of 10 pediatric cardiologists made on average 158 decisions…

Published on Aug 22, 2024. 3 min read

Vizly: AI-Enabled Data Analytics

Part 2

Published on Aug 8, 2024. 7 min read

Vizly: AI-Enabled Data Analytics

Part 1

Published on Aug 1, 2024. 7 min read

High-Performance Predictive Analytics Without Programming

Robert E Hoyt and David Patrishkoff

Published on Jul 23, 2024. 17 min read

Can Large Language Models Create Tabular Synthetic Data?

They can, but they have a variety of challenges

Published on Apr 8, 2024. 8 min read

Beyond Static Models: Boost Your Results with Dynamic Optimization in Orange

Introduction

Published on Mar 13, 2024. 4 min read

No-Code Data Science: Part 1

Are You Ready?

Published on Nov 10, 2023. 6 min read

No-Code Data Science: Part 2

Our Story

Published on Nov 10, 2023. 5 min read

Synthetic Tabular Data Created by AI

Robert E. Hoyt

Published on Feb 5, 2023. 10 min read

Explainable Models — Unlock the Black Box

Introduction

Published on Jul 30, 2022. 6 min read

Imbalanced Datasets—What Are Possible Solutions?

Imbalanced Datasets Common and Challenging?

Published on Jul 26, 2022. 6 min read

Maximizing Orange for Data Science Education — Part 2

In part 1 of this series, I provided an overview of the data mining platform Orange which focuses on data science education. In part 2 I…

Published on Jul 23, 2022. 7 min read

Maximizing Orange for Data Science Education — Part 1

What is Orange?

Published on Jul 23, 2022. 7 min read

Microsoft Lobe: Image Recognition Made Simple

Published on Dec 18, 2020. 5 min read

Synthea: Do-It-Yourself Data

It is difficult to find patient-level data of sufficient size for research, modeling, or software development. This is largely due to…

Published on Dec 15, 2020. 5 min read

Data World: Platform for Data Science Collaboration

While there are multiple excellent commercial data science platforms available (Dataiku, Databricks, DataRobot, etc.), they are expensive…

Published on Nov 14, 2020. 4 min read

Evidence-Based Data Science — We Aren’t There Yet

Part 2

Published on Jul 16, 2020. 6 min read

Evidence-Based Data Science — We Aren’t There Yet

Part I

Published on Jul 14, 2020. 3 min read

Jamovi — A Free Statistical Package For Your Data Science Toolkit

In this data-centric world we live in we need lots of tools in our data science tool kit. The kit should include expertise in spreadsheets…

Published on Jul 10, 2020. 4 min read

How and Why We Used Google Apps to Write a Textbook

Creating a new textbook is a complex process, requiring collaboration and commitment by everyone involved. It is clearly different from…

Published on Jul 8, 2020. 5 min read

Google Dataset Search: Out of Beta

Google Dataset Search was launched in September 2018 with the goal to create a searchable public data repository. The search engine…

Published on Jan 30, 2020. 3 min read

Clinician Data Scientists?

It is widely accepted that data science is an in-demand profession for all industries including healthcare. The unfortunate reality is…

Published on Jan 12, 2020. 5 min read