Deep Learning | Towards Data Science

June Must-Reads: AI Agents, Dashboards, and More

TDS Editors — Thu, 03 Jul 2025 11:00:00 +0000

A selection of our most-read and -shared articles of the past month.

The post June Must-Reads: AI Agents, Dashboards, and More appeared first on Towards Data Science.

Taking ResNet to the Next Level

Muhammad Ardi — Thu, 03 Jul 2025 04:11:06 +0000

Understanding how ResNeXt improves upon ResNet, with a comprehensive PyTorch implementation guide

The post Taking ResNet to the Next Level appeared first on Towards Data Science.

How I Automated My Machine Learning Workflow with Just 10 Lines of Python

Himanshu Sharma — Fri, 06 Jun 2025 13:11:46 +0000

Use LazyPredict and PyCaret to skip the grunt work and jump straight to performance.

The post How I Automated My Machine Learning Workflow with Just 10 Lines of Python appeared first on Towards Data Science.

Detecting Malicious URLs Using LSTM and Google’s BERT Models

Toluwase Babalola — Wed, 28 May 2025 18:38:05 +0000

A progressive approach to implementing AI-powered webpage detection applications into production

The post Detecting Malicious URLs Using LSTM and Google’s BERT Models appeared first on Towards Data Science.

Bayesian Optimization for Hyperparameter Tuning of Deep Learning Models

Kuriko Iwai — Tue, 27 May 2025 21:02:02 +0000

Explore how Bayesian Optimization outperforms Grid Search in efficiency and performance over binary classification tasks.

The post Bayesian Optimization for Hyperparameter Tuning of Deep Learning Models appeared first on Towards Data Science.

Why Regularization Isn’t Enough: A Better Way to Train Neural Networks with Two Objectives

Mehdi Yazdani — Tue, 27 May 2025 18:09:12 +0000

Why splitting your objectives and your model might be the key to better performance and clearer trade-offs in deep learning.

The post Why Regularization Isn’t Enough: A Better Way to Train Neural Networks with Two Objectives appeared first on Towards Data Science.

Demystifying Policy Optimization in RL: An Introduction to PPO and GRPO

Joshua Nishanth A — Mon, 26 May 2025 18:25:23 +0000

A beginner-friendly guide to PPO and GRPO: simplifying policy optimization in reinforcement learning

The post Demystifying Policy Optimization in RL: An Introduction to PPO and GRPO appeared first on Towards Data Science.

The CNN That Challenges ViT

Muhammad Ardi — Tue, 06 May 2025 01:44:13 +0000

A PyTorch implementation on the ConvNeXt architecture

The post The CNN That Challenges ViT appeared first on Towards Data Science.

Why Are Convolutional Neural Networks Great For Images?

Caroline Arnold — Thu, 01 May 2025 01:00:07 +0000

How data symmetry informs neural network architectures

The post Why Are Convolutional Neural Networks Great For Images? appeared first on Towards Data Science.

Adding Training Noise To Improve Detections In Transformers

Uri Almog — Mon, 28 Apr 2025 17:56:52 +0000

Denoising, explained

The post Adding Training Noise To Improve Detections In Transformers appeared first on Towards Data Science.

The Art of Noise

Muhammad Ardi — Thu, 03 Apr 2025 01:12:22 +0000

Understanding and implementing a diffusion model from scratch with PyTorch

The post The Art of Noise appeared first on Towards Data Science.

Show and Tell

Muhammad Ardi — Mon, 03 Feb 2025 16:30:24 +0000

Implementing one of the earliest neural image caption generator models with PyTorch.

The post Show and Tell appeared first on Towards Data Science.

DeepSeek-V3 Explained 1: Multi-head Latent Attention

Shirley Li — Fri, 31 Jan 2025 10:02:05 +0000

Key architecture innovation behind DeepSeek-V2 and DeepSeek-V3 for faster inference

The post DeepSeek-V3 Explained 1: Multi-head Latent Attention appeared first on Towards Data Science.

The Three Phases of Learning Machine Learning

Pascal Janetzky — Tue, 28 Jan 2025 13:02:11 +0000

Part One: The beginner phase

The post The Three Phases of Learning Machine Learning appeared first on Towards Data Science.

Understanding the Evolution of ChatGPT: Part 3- Insights from Codex and InstructGPT

Shirley Li — Tue, 21 Jan 2025 18:19:27 +0000

Mastering the art of fine-tuning: Learnings for training your own LLMs.

The post Understanding the Evolution of ChatGPT: Part 3- Insights from Codex and InstructGPT appeared first on Towards Data Science.

Influential Time-Series Forecasting Papers of 2023-2024: Part 1

Nikos Kafritsas — Fri, 17 Jan 2025 12:02:18 +0000

Exploring the latest advancements in time series

The post Influential Time-Series Forecasting Papers of 2023-2024: Part 1 appeared first on Towards Data Science.

Why Data Scientists Can’t Afford Too Many Dimensions and What They Can Do About It

Niklas Lang — Thu, 16 Jan 2025 13:32:00 +0000

An in-depth article about dimensionality reduction and its most popular methods

The post Why Data Scientists Can’t Afford Too Many Dimensions and What They Can Do About It appeared first on Towards Data Science.

Understanding Flash Attention: Writing the Algorithm from Scratch in Triton

Alex Dremov — Wed, 15 Jan 2025 17:01:59 +0000

Find out how Flash Attention works. Afterward, we'll refine our understanding by writing a GPU kernel of the algorithm in Triton.

The post Understanding Flash Attention: Writing the Algorithm from Scratch in Triton appeared first on Towards Data Science.

LossVal Explained: Efficiently Estimate the Importance of Your Training Data

Tim Wibiral — Wed, 15 Jan 2025 14:01:59 +0000

How to Exploit the Loss Function for Efficient Data Valuation

The post LossVal Explained: Efficiently Estimate the Importance of Your Training Data appeared first on Towards Data Science.

From Darwin to Deep Work

Pascal Janetzky — Tue, 14 Jan 2025 13:02:21 +0000

Focus Strategies for Machine Learning Practitioners

The post From Darwin to Deep Work appeared first on Towards Data Science.