Avishek Biswas, Author at Towards Data Science https://towardsdatascience.com Publish AI, ML & data-science insights to a global community of data professionals. Wed, 09 Jul 2025 01:59:32 +0000 en-US hourly 1 https://wordpress.org/?v=6.8.1 https://towardsdatascience.com/wp-content/uploads/2025/02/cropped-Favicon-32x32.png Avishek Biswas, Author at Towards Data Science https://towardsdatascience.com 32 32 How to Fine-Tune Small Language Models to Think with Reinforcement Learning https://towardsdatascience.com/how-to-finetune-small-language-models-to-think-with-reinforcement-learning/ Wed, 09 Jul 2025 01:59:20 +0000 https://towardsdatascience.com/?p=606531 A visual tour and from-scratch guide to train GRPO reasoning models in PyTorch

The post How to Fine-Tune Small Language Models to Think with Reinforcement Learning appeared first on Towards Data Science.

]]>
Sesame  Speech Model:  How This Viral AI Model Generates Human-Like Speech https://towardsdatascience.com/sesame-speech-model-how-this-viral-ai-model-generates-human-like-speech/ Sat, 12 Apr 2025 01:09:27 +0000 https://towardsdatascience.com/?p=605722 A deep dive into residual vector quantizers, conversational speech AI, and talkative transformers.

The post Sesame  Speech Model:  How This Viral AI Model Generates Human-Like Speech appeared first on Towards Data Science.

]]>
The Ultimate Guide to RAGs – Each Component Dissected https://towardsdatascience.com/the-ultimate-guide-to-rags-each-component-dissected-3cd51c4c0212/ Tue, 29 Oct 2024 23:53:10 +0000 https://towardsdatascience.com/the-ultimate-guide-to-rags-each-component-dissected-3cd51c4c0212/ A visual tour of what it takes to build CHAD-level LLM pipelines

The post The Ultimate Guide to RAGs – Each Component Dissected appeared first on Towards Data Science.

]]>
The Evolution of Text to Video Models https://towardsdatascience.com/the-evolution-of-text-to-video-models-1577878043bd/ Thu, 19 Sep 2024 19:04:03 +0000 https://towardsdatascience.com/the-evolution-of-text-to-video-models-1577878043bd/ Simplifying the neural nets behind Generative Video Diffusion

The post The Evolution of Text to Video Models appeared first on Towards Data Science.

]]>
Segment Anything 2: What Is the Secret Sauce? (A Deep Learner’s Guide) https://towardsdatascience.com/segment-anything-2-what-is-the-secret-sauce-a-deep-learners-guide-1c43dd07a6f8/ Tue, 06 Aug 2024 11:35:10 +0000 https://towardsdatascience.com/segment-anything-2-what-is-the-secret-sauce-a-deep-learners-guide-1c43dd07a6f8/ Foundation + Promptable + Interactive + Video. How?

The post Segment Anything 2: What Is the Secret Sauce? (A Deep Learner’s Guide) appeared first on Towards Data Science.

]]>
Monocular Depth Estimation with Depth Anything V2 https://towardsdatascience.com/monocular-depth-estimation-with-depth-anything-v2-54b6775abc9f/ Wed, 24 Jul 2024 06:24:11 +0000 https://towardsdatascience.com/monocular-depth-estimation-with-depth-anything-v2-54b6775abc9f/ How do neural networks learn to estimate depth from 2D images?

The post Monocular Depth Estimation with Depth Anything V2 appeared first on Towards Data Science.

]]>
The History of Convolutional Neural Networks for Image Classification (1989- Today) https://towardsdatascience.com/the-history-of-convolutional-neural-networks-for-image-classification-1989-today-5ea8a5c5fe20/ Fri, 28 Jun 2024 01:51:23 +0000 https://towardsdatascience.com/the-history-of-convolutional-neural-networks-for-image-classification-1989-today-5ea8a5c5fe20/ A tour through the history of Computer Vision!

The post The History of Convolutional Neural Networks for Image Classification (1989- Today) appeared first on Towards Data Science.

]]>