Andrea D'Agostino, Author at Towards Data Science https://towardsdatascience.com Publish AI, ML & data-science insights to a global community of data professionals. Mon, 07 Jul 2025 22:20:21 +0000 en-US hourly 1 https://wordpress.org/?v=6.8.1 https://towardsdatascience.com/wp-content/uploads/2025/02/cropped-Favicon-32x32.png Andrea D'Agostino, Author at Towards Data Science https://towardsdatascience.com 32 32 POSET Representations in Python Can Have a Huge Impact on Business https://towardsdatascience.com/poset-representations-in-python-have-huge-impact-on-business/ Mon, 07 Jul 2025 22:19:51 +0000 https://towardsdatascience.com/?p=606512 Discover how POSET indicators transform data into coherent scoring systems, enabling meaningful comparisons while preserving the data’s multi-dimensional semantic structure.

The post POSET Representations in Python Can Have a Huge Impact on Business appeared first on Towards Data Science.

]]>
Feature Engineering Techniques for Numerical Variables in Python https://towardsdatascience.com/feature-engineering-techniques-for-numerical-variables-in-python-4bd42e8bded7/ Tue, 24 Sep 2024 16:59:40 +0000 https://towardsdatascience.com/feature-engineering-techniques-for-numerical-variables-in-python-4bd42e8bded7/ Learn the most useful feature engineering techniques to convert numerical values ​​into useful information for your predictive model using...

The post Feature Engineering Techniques for Numerical Variables in Python appeared first on Towards Data Science.

]]>
Guiding an LLM’s Response to Create Structured Output https://towardsdatascience.com/guiding-an-llms-response-to-create-structured-output-5dde0d3e426b/ Fri, 21 Jun 2024 07:17:27 +0000 https://towardsdatascience.com/guiding-an-llms-response-to-create-structured-output-5dde0d3e426b/ Learn how to structure a language model's response to ensure that the response format is respected, such as JSON

The post Guiding an LLM’s Response to Create Structured Output appeared first on Towards Data Science.

]]>
Extract any entity from text with GLiNER https://towardsdatascience.com/extract-any-entity-from-text-with-gliner-32b413cea787/ Sun, 24 Mar 2024 14:41:50 +0000 https://towardsdatascience.com/extract-any-entity-from-text-with-gliner-32b413cea787/ GLiNER is an NER model that can identify any type of entity using a bidirectional transformer encoder (similar to BERT) that outperforms...

The post Extract any entity from text with GLiNER appeared first on Towards Data Science.

]]>
Introduction to PCA in Python with Sklearn, Pandas, and Matplotlib https://towardsdatascience.com/introduction-to-pca-in-python-with-sklearn-pandas-and-matplotlib-476880f30238/ Wed, 06 Sep 2023 22:55:43 +0000 https://towardsdatascience.com/introduction-to-pca-in-python-with-sklearn-pandas-and-matplotlib-476880f30238/ Learn the intuition behind PCA in Python and Sklearn by transforming a multidimensional dataset into an arbitrary number of dimensions and...

The post Introduction to PCA in Python with Sklearn, Pandas, and Matplotlib appeared first on Towards Data Science.

]]>
Data Leakage: What It Is and Why It Causes Our Predictive Systems to Fail https://towardsdatascience.com/data-leakage-what-it-is-and-why-it-causes-our-predictive-systems-to-fail-4e5ed008489b/ Fri, 04 Aug 2023 06:56:45 +0000 https://towardsdatascience.com/data-leakage-what-it-is-and-why-it-causes-our-predictive-systems-to-fail-4e5ed008489b/ Data leakage represents, together with over/underfitting, the main cause of failure of machine learning projects that go into production

The post Data Leakage: What It Is and Why It Causes Our Predictive Systems to Fail appeared first on Towards Data Science.

]]>
A guide to handling categorical variables in Python https://towardsdatascience.com/guide-to-handling-categorical-variables-in-python-854d0b65a6ae/ Fri, 16 Jun 2023 08:59:52 +0000 https://towardsdatascience.com/guide-to-handling-categorical-variables-in-python-854d0b65a6ae/ A guide on how to approach categorical variables for machine learning and data science purposes

The post A guide to handling categorical variables in Python appeared first on Towards Data Science.

]]>
Introduction to statistical sampling and resampling https://towardsdatascience.com/introduction-to-statistical-sampling-and-resampling-1a6110965c3a/ Tue, 16 May 2023 18:27:17 +0000 https://towardsdatascience.com/introduction-to-statistical-sampling-and-resampling-1a6110965c3a/ Statistical sampling is a fundamental block of statistics that allows us to obtain information on the population of interest efficiently...

The post Introduction to statistical sampling and resampling appeared first on Towards Data Science.

]]>
How to identify outliers in data with Python https://towardsdatascience.com/how-to-identify-outliers-in-data-with-python-a9fe40235064/ Fri, 12 May 2023 19:29:23 +0000 https://towardsdatascience.com/how-to-identify-outliers-in-data-with-python-a9fe40235064/ An article exploring techniques for outlier detection in datasets. Learn how to use data visualization, z-scores, and clustering techniques...

The post How to identify outliers in data with Python appeared first on Towards Data Science.

]]>
Vector Representations for Machine Learning https://towardsdatascience.com/vector-representations-for-machine-learning-5047c50aaeff/ Tue, 25 Apr 2023 15:56:06 +0000 https://towardsdatascience.com/vector-representations-for-machine-learning-5047c50aaeff/ How data scientists convert real-world objects in numerical representation for the development of machine learning models

The post Vector Representations for Machine Learning appeared first on Towards Data Science.

]]>
Introduction to PyTorch: from training loop to prediction https://towardsdatascience.com/introduction-to-pytorch-from-training-loop-to-prediction-a70372764432/ Tue, 28 Mar 2023 16:48:28 +0000 https://towardsdatascience.com/introduction-to-pytorch-from-training-loop-to-prediction-a70372764432/ An introduction to PyTorch's training loop and general approach to tackle the library's steeper initial learning curve

The post Introduction to PyTorch: from training loop to prediction appeared first on Towards Data Science.

]]>
L1 vs L2 Regularization in Machine Learning: Differences, Advantages and How to Apply Them in… https://towardsdatascience.com/l1-vs-l2-regularization-in-machine-learning-differences-advantages-and-how-to-apply-them-in-72eb12f102b5/ Thu, 23 Feb 2023 14:03:50 +0000 https://towardsdatascience.com/l1-vs-l2-regularization-in-machine-learning-differences-advantages-and-how-to-apply-them-in-72eb12f102b5/ Delving into L1 and L2 regularization techniques in Machine Learning to explain why they are important to prevent model overfitting

The post L1 vs L2 Regularization in Machine Learning: Differences, Advantages and How to Apply Them in… appeared first on Towards Data Science.

]]>
How to Train a Word2Vec Model from Scratch with Gensim https://towardsdatascience.com/how-to-train-a-word2vec-model-from-scratch-with-gensim-c457d587e031/ Mon, 06 Feb 2023 20:13:04 +0000 https://towardsdatascience.com/how-to-train-a-word2vec-model-from-scratch-with-gensim-c457d587e031/ In this article we will explore Gensim, a very popular Python library for training text-based machine learning models, to train a Word2Vec...

The post How to Train a Word2Vec Model from Scratch with Gensim appeared first on Towards Data Science.

]]>
Benchmarking Machine Learning Models with Cross-Validation and Matplotlib in Python https://towardsdatascience.com/benchmarking-machine-learning-models-with-cross-validation-and-matplotlib-in-python-4957a41149e/ Mon, 23 Jan 2023 13:19:21 +0000 https://towardsdatascience.com/benchmarking-machine-learning-models-with-cross-validation-and-matplotlib-in-python-4957a41149e/ Learn how to create an object-oriented approach to compare and evaluate the performance of machine learning models using cross-validation...

The post Benchmarking Machine Learning Models with Cross-Validation and Matplotlib in Python appeared first on Towards Data Science.

]]>
How to prepare data for K-fold cross-validation in Machine Learning https://towardsdatascience.com/how-to-prepare-data-for-k-fold-cross-validation-in-machine-learning-924a44ec322c/ Mon, 19 Dec 2022 18:42:54 +0000 https://towardsdatascience.com/how-to-prepare-data-for-k-fold-cross-validation-in-machine-learning-924a44ec322c/ Cross-validation is the first technique to use to avoid overfitting and data leakage when we want to train a predictive model on our data. Its function is essential as it allows us to test functions and logics on our data in a safe way – namely, avoiding that these processes contaminate our validation data. If […]

The post How to prepare data for K-fold cross-validation in Machine Learning appeared first on Towards Data Science.

]]>
Get started with TensorFlow 2.0 – Introduction to deep learning https://towardsdatascience.com/a-comprehensive-introduction-to-tensorflows-sequential-api-and-model-for-deep-learning-c5e31aee49fa/ Tue, 22 Nov 2022 19:46:16 +0000 https://towardsdatascience.com/a-comprehensive-introduction-to-tensorflows-sequential-api-and-model-for-deep-learning-c5e31aee49fa/ Kickstart your understanding of one of TensorFlow's most powerful set of tools for deep learning

The post Get started with TensorFlow 2.0 – Introduction to deep learning appeared first on Towards Data Science.

]]>
Why having many features can hinder your model’s performance https://towardsdatascience.com/why-having-many-features-can-hinder-your-models-performance-865369b6b8b1/ Fri, 21 Oct 2022 15:41:32 +0000 https://towardsdatascience.com/why-having-many-features-can-hinder-your-models-performance-865369b6b8b1/ The activity of feature engineering can be very useful for improving the performance of a predictive model. However, it could worsen our...

The post Why having many features can hinder your model’s performance appeared first on Towards Data Science.

]]>
Content tagging with fuzzy logic in Python https://towardsdatascience.com/content-tagging-with-fuzzy-logic-in-python-d9041c572cac/ Tue, 11 Oct 2022 14:02:37 +0000 https://towardsdatascience.com/content-tagging-with-fuzzy-logic-in-python-d9041c572cac/ Learn how to use a simple script to tag textual content with fuzzy logic

The post Content tagging with fuzzy logic in Python appeared first on Towards Data Science.

]]>
What Is the Difference Between Machine Learning and Deep Learning? https://towardsdatascience.com/what-is-the-difference-between-machine-learning-and-deep-learning-d0fb539e07a1/ Thu, 15 Sep 2022 14:23:38 +0000 https://towardsdatascience.com/what-is-the-difference-between-machine-learning-and-deep-learning-d0fb539e07a1/ An introduction for new learners to these two, often misleading, concepts in data science

The post What Is the Difference Between Machine Learning and Deep Learning? appeared first on Towards Data Science.

]]>
What is cross-validation in machine learning https://towardsdatascience.com/what-is-cross-validation-in-machine-learning-14d2a509d6a5/ Tue, 30 Aug 2022 15:40:08 +0000 https://towardsdatascience.com/what-is-cross-validation-in-machine-learning-14d2a509d6a5/ Learn what cross-validation is - a fundamental technique for building generalizable models

The post What is cross-validation in machine learning appeared first on Towards Data Science.

]]>