Petr Koráb, Author at Towards Data Science https://towardsdatascience.com Publish AI, ML & data-science insights to a global community of data professionals. Mon, 14 Jul 2025 23:44:33 +0000 en-US hourly 1 https://wordpress.org/?v=6.8.1 https://towardsdatascience.com/wp-content/uploads/2025/02/cropped-Favicon-32x32.png Petr Koráb, Author at Towards Data Science https://towardsdatascience.com 32 32 Topic Model Labelling with LLMs https://towardsdatascience.com/topic-model-labelling-with-llms/ Mon, 14 Jul 2025 23:44:18 +0000 https://towardsdatascience.com/?p=606581 Python tutorial for reproducible labeling of cutting-edge topic models with GPT4-o-mini.

The post Topic Model Labelling with LLMs appeared first on Towards Data Science.

]]>
Choose the Right One: Evaluating Topic Models for Business Intelligence https://towardsdatascience.com/choose-the-right-one-evaluating-topic-models-for-business-intelligence/ Thu, 24 Apr 2025 19:50:50 +0000 https://towardsdatascience.com/?p=605801 Python tutorial for evaluating top-notch bigram topic models in customer email classification

The post Choose the Right One: Evaluating Topic Models for Business Intelligence appeared first on Towards Data Science.

]]>
Topic Modelling in Business Intelligence: FASTopic and BERTopic in Code https://towardsdatascience.com/topic-modelling-in-business-intelligence-fastopic-and-bertopic-in-code-2d3949260a37/ Wed, 22 Jan 2025 18:02:13 +0000 https://towardsdatascience.com/topic-modelling-in-business-intelligence-fastopic-and-bertopic-in-code-2d3949260a37/ A comparison of two cutting-edge dynamic topic models solving consumer complaints classification exercise

The post Topic Modelling in Business Intelligence: FASTopic and BERTopic in Code appeared first on Towards Data Science.

]]>
Bigram Word Cloud Animates Your Data Stories https://towardsdatascience.com/bigram-word-cloud-animates-your-data-stories-753649d5cf4d/ Wed, 08 May 2024 05:50:27 +0000 https://towardsdatascience.com/bigram-word-cloud-animates-your-data-stories-753649d5cf4d/ Hands-on tutorial explaining how to create an Animated Word Cloud of bigram frequencies to display a text dataset in an MP4 video

The post Bigram Word Cloud Animates Your Data Stories appeared first on Towards Data Science.

]]>
Topic Modelling with BERTtopic in Python https://towardsdatascience.com/topic-modelling-with-berttopic-in-python-8a80d529de34/ Mon, 01 Apr 2024 12:25:04 +0000 https://towardsdatascience.com/topic-modelling-with-berttopic-in-python-8a80d529de34/ Hands-on tutorial on modeling political statements with a state-of-the-art transformer-based topic model

The post Topic Modelling with BERTtopic in Python appeared first on Towards Data Science.

]]>
Data Storytelling with Animated Word Clouds https://towardsdatascience.com/data-storytelling-with-animated-word-clouds-1889fdeb97b8/ Wed, 22 Nov 2023 02:34:37 +0000 https://towardsdatascience.com/data-storytelling-with-animated-word-clouds-1889fdeb97b8/ Introduction An animated word cloud displays absolute frequencies of n-grams (contiguous sequences of text sample items) over time as a sequence of images in a video file. It **** gives greater importance to words that appear more frequently in a source text. The bigger and bolder the n-gram displays, the more frequently it appears in […]

The post Data Storytelling with Animated Word Clouds appeared first on Towards Data Science.

]]>
Customer Satisfaction Measurement with N-gram and Sentiment Analysis https://towardsdatascience.com/customer-satisfaction-measurement-with-n-gram-and-sentiment-analysis-547e291c13a6/ Mon, 10 Apr 2023 18:52:40 +0000 https://towardsdatascience.com/customer-satisfaction-measurement-with-n-gram-and-sentiment-analysis-547e291c13a6/ Product reviews are an excellent source of information for qualified management decisions. Learn more about the right text mining...

The post Customer Satisfaction Measurement with N-gram and Sentiment Analysis appeared first on Towards Data Science.

]]>
Sentiment Analysis and Structural Breaks in Time-Series Text Data https://towardsdatascience.com/sentiment-analysis-and-structural-breaks-in-time-series-text-data-8109c712ca2/ Mon, 06 Mar 2023 19:04:48 +0000 https://towardsdatascience.com/sentiment-analysis-and-structural-breaks-in-time-series-text-data-8109c712ca2/ Arabica now offers a structural break and sentiment analysis module to enrich time-series text mining

The post Sentiment Analysis and Structural Breaks in Time-Series Text Data appeared first on Towards Data Science.

]]>
Text Data Pre-processing for Time-Series Models https://towardsdatascience.com/text-data-pre-processing-for-time-series-models-162c0d01f5c5/ Thu, 09 Feb 2023 13:29:44 +0000 https://towardsdatascience.com/text-data-pre-processing-for-time-series-models-162c0d01f5c5/ Have you ever thought about how sentiment from text data can be used as a regressor in time-series models?

The post Text Data Pre-processing for Time-Series Models appeared first on Towards Data Science.

]]>
Visualization Module in Arabica Speeds Up Text Data Exploration https://towardsdatascience.com/visualization-module-in-arabica-speeds-up-text-data-exploration-47114ad646ce/ Mon, 09 Jan 2023 21:51:17 +0000 https://towardsdatascience.com/visualization-module-in-arabica-speeds-up-text-data-exploration-47114ad646ce/ Arabica now offers unigram, bigram, and trigram word cloud, heatmap, and line chart to further accelerate time-series text data analysis

The post Visualization Module in Arabica Speeds Up Text Data Exploration appeared first on Towards Data Science.

]]>
Contour Plots and Word Embedding Visualisation in Python https://towardsdatascience.com/contour-plots-and-word-embedding-visualisation-in-python-9dd2dacff6ac/ Tue, 15 Nov 2022 14:50:53 +0000 https://towardsdatascience.com/contour-plots-and-word-embedding-visualisation-in-python-9dd2dacff6ac/ Contour plots are simple and very useful graphics for word embedding visualization. This end-to-end tutorial uses IMDb data to illustrate...

The post Contour Plots and Word Embedding Visualisation in Python appeared first on Towards Data Science.

]]>
Text as Time Series: Arabica 1.0 Brings New Features for Exploratory Text Data Analysis https://towardsdatascience.com/text-as-time-series-arabica-1-0-brings-new-features-for-exploratory-text-data-analysis-88eaabb84deb/ Thu, 20 Oct 2022 04:34:41 +0000 https://towardsdatascience.com/text-as-time-series-arabica-1-0-brings-new-features-for-exploratory-text-data-analysis-88eaabb84deb/ Arabica 1.0 improves time series text data analysis with an extended set of features

The post Text as Time Series: Arabica 1.0 Brings New Features for Exploratory Text Data Analysis appeared first on Towards Data Science.

]]>
Arabica: A Python Package for Exploratory Analysis of Text Data https://towardsdatascience.com/arabica-a-python-package-for-exploratory-analysis-of-text-data-3bb8d7379bd7/ Sat, 10 Sep 2022 18:46:22 +0000 https://towardsdatascience.com/arabica-a-python-package-for-exploratory-analysis-of-text-data-3bb8d7379bd7/ Arabica provides unigrams, bigrams, and trigrams frequencies by a period in a single line of code. Learn more in this tutorial.

The post Arabica: A Python Package for Exploratory Analysis of Text Data appeared first on Towards Data Science.

]]>
Text Network Analysis: Generate Beautiful Network Visualisations https://towardsdatascience.com/text-network-analysis-generate-beautiful-network-visualisations-a373dbe183ca/ Thu, 04 Aug 2022 00:23:47 +0000 https://towardsdatascience.com/text-network-analysis-generate-beautiful-network-visualisations-a373dbe183ca/ Discover how to visualize text networks in the circular, radial, and matrix forms: circos, hive, and matrix plots. At the same time, learn...

The post Text Network Analysis: Generate Beautiful Network Visualisations appeared first on Towards Data Science.

]]>
Text Network Analysis: A Concise Review of Network Construction Methods https://towardsdatascience.com/text-network-analysis-a-concise-review-of-network-construction-methods-8e875108f058/ Thu, 30 Jun 2022 12:34:54 +0000 https://towardsdatascience.com/text-network-analysis-a-concise-review-of-network-construction-methods-8e875108f058/ A concise, methodical guide, from research question definition to network structure estimation.

The post Text Network Analysis: A Concise Review of Network Construction Methods appeared first on Towards Data Science.

]]>
Text Network Analysis: Theory and Practice https://towardsdatascience.com/text-network-analysis-theory-and-practice-223ac81c5f07/ Mon, 06 Jun 2022 00:58:23 +0000 https://towardsdatascience.com/text-network-analysis-theory-and-practice-223ac81c5f07/ Text network analysis belongs to the broader skill set of most text data-oriented analysts.

The post Text Network Analysis: Theory and Practice appeared first on Towards Data Science.

]]>
Exploratory Data Analysis of Text Data – Do it Faster with TextData https://towardsdatascience.com/exploratory-data-analysis-of-text-data-do-it-faster-with-textdata-f7354dc72269/ Thu, 14 Apr 2022 21:54:01 +0000 https://towardsdatascience.com/exploratory-data-analysis-of-text-data-do-it-faster-with-textdata-f7354dc72269/ TextData simplifies exploratory data analysis of text data and saves a significant amount of coding time

The post Exploratory Data Analysis of Text Data – Do it Faster with TextData appeared first on Towards Data Science.

]]>
Fixing Google Trends Data Limitations https://towardsdatascience.com/trendecon-essential-r-package-to-construct-time-series-from-google-trends-36365ea48382/ Wed, 23 Mar 2022 10:40:55 +0000 https://towardsdatascience.com/trendecon-essential-r-package-to-construct-time-series-from-google-trends-36365ea48382/ Google Trends data suffer from several drawbacks. TrendEcon, a marvelous R package, tackles them and helps create consistent long-run time...

The post Fixing Google Trends Data Limitations appeared first on Towards Data Science.

]]>
Create Digital Holograms in Python https://towardsdatascience.com/create-digital-holograms-in-python-with-holopy-eacacad989ae/ Mon, 07 Mar 2022 00:12:19 +0000 https://towardsdatascience.com/create-digital-holograms-in-python-with-holopy-eacacad989ae/ Exploring the world of digital holograms with HoloPy - a python package for computational light scattering and digital holography

The post Create Digital Holograms in Python appeared first on Towards Data Science.

]]>
The Most Favorable Pre-trained Sentiment Classifiers in Python https://towardsdatascience.com/the-most-favorable-pre-trained-sentiment-classifiers-in-python-9107c06442c6/ Fri, 11 Feb 2022 03:15:49 +0000 https://towardsdatascience.com/the-most-favorable-pre-trained-sentiment-classifiers-in-python-9107c06442c6/ Inspecting the performance of Vader, Happy Transformer, TextBlob, and Google NL API, discussing their limitations and tips for selecting...

The post The Most Favorable Pre-trained Sentiment Classifiers in Python appeared first on Towards Data Science.

]]>