Creating a Wordcloud using NLP and TF-IDF in Python

提供方
在此指導項目中,您將:

Learn how to clean a dataset by removing encodings and unwanted words/characters

Learn how to lemmatize a text and fit a TF-IDF model

Learn how to create a wordcloud using TF-IDF scores

1.5 hours
初級
無需下載
分屏視頻
英語(English)
僅限桌面

By the end of this project, you will learn how to create a professional looking wordcloud from a text dataset in Python. You will use an open source dataset containing Christmas recipes and will create a wordcloud of the most important ingredients used in these recipes. I will teach you how load a JSON dataset, clean the dataset by removing encodings and unwanted characters, and lemmatize your dataset. I will also teach you how to calculate TF-IDF weights of words in your dataset and use these weights to create a wordcloud. You will create a ready-to-use Jupyter notebook for creating a wordcloud on any text dataset. Lemmatization is a process of removing inflectional endings only and to return the base or dictionary form of a word, which is known as the lemma. TF-IDF stands for term frequency-inverse document frequency. TF-IDF gives a weight to each word which tells how important that term is. Using both lemmatization and TF-IDF, one can find the important words in the text dataset and use these important words to create the wordcloud. For example, these datasets could be customer complaints and the business can focus on the important issues that the customers are facing. Wordcloud is a powerful resource which can be used in reports and presentations. Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

您要培養的技能

  • Natural Language Toolkit (NLTK)

  • Python Programming

  • Term Frequency Inverse Document Frequency (TF-IDF)

  • Wordnet

分步進行學習

在與您的工作區一起在分屏中播放的視頻中,您的授課教師將指導您完成每個步驟:

  1. Load a JSON dataset in Python

  2. Clean the dataset

  3. Remove encodings

  4. Lemmatize the text

  5. Fit TF-IDF model

  6. Create a Wordcloud

指導項目工作原理

您的工作空間就是瀏覽器中的雲桌面,無需下載

在分屏視頻中,您的授課教師會為您提供分步指導

常見問題

購買指導項目後,您將獲得完成指導項目所需的一切,包括通過 Web 瀏覽器訪問云桌面工作空間,工作空間中包含您需要了解的文件和軟件,以及特定領域的專家提供的分步視頻說明。

由於您的工作空間包含適合筆記本電腦或台式計算機使用的雲桌面,因此指導項目不在移動設備上提供。

指導項目授課教師是特定領域的專家,他們在項目的技能、工具或領域方面經驗豐富,並且熱衷於分享自己的知識以影響全球數百萬的學生。

您可以從指導項目中下載並保留您創建的任何文件。為此,您可以在訪問云桌面時使用‘文件瀏覽器’功能。

指導項目不符合退款條件。 請查看我們完整的退款政策

指導項目不提供助學金。

指導項目不支持旁聽。

您可在頁面頂部點按此指導項目的經驗級別,查看任何知識先決條件。對於指導項目的每個級別,您的授課教師會逐步為您提供指導。

是,您可以在瀏覽器的雲桌面中獲得完成指導項目所需的一切。

您可以直接在瀏覽器中於分屏環境下完成任務,以此從做中學。在屏幕的左側,您將在工作空間中完成任務。在屏幕的右側,您將看到有授課教師逐步指導您完成項目。