Course

Limpieza de datos para el procesamiento de lenguaje natural

Universidad Austral

This course provides comprehensive knowledge for extracting, cleaning, and preparing data for NLP applications. It requires basic to intermediate programming skills, preferably in Python, and familiarity with the Anaconda Jupyter Notebooks environment. Python 3.6 or higher is used for application development. The course utilizes Anaconda's Notebook as the primary code editor but allows the use of any text editor supporting Anaconda notebooks. Essential libraries include NLTK, Pandas, Scikit-learn, and data extraction libraries.

The course comprises four modules:

  1. Web Scraping for Natural Language Processing
  2. HTML Parsing for Natural Language Processing
  3. Advanced Scraping Techniques
  4. Text Manipulation Techniques

Each module explores specific topics such as web scraping, HTML parsing, advanced scraping, and text manipulation. The content includes lectures, tests, and supplementary materials to ensure a comprehensive learning experience.

Certificate Available ✔

Get Started / More Info
Limpieza de datos para el procesamiento de lenguaje natural
Course Modules

This course encompasses modules on web scraping, HTML parsing, advanced scraping techniques, and text manipulation, providing a comprehensive understanding of data preparation for NLP.

Web Scraping para Procesamiento de Lenguaje Natural

This module delves into web scraping for NLP, covering its stages, problem-solving capabilities, and techniques for extracting various web page formats. The content also includes a test, evaluation, and a podcast sharing professional experiences.

HTML Parsing para Procesamiento de Lenguaje Natural

Explore HTML parsing techniques, including parser construction, parsing types, and in-depth parsing processes. The module also addresses parsing techniques, evaluation, and professional insights through a podcast.

Técnicas avanzadas de Scraping

This module focuses on advanced parsing, concurrent and distributed programming, programming advanced parsers, and selecting the appropriate programming language for parser construction. It also includes a test, evaluation, and a professional podcast.

Técnicas de Manipulación de texto

Discover techniques for text manipulation, including connector construction, web scraping connectors, alternatives in connector construction, and modular connector construction. The module features tests, evaluations, and professional insights.

More Machine Learning Courses

Recommender Systems

University of Minnesota

Recommender Systems Specialization provides a comprehensive understanding of building and evaluating recommender systems for various applications.

Deploying a Pytorch Computer Vision Model API to Heroku

Coursera Project Network

Deploying a PyTorch Computer Vision Model API to Heroku offers hands-on experience in deploying a Flask REST API using a pre-trained PyTorch image classification...

Series Temporales con Pycaret y Python

Coursera Project Network

In this practical project, you will learn to train models using Pycaret in Python to predict time series data, including advanced ensemble techniques.

Generative AI Essentials: Overview and Impact

University of Michigan

Generative AI Essentials: Overview and Impact introduces learners to the fundamentals of generative AI, exploring its ethical use, implications for authorship, and...