This article and related courses on DataCamp discuss how AI is used in retail to enhance operational efficiency and personalize customer experiences, including AI-driven visual merchandising and predictive analytics.
This course focuses on A/B testing, a common application of hypothesis testing in the industry. You will learn how to design and analyze A/B tests using Python. The course covers topics such as sample size calculation, statistical power, and the interpretation of results.
For those who want to go beyond the basics, this course covers advanced deep learning topics using Keras. You'll learn about functional APIs, custom loss functions, and how to build more complex models.
Learn the role Generative Artificial Intelligence plays today and will play in the future in a business environment. This course covers how a well-designed AI initiative connects business goals, data readiness, people, and technology into one framework.
This course provides a deep dive into ARIMA models, teaching you how to fit, forecast, and interpret these powerful time series models in Python.
This case study focuses on analyzing churn rates for a telecom company using Tableau. It covers creating calculated fields, various visualizations, and combining them into a story to share insights.
An interactive DataCamp course teaching the fundamentals of causal inference and how to implement various methods in R.
A hands-on DataCamp course focused on applying causal inference techniques using Python libraries.
This course provides a comprehensive introduction to data cleaning with Python. You'll learn to diagnose your data's dirtiness, and develop the skills to clean it. You'll deal with common data problems like missing values, inconsistent data types, and duplicates.
This course provides a practical guide to data cleaning in R, covering everything from common data problems to techniques for tidying data.
This course focuses on unsupervised learning, specifically clustering algorithms. It covers popular methods like K-Means and hierarchical clustering, and teaches how to apply them to real-world datasets.
This course focuses on various techniques to handle missing data in Python using libraries like pandas and scikit-learn, covering both simple and advanced imputation methods.
This course covers the process of exploring and analyzing data, from understanding a dataset to incorporating findings into a data science workflow. You will use Python to summarize, validate, and clean data.
This course teaches how to use graphical and numerical techniques in R to uncover the structure of your data and identify interesting relationships and unusual observations.
A project-based course where you'll apply EDA techniques to a real-world dataset of UN voting records, using R packages like dplyr and ggplot2.
This course provides a deep dive into gradient boosting and the popular XGBoost library. You'll learn how to build and tune high-performance machine learning models.
A hands-on course that covers various aspects of feature engineering for both categorical and continuous variables, as well as text data.
Learn to extract useful information from text and format it for machine learning models. The course covers POS tagging, named entity recognition, readability scores, and implementing tf-idf models using scikit-learn and spaCy.
Learn various feature engineering techniques in R to develop meaningful features. The course covers changing categorical features to numerical, manipulating numeric features, and transformation techniques like Box-Cox.
This course focuses on data wrangling and feature engineering with large datasets using PySpark. It covers preparing and cleaning data, creating new features, and building and evaluating a machine learning model.
This course provides a thorough introduction to handling missing data in R, with a focus on multiple imputation using the 'mice' package.
Learn how to tune your model's hyperparameters to get the best predictive results for your supervised learning models in R.
This hands-on course from DataCamp teaches you how to conduct hypothesis tests in Python. You will learn about different types of tests, including t-tests and chi-squared tests, and how to interpret their results. The course is interactive and includes many coding exercises.
This course focuses on image processing and computer vision using Keras. It covers techniques for image manipulation, feature extraction, and building image classification models.
This non-technical course equips you with the knowledge to ask the right questions about data and choose the right tools to read, interpret, and communicate data. You'll learn how to get from data to insights and how data drives decision-making.
This course provides a comprehensive introduction to Data Version Control (DVC) for managing and versioning machine learning data. Students will learn about the machine learning product lifecycle and the differences between data and code versioning.
This course provides a gentle introduction to deep learning using PyTorch. It covers the fundamentals of neural networks and how to build and train them for tasks like image classification.
This course introduces you to regression analysis using the statsmodels library in Python. You'll learn how to build, interpret, and evaluate linear regression models.
Learn the essentials of the Tidyverse in R for data wrangling and visualization, which includes powerful tools for cleaning and transforming data.
Learn to build, interpret, and tune linear classifiers, including logistic regression and support vector machines, using scikit-learn.
This course teaches how to implement machine learning use cases for marketing in Python, including predicting customer churn, measuring and forecasting customer lifetime value, and building customer segments.
This course teaches you how to apply machine learning techniques to time series data. It covers feature engineering, spectrograms, and advanced techniques for classification and prediction tasks.
This course provides a thorough introduction to the caret package in R for building and evaluating supervised learning models.
This course provides a comprehensive introduction to the scikit-learn library, the most popular Python library for machine learning. You'll learn how to use scikit-learn for a variety of machine learning tasks, including regression.
This course covers the fundamentals of tree-based models, including decision trees, random forests, and gradient boosting. You will learn how to build, tune, and evaluate these models using Python's scikit-learn library.
This course teaches you how to use tree-based models and ensembles for classification and regression in R.
Learn the fundamentals of optimization and how to apply them to data science problems using Python.
This course focuses on the pandas library, a powerful tool for data manipulation and analysis in Python. It is a great precursor to learning about regression and other machine learning techniques.
This course provides a deep dive into regression analysis using Python. You will learn about simple and multiple linear regression, as well as techniques for model evaluation and selection.
This course focuses on regularization techniques, such as Ridge and Lasso regression, which are used to prevent overfitting in machine learning models. You will learn the theory behind these techniques and how to apply them in practice.
This course teaches the fundamentals of statistical thinking using Python. You will learn to perform exploratory data analysis, think probabilistically, and understand the core concepts of statistical inference.
Learn to generate, explore, and evaluate machine learning models in R using the Tidyverse. The course covers multiple and logistic regression, tree-based models, and support vector machines.
This course covers four of the most common classification algorithms in R: k-nearest neighbors, logistic regression, Naive Bayes, and decision trees.
This course teaches you how to build predictive models using scikit-learn. You'll learn about classification and regression and apply your skills to real-world datasets.
Learn the basics of time series analysis in Python, including concepts like autocorrelation, stationarity, and how to use the statsmodels library for modeling and forecasting.
Learn the fundamentals of time series analysis using the R programming language, covering data manipulation, visualization, and modeling.
This course focuses on the critical skill of visualizing time series data to identify patterns, trends, and seasonality using Python libraries like Matplotlib and Seaborn.
A DataCamp project that dives into India's telecom sector to analyze customer churn. You'll use pandas and machine learning to study datasets from top telecom firms.
A 26-hour interactive course focused on building AI applications like chatbots and semantic search engines using LLMs and vector databases. It covers tools like the OpenAI API, Hugging Face, Langchain, and Pinecone for vector embeddings.
A career track focused on using R for data analysis, covering data manipulation, visualization, and case studies to build practical EDA skills.
Unlock the power of data and empower yourself to tackle complex data challenges. There's no prior knowledge or coding skills required. You'll become more data literate through hands-on exercises and explore reading, working with, analyzing, and communicating with data.
This track covers the fundamental concepts of machine learning, with a focus on supervised learning techniques using Python.
A career track on DataCamp that provides a comprehensive curriculum for aspiring machine learning scientists. It covers a wide range of topics, including supervised and unsupervised learning, deep learning, and natural language processing, with a focus on practical coding exercises.
A skill track on DataCamp that focuses on natural language processing (NLP). It covers techniques for text classification, sentiment analysis, and other NLP tasks, providing a solid foundation in this specialized area of AI.
A comprehensive skill track that covers various aspects of time series analysis in Python, from manipulation and visualization to statistical modeling and machine learning.
A tutorial that covers the essentials of synthetic data generation, including various techniques and tools. It provides practical Python code examples for creating synthetic data for AI and machine learning.
This training focuses on managing features for machine learning models to save time and improve consistency. It teaches best practices for feature engineering and how to reuse features across projects using a feature store.
Compare AI courses across all education providers
Explore courses by topic — data science, deep learning, NLP, and more
Follow structured learning paths for AI and machine learning
Start your AI journey with beginner-friendly courses
Take your AI skills to the next level
Master advanced AI techniques and research topics