Data Science - Machine learning nuggets

Sign in Subscribe

Data Science

A collection of 12 posts

Top 20 Pandas Functions You Aren't Using, Which You Should Be Using

This blog post will explore 20 powerful and unique Pandas functions that can significantly enhance your data analysis workflow. We will be using the famous Iris dataset as an example to demonstrate each function. The Iris dataset contains four features: Sepal Length, Sepal Width, Petal Length, and Petal Width, along

Image by author

Entropy, information gain, and Gini impurity(Decision tree splitting criteria)

Decision trees are supervised machine-learning models used to solve classification and regression problems. They help to make decisions by breaking down a problem with a bunch of if-else-then-like evaluations that result in a tree-like structure. For quality and viable decisions to be made, a decision tree builds itself by splitting

decision trees and random forests featured image

Decision Trees and Random Forests(Building and optimizing decision tree and random forest models)

In the modern world, so much data is present on the internet. Organizations need efficient and rigorous algorithms to handle these huge chunks of data, make practical analyses, and provide appropriate decisions relevant to maximizing their profits and market presence. There are such algorithms commonly used today for decision-making processes.

Gradio tutorial (Build machine learning applications)

Gradio tutorial (Build machine learning applications)

You have built your optimally performing machine learning model. What next? This tutorial explores the use of Gradio in building machine learning applications. What is Gradio? Gradio is an open-source Python package that allows you to quickly create easy-to-use, customizable UI components for your ML model, any API, or even

logistic regression

Logistic regression in Python with Scikit-learn

In linear regression, we tried to understand the relationship between one or more predictor variables and a continuous response variable. This article will explore logistic regression, where the response variable will be discrete or categorical. What is classification? Classification is a supervised machine learning problem of predicting which category or

scikit-learn linear regression mlnuggets feature image

Linear regression in Python with Scikit-learn (With examples, code, and notebook)

Scikit-learn is a handy and robust library with efficient tools for machine learning. It provides a variety of supervised and unsupervised machine learning algorithms. The library is written in Python and is built on Numpy, Pandas, Matplotlib, and Scipy. In this tutorial, we will discuss linear regression with Scikit-learn. What

Seaborn tutorial mlnuggets feat image

Seaborn tutorial

Seaborn is a simple, easier-to-learn open-source data visualization Python library that provides fantastic default styles and color palettes to create attractive and informative statistical plots. Seaborn is built on top of Matplotlib. Matplotlib treats Figures and Axes as objects and focuses on how to draw them. Seaborn has a dataset-oriented,

Data visualization with Matplotlib

Data visualization with Matplotlib

💡"A good sketch is better than a long speech"(Napoleon Bonaparte). Organizations collect and analyze vast amounts of data from sales revenue, marketing performance, customer interactions, inventory levels, production metrics, staffing levels, costs, etc. This can be too much data that it is impossible to effectively understand and

python for data science(mlnuggets)

Data Science Featured

Python for data science tutorial (Complete guide with examples and notebook)

This article will dive into fundamental Python concepts you need to understand before using Python for data science and machine learning. Let's dive right in! What is Python? Python is the language of preference for most data scientists. It is a general-purpose, high-level programming language that supports object-oriented,

Streamlit tutorial(How to build
machine learning applications)

Data Science Featured

Streamlit tutorial(How to build machine learning applications)

Data science deals with large volumes of data using modern tools and methods to extract hidden patterns, obtain meaningful information, and inform business decisions. The application of data science in business, education, and economics has led to the emergence of various tools. Applying data science requires understanding the main components

Pandas tutorial (A complete guide with examples and notebook)

Data Science Featured

Pandas tutorial (A complete guide with examples and notebook)

Pandas is an open-source Python library that provides a rich collection of data analysis tools for working with datasets. It borrows most of its functionality from the NumPy library. Therefore, we advise that you go through our NumPy tutorial first. As we dive into familiarizing ourselves with Pandas, it is

Data Science Featured

NumPy tutorial(Everything you need to know about NumPy with examples)

So, you have decided to venture into data science and machine learning, and maybe you have been using Python for other projects or you are new to Python. Well, you just pointed yourself to the right path. However, to venture into data science and machine learning, you will need to