Mastering Decision Trees and Classification Trees: A Comprehensive Guide | by Shamanth Kuthpadi | May, 2024

Up to now we now have mentioned learn how to use a skilled Determination Tree to make classifications. However, with a purpose to actually perceive the implications that underly these fashions, we also needs to take a dive into the precise coaching course of.

Discover the very best attribute.

Think about we now have a dataset with m attributes for every occasion. Step one can be to search out the very best attribute to check. To do that, we have to outline a criterion that may permit us to measure how informative testing on a specific attribute will probably be.

There are various variations of criterions, however for the needs of simplicity, we are going to give attention to Info Acquire.

Earlier than we introduce Info Acquire, it’s essential that we deterministically symbolize what precisely info of a dataset is. Within the context of Determination Bushes, info of a dataset is extra generally known as the entropy of that dataset. To higher illustrate what entropy is, allow us to make the most of an instance:

Dataset = 
{
Amanda: iPhone,
John: iPhone,
Sam: Blackberry,
Robert: iPhone,
Ruby: Blackberry,
Shirley: iPhone
}

There are two distinctive courses — iPhone and Blackberry — for which we have to discover the chances for. This step is a straightforward counting drawback:

Pr(iPhone) = # of cases with class iPhone / # of cases = 4/6 = 2/3
Pr(Blackberry) = # of cases with class Blackberry / # of cases = 2/6 = 1/3

From right here we use the method for info acquire offered under.

In our case n=2, as a result of there are solely two distinctive courses and p1 = 2/3 whereas p2 = 1/3. Thus, we find yourself with an entropy of

I(2/3, 1/3) = -2/3 * log(2/3) — 1/3 * log(1/3)

We will now begin to type an instinct as to what precisely entropy is telling use concerning the dataset.

Entropy tells us how heterogeneous our dataset is.

Notice that heteregeneous means “how blended up is our information — significantly the category labels”. So, when making prediction a few sure class, we want the information to be as homogeneous as potential.

Think about having 5 apples and 5 oranges in a bag (very heterogeneous, not homogeneous). Would you be capable to confidently predict what fruit you’re going to get for those who blindly select? In all probability not. Alternatively, if we had 9 apples and 1 orange in a bag (very homogeneous, not too heterogeneous), it turns into simpler to foretell that apples can be the fruit you’re going to get.

Consequently, the decrease the entropy → the extra homogeneous → the extra informative the dataset is a few specific class.

Source link

Mastering Decision Trees and Classification Trees: A Comprehensive Guide | by Shamanth Kuthpadi | May, 2024

Working with Input-Convex Neural Networks part3(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

Embracing the Future: The Rise of AI-Driven Development in Software Engineering The software… | by DevBlogs | Jul, 2024

Research on Metaheuristic methods part4(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

How Real-Time Data Analytics and AI Are Transforming Heavy Equipment Operations

NVIDIA Accelerates Google Quantum AI Processor Design With Simulation of Quantum Device Physics

Game Development and Cloud Computing: Benefits of Cloud-Native Game Servers

Teradata AI Unlimited in Microsoft Fabric is Now Available for Public Preview through Microsoft Fabric Workload Hub

Cognigy Unveils Agentic AI: Transforming the Future of Enterprise Contact Centers

Our Picks

My Journey in Tech: From Curiosity to Innovation | by Sarah Khursheed | Jul, 2024

When ML meets Product: June ’24 AI Product Updates | by Anna Via | Jun, 2024

Exploratory Data Analysis using Python | by JAYAMBE | Jun, 2024

Most Popular

Revolutionizing the Way We Find Love

Will GenAI Replace Data Engineers? No – And Here’s Why.

Assortment Optimization Machine Learning | by Danishaliarshar | Mar, 2024

Mastering Decision Trees and Classification Trees: A Comprehensive Guide | by Shamanth Kuthpadi | May, 2024

Discover the very best attribute.

Related Posts