Main Challenges to Machine Learning | by Pratyush Singh | Jul, 2024

Our essential activity in machine studying is to pick a machine studying algorithm and prepare it utilizing some knowledge, So, the 2 issues that may go incorrect right here is — “dangerous machine studying algorithm” and/or “dangerous knowledge”.

Machine studying algorithms require hundreds of examples for pretty easy issues, and for complicated issues like picture or speech recognition we might required thousands and thousands of coaching examples.

It is very important use a coaching set that’s consultant of the instances we need to generalize to. If the pattern is just too small we might have non-representative knowledge on account of likelihood(referred to as sampling noise) and even very massive samples might be non-representative if the sampling methodology is flawed(referred to as sampling bias).

It’s essential to look out for nonresponse bias ( occurs when the people prepared to participate in a analysis examine are completely different than those that don’t need to or are unable to participate in it) throughout sampling.

If the coaching knowledge is filled with errors , lacking values , outliers and noise , it is going to make it more durable for the system to detect underlying patterns within the knowledge throughout coaching and so the system won’t carry out nicely. It’s usually nicely definitely worth the effort to spend time cleansing up the coaching knowledge.

The machine studying system solely learns if our coaching knowledge incorporates sufficient related options and never many irrelevant ones. Developing with a very good set of options to coach on is known as function engineering , and it includes:

Characteristic Choice
Characteristic Extraction (combing associated options)
Creating new options (collect extra knowledge).

Overfitting — It implies that the mannequin performs nicely on the coaching knowledge however doesn’t generalize nicely on new situations.

Overfitting occurs when the mannequin is just too complicated relative to the quantity and noisiness of the info and the mannequin is studying patterns within the noise itself.

Doable solutions-

Simplify the mannequin ( choose a mannequin with fewer parameters , decreasing the variety of attributes within the coaching knowledge , constraining the mannequin )
Collect extra coaching knowledge
Cut back the noise in coaching knowledge ( repair errors , lacking values , outliers , and so forth.).

Constraining the mannequin to make it less complicated and cut back the chance of overfitting is known as Regularization. We have to discover the best steadiness between becoming the coaching knowledge completely & preserving the mannequin merely sufficient to make sure that it generalizes nicely.

Underfitting is the alternative of overfitting. It implies that our mannequin is just too easy to be taught the underlying patterns within the knowledge.

Doable solutions-

Choose a extra highly effective mannequin (extra parameters).
Feed higher options to the training algorithm.
Cut back the constraints on the mannequin.

Reference E-book — https://www.oreilly.com/library/view/hands-on-machine-learning/9781492032632/

Source link

Main Challenges to Machine Learning | by Pratyush Singh | Jul, 2024

Working with Input-Convex Neural Networks part3(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

Embracing the Future: The Rise of AI-Driven Development in Software Engineering The software… | by DevBlogs | Jul, 2024

Research on Metaheuristic methods part4(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

Salesforce Introduces Agentforce Testing Center: AI Agent Lifecycle Management Tooling for Testing Autonomous AI Agents at Scale

70% of Firms Disrupted by AI: New Endava Research

How Real-Time Data Analytics and AI Are Transforming Heavy Equipment Operations

NVIDIA Accelerates Google Quantum AI Processor Design With Simulation of Quantum Device Physics

Game Development and Cloud Computing: Benefits of Cloud-Native Game Servers

Our Picks

Advanced RAG pipeline and LLM Evaluation using Open Source Models| BeyondLLM | by Arya Chakraborty | Jun, 2024

New Insights into Proximal Policy Optimization part6(Machine Learning 2024) | by Monodeep Mukherjee | May, 2024

Here’s the defense tech at the center of US aid to Israel, Ukraine, and Taiwan

Most Popular

Revolutionizing the Way We Find Love

Will GenAI Replace Data Engineers? No – And Here’s Why.

Assortment Optimization Machine Learning | by Danishaliarshar | Mar, 2024

Main Challenges to Machine Learning | by Pratyush Singh | Jul, 2024

Related Posts