Machine Learning - Geeky Codes

Why RAG Chatbots Struggle in Production

December 25, 2025 Geeky Codes No Comments

“Our RAG chatbot worked perfectly in the POC.But once we scaled to 50,000 documents… accuracy dropped to 60%.” If you’ve worked with enterprise RAG systems, you’ve probably heard this story.…

Data Science Decision Tree Machine Learning Random Forest

What is Stacking of Models in Machine Learning?

July 2, 2025 No Comments

The last Ensemble method we will discuss in this series is called stacking (short for stacked generalization). It is based on a simple idea: instead of using trivial functions (such…

Data Science Interview Interview Questions Machine Learning

Ensemble Learning: A Comprehensive Guide to AdaBoost and Gradient Boosting

April 14, 2024 No Comments

Introduction: In the realm of machine learning, ensemble learning techniques such as AdaBoost and Gradient Boosting have revolutionized the way we approach classification and regression tasks. These powerful algorithms harness…

Data Science Machine Learning Pandas

Introduction to Dimensionality Reduction

February 27, 2024 No Comments

The text discusses the curse of dimensionality in machine learning, highlighting challenges in high-dimensional spaces. It suggests reducing features to improve training efficiency and visualization, while addressing potential information loss…

Data Science Machine Learning Python

Sending Data in Unstructured File Form

February 23, 2024 No Comments

Unstructured data files consist of a series of bits. The file doesn’t separate the bits from each other in any way. You can’t simply look into the file and see…

Data Science Machine Learning Random Forest

Random Forests | Machine Learning from Scratch

February 23, 2024 No Comments

As we have discussed, a Random Forest is an ensemble of Decision Trees, generally trained via the bagging method (or sometimes pasting), typically with max_samples set to the size of…

Pandas Python

Accessing Data in Structured Flat-File Form

February 22, 2024 No Comments

In many cases, the data you need to work with won’t appear within a library, such as the toy datasets in the Scikit-learn library. Real-world data usually appears in a…

Data Science Decision Tree Machine Learning

Gini Impurity or Entropy? How to decide the root node in decision tree?

February 12, 2024 No Comments

By default, the Gini impurity measure is used, but you can select the entropy impurity measure instead by setting the criterion hyperparameter to “entropy”. The concept of entropy originated in…

Data Science Decision Tree Machine Learning

Decision Trees | Machine Learning from Scratch

February 10, 2024 No Comments

Like SVMs, Decision Trees are versatile Machine Learning algorithms that can perform both classification and regression tasks, and even multioutput tasks. They are very powerful algorithms, capable of fitting complex…

Data Science Machine Learning Python

How can A linear model learn non-linear/discrete patterns?

February 8, 2024 No Comments

Introduction During model development, one of the techniques that many don’t experiment with is feature discretization. The core idea is to transform a continuous feature into discrete features, mostly one-hot…

Data Science Machine Learning Support Vector Machine

Support Vector Machines (SVM) Algorithms

January 28, 2024 No Comments

A Support Vector Machine (SVM) is a very powerful and versatile Machine Learning model, capable of performing linear or nonlinear classification, regression, and even outlier detection. It is one of…

Data Science Machine Learning Python

What is early stopping? | Machine Learning from Scratch

January 23, 2024 No Comments

Machine learning models, particularly those trained iteratively using algorithms like Gradient Descent, face the risk of overfitting the training data. One powerful and elegant solution to this challenge is known…

Data Science Decision Tree Machine Learning Random Forest

Information Gain in Machine Learning

January 21, 2024 No Comments

Information Gain (IG) is critical in machine learning and decision tree algorithms, particularly in data classification and 𝐟𝐞𝐚𝐭𝐮𝐫𝐞 𝐬𝐞𝐥𝐞𝐜𝐭𝐢𝐨𝐧. Information Gain Information Gain is a concept used in the field…

Data Science Machine Learning Python

What is Lasso Regression? | Machine Learning from Scratch

January 19, 2024 No Comments

Least Absolute Shrinkage and Selection Operator Regression (simply called Lasso Regression) is another regularized version of Linear Regression: just like Ridge Regression, it adds a regularization term to the cost…

Data Science Machine Learning

Regularized Linear Models(Ridge Regression) | Machine Learning from Scratch

January 18, 2024 No Comments

As we saw in previous posts, a good way to reduce overfitting is to regularize the model (i.e., to constrain it): the fewer degrees of freedom it has, the harder…

Data Science Decision Tree Machine Learning

Learning Curves | Machine Learning from Scratch

December 27, 2023 Geeky Codes No Comments

Till now, We have read about Gradient Descent,Min-Batch Gradient Descent,Stochastic Gradient Descent and other type of Gradient Descents and Polynomial Regression. In this post we will learn about Learning Curves…

Why RAG Chatbots Struggle in Production

What is Stacking of Models in Machine Learning?

Ensemble Learning: A Comprehensive Guide to AdaBoost and Gradient Boosting

Introduction to Dimensionality Reduction

Sending Data in Unstructured File Form

Random Forests | Machine Learning from Scratch

Accessing Data in Structured Flat-File Form

Gini Impurity or Entropy? How to decide the root node in decision tree?

Decision Trees | Machine Learning from Scratch

How can A linear model learn non-linear/discrete patterns?

Support Vector Machines (SVM) Algorithms

What is early stopping? | Machine Learning from Scratch

Information Gain in Machine Learning

What is Lasso Regression? | Machine Learning from Scratch

Regularized Linear Models(Ridge Regression) | Machine Learning from Scratch

Learning Curves | Machine Learning from Scratch

You missed

Before Transformers: Why RNNs Could Never Scale to Modern AI- Part 1

20 Data Engineering Interview Questions You Should Know for Databricks & PySpark Roles

Stop Paying for Idle Servers: How I Built a Flask ML App That Costs Almost Nothing on AWS

Why Your FastAPI Event Loop Freezes Under Load: The Hidden Battle Between AsyncIO and Scikit-Learn

Tag: Machine Learning

You missed