How can A linear model learn non-linear/discrete patterns?
Introduction During model development, one of the techniques that many don’t experiment with is feature discretization. The core idea is to transform a continuous feature into discrete features, mostly one-hot…
Tips For Writing Efficient And Faster SQL Queries
INTRODUCTION Query optimization is an important skill for SQL developers and database administrators (DBAs). In order to improve the performance of SQL queries, developers and DBAs need to understand the…
Questions asked in Data Scientist Interviews Part 7
In this series we bring new Data Scientist Interview Questions. You can read other tutorials related to same topic on our website. What is Cross Validation? Cross-Validation in Machine Learning…
Nonlinear SVM Classification | Machine Learning from Scratch
In previous blog, We read about Soft margin Classification. In this blog we will talk about Non Linear SVM Classification. Introduction Although linear SVM classifiers are efficient and work surprisingly…
Soft Margin Classification | Machine Learning from Scratch
In previous blog, We read about Linear SVM. In this blog we will talk about soft margin classification. If we strictly impose that all instances be off the street and…
ROC and AUC in Evaluating Classification Models
In the dynamic world of business, where data-driven decisions reign supreme, the accuracy and reliability of classification models play a pivotal role. Whether you’re involved in lead scoring or any…
Support Vector Machines (SVM) Algorithms
A Support Vector Machine (SVM) is a very powerful and versatile Machine Learning model, capable of performing linear or nonlinear classification, regression, and even outlier detection. It is one of…
What is early stopping? | Machine Learning from Scratch
Machine learning models, particularly those trained iteratively using algorithms like Gradient Descent, face the risk of overfitting the training data. One powerful and elegant solution to this challenge is known…
Information Gain in Machine Learning
Information Gain (IG) is critical in machine learning and decision tree algorithms, particularly in data classification and 𝐟𝐞𝐚𝐭𝐮𝐫𝐞 𝐬𝐞𝐥𝐞𝐜𝐭𝐢𝐨𝐧. Information Gain Information Gain is a concept used in the field…
Pyspark Tutorial for beginners
PySpark, it is a Python library for Apache Spark, an open-source distributed computing system. PySpark allows you to write Spark applications using Python programming language, providing a Python API for…
Extracting Financial Year from Date in Pandas and PySpark DataFrames
Introduction Working with date data often involves extracting relevant information, such as the financial year. In this blog post, we’ll explore how to extract the financial year from a date…
What is Lasso Regression? | Machine Learning from Scratch
Least Absolute Shrinkage and Selection Operator Regression (simply called Lasso Regression) is another regularized version of Linear Regression: just like Ridge Regression, it adds a regularization term to the cost…
Regularized Linear Models(Ridge Regression) | Machine Learning from Scratch
As we saw in previous posts, a good way to reduce overfitting is to regularize the model (i.e., to constrain it): the fewer degrees of freedom it has, the harder…
Hackerrank | Longest Subarray with Absolute Difference Constraint
Problem Overview Given an array of integers, our task is to find the longest subarray such that the absolute difference between any two elements is less than or equal to…
Hackerrank | Magic Square
We define a magic square to be an n*n matrix of distinct positive integers from 1 to n^2 where the sum of any row, column, or diagonal of length n…
A Python Script to Check IP Address Reachability
Introduction In the world of networking, ensuring that devices are reachable is a fundamental aspect of troubleshooting and monitoring. One common method to test this reachability is by using the…
Hackerrank | Cats and a Mouse
Two cats and a mouse are at various positions on a line. You will be given their starting positions. Your task is to determine which cat will reach the mouse…
Hackerrank | Birthday Candles Problem
You are in charge of the cake for a child’s birthday. You have decided the cake will have one candle for each year of their total age. They will only…
Hackerrank | Min Max Sum of N-1 Elements
Given five positive integers, find the minimum and maximum values that can be calculated by summing exactly four of the five integers. Then print the respective minimum and maximum values…
HackerRank Problem | Staircase detail
Staircase detail This is a staircase of size n=4: Its base and height are both equal to 4 . It is drawn using # symbols and spaces. The last line…