About: geekycodesco
Posts by geekycodesco:
What is Data Imputation and it’s different techniques
Data imputation is an essential technique in data science that involves filling in missing values in a dataset. Missing values can affect the accuracy of predictive models and cause biased results. In this article, we will explore various data imputation techniques to help you choose the best approach for your project. There are several methods […]
How to do Ensembling in machine learning?
Ensembling is a powerful technique for improving the performance of machine learning models. This article will provide an overview of ensembling and explore popular techniques such as bagging, boosting, and stacking. By using ensembling methods, you can improve model accuracy and generalization. There are several types of ensembling techniques, including: To select the best ensemble […]
How to handle categorical data in machine learning
Understanding Categorical Data and its Importance in Machine Learning Categorical data is a type of data that can be divided into distinct groups or categories. In machine learning, it is common to encounter categorical data in the form of labels, such as a classification problem where the output is a set of predefined categories. Handling […]
How to connect OpenAI api with python code
Steps to Create a Tensorflow Model
There are 3 fundamental steps to creating a model Create a Model -> Connect the layers of NN yourself by using Sequential or Functional API or import a previously built model(Transfer Learning) Compile a Model -> Define how a model’s performance should be measured(metrics) and how to improve it by using an optimizer(Adam, SGD, etc.) […]
How to deal with outliers
In this Notebook, we will describe how to deal with outliers Trimming outliers from the dataset Performing winsorization Winsorizing is different from trimming because the extreme values are not removed, but are instead replaced byother values. Data greater than quantile 90 percent is replaced by value at 90 quantiles similarly less thenquantile 5 percent is […]
What is data leakage in Machine Learning
When training a machine learning model, we normally prefer selecting a generalized model which is performing well both on training and validation/test data. However, there can be a situation where the model performs well during testing but fails to achieve the same level of performance with real-world (production data) usage. For example, your model is […]
Most Popular SQL Commands everyone should know
SQL Commands:SQL Commands are instructions. It is used to communicate with the database. It is also used to perform specific tasks, functions, and queries of data. SQL can perform various tasks like creating a table. add data to tables, drop the table modify the table. set permission for users.Types of SQL Commands: There are five […]
Five Courses that can be finished in one week to advance Pandas skills
𝟏. 𝐖𝐫𝐢𝐭𝐢𝐧𝐠 𝐄𝐟𝐟𝐢𝐜𝐢𝐞𝐧𝐭 𝐂𝐨𝐝𝐞 𝐰𝐢𝐭𝐡 𝐩𝐚𝐧𝐝𝐚𝐬: This course will build on your knowledge of Python and the panda’s library and introduce you to efficient built-in pandas functions to perform tasks faster. Link:- Get the course here 𝟐. 𝐉𝐨𝐢𝐧𝐢𝐧𝐠 𝐃𝐚𝐭𝐚 𝐰𝐢𝐭𝐡 𝐩𝐚𝐧𝐝𝐚𝐬: In this course, you will learn to handle multiple DataFrames by combining, organizing, joining, […]