There was some JSON error while directly posting the code, so pasting the screenshot This whole process is called One Hot Encoding. So, Data Science, Software Development, Testing, etc. So, to get things sorted you need to specify this to the model that ‘bro, these are all categorical numbers and you dare not treat it as numbers’Ĭreate new column as binary column. Now the problem is that 2>1>0 and the model might treat it as this way. So, you will have Data Science, Software Development, Testing will be new columns with values as 0, 1, 2, etc. and you want to use these categorical variable in your model, then the best way to do is to make a column with binary variable with all the variables. Data Science, Software Development, Testing, etc. This is how One Hot Encoding worksīasically, if you have a column with Course Details like. What this means is that we want to transform a categorical variable or variables to a format that works better with classification and regression algorithms. One hot encoding is a representation of categorical variables as binary vectors. If you don’t know about it yet, then you are definitely missing out on something which can boost your rank. Be it a real-life Data Science problem or a Hackathon, one-hot encoding is one of the most important part of your data preparation. So, I just started solving the latest Hackathon on Analytics Vidhya, Women in the loop.
0 Comments
Leave a Reply. |