Size: a a a

AI / Искусственный Интеллект

2019 July 22

О

Орхан in AI / Искусственный Интеллект
May the force be with you
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
Haha thanks
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
I am mixed dataset. With 150 features (96 numeric /rest categorical) what is the best way to do feature selection before applying any ML algorithm?
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
*have
источник

О

Орхан in AI / Искусственный Интеллект
Why do you want to reduce features?
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
Because performance is very bad with all feature
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
I thought it may help for better performance
источник

О

Орхан in AI / Искусственный Интеллект
Have you tried PCA?
источник

DP

Defragmented Panda in AI / Искусственный Интеллект
Luke Skywalker
I am mixed dataset. With 150 features (96 numeric /rest categorical) what is the best way to do feature selection before applying any ML algorithm?
are all of this 96 numerical features indepentant of each other?
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
I tried PCA after 1 hot encoding of categorical. But not suitable as Principal component 1 only explains 6% variance
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
Defragmented Panda
are all of this 96 numerical features indepentant of each other?
How to find out if independent?
источник

DP

Defragmented Panda in AI / Искусственный Интеллект
Luke Skywalker
How to find out if independent?
you were mentioning 1 hot encoding of age

so that you have ~60 inputs for age (from 20 to 80 years), right?

are this 60 input for age included in this 96 features as 1 feature or as 60?
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
No age is only 1 feature /column like 18,30,25
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
I mean 1 hot encoding of categorical features not numeric
источник

DP

Defragmented Panda in AI / Искусственный Интеллект
okay, thats good
источник

DP

Defragmented Panda in AI / Искусственный Интеллект
is this task solvable by a human with the performance you want?

how do people detect cancer in your data?
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
There is a target variable which says : healthy /cancer
источник

DP

Defragmented Panda in AI / Искусственный Интеллект
Luke Skywalker
There is a target variable which says : healthy /cancer
can people guess it from the data available to your forest?
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
No it's not possible
источник

LS

Luke Skywalker in AI / Искусственный Интеллект
There are 1800 rows and 150 features
источник