Thanks to visit codestin.com
Credit goes to github.com

Skip to content

A necessary feature for Decision Trees? #10399

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
sky660099 opened this issue Jan 4, 2018 · 1 comment
Closed

A necessary feature for Decision Trees? #10399

sky660099 opened this issue Jan 4, 2018 · 1 comment

Comments

@sky660099
Copy link

sky660099 commented Jan 4, 2018

Hi, I am a graduate student utilizing sklean for some data work.
And when I handle the data using the Decision Trees library, I found there are some inconvenience:
Neither the classificationTree nor the regressionTree supports categorical feature. That means the Decision trees model can only accept continuous feature.
For example, the categorical feature like app name such as google, facebook can not be input into the model, because they can not be transformed to continuous value properly. And there don not exist a corresponding algorithm to divide discrete feature in the Decision Trees library.
However, the CART algorithm itself has considered the use of categorical feature. So I have made some modification of Decision Trees library based on CART and apply the new model on my own work. And it proves that the support for categorical feature indeed improves the performance, which is very necessary for decision tree, I think.
I am very willing to contribute this to sklearn community, but I`m new to this community, not so familiar about the procedure.
Could u give some suggestions or comments on this new feature? Or has anyone already processed on this feature? Thank you so much.

@jnothman
Copy link
Member

jnothman commented Jan 4, 2018 via email

@jnothman jnothman closed this as completed Jan 4, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants