Institutsseminar/2021-04-16
Datum | Freitag, 16. April 2021 | |
---|---|---|
Uhrzeit | 11:30 – 12:00 Uhr (Dauer: 30 min) | |
Ort | ||
Webkonferenz | https://conf.dfn.de/webapp/conference/979160755 | |
Vorheriger Termin | Fr 9. April 2021 | |
Nächster Termin | Fr 23. April 2021 |
Termin in Kalender importieren: iCal (Download)
Vorträge
Vortragende(r) | Patrick Ehrler |
---|---|
Titel | Feature Selection using Bayesian Optimization |
Vortragstyp | Bachelorarbeit |
Betreuer(in) | Jakob Bach |
Vortragssprache | |
Vortragsmodus | |
Kurzfassung | Datasets, like gene profiles from cancer patients, can have a large number of features. In order to apply prediction techniques, a lot of computing time and memory is needed. A solution to this problem is to reduce the number of features, whereby the main challenge is to still receive a satisfactory prediction performance afterwards. There are many state-of-the-art feature selection techniques, but they all have their limitations. We use Bayesian optimization, a technique to optimize expensive black-box-functions, and apply it to the problem of feature selection. Thereby, we face the challenge to adjust the standard optimization procedure to work with a discrete-valued search space, but also to find a way to optimize the acquisition function efficiently.
Overall, we propose 10 different Bayesian optimization feature selection approaches and evaluate their performance experimentally on 28 OpenML classification datasets. We do not only compare the approaches among themselves, but also to 9 state-of-the-art feature selection approaches. Our results state that especially four of our approaches perform well and can compete to most state-of-the-art approaches in terms of prediction performance. In terms of runtime, all our approaches do not perform outstandingly good, but similar to some filter and wrapper approaches. |
- Neuen Vortrag erstellen