Patient Rule Induction Method with Active Learning: Unterschied zwischen den Versionen

Aus SDQ-Institutsseminar
(Die Seite wurde neu angelegt: „{{Vortrag |vortragender=Emmanouil Emmanouilidis |email=ubesb@student.kit.edu |vortragstyp=Proposal |betreuer=Vadim Arzamasov |termin=Institutsseminar/2019-11-2…“)
 
Keine Bearbeitungszusammenfassung
 
(4 dazwischenliegende Versionen von 2 Benutzern werden nicht angezeigt)
Zeile 4: Zeile 4:
|vortragstyp=Proposal
|vortragstyp=Proposal
|betreuer=Vadim Arzamasov
|betreuer=Vadim Arzamasov
|termin=Institutsseminar/2019-11-29
|termin=Institutsseminar/2019-11-29 Zusatztermin
|kurzfassung=Kurzfassung
|kurzfassung=PRIM (Patient Rule Induction Method) is an algorithm for discovering scenarios from simulations, by creating hyperboxes, that are human-comprehensible. Yet PRIM alone requires relatively large datasets and computational simulations are usually quite expensive. Consequently, one wants to obtain a plausible scenario, with a minimal number of simulations. It has been shown, that combining PRIM with  ML models, which generalize faster, can reduce the number of necessary simulation runs by around 75%.
We will try to reduce the number of simulation runs even further, using an active learning approach to train an intermediate ML model.
Additionally, we extend the previously proposed methodology to not only cover classification but also regression problems. A preliminary experiment indicated, that the combination of these methods, does indeed help reduce the necessary runs even further. In this thesis, I will analyze different AL sampling strategies together with several intermediate ML models to find out if AL can systematically improve existing scenario discovery methods and if a most beneficial combination of sampling method and intermediate ML model exists for this purpose.
}}
}}

Aktuelle Version vom 26. November 2019, 19:44 Uhr

Vortragende(r) Emmanouil Emmanouilidis
Vortragstyp Proposal
Betreuer(in) Vadim Arzamasov
Termin Fr 29. November 2019
Vortragsmodus
Kurzfassung PRIM (Patient Rule Induction Method) is an algorithm for discovering scenarios from simulations, by creating hyperboxes, that are human-comprehensible. Yet PRIM alone requires relatively large datasets and computational simulations are usually quite expensive. Consequently, one wants to obtain a plausible scenario, with a minimal number of simulations. It has been shown, that combining PRIM with ML models, which generalize faster, can reduce the number of necessary simulation runs by around 75%.

We will try to reduce the number of simulation runs even further, using an active learning approach to train an intermediate ML model. Additionally, we extend the previously proposed methodology to not only cover classification but also regression problems. A preliminary experiment indicated, that the combination of these methods, does indeed help reduce the necessary runs even further. In this thesis, I will analyze different AL sampling strategies together with several intermediate ML models to find out if AL can systematically improve existing scenario discovery methods and if a most beneficial combination of sampling method and intermediate ML model exists for this purpose.