DomainML: A modular framework for domain knowledge-guided machine learning: Unterschied zwischen den Versionen

Aus SDQ-Institutsseminar
(Die Seite wurde neu angelegt: „{{Vortrag |vortragender=Lena Witterauf |email=lena.emma77@gmail.com |vortragstyp=Proposal |betreuer=Pawel Bielski |termin=Institutsseminar/2021-07-30 |kurzfass…“)
 
Keine Bearbeitungszusammenfassung
 
Zeile 2: Zeile 2:
|vortragender=Lena Witterauf
|vortragender=Lena Witterauf
|email=lena.emma77@gmail.com
|email=lena.emma77@gmail.com
|vortragstyp=Proposal
|vortragstyp=Masterarbeit
|betreuer=Pawel Bielski
|betreuer=Pawel Bielski
|termin=Institutsseminar/2021-07-30
|termin=Institutsseminar/2021-10-11 Zusatztermin
|kurzfassung=Standard, data-driven machine learning approaches learn relevant patterns solely from data. In some fields however, learning only from data is not sufficient. A prominent example for this is healthcare, where the problem of data insufficiency for rare diseases is tackled by integrating high-quality domain knowledge into the machine learning process.
|kurzfassung=Standard, data-driven machine learning approaches learn relevant patterns solely from data. In some fields however, learning only from data is not sufficient. A prominent example for this is healthcare, where the problem of data insufficiency for rare diseases is tackled by integrating high-quality domain knowledge into the machine learning process.



Aktuelle Version vom 5. Oktober 2021, 14:01 Uhr

Vortragende(r) Lena Witterauf
Vortragstyp Masterarbeit
Betreuer(in) Pawel Bielski
Termin Mo 11. Oktober 2021
Vortragsmodus
Kurzfassung Standard, data-driven machine learning approaches learn relevant patterns solely from data. In some fields however, learning only from data is not sufficient. A prominent example for this is healthcare, where the problem of data insufficiency for rare diseases is tackled by integrating high-quality domain knowledge into the machine learning process.

Despite the existing work in the healthcare context, making general observations about the impact of domain knowledge is difficult, as different publications use different knowledge types, prediction tasks and model architectures. It further remains unclear if the findings in healthcare are transferable to other use-cases, as well as how much intellectual effort this requires.

With this Thesis we introduce DomainML, a modular framework to evaluate the impact of domain knowledge on different data science tasks. We demonstrate the transferability and flexibility of DomainML by applying the concepts from healthcare to a cloud system monitoring. We then observe how domain knowledge impacts the model’s prediction performance across both domains, and suggest how DomainML could further be used to refine both the given domain knowledge as well as the quality of the underlying dataset.