Change-Adaptive Active Learning on Data Streams: Unterschied zwischen den Versionen

Aus SDQ-Institutsseminar
(Die Seite wurde neu angelegt: „{{Vortrag |vortragender=Julien Aziz |email=ukewr@student.kit.edu |vortragstyp=Proposal |betreuer=Marco Heyden |termin=Institutsseminar/2023-03-24 |vortragsmodu…“)
 
Keine Bearbeitungszusammenfassung
Zeile 6: Zeile 6:
|termin=Institutsseminar/2023-03-24
|termin=Institutsseminar/2023-03-24
|vortragsmodus=in Präsenz
|vortragsmodus=in Präsenz
|kurzfassung=TBA
|kurzfassung=Data streams are ubiquitous in modern applications such as predictive maintenance or quality control. Data streams can change in unpredictable ways, challenging existing supervised learning algorithms that assume a stationary relationship between input data and labels. Supervised learning algorithms for data streams must therefore "adapt" to changing data distributions. Active learning (AL), a sub-field of supervised learning, aims to reduce the total cost of labeling by identifying the most valuable data points for training. However, existing stream-based AL methods have difficulty adapting to changes in data streams as they rely mainly on the sparsely labeled data and ignore the regionality of changes, resulting in slow change adaptions.
 
To address these issues, the goal of this thesis is to develop an active learning policy for data streams that adapts to regional changes in the underlying data stream. Our idea is to enrich hierarchical data stream clustering with labeling statistics to measure the regionality and relevance of changes. We hypothesise that using such information in stream-based active learning leads to more effective labeling, resulting in faster change adaption.
}}
}}

Version vom 20. März 2023, 11:37 Uhr

Vortragende(r) Julien Aziz
Vortragstyp Proposal
Betreuer(in) Marco Heyden
Termin Fr 24. März 2023
Vortragssprache
Vortragsmodus in Präsenz
Kurzfassung Data streams are ubiquitous in modern applications such as predictive maintenance or quality control. Data streams can change in unpredictable ways, challenging existing supervised learning algorithms that assume a stationary relationship between input data and labels. Supervised learning algorithms for data streams must therefore "adapt" to changing data distributions. Active learning (AL), a sub-field of supervised learning, aims to reduce the total cost of labeling by identifying the most valuable data points for training. However, existing stream-based AL methods have difficulty adapting to changes in data streams as they rely mainly on the sparsely labeled data and ignore the regionality of changes, resulting in slow change adaptions.

To address these issues, the goal of this thesis is to develop an active learning policy for data streams that adapts to regional changes in the underlying data stream. Our idea is to enrich hierarchical data stream clustering with labeling statistics to measure the regionality and relevance of changes. We hypothesise that using such information in stream-based active learning leads to more effective labeling, resulting in faster change adaption.