Formeln Flashcards by Pascal Pianezzi

Optimizer

Eta / Alpha

Learning Rate: Die Schrittweite beim Gradient Descent.

How well did you know this?

Not at all

Perfectly

Beta

Adam / Momentum

Exponential Decay Rates:
für das Momentum,
für die skalierte Varianz.

How well did you know this?

Not at all

Perfectly

Gamma

Batch Norm

Scale Parameter: Lernt die optimale Skalierung nach der Normalisierung.

How well did you know this?

Not at all

Perfectly

Beta

Batch Norm

Shift Parameter: Lernt den optimalen Bias (Verschiebung) nach der Normalisierung.

How well did you know this?

Not at all

Perfectly

Lambda

Regularisierung

Weight Decay Coefficient: Stärke der L1/L2-Strafe (Penalty).

How well did you know this?

Not at all

Perfectly

Epsilon

Stabilität

Smoothing Term: Sehr kleine Zahl (

), um Division durch Null zu verhindern.

How well did you know this?

Not at all

Perfectly

Statistik

Mean: Der Mittelwert (z.B. eines Mini-Batches).

How well did you know this?

Not at all

Perfectly

Sigma Quadrat

Statistik

Variance: Die Streuung der Daten.

How well did you know this?

Not at all

Perfectly

Theta

General Modell

Ein Platzhalter für alle lernbaren Parameter (w und b) man schreibt oft f(x,Theta)

How well did you know this?

Not at all

Perfectly

Phi

Aktivierung / Feature Map: Wird oft als Symbol für die Aktivierungsfunktion sigma oder eine nicht lineare Abbildung genutzt.

How well did you know this?

Not at all

Perfectly

Formeln Flashcards

(10 cards)