INDEX
Explanations
variations of specific German umlauted letters and words associated with the humanities
New Auto-Interp
Negative Logits
nak
-0.17
enic
-0.17
ets
-0.17
oo
-0.17
eda
-0.16
edReader
-0.15
ré
-0.15
able
-0.15
ed
-0.15
eral
-0.15
POSITIVE LOGITS
zung
0.26
igung
0.20
ftar
0.19
nung
0.19
fter
0.18
entlich
0.18
hte
0.18
lung
0.18
fte
0.17
ÃŁ
0.17
Activations Density 0.041%