INDEX
Explanations
specific terms and concepts related to film, education, and cultural institutions
New Auto-Interp
Negative Logits
yleft
-0.16
5
-0.15
3
-0.15
2
-0.15
7
-0.14
łĢ
-0.14
6
-0.14
contr
-0.14
Sez
-0.14
9
-0.14
POSITIVE LOGITS
-,
0.30
unter
0.27
ver
0.27
-/
0.27
gesch
0.27
-
0.26
vere
0.26
bes
0.25
geb
0.25
bere
0.25
Activations Density 0.046%