INDEX
Explanations
references to sports events and achievements
New Auto-Interp
Negative Logits
Binder
-0.16
Manit
-0.15
Ľ
-0.15
dij
-0.15
Fat
-0.14
EDA
-0.14
scope
-0.14
beiten
-0.14
cro
-0.14
Cult
-0.13
POSITIVE LOGITS
wurde
0.30
kam
0.28
ging
0.25
gte
0.21
konnte
0.20
mus
0.20
hi
0.20
war
0.19
trat
0.19
warf
0.18
Activations Density 0.030%