INDEX
Explanations
class inheritance (`extends` or `class(...)`)
New Auto-Interp
Negative Logits
,
0.78
бна
0.66
tới
0.65
(
0.64
нения
0.61
чному
0.61
når
0.60
ния
0.60
ünden
0.59
،
0.59
POSITIVE LOGITS
in
1.26
a
0.85
w
0.84
in
0.82
f
0.71
had
0.70
has
0.68
d
0.65
um
0.65
el
0.64
Activations Density 0.397%