INDEX
Explanations
instances of the word "actually" in various contexts
New Auto-Interp
Negative Logits
stras
-0.19
esh
-0.18
edly
-0.15
rous
-0.15
est
-0.15
ese
-0.15
еÑĪ
-0.14
ë¬
-0.14
ers
-0.14
542
-0.14
POSITIVE LOGITS
-ÑĤаки
0.15
mente
0.15
ifar
0.15
gili
0.14
undo
0.14
áli
0.14
Rodrigo
0.14
ondo
0.14
EGIN
0.13
mland
0.13
Activations Density 0.049%