INDEX
Explanations
instances of the word "even," suggesting a focus on emphasizing unexpected or contrasting situations
New Auto-Interp
Negative Logits
именно
-0.17
also
-0.17
Guth
-0.16
superf
-0.15
only
-0.14
alth
-0.14
not
-0.14
Declare
-0.14
повÑĸд
-0.14
te
-0.14
POSITIVE LOGITS
omid
0.17
bedo
0.15
necessarily
0.15
mium
0.14
hint
0.14
remot
0.14
677
0.14
iese
0.14
kowski
0.14
بÙĪØ§Ø¨Ø©
0.14
Activations Density 0.043%