INDEX
Explanations
instances of the word "very" indicating emphasis or intensity
New Auto-Interp
Negative Logits
odom
-0.15
orre
-0.14
curacy
-0.13
rus
-0.13
uri
-0.13
ÑģÑĤÑİ
-0.13
оÑĢÑıд
-0.13
rew
-0.13
ivet
-0.13
olland
-0.12
POSITIVE LOGITS
same
0.27
essence
0.24
same
0.23
thing
0.21
SAME
0.20
opposite
0.18
reason
0.17
act
0.17
Same
0.17
SAME
0.17
Activations Density 0.017%