INDEX
Explanations
references to food and communal experiences
New Auto-Interp
Negative Logits
rello
-0.16
Khi
-0.15
vay
-0.14
.SelectedItems
-0.13
olia
-0.13
ilon
-0.13
ника
-0.13
sy
-0.13
ona
-0.13
sj
-0.13
POSITIVE LOGITS
everywhere
0.63
Everywhere
0.42
wherever
0.38
every
0.34
ubiquitous
0.34
every
0.31
ubiqu
0.31
EVERY
0.29
_every
0.27
anywhere
0.26
Activations Density 0.280%