INDEX
Explanations
references to "off the beaten path" or similar phrases indicating unconventional or alternative experiences
New Auto-Interp
Negative Logits
;br
-0.15
638
-0.15
neod
-0.15
reverse
-0.15
Bruno
-0.15
%B
-0.14
078
-0.14
kem
-0.14
Reverse
-0.14
Cres
-0.14
POSITIVE LOGITS
beaten
0.31
cuff
0.28
grid
0.26
bat
0.26
radar
0.24
mark
0.23
bat
0.21
hook
0.20
map
0.20
Grid
0.20
Activations Density 0.018%