INDEX
Explanations
references to personal leisure activities and off-time experiences
New Auto-Interp
Negative Logits
gratis
-0.16
ROTO
-0.15
avit
-0.14
locker
-0.14
xec
-0.14
VRT
-0.14
ONGL
-0.14
outine
-0.14
entic
-0.14
ais
-0.14
POSITIVE LOGITS
Shelf
0.14
ra
0.14
RA
0.13
otch
0.13
CZ
0.13
çIJ
0.13
<=(
0.13
akan
0.13
ighton
0.13
CASCADE
0.13
Activations Density 0.018%