INDEX
Explanations
references to recreational activities and sports
New Auto-Interp
Negative Logits
lox
-0.17
ceph
-0.16
/tos
-0.15
麻
-0.15
stru
-0.14
dek
-0.14
boa
-0.14
Herrera
-0.14
eterangan
-0.14
epad
-0.14
POSITIVE LOGITS
pist
0.28
ski
0.28
al
0.25
cha
0.24
lift
0.24
lifts
0.24
skiing
0.23
Ski
0.23
chair
0.23
Sav
0.22
Activations Density 0.043%