INDEX
Explanations
terms related to slot machines and gambling
New Auto-Interp
Negative Logits
ury
-0.16
rtl
-0.16
esy
-0.16
hone
-0.15
ight
-0.15
ously
-0.15
urate
-0.14
Hobby
-0.14
alta
-0.14
ho
-0.14
POSITIVE LOGITS
ting
0.33
tery
0.23
swana
0.19
tement
0.18
machines
0.17
tingham
0.17
à¹ģม
0.17
td
0.16
OKIE
0.16
machine
0.16
Activations Density 0.016%