INDEX
Explanations
countries or names related to gambling
references to gambling-related terms
New Auto-Interp
Negative Logits
tery
-0.80
NL
-0.78
Centauri
-0.76
FORM
-0.72
Mare
-0.66
Mayo
-0.65
Mour
-0.63
ogether
-0.61
izabeth
-0.60
core
-0.60
POSITIVE LOGITS
charism
0.88
lapt
0.86
unden
0.85
keye
0.84
hett
0.83
itsu
0.82
olin
0.81
ilib
0.80
enthusi
0.80
destro
0.80
Activations Density 0.013%