INDEX
Explanations
references to individuals with the name "Ronald" or similar variations, particularly in relation to their actions or statements
New Auto-Interp
Negative Logits
AFX
-0.15
em
-0.15
rowsable
-0.15
éĹĺ
-0.15
VML
-0.14
lfw
-0.14
okud
-0.14
ingga
-0.14
mates
-0.14
ITTE
-0.14
POSITIVE LOGITS
ised
0.17
ism
0.17
ously
0.17
ogy
0.17
.gdx
0.16
ysis
0.15
.FONT
0.15
ise
0.15
izers
0.15
igne
0.15
Activations Density 0.048%