INDEX
Explanations
references to specific names or locations, particularly those related to Eastern European culture
names or references to people and places, particularly those associated with the name "Savile" and related terms
New Auto-Interp
Negative Logits
threaded
-0.71
deaf
-0.70
bom
-0.70
WAYS
-0.69
hower
-0.68
slee
-0.68
Reloaded
-0.68
inclined
-0.65
filament
-0.64
proximity
-0.64
POSITIVE LOGITS
ille
1.08
inelli
1.08
iour
1.01
Sav
1.00
anas
0.96
annah
0.92
itri
0.91
chenko
0.91
illas
0.91
vy
0.91
Activations Density 0.007%