INDEX
Explanations
names of individuals, particularly last names
mentions of specific names or entities related to a legal or administrative context
New Auto-Interp
Negative Logits
ycle
-0.79
undai
-0.73
bered
-0.73
illard
-0.72
culosis
-0.72
tan
-0.71
Thieves
-0.70
tle
-0.68
arest
-0.67
stall
-0.65
POSITIVE LOGITS
Hasan
0.96
abis
0.93
IELD
0.85
EEK
0.84
Sack
0.81
onne
0.80
OUND
0.79
Hawkins
0.79
AGES
0.75
IGHTS
0.74
Activations Density 0.016%