INDEX
Explanations
names of individuals, particularly focusing on those involved in legal or personal conflicts
New Auto-Interp
Negative Logits
erna
-0.15
677
-0.15
inand
-0.15
dup
-0.14
winds
-0.14
rush
-0.14
oyo
-0.14
acc
-0.13
itness
-0.13
imi
-0.12
POSITIVE LOGITS
NU
0.15
olum
0.15
OLUM
0.14
otte
0.14
atta
0.14
oreal
0.14
RIA
0.14
Ùħات
0.13
otal
0.13
.G
0.13
Activations Density 0.334%