INDEX
Explanations
names of individuals, particularly focusing on particular prominent figures
New Auto-Interp
Negative Logits
akan
-0.15
ansi
-0.15
ysz
-0.15
MB
-0.15
egis
-0.15
Hans
-0.14
imd
-0.14
mojom
-0.14
tb
-0.13
DK
-0.13
POSITIVE LOGITS
aldi
0.16
oteca
0.16
utch
0.15
boom
0.15
HONE
0.15
acia
0.15
ston
0.15
ifest
0.14
eparator
0.14
LEMENT
0.14
Activations Density 0.000%