INDEX
Explanations
words related to legal cases and criminal events
New Auto-Interp
Negative Logits
creen
-0.77
ABE
-0.71
destro
-0.69
shroud
-0.67
Belg
-0.67
Tasman
-0.67
wagen
-0.67
Mirage
-0.67
iewicz
-0.66
Doodle
-0.64
POSITIVE LOGITS
ª
1.31
ł
1.24
IJ
1.23
ij
1.14
¹
1.08
Ĵ
1.06
ı
1.05
«
1.02
Ķ
0.99
¤
0.98
Activations Density 2.055%