INDEX
Explanations
people's names, potentially related to crimes or legal matters
New Auto-Interp
Negative Logits
Fract
-0.75
BIP
-0.74
Gemini
-0.72
âĵĺ
-0.72
MODE
-0.72
ãĥĩ
-0.69
!/
-0.67
ãĥ¼ãĤ¯
-0.67
Galileo
-0.65
Tibetan
-0.65
POSITIVE LOGITS
aughlin
1.26
endon
0.98
erm
0.86
arks
0.85
ussen
0.84
enn
0.84
uggets
0.83
ough
0.83
atch
0.82
arty
0.81
Activations Density 0.015%