INDEX
Explanations
the proper noun "Malcolm."
mentions of the name "Malcolm."
New Auto-Interp
Negative Logits
Tart
-0.82
ube
-0.72
liquid
-0.70
iquid
-0.69
controllers
-0.69
PSP
-0.68
stamina
-0.68
cig
-0.66
Ring
-0.66
Å
-0.65
POSITIVE LOGITS
Malcolm
3.63
colm
2.29
Turnbull
1.63
Desmond
1.50
Mal
1.36
Malik
1.33
Tony
1.18
Elijah
1.17
Lyndon
1.15
Gavin
1.14
Activations Density 0.015%