INDEX
Explanations
names and titles associated with authority or prominent roles
Capitalized proper nouns
names of arrested people
New Auto-Interp
Negative Logits
DrawerToggle
-0.66
ukone
-0.64
⎩
-0.59
onCancelled
-0.57
OGND
-0.53
Lana
-0.53
Tripp
-0.52
stoke
-0.52
Nicki
-0.51
utafitiHapana
-0.51
POSITIVE LOGITS
Innoc
0.67
Godwin
0.63
Moses
0.63
Innocent
0.62
Comfort
0.61
Evans
0.58
Isaac
0.58
Joseph
0.58
Isaac
0.57
Mercy
0.56
Activations Density 0.054%