INDEX
Explanations
mentions of personal names, particularly with the letter 'm'
mentions of the letter "m" in various contexts
New Auto-Interp
Negative Logits
pitch
-0.81
tip
-0.71
foul
-0.69
EStream
-0.67
tips
-0.67
Intercept
-0.65
Prompt
-0.65
freeze
-0.62
shortened
-0.62
Mechdragon
-0.62
POSITIVE LOGITS
ichael
1.54
agn
1.37
otor
1.36
ixed
1.36
ovies
1.35
arijuana
1.35
obiles
1.35
useum
1.32
igration
1.31
obil
1.29
Activations Density 0.035%