INDEX
Explanations
instances of the word "Max" with a numerical value attached to it
the repeated mention of a specific name, likely "Max," in various contexts
New Auto-Interp
Negative Logits
vernment
-0.78
ilitary
-0.77
ribing
-0.75
uez
-0.75
lay
-0.74
alam
-0.72
bah
-0.71
Tea
-0.70
ahime
-0.70
kef
-0.69
POSITIVE LOGITS
Max
3.67
Max
3.19
MAX
2.06
max
2.05
max
1.90
MAX
1.85
Maxim
1.80
Maxwell
1.50
Maximum
1.41
Chloe
1.37
Activations Density 0.017%