INDEX
Explanations
mentions of the name "Max" with varying degrees of intensity
mentions of the name "Max."
New Auto-Interp
Negative Logits
GROUND
-0.79
RECT
-0.71
wark
-0.69
alam
-0.68
CHO
-0.67
ADRA
-0.67
FORE
-0.65
keeper
-0.65
cipline
-0.65
bare
-0.64
POSITIVE LOGITS
imus
1.46
imil
1.28
imize
1.23
imal
1.02
ima
0.98
ims
0.93
imates
0.92
Payne
0.91
itar
0.91
imen
0.89
Activations Density 0.013%