INDEX
Explanations
the name "Alexander" in various contexts
mentions of the name "Alexander."
New Auto-Interp
Negative Logits
tub
-0.78
eals
-0.75
swing
-0.75
suc
-0.71
warped
-0.70
isec
-0.68
Switch
-0.66
Wii
-0.66
surf
-0.66
recy
-0.65
POSITIVE LOGITS
Alexander
3.64
Alexander
2.99
ALE
1.48
Anton
1.38
Alexandra
1.37
Maced
1.35
Alex
1.31
Alexis
1.24
Igor
1.23
Darius
1.22
Activations Density 0.013%