INDEX
Explanations
the name "Jonathan" with high activations
mentions of the name "Jonathan."
New Auto-Interp
Negative Logits
ials
-0.76
Ħ¢
-0.75
eatures
-0.75
shots
-0.73
atically
-0.73
spring
-0.71
ebin
-0.71
housing
-0.71
algia
-0.70
agall
-0.69
POSITIVE LOGITS
athan
0.88
Pearce
0.86
Daniels
0.80
Swift
0.78
Edwards
0.78
Cape
0.77
Gru
0.77
Coul
0.77
Kushner
0.75
Bernstein
0.74
Activations Density 0.028%