INDEX
Explanations
occurrences of the name "Jerry."
New Auto-Interp
Negative Logits
ngth
-0.82
atively
-0.72
alid
-0.72
awaru
-0.71
hips
-0.71
yrim
-0.69
totality
-0.68
heed
-0.68
hip
-0.66
ebin
-0.66
POSITIVE LOGITS
Springer
1.09
Kramer
0.89
Coy
0.76
stein
0.75
Fal
0.75
ono
0.74
Angelo
0.73
Garcia
0.71
Reese
0.70
Lee
0.70
Activations Density 0.004%