INDEX
Explanations
mentions of the name "Ellison."
New Auto-Interp
Negative Logits
worthiness
-0.71
fulness
-0.69
nces
-0.67
ework
-0.66
kered
-0.65
esome
-0.64
eric
-0.64
fully
-0.64
assium
-0.63
ctica
-0.60
POSITIVE LOGITS
Ellison
0.81
icut
0.72
irez
0.72
mann
0.70
berg
0.69
eering
0.66
urations
0.66
Franken
0.65
endorsed
0.64
wald
0.64
Activations Density 0.003%