INDEX
Explanations
names of a specific individual, "Ellis"
mentions of specific individuals, particularly political figures
New Auto-Interp
Negative Logits
dial
-0.68
united
-0.68
wheel
-0.67
lapt
-0.66
LOAD
-0.66
Balt
-0.66
Mald
-0.65
mand
-0.65
cou
-0.65
mog
-0.65
POSITIVE LOGITS
Ellis
2.76
Levi
1.58
Isaac
1.28
Cullen
1.13
Mercer
1.07
Spl
1.04
Rebecca
1.03
Marin
1.02
Solomon
0.97
Arc
0.96
Activations Density 0.031%