INDEX
Explanations
names of specific individuals
repeated mentions of specific names, particularly "Lyons" and "Stephens."
New Auto-Interp
Negative Logits
itarian
-0.80
worthiness
-0.79
apixel
-0.73
ITY
-0.71
itized
-0.69
Kids
-0.69
efeated
-0.69
aido
-0.68
iov
-0.68
tor
-0.66
POSITIVE LOGITS
Stephens
1.15
Lyons
0.94
slopes
0.81
terday
0.78
agher
0.75
anwhile
0.74
isson
0.72
Barker
0.72
udence
0.72
Greenwood
0.71
Activations Density 0.031%