INDEX
Explanations
dates or decades
references to specific decades, particularly the 1950s, 1960s, and 1970s
New Auto-Interp
Negative Logits
masc
-0.61
aukee
-0.54
probing
-0.54
venge
-0.54
avorite
-0.53
DEM
-0.52
tonight
-0.51
ebus
-0.51
Rivals
-0.48
ingred
-0.48
POSITIVE LOGITS
s
1.96
sie
1.03
sburg
1.00
si
0.95
sis
0.95
sat
0.94
sand
0.91
ties
0.90
ies
0.88
sg
0.85
Activations Density 0.047%