INDEX
Explanations
references to specific time periods, particularly those related to the mid 1900s
New Auto-Interp
Negative Logits
chy
-0.16
Ferd
-0.16
s
-0.15
ered
-0.15
indsight
-0.14
Hours
-0.14
ead
-0.14
Sor
-0.14
buz
-0.14
bam
-0.14
POSITIVE LOGITS
wife
0.25
range
0.25
wives
0.25
wner
0.24
day
0.23
summer
0.23
week
0.23
dle
0.23
western
0.23
dle
0.23
Activations Density 0.016%