INDEX
Explanations
references to specific years, particularly focusing on the 1970s
New Auto-Interp
Negative Logits
entirety
-0.73
tro
-0.72
opher
-0.72
hed
-0.71
cho
-0.67
Edge
-0.66
wer
-0.64
unders
-0.64
trader
-0.63
pace
-0.63
POSITIVE LOGITS
ILCS
0.93
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
0.89
å¹
0.81
1968
0.80
1966
0.79
1971
0.76
ĸļ
0.76
"$:/
0.75
çļ
0.73
1945
0.73
Activations Density 0.018%