INDEX
Explanations
dates from the early 1960s
specific years, particularly from the 1960s and late 1950s
New Auto-Interp
Negative Logits
hed
-0.88
ipel
-0.71
tro
-0.70
unders
-0.69
ournals
-0.69
lying
-0.68
lex
-0.68
amn
-0.67
ittee
-0.66
entirety
-0.66
POSITIVE LOGITS
å¹
0.89
onwards
0.76
1962
0.75
ĸļ
0.73
-'
0.72
1968
0.71
1958
0.70
1966
0.70
1965
0.68
1963
0.68
Activations Density 0.028%