INDEX
Explanations
references to historical events or time periods
specific years, particularly those in the 1920s
New Auto-Interp
Negative Logits
anamo
-0.73
ramer
-0.72
ioned
-0.71
ledged
-0.71
holder
-0.70
parency
-0.70
imon
-0.70
iaries
-0.69
amin
-0.69
RAM
-0.69
POSITIVE LOGITS
Osw
0.79
1936
0.78
1938
0.73
1934
0.73
âĢķ
0.67
1926
0.67
1939
0.66
1927
0.65
é¾įå¥ij士
0.65
1946
0.64
Activations Density 0.026%