INDEX
Explanations
elements related to timelines and historical data
New Auto-Interp
Negative Logits
195
-0.21
196
-0.20
194
-0.20
WWII
-0.19
193
-0.19
iaux
-0.18
Soviet
-0.16
twentieth
-0.16
Nazi
-0.15
dale
-0.15
POSITIVE LOGITS
172
0.72
173
0.71
171
0.70
174
0.69
170
0.68
175
0.67
176
0.65
169
0.62
168
0.61
167
0.59
Activations Density 0.101%