INDEX
Explanations
references to historical events or time periods, particularly connected to significant years like 17th or 18th century
New Auto-Interp
Negative Logits
orc
-0.84
enhagen
-0.73
regenerate
-0.71
ovie
-0.66
tremend
-0.66
odynamic
-0.65
ensed
-0.65
lication
-0.65
senal
-0.64
mble
-0.64
POSITIVE LOGITS
06
0.99
76
0.96
08
0.94
03
0.93
rd
0.90
05
0.89
07
0.88
87
0.87
89
0.87
09
0.87
Activations Density 0.036%