INDEX
Explanations
historical references and notable figures in literature and philosophy
New Auto-Interp
Negative Logits
inger
-0.18
199
-0.17
iment
-0.15
Kaiser
-0.15
wend
-0.14
Trafford
-0.14
Hindered
-0.14
anou
-0.14
INGER
-0.14
misuse
-0.13
POSITIVE LOGITS
Enlightenment
0.18
SION
0.17
eenth
0.17
176
0.15
ayi
0.15
.nl
0.15
GI
0.14
175
0.14
Ukr
0.14
182
0.14
Activations Density 0.189%