INDEX
Explanations
quotes and statements from individuals in the text
New Auto-Interp
Negative Logits
wan
-0.19
ugo
-0.15
Hud
-0.15
IGO
-0.14
stor
-0.14
tah
-0.14
clusion
-0.14
olor
-0.14
ETCH
-0.14
tas
-0.14
POSITIVE LOGITS
apult
0.16
IRM
0.15
hir
0.15
éIJĺ
0.14
ÏĥÏĦε
0.14
ULSE
0.14
gil
0.14
dle
0.14
à¤ķन
0.14
.Startup
0.13
Activations Density 0.028%