INDEX
Explanations
quotes or dialogue within the text
New Auto-Interp
Negative Logits
regor
-0.17
uer
-0.16
ummy
-0.15
Kay
-0.15
acre
-0.15
hva
-0.14
way
-0.14
ather
-0.14
Kay
-0.14
uste
-0.13
POSITIVE LOGITS
edik
0.17
uls
0.16
elsey
0.15
737
0.15
Lint
0.15
vos
0.14
asa
0.14
/Internal
0.14
andid
0.14
422
0.14
Activations Density 0.045%