INDEX
Explanations
long multisyllabic foreign words with special characters
punctuation and symbols associated with structuring information
New Auto-Interp
Negative Logits
htaking
-0.77
urances
-0.76
ubes
-0.71
hesda
-0.71
istries
-0.69
ocent
-0.68
athing
-0.68
uties
-0.68
epad
-0.67
eches
-0.66
POSITIVE LOGITS
pron
1.30
abbre
1.24
abbrevi
1.22
pronounced
1.20
acronym
1.11
shorthand
1.10
Literally
1.09
slang
1.04
literally
1.04
hereafter
1.03
Activations Density 0.185%