INDEX
Explanations
proper nouns
the definite article "the" in various contexts
New Auto-Interp
Negative Logits
someday
-0.67
itiz
-0.66
outweigh
-0.65
strap
-0.64
pointers
-0.64
apon
-0.63
emate
-0.62
whoever
-0.62
onto
-0.62
ional
-0.62
POSITIVE LOGITS
meantime
1.31
midst
1.16
aftermath
1.12
absence
1.03
guise
1.03
context
0.99
simplest
0.99
same
0.96
nutshell
0.94
latter
0.94
Activations Density 0.161%