INDEX
Explanations
repetitive mentions of the word "lot," indicating a focus on the abundance or quantity of something
New Auto-Interp
Negative Logits
slightly
-0.17
orns
-0.16
apol
-0.16
elden
-0.15
iors
-0.15
hores
-0.15
ors
-0.15
ject
-0.14
alore
-0.14
axter
-0.14
POSITIVE LOGITS
tery
0.22
tering
0.19
ting
0.18
ãĤĵãģ©
0.17
zheimer
0.16
geme
0.16
terr
0.16
ITE
0.16
Scri
0.16
TA
0.15
Activations Density 0.035%