INDEX
Explanations
keywords related to programming or coding
the prefix "Pre" at the beginning of words
New Auto-Interp
Negative Logits
tears
-0.75
wonder
-0.72
entertain
-0.66
odd
-0.65
dynam
-0.64
darts
-0.64
infinity
-0.63
elevator
-0.63
Yon
-0.63
laughter
-0.62
POSITIVE LOGITS
Pre
3.55
pre
2.16
PRE
2.03
Pre
2.00
PRE
1.68
Prep
1.57
Pref
1.52
pre
1.43
Prior
1.36
Prep
1.28
Activations Density 0.015%