INDEX
Explanations
words related to destruction or impactful events
occurrences of the word "ashes."
New Auto-Interp
Negative Logits
PF
-0.72
muse
-0.67
Metall
-0.66
requ
-0.63
regulating
-0.63
composition
-0.62
rightfully
-0.62
PG
-0.61
nery
-0.61
magnet
-0.61
POSITIVE LOGITS
ashes
4.40
ashing
2.51
ashed
2.37
ASH
2.04
ash
1.90
asher
1.38
ushes
1.16
oots
1.13
tails
1.10
andals
1.09
Activations Density 0.006%