INDEX
Explanations
mentions of the word "Saf"
references to the word "Saf" in various contexts
New Auto-Interp
Negative Logits
lore
-0.74
anooga
-0.70
hatt
-0.69
overs
-0.69
lift
-0.69
ional
-0.65
boiling
-0.64
baptism
-0.64
iste
-0.64
Dragonbound
-0.64
POSITIVE LOGITS
mented
1.03
eco
0.96
eties
0.95
vous
0.95
meric
0.90
mbol
0.86
PLE
0.86
ple
0.80
egu
0.79
saf
0.78
Activations Density 0.059%