INDEX
Explanations
words related to physical fractures or weaknesses
instances of the word "crack" in various contexts
New Auto-Interp
Negative Logits
ikk
-0.82
Leth
-0.78
urse
-0.65
ucky
-0.64
Pwr
-0.63
Jinn
-0.62
Fenrir
-0.61
yss
-0.60
phis
-0.58
Rath
-0.58
POSITIVE LOGITS
pots
1.15
ible
0.99
Berry
0.92
ibly
0.90
acies
0.85
lings
0.84
pot
0.83
buster
0.80
cracking
0.79
bowl
0.79
Activations Density 0.006%