INDEX
Explanations
instances of words related to physical scarring
instances of the word "scar" and its variations, indicating a focus on the concept of scarring or the effects of scars in various contexts
New Auto-Interp
Negative Logits
ĨĴ
-0.81
gaard
-0.75
ablishment
-0.74
£ı
-0.73
hower
-0.71
ullivan
-0.70
ostics
-0.68
gencies
-0.68
guyen
-0.67
ablish
-0.66
POSITIVE LOGITS
ring
1.05
red
1.02
lets
0.93
crow
0.92
fed
0.90
fing
0.88
face
0.87
abs
0.85
pered
0.85
uler
0.84
Activations Density 0.033%