INDEX
Explanations
text related to physical scars
references to scars and related terms
New Auto-Interp
Negative Logits
ablishment
-0.83
eger
-0.75
gaard
-0.74
ullivan
-0.73
hower
-0.71
geist
-0.70
£ı
-0.69
GEAR
-0.66
aminer
-0.65
ostics
-0.63
POSITIVE LOGITS
scar
1.16
scars
1.01
red
0.91
ring
0.91
crow
0.88
lets
0.88
Scar
0.83
fing
0.81
bed
0.79
let
0.78
Activations Density 0.005%