INDEX
Explanations
instances of the word 'razor' with varying levels of activation
references to "razor" and related terms or concepts
New Auto-Interp
Negative Logits
estation
-0.80
ablishment
-0.75
κ
-0.73
izations
-0.72
estine
-0.69
erest
-0.68
ãĤ±
-0.67
camp
-0.67
anooga
-0.67
ership
-0.64
POSITIVE LOGITS
shave
0.92
Razor
0.92
razor
0.90
blades
0.90
stal
0.86
shaving
0.86
brush
0.86
azor
0.84
blade
0.77
ulic
0.77
Activations Density 0.009%