INDEX
Explanations
words related to branding or labeling
occurrences of the word "bl" in various contexts
New Auto-Interp
Negative Logits
Gund
-0.75
reon
-0.65
Democr
-0.64
Solitaire
-0.64
HER
-0.62
Condition
-0.61
Simulator
-0.61
Gould
-0.60
pants
-0.59
depend
-0.59
POSITIVE LOGITS
anca
1.29
ossom
1.20
umenthal
1.13
anche
1.11
anco
1.09
acks
1.07
azer
1.06
izzard
1.01
itzer
1.01
oom
1.00
Activations Density 0.025%