INDEX
Explanations
words that are often found in code or technical documentation
terms related to votes or voting processes
New Auto-Interp
Negative Logits
Baz
-0.68
grooming
-0.67
pretext
-0.67
Boh
-0.64
shrew
-0.64
Mu
-0.61
Minute
-0.61
Magn
-0.61
distracted
-0.60
mog
-0.60
POSITIVE LOGITS
ES
4.05
ESA
1.76
ESH
1.73
es
1.73
ES
1.65
EST
1.64
ESE
1.61
ED
1.59
EC
1.53
EW
1.53
Activations Density 0.010%