INDEX
Explanations
repeated phrases or elements indicating emphasis and intensity
New Auto-Interp
Negative Logits
SPONSORED
-1.04
elson
-0.83
enced
-0.82
yll
-0.81
edit
-0.78
uin
-0.78
thood
-0.77
nir
-0.76
erson
-0.75
isu
-0.73
POSITIVE LOGITS
slack
0.89
basics
0.88
stink
0.86
courage
0.85
stairs
0.81
seams
0.80
pace
0.79
defenses
0.78
playbook
0.76
ante
0.76
Activations Density 0.119%