INDEX
Explanations
terms related to commitment or adherence to a particular course of action
the repeated use of the word "sticking."
New Auto-Interp
Negative Logits
unes
-0.81
RT
-0.73
une
-0.71
BER
-0.67
uned
-0.64
uf
-0.64
OTO
-0.63
ept
-0.63
ogram
-0.62
eded
-0.62
POSITIVE LOGITS
sticking
1.13
plaster
0.99
stick
0.91
caut
0.84
suspic
0.83
sticks
0.82
slic
0.81
proble
0.78
burner
0.77
pole
0.76
Activations Density 0.007%