INDEX
Explanations
words related to motivation or the underlying reasons for actions
references to reasons behind actions or behaviors
New Auto-Interp
Negative Logits
esan
-0.84
opic
-0.76
hold
-0.75
Honour
-0.74
ocker
-0.73
oys
-0.71
iannopoulos
-0.69
Islands
-0.68
abb
-0.65
rooms
-0.65
POSITIVE LOGITS
ivated
1.04
motivated
0.99
TextColor
0.91
ItemTracker
0.84
motivate
0.84
motivation
0.84
motivations
0.82
ivation
0.80
motivating
0.79
compel
0.79
Activations Density 0.014%