INDEX
Explanations
words related to willingness, ability, and inability to act or make changes
terms related to willingness and ability, particularly in the context of actions or behaviors
New Auto-Interp
Negative Logits
Surv
-0.80
adish
-0.71
Siber
-0.69
verbs
-0.69
Gleaming
-0.69
MpServer
-0.66
Wrap
-0.66
hematic
-0.65
gone
-0.65
gar
-0.65
POSITIVE LOGITS
rate
0.76
jriwal
0.72
shift
0.71
level
0.70
willingness
0.70
adherence
0.67
ibilities
0.66
BILITY
0.66
to
0.65
Ľ
0.64
Activations Density 0.092%