INDEX
Explanations
phrases related to the concept of change
instances of the word "change" and its related forms
New Auto-Interp
Negative Logits
ngth
-0.76
sie
-0.69
esm
-0.65
glas
-0.64
PU
-0.62
amina
-0.62
tic
-0.62
sol
-0.59
imentary
-0.58
ractor
-0.58
POSITIVE LOGITS
perceptions
1.14
attitudes
1.03
minds
0.97
tack
0.95
course
0.94
diapers
0.93
gears
0.91
direction
0.89
perception
0.87
hearts
0.84
Activations Density 0.056%