INDEX
Explanations
verbs related to significant change or transformation
concepts related to significant changes or transformations in society and technology
New Auto-Interp
Negative Logits
endment
-0.69
esm
-0.68
erity
-0.66
rants
-0.66
trak
-0.64
rogens
-0.64
ulhu
-0.64
tics
-0.62
REDACTED
-0.60
contained
-0.60
POSITIVE LOGITS
perceptions
1.30
attitudes
1.13
lives
1.07
fortunes
1.03
minds
0.98
how
0.97
perception
0.97
hearts
0.94
society
0.92
conceptions
0.89
Activations Density 0.171%