INDEX
Explanations
phrases expressing personal responsibility and participation in social or economic systems
New Auto-Interp
Negative Logits
opia
-0.16
ignite
-0.16
andon
-0.15
alles
-0.15
arden
-0.14
requires
-0.14
ascus
-0.14
sür
-0.14
emes
-0.14
inya
-0.13
POSITIVE LOGITS
thereby
0.32
indirectly
0.23
effectively
0.20
essentially
0.20
hereby
0.19
hopes
0.18
hope
0.17
hoped
0.17
hoping
0.17
tac
0.17
Activations Density 0.223%