INDEX
Explanations
words related to actions, processes, or concepts
gerunds and actions that relate to social, educational, or organizational activities
New Auto-Interp
Negative Logits
si
-0.81
ensis
-0.74
word
-0.73
ses
-0.68
bet
-0.68
behind
-0.67
fet
-0.66
shed
-0.66
tes
-0.65
calling
-0.64
POSITIVE LOGITS
oneself
0.83
redients
0.82
Yourself
0.74
HAM
0.72
orphans
0.65
peoples
0.59
yourself
0.59
technologies
0.59
Efficiency
0.58
POWER
0.58
Activations Density 0.373%