INDEX
Explanations
words related to goals or intended outcomes
instances of the term "objective" and related concepts
New Auto-Interp
Negative Logits
oles
-0.85
artifacts
-0.77
sing
-0.73
aps
-0.72
jen
-0.71
Tycoon
-0.70
ocket
-0.69
paces
-0.69
ingers
-0.68
Lago
-0.68
POSITIVE LOGITS
objective
1.07
Objective
1.05
observer
0.83
ives
0.81
goal
0.80
objectives
0.80
ignty
0.78
guiActiveUn
0.73
ablishment
0.73
isable
0.69
Activations Density 0.010%