INDEX
Explanations
phrases related to goals or purposes
references to goals or aims
New Auto-Interp
Negative Logits
Torrent
-0.67
zz
-0.65
Pratt
-0.62
Cumber
-0.62
irl
-0.61
Parks
-0.61
ming
-0.60
ines
-0.59
strains
-0.59
Ruff
-0.59
POSITIVE LOGITS
objective
3.86
objectives
2.22
Objective
2.19
goal
1.62
aim
1.50
unbiased
1.45
objectively
1.42
subjective
1.41
object
1.36
goal
1.32
Activations Density 0.018%