INDEX
Explanations
specific targets or goals mentioned in the text
instances of the word "target" in various contexts
New Auto-Interp
Negative Logits
ansk
-0.75
GGGG
-0.69
Geological
-0.66
IGH
-0.66
lycer
-0.65
fo
-0.65
Created
-0.64
aucas
-0.64
ISTORY
-0.63
aug
-0.63
POSITIVE LOGITS
ted
1.18
izen
0.89
targets
0.84
target
0.82
ting
0.75
topic
0.75
oided
0.72
ishes
0.71
audience
0.70
range
0.70
Activations Density 0.020%