INDEX
Explanations
terms related to targeting specific entities or groups
instances of the word "targeting"
New Auto-Interp
Negative Logits
batch
-0.81
mpeg
-0.69
birth
-0.68
sis
-0.67
gamer
-0.66
aut
-0.66
SourceFile
-0.66
shire
-0.66
vol
-0.65
FIN
-0.64
POSITIVE LOGITS
targeting
1.02
targets
0.94
eering
0.85
targeted
0.84
eers
0.84
oided
0.77
target
0.75
specificity
0.72
ataka
0.71
killings
0.71
Activations Density 0.014%