INDEX
Explanations
mentions of government positions or officials, particularly those related to defense and health
references to government officials and their titles
New Auto-Interp
Negative Logits
::::::::
-0.73
ucha
-0.72
Artists
-0.68
Tree
-0.65
forming
-0.65
VALUE
-0.65
liga
-0.62
Artist
-0.62
Film
-0.62
#$#$
-0.62
POSITIVE LOGITS
Shaun
0.83
Ernest
0.82
Marino
0.81
vette
0.80
Salman
0.78
Suzanne
0.78
Steven
0.77
Liz
0.77
Michele
0.77
Khalid
0.77
Activations Density 0.079%