INDEX
Explanations
mentions of press conferences or statements made to journalists
New Auto-Interp
Negative Logits
tein
-0.54
oute
-0.53
ez
-0.50
Lives
-0.49
inho
-0.46
Feminist
-0.46
course
-0.45
Femin
-0.45
lives
-0.44
Bastard
-0.44
POSITIVE LOGITS
perty
0.65
onstage
0.55
ubs
0.55
orage
0.53
afterward
0.53
aptic
0.53
conference
0.52
aboard
0.50
plaza
0.50
odder
0.50
Activations Density 9.860%