INDEX
Explanations
names of individuals or entities
prominent names of individuals mentioned in the context of events or statements
New Auto-Interp
Negative Logits
riet
-0.92
raltar
-0.73
ment
-0.65
uttered
-0.64
rica
-0.64
psc
-0.63
rave
-0.62
Olive
-0.62
BAT
-0.62
arette
-0.61
POSITIVE LOGITS
sembly
0.86
atra
0.80
ners
0.78
uctor
0.76
terday
0.75
numbered
0.74
ivas
0.73
DonaldTrump
0.73
tomat
0.70
Sabha
0.70
Activations Density 0.036%