INDEX
Explanations
names of political figures or notable personalities
statements about political figures and their actions
New Auto-Interp
Negative Logits
âĶĢâĶĢ
-0.74
Unit
-0.68
soc
-0.64
mination
-0.64
PRODUCT
-0.62
etary
-0.59
metab
-0.59
uras
-0.59
urry
-0.58
grids
-0.57
POSITIVE LOGITS
enegger
0.99
rhet
0.82
bernatorial
0.80
govtrack
0.78
himself
0.78
omics
0.74
Äĩ
0.70
assassinated
0.70
veto
0.70
hower
0.70
Activations Density 0.595%