INDEX
Explanations
key political and sports events, particularly relating to campaigns, games, and significant historical moments
New Auto-Interp
Negative Logits
ãĥ³ãĤ¸
-0.60
issu
-0.55
okin
-0.53
pestic
-0.53
nic
-0.52
explan
-0.52
pim
-0.51
excel
-0.50
cyt
-0.50
inen
-0.50
POSITIVE LOGITS
.ãĢį
0.70
*.
0.69
.).
0.69
.[
0.67
.
0.65
clave
0.65
ãĢĤ
0.64
.*
0.61
UTERS
0.61
.]
0.60
Activations Density 0.283%