INDEX
Explanations
mentions of political challenges and candidates
New Auto-Interp
Negative Logits
965
-0.15
indre
-0.14
avr
-0.14
Congress
-0.13
ONUS
-0.13
furt
-0.13
áno
-0.13
ura
-0.13
||(
-0.13
.fix
-0.12
POSITIVE LOGITS
running
0.44
Running
0.36
running
0.35
ran
0.34
Running
0.34
RUNNING
0.32
-running
0.32
run
0.29
_running
0.29
runs
0.28
Activations Density 0.119%