INDEX
Explanations
mentions of Kamala Harris
New Auto-Interp
Negative Logits
ti
-0.16
-handed
-0.16
ed
-0.16
ts
-0.16
c
-0.15
ardo
-0.15
i
-0.15
tn
-0.15
il
-0.15
tl
-0.15
POSITIVE LOGITS
engin
0.18
ullo
0.16
ased
0.15
radi
0.15
htable
0.15
elijke
0.15
BYTES
0.14
addslashes
0.14
ôm
0.14
ControlEvents
0.14
Activations Density 0.004%