INDEX
Explanations
terms related to political figures and actions
occurrences of the word "ence."
New Auto-Interp
Negative Logits
âĵĺ
-0.82
syn
-0.74
icable
-0.74
ding
-0.73
olicited
-0.72
STON
-0.71
acca
-0.70
ISO
-0.69
Spot
-0.68
ocular
-0.67
POSITIVE LOGITS
llor
0.95
ence
0.91
lihood
0.89
lement
0.85
phal
0.85
ment
0.80
ENCE
0.78
mble
0.77
rences
0.72
pend
0.70
Activations Density 0.009%