INDEX
Explanations
mentions of individuals' names or identifiers
repetitions of the name "Biden" and its variations
New Auto-Interp
Negative Logits
pmwiki
-0.72
Canadians
-0.67
mable
-0.66
uncontrolled
-0.66
exha
-0.65
weeds
-0.63
thora
-0.62
Rita
-0.62
semif
-0.59
vanquished
-0.59
POSITIVE LOGITS
iden
1.68
unci
0.88
furt
0.82
zen
0.79
emy
0.79
ovo
0.79
acity
0.79
ovic
0.78
isen
0.78
nen
0.78
Activations Density 0.010%