INDEX
Explanations
mentions of specific names, likely related to a legal case or incident
repeated mentions of the Bundy family
New Auto-Interp
Negative Logits
oen
-0.76
unction
-0.69
ORED
-0.67
OC
-0.66
ored
-0.66
Lovely
-0.65
Bok
-0.64
Io
-0.64
ORE
-0.62
nep
-0.61
POSITIVE LOGITS
Bundy
1.73
ranch
0.91
anan
0.80
undy
0.79
querque
0.77
inez
0.77
BLM
0.76
illas
0.76
erers
0.75
ranc
0.75
Activations Density 0.014%