INDEX
Explanations
mentions of the name "Bon" with varying activations
references to a specific individual named "Bon."
New Auto-Interp
Negative Logits
dfx
-0.83
Ethics
-0.77
gamer
-0.68
æĸ¹
-0.68
Marijuana
-0.68
INAL
-0.67
ELD
-0.65
Editorial
-0.65
Regulatory
-0.64
Nicotine
-0.64
POSITIVE LOGITS
anza
1.11
iton
1.08
uses
1.04
Bon
0.99
gey
0.98
itors
0.97
illas
0.95
etooth
0.93
isson
0.92
vill
0.91
Activations Density 0.009%