INDEX
Explanations
mentions of specific individuals or personalities, particularly focusing on the term "Eh"
New Auto-Interp
Negative Logits
Anthem
-0.72
itized
-0.66
aution
-0.66
oons
-0.65
icious
-0.64
simul
-0.64
Militia
-0.63
Charl
-0.63
ciating
-0.63
indo
-0.62
POSITIVE LOGITS
azard
1.15
awk
1.08
vre
0.99
awks
0.97
uda
0.92
renheit
0.89
lers
0.87
rences
0.86
ollow
0.81
nder
0.79
Activations Density 0.025%