INDEX
Explanations
mentions of attire or clothing
occurrences of the word "bes," indicating a focus on representation or status
New Auto-Interp
Negative Logits
Marijuana
-0.68
Examiner
-0.66
District
-0.65
Kubrick
-0.65
Records
-0.64
osaurs
-0.62
Lilly
-0.62
Reds
-0.62
Oval
-0.61
Roosevelt
-0.61
POSITIVE LOGITS
challeng
1.08
bes
0.96
entimes
0.95
poke
0.87
iege
0.84
sembly
0.84
semb
0.83
icker
0.82
aved
0.81
bridge
0.80
Activations Density 0.006%