INDEX
Explanations
mentions of military and political actions involving the United States
occurrences of the letter "U" in uppercase
New Auto-Interp
Negative Logits
Noir
-0.86
Wicked
-0.79
Bach
-0.74
Aph
-0.69
bars
-0.67
Cth
-0.67
Preferred
-0.67
sinks
-0.67
Redditor
-0.65
Arcade
-0.65
POSITIVE LOGITS
prising
1.19
nexpected
1.05
NA
0.98
seless
0.97
lyss
0.93
rug
0.90
wan
0.90
uge
0.89
uan
0.88
gh
0.88
Activations Density 0.051%