INDEX
Explanations
mentions of rumors or speculation
instances of the word "rumor"
New Auto-Interp
Negative Logits
artney
-0.73
DOS
-0.73
ĸļ
-0.72
alone
-0.71
acted
-0.70
hematic
-0.69
nea
-0.69
itar
-0.69
instance
-0.69
mand
-0.68
POSITIVE LOGITS
rumors
1.12
rumor
1.09
rumours
1.03
Rum
0.87
gossip
0.86
cov
0.83
speculate
0.82
speculation
0.81
indu
0.80
leak
0.79
Activations Density 0.048%