INDEX
Explanations
mentions or instances of spreading or sharing information
references to the concept of spreading, especially in relation to communication or distribution
New Auto-Interp
Negative Logits
*/(
-0.82
deen
-0.78
Zup
-0.73
--+
-0.70
herty
-0.70
omez
-0.67
aults
-0.67
puted
-0.65
clamation
-0.64
udeau
-0.63
POSITIVE LOGITS
sheets
1.87
sheet
1.38
shirt
1.03
spreads
0.92
awareness
0.85
misinformation
0.84
disinformation
0.83
across
0.83
eagle
0.82
spread
0.81
Activations Density 0.069%