INDEX
Explanations
references to the mainstream media
references to mainstream media and its influence
New Auto-Interp
Negative Logits
cific
-0.81
atoon
-0.80
otos
-0.78
uana
-0.76
ursed
-0.76
arcity
-0.74
thur
-0.69
tein
-0.65
alid
-0.65
vag
-0.63
POSITIVE LOGITS
mainstream
0.86
outlets
0.81
media
0.76
arily
0.76
ization
0.74
iary
0.72
ablishment
0.71
establishment
0.70
outlet
0.68
circulation
0.68
Activations Density 0.026%