INDEX
Explanations
references to the "Free Press."
repeated mentions of the term "Free," indicating a focus on content related to the Free Press
New Auto-Interp
Negative Logits
therap
-0.88
ENTS
-0.76
iop
-0.72
è¦ļéĨĴ
-0.72
deceive
-0.71
uality
-0.71
ional
-0.71
iot
-0.68
ENT
-0.67
positively
-0.66
POSITIVE LOGITS
zing
1.05
bies
1.01
zes
1.01
bie
0.92
zers
0.89
zer
0.85
edom
0.83
BSD
0.79
boot
0.78
gans
0.77
Activations Density 0.025%