INDEX
Explanations
mentions of the concept of freedom, especially freedom of expression
terms related to freedom, particularly freedom of expression and speech
New Auto-Interp
Negative Logits
eor
-0.76
ded
-0.67
mers
-0.65
nos
-0.64
DERR
-0.63
itated
-0.63
ented
-0.62
acan
-0.62
ents
-0.62
Dynasty
-0.62
POSITIVE LOGITS
Fighters
0.91
bies
0.88
fighters
0.82
roam
0.79
fighter
0.78
freedoms
0.78
fighters
0.78
Reviewer
0.77
guaranteed
0.74
boot
0.74
Activations Density 0.034%