INDEX
Explanations
terms related to freedom or unrestricted access
mentions of freedom and availability without restrictions
New Auto-Interp
Negative Logits
ulhu
-0.83
ynasty
-0.72
anan
-0.70
usted
-0.69
Derby
-0.68
awk
-0.68
Achievement
-0.67
arij
-0.66
elve
-0.64
need
-0.64
POSITIVE LOGITS
freely
1.10
unrestricted
0.93
decomp
0.89
roam
0.86
flowing
0.84
bies
0.79
uncontrolled
0.79
amnesty
0.76
distribut
0.76
ãĥ¼ãĤ¯
0.75
Activations Density 0.005%