INDEX
Explanations
phrases related to fears and warnings
phrases that express existential or philosophical concepts
New Auto-Interp
Negative Logits
TPS
-0.79
NM
-0.79
CF
-0.78
Cosponsors
-0.74
Proced
-0.71
CB
-0.70
related
-0.69
reporting
-0.68
seys
-0.67
LOG
-0.67
POSITIVE LOGITS
thy
1.11
..."
1.09
â̦"
1.09
fools
1.07
thou
1.07
nig
1.05
evil
1.02
shalt
0.98
mankind
0.98
tyranny
0.96
Activations Density 0.462%