INDEX
Explanations
words related to bureaucratic processes or obstacles
occurrences of the word "red."
New Auto-Interp
Negative Logits
UGH
-0.83
ernel
-0.81
ILA
-0.79
Reloaded
-0.77
Lank
-0.77
XT
-0.76
agall
-0.74
Ö¼
-0.73
OTOS
-0.71
=-=-
-0.69
POSITIVE LOGITS
neck
1.15
oubt
1.11
rawn
1.10
efined
1.09
velvet
1.05
oub
1.02
headed
1.00
iscovered
0.99
iscovery
0.98
iscover
0.97
Activations Density 0.024%