INDEX
Explanations
instances of direct quotes or dialogue in the text
New Auto-Interp
Negative Logits
usz
-0.15
ationale
-0.13
Brigade
-0.13
actionTypes
-0.13
Poh
-0.12
ena
-0.12
ãĥ¼ãĥģ
-0.12
achu
-0.12
FAIL
-0.12
stitial
-0.12
POSITIVE LOGITS
_LICENSE
0.15
nem
0.14
.Butter
0.14
’ta
0.14
acomp
0.13
socio
0.13
.GPIO
0.13
recently
0.13
trad
0.13
Sutton
0.13
Activations Density 0.010%