INDEX
Explanations
phrases indicating the absence of barriers or requirements
New Auto-Interp
Negative Logits
ãĤ¤ãĥī
-0.15
ibold
-0.14
isle
-0.14
imenti
-0.13
opo
-0.13
пол
-0.13
kâ
-0.13
OLT
-0.13
zc
-0.13
ault
-0.13
POSITIVE LOGITS
require
0.17
abox
0.17
need
0.16
required
0.16
requ
0.15
unlike
0.15
ä»»ä½ķ
0.15
any
0.15
Require
0.15
requirement
0.15
Activations Density 0.092%