INDEX
Explanations
concepts and expressions related to acceptance and self-acceptance
New Auto-Interp
Negative Logits
boss
-0.07
flix
-0.07
ve
-0.07
oron
-0.06
escape
-0.06
escaping
-0.06
ÑĶ
-0.06
ef
-0.06
etwork
-0.06
uplat
-0.06
POSITIVE LOGITS
reality
0.09
fact
0.08
Reality
0.08
realities
0.08
rằng
0.07
_fact
0.07
inevitable
0.07
facts
0.07
Fact
0.07
Fact
0.07
Activations Density 0.010%