INDEX
Explanations
concepts related to mindfulness and self-awareness
New Auto-Interp
Negative Logits
Permanent
-0.18
perman
-0.16
permanent
-0.15
Permanent
-0.14
Platt
-0.14
wide
-0.14
bove
-0.14
ausible
-0.14
elter
-0.13
ANDING
-0.13
POSITIVE LOGITS
815
0.17
rame
0.15
yne
0.14
impartial
0.14
-caret
0.14
à¸Ĥ
0.14
apart
0.14
åł
0.13
icut
0.13
sacred
0.13
Activations Density 0.013%