INDEX
Explanations
phrases and terms related to awareness and self-awareness in various contexts
New Auto-Interp
Negative Logits
sta
-0.17
otype
-0.15
justice
-0.14
reo
-0.14
REA
-0.14
Äįek
-0.14
иÑģÑĤÑĢа
-0.14
ÅĻ
-0.14
ernals
-0.14
bons
-0.13
POSITIVE LOGITS
ness
0.26
fulness
0.23
-aware
0.21
/alert
0.20
-ra
0.19
aware
0.18
Aware
0.18
Aware
0.18
aware
0.18
onso
0.18
Activations Density 0.030%