INDEX
Explanations
concepts related to reality checks and self-awareness in various contexts
New Auto-Interp
Negative Logits
ใจ
-0.45
ģ
-0.45
dington
-0.43
referenties
-0.42
Jeografia
-0.42
pitch
-0.41
telli
-0.41
Fein
-0.41
expandindo
-0.41
banderas
-0.41
POSITIVE LOGITS
reality
1.45
Reality
1.30
Reality
1.24
reality
1.23
realism
1.18
realities
1.13
realista
1.07
realist
1.06
realistic
1.06
grounded
0.99
Activations Density 0.163%