INDEX
Explanations
terms related to consciousness and perception
New Auto-Interp
Negative Logits
Orth
-0.17
zen
-0.15
eldom
-0.15
orna
-0.15
Engine
-0.15
cust
-0.15
Orth
-0.15
ÄĽk
-0.14
pedo
-0.14
ÄĽÅ¾
-0.14
POSITIVE LOGITS
Mein
0.20
phenomenal
0.18
simples
0.18
objects
0.18
.objects
0.18
accidents
0.18
Ding
0.18
Erf
0.18
Relations
0.17
sensible
0.17
Activations Density 0.036%