INDEX
Explanations
expressions of personal experiences and reflections
New Auto-Interp
Negative Logits
observation
-0.16
observations
-0.16
observer
-0.16
observe
-0.15
ç·ł
-0.14
utilus
-0.14
Observ
-0.14
okoj
-0.14
observing
-0.14
apter
-0.13
POSITIVE LOGITS
owns
0.17
stan
0.16
ç§Ł
0.16
owning
0.16
zon
0.15
_sink
0.15
हल
0.15
owned
0.15
buying
0.15
boyc
0.14
Activations Density 0.215%