INDEX
Explanations
concepts related to the mind and mental states
New Auto-Interp
Negative Logits
utscher
-0.81
Scherer
-0.81
Schröder
-0.78
Douglass
-0.77
Walsh
-0.76
=>"
-0.74
hysema
-0.72
zieher
-0.70
führer
-0.69
allenges
-0.69
POSITIVE LOGITS
MIND
1.83
Mind
1.71
MIND
1.62
mind
1.62
mind
1.58
Mind
1.52
minds
1.43
Mindy
1.25
Minds
1.25
minds
1.11
Activations Density 0.039%