INDEX
Explanations
phrases related to mental activities or states
references to the concept of "mind" and related mental states
New Auto-Interp
Negative Logits
ModLoader
-0.72
Merit
-0.69
Engineers
-0.62
Licensed
-0.62
Household
-0.61
unequal
-0.60
NYPD
-0.60
Sharp
-0.60
Antar
-0.59
Sponsor
-0.59
POSITIVE LOGITS
storms
0.98
fulness
0.98
share
0.95
ets
0.93
bender
0.91
swer
0.85
wand
0.79
grass
0.79
ings
0.78
ulsion
0.77
Activations Density 0.055%