INDEX
Explanations
references to "mind" and related concepts involving thought processes or mental states
New Auto-Interp
Negative Logits
миÑĢ
-0.16
instein
-0.15
baugh
-0.15
stoff
-0.15
banks
-0.14
Micha
-0.14
typed
-0.14
åĵ
-0.14
вне
-0.14
anou
-0.13
POSITIVE LOGITS
ãģĭãģ®
0.15
perty
0.15
еÑĢÑĸв
0.15
rekl
0.14
.eye
0.14
perfect
0.14
pies
0.14
circulation
0.13
/plugins
0.13
langs
0.13
Activations Density 0.014%