INDEX
Explanations
words related to the brain and medical conditions
New Auto-Interp
Negative Logits
Hitman
-0.69
FT
-0.69
lihood
-0.69
Else
-0.63
Kerry
-0.62
artisan
-0.60
anded
-0.58
makers
-0.58
Lennon
-0.58
itsch
-0.57
POSITIVE LOGITS
ricular
1.51
urous
1.25
ilated
1.24
uring
1.20
ral
1.19
rue
1.18
ilation
1.12
uri
1.02
orial
1.00
ories
1.00
Activations Density 0.019%