INDEX
Explanations
words related to medical conditions and treatments concerning the brain
references to the brain and its conditions or attributes
New Auto-Interp
Negative Logits
Bundy
-0.70
impunity
-0.67
Dialog
-0.66
adoes
-0.66
Arabia
-0.64
FANTASY
-0.64
Bowie
-0.64
inen
-0.64
Yanuk
-0.63
Faust
-0.61
POSITIVE LOGITS
stem
1.35
washed
1.27
washing
1.18
wash
1.12
iac
1.00
waves
0.95
caps
0.94
fuck
0.91
storms
0.85
cap
0.84
Activations Density 0.024%