INDEX
Explanations
references to cerebral conditions and significant people associated with them
New Auto-Interp
Negative Logits
arted
-0.80
alling
-0.77
aries
-0.74
riks
-0.73
nir
-0.71
aret
-0.70
olit
-0.69
unes
-0.67
ministic
-0.67
urated
-0.67
POSITIVE LOGITS
Shields
0.86
zinski
0.78
swer
0.76
Springer
0.68
pals
0.67
issance
0.65
itz
0.63
idges
0.63
Diet
0.63
auer
0.62
Activations Density 0.004%