INDEX
Explanations
mentions of schizophrenia and its related contexts
New Auto-Interp
Negative Logits
/Instruction
-0.15
alles
-0.14
hiba
-0.14
urge
-0.14
leur
-0.14
Trident
-0.14
irling
-0.14
xuyên
-0.14
Ïĥο
-0.14
viz
-0.13
POSITIVE LOGITS
treatments
0.30
cure
0.30
treatment
0.26
breakthrough
0.24
Cure
0.23
research
0.23
discoveries
0.22
Treatment
0.21
research
0.20
researchers
0.20
Activations Density 0.196%