INDEX
Explanations
references to emotional and mental health, particularly in the context of community and recovery
New Auto-Interp
Negative Logits
ivet
-0.18
irl
-0.16
itta
-0.15
ocab
-0.15
sole
-0.14
vens
-0.14
istr
-0.14
ocal
-0.14
obic
-0.14
ukkit
-0.14
POSITIVE LOGITS
iaz
0.15
Soph
0.15
uraa
0.15
uy
0.14
dee
0.14
egl
0.14
heimer
0.14
íľ´
0.14
indirect
0.14
goto
0.14
Activations Density 0.394%