INDEX
Explanations
keywords and phrases associated with health risks, particularly regarding eating disorders and their consequences
New Auto-Interp
Negative Logits
posedge
-0.54
vợ
-0.48
новниш
-0.48
pending
-0.47
zionali
-0.45
кӀ
-0.43
ttä
-0.43
cza
-0.43
Noice
-0.43
áklad
-0.42
POSITIVE LOGITS
orexia
0.86
suicidal
0.84
suicide
0.81
SequentialGroup
0.76
suicides
0.72
anorexia
0.69
dangerously
0.68
endangering
0.68
obsession
0.67
bleach
0.67
Activations Density 0.278%