INDEX
Explanations
connections between health issues and wider societal impacts
New Auto-Interp
Negative Logits
illez
-0.17
aby
-0.16
LineColor
-0.14
æľĭ
-0.14
á»§
-0.14
ÅĻen
-0.14
WND
-0.14
esktop
-0.14
ê¶ģ
-0.13
icie
-0.13
POSITIVE LOGITS
obot
0.15
ách
0.15
igg
0.14
19
0.14
apart
0.13
functioning
0.13
483
0.13
328
0.13
326
0.13
Burl
0.12
Activations Density 0.232%