INDEX
Explanations
actions and emotions related to health and well-being
New Auto-Interp
Negative Logits
verture
-0.15
agma
-0.15
nob
-0.15
iets
-0.15
urer
-0.14
imore
-0.14
eteria
-0.14
esture
-0.14
Migration
-0.14
å¬
-0.14
POSITIVE LOGITS
lo
0.18
anik
0.17
ή
0.15
.Dom
0.14
arts
0.14
etik
0.14
_SENS
0.14
rl
0.14
Ø®ÙĦ
0.14
eral
0.13
Activations Density 0.301%