INDEX
Explanations
evidence of awareness and reflection on personal actions and conditions in various contexts
New Auto-Interp
Negative Logits
addCriterion
-0.18
.opens
-0.16
ugu
-0.16
:numel
-0.15
.labelX
-0.15
δÏĮν
-0.15
Coch
-0.14
ameda
-0.14
YC
-0.14
mlin
-0.13
POSITIVE LOGITS
serious
0.16
desperation
0.15
desperate
0.15
maturity
0.15
indeed
0.15
åį³
0.14
burg
0.14
ipers
0.14
already
0.14
atti
0.14
Activations Density 0.217%