INDEX
Explanations
themes related to emotional distress and physical challenges
New Auto-Interp
Negative Logits
ubo
-0.16
alice
-0.14
>{!!-0.14
nero
-0.14
quared
-0.14
ercial
-0.14
.isFile
-0.14
SG
-0.13
uru
-0.13
nex
-0.13
POSITIVE LOGITS
or
0.21
ãĥĵãĥ¼
0.19
æĪĸèĢħ
0.18
æĪĸ
0.18
или
0.18
æĪĸ
0.17
або
0.16
hoặc
0.16
nebo
0.15
ή
0.15
Activations Density 0.110%