INDEX
Explanations
content related to questioning or examining social and personal dynamics
New Auto-Interp
Negative Logits
437
-0.15
/copyleft
-0.14
[](
-0.14
mue
-0.14
iglia
-0.13
ibling
-0.13
ÂŃi
-0.13
adeon
-0.13
amax
-0.13
%(
-0.12
POSITIVE LOGITS
ÌĨ
0.19
Äĩi
0.16
hd
0.15
sic
0.15
ients
0.14
been
0.14
dden
0.14
Ì
0.14
dd
0.13
md
0.13
Activations Density 0.652%