INDEX
Explanations
emotional engagement with characters
New Auto-Interp
Negative Logits
ulas
-0.17
aca
-0.15
çĵ
-0.14
ushman
-0.14
ãĥ©ãĥĥãĤ¯
-0.14
helm
-0.14
WORDS
-0.14
æĴ
-0.14
quat
-0.14
MeasureSpec
-0.13
POSITIVE LOGITS
rooting
0.36
root
0.32
Root
0.30
Root
0.27
root
0.26
(root
0.25
/root
0.25
investment
0.25
ROOT
0.24
invested
0.23
Activations Density 0.136%