INDEX
Explanations
themes of struggle and resilience in difficult circumstances
New Auto-Interp
Negative Logits
---
-0.27
--
-0.26
-
-0.25
“â̦
-0.23
---
-0.23
-----
-0.22
‘
-0.21
‘s
-0.21
(~
-0.20
--
-0.20
POSITIVE LOGITS
_
0.66
_(
0.46
↵
0.44
_.
0.42
_$
0.41
-_
0.40
**
0.39
↵
0.36
_↵↵
0.35
↵↵
0.34
Activations Density 3.253%