INDEX
Explanations
expressions of internal struggle and the need for personal strength and motivation
New Auto-Interp
Negative Logits
nip
-0.17
Nichols
-0.16
907
-0.15
_Generic
-0.14
Alic
-0.14
elong
-0.14
Mercer
-0.14
257
-0.14
905
-0.14
getManager
-0.14
POSITIVE LOGITS
courage
0.47
strength
0.42
Courage
0.36
nerve
0.34
strength
0.33
åĭĩ
0.33
guts
0.32
Strength
0.31
ourage
0.30
Strength
0.29
Activations Density 0.146%