INDEX
Explanations
instances of the word "broken" and its variations, indicating a focus on themes of damage or distress
New Auto-Interp
Negative Logits
اÙĦاØŃ
-0.16
æ¹¾
-0.16
ropa
-0.16
नल
-0.15
.synthetic
-0.14
IFI
-0.14
ellen
-0.14
azzo
-0.14
rex
-0.14
shint
-0.14
POSITIVE LOGITS
-hearted
0.28
broken
0.28
broken
0.28
heart
0.28
Broken
0.24
Broken
0.23
promises
0.21
pieces
0.21
-down
0.20
promise
0.20
Activations Density 0.029%