INDEX
Explanations
verbs related to storytelling and narration
references to legal proceedings or criminal activities
New Auto-Interp
Negative Logits
!".
-0.74
'.
-0.73
".[
-0.73
!.
-0.72
."[
-0.70
".
-0.67
.<
-0.67
.�
-0.67
.""
-0.67
.(
-0.66
POSITIVE LOGITS
Ĭ±
0.55
©¶æ
0.53
)]
0.52
initial
0.50
amina
0.48
essor
0.47
workload
0.47
lid
0.46
otomy
0.46
ensibly
0.45
Activations Density 1.737%