INDEX
Explanations
elements related to abrupt changes or impactful moments in narratives
New Auto-Interp
Negative Logits
culo
-0.16
chk
-0.16
yal
-0.16
istine
-0.16
astes
-0.15
788
-0.15
аÑĪ
-0.15
Äĵ
-0.15
erial
-0.15
Watt
-0.14
POSITIVE LOGITS
EIF
0.16
lint
0.16
ugins
0.15
izr
0.15
ÂŃi
0.15
oeff
0.14
reesome
0.14
TickCount
0.14
_HINT
0.14
oader
0.14
Activations Density 0.024%