INDEX
Explanations
expressions of inevitability or eventual outcomes in narratives
New Auto-Interp
Negative Logits
ade
-0.15
วรร
-0.15
ons
-0.15
↵↵
-0.14
amble
-0.14
/w
-0.14
aug
-0.14
.localized
-0.14
sudden
-0.13
lần
-0.13
POSITIVE LOGITS
mente
0.25
ities
0.22
/current
0.21
s
0.20
y
0.18
succ
0.17
lest
0.17
idades
0.17
arily
0.16
ments
0.16
Activations Density 0.029%