INDEX
Explanations
references to persistence and continuity in narratives
New Auto-Interp
Negative Logits
overall
-0.15
athe
-0.15
ound
-0.15
upon
-0.15
prox
-0.14
akan
-0.14
Upon
-0.14
Anc
-0.14
.Compute
-0.14
cleared
-0.13
POSITIVE LOGITS
escorte
0.16
áºŃn
0.15
claimer
0.14
ालय
0.14
ató
0.14
ldb
0.13
ÙĨÛĮÙĨ
0.13
089
0.13
ynchronous
0.13
adoo
0.13
Activations Density 0.134%