INDEX
Explanations
specific mentions of actions or key events relevant to discussions or narratives
New Auto-Interp
Negative Logits
,
-0.89
and
-0.75
gi
-0.66
-
-0.65
.
-0.64
_
-0.63
y
-0.59
pu
-0.57
when
-0.57
u
-0.56
POSITIVE LOGITS
itſelf
1.32
ſelves
1.31
ſelf
1.27
Houſe
1.24
tvguidetime
1.24
Monfieur
1.22
myſelf
1.22
Jefus
1.21
Personendaten
1.20
Anſ
1.17
Activations Density 0.682%