INDEX
Explanations
instances of the word "before" in various contexts
New Auto-Interp
Negative Logits
ſelf
-0.83
KommentareTeilen
-0.81
Jefus
-0.78
Efq
-0.74
ษัท
-0.73
bolista
-0.72
følge
-0.71
habet
-0.68
Cæsar
-0.68
genstein
-0.68
POSITIVE LOGITS
before
1.48
before
1.45
BEFORE
1.41
Before
1.40
BEFORE
1.35
Before
1.31
innan
1.09
sebelum
1.06
befo
1.03
før
0.94
Activations Density 0.100%