INDEX
Explanations
the keyword "after" and variations in the context of sequences or events
New Auto-Interp
Negative Logits
Theſe
-1.41
myſelf
-1.35
ſtate
-1.30
purpoſe
-1.28
leaſt
-1.26
Jefus
-1.25
pleaſure
-1.24
houſe
-1.23
Efq
-1.22
Houſe
-1.19
POSITIVE LOGITS
after
3.03
after
2.55
After
2.39
After
2.31
dopo
2.27
après
2.25
após
2.10
AFTER
2.04
setelah
1.94
после
1.91
Activations Density 0.235%