INDEX
Explanations
interactions involving the pronoun "it" in various contexts
New Auto-Interp
Negative Logits
cauſe
-1.16
whoſe
-1.12
Eſ
-1.08
pleaſure
-1.08
Efq
-1.05
Reſ
-1.05
faſt
-1.05
Theſe
-1.03
raiſ
-1.02
deſt
-1.02
POSITIVE LOGITS
it
0.92
him
0.88
them
0.79
виправивши
0.73
in
0.70
Посилання
0.69
doit
0.66
ujarnya
0.65
It
0.65
as
0.65
Activations Density 0.174%