INDEX
Explanations
instances of the word "using" in various forms
using specific terms
New Auto-Interp
Negative Logits
ſtate
-0.68
houſe
-0.66
ſche
-0.65
ſtre
-0.65
pleaſure
-0.64
purpoſe
-0.58
ſch
-0.57
perſon
-0.56
ſie
-0.55
neceff
-0.55
POSITIVE LOGITS
using
1.33
Using
1.28
using
1.25
USING
1.24
Using
1.24
USING
1.16
usando
1.02
utilising
0.98
utilizando
0.98
utilizing
0.93
Activations Density 0.036%