INDEX
Explanations
instances of honey and jars, particularly in a context that suggests measurement or quantification
New Auto-Interp
Negative Logits
itſelf
-0.86
Monfieur
-0.86
Jefus
-0.85
houſe
-0.79
purpoſe
-0.76
ſtand
-0.76
ſta
-0.76
iſt
-0.76
raiſ
-0.75
ſeveral
-0.75
POSITIVE LOGITS
lenker
0.58
gi
0.48
sch
0.44
altor
0.44
extra
0.43
thus
0.43
b
0.43
Schroeder
0.42
bu
0.41
g
0.41
Activations Density 0.027%