INDEX
Explanations
the word "as" in various contexts
New Auto-Interp
Negative Logits
itſelf
-0.69
こと
-0.69
Jefus
-0.68
smtplib
-0.66
ſever
-0.65
juſt
-0.65
pleaſure
-0.65
ſche
-0.64
againſt
-0.60
raiſ
-0.59
POSITIVE LOGITS
well
1.07
follows
1.02
part
1.00
opposed
0.94
far
0.91
pires
0.88
follows
0.87
soon
0.83
usual
0.76
perity
0.70
Activations Density 0.357%