INDEX
Explanations
instances of the word "better" in various contexts
New Auto-Interp
Negative Logits
InitVars
-0.96
Cæsar
-0.94
betweenstory
-0.91
Majefty
-0.91
ſtate
-0.90
occaf
-0.86
fubject
-0.81
meriva
-0.81
fevere
-0.81
raiſ
-0.79
POSITIVE LOGITS
better
1.55
Better
1.51
Better
1.42
better
1.37
BETTER
1.25
besser
1.05
nor
0.85
beter
0.81
mejor
0.79
bättre
0.78
Activations Density 0.107%