INDEX
Explanations
references to steam in various contexts
New Auto-Interp
Negative Logits
ſta
-1.38
myſelf
-1.34
pleaſure
-1.34
itſelf
-1.29
chofe
-1.28
ſtate
-1.28
raiſ
-1.26
ſever
-1.23
ſeveral
-1.23
faſt
-1.22
POSITIVE LOGITS
ot
0.68
"
0.63
0.60
ENT
0.60
(
0.58
'
0.58
A
0.57
«
0.56
of
0.54
تم
0.54
Activations Density 0.148%