INDEX
Explanations
variations and forms of the word "free" in different contexts
New Auto-Interp
Negative Logits
sell
-0.19
ernal
-0.19
son
-0.19
sm
-0.17
si
-0.17
Bloss
-0.17
ska
-0.16
rocket
-0.16
Suff
-0.16
sen
-0.16
POSITIVE LOGITS
estyle
0.26
est
0.25
ighting
0.25
ighth
0.24
ights
0.24
ck
0.23
esty
0.21
eways
0.21
nds
0.20
eware
0.19
Activations Density 0.004%