INDEX
Explanations
numerical values associated with precise quantities
occurrences of the word "just" in various contexts
New Auto-Interp
Negative Logits
hement
-0.70
Fact
-0.58
amen
-0.58
agically
-0.58
eli
-0.57
esta
-0.57
topic
-0.57
anytime
-0.56
illin
-0.55
antis
-0.55
POSITIVE LOGITS
ifiable
1.04
shy
0.98
ijn
0.74
ifications
0.74
below
0.71
0
0.67
¾
0.67
39
0.66
marginally
0.66
326
0.66
Activations Density 0.073%