INDEX
Explanations
mentions of the word "feeling" in various contexts
New Auto-Interp
Negative Logits
eson
-0.17
pio
-0.16
raith
-0.16
iesel
-0.15
reon
-0.15
fahren
-0.15
fried
-0.15
fp
-0.15
/hash
-0.14
ieg
-0.14
POSITIVE LOGITS
Fe
0.31
fe
0.30
Fe
0.27
(fe
0.25
-fe
0.24
bruary
0.23
.fe
0.22
igned
0.22
FE
0.20
fe
0.19
Activations Density 0.016%