INDEX
Explanations
mentions of the word 'heart' and its variations
New Auto-Interp
Negative Logits
egas
-0.18
olson
-0.16
.extern
-0.16
ritch
-0.16
erate
-0.16
illard
-0.15
intage
-0.15
ña
-0.15
atrix
-0.15
.getTarget
-0.15
POSITIVE LOGITS
less
0.20
rending
0.19
red
0.18
ening
0.16
ened
0.16
ÙĨج
0.16
rend
0.15
Moy
0.15
lessly
0.15
/body
0.15
Activations Density 0.048%