INDEX
Explanations
mentions of the word "Love" with varying degrees of activation
the word "Love" and related uses in various contexts
New Auto-Interp
Negative Logits
ocument
-0.84
todd
-0.82
ulhu
-0.79
acco
-0.79
ħĭ
-0.77
reluct
-0.75
aution
-0.74
monary
-0.73
emonium
-0.71
NRS
-0.69
POSITIVE LOGITS
lihood
1.25
joy
1.10
birds
1.03
bird
0.95
Actually
0.89
good
0.86
watching
0.83
tsky
0.83
fully
0.83
hound
0.82
Activations Density 0.028%