INDEX
Explanations
mentions of strong emotions of interest or enthusiasm
expressions of strong enthusiasm or dedication towards activities or interests
New Auto-Interp
Negative Logits
pta
-0.77
å¸
-0.76
annis
-0.76
eor
-0.74
avis
-0.73
ramid
-0.66
WATCHED
-0.65
reens
-0.64
ãĥĥãĥī
-0.64
helicop
-0.63
POSITIVE LOGITS
passion
1.05
atical
1.03
passionately
0.91
iously
0.86
passionate
0.84
passions
0.83
uous
0.83
uality
0.82
ful
0.80
edIn
0.80
Activations Density 0.014%