INDEX
Explanations
words related to being interested or showing interest in something
references to the concept of interest in various contexts
New Auto-Interp
Negative Logits
apple
-0.70
é¾į
-0.66
rome
-0.63
stacked
-0.63
helicop
-0.63
llan
-0.61
abby
-0.60
sled
-0.60
bunk
-0.59
seams
-0.59
POSITIVE LOGITS
interest
0.81
Interest
0.79
enza
0.77
="#
0.74
trolling
0.73
ATURE
0.73
edIn
0.71
interested
0.70
topic
0.68
curiosity
0.66
Activations Density 0.018%