INDEX
Explanations
words related to interest or enthusiasm in various contexts
instances of the word "interest" and its variations
New Auto-Interp
Negative Logits
rome
-0.72
prus
-0.68
apple
-0.64
pex
-0.62
seams
-0.62
ut
-0.61
xon
-0.61
stacked
-0.59
testament
-0.59
abby
-0.58
POSITIVE LOGITS
enza
0.87
Groups
0.76
Interest
0.72
ocene
0.71
reprene
0.71
ATURE
0.70
Rate
0.69
trolling
0.68
interest
0.68
phal
0.67
Activations Density 0.024%