INDEX
Explanations
descriptions of experiencing something physically
instances of experiencing or tasting new things
New Auto-Interp
Negative Logits
blaming
-0.85
redict
-0.69
citing
-0.68
ItemImage
-0.67
refrain
-0.67
refusing
-0.67
claiming
-0.67
advertising
-0.66
blamed
-0.64
comprom
-0.64
POSITIVE LOGITS
firsthand
1.31
glimps
0.84
prototypes
0.84
demos
0.83
preview
0.78
scenery
0.78
whats
0.77
majesty
0.77
goodies
0.73
glimpse
0.73
Activations Density 0.406%