INDEX
Explanations
references to the word "paint" at varying activation levels
references to paint and painting-related activities
New Auto-Interp
Negative Logits
htt
-0.80
doms
-0.78
indal
-0.75
zbek
-0.73
Kenyan
-0.69
atches
-0.67
AMY
-0.66
cgi
-0.65
otide
-0.65
scill
-0.64
POSITIVE LOGITS
brush
1.25
thinner
0.94
balls
0.89
ball
0.85
isans
0.84
brushes
0.83
pain
0.83
painter
0.81
acrylic
0.81
painting
0.80
Activations Density 0.049%