INDEX
Explanations
prompts asking users to click on specific elements or buttons on a webpage
actions related to clicking buttons or links in digital content
New Auto-Interp
Negative Logits
ngth
-0.80
lycer
-0.70
ilic
-0.69
ufact
-0.68
angering
-0.67
udeb
-0.66
contained
-0.62
noon
-0.62
ensical
-0.62
church
-0.61
POSITIVE LOGITS
button
1.12
buttons
0.96
ĵĺ
0.77
slider
0.73
accelerator
0.73
zoom
0.72
switches
0.71
arrow
0.71
tab
0.71
arrows
0.71
Activations Density 0.087%