INDEX
Explanations
HTML and scripting elements, specifically related to webpage navigation
New Auto-Interp
Negative Logits
poster
-0.17
bsp
-0.15
opal
-0.15
510
-0.14
permission
-0.14
UTOR
-0.14
Consent
-0.14
ango
-0.14
perms
-0.14
dden
-0.13
POSITIVE LOGITS
skip
0.23
Skip
0.20
Skip
0.20
skip
0.20
skips
0.18
Skipping
0.17
_SKIP
0.16
SKIP
0.16
ippers
0.16
skipped
0.16
Activations Density 0.007%