INDEX
Explanations
web page-related terms or instructions
occurrences of the word "page."
New Auto-Interp
Negative Logits
CAST
-0.71
mids
-0.69
ALLY
-0.68
creen
-0.66
dayName
-0.65
rived
-0.65
xon
-0.64
oise
-0.63
ivari
-0.62
kowski
-0.62
POSITIVE LOGITS
antry
1.34
pages
1.04
views
1.01
pages
0.97
page
0.93
page
0.91
lists
0.82
mails
0.80
chin
0.77
PAGE
0.76
Activations Density 0.019%