INDEX
Explanations
references to specific web page interactions and instructions
references to specific pages or sections within a document or platform
New Auto-Interp
Negative Logits
ALLY
-0.77
CAST
-0.77
Instruments
-0.72
creen
-0.69
Franch
-0.66
borgh
-0.64
xon
-0.64
Rated
-0.63
kowski
-0.62
igham
-0.62
POSITIVE LOGITS
antry
1.39
pages
1.09
views
1.09
page
0.87
page
0.84
pages
0.81
earance
0.80
lists
0.76
ants
0.76
unks
0.75
Activations Density 0.018%