INDEX
Explanations
URLs and website links
URLs and file paths
New Auto-Interp
Negative Logits
angry
-0.72
hugged
-0.70
apologised
-0.69
Ammo
-0.69
creep
-0.69
creeping
-0.68
Romo
-0.66
hugging
-0.65
wards
-0.64
enraged
-0.64
POSITIVE LOGITS
index
1.24
comments
1.11
etc
1.09
sites
1.07
photos
1.04
products
1.04
dq
1.03
detail
1.03
archives
1.01
Pages
1.01
Activations Density 0.036%