INDEX
Explanations
instances of the word "website."
New Auto-Interp
Negative Logits
well
-0.21
wig
-0.17
ajo
-0.16
wel
-0.16
inn
-0.16
shit
-0.15
oom
-0.15
essor
-0.15
mand
-0.15
ager
-0.14
POSITIVE LOGITS
/app
0.22
/web
0.20
/mobile
0.18
/blog
0.18
0.17
Sharper
0.17
/apps
0.17
visitor
0.16
/port
0.16
/App
0.16
Activations Density 0.023%