INDEX
Explanations
references to online activities or online platforms
New Auto-Interp
Negative Logits
ily
-0.19
seau
-0.18
iances
-0.17
ful
-0.16
runs
-0.16
nt
-0.16
anca
-0.16
Ïĩε
-0.15
pered
-0.15
äd
-0.15
POSITIVE LOGITS
/off
0.40
/mobile
0.26
presence
0.22
presence
0.20
-only
0.20
0.20
/web
0.19
Presence
0.19
0.19
-off
0.18
Activations Density 0.040%