INDEX
Explanations
occurrences of hyperlinks and linking actions in the text
New Auto-Interp
Negative Logits
vÃŃ
-0.19
/Linux
-0.17
imler
-0.16
473
-0.16
بÙĪØ±
-0.16
ccione
-0.16
zÅij
-0.15
442
-0.15
çĥĪ
-0.15
duc
-0.15
POSITIVE LOGITS
ages
0.44
age
0.32
edin
0.29
AGES
0.28
able
0.25
/button
0.23
aged
0.22
din
0.22
sys
0.22
ups
0.22
Activations Density 0.033%