INDEX
Explanations
capital letters with unusual accents and punctuation marks
numeric values and significant numerical statistics in discussions
New Auto-Interp
Negative Logits
boro
-0.73
utics
-0.70
undet
-0.69
utic
-0.68
lifes
-0.65
hust
-0.63
wiser
-0.62
utical
-0.61
utsche
-0.60
attent
-0.60
POSITIVE LOGITS
However
0.95
SPONSORED
0.95
Meanwhile
0.95
Advertisement
0.94
Newsletter
0.92
Article
0.90
Nevertheless
0.89
Despite
0.88
Related
0.87
Unlike
0.87
Activations Density 1.714%