INDEX
Explanations
names of authors or contributors at the beginning of an article or piece of content
mentions of authors or individuals associated with written works or articles
New Auto-Interp
Negative Logits
sic
-0.77
coni
-0.74
derog
-0.70
nown
-0.67
externalToEVAOnly
-0.66
successors
-0.66
jri
-0.65
disg
-0.64
bis
-0.64
hereafter
-0.64
POSITIVE LOGITS
Updated
0.90
Vegan
0.85
zbollah
0.83
CLUS
0.82
Expand
0.79
âĢº
0.79
WASHINGTON
0.75
UPDATE
0.74
Welcome
0.73
Calculator
0.73
Activations Density 0.915%