INDEX
Explanations
names of people and associated actions or roles
reporting speech or attribution
New Auto-Interp
Negative Logits
autorytatywna
-0.57
HideFlags
-0.57
WriteTagHelper
-0.52
mobileqq
-0.52
ंदीखरीदारी
-0.48
Cyfeiriadau
-0.47
новништво
-0.47
uxxxx
-0.46
الحره
-0.46
Tembelea
-0.45
POSITIVE LOGITS
said
0.51
setVerticalGroup
0.45
explained
0.42
UnusedPrivate
0.42
meinte
0.39
explains
0.39
spiega
0.38
said
0.37
indisponible
0.36
says
0.35
Activations Density 0.008%