INDEX
Explanations
references to individuals and groups involved in various contexts or stories
New Auto-Interp
Negative Logits
IsContent
-0.71
uç
-0.69
traditionnels
-0.66
StoreMessageInfo
-0.66
väli
-0.65
defStyle
-0.65
undang
-0.64
Controllo
-0.63
cauza
-0.63
arany
-0.63
POSITIVE LOGITS
who
1.33
person
1.26
people
1.12
persons
1.07
Person
1.05
PERSON
0.98
Persons
0.98
whom
0.98
Persons
0.93
person
0.93
Activations Density 0.318%