INDEX
Explanations
mentions of organizations and formal groups involved in charitable activities
New Auto-Interp
Negative Logits
еÑĢÑĤи
-0.17
otime
-0.16
unter
-0.16
Rise
-0.15
unting
-0.15
ormsg
-0.15
coles
-0.14
exion
-0.14
Dabei
-0.14
unt
-0.14
POSITIVE LOGITS
etto
0.16
ελ
0.15
dubious
0.14
themselves
0.14
ائ
0.13
Mend
0.13
alic
0.13
Bro
0.13
ittle
0.13
attery
0.13
Activations Density 0.224%