INDEX
Explanations
references to charitable organizations and services
New Auto-Interp
Negative Logits
561
-0.16
aly
-0.15
ething
-0.15
mega
-0.14
abin
-0.14
ek
-0.14
Mata
-0.14
θεν
-0.14
upd
-0.13
pedia
-0.13
POSITIVE LOGITS
charity
0.21
Action
0.19
Samar
0.18
Alzheimer
0.17
Campaign
0.17
charities
0.16
Royal
0.16
Forces
0.16
British
0.16
Citizens
0.16
Activations Density 0.178%