INDEX
Explanations
mentions of places of worship, particularly churches
references to churches and related activities
New Auto-Interp
Negative Logits
Rog
-0.68
schild
-0.63
neurot
-0.61
Consumer
-0.61
Fuj
-0.61
DonaldTrump
-0.61
Novel
-0.59
Yak
-0.59
Ake
-0.59
Paste
-0.59
POSITIVE LOGITS
goers
1.33
yard
1.19
yards
1.01
bells
0.98
going
0.97
congregation
0.95
choir
0.94
helps
0.93
attendance
0.93
fires
0.89
Activations Density 0.032%