INDEX
Explanations
references to believers and skeptics
terms related to belief and followers
New Auto-Interp
Negative Logits
RAW
-0.77
COL
-0.71
Liqu
-0.67
ony
-0.64
Mond
-0.64
Board
-0.64
artment
-0.62
oot
-0.62
ONY
-0.61
Toxic
-0.61
POSITIVE LOGITS
believers
3.73
believer
2.99
skeptics
1.73
adherents
1.73
ievers
1.63
atheists
1.49
unbel
1.44
disbel
1.41
proponents
1.40
Christians
1.37
Activations Density 0.017%