INDEX
Explanations
mentions of atheism or related terms
references to atheism and related concepts
New Auto-Interp
Negative Logits
Zub
-0.75
FORMATION
-0.70
DERR
-0.69
DOE
-0.68
Mandela
-0.64
ENDED
-0.63
âĺħâĺħ
-0.63
Lemon
-0.63
uggest
-0.62
Westbrook
-0.61
POSITIVE LOGITS
neum
1.15
rency
0.93
Athe
0.92
azi
0.88
ist
0.86
lington
0.86
nect
0.85
ists
0.84
ĪĴ
0.83
istical
0.83
Activations Density 0.010%