INDEX
Explanations
the word "mother" in various contexts
references to the concept of monotheism
New Auto-Interp
Negative Logits
Challenger
-0.68
2000
-0.65
Dem
-0.65
Labrador
-0.63
2012
-0.62
gangs
-0.61
2008
-0.61
modules
-0.60
2010
-0.59
trenches
-0.59
POSITIVE LOGITS
othe
5.12
othes
1.43
ithe
1.41
othing
1.37
athe
1.32
ocry
1.30
ophy
1.25
othy
1.24
oth
1.20
ethe
1.18
Activations Density 0.032%