INDEX
Explanations
references to creationism
New Auto-Interp
Negative Logits
EGA
-0.74
phia
-0.70
Ago
-0.67
lob
-0.64
olulu
-0.64
Caf
-0.64
Mahar
-0.63
RESULTS
-0.63
nerves
-0.63
Fulton
-0.62
POSITIVE LOGITS
ivist
0.94
ivism
0.89
ism
0.86
ist
0.82
idable
0.81
smanship
0.80
ally
0.80
istically
0.78
emis
0.78
flags
0.77
Activations Density 0.036%