INDEX
Explanations
words related to beliefs and convictions
references to beliefs or ideas
New Auto-Interp
Negative Logits
agher
-0.73
apeake
-0.72
nice
-0.70
bye
-0.69
avis
-0.68
Sieg
-0.68
nova
-0.66
yna
-0.66
Amen
-0.65
hma
-0.65
POSITIVE LOGITS
belief
0.99
ieve
0.90
fulness
0.90
beliefs
0.88
fully
0.80
ually
0.79
itious
0.77
faith
0.76
conviction
0.76
disbelief
0.76
Activations Density 0.020%