INDEX
Explanations
statements expressing beliefs
expressions of belief or conviction
New Auto-Interp
Negative Logits
conservancy
-0.82
practice
-0.73
pmwiki
-0.71
tnc
-0.68
ammy
-0.65
yna
-0.65
details
-0.62
umber
-0.62
skill
-0.62
agher
-0.62
POSITIVE LOGITS
believe
0.91
believes
0.78
BEL
0.76
ieve
0.71
itial
0.71
ibles
0.69
believing
0.68
rill
0.68
oglu
0.67
convinced
0.67
Activations Density 0.031%