INDEX
Explanations
phrases related to belief or conviction
expressions related to belief and trust
New Auto-Interp
Negative Logits
rawl
-0.65
Interface
-0.60
iol
-0.60
cleanup
-0.59
cture
-0.59
overhead
-0.58
arc
-0.57
interface
-0.56
breakdown
-0.56
++
-0.56
POSITIVE LOGITS
believing
3.53
trusting
1.83
fearing
1.71
belief
1.65
believe
1.45
knowing
1.45
disbel
1.44
believer
1.41
expecting
1.40
believed
1.35
Activations Density 0.011%