INDEX
Explanations
assertions of claims or allegations within a context
New Auto-Interp
Head Attr Weights
0:0.07
1:0.02
2:0.04
3:0.23
4:0.03
5:0.07
6:0.02
7:0.05
8:0.03
9:0.02
10:0.36
11:0.02
Negative Logits
irlf
-1.96
arag
-1.94
ocamp
-1.89
ummer
-1.89
amaru
-1.88
ELL
-1.87
trl
-1.85
abruptly
-1.83
alloween
-1.82
awaits
-1.82
POSITIVE LOGITS
assumptions
3.56
convictions
3.47
conviction
3.37
assumption
3.21
observations
3.02
perceptions
3.00
intuition
2.99
premise
2.83
perception
2.77
belief
2.77
Activations Density 0.107%