INDEX
Explanations
statements asserting the truth of claims or opinions
New Auto-Interp
Negative Logits
DockStyle
-0.64
oredCriteria
-0.63
InjectAttribute
-0.57
>>>>>>>
-0.54
يتيمه
-0.53
ftagPool
-0.52
<bos>
-0.52
曖昧さ回避
-0.51
setVerticalGroup
-0.51
-0.49
POSITIVE LOGITS
true
2.32
true
1.99
True
1.55
TRUE
1.54
True
1.54
TRUE
1.35
vrai
1.30
truer
1.25
cierto
1.22
truest
1.16
Activations Density 0.475%