INDEX
Explanations
instances of specific phrases or terms related to evidence and claims
New Auto-Interp
Negative Logits
.Undef
-0.16
styleType
-0.15
EMPLARY
-0.15
paque
-0.15
alace
-0.14
ascar
-0.14
rias
-0.14
iefs
-0.14
γά
-0.14
Äijỡ
-0.14
POSITIVE LOGITS
idea
0.44
notion
0.39
suggestion
0.34
belief
0.32
impression
0.32
idea
0.32
prospect
0.31
question
0.31
possibility
0.30
proposition
0.29
Activations Density 0.251%