INDEX
Explanations
words related to evidence or proof
references to evidence in various contexts
New Auto-Interp
Negative Logits
skill
-0.76
noxious
-0.72
ija
-0.70
sf
-0.68
frey
-0.68
nas
-0.67
oos
-0.67
sie
-0.63
jc
-0.62
bum
-0.62
POSITIVE LOGITS
evidence
1.12
evidence
1.11
Evidence
1.08
Evidence
0.96
evid
0.89
proofs
0.79
proof
0.78
validity
0.78
edly
0.77
proof
0.72
Activations Density 0.023%