INDEX
Explanations
mention of evidence
references to the concept of evidence
New Auto-Interp
Negative Logits
skill
-0.74
nas
-0.71
nee
-0.66
ija
-0.66
frey
-0.65
care
-0.62
jc
-0.61
dear
-0.60
awar
-0.60
Care
-0.60
POSITIVE LOGITS
evidence
1.28
evidence
1.27
Evidence
1.18
Evidence
1.00
evid
0.95
proof
0.89
uments
0.86
proof
0.83
proofs
0.81
andum
0.81
Activations Density 0.024%