INDEX
Explanations
evidence related to legal cases
references to various forms of evidence in a context
New Auto-Interp
Negative Logits
ategory
-0.95
ttle
-0.72
Hop
-0.71
TOR
-0.70
awar
-0.69
orks
-0.68
frey
-0.68
ernel
-0.66
orus
-0.66
onen
-0.66
POSITIVE LOGITS
tampering
1.03
evidence
1.00
evidence
1.00
Evidence
0.83
edly
0.81
evid
0.79
proof
0.79
Evidence
0.78
testifying
0.77
gathered
0.77
Activations Density 0.027%