INDEX
Explanations
legal or investigative terminology
expressions related to legal and procedural contexts
New Auto-Interp
Negative Logits
domest
-0.61
obyl
-0.60
interstitial
-0.60
roofs
-0.60
scenic
-0.59
animate
-0.58
indoors
-0.57
conserve
-0.56
skysc
-0.56
flowering
-0.55
POSITIVE LOGITS
redacted
0.84
wording
0.82
contradicted
0.76
furthermore
0.74
Counsel
0.73
Response
0.72
quoting
0.72
Comment
0.71
asserted
0.71
inconsistencies
0.70
Activations Density 2.175%