INDEX
Explanations
references to specific legal systems or institutions
references to specific locations and names associated with excitement or conflict
New Auto-Interp
Negative Logits
itable
-0.90
yip
-0.78
cannon
-0.75
disclaim
-0.73
zik
-0.73
ould
-0.73
ble
-0.70
conservancy
-0.70
ball
-0.69
table
-0.68
POSITIVE LOGITS
ively
0.91
alez
0.82
gow
0.75
OWER
0.73
iveness
0.71
iar
0.67
ority
0.67
warr
0.66
daq
0.66
endings
0.66
Activations Density 0.044%