INDEX
Explanations
mentions of legal entities i.e., courts
references to legal courts
New Auto-Interp
Negative Logits
ana
-0.67
Austral
-0.63
Stronghold
-0.63
igun
-0.62
NATO
-0.62
ACTED
-0.62
Laf
-0.60
Hab
-0.60
OS
-0.60
Camb
-0.59
POSITIVE LOGITS
hip
0.91
assic
0.88
courts
0.84
wright
0.82
auld
0.82
yer
0.81
ices
0.81
esan
0.80
icing
0.79
ide
0.79
Activations Density 0.010%