INDEX
Explanations
specific mentions of the word "Court."
mentions of courts or related legal terminology
New Auto-Interp
Negative Logits
rooting
-0.78
culturally
-0.61
lihood
-0.60
Horus
-0.59
curfew
-0.59
donor
-0.59
goodbye
-0.59
economically
-0.58
hol
-0.58
flashlight
-0.58
POSITIVE LOGITS
tyard
1.50
thouse
1.35
ses
1.12
ser
1.10
cour
1.06
sers
1.06
Cour
1.04
eneg
0.97
icol
0.96
cil
0.93
Activations Density 0.010%