INDEX
Explanations
phrases related to legal institutions or proceedings
instances of the word "court" or variations of it
New Auto-Interp
Negative Logits
curfew
-0.69
AMI
-0.64
OV
-0.63
Franks
-0.58
hol
-0.58
ocalypse
-0.56
ARK
-0.56
ORN
-0.56
Shining
-0.55
warp
-0.54
POSITIVE LOGITS
tyard
1.66
thouse
1.28
ser
1.19
ses
1.18
sers
1.16
tes
1.12
ten
1.10
tesy
1.10
gue
1.08
rier
1.07
Activations Density 0.053%