INDEX
Explanations
phrases related to implementing stricter measures or regulations
phrases associated with tightening control or restrictions
New Auto-Interp
Head Attr Weights
0:0.07
1:0.03
2:0.13
3:0.11
4:0.08
5:0.05
6:0.02
7:0.03
8:0.18
9:0.14
10:0.06
11:0.04
Negative Logits
Pal
-1.15
JECT
-1.11
Card
-1.11
Yao
-1.08
pige
-1.07
Pod
-1.05
cooks
-1.03
sacrificed
-1.02
podcast
-1.02
weet
-1.01
POSITIVE LOGITS
ーテ
1.54
nings
1.39
versions
1.28
irements
1.24
notations
1.23
foothold
1.22
龍契士
1.21
req
1.21
urities
1.20
grip
1.19
Activations Density 0.008%