INDEX
Explanations
mentions of legal or regulatory restrictions
terms related to limitations or controls imposed on actions or freedoms
New Auto-Interp
Negative Logits
story
-0.75
tein
-0.75
Sea
-0.72
past
-0.72
Lakes
-0.71
rious
-0.70
nie
-0.70
sis
-0.70
Chemistry
-0.68
Hy
-0.67
POSITIVE LOGITS
restrictions
1.40
restricting
1.22
imposed
1.16
restriction
1.10
prohibited
1.09
restricts
1.05
prohibitions
1.02
exemptions
0.99
restrictive
0.97
restricted
0.96
Activations Density 0.008%