INDEX
Explanations
phrases related to imposing limitations or regulations
terms related to imposing limitations or controls
New Auto-Interp
Negative Logits
FORE
-0.82
ashington
-0.74
OF
-0.67
rious
-0.66
orah
-0.66
Sham
-0.66
Dirt
-0.65
Äĩ
-0.64
hower
-0.64
------------------------
-0.63
POSITIVE LOGITS
ively
0.92
restricts
0.88
ricted
0.87
unrestricted
0.81
restricting
0.80
restraints
0.80
restricted
0.80
satell
0.80
restrictive
0.77
territ
0.77
Activations Density 0.034%