INDEX
Explanations
restrictions imposed on various subjects
mentions of various restrictions imposed by authorities
New Auto-Interp
Negative Logits
rious
-0.75
psc
-0.71
story
-0.70
earth
-0.67
amina
-0.66
Sea
-0.65
Generations
-0.64
Dirt
-0.62
rouse
-0.62
ashington
-0.62
POSITIVE LOGITS
restrictions
1.17
restricting
1.12
imposed
1.12
prohibited
1.02
restricts
0.97
inhib
0.93
restriction
0.93
prohibiting
0.91
restricted
0.90
restrictive
0.90
Activations Density 0.013%