INDEX
Explanations
phrases related to prohibition or restrictions
terms related to prohibitions or restrictions
New Auto-Interp
Negative Logits
lycer
-0.77
eah
-0.75
ammy
-0.75
pring
-0.73
swick
-0.72
etics
-0.71
eon
-0.68
mond
-0.67
yll
-0.65
edes
-0.64
POSITIVE LOGITS
smoking
0.85
prohibition
0.81
nudity
0.78
untarily
0.78
discrimination
0.76
ĸļ
0.76
gambling
0.74
duplication
0.72
tampering
0.72
restricting
0.72
Activations Density 0.062%