INDEX
Explanations
words related to prohibitions or restrictions
terms related to "prohibition" or restrictions
New Auto-Interp
Negative Logits
squared
-0.76
Primordial
-0.75
Apostles
-0.67
rolling
-0.65
Dover
-0.64
graves
-0.62
Span
-0.61
Spice
-0.60
DOT
-0.60
forth
-0.60
POSITIVE LOGITS
hibit
1.20
hib
1.15
hibition
1.09
oresc
1.04
inous
1.00
itors
0.99
hibited
0.98
itor
0.94
encies
0.94
exhib
0.90
Activations Density 0.008%