INDEX
Explanations
phrases related to rules and constraints
concepts related to social norms and regulations
New Auto-Interp
Negative Logits
ideshow
-0.31
augh
-0.31
)</
-0.30
)."
-0.29
outwe
-0.29
entimes
-0.29
utm
-0.29
hers
-0.29
candy
-0.28
arer
-0.28
POSITIVE LOGITS
Examination
0.35
galitarian
0.33
ombat
0.31
Whilst
0.30
raltar
0.30
Invalid
0.30
Wasteland
0.29
Regarding
0.28
Civilization
0.28
Dragons
0.28
Activations Density 3.587%