INDEX
Explanations
words related to legal actions or restrictions
references to legal prohibitions or bans
New Auto-Interp
Negative Logits
ilogy
-0.83
framework
-0.73
lopp
-0.73
LI
-0.68
ERC
-0.67
library
-0.66
icon
-0.66
aeus
-0.66
ruary
-0.64
Nin
-0.63
POSITIVE LOGITS
lawfully
1.17
unlawful
1.15
lawful
1.08
prohibited
0.95
legally
0.94
license
0.94
unlawfully
0.94
permissible
0.89
constitutionally
0.88
violates
0.87
Activations Density 0.729%