INDEX
Explanations
references to regulation and government-related actions concerning substances
New Auto-Interp
Negative Logits
psc
-0.18
DISCLAIM
-0.17
Frag
-0.16
Snowden
-0.16
ako
-0.16
unma
-0.15
nech
-0.14
Fukushima
-0.14
erus
-0.14
Baron
-0.13
POSITIVE LOGITS
boot
0.38
Pro
0.37
Temper
0.34
prohibition
0.34
temper
0.31
Boot
0.31
spe
0.29
Boot
0.27
/boot
0.26
-boot
0.26
Activations Density 0.027%