INDEX
Explanations
mentions of specific legislation or policy names
references to the entity labeled "EN" and its associated context or versions
New Auto-Interp
Negative Logits
vati
-0.81
heng
-0.69
ffe
-0.67
illac
-0.66
fentanyl
-0.65
cies
-0.62
FactoryReloaded
-0.62
undone
-0.62
e
-0.61
Louie
-0.61
POSITIVE LOGITS
EN
0.95
DERR
0.83
acci
0.83
vironment
0.81
terness
0.80
IG
0.80
EST
0.79
PUT
0.78
ANGEL
0.78
TRY
0.77
Activations Density 0.009%