INDEX
Explanations
phrases related to political actions and policies
references to political or regulatory actions affecting the environment
New Auto-Interp
Negative Logits
answered
-0.58
aback
-0.58
Canaver
-0.57
explanations
-0.56
disclaimer
-0.55
!:
-0.55
DragonMagazine
-0.53
IRC
-0.52
Patreon
-0.51
ERG
-0.51
POSITIVE LOGITS
)).
0.84
%).
0.80
)."
0.79
).[
0.76
]).
0.72
]."
0.70
).
0.68
)),
0.65
").
0.64
etc
0.63
Activations Density 2.934%