INDEX
Explanations
mentions of public health, environmental issues, and prevention measures
New Auto-Interp
Negative Logits
république
-0.49
female
-0.38
kvinder
-0.38
démocratie
-0.35
badkamer
-0.33
female
-0.33
gewerb
-0.32
République
-0.31
spoloč
-0.31
suom
-0.30
POSITIVE LOGITS
########.
0.98
autorytatywna
0.91
存于互联网档案馆
0.91
AsUp
0.89
ValueStyle
0.88
disambiguazione
0.88
<unused52>
0.88
<unused68>
0.87
<unused16>
0.87
[@BOS@]
0.87
Activations Density 0.798%