INDEX
Explanations
references to health and safety measures related to COVID-19
New Auto-Interp
Negative Logits
_BLK
-0.15
imprison
-0.14
Lock
-0.14
okus
-0.14
Ñģом
-0.14
lok
-0.13
ãĥªãĥ³
-0.13
ULO
-0.13
getc
-0.13
.sleep
-0.13
POSITIVE LOGITS
socially
0.30
social
0.30
social
0.27
temperature
0.26
_social
0.26
ÑģоÑĨи
0.25
Temperature
0.25
Social
0.25
-social
0.24
Social
0.24
Activations Density 0.064%