INDEX
Negative Logits
лаж
-0.08
작업
-0.07
NAC
-0.07
информа
-0.07
verdadeiro
-0.07
Opin
-0.07
沖
-0.07
Satisfied
-0.07
Opinion
-0.07
-aware
-0.07
POSITIVE LOGITS
uri
0.08
suffering
0.08
ểm
0.08
衡
0.07
medal
0.07
vela
0.07
broaden
0.07
inecraft
0.07
Leiter
0.07
capt
0.07
Activations Density 0.009%