INDEX
Negative Logits
don
0.60
an
0.53
a
0.52
eing
0.52
on
0.50
enteros
0.50
deforestation
0.49
datos
0.49
clear
0.49
didn
0.49
POSITIVE LOGITS
повседнев
0.86
ordinary
0.75
日常
0.73
обы
0.71
cotidiana
0.70
生活中
0.69
Ordinary
0.68
mundane
0.68
ordinary
0.67
일상
0.67
Activations Density 0.020%