INDEX
Negative Logits
\
-2.25
blatantly
-2.14
There
-2.13
When
-2.03
Despite
-2.02
spurred
-1.99
;
-1.98
}
-1.97
subtly
-1.95
-1.93
POSITIVE LOGITS
semelh
2.38
淠
2.23
౮
2.11
䛗
2.11
妧
2.11
泐
2.09
tajem
2.09
perigo
2.06
2.06
糁
2.05
Activations Density 0.003%