INDEX
Negative Logits
Execution
-0.08
Alberto
-0.07
echo
-0.07
īja
-0.07
ono
-0.07
uttering
-0.07
ceived
-0.07
vertellen
-0.07
.raw
-0.07
raw
-0.07
POSITIVE LOGITS
respectful
0.12
尊
0.11
peacefully
0.11
preferably
0.10
respe
0.10
peaceful
0.10
calmly
0.10
constructive
0.10
gentle
0.10
rather
0.09
Activations Density 0.076%