INDEX
Negative Logits
s
0.86
egin
0.79
doc
0.78
rule
0.76
sample
0.73
Determ
0.72
bb
0.72
tech
0.72
san
0.72
Determines
0.71
POSITIVE LOGITS
contrasting
0.84
䀓
0.82
livre
0.79
bodyguard
0.78
euthanasia
0.78
totalitarian
0.77
día
0.76
biasa
0.75
reação
0.75
speedometer
0.75
Activations Density 0.001%