INDEX
Negative Logits
democracy
0.49
lobbying
0.46
courtesy
0.46
roadways
0.45
incoherent
0.44
uprisings
0.43
recording
0.43
isotropic
0.43
plasmids
0.42
instabilities
0.42
POSITIVE LOGITS
EDIT
0.76
Edit
0.73
edit
0.71
Edit
0.71
EDIT
0.69
edit
0.66
редакти
0.66
Editar
0.63
править
0.60
编辑
0.57
Activations Density 0.000%