INDEX
Negative Logits
It
0.58
That
0.57
They
0.57
When
0.56
This
0.55
Once
0.55
These
0.54
Another
0.54
Security
0.54
That
0.53
POSITIVE LOGITS
dissertation
0.84
essay
0.79
essays
0.77
coursework
0.76
литератур
0.69
thesis
0.68
escritores
0.68
argumentative
0.65
escribir
0.64
escrever
0.63
Activations Density 0.000%