INDEX
Negative Logits
coup
-0.08
Colony
-0.07
208
-0.06
saldırı
-0.06
_winner
-0.06
düzen
-0.06
victory
-0.06
divers
-0.06
function
-0.06
fort
-0.06
POSITIVE LOGITS
read
0.14
Read
0.13
-read
0.12
reading
0.12
reads
0.11
READ
0.10
Reads
0.10
readings
0.10
read
0.10
read
0.09
Activations Density 0.055%