INDEX
Negative Logits
271
-0.07
269
-0.07
likeness
-0.07
/init
-0.07
663
-0.07
257
-0.07
disposal
-0.07
Two
-0.07
spinner
-0.07
Two
-0.07
POSITIVE LOGITS
read
0.27
Read
0.21
read
0.18
Read
0.17
-read
0.17
READ
0.16
.Read
0.14
reads
0.14
READ
0.14
reading
0.13
Activations Density 0.055%