INDEX
Negative Logits
-0.49
,
-0.46
"
-0.44
-
-0.43
.
-0.41
/
-0.41
仲間
-0.40
or
-0.40
OR
-0.39
a
-0.39
POSITIVE LOGITS
itſelf
0.98
Jefus
0.96
Efq
0.95
Majefty
0.94
myſelf
0.94
లాలు
0.94
tableFuture
0.93
Vidite
0.93
ſche
0.92
незавершена
0.92
Activations Density 2.690%