INDEX
Negative Logits
={},-0.93
᥆
-0.86
ITAS
-0.84
catchError
-0.83
鬯
-0.80
irez
-0.79
Birken
-0.79
鵙
-0.78
erdings
-0.78
asan
-0.78
POSITIVE LOGITS
while
1.12
these
0.81
ịn
0.78
忿
0.78
vola
0.76
just
0.76
ønne
0.75
took
0.75
taking
0.75
рекомендуется
0.74
Activations Density 0.003%