INDEX
Negative Logits
硙
0.39
administratif
0.35
邰
0.34
䣫
0.34
猞
0.34
şeyler
0.33
perusahaan
0.33
özellikle
0.33
颇
0.33
鸹
0.32
POSITIVE LOGITS
+
0.60
+
0.47
+\
0.46
4
0.44
+(
0.44
f
0.43
6
0.43
x
0.42
0.42
0.41
Activations Density 0.294%
硙
administratif
邰
䣫
猞
şeyler
perusahaan
özellikle
颇
鸹
+
+
+\
4
+(
f
6
x