INDEX
Negative Logits
hoax
0.80
TP
0.79
postpon
0.76
n
0.75
➤
0.75
jab
0.75
TP
0.74
haw
0.73
kore
0.72
acak
0.72
POSITIVE LOGITS
际
0.96
鎗
0.93
ش
0.91
බැ
0.90
រស
0.88
يز
0.88
ల
0.87
projetos
0.85
秋
0.84
лни
0.83
Activations Density 0.001%
hoax
TP
postpon
n
➤
jab
TP
haw
kore
acak
际
鎗
ش
බැ
រស
يز
ల
projetos
秋
лни