INDEX
Negative Logits
permiss
-0.09
dependent
-0.09
estação
-0.08
substitution
-0.08
substitutions
-0.08
panes
-0.08
PCR
-0.08
таблетки
-0.08
♪
-0.08
.wav
-0.08
POSITIVE LOGITS
disclosure
0.12
disclose
0.11
ESG
0.11
disclosures
0.11
Disclosure
0.11
Disclosure
0.10
divulg
0.10
BIM
0.10
వెల్లడ
0.09
Truth
0.09
Activations Density 0.012%