INDEX
Negative Logits
ılmıştır
-0.08
пуст
-0.07
convenience
-0.07
ruined
-0.07
pasar
-0.07
transc
-0.07
intros
-0.07
sinon
-0.07
Brown
-0.07
.Convert
-0.07
POSITIVE LOGITS
alleged
0.15
allegations
0.13
alleging
0.13
allegation
0.12
allege
0.12
alleges
0.11
allegedly
0.10
LOG
0.09
log
0.08
886
0.08
Activations Density 0.005%