INDEX
Explanations
references to specific organizations or companies
New Auto-Interp
Negative Logits
in
-0.75
,
-0.71
alone
-0.69
from
-0.69
as
-0.67
with
-0.66
-0.66
(
-0.66
-0.65
followed
-0.63
POSITIVE LOGITS
Paglinawan
0.76
ніципа
0.67
ویکیپدیای
0.66
Mero
0.56
betweenstory
0.55
urably
0.54
Jof
0.52
doubtnut
0.51
ømme
0.50
expandindo
0.50
Activations Density 0.367%