INDEX
Explanations
references to major news publications
New Auto-Interp
Negative Logits
"..\..\
-0.73
"..\..\..\
-0.71
وتسجيلات
-0.63
numerus
-0.61
__(/*!
-0.61
utafitiHapana
-0.60
ьаж
-0.59
NOWLEDG
-0.59
reated
-0.59
disambiguazione
-0.59
POSITIVE LOGITS
NYT
1.11
newspaper
0.99
Times
0.97
newspapers
0.87
nytimes
0.86
Times
0.82
wsj
0.79
Newspaper
0.79
Newspaper
0.75
newspaper
0.75
Activations Density 0.115%