INDEX
Explanations
references to publications such as newspapers or journals
mentions of various journals
New Auto-Interp
Negative Logits
creen
-0.70
holders
-0.69
theirs
-0.62
ours
-0.61
ticking
-0.61
cream
-0.59
regress
-0.59
Staples
-0.59
Michele
-0.59
dh
-0.59
POSITIVE LOGITS
Sentinel
1.02
istic
0.90
Editors
0.87
ournals
0.87
ism
0.86
ists
0.83
Journal
0.82
Paper
0.81
Journal
0.81
ist
0.80
Activations Density 0.011%