INDEX
Explanations
phrases related to news reporting or journalism
New Auto-Interp
Negative Logits
wives
-0.65
emort
-0.64
naissance
-0.63
tre
-0.63
lection
-0.61
nerg
-0.61
thing
-0.61
ãĥŃ
-0.59
ordial
-0.59
clamation
-0.58
POSITIVE LOGITS
ometimes
0.69
petertodd
0.69
olate
0.67
favorably
0.66
quotes
0.66
citing
0.64
metics
0.63
omin
0.63
anecdotes
0.63
"...
0.62
Activations Density 0.205%