INDEX
Explanations
The followed by proper noun
New Auto-Interp
Negative Logits
en
0.91
in
0.89
IN
0.89
the
0.89
of
0.86
own
0.84
If
0.83
o
0.83
ο
0.82
0.81
POSITIVE LOGITS
atrical
1.71
odore
1.50
odora
1.40
Beatles
1.39
atres
1.37
ophylline
1.33
mselves
1.32
matic
1.30
Hague
1.30
orems
1.28
Activations Density 0.146%