INDEX
Explanations
sentences that use the word "like" to initiate comparisons or examples
New Auto-Interp
Negative Logits
åłĤ
-0.18
EITHER
-0.16
Verfügung
-0.16
either
-0.16
illin
-0.14
onical
-0.14
.Îķ
-0.14
plemented
-0.14
orno
-0.14
ifix
-0.13
POSITIVE LOGITS
many
0.41
most
0.33
many
0.33
any
0.27
许å¤ļ
0.26
Many
0.25
MANY
0.25
Many
0.24
with
0.23
everything
0.23
Activations Density 0.073%