INDEX
Explanations
instances of phrases indicating a numerical quantity followed by a general noun or concept
phrases indicating a relative time or duration
New Auto-Interp
Negative Logits
Hollow
-0.63
Prelude
-0.61
Cassandra
-0.59
Emblem
-0.59
LF
-0.59
first
-0.58
1914
-0.57
Osw
-0.56
alist
-0.56
Romeo
-0.56
POSITIVE LOGITS
bered
0.90
oths
0.87
othe
0.86
zin
0.76
éĹ
0.75
oshenko
0.75
ricular
0.74
ooo
0.74
apy
0.73
posium
0.73
Activations Density 0.012%