INDEX
Explanations
phrases indicating temporal continuity
New Auto-Interp
Negative Logits
ignet
-0.16
бом
-0.15
ekl
-0.14
баÑģ
-0.14
eam
-0.14
ãĥªãĥ³ãĤ°
-0.13
elf
-0.13
Contours
-0.13
ringe
-0.13
oplevel
-0.13
POSITIVE LOGITS
kie
0.19
isos
0.18
Hampton
0.15
eve
0.14
lica
0.14
liš
0.14
leigh
0.14
recent
0.14
олоÑģ
0.14
valuable
0.13
Activations Density 0.062%