INDEX
Explanations
phrases indicating progression or transitions
New Auto-Interp
Negative Logits
leston
-0.19
ãĤ§
-0.16
ADO
-0.15
ashi
-0.15
zes
-0.15
è©
-0.15
orum
-0.14
stir
-0.14
eben
-0.14
»
-0.13
POSITIVE LOGITS
884
0.16
brook
0.16
iw
0.16
DebugEnabled
0.16
kla
0.15
erb
0.15
ванов
0.14
iyi
0.14
dcc
0.14
ioc
0.14
Activations Density 0.021%