INDEX
Explanations
phrases urging action or participation
New Auto-Interp
Negative Logits
urn
-0.16
ittings
-0.16
rig
-0.15
гоÑĤ
-0.15
ige
-0.14
ëĿ½
-0.14
.ov
-0.14
ÑĪе
-0.14
ler
-0.14
raft
-0.14
POSITIVE LOGITS
join
0.18
come
0.18
pletely
0.18
pcb
0.17
Join
0.17
Come
0.17
Come
0.16
LEC
0.16
%f
0.16
upp
0.15
Activations Density 0.022%