INDEX
Explanations
phrases indicating the conclusion or end of events or periods
New Auto-Interp
Negative Logits
late
-0.17
china
-0.15
æĹ©
-0.15
early
-0.15
late
-0.15
Begin
-0.15
igen
-0.14
ëª
-0.14
Late
-0.14
begin
-0.14
POSITIVE LOGITS
tunnel
0.20
tether
0.18
term
0.17
unnel
0.17
usra
0.16
tunnels
0.16
Tunnel
0.16
each
0.16
innocence
0.15
month
0.15
Activations Density 0.062%