INDEX
Explanations
statements of confirmation or findings related to incidents or events
New Auto-Interp
Negative Logits
recent
-0.24
lately
-0.24
soon
-0.24
recently
-0.23
soon
-0.20
recent
-0.18
缮åīį
-0.18
later
-0.18
Soon
-0.18
Recently
-0.18
POSITIVE LOGITS
still
0.39
STILL
0.36
again
0.36
still
0.35
Still
0.32
Still
0.30
again
0.29
ancora
0.27
Again
0.27
already
0.26
Activations Density 0.044%