INDEX
Explanations
terms related to historical changes and developments
a past state or previous time
initial states and past thoughts
New Auto-Interp
Negative Logits
désormais
-1.09
henceforth
-1.05
bientôt
-1.04
inzwischen
-1.03
mittlerweile
-1.02
inmiddels
-1.02
now
-1.01
uiteindelijk
-1.00
subsequently
-0.98
теперь
-0.96
POSITIVE LOGITS
thought
0.80
以為
0.68
以为
0.67
thought
0.65
feared
0.64
متعلقه
0.63
hesitant
0.63
notion
0.59
Thought
0.58
solely
0.57
Activations Density 0.329%