INDEX
Explanations
temporal indicators or phrases that suggest time and continuity
New Auto-Interp
Negative Logits
EndInit
-0.73
Someday
-0.59
verwijspagina
-0.52
Horizonte
-0.49
BeginContext
-0.48
something
-0.47
Perſ
-0.47
Через
-0.46
continuare
-0.45
someday
-0.45
POSITIVE LOGITS
since
1.15
since
1.03
SINCE
0.98
Depuis
0.91
Since
0.89
Since
0.88
depuis
0.83
sejak
0.82
Depuis
0.72
seit
0.71
Activations Density 0.140%