INDEX
Explanations
references to current entities, dates, and events
New Auto-Interp
Negative Logits
kowski
-0.18
initially
-0.18
soon
-0.17
originally
-0.17
аннÑĸ
-0.16
dorf
-0.16
åİŁæľ¬
-0.15
ÑĢанÑĮÑĪе
-0.15
previously
-0.15
first
-0.15
POSITIVE LOGITS
STILL
0.28
now
0.28
still
0.27
still
0.26
Still
0.25
Still
0.23
now
0.22
_now
0.22
artık
0.21
hâlâ
0.21
Activations Density 0.211%