INDEX
Explanations
references to historical events and figures
Past tense auxiliary verbs ("had", "was", "were")
past events by a certain time
New Auto-Interp
Negative Logits
оригіналу
-0.79
selama
-0.62
illier
-0.58
epik
-0.56
cinogenicity
-0.53
bitat
-0.53
همیشه
-0.50
مواليد
-0.49
nahilalakip
-0.49
InSection
-0.48
POSITIVE LOGITS
still
1.19
already
1.19
hadn
1.09
still
1.07
already
1.02
すでに
1.00
既に
0.98
Already
0.98
Still
0.96
had
0.96
Activations Density 0.563%