INDEX
Explanations
dates like 2024
mentions of specific years and dated time markers, especially recent calendar years and fiscal year labels (e.g., FY-year).
New Auto-Interp
Negative Logits
sedative
1.39
surgi
1.34
它
1.32
🙉
1.31
HAD
1.30
quinine
1.26
crippling
1.23
אר
1.19
astray
1.17
THERE
1.15
POSITIVE LOGITS
ن
1.90
ف
1.81
क
1.71
T
1.69
ל
1.52
k
1.36
h
1.33
𝗔
1.30
i
1.29
د
1.27
Activations Density 0.020%