INDEX
Explanations
instances where the text discusses time periods in the past
references to significant events or concepts within a historical context
New Auto-Interp
Negative Logits
arius
-0.80
owl
-0.78
athering
-0.77
adelphia
-0.67
atron
-0.66
throp
-0.64
auder
-0.63
eno
-0.62
overflow
-0.62
haze
-0.62
POSITIVE LOGITS
appell
0.81
olicy
0.80
trials
0.73
iments
0.73
sic
0.70
imental
0.68
ĸļ
0.67
utsche
0.66
CHAT
0.65
ijing
0.64
Activations Density 0.000%