INDEX
Explanations
dates and periods of time, specifically referencing events that happened a certain number of years ago
the occurrence of the word "ago," particularly in various contexts indicating time
New Auto-Interp
Negative Logits
uniqueness
-0.86
simultaneous
-0.76
determination
-0.72
similarity
-0.70
similarities
-0.69
mobility
-0.68
suspic
-0.68
emphasis
-0.66
intensity
-0.66
comfort
-0.66
POSITIVE LOGITS
ago
1.34
vernment
1.29
allo
0.97
edia
0.96
opa
0.94
zzi
0.88
onte
0.87
etta
0.85
onz
0.84
wright
0.81
Activations Density 0.003%