INDEX
Explanations
time-related expressions, particularly referring to specific time intervals and events happening after a certain period
references to time periods and the passage of years
New Auto-Interp
Negative Logits
SourceFile
-0.72
Talk
-0.70
Scrib
-0.70
ãĤ±
-0.69
esson
-0.66
ãĥ¤
-0.65
odox
-0.64
neau
-0.64
eus
-0.63
tainment
-0.63
POSITIVE LOGITS
hindsight
0.78
spent
0.78
schild
0.75
devoted
0.72
culminating
0.67
incarcer
0.65
adic
0.64
ccording
0.64
intertw
0.64
captivity
0.64
Activations Density 0.416%