INDEX
Explanations
instances of the word "time" with relatively high activation values
mentions of time and temporal references
New Auto-Interp
Negative Logits
ramid
-0.76
ouk
-0.70
RD
-0.69
Bey
-0.69
iants
-0.63
reditary
-0.62
atl
-0.61
MAT
-0.60
rim
-0.60
atever
-0.59
POSITIVE LOGITS
frames
0.89
elapsed
0.86
zone
0.85
interval
0.79
pause
0.76
consuming
0.70
ago
0.69
msec
0.69
traveller
0.69
][
0.69
Activations Density 0.026%