INDEX
Explanations
timestamps or durations mentioned in seconds
references to time durations
New Auto-Interp
Negative Logits
Hare
-0.73
hoe
-0.64
guide
-0.63
ribe
-0.61
mascul
-0.61
hare
-0.60
Dy
-0.60
plan
-0.60
cott
-0.59
sth
-0.59
POSITIVE LOGITS
seconds
3.49
seconds
2.61
milliseconds
2.45
Seconds
2.40
minutes
2.31
Minutes
1.83
moments
1.70
mins
1.61
millisec
1.51
hours
1.50
Activations Density 0.018%