INDEX
Explanations
phrases indicating duration or the passage of time
New Auto-Interp
Negative Logits
Times
-0.97
Times
-0.86
Time
-0.79
TIME
-0.75
times
-0.74
TIMES
-0.71
Time
-0.70
TIMES
-0.69
setTime
-0.66
times
-0.63
POSITIVE LOGITS
ti
0.88
tie
0.85
cime
0.83
tinte
0.72
rime
0.71
tine
0.69
tome
0.69
lime
0.69
tı
0.69
ime
0.67
Activations Density 0.153%