INDEX
Explanations
phrases related to the passage of time
mentions of "passage" in relation to time or events
New Auto-Interp
Negative Logits
pora
-0.89
iversity
-0.75
resid
-0.73
RAW
-0.71
ises
-0.68
Columb
-0.67
olulu
-0.67
ise
-0.66
ikarp
-0.64
ches
-0.64
POSITIVE LOGITS
passages
0.82
aloud
0.77
phrase
0.77
uality
0.75
through
0.74
passage
0.73
ttes
0.72
bill
0.68
notes
0.66
ahead
0.64
Activations Density 0.030%