INDEX
Explanations
references to temporal sequences and events in a narrative context
New Auto-Interp
Negative Logits
ongo
-0.69
oria
-0.65
olute
-0.63
ees
-0.61
ibaba
-0.60
ORN
-0.60
ken
-0.59
Listener
-0.58
phabet
-0.57
hemy
-0.57
POSITIVE LOGITS
way
0.76
day
0.73
til
0.72
week
0.70
morning
0.70
weekend
0.68
semester
0.66
mornings
0.66
furthe
0.65
night
0.65
Activations Density 0.167%