INDEX
Explanations
phrases denoting a significant event for the first time
references to "first time" occurrences in history
New Auto-Interp
Negative Logits
$$$$
-0.62
ona
-0.60
dan
-0.58
Gian
-0.57
_{-0.56
mot
-0.56
hao
-0.55
ahn
-0.55
mo
-0.54
iltr
-0.54
POSITIVE LOGITS
history
1.10
history
0.88
clusively
0.88
existence
0.87
awhile
0.85
entirety
0.83
succession
0.79
History
0.76
ever
0.75
decades
0.74
Activations Density 0.157%