INDEX
Explanations
mentions of actions or events from the past
references to events or actions that have occurred in the past
New Auto-Interp
Negative Logits
hat
-0.72
Franch
-0.69
wagon
-0.68
starting
-0.68
oya
-0.68
MSN
-0.67
hyde
-0.66
atism
-0.65
NEY
-0.65
ODY
-0.64
POSITIVE LOGITS
ebin
1.42
tense
1.28
decade
1.27
few
1.13
fortnight
1.10
couple
1.06
month
1.05
week
1.00
orate
0.99
century
0.96
Activations Density 0.030%