INDEX
Explanations
time-related phrases, specifically durations such as "two weeks" or "five years"
temporal references and time durations
New Auto-Interp
Negative Logits
fault
-0.81
bluff
-0.68
ente
-0.66
aily
-0.65
incent
-0.63
sympath
-0.63
treadmill
-0.61
incent
-0.60
exhib
-0.60
compuls
-0.60
POSITIVE LOGITS
increments
0.98
span
0.96
nutshell
0.82
hops
0.78
ago
0.77
opian
0.77
stride
0.77
inventoryQuantity
0.77
ixties
0.74
ruary
0.73
Activations Density 0.140%