INDEX
Explanations
instances of a specific time duration
phrases indicating frequency or specific timeframes for events
New Auto-Interp
Negative Logits
onto
-0.70
promot
-0.69
tolerated
-0.66
heed
-0.64
brakes
-0.63
orest
-0.63
ourge
-0.62
intimately
-0.62
insulted
-0.61
existed
-0.61
POSITIVE LOGITS
nutshell
1.09
case
0.82
context
0.80
meantime
0.79
guise
0.76
cases
0.75
mode
0.73
ç¥ŀ
0.73
vein
0.67
lance
0.67
Activations Density 0.287%