INDEX
Explanations
instances of phrases indicating a specific timing or condition/event occurrence
instances of the word "when."
New Auto-Interp
Negative Logits
ighed
-0.77
hid
-0.75
vec
-0.74
wered
-0.74
oof
-0.73
sed
-0.70
WER
-0.69
gat
-0.68
arious
-0.67
uren
-0.67
POSITIVE LOGITS
2019
0.88
soever
0.87
hostilities
0.78
2020
0.76
2021
0.75
2018
0.74
reinforcements
0.72
tomorrow
0.71
expiration
0.71
returns
0.70
Activations Density 0.226%