INDEX
Explanations
phrases indicating personal perseverance and continuous effort
New Auto-Interp
Negative Logits
995
-0.15
stan
-0.15
earlier
-0.15
Earlier
-0.15
rib
-0.14
rens
-0.14
sooner
-0.14
eres
-0.14
Reuters
-0.14
ugh
-0.13
POSITIVE LOGITS
since
0.48
thereafter
0.43
since
0.41
Since
0.39
subsequent
0.39
以æĿ¥
0.38
Since
0.38
subsequently
0.37
depuis
0.34
_since
0.34
Activations Density 0.266%