INDEX
Explanations
temporal references in relation to significant events
that often come before other words
time indicators like weeks and dates
New Auto-Interp
Negative Logits
"):
-0.82
autorytatywna
-0.80
'):
-0.80
المعيارى
-0.79
="">
-0.70
")[
-0.70
principalColumn
-0.69
ſelves
-0.68
Diſ
-0.68
Administrativna
-0.67
POSITIVE LOGITS
we
0.96
,
0.89
when
0.80
there
0.68
they
0.65
during
0.61
During
0.60
When
0.59
he
0.59
you
0.57
Activations Density 0.282%