INDEX
Explanations
information about organizational announcements and changes in leadership or service policies
New Auto-Interp
Negative Logits
agg
-0.15
elu
-0.14
Stride
-0.13
Constructor
-0.13
@Before
-0.13
anon
-0.13
izar
-0.13
ame
-0.13
iap
-0.13
ames
-0.12
POSITIVE LOGITS
effective
0.85
effective
0.76
Effective
0.74
Effective
0.71
-effective
0.62
beginning
0.60
starting
0.58
Beginning
0.51
Starting
0.50
EFFECT
0.49
Activations Density 0.326%