INDEX
Explanations
references to the "first" occurrence of events or elements
New Auto-Interp
Negative Logits
tics
-0.82
mbuds
-0.81
athed
-0.66
borg
-0.66
aths
-0.65
md
-0.65
ITH
-0.64
sav
-0.64
gregation
-0.64
rs
-0.64
POSITIVE LOGITS
iteration
1.06
installment
1.04
batch
0.99
couple
0.98
thing
0.98
few
0.97
baseman
0.93
responders
0.92
volley
0.92
lady
0.89
Activations Density 0.067%