INDEX
Explanations
phrases indicating the beginning or initiation of a process or sequence
phrases starting with "first."
New Auto-Interp
Negative Logits
jong
-0.82
bos
-0.74
md
-0.72
tics
-0.71
ingen
-0.70
sav
-0.69
ITH
-0.68
dain
-0.66
crim
-0.66
mbuds
-0.66
POSITIVE LOGITS
thing
1.06
baseman
1.00
responders
0.99
lady
0.93
installment
0.91
impression
0.89
iteration
0.86
attempt
0.86
incarnation
0.86
impressions
0.85
Activations Density 0.075%