INDEX
Explanations
instances of the word "first" in various contexts
New Auto-Interp
Negative Logits
ixa
-0.19
first
-0.18
ixo
-0.17
dal
-0.16
rzy
-0.16
firstly
-0.15
ixin
-0.15
cken
-0.15
further
-0.15
ix
-0.15
POSITIVE LOGITS
s
0.33
-ever
0.31
tiên
0.31
born
0.28
-hand
0.28
-rate
0.28
timers
0.25
-order
0.24
responders
0.24
-degree
0.23
Activations Density 0.128%