INDEX
Explanations
instances of the word "first" and its variations
New Auto-Interp
Negative Logits
first
-0.18
ixa
-0.17
further
-0.17
firstly
-0.16
dal
-0.15
rzy
-0.15
yssey
-0.14
ixo
-0.14
ssel
-0.14
essler
-0.14
POSITIVE LOGITS
-ever
0.39
s
0.36
-hand
0.33
-rate
0.32
born
0.31
tiên
0.30
-time
0.29
responders
0.27
-order
0.27
-degree
0.27
Activations Density 0.129%