INDEX
Explanations
occurrences of the word "second" and its variations
New Auto-Interp
Negative Logits
ÙĪØµ
-0.16
first
-0.16
czy
-0.15
uki
-0.15
enough
-0.14
ake
-0.14
egin
-0.14
ζα
-0.14
ixa
-0.14
further
-0.13
POSITIVE LOGITS
arily
0.46
aries
0.34
hand
0.32
/th
0.31
(second
0.27
baseman
0.26
-tier
0.26
-generation
0.26
-hand
0.24
-feira
0.24
Activations Density 0.044%