INDEX
Explanations
the presence of the word "tick" in various forms or contexts
New Auto-Interp
Negative Logits
ocal
-0.69
IZE
-0.65
Dakota
-0.63
ocally
-0.63
¬¼
-0.61
ruciating
-0.61
ONT
-0.60
isSpecialOrderable
-0.60
icably
-0.58
Columb
-0.58
POSITIVE LOGITS
lers
1.13
lish
1.12
ety
1.10
ling
1.05
les
1.02
boxes
0.95
led
0.95
eting
0.93
box
0.92
buck
0.90
Activations Density 0.008%