INDEX
Explanations
occurrences of the number six or its derivatives in different contexts
New Auto-Interp
Negative Logits
ont
-0.17
ings
-0.17
lint
-0.17
ived
-0.16
âĶĶ
-0.16
ishments
-0.16
iams
-0.15
ports
-0.15
unger
-0.15
nds
-0.15
POSITIVE LOGITS
teenth
0.39
ties
0.33
teen
0.33
ti
0.30
ty
0.27
sense
0.23
six
0.23
Flags
0.22
ix
0.21
ting
0.19
Activations Density 0.101%