INDEX
Explanations
the occurrence of the word "three" in various contexts
New Auto-Interp
Negative Logits
Athenians
-0.81
Efq
-0.77
chapels
-0.74
Lipschitz
-0.73
MAV
-0.72
Plutarch
-0.72
Cæsar
-0.71
Mongols
-0.71
Poincar
-0.70
ballads
-0.70
POSITIVE LOGITS
three
2.58
two
2.38
four
2.23
three
2.10
two
1.86
trois
1.82
five
1.82
six
1.69
drei
1.66
four
1.65
Activations Density 0.215%