INDEX
Explanations
references to the word "both" indicating comparisons or dualities
New Auto-Interp
Negative Logits
Chriftian
-0.66
Miao
-0.61
occaf
-0.61
Efq
-0.61
Elie
-0.60
Frie
-0.60
Cæsar
-0.59
goon
-0.59
monica
-0.58
ülerin
-0.58
POSITIVE LOGITS
both
3.67
both
3.41
Both
3.15
Both
3.13
BOTH
3.02
BOTH
2.81
Ambos
2.64
ambos
2.44
entrambi
2.44
beide
2.39
Activations Density 0.084%