INDEX
Explanations
numbers preceded by a space and followed by non-alphabetic characters
references to the number 58 in varying contexts
New Auto-Interp
Negative Logits
itan
-0.78
ART
-0.76
Pandora
-0.72
Cart
-0.71
Bend
-0.70
Bowman
-0.69
Asgard
-0.69
cart
-0.69
oton
-0.68
fruits
-0.68
POSITIVE LOGITS
68
1.66
98
1.64
58
1.64
88
1.48
178
1.47
118
1.46
78
1.45
58
1.44
138
1.44
118
1.44
Activations Density 0.073%