INDEX
Explanations
references to individuals named Julius or related to unary operations
names of famous people
New Auto-Interp
Negative Logits
ppuden
-0.58
ARG
-0.48
Garonne
-0.47
Christophe
-0.47
Kond
-0.45
Diane
-0.45
LayoutConstraint
-0.45
Coulson
-0.44
ondheim
-0.44
ARG
-0.43
POSITIVE LOGITS
Julius
2.23
Julius
1.96
Caesar
0.87
Caesar
0.70
Julio
0.67
Cornelius
0.65
lius
0.63
Giulio
0.63
Darryl
0.62
Cæsar
0.60
Activations Density 0.004%