INDEX
Explanations
occurrences of closing brackets
New Auto-Interp
Negative Logits
émon
-0.67
Sek
-0.59
global
-0.58
leſs
-0.57
Dol
-0.55
Mont
-0.54
ρυσ
-0.54
GEST
-0.54
Wal
-0.54
ness
-0.53
POSITIVE LOGITS
]
1.95
]")]
1.71
)]
1.69
"]
1.69
})]
1.63
']
1.56
])
1.55
″]
1.55
))]
1.54
]
1.48
Activations Density 0.174%