INDEX
Explanations
mentions of specific numbers with a hyphen or slash following the number
punctuations and formatting that signify sentence boundaries
New Auto-Interp
Negative Logits
Rally
-0.66
Colossus
-0.65
Squadron
-0.65
presc
-0.64
oche
-0.62
Classics
-0.62
Hail
-0.62
Transition
-0.61
scrap
-0.61
GP
-0.60
POSITIVE LOGITS
³³³
1.08
³³³³
1.00
³³³³³³³³
1.00
↵Âł
1.00
Âł
0.99
³³
0.97
Âł Âł Âł Âł
0.90
Âł Âł
0.90
³³
0.89
³³³³³³³³³³³³³³³³
0.89
Activations Density 0.513%