INDEX
Explanations
parentheses with a numerical value inside
opening parentheses in various contexts
New Auto-Interp
Negative Logits
76561
-0.82
behavi
-0.75
CLASSIFIED
-0.73
¬¼
-0.68
GoldMagikarp
-0.67
Magikarp
-0.64
corrid
-0.63
exha
-0.62
vous
-0.62
kefeller
-0.61
POSITIVE LOGITS
(
2.04
("1.72
([
1.66
('1.61
(~
1.57
(-
1.53
(<
1.52
((
1.51
(.
1.50
(=
1.49
Activations Density 0.195%