INDEX
Explanations
numerical values and their associated counts or classifications
Code, numbers, and symbols
specific numbers and identifiers
New Auto-Interp
Negative Logits
Baumann
-0.77
Nicola
-0.74
Goy
-0.69
Nicola
-0.67
Stephan
-0.67
Ronde
-0.67
Silva
-0.66
Kessler
-0.64
Lande
-0.64
Nikola
-0.63
POSITIVE LOGITS
AssemblyCulture
0.97
UserScript
0.70
Sumter
0.68
Hathaway
0.67
Guevara
0.66
Dunham
0.65
Alec
0.64
Gerrit
0.63
Coul
0.63
Lecce
0.63
Activations Density 0.841%