INDEX
Explanations
the pattern "X" followed by a number
the occurrences of the token 'X' in various contexts
New Auto-Interp
Negative Logits
getic
-0.84
kson
-0.79
enegger
-0.70
inally
-0.69
ufact
-0.68
captcha
-0.66
¢
-0.66
cffff
-0.66
Ú
-0.65
Poké
-0.64
POSITIVE LOGITS
avier
1.28
peria
1.21
cellence
1.14
posed
0.96
aminer
0.96
III
0.94
odus
0.92
clusive
0.92
posure
0.92
ML
0.92
Activations Density 0.027%