INDEX
Explanations
numbers, URLs, and email addresses in text
characters, symbols, and formats associated with URLs and online content
New Auto-Interp
Negative Logits
Coul
-0.89
Schultz
-0.86
Bulgar
-0.85
kel
-0.84
Jenna
-0.83
729
-0.81
Nun
-0.81
Bun
-0.81
Clover
-0.80
Kemp
-0.79
POSITIVE LOGITS
ardi
1.07
ARD
0.93
ard
0.91
ĩ
0.89
ords
0.88
ARDS
0.86
Jord
0.84
ARI
0.83
Rom
0.83
ART
0.81
Activations Density 0.695%