INDEX
Explanations
special characters or unusual symbols
New Auto-Interp
Negative Logits
Gillespie
-0.85
Manit
-0.76
Akin
-0.72
wagen
-0.71
Sakuya
-0.70
Bain
-0.69
Abyssal
-0.67
Samson
-0.67
Whitman
-0.67
Sidd
-0.67
POSITIVE LOGITS
¥ŀ
1.30
ĻĤ
1.25
Ķ
1.24
ĵ
1.23
ĵĺ
1.20
ðŁ
1.20
Ĵ
1.17
į
1.13
Į
1.12
İ
1.08
Activations Density 0.004%