INDEX
Explanations
proper nouns related to various institutions or entities
proper nouns, particularly names of institutions and characters
New Auto-Interp
Negative Logits
Reps
-0.75
Perez
-0.69
chore
-0.69
andra
-0.67
SNAP
-0.66
nesota
-0.66
oops
-0.66
EStreamFrame
-0.65
Snap
-0.65
Pipeline
-0.65
POSITIVE LOGITS
Britann
4.11
Albion
3.08
Avalon
1.61
ynes
1.18
mund
1.08
Yor
0.99
Atlantis
0.96
Acad
0.96
Bohem
0.96
Conan
0.96
Activations Density 0.039%