INDEX
Explanations
references to well-known or notorious individuals, characters, or entities
famous or iconic things
New Auto-Interp
Negative Logits
preſent
-0.50
purpoſe
-0.50
Reſ
-0.49
cauſe
-0.47
chrétiens
-0.46
juſ
-0.46
eſt
-0.44
Perſ
-0.44
Conſ
-0.43
diſt
-0.42
POSITIVE LOGITS
infamous
0.86
SourceChecksum
0.80
famous
0.73
notorious
0.73
famous
0.69
ніципа
0.69
знамени
0.67
famed
0.67
dreaded
0.67
famously
0.66
Activations Density 0.018%