INDEX
Explanations
phrases related to specific names or titles of individuals
proper nouns and names of individuals
New Auto-Interp
Negative Logits
Canaver
-0.67
Angelo
-0.64
selves
-0.64
Constantine
-0.63
PROGRAM
-0.61
defin
-0.60
VIDEOS
-0.59
Egyptians
-0.58
Skydragon
-0.58
sep
-0.57
POSITIVE LOGITS
hai
0.85
atu
0.80
igham
0.77
Ô
0.77
liga
0.74
urst
0.73
istan
0.72
esi
0.72
jee
0.71
edu
0.69
Activations Density 0.268%