INDEX
Explanations
proper nouns
proper nouns, particularly names and places
New Auto-Interp
Negative Logits
ified
-0.98
rified
-0.95
lot
-0.83
riter
-0.81
ifies
-0.81
egg
-0.80
urgy
-0.79
imil
-0.79
binding
-0.78
blade
-0.77
POSITIVE LOGITS
UAL
0.87
elson
0.73
Centauri
0.72
querade
0.71
uality
0.71
Gupta
0.69
BILITY
0.69
OPLE
0.68
ñ
0.67
agement
0.67
Activations Density 0.205%