INDEX
Explanations
names related to a specific individual, possibly related to a news or legal context
proper nouns and names, particularly referring to a specific individual
New Auto-Interp
Negative Logits
Bore
-0.81
Witt
-0.80
Ney
-0.76
Lov
-0.76
Davidson
-0.75
Ober
-0.74
Station
-0.74
minster
-0.73
Square
-0.73
Franch
-0.73
POSITIVE LOGITS
peel
1.64
Mal
1.21
mantle
1.19
isman
1.18
Vick
1.17
Xin
1.15
minion
1.13
Ming
1.06
Snapdragon
1.04
Cait
1.02
Activations Density 0.068%