INDEX
Explanations
names of individuals
proper nouns, specifically names of people
New Auto-Interp
Negative Logits
âĶĢâĶĢ
-0.72
Windsor
-0.71
Cecil
-0.68
Brittany
-0.67
Michigan
-0.67
Wayne
-0.66
LEASE
-0.66
stock
-0.65
Mercury
-0.64
Beacon
-0.63
POSITIVE LOGITS
hari
1.08
ibaba
1.05
oub
0.98
zynski
0.97
itars
0.96
iani
0.95
imov
0.92
awar
0.89
dh
0.89
wal
0.89
Activations Density 0.174%