INDEX
Explanations
job titles or names of public figures
New Auto-Interp
Negative Logits
acular
-0.91
orius
-0.77
ition
-0.76
orative
-0.72
uously
-0.68
eele
-0.68
uality
-0.68
igm
-0.67
oshenko
-0.65
heses
-0.65
POSITIVE LOGITS
ancock
0.78
idays
0.76
mong
0.75
aida
0.73
IGH
0.72
iland
0.71
ospital
0.70
ISTORY
0.70
aday
0.70
ttp
0.68
Activations Density 0.146%