INDEX
Explanations
names of famous individuals or public figures
prominent public figures and their associations or actions
New Auto-Interp
Negative Logits
Ir
-0.80
Reviewer
-0.79
Ire
-0.79
Maria
-0.71
Irish
-0.69
ãĥĭ
-0.69
Northern
-0.67
FU
-0.66
Native
-0.65
URI
-0.64
POSITIVE LOGITS
steen
0.90
Jr
0.90
agher
0.80
gaard
0.75
Sr
0.75
bey
0.74
III
0.73
famously
0.72
ovich
0.71
imperson
0.70
Activations Density 0.152%