INDEX
Explanations
famous names or public figures
names of notable individuals, particularly in sports and entertainment
New Auto-Interp
Negative Logits
Ire
-0.85
Ir
-0.75
Reviewer
-0.74
Maria
-0.72
Els
-0.68
ãĥŁ
-0.67
URI
-0.66
etheless
-0.65
ãĥ¼ãĤ¯
-0.64
Ts
-0.63
POSITIVE LOGITS
Jr
1.01
steen
0.89
zinski
0.84
agher
0.81
aka
0.80
III
0.80
Sr
0.79
gaard
0.75
QC
0.73
ovich
0.73
Activations Density 0.164%