INDEX
Explanations
proper nouns, specifically names like "Gary"
mentions of the name "Gary"
New Auto-Interp
Negative Logits
semble
-0.76
xual
-0.76
enegger
-0.73
uality
-0.71
ĺħ
-0.71
joined
-0.71
ilation
-0.68
orship
-0.68
ylum
-0.67
itect
-0.67
POSITIVE LOGITS
Johnson
0.97
Neville
0.90
Cooper
0.89
Sheffield
0.86
Hir
0.81
Melvin
0.80
Becker
0.78
Cohn
0.77
Kub
0.76
Gy
0.76
Activations Density 0.013%