INDEX
Explanations
proper nouns
proper nouns, specifically names of individuals
New Auto-Interp
Negative Logits
agra
-0.86
arl
-0.82
jah
-0.82
esh
-0.80
reb
-0.79
roach
-0.77
mem
-0.75
aji
-0.74
chal
-0.73
quer
-0.72
POSITIVE LOGITS
Fowler
0.86
Paddock
0.85
acebook
0.75
Osw
0.75
Huntington
0.74
Hudson
0.71
subp
0.68
herty
0.68
Hodg
0.67
dinand
0.66
Activations Density 0.021%