INDEX
Explanations
proper names related to titles, positions, or individuals
the name "Brian" in various contexts
New Auto-Interp
Negative Logits
TOR
-0.75
roxy
-0.75
USE
-0.66
orph
-0.65
artifacts
-0.65
INS
-0.64
UV
-0.64
uph
-0.64
minist
-0.64
allows
-0.62
POSITIVE LOGITS
Moy
0.86
Cage
0.80
McCann
0.80
Blessed
0.79
rics
0.78
nie
0.77
lass
0.76
Shaw
0.76
Patrick
0.75
Fitzpatrick
0.75
Activations Density 0.029%