INDEX
Explanations
names of individuals
proper names, specifically related to individuals and their roles or titles
New Auto-Interp
Negative Logits
ccording
-0.74
deviations
-0.72
entails
-0.67
suppose
-0.67
products
-0.66
shows
-0.64
glim
-0.64
embodiments
-0.64
necessities
-0.62
differed
-0.62
POSITIVE LOGITS
Jr
1.43
III
1.04
Sr
1.03
JR
1.01
hetti
0.90
oglu
0.89
kson
0.89
Jr
0.87
otti
0.87
son
0.86
Activations Density 0.176%