INDEX
Explanations
names of individuals, organizations, and locations
names and titles related to individuals and organizations
New Auto-Interp
Negative Logits
uers
-0.65
hip
-0.59
>>>>
-0.59
------------------------
-0.57
ADVERTISEMENT
-0.57
ornia
-0.56
ystem
-0.54
tml
-0.54
oward
-0.54
=(
-0.54
POSITIVE LOGITS
itself
1.62
herself
1.34
themselves
1.32
himself
1.24
Himself
1.02
ourselves
1.01
yourself
0.90
oneself
0.87
yourselves
0.82
myself
0.79
Activations Density 0.826%