INDEX
Explanations
keywords related to different topics such as sports, military, community, media, health, government, and politics
New Auto-Interp
Negative Logits
himself
-0.91
herself
-0.87
tein
-0.80
oneself
-0.72
themselves
-0.68
Himself
-0.67
nikov
-0.65
auri
-0.65
Stern
-0.62
EntityItem
-0.60
POSITIVE LOGITS
selves
1.18
ancestors
1.00
asses
0.92
counterparts
0.89
brethren
0.87
ourselves
0.86
partners
0.81
overl
0.79
cousins
0.78
motto
0.76
Activations Density 0.264%