INDEX
Explanations
information related to people's actions and events such as deaths, arrests, and followings
New Auto-Interp
Negative Logits
Locations
-0.81
divisions
-0.76
deployments
-0.69
reservoirs
-0.67
expansions
-0.67
shipments
-0.67
isSpecialOrderable
-0.66
closures
-0.64
highways
-0.64
Regulations
-0.64
POSITIVE LOGITS
whom
1.32
who
1.06
himself
0.89
who
0.89
whose
0.89
assassinated
0.84
hunt
0.80
died
0.79
nikov
0.79
opus
0.77
Activations Density 2.858%