INDEX
Explanations
references to historical events and societal structures
references to theater and cultural experiences
New Auto-Interp
Negative Logits
ebin
-0.79
[+
-0.72
GoPro
-0.71
](
-0.68
pic
-0.66
@
-0.65
WATCH
-0.64
(@
-0.64
)</
-0.63
https
-0.63
POSITIVE LOGITS
tended
0.91
postwar
0.87
mattered
0.81
reasoned
0.79
depended
0.76
lacked
0.76
outnumbered
0.75
despised
0.73
acquies
0.72
remained
0.71
Activations Density 2.036%