INDEX
Explanations
historical and political events and figures, particularly related to elections and movements
New Auto-Interp
Negative Logits
mable
-0.26
swear
-0.23
bells
-0.23
directives
-0.22
bonded
-0.21
leases
-0.21
optimize
-0.21
handlers
-0.21
ratios
-0.21
undercut
-0.21
POSITIVE LOGITS
?]
0.37
edit
0.32
Pg
0.30
:]
0.30
!]
0.28
former
0.27
Israeli
0.26
vol
0.26
...]
0.25
cour
0.25
Activations Density 24.218%