INDEX
Explanations
the word "Sal" specifically
references to a specific individual named Sal
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.81
Dominion
-0.77
hower
-0.77
Annotations
-0.71
derog
-0.71
lihood
-0.71
clipboard
-0.71
bom
-0.69
STEP
-0.69
schild
-0.69
POSITIVE LOGITS
isbury
1.19
omon
1.12
mone
1.09
utations
1.06
adin
1.06
igon
1.03
afi
1.02
amon
1.01
utation
0.99
ient
0.98
Activations Density 0.009%