INDEX
Explanations
instances where something is officially excluded as a possibility or decision is made
words related to decisions or authoritative statements
New Auto-Interp
Negative Logits
issance
-0.86
akra
-0.82
velength
-0.75
vas
-0.73
rious
-0.72
ufact
-0.72
ription
-0.71
illin
-0.70
ãĥ¼ãĥĨãĤ£
-0.70
ãĥ¤
-0.64
POSITIVE LOGITS
phas
0.76
differently
0.75
out
0.73
decisively
0.73
against
0.69
unanimously
0.68
definitively
0.68
doms
0.66
apart
0.65
maker
0.64
Activations Density 0.028%