INDEX
Explanations
instances where an action or decision is necessary or required
the word "must" indicating obligations or necessities
New Auto-Interp
Negative Logits
Gw
-0.64
vironment
-0.63
Niet
-0.63
Pug
-0.60
Lif
-0.60
Wim
-0.59
Trop
-0.59
Nob
-0.59
Laden
-0.58
Yas
-0.58
POSITIVE LOGITS
ered
1.12
obey
1.00
ering
1.00
n
0.98
abide
0.96
comply
0.95
aches
0.95
surely
0.94
endure
0.90
angs
0.90
Activations Density 0.040%