INDEX
Explanations
phrases related to requirements or obligations
imperative statements or obligations that emphasize necessity
New Auto-Interp
Negative Logits
vironment
-0.75
Hamp
-0.71
LP
-0.71
Hamb
-0.66
vertisements
-0.66
Guilty
-0.64
Marlins
-0.63
TED
-0.62
Trop
-0.62
Mem
-0.60
POSITIVE LOGITS
obey
0.96
abide
0.94
be
0.89
reproduce
0.89
surely
0.87
comply
0.85
overcome
0.84
ered
0.84
undergo
0.83
endure
0.83
Activations Density 0.039%