INDEX
Explanations
instances of coercion or being compelled to do something against one's will
instances of the word "forced" indicating coercion or compulsion
New Auto-Interp
Negative Logits
Offense
-0.60
Enh
-0.59
ership
-0.58
izoph
-0.57
Lights
-0.56
pport
-0.54
Prediction
-0.54
iosyncr
-0.54
aird
-0.53
etheless
-0.53
POSITIVE LOGITS
into
1.12
thereto
0.91
to
0.87
onto
0.87
INTO
0.83
awake
0.83
otom
0.82
into
0.81
aback
0.81
tto
0.75
Activations Density 0.056%