INDEX
Explanations
phrases indicating taking control or responsibility over something
phrases that contain the term "take over."
New Auto-Interp
Negative Logits
Forward
-0.68
Scale
-0.68
Bi
-0.66
mpeg
-0.63
Son
-0.63
////////////////////////////////
-0.63
tu
-0.62
uten
-0.61
Lastly
-0.60
Detect
-0.60
POSITIVE LOGITS
reins
0.76
lord
0.75
tones
0.73
erto
0.72
respons
0.70
drive
0.70
ordinate
0.69
rule
0.69
responsibility
0.69
arching
0.68
Activations Density 0.034%