INDEX
Explanations
phrases related to accountability and responsibility
instances of the word "hold" and its variations in various contexts
New Auto-Interp
Negative Logits
————
-0.70
ãĤ¡
-0.69
ghan
-0.67
endix
-0.66
zyk
-0.65
ibel
-0.64
ettel
-0.64
gnu
-0.64
shire
-0.63
nown
-0.62
POSITIVE LOGITS
sway
1.06
hold
1.01
erness
1.01
holders
0.95
holding
0.94
accountable
0.94
onto
0.92
overs
0.89
holder
0.89
fast
0.83
Activations Density 0.058%