INDEX
Explanations
phrases related to accusations or responsibility
New Auto-Interp
Negative Logits
horm
-0.71
acquaintances
-0.68
types
-0.67
onyms
-0.67
guiActiveUnfocused
-0.67
ibl
-0.66
fw
-0.65
erm
-0.64
await
-0.64
ieties
-0.62
POSITIVE LOGITS
overseeing
1.10
shaping
0.87
steering
0.86
organising
0.85
orchestr
0.84
brunt
0.84
superv
0.80
organizing
0.78
coordinating
0.78
guiding
0.77
Activations Density 5.720%