INDEX
Explanations
phrases where actions or achievements are associated with specific groups or individuals
the word 'with' and related phrases indicating association or accompaniment
New Auto-Interp
Negative Logits
chan
-0.71
obi
-0.70
former
-0.69
affected
-0.67
late
-0.67
obook
-0.64
ondo
-0.64
icator
-0.63
SPONSORED
-0.63
strate
-0.63
POSITIVE LOGITS
regards
1.35
regard
1.29
draw
1.05
impunity
1.00
respect
0.94
standing
0.92
holding
0.84
stood
0.82
vig
0.81
drawn
0.76
Activations Density 0.171%