INDEX
Negative Logits
AndGet
-0.12
_get
-0.10
Fam
-0.10
å¾Ĺ
-0.10
acquainted
-0.10
acquaint
-0.09
få
-0.09
got
-0.09
getting
-0.09
familiar
-0.09
POSITIVE LOGITS
rid
0.24
hold
0.19
past
0.17
noticed
0.14
her
0.14
rid
0.13
tings
0.13
past
0.12
-rich
0.12
away
0.12
Activations Density 0.027%