INDEX
Explanations
phrases with possessive or belonging connotations
occurrences of apostrophes or contractions
New Auto-Interp
Negative Logits
stacks
-0.68
slic
-0.65
payday
-0.64
wallets
-0.61
sided
-0.60
ciating
-0.60
uilt
-0.60
Transfer
-0.59
tle
-0.58
uberty
-0.58
POSITIVE LOGITS
oeuv
0.97
avez
0.86
Ag
0.86
Angelo
0.85
esp
0.85
Allah
0.85
Arc
0.85
hom
0.84
ét
0.82
Est
0.82
Activations Density 0.026%