INDEX
Explanations
phrases indicating ownership or belonging
phrases indicating possession or belonging
New Auto-Interp
Negative Logits
arrests
-0.74
nel
-0.71
revolves
-0.70
orah
-0.65
ende
-0.65
xp
-0.64
ullivan
-0.63
chilling
-0.62
visor
-0.62
aukee
-0.62
POSITIVE LOGITS
Sioux
0.68
pees
0.68
Species
0.66
uton
0.65
Kant
0.64
utable
0.63
srfAttach
0.63
Lank
0.63
Vander
0.63
Gong
0.63
Activations Density 0.079%