INDEX
Explanations
phrases related to social or cultural trends
phrases indicating ongoing actions or tendencies in people's behaviors and beliefs
New Auto-Interp
Negative Logits
ãĥ´ãĤ¡
-0.78
maximum
-0.69
Alright
-0.68
Kills
-0.63
player
-0.63
lasts
-0.62
predecessor
-0.62
Contains
-0.61
Resurrection
-0.61
Directions
-0.61
POSITIVE LOGITS
complain
1.24
perceive
1.24
shun
1.14
willingly
1.13
voluntarily
1.10
instinctively
1.10
migrate
1.09
flock
1.08
prefer
1.06
resent
1.06
Activations Density 0.515%