INDEX
Explanations
phrases related to the action of taking something
occurrences of the word "take" in various contexts
New Auto-Interp
Negative Logits
holm
-0.69
ingen
-0.68
Cong
-0.66
Smile
-0.62
gian
-0.62
agre
-0.62
ese
-0.60
eers
-0.60
Democr
-0.59
dissatisf
-0.57
POSITIVE LOGITS
advantage
1.26
aways
1.19
care
1.03
refuge
0.97
precautions
0.92
baths
0.91
aback
0.90
heed
0.89
offs
0.84
overs
0.84
Activations Density 0.109%