INDEX
Explanations
instances of the word "taken", as well as related phrases like "taken out" or "taken hold."
New Auto-Interp
Negative Logits
holm
-0.67
Cong
-0.65
Smile
-0.65
accompanies
-0.64
lich
-0.61
ese
-0.61
agre
-0.60
tch
-0.59
livest
-0.59
ingen
-0.57
POSITIVE LOGITS
advantage
1.31
aways
1.20
precedence
0.99
care
0.99
aback
0.99
refuge
0.97
heed
0.94
precautions
0.91
liberties
0.88
pains
0.86
Activations Density 1.926%